Only the *01 allele of each functional, ORF and in-frame pseudogenes C-REGION is shown.
Dots indicate gaps according to the IMGT unique numbering. Blanks at the 5' and/or 3' end indicate partial sequences.
Letters in red correspond to amino acids which are polymorphic in the other alleles.
For C-REGIONs, letters in bold correspond to additional positions in the IMGT unique numbering.
For C-REGIONs, letters between parentheses correspond to amino acids resulting from the splicing.
N (Asn, asparagine) of potential N-glycosylation sites (NXS/T, where X is different from P), (N-linked glycosylation) is shown is green (site is underlined in CHS and in pages edited before 14/10/2009).
A AB B BC C CD D DE E EF F FG G CHS | |||||
(1-15) (16-26) (27-38) (39-45) (77-84) (85-96) (97-104) (105-117) (118-128) | |||||
——————————————> ——————————> ———————————> ——————> ———————> ———————————> ———————> ————————————> ——————————> | |||||
group | C-Genes | AccNum | 1 10 15 16 20 2326 27 38 3941 45 77 80 84 85 89 96 97 104 105 117 118121 130 140 150 |
||
87654321|........|....|123|...|..|..| |..........| |.|...|1234567|..|...|12345677654321|...|......|12|......| |...........| |..|........|.........|.........| | |||||
EX1 | |||||
TRA | TRAC*01 | IMGT000049 | F | (X)VKDPNPTVYQLRSPQ........SSDTSVCLFT DFDS.....NQV NMEKIMG.......SEGSTVHKTNSTVLN.MEILGSKSNGIVTWGN......TSDAGC EYTFNE.TIPFAS SL | |
TRD | TRDC*01 | IMGT000049 | F | (X)SQPAASPSVFVMKNG...........TNVACLVK EFYP....KDVT ISLQSSKKI.....IEYDPAIAISPG.......GKYSAVKLGQYGD......PDSVTC SVEHN...KQTWH STDFEPKKTIP | |
TRG | TRGC1*01 | D90409 | F | #c (1) | (E)RNLEADTSPKPTVFLPSIAEINH..DNAGTYLCLLE KFFP....DVIT VSWRAKNDKRAL..PSQQGNTMKTKD.......TYMKFSWLTVTEN....SMDKQHVC VVKHKKNIGGIDQ EIIFPSIKE |
TRGC2*01 | D90411 | F | #c (2) | (E)RNLEADTSPKPTVFLPSIAEINH..DNAGTYLCLLE KFFP....DVIT VSWRAKNDKRAL..PSQQGNTMKTKD.......TYMKLSWLTVTEN....SMDKQHVC VVKHKKNIGGIDQ EIIFPSIKE | |
TRGC3*01 | D90414 | F | #c | (D)RDLDIDMSPKPTMFLPSITEIKR..ENTGTYLCLLE NFFP....HVIK VYWREKRGNRVL..PSQQGNTVKTAD.......TYMKFSWLTVSGN....SMDKEHIC IVKHEKNKTGDNQ EILFPPVNE | |
TRGC4*01 | X63680 | F | #c | (D)RNLATDISPKPTIFLPSIAEINH..SKTGTYLCLLE KFFP....DIIK VYWKEKDGNKAL..PSQQGNTMKTTD.......TYMKLSWLTVTEN....SMDKEHIC VVQHERNIGGINQ EILFPSINE | |
EX2 | |||||
[EX2A] [EX2B] [EX2C] [EX2] | |||||
TRA | TRAC*01 | IMGT000049 | F | (E)ISCNAKLVEKSFET | |
TRD | TRDC*01 | IMGT000049 | F | (E)TTPKPMAYENSTKAEAPVTCQEPQ | |
TRG | TRGC1*01 | D90409 | F | #c | (V)VTSLVPTTEPPTTKPPTTEPPTTEPPNDCLTDES (K)LTGTGSKKACLKDGS (D)TNSTKACLEGES |
TRGC2*01 | D90411 | F | #c | (V)VTSLVPTTEPPTA..........EPPNDCLTDES (I)TDTGSKKACLKDGS (D)TNSTKACLEGKS | |
TRGC3*01 | D90414 | F | #c | (V)VTSVVTATKPP...............NDGLKDKK (K)QVPVVNSTKACLKDEN | |
TRGC4*01 | X63680 | F | #c | (V)VSSIVPTTESP...............SDCLNHDS (K)VTGTGSKKACLKDES (E)VTADNNSTKVCLKDES | |
CONNECTING_REGION | |||||
EX3 | |||||
[EX3] | |||||
TRA | TRAC*01 | IMGT000049 | F | (D)INLNSQNLSVIVFRILLLKVVGFNLLMTLRL.....WSS | |
TRD | TRDC*01 | IMGT000049 | F | (V)QPGKVNMMSLSVLGLRMLFAKSVAVNFLLTAKLFFF | |
TRG | TRGC1*01 | D90409 | F | #c | (S)TLQLQLMNTSAAYYTYLLLLFLSTVYFVVIISCVF....RRTGVC |
TRGC2*01 | D90411 | F | #c | (S)TLQLQLMNTSAAYYTYLLLLLLSTVYFAVIISCVF....RRTGVCCDRKIS | |
TRGC3*01 | D90414 | F | #c | (N)TLQLHLMNTSA.YYTYLLLLITSTVYLVIITSCVF....RRTGVCGIQKSS | |
TRGC4*01 | X63680 | F | #c | (N)TLQLQLMNTSA.YYTYLLLLLKSVVYCIIITSCVF....RRTGICCDGKNL | |
CONNECTING_REGION | TRANSMEMBRANE-REGION | CYTOPLASMIC-REGION |
#c: rearranged cDNA