IMGT Repertoire (IG and TR)

The IGHC protein display numbering is according to the IMGT unique numbering for C-DOMAIN and C-LIKE-DOMAIN.

Only the *01 allele of each functional, ORF and in-frame pseudogenes C-REGION is shown.

Letters in red correspond to amino acids which are polymorphic in the other alleles.
For C-REGIONs, letters in bold correspond to additional positions in the IMGT unique numbering.
For C-REGIONs, letters between parentheses correspond to amino acids resulting from the splicing.

N (Asn, asparagine) of potential N-glycosylation sites (NXS/T, where X is different from P), (N-linked glycosylation) is shown is green (site is underlined in CHS and in pages edited before 14/10/2009).

                                                                                                                                     
                                                                                                                                     
TRGC genesAccNum
                 A       AB      B         BC         C     CD      D          DE           E      EF   F          FG            G     
              (1-15)          (16-26)   (27-38)    (39-45)       (77-84)                 (85-96)     (97-104)   (105-117)    (118-128) 
          ——————————————>   ——————————>            ——————>       ———————>              ———————————>  ———————>               ——————————>
          1        10  15   16     2326 27      38 3941 45       77 80 84              85  89    96  97   104 105       117 118     128    
  87654321|........|....|123|......|..| |........| |.|...|1234567|..|...|12345677654321|...|......|12|......| |...........| |.........|
TRGC1AAEX02028369ORF(1)
(X)KSPDEDISPKLTAFLPSIAERTL..HMAGTYLCPFL PDV.....IK IDWKE.NGRTIL..QSQQGDTMKTKD.......KYMKFSWLTVTDV....SMDNEHKS IVKHKSNKGGVDQ EILFPSINK..
TRGC2AAEX02028369F
(D)KSP.EDISPKPTIFLPSIAEIKA..HQVGTYICLLE DIIP..DIFK IDWKEKNSKTIL..QSQQGNTVKTKD.......TYMKFSWVTVTEK....SMDKEHQC IVKDERNKERVNQ EIDFPSINK..
TRGC3AAEX02028368F
(X)KSANEDTSPKPTVFLPSISEIKI..HKAGTYLCLLE DFFP..EIIK VDWKEKNDQTVL..QSQQGNTMKTKD.......TYMKFSWLTVTGA....SMDKEHKC IVNHESDRGGINQ EILFPSINE..
TRGC4AAEX02028368F
(D)KSP.EDISPKPTIFLPSIAEIKV..HKAGTYLCHLE ENIP..DVFK IDWKEKNVKTIL..QSQQGNTVKTED.......TYMKFSWVIVTEE....SLDKEHQC IVKDERNKERVNQ EIDFPSINK..
TRGC5AAEX02028366F
(D)KNPDEDIPPKPTVFLPSITEIKD..HNTGTYLCLLE DFFP..DVIK IDWKEKDDKTVL..QSQQGDTMKTKD.......TYMKFSWLTVTQE....SMAKDHKC MVKHERNKGRVDQ EIDFPSINR..
TRGC7AAEX02028365F
(X)RSLDADNSPKPTTFFPSIAETTL..HNAGTYLCLLE NFFR..DVIK IDWKEKSGKTIP..QSQQGNTMKTKD.......TFMKFSWLTVMGA....SMDKGYKC VIKHERNKGRVDQ EILFSSQIK..
TRGC8AAEX02028365F
(D)RSLDTDISPKPTIFFPSIAEIKL..HKTGTYLCLVE NFFP..EVIK IHWKEKNGQMIL..KSQQGDTVKTND.......TFMKFSWLTVAKK....SMAEEQQC IITHENNKEGINK EILYRSMKK..
                            CONNECTING-REGION                                     | TRANSMEMBRANE-REGION |CYTOPLASMIC-REGION
                     [EX2A]                       [EX2B]                            [EX3]
TRGC1AAEX02028369ORF(1)
                                             (E)LTAINSTKTSLKDDN (A)PMQLQLMNISVCYIYTLLLFKSLVYSAIITTHFLGRPALYGNGKSS
TRGC2AAEX02028369F
(D)LVARMESDMDPAGHSEPKQNTEVVTLNPASSRSFRPVLSTV (E)FAAINSTEASLHDEN (D)PLQLQLMNTSAYYTYLLLLLKSLMYSIITAICLLGRSVFDGNGKSS
TRGC3AAEX02028368F
(D)LVARMESDVDSEGHSEAKQNTEVVTVSPLSSRSFPPGASTV (E)LVALNSTEASLDDEY (D)PLQLQLMNTSAYYTYLLLLLKSLTYSIIITIYLLGRSVLNGNGKSS
TRGC4AAEX02028368F
(D)LLARMESNVDPTGHSEAKQNTEVVTLNPSSSRSFRPVPSTV (E)LTAINSTEAFLDDEN (H)TLQLQLMNTSAYYTYLLLLLKSLMYSIIITICLLGRPVLDGNGKSS
TRGC5AAEX02028366F
(V)LVAAIQSNMDSQRHSETRRKRQVVTVSPLSSQSFSTVPSTV                    (D)NQQLQLMNTSAYYIYILLLFKSLMYSIIITICLLERPALDGNRKN
TRGC7AAEX02028365F
                                             (E)LTAITSTKYSLKDKN (D)PLQLQLMNTSAYYTYLLLLLKSLVYSVVITFYLLGRPAFCGNGKSS
TRGC8AAEX02028365F
                                             (E)LTAINSTKTDD     (D)ILQLQLKTTSAYYTYLLLFLKSLMYSIITAFCLFRRPAPCGNGKSS
IMGT note:
(1)ORF because of 2nd CYS is missing and non canonical ACCEPTOR-SPLICE (naann instead of nagnn) in EX3.