Here you are: IMGT Web resources > IMGT Repertoire (RPI) > RPI entries from gene to protein

IMGT reference sequences in FASTA format:

Amino acid sequences with gaps

The FASTA header contains:
IMGT accession number | gene and allele name | species | functionality | exon name | domain name | domain type | alternative splicing (if it is) | partial (if it is)
>CD1A*01|Homo sapiens|F|EX1||
MLFLLLPLLAVLPGDGNAD
>CD1A*01|Homo sapiens|F|EX2|[D1]|G-LIKE-ALPHA1||
..GLKEPLSFHVTWIASFYNHSWKQNLVSGWLSDLQTHTWDSN..SSTIVFLC.......
.PWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAHELQFE
>CD1A*01|Homo sapiens|F|EX3|[D2]|G-LIKE-ALPHA2||
....YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNMA...
.KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ
>CD1A*01|Homo sapiens|F|EX4|C-LIKE||
.........VKPEAWLSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ
...GTQRGDILPSAD......GTWYLRATLEVAA.....GEAADLSCRVKHSS.......
......LEGQDIVLYW
>CD1A*01|Homo sapiens|F|EX5||
EHHSSVGFIILAVIVPLLLLIGLALWFRKR
>CD1A*01|Homo sapiens|F|EX6||
CFC
>CD1A*01|Homo sapiens|F||
MLFLLLPLLAVLPGDGNAD..GLKEPLSFHVTWIASFYNHSWKQNLVSGWLSDLQTHTWD
SN..SSTIVFLC........PWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAHELQFE
....YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNMA...
.KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ.........VKPEAW
LSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ...GTQRGDILPSAD
......GTWYLRATLEVAA.....GEAADLSCRVKHSS.............LEGQDIVLY
WEHHSSVGFIILAVIVPLLLLIGLALWFRKRCFC
>P06126|CD1A*02|Homo sapiens|F|EX1||
MLFLLLPLLAVLPGDGNAD
>P06126|CD1A*02|Homo sapiens|F|EX2|[D1]|G-LIKE-ALPHA1||
..GLKEPLSFHVIWIASFYNHSWKQNLVSGWLSDLQTHTWDSN..SSTIVFLW.......
.PWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAHELQFE
>P06126|CD1A*02|Homo sapiens|F|EX3|[D2]|G-LIKE-ALPHA2||
....YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNMA...
.KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ
>P06126|CD1A*02|Homo sapiens|F|EX4|C-LIKE||
.........VKPEAWLSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ
...GTQRGDILPSAD......GTWYLRATLEVAA.....GEAADLSCRVKHSS.......
......LEGQDIVLYW
>P06126|CD1A*02|Homo sapiens|F|EX5||
EHHSSVGFIILAVIVPLLLLIGLALWFRKR
>P06126|CD1A*02|Homo sapiens|F|EX6||
CFC
>P06126|CD1A*02|Homo sapiens|F||
MLFLLLPLLAVLPGDGNAD..GLKEPLSFHVIWIASFYNHSWKQNLVSGWLSDLQTHTWD
SN..SSTIVFLW........PWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAHELQFE
....YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNMA...
.KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ.........VKPEAW
LSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ...GTQRGDILPSAD
......GTWYLRATLEVAA.....GEAADLSCRVKHSS.............LEGQDIVLY
W(E)HHSSVGFIILAVIVPLLLLIGLALWFRKR(C)FC
>CD1A*03|Homo sapiens|F|EX3|[D2]|G-LIKE-ALPHA2||
.....YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNMA..
..KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ
>CD1A*03|Homo sapiens|F|EX4|C-LIKE||
.........VKPEAWLSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ
...GTQRGDILPSAD......GTWYLRATLEVAA.....GEAADLSCRVKHSS.......
......LEGQDIVLYW
>CD1A*03|Homo sapiens|F|EX5||
EHHSSVGFIILAVIVPLLLLIGLALWFRKR
>CD1A*03|Homo sapiens|F|EX6||
CFC
>CD1A*03|Homo sapiens|F|partial||
............................................................
............................................................
........YPFEIQVTGGCELHSG.KVSGSFLQLAYQGSDFVSFQ.NN.SWLPYPVAGNM
A....KHFCKV.LNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQ.........VK
PEAWLSHGPSPG.....PGHLQLVCHVSGFYP..KPVWVMWMRGEQEQQ...GTQRGDIL
PSAD......GTWYLRATLEVAA.....GEAADLSCRVKHSS.............LEGQD
IVLYWEHHSSVGFIILAVIVPLLLLIGLALWFRKRCFC