Legend:
Allele names | Gene functionality | IMGT reference sequences | ||
---|---|---|---|---|
Exons | Accession numbers | Molecule type | ||
CEACAM1*01 | F | EX1-5, EX7-10 | AC004785 | gDNA |
CEACAM1*02 | F | EX1-4, EX6-10 | D12502 [1] | cDNA (1) splicing B |
IMGT reference sequences (in FASTA format) for the allele(s): CEACAM1*01 to CEACAM1*02
Allele names | Gene functionality | IMGT reference sequences | ||
---|---|---|---|---|
Exons | Accession numbers | Molecule type | ||
CEACAM1*01 | F | EX1-5, EX7-10 | J03858 | cDNA |
X16354 | cDNA | |||
EX1-5, EX7, EX9 | A43165 | gDNA Splicing C | ||
EX1-4, EX7-10 | X14831 | cDNA Splicing D | ||
EX1-2, EX7-10 | M76742 | cDNA Splicing E | ||
EX1-4, EX7, EX9 | AY766113 | cDNA Splicing F | ||
EX1-3, EX4L | M69176 | cDNA Splicing G | ||
EX1-4, EX5L | D90311 | cDNA Splicing H | ||
EX1-4, EX5p | D90313 | cDNA Splicing I | ||
EX1 | X67277 | gDNA | ||
EX1-5, EX7-10 | NM_001712 | cDNA | ||
EX1-5, EX7, EX9 | NM_001024912 | cDNA |
Allele Name | Accession numbers | Number of amino acids | Protein isoform | |
---|---|---|---|---|
Nucleotide databases | Protein databases | |||
CEACAM1*01 | J03858 | P13688 | 526aa | 1 |
NM_001712 | NP_001703 | 526aa | 1 | |
NM_001024912 | NP_001020083 | 464aa | 2 |
The coding region (CODING-REGION)
sequence starts from INIT-CODON
(or its encoded amino acid) to STOP-CODON
(not included).
The nucleotide sequence is extracted from cDNA (c) sequences and/or is
built by artificial exon joining from genomic DNA (g) sequences.
The amino acid sequence is the translation of the nucleotide sequence.
CEACAM1*01: AC004785(g)
1 atggggcacc tctcagcccc acttcacaga gtgcgtgtac cctggcaggg gcttctgctc 61 acagcctcac ttctaacctt ctggaacccg cccaccactg cccagctcac tactgaatcc 121 atgccattca atgttgcaga ggggaaggag gttcttctcc ttgtccacaa tctgccccag 181 caactttttg gctacagctg gtacaaaggg gaaagagtgg atggcaaccg tcaaattgta 241 ggatatgcaa taggaactca acaagctacc ccagggcccg caaacagcgg tcgagagaca 301 atatacccca atgcatccct gctgatccag aacgtcaccc agaatgacac aggattctac 361 accctacaag tcataaagtc agatcttgtg aatgaagaag caactggaca gttccatgta 421 tacccggagc tgcccaagcc ctccatctcc agcaacaact ccaaccctgt ggaggacaag 481 gatgctgtgg ccttcacctg tgaacctgag actcaggaca caacctacct gtggtggata 541 aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 601 actctactca gtgtcacaag gaatgacaca ggaccctatg agtgtgaaat acagaaccca 661 gtgagtgcga accgcagtga cccagtcacc ttgaatgtca cctatggccc ggacaccccc 721 accatttccc cttcagacac ctattaccgt ccaggggcaa acctcagcct ctcctgctat 781 gcagcctcta acccacctgc acagtactcc tggcttatca atggaacatt ccagcaaagc 841 acacaagagc tctttatccc taacatcact gtgaataata gtggatccta tacctgccac 901 gccaataact cagtcactgg ctgcaacagg accacagtca agacgatcat agtcactgag 961 ctaagtccag tagtagcaaa gccccaaatc aaagccagca agaccacagt cacaggagat 1021 aaggactctg tgaacctgac ctgctccaca aatgacactg gaatctccat ccgttggttc 1081 ttcaaaaacc agagtctccc gtcctcggag aggatgaagc tgtcccaggg caacaccacc 1141 ctcagcataa accctgtcaa gagggaggat gctgggacgt attggtgtga ggtcttcaac 1201 ccaatcagta agaaccaaag cgaccccatc atgctgaacg taaactataa tgctctacca 1261 caagaaaatg gcctctcacc tggggccatt gctggcattg tgattggagt agtggccctg 1321 gttgctctga tagcagtagc cctggcatgt tttctgcatt tcgggaagac cggcagggca 1381 agcgaccagc gtgatctcac agagcacaaa ccctcagtct ccaaccacac tcaggaccac 1441 tccaatgacc cacctaacaa gatgaatgaa gttacttatt ctaccctgaa ctttgaagcc 1501 cagcaaccca cacaaccaac ttcagcctcc ccatccctaa cagccacaga aataatttat 1561 tcagaagtaa aaaagcagta a
1 MGHLSAPLHR VRVPWQGLLL TASLLTFWNP PTTAQLTTES MPFNVAEGKE VLLLVHNLPQ 61 QLFGYSWYKG ERVDGNRQIV GYAIGTQQAT PGPANSGRET IYPNASLLIQ NVTQNDTGFY 121 TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAFTCEPE TQDTTYLWWI 181 NNQSLPVSPR LQLSNGNRTL TLLSVTRNDT GPYECEIQNP VSANRSDPVT LNVTYGPDTP 241 TISPSDTYYR PGANLSLSCY AASNPPAQYS WLINGTFQQS TQELFIPNIT VNNSGSYTCH 301 ANNSVTGCNR TTVKTIIVTE LSPVVAKPQI KASKTTVTGD KDSVNLTCST NDTGISIRWF 361 FKNQSLPSSE RMKLSQGNTT LSINPVKRED AGTYWCEVFN PISKNQSDPI MLNVNYNALP 421 QENGLSPGAI AGIVIGVVAL VALIAVALAC FLHFGKTGRA SDQRDLTEHK PSVSNHTQDH 481 SNDPPNKMNE VTYSTLNFEA QQPTQPTSAS PSLTATEIIY SEVKKQ*
Legend: