Here you are: IMGT Web resources > IMGT Scientific chart > 1. Sequence description

IMGT reference sequences

Definition and characteristics

Definition

IMGT reference sequences are chosen on the basis of one or, whenever possible, several of the following criteria:

For the immunoglobulins (IG) and T cell receptors (TR), IMGT reference sequences are defined for the germline V-GENEs, D-GENEs, J-GENEs, and for the C-GENEs.

Characteristics

Characteristics of the IMGT reference sequences are according to the IMGT-ONTOLOGY concepts.

Presentation

The presentation of the IMGT reference sequences is of three kinds:

IMGT/LIGM-DB reference sequences

They correspond to IMGT/LIGM-DB accession numbers of which any part of the sequence has been defined as IMGT reference sequence for (a) given gene(s). The IMGT/LIGM-DB reference sequences can be accessed from:

IMGT/GENE-DB reference sequences

The IMGT/GENE-DB sequences correspond to the coding region sequences of the Functional or ORF genes (V-REGION, D-REGION, J-REGION, C-REGION), isolated from the IMGT/LIGM-DB sequences. By definition, there is one sequence for each Functional or ORF allele. If the C-REGION is encoded by several exons, the sequence is given by exon.

IMGT/GENE-DB reference sequences are provided in FASTA format:

In order to facilitate the search of expressed (spliced) sequences by BLAST on IMGT/LIGM-DB, and to increase interoperability with Genew and external generalist expression databases, IMGT/GENE-DB reference sequences will also be provided, if there are several exons, with the exons being artificially joined.

Interoperability with genome databases:

IMGT reference directory sequences

The IMGT reference directory sequences correspond to sequence fragments according to IMGT Labels, isolated from the Functional and ORF IMGT/LIGM-DB reference sequences, in which gaps are inserted according to the IMGT unique numbering ('NUMEROTATION' concept of IMGT-ONTOLOGY).

By definition, the IMGT reference directory sets contain one sequence for each allele. Allele names of these sequences are shown in red in Alignments of alleles.

Sets of the IMGT reference directory are used in IMGT/V-QUEST and other IMGT tools. All IMGT reference directory sets can be downloaded in FASTA format.