Sequence submission for IMGT/LIGM-DB
IMGT/LIGM-DB sequence data are identified by the EMBL/GenBank/DDBJ accession number.
Immunoglobulin (IG) and T cell receptor (TR) sequences of all vertebrates should be
submitted to EMBL-Bank ENA (Webin), GenBank (BankIt) or DDBJ.
Authors are encouraged to check their sequences, if appropriate, with
before submitting them to EMBL, GenBank or DDBJ.
Data flow and IMGT/LIGM-DB annotations
The unique source of data for IMGT/LIGM-DB is EMBL which shares data with the other
two generalist databases GenBank and DDBJ.
IMGT/LIGM-DB contains the IG and TR sequences which are allowed by the
authors to be made public.
LIGM automatically retrieves by FTP IG and TR sequences
from EBI, the European Bioinformatics Institute. After control by LIGM curators, data
are scanned to store sequences, bibliographical references and taxonomic data, and
standardized IMGT/LIGM-DB keywords are assigned to all entries. Based on expert analysis,
specific detailed annotations are added to the IMGT/LIGM-DB flat files, according to the
IMGT Scientific chart rules, based on IMGT-ONTOLOGY.
Since August 1996, the IMGT/LIGM-DB content closely follows the EMBL one for the IG and
TR, with the following advantages: IMGT/LIGM-DB does not contain sequences which have
previously been wrongly assigned to IG and TR; conversely, IMGT/LIGM-DB contains IG and
TR entries which have disappeared from the generalist databases, as examples:
- the L36092 accession number which encompasses the complete human TRB locus is still present
in IMGT/LIGM-DB, whereas it has been deleted from EMBL/GeneBank/DDBJ due to its too large
size (684,973 bp)
- in 1999, IMGT/LIGM-DB detected the disappearance of
20 IG and TR sequences which inadvertently
had been lost by GenBank, and allowed the recuperation of these sequences in the generalist
databases (see below).
List of the rescued IG and TR sequences by LIGM
List of the accession numbers of the 20 IG and TR sequences for which the IMGT team
noticed the disappearance from the GenBank release, in February 2000
(e-mail from Marie-Paule Lefranc to Jim Ostell, on the 11/02/2000).
These sequences were also deleted from EMBL and DDBJ.
Except for Z75464, which was rightly deleted, the flat files of the other sequences were
reintroduced in GenBank and in EMBL/DDBJ in March-April 2000.
Marie-Paule Lefranc, IMGT founder and director