The NCBI BLAST family of programs includes:
Size | Description | Updated on |
---|---|---|
251,285 sequences; 415,693,160 total bases | IMGT/LIGM-DB nucleotide sequences | 2025-03-16 |
640 sequences; 327,909,921 total bases | IMGT/LIGM-DB nucleotide sequences (length >100 kb) | 2025-03-16 |
15,703 sequences; 3,577,577 total bases | IMGT/GENE-DB nucleotide sequences (F+ORF+inframeP) | 2025-03-16 |
20,288 sequences; 4,797,856 total bases | IMGT/GENE-DB nucleotide sequences (F+ORF+allP) | 2025-03-16 |
27,551 sequences; 6,047,730 total residues | IMGT/2Dstructure-DB and IMGT/3Dstructure-DB amino acid chain sequences | 2025-03-16 |
2,914 sequences; 918,603 total residues | IMGT/2Dstructure-DB amino acid chain sequences | 2025-03-16 |
24,637 sequences; 5,129,127 total residues | IMGT/3Dstructure-DB amino acid chain sequences | 2025-03-16 |
55,494 sequences; 5,783,343 total residues | IMGT/2Dstructure-DB and IMGT/3Dstructure-DB amino acid domain sequences | 2025-03-16 |
20,558 sequences; 1,929,379 total residues | IMGT/DomainDisplay amino acid reference sequences | 2025-03-16 |
When protein aligned to the nucleotide there are 6 possibilities of match at any point. In OOF alignment - upper sequence is DNAP - 3-frame translated DNA. Lower sequence is protein. At any position next protein base may be aligned to 6 possible bases in DNAP:
(TBO - traditional blast output)
0: 3 nucleotides missing - gap (TBO notation "-")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGG-GVLCV | | | | | | | | | | | | | | | | | D G T K F A T G G Q G Q D S G K V V TBO: DGTKFATGGQGQDSG-VV DGTKFATGGQGQDSG VV DGTKFATGGQGQDSGKVV1: 2 nucleotides missing - "frameshift -2" (TBO notation "\\")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGGGVLCV | | | | | | | | | | | | | | |/ | | D G T K F A T G G Q G Q D S GK V V TBO: DGTKFATGGQGQDSG\\GVV DGTKFATGGQGQDSG VV DGTKFATGGQGQDSG KVV2: 1 nucletide missing - "frameshift -1" (TBO notation "\")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGERGV | | | | | | | | | | | | | | / | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDS\GEV DGTKFATGGQGQDS G V DGTKFATGGQGQDS GKV3: Complete match
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGEKRGV | | | | | | | | | | | | | | | | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDSGKV DGTKFATGGQGQDSGKV DGTKFATGGQGQDSGKV4: 1 nucleotide insertion - "frameshift +1" (TBO notation "/")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGVEKRGV | | | | | | | | | | | | | | | \ D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDSG/KV DGTKFATGGQGQDSG KV DGTKFATGGQGQDSG KV5: 2 nucleotides insertion - "frameshift +2" (TBP notation "//")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLFLWGGEKRGV | | | | | | | | | | | | | | \ | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDS//GKV DGTKFATGGQGQDS GKV DGTKFATGGQGQDS GKV
-G Cost to open a gap [Integer] default = 5 -E Cost to extend a gap [Integer] default = 2 -q Penalty for a mismatch in the blast portion of run [Integer] default = -3 -r Reward for a match in the blast portion of run [Integer] default = 1 -e Expectation value (E) [Real] default = 10.0 -W Word size, default is 11 for blastn, 3 for other programs. -v Number of one-line descriptions (V) [Integer] default = 100 -b Number of alignments to show (B) [Integer] default = 100
-G Cost to open a gap [Integer] default = 11 -E Cost to extend a gap [Integer] default = 1 -e Expectation value (E) [Real] default = 10.0 -W Word size, default is 11 for blastn, 3 for other programs. -v Number of one-line descriptions (V) [Integer] default = 100 -b Number of alignments to show (B) [Integer] default = 100 Limited values for gap existence and extension are supported for these three programs. Some supported and suggested values are: Existence Extension 10 1 10 2 11 1 8 2 9 2