The NCBI BLAST family of programs includes:
Size | Description | Updated on |
---|---|---|
246,719 sequences; 316,095,030 total bases | IMGT/LIGM-DB nucleotide sequences | 2023-03-19 |
515 sequences; 219,500,563 total bases | IMGT/LIGM-DB nucleotide sequences (length >100 kb) | 2023-03-19 |
12,786 sequences; 2,839,543 total bases | IMGT/GENE-DB nucleotide sequences (F+ORF+inframeP) | 2023-03-19 |
16,065 sequences; 3,718,564 total bases | IMGT/GENE-DB nucleotide sequences (F+ORF+allP) | 2023-03-19 |
26,586 sequences; 5,735,697 total residues | IMGT/2Dstructure-DB and IMGT/3Dstructure-DB amino acid chain sequences | 2023-03-19 |
2,167 sequences; 655,679 total residues | IMGT/2Dstructure-DB amino acid chain sequences | 2023-03-19 |
24,419 sequences; 5,080,018 total residues | IMGT/3Dstructure-DB amino acid chain sequences | 2023-03-19 |
52,597 sequences; 5,489,677 total residues | IMGT/2Dstructure-DB and IMGT/3Dstructure-DB amino acid domain sequences | 2023-03-19 |
18,387 sequences; 1,716,107 total residues | IMGT/DomainDisplay amino acid reference sequences | 2023-03-19 |
When protein aligned to the nucleotide there are 6 possibilities of match at any point. In OOF alignment - upper sequence is DNAP - 3-frame translated DNA. Lower sequence is protein. At any position next protein base may be aligned to 6 possible bases in DNAP:
(TBO - traditional blast output)
0: 3 nucleotides missing - gap (TBO notation "-")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGG-GVLCV | | | | | | | | | | | | | | | | | D G T K F A T G G Q G Q D S G K V V TBO: DGTKFATGGQGQDSG-VV DGTKFATGGQGQDSG VV DGTKFATGGQGQDSGKVV1: 2 nucleotides missing - "frameshift -2" (TBO notation "\\")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGGGVLCV | | | | | | | | | | | | | | |/ | | D G T K F A T G G Q G Q D S GK V V TBO: DGTKFATGGQGQDSG\\GVV DGTKFATGGQGQDSG VV DGTKFATGGQGQDSG KVV2: 1 nucletide missing - "frameshift -1" (TBO notation "\")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGERGV | | | | | | | | | | | | | | / | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDS\GEV DGTKFATGGQGQDS G V DGTKFATGGQGQDS GKV3: Complete match
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGEKRGV | | | | | | | | | | | | | | | | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDSGKV DGTKFATGGQGQDSGKV DGTKFATGGQGQDSGKV4: 1 nucleotide insertion - "frameshift +1" (TBO notation "/")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLWGGVEKRGV | | | | | | | | | | | | | | | \ D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDSG/KV DGTKFATGGQGQDSG KV DGTKFATGGQGQDSG KV5: 2 nucleotides insertion - "frameshift +2" (TBP notation "//")
OOF alignment with DNAP: DTRGGDTPQKSVFSRAQNTLWGERGDTQKRGGAQRGDIFSLFLWGGEKRGV | | | | | | | | | | | | | | \ | | D G T K F A T G G Q G Q D S G K V TBO: DGTKFATGGQGQDS//GKV DGTKFATGGQGQDS GKV DGTKFATGGQGQDS GKV
-G Cost to open a gap [Integer] default = 5 -E Cost to extend a gap [Integer] default = 2 -q Penalty for a mismatch in the blast portion of run [Integer] default = -3 -r Reward for a match in the blast portion of run [Integer] default = 1 -e Expectation value (E) [Real] default = 10.0 -W Word size, default is 11 for blastn, 3 for other programs. -v Number of one-line descriptions (V) [Integer] default = 100 -b Number of alignments to show (B) [Integer] default = 100
-G Cost to open a gap [Integer] default = 11 -E Cost to extend a gap [Integer] default = 1 -e Expectation value (E) [Real] default = 10.0 -W Word size, default is 11 for blastn, 3 for other programs. -v Number of one-line descriptions (V) [Integer] default = 100 -b Number of alignments to show (B) [Integer] default = 100 Limited values for gap existence and extension are supported for these three programs. Some supported and suggested values are: Existence Extension 10 1 10 2 11 1 8 2 9 2