Tables

 

 

Line number

Name

Sequence

Annotation

Comment

1

alpha-D

ATGCTGACCGACTCTGACAA...

(EEEEEEEEEEEEEEEEEEE...

/gene="alpha-D"/codo...

2

alpha-A

ATGGTGCTGTCTGCCAACGA...

(EEEEEEEEEEEEEEEEEEE...

/gene="alpha-A"/codo...

3

CMGLOAD_143

ATGCTGACCGCCGAGGACAA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

4

CIIHBADA2_367

ATGGTGCTGTCTGCGGCTGA...

(EEEEEEEEEEEEEEEEEEE...

/note="alpha-A globi...

5

GOTHBAI_917

ATGGTGCTGTCTGCCGCCGA...

(EEEEEEEEEEEEEEEEEEE...

/note="alpha-i globi...

6

GOTHBAII_745

ATGGTGCTGTCTGCCGCCGA...

(EEEEEEEEEEEEEEEEEEE...

/note="alpha-ii glob...

7

ESGLOB01_132

ATGGTGCTGTCTGCCGCCGA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

8

ECPZA2GL_3481

ATGGTGCTGTCTGCCGCCGA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

9

AF098919_17811

ATGGCACTGACCCAAGCTGA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

10

AF098919_21360

ATGCTGACTGCCGAGGACAA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

11

AF098919_24360

ATGGTGCTGTCCGCTGCTGA...

(EEEEEEEEEEEEEEEEEEE...

/codon_start=1/produ...

 

Table 1: Example output Ð overall file structure

Each line contains 4 fields (name, sequence, annotation and comments) separated by tabs representing an individual feature. In this example the features extracted are protein coding genes (CDS) from the following GenBank entries: AB001981, X01831, J00923, J00043, J00044, X01086, X07053, AF098919.

 

For readability the fields has been truncated after 20 letters.

 

Field

Contents

Name

alpha-D

Sequence

ATGCTGACCGACTCTGACAAGAAGCTGGTCCTGCAGGTGTGGGAGAAGGTGATCCGCCACCCAGACTGTGGAGCCGAGG
CCCTGGAGAGGTGCGGGCTGAGCTTGGGGAAACCATGGGCAAGGGGGGCGACTGGGTGGGAGCCCTACAGGGCTGCTGG
GGGTTGTTCGGCTGGGGGTCAGCACTGACCATCCCGCTCCCGCAGCTGTTCACCACCTACCCCCAGACCAAGACCTACT
TCCCCCACTTCGACTTGCACCATGGCTCCGACCAGGTCCGCAACCACGGCAAGAAGGTGTTGGCCGCCTTGGGCAACGC
TGTCAAGAGCCTGGGCAACCTCAGCCAAGCCCTGTCTGACCTCAGCGACCTGCATGCCTACAACCTGCGTGTCGACCCT
GTCAACTTCAAGGCAGGCGGGGGACGGGGGTCAGGGGCCGGGGAGTTGGGGGCCAGGGACCTGGTTGGGGATCCGGGGC
CATGCCGGCGGTACTGAGCCCTGTTTTGCCTTGCAGCTGCTGGCGCAGTGCTTCCACGTGGTGCTGGCCACACACCTGG
GCAACGACTACACCCCGGAGGCACATGCTGCCTTCGACAAGTTCCTGTCGGCTGTGTGCACCGTGCTGGCCGAGAAGTA
CAGATAA

Annotation

(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEE)DIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIA(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEE)DIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIA(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEE)

Comment

/gene="alpha-D"/codon_start=1/product="alpha-D globin"/protein_id="BAA19668.1"/
db_xref="GI:1943997"/translation="MLTDSDKKLVLQVWEKVIRHPDCGAEALERLFTTYPQTKTYFPHF
DLHHGSDQVRNHGKKVLAALGNAVKSLGNLSQALSDLSDLHAYNLRVDPVNFKLLAQCFHVVLATHLGNDYTPEAHAAF
DKFLSAVCTVLAEKYR" /GenBank_acc="AB001981"; /Source="Columba livia (domestic pig
eon)"; /feature_type="CDS"; /strand="+"; /spliced_product="atgctgaccgactctgacaa
gaagctggtcctgcaggtgtgggagaaggtgatccgccacccagactgtggagccgaggccctggagaggctgttcacc
acctacccccagaccaagacctacttcccccacttcgacttgcaccatggctccgaccaggtccgcaaccacggcaaga
aggtgttggccgccttgggcaacgctgtcaagagcctgggcaacctcagccaagccctgtctgacctcagcgacctgca
tgcctacaacctgcgtgtcgaccctgtcaacttcaagctgctggcgcagtgcttccacgtggtgctggccacacacctg
ggcaacgactacaccccggaggcacatgctgccttcgacaagttcctgtcggctgtgtgcaccgtgctggccgagaagt
acagataa"; /spliced_annotation="(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE)(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEE)(EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE)";

 

 

Table 2: Exampe output Ð field details

Detailed example of data extracted from the GenBank entry AB001981 (first CDS).