LOCUS AB001981 5891 bp DNA linear VRT 25-DEC-2002 DEFINITION Columba livia DNA for alpha-D globin, alpha-A globin. ACCESSION AB001981 VERSION AB001981.1 GI:1943996 KEYWORDS alpha-D globin; alpha-D; alpha-A globin; alpha-A. SOURCE Columba livia (domestic pigeon) ORGANISM Columba livia Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Archosauria; Aves; Neognathae; Columbiformes; Columbidae; Columba. REFERENCE 1 AUTHORS Ikehara,T., Eguchi,Y., Kayo,S. and Takei,H. TITLE Isolation and sequencing of two alpha-globin genes alpha(A) and alpha(D) in pigeon and evidence for embryo-specific expression of the alpha(D)-globin gene JOURNAL Biochem. Biophys. Res. Commun. 234 (2), 450-453 (1997) PUBMED 9177291 REFERENCE 2 (bases 1 to 5891) AUTHORS Ikehara,T., Eguchi,Y., Kayo,S. and Takei,H. TITLE Direct Submission JOURNAL Submitted (19-MAR-1997) Tsuyoshi Ikehara, University of the Ryukyus, Department of Biochemistry,Faculty of Medicine; Okinawa, Nishihara-cho, Uehara,207, Nishihara-cho 903-01, Japan (E-mail:k940401@med.u-ryukyu.ac.jp, Tel:098-895-3331) FEATURES Location/Qualifiers source 1..5891 /organism="Columba livia" /mol_type="genomic DNA" /db_xref="taxon:8932" gene join(1104..1192,1306..1510,1614..1742) /gene="alpha-D" CDS join(1104..1192,1306..1510,1614..1742) /gene="alpha-D" /codon_start=1 /product="alpha-D globin" /protein_id="BAA19668.1" /db_xref="GI:1943997" /translation="MLTDSDKKLVLQVWEKVIRHPDCGAEALERLFTTYPQTKTYFPH FDLHHGSDQVRNHGKKVLAALGNAVKSLGNLSQALSDLSDLHAYNLRVDPVNFKLLAQ CFHVVLATHLGNDYTPEAHAAFDKFLSAVCTVLAEKYR" exon 1104..1192 /gene="alpha-D" /number=1 intron 1193..1305 /gene="alpha-D" /number=1 exon 1306..1510 /gene="alpha-D" /number=2 intron 1511..1613 /gene="alpha-D" /number=2 exon 1614..>1742 /gene="alpha-D" /number=3 gene join(4915..5009,5165..5369,5474..5602) /gene="alpha-A" CDS join(4915..5009,5165..5369,5474..5602) /gene="alpha-A" /codon_start=1 /product="alpha-A globin" /protein_id="BAA19669.1" /db_xref="GI:1943998" /translation="MVLSANDKSNVKAVFGKIGGQAGDLGGEALERLFITYPQTKTYF PHFDLSHGSAQIKGHGKKVAEALVEAANHIDDIAGALSKLSDLHAQKLRVDPVNFKLL GHCFLVVVAVHFPSLLTPEVHASLDKFVCAVGTVLTAKYR" exon <4915..5009 /gene="alpha-A" /number=1 intron 5010..5164 /gene="alpha-A" /number=1 exon 5165..5369 /gene="alpha-A" /number=2 intron 5370..5473 /gene="alpha-A" /number=2 exon 5474..>5602 /gene="alpha-A" /number=3 ORIGIN 1 cgatcaggtt acatttactg cccatgcctg tctcagagga attctgacac gaaaaggtgg 61 gcacaaattc ttaagcacac tctgatggta caacgtgagc tggcactaca agctgtgttc 121 ctcatcccgt ttacaaaatt ttgagactgt gtttgggcaa gggggagaga gacagtgcag 181 aagctctgaa gccactgaat ttctctaaat gtgtttagag aagcttttca aacatgctac 241 atttgtgggt ctcaagtaac ccgaacattt aaatccacag ctgaggtggt aggtcagcaa 301 ctggtgtatc cctaactcca agtttgtacc aaagacttta acagaggctg agtgaaggat 361 gtgacagtca ccagccacca tatgccctgc caaatgtccc agtgtttaca gagggatagc 421 aacacacact ggggtgggaa gaaggaaaga agaccaggcc tgacaagcat cacaggatgg 481 attttgggaa gactatggac ctgaaaagga gattcttccc cactcaggtc tctccaggat 541 gctggggaga tgctgtttcc tgtggtagat ccccagcatg aaccaggagg gcatgtccgt 601 ggctcctgct ctgaggctca cagtgtcttt gggtggagag gggatggatg cactggggtc 661 tggaggacat gagggactgg gggcgctcgt gggatctact ctgacacctg cagagacagg 721 gaggaccctg gcctggccag aagggaatgg tggatcccaa caggaagctt gaggatatgc 781 aggtttgtga ggccgaggct gtggcacccg tgggacatgc cgatggctgc tgttgaccat 841 ggggcagctc agccaagtgc tgcccccagc ccccagccca gcgtggggct ggtgcagtgc 901 ggcacatcag ggcagggcag ccgccccatt ggggccccct cggggctggg cctcccaggg 961 cagtcggggc cccctgaggc agtggccccc caccccttgg tgccgataag ataacgctgg 1021 ggcggaggtg ccgaccacta taagaggatg tcctggtggg ccctgctacc actgagccct 1081 gaccgccacc cccagccgcc accatgctga ccgactctga caagaagctg gtcctgcagg 1141 tgtgggagaa ggtgatccgc cacccagact gtggagccga ggccctggag aggtgcgggc 1201 tgagcttggg gaaaccatgg gcaagggggg cgactgggtg ggagccctac agggctgctg 1261 ggggttgttc ggctgggggt cagcactgac catcccgctc ccgcagctgt tcaccaccta 1321 cccccagacc aagacctact tcccccactt cgacttgcac catggctccg accaggtccg 1381 caaccacggc aagaaggtgt tggccgcctt gggcaacgct gtcaagagcc tgggcaacct 1441 cagccaagcc ctgtctgacc tcagcgacct gcatgcctac aacctgcgtg tcgaccctgt 1501 caacttcaag gcaggcgggg gacgggggtc aggggccggg gagttggggg ccagggacct 1561 ggttggggat ccggggccat gccggcggta ctgagccctg ttttgccttg cagctgctgg 1621 cgcagtgctt ccacgtggtg ctggccacac acctgggcaa cgactacacc ccggaggcac 1681 atgctgcctt cgacaagttc ctgtcggctg tgtgcaccgt gctggccgag aagtacagat 1741 aagccatcgc tcgtgccgaa gtgccgtcaa taaagacacc tttgctgcag catcgtgtcc 1801 gtctgtgctg gggccaggga cctgggtggg ctgtgctcct gtgggaggga gggaggccgt 1861 ggaacagggg gcagcactgg ccatgggttg cctgggtgcc ccaccaagag ccttgcccac 1921 ttccacaccc ccctctccag cttgggatgt ggctgatggt ggtagcaggg ccagagcgat 1981 ggagctcagc ctgtcacctc gccatgcctg caccctctgg ggagcgggag ctaaagatga 2041 agacaagcgg ttcttgcatc cccttgaagg actctccagg agggtagaat taatccttcc 2101 tggctccatc tatcatgact gttgtttagg actggggaag ctgttggagc tgagtgcctg 2161 catgagctca aaggcagtca gaaaagttta tggagtgaaa aacatccacc acagactatt 2221 aagcacaaac agtgttcttg ggctcaagcc cctgagccac aaatcccagc aactgccagg 2281 gaagtgcccc catacctgcc ctgcgtgctg gtttggctga acgaaggtaa ctttccatga 2341 ggttagttaa ttttcttatt gatagttagc atgggccttt gtttttgaat gtagtttgag 2401 aacaataata tagcatcttg ggacaaagtt aatgttttct tccagtaact ctccagtgtt 2461 tgggacttgg catgaaagaa atgtaagaac tcactgatgt tgatacttat tgatagggaa 2521 gccaaggtca tccctcagtt tctcatgttt tgcaagtaag caggtatata atgcagggag 2581 gaaaggatcc aggacagctg actcaggttg gccaacataa gtattcaata ccatgaatat 2641 atttaagttc catatttaag caagaaattt ctgggaacag ctctttcctc aacggccgcc 2701 gtccctgggg gggtcctgct cctgtccctg ctccccagac ctggttccag caccttccct 2761 gtgacctgct gttggtctct gcacctttac tgaactcgtt gtgcttacag tgtccacctc 2821 tggtgctcgt tgctgggaac acgtgttcta ctggctaagt atttgctttg tttctttttt 2881 cttttactat tactttatta aagctgtttc tatttcaacc cacaagcatt tcatcccctt 2941 tccctccaga gaagaaaagg tgagtgagca accgttttgc tgtttagctg ccggcccagc 3001 gctgaacagc agcagttgcg gtctagcaaa gggccacagg tatcagcagt ttggagtctg 3061 aattaaaaaa aaataaaatc attgtcttta gatcacaggt gatgggttag gagataatga 3121 actagccttt ggtgtgttct gtgctatgat accagaggga gaagagtgtt gtgagggatg 3181 ggagaaggtg catgaagagc agtttctgca gtggcaagca caggcagcag ccaggtaaca 3241 ggcacgtggg gaccctgcct gtatggaaac atgggactat gacagcccat gcttccccaa 3301 gtgcactcat gggcccacag tgcttttcct ccatccttag acacatccca gtctacattc 3361 cagtgtcttc ctgtgcattt acacccattt actcttgtgt caccagtttg aagggctctt 3421 ctgcctctgc agtttgtccc cgtgatgtac acacaggcag cagatgcacc accccgaatg 3481 ctgtttctct gggctccgat tctcagaaga ttggcccctc tgtcctcccc tcctcccctg 3541 ctctgaaatg gtgccagctc aagctaatct cccctgaaca caaatgataa gaaccatcag 3601 ccatggagga aacatgggtg tgaactgtcc cctgcctgtc tgcgccacct tttgaaccta 3661 ctggaagcac tgcagccctc ctcaccttcc tcaccctccc tgtcttcctc accatcacca 3721 ccttccttgc cttcctcacc atcatttgag gctcacaggg ctcaccatgg gctgctgcag 3781 cctccgtgcc tttctccctg ggctgtgagc tcctgctgct ggaggccagt gcttccacca 3841 gtgcccactg atccccgtgg ctctgctgag cgtcccctgg cacagagtca gggtgacctg 3901 gtcccccatg gcactgggat tggcacatgt gcagaaggga gggccagggc aggggagctc 3961 cagggtttgt atgcagagtg gtgcagctgt ggtttgggga gcaggaaaag ggcacacgct 4021 ttggggtaca ctggtttggg gtgcactgag ggaacgagag cagtaggtgc agtcaggcaa 4081 tatggatgta tgtatagggg ctgcactgga tggcagtgtg tgtatagggg cacatgtgcg 4141 ggcagagacg gtgtgtgcac aggggctgca ttggatggca gcgtgtgtag gggcactgtg 4201 cacacagggg cagctcacgt gtgtgcgcgg gggctgtgtt ggatgggtca tgcatgtgca 4261 ggtgcccacg gggcagcatg tgggtgttgg gggcgctggg caggttcagt ctgtggggca 4321 caggctccca ccatactatt gaagttataa tgccaccctc taccgctgtc ccctccatcc 4381 tgcccgcttg tccctgccag agcacatgca tgctgtgcag ggctgcctgc gcccggggga 4441 ccaagtcagt acctggctcc aagccctgaa cccaacctgt gccggggggg agacagactc 4501 ctccacctgt gccactgaag accgtaaacc taacctgaag cctaaaccta aaactaaacc 4561 aaagcctaag cctaaaccta agcctaaccc tgcagtcagt ctgtgccaga ggggatggac 4621 cactcagcct gcgctgatgt gatccctggc cctaatcctg actcgaaacc catcaggtac 4681 cgggggccag acccccccag ccggtgccgg tgccgcagag cggagcgggg tgcggtgctg 4741 gccggggggg ggcggctccg ctggccgggc tccagcggcg gcggggccgg agcggggcgg 4801 ggcgggccgg gccgggctgg gccgggggcg ggcgctgccc cggcacgcat ataagggaca 4861 gcggcggcca gcgagggcac ccgtgctggg ggctgccaac gcgaaggtga caccatggtg 4921 ctgtctgcca acgacaagag caacgtgaag gccgtcttcg gcaaaatcgg cggccaggcc 4981 ggtgacttgg gtggtgaagc cctggagagg tatgtggtca tccgtcatta ccccatctct 5041 tgtctgtctg tgactccatc ccatctgccc ccatactctc cccatccata actgtccctg 5101 ttctatgtgg ccctggctct gtctcatctg tccccaactg tccctgattg cctctgtccc 5161 ccaggttgtt catcacctac ccccagacca agacctactt cccccacttc gacctgtcac 5221 atggctccgc tcagatcaag gggcacggca agaaggtggc ggaggcactg gttgaggctg 5281 ccaaccacat cgatgacatc gctggtgccc tctccaagct gagcgacctc cacgcccaaa 5341 agctccgtgt ggaccccgtc aacttcaaag tgagcatctg ggaaggggtg accagtctgg 5401 ctcccctcct gcacacacct ctggctaccc cctcacctca cccccttgct caccatctcc 5461 ttttgccttt cagctgctgg gtcactgctt cctggtggtc gtggccgtcc acttcccctc 5521 tctcctgacc ccggaggtcc atgcttccct ggacaagttc gtgtgtgccg tgggcaccgt 5581 ccttactgcc aagtaccgtt aagatgcggc accatggcta gagctggaca caacctgctg 5641 ccagccctcc aacagtgagc aaccaaatga tctgaaataa aatctgttcc atttgtgctc 5701 catcgttggc gtcctgctct ggctgctgcc tgtggggagg gagcgggaga gatctgtgct 5761 aggggtccaa acagggggtc ccccctgcca ggtgggcatg taggtgaaat gggctcttgt 5821 tttgctgtct ctgagaggag gcagcctttg ggtggctgtg gtgcagacac tgctctgcta 5881 gctgtggagc t // LOCUS CMGLOAD 1185 bp DNA linear VRT 18-APR-2005 DEFINITION Cairina moschata (duck) gene for alpha-D globin. ACCESSION X01831 VERSION X01831.1 GI:62724 KEYWORDS alpha-globin; globin. SOURCE Cairina moschata (Muscovy duck) ORGANISM Cairina moschata Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Archosauria; Aves; Neognathae; Anseriformes; Anatidae; Cairina. REFERENCE 1 (bases 1 to 1185) AUTHORS Erbil,C. and Niessing,J. TITLE The primary structure of the duck alpha D-globin gene: an unusual 5' splice junction sequence JOURNAL EMBO J. 2 (8), 1339-1343 (1983) PUBMED 10872328 COMMENT Data kindly reviewed (13-NOV-1985) by J. Niessing. FEATURES Location/Qualifiers source 1..1185 /organism="Cairina moschata" /mol_type="genomic DNA" /db_xref="taxon:8855" CAAT_signal 20..24 TATA_signal 69..73 precursor_RNA 101..1114 /note="primary transcript" exon 101..234 /number=1 CDS join(143..234,387..591,939..1067) /codon_start=1 /product="alpha D-globin" /protein_id="CAA25966.2" /db_xref="GI:4455876" /db_xref="GOA:P02003" /db_xref="InterPro:IPR000971" /db_xref="InterPro:IPR002338" /db_xref="InterPro:IPR002340" /db_xref="InterPro:IPR009050" /db_xref="UniProtKB/Swiss-Prot:P02003" /translation="MLTAEDKKLIVQVWEKVAGHQEEFGSEALQRMFLAYPQTKTYFP HFDLHPGSEQVRGHGKKVAAALGNAVKSLDNLSQALSELSNLHAYNLRVDPVNFKLLA QCFQVVLAAHLGKDYSPEMHAAFDKFLSAVAAVLAEKYR" repeat_region 227..246 /note="direct repeat 1" intron 235..386 /number=1 repeat_region 289..309 /note="direct repeat 1" exon 387..591 /number=2 intron 592..939 /number=2 exon 940..1114 /number=3 polyA_signal 1095..1100 polyA_signal 1114 ORIGIN 1 ctgcgtggcc tcagcccctc cacccctcca cgctgataag ataaggccag ggcgggagcg 61 cagggtgcta taagagctcg gccccgcggg tgtctccacc acagaaaccc gtcagttgcc 121 agcctgccac gccgctgccg ccatgctgac cgccgaggac aagaagctca tcgtgcaggt 181 gtgggagaag gtggctggcc accaggagga attcggaagt gaagctctgc agaggtgtgg 241 gctgggccca gggggcactc acagggtggg cagcagggag caggagccct gcagcgggtg 301 tgggctggga cccagagcgc cacggggtgc gggctgagat gggcaaagca gcagggcacc 361 aaaactgact ggcctcgctc cggcaggatg ttcctcgcct acccccagac caagacctac 421 ttcccccact tcgacctgca tcccggctct gaacaggtcc gtggccatgg caagaaagtg 481 gcggctgccc tgggcaatgc cgtgaagagc ctggacaacc tcagccaggc cctgtctgag 541 ctcagcaacc tgcatgccta caacctgcgt gttgaccctg tcaacttcaa ggcaagcggg 601 gactagggtc cttgggtctg ggggtctgag ggtgtggggt gcagggtctg ggggtccagg 661 ggtctgagtt tcctggggtc tggcagtcct gggggctgag ggccagggtc ctgtggtctt 721 gggtaccagg gtcctggggg ccagcagcca gacagcaggg gctgggattg catctgggat 781 gtgggccaga ggctgggatt gtgtttggaa tgggagctgg gcaggggcta gggccagggt 841 gggggactca gggcctcagg gggactcggg gggggactga gggagactca gggccatctg 901 tccggagcag gggtactaag ccctggtttg ccttgcagct gctggcacag tgcttccagg 961 tggtgctggc cgcacacctg ggcaaagact acagccccga gatgcatgct gcctttgaca 1021 agttcttgtc cgccgtggct gccgtgctgg ctgaaaagta cagatgagcc actgcctgca 1081 cccttgcacc ttcaataaag acaccattac cacagctctg tgtctgtgtg tgctgggact 1141 gggcatcggg ggtcccaggg agggctgggt tgcttccaca catcc // LOCUS CIIHBADA2 1145 bp DNA linear VRT 28-APR-1993 DEFINITION Duck alpha-A-globin gene and 5' flank. ACCESSION J00923 J00924 VERSION J00923.1 GI:212911 KEYWORDS alpha-globin; globin. SEGMENT 2 of 2 SOURCE Cairina moschata (Muscovy duck) ORGANISM Cairina moschata Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Archosauria; Aves; Neognathae; Anseriformes; Anatidae; Cairina. REFERENCE 1 (bases 1 to 1145) AUTHORS Niessing,J., Erbil,C. and Neubauer,V. TITLE The isolation and partial characterization of linked alpha A- and alpha D-globin genes from a duck DNA recombinant library JOURNAL Gene 18 (2), 187-191 (1982) PUBMED 6290322 REFERENCE 2 (bases 1 to 1145) AUTHORS Erbil,C. and Niessing,J. TITLE The complete nucleotide sequence of the duck alpha A-globin gene JOURNAL Gene 20 (2), 211-217 (1982) PUBMED 7166233 COMMENT Original source text: Duck adult erythrocyte DNA, clone D-alpha-G-1. The alpha-A-globin gene is linked to the alpha-D-globin gene. [1] compared their alpha-A-globin gene sequence with chicken alpha-A- and alpha-S-globin gene sequences, as well as with other avian and mammalian alpha-A-globin gene sequences. FEATURES Location/Qualifiers source 1..1145 /organism="Cairina moschata" /mol_type="genomic DNA" /db_xref="taxon:8855" prim_transcript 331..1145 /note="alpha-A-globin mRNA" CDS join(367..461,612..816,921..1049) /note="alpha-A globin" /codon_start=1 /protein_id="AAA49148.1" /db_xref="GI:212914" /translation="MVLSAADKTNVKGVFSKIGGHAEEYGAETLERMFIAYPQTKTYF PHFDLQHGSAQIKAHGKKVAAALVEAVNHIDDIAGALSKLSDLHAQKLRVDPVNFKFL GHCFLVVVAIHHPAALTPEVHASLDKFMCAVGAVLTAKYR" exon <367..461 /note="alpha-A globin" /number=1 intron 462..611 /note="alpha-A-globin intron A" exon 612..816 /number=2 intron 817..920 /note="alpha-A-globin intron B" exon 921..>1049 /note="alpha-A globin" /number=3 ORIGIN About 3 kb after ; 285 bp upstream of HhaI site. 1 ctcatgctgg ggttgcctcc ccccctcaaa ccctaacctt aatcccatct cgtgctgggg 61 tcagaccccc ctaaccctaa cccagttcat gccgggatca gcccccccaa accctaaccc 121 taaacccatc tcgtgccggg gtcagacccc ccccaaccct aaccccgacc ccagttcatg 181 ccggggtcgc ccccccccgg tggtgccggt gccgcaggcg gggcagggcg gcggccccgc 241 ctggccgagg tccagccgcg acggggcggg cggggcgggg cggcgcccgg gccggcacgg 301 ggatataagg ccggcggcac cagtgggggc acccgtgctg ggggctgcca acgcggagct 361 gcaaccatgg tgctgtctgc ggctgacaag accaacgtca agggtgtctt ctccaaaatc 421 ggtggccatg ctgaggagta tggcgccgag accctggaga ggtaggtgtc tgtccccgtc 481 ctttgtccgt ccctgatcct ctcctctcta accccatgct ctcccccacc ataactgtcc 541 gtgtcctacc ccaccccatc catcccccct gtccgttgat cccgctggcc ctgactcgct 601 ctgctccaca ggatgttcat cgcctacccc cagaccaaga cctacttccc ccactttgac 661 ctgcagcacg gctctgctca gatcaaggcc catggcaaga aggtggcggc tgccctagtt 721 gaagctgtca accacatcga tgacattgcg ggtgctctct ccaagctcag tgacctccac 781 gcccaaaagc tccgtgtgga ccctgtcaac ttcaaagtga gtctggtgac tccccccagc 841 tcctcttcag cacccatcct gggccatccg gccacccctt tacctccccc actcgctcac 901 cgtctccttt tgcctttcag ttcctgggcc actgcttcct ggtggtggtt gccatccacc 961 accccgctgc cctgacccca gaggtccacg cttccctgga caagttcatg tgcgccgtgg 1021 gtgctgtgct gactgccaag taccgttaga cggcaccgtg gctagagctg gacccaccct 1081 gttgccagcc ttccaactgc aagcagccaa atgatctgaa ataaaatctg ttgcatttgt 1141 gctcc // LOCUS GOTHBAI 1894 bp DNA linear MAM 27-APR-1993 DEFINITION Goat adult alpha-i-globin gene, complete sequence. ACCESSION J00043 VERSION J00043.1 GI:164123 KEYWORDS alpha-globin; globin. SOURCE Capra hircus (goat) ORGANISM Capra hircus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; Pecora; Bovidae; Caprinae; Capra. REFERENCE 1 (bases 1 to 1894) AUTHORS Schon,E.A., Wernke,S.M. and Lingrel,J.B. TITLE Gene conversion of two functional goat alpha-globin genes preserves only minimal flanking sequences JOURNAL J. Biol. Chem. 257 (12), 6825-6835 (1982) PUBMED 6282825 COMMENT [1] also determined complete nucleotide sequence of alpha-ii-globin gene. alpha-i and alpha-ii are nonallelic, but are 99% homologous. FEATURES Location/Qualifiers source 1..1894 /organism="Capra hircus" /mol_type="genomic DNA" /db_xref="taxon:9925" CDS join(917..1011,1120..1324,1429..1557) /note="alpha-i globin" /codon_start=1 /protein_id="AAA30909.1" /db_xref="GI:164124" /translation="MVLSAADKSNVKAAWGKVGGNAGAYGAEALERMFLSFPTTKTYF PHFDLSHGSAQVKGHGEKVAAALTKAVGHLDDLPGTLSDLSDLHAHKLRVDPVNFKLL SHSLLVTLACHLPNDFTPAVHASLDKFLANVSTVLTSKYR" exon <917..1011 /note="alpha-i globin" exon 1120..1324 exon 1429..>1557 /note="alpha-i globin" ORIGIN 1 gccctcggcc gaattagtct cttcctctga atctctgggg cagggacaca ttcctagtcc 61 acatagacac actgtgtgac actatgcatg tccctaagca atctgagcct tgtgcaccca 121 cccgaccaaa aaatccccaa cttcactggg tctttgaagt gatttttgaa ataataattc 181 taaggaaggc tggtgtcctg gcaggccgag tctacccgac ccccacccca gctccagcac 241 tctgggtcgg cctctgagcc cctggggcat agcgccccat cttacacaca gacagacaga 301 gacacacaca cagtcaatct ctccttggca gtactggagc agagatctta gcgatcagtc 361 cagagccttt gcaaagacaa gagcccaacc ctccaagctc tccagactct gaatgaggat 421 ggttccctgg atgacagtct cgggtatgtg cagggctgga cagtcagtga ccaagttcga 481 agggggatgc tggacactaa tccctcaccc agactgggca gaaggcccct cggtccatgc 541 cctttagtgc gaaagcagct tgcttcactt cggaaaagac tgtgctcctc tcccaggctg 601 gcaaacggaa aactcggttc cagggggttt gggtggctca gaaaagtgtg tgagctgttc 661 gacccacagg accagggtag aaggtaccat aaatagctgg attgcagcca aagagggaca 721 gaaaaggggc caccttggga ccccgacacc ctacaccctc tccagaccca cctttccagg 781 cctccacctc ctggccccgc gccagccaat gagcgcagcg cgggcgggcg tgcccctggc 841 gccgggcgca taaaggctcg cgcactcgca gccccgcact cttctggtcc tgacccagac 901 tcagagagaa tccaccatgg tgctgtctgc cgccgacaag tccaatgtca aggccgcctg 961 gggcaaggtt ggcggcaacg ctggagctta tggcgcagag gctctggaga ggtgagcacc 1021 gcacccgtcc cgaggggacc gggccgctcg ccgggcgccg ggggcgtcct tgtcccgggc 1081 cgctcggcct gagcccggct ttcccgcctc ttcacccagg atgttcctga gcttccccac 1141 caccaagacc tacttccccc acttcgacct gagccacggc tcggcccagg tcaagggcca 1201 cggcgagaag gtggccgccg cgctgaccaa agcggtgggc cacctggacg acctgcccgg 1261 tactctgtct gatctgagtg acctgcacgc ccacaagctg cgtgtggacc cggtcaactt 1321 taaggtgagc tcgcgggccg ggccgggaca gacctgggct agcggggcag agaatgccgc 1381 ggcgccccca cccagccccc gccccactga cgtcccctct ctcggcagct tctgagccac 1441 tccctgctgg tgaccctggc ctgccacctc cccaatgatt tcacccccgc ggtccacgcc 1501 tccctggaca agttcttggc caacgtgagc accgtgctga cctccaaata ccgttaagct 1561 ggagcctcgg ccacccctac cctggcctgg agcgcccttg cgctctgcgc actctcacct 1621 cctgatcttt gaataaagtc tgagtgggct gcagtgtctg tctgtagcct ccggtctctg 1681 cgtctgcgaa ccggccgggg tgggcgtggc tctcagtctc taggagtggg agggtggagg 1741 agggcgggga gctaaggctg agggtcccag aatctgctga accaagttcc cctcctggga 1801 gacttccaag ggttctctct gaggtgggga gtgctgaaat agccaccctg gttttgaaat 1861 ttttctgatc ccccttaaac atagatgaaa acaa // LOCUS GOTHBAII 1691 bp DNA linear MAM 27-APR-1993 DEFINITION Goat adult alpha-ii-globin gene, complete sequence. ACCESSION J00044 VERSION J00044.1 GI:164125 KEYWORDS alpha-globin; globin. SOURCE Capra hircus (goat) ORGANISM Capra hircus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; Pecora; Bovidae; Caprinae; Capra. REFERENCE 1 (bases 1 to 1691) AUTHORS Schon,E.A., Wernke,S.M. and Lingrel,J.B. TITLE Gene conversion of two functional goat alpha-globin genes preserves only minimal flanking sequences JOURNAL J. Biol. Chem. 257 (12), 6825-6835 (1982) PUBMED 6282825 FEATURES Location/Qualifiers source 1..1691 /organism="Capra hircus" /mol_type="genomic DNA" /db_xref="taxon:9925" CDS join(745..839,941..1145,1250..1378) /note="alpha-ii globin" /codon_start=1 /protein_id="AAA30910.1" /db_xref="GI:164126" /translation="MVLSAADKSNVKAAWGKVGSNAGAYGAEALERMFLSFPTTKTYF PHFDLSHGSAQVKGHGEKVAAALTKAVGHLDDLPGTLSDLSDLHAHKLRVDPVNFKLL SHSLLVTLACHHPSDFTPAVHASLDKFLANVSTVLTSKYR" exon <745..839 /note="alpha-ii globin" exon 941..1145 exon 1250..>1378 /note="alpha-ii globin" ORIGIN 1 ctgcaggaac cagcacctgg gagaagagac ttgaacccgg acttgaactc cttgcaaatt 61 gctgtaaccc gctctcagta tctgttcctt ccaagactgc cactcagttg cacccaaaaa 121 ctctctgcgg aaagaaagga agctcgaagc gccaaggctg aagaggaaca ggagggttgg 181 acgggggtgg ggaggaattc gcgattacat gtgaacggtg agccaagtgt gttgcgtcgg 241 gctgcctctg gcatggacta ggcgcactca gtcgcccgtt ccttcactga tactgcccaa 301 gtttaaaatg cccagagtgt gccaagctta ggtccggggt gggtagacgg gctgacttac 361 tcccttccgt tctcaagaca gctggggaac tcctgcagga tgcaggagcg ggcatctacc 421 cagctccaca atcccgcccc tgccacctgg cgcgaggcta ccacgtccgg ggaaggtgga 481 cgcagcgggc gggaagcaga cggtggaagc aagaaccccc ggtcagagtc caggtctggg 541 tgggtgaggg aagcacccat cgcccggccg ggcgcaggtc ggactccgcg cgccccctgc 601 ggtcctggtc cggccgcgca tgccgcgtgc cagccaatga gcgcagcgcg ggcgggcgtg 661 cacctggagc cgggcgcata aaggctcgcg cactcgcagc cccgcactct tctggttctg 721 acccagactc agagagaatc caccatggtg ctgtctgccg ccgacaagtc caatgtcaag 781 gccgcctggg gcaaggttgg cagcaacgct ggagcttatg gcgcagaggc tctggagagg 841 tgagcaccgc acccgccccg aggggaccgg gccgctcgcc gggcgcgtcc ttgtaccggg 901 cctctcggcc tgagcccggc tttcccgcct cttcacccag gatgttcctg agcttcccca 961 ccaccaagac ctacttcccc cacttcgacc tgagccacgg ctcggcccag gtcaagggcc 1021 acggcgagaa ggtggccgcc gcgctgacca aagcggtggg ccacctggac gacctgcccg 1081 gtactctgtc tgatctgagt gacctgcacg cccacaagct gcgtgtggac ccggtcaact 1141 ttaaggtgag ctcgcgggcc gggccgggac agacctgggc tagcggggca gagaatgccg 1201 cggcggcccc acccagcccc cgccccactg acgtcccctc tctcggcagc ttctgagcca 1261 ctccctgctg gtgaccctgg cctgccacca ccccagtgat ttcacccccg cggtccacgc 1321 ctccctggac aagttcttgg ccaacgtgag caccgtgctg acctccaaat accgttaagc 1381 tggagcctcg gccaccccta ccctggcctg gagcgccctt gcgctctgcg cactctcacc 1441 tcctgatctt tgaataaagt ctgagtgggc tgcagtgtct gtctgtagcc tcgggtctct 1501 gtgtccgcga accggcccag gttctcattg cctcggacca aggagctctc aggcagctag 1561 agagagaagg ggaaaactgg acggaggggt gggggtgcag cctgccccac tgccactacc 1621 tgggattctc tgggcagccc tcaccctcag cctggagtga tttctgagta tcttggccct 1681 tccctgaatt c // LOCUS ESGLOB01 1331 bp DNA linear MAM 10-FEB-1999 DEFINITION Equine BII alpha-1 globin gene. ACCESSION X01086 VERSION X01086.1 GI:1078 KEYWORDS alpha-globin; globin. SOURCE Equus caballus (horse) ORGANISM Equus caballus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. REFERENCE 1 (bases 1 to 1331) AUTHORS Clegg,J.B., Goodbourn,S.E. and Braend,M. TITLE Genetic organization of the polymorphic equine alpha globin locus and sequence of the BII alpha 1 gene JOURNAL Nucleic Acids Res. 12 (20), 7847-7858 (1984) PUBMED 6093055 FEATURES Location/Qualifiers source 1..1331 /organism="Equus caballus" /mol_type="genomic DNA" /db_xref="taxon:9796" promoter 11..15 /note="put. CAAT-box" promoter 54..59 /note="put. TATA-box" CDS join(132..226,504..708,858..986) /codon_start=1 /product="alpha-1 globin" /protein_id="CAA25564.1" /db_xref="GI:1079" /db_xref="GOA:P01958" /db_xref="UniProtKB/Swiss-Prot:P01958" /translation="MVLSAADKTNVKAAWSKVGGHAGEFGAEALERMFLGFPTTKTYF PHFDLSHGSAQVKAHGKKVGDALTLAVGHLDDLPGALSNLSDLHAHKLRVDPVNFKLL SHCLLSTLAVHLPNDFTPAVHASLDKFLSSVSTVLTSKYR" variation 204..206 /note="aa24 Phe is Tyr in haplotype BI" variation 204..206 /note="aa24 Phe is Tyr in haplotype A" variation 204..206 /note="aa24 Phe is Tyr in haplotype mutant C1" intron 227..503 /note="intron I" variation 539..541 /note="aa60 Lys is Gln in haplotype A" intron 709..857 /note="intron II" misc_feature 1061..1066 /note="polyA signal" ORIGIN 1 tcctcgccag ccaatgagcg cggcccgggc gggcgtgccc cccgcgcccg gactataaag 61 ctgcgcgctc ggcccgccgc gtacgctgct gtgccgctgc tggtcctagc acagactcag 121 aaacagtcac catggtgctg tctgccgccg acaagaccaa cgtcaaggcc gcctggagta 181 aggttggcgg ccacgctggc gagtttggcg cagaggccct agagaggtga ggaccctcct 241 ttccccggcc gggaccctcg ggcacagcag ccgccccagg ggcctgccag caacccctcg 301 gtgggttctg gcccggctgg tgcaaagacc cccaagatct cagggtctga ccgcggacca 361 gccggaggag cccggccagc accttcttcc gaatccgagg ctccggaccc tgcccccgac 421 cccgccaccc cacacccacg ccggcccccc gccgccgccg cccccccccc ccccgccccc 481 cgctcactct cctctccctg caggatgttc ctgggcttcc ccaccaccaa gacctacttc 541 ccccacttcg atctgagcca cggctccgcc caggtcaagg cccacggcaa gaaggtgggc 601 gacgcgctga ctctcgccgt gggccacctg gacgacctgc ctggcgccct gtcgaatctg 661 agcgacctgc acgcacacaa gctgcgcgtg gaccccgtca acttcaaggt gagcccgggg 721 gccgggcctg gccgggcggg agagacgagc gggaggcgca gcgggccctc ccagagggca 781 gggaaccccg tgggtctcag aagtggagcg cgggcggccg cggccccgac gccccctgac 841 accccctcga tccgcagctt ctgagtcatt gcctgctgtc caccttggcc gtccacctcc 901 ccaacgattt cacccctgcc gtccacgcct ccctggacaa gttcttgagc agtgtgagca 961 ccgtgctgac ctccaaatac cgttaagctg gagccacggc gatccctgcc cgcggcccgg 1021 ggccctttgc gctccgcgtg cccgcacttc cctacctttc aataaagtct gatgggctgc 1081 atgcagcctg cgtgcctcgg ttctctgtgt ccgcgaatgt gccaggggtg ggggtggtct 1141 gtctgatcaa ggacctccca ggagcgggca gagagggaag ggaagaaaag ggtggaggag 1201 ggagtggagc tggtaggctg cctggggttt gctgcaaccc cccactgtca ccctggagag 1261 ctttgctatg cacgttggcc tcttcctgca tttcattcct tgcagactca acacgggttt 1321 attcaagcaa t // LOCUS ECPZA2GL 5089 bp DNA linear MAM 07-JUL-1993 DEFINITION Horse psi zeta and alpha 2 globin genes (BI allele). ACCESSION X07053 M17900 VERSION X07053.1 GI:1065 KEYWORDS alpha-globin; pseudogene; zeta globin. SOURCE Equus caballus (horse) ORGANISM Equus caballus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. REFERENCE 1 (bases 3260 to 4776) AUTHORS Clegg,J.B. TITLE Gene conversions in the horse alpha-globin gene complex JOURNAL Mol. Biol. Evol. 4 (5), 492-503 (1987) PUBMED 2835578 REFERENCE 2 (bases 1 to 5089) AUTHORS Flint,J., Taylor,A.M. and Clegg,J.B. TITLE Structure and evolution of the horse zeta globin locus JOURNAL J. Mol. Biol. 199 (3), 427-437 (1988) PUBMED 3351936 FEATURES Location/Qualifiers source 1..5089 /organism="Equus caballus" /mol_type="genomic DNA" /db_xref="taxon:9796" /clone_lib="lambda L47.1" /note="Allele: BI" misc_feature 1297..1658 /note="pseudo zeta gene; region of homology to zeta gene" misc_feature 1465..1559 /note="psi zeta globin coding region" conflict 3302 /note="a is g in [2]" /citation=[1] CDS join(3481..3575,3854..4058,4208..4336) /codon_start=1 /product="alpha 2 globin" /protein_id="CAA30097.1" /db_xref="GI:313500" /db_xref="GOA:P01958" /db_xref="UniProtKB/Swiss-Prot:P01958" /translation="MVLSAADKTNVKAAWSKVGGHAGEYGAEALERMFLGFPTTKTYF PHFDLSHGSAQVKAHGQKVGDALTLAVGHLDDLPGALSNLSDLHAHKLRVDPVNFKLL SHCLLSTLAVHLPNDFTPAVHASLDKFLSSVSTVLTSKYR" exon <3481..3575 /number=1 intron 3576..3853 /number=1 conflict 3688 /note="g is c in [2]" /citation=[1] exon 3854..4058 /number=2 intron 4059..4207 /number=2 exon 4208..>4336 /number=3 polyA_signal 4411..4416 /note="put. poly A signal" ORIGIN 1 cctgagtgca tctgaccagt gagaatggac caacagatct tacctgtcat gtgggaatgt 61 gcgacctcaa cagcgagtgg aaacagtgcc accacacgga tcgccatgac agccatcccc 121 cttcatgcgc tggctgggga gggagaaact ggtccacacg cagatgttag ggctcagtcc 181 agggtcagag cagtgagggg gtctctgccc ctttctccga gtacccaggg tctccctgag 241 gcagccaggg aggtggccgt cgggggatgt gtaacaggag gagccacaga agaagcttct 301 ccaccagact caggcctggg cttggtctgc cagccccctg tgcccagggg ccccaaactc 361 tcggctcaga tacaaagaca gggtcaggcg gcccacagct cagctccact gaacagcaac 421 acgtgctcca tgtggaaacc cagtgacaac acacgcaacc gaccaaccga caaggacacc 481 acaaccctcg tcacttaggg gtcattactg tcactagttt ggtgctgtcc tatccagccc 541 tttctactac tgaaaaacat tatcatattg tatgttctgt aaaaaatctt ctattttaag 601 aaagtgctta taaaaaatgt ttattataat tacaacattc aaatcaagct ataatttaac 661 ttaaaaatac atgcataaaa gtgcactttg caagaaccca taggaactga gaggaagata 721 caggctcatt agaatggccg ccttacggga agggaaggga agcgacgtgc gtggacaaga 781 gaacaagggt cttgtccagg ctgtgacagg tggaaccgta ggcgcgtgtg gcccagccgt 841 tgtggagaat ccaggctcct tgcggcctgg ggccctgctt ggaaagcagc ccccccaagc 901 tgctggggga cttggcccat gctggggaga acctcagaca gaccctcatc taggtgagga 961 gggggcaaca cagaagaggc gggagagggt ctgaggccaa gactgccact cacacccagt 1021 gatgcctggc ctcagtttcc ctccatgggc ctcggtgact catcaggtgg cgttgctgaa 1081 gctttaagga acagggtgct tctgtcaggg cccgacgggt ttcaggtcat ccttccattg 1141 caaggacaag gaggggaggg accagcatta cggaggccag tctctgggct gggctgacac 1201 gtggaccctc gacccaagcc tgggccccca tctccctggg ccagggctct ggctgtggcc 1261 cagtgggtga taccctcacc ccctcacccc aaactgctcg tcactggatc ggataagaga 1321 ccaccaccct cacggccccc tcccctcctg accaatggcc acactctggc ccccacccca 1381 gctccgtgta tataagggga ccctgggggc tgagcaccat ggaggccagt cctgaggaca 1441 cccagctcca gtccagccat caccatgtct ctgaccaagg ctgagaggac catggtcgtg 1501 tccatatggg gcaagatctc catgcaggcg gatgccgtgg gcaccgaggc cctgcagagg 1561 tgagtgccag acagcctggg acaggtgaca gtgtcccagg tgacactggt gtaggtgaca 1621 gcgtgagttt agtgaggaca ggggccagtg aggagggaca gcgaggaggg gtcagtgagg 1681 agggcacagt caggagggac agtggagaag gtcttgagtc gggaaaggtg aatcgctcgg 1741 atgtctggct gacccccgct aggcctagca gggacaggcc caggtcccca caaggagggg 1801 ctgcacacat ttctcccgag agaacagaag tgtcttccgg ggcgccaggc agcaacagca 1861 tcagggtctc gcattctggg ggcggtgtca gcagagggtc cccgcagaga ccttccttgg 1921 gacacggctg gagagggcag catttccggc agaagggtca acaggaagcc tgatcaaggg 1981 ctgggaagga ggccccttcg ccagaggaga ggtgacaggg gctgattgga agccaggagg 2041 caggcccgga gccgcccggc ctggaatggg gctgggggcc gaacacggcc cggcggcggg 2101 agggtgtgga cgccgaggag agggcgcctc cagacggcgg ggtggagctt cggtgggggg 2161 ccgtggtagg gcagagtggg tgctcggatc ggaggagagt caggcttcgg cttgcgcggt 2221 ggccccacct tctcaaagca cgtccgcagt cattcgttcg ttcagtcatt ctccctccat 2281 ccctacctcg cgcggaagcg gggaacgtga atctttgccc cttttgcaga cagacgcggg 2341 ctcccaggcc caggggccaa gttcctggcc tgaggttaaa cagaattgcg gccctgggtt 2401 cctcggacac ccatctctgt tccttttccg cttcccctcc ccctagcctg cagcaaatct 2461 tgccctgggc caaattatcc ccttcctctg aatctcctgg gggcaaggga tgggggtagg 2521 gaggaacttc tcgacggccg caatctggcg gaacaggttt ccaactcgat tcactgggca 2581 tgtccccgag caatctgagc ctaacgcatc aggtgggaaa acctcgggcc tcgctgggtc 2641 tttgaaataa tttacgcgat cagaatgcag ggtcggttcc aggctggccc cggccacccc 2701 acccccaccc agccgcagcc gcggcgcccc agtctccgcg cgtccccggc caccacgccc 2761 ggccccaggg tcgatctctc ccgcctgcgg caccggggca ggaatcttag cgctcaaccc 2821 agaacctctg cggacaaggg gacaggagcc cggagaggga ccctgggagc cctccaaact 2881 cgatgaggac cgttccggga ccatcggcgc ccaagtccga agggggacgc tggaggctaa 2941 gtccggacgg tggttcggcc gaaacccgcg cccacgcggt ccaggccgcc cgttacccct 3001 tgctgccggg agcgcctgcc caccaccggg ggaggagggc ttggaggacg tcccccacat 3061 tatctccttc cccgcgtcgt ttacacggac agtttgcagc tctcggcctg gggcaaagtt 3121 tggtgccaaa gcagcttgtt tcacttcgga acccgcaaat atacacattg gaagctatcc 3181 cccagcccga gccaaaggcg agtgacagtg ggtttgggtg gctcagaacc cgcgtgggct 3241 cccggaacca caggagagag aatcgagggt gcggaggaga gctggctccc ggcgcgggcg 3301 cagggagtcc tgcctgcccc cctcgctctg ctcagcgctt tcccagcccg cagctcccgg 3361 caccgcgcca gccaatgagc gcggcccggg cgggcgtgcc ccccgcgccc agaccataaa 3421 gctgcgcgct ctgtcagccg cgtacactgc tgtcccggca cagactcaga aacagtcacc 3481 atggtgctgt ctgccgccga caagaccaac gtcaaggccg cctggagtaa ggttggcggc 3541 cacgctggcg agtatggcgc agaggcccta gagaggtgag gaccctcctt tccccggccg 3601 ggaccctcgg gcacagcagc cgccccaggg gcctgccagc aacccctcgg tgggttctgg 3661 cccggctggt gcaaagaccc ccaagatgtc agggtctgac cgcggaccag ccggaggagc 3721 ccggccagca ccttcttccg aatccgaggc tccggaccct gcccccgacc ccgctacccc 3781 acacccacgc cggccccccg ccgccgccgc cccccccccc ccccgccccc cgctcactct 3841 cctctccctg caggatgttc ctgggcttcc ccaccaccaa gacctacttc ccccacttcg 3901 atctgagcca cggctccgcc caggtcaagg cccacggcca gaaggtgggc gacgcgctga 3961 ctctcgccgt gggccacctg gacgacctgc ctggcgccct gtcgaatctg agcgacctgc 4021 acgcacacaa gctgcgcgtg gaccccgtca acttcaaggt gagcccgggg gccgggcctg 4081 gccgggcggg agagacgagc gggaggcgca gcgggccctc ccagagggca gggaaccccg 4141 tgggtctcag aagtggagcg cgggcggccg cggccccgac gccccctgac accccctcga 4201 tccacagctc ctgagtcatt gcctgctgtc caccttggcc gtccacctcc ccaacgattt 4261 cacccctgcc gtccacgcct ccctggacaa gttcttgagc agtgtgagca ccgtgctgac 4321 ctccaaatac cgttaagctg gagccacggc gatccctgcc cgcggcccgg ggccctttgc 4381 gctccgcgtg cccgcacttc cctatctttc aataaagtct gatgggctgc atgcagcctg 4441 cgtgcctcgg ttctctgtgt ccgcgaatgt gccaggggtg ggggtggtct gtctgatcaa 4501 ggacctccca ggagcgggca gagagggaag ggaagaaaag ggtggaggag ggagtggagc 4561 taagacacat gcttcaggga atcgctgcac tgtgctcctg tgctgggagg tgcccgactt 4621 cgcagggttc ctccctggca tgggacgtgg ggagttgtaa gatcgccacc ctgtttgaag 4681 tctctgactg atgcccgttg aacatataat ctagatcaaa acaaaccaca aacataggcg 4741 gctcccgtgc aaatgcaggc attggacaat atctagaaga tccactcgtt ccaccagaag 4801 gaaggttgct tggaatgggc gccgcagcct ggcgcaccct ccctgcctca cagcaatcct 4861 tgggagcggc aggcaagtgg gggcggggga gagaggacct gagtcaaggc acttaggaga 4921 gcactgttcc ctcgttcctg caggttacca ggcattgcgt gcccagccct gcatccccca 4981 gctaattgct tccccattct ccagccagag gctcctgggg tctgccagcg gttctgcgcc 5041 gcgctgtgag ccaccttcag taaggcccct gtccaggtgc tgaagccct // LOCUS AF098919 29615 bp DNA linear VRT 05-APR-2002 DEFINITION Gallus gallus embryonic alpha-type globin pi, adult alpha D globin, and adult alpha A globin genes, complete cds. ACCESSION AF098919 AF067138 VERSION AF098919.2 GI:20043262 KEYWORDS . SOURCE Gallus gallus (chicken) ORGANISM Gallus gallus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; Gallus. REFERENCE 1 (bases 1 to 29615) AUTHORS Zhao,Z., Sjakste,N., De Moura-Gallo,C.V., Ioudinkova,E.S., Razin,S.V. and Scherrer,K. TITLE Organization of the chicken domain of alpha-globin genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 29615) AUTHORS Zhao,Z., Sjakste,N., De Moura-Gallo,C.V., Razin,S.V. and Scherrer,K. TITLE Direct Submission JOURNAL Submitted (14-OCT-1998) Biochemistry of Differentiation, Institut Jaques Monod, 2 Place Jussieu - Tour 43, Paris 75251, France REFERENCE 3 (bases 1 to 29615) AUTHORS Zhao,Z., Sjakste,N., De Moura-Gallo,C.V., Ioudinkova,E.S., Razin,S.V. and Scherrer,K. TITLE Direct Submission JOURNAL Submitted (05-APR-2002) Biochemistry of Differentiation, Institut Jaques Monod, 2 Place Jussieu - Tour 43, Paris 75251, France REMARK Sequence update by submitter COMMENT On or before Apr 5, 2002 this sequence version replaced gi:3172520, gi:4063698. FEATURES Location/Qualifiers source 1..29615 /organism="Gallus gallus" /mol_type="genomic DNA" /db_xref="taxon:9031" misc_feature 2502..6612 /note="region containing DNase I hypersensitive sites" misc_feature complement(3198..3357) /note="similar to exon 4 of the human -14 gene" mRNA join(<17811..17905,18483..18687,18982..>19110) /product="embryonic alpha-type globin pi" CDS join(17811..17905,18483..18687,18982..19110) /codon_start=1 /product="embryonic alpha-type globin pi" /protein_id="AAM09071.1" /db_xref="GI:20043263" /translation="MALTQAEKAAVTTIWAKVATQIESIGLESLERLFASYPQTKTYF PHFDVSQGSVQLRGHGSKVLNAIGEAVKNIDDIRGALAKLSELHAYILRVDPVNFKLL SHCILCSVAARYPSDFTPEVHAAWDKFLSSISSVLTEKYR" mRNA join(<21360..21451,21600..21804,22066..>22194) /product="adult alpha D globin" CDS join(21360..21451,21600..21804,22066..22194) /codon_start=1 /product="adult alpha D globin" /protein_id="AAM09072.1" /db_xref="GI:20043264" /translation="MLTAEDKKLIQQAWERAASHQEEFGAEALTRMFTTYPQTKTYFP HFDLSPGSDQVRGHGKKVLGALGNAVKNVDNLSQAMAELSNLHAYNLRVDPVNFKLLS QCIQVVLAVHMGKDYTPEVHAAFDKFLSAVSAVLAEKYR" mRNA join(<24360..24454,24595..24799,24909..>25037) /product="adult alpha A globin" CDS join(24360..24454,24595..24799,24909..25037) /codon_start=1 /product="adult alpha A globin" /protein_id="AAM09073.1" /db_xref="GI:20043265" /translation="MVLSAADKNNVKGIFTKIAGHAEEYGAETLERMFTTYPPTKTYF PHFDLSHGSAQIKGHGKKVVAALIEAANHIDDIAGTLSKLSDLHAHKLRVDPVNFKLL GQCFLVVVAIHHPAALTPEVHASLDKFLCAVGTVLTAKYR" ORIGIN 1 ggatccgttt tggatacctg aaaaagattt agaagaaatt ccataggctt gtgtgctttg 61 gaaccagtaa tgaagggaga aaaaagagag aaaggaaaaa aaaaaaaaaa aaccaacaga 121 aaagcctcaa tcagtcaagt aacagtttgc tgagagggca gacaaatcaa attacagcat 181 taaaaaatga cttgcaaaaa ataaaactgt gttactatat tttttattaa agtgtcacag 241 ttattctaca tcttccaaga agcagataaa gcatttcggt cagaacaata ggagtgtgta 301 tcagttaaga tagcagtctt ggaggtgtct tgtattgcat cttagtacca ttaggggcct 361 acacagagat tttcctcagc acagttgtca tatactgcta gcacagcaaa tttctcaccc 421 ctgtccacat ctatattctc catacccccc tccctactct ttctgcttcc aagtgtcaca 481 atctttaaca catgcacagg attttcttgt ggggagggct tgaggaagtc aagccaagca 541 aggtgaattt gcacgtccag tcataatctg tccgaatggg ctgagcacca ccacaagatg 601 gagcacaaaa tccagagccc attcctgtct tcccaaagct cctgatgaac atccagtcca 661 acctgaagtg aaaacaacct gaacagtcac ctaacctaat atcctgcctc tacccaacaa 721 atgtgcaccc agagctaatg taatacttga cacagggcaa aggcagtgaa gcctcatcca 781 gccttcctga caaacactga gcaccaccga ccaactgctc acaggtgaac tcttgaaccc 841 tggatgcctt cactcagtgg tcttcaaaat cagagaaact gaaagtagta acgcatatat 901 ttgtgcctct aaactgtaag tgaagttcta tcaaacagac agagactgga ggggtggtaa 961 catcacctta cagcgaccgg agggctcata aaatcacctt gttgtgtcca tgtgttctca 1021 agctacaggt tgtaggatgc cttgtgaact cagtaaagat ttaaatgcaa ttacgtgcag 1081 ggtctgatcc ttccttaaaa tcaaagaaac catcacaatt aacatcagct ggaattgagc 1141 taattttcac ctcacatctt gtgcaagaaa aaaagaataa aatactaaga tggaatggaa 1201 aagggggaaa gaatttgaag aattcagaac atcactacat aaaaggcagt ctgacaccta 1261 cagttaaata ctggtgctct tgtagtttct gaatctcatc tgaggaagaa tttgctgagc 1321 tgtcctgtgc agtaacttac tgtagtatga acactaagga aattcaaaac acagactgtc 1381 aggtcacttc agtcactgct tctgtactgt gggaaaggaa agaaacaatc cagcttcttc 1441 ctaactgcca aagggtcccc agttcagacc caaatgaagc agcagagatg gctgccacat 1501 ccagaacatt ctggaaacta gaaagtttta gggtgcctgc tggggtgata agacacaccc 1561 aagaaaaaga ttctactaga agcaacaaga aggtgtttcc agcactctgc catgttgctt 1621 atttacccag atgcaatttc atttagacaa attgaaaatg aagagaaatt tcatgcagtt 1681 acaataaaag tgactacaaa gcagagagaa tgtagaacat gtggtgaata aatgcaaata 1741 cgacttttca atttcattca gattcaccac accacattta ggaagctgtg ccttttggac 1801 agcactggca aaataaatga atctagtact gtgagagagc aggccagtgc tgtcagggcc 1861 ctgagggcac caaaaggtct cagtggcatt gaccagcacc acttgggtcc cctctttagc 1921 ccatggtcca cgagccatgc ttaggccttc aatccttgtg tgcgtgtcct cggccctgtc 1981 caaggatggg ccttggacct atgttgtggg tagcttgtct cctggatgga ccccaccaag 2041 tcctccagcc ctgtctctgt ctccatctcc ttttgctcct ggctgggctc cctgcatgga 2101 ccctgggcct ggtgcatctc tcacctcctg tggggttggt gacagaccct gaccactgtg 2161 gtcagaccca cacctctgtg tcccatctgg ggagggttct tagcacagac cgggctttct 2221 gaagtacctg cacagcttgg gggcacttga actctgtctt ccaaagcatc ctaaacattt 2281 gagagctgag cctcggtgat actcagccac atttcctgta ggctcacgaa tgagtttcgg 2341 aagtggaaca ccagcagctt tttttgatgt ttgatgccag cttctcactt ggatgaaagt 2401 agtcctacag caatccgatc ctctaaatat caagcagcaa gttgccaaca atctgtttgc 2461 acctctaatt atagatagaa aggatgacag aactggcaag cagatctttg cctggaaaac 2521 tgaacggttt tgtaaatttg atgaaaaatg agatgaaagc agttcagtga tacagattag 2581 tgttgtattc agaaattacc aagcctactt cattttcaca gtctgtcagg aagaaagaat 2641 gggaaaggga ttattttcca tatagaacca tgtaaattct ttcctagagt taagcagggc 2701 cttttccata agctgagtgg caattgcttt cattccaatc agctaaaacc caggccacag 2761 atcaaagtct caacacaaat agcagcctaa atttgtccat gggaaaatgt tagaccgctt 2821 cctaacattt gtgagggtgt tgttttgttc agtttttgta ctgccatgct aattggaggt 2881 gctacaaacc taccgtgcaa caactagcat cataagaaac acgttttttg aggtgggctg 2941 cttattacta cttctcttta cacacactgc ttttcttgag taaccagatg ggcttttctg 3001 ggaaccactg acacagcttc ctcggtttgc cactcactgt ggttatctgg aaatacatac 3061 agacagctcc gagcattgtg aagaatcaga aatcaggaag gagcaggatc aaaaggaaac 3121 atatccattc tgaattttaa gtcccctacc taagaataca caactggaac agaaagcatg 3181 cctgcaagca gatctcttac catacctgcc caagggcatg ctgaagcaac gtagggtgac 3241 caacaaagcg aacgttatca atcttcagtt caaatttttt gccacacata tctgactttg 3301 tagccaaaat tgttgccagg atgacatctg aaaaccttga agagcagaaa cagaagttat 3361 tacagttttc aaaaagcaca ccccttgaac gttcctccaa acacaaattc atttaatttg 3421 tgagtctttt taatgttgta aattgtgtct aaaataatgt acctggagta ccccaattat 3481 ggaaaattaa gaatccctta attatatctc tcagatcatc acagttatca aacaggaagc 3541 attcaagaca aagactgcag ggtcatgctt atttaagtac atgctcagca aaatacattg 3601 gaaagattag atgcagttgg tttaaatata aaaacagctt gagttagcca ttgcctggat 3661 ttagctgcag gttaatgaac ttactaagca actacatacc aaaacagcct gccactcaag 3721 cacaacaaag acctttaggc cacctactga gacacagggg aaggcaagac agagctgatt 3781 caaacaaatg aaatatatat tctacacttg tccctccttt tcaaatctaa cttcatgaaa 3841 gttatttcac ttaacagatc tcaggctgca gtcagatgcc cagtggcacc cttctgagag 3901 ctgcatggca tcacctgggc tcatggactg cactgaacca agacaaagtc atggcagcag 3961 gccctccttg gtgcccacct tgctgcagtg ggactgcgct gctccggaac tgtggtggga 4021 agaatgcagc cacaaaatag gaaggcaagt atctacaagg agtctgccaa ataaaataca 4081 ctcacagctg caaaactggg tcagaactgc aatgtgacat gggaaggatt ctgcaaccac 4141 tttccgatga cttcacaaca ttggcctggt ctttaaaaat actccttttc ctcagggcat 4201 cccctgagac agcagagctc aattccatag gacctcagtg agctgccagg ggaagctcat 4261 agttgcagga tgggagcagt gtgaggtttg atattagcaa tgttctttcc ttgtctgatt 4321 tccctgagac tcccatttta ccaaatggat atcaagtcag cttctgagga caccaccttt 4381 tttaatcatg gcaatattta tttcagtttc tttatttaca tccacgatgt gtgcctccat 4441 cagaccaccc cctccaccat gcaggtaccc cagataacac agtatcactc agaaaaaagg 4501 gcagacatgg gtgtgttcca cacccacctt atctctcggc cacaagcagg ccctcagtgc 4561 ccacagaagt agtcaatgta cctcattcct ggttcactct gagtgagtta tggtgcatta 4621 cagaaactga gacaaataca cagacaagtc ttctctgtga acccacctgc atatgaccca 4681 ggggctattt ttttccatta ggacctacaa acactgaagg tgtaggataa caggcctttc 4741 ttgagcagtc aaagcctctg atgcttgtac agtaaagtca tcctgaccac ggttggctag 4801 tggaaggggc agctccatac aaacttccaa atctctggct gctctgctcc aaacaggatg 4861 cccagcacag ctgagggcta gcagagagga ggggagagtg gccacagact cctggggcac 4921 tgagccccca gcgttcctca gccactgcac tgggctgggt tcaacaccac atggagttgg 4981 taaagaaaag gtaacgcttc ctcagactgg gcctggcttg gcagtccctg gtgatcacat 5041 ctgaaatagg aaagaaagtc accaagacag cagtgagcct cccctgttct ttctgccctg 5101 ctcagctgtg ccagttaaca ggctcatcgt ggagccaggc agactgtaag agcagcagat 5161 caaaaatgaa catctgccct gttaagtgga agcaggaggg taaactagca agtgcctgct 5221 tgctgctaag aaatgagtca cttccttccc tcagccttgg caaaggacat cttgggagct 5281 ggaaaaccaa acttcattat gatgacttac accactgtta cgatttgact ttaacatgga 5341 attacagcaa cagttacttt caggcagaac tgtggtgccg aacaattcag cagttccaca 5401 gcagagctca tgacactgac attcaccatc agcatgactt tgcatttggc tcagtaggaa 5461 gcagcagact gagaggttaa ttttgtaaag ccttccctat cttcttgcta gctgaaacct 5521 ctttctggga gaaagctaca acagcagcca gatgatgtgc tcagcatatg tataaccctc 5581 ttgacaggtt gcaggggttc cacctgatgg cacctccaga gcaacctgga ctccagacac 5641 gttctcctgg tctgacagcc tggattcatt ggtttggttg tctgatccta cctctcaccc 5701 cccaaaacca ctgcacatga tttatacgtt tctcaggcct ctggatgaca gggagattga 5761 gggacagcag atatgtcact cccatcattt tatctcagac aataaatatt ttagtttgga 5821 tattttaaaa ctggttttct aatctctgca gtgcattaca atgacataaa cacgttaact 5881 tactgaagcc cttgaagcag attgggtttc acagtcaggt tgtaacagtc cctactgccc 5941 cagcattagc ctgagctctg tgattctaca ggggagccaa cctttcctgt gctatggagc 6001 tcagcaggca gaaactaaat acagaaagca gaagttttga tttgcccaga atatcctacc 6061 tctcctatca tgagctcaac acagtgtgca gccacagcac acagtcctgc aggatgacac 6121 aaaccacacc agtgcaggct gcctactgcc accagcacct cctctggaac tggcacagct 6181 gaaacaacag agcctttgcc ttagcatgca gacaagaggg ggaaagtatc tcaaacagaa 6241 cttcagtgtt tccccaagca tggataatca cttttaaagt gccatagaaa ttcaaaaatt 6301 ctttataatt ttcaaattac aaagaatttg tattcgaggt agggtagagg gggagctgac 6361 tatttaggtg accatcttta tgctacacga ggactagaaa tgtatttcaa tacttttggg 6421 ccctcggcat gttcaggaag gggctataat gcagttgtca gttcagctta gaagggcatc 6481 aggaaggagc agctgctcac aaacagctct gcaacgagct ctgcataatt attgtgagat 6541 tataaatagt tcagtcactc atctgtactt tttggtcatt gcttgtcaga aggcaatttg 6601 cctcagggat ccttgctgga atagctccaa agccaaggct attcttgttg tgctcttgga 6661 ggatcacaat ttttctcagt tcacagaagg gacagtacca cactcattct ggttacccta 6721 cagaaagcac agaagtgtta gtaggacctc tgtaattatc ctagcattaa ctgttagcaa 6781 tacgtttcct atagacattg tgatttagca cataattacc tcatcatttg gtgtaaagta 6841 gaaaccagaa acaaagcaca ttgacaggat ggtgagaaaa gaaagccatt ctagacagag 6901 gtggtggtgg agaacatttg gtcttaccct gaaaccaatt gctcatcagt taggggacat 6961 tggtctctga aatgggaaca tggccataaa cacaaaacaa aaacacaaac aaaagaaaat 7021 aaacgacgca agtacataaa atacaaaaaa aaaaaaagag agagagaaaa agataatgtt 7081 aaaaactagg aggatctatt aagagttaaa ctcctgtaaa gaaaatgatg agtattgaga 7141 agaggcacaa gttaatggta ggtgagttgc tgggatgagc tgcaggctga ggttggtggc 7201 atttcagttc tctcagcaac gctatggcaa tgtgtcccca caacctgctg cacttatgga 7261 actgcattca ttaaacacat gcttatctta aaacagacaa acaagcacat aaacaataac 7321 aaaataacag aagaggtccc actctgattt agagcagatc ccaaaagtcg gtttgtttcc 7381 tagagcatca ccaggggctc aaaaggcttt taaaagcttt acgctgccca ctgtaggaag 7441 gggagatcaa atggaaacct ccaacagtta tttaatgcag gagacagctg atttccactg 7501 gttatgttga gaaggtagaa cgaccctaga gacaaactat acaaagaggg tgatattaca 7561 gaaagggatg gatgggtgac cggaaaggaa gaggcttaca gaaaggaagg ctttcagaac 7621 aaataacaag gagaacactg aggtcccagt gaaagaggga cagaggatat agaaacggta 7681 ccacaggaga acgactgttt gttagaaagg gacattcacg tccttgtttc agtagcttgg 7741 gtttctgtaa caaaagtatt tagctagtta atgacatcaa tgctgcaaag cttgctaaat 7801 ctctttggtt aagtcagtta tgctgctgaa aaatacaaca tctggtgcaa caggtttctt 7861 catgtgcatc ttcatcagaa tttttgctat tgtatcccct caaaaaaaaa aaaaaaacac 7921 caaacctaca aaagacattt caaaccaatt taacaaagct ttatcttctg gcatcaacag 7981 agctaatgca caacttcaca tttttggagg aaaaaaagag aacagcaagc atctccaata 8041 gctccagggg ggtatatatt attgggatag cactcaaatc tgaaggatgt ttttgaagac 8101 agcaaattac aaaagcatgg agtgccactg acaggtcatt agtttcaaca gtgacagatt 8161 tttctgagca cctctctgtt gcatggccac ccccaacaca ccttgaccac ttgtgttact 8221 tgacttgctt ttgagaggaa taaaagatct atattcaaaa gacctagggt tacaactctg 8281 tggaaaagta atcagtatag ccaatacaca ccagtaacct gcaggtagga cttcatgaat 8341 aggctgcaat cttcttttat gtaacagaaa gcataactga acaccaggaa attaatgtct 8401 gaagtattca agaaatccga attacgcata caaagatgga aatggagagt tatagcacct 8461 aaactgatgt aattttctgg agccattaag aaataactta caacaactca actgctccac 8521 aagagatcag ctcaaaagta accattctgg cttccaacag cttagaaaaa aaggttcgat 8581 attgacatgt tctctaggac agcatttgat cctagatatg ccagtgaatt gaagcgccag 8641 tctatcgcag tacaaaggca ggatggatcc aaccttcata caaacgactg cagcaactag 8701 atagatctga aataaccttt gttactaaac atgagaacat tatcaaactt gccatacacc 8761 atccactcag cttagcatct gccccctcca aagccccaat ttgtgctgcc aaaaaacatc 8821 catagcaact accgacatgt ggtgatttaa agggatcttt ttgctaaccc ttccttgaag 8881 cagccttcca gaaataacac agtgattcac aaatgacttt cacgtgggac aactccaatc 8941 agataacaac accagcagca cacagttggg acttcatgct gatgaacctc aggactactg 9001 ctccggggag agcactgctg cagctctgtg actgcaccag gacagaactc tatggcagac 9061 aggaggggct ctcacaaacc tgctcactcc tcagccttca atagaaacat aaaaaaacat 9121 tccaacctgg agcccagggg aaaaaaatca tacagcttct cagccagaat cccactggct 9181 caatgacaac actggtttta tggcatatgt tatttaagct ccttccttgc ctattccact 9241 cacaatgact acaaaccaca ggcatctccc tctctctcca accatcgtcc ctctgcctct 9301 gttcactccc aggatctgtt cttcacagtc accaactttg ttaccccccc acctgtctca 9361 tgcataccag aagcctgagc cacctgctgt gcacagccct aaagagacct tatagcttcc 9421 tgatggaaca tgtgcttgcg tacatagtgc cggaatatga tttctatcta tgtcattaag 9481 cctagtgaaa aaataaacct cttgcctaat gtgaattatc agtactcagt gtacaaagag 9541 catgggcatt tcttctactt ccaggtgctg ctgaagagaa ggatgtttac ttgttgcatt 9601 aaatgcacct gcaagcttcc tttccacatt cctgcgagag aagatggctt tttcagtgca 9661 ttcacagcac tgtgaatacg gggtagaggc ttttcataaa caaaattgaa acaaatgcag 9721 ttttgaagga cgctttgaat ttctgaatgc tgctgtgtga gcaggactag tagtacttca 9781 gagcaacaag atgcttctcc cactaccccc gtacctgcat ggcagctatt gccatatcca 9841 taggggataa tactaaggca cgtcctgcta acagacctgt gttgacaaac acctcaacct 9901 caggtacgta aatataaagc tcataaactc cagtaacctc aataatcatc acatgcagac 9961 agtgcaggat ggatttgtgt gaactgctgt gtaagactgc acaaacagca actgctttcc 10021 tctgagagcc atcactacag agggggattt ctgtctttgt attctttcac tcagagcttc 10081 ggaacagcac agatccacct ggtggagtgc taatgaattg atactttgtg cttagggaac 10141 tcagctccct aaaagcttaa tagctagtat ctgacccaag agacctgaat gctagccctc 10201 agcccctgat tcctgcacac cactacaaca gaaccatcta actctttgtt taaggcagac 10261 tacatgtcag gacgttcaac ctacttaaaa ctacattcag taaaatactc aactatgagt 10321 ggttgtctca tccgaccatt atttgagtga taaaatctct tgcttccatc actgcagtag 10381 ccaacagtcg atcacacagg ttcagcacca cagacaacat ccacgagcac agctctgcat 10441 tgctgtgtgt gttttgtcca accaccctac aaaggcagag gcagcaagaa ttgctatgag 10501 aaagccagcc tggcacactg tcatcaatct ctctagggaa atactagagc caagccgaac 10561 cagcaaccca accaaaaccc ctgagataac caccattcac tgagccctcc cagtgtgcat 10621 actcagcact gtccacgggg acgtctgaag tgccgttcaa cagaatctaa acttaagaaa 10681 acacattttg gctctcgttc agttttgcac tgtattactg tgttccatgg agacaataga 10741 atgctgcaat ggaaggtttc caaaatacca gaaatgtaat agacacaaat gacctcaagc 10801 aagatgcgtt acctctgtta tgctgttttc ataaggttcc agtcctggta aaatgtatga 10861 acttaatgaa gtcactgcca gggatggctt taatttcagt gaaactattc ttagtcagac 10921 ttaagaacca gcttacattt tgcaaggttg ccatctcaat tgccttttgg ttaaaagcag 10981 tgagtcaact tcttcattct tatatcagat ctgcaactga aactatggcc caggtttgtc 11041 tcttatctga taatgataca tttagaacac agcagatttc aaattttctg gtgttgacca 11101 gttagagggt ccatgtccgt gtattgcacg atcctgaaca acgaccacct atgctgttca 11161 gtctgacttt tgttaccctc taatttcctg cagttaaaaa actcagtctc taagtctcag 11221 agccctcctg agccactatt tcacttcatg gatatcaggc agctacatgc ccagaaacaa 11281 aatgagtcgc ccaggtagtg cacagcaatt ctgtatgcac tgttacatga gctaaggaag 11341 aacacgaggc agctgtgtaa gatatccagg tacaggtagg taggtggaga aaaatggaat 11401 tttgccaaaa ggattaatgc tctgaaggac gaccactgcc tttctaacaa gcacaactta 11461 cctgagctac actttgccca gaatcaacga gactcaccgg cattcactca tgagacatgt 11521 ctcattacca aacagtgatt ttctcctaac ctggcattac aatatccctt tgcaggcctt 11581 ctgtgcacta caggagtttg cctactgaac tacgacgaag aaaaaaggag gccatatcct 11641 ctcaagtttt aaagtcagac gaggagaatc attttctccc tacctggagt ccccatcttg 11701 gtcctctgag gtgtcaccag agctattcac cgcatatcga ctccgtggtt tatctgaggc 11761 aaagaataaa gacagctttc ttaaggagca gtcttccaac agtgtacagt gaggtcttgt 11821 tcttgttttt tcagttctca ctcacaaact catcaacaaa tgggagcaag acgcttacat 11881 tttcaaagga catagagaaa acaaaagcat tcaaagaaac aactttcctg acgagtacca 11941 gggaacaatt catgcttgca gaaccatgtt gcaattcaag gaagcaaaaa ttaaaaactc 12001 aactcattaa tttcccacat tcttgctcat gtttcaggac aatggtaagt tccaggagcc 12061 ccctctgcag tgacacatat ccaaagcttt taccaacaaa aagctgaaga gcaactggag 12121 ggaagcagac agcaaacaat aataataatc ttttcacact tcacttataa ggacaaaatg 12181 gggttcaaaa atgattcttc ctcagagaaa aatgactcac aatcttaaaa ggggaaatgg 12241 tttcaaacta aaagagggca gatttagatt ggatacaagg aagttcttta cagtaagggt 12301 ggtgaagcac tggcacaggc tgcccagaga ggtggtgggc gccctgctcc tggagacact 12361 cagtgtcagg ctgggtgggc tctgagtact gatggagcta cgggtgtccc tgttcatttc 12421 agggaagttg ggctgtaagg ccattaaaag cccattccaa ttcaaatgat tctgtgactc 12481 tgtgattctg tgacgtatgg cagttggcaa atgatccacg ggaaacaacc tcatcactgg 12541 ataacagcct cagatcacac acagggctca cccatcgcgt gcctctcatg atcactttgg 12601 tacgttctca gtgcccaaac tgttaaatgt tccagggcac caaactcaca ctgaaactga 12661 atgctgattc taacactgcc cggtgacagg ttaggtctgc agtggtgatg ctgtaatccc 12721 acaggtctga ccatgcgaag ccctcccaaa ccccctctgg gttcagcagc gcactgcaga 12781 gctcaggaca gaaacaccag ggctggctgc acatgtgggc acggctccac atctgtcacc 12841 gcagacccaa ggcagggccg tgcagaacgg atagaactca tagagcgtcc caagcaggag 12901 gggaccgcca gggattcggt ccacccttgg ctccactcca aacccaaacc ctgtgcctga 12961 gagcggggtc ccaacgctcc ttgagccctg gcggcgtggg gccgcgccca cagcccaggg 13021 ggttcgagcc gcccctcctc agaggcgcgc ggcaggaagg cgtctacccc gggccgtgct 13081 ggacagcggt gctgggccaa agcgtgcctc gggacgggag cctggggaca gcagcagccc 13141 aaggcacgct cacacagggc tgcacgagtg aggggacaac gggccaccaa acgacgaggc 13201 tgcgaggggg gtgccacgcc agcccgagcc gtgggaaggc cggggagggg ctgccaggac 13261 acgggccgga tcgcggccct gcggcacggg gcggccccgg gcccggcgcg cacttactgg 13321 cctgggcggc ggggtgctcg gcgccgcgct ggaaggggaa gcggaagagc agcttgttgc 13381 cgcggctgcc cgagctcgag ctcacaagga taacgctgat ggggctggtg ctctcgccca 13441 tgccgccgcg ccacagcgag caccgggcgg gcaacgacgg acgcggctcc gcggaaggcg 13501 gcccggcccg cgcgacttcc gcttccgcgc ctccgccgcc gccgccggtt cccccgggcc 13561 gcggccgagc ggcggggcgg agctgcgggc acagcgctcc ccgggcaggt cgcgctcaga 13621 ggccgggccg ccgcttcagc gccgtgccct cagtgcggcc cagcgccgtg cccgcagcgc 13681 tgcccacacg ccctcggggt gccccacggc tgctgcttgc tcccggtgcc cgccgttcct 13741 cccagcacct cgcagtgcag ccgtgcctga agtgcagccc agcacctcac acctcagccc 13801 cgggctccca gtacgaccag caggtcacgt tggagtctct tgtcctcaag actgcgcagt 13861 gtctcacctt tgagccttgt gccccccatt cagcccagca catcacactg tagcccttac 13921 accctcacca cagcacagca cctcacgttc aggccccagc acgtcaagat ggagccctgt 13981 gcccccagac agccagcatg gaaccatcaa atccttagag ttggaagatg tctgaatcct 14041 tgtgccccca gttcagcccg gcacctctca caccccactc aacactcttc agccaagagc 14101 ctacagctca acccagcacc tcacgccacc cagcagcact cccgccatca gcccagtgcc 14161 cccagtccgg atcggtacct ctcatgccca tgcacagtgc accagatcag cctagcacca 14221 ctagttcatt ccagcacctc acgtgcccac agccaaccac tccagcaccc ccggtgccct 14281 agtcacacct ctccgctgcc tcaaggttca ttcccacctc ttcccacatc ccctcacacc 14341 ccctcattat tttcatgtct cgcaatctcc tttggtcact tggagtcatt cagttatgac 14401 aactccagaa ctagaagctg ctggccagca gcaagtgcca caaactgtgt tcccccggca 14461 gctcttctgg ctcatttgtc ttattgtgtg tccagctgag atcagaaagc tatcggcaat 14521 tatgtcagag gatggcccag tttttcacat agatttgtct gtatttgata gcaatattta 14581 gtatttggtg ctccgagtat ccccactctg gatttttctc tgcaagattc ttcccttgga 14641 cttcaggcag agaaggggac tgaaagggag atgagcaccc gcagtgaggg cttaatctgc 14701 acggccattc tctgcaaggc aggtgataac aactgaagca agagaagctg tcattgaggg 14761 gagagagttg ttggtgagcg attaaagagc agtcacatta tcacagcaga gcattcatcg 14821 tggcccagtg ctggggagct acgttagaat tgcccagtgt gtctgcttcc cagcataact 14881 atgcattctt caattaaaaa actgcaggca tgtttgccat ttccagctct cggagatgag 14941 ttaaagcaaa gctctggaaa cctgcaagct ctctgagtgc tagtagaatg aaatgaaaga 15001 ataaagccag gatatagatt ctgcttttaa gctttctgaa acttaatgca cccaagttct 15061 tcttcagtta gagcagatgt atgtgccact gctacccacc gagattcccc tcgagagaga 15121 ggtggttcct gtgcagaaga ggcccatggg gacagttacc cgacctcaca gaggtacaac 15181 catccattta atgaactggg aattataatc tcacttttat aatgagcaaa aaggcctgtt 15241 gctgacttca actgacagcc tgaaaagatg ataatgaaaa gaagacctac attagtacaa 15301 aaagccactc ttcacatttg ctccgtgctt tgttcctgta ggagcgagac acggcaggcc 15361 ccaaagacag gctgctgtgc agcaggacga catgccagct cagcctgctg ctccttccca 15421 aggctctgag gacatctcac agcttcataa caaagataca cagcctgcac tgaagtgcac 15481 atcacgctct aggtttacga aacctcacta tgaaaagcaa ccgaaagaga gagagacaca 15541 tctgatgcag tttccatgta tcaatgctgc gaccagagga acacagtggc tgtgttgtcc 15601 aattcctcga gtagcgtagg ccaggagctc cagcctcggc accccgtggg cagctggaca 15661 gtggggcgag gaacagctga gaagtgtgtg tagggacaga gccccatcac tctccatgat 15721 tgctgcacgt ggctgcaaat tacactactc ccaccaggct cttttctgaa gagttcctga 15781 agctgtcccc agcacaactc tgatcctaga ggagcggtga gcccctccta gcaaaccact 15841 gtcacctttg ggctgctctg gcttttgtga tagtgcagga gcagctcttg gagaatgaat 15901 ccaaattcaa tgaagatttc agtgccgata tttcatatac aaaccaacaa gcctccggtg 15961 caggcatcac tctgatcttc atgcctacat ggaccttaat cgtccagtcg tggaggatca 16021 ctgtttgggt taaacagttg gcagcaaaat aggaaggagt attgctggca tctgcctggc 16081 ttatctgaag tcacggggac ttcccttgag cgctgtcagc agagcttgga gcacagccat 16141 gcagtgggcg atgtaagagg tagggctttc agaaggagac aacaacagca ttattttcaa 16201 agtattttcc tactggaaaa ctgcacgtgt tttttaaatg catgtaatta gagatgatga 16261 agacagcttc tctctcagta cctccatccg tttaagctgt gtaaactgaa tacagcattt 16321 tcctgtgtaa agtgcttagt gcctgtgttg tcagcagggc agccacatct ggagcacact 16381 gcctgcagcc gctggtagtg ctgtgaaaca caaacacatc ttggtaagga atctggctta 16441 tacatatcaa cctgctcagc acaagcatgc agcgctgttt gggcaaagaa cacgattgac 16501 acctgcccgt gccaccagct tgccatgcac tgcttctgag gattgatggg gtggcagaag 16561 gactcacctc tgcttcatgt gctggcacag tattccccaa gcattggtag caaccatctc 16621 cagagggtat ttccttgata ctttgcccac ttagctgtga ttgcagactg ctctggcccc 16681 tcctttactc gtttggccct ttacccatca cgtgtccatg cactgaagac acaaaccttc 16741 tagcacagga ccaggctttg tgtgcatggc cagcatgcag cacagtgcac accagcactg 16801 cctgaggtgc tgagggccgt ttcacagcac aagggataac tggctgtaag aatcacacag 16861 ccacaaatca aagcgatgcg gtatgaggca tttagtgatg cagctcagat ctgtatcggg 16921 gggaactgca gattaaggaa acaaactact gggcaatatc gatgttacca ttcaggaagc 16981 actcagcagt ttgaggatgt gtgactggtg aggaactgtg caactctgga ttgggctttc 17041 tcccataaac atgcttcttt ttagtactat acttgatttc ataagctctc tctccctgta 17101 tccacatgaa tcatgactac tggtgctgta aatccgcaat atcagtagtg gagctggtcc 17161 ctaactcttt gcaagaggcc actaaatgcc tgacagtgct ggaacgtgcc aaaatctcca 17221 ctgtttccca cagaatcagg gtctcactcc ttgccctgag ccatactgca catatcccac 17281 gtgtccccag cactaccggg gaagcattga atatgcagta ctaggccatg tagtggccat 17341 gtggattgga ggtatatgat atggcccaga aagacccagt ctggtgtgca catctgtgaa 17401 gcaaggtgac aaatacaaat tactgtccaa gcttcttagt ttgctcaaaa tccattgaaa 17461 ggcacgtcct aatcctccat cactagaatg ttccagcact atctctctgc tcatacgtgc 17521 ctggtcaact ttttcaggcc acacggccaa tacgtgttca gaagcaagaa gctgataaca 17581 ccagcaaacc aggaggccag tgcactgacc cttagcgaaa tgggctttgt ttcacaagga 17641 gataagggtc tcccacctcc atgggtgggc ctccggagtg accaatgagt gtggacagat 17701 gccaaggccc gctctcctct tcctctttat aaccggggct gcgagggcac tcagtacaac 17761 ctgctctggg tgttcactga agggagcctg agccagcact ctcctgcaca atggcactga 17821 cccaagctga gaaggctgcc gtgaccacca tctgggcaaa ggtggctacc cagattgagt 17881 ccattgggct ggaatcactg gagaggtaag tcacccacag caccccccca aagggtgccc 17941 cccctgactt tgctgttagg atgcatcttg tttcagtgct gtatgagtga gacccataca 18001 gtcgtgttag gactgatgag aactgcttga tgagctgcat gttttttaac atgatttttt 18061 cttactggag ttacacccgt gctatgaatt caaaggcatt catactcggt gtcccgaaat 18121 agggttatcg ctaccaggaa atgaactcaa atagatttat cacataagtc gccatgtaat 18181 ggacattaag agaatagctg tccacatttg ttgttgttca gagctattta aatgctattt 18241 acaatcaatc taacatatcg tgatatccct gctgggactc cgagtaccac tcataccaat 18301 cctggttcat tcagctcaca gcagtttgaa gaccttgcaa aattggctta cggtataata 18361 taggaaacgt ggtgtcagtt tcaagcttcc aaagctctat taattgctga aggagtgtat 18421 ttcaagtgcg gatcatgcgt gccttttcca ctgctgaact ttgttgtctt gttctcctcc 18481 aggctttttg ccagctatcc tcagacgaaa acctacttcc ctcactttga tgtcagccaa 18541 ggctcagttc agcttcgtgg tcacggctcc aaggtcctga atgccattgg ggaagctgtg 18601 aagaacatcg atgacattag aggtgctttg gccaaactca gcgagctgca tgcttacatc 18661 ctcagggtgg acccagtgaa cttcaaggtg agtgggcacg ctttcaggga tgaaaactac 18721 cagtcacaga actagaggcc acaggtcatt tagactaatg ggagcttcat ccctgactgt 18781 gtggtacagc ctcatgttgc tcctggttcc ccagctgatt tttaacctga gactggatct 18841 tctagcaaac catctaatcc agtttaagct gttacctggc tgggacatgt attttttttc 18901 attccagagc atgttgcaga ctcttagatt ggcatctctt actgattaca atggatttca 18961 tctttgtttt ttccctttca gctgctttcc cactgtatcc tgtgctctgt ggctgcccgc 19021 tatcccagtg atttcacccc agaagttcat gctgcgtggg acaagttcct gtccagcatt 19081 tcctctgttc tgactgagaa atacagataa atggcttcca cactgggtta gggacgtgca 19141 tccacggcac acaacagctg ccaagttctg gggtattctt cctatgcagt ccccccaact 19201 cccctgcgca ggggctcggc cacctgcaga ccacaataaa taattcaact gtgatctatg 19261 gtctctgtgc tttcatcttt catgttccac ctacaccttc gtgtttaggc agtgtgtttg 19321 gaaatcacta cgtaatacag aggggaaaat actaagctgt gtgagctcat gtacatcatg 19381 agaataacat actgctgagc agaagcccag aacaggctag agccctgtga ctgtcactgc 19441 tcacggttca gaagctatgt caaccaaata taaagctgtt caggaaacat tctttcgccc 19501 tgctaggata atgaatggac ccaggcttct tgtttagact ctacaaaatc cagcacagcc 19561 tattctgaac aacaactgca ttgccaaatg gcgtgatgtg cacagggcaa tgctgtgcat 19621 tgactgtgtg tgctcctctc caaagggtct ggacaaggtg aacaccttag gctatcacac 19681 ctgcttagtc ccacctgttt ggtaaaggga ccctcccctc agcttgacat ttcctatatt 19741 tctatcagga actttgcatg cttttgaaat ttgggaagac taatggctcc tggagatcag 19801 tgatgatatc tgagggacca aaattcccag tgtttggcat ttccctgagg ggaacgcatg 19861 gaccttccat gcctcttggc caacacccca gctcccagct gataaaagag acgtgaattt 19921 cattgtatgc atccttgctt cctcttccaa agtatttcca ggaccttaca gtggtacaca 19981 tcacctcttc actaaacaca caaaaaacac ctggagtgtt agagcagctt gaaactcaga 20041 agcccctcct ttgtttgttg atgtattctc tacctaagtg atccacgtgt tgtgggcagg 20101 ccaataagtg tggtatcatc cagtgcatct taggaagtgt ctgcgtctaa aggcagagtg 20161 attccagctt tatttactga gctatatccc tgctcttcca gtgagatcct tactgagtta 20221 cactcactgc ttatgcaggt ctcagaagac ttctgacatg aaaagatgtg tgcaaagcct 20281 caagcaaaca agcatgaact ggcactacaa ggtgcactgc tcatctcatt tacaaaatgt 20341 aagcaccgta tttgggcaag gggaagagaa atgagatgca gaagtctgct gatgttgccc 20401 agggcacctt cagtgagcag taagttttca aagcatccta tctttgtggg tctctgggta 20461 acctgaatgt ttaaatcgac aactgacagt gtaggtcagc atgaggtaca tcccttaagc 20521 ccagtgtatc agtgttttgt atcagtgttt tcatgcagag gctcagtgaa ggatatgaca 20581 gtccccaacc actgcatgcc ctatgggatg tcctgtcact tatggagagc agcaggagca 20641 cacgctggga tgggaacaag gcagcaagtc caggcctgac gagcaaacag gtatgaactt 20701 tggaaagact aggtgccaaa aaaagaggct cttccttatc taggtttctc tgggctgctg 20761 cagacagatg atgggagcac acagagggca aaggtggcgc aggaggggag cagacccagg 20821 ggctctttat ttggagaggg ctggatgtgg cagaggctgg aggattgcag ggcccaagac 20881 atgtggggga caaggagcca tgcaggctct tggcgaggga agggtgcatc ctgacaggag 20941 tgccaagcag caccacgagg ccaaggctgc cagcctgcag catccacagg acagtgactg 21001 ccaactgcag ggtgcagcct tgatgcgtgc cttccccaag cctggcgagg ggctgcctgg 21061 gatcccccct gcatgcagtg tggagctgcc tgtgagggtc aactcaactg gcagagcagg 21121 gcagggtagg gtaggacagg gcagggcagg gcagcagtgc cccctcaggg ccccctccgt 21181 gggcatcggg gcccctgagg caccgccgct ccctgctctc agcattgcac agccacggcc 21241 cctccgtgcg gataagataa ggccggggcg ggtgtacagg gagctataag aggtcggccc 21301 tgcaggctcc tccatcacac attgccacca gccaccagcc cgccccacca gctgccacca 21361 tgctgactgc cgaggacaag aagctcatcc agcaggcctg ggagagggcc gcttcccacc 21421 aggaggagtt tggagctgag gctctgacta ggtgcgagcc agcccagggg cacctggcgg 21481 ggtgggaatt ggggagttgg agccaagggg tgcgggccag gtcctggagt gttggggtgt 21541 gtgggttggg gcagggggca actgcggggc agcagagctg accagcctcc ttccggcagg 21601 atgttcacca cctatcccca gaccaagacc tacttccccc acttcgacct ttcgcctggc 21661 tctgaccagg tccgtggcca tggcaagaag gtgttgggtg ccctgggcaa cgccgtgaag 21721 aacgtggaca acctcagcca ggccatggct gagctgagca acctgcatgc ctacaacctg 21781 cgtgttgacc ccgtcaattt caaggcaagc aaaggctggg gtctgcgggt cctggggtcc 21841 tcaggtcagg ggatcttggg gttacaggat cctggaggtc ttggggtcag ggggtcctag 21901 agtaaagggg tcctgaggtc agggtgtgct gggttcctgg gagctgggat cctagggtac 21961 aggagccaga ggccaggggt ggtaccaggg ccagaatggg ggacggaatg gggctgagag 22021 tcctccgtcc agagcaaagg tactgagcct ctgtttgcct tgcagctgtt gtcgcagtgc 22081 atccaggtgg tgctggctgt acacatgggc aaagactaca cccctgaagt gcatgctgcc 22141 ttcgacaagt tcctgtctgc cgtgtctgct gtgctggctg agaagtacag ataagccacc 22201 gtctacaact tcaagtcttc aataaagaca ccattgctgc agcactgtgt ccatgtgtgc 22261 tggggctggg gacagggcat aggggtccag ggtgggctgg ggcacactca atgcttcccc 22321 acaccccttg ggagggaggg agaaaggaag ctgctagcag gtctggcatg gggcctgcct 22381 ggttaaacgt gatcctgaag ctggtagcag gactctgagg gcagagctca gcctgcacct 22441 tcccatgcct gcaaacctgc ttggggtggg tgaagccaaa tgtgtttctt tgaacccctt 22501 gaacagtttt ccatgggtga aggattgacc tctcctggct ctgtcaccat ggctgttaag 22561 gtcttgagtg gggaagctgt gggagctaaa tgcctgcgtg aattccgtgg gcaaccagac 22621 aagttcatag agtgaaaaaa atccactgta aaccatgaag tacaaacacc tttcatagcc 22681 tcgagtccct gagccacaac tccctagcaa ttgcctggga agggcccctc cacccctgcc 22741 ctgggtgcct gcttttccca tgccatttgg agagggtgga aatgtcacag gcagagctgt 22801 agtccaacat aaagctgctg cttttacatc attggcttta gatctgaggg agatggtggg 22861 ttataagaca gcaaactagc cctgggtttg ctctgcatgg tgaccagagg aggaaattgg 22921 gctttgtgag gaatgaggga agctgcacca ggaaatcagc acctgctttg gtagggagtc 22981 ccacgccgtg ccctgctccg gtgagcagtg ctccatactc caaagcacta tctacatctc 23041 agcatgtctg tgtccattta cacccagcta ttcctgtgcc ttcagtctga ggagctcttc 23101 tgcctttgca gtttgcccca ccgacgtgtg catgggcagc aaacacgcag cctccccagc 23161 atctgcttcc cagaagtgca gcccccctcc tccactcttc ccctcttcca gtgctctgac 23221 atggtgccag gtcaagctaa tctcccccaa acacagatga taagaaccgc tggaaatgga 23281 ggaaacacgg gtgtaaactg cccccccccc cgccagctct caaacctact ggaaccactg 23341 caactttttc accttcccag ccctccttat ccttctcatc ctcctcacct tcctcatcac 23401 cttcatcgtt tgggacacgc agagctcacc ggggtctgct ttggccatac attcccctct 23461 tcccctgctg cgagctcctg tagctgcagc catcaggtgc ctgcatcagg atccggcact 23521 gaccccgtgg ctctgctgcg tgccccattt ccagctgtca gggtcacccc atgcccccac 23581 agcagcagtg gcaaagaggg agctccagga tctgtttgca aagggttgca gctgcaggtc 23641 agggagcagg acatggagat gcactgcggt gtactggtgt gaggagtgca gaaggaatga 23701 gagtagaagg tgtgagaatg agtggtggtg tgtagggcta cattgcggag cagtgtgtgt 23761 agggacagtg catgtactgg ggtagtttgc atgtgtgcac aggctgtgct ggagaggtca 23821 gtctgtgcac agcatgagca cagggcatca taggggcagc tccttggggc agactcccac 23881 agtgctcctg aacctacagc ccccatattg ctgtcccctc cattctgcct gctcctcgcg 23941 tacagaacac atcccttctg gcctcctgag ggacccctca ttggtacctc agccctaacc 24001 cacagcagga ggctggacct actgctccac ctgcaccaag gcagacccta acctcaaccc 24061 atgccggtgc agacactaac cctaatccta accctaaatc cagctcatgc cagcatcaca 24121 ctgccccaac cctaacacca agctcaaccc tgatcctaac actaacccca gctcgcgtcg 24181 gggtccaacc cccccagcct gcgcagtatc gtgggtggcg agggcagcag ccctgcctgg 24241 ctggggtcca gaatctatgg ggcgggctgg ggggtgggcg gtggccagca cagcatataa 24301 ggctgacagc agacttcagg ggcacccgtg ctgggggctg ccaacacaga ggtgcaacca 24361 tggtgctgtc cgctgctgac aagaacaacg tcaagggcat cttcaccaaa atcgccggcc 24421 atgctgagga gtatggcgcc gagaccctgg aaaggtaggt gtccttctct gcctccggct 24481 gcctctctcc cctgatcccc ttcccgtcct cagctgcccc cgtcctatcc ctccctgcct 24541 tacccgtccc tctcccctcc tgccctgcta gccctgactc actgtgctcc gcaggatgtt 24601 caccacctac cccccaacca agacctactt cccccacttc gatctgtcac acggctccgc 24661 tcagatcaag gggcacggca agaaggtagt ggctgccttg atcgaggctg ccaaccacat 24721 tgatgacatc gccggcaccc tctccaagct cagcgacctc catgcccaca agctccgcgt 24781 ggaccctgtc aacttcaaag tgagtgtctg ggaaggggcg acccgccgcc cccaccgcag 24841 ccccgtcttg ggccatgcgg ccacccctcg cctcaccccc tcgctcatcc tctccttttg 24901 ccttgcagct cctgggccaa tgcttcctgg tggtggtggc catccaccac cctgctgccc 24961 tgaccccgga ggtccatgct tccctggaca agttcttgtg cgccgtgggc actgtgctga 25021 ccgccaagta ccgttaagac ggcacggtgg ctagagctgg ggccaaccca tcgccagccc 25081 tccgacagcg agcagccaaa tgagatgaaa taaaatctgt tgcatttgtg ctccagccct 25141 ggtgtcctgc tctggtttct gcctgcgggg agggagggga gagatctgct ttggggctct 25201 gtgcagctga accaggggcc cccggggcca ggtgggtgtg cggaggatat gggctcaggt 25261 ggtggtgttt gagtggtgtg cccagaagac agatgcgttt ctgttgtctg tgggcatgag 25321 tagagctctt acaggtgctg gtggacaaga aagctttatg gagagtggtt cttgagggct 25381 ccctgcagct ccagcccatc ctgagcttcc cagcagagcc acatactcat actgggatcc 25441 catgcactcc ttaccccatg aagcccaaac tgtagggaaa cattctggaa gggctcgcag 25501 gaggtggtgg ggaacatttc ccagcgtctg tcatcgtctg tgtggtgaca cagcgaggac 25561 atgctgagga ccttctgcac cagggcagct ttctgtgtat gcctacagac tgtggccctc 25621 ataggacttc cctgcttcca tctcctgatg gaacatgctt gctatgtgcc acttgtgggg 25681 gtcccagggc acactgagtg ggcagacagg ctggagaaga ccacaacact ccaacaggaa 25741 ccatcagcac ttggggctca gttcctgctc tttgggctat gaccctcagc agggcagggg 25801 gatgagcgca gggtgaagct gtgctgggtg ctgtgtgcgg cccccggctc ttatcacggc 25861 cagcagcagg gctggggatg gccatagcca gtgggggctg caggtggctg ataaagagct 25921 gacaggctct cctccagctc acgggtgtgg tgagggacgt gggcagcaga tagcctcggg 25981 tgggtaaaac ctgccaggtg ctgagtaact gcagtctgat cctgccaggt atcttcatgc 26041 agagatggag gtttgagaaa gccccagggt ggccagcacg gaaaaaaaca cagcatgaat 26101 ccggcctggg gttaccacgg gtcccttcat tcctcgccac acgtaccata caaatgggtc 26161 cagatggaaa gatggacact tgcctgagtc ccacagtctc cagtgacatc agggcccagt 26221 gctcacaggt gactccagag agcttcaggt gccacagctc caccaggcca tgctccaatc 26281 tccccttcca cctcacacca tccagataca tctatgggca ctgcacccca cagcagagag 26341 cctctgctcc tggggcagca gggagcacag ggaacagggg cacagtgcag gcagggcaca 26401 cgtgcactgg gacagtgtgc agcatggtaa gagccatcca aatctgcata gatcgggtct 26461 gctcaacctg ctgcagggga tccttgcagt cctgagcgtg gcaaaggcag tgctcgcaca 26521 agaggggtga agctgcatgt aaaggaatgc atgtcctcca gccttgagct ggagcaagca 26581 gcgggcagct catatggaac atatctgtat gtgtgcatgg aaagaaaagt gtcaggagca 26641 atgcaggggg atgatgtgtg caaggtctgg ctgttgtccc tgctgcctct gccagacctc 26701 tgcctccagt tgctctcaca ccttgctgtg acctcaggta ggactcagca gggacagtgt 26761 tttggaggag tcttggctct gtctggataa aaaacaccat gctgcaaaca gctgggtgtt 26821 ttgtcccagc tacttgtccc tgctgccaca agatggaggc ctcaagccgt ggtgccttta 26881 tcagagccag acagacatct cagctttgag ttggagcctc cagccttggc caggacgggc 26941 ttctatcagg ctctgagctg ctgagcacct tggggtgctg gtgctggctt ggaggggtgc 27001 acggaacccc tgctgcccaa ggggccagac ccactgccag gagggtctgc accgccggct 27061 aggcagagct gggcacggtg tgggaacagg aaccctgccc aggcccctca ccctcttcct 27121 acaatcatag aaatcataga aacacagaat ggttgggttg gaagggacct tacagccccc 27181 agctccaccc ctgctgtggg ctggctgccc ccaccagctc aggctgccca gagcccctcc 27241 atggccttgg gcacatccag ggatggggca ctcacagctc cgggcagcag tgccatcacc 27301 tcactgccct ctatgtgaag gatttcctcc tcacatctta cctaaatctc ccctcttttg 27361 gtttaaaagc attccctctc gtcctctcac tatctacccg tgtgaaaaag aagacaagag 27421 aaggcaagca tcctagagga gtttagcaag aatttcccat tcaaaatgta cttcctcatt 27481 tagcaaagaa gaaaacacag aattttcttc ctgagagcca attgtaacat accacttgag 27541 agatcctctt gctagagaga tgagggagag catcacctgc ttacaaaagg ttcttctacg 27601 tttgcatctg acactggcat tcagtaggcc aatagcaaaa cccattattg gtgcaaccta 27661 taggtcttct gctgaaaaaa tcctgtcaaa gcagcttgtt tgccttaggc ttggccagtt 27721 tttctttcct aaatgtcagc ctgaactgct ctctctgata tatcccaacc atttttaacc 27781 caacgctttg ggttgctcac agctcttcca gctcagatcc atctgaggtt ttaataagtg 27841 tattctctat tacaacacat aagatattga tgaaggaaca gtgaataatg atgttgaacc 27901 aagaactgga atctgtcaca gaaggaaaac agactcatct gaaatgattc attcttggca 27961 atagcagtgc ttattcatct cttacttcaa ttaaatgtcc cactgttagt tgtgaattga 28021 agttgaccgc cttatagttc cctggtttcc actttttacc ctgctaataa gaatagaccc 28081 tactgtctta ttctgcaacc ccatcaccct ctgccattgc tcaaagattc tgatggagtg 28141 aaatggcttc aatcagtcgc tgaggtaaag gggaatttct ccacgcagtc tgggaacact 28201 gaaatgattg aactatactt taatccactc ctttcgggat ctatctacct tccttaagtg 28261 ttaagtcctt tatcagaaat gacattttag tgaaggctga ctctattttc tctcttctgt 28321 gagtactctt cctgtccctg ctgcgtcaac accttccttg ggggatattt tctatttagg 28381 ctttaagacc cttgtgaatg ttggtcttgg tctttcgatc gtggccccgt gtgctcagtg 28441 ttccttcacc acaacgtcag ttttgatgga ggacttaaag tattttctct ccttcctttc 28501 ctgaaagcag agactgctcc cagtggagca gccacatcga gcatcagctg cacgggtccc 28561 tggagagaac agggccgata aaaacaagca ttatcagaac ttaaaacatt gtctgtgctt 28621 catggccatc accgaaagtt taatgccaca aacaatatct tagcaaaact ttttgctcgt 28681 ccccctctga gttgacctgg gagtaatcca agtcccatcc tgtgagttcc ctttgaccaa 28741 gaaggcatga agcacgctga accccctgat gctggagata cccatggcag aacccagccc 28801 ttggctttta cagctgagag gaggtggagc agacctgcag tagtttgacg ttggccacac 28861 attccttctc aaggctggca tctgggatgg cactggggag agcacacaag cctcgccatt 28921 ttaggctgtg ctcctccaac acctgtaaat gatttcagag agtaagttcc tatgcgttgc 28981 cttcagcatc tgtttatcga cctgttgttt gtgaatttgc tcatcccttt ataaacctgc 29041 tgaccccatt tgtttccaca gcctcctgtg gcagcgggtt ccagaagttc attctccctc 29101 cctgcctcac atgaagtccc ttttatccgt ttcaaaccaa cctactggac tcatcgatgg 29161 cctctaattc cagtactgca gaatccagtg agcaccaaga tccacattgc ccttacgtct 29221 gtgtaaagta ggccctttct gtgccctctc ctgacaaacc ccagtcttcc cagtccatca 29281 tcacaccgca gctgccccat gcccatctca cttgctctca ggacactcag ttgatcttag 29341 ggctctctct gcctgcactg tgtccataga tttagaggcc agaattgcac agccatccca 29401 ggagccaaac ggtgcctggc cagccacagg tggggctccc agggcagagc aagaaaggca 29461 aacctgccat aaatgtgatg catgggaaaa gacctgagac actgctggat cgcattaccc 29521 atacactaca atgggaagcc tttcaccact gtcctgcaat gaaggaacga agtctgatat 29581 tgagctgaag cgtacagctc tcagattttg gatcc //