According to FGENESH, coding sequence boundaries are
exon # | begin (in cb089c8.1) |
end (in cb089c8.1) |
begin (in gene 5) |
end (in gene 5) |
exon size | Intron # | intron size |
1 | 24049 | 24163 | 1 | 115 | 115 | 1 | 215 |
2 | 24377 | 24432 | 329 | 384 | 56 | 2 | 48 |
3 | 24479 | 24602 | 431 | 554 | 124 | 3 | 78 |
4 | 24679 | 24743 | 631 | 695 | 65 | 4 | 51 |
5 | 24793 | 24880 | 745 | 832 | 88 | 5 | 1241 |
6 | 26120 | 26406 | 2072 | 2358 | 287 | 6 | 51 |
7 | 26456 | 26507 | 2408 | 2459 | 52 | 7 | 114 |
8 | 26620 | 26718 | 2572 | 2670 | 99 | 8 | 49 |
9 | 26766 | 26848 | 2718 | 2800 | 83 |
NgoMI \ 1 atgactccaggattcaccggatatatacctggcgcaaaatggcaggtcggaagtcgctacgttgccggctcacgg 75 M T P G F T G Y I P G A K W Q V G S R Y V A G S R 76 gaaaacagtggattctcaaataggaatgcttcggtacctg|gtaagaaaataagagtatgtatgagcattgccaaa 150 E N S G F S N R N A S V P E 151 aaattaaaaaaaaaggaaacgtatcatgtttcatatttgtgaattgattccttttttgagcggcttcggaagagg 225 226 ggaatttttcaccgtttgtaaatatggctcatgttggttagtgtgaaaccagtgagcgtagtaaattttttttaa 300 301 aaattgaagtttgactctattttttaag|aaggctcacaatatcaaacaccaaatgcgacgaatcaaaatttttct 375 G S Q Y Q T P N A T N Q N F S 376 gctgagaat|gtaaggatatatttgggtaaaaataatcaaatgagattttttgcag|agatcaatggcaatgggtgc 450 A E N R S M A M G A 451 aggagatgaacaaatgagaaactttacggaaatgacgaatgacgaacttcgcgaaaggctaatgaaaatgcaaat 525 G D E Q M R N F T E M T N D E L R E R L M K M Q M 526 ggatatgcagaaccttcaaatggcaatga|gtatgcagaaccagcaacaacaacactctggcaaacaccaaagttt 600 D M Q N L Q M A M S 601 tcgaaatgacgtgctaaattttattttcag|gcgctaacggcggagatggctcctccaggaatgcggttaattttg 675 A N G G D G S S R N A V N F G BspEI \ 676 gagacccatccggacgaatg|gtaatgagaaatgagatggagcttttctattcattatattcattttaag|gacaga 750 D P S G R M D R 751 agaacagaatatgaaaatgggcaaagaggtggtggtggacagcaggagcaactgaaacaaagaagcaagagtatg 825 R T E Y E N G Q R G G G G Q Q E Q L K Q R S K S M 826 ccaagaa|gtgagttgatgcggaaactgaaaagtgtcacgggcaccttttgtaattattgccccgaattctagttc 900 P R K 901 cgtgtagagcttcatcatcttgggagaacagaaaacaaattgttttctgttattaggtttatcacaaaaaaagag 975 976 cctgagaattgaagcaatgacataaaacccgtatatattaacagaatgcaggaaaacgagaagaagaaagaaaaa 1050 1051 caagcaatctgcgtcattcaaatttaatttcagttctagctaataacagcttttttcacctgatttcgacaaaac 1125 1126 aaaacattatgagaggcactgttcttgaacaatgagcttgaattttagaatgacatagaaaaattcaattttttg 1200 1201 gcagatataaaaatgaaaattgcagaattgaaagtgatgtcaataattaaaatagtttgcgacgtcatttggtga 1275 1276 gagaattactgaataagaaggccacgttcatctaatttcaattttttgaggacatatgtcctttttaaaactgaa 1350 BspEI \ 1351 cgcagttaattgataattgattaatgaagtttggaacagaattcagaattcctacttgaatatttccggagtttt 1425 1426 taaaggaataatatcccaatcggtattgtatattcaaatttctttttcgtaattagaatttttcagactacagta 1500 SpeI \ 1501 tgttaaaggcgcacgcaaaaagtttcttcgtggctctcgaatatgttttgcggttttgaaactagtttgttaatt 1575 1576 tttcaccacttttggtgatggcacagtgggtatctaatcaaaaatcttcgaaaattggaataattacgaaaattt 1650 1651 gaagtattcaaaattgtaatattcggaaacacggcaatttcaaaaatcggaatattccgaaaatccgaaattttg 1725 1726 aaaatcggataattttgaaattccaaaatattgaaaatcgaagtattccaaatcaaaaattttaaaactcgaaat 1800 1801 tttcaacatttccaaattcccctagatttagagaaaacttaaaaattacaccgattctagttttgttagttataa 1875 1876 tcgtcactaaaaagttttgtgacacaaaaaaacatcccaaaaaccaaagtaagatgtttgtaaacaacaactcaa 1950 NheI \ 1951 gaaatcgccggaagattagttccgtaacttttagatcccttgttttttctagtccgtccagctagcatatgttac 2025 2026 aaatcacttcctaacttcttgatttccttaattttttttctttcag|aaatcgaaaacaatccgtttgatggaatc 2100 I E N N P F D G I 2101 gaaagtggatggtggagtaaaggagaagtgaagaggaatcaggttggtaccacagaaagaaaggcaactgaaacg 2175 E S G W W S K G E V K R N Q V G T T E R K A T E T 2176 aatgcgtatttgacgcttcgcttcttcattctctttatttttttttcgaacaaaaaagctcaagctcgatatttc 2250 N A Y L T L R F F I L F I F F S N K K A Q A R Y F 2251 aaggagcggcgggaaattgcgaacggcggacaaatggtgagcggcggcaattggaaccaatggccgccaagtcgt 2325 K E R R E I A N G G Q M V S G G N W N Q W P P S R XhoI \ 2326 cagcgagtttcaagaagaattggagcactcgag|gttagacgaaaatgattggtctcattgttgttttgtgctatt 2400 Q R V S R R I G A L E 2401 tttgcag|aaggatgagacggaagacattccaacagctgggtatagcgggcacattcaag|gttagaaatactaaag 2475 K D E T E D I P T A G Y S G H I Q G 2476 aaaagaagacaccagatgtcttgggtcgtaaaaactaacagtaatcctaatcaaacactaatataaaaatctaag 2550 2551 attctaaatacgaattttcag|gacttcgtcaacttggagttggaaaaccattcaacgtagcagcgaagcaggcta 2625 L R Q L G V G K P F N V A A K Q A K 2626 aaaaagagtacattgaacggagaagaacgtacagtggtagtcgag|gtaattggatttacggggttaaataaattt 2700 K E Y I E R R R T Y S G S R E XbaI \ 2701 taaaaaaacgagaaaag|aggcctacaaaaagacatctgcgcaaggtgtacgtgtcaaggaagttgagtgttctag 2775 A Y K K T S A Q G V R V K E V E C S R 2776 agaaaaaacacaaccaggcggttga 2850 E K T Q P G G *