Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

...attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtttggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC

Basic information

species Glycine max
transcript GLYMA01G00740.2
intron # 2
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: LOC_Os06g21480.1 (Oryza sativa), 3'ss of exon 1
--attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtttggtttgttgttgtgtaaa--acagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
| | | || | || | | | | || | | | | || ||| | || || ||| || |||| | || ||||||| || |||||||| |||||||| || || |||||||| ||||| ||||| || ||||||||||| |||| ||| || ||||| ||||||||| | |||
acacctacatagttaaataagtagacagctaactacatagtactaagatatttgtttaatg-caacaatc--agtaactta-cttgtccatacccaatccacagCACCTTCAAGAAGAAGGCTCCTAATGCCATCAAGGAGATCAGGAAGTTTGCACAGAAGGCCATGGGCACCATTGACGTCAGGGTTGATGTGAAGCTCAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: GRMZM2G132623_T01 (Zea mays), 3'ss of exon 1
attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtt--tggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
|| | | | || | || | | | || | || | || | | || | | || | || | ||| ||||||||| |||||||| || || || || || |||||||| ||||| ||||| || ||||||||||| |||| || | ||| | ||||||||| | |||
atgacctctcgttcctgcttgacattgtactgttactgtaattgaaggttact--gaatgccgcgatcccacttcttaccaacattttctcatgtctggcagCACATTCAAGAAGAAGGCACCCAACGCCATCAAGGAGATCAGGAAGTTTGCGCAGAAGGCCATGGGCACCACGGACATTAGGATTGATGTGAAGCTCAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: GRMZM2G132623_T03 (Zea mays), 3'ss of exon 2
attaattttatttattataaatttgggtcttttgattgctttcatgttttggtt--cgagaacagttgtcttagtggtt--tggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
|| || | || | |||| || | | || | | || | || | ||| ||||||||| |||||||| || || || || || |||||||| ||||| ||||| || ||||||||||| |||| || | ||| | ||||||||| | |||
----------------------------------gttactgtaat-tgaaggttactgaatgccgcgatcccacttcttaccaacattttctcatgtctggcagCACATTCAAGAAGAAGGCACCCAACGCCATCAAGGAGATCAGGAAGTTTGCGCAGAAGGCCATGGGCACCACGGACATTAGGATTGATGTGAAGCTCAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: PP1S30_365V6.1 (Physcomitrella patens), 3'ss of exon 1
attaattttatttattataaattt-gggtcttttgattgctttcatgttttggttcga-gaacagttgtcttagtggtttggtttgt-tgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
||| | | || | || | || | || || | | | |||| | | || ||| ||| ||| |||| | |||||| || ||||||| || || || ||| | |||||||| | || || ||||| || ||||||||||||| |||||| ||| ||||||||||| |||||
--aaatccttgtgatcgttgatctcaagtatattcatgatgcttttactcctgttcatcggtctcttcatttaaacacttgctttccctgttaca-accgcagCACTTTCAAGAAGATGGCCCCCAAGGCTGTGAAGGAGATTCGCAAGTTCGCCCAGAAAGCCATGGGGACCAGTGATGTTAGGTTGGATGTGAAGCTGAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: PP1S50_89V6.1 (Physcomitrella patens), 3'ss of exon 1
-attaattttatttattataaatttgggtcttttgattgctttcatgttttggt-tcgagaacagttgtcttagtggtttggtttgttgttgtgtaaa-acagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
||| | |||| | | |||| | | | || || ||| | | | | | | | || | | | ||||| || | | |||||| || ||||| | || || ||||| | ||||| || | |||||||| |||||||||||||| || | |||||| ||| ||||||| |||||||||
cattgttaagttttagggttacattggatttgctag--aattaggtggactggaattgcaatcgctcgaccgcatgctatc-tcagttgtcgtttgattgcagCACTTTCAAGAAAATGGCACCCAAAGCGGTGAAGGAAATCCGTAAATTTGCGCAAAAGGCCATGGGCACGAGTGATGTAAGGTTGGATGTAAAGTTGAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: PP1S58_106V6.1 (Physcomitrella patens), 3'ss of exon 1
attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtttggtttgttgttg--tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
|||| | || || || | | | | ||| | || | | |||| | | | |||| || | | | || || || ||||||| ||||| || || | |||||||| ||||| |||||||| ||| |||||||||||| ||||||||| | ||||| |||||||||
attagagaaaaggcaagcaacttatggacacagaa--gttaggatgcggatactggaacattctctcactagtcgcctattgtgttttttgtttttacttagTACTTTCAAGAAGATGGCTCCCAAGGCAGTGAAGGAGATCAGGAAGTTTGCCCAGAAGACCATGGGGACCAGTGATGTGAGACTTGATGTTAAGTTGAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: PP1S79_255V6.1 (Physcomitrella patens), 3'ss of exon 1
-------attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtt-tggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
| | | || | | | ||| | | |||| || || ||| | ||| | || || || ||| | | | | ||||| || ||||||| || || || || | |||||||| ||||| |||||||||||| | ||||| |||| ||||||||| | ||||| |||||||||
cctgtggagtgtagggaaagataagagaccgaagtcatgttattgtagcca--acgtgcaccgaattta---ttctcactgattatgaaacgttttccttt---atagCACCTTCAAGAAGATGGCGCCCAAGGCAGTGAAGGAGATCAGGAAGTTTGCCCAAAAGACAATGGGAACCAGTGATGTGAGACTTGATGTCAAGTTGAAC

upper sequence: GLYMA01G00740.2 (Glycine max), 3'ss of exon 2
lower sequence: EFJ04749 (Selaginella moellendorffii), 3'ss of exon 1
attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtttggtttg-ttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
| | ||| | || | | | | | | | | ||||| || |||||| || ||||||| ||||| || | |||||||| | |||||| || |||||||||||||| ||| ||||||||||||||||| ||| | |||
----------------------------------------------gtaagcttcaatcgca--cgctccattcgatcgaatcgattgttttgcc--gcagCACCTTCAAGAAGATGGCTCCCCGGGCGGTGAAGGAGATCAAGAAATTCGCGCAAAAGGCCATGGGAACCTCGGATGTGAGGGTGGATGTCAAGCTCAAC

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|38191162|gb|CF920368.1|CF920368
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|31308929|gb|CD394132.1|CD394132
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|12773734|gb|BG238661.1|BG238661
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|14992136|gb|BI317809.1|BI317809
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209725879|gb|BW669688.1|BW669688
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|207718387|gb|GD700393.1|GD700393
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|7692591|gb|AW760694.1|AW760694
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|58021716|gb|CX708457.1|CX708457
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|151405794|gb|EV275609.1|EV275609
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|298182711|gb|HO031446.1|HO031446
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|7146575|gb|AW508497.1|AW508497
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|31474230|gb|CD416258.1|CD416258
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|193552387|gb|FK509872.1|FK509872
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|15001255|gb|BI322069.1|BI322069
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAATAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|192328127|gb|FK020402.1|FK020402
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|208029643|gb|GD832021.1|GD832021
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|254327000|gb|GR836609.1|GR836609
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|207796179|gb|GD769332.1|GD769332
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|193319920|gb|FK287597.1|FK287597
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|27423676|gb|CA935196.1|CA935196
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|192333827|gb|FK025088.1|FK025088
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|21256903|gb|BQ453791.1|BQ453791
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|4290608|gb|AI438098.1|AI438098
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|192302764|gb|FG993784.1|FG993784
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|38191100|gb|CF920306.1|CF920306
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|213611910|gb|DB958920.1|DB958920
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CGCATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|19268681|gb|BM884937.1|BM884937
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|58021531|gb|CX708272.1|CX708272
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|192334196|gb|FK026577.1|FK026577
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|213617197|gb|DB963760.1|DB963760
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209725880|gb|BW669689.1|BW669689
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209724485|gb|BW658202.1|BW658202
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|21602736|gb|BQ613067.1|BQ613067
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|4294266|gb|AI442166.1|AI442166
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|7042503|gb|AW472397.1|AW472397
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|207705317|gb|GD687349.1|GD687349
EST:     CCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: CCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|33387678|gb|CA850885.1|CA850885
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|208222771|gb|GE027474.1|GE027474
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|192296293|gb|FG991575.1|FG991575
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGGTAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209722821|gb|BW682894.1|BW682894
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|9899531|gb|BE608499.1|BE608499
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|6666153|gb|AW277612.1|AW277612
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|18041108|gb|BM309402.1|BM309402
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|31474134|gb|CD416162.1|CD416162
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|9899082|gb|BE608050.1|BE608050
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209724486|gb|BW658203.1|BW658203
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|4397030|gb|AI496027.1|AI496027
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAAATTTG
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAA-TTTG
EST: gi|208198798|gb|GE000748.1|GE000748
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|6846972|gb|AW349262.1|AW349262
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|209722822|gb|BW682895.1|BW682895
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|18730134|gb|BM525703.1|BM525703
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|18730475|gb|BM525897.1|BM525897
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|213601691|gb|DB972413.1|DB972413
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|24135982|gb|BU926492.1|BU926492
EST:     GTGGTTACCCGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|6726250|gb|AW310604.1|AW310604
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|6719031|gb|AW306678.1|AW306678
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|208178181|gb|GD978750.1|GD978750
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAG
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAG
EST: gi|13788680|gb|BG651271.1|BG651271
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|38191163|gb|CF920369.1|CF920369
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
EST: gi|38191241|gb|CF920447.1|CF920447
EST:     GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTG                         CACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC
genomic: GTGGTTACACGTGAGTACACCATTAACCTCCACAAGCGCCTCCATGGCTGgtaatctatt ... tgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGC


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

gttttggttcgagaacagttgtcttagtggtttggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC
                                             tgtaaaa  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
attaattttatttattataaatttgggtcttttgattgctttcatgttttggttcgagaacagttgtcttagtggtttggtttgttgttgtgtaaaacagCACATTTAAGAAGAAAGCTCCTAAAGCTATTAAGGAGATAAGGAAATTTGCCCAAAAGGCCATGGGGACCAATGATGTGAGGGTGGATGTGAAGTTGAAC

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GGTGGA