Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   
4  
 5'  3'   
5  
 5'  3'   
6  
 5'  3'   
7  
 5'  3'   
8  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

gtacacatagtttcttcaaaaatttcttttaccaaagtgagcaaaccatgcatgagttaaggaactcaggttattatgtagttacttacaagttgatcttgttgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCGGCTTAATTACTTATTGTGATTTTGGCAATATGAGAGCATTAAGGATAAA

Basic information

species Glycine max
transcript GLYMA03G34300.1
intron # 8
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: GLYMA03G34300.1 (Glycine max), 3'ss of exon 8
lower sequence: AT2G21170.2 (Arabidopsis thaliana), 3'ss of exon 7
gtacacatagtttcttcaaaaatttcttttaccaaagtg-agcaaaccatgcatgagttaaggaactcaggttattatgtagttacttacaagttgatcttgttgaggtgc-agGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCGGCTTAATTACTTATTGTGATTTTGGCAATATGAGAGCATTAAGGATAAA
| | ||||| || ||| || ||| || | | || || ||| | || ||| | | |||||||||||||||| |||||||| || |||||||| || ||||||||||| |||| ||| | | | | | | || | || | | | | |
-------------------------gtaaaagtaaagtttagacctttatggatatgttgttga---tgaatagaagtaaagcagctcacac-tcacctttattgtgactctagGGTCCTGAGTTTGCAACCATTGTGAACTCAGTCACGTCGAAGAAAGTTGCTGCTTGATTGAGAACTATCAGTAACGGAAATCGCTAGTCTCCATGGAACA

upper sequence: GLYMA03G34300.1 (Glycine max), 3'ss of exon 8
lower sequence: Vv03s0038g01780.t01 (Vitis vinifera), 3'ss of exon 8
gtacacatagtttcttcaaaaatttcttttaccaaagtgagcaaaccatgcatgagttaaggaactcaggttattatgtagttacttacaagttgatcttgttgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCGGCTTAATTACTTATTGTGATTTTGGCAATATGAGAGCATTAAGGATAAA
|| | | | ||||| ||| ||| |||| | || || | | || | ||| | | |||||| || | || | ||||||||||||||| |||||| ||||||||||| || ||||||||||||||||| |||| || ||| | || |||| ||| | ||| || || | |
-----gctattaaatggcacaatttggtttccca-----agcagtttcgata-aagatattaatgtttctttttggtgttgatgattacaacttagtatt-cggtggtgcagGGTCCTGAATTTGCTGTGATTGTCAATTCTGTAACATCCAAGAAAGTTGCTGCTTGATCACTCACACTGGTTTTTGCACTGGAAGAAAATAAAAGGCCGA
















Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|208251450|gb|GE044161.1|GE044161
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
EST: gi|31457431|gb|CD399459.1|CD399459
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31467995|gb|CD410023.1|CD410023
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|298193423|gb|HO033857.1|HO033857
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208235829|gb|GE028685.1|GE028685
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193701495|gb|FK634587.1|FK634587
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAA-GTTGC
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGC
EST: gi|193594781|gb|FK547200.1|FK547200
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCCTTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|7028349|gb|AW458132.1|AW458132
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208048148|gb|GD847891.1|GD847891
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCA
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCA
EST: gi|208048963|gb|GD848874.1|GD848874
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTA
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTA
EST: gi|193388015|gb|FK355224.1|FK355224
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTNCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|16284155|gb|BI945935.1|BI945935
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193380681|gb|FK347406.1|FK347406
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31455871|gb|CD397899.1|CD397899
EST:     CCAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCCAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208219211|gb|GE018349.1|GE018349
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31308297|gb|CD393500.1|CD393500
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208107055|gb|GD907460.1|GD907460
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
EST: gi|207690328|gb|GD672945.1|GD672945
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|16279304|gb|BI943365.1|BI943365
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTTCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31458560|gb|CD400588.1|CD400588
EST:     GAGG-GCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: GAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208279665|gb|GE080046.1|GE080046
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTT
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTT
EST: gi|208159624|gb|GD957976.1|GD957976
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|207759952|gb|GD730811.1|GD730811
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCAATTAGTACAATTCAGTCA
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCA-TT-GT-CAATTCAGTCA
EST: gi|31465283|gb|CD407311.1|CD407311
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193586115|gb|FK542495.1|FK542495
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAA
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAA
EST: gi|208058575|gb|GD860692.1|GD860692
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTT
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTT
EST: gi|16284094|gb|BI945902.1|BI945902
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCANTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193312291|gb|FK276075.1|FK276075
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|254339332|gb|GR848049.1|GR848049
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTCCCATTGTCAATTCAGTCACATCCAAGACAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208266934|gb|GE060985.1|GE060985
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCNTTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208056628|gb|GD850345.1|GD850345
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208074994|gb|GD875491.1|GD875491
EST:     TGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: TGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31467830|gb|CD409858.1|CD409858
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193334279|gb|FK299231.1|FK299231
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|254339331|gb|GR848048.1|GR848048
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193461696|gb|FK425453.1|FK425453
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGA
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGA
EST: gi|208242981|gb|GE037457.1|GE037457
EST:     CAAAGCACGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208322209|gb|GE124728.1|GE124728
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208145583|gb|GD940926.1|GD940926
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193642544|gb|FK593298.1|FK593298
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCCTTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|10231702|gb|BE800590.1|BE800590
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|207813878|gb|GD785113.1|GD785113
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGT
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGT
EST: gi|208157521|gb|GD953701.1|GD953701
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31469935|gb|CD411963.1|CD411963
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193548983|gb|FK505390.1|FK505390
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208038668|gb|GD835002.1|GD835002
EST:     TATTGATGGATTTCTCGTTGGAGGTGC-TCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: TATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208199466|gb|GE001612.1|GE001612
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGT
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGT
EST: gi|31306225|gb|CD391428.1|CD391428
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208187723|gb|GD986005.1|GD986005
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAG
EST: gi|207806969|gb|GD778844.1|GD778844
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193391555|gb|FK354144.1|FK354144
EST:     CAA-GCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|20814422|gb|BQ298900.1|BQ298900
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|31468627|gb|CD410655.1|CD410655
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAACTTCAGTCACATCCAAGAAAGTTGC
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAA-TTCAGTCACATCCAAGAAAGTTGC
EST: gi|193346012|gb|FK310806.1|FK310806
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208269128|gb|GE068639.1|GE068639
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|254346362|gb|GR856084.1|GR856084
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATTCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|208131127|gb|GD927234.1|GD927234
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193632145|gb|FK581463.1|FK581463
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGACTAACCATTGTCAATTCAGTCACATCCAAGAAAGTTG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTG-CTA-CCATTGTCAATTCAGTCACATCCAAGAAAGTTG
EST: gi|7640322|gb|AW734626.1|AW734626
EST:     CAAAGCAAG-A-ATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
EST: gi|193510154|gb|FK472126.1|FK472126
EST:     CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAG                         GGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG
genomic: CAAAGCAAGAAGATATTGATGGATTTCTCGTTGGAGGTGCTTCATTAAAGgtacacatag ... tgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCG


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

taaggaactcaggttattatgtagttacttacaagttgatcttgttgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCGGCTTAATTACTTATTGTGATTTTGGCAATATGAGAGCATTAAGGATAAA
                                       tcttgtt  CT-rich tract
 ttattatgtagtta  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
gtacacatagtttcttcaaaaatttcttttaccaaagtgagcaaaccatgcatgagttaaggaactcaggttattatgtagttacttacaagttgatcttgttgaggtgcagGGTCCTGAGTTTGCTACCATTGTCAATTCAGTCACATCCAAGAAAGTTGCGGCTTAATTACTTATTGTGATTTTGGCAATATGAGAGCATTAAGGATAAA

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGAGCA