Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   
4  
 5'  3'   
5  
 5'  3'   
6  
 5'  3'   
7  
 5'  3'   
8  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

gtaagaaagaagaacaatgtgtctgtgatggatcgtttacggtttaaactgttcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG

Basic information

species Arabidopsis thaliana
transcript AT5G66190.2
intron # 4
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: LOC_Os02g01340.2 (Oryza sativa), 3'ss of exon 2
gtaagaaagaagaacaatgtgtctgtgatggatcgtttacggtttaaactgttctta-gtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG--------
| | | | ||||| || | | ||| | | ||| ||| ||| | | | | | || ||||||| ||||||| ||| || | |||||||| || || || || ||||| ||||| ||||| || |||||||| |||||||||||||||
---------gtaagctaggagtctg--atcatgtaatagcagtt-agattgtacttgtgtgctaattaatggagtataatttagGTGACCTGAAGCCTGGTTCGGACGTGAAGATCACGGGGCCAGTGGGGAAGGAGATGCTGATGCCCAAGGACCCCAACGCCACCATCATCATGCTGGGCAC

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: LOC_Os06g01850.1 (Oryza sativa), 3'ss of exon 3
-------------gtaagaaagaagaa--caatgtgtctgtgatggatcgtttacggttta---aactgttcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
||| | | |||| || | || | | | ||| ||| | | | | | | | ||||||||||||||| ||| ||| | ||||| || ||||| || ||||| |||||||| ||||| ||||| |||||||| | || || |||
atgtaacctgcatataatgattctccattcaataagtttatgtctca-catctactttttgcggggtcatgtctcacatctacttcaccttgctaatatagGTGACTTGAAGCCTGGTTCTGATGTCAAGATAACCGGACCAGTAGGCAAAGAAATGCTCATGCCCAAAGATCCCAATGCTAATATTATAATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: GRMZM2G059083_T02 (Zea mays), 3'ss of exon 3
gtaagaaagaagaacaatgtgtctgtgatggatcgttt-------acggtttaaactgttct-tagtgtttgtg-atgttcgggttttt-cagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
|| || | || | ||| || || | | ||| | || | || | | || | | | | |||||||||||||||| || | ||| | |||||||| || || || |||||||| ||||| ||||| ||||||||||| || || |||||||||
gtcagttgtctagctttaatttccgccatgtctcagttcttccatgcagactaatatattatatactaagcatatatatacatactaatgcagGTGACTTGAAGCCAGGCGCTGAGGTGAAGATCACAGGGCCAGTGGGCAAGGAGATGCTCATGCCCAAAGACCCCAACGCAACAATCATCATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: GRMZM2G059191_T01 (Zea mays), 3'ss of exon 3
-----------------------gtaagaaagaagaacaatgtgtctgtgatggatcgtttacggtttaaactgttcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
|| | |||| | | | | | || | | |||| ||| | | | | | | | | ||||||||||||| || || | || | |||||||| || || || |||||||| ||||| ||||| |||||||||||||| || ||||||||
ttttagctaactaactattagttctagtgcattcaaacaccccctaagcgtagaagcggtcaaa--ctaaaagaatctc----tctcttgtatcctgtgcatgcagGTGACTTGAAACCTGGCGCCGATGTGAAGATCACAGGGCCAGTGGGCAAGGAGATGCTCATGCCCAAAGACCCCAATGCAACAGTCATCATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: GLYMA16G23710.1 (Glycine max), 3'ss of exon 4
---------------------------------------gtaagaaagaagaacaatgtgtctgtgatggatcgttt------------acggtttaaactgttct--tagtgtttgtg-atgttcgggttt-ttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
| |||||| | || | || | ||| | | ||| | || |||| || ||| | || ||| ||| |||||||||| ||||||| || | ||||| | || |||||||||||||| || |||||||||||||||||||| || ||||||||||||||||||
gtaaggatgagtagttttagattttacataatgtaggctggaagaaatttagttatatcgtttttggttgatttactttgttatactaaatggtacatacccttcttttaccatttatttatattcctctttattcagGTGACCTGAAGCCAGGAGCTGAAGTAACAATAACTGGACCTGTTGGGAAAGAAATGCTTATGCCAAAAGATCCTAATGCCACCATCATCATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: GLYMA02G05350.1 (Glycine max), 3'ss of exon 4
--------------------------------gtaagaaagaagaac----aatgtgtctgtgatggatcgtttac--------------ggtttaaactgttc--ttagtgtttgtg-atgttcgggttttt-cagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
||| | | ||||| | | ||| | ||| | ||||| ||| | || ||| ||| || | || ||| ||| | |||||||| ||||||| || | ||||| | || |||||||||||||| || |||||||||||||||||||| || ||||||||||||||||||
gtaaggatgagtagttttagattttatataatgtaggctacaagaaatttagctatatcttttttggttattttactttgttatactaatggtatctacccttcatttaccaattatttatattcctctttatgcagGTGACCTGAAGCCAGGAGCTGAAGTAACAATAACTGGACCTGTTGGGAAAGAAATGCTTATGCCAAAAGATCCAAATGCCACCATCATCATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: GLYMA11G08230.1 (Glycine max), 3'ss of exon 4
----------------gtaagaaagaagaacaatgtg-tctgtgatggatcgtttacggtttaaactgttcttagtgtttgtgatgttcgggttt-ttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
| ||| | | ||| | |||| | || || |||| | | || | | | | || | |||| |||||| ||||||||||| || | |||| ||||| |||||||||||||| || |||||||||||||||||||| || ||||| ||| ||||||||
tagttatatttctcatctgagataattaatcaagaaaatgtgtggtttttccttggatatttacatt-ttatgacaatcacttctggccatgtttattcagGCGACTTGAAGCCAGGAGCCGAAGTAAAGATTACTGGACCTGTTGGTAAAGAAATGCTTATGCCAAAAGATCCTAATGCAACCGTCATCATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: Vv04s0023g03510.t01 (Vitis vinifera), 3'ss of exon 4
--------------------------------------------------------------gtaagaaagaagaacaatgtgtctgtgatggatcgtttacggtttaaactgttcttagtgt--ttgtgatgttcgggttt-------ttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
|| | | | | |||| | || |||| | | | || | ||| ||| || ||| | | | |||||||||||||||||| || | |||| ||||| || || ||||| || || ||||||||||||||||| || || |||||||| ||||| |||
gtaagtctgctcaaaattttcttaattgcataggaatccagcccaccttctgatttcaaacagttgtatgggaaatcaatctctcggtgaagacattagtctaagagatgcttctgttaatgtacttttgagctctaaatctcttccatttcagGTGACTTGAAGCCTGGGGCCGAAGTGAAGATTACAGGGCCTGTGGGGAAAGAAATGCTTATGCCAAAGGATCCAAATGCCACAATCATAATG

upper sequence: AT5G66190.2 (Arabidopsis thaliana), 3'ss of exon 4
lower sequence: Vv18s0001g14450.t01 (Vitis vinifera), 3'ss of exon 4
----------gtaagaaagaagaacaatgtgtctgtgatggatc---gtttacg----gtttaaactgt--tcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
| || | | || | || ||||| | ||||| | | || | | | || ||| | |||| | |||| |||||||||||||||| || | ||| | || ||||| ||||||||||| || |||||||||||||||||||| || |||||||| |||| |||
tttgtatagaatcagctctatcagcatcggtgctacgatggggccgagtttaggaaatgctttgattatcgtcctagcaactctgatccttttgttt--cagGTGACTTGAAGCCTGGGGCTGATGTCAAAATCACAGGACCTGTTGGGAAAGAAATGCTTATGCCAAAAGATCCAAATGCCACTGTCATAATG

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|86051155|gb|DR346910.1|DR346910
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125031213|gb|EL103482.1|EL103482
EST:     ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCCTTGT                         GTGACTTGAAG
genomic: ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTC-TTGTgtaagaaaga ... ggtttttcagGTGACTTGAAG
EST: gi|124728539|gb|EH819941.1|EH819941
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|125161048|gb|EL188346.1|EL188346
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTG
EST: gi|125172853|gb|EL194801.1|EL194801
EST:     TGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125275928|gb|EL290013.1|EL290013
EST:     TCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124931597|gb|EL008940.1|EL008940
EST:     CAAATGATGGCGGAGTTAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
genomic: CAAATGATGGCGGAG--AGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
EST: gi|125285483|gb|EL299568.1|EL299568
EST:     CACAAATGATGGCGGAGAGATTGTTAA-GGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124987127|gb|EL059602.1|EL059602
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAA
EST: gi|125058857|gb|EL127534.1|EL127534
EST:     ACAAATGATGGCGGAGAGATTGTTAAGGGGGTTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: ACAAATGATGGCGGAGAGATTGTTAAGGGGG-TCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124957403|gb|EL032243.1|EL032243
EST:     ACAAATTGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
genomic: ACAAA-TGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
EST: gi|125250665|gb|EL264750.1|EL264750
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGC
EST: gi|125089340|gb|EL147746.1|EL147746
EST:     CACAAATGATGGCGGGGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGG
EST: gi|125120166|gb|EL168020.1|EL168020
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|124846026|gb|EH930463.1|EH930463
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACT
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACT
EST: gi|124992789|gb|EL065058.1|EL065058
EST:     TTGTTAAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TTGTT-AAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124725760|gb|EH817162.1|EH817162
EST:     ATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: ATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124884730|gb|EH963632.1|EH963632
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAA
EST: gi|124886008|gb|EH964910.1|EH964910
EST:     ATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACT
genomic: ATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACT
EST: gi|124886443|gb|EH965345.1|EH965345
EST:     AAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125083431|gb|EL143993.1|EL143993
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
EST: gi|125020891|gb|EL093160.1|EL093160
EST:     AGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125236694|gb|EL250779.1|EL250779
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|124999297|gb|EL071566.1|EL071566
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
EST: gi|124879982|gb|EH958884.1|EH958884
EST:     CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124917012|gb|EH990886.1|EH990886
EST:     CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125033229|gb|EL105498.1|EL105498
EST:     ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCGTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
genomic: ACAAATGATGGCGGAGAGATTGTTAAGGGGGTC-TGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA
EST: gi|124814684|gb|EH899550.1|EH899550
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGC
EST: gi|125090052|gb|EL148191.1|EL148191
EST:     AGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125306299|gb|EL320384.1|EL320384
EST:     GTTAAGGGGGTCGTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: GTTAAGGGGGTC-TGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124736271|gb|EH827673.1|EH827673
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|124978506|gb|EL051673.1|EL051673
EST:     CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125244128|gb|EL258213.1|EL258213
EST:     CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGAACCTGTTGGCAAG
genomic: CGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGA-CCTGTTGGCAAG
EST: gi|124880564|gb|EH959466.1|EH959466
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGAT
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGAT
EST: gi|125056368|gb|EL126125.1|EL126125
EST:     CCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125248174|gb|EL262259.1|EL262259
EST:     CTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|116446281|gb|EG488873.1|EG488873
EST:     CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124826855|gb|EH911721.1|EH911721
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGAATGAAGCTAAGATCACTGGACCTGTTGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA-TGAAGCTAAGATCACTGGACCTGTTGG
EST: gi|125198369|gb|EL213323.1|EL213323
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGC
EST: gi|125230915|gb|EL245000.1|EL245000
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGG
EST: gi|124996339|gb|EL068608.1|EL068608
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|125310182|gb|EL324267.1|EL324267
EST:     AAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124989474|gb|EL061789.1|EL061789
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACT-GAAGACCGGGTGATGAAGCTAAGATCACTGGAC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAG-CCGGGTGATGAAGCTAAGATCACTGGAC
EST: gi|124890309|gb|EH969211.1|EH969211
EST:     CAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125024495|gb|EL096764.1|EL096764
EST:     GGTCTGCTCCAACTTCTTGT                         GTGACTTGTAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAG
genomic: GGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTG-AAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAG
EST: gi|124888944|gb|EH967846.1|EH967846
EST:     AAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|86051162|gb|DR346917.1|DR346917
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124961966|gb|EL036423.1|EL036423
EST:     TCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124997810|gb|EL070079.1|EL070079
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATC
EST: gi|125251847|gb|EL265932.1|EL265932
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCA
EST: gi|124851269|gb|EH935706.1|EH935706
EST:     TGGCGGAGAGATTNTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125075060|gb|EL138770.1|EL138770
EST:     GAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: GAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124869154|gb|EH953591.1|EH953591
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|125251435|gb|EL265520.1|EL265520
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAA
EST: gi|125053840|gb|EL124795.1|EL124795
EST:     AATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: AATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124752950|gb|EH843080.1|EH843080
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|86051160|gb|DR346915.1|DR346915
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125029470|gb|EL101739.1|EL101739
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTG
EST: gi|125224954|gb|EL239039.1|EL239039
EST:     TGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|86051158|gb|DR346913.1|DR346913
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125004053|gb|EL076322.1|EL076322
EST:     ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124903565|gb|EH977645.1|EH977645
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCAC
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCAC
EST: gi|124913513|gb|EH987387.1|EH987387
EST:     TTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|86078472|gb|DR374229.1|DR374229
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125242422|gb|EL256507.1|EL256507
EST:     CTCCAACTTCTTGT                         GTGACTTG-AGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125248361|gb|EL262446.1|EL262446
EST:     CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|124837397|gb|EH921836.1|EH921836
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGA
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGA
EST: gi|124834985|gb|EH919424.1|EH919424
EST:     ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTACTTGT                         GTGACTTGAAGCCGGGTGATG
genomic: ACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTT-CTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATG
EST: gi|86051163|gb|DR346918.1|DR346918
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGANNTGAAAGC-GGGTGATGAAGCTAAGATCACTGGACCTGGTGGCAAG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAA-GCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAG
EST: gi|125060167|gb|EL128844.1|EL128844
EST:     ATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: ATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|47829213|gb|CK118897.1|CK118897
EST:     CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: CACAAATGATGGCGGAGAGATTGTTAAGGGGGTCTGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
EST: gi|125284785|gb|EL298870.1|EL298870
EST:     TGCTCCAACTTCTTGT                         GTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG
genomic: TGCTCCAACTTCTTGTgtaagaaaga ... ggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGG


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

tggatcgtttacggtttaaactgttcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG
                                               tttttc  CT-rich tract
 tttaaact  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
gtaagaaagaagaacaatgtgtctgtgatggatcgtttacggtttaaactgttcttagtgtttgtgatgttcgggtttttcagGTGACTTGAAGCCGGGTGATGAAGCTAAGATCACTGGACCTGTTGGCAAGGAAATGCTTATGCCAAAAGACCCCAATGCCACCATCATCATG

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGGAA