Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   
4  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

...acttggttattcctcacataaacacttgactgtgctataagtaaatatccaatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA

Basic information

species Glycine max
transcript GLYMA13G21520.1
intron # 3
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: GLYMA13G21520.1 (Glycine max), 3'ss of exon 3
lower sequence: AT3G53020.1 (Arabidopsis thaliana), 3'ss of exon 3
-----------acttggttattcctcacataaacacttgactgtgctataagtaaatatccaatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA
||| || ||| | | ||| ||| || | || | | || || | | | |||| ||| | | | | | |||||||| || ||||| |||||||||| || |||||||| ||||| || || ||||| ||||| |||||||||||||| || ||||| || ||||| ||||
gtaaaaaaaaaactcagtaatttgg-atctgttttgttgtttgtacttt--gttttttttcattgatc-ctgatttgttaac---attctaaaactctgtttcttttatagGATGCAGCACAAGAGGCTGTGAAGAGAAGGAGACGTGCCACCAAGAAGCCATACTCAAGGTCCATTGTTGGTGCTACCTTGGAAGTAATTCAGAAGAAGA

upper sequence: GLYMA13G21520.1 (Glycine max), 3'ss of exon 3
lower sequence: AT2G36620.1 (Arabidopsis thaliana), 3'ss of exon 3
acttggttattcctcacataaacacttgactgtgctataagtaaatatccaatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA
| | ||| | || ||| | || | | | || | || || ||| | | |||| ||||| | | || |||| || ||||| |||||||||| || |||||||| || || || |||||||| ||||| ||||| ||||||||||| || ||||| ||||| |||
--gtaatgtttctttttttagttgcttcttgttttcatgatttagta--ctat-------tgaattctgttgtgatc-tttgttttctctcttcttacagGACGCAGCACAAGAGGCTGTGAAGAGAAGGAGACGTGCAACTAAGAAGCCTTACTCAAGGTCGATTGTCGGTGCTACTTTGGAGGTTATTCAGAAGAAGC

upper sequence: GLYMA13G21520.1 (Glycine max), 3'ss of exon 3
lower sequence: Vv08s0007g04210.t01 (Vitis vinifera), 3'ss of exon 3
acttggttattcctcacataaacacttgactgtgctata-agtaaatatc-caatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA
| | | | | || || ||| | || || | |||| | ||| ||||| | | || | | ||| |||||||||||||||| ||||| |||||||| | ||||| |||||||| || ||||||||||| ||||| |||||||| || || ||||||||||| | |
-tgtatatgtgttttggtcactttctggaaaagaatatccactatggattatgtttgttgggagttatttgttttgagtcactaattgggt-taatgctcagGATATTGCTCAAGAGGCTGTAAAGAAGAGGCGTCGTGCCACCAAAAAGCCCTACTCTAGGTCCATTGTGGGTGCTACATTGGAGGTTATCCAGAAGAGAA

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|23057424|gb|BU578098.1|BU578098
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209720831|gb|BW659525.1|BW659525
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|151411427|gb|EV281238.1|EV281238
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|13790867|gb|BG653458.1|BG653458
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209715514|gb|BW682180.1|BW682180
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|33388019|gb|CA851226.1|CA851226
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGCGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|120529878|gb|EH258012.1|EH258012
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|19053213|gb|BM731880.1|BM731880
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192329410|gb|FK023628.1|FK023628
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|254316158|gb|GR825967.1|GR825967
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209723193|gb|BW675139.1|BW675139
EST:     CGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: CGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|13787312|gb|BG649904.1|BG649904
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|120531053|gb|EH259186.1|EH259186
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|9203781|gb|BE330005.1|BE330005
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATATGAAT                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|214004056|gb|DB984962.1|DB984962
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGCACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|213618473|gb|DB967710.1|DB967710
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|4397362|gb|AI496359.1|AI496359
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|298185626|gb|HO030839.1|HO030839
EST:     GGCCGTCAAAGCTCACGTGGACCGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGGAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|254341112|gb|GR854840.1|GR854840
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|17021663|gb|BM092697.1|BM092697
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|57575853|gb|CX548824.1|CX548824
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAACCCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|7926002|gb|AW832028.1|AW832028
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|6725370|gb|AW309769.1|AW309769
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209722711|gb|BW657965.1|BW657965
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|213619695|gb|DB967317.1|DB967317
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209705938|gb|BW661755.1|BW661755
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|151398218|gb|EV268096.1|EV268096
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAGAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|6566752|gb|AW234384.1|AW234384
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|7145806|gb|AW507728.1|AW507728
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192306402|gb|FG997156.1|FG997156
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192325754|gb|FK017976.1|FK017976
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGTTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|31456728|gb|CD398756.1|CD398756
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|31476383|gb|CD418411.1|CD418411
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|163926101|gb|EH038392.1|EH038392
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|57574256|gb|CX547231.1|CX547231
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|214005582|gb|DB984590.1|DB984590
EST:     CGAAAGCAGCATAAGAAG                         GATAGTGCTAAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAACATGG
genomic: CGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|57574098|gb|CX547073.1|CX547073
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|18040154|gb|BM308448.1|BM308448
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTATCATTAAAACC
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCA-AAAAACC
EST: gi|214005034|gb|DB985658.1|DB985658
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGCACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|213599777|gb|DB968427.1|DB968427
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192324210|gb|FK017941.1|FK017941
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAA
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAA
EST: gi|33388095|gb|CA851302.1|CA851302
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209715513|gb|BW682179.1|BW682179
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|17518736|gb|BM187778.1|BM187778
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|4396821|gb|AI495818.1|AI495818
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|5606228|gb|AI900326.1|AI900326
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|151397032|gb|EV266905.1|EV266905
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|12496182|gb|BG046936.1|BG046936
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209702723|gb|BW652766.1|BW652766
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|58022509|gb|CX709250.1|CX709250
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|7041987|gb|AW471881.1|AW471881
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|58025291|gb|CX712032.1|CX712032
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|10848042|gb|BF070603.1|BF070603
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|31473123|gb|CD415151.1|CD415151
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192307537|gb|FK000823.1|FK000823
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|4313982|gb|AI461101.1|AI461101
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|298164889|gb|HO010886.1|HO010886
EST:     GGCCGTCAGAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|19054246|gb|BM732913.1|BM732913
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|192298391|gb|FG989142.1|FG989142
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209723192|gb|BW675138.1|BW675138
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|6482694|gb|AW201931.1|AW201931
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|16347827|gb|BI973422.1|BI973422
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTTTTTTAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|15664207|gb|BI701578.1|BI701578
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|213607190|gb|DB965104.1|DB965104
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|193529014|gb|FK486042.1|FK486042
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATATACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTAACCAAAAAACC
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTA-CCAAAAAACC
EST: gi|58022219|gb|CX708960.1|CX708960
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209720832|gb|BW659526.1|BW659526
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|298165579|gb|HO011628.1|HO011628
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|20814420|gb|BQ298898.1|BQ298898
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|193699212|gb|FK637637.1|FK637637
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|209705937|gb|BW661754.1|BW661754
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
EST: gi|10846233|gb|BF069176.1|BF069176
EST:     AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAG                         GATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT
genomic: AGCCGTCAAAGCTCACGTGGACTGCAATGTACCGAAAGCAGCATAAGAAGgtgattattt ... tgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCT


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

tatccaatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA
                  attttaatttgattgt  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
acttggttattcctcacataaacacttgactgtgctataagtaaatatccaatggttgcatgcattttaatttgattgtttgtgtgcatgtgatgtatagGATATTGCTCAAGAAGCTGTGAAGAAGAGAAGACGTGCTACCAAAAAACCTTACTCTAGGTCTATTGTTGGTGCTACTTTAGAAGTTATCCAGAAAAAGA

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGAAGT
- - - - - -cctcaca
- - - - - - - - - - - - - - actgtgc
- - - - - - - - - - - - - - - - - - - - - - - - - - - - -gcatgca
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - tgtgtgc