Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   
4  
 5'  3'   
5  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

gttcgtgtgcctctgttgtgttctagattcttctattttttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA

Basic information

species Glycine max
transcript GLYMA20G21190.1
intron # 2
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: GLYMA20G21190.1 (Glycine max), 3'ss of exon 2
lower sequence: AT2G31610.1 (Arabidopsis thaliana), 3'ss of exon 2
gttcgtgtgcctctgttgtgttctagattcttctattttttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA
|| | |||| | | ||||||| | |||| |||| | || | | || ||| | || | || | |||| ||||||||||| |||||||| |||||| | || || | || |||||||| ||||||||||| | |||||||||| || |||||||| ||||| |||||
------gtatgtttgtttgatcat---ttcttct-ctctttcgattttg--aatgtgtttcgagatctgatatcgaaaatgttgggtttta--gGTGAGAAGGGGAGGAGAATTAGGGAATTGACATCTCTTGTCCAGAAGAGATTCAAGTTTCCAGTTGACAGTGTTGAGCTCTATGCTGAGAAGGTTAACAA

upper sequence: GLYMA20G21190.1 (Glycine max), 3'ss of exon 2
lower sequence: AT5G35530.1 (Arabidopsis thaliana), 3'ss of exon 2
----gttcgtgtgcctctgttg----tgttctagattcttctattttttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA
||| ||| | ||| || ||| | || ||||| || || ||| | | |||| | | | |||||| ||||||||||||||||||| || |||||| | || || | || || ||||| ||||| ||||| || |||||||||| ||||||||||| ||||| ||
gtatgttggtgcattttcgttaaccttgctctgattcgacctttttttgtttctatgatatcaatt--tgatgattctctgtttggtt-----------tagGTGAGAAGGGAAGGAGGATTAGGGAATTGACATCTCTTGTACAAAAGAGATTCAAATTTCCTCAGGACAGTGTTGAGCTTTATGCTGAGAAGGTTGCTAA

upper sequence: GLYMA20G21190.1 (Glycine max), 3'ss of exon 2
lower sequence: AT3G53870.1 (Arabidopsis thaliana), 3'ss of exon 2
gttcgtgtgcctctgttgtgttctagattcttctattttttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA
| |||| | ||| | || || | | |||| | | | || | | || |||||||| |||||||||||| |||||||| |||||| | || || | ||||||||||| ||||||||||| | |||||||||| |||||||| || ||||| |||||
---------------gtatgtttttgatcttcgagcttaatcatttcttctattatacggtagtgactgtaaaat---ttgattgtattt-cagGTGAGAAGGGGAGGAGAATTAGGGAATTGACTTCCCTTGTTCAGAAGAGATTCAAGTTTCCAGTTGACAGTGTTGAGCTTTATGCCGAGAAGGTTAACAA

upper sequence: GLYMA20G21190.1 (Glycine max), 3'ss of exon 2
lower sequence: PP1S127_74V6.1 (Physcomitrella patens), 3'ss of exon 2
--------------------------gttcg-tgtgcctctgtt-gtgttctagattcttctattttttcccctttggt----attgaatat-------tgatttcttttttcttggttatttgtattt--atagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA
| || | || |||| | || || |||| || ||||| | |||||| ||||| | | | || | || | || |||||||||||||| | || ||||||||||| ||||| ||||| ||||||||||| |||| || ||| | ||| || | || || || ||||| |||||
gtgagtctcttggctgcactgacattatgcgctatggatctggtagtctttgagataattagggttttttctctttggccggaggtgaatttgcgggggtagtcggattgaagtagggtgactgctccctggtagGTGAGAAGGGACGAAGGATCAGGGAACTGACCTCCGTGGTACAGAAGAGGTTTCAGTTCCCGGAGGGTACTGTGGAGTTGTACGCCGAGAAGGTGAACAA

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|298182456|gb|HO031191.1|HO031191
EST:     GATGCGCACCGAAATCATCGTCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|20498169|gb|BQ273099.1|BQ273099
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|58018699|gb|CX705441.1|CX705441
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|151394186|gb|EV264057.1|EV264057
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|21678027|gb|BQ630378.1|BQ630378
EST:     GATGCGCACCGAAATCATCATCATAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|24205643|gb|BU964896.1|BU964896
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192316858|gb|FK006085.1|FK006085
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|10845025|gb|BF068214.1|BF068214
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|17023989|gb|BM095023.1|BM095023
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|254316297|gb|GR827834.1|GR827834
EST:     AGAACCCAAG-CGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGG-TCAGAAGAGGT
genomic: AGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|22541845|gb|BU091688.1|BU091688
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|298189761|gb|HO038164.1|HO038164
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192325123|gb|FK019069.1|FK019069
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|48575088|gb|CO036228.1|CO036228
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATAAGGGAACTTACCTCTGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|37996532|gb|CF808121.1|CF808121
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|7041963|gb|AW471857.1|AW471857
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192324902|gb|FK015383.1|FK015383
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|209718519|gb|BW674683.1|BW674683
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|213603394|gb|DB966569.1|DB966569
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|58024786|gb|CX711527.1|CX711527
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|13481301|gb|BG510644.1|BG510644
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|7925503|gb|AW831529.1|AW831529
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|26268284|gb|CA819347.1|CA819347
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|57576092|gb|CX549063.1|CX549063
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|9901298|gb|BE610266.1|BE610266
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|298174155|gb|HO017491.1|HO017491
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|18040358|gb|BM308652.1|BM308652
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|13240447|gb|BG359756.1|BG359756
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192326793|gb|FK017360.1|FK017360
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|8404055|gb|BE059689.1|BE059689
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|13790896|gb|BG653487.1|BG653487
EST:     AGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: AGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|9986821|gb|BE660929.1|BE660929
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|209701714|gb|BW655895.1|BW655895
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192316859|gb|FK006086.1|FK006086
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|19934412|gb|BQ079442.1|BQ079442
EST:     GATGCGCACCGAAATCATCATCAGAGTCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|18040338|gb|BM308632.1|BM308632
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|18040077|gb|BM308371.1|BM308371
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192303477|gb|FG994483.1|FG994483
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGGACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|4292317|gb|AI437745.1|AI437745
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|6915481|gb|AW397011.1|AW397011
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|15663235|gb|BI700606.1|BI700606
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|21600670|gb|BQ611001.1|BQ611001
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|254316296|gb|GR827833.1|GR827833
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192315548|gb|FK009700.1|FK009700
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|9820315|gb|BE555825.1|BE555825
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|192315549|gb|FK009701.1|FK009701
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|209721033|gb|BW666290.1|BW666290
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|298176664|gb|HO026785.1|HO026785
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
EST: gi|13562761|gb|BG550981.1|BG550981
EST:     GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCG                         GTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT
genomic: GATGCGCACCGAAATCATCATCAGAGCCACCAGAACCCAAGCCGTTCTCGgttcgtgtgc ... gtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGT


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

ttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA
                  tattgat  putative branch site (score: 3)
 tattgaatattgattt  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
gttcgtgtgcctctgttgtgttctagattcttctattttttcccctttggtattgaatattgatttcttttttcttggttatttgtatttatagGTGAGAAGGGAAGGAGAATCAGGGAACTTACCTCGGTGGTTCAGAAGAGGTTCAAGTTTCCCGAGAACAGTGTTGAACTTTATGCTGAAAAGGTCAACAA

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ATGCTG