Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC

Basic information

species Arabidopsis thaliana
transcript AT4G31700.2
intron # 3
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: LOC_Os03g27260.1 (Oryza sativa), 3'ss of exon 5
gtgagacac-tgaacttgagaaatag--tggtttctgaatcacacttgagaaataatgtttt-ctgaaacttgtttcttctgtttggatttttg--acagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
| | | | || || |||| | | | || | | || | || ||| |||| | | | | ||| ||||| |||||||| | |||||||| || |||||||||||||||||||| || || ||||||||||||||||||| || ||||| || | |||||| | | ||||||
atataatctgtaattttttgacatagcctagcatatgcagtcaatttaatggttagtgtagagctgacctatttaccaaccatttatttttttcctgcagGCAAGAAGGTTAGCAAGGCTCCTAAGATCCAGAGGCTTGTCACTCCCCTGACTCTTCAGAGGAAGAGGGCGAGAATCGCCCAGAAGAAGCAAAGAATTGC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: LOC_Os07g42950.1 (Oryza sativa), 3'ss of exon 5
----gtgagacactgaacttgagaaatagtggtttctgaatca-----cacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
| | | || || | | |||| | ||| | | | || || | | || | | |||| | | | | |||||||| | || ||||| || |||||||||||| | |||||||| || | || || ||||||||| | || ||||| |||||||||||||||| || ||
caccatattatattgt-ctgcatgtttgtcagttttcaagtcaaaatatatcttccaggtagctttctgcttacctacctaatggtgtt--gttgtgttgcagGCAAGAAGGTGAGCAAGGCTCCTAAGATCCAGCGTCTTGTGACTCCCCTCACCCTCCAGAGGAAGCGTGCCAGAATCGCTGACAAGAAGAAGAGGATCGC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: GRMZM5G851698_T02 (Zea mays), 3'ss of exon 5
------gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
|| | | | | || || | |||| || | | | | | | | | ||||||| | || |||||||| | || ||||| || |||||||||||| ||||||||||||| ||||| || ||||||||| | ||||||||||||||||||||||||| || ||
ctctgtgtttgttatttgttggatgggattgttgacaatgaaccatagtgttctgaagttttgcaaccacagcaagctaacctgtttgcttccttttcagGCAAGAAGGTGAGCAAGGCACCTAAGATCCAGCGGCTTGTGACCCCCTTGACCCTCCAGAGGAAGCGTGCTAGAATTGCTGACAAGAAGAAGAGGATCGC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: GRMZM2G054136_T02 (Zea mays), 3'ss of exon 4
----------gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
|| | || ||| | | | | | ||| |||| || | | | | | | |||| || | ||| |||||||| | || ||||| || ||||||||||| |||||||||| || ||||| || ||||||||| | |||||||| |||||||||||||||| || |
tactctgcttgttatacgctggat----gggtttcttgacaatgagccacagttttctgaagttttcgaaccacaacaagctaacctgtatgtttccttggcagGCAAGAAGGTCAGCAAGGCACCTAAGATCCAACGGCTTGTGACACCCTTGACCCTCCAGAGGAAGCGTGCTAGAATCGCTGACAAGAAGAAGAGGATAAC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: GRMZM2G054136_T09 (Zea mays), 3'ss of exon 5
----------gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
|| | | ||| | | | | | ||| |||| || | | || | | | |||| || | ||| |||||||| | || ||||| || ||||||||||| |||||||||| || ||||| || ||||||||| | |||||||| |||||||||||||||| || |
tactctgcttgttatgcgctggat----gggtttcttgacaatgagccacagttttctgaagtttttgaaccacaacaagctaacctgtatgtttccttggcagGCAAGAAGGTCAGCAAGGCACCTAAGATCCAACGGCTTGTGACACCCTTGACCCTCCAGAGGAAGCGTGCTAGAATCGCTGACAAGAAGAAGAGGATAAC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: GRMZM2G048770_T01 (Zea mays), 3'ss of exon 1
-------gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC-
| || ||| |||| | || ||| | || ||| || | | |||||||| || | || || |||||||| | || ||||| || ||||||||||| |||||||||||| ||||| |||||||||||| |||||||| |||||||||||||||| || ||
tcttctattagtactctgtgcttgttatatgttggatag-gattcatgacaatgagccactaatttctgaagttttccaagctaacctattttctttgcagGCAAGAATGTGAGCAAGGCACCTAAGATCCAATAGCTTGTGACCCC-TTGACCCTTCAGAGGAAGCACGCTAGAATCGCTGACAAGAAGAAGAGGATCGCT

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: GRMZM2G019734_T01 (Zea mays), 3'ss of exon 2
gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC-------------------------------------------------
|| ||||| || |||||||| ||| ||||| ||||||| | ||| |||||||||||| | |||||||| |||| ||||||||||| || ||
------------------------------------------------------------------------------------------------------gtgagcaaggcacctaagattcagtggcttatgaccccgt-gacccttcagAGGAAGCGCGCTAGAATCACTGATAAGAAGAAGAAGATCGCTAAGGAGTCTGAGGCTGCCGAGTACCAAAATCTTCTTGCCAGAGGCTAA

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: Vv03s0017g01350.t01 (Vitis vinifera), 3'ss of exon 5
-gtgagacac---tgaacttgagaaatagtggtttctgaatcacacttgagaaataa-tgttttctgaaacttgtttc--ttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
||| | | ||| | | || || | |||| | || ||| ||| | | | ||| ||| | || | | | |||| ||| || || |||||||||||||| |||||||| ||||||||||||||||| ||||||||| | || || ||||| || |||||||||| ||||||
actgaaattcatgtgatgcttaaaagtatta-tttcagtattttgctttttaaaagactaaagacagaagattgatatggtttcatcactgtgatcctcagGGAAGAAATGCAGTAAAGCCCCTAAGATTCAGAGGCTGGTGACCCCATTGACTCTGCAGAGGAAGCGTGCAAGGATTGCAGAGAAGAAGAAGAGAATTGC

upper sequence: AT4G31700.1 (Arabidopsis thaliana), 3'ss of exon 5
lower sequence: PP1S3195_1V6.1 (Physcomitrella patens), 3'ss of exon 2
-----------------------------gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttga---cagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
|| | ||| | || | || | | | | | |||| | | | | || || || | || |||||||| | ||||| || || |||||||||||||||||||| || |||||| | ||||||||| | |||| | || ||||||| | | ||
gtacagtcatggctcctgatgtgggtgtcgtattttgccatgcttcctttcgtgcagtcgccgatgtagagc-gggggccattgttgcgaggagccctctactgacgtctgcgtgcgtgctttcagGCAAGAAGAGGAGCAAGGCACCGAAGATCCAGAGGCTTGTGACTCCTTTGACTTTGCAGAGGAAGCGCCGTAGAGTGGCAATCAAGAAGGCGCGTGTCGC

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|164157029|gb|ES093007.1|ES093007
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAAGACAAAGCCCCTAAGATCCAGAGGCTTGTGACC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTA-G-CAAAGCCCCTAAGATCCAGAGGCTTGTGACC
EST: gi|86037904|gb|DR333659.1|DR333659
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|125249014|gb|EL263099.1|EL263099
EST:     CAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86079481|gb|DR375238.1|DR375238
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86037944|gb|DR333699.1|DR333699
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|116429653|gb|EG472245.1|EG472245
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|124837314|gb|EH921753.1|EH921753
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGT
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGT
EST: gi|164065388|gb|ES171541.1|ES171541
EST:     GTCAGGACCTATGTCAACACTTACCGCCGCAAGTTACACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCT
genomic: GTCAGGACCTATGTCAACACTTACCGCCGCAAGTT-CACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCT
EST: gi|164197240|gb|ES094316.1|ES094316
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACC
EST: gi|125324581|gb|EL338661.1|EL338661
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGA
EST: gi|124971870|gb|EL045601.1|EL045601
EST:     CAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164197664|gb|ES143446.1|ES143446
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCC
EST: gi|164122944|gb|ES166553.1|ES166553
EST:     TCAGGACCTATGTCAACACTTACCGCCGCAAGTTCAC-AACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTG-
genomic: TCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|125286878|gb|EL300963.1|EL300963
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|124734018|gb|EH825420.1|EH825420
EST:     CACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164219056|gb|ES012692.1|ES012692
EST:     TCACAAACAAGAAGG                         GCAAGGAGGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86040573|gb|DR336328.1|DR336328
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAG
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAG
EST: gi|124787990|gb|EH877669.1|EH877669
EST:     TACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|116379638|gb|EG422230.1|EG422230
EST:     GTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: GTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164128067|gb|EL987965.1|EL987965
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCC
EST: gi|47828814|gb|CK118498.1|CK118498
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|124711289|gb|EH802691.1|EH802691
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCA
EST: gi|164212848|gb|ES059687.1|ES059687
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACC
EST: gi|124723349|gb|EH814751.1|EH814751
EST:     CAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|124794777|gb|EH884456.1|EH884456
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGA
EST: gi|86037895|gb|DR333650.1|DR333650
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86037892|gb|DR333647.1|DR333647
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164159043|gb|ES024896.1|ES024896
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGA
EST: gi|164144331|gb|ES102158.1|ES102158
EST:     AAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTCGTGACCCCATTGA
genomic: AAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86037898|gb|DR333653.1|DR333653
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|86079571|gb|DR375328.1|DR375328
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|125030693|gb|EL102962.1|EL102962
EST:     CCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|47829514|gb|CK119198.1|CK119198
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164131161|gb|ES092125.1|ES092125
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAG
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAG
EST: gi|86081299|gb|DR377056.1|DR377056
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164219541|gb|EL968390.1|EL968390
EST:     AAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: AAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|125296713|gb|EL310798.1|EL310798
EST:     CGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: CGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|125052829|gb|EL124338.1|EL124338
EST:     AGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
genomic: AGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGA
EST: gi|164011767|gb|ES147528.1|ES147528
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGC
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGC
EST: gi|124841595|gb|EH926032.1|EH926032
EST:     TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGG                         GCAAGGAAGTTAGCAAAGCCCCTAAGAT
genomic: TGTCAGGACCTATGTCAACACTTACCGCCGCAAGTTCACAAACAAGAAGGgtgagacact ... tttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGAT


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

cacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC
                                             ttttt  CT-rich tract
 agaaataatgtttt  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
gtgagacactgaacttgagaaatagtggtttctgaatcacacttgagaaataatgttttctgaaacttgtttcttctgtttggatttttgacagGCAAGGAAGTTAGCAAAGCCCCTAAGATCCAGAGGCTTGTGACCCCATTGACTCTTCAGAGGAAGAGAGCTAGAATTGCTGACAAGAAGAAGAAAATTGC

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGAGGA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GGAAGA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGAGA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGAGCT
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGAAG
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGAAG
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGAAA