Skip to another splice site:
1  
 5'  3'   
2  
 5'  3'   
3  
 5'  3'   
4  
 5'  3'   
5  
 5'  3'   

Data associated with selected splice site

Sequence

 atgc   intronic sequence     ATGC   exonic sequence

gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA

Basic information

species Physcomitrella patens
transcript PP1S31_150V6.5
intron # 4
splice site 3'
intron type U2

Orthologous splice sites


 atgc   intronic sequence     ATGC   exonic sequence


upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: LOC_Os02g05330.1 (Oryza sativa), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
||||| ||| || | ||||| | | ||| | ||| || ||| | | |||| ||| | | | | | | | | ||||| ||||| ||||||||||| ||||||| ||| | || || || ||||| ||||| || ||||| || || || |||||||||||| | ||||||||||
--------------------------------------------------gtgtgtctaatggttgatgag-ctattatggaagttgtatttcatttctt-tatgctgtag----tgttgcatggtctgaacttacata--tggtgttcccttttcagATCTATGATATCTTCCAGCTCCTTCCACCAAAGATCCAGGTCGGAGTGTTCTCTGCTACCATGCCTCCTGAGGCCCTTGAGATCACCCGCAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GRMZM2G027995_T01 (Zea mays), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagat--attgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
||| | | | | || || |||| | || | | |||| | ||| | | || | | || ||| | |||||||||||||||||||||||||||| ||| ||| |||| || || || || ||||| || |||||||| || || |||||||| ||| | ||||||||||
------------------------------------------------------aaccatatatcca---gtctcttggagttgctacaacgcgctatctgt---ttatggtactctgattcatttatgcctgctgacttatttctttggtggaatgcagATTTATGACATCTTCCAGCTTCTGCCATCCAAGATTCAGGTTGGAGTCTTCTCTGCTACCATGCCACCTGAGGCCCTTGAGATTACCCGCAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GRMZM2G116034_T01 (Zea mays), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcct-ccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|| | ||| | | |||| ||| ||| | | | || | || || | |||| | | || || | || || | | |||||||||| ||||||||||||||||| ||| ||| |||| || || || || ||||| || ||||| || || || |||||||| ||| | ||||||||||
----------------------------------------------------tctacacttcttttatctt-ctatcgcagcaatctgaaaagcaaaccatttatcttactcta---attgatttatgcctactgactttcttt---cgtgaaatgcagATTTACGACATCTTCCAGCTTCTCCCAGCCAAGATTCAGGTTGGAGTCTTCTCTGCTACCATGCCCCCTGAGGCCCTTGAGATTACCCGCAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA09G07530.3 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
||| | | | | | | | || ||| | | | | || | || | || || | | | |||| | | ||| | || ||| ||| || ||||| ||||| || |||||| | |||||| ||| |||||||||| || || ||||| |||||||| ||||| || ||||||||||| |||||||||||||
--------------------------------------tttgaattatcctggagatt--ttatttatat---tggcctatttgtttagtgtccc-------aaac----aaatttattcataagtgctt-taaccttaattt---catttttatcagATCTATGATATATTCCAGTTGCTTCCATCTAAGATTCAAGTGGGAGTTTTCTCTGCTACAATGCCTCCCGAGGCACTTGAGATCACAAGGAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA17G06110.1 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
| ||||| | | | | | | | | | | || | ||| ||| || | | | ||||| ||| || | || ||||| || || || || ||||| |||||| ||| |||||||||| || || |||||||||||||| || || || |||||||||||||||||||| ||||
---------------------------------------------------aagtttcaattgacttttttccaagcctattgggtttgtagtgct----aaaagaattgatttatgtataactgctctaacc---ttggttttattgttatcatcagATCTACGATATATTTCAGCTGCTTCCATCTAAGATTCAAGTGGGAGTTTTCTCTGCCACAATGCCTCCTGAGGCCCTTGAGATCACCAGGAAGTTTATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA15G18760.3 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatg-agagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|||| | | || | | | || | ||| | | | | | | |||| | | ||| | || ||| ||| || ||||| ||||| || |||||| | |||||| ||| |||||||||| || || ||||| |||||||| || || || ||||||||||| |||||||||||||
-------------------------------------------------------ttgaattatcctggagattttatttattattggcctatttgtttagtttcccaaacaaatttattcataagtgctt-taaccttaattt---catttttatcagATCTATGATATATTCCAGTTGCTTCCATCTAAGATTCAAGTGGGAGTTTTCTCTGCTACAATGCCTCCTGAGGCACTTGAGATCACAAGGAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA07G00950.1 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaaga--aaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
| | |||| ||| | | ||| | | || | | | | ||| ||| | | ||| || |||| | | | ||| || | | ||||||| ||||||||||| ||||| || ||| || |||| || || ||||| ||||| ||||||||||| ||||| |||||||| || || ||||||||||
-------------------------------------------agttataagaagaaatacaatatttatttatctggacaggatacaagaatttccccctctt---ccatgcttgacattg--------tgaaaattttggttggtgt------tgcagATCTATGACATCTTTCAGCTGCTGCCATCCAAAATTCAGGTTGGAGTGTTCTCTGCTACAATGCCACCAGAAGCCCTTGAGATTACAAGAAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA08G20300.3 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|| | || | || || ||||| || | || | | |||| |||| || | |||| ||| || | ||||||| ||||||||||||||||| || ||| || |||| || || ||||| ||||| |||||||| || ||||| ||||| || || |||||||||||||
----------------------------------------------ggacaggatacaagaatttcccaaaacaatttctgtgagaaatgta--------acagtttatgttcttccatgcttgacattg-tgaaa---attttggttggtgttgcagATCTATGACATCTTCCAGCTGCTGCCATCCAAAATTCAGGTTGGAGTGTTCTCTGCTACAATGCCGCCAGAAGCCCTTGAAATTACAAGGAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: GLYMA13G16570.1 (Glycine max), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
| | || || |||| ||| || | ||||| | | | || | || ||| ||| || | || ||||| ||| | || | ||||| || || || || ||||| || ||| ||| |||||||||| || || ||||| |||||||| || || || ||||||||||| |||||||| ||||
-------------------------------------------agtgcctttgaattgaattttttctgagcctatt--gggtttgtagtg-------ctaaaagaattgatttatgtataaatactctaacc---ttggctt---tattgttatcagATCTACGATATATTTCAGCTGCTGCCATCTAAGATTCAAGTGGGAGTTTTCTCTGCTACAATGCCTCCTGAGGCCCTTGAGATCACTAGGAAGTTTATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: AT3G13920.2 (Arabidopsis thaliana), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|| || | | ||||| | |||| | | || | ||| | | ||| || | || || | | || | ||||| |||||||| |||||||||||||||| ||| | ||||| |||||||| || || ||||||||||| ||||| ||||||||||| |||||||||||||
--------------------------------------------------------------gtatgcaaatatttctcagtgtttcaatt-ccattctcgt-------------tatagtttaacttgaatctcactggtttctgggtttttt-cagATCTATGACATATTCCAGCTTCTTCCACCAAAGATCCAAGTTGGTGTGTTCTCCGCAACAATGCCACCAGAAGCTCTTGAGATCACAAGGAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: AT1G72730.1 (Arabidopsis thaliana), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaa--ttgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
||| |||||| | ||| || ||| || || || |||| | || || ||| || || || |||||| ||| ||||| || |||||||||||||||||||| ||| |||| || ||||| || ||||| |||||||| |||||||| ||||||||||| |||||||||||||
------------------------------------------------------gtttaatcttgtttaatctcccattcacattttct--ttttctctctgataaaat-caagtcat-ttgttc----ttat--aatttgatt------tctctgtcagATATACGACATCTTCCAGCTTCTTCCTTCCAAGGTTCAGGTTGGTGTTTTCTCTGCTACAATGCCTCCCGAAGCCCTTGAGATCACAAGGAAGTTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: AT1G54270.2 (Arabidopsis thaliana), 3'ss of exon 4
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|||||| | || |||| | ||| | | | | | | ||| ||||||| | | || ||| || || ||||||| |||||||| ||||||||||| |||| ||| |||| || || ||||| ||||| |||||||| || ||||| | |||||||| || || |||||||
--------------------------------------------------------------gtttgt-tatcttcttcaaagtttgaatc-taccctctgtg-gttatctctgatattgacttggttttgtgttgtt--ttctggtgaatt-tgcagATCTATGACATATTCCAGCTTCTCCCACCAAAGATTCAGGTTGGAGTGTTCTCTGCGACAATGCCTCCGGAAGCTTTGGAGATCACAAGAAAATTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: Vv11s0016g01920.t01 (Vitis vinifera), 3'ss of exon 3
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctc----aacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|| ||| | | ||| | | | | | | |||| | || || | | || ||| | | || ||| ||| | | ||||||| ||||| || ||||| || || ||| || ||||||| || ||||| |||||||||||||||||||| || |||||||| || |||||||| ||||
------------------------------------------------------------ttatttttttaactctgtgttgtattccaagaacctca--acaacctttcaactttaatgcgtttatatcactgttctaataaattagtttcttccttgcagATCTATGATATTTTCCAACTCCTGCCATCAAAAGTTCAAGTTGGAGTGTTCTCTGCCACAATGCCACCCGAGGCCCTTGAGATTACAAGGAAGTTTATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: Vv04s0008g01000.t01 (Vitis vinifera), 3'ss of exon 3
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|| | | ||| | | | | ||| ||| | | || | || ||| ||| | | || |||||| || || |||||| ||||| || |||||| | || ||| ||| |||| || || ||||| |||||||| |||||||| || || ||||||||||| ||||| |||||||
-----------------------------------------------tttagctatgtggttgcaagcattttgttgtgggttttctgactttcatgtgcatt--ctaatgctgatttcttgtcttgatactgaaatt---tca------tttggcagATCTATGATATTTTCCAGTTGCTGCCATCAAAGATTCAGGTTGGGGTGTTCTCTGCCACGATGCCACCTGAGGCCCTTGAGATCACAAGGAAATTCATGA

upper sequence: PP1S31_150V6.5 (Physcomitrella patens), 3'ss of exon 4
lower sequence: EFJ16567 (Selaginella moellendorffii), 3'ss of exon 3
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggc--atcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
|||| | || | || || | |||| | || | | | | | ||||||||| |||||||||||||| ||||| ||| | || ||||| | || || ||||||||||| ||||| ||||| ||||||||| | ||||||||||
----------------------------------------------------------------------------------------------------gtaaggtggaaggaaagctggtctata-taaccaccctgcctgtgccttgtgtgtcgcagATTTACGACATCTTCCAGCTCCTTCCCTCCAAGGTCCAGGTGGGCCTCTTCTCCGCCACAATGCCGCCCGAGGCGCTGGAGATCACCCGCAAGTTCATGA

Mapped EST sequences

Showing partial alignments of ESTs and genomic sequences. See full alignments


 ATGC     EST sequence
 ATGC     genomic sequence (exon)
 ATGC     genomic sequence (truncated intron)


EST: gi|208388678|gb|DC941505.1|DC941505
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGNATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCANGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100388568|gb|BY952170.1|BY952170
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAAGCT-CAAGTGGGTGTGTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAA-GCTTCAAGTGGGTGTGTT
EST: gi|100405929|gb|BY991454.1|BY991454
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|67718483|gb|BJ968742.1|BJ968742
EST:     TCGTGCTGGACGANGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTC
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTC
EST: gi|7147757|gb|AW509622.1|AW509622
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100362821|gb|BY947545.1|BY947545
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTTCA-GTGGGTGTGTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT-CAAGTGGGTGTGTT
EST: gi|67718047|gb|BJ968306.1|BJ968306
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|162259491|gb|FC435002.1|FC435002
EST:     GGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCCTTCCAGCCTTCTTCCACAGAAGCTTCAAGTGGGTGTGT
genomic: GGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACAT-CTTCCAG-CTTCTTCCACAGAAGCTTCAAGTGGGTGTGT
EST: gi|67726661|gb|BJ976920.1|BJ976920
EST:     CGTGCTGGACGAGGCCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: CGTGCTGGACGAGG-CCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|67569151|gb|BJ941975.1|BJ941975
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208375488|gb|DC937003.1|DC937003
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGC
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGC
EST: gi|208346400|gb|DC905305.1|DC905305
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT
EST: gi|208400626|gb|DC904602.1|DC904602
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCANGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100357901|gb|BY946631.1|BY946631
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAAAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|67718174|gb|BJ968433.1|BJ968433
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100373426|gb|BY949320.1|BY949320
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCA-GGATCAG                         ATTTATGACATCTTCCAGCTTCTTTCACAGAAGCTTCA-GTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208359138|gb|DC934641.1|DC934641
EST:     TCGTGCTGGACGANGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
EST: gi|208392366|gb|DC907885.1|DC907885
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGC
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGC
EST: gi|100394940|gb|BY990001.1|BY990001
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208388645|gb|DC941486.1|DC941486
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCANGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|162220742|gb|FC395511.1|FC395511
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208378518|gb|DC927215.1|DC927215
EST:     TCGTGCTNGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208353687|gb|DC938754.1|DC938754
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCANGGATCAN                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|162224598|gb|FC400494.1|FC400494
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100393761|gb|BY953501.1|BY953501
EST:     CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTTCA-GGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
genomic: CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATT-CAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
EST: gi|208395856|gb|DC911865.1|DC911865
EST:     TCGTGCTGGACGANGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTTCTTCCACAGAAGCTNCANGTGGGTGTGTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTT-CTTCCACAGAAGCTTCAAGTGGGTGTGTT
EST: gi|100401889|gb|BY954761.1|BY954761
EST:     CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGGATTCANGGATCAG                         AATTATGACATCTTCCAGCTTCTTCCACAGAAAGCTTCA-GTGGGTGTGTT
genomic: CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGG-ATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAA-GCTTCAAGTGGGTGTGTT
EST: gi|100403374|gb|BY991171.1|BY991171
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208397355|gb|DC908245.1|DC908245
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT
EST: gi|162198132|gb|FC373082.1|FC373082
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|67718173|gb|BJ968432.1|BJ968432
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|162218220|gb|FC395018.1|FC395018
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100403879|gb|BY955147.1|BY955147
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCAACAGAAGCTTCAAAGTGG-TGTGTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAA-GTGGGTGTGTT
EST: gi|208392199|gb|DC957278.1|DC957278
EST:     CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: CGTGCTGGACGAGGCCGATGAGATGCTTTCGA-GGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100447413|gb|CJ971712.1|CJ971712
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|162221100|gb|FC397741.1|FC397741
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100367028|gb|BY948332.1|BY948332
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTTCACAGAAGCTTTCA-GTGGGTGTGTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTT-CAAGTGGGTGTGTT
EST: gi|67726662|gb|BJ976921.1|BJ976921
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208356832|gb|DC926843.1|DC926843
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100396603|gb|BY990207.1|BY990207
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|208350849|gb|DC913453.1|DC913453
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGNATCAN                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCA
EST: gi|162272354|gb|FC449019.1|FC449019
EST:     TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAG                         ATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: TCGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTCAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
EST: gi|100386440|gb|BY951236.1|BY951236
EST:     CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATTTCA-GGATCAG                         AATTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT
genomic: CGTGCTGGACGAGGCCGATGAGATGCTTTCGAGGGGATT-CAAGGATCAGgtttgttcgg ... tcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTT


 atgc   intronic sequence     ATGC   exonic sequence

Intronic sequence truncated to 55 bases.

agctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA
                                           catcttct  CT-rich tract
 aaattgatt  TA-rich tract
















Putative cis-regulatory sequences

 atgc intron ATGC exonic elements by Pertea et al.
 ATGC exon atgc putative intronic elements
 ATGC putative exonic elements identified for retained introns
        10        20        30        40        50        60        70        80        90        100       110       120       130       140       150       160       170       180       190       200       210       220 
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------| 
gtttgttcggagagtcctggaatggtagaagaaatttgtttttactcccaagaaaactaattgtttgtgtgactatttcagttctctaattatcctccttataagctatgagagatattgctcaacgctaacgaaattgattcgggcatcttctgcagATTTATGACATCTTCCAGCTTCTTCCACAGAAGCTTCAAGTGGGTGTGTTTTCTGCCACAATGCCACCCGAAGCGCTTGAGATCACCAGGAAGTTCATGA

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GAGATC
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGATCA