Sequence
atgc intronic sequence ATGC exonic sequencegtaagtcaagcactaatagaagaatatttgattataaatataacttcatttatggtttgacatgtattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
Basic information
species | Arabidopsis thaliana |
transcript | AT2G26800.2 |
intron # | 8 |
splice site | 3' |
intron type | U2 |
Orthologous splice sites
atgc intronic sequence ATGC exonic sequence
upper sequence: AT2G26800.2 (Arabidopsis thaliana), 3'ss of exon 8
lower sequence: GLYMA07G36340.1 (Glycine max), 3'ss of exon 7
------------gtaagtcaagcactaa--tagaagaatatttgattata----aatataacttcatttatggtttgacatgtattggtgca---aaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
| | || | ||| | | | ||| || | | | |||| | || || | ||||| |||| | ||||||||||| || || ||| || || ||| |||| || |||| ||||| |||||||| || |||||||||||||||||||| || |||||||| || ||||
agatttgtataaccagtttaaactttaaactgggctgacattaaatcttttggcagtctaacattgtt-attaatggacat-cattgatattttctaaacagATGGGGATCAGTGCAGTTGATTCTTCAGTTGCTGGTCTAGGTGGCTGTCCATATGCTAAGGGAGCTTCAGGAAATGTAGCTACCGAAGATGTTGTGTACA
upper sequence: AT2G26800.2 (Arabidopsis thaliana), 3'ss of exon 8
lower sequence: GLYMA20G08360.1 (Glycine max), 3'ss of exon 9
--------------------gtaagtcaagcactaatagaagaatatttgattataaatataacttcatttatggtttgacatgtattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
||| || || || | ||| | | | | | ||| |||| || || | ||||||||||| || || ||| || || ||| |||| || |||| ||||| || ||||| || |||||||||||||||||||| || |||||||| || ||||
ttgtataacaagtttaaactttaaactgggccgacattaaaaatccttttgcagtcta-acagtgttattaatggacgttattgatttcttttctaaacagATGGGGATCAGTGCAGTTGATTCTTCAGTTGCTGGTCTAGGTGGCTGTCCTTATGCCAAGGGAGCTTCAGGAAATGTAGCTACCGAAGATGTTGTGTACA
upper sequence: AT2G26800.2 (Arabidopsis thaliana), 3'ss of exon 8
lower sequence: GLYMA14G04110.1 (Glycine max), 3'ss of exon 8
------------------------------------------------gtaagtcaagcactaatagaagaa---tatttgattataaatataactt--catttatggtttgacatgtattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
|| || || | | || | | |||| || | ||| || | || || | ||| || |||||||||| || |||| ||| || || || |||| || | || || || |||||||| || ||||||||||||||||| || || |||||||| ||||| |
gtaagtcatcaactaaatggcttggacttttgacatgcttgtcttttagtgtgtaaaaaaatgatttcatttttttgtttgctttgttttggcactcaacaatcattaagggatgttaatttgtatttgaacagATGGGGATTAGCACAGTGGATTCCTCTGTTGCTGGTCTTGGTGGGTGTCCATATGCTAAGGGAGCTTCAGGAAATGTTGCAACTGAAGATGTTGTTTATA
upper sequence: AT2G26800.2 (Arabidopsis thaliana), 3'ss of exon 8
lower sequence: PP1S138_43V6.1 (Physcomitrella patens), 3'ss of exon 5
-------------------------------------------------gtaagtcaagcactaatagaagaatatttgattataaatataacttcatttatggtttgacatg-tattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
| || | | | || || | || |||| ||| | || | ||||| | || |||||||| || || | || |||||||| | || || | ||||| ||||||||||| ||||||||| | |||||||| ||||| || |||||| | |||
gtatgaagcgtgcctactgcaccaaattatttcgtaccagaaaatgagttttggtgatcgttttacagttcaaaaaactgttggtaatagtactgaagcta--atcctacatggcgatcaactgaatgcagATGGGCATCAGTGTGGTGGACTCATCGGTGGCGGGCCTGGGCGGATGCCCATATGCCAAAGGAGCTACCGGAAATGTGGCCACCGAGGATGTGATCTACT atgc intronic sequence ATGC exonic sequenceIntronic sequence truncated to 55 bases.tttgattataaatataacttcatttatggtttgacatgtattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
ttataaatataacttc TA-rich tract
Putative cis-regulatory sequences
atgc | intron | ATGC | exonic elements by Pertea et al. |
ATGC | exon | atgc | putative intronic elements |
| | ATGC | putative exonic elements identified for retained introns |
10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
gtaagtcaagcactaatagaagaatatttgattataaatataacttcatttatggtttgacatgtattggtgcaaaaacagATGGGAATAAGCATAGTAGACTCATCAATTGCCGGATTAGGCGGCTGCCCATATGCAAAAGGAGCTTCAGGAAATGTAGCCACAGAAGATGTGGTTTACA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGGAG
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GGAGCT
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GAGCTT
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAGATG