Sequence
atgc intronic sequence ATGC exonic sequencegttatcctagttgtctaactcatgaattaatcaattttgttgtttacgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
Basic information
species | Arabidopsis thaliana |
transcript | AT4G00520.2 |
intron # | 12 |
splice site | 3' |
intron type | U2 |
Orthologous splice sites
atgc intronic sequence ATGC exonic sequence
upper sequence: AT4G00520.2 (Arabidopsis thaliana), 3'ss of exon 12
lower sequence: LOC_Os04g47120.1 (Oryza sativa), 3'ss of exon 12
--gttatcctagttgt--ctaactcatgaattaatcaattttgttgtttacgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
||||| | || ||| | | ||||| ||| | || || | || | | || | | | | |||| ||| |||| |||||||||||||| | |||||||||||||||||| | ||| |||| ||
aatttatcgcatttcctgctacattgcaatttaatacattgttctgattcattagtgcatctattccattatctaaaaaaatgtatccaatttctttcagCTTAAATTATTGGTTTAGAGCCAGAGGAAAACTTTCTGATGAT---CCAGCACTGCATAG
upper sequence: AT4G00520.2 (Arabidopsis thaliana), 3'ss of exon 12
lower sequence: GLYMA03G01210.1 (Glycine max), 3'ss of exon 12
--------------------------------------------------------------------------------------------gttatcctagttgtctaactcatgaatt--aat-caattttgttgttt-acgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
| |||| | | || |||| | ||| || || || | | | | || || | ||| ||| || | | |||||| | | ||| |||||||||||||||||||||||||| |||||||| ||| ||||| ||
gttagtcagatgttgtgtctgcaatttttcacttacttttattatcagcatttatgcaatgtactgatcagaaagatttggaagggagcatcgccctcctttctcaccaaatcatttttccaaatgtaaccttaatgcctgattttttaaatgctagatattaatttgaacttgtgtaaatactgtttgcagTTTGAGATACTGGTTTAGAGCAAAGGGAAAACTTTCAGATGATGA---AGCCTTGCATAG
upper sequence: AT4G00520.2 (Arabidopsis thaliana), 3'ss of exon 12
lower sequence: GLYMA18G46610.1 (Glycine max), 3'ss of exon 12
-gttatcctagttgtctaactcatgaattaatcaattttgttgtttacgtggttgctgataagaacta--accatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG-
|| | || || || || || | | || | | | | || || | | | ||| || ||| ||||| | | ||||||||||||||||| |||||||||||| |||||||| ||| ||||| ||
ctttgccttactttctctgtaaatcttttctacagagccttaatgtatgggaattttaatggtgtatattatcttgacttaattaaaatttgaatgcagtTTGAGATATTGGTTTAGAGCAAGGGGAAAACTTTCAGATGATGA---AGCCTTGCATAGG
upper sequence: AT4G00520.2 (Arabidopsis thaliana), 3'ss of exon 12
lower sequence: GLYMA18G46620.1 (Glycine max), 3'ss of exon 10
-gttatcctagttgtctaactcatgaatt---aatcaattttgttgtttacgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
|| ||| | ||| || | |||| | | ||| | | | ||| | | | | | | ||| || ||| ||||| | | ||| |||||||||||||||||||||||||| |||||| || || ||||| ||
cctttgccttactttctctctaaataatttctacagagccttgacatcttggaatttgaatgctacatattatcttaacttatttaaaatttaaatgcagTTTGAGATACTGGTTTAGAGCAAAGGGAAAACTTTCAGATGAT---CAGGCCTTGCATAGMapped EST sequences
Showing partial alignments of ESTs and genomic sequences. See full alignments
ATGC EST sequence
ATGC genomic sequence (exon)
ATGC genomic sequence (truncated intron)
EST:
gi|86079630|gb|DR375387.1|DR375387EST: CACTAAACAGAATAAGTCTCCTCCAAG ACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGC
genomic: CACTAAACAGAATAAGTCTCCTCCAAGgttatcctag ... catcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGC
atgc intronic sequence ATGC exonic sequenceIntronic sequence truncated to 55 bases.gtttacgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
aactaac putative branch site (score: 1)
tcatctt putative PPT
tataatcat TA-rich tract
Putative cis-regulatory sequences
atgc | intron | ATGC | exonic elements by Pertea et al. |
ATGC | exon | atgc | putative intronic elements |
| | ATGC | putative exonic elements identified for retained introns |
10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220
---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
gttatcctagttgtctaactcatgaattaatcaattttgttgtttacgtggttgctgataagaactaaccatttctttcctataatcatcttgcagACTAAAATATTGGTTTAGAGCAAAGGGAAAACTTTCTGATGATGATCAAGCTTTGCACAG
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGAGCA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - GCAAAG
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - CAAGCT