Tc09v2_t002690.3

Overview
NameTc09v2_t002690.3
Unique NameTc09v2_t002690.3
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length1140
Properties
Property NameValue
NoteDNA glycosylase superfamily protein isoform 1
Model evidenceSupporting evidence includes similarity to: 3 ESTs, 9 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 27 samples with support for all annotated introns
Productuncharacterized LOC18587847, transcript variant X3
Cross References
External references for this mRNA
DatabaseAccession
GeneID18587847
GenbankXM_007011876.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc09v2_g002690Tc09v2_g002690Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc09v2_p002690.3Tc09v2_p002690.3Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto425017auto425017Theobroma cacaoexon
exon-auto425018auto425018Theobroma cacaoexon
exon-auto425019auto425019Theobroma cacaoexon
exon-auto425020auto425020Theobroma cacaoexon
exon-auto425021auto425021Theobroma cacaoexon
exon-auto425022auto425022Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto425023auto425023Theobroma cacaoCDS
CDS-auto425024auto425024Theobroma cacaoCDS
CDS-auto425025auto425025Theobroma cacaoCDS
CDS-auto425026auto425026Theobroma cacaoCDS
CDS-auto425027auto425027Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc09v2_t002690.3 ID=Tc09v2_t002690.3|Name=Tc09v2_t002690.3|organism=Theobroma cacao|type=mRNA|length=1140bp
ATGTCCGGTGCTCCAAGAATGAGGTCGATGAATGTGGCAGATTCTGAGGC
AAGGCCCGTACTGGGACCTGCAGGGAACAAGGCGGGTTCGTTGAGTGCAA
GAAAACCAGCTTCAAAGCCTTTGAGAAAGGTTGAAAAGTCTCCGGTTGAG
GTCACTGTAGCAGAGGAGAAAAAAGCCCTTCCATCATCTACTGTCAATTC
ACTTTCTCCAAAAACGCATTCTGTTAGTGTTCCATCAGTGCTACGCCGCC
ACGAGCAGTTATTACATTCTAATTTATCACTAAATGCTTCTTGTTCATCC
GATGCTTCCACGGATTCATTTCACAGTCGAGCATCTACTGGTAGGTTAAT
TCGGTCAAATAGTGTAGGAAATAGGAGGAAGCCATACGCATCAAAGCCGA
GAAGCGTTGTTTCTGACGGTGGTTTGGATTCACCACCTGATGGTTCACAT
CAGAAGAAGAGATGTGCCTGGGTGACACCAAATACAGATCCAAGTTATGT
TGCTTTCCATGATGAAGAGTGGGGAGTTCCTGTACATGACGACAGGAAAT
TGTTTGAGCTGCTTGTGCTTTCGGGTGCATTGTCTGAACTTACATGGCCT
GCTATACTAAGCAAAAGGCACATAGTTAGGGAAGTTTTTGTGGATTTTGA
TCCTGTTGCTGTATCAAAATTGAATGAGAAGAAGTTAGTAGCACCTGGTA
GCATCGCAAGTTCATTGTTATCAGAACTAAAGCTGCGAGCTATCATCGAG
AATGCGCGCCAAATATCTAAGGTTATAGATGAATTCGGGTCATTCGATGA
GTATATTTGGAGTTTTGTGAATCACAAGCCTATAGTTAGCAGATTCAGAT
ATCCTCGCCAGGTTCCGGTTAAAACTCCAAAAGCAGATGTCATAAGCAAA
GATCTGGTAAGAAGGGGCTTTCGAAGTGTGGGGCCCACGGTCATCTACTC
GTTCATGCAAGTGGCGGGAATAACCAACGACCATCTCACGAGTTGTTTCA
GATTCCAGGAATGCATAACAGCAGCAGAAGGAAAAGAGGAAAATGGTATC
AAAGATATGCCTGAAGAGAAAAAGACCGATAACGTGATGGAATCAAAGTT
ATCCATAGCTATCGACGAATTGAGCTTCTCATCAGAATGA
back to top

protein sequence of Tc09v2_p002690.3

>Tc09v2_p002690.3 ID=Tc09v2_p002690.3|Name=Tc09v2_p002690.3|organism=Theobroma cacao|type=polypeptide|length=380bp
MSGAPRMRSMNVADSEARPVLGPAGNKAGSLSARKPASKPLRKVEKSPVE
VTVAEEKKALPSSTVNSLSPKTHSVSVPSVLRRHEQLLHSNLSLNASCSS
DASTDSFHSRASTGRLIRSNSVGNRRKPYASKPRSVVSDGGLDSPPDGSH
QKKRCAWVTPNTDPSYVAFHDEEWGVPVHDDRKLFELLVLSGALSELTWP
AILSKRHIVREVFVDFDPVAVSKLNEKKLVAPGSIASSLLSELKLRAIIE
NARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKADVISK
DLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKEENGI
KDMPEEKKTDNVMESKLSIAIDELSFSSE*
back to top