Tc01v2_t002720.1

Overview
NameTc01v2_t002720.1
Unique NameTc01v2_t002720.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length1101
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 36 Proteins, and 72% coverage of the annotated genomic feature by RNAseq alignments
Productdihydroflavonol-4-reductase
NoteDihydroflavonol-4-reductase
Cross References
External references for this mRNA
DatabaseAccession
GeneID18610896
GenbankXM_018129818.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc01v2_g002720Tc01v2_g002720Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc01v2_p002720.1Tc01v2_p002720.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto6362auto6362Theobroma cacaoexon
exon-auto6363auto6363Theobroma cacaoexon
exon-auto6364auto6364Theobroma cacaoexon
exon-auto6365auto6365Theobroma cacaoexon
exon-auto6366auto6366Theobroma cacaoexon
exon-auto6367auto6367Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto6368auto6368Theobroma cacaoCDS
CDS-auto6369auto6369Theobroma cacaoCDS
CDS-auto6370auto6370Theobroma cacaoCDS
CDS-auto6371auto6371Theobroma cacaoCDS
CDS-auto6372auto6372Theobroma cacaoCDS
CDS-auto6373auto6373Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc01v2_t002720.1 ID=Tc01v2_t002720.1|Name=Tc01v2_t002720.1|organism=Theobroma cacao|type=mRNA|length=1101bp
ATGGAAGGCCAAACAGCTGCCAGGGTTTGTGTCACAGGAGCTGCCGGTTT
CATTGGTTCCTGGCTTGTCATGAGGCTTCTTCAACGGGGCTACACAGTTA
AAGCAACCGTTCGCGACCCCGCAAATTTAAACAAGGTGAAGCATTTGCTA
GAGCTGCCAAAAGCTGATGAAAACTTGATATTGTGGAAGGCAGACCTTAT
GGAAGAAGGCAGCTTCGACGAGGCCGTCAGAGGTTGCTCGGGAGTCTTCC
ATGTAGCCACGCCTAAGGACTTCGAGTCACAAGACCCTGAGAATGAAGTG
ATTAAGCTGACAGTAGATGGGGCGTTGAGCATCTTAAAGTCATGCGCTAA
TGCAAAAACTGTGGAAAGGTTGTTGTTTGTGTCATCCGGTGGAACTGTTG
CCATGCAGGAGCGGAAACTTGTCCAGTATGATGAGACTTGTTGGAGCGAT
GTTGATTTCATCAGAGCTAAGAAGATGACCGGATGGATGTATTTCGAATC
CAAAACTTTGGCAGAGAAAGCAGCATGGGGAGCTGCTCAAGAGAACGACA
TTGATTTCATCAGCGTTAAACCAACTCTCGTTGTTGGCCCATTCATCATC
CCATCGATGCCACCAAGCCTGATCACTGCACTTTCCTTGATAACGAGAAA
TGAAGGGCATTACTCTATCATAAAGCAATGCCAGTATGTGCATTTGGATG
ATCTTTGTAATGCTCTTGTTTTCCTGTACGAGAACCCTGAAGCCCATGGT
CGATACATTTGCTCTTCTCATGATGCAACCATATTCGATCTGGCGGAAAT
GCTTCGGCAGAAATACCCAGGATACGACATCCCCATCCAGTTCAGGGGTG
TCGAAGAACACTTGGAAGTGATCTCTTTCTCCTCTAAGACATTGACGGAC
CTGGGGTTCCAGTTCAAGTATGGCTTCGAGAACATGTACACAGGAGCCAT
TAAAACCTGTATTGAGAAGGGACTGATTCCTCGTTGTTTAAACGACAATG
ACCATGCCGCTGTTGGCAAAATCCATAACATCATGGTTTTAGATGTTGGT
TTTACTTCTGTGGCAATAAATTTACTACCTGTTTGTTTGAAGGAAGCATA
G
back to top

protein sequence of Tc01v2_p002720.1

>Tc01v2_p002720.1 ID=Tc01v2_p002720.1|Name=Tc01v2_p002720.1|organism=Theobroma cacao|type=polypeptide|length=367bp
MEGQTAARVCVTGAAGFIGSWLVMRLLQRGYTVKATVRDPANLNKVKHLL
ELPKADENLILWKADLMEEGSFDEAVRGCSGVFHVATPKDFESQDPENEV
IKLTVDGALSILKSCANAKTVERLLFVSSGGTVAMQERKLVQYDETCWSD
VDFIRAKKMTGWMYFESKTLAEKAAWGAAQENDIDFISVKPTLVVGPFII
PSMPPSLITALSLITRNEGHYSIIKQCQYVHLDDLCNALVFLYENPEAHG
RYICSSHDATIFDLAEMLRQKYPGYDIPIQFRGVEEHLEVISFSSKTLTD
LGFQFKYGFENMYTGAIKTCIEKGLIPRCLNDNDHAAVGKIHNIMVLDVG
FTSVAINLLPVCLKEA*
back to top