Tc01v2_t031950.1

Overview
NameTc01v2_t031950.1
Unique NameTc01v2_t031950.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2238
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 33 ESTs, 21 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 27 samples with support for all annotated introns
Producttransketolase, chloroplastic
NoteTransketolase-2, chloroplastic
Cross References
External references for this mRNA
DatabaseAccession
GeneID18614321
GenbankXM_007052028.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc01v2_g031950Tc01v2_g031950Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc01v2_p031950.1Tc01v2_p031950.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto71889auto71889Theobroma cacaoexon
exon-auto71890auto71890Theobroma cacaoexon
exon-auto71891auto71891Theobroma cacaoexon
exon-auto71892auto71892Theobroma cacaoexon
exon-auto71893auto71893Theobroma cacaoexon
exon-auto71894auto71894Theobroma cacaoexon
exon-auto71895auto71895Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto71896auto71896Theobroma cacaoCDS
CDS-auto71897auto71897Theobroma cacaoCDS
CDS-auto71898auto71898Theobroma cacaoCDS
CDS-auto71899auto71899Theobroma cacaoCDS
CDS-auto71900auto71900Theobroma cacaoCDS
CDS-auto71901auto71901Theobroma cacaoCDS
CDS-auto71902auto71902Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc01v2_t031950.1 ID=Tc01v2_t031950.1|Name=Tc01v2_t031950.1|organism=Theobroma cacao|type=mRNA|length=2238bp
ATGGCCTCAACTTCGTCTACCACCCTATCCCAAGCCCTCTTGGCTCGCGC
CATCTCTTACCATGGCTCCACCCAGTCCTCCGACCACCGCGTCTCTCTCT
CCACCCTCTCTCTCCCCACCTTCTCTGGCCTCAAATCCACTACCTCACGT
GCTTCGGCTTCTCGTAGACGCCCTCCCGTGCGCTCCTACCAGAACCGCCA
AGTCCGTGCCGCCGCCGTGGAGACAATTGGTACTGCCGCGGAGACTTCCT
TGGTGGAGAAATCGGTCAATACGATCAGATTCCTGGCCATTGATGCTGTC
GAGAAGGCGAATTCGGGTCATCCTGGGTTGCCTATGGGCTGTGCTCCGAT
GGGCCACATTTTGTACGATGAAGTCATGAGGTATAACCCGAAGAACCCTT
ATTGGTTCAACCGTGACCGTTTCGTGTTGTCCGCTGGTCACGGTTGTATG
TTGCAGTATGCTCTGCTTCACCTCGCTGGTTACGACAGTGTCCTGGAAGA
AGATTTGAAGAATTTCCGTCAGTGGGGTAGCAAAACCCCAGGACATCCTG
AGAACTTTGAAACACTTGGAGTTGAAGTCACAACTGGTCCTCTTGGTCAA
GGTGTTGCGAATGCTGTCGGACTGGCTCTTGCGGAGAAACACTTGGCTGC
TAGATTCAACAAGCCAGACAATGAGATCGTTGACCACTACACATATGTTA
TTTTGGGAGATGGGTGTCAAATGGAGGGTATTGCAAATGAAGCATGTTCA
CTTGCTGGACACTGGGGACTTGGGAAGCTTATAGCTTTCTATGATGACAA
CCACATTTCCATTGATGGTGACACTGAAATTGCCTTTACTGAGAGTGTTG
ATAAGCGTTTTGAGGGGCTTGGGTGGCATGTCATCTGGGTCAAGAATGGA
AACACTGGCTATGATGATATTCGTGCTGCTATTAAGGAAGCAAAGGCTGT
TAAAGACAAACCCACTTTGATCAAGCTGACAACCACCATTGGTTATGGAT
CCCCGAACAAGGCAAACTCATACAGTGTACATGGGAGTGCACTGGGTGCC
AAGGAAGTGGATGCTACTAGGAAAAATCTTGGATGGCCATATGAGCCTTT
CCATGTACCTGAAGATGTTAAAACGCACTGGAGTCGCCATGTCCCTCAGG
GTGCTGCTCTTGAAGCCGAATGGAATGCCAAGTTTGCTGAATATGAGAAG
AAGTACAAAGAGGAAGCTGCAGAGCTCAAGACAATCATCACTGGTGAACT
ACCTGCTGGATGGGAGAAGGCACTTCCGACATACACTCCAGAGAGCCCAC
CTGATGCTACCAGAAATCTCTCTCAACAAAATCTCAATGCCCTTGTAAAA
GTACTCCCTGGTCTTCTTGGTGGAAGTGCAGACCTTGCTTCTTCCAACAT
GACCTTGCTCAAAATGTATGGTGATTTCCAGAAGGACACCCCTGAGGAAC
GCAATGTTAGGTTTGGTGTTAGGGAACATGGAATGGGAGCCATCTCAAAT
GGCATTGCCCTTCACAGCCCTGGTCTGATTCCATACTGTGCTACTTTCTT
TGTCTTTACTGACTACATGAGAGCTGCCATCAGGATTTCTGCCTTGTGTG
AAGCTGGAGTTATCTATGTTATGACCCACGATTCCATTGGTCTTGGGGAA
GATGGACCAACCCACCAGCCAATTGAGCACTTGGCGAGCTTCCGTGCAAT
GCCTAACATTTTAATGCTCCGTCCAGCTGATGGAAATGAAACTGCTGGTG
CATACAAGGTTGCTGTCCTCAACAGGAAGAGACCCTCAATTCTTGCTCTC
TCTCGGCAAAAGCTGCCCCAACTTGCTGGAACTTCCATTGAGGGAGTTGA
AAAGGGTGGCTACATTGTTTCAGACAATTCTTCAGGCAACAAGCCTGATG
TAATTCTGATTGGAACTGGTTCTGAGCTAGAGATTGCTGCTAAAGCTGCT
GAGGAACTAAGGAATGGAGGAAAGGCTGTTAGGGTTGTCTCCCTGGTTTC
TTGGGAGCTCTTTGATGAGCAATCTGATGCCTACAAGGAAAGTGTTTTGC
CATCTGCTGTATCAGCTAGGGTGAGTATTGAGGCTGGATCAACATTTGGA
TGGGAGAAGATAGTTGGATCCAAAGGAAAGTCAATAGGAATTGACCGGTT
TGGCGCAAGTGCACCAGCAGGCAGAATATACAAGGAATTTGGTTTAACCC
CAGAGGCTGTTGTTACAGCAGCGAAAGAACTCTGCTAG
back to top

protein sequence of Tc01v2_p031950.1

>Tc01v2_p031950.1 ID=Tc01v2_p031950.1|Name=Tc01v2_p031950.1|organism=Theobroma cacao|type=polypeptide|length=746bp
MASTSSTTLSQALLARAISYHGSTQSSDHRVSLSTLSLPTFSGLKSTTSR
ASASRRRPPVRSYQNRQVRAAAVETIGTAAETSLVEKSVNTIRFLAIDAV
EKANSGHPGLPMGCAPMGHILYDEVMRYNPKNPYWFNRDRFVLSAGHGCM
LQYALLHLAGYDSVLEEDLKNFRQWGSKTPGHPENFETLGVEVTTGPLGQ
GVANAVGLALAEKHLAARFNKPDNEIVDHYTYVILGDGCQMEGIANEACS
LAGHWGLGKLIAFYDDNHISIDGDTEIAFTESVDKRFEGLGWHVIWVKNG
NTGYDDIRAAIKEAKAVKDKPTLIKLTTTIGYGSPNKANSYSVHGSALGA
KEVDATRKNLGWPYEPFHVPEDVKTHWSRHVPQGAALEAEWNAKFAEYEK
KYKEEAAELKTIITGELPAGWEKALPTYTPESPPDATRNLSQQNLNALVK
VLPGLLGGSADLASSNMTLLKMYGDFQKDTPEERNVRFGVREHGMGAISN
GIALHSPGLIPYCATFFVFTDYMRAAIRISALCEAGVIYVMTHDSIGLGE
DGPTHQPIEHLASFRAMPNILMLRPADGNETAGAYKVAVLNRKRPSILAL
SRQKLPQLAGTSIEGVEKGGYIVSDNSSGNKPDVILIGTGSELEIAAKAA
EELRNGGKAVRVVSLVSWELFDEQSDAYKESVLPSAVSARVSIEAGSTFG
WEKIVGSKGKSIGIDRFGASAPAGRIYKEFGLTPEAVVTAAKELC*
back to top