Tc01v2_t031940.1

Overview
NameTc01v2_t031940.1
Unique NameTc01v2_t031940.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length1566
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 9 ESTs, 21 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 27 samples with support for all annotated introns
Product3-phosphoshikimate 1-carboxyvinyltransferase 2
Note3-phosphoshikimate 1-carboxyvinyltransferase, chloroplastic
Cross References
External references for this mRNA
DatabaseAccession
GeneID18614319
GenbankXM_007052025.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc01v2_g031940Tc01v2_g031940Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc01v2_p031940.1Tc01v2_p031940.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto71870auto71870Theobroma cacaoexon
exon-auto71871auto71871Theobroma cacaoexon
exon-auto71872auto71872Theobroma cacaoexon
exon-auto71873auto71873Theobroma cacaoexon
exon-auto71874auto71874Theobroma cacaoexon
exon-auto71875auto71875Theobroma cacaoexon
exon-auto71876auto71876Theobroma cacaoexon
exon-auto71877auto71877Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto71878auto71878Theobroma cacaoCDS
CDS-auto71879auto71879Theobroma cacaoCDS
CDS-auto71880auto71880Theobroma cacaoCDS
CDS-auto71881auto71881Theobroma cacaoCDS
CDS-auto71882auto71882Theobroma cacaoCDS
CDS-auto71883auto71883Theobroma cacaoCDS
CDS-auto71884auto71884Theobroma cacaoCDS
CDS-auto71885auto71885Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc01v2_t031940.1 ID=Tc01v2_t031940.1|Name=Tc01v2_t031940.1|organism=Theobroma cacao|type=mRNA|length=1566bp
ATGGCACACGCTAGCAAAATCTACAGTGGGACGCAAAAAACATGTGTTTT
ACCCAATGTCTCAAAATCCCAGAAGCCTAAATGTATACCTTCGGTTTCCT
TCAGATCAAATCTCAAAGGAAGCTTTTCAAGTTCTTGGGGTTTGGTTTTC
AAGAGCAACGGTAAGTTGGGAACAATTAAGGTTGGGCCTTTGCTGGTTTC
TGCTTCAATGGCAACAGCAGAGAAGCCATCCAGTGCATCAGAAATCGTGC
TTCAACCAATTAATGAAATTTCGGGTACGGTTAAGTTACCCGGGTCCAAA
TCACTGTCCAATCGAATTCTGCTTCTTGCTGCTCTATCTGAGGGAACTAC
TGTGGTGGACAATTTGTTGAACAGTGATGATGTTCATCACATGCTTGTTG
CCTTGGGAAAACTTGGGCTACGTGTGGAACATGACAGTGAACAGAAACGA
GCCATTGTGGAAGGTTGCGGTGGTCAATTTCCAGTGGGGAAAGGGGAAGG
TCAAGAAATTGAGCTTTTCCTTGGGAATGCAGGAACTGCAATGCGACCAC
TTACTGCTGCTATTACTGCTGCCGGTGGCAATTCAAGCTACATACTTGAT
GGTGTGCCCCGAATGAGAGAGAGACCAATTGGGGACTTAGTAACTGGTCT
TAAGCAGCTGGGTGCAGATGTAGATTGTACTCTTGGCACAAATTGTCCCC
CTGTTCTTATAAATGGAAAGGGTGGTCTTCCTGGGGGAAAGGTGAAACTT
TCTGGTTCAATCAGTAGTCAATACTTGACTGCTTTACTCATGGCAGCTCC
TTTGGCTCTTGGGGATGTGGAAATTGAGATAATTGATAAATTGATTTCAA
TCCCATATGTTGAAATGACTATAAAGTTGATGGAAAGGTTTGGGGTCAGT
GTGGAGCACACTGGTAGTTGGGATCGATTCTATATCCGAGGACGTCAAAA
GTACAAGTCTCCTGGAAAGGCTTATGTTGAAGGTGATGCTTCAAGTGCTA
GTTACTTCCTTGCTGGTGCAGCAGTCACTGGTGGGACTGTCACAGTAGAA
GGATGTGGTACAAGTAGCTTACAGGGTGATGTAAAATTTGCTGAGGTTCT
TGAGAAGATGGGTGCCAAAGTCACCTGGACTGAGAACAGCGTCACTGTCA
CTGGGCCACCAAGAAATTCCTCTGGGAAGAAACACTTGCGTGCCATTGAT
GTCAACATGAACAAAATGCCAGATGTTGCCATGACTCTCGCTGTTGTGGC
ACTTTATGCTGATGGTCCCACTGCCATAAGAGATGTGGCAAGTTGGAGGG
TGAAAGAGACTGAAAGGATGATTGCTATATGCACTGAACTCAGGAAGCTT
GGAGCAACAGTTGAAGAAGGACCAGATTATTGTGTGATCACTCCACCAGA
GAAATTAAATGTGACAGCAATAGACACTTATGATGATCACCGAATGGCCA
TGGCATTCTCTCTTGCTGCCTGTGCAGATGTTCCAGTTACCATCAATGAT
CCTGGTTGCACACGGAAAACCTTCCCTGACTACTTTGAAGTTCTCGAGAA
GGTTACAAAGCATTGA
back to top

protein sequence of Tc01v2_p031940.1

>Tc01v2_p031940.1 ID=Tc01v2_p031940.1|Name=Tc01v2_p031940.1|organism=Theobroma cacao|type=polypeptide|length=522bp
MAHASKIYSGTQKTCVLPNVSKSQKPKCIPSVSFRSNLKGSFSSSWGLVF
KSNGKLGTIKVGPLLVSASMATAEKPSSASEIVLQPINEISGTVKLPGSK
SLSNRILLLAALSEGTTVVDNLLNSDDVHHMLVALGKLGLRVEHDSEQKR
AIVEGCGGQFPVGKGEGQEIELFLGNAGTAMRPLTAAITAAGGNSSYILD
GVPRMRERPIGDLVTGLKQLGADVDCTLGTNCPPVLINGKGGLPGGKVKL
SGSISSQYLTALLMAAPLALGDVEIEIIDKLISIPYVEMTIKLMERFGVS
VEHTGSWDRFYIRGRQKYKSPGKAYVEGDASSASYFLAGAAVTGGTVTVE
GCGTSSLQGDVKFAEVLEKMGAKVTWTENSVTVTGPPRNSSGKKHLRAID
VNMNKMPDVAMTLAVVALYADGPTAIRDVASWRVKETERMIAICTELRKL
GATVEEGPDYCVITPPEKLNVTAIDTYDDHRMAMAFSLAACADVPVTIND
PGCTRKTFPDYFEVLEKVTKH*
back to top