Tc04v2_t015660.4

Overview
NameTc04v2_t015660.4
Unique NameTc04v2_t015660.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length1392
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 26 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 26 samples with support for all annotated introns
Productcrocetin glucosyltransferase, chloroplastic, transcript variant X1
NoteCrocetin glucosyltransferase, chloroplastic
Cross References
External references for this mRNA
DatabaseAccession
GeneID18602622
GenbankXM_018119884.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g015660Tc04v2_g015660Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p015660.4Tc04v2_p015660.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto230105auto230105Theobroma cacaoexon
exon-auto230106auto230106Theobroma cacaoexon
exon-auto230107auto230107Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto230108auto230108Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t015660.4 ID=Tc04v2_t015660.4|Name=Tc04v2_t015660.4|organism=Theobroma cacao|type=mRNA|length=1392bp
ATGATGGCTCAACCTCACGTCCTTCTTGTAACACTCCCTGGCCAAGGTCA
CATAAACCCTTCCCTCCAATTCGCCAAGCGCCTAATACACCTAGGCCTTC
GTGTCACCTTTGCAACTGCCGTATCCGCCATCCGCCGCATGAAACCAATG
TCCCCTCTTGAGGGTTTAACATACGTCGCTGCCTACTCCGACGGTTACGA
CGATGGGTTGAAACCCGGTGATGATATTGACCGTTACATATTAGAATCCA
GGCGTAAGGGTCTCGAGACTTTGAGCGAGTTCATTGGTGCTAGCATTGAG
GAAGGCATACGGTTCACTTGTATTGTGTATGGAATCATGATGCCTTGGGT
GGCATTGGTGGCGCGTGAATTTCACATCCCATCAACATTACTTTGGAATC
AACCCGCCAGCGTTTTTGTCACCTATTATTATTACTTCAAAGATTATGGT
GATATCATAAGGAAAACTGTTAAAGACCCTTCATCGATCGTCGAATTGCC
AGGATTGCCACCTCTTGCTAGCCGTGACATGCCCTCGTTTTTCCTCCCTG
CAAATGAATACGATTGTGCCTTACCATCATTGAAGCAACATGTAGAGATC
CTTGATGAAGAAACGAAGCCAAAAGTTCTGGTTAATACCTTTGATGCTTT
GGAACCTGAGGCCATAAAAGTGATTGATAAGTACAATTTGGTTGGTATTG
GGCCCTTGATACCGTCTGCTTTCTTGGATGGAAATGATCACTCCGATTCT
TCATTCGGAGGTGATCTTTTCAAGGGCACAAACGACTTTGTCCAGTGGCT
GGACTCAATGCCAAAAAGTTCAGTTATATATGTATCGTTTGGAAGCATTC
TTATGTTGACAAAACAACAAATGGAGGAAATTGCAAATGGGTTATTAGGC
ACTGGCTACCCTTTTTTGTGGGTGATAAGGGAAGGGGCTGGAGAGAAAGA
AGAGAAACTCAGTCGTATCGAAGAATTGAAAAAGCAAGGGATGATAGTGC
CATGGTGTTCGCAAGTCGAGGTTCTTTCTCATCCATCGGTAGGGTGTTTC
TTAACTCACTGTGGATGGAATTCGGCCTTGGAAAGCTTGGTTTCTGGGGT
GCCAATGGTGACGTTTCCGCAGTTGACAGATCAAGGTACTAATGCGAAGC
TTGTGGAAGATTTGTGGAAGACTGGGGTAAGGGTGACTAGGAATCCTGAA
GAACGAATTGTCGTCGAAGGGCACGAGATTAAAAGGTGCTTGGAATTGAT
AATGGAAGGTGGAGAGAAAGGGGAGGAATTGAGAAAGAATGGCAAGAAAT
GGAAGTATTTGGCAAGGGAAGCTGTCAAGGAAGATGGGTCTTCACTCAAG
AACCTTGAGGCGTTCCTACATGGGCTTGGAAAAAGCTACTAA
back to top

protein sequence of Tc04v2_p015660.4

>Tc04v2_p015660.4 ID=Tc04v2_p015660.4|Name=Tc04v2_p015660.4|organism=Theobroma cacao|type=polypeptide|length=464bp
MMAQPHVLLVTLPGQGHINPSLQFAKRLIHLGLRVTFATAVSAIRRMKPM
SPLEGLTYVAAYSDGYDDGLKPGDDIDRYILESRRKGLETLSEFIGASIE
EGIRFTCIVYGIMMPWVALVAREFHIPSTLLWNQPASVFVTYYYYFKDYG
DIIRKTVKDPSSIVELPGLPPLASRDMPSFFLPANEYDCALPSLKQHVEI
LDEETKPKVLVNTFDALEPEAIKVIDKYNLVGIGPLIPSAFLDGNDHSDS
SFGGDLFKGTNDFVQWLDSMPKSSVIYVSFGSILMLTKQQMEEIANGLLG
TGYPFLWVIREGAGEKEEKLSRIEELKKQGMIVPWCSQVEVLSHPSVGCF
LTHCGWNSALESLVSGVPMVTFPQLTDQGTNAKLVEDLWKTGVRVTRNPE
ERIVVEGHEIKRCLELIMEGGEKGEELRKNGKKWKYLAREAVKEDGSSLK
NLEAFLHGLGKSY*
back to top