Tc02v2_t004530.4

Overview
NameTc02v2_t004530.4
Unique NameTc02v2_t004530.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3462
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 1 EST, 12 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 16 samples with support for all annotated introns
Productprobable ubiquitin-conjugating enzyme E2 23, transcript variant X1
NoteProbable ubiquitin-conjugating enzyme E2 23
Cross References
External references for this mRNA
DatabaseAccession
GeneID18607524
GenbankXM_018116064.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc02v2_g004530Tc02v2_g004530Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc02v2_p004530.4Tc02v2_p004530.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto88597auto88597Theobroma cacaoexon
exon-auto88598auto88598Theobroma cacaoexon
exon-auto88599auto88599Theobroma cacaoexon
exon-auto88600auto88600Theobroma cacaoexon
exon-auto88601auto88601Theobroma cacaoexon
exon-auto88602auto88602Theobroma cacaoexon
exon-auto88603auto88603Theobroma cacaoexon
exon-auto88604auto88604Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto88605auto88605Theobroma cacaoCDS
CDS-auto88606auto88606Theobroma cacaoCDS
CDS-auto88607auto88607Theobroma cacaoCDS
CDS-auto88608auto88608Theobroma cacaoCDS
CDS-auto88609auto88609Theobroma cacaoCDS
CDS-auto88610auto88610Theobroma cacaoCDS
CDS-auto88611auto88611Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc02v2_t004530.4 ID=Tc02v2_t004530.4|Name=Tc02v2_t004530.4|organism=Theobroma cacao|type=mRNA|length=3462bp
ATGGGGATGGAGCAACATTTATTAGCTTCTGAGACTAATGAATCCACCAC
AACAAGTTTGCATGGAAGTAACACTTTGAGTCAGGGTGGTTCTTCAACAA
ATGCGTCTGTCAGTGATCAAAATGTAAATTACACTAATGTTGGAGTTAGT
AAACAAAATGAGACTTTCTGTAACTTGCATAGTGTTCCTTATATTTATCG
ACAAGATGTTGTCAGGAGTAATACAAGTGGTGCAATTGGGATTGTGAGTG
AAGTTGCTGGTGACTCAGATTCAGATGGAAGCATAACTGATGATGAAGAT
GACGAAGACGAAGATGATGAGGAGGATGGTGAAGATGAAAGCGGTAATGG
TGATGCAAATAGCAATGCTAATGAGAGTGGTGATGGGAACAAAGGTGGTA
ATTATAAGTGCGGTGATCTTCAAGCCGATCAAATTCGGGTACTCTGGATG
GATGACACTGAGCCAGTTCAAAGTATTAAGAATGTAAGTGTTGTTGATCG
GGGTTTCTTACATGGGGATTATGTTGCTGCTGCTTTAGATTCAACTGGAC
AGGTGGGTGTTGTGGTGGACGTCAATGTCTCTGTTGATCTGTTAGCTCCT
GATGGATCTATTTTAAATGATGTCTCAACTAGAGACTTGCAACGTGTGAG
GGATTTCACTGTGGGCGATTATGTGGTCCTTGGTCCCTGGCTGGGTAGAA
TAGATGATGTTTTAGATAATGTCAACGTGTTGTTTGATGATGGCTCTGTA
TGCAAAGTTACAAGGGCTGAACCATTGCGTCTTAAACCAATTACTAGAAA
TACCCTTGAAGACGATAGTAATTTTCCATACTATCCTGGTCAGCGAGTAA
GAGCAAGCTCTTCATCTGTTTTCAAGAATTCTAGGTGGTTATCTGGCTTA
TGGAAGGCAAATCGGTTGGAAGGTACAGTCACTAAAGTTACAGCTGGAGC
TGTGTTTATTTATTGGATAGCATCTGCTGGCTATGGGCCTGATTCTTCCA
CTGCCCCTGCTGAAGAGCAGAATCCAAAGAATCTAAAACTGTTGTCTTGT
TTTGCGCATGCAAATTGGCAAGTGGGTGATTGGTGTCTTCTTCCAACTTC
ATCGCAATGCATTCCTTTGGACAAGGGTTTGTCCAAACTGCAGCTTAATG
GTTCCATAAAAAATAGGGGAAATTGTGATAAGTTGGATAGTGAATGGGAT
TCCAAAGAGGTTATTCTGTATGAATCAAATGATAATAGTGAATCCATGGA
TCTTGATGCAACACCTACACCTGATGAAAACAATGCAACTATTGAAACTA
AAGACAATGGAGCTATTGGAACTAAAGCCTCACCTGAATCTAGCTCTTGT
AGTAGTTCATTATCAGTTTCAAAGGAGACTGTCCATGAACATTGGCCACA
TCACCGCAAGAAGATCCGGAAAGTTGTGATTAGGAAAGACAAGAAAGCAA
AAAAGAAAGTGGAGAATTTTGAAAGGGCACTTTTGATTGTCAATAGCAGA
ACAAGAGTTGATGTTGCATGGCAGGATGGAACAATCGAACGTGGAGTGGA
TGCAACGACATTGATCCCAATTGAAACTCCCGGTGATCATGAATTTGTTG
CAGAGCAGTATGTGGTGGAGAAGGCCTCTGATGATAGTGATGATGTATAT
GAACCCAGGCGTGTTGGGGTTGTCAAAAGTGTTAATGCAAAGGAGCGGAC
AGCTTGTATAAGGTGGATAAAGCCAGTTGCCAGGGCAGAGGACCCTCGAG
AGTTTGACAAGGAAGAAATTGTAAGTGTGTATGAGCTGGAAGGACATCCA
GATTATGATTATTGTTATGGTGATGTAGTAGTTCGATTATCCCCGGCTTC
TGTTCCCATGCAATCTGCTTCTGGTGAAGGCTTCATTGAGGAACCAAAGC
AGGAAGATGGATCAAAGGAGATAAAACGAGACTTGAAAAAGTGCTCAGGA
AGTAACAAAGTAGAAGGTGAATCACCAAATGAAGCTTCCATGGACTTCAC
GGATCTCTCTTGGGTTGGGAACATAACTGGCCTGAGAAATGGTGATATTG
AGGTTACATGGGCTGATGGGATGGTTTCAACGGTTGGACCTCAAGCAATT
TATGTTGTTGGCCGAGATGATGATGAGTCAATTGCTGCTGGGAGTGAAGT
AAGTGATGATGCTGCTAGTTGGGAAACGGTTAATGATGATGAGATGGATG
CTCTTGAGAATGCTCAAGAGGATCTGGAACCACAAAATGCCAGCAGTATT
ATTTCGGACGTTGAAGAGGGTATGGAGAATAATTCTGGAAGGAATGCAGC
ATTATCACTCCCCTTAGCTGCATTTGATTTTGTCACCAGACTGGCCAGTG
GATTTTTTTCAGGAAGACGAAAAAATATTGATCCAATTGATTTGGATTCC
AAAGGAGAAAATGAACTTCAGCCTGAGGGAAGAGATTTCAGCCATGAGTC
TAGCTCTCAAAAGTCTAATGTTCTTGATAATTTCAGTGGGGAAAGCGTTA
ATGAGAAAGGAGAGGAACATGTTGATGAAAAGGCCCACGAACTTTCACTT
CCATCAGATGTTTTATGCAATGTGAGGATTGAAGACTCAGATTCTAAAAC
AGGTGATGAGGATGATACTTGCAGTTTCAAGCGGTTTGATACAGCTAAAG
ATCCTCTAGATCATTATTTTCTTGGTGCAAATGGACAGAATAGTACTGGA
AGAAAGTGGCTAAAGAAGGTGCAGCAAGATTGGAACATCCTTCAGAACAA
CCTGCCAGATGGAATCTATGTACGGGTATATGAAGATCGGATGGACCTCT
TGAGGGCTGTAATAGTTGGGGCATATGGGACACCTTATCAAGATGGTCTC
TTCTTCTTTGATTTCCACCTTCCTCCTGAGTATCCAGATGTGCCACCGTC
AGCATACTATCATTCTGGCGGTTGGAGAATAAATCCTAATTTGTATGAGG
AAGGTAAGGTGTGCCTTAGCCTTCTAAATACATGGACTGGCAGGGGAAAC
GAAGTTTGGGATTCATTGTCCTCTAGCATCCTTCAAGTCCTAGTTTCACT
GCAGGGGTTAGTGCTAAATTCTAGGCCATATTTCAATGAAGCTGGGTATG
ATAAGCAGGTTGGAACAGCTGAAGGAGAGAAAAATTCATTAGCATACAAT
GAGAATACTTTCTTACTGAACTGCAAGTCAATGATGTATCTCATGCGGAA
GCCCCCAAAGGACTTTGAAGAACTTGTCAGAGACCATTTCAGGAGACGTG
GTTTTTACATCCTTAAAGCATGTGATGCATACATGAAAGGCTACTTAATT
GGCTCTCTAACTAAAGATGCCTCTTATAGTGATGCAAACAATGCAAACTC
CACTTCAGTTGGTTTCAAGCTGATGTTAGGCAAGATTGTACCTAAGCTTT
TATTGGCACTTAATGAAGTTGGAGCTGATTGTCAGGAATTTAAGCATTTC
CAGCAATCATAG
back to top

protein sequence of Tc02v2_p004530.4

>Tc02v2_p004530.4 ID=Tc02v2_p004530.4|Name=Tc02v2_p004530.4|organism=Theobroma cacao|type=polypeptide|length=1154bp
MGMEQHLLASETNESTTTSLHGSNTLSQGGSSTNASVSDQNVNYTNVGVS
KQNETFCNLHSVPYIYRQDVVRSNTSGAIGIVSEVAGDSDSDGSITDDED
DEDEDDEEDGEDESGNGDANSNANESGDGNKGGNYKCGDLQADQIRVLWM
DDTEPVQSIKNVSVVDRGFLHGDYVAAALDSTGQVGVVVDVNVSVDLLAP
DGSILNDVSTRDLQRVRDFTVGDYVVLGPWLGRIDDVLDNVNVLFDDGSV
CKVTRAEPLRLKPITRNTLEDDSNFPYYPGQRVRASSSSVFKNSRWLSGL
WKANRLEGTVTKVTAGAVFIYWIASAGYGPDSSTAPAEEQNPKNLKLLSC
FAHANWQVGDWCLLPTSSQCIPLDKGLSKLQLNGSIKNRGNCDKLDSEWD
SKEVILYESNDNSESMDLDATPTPDENNATIETKDNGAIGTKASPESSSC
SSSLSVSKETVHEHWPHHRKKIRKVVIRKDKKAKKKVENFERALLIVNSR
TRVDVAWQDGTIERGVDATTLIPIETPGDHEFVAEQYVVEKASDDSDDVY
EPRRVGVVKSVNAKERTACIRWIKPVARAEDPREFDKEEIVSVYELEGHP
DYDYCYGDVVVRLSPASVPMQSASGEGFIEEPKQEDGSKEIKRDLKKCSG
SNKVEGESPNEASMDFTDLSWVGNITGLRNGDIEVTWADGMVSTVGPQAI
YVVGRDDDESIAAGSEVSDDAASWETVNDDEMDALENAQEDLEPQNASSI
ISDVEEGMENNSGRNAALSLPLAAFDFVTRLASGFFSGRRKNIDPIDLDS
KGENELQPEGRDFSHESSSQKSNVLDNFSGESVNEKGEEHVDEKAHELSL
PSDVLCNVRIEDSDSKTGDEDDTCSFKRFDTAKDPLDHYFLGANGQNSTG
RKWLKKVQQDWNILQNNLPDGIYVRVYEDRMDLLRAVIVGAYGTPYQDGL
FFFDFHLPPEYPDVPPSAYYHSGGWRINPNLYEEGKVCLSLLNTWTGRGN
EVWDSLSSSILQVLVSLQGLVLNSRPYFNEAGYDKQVGTAEGEKNSLAYN
ENTFLLNCKSMMYLMRKPPKDFEELVRDHFRRRGFYILKACDAYMKGYLI
GSLTKDASYSDANNANSTSVGFKLMLGKIVPKLLLALNEVGADCQEFKHF
QQS*
back to top