Tc02v2_t004530.1

Overview
NameTc02v2_t004530.1
Unique NameTc02v2_t004530.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3390
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 7 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 11 samples with support for all annotated introns
Productprobable ubiquitin-conjugating enzyme E2 23, transcript variant X4
NoteProbable ubiquitin-conjugating enzyme E2 23
Cross References
External references for this mRNA
DatabaseAccession
GeneID18607524
GenbankXM_007041733.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc02v2_g004530Tc02v2_g004530Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc02v2_p004530.1Tc02v2_p004530.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto88560auto88560Theobroma cacaoexon
exon-auto88561auto88561Theobroma cacaoexon
exon-auto88562auto88562Theobroma cacaoexon
exon-auto88563auto88563Theobroma cacaoexon
exon-auto88564auto88564Theobroma cacaoexon
exon-auto88565auto88565Theobroma cacaoexon
exon-auto88566auto88566Theobroma cacaoexon
exon-auto88567auto88567Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto88568auto88568Theobroma cacaoCDS
CDS-auto88569auto88569Theobroma cacaoCDS
CDS-auto88570auto88570Theobroma cacaoCDS
CDS-auto88571auto88571Theobroma cacaoCDS
CDS-auto88572auto88572Theobroma cacaoCDS
CDS-auto88573auto88573Theobroma cacaoCDS
CDS-auto88574auto88574Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc02v2_t004530.1 ID=Tc02v2_t004530.1|Name=Tc02v2_t004530.1|organism=Theobroma cacao|type=mRNA|length=3390bp
ATGGGGATGGAGCAACATTTATTAGCTTCTGAGACTAATGAATCCACCAC
AACAAGTTTGCATGGAAGTAACACTTTGAGTCAGGGTGGTTCTTCAACAA
ATGCGTCTGTCAGTGATCAAAATGTAAATTACACTAATGTTGGAGTTAGT
AAACAAAATGAGACTTTCTGTAACTTGCATAGTGTTCCTTATATTTATCG
ACAAGATGTTGTCAGGAGTAATACAAGTGGTGCAATTGGGATTGTGAGTG
AAGTTGCTGGTGACTCAGATTCAGATGGAAGCATAACTGATGATGAAGAT
GACGAAGACGAAGATGATGAGGAGGATGGTGAAGATGAAAGCGGTAATGG
TGATGCAAATAGCAATGCTAATGAGAGTGGTGATGGGAACAAAGGTGGTA
ATTATAAGTGCGGTGATCTTCAAGCCGATCAAATTCGGGTACTCTGGATG
GATGACACTGAGCCAGTTCAAAGTATTAAGAATGTAAGTGTTGTTGATCG
GGGTTTCTTACATGGGGATTATGTTGCTGCTGCTTTAGATTCAACTGGAC
AGGTGGGTGTTGTGGTGGACGTCAATGTCTCTGTTGATCTGTTAGCTCCT
GATGGATCTATTTTAAATGATGTCTCAACTAGAGACTTGCAACGTGTGAG
GGATTTCACTGTGGGCGATTATGTGGTCCTTGGTCCCTGGCTGGGTAGAA
TAGATGATGTTTTAGATAATGTCAACGTGTTGTTTGATGATGGCTCTGTA
TGCAAAGTTACAAGGGCTGAACCATTGCGTCTTAAACCAATTACTAGAAA
TACCCTTGAAGACGATAGTAATTTTCCATACTATCCTGGTCAGCGAGTAA
GAGCAAGCTCTTCATCTGTTTTCAAGAATTCTAGGTGGTTATCTGGCTTA
TGGAAGGCAAATCGGTTGGAAGGTACAGTCACTAAAGTTACAGCTGGAGC
TGTGTTTATTTATTGGATAGCATCTGCTGGCTATGGGCCTGATTCTTCCA
CTGCCCCTGCTGAAGAGCAGAATCCAAAGAATCTAAAACTGTTGTCTTGT
TTTGCGCATGCAAATTGGCAAGTGGGTGATTGGTGTCTTCTTCCAACTTC
ATCGCAATGCATTCCTTTGGACAAGGGTTTGTCCAAACTGCAGCTTAATG
GTTCCATAAAAAATAGGGGAAATTGTGATAAGTTGGATAGTGAATGGGAT
TCCAAAGAGGTTATTCTGTATGAATCAAATGATAATAGTGAATCCATGGA
TCTTGATGCAACACCTACACCTGATGAAAACAATGCAACTATTGAAACTA
AAGACAATGGAGCTATTGGAACTAAAGCCTCACCTGAATCTAGCTCTTGT
AGTAGTTCATTATCAGTTTCAAAGGAGACTGTCCATGAACATTGGCCACA
TCACCGCAAGAAGATCCGGAAAGTTGTGATTAGGAAAGACAAGAAAGCAA
AAAAGAAAGTGGAGAATTTTGAAAGGGCACTTTTGATTGTCAATAGCAGA
ACAAGAGTTGATGTTGCATGGCAGGATGGAACAATCGAACGTGGAGTGGA
TGCAACGACATTGATCCCAATTGAAACTCCCGGTGATCATGAATTTGTTG
CAGAGCAGTATGTGGTGGAGAAGGCCTCTGATGATAGTGATGATGTATAT
GAACCCAGGCGTGTTGGGGTTGTCAAAAGTGTTAATGCAAAGGAGCGGAC
AGCTTGTATAAGGTGGATAAAGCCAGTTGCCAGGGCAGAGGACCCTCGAG
AGTTTGACAAGGAAGAAATTGTAAGTGTGTATGAGCTGGAAGGACATCCA
GATTATGATTATTGTTATGGTGATGTAGTAGTTCGATTATCCCCGGCTTC
TGTTCCCATGCAATCTGCTTCTGGTGAAGGCTTCATTGAGGAACCAAAGC
AGGAAGATGGATCAAAGGAGATAAAACGAGACTTGAAAAAGTGCTCAGGA
AGTAACAAAGTAGAAGGTGAATCACCAAATGAAGCTTCCATGGACTTCAC
GGATCTCTCTTGGGTTGGGAACATAACTGGCCTGAGAAATGGTGATATTG
AGGTTACATGGGCTGATGGGATGGTTTCAACGGTTGGACCTCAAGCAATT
TATGTTGTTGGCCGAGATGATGATGAGTCAATTGCTGCTGGGAGTGAAGA
TCTGGAACCACAAAATGCCAGCAGTATTATTTCGGACGTTGAAGAGGGTA
TGGAGAATAATTCTGGAAGGAATGCAGCATTATCACTCCCCTTAGCTGCA
TTTGATTTTGTCACCAGACTGGCCAGTGGATTTTTTTCAGGAAGACGAAA
AAATATTGATCCAATTGATTTGGATTCCAAAGGAGAAAATGAACTTCAGC
CTGAGGGAAGAGATTTCAGCCATGAGTCTAGCTCTCAAAAGTCTAATGTT
CTTGATAATTTCAGTGGGGAAAGCGTTAATGAGAAAGGAGAGGAACATGT
TGATGAAAAGGCCCACGAACTTTCACTTCCATCAGATGTTTTATGCAATG
TGAGGATTGAAGACTCAGATTCTAAAACAGGTGATGAGGATGATACTTGC
AGTTTCAAGCGGTTTGATACAGCTAAAGATCCTCTAGATCATTATTTTCT
TGGTGCAAATGGACAGAATAGTACTGGAAGAAAGTGGCTAAAGAAGGTGC
AGCAAGATTGGAACATCCTTCAGAACAACCTGCCAGATGGAATCTATGTA
CGGGTATATGAAGATCGGATGGACCTCTTGAGGGCTGTAATAGTTGGGGC
ATATGGGACACCTTATCAAGATGGTCTCTTCTTCTTTGATTTCCACCTTC
CTCCTGAGTATCCAGATGTGCCACCGTCAGCATACTATCATTCTGGCGGT
TGGAGAATAAATCCTAATTTGTATGAGGAAGGTAAGGTGTGCCTTAGCCT
TCTAAATACATGGACTGGCAGGGGAAACGAAGTTTGGGATTCATTGTCCT
CTAGCATCCTTCAAGTCCTAGTTTCACTGCAGGGGTTAGTGCTAAATTCT
AGGCCATATTTCAATGAAGCTGGGTATGATAAGCAGGTTGGAACAGCTGA
AGGAGAGAAAAATTCATTAGCATACAATGAGAATACTTTCTTACTGAACT
GCAAGTCAATGATGTATCTCATGCGGAAGCCCCCAAAGGACTTTGAAGAA
CTTGTCAGAGACCATTTCAGGAGACGTGGTTTTTACATCCTTAAAGCATG
TGATGCATACATGAAAGGCTACTTAATTGGCTCTCTAACTAAAGATGCCT
CTTATAGTGATGCAAACAATGCAAACTCCACTTCAGTTGGTTTCAAGCTG
ATGTTAGGCAAGATTGTACCTAAGCTTTTATTGGCACTTAATGAAGTTGG
AGCTGATTGTCAGGAATTTAAGCATTTCCAGCAATCATAG
back to top

protein sequence of Tc02v2_p004530.1

>Tc02v2_p004530.1 ID=Tc02v2_p004530.1|Name=Tc02v2_p004530.1|organism=Theobroma cacao|type=polypeptide|length=1130bp
MGMEQHLLASETNESTTTSLHGSNTLSQGGSSTNASVSDQNVNYTNVGVS
KQNETFCNLHSVPYIYRQDVVRSNTSGAIGIVSEVAGDSDSDGSITDDED
DEDEDDEEDGEDESGNGDANSNANESGDGNKGGNYKCGDLQADQIRVLWM
DDTEPVQSIKNVSVVDRGFLHGDYVAAALDSTGQVGVVVDVNVSVDLLAP
DGSILNDVSTRDLQRVRDFTVGDYVVLGPWLGRIDDVLDNVNVLFDDGSV
CKVTRAEPLRLKPITRNTLEDDSNFPYYPGQRVRASSSSVFKNSRWLSGL
WKANRLEGTVTKVTAGAVFIYWIASAGYGPDSSTAPAEEQNPKNLKLLSC
FAHANWQVGDWCLLPTSSQCIPLDKGLSKLQLNGSIKNRGNCDKLDSEWD
SKEVILYESNDNSESMDLDATPTPDENNATIETKDNGAIGTKASPESSSC
SSSLSVSKETVHEHWPHHRKKIRKVVIRKDKKAKKKVENFERALLIVNSR
TRVDVAWQDGTIERGVDATTLIPIETPGDHEFVAEQYVVEKASDDSDDVY
EPRRVGVVKSVNAKERTACIRWIKPVARAEDPREFDKEEIVSVYELEGHP
DYDYCYGDVVVRLSPASVPMQSASGEGFIEEPKQEDGSKEIKRDLKKCSG
SNKVEGESPNEASMDFTDLSWVGNITGLRNGDIEVTWADGMVSTVGPQAI
YVVGRDDDESIAAGSEDLEPQNASSIISDVEEGMENNSGRNAALSLPLAA
FDFVTRLASGFFSGRRKNIDPIDLDSKGENELQPEGRDFSHESSSQKSNV
LDNFSGESVNEKGEEHVDEKAHELSLPSDVLCNVRIEDSDSKTGDEDDTC
SFKRFDTAKDPLDHYFLGANGQNSTGRKWLKKVQQDWNILQNNLPDGIYV
RVYEDRMDLLRAVIVGAYGTPYQDGLFFFDFHLPPEYPDVPPSAYYHSGG
WRINPNLYEEGKVCLSLLNTWTGRGNEVWDSLSSSILQVLVSLQGLVLNS
RPYFNEAGYDKQVGTAEGEKNSLAYNENTFLLNCKSMMYLMRKPPKDFEE
LVRDHFRRRGFYILKACDAYMKGYLIGSLTKDASYSDANNANSTSVGFKL
MLGKIVPKLLLALNEVGADCQEFKHFQQS*
back to top