Tc09v2_t029780.1

Overview
NameTc09v2_t029780.1
Unique NameTc09v2_t029780.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3375
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 1 mRNA, 16 ESTs, 27 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 25 samples with support for all annotated introns
CommentThe sequence of the model RefSeq transcript was modified relative to this genomic sequence to represent the inferred CDS: inserted 1 base in 1 codon
Productubiquitin-activating enzyme E1 1
NoteUbiquitin-activating enzyme E1 1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18591005
GenbankXM_018127673.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc09v2_g029780Tc09v2_g029780Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc09v2_p029780.1Tc09v2_p029780.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto483743auto483743Theobroma cacaoexon
exon-auto483744auto483744Theobroma cacaoexon
exon-auto483745auto483745Theobroma cacaoexon
exon-auto483746auto483746Theobroma cacaoexon
exon-auto483747auto483747Theobroma cacaoexon
exon-auto483748auto483748Theobroma cacaoexon
exon-auto483749auto483749Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto483750auto483750Theobroma cacaoCDS
CDS-auto483751auto483751Theobroma cacaoCDS
CDS-auto483752auto483752Theobroma cacaoCDS
CDS-auto483753auto483753Theobroma cacaoCDS
CDS-auto483754auto483754Theobroma cacaoCDS
CDS-auto483755auto483755Theobroma cacaoCDS
CDS-auto483756auto483756Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc09v2_t029780.1 ID=Tc09v2_t029780.1|Name=Tc09v2_t029780.1|organism=Theobroma cacao|type=mRNA|length=3375bp
ATGTTTGGTATGGATGATAATAAGAAATTGTGTTCATTTGTGGTATTGAC
GGTAATTTTTGCGGGGTTTCGTGTTTTTGGCAGTTTACTGCACTATATGC
TTCCTAGAAAGAGAGCAGGTGAAGGAGAGGTTGTAGAAGGAGAGAGTGAA
AACAACAATAACAGCAACAACATAAAAGACGTAGCTGTCACGTCGCCGAT
CAAGAAGCATCGCTTCTCTGCCGCGGCAGCCGCCGATTTGACGGCTAATA
ACAACACTGTAGCCATAGGGAACAACAGCAGTAACCACAGTAGTGGTAGC
GTGCTCGAGCCGACGATCATGGCTCCGGGCGACGCTAACCACAATGATAT
TGATGAGGATCTGCACAGCCGGCAGCTCGCTGTGTATGGCCGTGAGACGA
TGAGGCTTCTTTTTGCCTCCAATATCCTTATCTCGGGGATGAATGGTCTC
GGTGCTGAAATTGCAAAGAATCTCATTCTTGCTGGTGTCAAGTCTGTGAC
CTTGCATGATGAAGGAGTGGTGGAGTTGTGGGATTTGTCCAGTAATTTTG
TTTTCTCTGAGAATGATGTTGGTAAGAACAGAGCACTTGCTTCTGTTCAG
AAGTTGCAGGAGCTCAACAATGCTGTTGTCATTTCCACCTTGACAACAAA
GTTGGCCAAACAACAACTTTCTCATTTCCAGGCTGTTGTATTCACTGATA
TAAGTCTTGAGAAAGCCTTTGAGTTTGATGACTACTGCCATAATCATCGG
CCTCCCATTTCCTTCATCAAGACTGAAGTAAGAGGCCTTTTTGGTTCTGT
CTTCTGTGACTTTGGTCCTGAGTTTACTGTTTTTGATGTTGATGGTGAGG
ATCCACATACGGGTATAATAGCATCCATCAGCAATGACAACCCTGCCCTA
GTATCATGTGTCGATGATGAAAGGCTTGAGTTTCAGGATGGGGATCTTGT
TGTGTTCTCTGAAGTTCATGGAATGACAGAGCTCAATGATGGAAAGCCGA
GGAAGATTAAAAGTGCAAGGCCGTACTCATTTACACTTGAGGAGGACACC
ACTAATTTTGGTACGTATTTCAAAGGTGGCATTGTCACACAAGTGAAACA
GCCCAAGGTGTTGAATTTCAAGCCATTGAGAGAAGCTCTTAAAGATCCTG
GTGATTTTCTTCTGAGTGATTTCTCAAAGTTTGACCATCCACCTATCCTA
CACATAGCATTCCAAGCATTGGATAAGTTTGTTTCTGAGTTAGGCCGCTT
CCCTGTGGCTGGATCAGAAGAAGATGCTCAGAAGCTCACATCTATTGCTG
CTAACGTCAATGAGTGCCTTGGAGAGGGAAAAATTGAAGATATTAACCCA
AAACTTCTGAGGCACTTTTCCTTTGGTTCCAGGGCAGTATTGAATCCCAT
GGCTGCCATGTTTGGAGGAATTGTGGGACAAGAGGTTGTCAAGGCATGTT
CTGGAAAATTTCACCCTCTTTTTCAGTTCTTCTATTTTGACTCAGTGGAG
TCCCTTCCTGCTGAACCGTTGGACCCCAGTGATTTTAAACCATTGAATAG
CCGATATGATGCACAAATATCGGTATTTGGCTCCAAACTTCAGAAGAAGC
TGGAGGATTCAAAAGTGTTTATAGTTGGATCTGGGGCCTTAGGCTGTGAG
TTCCTGAAAAATGTAGCATTGATGGGTGTTTCATGTGGCAGTCAAGGCAA
GCTAACTATCACTGATGATGATGTAATTGAGAAGAGCAACCTCAGCAGGC
AGTTTTTGTTCCGTGATTGGAACATTGGGCAGGCTAAATCAACTGTTGCA
GCTTCTGCTGCTGCATCTATAAATCCTCAGCTCAAGATTGAAGCTTTGCA
AAATCGTGTGGGTCCTGAAACTGAGAATGTGTTTAATGACACCTTCTGGG
AGAACCTAACAGTGGTCATTAATGCATTAGATAATGTCAATGCTAGGCTG
TATGTTGATCAGAGGTGCTTGTATTTCCAGAAACCACTTCTTGAATCAGG
AACTCTTGGTGCTAAATGCAACACCCAGATGGTGATTCCTCATCTAACTG
AGAACTATGGTGCTTCGAGAGACCCACCTGAGAAACAAGCACCCATGTGC
ACTGTGCATTCATTTCCACACAATATTGATCACTGCTTGACATGGGCTCG
ATCTGAGTTTGAGGGCTTGCTCGAGAAAACTCCTGCTGAAGTGAACGCCT
ATTTGTCCAACCCAGTTGAATATGCCGCTTCAATGAGAGATGCTGGTGAT
GCTCAGGCTAAGGATAACTTAGAGCGCATCTTGGAGTGCCTTGACCGTGA
AAAATGTGAGACATTCCAGGATTGTGTGGCATGGGCTCGCCTAAGATTTG
AGGACTATTTTGTTAATCGGGTGAAGCAGTTAATATATACATTCCCTGAA
GATGCTGCAACCAGTACTGGGGCTCCCTTCTGGTCTGCTCCAAAGCGATT
CCCGCATCCACTTCAGTTTTCATCTACTGATCCTAGCCATCTCCACTTTA
TTATGGCAGCATCTATACTTAGAGCTGAGACATTCGGTATCGCAGTCCCT
GACCAGGTCAAGAATCCGAAGATGTTGGCTGAGGCAATCGAGAATGTTAT
AGTCCCAGATTTTCAGCCAAAGGAAGGTGTTAAAATTAACACAGATGAGA
AGGATACTAGTCTCTCCACTGCCTCCGTGAATGATGAAGCCATGATTAAT
GAATTATTTTACAAGTTAGAGCTTTGCAAGAACAATCTGCCATCAGGATT
CAGGTTGAAACCAATTCAATTTGAAAAGGATGATGATACAAACTATCACA
TGGATCTTATTGCTGCGCTTGCCAACATGAGGGCAAGGAACTATAGCATT
CCTGAGGTGGATAAGCTTAAAGCCAAGTTTATAGCTGGAAGAATCATACC
AGCAATTGCCACTTCCACGGCTATGGCTACAGGCCTTGTCTGCCTTGAGC
TATATAAGGTTCTAGATGGAGCACATAAAGTGGAGGACTATCGAAATACA
TTTGCAAACTTAGCACTGCCTTTGTTCTCCATGGCTGAGCCGGTGCCCCC
CAAGGTCATGAAGCACCGGGAGATGAGCTGGACTGTATGGGACAGGTGGA
TCTTGAGAGACAATCCCACTCTGAGGGAACTCATCCAGTGGCTCAAAGAT
AAGGGGTTGAATGCTTACAGCATATCTTACGGAAGTTGCCTGCTCTTCAA
CAGCATGTTTCCCAAGCACAAAGAGCGACTGGACAAGAAGGTGGTGGATG
TGGCTCGAGAAGTTGCCAAGGCAGAATTGCCTCCCTACCGATCCCACTTG
GATGTGGTGGTGGCATGCGAGGACGATGAAGACAATGATATTGACATTCC
TCAAATTTCCATCTACTACCGTTGA
back to top

protein sequence of Tc09v2_p029780.1

>Tc09v2_p029780.1 ID=Tc09v2_p029780.1|Name=Tc09v2_p029780.1|organism=Theobroma cacao|type=polypeptide|length=1125bp
MFGMDDNKKLCSFVVLTVIFAGFRVFGSLLHYMLPRKRAGEGEVVEGESE
NNNNSNNIKDVAVTSPIKKHRFSAAAAADLTANNNTVAIGNNSSNHSSGS
VLEPTIMAPGDANHNDIDEDLHSRQLAVYGRETMRLLFASNILISGMNGL
GAEIAKNLILAGVKSVTLHDEGVVELWDLSSNFVFSENDVGKNRALASVQ
KLQELNNAVVISTLTTKLAKQQLSHFQAVVFTDISLEKAFEFDDYCHNHR
PPISFIKTEVRGLFGSVFCDFGPEFTVFDVDGEDPHTGIIASISNDNPAL
VSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRKIKSARPYSFTLEEDT
TNFGTYFKGGIVTQVKQPKVLNFKPLREALKDPGDFLLSDFSKFDHPPIL
HIAFQALDKFVSELGRFPVAGSEEDAQKLTSIAANVNECLGEGKIEDINP
KLLRHFSFGSRAVLNPMAAMFGGIVGQEVVKACSGKFHPLFQFFYFDSVE
SLPAEPLDPSDFKPLNSRYDAQISVFGSKLQKKLEDSKVFIVGSGALGCE
FLKNVALMGVSCGSQGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVA
ASAAASINPQLKIEALQNRVGPETENVFNDTFWENLTVVINALDNVNARL
YVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMC
TVHSFPHNIDHCLTWARSEFEGLLEKTPAEVNAYLSNPVEYAASMRDAGD
AQAKDNLERILECLDREKCETFQDCVAWARLRFEDYFVNRVKQLIYTFPE
DAATSTGAPFWSAPKRFPHPLQFSSTDPSHLHFIMAASILRAETFGIAVP
DQVKNPKMLAEAIENVIVPDFQPKEGVKINTDEKDTSLSTASVNDEAMIN
ELFYKLELCKNNLPSGFRLKPIQFEKDDDTNYHMDLIAALANMRARNYSI
PEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGAHKVEDYRNT
FANLALPLFSMAEPVPPKVMKHREMSWTVWDRWILRDNPTLRELIQWLKD
KGLNAYSISYGSCLLFNSMFPKHKERLDKKVVDVAREVAKAELPPYRSHL
DVVVACEDDEDNDIDIPQISIYYR*
back to top