Tc02v2_t001050.2

Overview
NameTc02v2_t001050.2
Unique NameTc02v2_t001050.2
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2097
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 6 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 21 samples with support for all annotated introns
Product30-kDa cleavage and polyadenylation specificity factor 30, transcript variant X2
Note30-kDa cleavage and polyadenylation specificity factor 30
Cross References
External references for this mRNA
DatabaseAccession
GeneID18607089
GenbankXM_007041078.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc02v2_g001050Tc02v2_g001050Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc02v2_p001050.2Tc02v2_p001050.2Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto80795auto80795Theobroma cacaoexon
exon-auto80796auto80796Theobroma cacaoexon
exon-auto80797auto80797Theobroma cacaoexon
exon-auto80798auto80798Theobroma cacaoexon
exon-auto80799auto80799Theobroma cacaoexon
exon-auto80800auto80800Theobroma cacaoexon
exon-auto80801auto80801Theobroma cacaoexon
exon-auto80802auto80802Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto80803auto80803Theobroma cacaoCDS
CDS-auto80804auto80804Theobroma cacaoCDS
CDS-auto80805auto80805Theobroma cacaoCDS
CDS-auto80806auto80806Theobroma cacaoCDS
CDS-auto80807auto80807Theobroma cacaoCDS
CDS-auto80808auto80808Theobroma cacaoCDS
CDS-auto80809auto80809Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc02v2_t001050.2 ID=Tc02v2_t001050.2|Name=Tc02v2_t001050.2|organism=Theobroma cacao|type=mRNA|length=2097bp
ATGGATGACTCGGAGGGAGGTCTCAGCTTCGATTTCGAGGGCGGTCTCGA
TGCGGGTCCCGCCGCGCCAACTGCCTCCATGCCCGTTGTCAACTCCGATC
CTTCGGCCGCCGCCAACAACAACAGCAACAACAACTCGGCGGTCCCAGGC
GCTGCCCCTACTTCGACAAATGATCCCGCCGCCGCTGTGGGTGGCGGAGG
AGCTGGGCGAAGGAGCTTTCGTCAGACTGTATGCCGGCACTGGCTCCGCA
GCCTCTGCATGAAGGGCGACGCCTGCGGTTTCCTTCACCAGTACGACAAG
TCCCGCATGCCGGTCTGCCGGTTTTTTCGGCTTTTTGGTGAGTGCCGGGA
GCAAGATTGTGTCTACAAACACACCAACGAGGATATTAAGGAGTGTAACA
TGTACAAGCTAGGTTTTTGTCCTAATGGTGCTGACTGCCGATATAGGCAT
GCAAAGCTCCCAGGACCTCCACCTCCTGTGGAAGAAGTCCTGCAAAAGAT
TCAGCAATTGAGTTCTTACAATTATAACAAATTTTTTCAACAAAGGAATT
CTGGCTTTGCCCAACAAACAGAAAAATCTCAGATTCCACAAGGACAAAAC
AATGTAAACCAAGGAGCAGGTGGAAAACCTTCAACAACAGAATCTGCTAA
CATGCATCCACAGCAGCAAGTTCAACAGCCTCCACAACAGGTCAGCCAGA
CCCAGATACAAAATGTCCCCAACGGTCAGTCTAACCAGGCAAACAAAACT
GCTATACCTTTGCCTCAAGGAATATCTAGGTATTTTATTGTTAAAAGTTG
CAACCGCGAAAATCTGGAACTATCTGTTCAACAAGGAGTATGGGCAACTC
AAAGAAGCAATGAGGCTAAACTGAATGAAGCTTTTGATTCTGCTGAAAAT
GTGATTTTGATCTTCTCAGTTAATCGTACTCGGCATTTCCAGGGTTGTGC
CAAGATGACATCAAAAATTGGTGGATCTGTTGCTGGAGGGAACTGGAAAT
ATGCTCATGGAACTGCACATTATGGACGAAATTTCTCAGTAAAATGGTTA
AAATTATGTGAGCTGTCCTTCCACAAAACTCGCCATTTGAGGAACCCCTA
CAATGAGAACTTACCAGTAAAGATAAGTAGAGATTGTCAGGAGTTAGAGC
CTTCTATTGGTGAGCAGTTGGCCTCCTTGCTTTATCTTGAGCCAGATAGT
GAGCTTATGGCTATTTCGGTGGCAGCAGAATTAAAACGAGAAGAAGAGAA
AGCAAAGGGAGTCAATTCAGATAATGGAGGAGAGAACCCGGACATTGTGC
CATTTGAGGACAACGAAGAAGAAGAAGAGGAAGAAAGTGAGGAGGAGGAT
GAAAGCTTTAGTGCTGCAGCTCAGGGAAGAGGAAGAGGCAGAGGAGTCAT
GTGGCCCCCTCACATGCCACTTGCCCGAGGTGCCAGACCCATGCCTGGTA
TGCGAGGTTTTCCACCTATGATGATGGGTGGTGATGGCTTTTCGTATGGA
CCCGTAACACCTGATGGTTTTGGGGTGCCAGATCTTTTCGGTGCTCCCCG
TCCTTTTCCACCATATGGGCCAAGGTTTTCTGGAGATTTCACAGGTCCTG
CATCTGGCATGATGTTTCCAGGGCGACCTCCACAACCTGGGGCTATGTTT
CCTGCTGGTGGACTTGGGATGATGATGGGTCCAGGTCGTGCTCCATTTAT
GGGGGGTATGGGGCCCACAGGTGCAAATCCTGTTCGAGGTGGCCGCCCAG
TCAGCATGCCCCCTATGTTTCCCCCACCCCCAGCACCTTCATCCCAGAAT
TCTGGCCGGGCAGTTAAGAGAGATCAGAGGACACCAACCAATGACAGATA
TGGTGCAGGATCAGAGCAGGGTAGAGGTCAGGAAATGGCTGGTCCGGGAG
GCAGGTTGGATGATGAGACACAGTATCAGCAAGAAGGACAAAAGGCTCAC
CACGAAGATCAGTTTGCTGCTGGAAACAGCTTCAGAAATGATGAGAGTGA
AAGTGAGGATGAGGCACCGAGGAGGTCAAGGTATGGCGAAGGGAAGAAGA
AAAGAAGAAGCTTAGAAGGAGATGATGCCAATGGTTCTGATCACTAG
back to top

protein sequence of Tc02v2_p001050.2

>Tc02v2_p001050.2 ID=Tc02v2_p001050.2|Name=Tc02v2_p001050.2|organism=Theobroma cacao|type=polypeptide|length=699bp
MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPG
AAPTSTNDPAAAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDK
SRMPVCRFFRLFGECREQDCVYKHTNEDIKECNMYKLGFCPNGADCRYRH
AKLPGPPPPVEEVLQKIQQLSSYNYNKFFQQRNSGFAQQTEKSQIPQGQN
NVNQGAGGKPSTTESANMHPQQQVQQPPQQVSQTQIQNVPNGQSNQANKT
AIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAEN
VILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWL
KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDS
ELMAISVAAELKREEEKAKGVNSDNGGENPDIVPFEDNEEEEEEESEEED
ESFSAAAQGRGRGRGVMWPPHMPLARGARPMPGMRGFPPMMMGGDGFSYG
PVTPDGFGVPDLFGAPRPFPPYGPRFSGDFTGPASGMMFPGRPPQPGAMF
PAGGLGMMMGPGRAPFMGGMGPTGANPVRGGRPVSMPPMFPPPPAPSSQN
SGRAVKRDQRTPTNDRYGAGSEQGRGQEMAGPGGRLDDETQYQQEGQKAH
HEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGDDANGSDH*
back to top