Tc03v2_t000850.2

Overview
NameTc03v2_t000850.2
Unique NameTc03v2_t000850.2
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length4146
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 13 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
ProductDNA-directed RNA polymerase III subunit 1, transcript variant X2
NoteDNA-directed RNA polymerase III subunit 1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18603906
GenbankXM_018117639.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc03v2_g000850Tc03v2_g000850Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc03v2_p000850.2Tc03v2_p000850.2Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto141975auto141975Theobroma cacaoexon
exon-auto141976auto141976Theobroma cacaoexon
exon-auto141977auto141977Theobroma cacaoexon
exon-auto141978auto141978Theobroma cacaoexon
exon-auto141979auto141979Theobroma cacaoexon
exon-auto141980auto141980Theobroma cacaoexon
exon-auto141981auto141981Theobroma cacaoexon
exon-auto141982auto141982Theobroma cacaoexon
exon-auto141983auto141983Theobroma cacaoexon
exon-auto141984auto141984Theobroma cacaoexon
exon-auto141985auto141985Theobroma cacaoexon
exon-auto141986auto141986Theobroma cacaoexon
exon-auto141987auto141987Theobroma cacaoexon
exon-auto141988auto141988Theobroma cacaoexon
exon-auto141989auto141989Theobroma cacaoexon
exon-auto141990auto141990Theobroma cacaoexon
exon-auto141991auto141991Theobroma cacaoexon
exon-auto141992auto141992Theobroma cacaoexon
exon-auto141993auto141993Theobroma cacaoexon
exon-auto141994auto141994Theobroma cacaoexon
exon-auto141995auto141995Theobroma cacaoexon
exon-auto141996auto141996Theobroma cacaoexon
exon-auto141997auto141997Theobroma cacaoexon
exon-auto141998auto141998Theobroma cacaoexon
exon-auto141999auto141999Theobroma cacaoexon
exon-auto142000auto142000Theobroma cacaoexon
exon-auto142001auto142001Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto142002auto142002Theobroma cacaoCDS
CDS-auto142003auto142003Theobroma cacaoCDS
CDS-auto142004auto142004Theobroma cacaoCDS
CDS-auto142005auto142005Theobroma cacaoCDS
CDS-auto142006auto142006Theobroma cacaoCDS
CDS-auto142007auto142007Theobroma cacaoCDS
CDS-auto142008auto142008Theobroma cacaoCDS
CDS-auto142009auto142009Theobroma cacaoCDS
CDS-auto142010auto142010Theobroma cacaoCDS
CDS-auto142011auto142011Theobroma cacaoCDS
CDS-auto142012auto142012Theobroma cacaoCDS
CDS-auto142013auto142013Theobroma cacaoCDS
CDS-auto142014auto142014Theobroma cacaoCDS
CDS-auto142015auto142015Theobroma cacaoCDS
CDS-auto142016auto142016Theobroma cacaoCDS
CDS-auto142017auto142017Theobroma cacaoCDS
CDS-auto142018auto142018Theobroma cacaoCDS
CDS-auto142019auto142019Theobroma cacaoCDS
CDS-auto142020auto142020Theobroma cacaoCDS
CDS-auto142021auto142021Theobroma cacaoCDS
CDS-auto142022auto142022Theobroma cacaoCDS
CDS-auto142023auto142023Theobroma cacaoCDS
CDS-auto142024auto142024Theobroma cacaoCDS
CDS-auto142025auto142025Theobroma cacaoCDS
CDS-auto142026auto142026Theobroma cacaoCDS
CDS-auto142027auto142027Theobroma cacaoCDS
CDS-auto142028auto142028Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc03v2_t000850.2 ID=Tc03v2_t000850.2|Name=Tc03v2_t000850.2|organism=Theobroma cacao|type=mRNA|length=4146bp
ATGCAGCAAAAATTTACAAAACGGCCGTACATCGAAGATGTCGGGCCGCG
AAAAATAAAAAGCATTCAATTTTCTATGTTATCAGATTCGGAGATAGCCA
AAGCTGCTGAAGTTCAAGTTTATCAAGCTCTTTACTATGATCCTAAAAGC
CGCCCCATCGAAGGCGGCTTATTGGATCCCCGAATGGGTCCTGCAAATAA
AAGCGGGAAATGTGCAACCTGCCATGGAAATTTTGCGGATTGCCCAGGCC
ATTACGGATACTTATCTCTTGCCCTTCCTGTTTATAATGTTGGATATTTA
AGTACAATTTTAGACATTTTAAAGTGCATCTGTAAGTCTTGTTCTCGTAT
AATTTTGGATGAGAAATTATGCAAAGATTATCTGAAGAGGATGAGAAGTC
CGAAGATCGATGCATTAAAGAAGGGTGATATAATGAAAAGTATCGTGAAG
AAGTGTAGTGCTATGGCTAGTAGTAAAGCTGTGAAGTGCTGGAGATGTGG
ATATGTAAATGGTACGGTGAAGAAGGCTGTGGCAATGTTGGGCATTATTC
ATGATCGTTCAAAAATTAATGACAACAGTTTGGAAGAATTTAGATCAGCA
ATTTCCCACACAAAGGAGTCCAAGGCATCCTTCAACGTTGCTACTTATGT
TCTAAACCCTGTCAAAGTGCTTTCTCTTTTTAAAAGGATGACTGATTTGG
ATTGCGAATTGCTATATCTTTCTGATAGACCTGAGAAGCTCATAATTACA
AATATTGCTGTGCCACCTATACCTATCCGACCTTCAGTCATTATGGATGG
GTCACAGAGCAACGAAAATGACATTACTGAGAGGTTGAAACGAATTATTC
AGGCAAATGCTAGCCTTCGTCAGGAATTAGTAGAAACAAATGCTGCATTC
AAATGTCTGGGTGGCTGGGAGATGCTTCAAGTTGAAGTTGCACAGTACAT
TAATAGTGATGTTCGTGGTGTTCCATTTAGTATGCAAGTGTCAAAGCCGC
TGAGTGGTTTTGTTCAGCGCATCAAAGGGAAGCACGGACGCTTTCGTGGT
AACTTATCTGGCAAACGTGTTGAATATACTGGCCGGACTGTTATATCACC
TGACCCCAATCTGAAAATTACTGAGGTGGCTATCCCAATCCATATGGCTC
GGATTTTGACTTATCCAGAACGTGTTTCCAATCACAATATAGAGAAGCTA
AGGCAGTGTGTTCGTAATGGTCCTTCGAAATACCCTGGTGCAAGGATGGT
CAGATATCCTGATGGTTCAGCTAGGCTCTTGATAGGTGATTACAGAAAGC
GTCTTGCTGATGAACTAAAATTCGGTTGTGTAGTTGATCGCCATTTAGAA
GATGGAGATATTGTTCTTTTCAATAGACAGCCAAGCTTGCATAGAATGTC
TATCATGTGCCATAGGGCGAGAATCATGCCGTGGAGAACATTGCGATTCA
ATGAGTCTGTTTGTAACCCATATAATGCTGATTTTGATGGTGATGAAATG
AACATGCATGTCCCACAAACGGAGGAGGCTCGAACAGAGGCACTCATGTT
GATGGGGGTGCAAAATAATTTATGCACGCCAAAAAATGGAGAAATTTTGG
TTGCTTCTACTCAAGATTTTTTAACATCTTCCTTTCTCATTACAAGGAAG
GATATTTTCTATGATCGTGCAGCTTTTTCTCTTATATGCTCCTATATGGG
TGATGGCATGGATCTTATAGATTTGCCGACTCCAGCATTACTTAAGCCAA
TAGAGCTTTGGACTGGTAAGCAATTGTTTAGTGTTCTATTACGCCCACAT
GCGAGTGTGAGAGTCTACTTGAATCTTATTGTTAAGGAAAGGAACTACTC
CAAGAAGATTATCAAAAGGATTGGAAATAAGGAAATAGAAGTAGAAACAA
TGTGCCCAGACGATGGATTTGTCTATATTCGGAATAGTGAGCTTATATGT
GGGCAACTGGGGAAGGCTACTTTAGGAAATGGCAACAAGGATGGACTTTA
TTCTGTTCTTCTCAGGGACTACAATGCACATGCTGCTGCTGCCTGCATGA
ATCGGTTAGCTAAACTGAGTGCTCGATGGATAGGGAATCACGGCTTTTCA
ATTGGAATTGATGATGTCCAACCGGGGAAAAGGTTGAATGATGAGAAAGC
ATTAACAATTTCAGGAGATTATAAGAAATGTGATGAAGAGATACAGACGT
TCAATGAAGGAAAACTAAAGCCTAAACCTGGTTATGATGCTGCTCAAACA
CTAGAAGCTAATGTATGCATGAAAGAACTACATTGGAGAAACAGTCCATT
GATCATGTCGCAATGTGGTTCCAAGGGTTCTGCTATAAATATAAGTCAAA
TGATTGCATGTGTTGGTCAGCAATCAGTTGGTGGTCGTCGTGCTCCTAAT
GGATTCATAGATCGTAGCCTTCCTCATTTTCATAGAGGATCAAAAACCCC
TGCTGCTAAAGGCTTTGTTGCAAATTCATTCTACAGTGGTTTGACTGCTA
CAGAGTTTTTCTTTCACACGATGGCTGGGCGAGAAGGCCTTGTGGATACA
GCTGTAAAAACAGCTGAGACAGGATACATGTCTCGTAGACTGATCAAAGC
ATTGGAAGACTTGAGCATTCATTATGATAACACCGTTCGCAATGCAAGTG
GATGTATAGTTCAATTTATTTATGGAGATGATGGCATGGATCCTGCATGT
ATGGAGGGAAAAAGTGGATTTCCTCTGAATTTTGACAGATTGTTAATGAA
AGTAAAGGCTACCTGTCCTCCAATAGAACAGAAATGCTTACACGTTGGTT
CTATCATGCCAATGTTAGAGGAGCAGCTTGCTAAACATGATCCTGCTGGG
GTTTGCTCTGAAGCCTTCAAAAAATCTCTGAAAGGGTTCCTTAAAAGTCA
GACGAACGAACTAGACAGAGTGATGAAATTGGTTAACAATTGTGCACAGA
AGAGTGAGATACTTGAGAAAGTTGGCCATAAAATATCTGGTATATCTGAC
AGGCAGTTGGAGGTTTTTGTTAGTACTTGCATTTCTCGTTATCGCTCTAA
AGTAATTGAAGCTGGAACTGCCATTGGAGCTATTGGAGCTCAGAGTATTG
GTGAACCTGGGACACAGATGACGCTGAAGACATTTCACTTTGCTGGAGTT
GCGAGCATGAATATTACACAAGGAGTTCCTCGTATCAAAGAAATCATAAA
TGCAGCCAAAAGAATTAGTACTCCCGTAATTACTGCAGAACTTGAGTTTG
ATGATAATCCGAACATTGCACAAATAGTAAAAGGTCGAATTGAGAAAACC
GTTTTAGGGCAGGTTGCTAAGAGCATCAAGATTGTAATTACTTCAAGATC
AGCATCAGTTGTTATCACCCTTGACATGGAAATAATCCTAGATGCAGAAT
TGTATATAGATGCAAATATTGTGAAAGAATCAATTTTGCAAACTCCGAAA
ATTAAACTAAAGGAGCAGCATGTGAAGGTTTTGGATGGTAGAAAATTGGA
AGTTGTTCCTCCAGCTGATAGAAGTCAAATTCATTTTGAACTTCATTCTC
TTAAAAATCTGCTTCCACTGGTTGTGGTAAAGGGGATAAAAACTGTTGAA
CGCACTGTTGTTTATGACAAGAACAAAGAGAAGAAAAATCAGAAAGAAGA
AGAGACAACGAAGCATTTCCAGTTGCTTGTAGAAGGCATGGGGCTCCAAG
CAGTTATGGGCATTGAAGGAATTGATGGACGGAGGACATGGAGTAACCAT
GTAATGGAAATGGAGCAGATATTGGGAATTGAAGCTGCAAGGAAATGCAT
AATCGATGAGATAGCACAAACTATGGAACATCATGGAATGACTATAGACA
GACGCCATATGATGCTTCTAGCAGATGTGATGACATTTAGGGGGGAAGTT
CTTGGCATCACAAGATTTGGAATCCAAAAAATGGACAAGAGTATATTGAT
GCTGGCTTCATTTGAGAGGACAGCTGATCACCTTTTCAATGCTGCTGTTA
ACGGGAGGGATGACAAGATTGAGGGAGTTACTGAGTGCATCATCATGGGC
ATCCCAATGCAGATAGGCACTGGAATACTCAAAGTTATACAGAGAGTTGA
TCCACCTCCTATGCTACGATATGGACCAGATCCAGTTTTATCTTGA
back to top

protein sequence of Tc03v2_p000850.2

>Tc03v2_p000850.2 ID=Tc03v2_p000850.2|Name=Tc03v2_p000850.2|organism=Theobroma cacao|type=polypeptide|length=1382bp
MQQKFTKRPYIEDVGPRKIKSIQFSMLSDSEIAKAAEVQVYQALYYDPKS
RPIEGGLLDPRMGPANKSGKCATCHGNFADCPGHYGYLSLALPVYNVGYL
STILDILKCICKSCSRIILDEKLCKDYLKRMRSPKIDALKKGDIMKSIVK
KCSAMASSKAVKCWRCGYVNGTVKKAVAMLGIIHDRSKINDNSLEEFRSA
ISHTKESKASFNVATYVLNPVKVLSLFKRMTDLDCELLYLSDRPEKLIIT
NIAVPPIPIRPSVIMDGSQSNENDITERLKRIIQANASLRQELVETNAAF
KCLGGWEMLQVEVAQYINSDVRGVPFSMQVSKPLSGFVQRIKGKHGRFRG
NLSGKRVEYTGRTVISPDPNLKITEVAIPIHMARILTYPERVSNHNIEKL
RQCVRNGPSKYPGARMVRYPDGSARLLIGDYRKRLADELKFGCVVDRHLE
DGDIVLFNRQPSLHRMSIMCHRARIMPWRTLRFNESVCNPYNADFDGDEM
NMHVPQTEEARTEALMLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRK
DIFYDRAAFSLICSYMGDGMDLIDLPTPALLKPIELWTGKQLFSVLLRPH
ASVRVYLNLIVKERNYSKKIIKRIGNKEIEVETMCPDDGFVYIRNSELIC
GQLGKATLGNGNKDGLYSVLLRDYNAHAAAACMNRLAKLSARWIGNHGFS
IGIDDVQPGKRLNDEKALTISGDYKKCDEEIQTFNEGKLKPKPGYDAAQT
LEANVCMKELHWRNSPLIMSQCGSKGSAINISQMIACVGQQSVGGRRAPN
GFIDRSLPHFHRGSKTPAAKGFVANSFYSGLTATEFFFHTMAGREGLVDT
AVKTAETGYMSRRLIKALEDLSIHYDNTVRNASGCIVQFIYGDDGMDPAC
MEGKSGFPLNFDRLLMKVKATCPPIEQKCLHVGSIMPMLEEQLAKHDPAG
VCSEAFKKSLKGFLKSQTNELDRVMKLVNNCAQKSEILEKVGHKISGISD
RQLEVFVSTCISRYRSKVIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGV
ASMNITQGVPRIKEIINAAKRISTPVITAELEFDDNPNIAQIVKGRIEKT
VLGQVAKSIKIVITSRSASVVITLDMEIILDAELYIDANIVKESILQTPK
IKLKEQHVKVLDGRKLEVVPPADRSQIHFELHSLKNLLPLVVVKGIKTVE
RTVVYDKNKEKKNQKEEETTKHFQLLVEGMGLQAVMGIEGIDGRRTWSNH
VMEMEQILGIEAARKCIIDEIAQTMEHHGMTIDRRHMMLLADVMTFRGEV
LGITRFGIQKMDKSILMLASFERTADHLFNAAVNGRDDKIEGVTECIIMG
IPMQIGTGILKVIQRVDPPPMLRYGPDPVLS*
back to top