Tc03v2_t000850.1

Overview
NameTc03v2_t000850.1
Unique NameTc03v2_t000850.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length4188
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 19 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
ProductDNA-directed RNA polymerase III subunit 1, transcript variant X1
NoteDNA-directed RNA polymerase III subunit 1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18603906
GenbankXM_018117638.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc03v2_g000850Tc03v2_g000850Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc03v2_p000850.1Tc03v2_p000850.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto141919auto141919Theobroma cacaoexon
exon-auto141920auto141920Theobroma cacaoexon
exon-auto141921auto141921Theobroma cacaoexon
exon-auto141922auto141922Theobroma cacaoexon
exon-auto141923auto141923Theobroma cacaoexon
exon-auto141924auto141924Theobroma cacaoexon
exon-auto141925auto141925Theobroma cacaoexon
exon-auto141926auto141926Theobroma cacaoexon
exon-auto141927auto141927Theobroma cacaoexon
exon-auto141928auto141928Theobroma cacaoexon
exon-auto141929auto141929Theobroma cacaoexon
exon-auto141930auto141930Theobroma cacaoexon
exon-auto141931auto141931Theobroma cacaoexon
exon-auto141932auto141932Theobroma cacaoexon
exon-auto141933auto141933Theobroma cacaoexon
exon-auto141934auto141934Theobroma cacaoexon
exon-auto141935auto141935Theobroma cacaoexon
exon-auto141936auto141936Theobroma cacaoexon
exon-auto141937auto141937Theobroma cacaoexon
exon-auto141938auto141938Theobroma cacaoexon
exon-auto141939auto141939Theobroma cacaoexon
exon-auto141940auto141940Theobroma cacaoexon
exon-auto141941auto141941Theobroma cacaoexon
exon-auto141942auto141942Theobroma cacaoexon
exon-auto141943auto141943Theobroma cacaoexon
exon-auto141944auto141944Theobroma cacaoexon
exon-auto141945auto141945Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto141946auto141946Theobroma cacaoCDS
CDS-auto141947auto141947Theobroma cacaoCDS
CDS-auto141948auto141948Theobroma cacaoCDS
CDS-auto141949auto141949Theobroma cacaoCDS
CDS-auto141950auto141950Theobroma cacaoCDS
CDS-auto141951auto141951Theobroma cacaoCDS
CDS-auto141952auto141952Theobroma cacaoCDS
CDS-auto141953auto141953Theobroma cacaoCDS
CDS-auto141954auto141954Theobroma cacaoCDS
CDS-auto141955auto141955Theobroma cacaoCDS
CDS-auto141956auto141956Theobroma cacaoCDS
CDS-auto141957auto141957Theobroma cacaoCDS
CDS-auto141958auto141958Theobroma cacaoCDS
CDS-auto141959auto141959Theobroma cacaoCDS
CDS-auto141960auto141960Theobroma cacaoCDS
CDS-auto141961auto141961Theobroma cacaoCDS
CDS-auto141962auto141962Theobroma cacaoCDS
CDS-auto141963auto141963Theobroma cacaoCDS
CDS-auto141964auto141964Theobroma cacaoCDS
CDS-auto141965auto141965Theobroma cacaoCDS
CDS-auto141966auto141966Theobroma cacaoCDS
CDS-auto141967auto141967Theobroma cacaoCDS
CDS-auto141968auto141968Theobroma cacaoCDS
CDS-auto141969auto141969Theobroma cacaoCDS
CDS-auto141970auto141970Theobroma cacaoCDS
CDS-auto141971auto141971Theobroma cacaoCDS
CDS-auto141972auto141972Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc03v2_t000850.1 ID=Tc03v2_t000850.1|Name=Tc03v2_t000850.1|organism=Theobroma cacao|type=mRNA|length=4188bp
ATGCAGCAAAAATTTACAAAACGGCCGTACATCGAAGATGTCGGGCCGCG
AAAAATAAAAAGCATTCAATTTTCTATGTTATCAGATTCGGAGATAGCCA
AAGCTGCTGAAGTTCAAGTTTATCAAGCTCTTTACTATGATCCTAAAAGC
CGCCCCATCGAAGGCGGCTTATTGGATCCCCGAATGGGTCCTGCAAATAA
AAGCGGGAAATGTGCAACCTGCCATGGAAATTTTGCGGATTGCCCAGGCC
ATTACGGATACTTATCTCTTGCCCTTCCTGTTTATAATGTTGGATATTTA
AGTACAATTTTAGACATTTTAAAGTGCATCTGTAAGTCTTGTTCTCGTAT
AATTTTGGATGAGAAATTATGCAAAGATTATCTGAAGAGGATGAGAAGTC
CGAAGATCGATGCATTAAAGAAGGGTGATATAATGAAAAGTATCGTGAAG
AAGTGTAGTGCTATGGCTAGTAGTAAAGCTGTGAAGTGCTGGAGATGTGG
ATATGTAAATGGTACGGTGAAGAAGGCTGTGGCAATGTTGGGCATTATTC
ATGATCGTTCAAAAATTAATGACAACAGTTTGGAAGAATTTAGATCAGCA
ATTTCCCACACAAAGGAGTCCAAGGCATCCTTCAACGTTGCTACTTATGT
TCTAAACCCTGTCAAAGTGCTTTCTCTTTTTAAAAGGATGACTGATTTGG
ATTGCGAATTGCTATATCTTTCTGATAGACCTGAGAAGCTCATAATTACA
AATATTGCTGTGCCACCTATACCTATCCGACCTTCAGTCATTATGGATGG
GTCACAGAGCAACGAAAATGACATTACTGAGAGGTTGAAACGAATTATTC
AGGCAAATGCTAGCCTTCGTCAGGAATTAGTAGAAACAAATGCTGCATTC
AAATGTCTGGGTGGCTGGGAGATGCTTCAAGTTGAAGTTGCACAGTACAT
TAATAGTGATGTTCGTGGTGTTCCATTTAGTATGCAAGTGTCAAAGCCGC
TGAGTGGTTTTGTTCAGCGCATCAAAGGGAAGCACGGACGCTTTCGTGGT
AACTTATCTGGCAAACGTGTTGAATATACTGGCCGGACTGTTATATCACC
TGACCCCAATCTGAAAATTACTGAGGTGGCTATCCCAATCCATATGGCTC
GGATTTTGACTTATCCAGAACGTGTTTCCAATCACAATATAGAGAAGCTA
AGGCAGTGTGTTCGTAATGGTCCTTCGAAATACCCTGGTGCAAGGATGGT
CAGATATCCTGATGGTTCAGCTAGGCTCTTGATAGGTGATTACAGAAAGC
GTCTTGCTGATGAACTAAAATTCGGTTGTGTAGTTGATCGCCATTTAGAA
GATGGAGATATTGTTCTTTTCAATAGACAGCCAAGCTTGCATAGAATGTC
TATCATGTGCCATAGGGCGAGAATCATGCCGTGGAGAACATTGCGATTCA
ATGAGTCTGTTTGTAACCCATATAATGCTGATTTTGATGGTGATGAAATG
AACATGCATGTCCCACAAACGGAGGAGGCTCGAACAGAGGCACTCATGTT
GATGGGGGTGCAAAATAATTTATGCACGCCAAAAAATGGAGAAATTTTGG
TTGCTTCTACTCAAGATTTTTTAACATCTTCCTTTCTCATTACAAGGAAG
GATATTTTCTATGATCGTGCAGCTTTTTCTCTTATATGCTCCTATATGGG
TGATGGCATGGATCTTATAGATTTGCCGACTCCAGCATTACTTAAGCCAA
TAGAGCTTTGGACTGGTAAGCAATTGTTTAGTGTTCTATTACGCCCACAT
GCGAGTGTGAGAGTCTACTTGAATCTTATTGTTAAGGAAAGGAACTACTC
CAAGAAGATTATCAAAAGGATTGGAAATAAGGAAATAGAAGTAGAAACAA
TGTGCCCAGACGATGGATTTGTCTATATTCGGAATAGTGAGCTTATATGT
GGGCAACTGGGGAAGGCTACTTTAGGAAATGGCAACAAGGATGGACTTTA
TTCTGTTCTTCTCAGGGACTACAATGCACATGCTGCTGCTGCCTGCATGA
ATCGGTTAGCTAAACTGAGTGCTCGATGGATAGGGAATCACGGCTTTTCA
ATTGGAATTGATGATGTCCAACCGGGGAAAAGGTTGAATGATGAGAAAGC
ATTAACAATTTCAGGAGATTATAAGAAATGTGATGAAGAGATACAGACGT
TCAATGAAGGAAAACTAAAGCCTAAACCTGGTTATGATGCTGCTCAAACA
CTAGAAGCTAATGTAACTGCAATATTGAATAACATTCGGGACAAAACGGG
GAAGGTATGCATGAAAGAACTACATTGGAGAAACAGTCCATTGATCATGT
CGCAATGTGGTTCCAAGGGTTCTGCTATAAATATAAGTCAAATGATTGCA
TGTGTTGGTCAGCAATCAGTTGGTGGTCGTCGTGCTCCTAATGGATTCAT
AGATCGTAGCCTTCCTCATTTTCATAGAGGATCAAAAACCCCTGCTGCTA
AAGGCTTTGTTGCAAATTCATTCTACAGTGGTTTGACTGCTACAGAGTTT
TTCTTTCACACGATGGCTGGGCGAGAAGGCCTTGTGGATACAGCTGTAAA
AACAGCTGAGACAGGATACATGTCTCGTAGACTGATCAAAGCATTGGAAG
ACTTGAGCATTCATTATGATAACACCGTTCGCAATGCAAGTGGATGTATA
GTTCAATTTATTTATGGAGATGATGGCATGGATCCTGCATGTATGGAGGG
AAAAAGTGGATTTCCTCTGAATTTTGACAGATTGTTAATGAAAGTAAAGG
CTACCTGTCCTCCAATAGAACAGAAATGCTTACACGTTGGTTCTATCATG
CCAATGTTAGAGGAGCAGCTTGCTAAACATGATCCTGCTGGGGTTTGCTC
TGAAGCCTTCAAAAAATCTCTGAAAGGGTTCCTTAAAAGTCAGACGAACG
AACTAGACAGAGTGATGAAATTGGTTAACAATTGTGCACAGAAGAGTGAG
ATACTTGAGAAAGTTGGCCATAAAATATCTGGTATATCTGACAGGCAGTT
GGAGGTTTTTGTTAGTACTTGCATTTCTCGTTATCGCTCTAAAGTAATTG
AAGCTGGAACTGCCATTGGAGCTATTGGAGCTCAGAGTATTGGTGAACCT
GGGACACAGATGACGCTGAAGACATTTCACTTTGCTGGAGTTGCGAGCAT
GAATATTACACAAGGAGTTCCTCGTATCAAAGAAATCATAAATGCAGCCA
AAAGAATTAGTACTCCCGTAATTACTGCAGAACTTGAGTTTGATGATAAT
CCGAACATTGCACAAATAGTAAAAGGTCGAATTGAGAAAACCGTTTTAGG
GCAGGTTGCTAAGAGCATCAAGATTGTAATTACTTCAAGATCAGCATCAG
TTGTTATCACCCTTGACATGGAAATAATCCTAGATGCAGAATTGTATATA
GATGCAAATATTGTGAAAGAATCAATTTTGCAAACTCCGAAAATTAAACT
AAAGGAGCAGCATGTGAAGGTTTTGGATGGTAGAAAATTGGAAGTTGTTC
CTCCAGCTGATAGAAGTCAAATTCATTTTGAACTTCATTCTCTTAAAAAT
CTGCTTCCACTGGTTGTGGTAAAGGGGATAAAAACTGTTGAACGCACTGT
TGTTTATGACAAGAACAAAGAGAAGAAAAATCAGAAAGAAGAAGAGACAA
CGAAGCATTTCCAGTTGCTTGTAGAAGGCATGGGGCTCCAAGCAGTTATG
GGCATTGAAGGAATTGATGGACGGAGGACATGGAGTAACCATGTAATGGA
AATGGAGCAGATATTGGGAATTGAAGCTGCAAGGAAATGCATAATCGATG
AGATAGCACAAACTATGGAACATCATGGAATGACTATAGACAGACGCCAT
ATGATGCTTCTAGCAGATGTGATGACATTTAGGGGGGAAGTTCTTGGCAT
CACAAGATTTGGAATCCAAAAAATGGACAAGAGTATATTGATGCTGGCTT
CATTTGAGAGGACAGCTGATCACCTTTTCAATGCTGCTGTTAACGGGAGG
GATGACAAGATTGAGGGAGTTACTGAGTGCATCATCATGGGCATCCCAAT
GCAGATAGGCACTGGAATACTCAAAGTTATACAGAGAGTTGATCCACCTC
CTATGCTACGATATGGACCAGATCCAGTTTTATCTTGA
back to top

protein sequence of Tc03v2_p000850.1

>Tc03v2_p000850.1 ID=Tc03v2_p000850.1|Name=Tc03v2_p000850.1|organism=Theobroma cacao|type=polypeptide|length=1396bp
MQQKFTKRPYIEDVGPRKIKSIQFSMLSDSEIAKAAEVQVYQALYYDPKS
RPIEGGLLDPRMGPANKSGKCATCHGNFADCPGHYGYLSLALPVYNVGYL
STILDILKCICKSCSRIILDEKLCKDYLKRMRSPKIDALKKGDIMKSIVK
KCSAMASSKAVKCWRCGYVNGTVKKAVAMLGIIHDRSKINDNSLEEFRSA
ISHTKESKASFNVATYVLNPVKVLSLFKRMTDLDCELLYLSDRPEKLIIT
NIAVPPIPIRPSVIMDGSQSNENDITERLKRIIQANASLRQELVETNAAF
KCLGGWEMLQVEVAQYINSDVRGVPFSMQVSKPLSGFVQRIKGKHGRFRG
NLSGKRVEYTGRTVISPDPNLKITEVAIPIHMARILTYPERVSNHNIEKL
RQCVRNGPSKYPGARMVRYPDGSARLLIGDYRKRLADELKFGCVVDRHLE
DGDIVLFNRQPSLHRMSIMCHRARIMPWRTLRFNESVCNPYNADFDGDEM
NMHVPQTEEARTEALMLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRK
DIFYDRAAFSLICSYMGDGMDLIDLPTPALLKPIELWTGKQLFSVLLRPH
ASVRVYLNLIVKERNYSKKIIKRIGNKEIEVETMCPDDGFVYIRNSELIC
GQLGKATLGNGNKDGLYSVLLRDYNAHAAAACMNRLAKLSARWIGNHGFS
IGIDDVQPGKRLNDEKALTISGDYKKCDEEIQTFNEGKLKPKPGYDAAQT
LEANVTAILNNIRDKTGKVCMKELHWRNSPLIMSQCGSKGSAINISQMIA
CVGQQSVGGRRAPNGFIDRSLPHFHRGSKTPAAKGFVANSFYSGLTATEF
FFHTMAGREGLVDTAVKTAETGYMSRRLIKALEDLSIHYDNTVRNASGCI
VQFIYGDDGMDPACMEGKSGFPLNFDRLLMKVKATCPPIEQKCLHVGSIM
PMLEEQLAKHDPAGVCSEAFKKSLKGFLKSQTNELDRVMKLVNNCAQKSE
ILEKVGHKISGISDRQLEVFVSTCISRYRSKVIEAGTAIGAIGAQSIGEP
GTQMTLKTFHFAGVASMNITQGVPRIKEIINAAKRISTPVITAELEFDDN
PNIAQIVKGRIEKTVLGQVAKSIKIVITSRSASVVITLDMEIILDAELYI
DANIVKESILQTPKIKLKEQHVKVLDGRKLEVVPPADRSQIHFELHSLKN
LLPLVVVKGIKTVERTVVYDKNKEKKNQKEEETTKHFQLLVEGMGLQAVM
GIEGIDGRRTWSNHVMEMEQILGIEAARKCIIDEIAQTMEHHGMTIDRRH
MMLLADVMTFRGEVLGITRFGIQKMDKSILMLASFERTADHLFNAAVNGR
DDKIEGVTECIIMGIPMQIGTGILKVIQRVDPPPMLRYGPDPVLS*
back to top