Tc06v2_t015520.1

Overview
NameTc06v2_t015520.1
Unique NameTc06v2_t015520.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3570
Properties
Property NameValue
NoteDNA-directed RNA polymerase II subunit RPB2
Model evidenceSupporting evidence includes similarity to: 8 ESTs, 14 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 18 samples with support for all annotated introns
ProductDNA-directed RNA polymerase II subunit RPB2
Cross References
External references for this mRNA
DatabaseAccession
GeneID18596855
GenbankXM_007025569.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc06v2_g015520Tc06v2_g015520Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc06v2_p015520.1Tc06v2_p015520.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto344102auto344102Theobroma cacaoexon
exon-auto344103auto344103Theobroma cacaoexon
exon-auto344104auto344104Theobroma cacaoexon
exon-auto344105auto344105Theobroma cacaoexon
exon-auto344106auto344106Theobroma cacaoexon
exon-auto344107auto344107Theobroma cacaoexon
exon-auto344108auto344108Theobroma cacaoexon
exon-auto344109auto344109Theobroma cacaoexon
exon-auto344110auto344110Theobroma cacaoexon
exon-auto344111auto344111Theobroma cacaoexon
exon-auto344112auto344112Theobroma cacaoexon
exon-auto344113auto344113Theobroma cacaoexon
exon-auto344114auto344114Theobroma cacaoexon
exon-auto344115auto344115Theobroma cacaoexon
exon-auto344116auto344116Theobroma cacaoexon
exon-auto344117auto344117Theobroma cacaoexon
exon-auto344118auto344118Theobroma cacaoexon
exon-auto344119auto344119Theobroma cacaoexon
exon-auto344120auto344120Theobroma cacaoexon
exon-auto344121auto344121Theobroma cacaoexon
exon-auto344122auto344122Theobroma cacaoexon
exon-auto344123auto344123Theobroma cacaoexon
exon-auto344124auto344124Theobroma cacaoexon
exon-auto344125auto344125Theobroma cacaoexon
exon-auto344126auto344126Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto344127auto344127Theobroma cacaoCDS
CDS-auto344128auto344128Theobroma cacaoCDS
CDS-auto344129auto344129Theobroma cacaoCDS
CDS-auto344130auto344130Theobroma cacaoCDS
CDS-auto344131auto344131Theobroma cacaoCDS
CDS-auto344132auto344132Theobroma cacaoCDS
CDS-auto344133auto344133Theobroma cacaoCDS
CDS-auto344134auto344134Theobroma cacaoCDS
CDS-auto344135auto344135Theobroma cacaoCDS
CDS-auto344136auto344136Theobroma cacaoCDS
CDS-auto344137auto344137Theobroma cacaoCDS
CDS-auto344138auto344138Theobroma cacaoCDS
CDS-auto344139auto344139Theobroma cacaoCDS
CDS-auto344140auto344140Theobroma cacaoCDS
CDS-auto344141auto344141Theobroma cacaoCDS
CDS-auto344142auto344142Theobroma cacaoCDS
CDS-auto344143auto344143Theobroma cacaoCDS
CDS-auto344144auto344144Theobroma cacaoCDS
CDS-auto344145auto344145Theobroma cacaoCDS
CDS-auto344146auto344146Theobroma cacaoCDS
CDS-auto344147auto344147Theobroma cacaoCDS
CDS-auto344148auto344148Theobroma cacaoCDS
CDS-auto344149auto344149Theobroma cacaoCDS
CDS-auto344150auto344150Theobroma cacaoCDS
CDS-auto344151auto344151Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc06v2_t015520.1 ID=Tc06v2_t015520.1|Name=Tc06v2_t015520.1|organism=Theobroma cacao|type=mRNA|length=3570bp
ATGGAGGACGACAGTGAGTACGATCCGCAACTTATGGACGACGAAGACGA
CGAGGAGATCACGCAGGAAGACGCGTGGGCGGTTATCTCAGCTTACTTCG
AAGAAAAAGGTCTGGTGCGTCAACAGCTCGACTCGTTCGATGAATTTATC
CAAAACACTATGCAAGAAATCGTCGACGAATCGGCCGATATTGAGATCAG
GCCAGAGTCACAGCACAATCCTGGTCACCAGTCCGACTTTGCTGAGACTA
TCTATAAGATTAGCTTTGGTCAGATCTACCTTAGTAAACCTATGATGACC
GAGTCAGATGGTGAAACTGCAACTTTATTTCCAAAAGCTGCAAGGTTGAG
GAATCTTACTTACTCAGCTCCATTGTATGTCGATGTAACTAAGAGAGTTA
TAAAGAAAGGGCATGATGGTGAAGAAGTCACTGAGACTCAGGATTTTACT
AAAGTGTTCATTGGGAAGGTTCCTATAATGCTCCGGTCAAGTTATTGCAC
ACTATATCAAAATTCAGAGAAGGATCTGACCGAGCTTGGGGAGTGTCCAT
ATGATCAAGGTGGGTATTTCATTATCAATGGGAGTGAAAAGGTTCTAATT
GCTCAGGAGAAGATGAGCACAAATCATGTCTATGTCTTCAAAAAGAGGCA
GCCGAACAAATATGCCTATGTGGCAGAAGTTCGGTCCATGGCAGAGTCCC
AGAATAGGCCACCAAGTACCATGTTTGTGCGGATGCTTTCTCGGACTAGT
GCCAAAGGGGGCTCTTCGGGGCAGTACATTCGTGCTACTCTTCCATATAT
TCGGACTGAAATTCCTATCATAATTGTCTTTCGGGCTTTGGGATTTGTTG
CTGACAAGGACATATTAGAGCATATATGCTATGACTTCTCCGACACCCAG
ATGATGGAGTTGCTTAGGCCTTCCTTAGAAGAAGCATTTGTGATTCAAAA
CCAGCAGGTTGCACTAGATTATATTGGTAAAAGAGGAGCAACTGTTGGTG
TTACCAGAGAAAAGAGGATTAAGTATGCTAAAGAGATCCTCCAAAAAGAA
ATGCTTCCTCACGTAGGTGTTGGAGATTTTTGCGAGACAAAGAAAGCTTA
TTATTTTGGATATATTATTCACCGGCTGCTTCTTTGTGCACTTGGCCGGA
GGGCGGAAGATGATAGAGATCATTATGGCAACAAGAGGTTGGACCTTGCT
GGTCCATTACTTGGAGGCCTCTTTAGAATGCTTTTTCGGAAGTTAACTAG
GGATGTGAGATCTTATGTGCAGAAGTGTGTTGATAACGGGAAGGATGTGA
ACCTGCAATTTGCTATCAAAGCGAAAACTATTACAAGTGGTCTTAAATAC
TCACTTGCTACTGGAAATTGGGGGCAAGCAAATGCAGCTGGTACTAGAGC
TGGAGTGTCACAGGTGTTAAACCGTTTGACATATGCCTCAACTTTGTCAC
ACTTGCGAAGGCTCAATTCTCCTATAGGACGTGAAGGGAAATTGGCTAAA
CCACGTCAGTTGCATAATTCACAGTGGGGAATGATGTGTCCAGCGGAAAC
ACCGGAAGGACAGGCCTGTGGACTTGTAAAGAATCTTGCCTTGATGGTAT
ACATAACTGTCGGATCAGCTGCATATCCTATTCTTGAATTTTTGGAAGAG
TGGGGTACGGAGAATTTTGAGGAAATCTCACCTGCAGTTATCCCTCAAGC
TACAAAAATTTTTGTCAATGGTTGCTGGGTTGGTGTACATCGGAATCCTG
ATATGCTTGTGACAACATTGAGACGGTTGAGAAGACGGGTTGATGTCAAT
ACTGAAGTTGGTGTTGTTAGAGATATCCGTCTAAAAGAACTTCGAATATA
TACTGACTATGGTCGTTGCAGTCGACCATTGTTCATCGTGGAGAAACAAA
GACTTCTCATAAAGAAGAAAGATATTCATGCACTGCAACAAAGAGAAAGC
CCAGAAGACGGTGGCTGGCATGATCTTGTAGCAAAGGGATTTATAGAATA
CATTGACACGGAAGAAGAGGAGACAACAATGATTTCCATGACCATCAATG
ATCTTGTACAAGCGAGAGTCAATCCAGAGGAAGCTTATTCTGAAACTTAT
ACCCATTGTGAGATCCACCCTTCATTGATTTTGGGTGTTTGTGCTTCAAT
TATACCATTTCCTGATCATAATCAGTCCCCGCGTAATACCTATCAATCTG
CTATGGGTAAGCAAGCAATGGGAATATATGTTACCAACTACCAATTTCGA
ATGGATACATTGGCCTATGTTCTCTATTATCCCCAAAAGCCACTTGTTAC
TACACGAGCTATGGAACATCTCCACTTTCGGCAGCTTCCAGCTGGCATTA
ATGCTATTGTTGCTATCGCCTGCTATTCTGGATATAACCAAGAAGATTCT
GTTATTATGAATCAATCATCAATAGACCGTGGATTCTTCCGATCACTTTT
CTTCCGCTCTTACCGAGATGAGGAGAAAAAAATGGGGACCCTTGTTAAAG
AAGATTTTGGTCGACCAGATAGGGCTAATACTATGGGAATGAGGCATGGC
TCTTATGATAAATTGGATGATGATGGTCTTGCACCTCCTGGAACAAGAGT
TTCAGGTGAGGATGTAATCATCGGAAAGACCACCCCGATTTCTCAGGAAG
AAGCTCAGGGACAAGCATCACGCTATTCAAGACGTGATCATAGCATAAGC
TTACGTCACAGTGAAACAGGCATAGTGGACCAAGTTCTATTGACAACTAA
TGCTGATGGGTTGAGATTTGTGAAAGTAAGGGTAAGATCTGTTCGCATTC
CCCAGATTGGGGACAAGTTTAGCAGTAGACATGGTCAAAAGGGGACAGTG
GGCATGACATACACGCAGGAAGACATGCCTTGGACTGTGGAAGGCATCAC
ACCCGATATCATTGTGAACCCACATGCTATTCCTTCTCGAATGACAATTG
GTCAGCTTATTGAATGTATCATGGGGAAAGTTGCAGCTCACATGGGCAAG
GAAGGGGATGCCACTCCTTTTACAGATGTCACCGTGGACAATATCAGCAG
AGCTCTTCATAAATGTGGATATCAAATGCGTGGTTTTGAGACCATGTATA
ATGGGCACACAGGCAGGCGCCTTTCTGCTATGATATTTTTGGGGCCCACA
TATTACCAAAGACTAAAGCACATGGTTGATGATAAGATCCATTCTCGTGG
TCGGGGCCCTGTGCAGATCCTGACAAGGCAGCCTGCAGAGGGACGATCCC
GTGATGGTGGTCTCCGTTTCGGAGAGATGGAAAGAGATTGCATGATTGCG
CATGGTGCTGCTCATTTCCTTAAAGAGAGATTGTTTGACCAAAGTGATGC
ATACAGGGTCCATGTGTGCGAGCGTTGTGGGTTGATTGCTATTGCAAATC
TAAAGAAGAACTCATTTGAGTGCAGAGGATGCAAGAATAAAACTGATATT
GTTCAGGTATACATTCCTTACGCCTGTAAGCTGCTCTTCCAAGAGCTTAT
GGCCATGGCAATTGCTCCAAGAATGCTCACAAAGGAACCTCCCAAAGACC
AAAAGAAGAAAGGAGCCTGA
back to top

protein sequence of Tc06v2_p015520.1

>Tc06v2_p015520.1 ID=Tc06v2_p015520.1|Name=Tc06v2_p015520.1|organism=Theobroma cacao|type=polypeptide|length=1190bp
MEDDSEYDPQLMDDEDDEEITQEDAWAVISAYFEEKGLVRQQLDSFDEFI
QNTMQEIVDESADIEIRPESQHNPGHQSDFAETIYKISFGQIYLSKPMMT
ESDGETATLFPKAARLRNLTYSAPLYVDVTKRVIKKGHDGEEVTETQDFT
KVFIGKVPIMLRSSYCTLYQNSEKDLTELGECPYDQGGYFIINGSEKVLI
AQEKMSTNHVYVFKKRQPNKYAYVAEVRSMAESQNRPPSTMFVRMLSRTS
AKGGSSGQYIRATLPYIRTEIPIIIVFRALGFVADKDILEHICYDFSDTQ
MMELLRPSLEEAFVIQNQQVALDYIGKRGATVGVTREKRIKYAKEILQKE
MLPHVGVGDFCETKKAYYFGYIIHRLLLCALGRRAEDDRDHYGNKRLDLA
GPLLGGLFRMLFRKLTRDVRSYVQKCVDNGKDVNLQFAIKAKTITSGLKY
SLATGNWGQANAAGTRAGVSQVLNRLTYASTLSHLRRLNSPIGREGKLAK
PRQLHNSQWGMMCPAETPEGQACGLVKNLALMVYITVGSAAYPILEFLEE
WGTENFEEISPAVIPQATKIFVNGCWVGVHRNPDMLVTTLRRLRRRVDVN
TEVGVVRDIRLKELRIYTDYGRCSRPLFIVEKQRLLIKKKDIHALQQRES
PEDGGWHDLVAKGFIEYIDTEEEETTMISMTINDLVQARVNPEEAYSETY
THCEIHPSLILGVCASIIPFPDHNQSPRNTYQSAMGKQAMGIYVTNYQFR
MDTLAYVLYYPQKPLVTTRAMEHLHFRQLPAGINAIVAIACYSGYNQEDS
VIMNQSSIDRGFFRSLFFRSYRDEEKKMGTLVKEDFGRPDRANTMGMRHG
SYDKLDDDGLAPPGTRVSGEDVIIGKTTPISQEEAQGQASRYSRRDHSIS
LRHSETGIVDQVLLTTNADGLRFVKVRVRSVRIPQIGDKFSSRHGQKGTV
GMTYTQEDMPWTVEGITPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGK
EGDATPFTDVTVDNISRALHKCGYQMRGFETMYNGHTGRRLSAMIFLGPT
YYQRLKHMVDDKIHSRGRGPVQILTRQPAEGRSRDGGLRFGEMERDCMIA
HGAAHFLKERLFDQSDAYRVHVCERCGLIAIANLKKNSFECRGCKNKTDI
VQVYIPYACKLLFQELMAMAIAPRMLTKEPPKDQKKKGA*
back to top