Tc04v2_t011400.1

Overview
NameTc04v2_t011400.1
Unique NameTc04v2_t011400.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length4386
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 1 EST, 23 Proteins, and 99% coverage of the annotated genomic feature by RNAseq alignments, including 1 sample with support for all annotated introns
Producthistone acetyltransferase HAC12
NoteHistone acetyltransferase of the CBP family 1, putative
Cross References
External references for this mRNA
DatabaseAccession
GeneID18602114
GenbankXM_007033300.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g011400Tc04v2_g011400Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p011400.1Tc04v2_p011400.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto220254auto220254Theobroma cacaoexon
exon-auto220255auto220255Theobroma cacaoexon
exon-auto220256auto220256Theobroma cacaoexon
exon-auto220257auto220257Theobroma cacaoexon
exon-auto220258auto220258Theobroma cacaoexon
exon-auto220259auto220259Theobroma cacaoexon
exon-auto220260auto220260Theobroma cacaoexon
exon-auto220261auto220261Theobroma cacaoexon
exon-auto220262auto220262Theobroma cacaoexon
exon-auto220263auto220263Theobroma cacaoexon
exon-auto220264auto220264Theobroma cacaoexon
exon-auto220265auto220265Theobroma cacaoexon
exon-auto220266auto220266Theobroma cacaoexon
exon-auto220267auto220267Theobroma cacaoexon
exon-auto220268auto220268Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto220269auto220269Theobroma cacaoCDS
CDS-auto220270auto220270Theobroma cacaoCDS
CDS-auto220271auto220271Theobroma cacaoCDS
CDS-auto220272auto220272Theobroma cacaoCDS
CDS-auto220273auto220273Theobroma cacaoCDS
CDS-auto220274auto220274Theobroma cacaoCDS
CDS-auto220275auto220275Theobroma cacaoCDS
CDS-auto220276auto220276Theobroma cacaoCDS
CDS-auto220277auto220277Theobroma cacaoCDS
CDS-auto220278auto220278Theobroma cacaoCDS
CDS-auto220279auto220279Theobroma cacaoCDS
CDS-auto220280auto220280Theobroma cacaoCDS
CDS-auto220281auto220281Theobroma cacaoCDS
CDS-auto220282auto220282Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t011400.1 ID=Tc04v2_t011400.1|Name=Tc04v2_t011400.1|organism=Theobroma cacao|type=mRNA|length=4386bp
ATGAACCAGCAGACTTTGGAGTACCCTTTCAAAGTTCTTAATGTAAATCC
TACTCCCTCGTTCATGCCCAACAGAATGGTGATGCCAACTTCGGGCTTGG
TGCAAGATGGCAACTTAGCCATTCCTGATGCTTTAAAAGTTCTTAATGTA
AATCCTACTCCCTCATTCATGTCGAACAGTATGATGATGCCAACTTCAGG
TTTGGTACAAGATGGCAATTTAGCCATTCCTGGTGGACTTTTTAATCAAA
ATCAGTTAAATGATGTGACAATGGATTACAAAGAGATTGTTGATGACCCT
AATCAATTTGGCTTCTTAAAAGAATTCTTTGGTGATAGACACAAGGCACA
AACTGCTGGATACACCACTATGGATTGGAATATTGCGCCAGCCTTGGGTA
TGCTGCCGGTTGAGGGTGAGATGGTCCTTTCCTCCGGACAGGCCTCAACC
ATCACTAGTTGTTATGGTGGGGATGGCTTTTCTGATGTCGGATCTCTTGA
AAGTAGGCCACCCTTTTCAAAGGGAAAATTACTACATCTTTATGATGGCA
AAATAAATATGATGGATCACATAGGTTGGTTGGGGAATAATCAGAGCTCT
GATTCCATGGTTTTCCCACATATACTAGGAGGATCCTTGGCATTTTCTGA
AGCAAATACACAGACCTCGCAGGAAATTTTACAGGAACTTTCTGAAGTGC
CTGATATATTGCCTGGTCTGAACTCAGCCATGGCTAGTTCAACTATTATT
CCGTACATGCAGTCTTCTAGACCATCTGAGGCTGAAGCAAAATGTTCTTT
CACTGGAAAAAATCAATCTTACTGTCCAGTAGAGGCAAAAACTGCAGGTC
ATTTTCCTCAGCTCCCTTTAGAAAATGCACCAGCAGATTCAAGGAATCAA
TTGACATGGGCTATCCAGCATAGGGTTTTACTTGCTTATATCCAGTACAA
GAAATCAATGGTGATTATAGGCAATTCTCAAGTTTCTTTCGTGAACCATA
TGCATTCTGCAACCTGCAACAAACATGCATGTAAATGTGAACAATTTTTC
TCACTTGTATCACATTTTGATGGCTGTCATGATGCTGATTGTAATATATG
CAGTCCTGTTTGGTATAGTTGTGTCACCAATAAACCTCACCCTAAGTTTG
AACGTGTAAAAAGAGGTCTTTTAAGGGATGGAGATTCTGACCAGCCCAGC
TGTGGTAGTTCAGAAACCATGCAACCTTCTTTGAAGCGTTTGAAGGTAGA
AAATCCTCTTTGTCCTAGCTTGACTGAGAATGGTATATGTTGTGCAAAGG
CTCCACTGAAGGTTCAACCATGTTATGCCAAGCTTCCACCCTTGCGGCAG
TTGCCAGAATCTCCTGTATCTAATAATTCTGAGGTTATGGAGGTGAACAT
GGAATTGCTACCAAAGCTTATAGAAGCTTCTATGAGCACTAAAGATATTA
GTTATAATGTTGCAGATAATTTTCCTATATTGCCTACTGAGAATTTGCCG
GGTGCTTCAGAAGTGGTTGTCTGCAGTTATAAATTGGAGGAAACAGATGC
TGTTGGCAGTGAAAAAGAAGGGGGTATGGACTTCAGAAGTGATACCGATA
TTGCAGACAATGTAATAGATCACTCCAACATTTTGGAATCCAATACTTTG
CCCAGCTTCTCTGAAGGACTTGCTGCTGGTTATGAAGAGGAAGAAACAGA
AGCCAGGACTAATTCCAACCAGGCAGAGCTAGCAATAGAGAATGAGCTCA
TTACACAAGAATCAAATTGCGGAAAGGAACTTTCTGCTGGTTGTGAAGAG
GGAGAAACAGAAGCCACAACTAATTCCAACCAGGCAGCTCTAGCAATAGA
GGATGAGCTCATTGCACAGGAATCAAATTGTGGAAAGGAACTTGATGCTG
GTTGTGAAGATGGAGAAACAGAAGCCAAGACTAATTCCAACCTGGCAGAG
CTAGCAATGGAGAATAAGCTCATTGCACCGGAATTGAATTGTGGAAAGGA
AATAGAGTTGGAGAGTCAAACAATAAGGGGTTTGTCCTTAATTGAAAATT
TCACAGCCCAGCAAATAAAGGAACATATATCAAGTCTCAGGCAGTGCATA
GATCAGGATATACCAAAGAAAGAAAGGGGAAAGAGAATAAGCAATGTCTA
CAGTGAGAACTCATGCCAGTTGTGTGGAGCAGATAAGCTTTCACTTGCCC
CAGCACCAATATATTGTTCATCATGTGGTAATCGTATCAGGCGCAGTGCA
AACTATTATATCACACCTGAAGAAAAGGACATCAGAATTTGTCTTTGTAC
CTCATGCTATAAGGTATCTCGGGGGAGGAGCATCGTGTTTTCTGGGATTG
CTCTTTCCAAGGCAAAGCTGGATAAAATTAAGAACGAGGAGGAAGCTGAA
GAATCGTGGGTTCAGTGTGATAAATGTGAAGGCTGGCAACACCAGATATG
TGCCCTCTTTAATGATAAAAATGATATGGAAGGAAAAGCTCAGTTCATCT
GCCCAATATGCTGCCTAAAAGAAATTCAAAGTGGAGAACGTATGCCCCCA
CTGATGAGTACTGTTTTTGGTGCAAAAGATCTCCCGTGTACCATGCTTAG
TGACCACATAGAGCAAAGACTCTTTAGGCGTCTTCAAAAAGAAAGAGAAG
AGAAAGCAAGGGTTACAGGAAAGCGCATTGATGAGGTTCCTGAAGCAGAA
GGTCTTGTTGTTAGAGTGGTCGTATCTGTTGACAAACATGTAAAAGTGAA
GAAGCAGCTTTTAGAAATAGTTCAGAATGAGAACTACCCTGCTGAGTTTC
CGTACAAGTCAAAGGTTATTCTTTTGTTTCAGAAGATTGACGGGGTAGAT
GTATGCCTTTTTAGCATGTATGTCCAGGAGTTTGGCTCAGAATGTGGTCA
CCCAAATCAACGCTGTGTTTATATTGCATATCTTGATTCTGTGAAGTACT
TTAGGCCTGAGACAAAAACTGCAGCTGGAGAAGCTCTTCGAACTGTTGTT
TACCATGAAATATTGATTGGATACCTTGAATACTGCAAGAAACGAGGGTT
TGCAACCTGCTATTTATGGGCCTGTCCACCTTTGAAAGGAGAAGATTATA
TCTTAAACTGCCACCCAGAGATTCAGAAAACGCCAAAGACCGATAAGCTG
CGGCAGTGGTATCAGTTCATGCTACAAAAGGCTGCTAAAGAGAAAGTGGT
GGTTGGTTTGACAAACTTGTATGATCACTTTTTTGTTTCCACTGGGAAAT
ACAACTCCAAGGTGACAGCAGCTCATTTGCCATATTTTGATGGTGACTAC
TGGTCTGGTGCTGCTGAGGATGTGATAAATAATATTGAGAAAGCAAGTTC
AGAAGACCCAAAAAAGATGGGCAAAAGAATAATGTCAAAGAGAACATTGA
AAGCTATGGGACACACAAATCCTTCTGGTGATGCCACTAAGGATATTCTG
CTGATGCAAAAGCTGGGGCAAACTATTTTACCTATTAAGGAGGACTTTAT
CATTGCCCACTTGCAGTTTGTGTGCATACATTGTCATAGAGCTATACTAT
CTGGATGGCGATGGTTTTGCAGCCTGTGTAAAGGCTTTCAGCTATGTGAA
AGGTGCCATGATGCAGAGCAAAATGTCTACAAGGATTGCTCTCACACTTT
ATGTAATGGGGAAAAACACGCACTGTGTAAGATTATGGTGGATGATGTGC
CTTCTGATACTGATGATACAGATGCCAGTATGGATAATGGTTTATTTGGA
AATAGGCATAGTTTTTTGAGCTTCTGTCAGAAGAACAGTCATCAGTTTGA
CACACTTCGTCGGGCCAAGCATTCCTCAATGATGATCCTACATTACCTTC
ACAATTCAACCTTGCTGACTGCTGAGACCACCTGTATTATTTGTTACAAG
GACACACCAATGGACCAGTCCTGGCTATGTGAGATCTGCCCCAATGTTGC
TGTTTGTGCTGCATGTTACCGAAGAGATGGTTGTTCTTTGCATATTCATA
AGTTGATTCTGCATTGTTCTGCAGTTGATTCTGCGACCAAAAATAGAGAG
GCCAAGAAGAAGGAATTACTGAAAATGCGACTGCTGGATGTTTTGCTGCA
TGCCTGTCAATGTCGCTCCCCCTGCTCCTACCCTAATTGTCTTCTCATCA
AAAAGCTATTCTTCCATGCAAAAAAGTGCACTGTCAGGATTTCTGGGGGT
TGTGAGCATTGTAAGAAGATGTGGCTCATATTGAGACTGCACTCCAGAAA
TTGCAAAGACTCTGATTGTGACGTACCACGCTGCAGGGATTTAAAGCAAC
ATGTCAACAGCCGTCTGCAACAATTGGAAGAGGCTGCACATGAAGAACCA
CCGATCGTACCTGATCAGATGGGTCAGAGAATTTAA
back to top

protein sequence of Tc04v2_p011400.1

>Tc04v2_p011400.1 ID=Tc04v2_p011400.1|Name=Tc04v2_p011400.1|organism=Theobroma cacao|type=polypeptide|length=1462bp
MNQQTLEYPFKVLNVNPTPSFMPNRMVMPTSGLVQDGNLAIPDALKVLNV
NPTPSFMSNSMMMPTSGLVQDGNLAIPGGLFNQNQLNDVTMDYKEIVDDP
NQFGFLKEFFGDRHKAQTAGYTTMDWNIAPALGMLPVEGEMVLSSGQAST
ITSCYGGDGFSDVGSLESRPPFSKGKLLHLYDGKINMMDHIGWLGNNQSS
DSMVFPHILGGSLAFSEANTQTSQEILQELSEVPDILPGLNSAMASSTII
PYMQSSRPSEAEAKCSFTGKNQSYCPVEAKTAGHFPQLPLENAPADSRNQ
LTWAIQHRVLLAYIQYKKSMVIIGNSQVSFVNHMHSATCNKHACKCEQFF
SLVSHFDGCHDADCNICSPVWYSCVTNKPHPKFERVKRGLLRDGDSDQPS
CGSSETMQPSLKRLKVENPLCPSLTENGICCAKAPLKVQPCYAKLPPLRQ
LPESPVSNNSEVMEVNMELLPKLIEASMSTKDISYNVADNFPILPTENLP
GASEVVVCSYKLEETDAVGSEKEGGMDFRSDTDIADNVIDHSNILESNTL
PSFSEGLAAGYEEEETEARTNSNQAELAIENELITQESNCGKELSAGCEE
GETEATTNSNQAALAIEDELIAQESNCGKELDAGCEDGETEAKTNSNLAE
LAMENKLIAPELNCGKEIELESQTIRGLSLIENFTAQQIKEHISSLRQCI
DQDIPKKERGKRISNVYSENSCQLCGADKLSLAPAPIYCSSCGNRIRRSA
NYYITPEEKDIRICLCTSCYKVSRGRSIVFSGIALSKAKLDKIKNEEEAE
ESWVQCDKCEGWQHQICALFNDKNDMEGKAQFICPICCLKEIQSGERMPP
LMSTVFGAKDLPCTMLSDHIEQRLFRRLQKEREEKARVTGKRIDEVPEAE
GLVVRVVVSVDKHVKVKKQLLEIVQNENYPAEFPYKSKVILLFQKIDGVD
VCLFSMYVQEFGSECGHPNQRCVYIAYLDSVKYFRPETKTAAGEALRTVV
YHEILIGYLEYCKKRGFATCYLWACPPLKGEDYILNCHPEIQKTPKTDKL
RQWYQFMLQKAAKEKVVVGLTNLYDHFFVSTGKYNSKVTAAHLPYFDGDY
WSGAAEDVINNIEKASSEDPKKMGKRIMSKRTLKAMGHTNPSGDATKDIL
LMQKLGQTILPIKEDFIIAHLQFVCIHCHRAILSGWRWFCSLCKGFQLCE
RCHDAEQNVYKDCSHTLCNGEKHALCKIMVDDVPSDTDDTDASMDNGLFG
NRHSFLSFCQKNSHQFDTLRRAKHSSMMILHYLHNSTLLTAETTCIICYK
DTPMDQSWLCEICPNVAVCAACYRRDGCSLHIHKLILHCSAVDSATKNRE
AKKKELLKMRLLDVLLHACQCRSPCSYPNCLLIKKLFFHAKKCTVRISGG
CEHCKKMWLILRLHSRNCKDSDCDVPRCRDLKQHVNSRLQQLEEAAHEEP
PIVPDQMGQRI*
back to top