Tc08v2_t008160.4

Overview
NameTc08v2_t008160.4
Unique NameTc08v2_t008160.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3627
Properties
Property NameValue
NoteSet domain protein, putative isoform 1
Model evidenceSupporting evidence includes similarity to: 1 EST, 2 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 5 samples with support for all annotated introns
Producthistone-lysine N-methyltransferase ATXR7, transcript variant X4
Cross References
External references for this mRNA
DatabaseAccession
GeneID18592056
GenbankXM_018126069.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc08v2_g008160Tc08v2_g008160Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc08v2_p008160.4Tc08v2_p008160.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto400621auto400621Theobroma cacaoexon
exon-auto400622auto400622Theobroma cacaoexon
exon-auto400623auto400623Theobroma cacaoexon
exon-auto400624auto400624Theobroma cacaoexon
exon-auto400625auto400625Theobroma cacaoexon
exon-auto400626auto400626Theobroma cacaoexon
exon-auto400627auto400627Theobroma cacaoexon
exon-auto400628auto400628Theobroma cacaoexon
exon-auto400629auto400629Theobroma cacaoexon
exon-auto400630auto400630Theobroma cacaoexon
exon-auto400631auto400631Theobroma cacaoexon
exon-auto400632auto400632Theobroma cacaoexon
exon-auto400633auto400633Theobroma cacaoexon
exon-auto400634auto400634Theobroma cacaoexon
exon-auto400635auto400635Theobroma cacaoexon
exon-auto400636auto400636Theobroma cacaoexon
exon-auto400637auto400637Theobroma cacaoexon
exon-auto400638auto400638Theobroma cacaoexon
exon-auto400639auto400639Theobroma cacaoexon
exon-auto400640auto400640Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto400641auto400641Theobroma cacaoCDS
CDS-auto400642auto400642Theobroma cacaoCDS
CDS-auto400643auto400643Theobroma cacaoCDS
CDS-auto400644auto400644Theobroma cacaoCDS
CDS-auto400645auto400645Theobroma cacaoCDS
CDS-auto400646auto400646Theobroma cacaoCDS
CDS-auto400647auto400647Theobroma cacaoCDS
CDS-auto400648auto400648Theobroma cacaoCDS
CDS-auto400649auto400649Theobroma cacaoCDS
CDS-auto400650auto400650Theobroma cacaoCDS
CDS-auto400651auto400651Theobroma cacaoCDS
CDS-auto400652auto400652Theobroma cacaoCDS
CDS-auto400653auto400653Theobroma cacaoCDS
CDS-auto400654auto400654Theobroma cacaoCDS
CDS-auto400655auto400655Theobroma cacaoCDS
CDS-auto400656auto400656Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc08v2_t008160.4 ID=Tc08v2_t008160.4|Name=Tc08v2_t008160.4|organism=Theobroma cacao|type=mRNA|length=3627bp
ATGGTTTCCTCAACATCCCCCTTTGACGAATATGATCACGTTCACGATTA
CCCTTTCTTTTCCAGAAAGCGGCTTAAAGTTTCAGATCGCAGATCAAATA
TCTATACAGGCCTCTCTCCTGATTCTGCATCTTCGATTTGCGGTGATGAA
CGATCCGCAACGGAGATGAGTTGCCAATCTAATGGCAATAGCAGTGGTGT
TCCTCAATCTTGTAATGACGGTGGAGGTTCATGCCAAGACAAGAGTTACT
CCAGCTATGCACCGTCTTCTTTTGCGAGTGGATGGATGTATGTAAACGAA
CATGGACAGATGTGTGGCCCTTACATCCAGCAGCAATTATACGAGGGTTT
ATCTACTGGTTTCCTGCCTGATGAGCTCCCTGTTTATCCTGTAGTCAATG
GGACTGTGAGTAATCCTGTTCCCTTAAAATACTTCAGGCAGTTCCCTGGT
CATGTCGCAACTGGATTTGTGTATCTCAGTTCAACCACAGCTTCCAATTG
TTTGAAGTCTTCCCATACAAATTTCCAGCACACCCTTTCCCAATCACAAA
TTAACCGCAATGGTTTTGATGCATCCAATGACCTCATTTCAAGCTCTCTT
TTGCAGTCAGGTGAAGATGCATGTTGGTTGTATGAGGATGATAAGAGTAC
GAAACATGGGCCTCATTCCCTTTTACAACTATATTCTTGGCATCGCTATG
GGTATCTTGCAGATTCTGTTATGATACATCATGCTGAAAACAGGTTTCGC
CCCATTAAGTTGCTGTCTGTTTTAAATGCTTGGAAAGGTAGTCAAGCTTA
TGCTGCTGAAAATGAACGGGACTTATCAGTGAACTTCATATCTGATATTT
CTGAAGAAGTTTCTTCTCAGCTCCATTCTGGGATTATGAAAGCAGCTCGT
AGAGTTGTGCTAGATGAAATAATCAGCAATATGATCTCAGAGTTTGTTAC
TGCAAAAAAATCTCAGAGACATCTAATGGTTGAATCATTCAACCAGGATG
CTAAAAGGTTTCCTGATGGAAAAAGGATTGAAAATGCCCCCGAAATAAAA
ATGCAGTGTATTCCCATGTTTGAGACGGCAGCCTCCCACAATGTATCTGA
CCAGCCATGCATTCAAGAATCTACATGTTCTCCTGCAAGTATAAAATCTG
TTGGAAGCATTGAAAATTTTTGGGGTTCTTATACAGTTGTTTGTAAGATG
CTTTTTGACTACTGCATGCAAGTTATGTGGAATGCTGTCTTTTATGACAG
TATAGCCGAGTATTCGTCTTCCTGGAGAAGGGGAAAACTTTGGTTTGGTC
ACCCTAATGTTATGCTGTCTGCTACTGACTCCAGGGATCATGGCAACGAG
ACTGAAAAAGTAACAGATAAACCTCTCTTATCTGGGATGGAATTGATTGC
TCATGACGTTGATTGTCCACCTGGTTTTGAGCTGGCAACAGTTGCTGGAG
TTGATTCTGCAGAAAAGTCATCTAAATCTTCATATGTTGTGCAGCAAATT
TTATCCAAACAGAAAACCCGATTGTGCAATAATGGCCTGTATGATGACAT
GGAATGCATCCTTGAAGGTGTTGAAAATGAGCTCCATTTATCTGTGAAGG
TGTTTATGGCCAAGTATGTTGACAATTTTGTTAAAAGTGAAGCAAGAAGA
GTGATTGGTTTGGAAAATGATGACAAATCGAAGGAAAATCTTGATGATGA
AGAAGCAGAGAAATCAGTTAATTTTTCAATAGATGATGAATTGAAAGAAT
TACAAAAGTTGCAAGATGCTGTTGGATCTTCCAGTCAATGCCATCTTGCT
TTAGAGTTTGATACTTTAGACATTTGTGGAGAGAAAAGGGTCAGTTTAAG
CAGAATGTCTGATTTATCTGGCAATCTACAGAATCCATTACAATCTTGGA
CACCCATTTGTCAGTCTGTGTCTGAAAATTTGTATGTTACAAGGCAGGAA
ACTTTCATGGCAGGTGCATTTAAGAGTTTGTTTTCACATTTAGGGGACGT
AATTGATGAACTAGAAGTTGATGAGCCACCACCTCCTGGACTTGAGGGTA
ATGCTGGGACGCTTGTTCCATCACACCTTTGTAAGTTTCGACCTTCAAGG
TCAGATGAGCGTAGCCCTAAGATTGGAGAATATGTTGCCGTGGCAATGTG
TCGGCAGAAGCTCCATGAGGATGTACTAAGAGAGTGGAAATCATCTTTTA
TTGATGCTACTCTTTATCAGTTTCTTACATCATGGCGTAGTTTGAAGAAA
CGCTGTAAGGCTGATAGCAAAGAGGAAAGGGCATTTAGTGTAGGAAGGGA
AATTCTCGCTGATTCTTCTGCCATAGGAGATAAGCTCAGGGAGAGGTCAA
AGAAGTCTCAGAGTTCAGGCTCCTCAGAAGTATCTTTAGTTACTGGTAAA
TATACATATTACCGCAAGAAAAAGTTGGTTCGTAAGAAGATAGGATCTAC
TCAGTCCACTATTGTCAATGGGTCACAGAATCATCCTGTTGAAAGGCCTC
GGAAAAAAGAGGCTTCCAGAAATTTGTTGGATCATGCAGATCCAGAACCA
ACTGCGGCCACTTCTAAAAAGAGTGCAGGTGGTCGCAAAAAAACCAAGGT
TACCCTTGCTGTTCAAAAAAATTTGGTCGGAGAAGGTGCGGTTCAAGTCT
CCAGGGAGAGAGCATCAACCTCTCAGAATTGTGATGTTAAGAAGGTTGTT
GGCAGGACTAACCATATTGTTGGAAGTGAAGTAGAGCTCACTAATGATTC
CCACAAGAAGACACTAAAAGCTCCCAAGGTATCAAGGGTAAAAAGGAAGC
AATTAGATAATGATGAGCCTCCATTGCTTCCAACCAAGGTACAGAAAGTG
GCAAATTCTGCTAGCAAGCATCCTTCTTCTAGAGGGAATGCAGATCGAAA
TACCCATTCAATTAGATCCAGGACAGCAAATTCCTGTCCCAGATCTGATG
GATGTGCGCGTTCTTCAATTAATGGCTGGGAGTGGCATAAATGGTCACTC
AATGCAAGTCCTGCTGAAAGAGCTCGTGTTAGAGGAATTCAGTGTACACA
CATGAAATATTCAGGCTCTGAGGTTAATAATATGATGCAGTTGTCAAACG
GTAAAGGTCTTTCTGCAAGAACAAACAGAGTGAAGCTGCGCAATCTTCTT
GCTGCTGCAGAGGGTGCTGATCTCTTAAAAGCAACTCAGTTGAAGGCAAG
GAAAAAGCGTCTACGTTTTCAGCGAAGCAAGATTCACGATTGGGGTCTCG
TTGCGCTTGAGCCAATTGAGGCTGAGGATTTTGTCATTGAATATGTTGGA
GAGTTGATTCGTCCCCGGATATCTGATATACGTGAACACTATTATGAGAA
GATGGGAATTGGTAGCAGTTATCTGTTTAGGCTTGATGATGGATACGTGG
TTGATGCTACAAAGCGTGGTGGGATTGCTAGATTTATAAATCATTCTTGT
GAGCCTAACTGTTACACAAAAGTTATTAGTGTTGAGGGCCAGAAGAAGAT
TTTCATCTATGCAAAACGGCACATAGCAGCTGGTGAAGAAATTACTTACA
ACTACAAGTTCCCTTTGGAGGAGAAAAAAATTCCTTGCAACTGTGGTTCA
AAGAAGTGTCGTGGATCTTTAAACTAG
back to top

protein sequence of Tc08v2_p008160.4

>Tc08v2_p008160.4 ID=Tc08v2_p008160.4|Name=Tc08v2_p008160.4|organism=Theobroma cacao|type=polypeptide|length=1209bp
MVSSTSPFDEYDHVHDYPFFSRKRLKVSDRRSNIYTGLSPDSASSICGDE
RSATEMSCQSNGNSSGVPQSCNDGGGSCQDKSYSSYAPSSFASGWMYVNE
HGQMCGPYIQQQLYEGLSTGFLPDELPVYPVVNGTVSNPVPLKYFRQFPG
HVATGFVYLSSTTASNCLKSSHTNFQHTLSQSQINRNGFDASNDLISSSL
LQSGEDACWLYEDDKSTKHGPHSLLQLYSWHRYGYLADSVMIHHAENRFR
PIKLLSVLNAWKGSQAYAAENERDLSVNFISDISEEVSSQLHSGIMKAAR
RVVLDEIISNMISEFVTAKKSQRHLMVESFNQDAKRFPDGKRIENAPEIK
MQCIPMFETAASHNVSDQPCIQESTCSPASIKSVGSIENFWGSYTVVCKM
LFDYCMQVMWNAVFYDSIAEYSSSWRRGKLWFGHPNVMLSATDSRDHGNE
TEKVTDKPLLSGMELIAHDVDCPPGFELATVAGVDSAEKSSKSSYVVQQI
LSKQKTRLCNNGLYDDMECILEGVENELHLSVKVFMAKYVDNFVKSEARR
VIGLENDDKSKENLDDEEAEKSVNFSIDDELKELQKLQDAVGSSSQCHLA
LEFDTLDICGEKRVSLSRMSDLSGNLQNPLQSWTPICQSVSENLYVTRQE
TFMAGAFKSLFSHLGDVIDELEVDEPPPPGLEGNAGTLVPSHLCKFRPSR
SDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFIDATLYQFLTSWRSLKK
RCKADSKEERAFSVGREILADSSAIGDKLRERSKKSQSSGSSEVSLVTGK
YTYYRKKKLVRKKIGSTQSTIVNGSQNHPVERPRKKEASRNLLDHADPEP
TAATSKKSAGGRKKTKVTLAVQKNLVGEGAVQVSRERASTSQNCDVKKVV
GRTNHIVGSEVELTNDSHKKTLKAPKVSRVKRKQLDNDEPPLLPTKVQKV
ANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGCARSSINGWEWHKWSL
NASPAERARVRGIQCTHMKYSGSEVNNMMQLSNGKGLSARTNRVKLRNLL
AAAEGADLLKATQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG
ELIRPRISDIREHYYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC
EPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGS
KKCRGSLN*
back to top