Tc03v2_t020090.4

Overview
NameTc03v2_t020090.4
Unique NameTc03v2_t020090.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2787
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 4 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
Producthistone-lysine N-methyltransferase, H3 lysine-9 specific SUVH6, transcript variant X3
NoteSU(VAR)3-9, putative
Cross References
External references for this mRNA
DatabaseAccession
GeneID18606171
GenbankXM_018118242.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc03v2_g020090Tc03v2_g020090Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc03v2_p020090.4Tc03v2_p020090.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto183152auto183152Theobroma cacaoexon
exon-auto183153auto183153Theobroma cacaoexon
exon-auto183154auto183154Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto183155auto183155Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc03v2_t020090.4 ID=Tc03v2_t020090.4|Name=Tc03v2_t020090.4|organism=Theobroma cacao|type=mRNA|length=2787bp
ATGGGCGTTTCCGATAATATGCTGCATAAAGAGACATTAAAAGTGGCAAG
TTGTAGTCATTCTGAGGGAAGGCTGGGAAGAGTGTCAACTGAAAATGGTC
ACTTTGCCCCTGCACCCAAGTATAAGCAGCGTAAAGTTTCTGTTGTTCGG
AATTTTCCACCAGGGTGTGGAAGGTTGGCTGCACCAATCGATAGACCAAG
TGAGCAAGCGGTGAAATCCCACCCTGGTGAGAGCAGTTTGGAGAAGACTT
CGGCAAGAAATTATCGTCCTCGTAGAGGAGTCACTGTTGTGAGAAACTTC
CCTCCATTTTGTGGAAGAAATGCTCCACCTCTTAGCGAGGAGGAGCGCAT
GAAATGGCTTACCTCTCTAAAGGACAAAGGTTTCAATCTAGAGAAGTTTG
TAAATGAGGAAAAGCCCTCAGAGAAGACTATATGTACTGATGTGAAACAA
GTGATAGAGGATGTTCAGGATGTCAATGCTCTAGAAGGTAAAATAGAAGG
TAGTGCTCCTACACTCTCTGCAGAAGAAATTCAATCTAAACCTGAAGAGC
TGGCTTCTGAAAAAATGAGGAAGCTATGTGCTTATGAAGCTTCATCCAGG
AATGATATGGATGAGGACAAGGAAGATATGAGAGAAAAGAGCATCAAGTC
TCCTTGCGAAACTTATCCAAATGAATTTGATAGCAAGTCCAAGCAAGTTA
GTGAGACAAGTGATGGATATGTTAGAGGTTTGGAGGAAAATCCAATACAT
GATATTGTAATCTATGCTGAGGACAAGAGTTTTGAGACAAAGCTTTCTGA
TTCACCTGCCTTTGAGGATCAATTGCTGGAGGAGGACTGTGGGAGTCAAG
AAGTTTTATTGGATGGGTCAATTGTGCAAGGCCTCATGGCTTCATCAACT
TGTCCTCTGCCACAAGGGAAAGTGACCTGCAAACGTGACCTGGGGGGTGT
TTCATTTAAAAGAAAAAGAAAGAACAATTTCATATTGCTACCAAGGGCAA
ACCATGCTTTAGTAGCAAATAAGAATGAAGCAGAGAGCCCTGAAGAAACA
TGTATTAAGAAGAATTCTTCTCCCACAAGGCCTTATAAAGGTCTTGGTCA
AGTGGTTATCAGGGACAAGGAAGAATCAATCCAACAGGATGGACTATACA
CAGATGATAATTTTGCTCTGAGATCATACAGTTATGATGTGAGTCTTCCT
CCTTCTTGTCCAAGTAGTGTGTGTCATGATAATGATGCAATTACTACTCG
GAACAAAGTGAGAGAGACATTACGCCTATTCCAAGCCATTTGTCGGAAGC
TTTTACAGGAAGAAGAATCAAAGTTGAATGGAGAAGGAAAGACCTTTAAG
AGGGTGGATATCCAAGCTGCAAAGATTCTCAAAGAGAAAGGGAAATACAT
TAACACAGGCAAACAGATCATTGGACCTGTACCAGGTGTTGAAGTTGGTG
ATGAGTTTCATTATTTTGTGGAGCTCAATATTGTTGGCCTTCATCGCCAA
AGTCAGGGTGGTATAGATTACGTAAAGCAAGGTGATAGGATCATTGCTAC
TAGTGTTATAGCATCAGGGGGCTATGACAATGACTTGGATAACTCAGATA
TCTTGACTTACATGGGTCAGGGAGGGAATGTTATGCAGAAAGGTAAGCAA
CCGGAAGACCAGAAACTTGAAAGAGGAAACCTTGCTTTGGCAAATAGCAT
ATTTGTTAAGAATCCAGTGAGGGTTATTCGCGGTGAGACAAGGTCTTCTG
ATTTGTTAGAAGGTAGGGGTAAAACATATGTTTATGATGGCCTCTATTTG
GTGGAGGAGTGTAAGCAAGAATCAGGACCACATGGTAAGCTTGTCTACAA
ATTTAAGCTGGTCAGAATTCCTGGTCAACCAGAGCTTGCTTGGAAAGTTG
TAAAAAAATCTAATAAATCTAAAGTGTGGGAAGGGCTGTGTGCACATGAT
ATCTCACAAGGGAAGGAGGTAATCCCCATTTGTGCTATAAACACCATAGA
TAGTGAAAAACCTCCACCATTTGTGTATGTACCTCACATGATCTATCCTG
ACTGGTGCCACCCTATTCCTCCCAAAGGTTGTGATTGTATTGATGGATGT
TCAGAATCTGGGAAATGTTCCTGTGCAATGAAGAATGGAGGAGAGATCCC
ATATAACCATAATGGGGCCATTGTTGAAGCAAAGCGCCTTGTCTATGAAT
GTGGTCCTACTTGCAAGTGTCCTGCTTCTTGCTATAATAGAGTGAGCCAG
CGTGGCATAAAATTTCAGCTTGAAATCTTTAAAACAGAATCGAGAGGCTG
GGGTGTTAGATCCCTAAATTCTATCCCTTCCGGAAGTTTCATCTGTGAGT
ATGCTGGAGAGCTCCTCGAAGATAGAGAAGCTGAAGAAAGAACAGGGAAT
GATGAGTATCTGTTTGATATTGGAAACAACTACAGTGAAAGTTCTCTGTG
GGATGGTCTTTCAACCCTAATGCCTGATGTGCATTCAAGTGTTTGCCAAG
TTGTGCAAGACAGTGGTTTTACCATCGATGCAGCACAGCATGGCAATGTA
GGGAGATTCATAAACCATAGTTGTTCACCTAATTTGTATGCACAAAATGT
CCTTTATGATCACGATGACAGGAGAATCCCACATATAATGCTCTTTGCTG
CTGAAAATATTCCTCCCTTGCAGGAGTTGACATACCATTACAATTATATG
ATAGATCAGGTTCGTGATGAGAATGGTAACATAAAGAAGAAATTTTGCTA
TTGTGGTTCTTCAGAGTGCACTGGTAGGCTGTATTGA
back to top

protein sequence of Tc03v2_p020090.4

>Tc03v2_p020090.4 ID=Tc03v2_p020090.4|Name=Tc03v2_p020090.4|organism=Theobroma cacao|type=polypeptide|length=929bp
MGVSDNMLHKETLKVASCSHSEGRLGRVSTENGHFAPAPKYKQRKVSVVR
NFPPGCGRLAAPIDRPSEQAVKSHPGESSLEKTSARNYRPRRGVTVVRNF
PPFCGRNAPPLSEEERMKWLTSLKDKGFNLEKFVNEEKPSEKTICTDVKQ
VIEDVQDVNALEGKIEGSAPTLSAEEIQSKPEELASEKMRKLCAYEASSR
NDMDEDKEDMREKSIKSPCETYPNEFDSKSKQVSETSDGYVRGLEENPIH
DIVIYAEDKSFETKLSDSPAFEDQLLEEDCGSQEVLLDGSIVQGLMASST
CPLPQGKVTCKRDLGGVSFKRKRKNNFILLPRANHALVANKNEAESPEET
CIKKNSSPTRPYKGLGQVVIRDKEESIQQDGLYTDDNFALRSYSYDVSLP
PSCPSSVCHDNDAITTRNKVRETLRLFQAICRKLLQEEESKLNGEGKTFK
RVDIQAAKILKEKGKYINTGKQIIGPVPGVEVGDEFHYFVELNIVGLHRQ
SQGGIDYVKQGDRIIATSVIASGGYDNDLDNSDILTYMGQGGNVMQKGKQ
PEDQKLERGNLALANSIFVKNPVRVIRGETRSSDLLEGRGKTYVYDGLYL
VEECKQESGPHGKLVYKFKLVRIPGQPELAWKVVKKSNKSKVWEGLCAHD
ISQGKEVIPICAINTIDSEKPPPFVYVPHMIYPDWCHPIPPKGCDCIDGC
SESGKCSCAMKNGGEIPYNHNGAIVEAKRLVYECGPTCKCPASCYNRVSQ
RGIKFQLEIFKTESRGWGVRSLNSIPSGSFICEYAGELLEDREAEERTGN
DEYLFDIGNNYSESSLWDGLSTLMPDVHSSVCQVVQDSGFTIDAAQHGNV
GRFINHSCSPNLYAQNVLYDHDDRRIPHIMLFAAENIPPLQELTYHYNYM
IDQVRDENGNIKKKFCYCGSSECTGRLY*
back to top