Tc02v2_t001120.5

Overview
NameTc02v2_t001120.5
Unique NameTc02v2_t001120.5
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length5463
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 9 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 1 sample with support for all annotated introns
Productuncharacterized LOC18607098, transcript variant X2
NoteProtein MKS1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18607098
GenbankXM_018115282.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc02v2_g001120Tc02v2_g001120Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc02v2_p001120.5Tc02v2_p001120.5Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto81122auto81122Theobroma cacaoexon
exon-auto81123auto81123Theobroma cacaoexon
exon-auto81124auto81124Theobroma cacaoexon
exon-auto81125auto81125Theobroma cacaoexon
exon-auto81126auto81126Theobroma cacaoexon
exon-auto81127auto81127Theobroma cacaoexon
exon-auto81128auto81128Theobroma cacaoexon
exon-auto81129auto81129Theobroma cacaoexon
exon-auto81130auto81130Theobroma cacaoexon
exon-auto81131auto81131Theobroma cacaoexon
exon-auto81132auto81132Theobroma cacaoexon
exon-auto81133auto81133Theobroma cacaoexon
exon-auto81134auto81134Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto81135auto81135Theobroma cacaoCDS
CDS-auto81136auto81136Theobroma cacaoCDS
CDS-auto81137auto81137Theobroma cacaoCDS
CDS-auto81138auto81138Theobroma cacaoCDS
CDS-auto81139auto81139Theobroma cacaoCDS
CDS-auto81140auto81140Theobroma cacaoCDS
CDS-auto81141auto81141Theobroma cacaoCDS
CDS-auto81142auto81142Theobroma cacaoCDS
CDS-auto81143auto81143Theobroma cacaoCDS
CDS-auto81144auto81144Theobroma cacaoCDS
CDS-auto81145auto81145Theobroma cacaoCDS
CDS-auto81146auto81146Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc02v2_t001120.5 ID=Tc02v2_t001120.5|Name=Tc02v2_t001120.5|organism=Theobroma cacao|type=mRNA|length=5463bp
ATGGACTCTCAATTTCACGCCGGGGAGCCACCGTCAAAGCGTCAGCTACA
AATCCAAGGTCCACGCCCACCCCCTCTAAAGGTCAGCAAAGACTCCCATA
AAATCAAAAAACCACCTCACCCACCATCCCACGCCGCTGGTCCCGCAGCA
GCAGCAGCAGACCAACGCCGTCCCGAGCCAGTAATCATCTACGCCGTCTC
ACCAAAAGTCATCCACGCTGAAGAATCCGATTTCATGTCTATCGTGCAAC
GCTACACCGGACTTTCATCCGGTACCTTCTCCGGAGACGGCGACGTCTAC
CGGGCGGCGAGGTTGGCAGCGACGGAAAAGGCGAGTTCTAGCTCGAGAGA
GAAAATAGGGGACGTGGGGGTCGCGGGTGAAGGTGCGATGGAAGAGGGTT
TGATTCGAGCACTGCCAGGGATACTTTCTCCGCCGCCGGAGACGTTGCCA
GAGGTGGCGGTGGGATTCTTTTCCCCCGTTCGTACGCCAGGGGGGACGTT
CTTATCGCCGGCGGCGAGGTTGGCAGGGACGACAAAGGGGAATCCTAGCT
GGAGAGAGAAAATAGGGGACGTGGGGGTCGCGGGTGAAGGTGGGATGAAA
GAGGGTTTGATTCGAGCACCGCCAGGGATACTTTCTCCTGCGTCGGAGAC
GTTGCCAGCGGTGGCGGCGGGAACATTCTTTTCGCCCGCCGTAGCAGGGG
GGACGTTCTTATCGCGGGCGTCGGAGGCGAGAATGATGTCGGCAGACATC
ATATATGGTCAATCAAATAGTTGCAGTGTGAAAGATGAGTCAAAGCCTGC
GGGACTTGCAAACATAATGTGTAAGTCCTTGTCTGGCTATAACTTGGAGG
AGCAAAAGCTCACAGTCAGTGATGCTTCCAAGAAAAGTTCTGCAGGTGAG
TTACTACCCACAGAAGTTGAGTTTAAAAAGCCCAAATCCTCGCATTATCT
TGGCGAAAGCCTCCCAAGTAAGGACTCTGACTATGATTTCAAAGAGGAGC
AAGAGCTCAAAGTCAATGAAACCTTCAACAAAAGTTGCAGTGTGAAAGAT
GAGACAAAGCCTGCAGGACTTGCAATCGCAATGTTTAAGTCCTTGTCTGA
CTATAACTTGGAGAAGCAAAAGCTCACAGTCAGCGATGCTTCCAAGAAAA
GTGCTGCAGGTGAGTTACAACCCACAGAAGTTGAGTTTAAAAAGCCCAAA
TCCTCGCATTATCTTGGCGAAAGCCTCCCAAGTAAGGACTCTGACTATGA
TTTCAAGGAGGAGCAAGAGCTCAAAGTCAATGAAACCTTCAACAAAAGTT
GCAGTGTGAAAGATGAGACAAAGCCTGCAGGACTTGCAATCGCAATGTTT
AAGTCCTTGTCTGACTATAACTTGGAGAAGCAAAAGCTCACAGTCAGCGA
TGCTTCCAAGAAAAGTGCTGCAGGTGAGTTACAACCCACAGAAGTTGAGT
TTAAAAAGCCCAAATCCTCGCATTATCTTGGCGAAAGCCTCCCAAGTAAG
GACTCTGACTATGATTTCAAGGAGGAGCAAGAGCTCAAAGTCAATGAAAC
CTTCAACAAAAGTTGCAGTGTGAAAGATGAGACAAAGCCTGCAGGACTTG
CAATCGCAATGTTTAAGTCCTTGTCTGACTATAACTTGGAGAAGCAAAAG
CTCACAGTCAGCGATGCTTCCAAGAAAAGTGCTGCAGGTGAGTCACAACC
CACAGAAGTTGAGTTTAAAAAGCCCAAATCCTCGCATTCTCTTGGCGAAA
GCCTCCCAAGTAAGGACTCTGACTATAATTTCAAGGAGCAGGAGCTCAAA
ATCAATGAGGGCTCCGGCAAAAGGAACGCTAGGCGGAGCTGGACAGTATT
GGTTCACTCTGATATGGTCCTAGGGGAACTTCCTTCTGATGGTTGCAATT
GGAGGAAATATGGACAGAAGGATATTCTTAATGCAAGATTTCCAAGAGAA
TACTACAGGTGCGCACATCGACACACTCAAGGCTGTTTTGCTACAAAGGA
AGTCCAAAGAGAGGATGAAGATCCAATGTTCATCACTGCTACTTACAAAG
GAATGCACACTTGCACACTAGCCCCAGATTTGATGCCTCCAGGACCACCT
GAGATACTAGCTCCTTTGGATACTGTACTTGGCGCCGATGGAAATGACAA
AAAGGATTCACAATCTAATCTTCAGTCAAGTGTACACAGTCCTGACAATC
AAACTTGCATTTCGTCAACCAAGCTAACAAGTGAGCTTCCAAATTTAGGG
CTCAACCTGAATGTGTTTCCTGAGAAATCATTTAAGTCATATCCAACGTG
GAAAAACTTTTATGAAAATGAAGTGAGGAAGAATTGGAAAGTGCTGAACA
GGAAGAAGGATGTGCTACTCTTACTTTCATCTTATCCTATGATTATGATT
GACAAATCTGACACAGATAAATGGATTATTGATGTATTGGCAACCATGAG
ACATGTGAAATCAACAGAAAAGATTTTATTTGGTGTAGGGGTAGCGAAAC
ATTGGCCAGGAATGACTACGCTGCAGGAACTTTCTGGACGCTTGCAGAAG
TTGCTGGATGTTCCCTTGATGAATGATATTGAGGGAGTTTTACCGGTAGA
CTTGGTTGAGAACCTATATAGGACTACAGAAGCTGATTTAAGACCCCTGT
TGGAAGTGGAACAGAATATAATTAGCGGAAAGACATCAAAATCTAGAGGT
TCAGCAAGTAACAGTGAAGGGGCAGCAATGGAAGCAGAAAAAGAATTGCA
GCCAATGCCTGCAAAATGTAAAACTCTGGTAGAAGATACTGAGTTGCCAG
CAAAAGGAACACTGAATGTACCAGAAGAAATATTTGACTTGGCAATTTAT
TTAGCTGTTCGTCAGATCTTAAAATGTATAAACAGGGGATATATCTGGTG
TATTACTATCAGTGGAAGAGATAAGAAAAGGGTGCTAGGAGCAGTAAAGC
AACACCAAGATATAGTTTCCGAGTTTGGATATATCATTGTATTCACTGTG
TCAGAAGATCAAAGTGGGGCAAATGTTCACGGTGTCTTTCAACTGCAGAA
GGGTTTTTGGCTAGGTGGATGCTTTGATTCTGTTGACCTTACACATGAAT
ATTTTGACAACTTGTGCTCCCCAGGAATCTTATTGCTTACAGAGGATGAT
TACGATAAGAACATGAACTTGGATCAGTCTACACTCCCACTTTTGATAAA
CCTTAACAAGTTGGTTGACCATAAACATAGCGATTCAAGGTTCATAATCT
TTACTTCTAAAATGGCAACAGACATGGAGATAAGAATGGAGGATCATTTG
TTGTCATGGAAATTGTTTTGTAGGATTGTGGGTGAAGGTTTGCTTTCTCC
TAGTATCCAACAGATAGCAGCAAGTTTGGTGAAAGAATACCGTGGCAATC
TACTCGCCATCATTCTAACGGCCAGGTCCTTGGAGAAAGTTACTGATGAT
GTCAACTTGTGGGAACTTGCTGTTAAAAGATTGACCATGCTACCTCCATC
TCAAATAGAAGATATAGACAATGTCCTGATTAATGCATTAACATTCATTT
GGGAACGTATGAACAATAAAACAAGACATTGCATTAAGTTTTTCACGTGG
TATCCCAAGGGACAGAAAATTAACAGAGTCTCACTAATACAACATTGGAT
CCAAGATCGTCTGGTTGATACCCATGATGAAGGTACCAATATTATCCAAA
ATCTTGTTGATACATCCCTGCTTAATATTGTGGAGTTAAATGGGGTCCAA
CTGCGAAGAGAGATCTATGATGTATTAGTAAACCCACTAATTCTTCAAAT
GCATCCATTTTATCTTTTGCTAGGCAGGGCAAGATTGATTAAACCACCAG
AAGAAGAGGAATGGGATGCCAAAGTGATCAATTTGATGGATAATAAATTA
TCTGACCTGCCAGAATCTCCAAGGTCACCCTCACTAATTGCATTGTACCT
TCAGCGTAACTTGGATCTCATGACTATCCCATCTTGTTTCTTCAAGCACA
TGCCTTTGCTTCAAATCCTAGACTTATCACACACCAGCATCAAATCTTTG
CCAGAGTCACTTTCTAGTTTGGTTAACCTTCGAGAACTCCTTTTGAAAGG
CTGTGAACTCTTCATACGACTCCCTAGCCATGTTGGAGAACTGAAGAATC
TTGAGAAGCTTGACCTTGATGAAACTCAGATTATTGATCTCCCAGCAGAG
ATTGGACAACTTTCCAAATTAAAAATTTTGAGGGTCTCATTCTATGGATA
TATGAACTGTAGCAAAACAAGGTTGCGGCAAGATACAATAATTCCCCCTG
GAACAATATCAGGTCTCTCTGAATTAACTGAATTAAGCATTGATGTTGAT
CCGGATGATGAACGCTGGAATGCAACGGTGAAAGATATTATTGAGGAAGC
TTGCAACTTGAAAACTTTAAGACAGCTTAATTTGTACCTGCCAAACATCG
AAATATTGTGGAAACGCAGAACCGGTAGCGCATCATTGCTCCATTACCCT
TTGCCACGTTTTAGATTTACTGTCGGTTATCACAAGCGGCAGGTCATATC
TCGAGTACCGGAAGAAGTAGAAGCTCACTTCAATAAAAGCAACAAATGCT
TGAAGTTTGTCAAAGGCAATGATATCCCAGCTGAAATGAAAAAGGTTCTG
AACCACAGCACAGCTTTTTTCCTGGAAGGTCATGCTACCGCTAGGAGTTT
GTCTGATTTCGGAATTGAGAATACCAGGCTGCTAAAATGTTGCTTATTGA
CAGAATGTAATGGAGTCAAAACCATCATTGATTTGTCACAAGGTGGTGGA
CACTCACAAGTTTACACAAGAGGAAAAGGGAAGAGCGAGTCACTGAAGTT
TCCTGAAGAACAAACTGATGCACTTGGAAATCTACAAGACTTGAATATAT
ATTACATGAAGAATTTAGAGAGCATTTGGAAGGGGCCTGTTCATAAGCAC
TGCCTAGCTAGCCTGAAGTTCCTTGCACTTCATAAATGCCCCAGATTGAG
TACCATTTTCTCACTAGATTTGGTTGCTAATCTTGACAATTTAGAAGAGC
TCATTGTTGAACACTGCCCTCAACTGACCAGTCTTGTGAGCCCGACGGGT
CATGTGTCCAGTAACTCAACACCACAACCAAATTGCTTTTTTCCTAGCTT
GAAAAGAATATCACTGCTTTACGTGCCAAATCTTGTTAGCATTTCTAGTG
GTTTGTGGATTGCTCCAGAACTGGAAAAAGTAGGCTTTTACAATTGCCCA
AAGCTTAAGAGTCTTTCCGCGATGGAAATGTCAAGTGACCATTTGACGAG
GATCAAAGGAGAAAGTCACTGGTGGGAAGCATTGGAGTGGAAAAACTCAG
AGTGGGGGAACCCGCTGGATTATCTGCAGAGTATCTTTTCCCCACTTATT
AAGGAGAGAGATGTGAAGGCGCAATTGGCAGAAGAAGGAATTATGCACCA
TGCTTCAACTTAA
back to top

protein sequence of Tc02v2_p001120.5

>Tc02v2_p001120.5 ID=Tc02v2_p001120.5|Name=Tc02v2_p001120.5|organism=Theobroma cacao|type=polypeptide|length=1821bp
MDSQFHAGEPPSKRQLQIQGPRPPPLKVSKDSHKIKKPPHPPSHAAGPAA
AAADQRRPEPVIIYAVSPKVIHAEESDFMSIVQRYTGLSSGTFSGDGDVY
RAARLAATEKASSSSREKIGDVGVAGEGAMEEGLIRALPGILSPPPETLP
EVAVGFFSPVRTPGGTFLSPAARLAGTTKGNPSWREKIGDVGVAGEGGMK
EGLIRAPPGILSPASETLPAVAAGTFFSPAVAGGTFLSRASEARMMSADI
IYGQSNSCSVKDESKPAGLANIMCKSLSGYNLEEQKLTVSDASKKSSAGE
LLPTEVEFKKPKSSHYLGESLPSKDSDYDFKEEQELKVNETFNKSCSVKD
ETKPAGLAIAMFKSLSDYNLEKQKLTVSDASKKSAAGELQPTEVEFKKPK
SSHYLGESLPSKDSDYDFKEEQELKVNETFNKSCSVKDETKPAGLAIAMF
KSLSDYNLEKQKLTVSDASKKSAAGELQPTEVEFKKPKSSHYLGESLPSK
DSDYDFKEEQELKVNETFNKSCSVKDETKPAGLAIAMFKSLSDYNLEKQK
LTVSDASKKSAAGESQPTEVEFKKPKSSHSLGESLPSKDSDYNFKEQELK
INEGSGKRNARRSWTVLVHSDMVLGELPSDGCNWRKYGQKDILNARFPRE
YYRCAHRHTQGCFATKEVQREDEDPMFITATYKGMHTCTLAPDLMPPGPP
EILAPLDTVLGADGNDKKDSQSNLQSSVHSPDNQTCISSTKLTSELPNLG
LNLNVFPEKSFKSYPTWKNFYENEVRKNWKVLNRKKDVLLLLSSYPMIMI
DKSDTDKWIIDVLATMRHVKSTEKILFGVGVAKHWPGMTTLQELSGRLQK
LLDVPLMNDIEGVLPVDLVENLYRTTEADLRPLLEVEQNIISGKTSKSRG
SASNSEGAAMEAEKELQPMPAKCKTLVEDTELPAKGTLNVPEEIFDLAIY
LAVRQILKCINRGYIWCITISGRDKKRVLGAVKQHQDIVSEFGYIIVFTV
SEDQSGANVHGVFQLQKGFWLGGCFDSVDLTHEYFDNLCSPGILLLTEDD
YDKNMNLDQSTLPLLINLNKLVDHKHSDSRFIIFTSKMATDMEIRMEDHL
LSWKLFCRIVGEGLLSPSIQQIAASLVKEYRGNLLAIILTARSLEKVTDD
VNLWELAVKRLTMLPPSQIEDIDNVLINALTFIWERMNNKTRHCIKFFTW
YPKGQKINRVSLIQHWIQDRLVDTHDEGTNIIQNLVDTSLLNIVELNGVQ
LRREIYDVLVNPLILQMHPFYLLLGRARLIKPPEEEEWDAKVINLMDNKL
SDLPESPRSPSLIALYLQRNLDLMTIPSCFFKHMPLLQILDLSHTSIKSL
PESLSSLVNLRELLLKGCELFIRLPSHVGELKNLEKLDLDETQIIDLPAE
IGQLSKLKILRVSFYGYMNCSKTRLRQDTIIPPGTISGLSELTELSIDVD
PDDERWNATVKDIIEEACNLKTLRQLNLYLPNIEILWKRRTGSASLLHYP
LPRFRFTVGYHKRQVISRVPEEVEAHFNKSNKCLKFVKGNDIPAEMKKVL
NHSTAFFLEGHATARSLSDFGIENTRLLKCCLLTECNGVKTIIDLSQGGG
HSQVYTRGKGKSESLKFPEEQTDALGNLQDLNIYYMKNLESIWKGPVHKH
CLASLKFLALHKCPRLSTIFSLDLVANLDNLEELIVEHCPQLTSLVSPTG
HVSSNSTPQPNCFFPSLKRISLLYVPNLVSISSGLWIAPELEKVGFYNCP
KLKSLSAMEMSSDHLTRIKGESHWWEALEWKNSEWGNPLDYLQSIFSPLI
KERDVKAQLAEEGIMHHAST*
back to top