Tc02v2_t001130.4

Overview
NameTc02v2_t001130.4
Unique NameTc02v2_t001130.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length4410
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 4 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 2 samples with support for all annotated introns
Productuncharacterized LOC18607098, transcript variant X9
NoteUncharacterized protein
Cross References
External references for this mRNA
DatabaseAccession
GeneID18607098
GenbankXM_018115289.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc02v2_g001120Tc02v2_g001120Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc02v2_p001130.4Tc02v2_p001130.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto81246auto81246Theobroma cacaoexon
exon-auto81247auto81247Theobroma cacaoexon
exon-auto81248auto81248Theobroma cacaoexon
exon-auto81249auto81249Theobroma cacaoexon
exon-auto81250auto81250Theobroma cacaoexon
exon-auto81251auto81251Theobroma cacaoexon
exon-auto81252auto81252Theobroma cacaoexon
exon-auto81253auto81253Theobroma cacaoexon
exon-auto81254auto81254Theobroma cacaoexon
exon-auto81255auto81255Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto81256auto81256Theobroma cacaoCDS
CDS-auto81257auto81257Theobroma cacaoCDS
CDS-auto81258auto81258Theobroma cacaoCDS
CDS-auto81259auto81259Theobroma cacaoCDS
CDS-auto81260auto81260Theobroma cacaoCDS
CDS-auto81261auto81261Theobroma cacaoCDS
CDS-auto81262auto81262Theobroma cacaoCDS
CDS-auto81263auto81263Theobroma cacaoCDS
CDS-auto81264auto81264Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc02v2_t001130.4 ID=Tc02v2_t001130.4|Name=Tc02v2_t001130.4|organism=Theobroma cacao|type=mRNA|length=4410bp
ATGGAGAGCTGGGAACGTAAGAATGTTAGAAATGAACTAAGACAAGGGAG
GAAGCTAGCAAAGCAGCTCCAAGCAAATCTCAAACGGTCATCTTCCGAAG
AAAATCCTGAGTTAGTCCAAAAGATCGTATCGTCCTTTGAAAAGGCACTT
TCAATGCTTAACTGCAGCACCTCCTCTATGGCAGCCAAGCTGCAGCCCAT
AGCTCACAGCTCGCTTGCTCGTGATGGAAGTCACCAGAGTAAGGACCCTG
ACCATGATATCAAGGAGCAAGACTTCAAAGTTAAAGATTTCTTCATGAAA
AGAGAAGTAACTGAGTCACAGCCTAAAGGACTTGCAACTGAAATGTTTAA
ATTCCCGCCTCCTTTCAGTGGAAGCTTCTCAAGTGCGGGAAATTTGGAAG
CCTTTGACGGTAAGGCAGCCGAATTAAAGCCAACAATAATTGGGACCGAA
TTTGACCATGATTTCATGGAGGAGCATGAGCTCAAAGTCAATGAAACCTT
CAATAAAAGTTGCAGTGTGAAAGATGAGTCAAAGCCTGCAGGACATGCAA
AAGTAATGTTTAAGTCCTTGTCTGACTATAACTTGGAGAAGAAAAAGCAC
ACCGTCGGCGATGCTTCCAAGAAAAGTGCTGCAGGTGAGTCACAACCCAC
AGAAGTTGAGTTTAAAAAGCCCAAATCCTCGCATTCTCTTGGCGAAAGCC
TCCCAAGTAAGGACTCTGACTATAATTTCAAGGAGCAGGAGCTCAAAATC
AATGAGGGCTCCGGCAAAAGGAACGCTAGGCGGAGCTGGACAGTATTGGT
TCACTCTGATATGGTCCTAGGGGAACTTCCTTCTGATGGTTGCAATTGGA
GGAAATATGGACAGAAGGATATTCTTAATGCAAGATTTCCAAGAGAATAC
TACAGGTGCGCACATCGACACACTCAAGGCTGTTTTGCTACAAAGGAAGT
CCAAAGAGAGGATGAAGATCCAATGTTCATCACTGCTACTTACAAAGGAA
TGCACACTTGCACACTAGCCCCAGATTTGATGCCTCCAGGACCACCTGAG
ATACTAGCTCCTTTGGATACTGTACTTGGCGCCGATGGAAATGACAAAAA
GGATTCACAATCTAATCTTCAGTCAAGTGTACACAGTCCTGACAATCAAA
CTTGCATTTCGTCAACCAAGCTAACAAGTGAGCTTCCAAATTTAGGGCTC
AACCTGAATGTGTTTCCTGAGAAATCATTTAAGTCATATCCAACGTGGAA
AAACTTTTATGAAAATGAAGTGAGGAAGAATTGGAAAGTGCTGAACAGGA
AGAAGGATGTGCTACTCTTACTTTCATCTTATCCTATGATTATGATTGAC
AAATCTGACACAGATAAATGGATTATTGATGTATTGGCAACCATGAGACA
TGTGAAATCAACAGAAAAGATTTTATTTGGTGTAGGGGTAGCGAAACATT
GGCCAGGAATGACTACGCTGCAGGAACTTTCTGGACGCTTGCAGAAGTTG
CTGGATGTTCCCTTGATGAATGATATTGAGGGAGTTTTACCGGTAGACTT
GGTTGAGAACCTATATAGGACTACAGAAGCTGATTTAAGACCCCTGTTGG
AAGTGGAACAGAATATAATTAGCGGAAAGACATCAAAATCTAGAGGTTCA
GCAAGTAACAGTGAAGGGGCAGCAATGGAAGCAGAAAAAGAATTGCAGCC
AATGCCTGCAAAATGTAAAACTCTGGTAGAAGATACTGAGTTGCCAGCAA
AAGGAACACTGAATGTACCAGAAGAAATATTTGACTTGGCAATTTATTTA
GCTGTTCGTCAGATCTTAAAATGTATAAACAGGGGATATATCTGGTGTAT
TACTATCAGTGGAAGAGATAAGAAAAGGGTGCTAGGAGCAGTAAAGCAAC
ACCAAGATATAGTTTCCGAGTTTGGATATATCATTGTATTCACTGTGTCA
GAAGATCAAAGTGGGGCAAATGTTCACGGTGTCTTTCAACTGCAGAAGGG
TTTTTGGCTAGGTGGATGCTTTGATTCTGTTGACCTTACACATGAATATT
TTGACAACTTGTGCTCCCCAGGAATCTTATTGCTTACAGAGGATGATTAC
GATAAGAACATGAACTTGGATCAGTCTACACTCCCACTTTTGATAAACCT
TAACAAGTTGGTTGACCATAAACATAGCGATTCAAGGTTCATAATCTTTA
CTTCTAAAATGGCAACAGACATGGAGATAAGAATGGAGGATCATTTGTTG
TCATGGAAATTGTTTTGTAGGATTGTGGGTGAAGGTTTGCTTTCTCCTAG
TATCCAACAGATAGCAGCAAGTTTGGTGAAAGAATACCGTGGCAATCTAC
TCGCCATCATTCTAACGGCCAGGTCCTTGGAGAAAGTTACTGATGATGTC
AACTTGTGGGAACTTGCTGTTAAAAGATTGACCATGCTACCTCCATCTCA
AATAGAAGATATAGACAATGTCCTGATTAATGCATTAACATTCATTTGGG
AACGTATGAACAATAAAACAAGACATTGCATTAAGTTTTTCACGTGGTAT
CCCAAGGGACAGAAAATTAACAGAGTCTCACTAATACAACATTGGATCCA
AGATCGTCTGGTTGATACCCATGATGAAGGTACCAATATTATCCAAAATC
TTGTTGATACATCCCTGCTTAATATTGTGGAGTTAAATGGGGTCCAACTG
CGAAGAGAGATCTATGATGTATTAGTAAACCCACTAATTCTTCAAATGCA
TCCATTTTATCTTTTGCTAGGCAGGGCAAGATTGATTAAACCACCAGAAG
AAGAGGAATGGGATGCCAAAGTGATCAATTTGATGGATAATAAATTATCT
GACCTGCCAGAATCTCCAAGGTCACCCTCACTAATTGCATTGTACCTTCA
GCGTAACTTGGATCTCATGACTATCCCATCTTGTTTCTTCAAGCACATGC
CTTTGCTTCAAATCCTAGACTTATCACACACCAGCATCAAATCTTTGCCA
GAGTCACTTTCTAGTTTGGTTAACCTTCGAGAACTCCTTTTGAAAGGCTG
TGAACTCTTCATACGACTCCCTAGCCATGTTGGAGAACTGAAGAATCTTG
AGAAGCTTGACCTTGATGAAACTCAGATTATTGATCTCCCAGCAGAGATT
GGACAACTTTCCAAATTAAAAATTTTGAGGGTCTCATTCTATGGATATAT
GAACTGTAGCAAAACAAGGTTGCGGCAAGATACAATAATTCCCCCTGGAA
CAATATCAGGTCTCTCTGAATTAACTGAATTAAGCATTGATGTTGATCCG
GATGATGAACGCTGGAATGCAACGGTGAAAGATATTATTGAGGAAGCTTG
CAACTTGAAAACTTTAAGACAGCTTAATTTGTACCTGCCAAACATCGAAA
TATTGTGGAAACGCAGAACCGGTAGCGCATCATTGCTCCATTACCCTTTG
CCACGTTTTAGATTTACTGTCGGTTATCACAAGCGGCAGGTCATATCTCG
AGTACCGGAAGAAGTAGAAGCTCACTTCAATAAAAGCAACAAATGCTTGA
AGTTTGTCAAAGGCAATGATATCCCAGCTGAAATGAAAAAGGTTCTGAAC
CACAGCACAGCTTTTTTCCTGGAAGGTCATGCTACCGCTAGGAGTTTGTC
TGATTTCGGAATTGAGAATACCAGGCTGCTAAAATGTTGCTTATTGACAG
AATGTAATGGAGTCAAAACCATCATTGATTTGTCACAAGGTGGTGGACAC
TCACAAGTTTACACAAGAGGAAAAGGGAAGAGCGAGTCACTGAAGTTTCC
TGAAGAACAAACTGATGCACTTGGAAATCTACAAGACTTGAATATATATT
ACATGAAGAATTTAGAGAGCATTTGGAAGGGGCCTGTTCATAAGCACTGC
CTAGCTAGCCTGAAGTTCCTTGCACTTCATAAATGCCCCAGATTGAGTAC
CATTTTCTCACTAGATTTGGTTGCTAATCTTGACAATTTAGAAGAGCTCA
TTGTTGAACACTGCCCTCAACTGACCAGTCTTGTGAGCCCGACGGGTCAT
GTGTCCAGTAACTCAACACCACAACCAAATTGCTTTTTTCCTAGCTTGAA
AAGAATATCACTGCTTTACGTGCCAAATCTTGTTAGCATTTCTAGTGGTT
TGTGGATTGCTCCAGAACTGGAAAAAGTAGGCTTTTACAATTGCCCAAAG
CTTAAGAGTCTTTCCGCGATGGAAATGTCAAGTGACCATTTGACGAGGAT
CAAAGGAGAAAGTCACTGGTGGGAAGCATTGGAGTGGAAAAACTCAGAGT
GGGGGAACCCGCTGGATTATCTGCAGAGTATCTTTTCCCCACTTATTAAG
GAGAGAGATGTGAAGGCGCAATTGGCAGAAGAAGGAATTATGCACCATGC
TTCAACTTAA
back to top

protein sequence of Tc02v2_p001130.4

>Tc02v2_p001130.4 ID=Tc02v2_p001130.4|Name=Tc02v2_p001130.4|organism=Theobroma cacao|type=polypeptide|length=1470bp
MESWERKNVRNELRQGRKLAKQLQANLKRSSSEENPELVQKIVSSFEKAL
SMLNCSTSSMAAKLQPIAHSSLARDGSHQSKDPDHDIKEQDFKVKDFFMK
REVTESQPKGLATEMFKFPPPFSGSFSSAGNLEAFDGKAAELKPTIIGTE
FDHDFMEEHELKVNETFNKSCSVKDESKPAGHAKVMFKSLSDYNLEKKKH
TVGDASKKSAAGESQPTEVEFKKPKSSHSLGESLPSKDSDYNFKEQELKI
NEGSGKRNARRSWTVLVHSDMVLGELPSDGCNWRKYGQKDILNARFPREY
YRCAHRHTQGCFATKEVQREDEDPMFITATYKGMHTCTLAPDLMPPGPPE
ILAPLDTVLGADGNDKKDSQSNLQSSVHSPDNQTCISSTKLTSELPNLGL
NLNVFPEKSFKSYPTWKNFYENEVRKNWKVLNRKKDVLLLLSSYPMIMID
KSDTDKWIIDVLATMRHVKSTEKILFGVGVAKHWPGMTTLQELSGRLQKL
LDVPLMNDIEGVLPVDLVENLYRTTEADLRPLLEVEQNIISGKTSKSRGS
ASNSEGAAMEAEKELQPMPAKCKTLVEDTELPAKGTLNVPEEIFDLAIYL
AVRQILKCINRGYIWCITISGRDKKRVLGAVKQHQDIVSEFGYIIVFTVS
EDQSGANVHGVFQLQKGFWLGGCFDSVDLTHEYFDNLCSPGILLLTEDDY
DKNMNLDQSTLPLLINLNKLVDHKHSDSRFIIFTSKMATDMEIRMEDHLL
SWKLFCRIVGEGLLSPSIQQIAASLVKEYRGNLLAIILTARSLEKVTDDV
NLWELAVKRLTMLPPSQIEDIDNVLINALTFIWERMNNKTRHCIKFFTWY
PKGQKINRVSLIQHWIQDRLVDTHDEGTNIIQNLVDTSLLNIVELNGVQL
RREIYDVLVNPLILQMHPFYLLLGRARLIKPPEEEEWDAKVINLMDNKLS
DLPESPRSPSLIALYLQRNLDLMTIPSCFFKHMPLLQILDLSHTSIKSLP
ESLSSLVNLRELLLKGCELFIRLPSHVGELKNLEKLDLDETQIIDLPAEI
GQLSKLKILRVSFYGYMNCSKTRLRQDTIIPPGTISGLSELTELSIDVDP
DDERWNATVKDIIEEACNLKTLRQLNLYLPNIEILWKRRTGSASLLHYPL
PRFRFTVGYHKRQVISRVPEEVEAHFNKSNKCLKFVKGNDIPAEMKKVLN
HSTAFFLEGHATARSLSDFGIENTRLLKCCLLTECNGVKTIIDLSQGGGH
SQVYTRGKGKSESLKFPEEQTDALGNLQDLNIYYMKNLESIWKGPVHKHC
LASLKFLALHKCPRLSTIFSLDLVANLDNLEELIVEHCPQLTSLVSPTGH
VSSNSTPQPNCFFPSLKRISLLYVPNLVSISSGLWIAPELEKVGFYNCPK
LKSLSAMEMSSDHLTRIKGESHWWEALEWKNSEWGNPLDYLQSIFSPLIK
ERDVKAQLAEEGIMHHAST*
back to top