Tc01v2_t016100.3

Overview
NameTc01v2_t016100.3
Unique NameTc01v2_t016100.3
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3816
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 1 EST, 33 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 12 samples with support for all annotated introns
Producthistidine kinase 2, transcript variant X3
NoteHistidine kinase 2
Cross References
External references for this mRNA
DatabaseAccession
GeneID18612437
GenbankXM_018124708.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc01v2_g016100Tc01v2_g016100Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc01v2_p016100.3Tc01v2_p016100.3Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto36855auto36855Theobroma cacaoexon
exon-auto36856auto36856Theobroma cacaoexon
exon-auto36857auto36857Theobroma cacaoexon
exon-auto36858auto36858Theobroma cacaoexon
exon-auto36859auto36859Theobroma cacaoexon
exon-auto36860auto36860Theobroma cacaoexon
exon-auto36861auto36861Theobroma cacaoexon
exon-auto36862auto36862Theobroma cacaoexon
exon-auto36863auto36863Theobroma cacaoexon
exon-auto36864auto36864Theobroma cacaoexon
exon-auto36865auto36865Theobroma cacaoexon
exon-auto36866auto36866Theobroma cacaoexon
exon-auto36867auto36867Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto36868auto36868Theobroma cacaoCDS
CDS-auto36869auto36869Theobroma cacaoCDS
CDS-auto36870auto36870Theobroma cacaoCDS
CDS-auto36871auto36871Theobroma cacaoCDS
CDS-auto36872auto36872Theobroma cacaoCDS
CDS-auto36873auto36873Theobroma cacaoCDS
CDS-auto36874auto36874Theobroma cacaoCDS
CDS-auto36875auto36875Theobroma cacaoCDS
CDS-auto36876auto36876Theobroma cacaoCDS
CDS-auto36877auto36877Theobroma cacaoCDS
CDS-auto36878auto36878Theobroma cacaoCDS
CDS-auto36879auto36879Theobroma cacaoCDS
CDS-auto36880auto36880Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc01v2_t016100.3 ID=Tc01v2_t016100.3|Name=Tc01v2_t016100.3|organism=Theobroma cacao|type=mRNA|length=3816bp
ATGAGTTGTTCTTCTGGAACTGGGAATTTTGTGAAGCTCTCAAGGCTCCT
TGGGGAAATACGTAAGTGTGCTTTGGTCAAGATGTCTATGAACGGCAAGC
TTTCTGGTTCTAATTGTAGATTATCAGCAAATTTCAGGCTGAAGAAGGCA
AAAGAGACTATGCATGGGCCCAATTCTTTCAGGAAATGGAAGAGAAACCT
TCTCTTTCTCTGGCTTTTAGGCTTTGTTTCGACAGGAATTATTTGGTTTT
TCTTGAGTTTCAATAGTGTAGCTTCGGAGAGGAATGAGAAAAGTCCTGAT
TCTTGTGATGAGAAGGCAAGAATCTTGCTCCAACATTTCAATGTTAGCAA
GAACCAGTTTCATGCTCTAGCTTCTTTCTTCTACGAATCAGATCAGATAA
AATTCCTCGAATGTACCAGAGATTCAGGACCTAAAAAGCCATCAAGTGAT
GGTATTGCCTGTGCTCTTAAGGTACTGTGTTCAGAGCACCAAGACCTCAG
AAAGCAGCAGATGTGGGTTGTAAGAAATACAGAACTTAAGGATCAATGCC
CAGTCCAAGTTGAGAATATTCCCAGCGAGCATGACTTGTCATTGCTGGAG
CACGATACCTTATCATTTGTCTCACAAATTGCAGTTTCATTAGTATCATG
GGAGCATCACAGTGGTGGAAAGAACATCTCACAAAGAAGTGCACTAGGAG
TCGAATCAAAAGACAATTGTGAGAACTTGTCATTTTGTATGGTGAAAGGA
TGTTGGTTGCTTCTTGTTGGAGTGATACTGAGCTGGAAGATTCCTGGAGT
TCGTTTGAAGCTCTGGAGGAACAGAAAGAATGAGCCAGCTCTGCTGCAGC
CTGTGGCTCAGCAACTACCGCTGCTGCTGCAACAGAAGCAGCAGCAAACC
CAGAGCCCTCCTAAAGGTGCAGGGAAGTGGAGAAAGAAACTCTTAATAAC
ATTTGTATTTGTGGGGATCTTTACATCCTTCTGGTTATTTTGGCATTTAA
ACCAAAAGATCATTTTAAGGAGAGAAGAGACACTTGCCAACATGTGTGAT
GAAAGAGCACGGATGTTGCAGGATCAGTTCAATGTTAGCATGAATCATGT
TCATGCGTTGGCTATTCTCGTATCCACTTTTCACCATGGGAAGCATCCAT
CTGCTATTGATCAGAAAACATTTGGTGAATATACTGAAAGAACAGCTTTT
GAGAGGCCACTTACTAGTGGTGTCGCTTATGCACTGAAAGTTCTTCACTC
AGAGAGGGAGCAGTTTGAGAAGCAGCATGGATGGACAATAAAGAAAATGG
AAACTGAGGACCAGACTTTGGTCCAAGATTGCCTGACAGAAAATTTGGAT
CCTGCACCCATTAAAGATGAATATGCACCAGTAATATTTTCACAAGAAAC
TGTGTCTCATATTGTTTCTATTGACATGATGTCTGGAAAGGAAGACCGTG
AGAACATCCTGCGGGCAAGGGCAACTGGAAAGGGAGTATTGACATCTCCT
TTTAAGCTGTTAAAATCCAATCACCTTGGTGTTGTTCTCACATTTGCTGT
TTATAACAAGGATTTGCCTCCAAGTGCTACACCAAGGCAACGAACTGAAG
CTACTGTGGGGTACCTGGGTGCGTCTTATGATGTCCCCTCTCTGGTGGAG
AAGCTTCTGCACCAACTTGCCAGCAAGCAAACCATTGTTGTCAATGTTTA
CGACACAACCAATGCATCTGCTGCCATCAGCATGTACGGTACTGATGTAA
CTGATACTGGCCTACTGCATGTCAGTAGCCTTGATTTTGGAGATCCATTA
AGGAAGCATGAGATGCACTGCAGGTTCAAGCAAAAACCCCCGTTACCTTG
GACAGCAATTAATGCATCAGTAGGAGTCCTAGTTATTACTTTGCTTGTCG
GTCATATCTTCCATGCTGCTATATGTCGAATTGCAAAAGTAGAGAATGAC
TACCGTGAGATGATGGAGCTCAAAGCTCGTGCTGAAGCTGCAGATGTGGC
CAAATCTCAGTTTCTAGCAACTGTTTCCCATGAGATCAGGACTCCGATGA
ATGGTGTTTTAGGTATGCTGAAAATGCTGATGGATACAGAGCTTGATGCG
ATCCAAAGGGACTATGCTGAGACTGCTCATGCTAGTGGGAAAGATCTTAT
CTCACTGATAAATGAGGTCCTTGATCAGGCTAAGATAGAATCAGGCAGGC
TTGAGCTTGAGGATGTGCCCTTTGATCTACGCACTCTTCTTGATAACGTC
CTCTCACTTTCCTCAGACAAATCTAATTATAAAGGGATTGAGTTGGCAGT
TTATGTATCCGATCGGGTTCCTGAAGTTGTTGTTGGTGATCCCGGGCGGT
TTCGGCAAATAATTACAAATCTTGTTGGAAATTCAATTAAGTTCACGCAG
GATAAGGGACATATTTTTGTCTCAGTGCATCTGGTAGATGAAGTGAAGGG
TGCATTTGATGTGGGAGACAAGGTGCTGCAACCAGGCTTGAACTTAGTTC
AAGACATGTCAAGCAAAACATATAATACGTTAAGTGGGTTTCTAGTGGTG
GACAGGTGGAGAAGCTGGGAGAACTTTACAATACTAAATGGCAAAGACTC
AATGGAGGATCCTGAAAAGATTAAATTACTAGTAACAGTTGAGGACACAG
GTGTGGGAATTCGTTTAGATGCACAGGATCGAATTTTCACTCCTTTTGTG
CAAGCTGACAGTTCCACTTCACGACATTATGGTGGGACTGGAATAGGATT
GAGCATCAGCAAACGTCTTGTACAACTCATGCATGGGGAGATCGGGTTTG
TGAGTGAACCTGGCACTGGCAGTACTTTCTCATTCACTGCAGCTTTTGGA
AAAGGTGAAGCGAGTTCTCTGGATTCAAAGTGGAAGCAATATGATCCAGT
GATTTCGGAGTTCCAAGGTTTGGGAGCACTGATTATTGATAATAGAAGCA
TCCGAGCTGAGGTTACAAGATACCATCTTCGGAGATTGGGAATATCTGTG
GATATAACTTCCAGTATGGAGTTAGCGTACACCTATCTGTCAAGCACTTG
TGGCACAAGTGCATTTGCACATTTGGCCATGATTCTTATTGACAAAGATG
TTTGGAATCAGGAAACAGTTCTTCAGTTACGATCTTTGCTCAAAGATCAT
AGGCAAAATGACAGAGTAGATGTTTCGACAAACCTTCCAAAAATTTTTCT
CTTGGCTACCTCCATGAGCCCGATTGAGCGCTCCAAGCTTAAGACTGCTG
CTTTTGTAGATAATGTGCTGATGAAGCCACTTCGGTTGAGTGTCTTGATT
GCCTGTTTCCAAGAAGCCCTTGGAAATGGTAGAAAGGAGCAAGTACATAG
AGAGAGAATGTCTACGCTTGGGAGCTTACTACGAGAAAAGCGGATTTTAG
TGGTTGATGACAATAAGGTTAACAGAAGAGTGGCAGAAGGTGCTTTAAAG
AAATATGGAGCAATTGTTTCCTGTGTGGAAAGAGGCCAGGATGCGCTGCA
CAAGCTTAAGCCACCCCATAATTTTGATGCTTGCTTCATGGATCTCCAGA
TGCCAGAAATGGATGGGTTTGAAGCTACTAGGCAAATCCGCTGCGTGGAG
AGTGAGGTCAATGAAAAAATTGTTTCTGGAGAAGCATCCATTGAGATGTA
CGGAAATGTGCATCAATGGCACATTCCAATTTTAGCAATGACAGCTGATG
TCATCCAAACTACAAATGAAGAGTGCATGAAATGTGGGATGGATGGCTAT
GTGTCAAAGCCTTTTGAGGAAGAGCAACTTTATTCAGCTGTTGCAAGTTT
TTTTGAGTCTGGTTGA
back to top

protein sequence of Tc01v2_p016100.3

>Tc01v2_p016100.3 ID=Tc01v2_p016100.3|Name=Tc01v2_p016100.3|organism=Theobroma cacao|type=polypeptide|length=1272bp
MSCSSGTGNFVKLSRLLGEIRKCALVKMSMNGKLSGSNCRLSANFRLKKA
KETMHGPNSFRKWKRNLLFLWLLGFVSTGIIWFFLSFNSVASERNEKSPD
SCDEKARILLQHFNVSKNQFHALASFFYESDQIKFLECTRDSGPKKPSSD
GIACALKVLCSEHQDLRKQQMWVVRNTELKDQCPVQVENIPSEHDLSLLE
HDTLSFVSQIAVSLVSWEHHSGGKNISQRSALGVESKDNCENLSFCMVKG
CWLLLVGVILSWKIPGVRLKLWRNRKNEPALLQPVAQQLPLLLQQKQQQT
QSPPKGAGKWRKKLLITFVFVGIFTSFWLFWHLNQKIILRREETLANMCD
ERARMLQDQFNVSMNHVHALAILVSTFHHGKHPSAIDQKTFGEYTERTAF
ERPLTSGVAYALKVLHSEREQFEKQHGWTIKKMETEDQTLVQDCLTENLD
PAPIKDEYAPVIFSQETVSHIVSIDMMSGKEDRENILRARATGKGVLTSP
FKLLKSNHLGVVLTFAVYNKDLPPSATPRQRTEATVGYLGASYDVPSLVE
KLLHQLASKQTIVVNVYDTTNASAAISMYGTDVTDTGLLHVSSLDFGDPL
RKHEMHCRFKQKPPLPWTAINASVGVLVITLLVGHIFHAAICRIAKVEND
YREMMELKARAEAADVAKSQFLATVSHEIRTPMNGVLGMLKMLMDTELDA
IQRDYAETAHASGKDLISLINEVLDQAKIESGRLELEDVPFDLRTLLDNV
LSLSSDKSNYKGIELAVYVSDRVPEVVVGDPGRFRQIITNLVGNSIKFTQ
DKGHIFVSVHLVDEVKGAFDVGDKVLQPGLNLVQDMSSKTYNTLSGFLVV
DRWRSWENFTILNGKDSMEDPEKIKLLVTVEDTGVGIRLDAQDRIFTPFV
QADSSTSRHYGGTGIGLSISKRLVQLMHGEIGFVSEPGTGSTFSFTAAFG
KGEASSLDSKWKQYDPVISEFQGLGALIIDNRSIRAEVTRYHLRRLGISV
DITSSMELAYTYLSSTCGTSAFAHLAMILIDKDVWNQETVLQLRSLLKDH
RQNDRVDVSTNLPKIFLLATSMSPIERSKLKTAAFVDNVLMKPLRLSVLI
ACFQEALGNGRKEQVHRERMSTLGSLLREKRILVVDDNKVNRRVAEGALK
KYGAIVSCVERGQDALHKLKPPHNFDACFMDLQMPEMDGFEATRQIRCVE
SEVNEKIVSGEASIEMYGNVHQWHIPILAMTADVIQTTNEECMKCGMDGY
VSKPFEEEQLYSAVASFFESG*
back to top