Tc04v2_t014510.1

Overview
NameTc04v2_t014510.1
Unique NameTc04v2_t014510.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2106
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 8 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments
ProductDNA-directed RNA polymerase II subunit RPB1
NoteUncharacterized protein
Cross References
External references for this mRNA
DatabaseAccession
GeneID18602497
GenbankXM_007033908.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g014510Tc04v2_g014510Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p014510.1Tc04v2_p014510.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto227116auto227116Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto227117auto227117Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t014510.1 ID=Tc04v2_t014510.1|Name=Tc04v2_t014510.1|organism=Theobroma cacao|type=mRNA|length=2106bp
ATGGCCTTCACCAGTTGTTTCTTATTGGCTTTCTCTATTGTTGTCATGCT
TTCAGGCATTGATACAAACTTGGCAGCTCGCTATCTGCTTGACAAGACAC
GGTCTCCAGCTCCATCTGTGCTGTCCGTGTCTGCTCCACAATCATCACTG
CCTCCGCACAGTTCAAAATCAAGTTTCACAGCACCTGCAGAGCCTCCTTT
GGCCGGCTCTCAACCATCCGTTTCTAGTTCCATGCCCAGTTCATCAGAAT
CAACGACACCATCCTCAGCTTCAGATAAACCATCTCTGCCTAATTCTACG
CCAAGCTTGACACAGCCCGCAGCAGCGCCTAAAGCTTCTACTCAGCCATC
TTTATCTAAAACAATCCCAAGCTCAACACAGCCCACACCCTCTCAGCAAG
CTTCCCGTAATTCAATCCCTGGCTCGAGAGAACCAACAATGGCTCCAATG
GGTTCCTCTCATCCGTCCTTGTCTAATACAGAGCCAAGCTTGACACAACC
CGCAATGTCACCTTTAGCTAATGCCCCACCATCTTCGTCTACTAATAATC
CATCTCTGTCCAATTCAACAAACAACTTGGCACAGCCTGCTGTGGCACCT
ATGGCTTCTACTCATCCGTCTTCTCCAGATAAAGTACCAAGTTTGGCAGT
GCCTGCTACCTCGCCTTCAGCTACTACCCAACCATCGTTGCCTACTAATA
ATTCACCTTCGTCCAATTCAATGCCAGGCTTGGCACAGCCTGCTATGGCA
CCTATGGATTCTACTCATCCATCTGCACCAGGTAAAGTGCCAAGCTTGGC
ACTGCCTGCAACGTCTCCTTCAGAAACTACCCAACCATCGTTGCCTAATC
CATCTTCGTCCAACTCAACATCAGGCTTGACACAACATGGAATGGCACCT
ATGGCTTCCTCTCTGCCATCTTCACCAGATAAGATGCCAAGCTTGACACT
GCCTGCAACATCACCTTCTGCTACCAGCCAACCATCTTTGCCTACTAATA
GTCCATCATTCTCCAATTCAACGCCAAGCTTGGCACAACCTGGAATGGCA
CCTATGGCTTCCTCTCAACCATCATTGTCAAACTCAACGCCGAGCTTGGG
ACTGCCTTCAATGTCGCCTTCTGCTACCAGCCAACCATCTTTACCTGCAA
ATATTCCATCGTTCTCCAACTCAACACCAAGCTTGGCACAACCTGGAATG
GCACCTATGGCTTCCTCTCAACCATCATTGTCAAACTCAACGCCAAGCTT
GGGACTGCCTTCGATGTCGCCTTCTGCTACCAGCCAACCATCTTTGCCCA
CAAATATTCCATCGTTCTCCAACTCAACACCAAGCTTGGCACAACCTGGA
ATGCCACCTATGTCTTCCTCGCAACCATCATTACCAAACTCAACGCCGAG
CTTGACACAACCTGGTATGGCACCAATGGCTTCGTCTCAACCATCATTGT
CAAATTCAACTCCAAGCTTGGGACTGCCTTCAATGTCGCCTTCTGCTACC
AGCCAACCATCTTTGCCTACAAATAGCCCATCGTTCTCCAACTCAACGTC
AAGCTTGACACAACCTGGAATGGCACCTATAGCTTCCTCTCAACCATCTT
CGCCTGATAAAATCCCAAGCTTGGCATTGCCTTCAAATTCACCTTCTAAT
ACCACCCAACCGTCGTTGCCTAACAATATTAATCCATCATTATCCAACTC
AACCCCAAGCTTGACACAACCTGCAATATCACCTTCAGCTCATCAAGCTT
CGCCCCAATCCTCAATGACACCTACGGCTTCACCAAATCATACATCTCCG
TCTAATATTGCACCGAAGGCTTCCTCGCAACCATCTTTACCAAACACGAC
TCCAAGCTCGACACAATCAGCAGTTGCGCCTTCTCCTACTGCTCATCCAT
CTTCGTCTAATACAACGTCAGGGTTAAAACAACCCGCGATGGCGCCGCCA
AGGACATCTGAGACACCTTTGCGTGGAGCTTCCTTGCCTCCACTTTCTGG
CATGAACCCCACTACGCCAACAAATGCAAGCACAACGCTACCGTCAATCC
CAACGAAAATCTCTTTCCCATTCCTTCCGCCACCATCTACCAAAACTAGG
CCTTGA
back to top

protein sequence of Tc04v2_p014510.1

>Tc04v2_p014510.1 ID=Tc04v2_p014510.1|Name=Tc04v2_p014510.1|organism=Theobroma cacao|type=polypeptide|length=702bp
MAFTSCFLLAFSIVVMLSGIDTNLAARYLLDKTRSPAPSVLSVSAPQSSL
PPHSSKSSFTAPAEPPLAGSQPSVSSSMPSSSESTTPSSASDKPSLPNST
PSLTQPAAAPKASTQPSLSKTIPSSTQPTPSQQASRNSIPGSREPTMAPM
GSSHPSLSNTEPSLTQPAMSPLANAPPSSSTNNPSLSNSTNNLAQPAVAP
MASTHPSSPDKVPSLAVPATSPSATTQPSLPTNNSPSSNSMPGLAQPAMA
PMDSTHPSAPGKVPSLALPATSPSETTQPSLPNPSSSNSTSGLTQHGMAP
MASSLPSSPDKMPSLTLPATSPSATSQPSLPTNSPSFSNSTPSLAQPGMA
PMASSQPSLSNSTPSLGLPSMSPSATSQPSLPANIPSFSNSTPSLAQPGM
APMASSQPSLSNSTPSLGLPSMSPSATSQPSLPTNIPSFSNSTPSLAQPG
MPPMSSSQPSLPNSTPSLTQPGMAPMASSQPSLSNSTPSLGLPSMSPSAT
SQPSLPTNSPSFSNSTSSLTQPGMAPIASSQPSSPDKIPSLALPSNSPSN
TTQPSLPNNINPSLSNSTPSLTQPAISPSAHQASPQSSMTPTASPNHTSP
SNIAPKASSQPSLPNTTPSSTQSAVAPSPTAHPSSSNTTSGLKQPAMAPP
RTSETPLRGASLPPLSGMNPTTPTNASTTLPSIPTKISFPFLPPPSTKTR
P*
back to top