Tc03v2_t019860.1

Overview
NameTc03v2_t019860.1
Unique NameTc03v2_t019860.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2931
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 3 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 15 samples with support for all annotated introns
ProductDNA repair endonuclease UVH1
NoteDNA repair endonuclease UVH1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18606146
GenbankXM_018117922.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc03v2_g019860Tc03v2_g019860Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc03v2_p019860.1Tc03v2_p019860.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto182734auto182734Theobroma cacaoexon
exon-auto182735auto182735Theobroma cacaoexon
exon-auto182736auto182736Theobroma cacaoexon
exon-auto182737auto182737Theobroma cacaoexon
exon-auto182738auto182738Theobroma cacaoexon
exon-auto182739auto182739Theobroma cacaoexon
exon-auto182740auto182740Theobroma cacaoexon
exon-auto182741auto182741Theobroma cacaoexon
exon-auto182742auto182742Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto182743auto182743Theobroma cacaoCDS
CDS-auto182744auto182744Theobroma cacaoCDS
CDS-auto182745auto182745Theobroma cacaoCDS
CDS-auto182746auto182746Theobroma cacaoCDS
CDS-auto182747auto182747Theobroma cacaoCDS
CDS-auto182748auto182748Theobroma cacaoCDS
CDS-auto182749auto182749Theobroma cacaoCDS
CDS-auto182750auto182750Theobroma cacaoCDS
CDS-auto182751auto182751Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc03v2_t019860.1 ID=Tc03v2_t019860.1|Name=Tc03v2_t019860.1|organism=Theobroma cacao|type=mRNA|length=2931bp
ATGGTCTTGAAATTCCACGAACAAATAGTCTCCGACCTCCTCCAAGACCC
AAACGGCGGCCTTGTAATCCTTTCTTCCGGCCTTTCCCTCCCTAAACTAC
TCTCTTCTTTCCTCTCTTTCCACTCCCAATCGAACGGCTCCCTCCTCCTC
CTCCACTCCCCTCAATTCTCCTCCTCCCTCAAATCCCTCCTCCTTTCCCT
CTCCCCCAACCTCCCGCTCTCAGAAATTACCGCCGACCTACCTTCCTCCA
ACCGCCTCTCGCTCTACTCCTCCAATCGAGTCCTCCTCCTCTCCCCTCGT
ATCCTCATCGTCGACCTCTTAACCCAAAAGGCCCAAACTTCCTTAATTTC
CGGTGTCATTTTCCTCAACACCCATTCGCTCTCCGAAAGCTCAACTGAAT
CTTTCATTGTGAGAATCATCAAAACATTCAATAAAAATGCTTCTGTTTAC
GCGTTTTCAGACAAGCCTCACTCTATGGTTTCTGGGTTCGCGAAAACGGA
GAGGATAATGAAGAGTTTGTTCATTAAAAAGCTTCATCTTTGGCCGAGGT
TTCAAGTGAATGTATCGGAGGAATTGGAGAGGGATCCGCCTGAAGTGGTG
GATATAAGGGTGCCGATGAGTAAATACATGGTGGGGATTCAAAAGGCGAT
TGTGGAAGTCATGGATGCTTGTTTGAAGGAAATGAGGAAGACTAATAAGG
TTGATGTGGAGGACCTGACGTTGGAGAATGGGTTGTTTAAGTCATTTGAT
GAGATTGTGAGGAGACAATTGGATCCCATTTGGCATACTTTGGGGAAGAA
GACGAAGCAGCTCGTTTCGGATTTGAAGACTTTGAGGAAGTTGTTGGACT
ATCTTGTTAGGTATGATGCGGTGAGTTATTTGAAGTATTTGGATACGCTT
AGAGTGTCAGAGAGTTTTCGGTCTGTTTGGATATTTGCAGAGTCCAGTTA
TAAGATATTTGACTATGCAAGGAAGCGAGTTTATTGTTTTTCAAGGTCAG
ATGGAACCAAAATTAATAAGCCTAGTAAGAACGTGTCTGGCAAAAAGAGA
AAATTGAAGGAGGATGGTAGTATTAACGAAGGAGCAATTGCTGGTACTTC
ATCAACAGGTACAAGTAATGGAGTTGTTCTCGAAGAAGTTTTGGAAGAGC
CTCCAAAGTGGAAGGTGTTACGTGAGGTTCTTGAGGAGATAGAAGAGGAA
AGACAAAAGCAAGCATCATCAGAAGAACTTCTTTTGGATGTCGGAGAGGA
CAATAATGGAATTGTTTTAGTGGCGTGCAAAGATGAGTGCTCGTGCATGC
AACTTGAAGATTGCATTACTAACAGCCCACAAAAGGTCATGAGGGATGAA
TGGGAGAAATACCTCTTAAGCAAAGTAGAACTCCGTAGTGTGCAAACATC
TCACAAGAAAAAACCTAAAAAACCTAAAACACCTAAAGGTTATGGGATTC
TTGATGGTATTGTTCCTGTTACGTCTGCCCAAAATGCAGAACCTAGCAGT
GCATGCAAGCAGGAACATGAAGCATTGTTAGCAGCGGCATCAGAATTAAG
AAGAAACCAGACTAAAATGGAAAATGATGCTGCAGATGATCCTGAACCTC
ATGTTGGCAGCCGAGGACATGGGAAAGGAAGGGGAAGAGGAAGGATTAAA
AAAGGCCCTGCAAATACACGGTGTTCTAGGAATAAAGATGGCTCTCATAG
CACTGAGGCAGCAACAGATGATAGACCTGAAATTTCTGTTTCAGAAAATG
AAGGTCATAGAAATGAAATTAACCCTACTATTGGCAATGGGCTTTTTAGG
AAGCATATTGACAGGATTGATGATACGAAAACTGACAACTCTAAGCAATT
ACCACCTGTCCACTTTCATGCTCTGGAGAGAGATCAGCCTATACTAGATG
TGTTGAAGCCCTCTGTAATTATTGTTTACCATCCAGATACGACTTTTGTT
AGGGAAATTGAAGTCTACAAAGCAGAGAATCCTGGAAAAAGGTTGAAGGT
CTATTTTCTTTTCTATGAAGCTTCTACTGAAGTCCAAAAGTTTGAAGCAA
GTATTCGTAGAGAAAATGGAGCATTTGAATCCTTGATCCGGCAGAAATCA
ATGATGATGATTCCTGTTGATCAGGATGGGTTCTGCCTTGGTTCTAATTC
TTCCTCAGACCTACAAGGTTCAAGTTCCCAGAACTCAATCACTAGAAAGG
CAGGTGGAAGAAAGGAAGCTGAGAAAGAAAAGCAGGTTGTAGTGGACATG
AGGGAGTTCATGAGTAGTCTTCCAAATGTGCTCCATCAGAAGGGCATGCG
CATAATCCCAGTTACCTTAGAAGTTGGAGATTATGTTCTCTCACCACTTA
TTTGTGTTGAGAGGAAAAGCATTCAAGATCTTTTTATGAGTTTCACATCA
GGCCGCCTTTACCACCAAGTGGAGACTATGGTTCGTTATTATCGAATACC
AGTTCTTCTAATTGAGTTTTCACAAGACAAAAGCTTTTCATTTCAGTCTG
CAAGTGACATTGGGGATGATGTAACACCAAATAATATCATATCCAAACTG
TCATTACTTGTTCTGCATTTTCCCCGCCTACGAATCCTCTGGTCTCGCAG
CTTGCATGCAACTGCTGAAATATTTGCTTCTCTTAAGGCAAATCAGGATG
AACCTGATGAGGCAAAGGCAATGAGAGTGGGTGTACCCTCCGAAGAGGGT
TTCATAGAAAATGATGTTAGAGCTGAGAACTACAATACATCTGCTGTTGA
GTTTCTGAGACGACTTCCAGGAGTGACAGATTCTAACTACAGGGCTATAA
TGGATGGATGTAAGAGCTTGGCCGAACTTGCACTTCTTCCTATGGAGAAG
CTAGCTGAACTAATGGGTGGTCGGAAAGCTGCTCAGACTCTAAGAGATTT
CCTTGATGCAAAGTGTCCAACCTTGTTGTGA
back to top

protein sequence of Tc03v2_p019860.1

>Tc03v2_p019860.1 ID=Tc03v2_p019860.1|Name=Tc03v2_p019860.1|organism=Theobroma cacao|type=polypeptide|length=977bp
MVLKFHEQIVSDLLQDPNGGLVILSSGLSLPKLLSSFLSFHSQSNGSLLL
LHSPQFSSSLKSLLLSLSPNLPLSEITADLPSSNRLSLYSSNRVLLLSPR
ILIVDLLTQKAQTSLISGVIFLNTHSLSESSTESFIVRIIKTFNKNASVY
AFSDKPHSMVSGFAKTERIMKSLFIKKLHLWPRFQVNVSEELERDPPEVV
DIRVPMSKYMVGIQKAIVEVMDACLKEMRKTNKVDVEDLTLENGLFKSFD
EIVRRQLDPIWHTLGKKTKQLVSDLKTLRKLLDYLVRYDAVSYLKYLDTL
RVSESFRSVWIFAESSYKIFDYARKRVYCFSRSDGTKINKPSKNVSGKKR
KLKEDGSINEGAIAGTSSTGTSNGVVLEEVLEEPPKWKVLREVLEEIEEE
RQKQASSEELLLDVGEDNNGIVLVACKDECSCMQLEDCITNSPQKVMRDE
WEKYLLSKVELRSVQTSHKKKPKKPKTPKGYGILDGIVPVTSAQNAEPSS
ACKQEHEALLAAASELRRNQTKMENDAADDPEPHVGSRGHGKGRGRGRIK
KGPANTRCSRNKDGSHSTEAATDDRPEISVSENEGHRNEINPTIGNGLFR
KHIDRIDDTKTDNSKQLPPVHFHALERDQPILDVLKPSVIIVYHPDTTFV
REIEVYKAENPGKRLKVYFLFYEASTEVQKFEASIRRENGAFESLIRQKS
MMMIPVDQDGFCLGSNSSSDLQGSSSQNSITRKAGGRKEAEKEKQVVVDM
REFMSSLPNVLHQKGMRIIPVTLEVGDYVLSPLICVERKSIQDLFMSFTS
GRLYHQVETMVRYYRIPVLLIEFSQDKSFSFQSASDIGDDVTPNNIISKL
SLLVLHFPRLRILWSRSLHATAEIFASLKANQDEPDEAKAMRVGVPSEEG
FIENDVRAENYNTSAVEFLRRLPGVTDSNYRAIMDGCKSLAELALLPMEK
LAELMGGRKAAQTLRDFLDAKCPTLL*
back to top