Tc04v2_t009540.4

Overview
NameTc04v2_t009540.4
Unique NameTc04v2_t009540.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2916
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 12 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 11 samples with support for all annotated introns
Productvaline--tRNA ligase, chloroplastic/mitochondrial 2, transcript variant X4
NoteValine--tRNA ligase, chloroplastic/mitochondrial 2
Cross References
External references for this mRNA
DatabaseAccession
GeneID18601866
GenbankXM_018120005.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g009540Tc04v2_g009540Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p009540.4Tc04v2_p009540.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto216709auto216709Theobroma cacaoexon
exon-auto216710auto216710Theobroma cacaoexon
exon-auto216711auto216711Theobroma cacaoexon
exon-auto216712auto216712Theobroma cacaoexon
exon-auto216713auto216713Theobroma cacaoexon
exon-auto216714auto216714Theobroma cacaoexon
exon-auto216715auto216715Theobroma cacaoexon
exon-auto216716auto216716Theobroma cacaoexon
exon-auto216717auto216717Theobroma cacaoexon
exon-auto216718auto216718Theobroma cacaoexon
exon-auto216719auto216719Theobroma cacaoexon
exon-auto216720auto216720Theobroma cacaoexon
exon-auto216721auto216721Theobroma cacaoexon
exon-auto216722auto216722Theobroma cacaoexon
exon-auto216723auto216723Theobroma cacaoexon
exon-auto216724auto216724Theobroma cacaoexon
exon-auto216725auto216725Theobroma cacaoexon
exon-auto216726auto216726Theobroma cacaoexon
exon-auto216727auto216727Theobroma cacaoexon
exon-auto216728auto216728Theobroma cacaoexon
exon-auto216729auto216729Theobroma cacaoexon
exon-auto216730auto216730Theobroma cacaoexon
exon-auto216731auto216731Theobroma cacaoexon
exon-auto216732auto216732Theobroma cacaoexon
exon-auto216733auto216733Theobroma cacaoexon
exon-auto216734auto216734Theobroma cacaoexon
exon-auto216735auto216735Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto216736auto216736Theobroma cacaoCDS
CDS-auto216737auto216737Theobroma cacaoCDS
CDS-auto216738auto216738Theobroma cacaoCDS
CDS-auto216739auto216739Theobroma cacaoCDS
CDS-auto216740auto216740Theobroma cacaoCDS
CDS-auto216741auto216741Theobroma cacaoCDS
CDS-auto216742auto216742Theobroma cacaoCDS
CDS-auto216743auto216743Theobroma cacaoCDS
CDS-auto216744auto216744Theobroma cacaoCDS
CDS-auto216745auto216745Theobroma cacaoCDS
CDS-auto216746auto216746Theobroma cacaoCDS
CDS-auto216747auto216747Theobroma cacaoCDS
CDS-auto216748auto216748Theobroma cacaoCDS
CDS-auto216749auto216749Theobroma cacaoCDS
CDS-auto216750auto216750Theobroma cacaoCDS
CDS-auto216751auto216751Theobroma cacaoCDS
CDS-auto216752auto216752Theobroma cacaoCDS
CDS-auto216753auto216753Theobroma cacaoCDS
CDS-auto216754auto216754Theobroma cacaoCDS
CDS-auto216755auto216755Theobroma cacaoCDS
CDS-auto216756auto216756Theobroma cacaoCDS
CDS-auto216757auto216757Theobroma cacaoCDS
CDS-auto216758auto216758Theobroma cacaoCDS
CDS-auto216759auto216759Theobroma cacaoCDS
CDS-auto216760auto216760Theobroma cacaoCDS
CDS-auto216761auto216761Theobroma cacaoCDS
CDS-auto216762auto216762Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t009540.4 ID=Tc04v2_t009540.4|Name=Tc04v2_t009540.4|organism=Theobroma cacao|type=mRNA|length=2916bp
ATGAGTCTCCACCAAATGGCTATCTCTCCTCCTTTTCTTTTATCTTCTCG
CTCTGCTTACACCCTCAACCCTCTTCTCTTTGCCAAACATCGCCGTTTTT
GCTTTCCTCTCTCTCAGTCTCGTTTCAGTTCGATAAAGCGTCGGAGCTTT
GCTGTTGTTGCATCAGAGAATGGCGTTTTCACTTCTCCAGAGTTGGCAAA
GTCTTTTGATTTTACTTCAGAGGAGCGGATATACAATTGGTGGCAGTCTC
AAGGGTATTTCAGGCCAAAATTTGACCGGGGAAGTGATCCTTTTGTCATA
TCAATGCCACCACCTAATGTTACTGGCTCTCTGCACATGGGACATGCAAT
GTTTGTGACTCTTGAGGATATCATGGTTAGGTACCATCGCATGAGGGGAA
GACCAACACTTTGGCTTCCTGGGACTGATCATGCTGGTATTGCGACTCAG
TTGGTTGTGGAAAGAATGTTGGCGTCTGAAGGAATAAAAAGGGCAGAACT
GGGCAGAGATGAATTTGCAAAACGAGTTTGGGAGTGGAAAGAGAAGTATG
GTGGGACCATTACAAATCAGATTAAAAGACTTGGGGCTTCTTGTGATTGG
ACTAGAGAAAGGTTCACCCTTGATGAGCAGCTAAGTCGAGCTGTTGTTGA
GGCGTTTGTTAAACTTCATGAAAAAGGTTTAATCTATCAAGGGTCTTATA
TGGTTAACTGGTCTCCCAAGTTACAGACTGCTGTTTCAGACTTGGAAGTA
GAATATTCTGAAGAGCCTGGTGCCCTATATTATATTAAGTATCGAGTTGC
TGGAGGTTCAAGGAGTGATTTCTTGACAATAGCAACGACGCGGCCTGAAA
CTTTGTTTGGTGATGTAGCTATTGCTGTGCATCCTCAGGATGAGCGATAT
TCCAAGTATGTTGGTCAAATGGCAATTGTTCCTATGACATATGGTCGTCA
TGTTCCCATTATCTCTGATAAGTTTGTTGATAAAGACTTTGGGACAGGTG
TGCTGAAGATAAGCCCTGGCCATGATCATAATGATTATCTTCTAGCTAGA
AAGCTTGGTCTTCCAATTCTTAATGTTATGAACAAGGATGGAACACTAAA
TGAGGTTGCCGGACTGTACTGTGGTCTTGATCGGTTTGAGGCACGGAAGA
AATTGTGGTGCGAACTTGAGGAGACTGACTTAGCTGTGAAAAAGGAACCT
TACACTTTACGAGTACCAAGATCCCAGCGTGGTGGAGAGGTAATAGAGCC
ATTAGTTAGCAAGCAATGGTTTGTAACAATGGAGCCCTTGGCTGAAAAGG
CCCTTCGTGCAGTTGAAAAGGGAGAACTGACGATTATGCCTGAAAGATTT
GAGAAGATTTATAATCATTGGCTATCAAATATAAAGGATTGGTGCATAAG
CAGACAGCTGTGGTGGGGACACCGCATACCTGTTTGGTACATTGTTGGAA
AAGACTGTGAAGAGGAATATATAGTTGCTAGGAGTGCTGAGGAAGCACTT
ATAAAGGCTTGTGATAAATATGGCAAAGAAATAGAAATATATCAGGATCC
AGATGTTCTTGACACTTGGTTCTCAAGTGCACTATGGCCTTTCAGTACTC
TTGGGTGGCCAGATGTGTCAGCAGAGGATTTTAAAAGGTTTTATCCAACA
ACAATGCTTGAAACTGGGCATGATATATTGTTCTTTTGGGTTGCAAGAAT
GGTTATGATGGGAATTGAATTCACAGGAACTGTTCCATTTTCGTATGTAT
ATCTTCATGGACTTATCCGCGACTCAGAAGGGCGTAAAATGTCTAAAACT
CTTGGGAATGTTATTGATCCCCTTGATACTATCGAGGAGTTTGGCACTGA
TGCCTTGCGATTCACTCTTGCTTTAGGAACTGCTGGTCAGGACCTTAATT
TATCTACTGAGAGGCTAACAGCAAACAAAGCCTTCACAAACAAATTGTGG
AATGCTGGCAAATTTGTGCTGCAGAATCTTCCTGATCGGGATAATGTTTC
TGGTTGGCAGACTATACAGGCATATAAGTTTGACATGGAGGAGTCTCTTT
TAAGGCTTCCGCTTTCAGAATGTTGGGTGGTCTCAAAACTGCATTTGCTT
ATTGATGCAGTCACTGAGAGTTATAACAAGTTTTTCTTTGGGGAAGTTGG
AAGAGAAACGTATGATTTCATTTGGGGTGATTTTGCTGACTGGTATATTG
AAGCGAGTAAAGCTCGCCTTTACCACTCTGGAGATGATTCAGTTGCTTTA
GTAGCACAGGCTGTTCTACTTTATGTGTTTGAGAGTATACTGAAACTATT
ACATCCATTCATGCCATTTGTAACTGAGGAGCTGTGGCAGGCACTTCCCA
ATCGGAAAGAAGCTCTTATAATATCTTCTTGGCCACAAATTTCTCTTCCC
AGGAACACTACTTTGGTAAAAAGATTTGAAAATTTACAAGCTCTGACTCG
AGCAATCCGGAATGCTAGAGCTGAGTATTCTGTTGAGCCAGCAAAGCGTA
TATCTGCTTCTATTGTTGCCAGTGAAGAAGTCATTCAGTATATATCTGAA
GAGAAGGAGGTTTTGGCTCTCTTATCCAGGCTAGATTTAGACAATATCCA
TTTCACTGATTCTCCTCCAGGGGATGCTAAACAATCAGTTCACCTTGTTG
CAAGTGAAGGACTAGAGGCATATCTGCCCCTCACTGATATGGTTGATATT
TCTGCTGAAGTCCAACGCCTTTCCAAGCGCCTATCTAAGATGCAAACAGA
GTATGAGGGACTTAAAGCTCGTCTCAAGTCCCCTAAATTCATAGAGAAAG
CTCCTGAGGATATTGTCCGTGGGGTTCAGCAAAAAGCAGCAGAAGCAGAA
GAGAAGATTAATTTGACCAAAAACCGTTTGGATTTCCTCAAATCAACTGT
TTTGGTTTCACAATAG
back to top

protein sequence of Tc04v2_p009540.4

>Tc04v2_p009540.4 ID=Tc04v2_p009540.4|Name=Tc04v2_p009540.4|organism=Theobroma cacao|type=polypeptide|length=972bp
MSLHQMAISPPFLLSSRSAYTLNPLLFAKHRRFCFPLSQSRFSSIKRRSF
AVVASENGVFTSPELAKSFDFTSEERIYNWWQSQGYFRPKFDRGSDPFVI
SMPPPNVTGSLHMGHAMFVTLEDIMVRYHRMRGRPTLWLPGTDHAGIATQ
LVVERMLASEGIKRAELGRDEFAKRVWEWKEKYGGTITNQIKRLGASCDW
TRERFTLDEQLSRAVVEAFVKLHEKGLIYQGSYMVNWSPKLQTAVSDLEV
EYSEEPGALYYIKYRVAGGSRSDFLTIATTRPETLFGDVAIAVHPQDERY
SKYVGQMAIVPMTYGRHVPIISDKFVDKDFGTGVLKISPGHDHNDYLLAR
KLGLPILNVMNKDGTLNEVAGLYCGLDRFEARKKLWCELEETDLAVKKEP
YTLRVPRSQRGGEVIEPLVSKQWFVTMEPLAEKALRAVEKGELTIMPERF
EKIYNHWLSNIKDWCISRQLWWGHRIPVWYIVGKDCEEEYIVARSAEEAL
IKACDKYGKEIEIYQDPDVLDTWFSSALWPFSTLGWPDVSAEDFKRFYPT
TMLETGHDILFFWVARMVMMGIEFTGTVPFSYVYLHGLIRDSEGRKMSKT
LGNVIDPLDTIEEFGTDALRFTLALGTAGQDLNLSTERLTANKAFTNKLW
NAGKFVLQNLPDRDNVSGWQTIQAYKFDMEESLLRLPLSECWVVSKLHLL
IDAVTESYNKFFFGEVGRETYDFIWGDFADWYIEASKARLYHSGDDSVAL
VAQAVLLYVFESILKLLHPFMPFVTEELWQALPNRKEALIISSWPQISLP
RNTTLVKRFENLQALTRAIRNARAEYSVEPAKRISASIVASEEVIQYISE
EKEVLALLSRLDLDNIHFTDSPPGDAKQSVHLVASEGLEAYLPLTDMVDI
SAEVQRLSKRLSKMQTEYEGLKARLKSPKFIEKAPEDIVRGVQQKAAEAE
EKINLTKNRLDFLKSTVLVSQ*
back to top