Tc04v2_t009540.6

Overview
NameTc04v2_t009540.6
Unique NameTc04v2_t009540.6
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length2700
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 ESTs, 7 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
Productvaline--tRNA ligase, chloroplastic/mitochondrial 2, transcript variant X5
NoteValine--tRNA ligase, chloroplastic/mitochondrial 2
Cross References
External references for this mRNA
DatabaseAccession
GeneID18601866
GenbankXM_018120006.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g009540Tc04v2_g009540Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p009540.6Tc04v2_p009540.6Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto216817auto216817Theobroma cacaoexon
exon-auto216818auto216818Theobroma cacaoexon
exon-auto216819auto216819Theobroma cacaoexon
exon-auto216820auto216820Theobroma cacaoexon
exon-auto216821auto216821Theobroma cacaoexon
exon-auto216822auto216822Theobroma cacaoexon
exon-auto216823auto216823Theobroma cacaoexon
exon-auto216824auto216824Theobroma cacaoexon
exon-auto216825auto216825Theobroma cacaoexon
exon-auto216826auto216826Theobroma cacaoexon
exon-auto216827auto216827Theobroma cacaoexon
exon-auto216828auto216828Theobroma cacaoexon
exon-auto216829auto216829Theobroma cacaoexon
exon-auto216830auto216830Theobroma cacaoexon
exon-auto216831auto216831Theobroma cacaoexon
exon-auto216832auto216832Theobroma cacaoexon
exon-auto216833auto216833Theobroma cacaoexon
exon-auto216834auto216834Theobroma cacaoexon
exon-auto216835auto216835Theobroma cacaoexon
exon-auto216836auto216836Theobroma cacaoexon
exon-auto216837auto216837Theobroma cacaoexon
exon-auto216838auto216838Theobroma cacaoexon
exon-auto216839auto216839Theobroma cacaoexon
exon-auto216840auto216840Theobroma cacaoexon
exon-auto216841auto216841Theobroma cacaoexon
exon-auto216842auto216842Theobroma cacaoexon
exon-auto216843auto216843Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto216844auto216844Theobroma cacaoCDS
CDS-auto216845auto216845Theobroma cacaoCDS
CDS-auto216846auto216846Theobroma cacaoCDS
CDS-auto216847auto216847Theobroma cacaoCDS
CDS-auto216848auto216848Theobroma cacaoCDS
CDS-auto216849auto216849Theobroma cacaoCDS
CDS-auto216850auto216850Theobroma cacaoCDS
CDS-auto216851auto216851Theobroma cacaoCDS
CDS-auto216852auto216852Theobroma cacaoCDS
CDS-auto216853auto216853Theobroma cacaoCDS
CDS-auto216854auto216854Theobroma cacaoCDS
CDS-auto216855auto216855Theobroma cacaoCDS
CDS-auto216856auto216856Theobroma cacaoCDS
CDS-auto216857auto216857Theobroma cacaoCDS
CDS-auto216858auto216858Theobroma cacaoCDS
CDS-auto216859auto216859Theobroma cacaoCDS
CDS-auto216860auto216860Theobroma cacaoCDS
CDS-auto216861auto216861Theobroma cacaoCDS
CDS-auto216862auto216862Theobroma cacaoCDS
CDS-auto216863auto216863Theobroma cacaoCDS
CDS-auto216864auto216864Theobroma cacaoCDS
CDS-auto216865auto216865Theobroma cacaoCDS
CDS-auto216866auto216866Theobroma cacaoCDS
CDS-auto216867auto216867Theobroma cacaoCDS
CDS-auto216868auto216868Theobroma cacaoCDS
CDS-auto216869auto216869Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t009540.6 ID=Tc04v2_t009540.6|Name=Tc04v2_t009540.6|organism=Theobroma cacao|type=mRNA|length=2700bp
ATGTGGCAGTCTCAAGGGTATTTCAGGCCAAAATTTGACCGGGGAAGTGA
TCCTTTTGTCATATCAATGCCACCACCTAATGTTACTGGCTCTCTGCACA
TGGGACATGCAATGTTTGTGACTCTTGAGGATATCATGGTTAGGTACCAT
CGCATGAGGGGAAGACCAACACTTTGGCTTCCTGGGACTGATCATGCTGG
TATTGCGACTCAGTTGGTTGTGGAAAGAATGTTGGCGTCTGAAGGAATAA
AAAGGGCAGAACTGGGCAGAGATGAATTTGCAAAACGAGTTTGGGAGTGG
AAAGAGAAGTATGGTGGGACCATTACAAATCAGATTAAAAGACTTGGGGC
TTCTTGTGATTGGACTAGAGAAAGGTTCACCCTTGATGAGCAGCTAAGTC
GAGCTGTTGTTGAGGCGTTTGTTAAACTTCATGAAAAAGGTTTAATCTAT
CAAGGGTCTTATATGGTTAACTGGTCTCCCAAGTTACAGACTGCTGTTTC
AGACTTGGAAGTAGAATATTCTGAAGAGCCTGGTGCCCTATATTATATTA
AGTATCGAGTTGCTGGAGGTTCAAGGAGTGATTTCTTGACAATAGCAACG
ACGCGGCCTGAAACTTTGTTTGGTGATGTAGCTATTGCTGTGCATCCTCA
GGATGAGCGATATTCCAAGTATGTTGGTCAAATGGCAATTGTTCCTATGA
CATATGGTCGTCATGTTCCCATTATCTCTGATAAGTTTGTTGATAAAGAC
TTTGGGACAGGTGTGCTGAAGATAAGCCCTGGCCATGATCATAATGATTA
TCTTCTAGCTAGAAAGCTTGGTCTTCCAATTCTTAATGTTATGAACAAGG
ATGGAACACTAAATGAGGTTGCCGGACTGTACTGTGGTCTTGATCGGTTT
GAGGCACGGAAGAAATTGTGGTGCGAACTTGAGGAGACTGACTTAGCTGT
GAAAAAGGAACCTTACACTTTACGAGTACCAAGATCCCAGCGTGGTGGAG
AGGTAATAGAGCCATTAGTTAGCAAGCAATGGTTTGTAACAATGGAGCCC
TTGGCTGAAAAGGCCCTTCGTGCAGTTGAAAAGGGAGAACTGACGATTAT
GCCTGAAAGATTTGAGAAGATTTATAATCATTGGCTATCAAATATAAAGG
ATTGGTGCATAAGCAGACAGCTGTGGTGGGGACACCGCATACCTGTTTGG
TACATTGTTGGAAAAGACTGTGAAGAGGAATATATAGTTGCTAGGAGTGC
TGAGGAAGCACTTATAAAGGCTTGTGATAAATATGGCAAAGAAATAGAAA
TATATCAGGATCCAGATGTTCTTGACACTTGGTTCTCAAGTGCACTATGG
CCTTTCAGTACTCTTGGGTGGCCAGATGTGTCAGCAGAGGATTTTAAAAG
GTTTTATCCAACAACAATGCTTGAAACTGGGCATGATATATTGTTCTTTT
GGGTTGCAAGAATGGTTATGATGGGAATTGAATTCACAGGAACTGTTCCA
TTTTCGTATGTATATCTTCATGGACTTATCCGCGACTCAGAAGGGCGTAA
AATGTCTAAAACTCTTGGGAATGTTATTGATCCCCTTGATACTATCGAGG
AGTTTGGCACTGATGCCTTGCGATTCACTCTTGCTTTAGGAACTGCTGGT
CAGGACCTTAATTTATCTACTGAGAGGCTAACAGCAAACAAAGCCTTCAC
AAACAAATTGTGGAATGCTGGCAAATTTGTGCTGCAGAATCTTCCTGATC
GGGATAATGTTTCTGGTTGGCAGACTATACAGGCATATAAGTTTGACATG
GAGGAGTCTCTTTTAAGGCTTCCGCTTTCAGAATGTTGGGTGGTCTCAAA
ACTGCATTTGCTTATTGATGCAGTCACTGAGAGTTATAACAAGTTTTTCT
TTGGGGAAGTTGGAAGAGAAACGTATGATTTCATTTGGGGTGATTTTGCT
GACTGGTATGTTGAATGCATTTATGAGTATATTGAAGCGAGTAAAGCTCG
CCTTTACCACTCTGGAGATGATTCAGTTGCTTTAGTAGCACAGGCTGTTC
TACTTTATGTGTTTGAGAGTATACTGAAACTATTACATCCATTCATGCCA
TTTGTAACTGAGGAGCTGTGGCAGGCACTTCCCAATCGGAAAGAAGCTCT
TATAATATCTTCTTGGCCACAAATTTCTCTTCCCAGGAACACTACTTTGG
TAAAAAGATTTGAAAATTTACAAGCTCTGACTCGAGCAATCCGGAATGCT
AGAGCTGAGTATTCTGTTGAGCCAGCAAAGCGTATATCTGCTTCTATTGT
TGCCAGTGAAGAAGTCATTCAGTATATATCTGAAGAGAAGGAGGTTTTGG
CTCTCTTATCCAGGCTAGATTTAGACAATATCCATTTCACTGATTCTCCT
CCAGGGGATGCTAAACAATCAGTTCACCTTGTTGCAAGTGAAGGACTAGA
GGCATATCTGCCCCTCACTGATATGGTTGATATTTCTGCTGAAGTCCAAC
GCCTTTCCAAGCGCCTATCTAAGATGCAAACAGAGTATGAGGGACTTAAA
GCTCGTCTCAAGTCCCCTAAATTCATAGAGAAAGCTCCTGAGGATATTGT
CCGTGGGGTTCAGCAAAAAGCAGCAGAAGCAGAAGAGAAGATTAATTTGA
CCAAAAACCGTTTGGATTTCCTCAAATCAACTGTTTTGGTTTCACAATAG
back to top

protein sequence of Tc04v2_p009540.6

>Tc04v2_p009540.6 ID=Tc04v2_p009540.6|Name=Tc04v2_p009540.6|organism=Theobroma cacao|type=polypeptide|length=900bp
MWQSQGYFRPKFDRGSDPFVISMPPPNVTGSLHMGHAMFVTLEDIMVRYH
RMRGRPTLWLPGTDHAGIATQLVVERMLASEGIKRAELGRDEFAKRVWEW
KEKYGGTITNQIKRLGASCDWTRERFTLDEQLSRAVVEAFVKLHEKGLIY
QGSYMVNWSPKLQTAVSDLEVEYSEEPGALYYIKYRVAGGSRSDFLTIAT
TRPETLFGDVAIAVHPQDERYSKYVGQMAIVPMTYGRHVPIISDKFVDKD
FGTGVLKISPGHDHNDYLLARKLGLPILNVMNKDGTLNEVAGLYCGLDRF
EARKKLWCELEETDLAVKKEPYTLRVPRSQRGGEVIEPLVSKQWFVTMEP
LAEKALRAVEKGELTIMPERFEKIYNHWLSNIKDWCISRQLWWGHRIPVW
YIVGKDCEEEYIVARSAEEALIKACDKYGKEIEIYQDPDVLDTWFSSALW
PFSTLGWPDVSAEDFKRFYPTTMLETGHDILFFWVARMVMMGIEFTGTVP
FSYVYLHGLIRDSEGRKMSKTLGNVIDPLDTIEEFGTDALRFTLALGTAG
QDLNLSTERLTANKAFTNKLWNAGKFVLQNLPDRDNVSGWQTIQAYKFDM
EESLLRLPLSECWVVSKLHLLIDAVTESYNKFFFGEVGRETYDFIWGDFA
DWYVECIYEYIEASKARLYHSGDDSVALVAQAVLLYVFESILKLLHPFMP
FVTEELWQALPNRKEALIISSWPQISLPRNTTLVKRFENLQALTRAIRNA
RAEYSVEPAKRISASIVASEEVIQYISEEKEVLALLSRLDLDNIHFTDSP
PGDAKQSVHLVASEGLEAYLPLTDMVDISAEVQRLSKRLSKMQTEYEGLK
ARLKSPKFIEKAPEDIVRGVQQKAAEAEEKINLTKNRLDFLKSTVLVSQ*
back to top