Tc01v2_t007550.1

Overview
NameTc01v2_t007550.1
Unique NameTc01v2_t007550.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3174
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 1 EST, 12 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 15 samples with support for all annotated introns
Productuncharacterized LOC18611437, transcript variant X1
NoteMethylmalonate-semialdehyde dehydrogenase [acylating], mitochondrial
Cross References
External references for this mRNA
DatabaseAccession
GeneID18611437
GenbankXM_007047680.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc01v2_g007550Tc01v2_g007550Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc01v2_p007550.1Tc01v2_p007550.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto16276auto16276Theobroma cacaoexon
exon-auto16277auto16277Theobroma cacaoexon
exon-auto16278auto16278Theobroma cacaoexon
exon-auto16279auto16279Theobroma cacaoexon
exon-auto16280auto16280Theobroma cacaoexon
exon-auto16281auto16281Theobroma cacaoexon
exon-auto16282auto16282Theobroma cacaoexon
exon-auto16283auto16283Theobroma cacaoexon
exon-auto16284auto16284Theobroma cacaoexon
exon-auto16285auto16285Theobroma cacaoexon
exon-auto16286auto16286Theobroma cacaoexon
exon-auto16287auto16287Theobroma cacaoexon
exon-auto16288auto16288Theobroma cacaoexon
exon-auto16289auto16289Theobroma cacaoexon
exon-auto16290auto16290Theobroma cacaoexon
exon-auto16291auto16291Theobroma cacaoexon
exon-auto16292auto16292Theobroma cacaoexon
exon-auto16293auto16293Theobroma cacaoexon
exon-auto16294auto16294Theobroma cacaoexon
exon-auto16295auto16295Theobroma cacaoexon
exon-auto16296auto16296Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto16297auto16297Theobroma cacaoCDS
CDS-auto16298auto16298Theobroma cacaoCDS
CDS-auto16299auto16299Theobroma cacaoCDS
CDS-auto16300auto16300Theobroma cacaoCDS
CDS-auto16301auto16301Theobroma cacaoCDS
CDS-auto16302auto16302Theobroma cacaoCDS
CDS-auto16303auto16303Theobroma cacaoCDS
CDS-auto16304auto16304Theobroma cacaoCDS
CDS-auto16305auto16305Theobroma cacaoCDS
CDS-auto16306auto16306Theobroma cacaoCDS
CDS-auto16307auto16307Theobroma cacaoCDS
CDS-auto16308auto16308Theobroma cacaoCDS
CDS-auto16309auto16309Theobroma cacaoCDS
CDS-auto16310auto16310Theobroma cacaoCDS
CDS-auto16311auto16311Theobroma cacaoCDS
CDS-auto16312auto16312Theobroma cacaoCDS
CDS-auto16313auto16313Theobroma cacaoCDS
CDS-auto16314auto16314Theobroma cacaoCDS
CDS-auto16315auto16315Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc01v2_t007550.1 ID=Tc01v2_t007550.1|Name=Tc01v2_t007550.1|organism=Theobroma cacao|type=mRNA|length=3174bp
ATGGAGACTCAAAATCAGCCAGAGTTTAGTGGACAGAAAAGGATGCTTCC
TCCACCTGCTGGAAATTTTCAAGATCGTGAGGAGCTCATTAAACACGTGC
GTGATTTTGGGGCTTCCCAAGGATATGTAGTGACTATTAAGAAATCTAGG
AAGGACAGAAGAGTCATCCTTGGGTGTGACAGAGGAGGTGTTTATCGCAA
TAGGCGTAAGATTGATGAAAGTAAACGCAAAAGGAAAGCATGCTCAAGGC
TTATCAATTGTCCTTTTGAAGCCATAGGTAAAAAGGAAGATGATGCATGG
GTTCTCACCATAAAAAACGGGGAGCATAATCACGAGCCTTTAAAAGATAT
GTCAGAGCATCCTTATAGTCGTCGTTTTACTGAGGAAGAAGTTAGGCAAA
TCAAATTAATGACTGAAGCTGGTATAAAACCACGTCAAGTGCTCAAGGCT
TTGAAACAAAGTAACCCAGAGTTGCAGTCAACTCCAAGGCATTTGTACAA
CCTTAAAGCTAAGATTCGTCAAGGAAATTTATCAGAGAAAAGTTTCAAAT
CATGGAGGCCTAACAGATCTGTTCCTGTAAGCACGAATGGAACTCTCACT
GGAGAGTTGTTAAGGCAAAACAACCAGCCGGTGAAAGTTCCTAATTTTAT
TGGAGGGAAATTTGTGCATTCACAAGGGTCCATGGTCATTGACGTAATTA
ATCCTGCAACACAAGAGGTTGTTTCTCAAGTTCCTTCAGCTACCTACGAA
GAGTTCAAAGATGCAGTTAATGCTGCCAAGCAAGCTTTTTCCTCTTGGAA
GAATACACCGGTTGCAACTCGCCAGCGCATCATGTTCAAGCTCCAGGAGC
TCATCCACAGAAATATTGATAAGCTTGCAATGAATATCACGATGGAACAG
GGAATGACTTTAAAGAGAGCCCAGGGTGATGTGTTGCGTGGTTTAGAGGT
TGTTGAACATGCTTGTGGACTGGCAACTCTGCAAATGGGGGAGTTTGTCC
CGAATGCATCTAATGGCATTGACACGTACTTCATTAGAGAACCACTCGGT
GTGTGTGCTGGGATATGTCCCTCTAACTTTCCTGCAATGATCCCTTTATT
GATGTTTCCTATTGCAGTTTCATGTGGCAATACATTTATTCTTAAGCCAT
GTGAAAAAAATCCAGGGGCTTCAATGATTCTTGCAGCACTAGCAAAGGAG
GCTGGTTTGCCTGATGGTGTCTTAAATATTGTTCATGGCACCAATGATAT
TGTCAATTATATTTGTGATGATGAGGATATAAAAGCTATATCTTTTGTTG
GTTCAAACACAGCTGGCATGCATATATATGCTAGGGCTGCTGCTAGAGGG
AAACGTATTCAGTCCAATGTAGGAGGCAAGAATTATGCAATTATCATGCC
TGATGCAAGCATAGATGCTACTTTAAGTTCTCTAGTTGCAGGCGGATTTG
GAGCTGCAGGGCAGAGGTGCATAGGTCTAAGTACAGCAGTTTTTGTTGGA
GGTTCAATGCCATGGGAAGAAGAACTTTTGGAGCGTGCCAAAGCACTTAA
AGTGAATGTAGGATCAGATCCTGGTGCAGATGTAGGTCCGGTGATTAGTA
AGGAGGTAAAGGATCGCATAAATAGATTAGTTCAAAGCAGTGTTGATGGT
GGTGCTAGACTTATTCTTGATGGGAGAAATATTGTGGTTCCTGGTTATGA
GAATGGGAATTTTATTGGTCCTACTATCATATGTGATGTTGCATCCAATA
TGGAGTGCTGCAAGGAAGAAATATTTGGACCGGTTCTCCTTTGTATGCAG
GCTGGGAGCCTAGAAGGGGCCATAGCCATTGTAAACAGAAACAAGTCCGT
GAATGGAGCTTCTATATTCACAACATCTGGCTATGCTGCAAGGAAGTTTC
AGAATGAAATCGAGTCCGGCTTGGTTGGGATCAATGTTCCTGTTCCCGTT
GCTATTCCAATGCCTTTTTCCTCTTTTAATGGACCAAGAACATCTTTTGC
CGGAGATCTTAATTTTTGTGGAAAGTCAGGTGTGCATTTTTACACCCAGA
TCAAAATGGTGGCACAGCAGTGGAGGGATTTACCAAGCCTAGGATTGTCC
TCGGGTTTGCATCTATCATCTGAGACAGATATTACAAGCCGGGGAGTCTC
TTCAGCATTGCCTCCATCATCAGAGAGAGATTCACCATACCGTAGAGTTT
CGCGGGCCATGTCTCCAGAATCAGAGGGTAATTCACCAAATCATGCATTG
TTGCTTTCTGTTGCTGCAACTTCAGAGAGGGATCTATCAAACCCGGTAAT
TACATCTCTGCCTCCAACTGCTGATGGTGATTTACCAAATCATGGAGCAT
CTCTCCTCATACCTCCGACATCAGAGATGGATTTGGAGAACCAAGATGCA
TCCCTAACCGTGCCATTAGGAAGAGAAACATCAAACCAAGGAGTGTCATC
AGCAACATCCCATCAATCTGAAAGGATGTATACGTCGCAAACATCACAGT
GGAATGAAACTCCGACACTAGCATCTCAAAGAAATGAGCCTATTCCTCCA
CCCTCTGAGAGGATTAATATACCTACAACATCTAAGAGGAATAGCAATGC
AGCTCCAACAGTTCCGAGGTCGGACACTGCAATAGGTTTAACTCATGAGC
GACTATATTTGCCTACATCCCATAAAAATGACAGTATGGTTCCCATTTCA
CATAGGAATGAAAGCATGTCTCCAACTTCCGAGAGAATATATATGATGGC
AACTTCTCACTTGAGCGACAGTATGGGTCAAACGTTTCAGAGGACTGATG
CCCCAATGTTTCCAACTTCTGAGAGGATGTATGTACCTGCCACTCCTCAC
AGGACCGACCACATGGGATCAACTTCTCAGAGGGCTGATGTTGCATTACA
GCCAGCCGCCGAGAGGTTATACATGCCTGCAACATCTCAAAGGAACGATA
ACATTGCTTCGTCTTCTCACCGGGCTGAGTCCATGCCCCAAAATTCCGAG
GGCCTGTATCTGTCTCCAATTATTCACAGAAATGCTGGTATGCCGCCAAC
ATCTGAGAGGTTATATATGCCTGCAGCATCTCAGAGGATGTATGCTCAAA
ACACAATAATTTCAATGGATGATTATCCCAGCCAAGGACCACCTATGACT
TTGCCTACTTCACAGAGGATATAG
back to top

protein sequence of Tc01v2_p007550.1

>Tc01v2_p007550.1 ID=Tc01v2_p007550.1|Name=Tc01v2_p007550.1|organism=Theobroma cacao|type=polypeptide|length=1058bp
METQNQPEFSGQKRMLPPPAGNFQDREELIKHVRDFGASQGYVVTIKKSR
KDRRVILGCDRGGVYRNRRKIDESKRKRKACSRLINCPFEAIGKKEDDAW
VLTIKNGEHNHEPLKDMSEHPYSRRFTEEEVRQIKLMTEAGIKPRQVLKA
LKQSNPELQSTPRHLYNLKAKIRQGNLSEKSFKSWRPNRSVPVSTNGTLT
GELLRQNNQPVKVPNFIGGKFVHSQGSMVIDVINPATQEVVSQVPSATYE
EFKDAVNAAKQAFSSWKNTPVATRQRIMFKLQELIHRNIDKLAMNITMEQ
GMTLKRAQGDVLRGLEVVEHACGLATLQMGEFVPNASNGIDTYFIREPLG
VCAGICPSNFPAMIPLLMFPIAVSCGNTFILKPCEKNPGASMILAALAKE
AGLPDGVLNIVHGTNDIVNYICDDEDIKAISFVGSNTAGMHIYARAAARG
KRIQSNVGGKNYAIIMPDASIDATLSSLVAGGFGAAGQRCIGLSTAVFVG
GSMPWEEELLERAKALKVNVGSDPGADVGPVISKEVKDRINRLVQSSVDG
GARLILDGRNIVVPGYENGNFIGPTIICDVASNMECCKEEIFGPVLLCMQ
AGSLEGAIAIVNRNKSVNGASIFTTSGYAARKFQNEIESGLVGINVPVPV
AIPMPFSSFNGPRTSFAGDLNFCGKSGVHFYTQIKMVAQQWRDLPSLGLS
SGLHLSSETDITSRGVSSALPPSSERDSPYRRVSRAMSPESEGNSPNHAL
LLSVAATSERDLSNPVITSLPPTADGDLPNHGASLLIPPTSEMDLENQDA
SLTVPLGRETSNQGVSSATSHQSERMYTSQTSQWNETPTLASQRNEPIPP
PSERINIPTTSKRNSNAAPTVPRSDTAIGLTHERLYLPTSHKNDSMVPIS
HRNESMSPTSERIYMMATSHLSDSMGQTFQRTDAPMFPTSERMYVPATPH
RTDHMGSTSQRADVALQPAAERLYMPATSQRNDNIASSSHRAESMPQNSE
GLYLSPIIHRNAGMPPTSERLYMPAASQRMYAQNTIISMDDYPSQGPPMT
LPTSQRI*
back to top