Tc03v2_t011920.4

Overview
NameTc03v2_t011920.4
Unique NameTc03v2_t011920.4
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length894
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 11 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 2 samples with support for all annotated introns
Productsuperoxide dismutase [Fe], chloroplastic, transcript variant X4
NoteSuperoxide dismutase [Fe] 2, chloroplastic
Cross References
External references for this mRNA
DatabaseAccession
GeneID18605241
GenbankXM_018116824.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc03v2_g011920Tc03v2_g011920Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc03v2_p011920.4Tc03v2_p011920.4Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto164819auto164819Theobroma cacaoexon
exon-auto164820auto164820Theobroma cacaoexon
exon-auto164821auto164821Theobroma cacaoexon
exon-auto164822auto164822Theobroma cacaoexon
exon-auto164823auto164823Theobroma cacaoexon
exon-auto164824auto164824Theobroma cacaoexon
exon-auto164825auto164825Theobroma cacaoexon
exon-auto164826auto164826Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto164827auto164827Theobroma cacaoCDS
CDS-auto164828auto164828Theobroma cacaoCDS
CDS-auto164829auto164829Theobroma cacaoCDS
CDS-auto164830auto164830Theobroma cacaoCDS
CDS-auto164831auto164831Theobroma cacaoCDS
CDS-auto164832auto164832Theobroma cacaoCDS
CDS-auto164833auto164833Theobroma cacaoCDS
CDS-auto164834auto164834Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc03v2_t011920.4 ID=Tc03v2_t011920.4|Name=Tc03v2_t011920.4|organism=Theobroma cacao|type=mRNA|length=894bp
ATGGCGGCAGCTGCTTCCATGGCTACTTCACTTACGTTCCCGTTACTTCC
CTCCCAAGGTTGCTCTTACTTTATCATCAGACGCTTTTGGAGGAAGGTTG
CTACTAATTTGATAACGGCAAAGTTCGAGCTGAAGCCTCCTCCATATACG
CTGAATGCATTGGAACCACATATGAGCCGACAAACCCTGGAGTATCATTG
GGGAAAGCACCATAGAACCTATGTGGAGAACCTAAACAAGCAAATTGCAG
GAACAGAGCTAGAAGGGTTGCCCTTAGAAGACATTATAATTGTTTCATAC
AACAACGGTGATATACTCCCTGCCTTTAACAATGCTGCACAGGCCTGGAA
CCATGACTTCTTCTGGGAATCAATGAAACCAGGTGGTGGAGGAAAACCAT
CTGGAGATCTTCTAGATCTAATTGAAAGAGATTTTGGGTCTTTCGAACAA
TTTATCCAAGAGTTCAAGTCTGCTGCAGCTGCTCAATTTGGTTCTGGCTG
GGCATGGCTTGCATACAAGGCCAATAGGCTTGATGTGGAAAATGCGGTAA
ATCCTTGGCCATCAGAGAAGGACAAAAAGCTTGTAGTTGTAAAAAGTCCA
AATGCCGTGAATCCCCTTGTTTGGGACTACTTCCCACTACTCACTATTGA
TGTCTGGGAGCATGCTTATTACCTTGACTTCCAGAACCGACGACCGGATT
ACATTTCAATGTTCATGGAGAAGCTTATATCTTGGGAAGCAGTTAGTGCT
AGACTTGAAAAAGCAAAGGCCCTAGCTGCAGAGAGAGAAATGGAAGAAGA
GAGGAGGAAAAAAGAAGAAGAGGAGAAACAGACTGACGATGAAGCTGTAG
AGATGTACTTAGATAGTGACACTGATGATTCCGAGGCTGAATAG
back to top

protein sequence of Tc03v2_p011920.4

>Tc03v2_p011920.4 ID=Tc03v2_p011920.4|Name=Tc03v2_p011920.4|organism=Theobroma cacao|type=polypeptide|length=298bp
MAAAASMATSLTFPLLPSQGCSYFIIRRFWRKVATNLITAKFELKPPPYT
LNALEPHMSRQTLEYHWGKHHRTYVENLNKQIAGTELEGLPLEDIIIVSY
NNGDILPAFNNAAQAWNHDFFWESMKPGGGGKPSGDLLDLIERDFGSFEQ
FIQEFKSAAAAQFGSGWAWLAYKANRLDVENAVNPWPSEKDKKLVVVKSP
NAVNPLVWDYFPLLTIDVWEHAYYLDFQNRRPDYISMFMEKLISWEAVSA
RLEKAKALAAEREMEEERRKKEEEEKQTDDEAVEMYLDSDTDDSEAE*
back to top