Tc04v2_t010350.2

Overview
NameTc04v2_t010350.2
Unique NameTc04v2_t010350.2
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3465
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
Productputative disease resistance protein RGA1, transcript variant X2
NotePutative disease resistance protein RGA4
Cross References
External references for this mRNA
DatabaseAccession
GeneID18601973
GenbankXM_018120268.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g010350Tc04v2_g010350Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p010350.2Tc04v2_p010350.2Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto218126auto218126Theobroma cacaoexon
exon-auto218127auto218127Theobroma cacaoexon
exon-auto218128auto218128Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto218129auto218129Theobroma cacaoCDS
CDS-auto218130auto218130Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t010350.2 ID=Tc04v2_t010350.2|Name=Tc04v2_t010350.2|organism=Theobroma cacao|type=mRNA|length=3465bp
ATGGCAGATGCTCTTCTTTCAGCCGTTCTAAACACCATCTTGGAGAACAT
AAACTCCCTGTGGCTTGAAGAGTTTGGGATCACAGGGGGATTGAAAACCG
AGCTTGAGAGTCTGCAGAGCACCTTAAGTACGATCCAAGCGGTACTTCTT
GATGCAGAGGAGAAGCAGTGGAAGAGTGAGGCTATCAAGAATTGGCTGGG
AAAACTCAAAGATACGGCTTATCATCTAGATGATATACTAGATGAGTTTG
CAACAAACACTCAAAAAGAAAGGCTGCAGAGAGATGCTAGAAGCCAGGTA
TGCACCTTCCATTACCTTCCTAAACAGCTTTTATTTCGCTCAAAGATGGC
GCATAAGTTGAAGGATGTTAGAGAAAAATTGGATGCAGTTGCTGGGGAGA
GATCCAAGTTCCATTTGAGAGAAGGGATGGAGCCCTTGGAGGACAGAGAA
GTGAGTGATACAGAGTGGAGAAAAACTAGCTCACTTGTAAATGAATTAGA
GGTGTATGGAAGAGATAAGGAGCTGGATAGAATAATCAATATGCTGTTGA
ACAATTTAGCAGATCAGGATGGGATTTCTGTTTACACTATATGTGGCATG
GGAGGACTCGGAAAGACAACACTTGCTCAATTGGTTTATAATGATGAAAG
CATAAGAAAGGCTTTTGATTTGAGAATTTGGGTATGTGTATCCGATGATT
TTGATATTACAAGATTAACAAAAGCCATTATAGAGTCCATTGAAGGAAAG
TGCAGTATAGAAGAACTAGATCCCCTGCTAAGACACCTACAAGAAAAACT
AATTGGGAAAAGGTTTTTGCTTGTATTGGATGATGTGTGGAATGAATATC
ACGAAAAGTGGGAAGGATTGAAGGAAGCATTTAGATGCGGTGCGAAAGGA
AGCACAGTTATAGTCACTACCCGTATCGAGAAAGTTGCCCTTATGATGAC
AACTACTCCTATACACCACTTGGGAAGCTTGTCCTGTGATGATTCTTGGT
CCTTATTCAAGCAGCGTGCGTTTAGGATGGGAAAGAGCGAGGATTACCCA
CACTTAGAAGCACTTGGAAAGGAAATAGTTAAGAAGTGTGGGGGGGTGCC
CTTAGCACTAAAGGCTTTGGGAGGTTTGTTGCGTTTCAAAGAAAGAGAGA
GTGAGTGGCTATCGATCAAAGAAAGCGAGATGTGGGAATTGGCAGATGAG
GGGAGCAAAGTCTTATCTGTGTTGAATTTGAGTTACAGACGTCTAAAACC
GCATTTGAGACAATGTTTTACATTTTGCTCTATATTTCCCAAAGATTATA
TCATGAGTAAAGAGCAGTTGATACAACTTTGGATGGCTAATGGCTTTGTT
CCTGCAAGAGGACAAATGAATTTGCATGACATGGGCTGTGAAATCTTCAA
TGAATTAGCTTGGAGGTCCTTTTTCCAAGAACTCGTGGAGGATTTTGAAG
GAAATTCAACATGTAAAATGCATGACCTTATCCATGATCTTGCACAATCA
ATTATGAGTTGCGAGTGCTCTGTGACTGAACCAAGTCAGCTAGTGTTGAC
TGCGCCCAAAACAGTTCGTCACATGTTTGCTTCTGGTAATTCGTCTATAT
TTGCTCCTTCAAATGTGGACAACCTACCCAAAGTCTGTTCCTTGCGCACA
TTGTTTGTACGTAACAACTTCCATTGGAGAATTGCAACTAAACAGAAGCA
TCTGAGGGCATTACACGTTACATTTAATGGAGGAATGAAAATCTCAATTG
ATGATAAGTTCAGACATCTAAGGTATCTGAGCCTTGTTAATTCTGGAATT
GAAACACTGCCAGAATCACTATGCAGCTTCCAAAAATTGCAGACACTAAA
TCTGATATGTTGTTATCACCTTCGCAAATTACCCAAAGGTTTGAAGCTCT
TGAAAAGTCTTACATATTTAGACATAAAATATTGTAATGCACTTACTCGT
ATGCCTGTTGGCCTGGGGCAATTGTCTTGCTTGCGTAGGCTGAGCATGTT
CATTGTGGGAAAGGACCGTGGTTGCTGTATAGACGAATTAAAAGGGCTGG
CTCTTGAGGGAGAGCTTTGCATTGAAGAACTTGATAATGTAAAAAGTTTA
ATAGATGCTAAAAGTGCCAATCTGATAATGAAGCAAAATCTAAGATCACT
AGGCTTATCTTGGCGCAAAATCGACAATTGTTACCTACATGAAAATGCTG
AAGAGGTTCTTAGTGGTCTCCAACCTCATTCAAGTTTGAAGACGCTAAGC
ATACGAAATTACCATGGTCCAAAGTTTTCATATTGGTTGATGGATCTCCT
TGTTCCAAACCTAGTTGACATCACACTGGTAAATTGTGAAAGATGTGAAT
GCCTTCCACCTCTTGGTAAATTAGGCTTCCTCAAGTCCCTCACCATTACT
GGAATGGATGCTCTAAAATCTATTGATAATAGCTTCTATGGAGATGGCGA
GAGTTCATTCTCGTCACTGGAGAGTCTCTGTTTCGAGAATATGCTTTCTT
TCGAGGAATGGACAACAGTGAAGGGGAAGGAAAATTTTCCTCAGCTAAGA
TCATTAGTTATTAGAGATTGTCCGAAGCTAGTTGAAATGCCTATGCTTCA
ATCTCTGAAAATATTAGAAATTAGCAAAACCAGCGTCTCATTACTTAGCT
CCGTGATGCATTTCACTTTTCTCACCTCTCTCTTACTGGGCGGCTTTGAT
GGCTTGACGGTTATGCCAGATGGACTATTGCAAAATCACAAGCACCTTGA
AAGCTTGGAGATACGTTTTAAAAAGCTGAAATCTCTATCAAATCTTCTAG
ATAACCTATCTGCTCTCGAGCAATTGGATCTTCAGGACTGCCTAGAGCTT
GAAAATATTCCAGCAGGACTAGAAAACCTCAGCTCTTTGGAGAGATTGCA
TTTAAGTGAGTGTAACAGCCTTGTAACCCTTCCAGAAGATGGATTGCGTG
GTTTATCTTCCCTTTCTTCGCTGTGGTTTCAAGGGTGTCAGAAATTAGCC
TCTTTATCTGATGGAGTGAGATATCTGACTTCGCTCCGAGACTTACTCGT
CAATGATTGTCCAGAGTTAAACTCATTGCCCGAGTGTATCCAACATCTCT
CTGCACTTCGGAGTTTGAGGATTTGGCATTGTGAGAGATTAACTTCTCTG
CCAAATGGGATAGAAAACCTTGCCTTGCTTTCAGAATTGGAGATCATGCG
TTGCGATAATCTAATGTGTCTGCCTCAAGGGCTACAGAGTCTCACGGCAC
TCACAATACTGAGGATTGTAGGATGCCGACATCTGGAAAGGCGGTGCAGG
AGAGAGAGAGGAGAGGATTGGCCCATCATAGCCCACATTCCTTCTATTGT
AATCATGTCCCGTGGAGAGTACTTTTTTCGAGGACGAAGAAGGCCTCTTG
GCAATCTGTTAACAAGGGTTGGTGATTGGACAAATGGGCTCTCCAGAAAG
TTTTGGAAATCTTAG
back to top

protein sequence of Tc04v2_p010350.2

>Tc04v2_p010350.2 ID=Tc04v2_p010350.2|Name=Tc04v2_p010350.2|organism=Theobroma cacao|type=polypeptide|length=1155bp
MADALLSAVLNTILENINSLWLEEFGITGGLKTELESLQSTLSTIQAVLL
DAEEKQWKSEAIKNWLGKLKDTAYHLDDILDEFATNTQKERLQRDARSQV
CTFHYLPKQLLFRSKMAHKLKDVREKLDAVAGERSKFHLREGMEPLEDRE
VSDTEWRKTSSLVNELEVYGRDKELDRIINMLLNNLADQDGISVYTICGM
GGLGKTTLAQLVYNDESIRKAFDLRIWVCVSDDFDITRLTKAIIESIEGK
CSIEELDPLLRHLQEKLIGKRFLLVLDDVWNEYHEKWEGLKEAFRCGAKG
STVIVTTRIEKVALMMTTTPIHHLGSLSCDDSWSLFKQRAFRMGKSEDYP
HLEALGKEIVKKCGGVPLALKALGGLLRFKERESEWLSIKESEMWELADE
GSKVLSVLNLSYRRLKPHLRQCFTFCSIFPKDYIMSKEQLIQLWMANGFV
PARGQMNLHDMGCEIFNELAWRSFFQELVEDFEGNSTCKMHDLIHDLAQS
IMSCECSVTEPSQLVLTAPKTVRHMFASGNSSIFAPSNVDNLPKVCSLRT
LFVRNNFHWRIATKQKHLRALHVTFNGGMKISIDDKFRHLRYLSLVNSGI
ETLPESLCSFQKLQTLNLICCYHLRKLPKGLKLLKSLTYLDIKYCNALTR
MPVGLGQLSCLRRLSMFIVGKDRGCCIDELKGLALEGELCIEELDNVKSL
IDAKSANLIMKQNLRSLGLSWRKIDNCYLHENAEEVLSGLQPHSSLKTLS
IRNYHGPKFSYWLMDLLVPNLVDITLVNCERCECLPPLGKLGFLKSLTIT
GMDALKSIDNSFYGDGESSFSSLESLCFENMLSFEEWTTVKGKENFPQLR
SLVIRDCPKLVEMPMLQSLKILEISKTSVSLLSSVMHFTFLTSLLLGGFD
GLTVMPDGLLQNHKHLESLEIRFKKLKSLSNLLDNLSALEQLDLQDCLEL
ENIPAGLENLSSLERLHLSECNSLVTLPEDGLRGLSSLSSLWFQGCQKLA
SLSDGVRYLTSLRDLLVNDCPELNSLPECIQHLSALRSLRIWHCERLTSL
PNGIENLALLSELEIMRCDNLMCLPQGLQSLTALTILRIVGCRHLERRCR
RERGEDWPIIAHIPSIVIMSRGEYFFRGRRRPLGNLLTRVGDWTNGLSRK
FWKS*
back to top