Tc04v2_t010350.1

Overview
NameTc04v2_t010350.1
Unique NameTc04v2_t010350.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3465
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 21 samples with support for all annotated introns
Productputative disease resistance protein RGA1, transcript variant X1
NotePutative disease resistance protein RGA4
Cross References
External references for this mRNA
DatabaseAccession
GeneID18601973
GenbankXM_018120267.1
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g010350Tc04v2_g010350Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p010350.1Tc04v2_p010350.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto218120auto218120Theobroma cacaoexon
exon-auto218121auto218121Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto218122auto218122Theobroma cacaoCDS
CDS-auto218123auto218123Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t010350.1 ID=Tc04v2_t010350.1|Name=Tc04v2_t010350.1|organism=Theobroma cacao|type=mRNA|length=3465bp
ATGGCAGATGCTCTTCTTTCAGCCGTTCTAAACACCATCTTGGAGAACAT
AAACTCCCTGTGGCTTGAAGAGTTTGGGATCACAGGGGGATTGAAAACCG
AGCTTGAGAGTCTGCAGAGCACCTTAAGTACGATCCAAGCGGTACTTCTT
GATGCAGAGGAGAAGCAGTGGAAGAGTGAGGCTATCAAGAATTGGCTGGG
AAAACTCAAAGATACGGCTTATCATCTAGATGATATACTAGATGAGTTTG
CAACAAACACTCAAAAAGAAAGGCTGCAGAGAGATGCTAGAAGCCAGGTA
TGCACCTTCCATTACCTTCCTAAACAGCTTTTATTTCGCTCAAAGATGGC
GCATAAGTTGAAGGATGTTAGAGAAAAATTGGATGCAGTTGCTGGGGAGA
GATCCAAGTTCCATTTGAGAGAAGGGATGGAGCCCTTGGAGGACAGAGAA
GTGAGTGATACAGAGTGGAGAAAAACTAGCTCACTTGTAAATGAATTAGA
GGTGTATGGAAGAGATAAGGAGCTGGATAGAATAATCAATATGCTGTTGA
ACAATTTAGCAGATCAGGATGGGATTTCTGTTTACACTATATGTGGCATG
GGAGGACTCGGAAAGACAACACTTGCTCAATTGGTTTATAATGATGAAAG
CATAAGAAAGGCTTTTGATTTGAGAATTTGGGTATGTGTATCCGATGATT
TTGATATTACAAGATTAACAAAAGCCATTATAGAGTCCATTGAAGGAAAG
TGCAGTATAGAAGAACTAGATCCCCTGCTAAGACACCTACAAGAAAAACT
AATTGGGAAAAGGTTTTTGCTTGTATTGGATGATGTGTGGAATGAATATC
ACGAAAAGTGGGAAGGATTGAAGGAAGCATTTAGATGCGGTGCGAAAGGA
AGCACAGTTATAGTCACTACCCGTATCGAGAAAGTTGCCCTTATGATGAC
AACTACTCCTATACACCACTTGGGAAGCTTGTCCTGTGATGATTCTTGGT
CCTTATTCAAGCAGCGTGCGTTTAGGATGGGAAAGAGCGAGGATTACCCA
CACTTAGAAGCACTTGGAAAGGAAATAGTTAAGAAGTGTGGGGGGGTGCC
CTTAGCACTAAAGGCTTTGGGAGGTTTGTTGCGTTTCAAAGAAAGAGAGA
GTGAGTGGCTATCGATCAAAGAAAGCGAGATGTGGGAATTGGCAGATGAG
GGGAGCAAAGTCTTATCTGTGTTGAATTTGAGTTACAGACGTCTAAAACC
GCATTTGAGACAATGTTTTACATTTTGCTCTATATTTCCCAAAGATTATA
TCATGAGTAAAGAGCAGTTGATACAACTTTGGATGGCTAATGGCTTTGTT
CCTGCAAGAGGACAAATGAATTTGCATGACATGGGCTGTGAAATCTTCAA
TGAATTAGCTTGGAGGTCCTTTTTCCAAGAACTCGTGGAGGATTTTGAAG
GAAATTCAACATGTAAAATGCATGACCTTATCCATGATCTTGCACAATCA
ATTATGAGTTGCGAGTGCTCTGTGACTGAACCAAGTCAGCTAGTGTTGAC
TGCGCCCAAAACAGTTCGTCACATGTTTGCTTCTGGTAATTCGTCTATAT
TTGCTCCTTCAAATGTGGACAACCTACCCAAAGTCTGTTCCTTGCGCACA
TTGTTTGTACGTAACAACTTCCATTGGAGAATTGCAACTAAACAGAAGCA
TCTGAGGGCATTACACGTTACATTTAATGGAGGAATGAAAATCTCAATTG
ATGATAAGTTCAGACATCTAAGGTATCTGAGCCTTGTTAATTCTGGAATT
GAAACACTGCCAGAATCACTATGCAGCTTCCAAAAATTGCAGACACTAAA
TCTGATATGTTGTTATCACCTTCGCAAATTACCCAAAGGTTTGAAGCTCT
TGAAAAGTCTTACATATTTAGACATAAAATATTGTAATGCACTTACTCGT
ATGCCTGTTGGCCTGGGGCAATTGTCTTGCTTGCGTAGGCTGAGCATGTT
CATTGTGGGAAAGGACCGTGGTTGCTGTATAGACGAATTAAAAGGGCTGG
CTCTTGAGGGAGAGCTTTGCATTGAAGAACTTGATAATGTAAAAAGTTTA
ATAGATGCTAAAAGTGCCAATCTGATAATGAAGCAAAATCTAAGATCACT
AGGCTTATCTTGGCGCAAAATCGACAATTGTTACCTACATGAAAATGCTG
AAGAGGTTCTTAGTGGTCTCCAACCTCATTCAAGTTTGAAGACGCTAAGC
ATACGAAATTACCATGGTCCAAAGTTTTCATATTGGTTGATGGATCTCCT
TGTTCCAAACCTAGTTGACATCACACTGGTAAATTGTGAAAGATGTGAAT
GCCTTCCACCTCTTGGTAAATTAGGCTTCCTCAAGTCCCTCACCATTACT
GGAATGGATGCTCTAAAATCTATTGATAATAGCTTCTATGGAGATGGCGA
GAGTTCATTCTCGTCACTGGAGAGTCTCTGTTTCGAGAATATGCTTTCTT
TCGAGGAATGGACAACAGTGAAGGGGAAGGAAAATTTTCCTCAGCTAAGA
TCATTAGTTATTAGAGATTGTCCGAAGCTAGTTGAAATGCCTATGCTTCA
ATCTCTGAAAATATTAGAAATTAGCAAAACCAGCGTCTCATTACTTAGCT
CCGTGATGCATTTCACTTTTCTCACCTCTCTCTTACTGGGCGGCTTTGAT
GGCTTGACGGTTATGCCAGATGGACTATTGCAAAATCACAAGCACCTTGA
AAGCTTGGAGATACGTTTTAAAAAGCTGAAATCTCTATCAAATCTTCTAG
ATAACCTATCTGCTCTCGAGCAATTGGATCTTCAGGACTGCCTAGAGCTT
GAAAATATTCCAGCAGGACTAGAAAACCTCAGCTCTTTGGAGAGATTGCA
TTTAAGTGAGTGTAACAGCCTTGTAACCCTTCCAGAAGATGGATTGCGTG
GTTTATCTTCCCTTTCTTCGCTGTGGTTTCAAGGGTGTCAGAAATTAGCC
TCTTTATCTGATGGAGTGAGATATCTGACTTCGCTCCGAGACTTACTCGT
CAATGATTGTCCAGAGTTAAACTCATTGCCCGAGTGTATCCAACATCTCT
CTGCACTTCGGAGTTTGAGGATTTGGCATTGTGAGAGATTAACTTCTCTG
CCAAATGGGATAGAAAACCTTGCCTTGCTTTCAGAATTGGAGATCATGCG
TTGCGATAATCTAATGTGTCTGCCTCAAGGGCTACAGAGTCTCACGGCAC
TCACAATACTGAGGATTGTAGGATGCCGACATCTGGAAAGGCGGTGCAGG
AGAGAGAGAGGAGAGGATTGGCCCATCATAGCCCACATTCCTTCTATTGT
AATCATGTCCCGTGGAGAGTACTTTTTTCGAGGACGAAGAAGGCCTCTTG
GCAATCTGTTAACAAGGGTTGGTGATTGGACAAATGGGCTCTCCAGAAAG
TTTTGGAAATCTTAG
back to top

protein sequence of Tc04v2_p010350.1

>Tc04v2_p010350.1 ID=Tc04v2_p010350.1|Name=Tc04v2_p010350.1|organism=Theobroma cacao|type=polypeptide|length=1155bp
MADALLSAVLNTILENINSLWLEEFGITGGLKTELESLQSTLSTIQAVLL
DAEEKQWKSEAIKNWLGKLKDTAYHLDDILDEFATNTQKERLQRDARSQV
CTFHYLPKQLLFRSKMAHKLKDVREKLDAVAGERSKFHLREGMEPLEDRE
VSDTEWRKTSSLVNELEVYGRDKELDRIINMLLNNLADQDGISVYTICGM
GGLGKTTLAQLVYNDESIRKAFDLRIWVCVSDDFDITRLTKAIIESIEGK
CSIEELDPLLRHLQEKLIGKRFLLVLDDVWNEYHEKWEGLKEAFRCGAKG
STVIVTTRIEKVALMMTTTPIHHLGSLSCDDSWSLFKQRAFRMGKSEDYP
HLEALGKEIVKKCGGVPLALKALGGLLRFKERESEWLSIKESEMWELADE
GSKVLSVLNLSYRRLKPHLRQCFTFCSIFPKDYIMSKEQLIQLWMANGFV
PARGQMNLHDMGCEIFNELAWRSFFQELVEDFEGNSTCKMHDLIHDLAQS
IMSCECSVTEPSQLVLTAPKTVRHMFASGNSSIFAPSNVDNLPKVCSLRT
LFVRNNFHWRIATKQKHLRALHVTFNGGMKISIDDKFRHLRYLSLVNSGI
ETLPESLCSFQKLQTLNLICCYHLRKLPKGLKLLKSLTYLDIKYCNALTR
MPVGLGQLSCLRRLSMFIVGKDRGCCIDELKGLALEGELCIEELDNVKSL
IDAKSANLIMKQNLRSLGLSWRKIDNCYLHENAEEVLSGLQPHSSLKTLS
IRNYHGPKFSYWLMDLLVPNLVDITLVNCERCECLPPLGKLGFLKSLTIT
GMDALKSIDNSFYGDGESSFSSLESLCFENMLSFEEWTTVKGKENFPQLR
SLVIRDCPKLVEMPMLQSLKILEISKTSVSLLSSVMHFTFLTSLLLGGFD
GLTVMPDGLLQNHKHLESLEIRFKKLKSLSNLLDNLSALEQLDLQDCLEL
ENIPAGLENLSSLERLHLSECNSLVTLPEDGLRGLSSLSSLWFQGCQKLA
SLSDGVRYLTSLRDLLVNDCPELNSLPECIQHLSALRSLRIWHCERLTSL
PNGIENLALLSELEIMRCDNLMCLPQGLQSLTALTILRIVGCRHLERRCR
RERGEDWPIIAHIPSIVIMSRGEYFFRGRRRPLGNLLTRVGDWTNGLSRK
FWKS*
back to top