Tc04v2_t024720.1

Overview
NameTc04v2_t024720.1
Unique NameTc04v2_t024720.1
TypemRNA
OrganismTheobroma cacao (cacao)
Sequence length3744
Properties
Property NameValue
Model evidenceSupporting evidence includes similarity to: 2 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns
ProductFIP1[V]-like protein
NoteUncharacterized protein isoform 1
Cross References
External references for this mRNA
DatabaseAccession
GeneID18603647
GenbankXM_007035732.2
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
Tc04v2_g024720Tc04v2_g024720Theobroma cacaogene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
Tc04v2_p024720.1Tc04v2_p024720.1Theobroma cacaopolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
exon-auto249745auto249745Theobroma cacaoexon
exon-auto249746auto249746Theobroma cacaoexon
exon-auto249747auto249747Theobroma cacaoexon
exon-auto249748auto249748Theobroma cacaoexon
exon-auto249749auto249749Theobroma cacaoexon
exon-auto249750auto249750Theobroma cacaoexon
exon-auto249751auto249751Theobroma cacaoexon
exon-auto249752auto249752Theobroma cacaoexon
exon-auto249753auto249753Theobroma cacaoexon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
CDS-auto249754auto249754Theobroma cacaoCDS
CDS-auto249755auto249755Theobroma cacaoCDS
CDS-auto249756auto249756Theobroma cacaoCDS
CDS-auto249757auto249757Theobroma cacaoCDS
CDS-auto249758auto249758Theobroma cacaoCDS
CDS-auto249759auto249759Theobroma cacaoCDS
CDS-auto249760auto249760Theobroma cacaoCDS
CDS-auto249761auto249761Theobroma cacaoCDS
CDS-auto249762auto249762Theobroma cacaoCDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>Tc04v2_t024720.1 ID=Tc04v2_t024720.1|Name=Tc04v2_t024720.1|organism=Theobroma cacao|type=mRNA|length=3744bp
ATGGATTCGATGGATGATGATTTTGGTGACTTATACGCCGACGTTGAAAT
CCAAGCTAGCTCAGCGATCGACGCATTGTTCATTGAACCAGAAGACAATG
GCCGTAGCAATGGCGCCGAAAGCACTGACGGGGATGAGAAATTCGACCCC
GGTTCAGTTATGGAAGATAGTGACAGCGAGGACGATTTGAACATTTTGTT
GAACGATGATGACTGCGAGAAGTTTCCGGTTACCGGTGCGAGGAGTCACG
GTGGTGGCTATGAGGAGGATGAAGATAGCGGTTTTGGTGTGGAGGGAACT
GGGTCGGATAAGATTTCAAGGCGGGTGGAACCGGTTGGTGATGGGTCGGA
GCTGAATTGTAGTGGAAATGGTGTAGAAAGAGGGACTGGAGCTAAAACGC
AGTTTTCGCTCTTCAAGTATGTGAGACCTCATGGATTACCATTTTCAAGT
AATGTGAGAGTTACTGGATGTACTGGTGTTTCACCATTCTCTTCTACGTC
GGCAAGAGGTGATCGGGAAGATGATGTTTACAGCCAGAAAAAGGGTGGAA
GCTTGGTTCAGGTTGCTAACAGACATGCTACCACGAATTCATTACCACAT
CAATTTGGATATGGTTTTTCTCTTCCATGGTATAGGACAATATTGGATAT
GAAAATTGATGCATTTGAGGAGAAACCTTGGAGGCATCCCGGCATAGATA
TAACAGATTTCTTCAATTTTGGTTTCAATGAGGACAGTTGGAAACGATAC
TGTAACTCCCTGGAGAAATTTCGGCACCGGTCGTCCAGGCAGGCTAGGAT
TCCTGTTTATTTTTCTTCAAAACTTGATCAGGCTTATGAAGCTGAGGCTG
GGCTTGAGACAGCAACTCAAGAAGCTATGACTGAGGATGTATCTAAAGTT
GAACCATCATTTAAATGTGCTGATAGAGGAGAGATGCCCTTGGAATTGCC
AAAAGGAAGAGCAATTCAGGTCGAAGACAGCATCAATGAACGCCAACCAT
CCATGGATCTAAGGCGTCCACGTTTTCAGGATTCTGATGTTATTATACAG
ATAACTGTGCAGGATTTCACTGTGGATTCCTCTGAGTCTGCAAGGGAGGA
ACTAGGTCATGGTAGAAAGTGTGAAGTGTCAGAATCTGGGAAGTTGGATG
TGAAGGATGACAGAGATGTTTGCTTTTCTGTTAGTGCTGGCGGTGATGAC
CTGTCTGGAGAGCATTGTGCAAGGGTCAGAAATGCGTCCCTGTCTTGTCC
TTTGAGGTCTTTGCAGCCAACAACTGCATCTAATCAAACTTCACTGGAGA
CTAATAATCACAGAAATGACAAGCTCTCTGATATGAATGGGCGTTGTCAT
CCAAATATGGATGTTTGCATATCAGAAGGAATTGCTGAATCAATGGAAAC
AACATATAAGGAAAATGAAGTGGCTTGCAGAAATACTTACCAGTCAGATC
CTTGCATGATTGAACCAGAACAATCACTTGACGATCGGAGTCATTTTAGC
CCTACACTTTCCTTTTCTGAAAGCAATTCTGAAGAAAGATCCAAAGATAG
CGTTCATGCTGTTTCCATTGACGGTCCAAGTCCATTAAGAAGGCAATCAC
TAGATTATGGCTCTGAATTGCAGAAGTCAGTTGCATCTTATCATAAAAGT
TCCAGAATTGGTGGCAGCAAAACAAAATCAGATGATGGAGAAAGTTATTC
AATACATTCAAGTCCACTCAGAGACAAGCAAAAGCATGAGAGCTGGAGAC
ACCGACCTCTTGTGAAACAGAGGATCTTGCATGAAAGTGATGATGACATT
TCTCCAATACCAGATGCAGAGTGTGATAGGAAAAGATATCAAAGATGTAA
AAATCCCATTGAGGAAGAAAGGAAGCATCACCGTGGTAGACCTCACGGTA
TTACTGATCAGAAGATATATCCTGAAAACTGCTATAAAGCTTCCCCTTCA
TCAAATGCACTGAAACTCTGTGATAAAGATTACTCATCTGATTGTAGCAG
ACAGAAAGAAAGACTGCAAGATCTTGGTTATCATGACAGAGAAGGTTCCT
CATGCTACATGGAGAAAGGACCTTGTGTTAATGGCCATAAAAGGTTTGCT
GACAGCCATCTTCAGGCTGTCCGCACAAAAGGTCCTCTAAGCTTAAAAGA
AGATTCGGATCAGTTTGCTGGAAGAGAATGGAAAAAGGAGTTTTATCATG
GAAGAAGAGCTGGCATAGATAAAGAAGATGACATGGATGGGTTTTGGCAT
CATGGACAAAGACTTCCTGCTCAACAGGGTTTGTTTCCTCACACTTGCAG
GGAATCTGGGAGGTTAGTCTCAAGGTATTCCTCTGCTTCAAAAGAAAGAG
ATATTCAATGGAGAAGGGGATATGATGGACTCCAGCTTCGGAAGAAAACT
GATCATGATGATTGTCCATTAGATTATAAGCATGAAAATGAACGGTTAAA
AGAAACGTATGGTAGATCCATTCCATTCACTCGTTGTGAAAGGGATATGG
TTGAACCATATGAGAGATGGCTACCACCAATTAGGAGAGAATTCAAAGTT
TCTGGCAGAAAAGGTAGATATGTTGATCCTGCCTATTTCCCTTTGGATAG
ACCGTGGCCGATGGAAAGTGAAGAGTATCTGAGACACACGTATTGTAGAT
CTCTAGCCTTGGAGACTGACAGAGAACCTTCCGTACCTAATGGAAGAAGG
TGGCGTAACACTTTATTATCAAGAAATGAGGCATTTGACTCCAAGTTTAT
TAAAAGATACCATAGACATCAGAGAATAGTATGTCATGAGGAAGATGGAG
ACAATGGTCGATGTGGTTGTTATGATTATGTCGATGACAATGAAGATGGT
ATCTTGCCAAATGGGAATCAAGTTCAGTCGTGGAGAAGGGGCCATAGTCA
GCGAGGTAGAGTAGTACACTGGACGAAGGATAAACTACTTGGAAATGATA
GATTGTTAGCCCAATGGGTGTCTTTTTCCTGTCAAAAAACTTCTAAGCAT
GACTTAATTCATGCTAGGCATGGATCCCTCCGTGATGAGATGCTCATTAA
TGATTTGATGTTGGAGCATCACGGATATGAAATGATAACTGAAGGAAGTA
ATGCCAACTGTCATGAAAGAAATTCTATTATTAGGCAAAAGCAGAAGGTC
CTGAAGGACAGGGACTCAGTTGACTTGATTGTTGGGGAAGGAAAGTCTTC
TGTAAGGCACTTGGATGGTGGAAGCTTAATATGCAATGGAAGGCTTGAAA
AGATTGGCTTGGAATTTCCTATGGAGCAGAAATCTTTAAGGGATGTTAAT
GACTCTTGTGGAGGCAACAGAGTTAAGACAGACATCTCAAATACGGATGG
TAGCAGAACTATTGAGAAACAGCTTGATAAGTTTTCAGTTGCAGAGTGTA
ATCAAGATCTGGATATTGAGGAGGGTCAGATTATATGTGAAGAACAAAGT
ATTAACCTGGAAAAGGAAAATGTTTCTGAGACTATGGTGCAAAGGAGCAA
GGTCAAGATGAGAACATTGCATGTTGACAGTTCTGACGGAAATAGAGCTG
TGGGTGAATATGACAACAAACGGATAGTGGAGACACTAGCAAAGATGGAG
AAACGAAGGGAACGGTTTAAGGATCCCATCACAATAAAAATGGAGCCAGA
CAAGACTTCTGAGCCTCAAGTTGACTTGGTAGTTGACACTAATGAAATTA
AGCACCAAAGGCCTGCTCGAAAGAGGCGGTGGGGTGTAAGTTAG
back to top

protein sequence of Tc04v2_p024720.1

>Tc04v2_p024720.1 ID=Tc04v2_p024720.1|Name=Tc04v2_p024720.1|organism=Theobroma cacao|type=polypeptide|length=1248bp
MDSMDDDFGDLYADVEIQASSAIDALFIEPEDNGRSNGAESTDGDEKFDP
GSVMEDSDSEDDLNILLNDDDCEKFPVTGARSHGGGYEEDEDSGFGVEGT
GSDKISRRVEPVGDGSELNCSGNGVERGTGAKTQFSLFKYVRPHGLPFSS
NVRVTGCTGVSPFSSTSARGDREDDVYSQKKGGSLVQVANRHATTNSLPH
QFGYGFSLPWYRTILDMKIDAFEEKPWRHPGIDITDFFNFGFNEDSWKRY
CNSLEKFRHRSSRQARIPVYFSSKLDQAYEAEAGLETATQEAMTEDVSKV
EPSFKCADRGEMPLELPKGRAIQVEDSINERQPSMDLRRPRFQDSDVIIQ
ITVQDFTVDSSESAREELGHGRKCEVSESGKLDVKDDRDVCFSVSAGGDD
LSGEHCARVRNASLSCPLRSLQPTTASNQTSLETNNHRNDKLSDMNGRCH
PNMDVCISEGIAESMETTYKENEVACRNTYQSDPCMIEPEQSLDDRSHFS
PTLSFSESNSEERSKDSVHAVSIDGPSPLRRQSLDYGSELQKSVASYHKS
SRIGGSKTKSDDGESYSIHSSPLRDKQKHESWRHRPLVKQRILHESDDDI
SPIPDAECDRKRYQRCKNPIEEERKHHRGRPHGITDQKIYPENCYKASPS
SNALKLCDKDYSSDCSRQKERLQDLGYHDREGSSCYMEKGPCVNGHKRFA
DSHLQAVRTKGPLSLKEDSDQFAGREWKKEFYHGRRAGIDKEDDMDGFWH
HGQRLPAQQGLFPHTCRESGRLVSRYSSASKERDIQWRRGYDGLQLRKKT
DHDDCPLDYKHENERLKETYGRSIPFTRCERDMVEPYERWLPPIRREFKV
SGRKGRYVDPAYFPLDRPWPMESEEYLRHTYCRSLALETDREPSVPNGRR
WRNTLLSRNEAFDSKFIKRYHRHQRIVCHEEDGDNGRCGCYDYVDDNEDG
ILPNGNQVQSWRRGHSQRGRVVHWTKDKLLGNDRLLAQWVSFSCQKTSKH
DLIHARHGSLRDEMLINDLMLEHHGYEMITEGSNANCHERNSIIRQKQKV
LKDRDSVDLIVGEGKSSVRHLDGGSLICNGRLEKIGLEFPMEQKSLRDVN
DSCGGNRVKTDISNTDGSRTIEKQLDKFSVAECNQDLDIEEGQIICEEQS
INLEKENVSETMVQRSKVKMRTLHVDSSDGNRAVGEYDNKRIVETLAKME
KRRERFKDPITIKMEPDKTSEPQVDLVVDTNEIKHQRPARKRRWGVS*
back to top