CmoCh04G011100 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G011100
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRNA-binding family protein isoform 2
LocationCmo_Chr04 : 5655708 .. 5660406 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACGGACTGTCTTGATACGCTTTTGAATACAAAATTCTTCACTCCTTGCAATCTTCATCCTAGTCTCCAGACATATAAATTCTGCATTGATTGTAGTGTTAGCTTCTGCACGAATTGTACAGTCCGTAATCTTCATGGGCAGGTTCGTATCTGGAGATATTCCTATCATGATGTTGTGCGCGTTCAGGTAATGAAGAATCACTTTTGCTGTGCAGAGATTCAGGTATAATACATTCTTTCCCTTGTGCCTCCCGTGTCTGGGCCTTGCGACGTGTGCGAGTGCCGAGCAACTTTCCGACTATAATTGTCTTTGATAGAAAAGATGCCTGAATTCCATCTGTTTGTTGCTGATCTTTTTGCTTATGTTGTTGAAATCATCGACTTTAGAGATAGGTTTTCCTTTTTTCATTCAAACAGATAGTTTTTAAATTTGTTAGCCAAATTATCTGGTTTTTAAGCACTAGTAATTCTTGAATCGAAAATGATATGTTCTTGAGGAAATCTCACCAGCCTGATTTTGATCCTTGCCAGTTTGAATGGATGAATATACAATTTGATACTTTGATTCCTGTCCGATACCAGTTTTAGTAATAGGAATAAAGTTTGTTTGCTATCGAGTTGTTTTTTGGTTTGTTTTGTAAAAGTTTGCCTTTGTTCTATGAATCCATCTCTATGCCCCCTAACTTGAAGCCCCCAATTGCTTGATTCTGCAGTCATATCGAGTCAATGGTGAAATAGCTGTTCATCTAAACTCCCGTGGTCAACCCGTCGACACCAAATCACCAAAGCGGAAGTTTGGTGATTTTTGTGAAGATTGTGGCAGAAGTGTAAATGATCCTCATCCCTTCTGTTCCATCGCTTGCAAGGTTAGAAACATATGAACCTTATCCACATCAGTTGGACCTTACAAGAGCATATAAAATTCTTCCTAGCAGGCGGGTTTTAAAAACTGTGAGACCGACGCCTGTACATAACGAGTCAAATCTGACAATATCTGCTAATAGTAAGCTTAGACTGTTACAAATAATATCAAAATCAGTCCCCGAGTAGTGTGCCGCTGGGCTCACAAGCGGGTGGATTGTAAGATCCCTCGTCGATTGAAGAAAGGAATGAAACATTCTTTATAAGAGTGTAGAAACCTCTCTCTAAGAGACGCGTTTTAAAAATCGTGAGGTTGACGGCGCGGCCCGACAATATTCTAAACAATTTTATTACTGCATCACACACAGATTGTTAACAATTCAAATGCTGTAAAAACATCAGGTTTCTGTGAACTCAAAGTTCAAGGAACAGATCATTGGAACCATCATATATCCGAGCCCGGAATCCATTTACTTATCATTCAAGGAAAAAAGCAGCCCAGAAACAGAAGCATGGGAATTGGAATCAACCATATCAGTTGCAGAGTCCACAGAAGAGACCCAAGCTACCTCTTCATCTTCAAGGCCAAGAAAACGCAGAAGGAAAGGCATCCCTCGCAGGTGTCCATTTTTTTGACAAGAAGATAAGATGCAACATTTTCATCTCCTTTTTAACTTCCTTGTGGCTCTCAAATGGTATGCTTCTTAATTTTTATTTACCTTGACTCGTATATTTCATAGAATGTCTAGTAGCGAAATGAAATCATCAATATAAAAGCCAGAATTGTAGATCTCTTTGCTGCTGCATGGTTTTTTAAATTTCTTTTTCAAGTTTTGACAAGATTAGTTGGTTGACAAAGCTGATATTAACATATATTCTTTTACACTAATTCTTTGCCTCGTCCATGGCGAGTCTTGGCCGCTACCGACGATGGCGACGTGGGAAGGATGGAGGTGTTTATGCACGCAGCGATGATGGGAGGAGGTAATGGAGGTGTTTATGCACGCAGCGATGATGGGAGGAGGTAACCTCTAAATTAAAATTAAGTATTTTAATCAATTTAACAATGTTCAAATTAATTGGTAGTGCTATGTTAAACCAATTTAAATAAAAACAACAAAAATAGAGAAATATTTTTATTTTATATATTAAATAATTATCACCTTTACTTCGGTTGAAATTTGGATAATTTTCTATTTTATCAATGAGTGGCTCAGTGATTTGGTACAGTCAAATTTGGTCCCGCCTCCGCTAGAGCCCAAAAAAAAAGGCTCTTCCCTACAAGAATTTCGTCTCTTCTCTTCTTCTCTTCTTCTCCTCTTCTCCCCGCCGCTTGATCTCATCAACCTATCATCTTCCGCTATCTCTAATTCCCTTCTCAACCATTCTTTTACCTTTCAGAAATGTCGGTGAGTTCTTCTTCATCATCATCTTGTTCTTCTGTTATCTGTGAATCTTTGATGCAGATCTGCCGCATTTATGTTTAGTAATTGTTTTTTTGTTTCTGATGGGTAATGGCGATTTGGGGAATTGTAGTTGTAATGTGGAACTAGGGTTTTGTTTTTCTTTCTAATTTTGATTTTGATTTTGATTTTCAAGGTCTCTGGCTTCTACAATGACCTCTGATTCTGATTGGTTGCCGACAGAATTCCACTGATTTTTCTCCCCCAAATTTTGAATTTATCAACAAAACAGTCTCTTCGTCCCTTCCTTCCTCTCAAATCAGCTGCTTCAATCCCAACTTTTGTTTTCAGTTTCTTCTCAAAAAGGTTGAGTGAAGTGAAAATTAGCATGGTGAATTCACTGCTAAATCCTTTTTGTTTTTGGTTCTTAATTTTCTATTTCTGAAACCAGCTACCTGAATCTCACCCTAGTTCTTTCTTTTCACTTTGATGATGAACTCAACGGAGTCAACACTAAGTACTGTTCACTACTAAATCCAAAAACTTGATAAGTATGATGAAGCTCAACAATTATTAGAATGTTAATTCTCTAACTAGTGCTGCCTTTTAAAAAAATTTAAATACTAGTTTTTCGTACAAAAATTGTAGGGGTGTGGAGGGTAGAAGTATCAACTTTTGAGAAGATAAATGATAGTCTTATCTACTAAGTTTGTAATAGTTCGCCTACAGCTAGCCGACATTGTTCTTTTTGTGTTTTCCCTTTCGAGCTTTTCCTCGAGATTTTTAAAATACCTCTACTAGGAGAGGTTTCCATGCCCTTATAAAAAATGCTTCGTTCCCCTCTGTAATCGACATGAGATCCCACAATCCATCCCTCTAGGGCCCAGCGTTCTCGTTGGCATTGTGAGATTTGATATATACACATATATATATGTGTATTCTGTCATTCATAAGTTTAGGTATTAATTTGAAGAAGGAATGATATTATAAATGTACATTCGAACACTTGCATTTCTTAATTATTTTTAATGGGTTTTTTTCTTTTTTTCTTTTAAAAAAAATCTTATGTAGATGAGAACTGTCAAGGTCAGCAATATCTCAAAAGCTACTTCTGAGAGAGATATCAAAGAATTCCTTTCATTCTCTGGTCAAATCCTTTATGTTGAAATGCAAAGGTTAGTGATTATATTCTATCTGTTTGTTTTGTTTGGAGAATTATTATTATTATTATTATTGTTTTTATTTTGTGTAGAGAAACTGAAAATACTCAGGTTGCTTATGTTACATATGAGGATTCTCAAGGAGCAGATACTGCAATACTTTTAACTGTATGTTCCTTTTTAGGCACATTGGTCTCTTTATTTCTATAATTTATGTTAGTTTTTGTGAACCTTTCTCTCTTAATCTCTTCTCTTAGGGTGCAAAAATTGGTGATCTATTTGTATCAATTACTCTGGTTGAGAATTACCATCTTCCACCTGAAGCAATGTCATCAATCCTGGTAATAAACGTCTCGTGTATATGCATTCGTTGTCGACCCTTGAATTCCTGCACATAAACGAGTTACATTATTCGATTTGAAATCTTGATTATCAAGTAGCCGTCTTTGTCTTCAAGGAGGTATTGCAAACTATTTACTTAAAATCTTAATGACCTTTTGAAGGATAAAAGGAAAACTGCTACCAGGATTGCTCCGAGTCAGGCTGAAGACGTTGTAAGTACAATGCTCGCAAAAGGCTTCATCTTGGGGAAGGATGCTCTCAACAAGGCAAAAGCGTTCGACGAACGACTCCAATTGACATCAAACGCTTCTGCAACTGTCGCTTCCATTGACAGGAAGATGGGTATAACTGAGAAAATAACTGCAGGCACAGCAGTGGTGAATGAAAAGGTCAGGGAGATGGATGAAATGTTCCAGGTTACGGAGAAGACGAAATCCGCTCTAGCAGTTGCCGAGCAGAAGGCAAGCAACGCTGGGACTGCGCTCATGAGCAACTATTACGTGTTAACTGGAGCAGCTTGGTTTTCAAATGCTGTAACTGCAGTTACAAAAGCTGCCGAGGAAGTGACCCAGATGACAAAGGCAAAAGTCGAGAAAGCCGAGGAGGAGAAAGAGGAGAGTATATACCACGAAAGAACCGGGATCATCAATAACTTCGCCGAGTTCCATCTCGACGAGCCTTTACCGGGGGAGCCTGCTATTGTTCCAGTTAATTCAGCTGATAGATAGACAGTAAGTTGTGAATCATACGCATGAAAACCTACACTTGTGAAATACATTGCTTTCCCCAATCAACTGTATGATATATACATTTTCATCTAGAGGTGCTCCAATTACAGGGTTTACTTCTTTATACACCTGCTCCTGTTGAAATGTCATGACGTTCTTTTGAGTGAAAAATTCACTTTTTGAATACTTGATTATACAAT

mRNA sequence

ATGGAAACGGACTGTCTTGATACGCTTTTGAATACAAAATTCTTCACTCCTTGCAATCTTCATCCTAGTCTCCAGACATATAAATTCTGCATTGATTGTAGTGTTAGCTTCTGCACGAATTGTACAGTCCGTAATCTTCATGGGCAGGTTCGTATCTGGAGATATTCCTATCATGATGTTGTGCGCGTTCAGTCATATCGAGTCAATGGTGAAATAGCTGTTCATCTAAACTCCCGTGGTCAACCCGTCGACACCAAATCACCAAAGCGGAAGTTTGGTGATTTTTGTGAAGATTGTGGCAGAAGTGTTTCTGTGAACTCAAAGTTCAAGGAACAGATCATTGGAACCATCATATATCCGAGCCCGGAATCCATTTACTTATCATTCAAGGAAAAAAGCAGCCCAGAAACAGAAGCATGGGAATTGGAATCAACCATATCAGTTGCAGAGTCCACAGAAGAGACCCAAGCTACCTCTTCATCTTCAAGGCCAAGAAAACGCAGAAGGAAAGGCATCCCTCGCAGTCTTGGCCGCTACCGACGATGGCGACGTGGGAAGGATGGAGGTGTTTATGCACGCAGCGATGATGGGAGGAGCAATATCTCAAAAGCTACTTCTGAGAGAGATATCAAAGAATTCCTTTCATTCTCTGGTCAAATCCTTTATGTTGAAATGCAAAGAGAAACTGAAAATACTCAGGTTGCTTATGTTACATATGAGGATTCTCAAGGAGCAGATACTGCAATACTTTTAACTGGTGCAAAAATTGGTGATCTATTTGTATCAATTACTCTGGTTGAGAATTACCATCTTCCACCTGAAGCAATGTCATCAATCCTGGATAAAAGGAAAACTGCTACCAGGATTGCTCCGAGTCAGGCTGAAGACGTTGTAAGTACAATGCTCGCAAAAGGCTTCATCTTGGGGAAGGATGCTCTCAACAAGGCAAAAGCGTTCGACGAACGACTCCAATTGACATCAAACGCTTCTGCAACTGTCGCTTCCATTGACAGGAAGATGGGTATAACTGAGAAAATAACTGCAGGCACAGCAGTGGTGAATGAAAAGGTCAGGGAGATGGATGAAATGTTCCAGGTTACGGAGAAGACGAAATCCGCTCTAGCAGTTGCCGAGCAGAAGGCAAGCAACGCTGGGACTGCGCTCATGAGCAACTATTACGTGTTAACTGGAGCAGCTTGGTTTTCAAATGCTGTAACTGCAGTTACAAAAGCTGCCGAGGAAGTGACCCAGATGACAAAGGCAAAAGTCGAGAAAGCCGAGGAGGAGAAAGAGGAGAGTATATACCACGAAAGAACCGGGATCATCAATAACTTCGCCGAGTTCCATCTCGACGAGCCTTTACCGGGGGAGCCTGCTATTGTTCCAGTTAATTCAGCTGATAGATAGACAGTAAGTTGTGAATCATACGCATGAAAACCTACACTTGTGAAATACATTGCTTTCCCCAATCAACTGTATGATATATACATTTTCATCTAGAGGTGCTCCAATTACAGGGTTTACTTCTTTATACACCTGCTCCTGTTGAAATGTCATGACGTTCTTTTGAGTGAAAAATTCACTTTTTGAATACTTGATTATACAAT

Coding sequence (CDS)

ATGGAAACGGACTGTCTTGATACGCTTTTGAATACAAAATTCTTCACTCCTTGCAATCTTCATCCTAGTCTCCAGACATATAAATTCTGCATTGATTGTAGTGTTAGCTTCTGCACGAATTGTACAGTCCGTAATCTTCATGGGCAGGTTCGTATCTGGAGATATTCCTATCATGATGTTGTGCGCGTTCAGTCATATCGAGTCAATGGTGAAATAGCTGTTCATCTAAACTCCCGTGGTCAACCCGTCGACACCAAATCACCAAAGCGGAAGTTTGGTGATTTTTGTGAAGATTGTGGCAGAAGTGTTTCTGTGAACTCAAAGTTCAAGGAACAGATCATTGGAACCATCATATATCCGAGCCCGGAATCCATTTACTTATCATTCAAGGAAAAAAGCAGCCCAGAAACAGAAGCATGGGAATTGGAATCAACCATATCAGTTGCAGAGTCCACAGAAGAGACCCAAGCTACCTCTTCATCTTCAAGGCCAAGAAAACGCAGAAGGAAAGGCATCCCTCGCAGTCTTGGCCGCTACCGACGATGGCGACGTGGGAAGGATGGAGGTGTTTATGCACGCAGCGATGATGGGAGGAGCAATATCTCAAAAGCTACTTCTGAGAGAGATATCAAAGAATTCCTTTCATTCTCTGGTCAAATCCTTTATGTTGAAATGCAAAGAGAAACTGAAAATACTCAGGTTGCTTATGTTACATATGAGGATTCTCAAGGAGCAGATACTGCAATACTTTTAACTGGTGCAAAAATTGGTGATCTATTTGTATCAATTACTCTGGTTGAGAATTACCATCTTCCACCTGAAGCAATGTCATCAATCCTGGATAAAAGGAAAACTGCTACCAGGATTGCTCCGAGTCAGGCTGAAGACGTTGTAAGTACAATGCTCGCAAAAGGCTTCATCTTGGGGAAGGATGCTCTCAACAAGGCAAAAGCGTTCGACGAACGACTCCAATTGACATCAAACGCTTCTGCAACTGTCGCTTCCATTGACAGGAAGATGGGTATAACTGAGAAAATAACTGCAGGCACAGCAGTGGTGAATGAAAAGGTCAGGGAGATGGATGAAATGTTCCAGGTTACGGAGAAGACGAAATCCGCTCTAGCAGTTGCCGAGCAGAAGGCAAGCAACGCTGGGACTGCGCTCATGAGCAACTATTACGTGTTAACTGGAGCAGCTTGGTTTTCAAATGCTGTAACTGCAGTTACAAAAGCTGCCGAGGAAGTGACCCAGATGACAAAGGCAAAAGTCGAGAAAGCCGAGGAGGAGAAAGAGGAGAGTATATACCACGAAAGAACCGGGATCATCAATAACTTCGCCGAGTTCCATCTCGACGAGCCTTTACCGGGGGAGCCTGCTATTGTTCCAGTTAATTCAGCTGATAGATAG
BLAST of CmoCh04G011100 vs. Swiss-Prot
Match: BPA1_ARATH (Binding partner of ACD11 1 OS=Arabidopsis thaliana GN=BPA1 PE=1 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.3e-52
Identity = 118/225 (52.44%), Postives = 161/225 (71.56%), Query Frame = 1

Query: 200 NISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGDL 259
           N+S   +E DIKEF SFSG++  +++Q    N   AYVT++++QGA+TA+LL+GA I D 
Sbjct: 12  NLSSGATEHDIKEFFSFSGEVESIDIQ---SNEHSAYVTFKETQGAETAVLLSGASIADQ 71

Query: 260 FVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKAF 319
            V I L  NY  PP A  +  + + +       +AEDVVS+MLAKGFILGKDA+ KAKAF
Sbjct: 72  SVIIELAPNYS-PPAAPHA--ETQSSGAESVVQKAEDVVSSMLAKGFILGKDAVGKAKAF 131

Query: 320 DERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAEQ 379
           DE+   TS A+A VAS+D+K+G+++K+TAGT++VNEK++ +D+ FQVTE+TKS  A AEQ
Sbjct: 132 DEKHGFTSTATAGVASLDQKIGLSQKLTAGTSLVNEKIKAVDQNFQVTERTKSVYAAAEQ 191

Query: 380 KASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVE 425
             S+AG+A+M N YVLTG +W + A   V +AA EV Q TK KVE
Sbjct: 192 TVSSAGSAVMKNRYVLTGVSWAAGAFNRVAQAAGEVGQKTKEKVE 230

BLAST of CmoCh04G011100 vs. TrEMBL
Match: A0A0A0KIR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G023900 PE=4 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 1.3e-128
Identity = 251/270 (92.96%), Postives = 261/270 (96.67%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNISK TSERDIKEF SFSG+ILYVEMQRE+ENTQVAYVTY+DSQGADTAILLTGAKIGD
Sbjct: 9   SNISKLTSERDIKEFFSFSGEILYVEMQRESENTQVAYVTYKDSQGADTAILLTGAKIGD 68

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+ITLVENYHLPPEAMSSILDKR+T T IAP+QAEDVVSTMLAKGFILGKDALNKAKA
Sbjct: 69  LSVTITLVENYHLPPEAMSSILDKRQTVTGIAPNQAEDVVSTMLAKGFILGKDALNKAKA 128

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQV EKTKSALAVAE
Sbjct: 129 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVREKTKSALAVAE 188

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKA++AGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTK KVEKAEEEK+ESIY ER
Sbjct: 189 QKATSAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKVKVEKAEEEKKESIYRER 248

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSADR 469
           TGII+NFAE HLDEPLPGEPAIVPVNSADR
Sbjct: 249 TGIISNFAELHLDEPLPGEPAIVPVNSADR 278

BLAST of CmoCh04G011100 vs. TrEMBL
Match: A0A061EAC5_THECC (RNA-binding family protein isoform 2 OS=Theobroma cacao GN=TCM_011268 PE=4 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 3.1e-93
Identity = 189/269 (70.26%), Postives = 223/269 (82.90%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNIS A S+RDIKEF SFSG I YVEM+RETEN QVAYVT++DSQGADTA+LLTGA I D
Sbjct: 8   SNISLAASQRDIKEFFSFSGDIQYVEMRRETENAQVAYVTFKDSQGADTAMLLTGATIVD 67

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+IT VE+Y LPPEA+ S ++ +   T     +AEDV+STMLAKGF+LGKDA+NKAKA
Sbjct: 68  LSVNITPVEDYQLPPEALVSNMENKPAVTDSTVKKAEDVMSTMLAKGFVLGKDAINKAKA 127

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER   TSNASA V SID+KMG++EK++ GTAVVNEK+REM+E+FQV+EKTKSA AVAE
Sbjct: 128 FDERHHFTSNASAAVTSIDQKMGLSEKLSIGTAVVNEKMREMNEIFQVSEKTKSAFAVAE 187

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AGTA+MSN YV TGA W SNA +AV KAAE+V  +TK KVEKAEEEK+E IY ER
Sbjct: 188 QKASSAGTAIMSNRYVSTGALWLSNAFSAVAKAAEDVGMLTKEKVEKAEEEKKEIIYRER 247

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+FHLDE    EP IVPV+S D
Sbjct: 248 TGIISDFAQFHLDESSAAEPTIVPVDSTD 276

BLAST of CmoCh04G011100 vs. TrEMBL
Match: A0A061E8Q5_THECC (RNA-binding family protein isoform 1 OS=Theobroma cacao GN=TCM_011268 PE=4 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 3.1e-93
Identity = 189/269 (70.26%), Postives = 223/269 (82.90%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNIS A S+RDIKEF SFSG I YVEM+RETEN QVAYVT++DSQGADTA+LLTGA I D
Sbjct: 37  SNISLAASQRDIKEFFSFSGDIQYVEMRRETENAQVAYVTFKDSQGADTAMLLTGATIVD 96

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+IT VE+Y LPPEA+ S ++ +   T     +AEDV+STMLAKGF+LGKDA+NKAKA
Sbjct: 97  LSVNITPVEDYQLPPEALVSNMENKPAVTDSTVKKAEDVMSTMLAKGFVLGKDAINKAKA 156

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER   TSNASA V SID+KMG++EK++ GTAVVNEK+REM+E+FQV+EKTKSA AVAE
Sbjct: 157 FDERHHFTSNASAAVTSIDQKMGLSEKLSIGTAVVNEKMREMNEIFQVSEKTKSAFAVAE 216

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AGTA+MSN YV TGA W SNA +AV KAAE+V  +TK KVEKAEEEK+E IY ER
Sbjct: 217 QKASSAGTAIMSNRYVSTGALWLSNAFSAVAKAAEDVGMLTKEKVEKAEEEKKEIIYRER 276

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+FHLDE    EP IVPV+S D
Sbjct: 277 TGIISDFAQFHLDESSAAEPTIVPVDSTD 305

BLAST of CmoCh04G011100 vs. TrEMBL
Match: W9RFG7_9ROSA (Protein vip1 OS=Morus notabilis GN=L484_006763 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 3.8e-91
Identity = 191/269 (71.00%), Postives = 228/269 (84.76%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           +NIS A SERDIKEF SFSG I YVEM+RE+E+TQ+AYVT++DSQGADTAILL+GA IG+
Sbjct: 37  NNISLAVSERDIKEFFSFSGDIQYVEMRRESESTQLAYVTFKDSQGADTAILLSGAVIGN 96

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L VSIT V++Y LPPEA+ S L+K  T T  A  +AEDVVS+MLAKGFILGKDA+NKAK+
Sbjct: 97  LSVSITPVDDYLLPPEALPSTLEKNPT-TSPAVKKAEDVVSSMLAKGFILGKDAINKAKS 156

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER  L SNASATVASID K+G++EK++ G AVVNEKVREMDE +QV+E TKSALAVAE
Sbjct: 157 FDERHHLMSNASATVASIDNKIGLSEKLSIGKAVVNEKVREMDERYQVSEMTKSALAVAE 216

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AG+ALMSN+YVLTGA+W S+A +A  KAAE+V+ MTK K+EKAEEEKEE IY ER
Sbjct: 217 QKASSAGSALMSNHYVLTGASWVSSAFSAFAKAAEDVSIMTKEKLEKAEEEKEEIIYRER 276

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+ HLDE     P IVPV+SAD
Sbjct: 277 TGIISDFAQIHLDESSSKGPPIVPVSSAD 304

BLAST of CmoCh04G011100 vs. TrEMBL
Match: A0A061E9N0_THECC (RNA-binding family protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_011268 PE=4 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 1.9e-90
Identity = 188/269 (69.89%), Postives = 219/269 (81.41%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNIS A S+RDIKEF SFSG I YVEM+RETEN QVAYVT++DSQGADTA+LLTGA I D
Sbjct: 27  SNISLAASQRDIKEFFSFSGDIQYVEMRRETENAQVAYVTFKDSQGADTAMLLTGATIVD 86

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+IT VE+Y LPPEA+ S             + AEDV+STMLAKGF+LGKDA+NKAKA
Sbjct: 87  LSVNITPVEDYQLPPEALVS-------------NMAEDVMSTMLAKGFVLGKDAINKAKA 146

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER   TSNASA V SID+KMG++EK++ GTAVVNEK+REM+E+FQV+EKTKSA AVAE
Sbjct: 147 FDERHHFTSNASAAVTSIDQKMGLSEKLSIGTAVVNEKMREMNEIFQVSEKTKSAFAVAE 206

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AGTA+MSN YV TGA W SNA +AV KAAE+V  +TK KVEKAEEEK+E IY ER
Sbjct: 207 QKASSAGTAIMSNRYVSTGALWLSNAFSAVAKAAEDVGMLTKEKVEKAEEEKKEIIYRER 266

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+FHLDE    EP IVPV+S D
Sbjct: 267 TGIISDFAQFHLDESSAAEPTIVPVDSTD 282

BLAST of CmoCh04G011100 vs. TAIR10
Match: AT4G17720.1 (AT4G17720.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 250.0 bits (637), Expect = 2.9e-66
Identity = 142/258 (55.04%), Postives = 183/258 (70.93%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SN+S   ++RD+KEF SFSG ILY+E Q ETE T++AYVT++D QGA+TA+LL+GA I D
Sbjct: 9   SNVSLGATDRDLKEFFSFSGDILYLETQSETERTKLAYVTFKDLQGAETAVLLSGATIVD 68

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPS----QAEDVVSTMLAKGFILGKDALN 318
             V +++  +Y L PEA++S+  K    +  A      +AEDVVS+MLAKGFILGKDA+ 
Sbjct: 69  SSVIVSMAPDYQLSPEALASLEPKDSNKSPKAGDSVLRKAEDVVSSMLAKGFILGKDAIA 128

Query: 319 KAKAFDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSAL 378
           KAK+ DE+ QLTS ASA VAS D+K+G T+KI  GT VV EKVRE+D+ +QV+EKTKSA+
Sbjct: 129 KAKSVDEKHQLTSTASAKVASFDKKIGFTDKINTGTVVVGEKVREVDQKYQVSEKTKSAI 188

Query: 379 AVAEQKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESI 438
           A AEQ  SNAG+A+M N YVLTGA W + A   V KAAEEV Q  K KV  AEEE     
Sbjct: 189 AAAEQTVSNAGSAIMKNRYVLTGATWVTGAFNKVAKAAEEVGQKAKEKVGMAEEE----- 248

Query: 439 YHERTGIINNFAEFHLDE 453
             ++  +++ FA  HL E
Sbjct: 249 --DKRKVVDEFARVHLSE 259

BLAST of CmoCh04G011100 vs. TAIR10
Match: AT5G46870.1 (AT5G46870.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 246.5 bits (628), Expect = 3.2e-65
Identity = 138/262 (52.67%), Postives = 187/262 (71.37%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SN+S   +ERD+KEF SFSG I Y+E Q E + +++AYVT++D QGA+TA+LLTG+ I D
Sbjct: 9   SNVSLEATERDLKEFFSFSGDIAYLETQSENDGSKLAYVTFKDLQGAETAVLLTGSTIVD 68

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQ--------AEDVVSTMLAKGFILGK 318
             V++T+  +Y LPP+A++SI   +++    +P++        AEDVVS M++KGF+LGK
Sbjct: 69  SSVTVTMSPDYQLPPDALASIESLKESNKSSSPTREDVSVFRKAEDVVSGMISKGFVLGK 128

Query: 319 DALNKAKAFDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKT 378
           DA+ KAK+ DE+ QLTS ASA V S D+++G TEKI  GT VV+EKV+E+D+ FQVTEKT
Sbjct: 129 DAIAKAKSLDEKHQLTSTASARVTSFDKRIGFTEKINTGTTVVSEKVKEVDQKFQVTEKT 188

Query: 379 KSALAVAEQKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEK 438
           KSA+A AEQ  SNAG+A+M N YVLTGA W + A   V+KAAEEV Q  K KV  AEEE+
Sbjct: 189 KSAIAAAEQTVSNAGSAIMKNRYVLTGATWVTGAFNRVSKAAEEVGQKAKEKVGLAEEEE 248

Query: 439 EESIYHERTGIINNFAEFHLDE 453
           E     E+  +++  A  HL E
Sbjct: 249 E-----EKKKVVDEVAIVHLTE 265

BLAST of CmoCh04G011100 vs. TAIR10
Match: AT1G67950.3 (AT1G67950.3 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 240.0 bits (611), Expect = 3.0e-63
Identity = 144/266 (54.14%), Postives = 187/266 (70.30%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SN+S   S++D+KEF SFSG I YVEM+ ET+ +QVAYVT++DSQGA+TA+LLTGA I D
Sbjct: 34  SNVSLIVSKKDVKEFFSFSGDIQYVEMRSETQESQVAYVTFKDSQGAETAMLLTGAVIAD 93

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATR-IAPSQAEDVVSTMLAKGFILGKDALNKAK 318
           L VSIT   NY LPPEA++  LD ++ +    +  +AEDVV+ M+ +G+ LGKDA+ KAK
Sbjct: 94  LRVSITPAVNYQLPPEALA--LDSQEHSFNGFSVKKAEDVVNIMVGRGYALGKDAMEKAK 153

Query: 319 AFDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVA 378
           AFD+R  L SNASAT+AS+D KMG++EK++ GT VVNEK+R++DE +QV E TKSALA A
Sbjct: 154 AFDDRHNLISNASATIASLDDKMGLSEKLSIGTTVVNEKLRDIDERYQVREITKSALAAA 213

Query: 379 EQKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHE 438
           E+ A +A TALM+N YV +GA+WFSNA  AVTKA +E       KVE   E ++E I   
Sbjct: 214 EETAISARTALMANPYVSSGASWFSNAFGAVTKAVKE-------KVENGGEGRKEII--- 273

Query: 439 RTGIINNFAEFHLDEPLPGEPAIVPV 464
                       LD   P  PA+VPV
Sbjct: 274 -----------TLDPSSPKVPAVVPV 276

BLAST of CmoCh04G011100 vs. TAIR10
Match: AT5G16840.2 (AT5G16840.2 binding partner of acd11 1)

HSP 1 Score: 210.3 bits (534), Expect = 2.6e-54
Identity = 118/225 (52.44%), Postives = 163/225 (72.44%), Query Frame = 1

Query: 200 NISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGDL 259
           N+S   +E DIKEF SFSG++  +++Q   E++  AYVT++++QGA+TA+LL+GA I D 
Sbjct: 12  NLSSGATEHDIKEFFSFSGEVESIDIQSSNEHS--AYVTFKETQGAETAVLLSGASIADQ 71

Query: 260 FVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKAF 319
            V I L  NY  PP A  +  + + +       +AEDVVS+MLAKGFILGKDA+ KAKAF
Sbjct: 72  SVIIELAPNYS-PPAAPHA--ETQSSGAESVVQKAEDVVSSMLAKGFILGKDAVGKAKAF 131

Query: 320 DERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAEQ 379
           DE+   TS A+A VAS+D+K+G+++K+TAGT++VNEK++ +D+ FQVTE+TKS  A AEQ
Sbjct: 132 DEKHGFTSTATAGVASLDQKIGLSQKLTAGTSLVNEKIKAVDQNFQVTERTKSVYAAAEQ 191

Query: 380 KASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVE 425
             S+AG+A+M N YVLTG +W + A   V +AA EV Q TK KVE
Sbjct: 192 TVSSAGSAVMKNRYVLTGVSWAAGAFNRVAQAAGEVGQKTKEKVE 231

BLAST of CmoCh04G011100 vs. TAIR10
Match: AT5G32450.1 (AT5G32450.1 RNA binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 178.7 bits (452), Expect = 8.2e-45
Identity = 105/245 (42.86%), Postives = 157/245 (64.08%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           +N+S   +ER+I EF SFSG I ++E+Q+E   +++A+VT+ D +  + A+LL+GA I D
Sbjct: 10  NNVSDLATEREIHEFFSFSGDIEHIEIQKEFGQSRIAFVTFTDPKALEIALLLSGATIVD 69

Query: 259 LFVSITLVENY---------HLPPEAMSSILDKRKTAT--------RIAPSQAEDVVSTM 318
             V+IT  ENY          +   AM   L +  T T        R   S+A+DVV+T+
Sbjct: 70  QIVTITRAENYVQRRETQEVRMLDNAMPLGLQESTTQTKTNMDGNSRAYVSKAQDVVATV 129

Query: 319 LAKGFILGKDALNKAKAFDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMD 378
           LAKG  LG+DA+NKAKAFDE+ QL +NASA V+S D+++G+TEK++ G + VNEKV+ +D
Sbjct: 130 LAKGSALGQDAVNKAKAFDEKHQLRANASAKVSSFDKRVGLTEKLSVGISAVNEKVKSVD 189

Query: 379 EMFQVTEKTKSALAVAEQKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKA 427
           +  QV++KT +A+  AE+K ++ G+A+ S+ YV  GAAWFS A + V +  +     TK 
Sbjct: 190 QKLQVSDKTMAAIFAAERKLNDTGSAVKSSRYVTAGAAWFSGAFSKVARVGQVAGSKTKE 249

BLAST of CmoCh04G011100 vs. NCBI nr
Match: gi|659103004|ref|XP_008452424.1| (PREDICTED: protein vip1-like isoform X2 [Cucumis melo])

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-128
Identity = 251/270 (92.96%), Postives = 261/270 (96.67%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNISK TSERDIKEF SFSG+ILYVEMQRE+ENTQVAYVTY+DSQGADTAILLTGAKIGD
Sbjct: 7   SNISKLTSERDIKEFFSFSGEILYVEMQRESENTQVAYVTYKDSQGADTAILLTGAKIGD 66

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+ITLVENYHLPPEAMSSILDKR+T T IAP+QAEDVVSTMLAKGFILGKDALNKAKA
Sbjct: 67  LSVTITLVENYHLPPEAMSSILDKRQTVTGIAPNQAEDVVSTMLAKGFILGKDALNKAKA 126

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDERLQ TSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE
Sbjct: 127 FDERLQFTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 186

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKA++AGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTK KVEKAEEEK+ESIY ER
Sbjct: 187 QKATSAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKVKVEKAEEEKKESIYRER 246

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSADR 469
           TGII+NFAE HLDEPLPGEPAIVPVNSADR
Sbjct: 247 TGIISNFAELHLDEPLPGEPAIVPVNSADR 276

BLAST of CmoCh04G011100 vs. NCBI nr
Match: gi|659103002|ref|XP_008452423.1| (PREDICTED: protein vip1-like isoform X1 [Cucumis melo])

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-128
Identity = 251/270 (92.96%), Postives = 261/270 (96.67%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNISK TSERDIKEF SFSG+ILYVEMQRE+ENTQVAYVTY+DSQGADTAILLTGAKIGD
Sbjct: 9   SNISKLTSERDIKEFFSFSGEILYVEMQRESENTQVAYVTYKDSQGADTAILLTGAKIGD 68

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+ITLVENYHLPPEAMSSILDKR+T T IAP+QAEDVVSTMLAKGFILGKDALNKAKA
Sbjct: 69  LSVTITLVENYHLPPEAMSSILDKRQTVTGIAPNQAEDVVSTMLAKGFILGKDALNKAKA 128

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDERLQ TSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE
Sbjct: 129 FDERLQFTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 188

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKA++AGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTK KVEKAEEEK+ESIY ER
Sbjct: 189 QKATSAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKVKVEKAEEEKKESIYRER 248

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSADR 469
           TGII+NFAE HLDEPLPGEPAIVPVNSADR
Sbjct: 249 TGIISNFAELHLDEPLPGEPAIVPVNSADR 278

BLAST of CmoCh04G011100 vs. NCBI nr
Match: gi|449449026|ref|XP_004142266.1| (PREDICTED: uncharacterized protein LOC101203394 [Cucumis sativus])

HSP 1 Score: 468.0 bits (1203), Expect = 1.9e-128
Identity = 251/270 (92.96%), Postives = 261/270 (96.67%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNISK TSERDIKEF SFSG+ILYVEMQRE+ENTQVAYVTY+DSQGADTAILLTGAKIGD
Sbjct: 9   SNISKLTSERDIKEFFSFSGEILYVEMQRESENTQVAYVTYKDSQGADTAILLTGAKIGD 68

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+ITLVENYHLPPEAMSSILDKR+T T IAP+QAEDVVSTMLAKGFILGKDALNKAKA
Sbjct: 69  LSVTITLVENYHLPPEAMSSILDKRQTVTGIAPNQAEDVVSTMLAKGFILGKDALNKAKA 128

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQV EKTKSALAVAE
Sbjct: 129 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVREKTKSALAVAE 188

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKA++AGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTK KVEKAEEEK+ESIY ER
Sbjct: 189 QKATSAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKVKVEKAEEEKKESIYRER 248

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSADR 469
           TGII+NFAE HLDEPLPGEPAIVPVNSADR
Sbjct: 249 TGIISNFAELHLDEPLPGEPAIVPVNSADR 278

BLAST of CmoCh04G011100 vs. NCBI nr
Match: gi|590697684|ref|XP_007045509.1| (RNA-binding family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 350.5 bits (898), Expect = 4.5e-93
Identity = 189/269 (70.26%), Postives = 223/269 (82.90%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNIS A S+RDIKEF SFSG I YVEM+RETEN QVAYVT++DSQGADTA+LLTGA I D
Sbjct: 37  SNISLAASQRDIKEFFSFSGDIQYVEMRRETENAQVAYVTFKDSQGADTAMLLTGATIVD 96

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+IT VE+Y LPPEA+ S ++ +   T     +AEDV+STMLAKGF+LGKDA+NKAKA
Sbjct: 97  LSVNITPVEDYQLPPEALVSNMENKPAVTDSTVKKAEDVMSTMLAKGFVLGKDAINKAKA 156

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER   TSNASA V SID+KMG++EK++ GTAVVNEK+REM+E+FQV+EKTKSA AVAE
Sbjct: 157 FDERHHFTSNASAAVTSIDQKMGLSEKLSIGTAVVNEKMREMNEIFQVSEKTKSAFAVAE 216

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AGTA+MSN YV TGA W SNA +AV KAAE+V  +TK KVEKAEEEK+E IY ER
Sbjct: 217 QKASSAGTAIMSNRYVSTGALWLSNAFSAVAKAAEDVGMLTKEKVEKAEEEKKEIIYRER 276

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+FHLDE    EP IVPV+S D
Sbjct: 277 TGIISDFAQFHLDESSAAEPTIVPVDSTD 305

BLAST of CmoCh04G011100 vs. NCBI nr
Match: gi|590697687|ref|XP_007045510.1| (RNA-binding family protein isoform 2 [Theobroma cacao])

HSP 1 Score: 350.5 bits (898), Expect = 4.5e-93
Identity = 189/269 (70.26%), Postives = 223/269 (82.90%), Query Frame = 1

Query: 199 SNISKATSERDIKEFLSFSGQILYVEMQRETENTQVAYVTYEDSQGADTAILLTGAKIGD 258
           SNIS A S+RDIKEF SFSG I YVEM+RETEN QVAYVT++DSQGADTA+LLTGA I D
Sbjct: 8   SNISLAASQRDIKEFFSFSGDIQYVEMRRETENAQVAYVTFKDSQGADTAMLLTGATIVD 67

Query: 259 LFVSITLVENYHLPPEAMSSILDKRKTATRIAPSQAEDVVSTMLAKGFILGKDALNKAKA 318
           L V+IT VE+Y LPPEA+ S ++ +   T     +AEDV+STMLAKGF+LGKDA+NKAKA
Sbjct: 68  LSVNITPVEDYQLPPEALVSNMENKPAVTDSTVKKAEDVMSTMLAKGFVLGKDAINKAKA 127

Query: 319 FDERLQLTSNASATVASIDRKMGITEKITAGTAVVNEKVREMDEMFQVTEKTKSALAVAE 378
           FDER   TSNASA V SID+KMG++EK++ GTAVVNEK+REM+E+FQV+EKTKSA AVAE
Sbjct: 128 FDERHHFTSNASAAVTSIDQKMGLSEKLSIGTAVVNEKMREMNEIFQVSEKTKSAFAVAE 187

Query: 379 QKASNAGTALMSNYYVLTGAAWFSNAVTAVTKAAEEVTQMTKAKVEKAEEEKEESIYHER 438
           QKAS+AGTA+MSN YV TGA W SNA +AV KAAE+V  +TK KVEKAEEEK+E IY ER
Sbjct: 188 QKASSAGTAIMSNRYVSTGALWLSNAFSAVAKAAEDVGMLTKEKVEKAEEEKKEIIYRER 247

Query: 439 TGIINNFAEFHLDEPLPGEPAIVPVNSAD 468
           TGII++FA+FHLDE    EP IVPV+S D
Sbjct: 248 TGIISDFAQFHLDESSAAEPTIVPVDSTD 276

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BPA1_ARATH1.3e-5252.44Binding partner of ACD11 1 OS=Arabidopsis thaliana GN=BPA1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KIR7_CUCSA1.3e-12892.96Uncharacterized protein OS=Cucumis sativus GN=Csa_5G023900 PE=4 SV=1[more]
A0A061EAC5_THECC3.1e-9370.26RNA-binding family protein isoform 2 OS=Theobroma cacao GN=TCM_011268 PE=4 SV=1[more]
A0A061E8Q5_THECC3.1e-9370.26RNA-binding family protein isoform 1 OS=Theobroma cacao GN=TCM_011268 PE=4 SV=1[more]
W9RFG7_9ROSA3.8e-9171.00Protein vip1 OS=Morus notabilis GN=L484_006763 PE=4 SV=1[more]
A0A061E9N0_THECC1.9e-9069.89RNA-binding family protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_011268... [more]
Match NameE-valueIdentityDescription
AT4G17720.12.9e-6655.04 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT5G46870.13.2e-6552.67 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT1G67950.33.0e-6354.14 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT5G16840.22.6e-5452.44 binding partner of acd11 1[more]
AT5G32450.18.2e-4542.86 RNA binding (RRM/RBD/RNP motifs) family protein[more]
Match NameE-valueIdentityDescription
gi|659103004|ref|XP_008452424.1|1.1e-12892.96PREDICTED: protein vip1-like isoform X2 [Cucumis melo][more]
gi|659103002|ref|XP_008452423.1|1.1e-12892.96PREDICTED: protein vip1-like isoform X1 [Cucumis melo][more]
gi|449449026|ref|XP_004142266.1|1.9e-12892.96PREDICTED: uncharacterized protein LOC101203394 [Cucumis sativus][more]
gi|590697684|ref|XP_007045509.1|4.5e-9370.26RNA-binding family protein isoform 1 [Theobroma cacao][more]
gi|590697687|ref|XP_007045510.1|4.5e-9370.26RNA-binding family protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000315Znf_B-box
IPR000504RRM_dom
IPR012677Nucleotide-bd_a/b_plait_sf
Vocabulary: Cellular Component
TermDefinition
GO:0005622intracellular
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0003676nucleic acid binding
GO:0000166nucleotide binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019344 cysteine biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005622 intracellular
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G011100.1CmoCh04G011100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000315B-box-type zinc fingerPROFILEPS50119ZF_BBOXcoord: 18..63
score: 8
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 200..258
score: 5.
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 199..268
score: 8
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 200..258
score: 1.
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 199..268
score: 2.4
NoneNo IPR availableunknownCoilCoilcoord: 409..431
scor
NoneNo IPR availablePANTHERPTHR31065FAMILY NOT NAMEDcoord: 6..405
score: 1.3
NoneNo IPR availablePANTHERPTHR31065:SF15PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 6..405
score: 1.3
NoneNo IPR availableunknownSSF57845B-box zinc-binding domaincoord: 16..52
score: 4.1