CmoCh04G021720 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh04G021720
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
LocationCmo_Chr04: 15821372 .. 15825745 (+)
RNA-Seq ExpressionCmoCh04G021720
SyntenyCmoCh04G021720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTTGGAAAAACCAGCGTACTGCTTCGCCTCCATCTCTTTCCTTTTTTCTTCCATTGCTCTCTTTTCTTCTCTCTGTTTCGACGACGCCATTGAAACCCTCGTTCTGTGTTTTAGATCCATTGCGTCTCAATCTGTTTAAGGTTTTTGCATTCGAACTTATCTTCTTCGATTCGATTTGCCTGGATTTTTTGGCGAACCACAGGAGGGCTCTGTTTTCGTCTCTCTCACATTTTGGAACCCGATTGCCTGTTTGATTCTATCTCGCAATCAATCGTTACTCTTCTGTGCTTCGGGATTTTGGTTCCAATCTGATTGGTTTTTAGCGAGATTGTTACTTTGTGCATTTCCTTGACTTTTATTGTGGATTTTAGGACTGCGGATTGCGATTTTCCGGAGCTGTTTTTCGTTTGGTTTTCGAAATATAATGTTAGGTGCTGGATTGCAGTTTGGTCGTGGTTGTGGTGATGATAGGTTTTACAATTCGACGAAAGCTCGTAGGGTGAATCAGGGACGTCAAAATGATCAGCTCCGTAGAGCTCAGAGCGACGTTTCTGCAGGTCAATCTCCTGTGGTTAAACCGACCACGGTGTCCTCGGTGACTAGAGAAACCGAGAGCGGAGATGCGTGTGAAGAGCTCCCCAAATCTATTTCGATGTCGGCCTTTGAGCCAGTGGTATCGTCGCTGAGTAATCTGCAGCGGTTTTTGCAGTCTATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGGTTTGAATCGATTGGAAGTTTCTCTTCCATTGGCTAATATTTTGTTTTCTTTAGTGTGAAAAAGTCGATTCTCAAATGTTTTAAGTCTTTACCTTTTGTGGCTCTCGGTATGGCCCTCCTAATTCTGATAAGCGGATTGATGTTTGAGTTTTTGGTTCAATTCTGAACCGTATAACAAATAAGGGATTATGCGAAGAAGAAAGCTAGTTTGGTTGTGGGAATTGCTTCGAGGGATAATATACAGATTGCAGTTGAAATTTTAATGTCTTTCCTTTTTCTATCGCCTCAAGTTTATCAAGGATCAGCGAATGAGGGTTTATTTTTTTTTCATAGCTAGTATCAACTTTAAAGTCTCAATTAATGATCCCAATAGTTTACTGTTCCTCTGATATTGTCTTCATTTGTTTTTCATTGCAGACAACGGTAAAGGGTTGGAGAACCTGTGACGGAGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTACTAAACGACAGTGACAGTGTTATCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTTAAGGTATTCTCGTTAACTGGAAATAAATTTTGTCTATACAATTCTTTGCATTTTTTTTTCTAAATTATTTCTGATAAGCTGTTTGTATGAATTTATTGGTGAAATATAGCCAAATGTTCAAGTGAGGGTCAATTTCTGTGGATGTCTTTTTTGTTTTTTCATATCATGCATACTTATATTCTTGATTCAATCATTAGAAAGATTATCGAACTGTTCAAGAATTTTTCTTGGGCAAAAGAACTGTAGGAGTTGATAGCTAATATTTGGTTCGACGATGTTATAAATTGTAATAAGTTCGTCAATATGTGTTATGTAACACTTGAGAATTCAATCTAAAAGTCTTGTTGAAGAATAAGACCTTTAGTTATAAATTATCCAAGAATCTCAAAAACGATAAGGAATCGAGCCAAGGGAAAATTTTGGAACGGATTACTTTGATGGTCAAGATCAAGAGTACAAAAAAACTCGTTTCTAAACTCATGATTCAAATCACTCCACAAAATAGCTTAATCAAGGCTACTTGAATGACTCTGGCATGCAAATCAAAACATAAGAAATTTCAAGTAACAAACTTATGACCTAAAATGCATAATAGGTTACTTTCATTTTAATATCCAAAATCTTCATATTAAAGCAACAAGATGAATGACTATTTATAACCTTGAAAGTAAACCTCAGAGTAACTTAGGGTCAATTGAAAGGTTGTAACTCTCATGTTCATCAATGATCACTAGCTAATGGTAACTTATAACCTTAAAAATAATTCTCTAATAATAATAATAATAATAATAAATCTTCTTAAATCCGAGAATTAATATTTTTATAAGCTAACCGAAAGTTTGTAACCTATCTAAGAATAATTGAAAGGTCACTTGATGCAGTTTGAAGTAGATTGCCTTATTCCGTATCATGCAACCTGGTGAAGGGGTAAAGACCAAGCATTATTTTATTGACAGATTATGATAAATTCTCTCTGGTTTGATGTAAAAATTAACCAAATAAAGCCTTGCGTAAGTTCAATGAACCAATTTGAAGGATAACGTATTTTAATTCACTAAATAAGTGTTTAAAGTTTAACATTGAGTTGGTACTCAGTATACTGAATACTTGGTATTTTAGCATGGACTTTTCTATGTGATGTAGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAAGCTGAACGAGCTCTAAAATACATGGAGAAACAACTCAATCATCACAACTTATCTTCTGAGATTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCCTAATCCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGGGATTTGCCTTATAGTCGTGAACCTTTAGCAGATAAGGCATGTAAATGCTTAAACAGAACCCCTCCCCTATTGAAATCAGAAGAAAATAAAATACATAAATCCTTGCCTTTTATCCCTCATAACCTTACTGTAGTATATTTTCTCCAGATATCAGATCTTGCCTTCCAGTTCCCTGAGCTCAGGACATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTACTGTCAAGCCTTGAAGCTAATTGGATTAATTGGACACTTTTTTACTTGCATGAATCACTTATTGTGTTTTGGTGGTTCATTCTTAGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTACGCCAATGGGAGGTAGTCTATTTCTGTCTCTGCTTCTTAAATGTCTAATTTTTTTGTGTGGAGGTGTAGTTCATTTAATTTCCTTGTTTTCTTTTGCGCTTCATAATGTTTTTACCTGTCTTTATGTATATAGAAGACTTTTTCTGTCTCCCTGCTCTTCTGTCTTAACTGAGGTTGATGGCAGCTTCAAACAGTACTTTGACAATGCTTGGAATTAGGGATTCTTTGTGTAGCAACGCTCTATTATGTTTCACAACTTCATTTTCATATTTTTCTTCACTGAAACCATCTTAAGGAATACAACTTCCACTGCTAAGTTTATTGTCTGTATTTGTTATTTACTCAAACTTTATTGGAGATTGGTCGACGATTTTGTGTTAAAATGGTTTTCATTCATAAATTTGTGCTCTTCATGCAAGTATTCATGGATGACTTGGTATCTTAAGAACAAAATACCAATGGTTTATATTTTACCAAGTTATATATAGAAGATTTAGCCCGCACATTGCAACTGTTTTGATACAGACCAGCGTACACTAATCCAATAAGTTCTGACTTTTGTCTATGGCTTCGCTGTTATTTTGTTGCGTATGCATTTGAAATAAGCTGGTAATAAACTGGTAATTGTCTCCTCATGGTTGTAACTTTAAAACAAGTTCTAGTGCTCCATGCCACTCTGCGAATTTACAAATATAATTGCATCTGGGCAAGTTGTATCCAGATTGTTGATAATATGATGGGCCTCTTCACATCTTACATCTTAAATTTTTTTGGTACCGATTCTTCATTTATTTTAATTTCAGGGGGACGCAGTGTACAAGGTCCTGTACTAACGTATCCCAGTGAGATAGATGGTATCCCTAAGATGTCACTGCCAGTTTTTGGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCAGATTCGAGTGGCAATTGGCAAAGTCACTTTTGCAGGATGCTGAGGATTGGTTGAGACAACGCCAAGTAAACCACCCCGACTTCCTCTTCTTCAGGCGACGGTGAAGTTCTGGTAACTTCTACGAGATCTAAAGGTGGAAATCAAGATATCATAGTATTCAGTACCATGTCGTGATTTGTGGGCACTGTTCTAATTTTGGAAAATGGGAAGGAAAAAGGAAAAGGAAAAAAAAAAAAGAAAAAGGAAAAAGGAAAAAGGAAAAAGGAAAA

mRNA sequence

CCTTGGAAAAACCAGCGTACTGCTTCGCCTCCATCTCTTTCCTTTTTTCTTCCATTGCTCTCTTTTCTTCTCTCTGTTTCGACGACGCCATTGAAACCCTCGTTCTGTGTTTTAGATCCATTGCGTCTCAATCTGTTTAAGGTTTTTGCATTCGAACTTATCTTCTTCGATTCGATTTGCCTGGATTTTTTGGCGAACCACAGGAGGGCTCTGTTTTCGTCTCTCTCACATTTTGGAACCCGATTGCCTGTTTGATTCTATCTCGCAATCAATCGTTACTCTTCTGTGCTTCGGGATTTTGGTTCCAATCTGATTGGTTTTTAGCGAGATTGTTACTTTGTGCATTTCCTTGACTTTTATTGTGGATTTTAGGACTGCGGATTGCGATTTTCCGGAGCTGTTTTTCGTTTGGTTTTCGAAATATAATGTTAGGTGCTGGATTGCAGTTTGGTCGTGGTTGTGGTGATGATAGGTTTTACAATTCGACGAAAGCTCGTAGGGTGAATCAGGGACGTCAAAATGATCAGCTCCGTAGAGCTCAGAGCGACGTTTCTGCAGGTCAATCTCCTGTGGTTAAACCGACCACGGTGTCCTCGGTGACTAGAGAAACCGAGAGCGGAGATGCGTGTGAAGAGCTCCCCAAATCTATTTCGATGTCGGCCTTTGAGCCAGTGGTATCGTCGCTGAGTAATCTGCAGCGGTTTTTGCAGTCTATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACGGTAAAGGGTTGGAGAACCTGTGACGGAGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTACTAAACGACAGTGACAGTGTTATCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTTAAGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAAGCTGAACGAGCTCTAAAATACATGGAGAAACAACTCAATCATCACAACTTATCTTCTGAGATTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCCTAATCCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGGGATTTGCCTTATAGTCGTGAACCTTTAGCAGATAAGGCATTATATTTTCTCCAGATATCAGATCTTGCCTTCCAGTTCCCTGAGCTCAGGACATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTACGCCAATGGGAGGGGGACGCAGTGTACAAGGTCCTGTACTAACGTATCCCAGTGAGATAGATGGTATCCCTAAGATGTCACTGCCAGTTTTTGGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCAGATTCGAGTGGCAATTGGCAAAGTCACTTTTGCAGGATGCTGAGGATTGGTTGAGACAACGCCAAGTAAACCACCCCGACTTCCTCTTCTTCAGGCGACGGTGAAGTTCTGGTAACTTCTACGAGATCTAAAGGTGGAAATCAAGATATCATAGTATTCAGTACCATGTCGTGATTTGTGGGCACTGTTCTAATTTTGGAAAATGGGAAGGAAAAAGGAAAAGGAAAAAAAAAAAAGAAAAAGGAAAAAGGAAAAAGGAAAAAGGAAAA

Coding sequence (CDS)

ATGTTAGGTGCTGGATTGCAGTTTGGTCGTGGTTGTGGTGATGATAGGTTTTACAATTCGACGAAAGCTCGTAGGGTGAATCAGGGACGTCAAAATGATCAGCTCCGTAGAGCTCAGAGCGACGTTTCTGCAGGTCAATCTCCTGTGGTTAAACCGACCACGGTGTCCTCGGTGACTAGAGAAACCGAGAGCGGAGATGCGTGTGAAGAGCTCCCCAAATCTATTTCGATGTCGGCCTTTGAGCCAGTGGTATCGTCGCTGAGTAATCTGCAGCGGTTTTTGCAGTCTATCGCGCCATCTGTACCTGCACAGTACCTCTCAAAGACAACGGTAAAGGGTTGGAGAACCTGTGACGGAGAATTTCAACCATACTTTGTCCTAGGTGATTTGTGGGAGTCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTACTAAACGACAGTGACAGTGTTATCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTGCAAAGTTAAGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTCTAGTAGTGATGGTAGTAGTGATTCAGAAGCTGAACGAGCTCTAAAATACATGGAGAAACAACTCAATCATCACAACTTATCTTCTGAGATTTCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTAATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTCCTAATCCTCAAGGCCAGCTACTATTTGAGCATCTTGAACGGGATTTGCCTTATAGTCGTGAACCTTTAGCAGATAAGGCATTATATTTTCTCCAGATATCAGATCTTGCCTTCCAGTTCCCTGAGCTCAGGACATTACGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATTCCAACTGGACCAACATTAAAGGATCTGGATGCCTGCTTCCTCACCTTTCATTATTTGTCTACGCCAATGGGAGGGGGACGCAGTGTACAAGGTCCTGTACTAACGTATCCCAGTGAGATAGATGGTATCCCTAAGATGTCACTGCCAGTTTTTGGTCTAGCTTCATACAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCAGATTCGAGTGGCAATTGGCAAAGTCACTTTTGCAGGATGCTGAGGATTGGTTGAGACAACGCCAAGTAAACCACCCCGACTTCCTCTTCTTCAGGCGACGGTGA

Protein sequence

MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Homology
BLAST of CmoCh04G021720 vs. ExPASy TrEMBL
Match: A0A6J1HCB0 (uncharacterized protein LOC111462294 OS=Cucurbita moschata OX=3662 GN=LOC111462294 PE=4 SV=1)

HSP 1 Score: 822.4 bits (2123), Expect = 8.6e-235
Identity = 411/417 (98.56%), Postives = 411/417 (98.56%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELRTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 411

BLAST of CmoCh04G021720 vs. ExPASy TrEMBL
Match: A0A6J1I465 (uncharacterized protein LOC111470464 OS=Cucurbita maxima OX=3661 GN=LOC111470464 PE=4 SV=1)

HSP 1 Score: 816.2 bits (2107), Expect = 6.2e-233
Identity = 406/417 (97.36%), Postives = 411/417 (98.56%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKAR+V+QGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARKVHQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDAC+ELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACDELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSELSRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELRTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGRYEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 411

BLAST of CmoCh04G021720 vs. ExPASy TrEMBL
Match: A0A5D3BPU4 (DUF789 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00070 PE=4 SV=1)

HSP 1 Score: 745.0 bits (1922), Expect = 1.7e-211
Identity = 371/417 (88.97%), Postives = 387/417 (92.81%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYN TKARRV+QGRQ DQLRRAQSDVSAGQS VVKP+ VSSV R
Sbjct: 1   MLGAGLQFGRGCGDDRFYNPTKARRVHQGRQKDQLRRAQSDVSAGQSLVVKPSAVSSVIR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETE G+ CEELPKSI+MS FEPVVSSLSNL+RFLQSIAPSVPAQYLSKTT+KGWRTCD E
Sbjct: 61  ETECGEGCEELPKSIAMSGFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDME 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGES KSSAK RQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESSKSSAKSRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSE ERALKYM KQLNHH+LSSE+SRRMD IS RDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLSSELSRRMDNISFRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAES N QGQLLFEHLERDLPYSREPLADK      ISDLAFQFP+L+TLRSCDL
Sbjct: 241 DCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPKLKTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLS+P GG RSVQ PV+TYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSSPTGGARSVQCPVVTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNG +EWQLA SLL DAEDWLR+RQVNHPDF+FF RR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLHDAEDWLRERQVNHPDFIFFSRR 411

BLAST of CmoCh04G021720 vs. ExPASy TrEMBL
Match: A0A6J1JV26 (uncharacterized protein LOC111489147 OS=Cucurbita maxima OX=3661 GN=LOC111489147 PE=4 SV=1)

HSP 1 Score: 744.2 bits (1920), Expect = 3.0e-211
Identity = 365/417 (87.53%), Postives = 389/417 (93.29%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           M GAGLQFGRGCGDDRFYN TKARR +QGRQNDQLRR QSDVSA +SPV+KPTTVSS+ R
Sbjct: 1   MFGAGLQFGRGCGDDRFYNPTKARRSHQGRQNDQLRRVQSDVSASESPVLKPTTVSSMIR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETE GD CEELPKSI+MSAFEPVVSSLSNL+RFLQSIAPSVPAQY SKTT+KGWRTCD E
Sbjct: 61  ETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYPSKTTIKGWRTCDAE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
            QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQ
Sbjct: 121 LQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSE ERA+KYM  QLNHH+LSSE+SRRM+R+SLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEPERAVKYMGNQLNHHHLSSELSRRMERLSLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAES N QGQLLFEHLERDLPYSREPLADK      +SDLAF+FPEL+TLRSCDL
Sbjct: 241 DCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK------VSDLAFRFPELKTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGG RSVQGPV+TYPS+IDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           +MSLPVFGLASYKFRGSLWTPNG +EWQLA SLLQDA+DWLR+R VNHPDF+FF RR
Sbjct: 361 RMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAKDWLRERCVNHPDFIFFSRR 411

BLAST of CmoCh04G021720 vs. ExPASy TrEMBL
Match: A0A6J1FG46 (uncharacterized protein LOC111445107 OS=Cucurbita moschata OX=3662 GN=LOC111445107 PE=4 SV=1)

HSP 1 Score: 741.5 bits (1913), Expect = 1.9e-210
Identity = 367/417 (88.01%), Postives = 386/417 (92.57%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYN TKARR +QGRQNDQLRRAQSDVSA QSPV+KPTTVSSV R
Sbjct: 1   MLGAGLQFGRGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSASQSPVLKPTTVSSVIR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETE GD CEELP SI+MSAFEPVVSSLSNL+RFLQSI PSVPAQY SKTT+KGWRTCD E
Sbjct: 61  ETEYGDGCEELPISIAMSAFEPVVSSLSNLERFLQSIVPSVPAQYPSKTTMKGWRTCDAE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
            QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQ
Sbjct: 121 LQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSE ERALKYM  QLNHH+LSSE+SRR +R+SLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEPERALKYMGNQLNHHHLSSELSRRTERLSLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DC SDEAES N QGQLLFEHLERDLPYSREPLADK      +SDLAF+FPEL+TLRSCDL
Sbjct: 241 DCYSDEAESLNSQGQLLFEHLERDLPYSREPLADK------VSDLAFRFPELKTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFHYLSTPMGG RSVQGPV+TYPS+IDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHYLSTPMGGARSVQGPVVTYPSDIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           +MSLPVFGLASYKFRGSLWTPNG  EWQLA SLLQDAEDWLR+R VNHPDF+FF RR
Sbjct: 361 RMSLPVFGLASYKFRGSLWTPNGGHEWQLANSLLQDAEDWLRERCVNHPDFIFFSRR 411

BLAST of CmoCh04G021720 vs. NCBI nr
Match: XP_022961613.1 (uncharacterized protein LOC111462294 [Cucurbita moschata])

HSP 1 Score: 822.4 bits (2123), Expect = 1.8e-234
Identity = 411/417 (98.56%), Postives = 411/417 (98.56%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELRTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 411

BLAST of CmoCh04G021720 vs. NCBI nr
Match: KAG7032526.1 (hypothetical protein SDJN02_06575, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 821.2 bits (2120), Expect = 3.9e-234
Identity = 414/448 (92.41%), Postives = 417/448 (93.08%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSELSRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKA------------------------ 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKA                        
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKACKCLNRTPPLLKSEENKIHKSLPF 300

Query: 301 -------LYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF 360
                  +YFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF
Sbjct: 301 IPHNLTVVYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF 360

Query: 361 LTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQL 418
           LTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGR+EWQL
Sbjct: 361 LTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRYEWQL 420

BLAST of CmoCh04G021720 vs. NCBI nr
Match: XP_023554659.1 (uncharacterized protein LOC111811852 [Cucurbita pepo subsp. pepo] >KAG6601821.1 hypothetical protein SDJN03_07054, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 820.5 bits (2118), Expect = 6.7e-234
Identity = 409/417 (98.08%), Postives = 411/417 (98.56%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSELSRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELRTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGRYEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 411

BLAST of CmoCh04G021720 vs. NCBI nr
Match: XP_022971786.1 (uncharacterized protein LOC111470464 [Cucurbita maxima])

HSP 1 Score: 816.2 bits (2107), Expect = 1.3e-232
Identity = 406/417 (97.36%), Postives = 411/417 (98.56%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQFGRGCGDDRFYNSTKAR+V+QGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR
Sbjct: 1   MLGAGLQFGRGCGDDRFYNSTKARKVHQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETESGDAC+ELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE
Sbjct: 61  ETESGDACDELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSE+SRRMDRISLRDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSELSRRMDRISLRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK      ISDLAFQFPELRTLRSCDL
Sbjct: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELRTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNGR+EWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGRYEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 411

BLAST of CmoCh04G021720 vs. NCBI nr
Match: XP_038874258.1 (uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida])

HSP 1 Score: 753.4 bits (1944), Expect = 1.0e-213
Identity = 374/417 (89.69%), Postives = 390/417 (93.53%), Query Frame = 0

Query: 1   MLGAGLQFGRGCGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTR 60
           MLGAGLQF RGCGDDRFYN TKARR +QGRQNDQLRRAQSDVSAGQSP+VKP  VSSV R
Sbjct: 1   MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDQLRRAQSDVSAGQSPLVKPGVVSSVIR 60

Query: 61  ETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGE 120
           ETE GD CEELPKSI+MSAFEPVVSSLSNL+RFLQSIAPSVPAQYLSKTT+KGWRTCD E
Sbjct: 61  ETEYGDGCEELPKSIAMSAFEPVVSSLSNLERFLQSIAPSVPAQYLSKTTMKGWRTCDVE 120

Query: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQ 180
           FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSV+QYYVPYLSGIQIYGESLKSSAK RQ
Sbjct: 121 FQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQ 180

Query: 181 PGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQLIGLQE 240
           PGEDSDSDFRDSSSDGSSDSE ERALKYM KQLNHH+LSSE+ RRMDRIS RDQLIGLQE
Sbjct: 181 PGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHHHLSSELFRRMDRISFRDQLIGLQE 240

Query: 241 DCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDL 300
           DCSSDEAES N QGQLLFEHLERDLPYSREPLADK      ISDLAFQFPEL+TLRSCDL
Sbjct: 241 DCSSDEAESLNSQGQLLFEHLERDLPYSREPLADK------ISDLAFQFPELKTLRSCDL 300

Query: 301 LPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIP 360
           LPSSWFSVAWYPIYRIPTGPTL+DLDACFLTFH+LS+PMGG RSVQGPV+TYPSEIDGIP
Sbjct: 301 LPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHHLSSPMGGARSVQGPVVTYPSEIDGIP 360

Query: 361 KMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           KMSLPVFGLASYKFRGSLWTPNG +EWQLA SLLQDAE+WLR RQVNHPDF+FF RR
Sbjct: 361 KMSLPVFGLASYKFRGSLWTPNGGYEWQLANSLLQDAEEWLRDRQVNHPDFIFFSRR 411

BLAST of CmoCh04G021720 vs. TAIR 10
Match: AT2G01260.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 412.5 bits (1059), Expect = 3.9e-115
Identity = 248/423 (58.63%), Postives = 273/423 (64.54%), Query Frame = 0

Query: 1   MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVT 60
           MLGAG Q  RG  GDD FY S K RR NQ  + DQLRRAQSDVS   S    P       
Sbjct: 1   MLGAGFQLTRGRHGDDPFYTSAKTRRANQ--RIDQLRRAQSDVSNVPSSAPSP------- 60

Query: 61  RETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCD- 120
                                EP   S SNL RFL+S+ PSVPAQ+LSKT ++  R  D 
Sbjct: 61  ----------------HKQQLEPSDLSSSNLDRFLESVTPSVPAQFLSKTLLRERRADDD 120

Query: 121 -GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKS 180
             +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D VIQYYVP LS IQIY  S  L S
Sbjct: 121 YNKLVPYFVLGDIWDSFAEWSAYGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAHSHALDS 180

Query: 181 SAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQ 240
           S K R+PG+ SDSDFRDSSSD SSDS++ER                 +S R+D ISLRDQ
Sbjct: 181 SLKSRRPGDSSDSDFRDSSSDVSSDSDSER-----------------VSARVDCISLRDQ 240

Query: 241 LIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT 300
               QED SSD+ E    QG+L+FE+LERDLPY REP ADK L      DLA QFPEL T
Sbjct: 241 ---HQEDSSSDDGEPLGSQGRLMFEYLERDLPYIREPFADKVL------DLAAQFPELMT 300

Query: 301 LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPS 360
           LRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG  S Q   LT P 
Sbjct: 301 LRSCDLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPR 360

Query: 361 EIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNHPDFLFF 418
           E +   KMSLPVFGLASYKFRGSLWTP G  E QL  SL Q A+ WL    V+HPDFLFF
Sbjct: 361 ESE---KMSLPVFGLASYKFRGSLWTPIGGSEHQLVNSLFQAADKWLHSCHVSHPDFLFF 369

BLAST of CmoCh04G021720 vs. TAIR 10
Match: AT1G15030.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 379.0 bits (972), Expect = 4.8e-105
Identity = 224/389 (57.58%), Postives = 269/389 (69.15%), Query Frame = 0

Query: 34  QLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACEELPKSISMSAFEPVVSSLSNLQRF 93
           QL+RAQ DVS G          SS T++ E+G A   L   +S        +S SN++RF
Sbjct: 10  QLQRAQIDVSYG--------CRSSHTKDRENGSAL--LKHHVS-------EASSSNVERF 69

Query: 94  LQSIAPSVPAQYLSKTTVKGWRTCDGEFQ-PYFVLGDLWESFKEWSAYGAGVPLVLNDS- 153
           L S+ PSVPA YLSKT V+     D E Q PYF+LGD+WESF EWSAYG GVPL LN++ 
Sbjct: 70  LDSVTPSVPAHYLSKTIVRERGGSDVESQVPYFLLGDVWESFAEWSAYGIGVPLTLNNNK 129

Query: 154 DSVIQYYVPYLSGIQIYG--ESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYM 213
           D V QYYVP LSGIQ+Y   ++L SS + R+ GE+S+SDFRDSSS+GSS SE+ER L Y 
Sbjct: 130 DRVFQYYVPSLSGIQVYADVDALTSSLQARRQGEESESDFRDSSSEGSS-SESERGLCYS 189

Query: 214 EKQLNHHNLSSEISRRMDRISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSR 273
           ++Q         IS RMD++SLR +    QED SSD+ E  + QG+L+FE+LERDLPY R
Sbjct: 190 KEQ---------ISARMDKLSLRKE---HQEDSSSDDGEPLSSQGRLIFEYLERDLPYVR 249

Query: 274 EPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACF 333
           EP ADK      +SDLA +FPEL+TLRSCDLLPSSWFSVAWYPIY+IPTGPTLKDLDACF
Sbjct: 250 EPFADK------MSDLASRFPELKTLRSCDLLPSSWFSVAWYPIYKIPTGPTLKDLDACF 309

Query: 334 LTFHYLSTPMGGGRSVQGPV-LTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQ 393
           LT+H L TP  G     G + +  P E   + KM LPVFGLASYK RGS+WT  G    Q
Sbjct: 310 LTYHSLHTPFQGPGVTTGSMHVVQPRE--SVEKMELPVFGLASYKLRGSVWTSFGGSGHQ 360

Query: 394 LAKSLLQDAEDWLRQRQVNHPDFLFFRRR 418
           LA SL Q A++WLR RQVNHPDF+FF RR
Sbjct: 370 LANSLFQAADNWLRLRQVNHPDFIFFCRR 360

BLAST of CmoCh04G021720 vs. TAIR 10
Match: AT2G01260.2 (Protein of unknown function (DUF789) )

HSP 1 Score: 329.3 bits (843), Expect = 4.4e-90
Identity = 201/347 (57.93%), Postives = 223/347 (64.27%), Query Frame = 0

Query: 1   MLGAGLQFGRG-CGDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVT 60
           MLGAG Q  RG  GDD FY S K RR NQ  + DQLRRAQSDVS   S    P       
Sbjct: 1   MLGAGFQLTRGRHGDDPFYTSAKTRRANQ--RIDQLRRAQSDVSNVPSSAPSP------- 60

Query: 61  RETESGDACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCD- 120
                                EP   S SNL RFL+S+ PSVPAQ+LSKT ++  R  D 
Sbjct: 61  ----------------HKQQLEPSDLSSSNLDRFLESVTPSVPAQFLSKTLLRERRADDD 120

Query: 121 -GEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVIQYYVPYLSGIQIYGES--LKS 180
             +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D VIQYYVP LS IQIY  S  L S
Sbjct: 121 YNKLVPYFVLGDIWDSFAEWSAYGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAHSHALDS 180

Query: 181 SAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQ 240
           S K R+PG+ SDSDFRDSSSD SSDS++ER                 +S R+D ISLRDQ
Sbjct: 181 SLKSRRPGDSSDSDFRDSSSDVSSDSDSER-----------------VSARVDCISLRDQ 240

Query: 241 LIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRT 300
               QED SSD+ E    QG+L+FE+LERDLPY REP ADK L      DLA QFPEL T
Sbjct: 241 ---HQEDSSSDDGEPLGSQGRLMFEYLERDLPYIREPFADKVL------DLAAQFPELMT 296

Query: 301 LRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGG 342
           LRSCDLL SSWFSVAWYPIYRIPTGPTLKDLDACFLT+H L T  GG
Sbjct: 301 LRSCDLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYHSLHTSFGG 296

BLAST of CmoCh04G021720 vs. TAIR 10
Match: AT4G16100.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 305.1 bits (780), Expect = 8.9e-83
Identity = 183/410 (44.63%), Postives = 246/410 (60.00%), Query Frame = 0

Query: 13  GDDRFYNSTKARRVNQGRQNDQLRRAQSDVSAGQSPVVKPTTVSSVTRETESGDACE--- 72
           G++RFYN    R++ Q R+  +L   + +    ++  +    +    +E +  + C    
Sbjct: 9   GENRFYNPPPMRKLQQEREKKRLEAEEIEKEKKKAKEILDRKIKVEEKEIKQPEECSTSD 68

Query: 73  -ELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGWRTCDGEFQPYFVLG 132
             +P  +S +      +S SNL RFL    P V  Q+L  T+ KGWRT + E++PYF+L 
Sbjct: 69  CSVPSRVSSTTTTTGTTS-SNLGRFLDCTTPIVSTQHLPLTSSKGWRTREPEYRPYFLLN 128

Query: 133 DLWESFKEWSAYGAGVPLVLNDSDSVIQYYVPYLSGIQIYGESLKSSAKLRQPGEDSDSD 192
           DLW+SF+EWSAYG GVPL+LN  DSV+QYYVPYLSGIQ+Y +  ++    R+ GE+SD D
Sbjct: 129 DLWDSFEEWSAYGVGVPLLLNGIDSVVQYYVPYLSGIQLYEDPSRACTTRRRVGEESDGD 188

Query: 193 F-RDSSSDGSSDSEAERALKYMEKQLNHHNLSSEISRRMDRISLRDQ-LIGLQEDCSSDE 252
             RD SSDGS+D                     E+S+ + R SL ++  IG     SSDE
Sbjct: 189 SPRDMSSDGSNDCR-------------------ELSQNLYRASLEEKPCIG----SSSDE 248

Query: 253 AE-SPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQFPELRTLRSCDLLPSSW 312
           +E S N  G+L+FE+LE  +P+ REPL DK      IS+L+ QFP LRT RSCDL PSSW
Sbjct: 249 SEASSNSPGELVFEYLEGAMPFGREPLTDK------ISNLSSQFPALRTYRSCDLSPSSW 308

Query: 313 FSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGPVLTYPSEIDGIPKMSLP 372
            SVAWYPIYRIP G +L++LDACFLTFH LSTP  G  + +G      S+     K+ LP
Sbjct: 309 VSVAWYPIYRIPLGQSLQNLDACFLTFHSLSTPCRGTSNEEG---QSSSKSVASAKLPLP 368

Query: 373 VFGLASYKFRGSLWTPNGRF-EWQLAKSLLQDAEDWLRQRQVNHPDFLFF 415
            FGLASYKF+ S W+P     E Q   +LL+ AE+WLR+ +V  PDF  F
Sbjct: 369 TFGLASYKFKLSEWSPESDVDENQRVGTLLRTAEEWLRRLKVILPDFRHF 385

BLAST of CmoCh04G021720 vs. TAIR 10
Match: AT5G49220.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 282.3 bits (721), Expect = 6.2e-76
Identity = 192/426 (45.07%), Postives = 241/426 (56.57%), Query Frame = 0

Query: 13  GDDRFYNSTKARRVNQGRQ-NDQLRRAQ---------SDVSAGQSPVVKPTTV------- 72
           G++RFYN    RR+ Q  Q   Q+R  Q          D    ++  V P T        
Sbjct: 16  GENRFYNPPPMRRMQQEAQLQQQIREKQRRDDEDEVLMDKERRKAATVAPRTTRKGLGVS 75

Query: 73  SSVTRETESG-DACEELPKSISMSAFEPVVSSLSNLQRFLQSIAPSVPAQYLSKTTVKGW 132
            S +R   SG + C     S S S    V+S  SNL RFL+   P VPA+     +    
Sbjct: 76  ESKSRVVVSGSEVC--AGSSDSSSGSGRVLSDGSNLDRFLEHTTPVVPARLFPMRSRWEL 135

Query: 133 RTCDGEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVIQYYVPYLSGIQIYG 192
           +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS +QYYVPYLSGIQ+Y 
Sbjct: 136 KTRESDCHTYFVLEDLWESFAEWSAYGAGVPLEMHPLEMHGNDSTVQYYVPYLSGIQLYV 195

Query: 193 ESLKSSAKLRQPGEDSDSDFRDSSSDGSSDSEAERALKYMEKQLNHHNLSSEIS-RRMDR 252
           + LK   K R P  D+     + SS+GSS               N   L  ++S   ++R
Sbjct: 196 DPLK---KPRNPVGDN-----EGSSEGSS---------------NSRTLPVDLSVGELNR 255

Query: 253 ISLRDQLIGLQEDCSSDEAESPNPQGQLLFEHLERDLPYSREPLADKALYFLQISDLAFQ 312
           ISL+DQ   +    SS EAE  NPQG+LLFE+LE + P+ REPLA+K      ISDLA +
Sbjct: 256 ISLKDQ--SITGSLSSGEAEISNPQGRLLFEYLEYEPPFGREPLANK------ISDLASR 315

Query: 313 FPELRTLRSCDLLPSSWFSVAWYPIYRIPTGPTLKDLDACFLTFHYLSTPMGGGRSVQGP 372
            PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFH LST     +S  G 
Sbjct: 316 VPELMTYRSCDLLPSSWVSVSWYPIYRIPVGPTLQNLDACFLTFHSLST--APPQSAMGC 375

Query: 373 VLTYPSEIDGIPKMSLPVFGLASYKFRGSLWTPNGRFEWQLAKSLLQDAEDWLRQRQVNH 415
             + PS      K+ LP FGLASYK + S+W  N   E Q   SLLQ A+ WL++ QV+H
Sbjct: 376 SDSQPS-----TKLPLPTFGLASYKLKVSVWNQNRIQESQKMTSLLQAADKWLKRLQVDH 401

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HCB08.6e-23598.56uncharacterized protein LOC111462294 OS=Cucurbita moschata OX=3662 GN=LOC1114622... [more]
A0A6J1I4656.2e-23397.36uncharacterized protein LOC111470464 OS=Cucurbita maxima OX=3661 GN=LOC111470464... [more]
A0A5D3BPU41.7e-21188.97DUF789 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
A0A6J1JV263.0e-21187.53uncharacterized protein LOC111489147 OS=Cucurbita maxima OX=3661 GN=LOC111489147... [more]
A0A6J1FG461.9e-21088.01uncharacterized protein LOC111445107 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
Match NameE-valueIdentityDescription
XP_022961613.11.8e-23498.56uncharacterized protein LOC111462294 [Cucurbita moschata][more]
KAG7032526.13.9e-23492.41hypothetical protein SDJN02_06575, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023554659.16.7e-23498.08uncharacterized protein LOC111811852 [Cucurbita pepo subsp. pepo] >KAG6601821.1 ... [more]
XP_022971786.11.3e-23297.36uncharacterized protein LOC111470464 [Cucurbita maxima][more]
XP_038874258.11.0e-21389.69uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT2G01260.13.9e-11558.63Protein of unknown function (DUF789) [more]
AT1G15030.14.8e-10557.58Protein of unknown function (DUF789) [more]
AT2G01260.24.4e-9057.93Protein of unknown function (DUF789) [more]
AT4G16100.18.9e-8344.63Protein of unknown function (DUF789) [more]
AT5G49220.16.2e-7645.07Protein of unknown function (DUF789) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 88..414
e-value: 3.0E-100
score: 335.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 22..62
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 177..202
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 20..63
NoneNo IPR availablePANTHERPTHR31343T15D22.8coord: 1..415
NoneNo IPR availablePANTHERPTHR31343:SF5DUF789 FAMILY PROTEINcoord: 1..415

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G021720.1CmoCh04G021720.1mRNA