CmoCh16G011720 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G011720
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionWW domain-binding protein 11
LocationCmo_Chr16: 8350872 .. 8352590 (+)
RNA-Seq ExpressionCmoCh16G011720
SyntenyCmoCh16G011720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA

mRNA sequence

ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA

Coding sequence (CDS)

ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA

Protein sequence

MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLTRTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETPSANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT
Homology
BLAST of CmoCh16G011720 vs. ExPASy TrEMBL
Match: A0A6J1E6G0 (uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC111431041 PE=4 SV=1)

HSP 1 Score: 1079.3 bits (2790), Expect = 0.0e+00
Identity = 572/572 (100.00%), Postives = 572/572 (100.00%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT
Sbjct: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP
Sbjct: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240
           VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240

Query: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300
           GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP
Sbjct: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300

Query: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN 360
           SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN
Sbjct: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN 360

Query: 361 DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP 420
           DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP
Sbjct: 361 DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP 420

Query: 421 GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDD 480
           GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDD
Sbjct: 421 GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDD 480

Query: 481 MESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADE 540
           MESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADE
Sbjct: 481 MESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADE 540

Query: 541 FIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
           FIAKFREQIRLQRIESIKRSTGQIRRNTSKQT
Sbjct: 541 FIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 572

BLAST of CmoCh16G011720 vs. ExPASy TrEMBL
Match: A0A6J1L1K4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111500278 PE=4 SV=1)

HSP 1 Score: 979.2 bits (2530), Expect = 7.5e-282
Identity = 530/574 (92.33%), Postives = 543/574 (94.60%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDVPAKPPNLPP KD+ATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT
Sbjct: 1   MAESDVPAKPPNLPPGKDQATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           RTWELLHLLFVGIAVSYGLFSRRNDEKED ISVSNFDNVQSYVSGLLHVSSVFDDEAETP
Sbjct: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDGISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDESMS SDGNKVQTWSNRYFRNES+ V+EESPVVNEQRVRSEKPLLLPVRSL S+VV
Sbjct: 121 SANDESMSSSDGNKVQTWSNRYFRNESLVVAEESPVVNEQRVRSEKPLLLPVRSLNSQVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240
           VDDESRTVSGSTSRVSS RLLS+SKRSSNGE GG++L G+EDN NENV LPSPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSGRLLSNSKRSSNGEFGGLSLEGIEDNLNENVVLPSPVPWRSRS 240

Query: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300
           GRTEVQEEADNPP+YSPAVPMEESESNWIDSRSSRPQTSRS QASAIKLSP    PSPSP
Sbjct: 241 GRTEVQEEADNPPVYSPAVPMEESESNWIDSRSSRPQTSRSFQASAIKLSP----PSPSP 300

Query: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN 360
            PRKPSPSPNVSPELKAKSSE SVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSS LNDN
Sbjct: 301 FPRKPSPSPNVSPELKAKSSEDSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSLLNDN 360

Query: 361 DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP 420
           DVPHQKDLKRSVTTSKPR SIR TGDDIDMVMGTNSSAEALPRNYDD LSMGKS RKIRP
Sbjct: 361 DVPHQKDLKRSVTTSKPRRSIRDTGDDIDMVMGTNSSAEALPRNYDDILSMGKSIRKIRP 420

Query: 421 GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMET--D 480
           GEVANEP RRGREFGG DQLKGKMIDQN HVQAFEENPIEFP+D+KKE VEKL MET  D
Sbjct: 421 GEVANEPTRRGREFGGNDQLKGKMIDQNTHVQAFEENPIEFPDDDKKEPVEKLGMETDDD 480

Query: 481 DDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKA 540
           DDMES+EEDNNMVGKFIREDNGEPFNVNRRDNERSSSNE EAG SSNLSNDGGPDVDKKA
Sbjct: 481 DDMESEEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNE-EAGGSSNLSNDGGPDVDKKA 540

Query: 541 DEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
           DEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQ+
Sbjct: 541 DEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQS 569

BLAST of CmoCh16G011720 vs. ExPASy TrEMBL
Match: A0A6J1DZG8 (WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4 SV=1)

HSP 1 Score: 781.9 bits (2018), Expect = 1.8e-222
Identity = 438/579 (75.65%), Postives = 491/579 (84.80%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDR--ATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTL 60
           MAE+++ AK P L  R+ +  A PSKF+SH+LYK+L AIFFLVILPLVPS+APEF+NQTL
Sbjct: 1   MAETEIRAKSPTLALRQSQAEACPSKFHSHVLYKVLTAIFFLVILPLVPSRAPEFINQTL 60

Query: 61  LTRTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAE 120
           LTR+WELLHLLFVGIAVSYGLFSRRN+EKE+E+S S FDNVQSYVSGLLHVSSVFDDE E
Sbjct: 61  LTRSWELLHLLFVGIAVSYGLFSRRNEEKENEVSGSKFDNVQSYVSGLLHVSSVFDDEPE 120

Query: 121 TPSANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSR 180
           TPSANDES+S SD +KVQTWS+RYFRNESV V+EE P VNEQRVRSEKPLLLPVRSLKSR
Sbjct: 121 TPSANDESLSSSDESKVQTWSSRYFRNESVVVAEERPAVNEQRVRSEKPLLLPVRSLKSR 180

Query: 181 VVVD----DESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPV 240
           VV D    DESR VSGS  R SSRRLLS SKRS+ GE GGVNL  +ED  NENV L SPV
Sbjct: 181 VVADDDLLDESRAVSGSKPRASSRRLLSKSKRSTEGEFGGVNLEEMEDKLNENVVLRSPV 240

Query: 241 PWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSP 300
           PWRSRSGR E+QEEADNPPMYSP   MEESESNWIDSRSSRPQTSRS++A+AI     SP
Sbjct: 241 PWRSRSGRMEMQEEADNPPMYSPVAAMEESESNWIDSRSSRPQTSRSTRANAIG-QKLSP 300

Query: 301 SPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPS 360
           SPSPSP+P+KPSP P VSPEL+ K +E  VRKKSF+ SPPPPPPPPPPP VRRI+SMK S
Sbjct: 301 SPSPSPTPKKPSPPPTVSPELQGKGAEDFVRKKSFYRSPPPPPPPPPPPRVRRISSMKQS 360

Query: 361 SWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSS-AEALPRNYDDSLSMGK 420
           SWLNDNDVPHQKDL+RS  TSKPRSSIR TGDDIDM++G NSS     PRNY DS SMGK
Sbjct: 361 SWLNDNDVPHQKDLRRSF-TSKPRSSIRDTGDDIDMMVGPNSSVVNEPPRNYVDSQSMGK 420

Query: 421 STRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKL 480
           S R IRPGE+ NEPPRRGRE GG ++LKG+M  QN HVQ FEENPIEFP++ K+ELVEKL
Sbjct: 421 SVRTIRPGELVNEPPRRGRELGG-NELKGRMDHQNVHVQDFEENPIEFPDEEKEELVEKL 480

Query: 481 SMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPD 540
            METDDDME++EE++ M  +FIR+ NG  +   R+DNERSSSNE EAGSSS ++ DGGPD
Sbjct: 481 DMETDDDMETEEEEDTMATEFIRDKNGGTYTETRKDNERSSSNE-EAGSSS-MAGDGGPD 540

Query: 541 VDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
           VDKKADEFIAKFREQIRLQRIESIKRS+GQIRRN+S+QT
Sbjct: 541 VDKKADEFIAKFREQIRLQRIESIKRSSGQIRRNSSRQT 574

BLAST of CmoCh16G011720 vs. ExPASy TrEMBL
Match: A0A6J1L1Z4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111498378 PE=4 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 8.5e-217
Identity = 427/571 (74.78%), Postives = 478/571 (83.71%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDV AKP NL   + + TPSKF+SHILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1   MAESDVHAKPSNLAAGQTQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           R+WELLHLLFVGIAVSYGLFSRR DE EDEISVS FDNVQSYVS LLHVSSVFDDE  TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+S  D +KVQTWSNRYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDESKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240
           VD+E +T+SGS  RVSSRR LS   RSSN E+            NE + LPSPVPWRSRS
Sbjct: 181 VDEEYKTISGSKRRVSSRRSLSMPMRSSNEEL------------NEKIVLPSPVPWRSRS 240

Query: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300
              EVQEEADN P+YSPA PMEESES+WIDSRSSRP TSRSS+ASAI     +   SPSP
Sbjct: 241 EWKEVQEEADNLPLYSPAAPMEESESSWIDSRSSRPPTSRSSRASAIS----TQKLSPSP 300

Query: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN 360
           SPRKPSPSP VSPEL+AKS+E  VRKK+F+ SPPPPPPPPPPP VRRI+SMKP+SWL+++
Sbjct: 301 SPRKPSPSPTVSPELQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWLHES 360

Query: 361 DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP 420
           DV HQKDL+RS+  SKPR SIR TGD+ID++M  NSSAE LPRNY D  SMGKS R IRP
Sbjct: 361 DVSHQKDLRRSL-ISKPRRSIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRP 420

Query: 421 GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDD 480
           GEV NEPPRRGREFGG DQLKGKM +QNAH Q FEENPIE+P+++K +LVEKL+ME  DD
Sbjct: 421 GEVVNEPPRRGREFGGTDQLKGKM-EQNAHAQEFEENPIEYPDEDKADLVEKLAMEAGDD 480

Query: 481 MESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADE 540
           ME++EE++++VG+FIREDNGEPFNV RRD   SSS E     SSN++NDGGPDVDKKADE
Sbjct: 481 MENEEEEDDVVGQFIREDNGEPFNVKRRDFNESSSEE---AGSSNMANDGGPDVDKKADE 540

Query: 541 FIAKFREQIRLQRIESIKRSTGQIRRNTSKQ 572
           FIAKFREQIRLQRIE IK+S+GQI RNTS+Q
Sbjct: 541 FIAKFREQIRLQRIELIKKSSGQIGRNTSRQ 550

BLAST of CmoCh16G011720 vs. ExPASy TrEMBL
Match: A0A6J1GC23 (MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452771 PE=4 SV=1)

HSP 1 Score: 755.4 bits (1949), Expect = 1.8e-214
Identity = 427/571 (74.78%), Postives = 474/571 (83.01%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDV AK  NL   +++ TPSKF+SHILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1   MAESDVHAKTSNLAAEQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           R+WELLHLLFVGIAVSYGLFSRR DE EDEISVS FDNVQSYVS LLHVSSVFDDE  TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+S  D NKVQTWSNRYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240
           VDDE +T+SGS  RVSSRR LS   RSSN E+            NE V LPSPVPWRSRS
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEEM------------NEKVVLPSPVPWRSRS 240

Query: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300
            R EVQEEA+N PMYSPA PMEESES+WIDSRSSRP TSRSS+ASAI     +   SPSP
Sbjct: 241 ERKEVQEEAENLPMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAIS----TQKLSPSP 300

Query: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLNDN 360
           SPRKPSPSP VSPEL+AKS+E  VRKK+F+ SPPPPPPPPP   VRRI+SMKP+SWL+D+
Sbjct: 301 SPRKPSPSPTVSPELQAKSAEDLVRKKNFYSSPPPPPPPPPT--VRRISSMKPNSWLHDS 360

Query: 361 DVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRP 420
           DV HQ DL+RS+TT KPR SIR TGD+ID++M  NSSAE  PRNY D  SMGKS R IRP
Sbjct: 361 DVSHQNDLRRSLTT-KPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRP 420

Query: 421 GEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETDDD 480
           GEV NEPPRRGREFGG DQLKGKM +QN H Q FEENPIEFP++ K+ LVEKL+ME  DD
Sbjct: 421 GEVLNEPPRRGREFGGTDQLKGKM-EQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDD 480

Query: 481 MESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKADE 540
           ME++EE++++VG+FIREDNGEPFNV RRD   +SS E     SSN++NDGGPDVDKKADE
Sbjct: 481 MENEEEEDDVVGQFIREDNGEPFNVKRRDFNETSSEE---AGSSNMANDGGPDVDKKADE 540

Query: 541 FIAKFREQIRLQRIESIKRSTGQIRRNTSKQ 572
           FIAKFREQIRLQRIE IK+S+GQI RNTS+Q
Sbjct: 541 FIAKFREQIRLQRIELIKKSSGQIGRNTSRQ 548

BLAST of CmoCh16G011720 vs. TAIR 10
Match: AT4G16790.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 161.4 bits (407), Expect = 2.2e-39
Identity = 199/585 (34.02%), Postives = 263/585 (44.96%), Query Frame = 0

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           M E+    KP  L  ++D+  P KF S  ++K L+      ++P+  SQ PE  NQ   T
Sbjct: 1   MVEARSLKKPIQLGNKEDQ-NPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQ---T 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFD---------NVQSYVSGLLHVSS 120
           R  ELLHL+FVGIAVSYGLFSRRN +       SN D         N  SYV  +L VSS
Sbjct: 61  RLLELLHLVFVGIAVSYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSS 120

Query: 121 VFD--DEAETPSANDESMSLSDGNKVQTWSNRY-FRNESVAVSEESPVVNEQRVRSEKPL 180
           VF+   E+E+  ++D S    D  K QTW N+Y  +   V       V +E R   EKPL
Sbjct: 121 VFNVGHESESEPSDDSS---GDQRKFQTWKNKYHMKIPEVETRFVDRVSSENR---EKPL 180

Query: 181 LLPVRSLK-SRVVVDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNEN 240
           LLPVRSL  SR  V D S   SG   +V S+R L  +    N +V               
Sbjct: 181 LLPVRSLNYSR--VSDSSGDNSGRWEKVRSKRELLKTLGDDNSDV--------------- 240

Query: 241 VALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSS-RPQTSRSSQASA 300
             LPSP+PWRSRS  +                    S S  ++S  S +  T+  SQ   
Sbjct: 241 --LPSPIPWRSRSSSS------------------SSSSSKEVESLPSVKNLTTVESQPLI 300

Query: 301 IKLSPPSPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVR 360
             L+P    PS   SPRK +P PN++ E              F PSPPPPPPPPPP    
Sbjct: 301 KNLTP----PSSFSSPRKSNPIPNLASE--------------FHPSPPPPPPPPPP---- 360

Query: 361 RIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYD 420
            + +   SS         +KD        +  SS+  T           +  E  P    
Sbjct: 361 -LPAFYNSS--------SRKDHPGIYRVERRESSVHKT---------KFAGGEFHP---- 420

Query: 421 DSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNK 480
                        P E    PP + R      +   + + +NA  + +  +PI       
Sbjct: 421 -----PPPPPPPPPVEYYKSPPTKFRLSNERRKSSEQKMKRNAPKKVWWSDPI------- 473

Query: 481 KELVEKLSMETDDDMESKEEDNNMVGKFIRE-DNGEPFNVNRRDNERSSSNELEAGSSSN 540
              VE      + D E  ++ +N+  K + E +NGE        ++      +E    S 
Sbjct: 481 ---VE----SKEQDTEKNDQRSNLGSKAVEESENGEQRRGENEIHDEVEKKIVEEEGVSE 473

Query: 541 LSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSK 571
           ++N  G DVDKKADEFIAKFREQIRLQRIESIKRST +I  N+S+
Sbjct: 541 INN--GSDVDKKADEFIAKFREQIRLQRIESIKRSTNKISANSSR 473

BLAST of CmoCh16G011720 vs. TAIR 10
Match: AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )

HSP 1 Score: 108.6 bits (270), Expect = 1.7e-23
Identity = 143/500 (28.60%), Postives = 214/500 (42.80%), Query Frame = 0

Query: 10  PPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLTRTWELLHLL 69
           PPN+         S        K ++   FL+ LPL PSQAP+FV +T+LT+ WEL+HLL
Sbjct: 13  PPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLL 72

Query: 70  FVGIAVSYGLFSRRNDEKEDEISVSNFDNVQ-SYVSGLLHVSSVFDDEAETPSAN----- 129
           FVGIAV+YGLFSRRN E   ++ ++  D    SYVS +  VSSVFD+E +  S       
Sbjct: 73  FVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEFDDNSCEFVDVR 132

Query: 130 -------------------------DESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVN 189
                                    +ES    + N+V+ W+++YF+ +S  V    P   
Sbjct: 133 SDESVSARASVVGKSESFVVESGELEESSEFGETNEVRAWNSQYFQGKSKVVVAR-PAYG 192

Query: 190 EQRVRSEKPLLLPVRSLKSRVVVDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLG 249
                  +PL LP+R L+S +             + +  +        + N E   +   
Sbjct: 193 LDGHVVHQPLGLPIRRLRSSL----------RDNAALQDKSFADSCDGAVNAEAESL--- 252

Query: 250 GVEDNFNENV--ALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRP 309
            + DNF + V  A  SPVPW++   R E+    DN P     + ++E+    + S SSR 
Sbjct: 253 -LADNFFDEVLAAPASPVPWQA---RPEMMGIGDNYPSNFQPISVDET----LKSISSRS 312

Query: 310 QTSRSSQASAIKLSPPSPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPP 369
             S SSQ S    +    SPS S S    S + NV   +K KS + S R  S  PS PP 
Sbjct: 313 TGSSSSQTSYASQNQNRFSPSRSVSAE--SLNSNVEELVKEKSRQSSSRSSS--PSLPPS 372

Query: 370 P---PPPPPPHVRRIASMKPSSWLNDNDVPH--------------QKDLKRSVTTSKPRS 429
           P   P PP P +    + + S  L  +D P               ++D++R        S
Sbjct: 373 PSLSPSPPSPELVPNDTRRRSPELVTDDTPRRASHSRHYSDGSLLEEDVRRGFENELEGS 432

Query: 430 SIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQ 458
            +R  G   +        +++L    + S    KS R   P  +++         GG D 
Sbjct: 433 KVR--GRKAEFFSKKERGSKSLNLAAESSRRGNKSRRSYPPESISS-------PVGGADD 477


HSP 2 Score: 40.4 bits (93), Expect = 5.6e-03
Identity = 24/41 (58.54%), Postives = 31/41 (75.61%), Query Frame = 0

Query: 522 SSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTG 563
           S SN S+D   +VD+KA EFIAKFREQIRLQ++ S ++  G
Sbjct: 692 SQSNASHDHN-EVDRKAGEFIAKFREQIRLQKLISGEQPRG 731

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1E6G00.0e+00100.00uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC1114310... [more]
A0A6J1L1K47.5e-28292.33uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1115... [more]
A0A6J1DZG81.8e-22275.65WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4... [more]
A0A6J1L1Z48.5e-21774.78uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A6J1GC231.8e-21474.78MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452... [more]
Match NameE-valueIdentityDescription
AT4G16790.12.2e-3934.02hydroxyproline-rich glycoprotein family protein [more]
AT3G60380.11.7e-2328.60FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 532..566
e-value: 2.4E-15
score: 55.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 397..411
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 455..539
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 183..436
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 184..198
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 419..436
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 329..347
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 486..515
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 266..290
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 460..479
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 516..530
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 291..309
NoneNo IPR availablePANTHERPTHR34059:SF1EXPRESSED PROTEINcoord: 1..570
NoneNo IPR availablePANTHERPTHR34059EXPRESSED PROTEINcoord: 1..570

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G011720.1CmoCh16G011720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane