CmoCh16G011720 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G011720
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHydroxyproline-rich glycoprotein family protein, putative
LocationCmo_Chr16 : 8350872 .. 8352590 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA

mRNA sequence

ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA

Coding sequence (CDS)

ATGGCGGAATCGGACGTTCCCGCCAAACCTCCAAATCTGCCGCCAAGGAAAGATCGGGCTACTCCAAGTAAGTTCAATAGCCATATCTTGTATAAAATTTTAATGGCGATTTTCTTTCTAGTGATTCTCCCTCTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACTAGAACCTGGGAGCTTCTGCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTTTTTAGCCGGAGAAATGATGAGAAAGAGGATGAAATTAGTGTCTCTAATTTTGATAATGTTCAGTCTTATGTTTCTGGTTTGCTTCATGTTTCGTCTGTTTTTGATGATGAGGCTGAAACTCCATCTGCTAATGATGAATCCATGTCTTTGTCTGATGGAAATAAGGTCCAAACATGGAGTAATAGGTATTTTAGGAATGAGTCTGTGGCTGTTTCTGAAGAAAGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGAGTTGTTGTAGATGATGAGTCTAGAACTGTTTCTGGATCTACGTCGAGAGTGAGTTCGAGAAGATTATTGAGCGATTCGAAGAGGAGTTCGAATGGGGAGGTTGGGGGAGTGAATCTTGGAGGAGTTGAGGATAATTTCAATGAAAATGTCGCTCTTCCATCCCCGGTTCCATGGCGATCGAGATCGGGGAGGACGGAGGTGCAAGAAGAAGCTGATAATCCTCCTATGTATTCCCCTGCTGTTCCCATGGAGGAATCTGAATCGAATTGGATCGATTCTAGGTCTTCTAGGCCTCAAACTTCAAGGTCCTCCCAAGCCAGTGCCATTAAGCTATCTCCTCCTTCTCCTTCTCCTTCTCCTTCTCCATCTCCAAGGAAGCCATCTCCTTCGCCCAATGTGTCGCCAGAATTAAAGGCCAAGAGTTCTGAGGGTTCGGTAAGGAAGAAGAGCTTCTTCCCGTCTCCTCCGCCTCCGCCGCCACCTCCACCCCCGCCACATGTTCGAAGAATTGCCTCAATGAAACCAAGCTCTTGGTTGAACGACAATGATGTACCTCATCAAAAGGATTTGAAGAGAAGCGTCACTACTAGCAAGCCCAGAAGCTCAATTCGTGCTACAGGAGATGACATTGATATGGTGATGGGTACTAACTCAAGTGCTGAAGCACTGCCAAGAAATTATGATGATAGTTTATCAATGGGGAAATCTACTAGAAAAATCAGACCTGGAGAAGTTGCGAATGAGCCACCAAGAAGAGGAAGAGAATTTGGTGGATATGATCAGTTGAAGGGGAAGATGATAGATCAGAACGCCCATGTCCAAGCTTTTGAAGAAAACCCCATTGAGTTTCCAAATGACAATAAAAAAGAACTGGTCGAAAAGCTCAGCATGGAGACCGACGACGACATGGAAAGCAAGGAAGAAGACAATAACATGGTGGGAAAGTTTATTAGGGAAGACAATGGAGAACCCTTTAATGTGAACCGTAGAGACAACGAAAGAAGCTCGAGTAATGAATTAGAAGCAGGAAGCTCTAGCAATCTGAGCAATGATGGAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAACGATCAACTGGACAAATTCGTAGAAACACTTCAAAGCAAACTTGA
BLAST of CmoCh16G011720 vs. TrEMBL
Match: A0A0A0L1H7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G019360 PE=4 SV=1)

HSP 1 Score: 726.5 bits (1874), Expect = 2.6e-206
Identity = 425/586 (72.53%), Postives = 475/586 (81.06%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDV   P N        +PSKFN+H+LYK++ AIFFL+ILPLVPSQAPEFVNQTLLT
Sbjct: 1   MAESDVLTPPQN----HSTPSPSKFNTHLLYKLITAIFFLLILPLVPSQAPEFVNQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           R+WELLHLLFVGIAVSYGLFSRR+DEKEDEISVS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SAND      D NKVQTW+NRYFRNESV V+EE PV NEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SAND------DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGV-NLGGVEDNFNENVALPSPVPWRSR 240
           VDDE R    S  RVSSRRLLS+ KRSSN E GGV NL  ++D  NEN  LPSPVPWRSR
Sbjct: 181 VDDEFR----SKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSR 240

Query: 241 SGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPS 300
           SGR E QEEADNP        ME+SESN I SRS +PQTS+SS+ASAI   P   SPSPS
Sbjct: 241 SGRMEKQEEADNP-------SMEDSESNRIGSRSPKPQTSKSSRASAI---PQRLSPSPS 300

Query: 301 PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLND 360
           PSPRKPSPS NVSPEL+AKS+E  VRKKSF+ S PPPPPPPPPP VRR +SMKPSSW+N+
Sbjct: 301 PSPRKPSPSHNVSPELQAKSAEDLVRKKSFYRS-PPPPPPPPPPRVRRTSSMKPSSWVNE 360

Query: 361 NDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIR 420
           +DVPHQK+L+RS  TSKPR+  R TGDD DM++G NSS E  PR+Y D LSMGKS R IR
Sbjct: 361 DDVPHQKELRRSY-TSKPRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIR 420

Query: 421 PGEVANEPPRRGREFGGYDQLKGK-MIDQNAHVQAFEENPIEFPNDNKKELVEKLSMET- 480
            GE  NEPPRRGREF   DQLKGK M+++N HVQ FEENP+E P+++K+ELVEKL+M+T 
Sbjct: 421 AGEAVNEPPRRGREFSVNDQLKGKTMMNENTHVQDFEENPLESPDEDKEELVEKLTMDTD 480

Query: 481 -----DDDMESKEEDNNMVGKFIREDNGEPFNVNR--RDNERSSSN----ELEAGSSSNL 540
                DDDMES+ E N+MVGKFIREDNGEPF+V R  R++ER SSN    E EAGSSSN+
Sbjct: 481 VDEDDDDDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEAGSSSNI 540

Query: 541 SNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
            NDGGPDVDKKADEFIAKFREQIRLQRIES KRS+GQIR+NT+KQ+
Sbjct: 541 GNDGGPDVDKKADEFIAKFREQIRLQRIESFKRSSGQIRKNTTKQS 560

BLAST of CmoCh16G011720 vs. TrEMBL
Match: A0A061G512_THECC (Hydroxyproline-rich glycoprotein family protein, putative OS=Theobroma cacao GN=TCM_015918 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 5.1e-106
Identity = 288/602 (47.84%), Postives = 364/602 (60.47%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MA++D   K   L   +++  PSK+  H L K L+ I FLVI+P+ PSQAPEF+NQTLL 
Sbjct: 71  MADTDSYTKRQQLVKAQNQENPSKYYKHFLNKALVVIIFLVIIPVFPSQAPEFINQTLLN 130

Query: 61  RTWELLHLLFVGIAVSYGLFSRRND--EKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAE 120
           R+WELLHLLFVGIAVSYGLFSRRND  EKE+  + S FDNVQS+VS  L VSSVFDDEAE
Sbjct: 131 RSWELLHLLFVGIAVSYGLFSRRNDEIEKENNNNQSKFDNVQSFVSRFLQVSSVFDDEAE 190

Query: 121 TPSANDESMSLSDGNKVQTWSNRYFRNE-SVAVSEESPVVNEQRVR----SEKPLLLPVR 180
               +DES       KVQTWSN+Y+RNE  V V++E  V++EQR      SEKPLLLPVR
Sbjct: 191 NLPGSDES-------KVQTWSNQYYRNEPPVVVAKEHAVLDEQRSSSSRISEKPLLLPVR 250

Query: 181 SLKSRVV----------VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDN 240
           SLKSRV+              S ++S S S  SS+R    S + +NG +GG++   +E  
Sbjct: 251 SLKSRVLDANNLETSRENSSNSSSLSRSDSSFSSKRF---SNKGTNGALGGLDQDALEKK 310

Query: 241 FNE-NVALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSS 300
            NE NV LPSP+PWRSRSGR EV+++              ESE N ++SRS R QT+R S
Sbjct: 311 LNENNVVLPSPIPWRSRSGRMEVKDDI-------------ESEFNRLESRSFRSQTNRLS 370

Query: 301 QASAIKLSPP-SPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPP 360
           ++S++  SP  SPSP P  SP+K SPSP +S E +AKS+E  VRKKS + SPPPPPPPPP
Sbjct: 371 RSSSLSSSPKLSPSP-PLSSPKKLSPSPPLSMEAQAKSAEDVVRKKSIYRSPPPPPPPPP 430

Query: 361 PPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEAL 420
           PP + + +S+KPSS L D++V   KDL  +  +            D D +MGT       
Sbjct: 431 PPIIHKSSSLKPSSTLIDDEVSFDKDLPWNYASE---------DSDGDTLMGTQ------ 490

Query: 421 PRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGG-------YDQLKGKMIDQNAHVQAF 480
            R+Y D LS GKS + IRP +      + G    G       +DQ   +    N    +F
Sbjct: 491 -RDYVDGLSKGKSLKMIRPSDSLRGTRKDGEIENGINGKTVRFDQTSFRTEKLNRESVSF 550

Query: 481 EENP--IEFPNDNKKELVEKLSME-TDDDMESKEE---DNNMVGKFIREDNGEPFNVNRR 540
              P  +EFP + K E VEKL ME TDD+ ES+ E   D + +  F R  N E       
Sbjct: 551 MPKPTFMEFPQEQKHEFVEKLVMETTDDESESENEEVGDTSFLSSFERSPNIE------- 610

Query: 541 DNERSSSNELEAGSSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNT 571
                     EA  SS +  DGG DVDKKADEFIAK REQIRLQRI+SIKRS+GQ++RN+
Sbjct: 611 ----------EASPSSGI--DGGSDVDKKADEFIAKVREQIRLQRIDSIKRSSGQMKRNS 613

BLAST of CmoCh16G011720 vs. TrEMBL
Match: A0A067F1T8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g009613mg PE=4 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 2.2e-101
Identity = 277/591 (46.87%), Postives = 361/591 (61.08%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MA  D  AK   L   +++A PSKF +H  YK L+A  FLVILPL PSQAPEF+NQ+L T
Sbjct: 1   MANMDPYAKQKMLKTEENKANPSKFYTHFFYKALIASIFLVILPLFPSQAPEFINQSLFT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEI-SVSNFDNVQSYVSGLLHVSSVFDDEAET 120
           R+WELLHL+FVGIAVSYGLFS RNDE E E  + + FDN Q+YVS  L VSSVFDDE E 
Sbjct: 61  RSWELLHLVFVGIAVSYGLFSHRNDETEKETNNHTKFDNAQTYVSRFLQVSSVFDDENEN 120

Query: 121 PSANDESMSLSDGNKVQTWSNRYFRNESVAV--SEESPVVNEQRVR-------SEKPLLL 180
            S++DE       NKVQTWSN+Y+RNE V V   E S +  E  V         EKPLLL
Sbjct: 121 SSSSDE-------NKVQTWSNQYYRNEPVVVVAKEHSALDQEHMVSCSSFGGGGEKPLLL 180

Query: 181 PVRSLKSRVVVDD------ESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNF 240
           PVRSLKSRV   D      E +++S S+S   SRR  S+S +  N E+GG +   +E+  
Sbjct: 181 PVRSLKSRVSDTDTVEPIKEFKSLSRSSSNSGSRRFSSNSNKRKNRELGGFDQVKLEEKL 240

Query: 241 NENVALPSPVPWRSRSGRTEVQEEADNP-PMYSPAVP-MEESESNWIDSRSSRPQTSRSS 300
           NENV LPSP+PWR+RSGR E++E+ D+P P  +P  P +EE++ N ++SRS R Q  RS+
Sbjct: 241 NENVVLPSPIPWRTRSGRMEMKEDVDSPVPPLNPLPPSVEETDLNGLESRSLRSQMPRSN 300

Query: 301 QASAIKLSPP-SPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPP 360
           + +A   SP  SPSPS S SP+K SPSP++S E +AK+SE  VRKKSF+   PPPPPPPP
Sbjct: 301 RPNATSSSPKLSPSPSLS-SPKKLSPSPSLSSESQAKNSEDLVRKKSFY--RPPPPPPPP 360

Query: 361 PPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRS--SIRATGDDIDMVMGTNSSAE 420
           PP + R ++MK SS L+++    +K+L RS TT   +S  ++R+T  + D    T+   E
Sbjct: 361 PPMMYRSSTMKVSSGLDNDGASFEKNLNRSFTTETKQSVRTMRSTDRETDQ---TSFKTE 420

Query: 421 ALPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPI 480
            L R           +    P     + P + RE         + +D+   V+  E++  
Sbjct: 421 QLRR----------ESVNFMPNASFMQFPEQDRE---------EFVDK-VVVETDEDSGT 480

Query: 481 EFPNDNKKELVEKLSMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELE 540
           E+ ++ ++E  E +  ET                        PF  N    +RSS+NE  
Sbjct: 481 EYDDEEEEEEEEDVMGET------------------------PFISN---IDRSSNNEAA 531

Query: 541 AGSSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSK 571
           A S S+  +DGGPDVDKKADEFIAKFREQIRLQRIESIKRS+ QI RN+S+
Sbjct: 541 AASISSSVSDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSAQISRNSSR 531

BLAST of CmoCh16G011720 vs. TrEMBL
Match: B9SZP6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0006170 PE=4 SV=1)

HSP 1 Score: 372.1 bits (954), Expect = 1.2e-99
Identity = 277/576 (48.09%), Postives = 351/576 (60.94%), Query Frame = 1

Query: 22  PSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLTRTWELLHLLFVGIAVSYGLFS 81
           PSKF SH LYK L+   FLVILPL PSQAPEF+NQTL TR WE LHL+FVGIAVSYGLFS
Sbjct: 23  PSKFYSHFLYKALIVTIFLVILPLFPSQAPEFINQTLNTRGWEFLHLIFVGIAVSYGLFS 82

Query: 82  RRNDEKE-DEISVSNFDNVQSYVSGLLHVSSVFDDEAETPSANDESMSLSDGNKVQTWSN 141
           RRNDE E D  S S FDN QSYVS  L VSSVFDD+A++PS +D S S S    VQTW+N
Sbjct: 83  RRNDETEKDNSSNSKFDNAQSYVSRFLQVSSVFDDDADSPSKSDVSNSTS----VQTWNN 142

Query: 142 RYFRNESVAV-SEESPVVNEQRVRS------EKPLLLPVRSLKSRVV-----------VD 201
           +Y+RNE V V +EE     +Q  RS      EKPLLLP+RSLKSRV+           + 
Sbjct: 143 QYYRNEPVVVVAEEQHPAFDQEQRSTGSRIGEKPLLLPIRSLKSRVLDADGNEISKESIS 202

Query: 202 DESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRSGR 261
             S ++S + S + S+R    S +S NGE GG+    +E+   +NV LPSP+PWRSRSGR
Sbjct: 203 SVSASISRTNSNLGSKRF---SSKSRNGEFGGLQHQDLEEKIKDNVVLPSPIPWRSRSGR 262

Query: 262 TEVQE---EADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPP-SPSPSP 321
            E++E   E D+PP+Y+    MEESE N    R  R Q SRS ++++   SP  SPSPS 
Sbjct: 263 MEMKEAKEETDSPPLYTLPPSMEESEFN----RFFRSQVSRSPRSNSTASSPKLSPSPSM 322

Query: 322 SPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPP-PH-VRRIASMKPSSW 381
           S SP+K SP P+ S E +AKS+E  VR+KSF  SPPPPPPPPPP P  +R+  SMKP S 
Sbjct: 323 S-SPKKLSPPPSFSAETQAKSAEDFVRRKSFHRSPPPPPPPPPPFPQLIRKSRSMKPGSS 382

Query: 382 LNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTR 441
              N     +D KRS T S+P+        +++ V  ++          +DS +M    +
Sbjct: 383 EIGNRDSVGRDFKRSFT-SEPK--------EMNWVGNSSMKKSIRTTRSNDSFAMASKEK 442

Query: 442 KIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENP--IEFPNDNKKELVEKLS 501
           +    +V N    +      +DQ   K    N     F   P  +E+P + K+E VEKL 
Sbjct: 443 EF--DDVINSKTEKK-----FDQAAFKT---NRDRVTFMPQPTYMEYPKEEKEEFVEKLV 502

Query: 502 METDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDV 561
           +E+D+D+E  E+D +        DN +    N      S+S   E  +S N+S DGGPDV
Sbjct: 503 LESDEDLEETEDDLDGAADDNDNDN-DDIAGNSFVASTSASTNNEEPNSGNVS-DGGPDV 562

Query: 562 DKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSK 571
           DKKADEFIAKFREQIRLQRIESIK+S+GQI R  S+
Sbjct: 563 DKKADEFIAKFREQIRLQRIESIKKSSGQIHRKASR 565

BLAST of CmoCh16G011720 vs. TrEMBL
Match: U5GTV3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15480g PE=4 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.7e-99
Identity = 272/580 (46.90%), Postives = 362/580 (62.41%), Query Frame = 1

Query: 19  RATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLTRTWELLHLLFVGIAVSYG 78
           +A P+K+ ++ LYK L+   FL+IL L PSQAPEF+NQTL TR WE L L+FVGIAVSYG
Sbjct: 15  QANPTKYYTNFLYKALIVTIFLIILQLFPSQAPEFLNQTLNTRGWEFLRLVFVGIAVSYG 74

Query: 79  LFSRRNDEKEDEISVSN---FDNVQSYVSGLLHVSSVFDDEAETPSANDESMSLSDGNKV 138
           LFSRRNDE E EI+ SN   FDN QSYVS  L VSSVFDDE ++P  ++E+       KV
Sbjct: 75  LFSRRNDETEKEINNSNPSRFDNAQSYVSRFLQVSSVFDDEVDSPPESEET-------KV 134

Query: 139 QTWSNRYFRNESVAVSEE--SPVVNEQRVRS----EKPLLLPVRSLKSRVV---VDDESR 198
           QTWSN+Y+RN+ V V  E  S +  EQR  S    EKPLLLPVRSLKSRV+   VD+  +
Sbjct: 135 QTWSNQYYRNDPVVVVAEQNSALDKEQRATSSRIGEKPLLLPVRSLKSRVIDADVDETGK 194

Query: 199 TVSGSTSRVS-------SRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSR 258
             +G ++ +S       S+R  S+S ++ +GE GG     +E+   EN  LPSP+PWRSR
Sbjct: 195 ESAGGSASISRSNSDSGSKRFSSNSSKNKSGESGGSYCQELEEKLKENFVLPSPIPWRSR 254

Query: 259 SGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPS 318
           SGR E++EEAD+P +YS    +E+SE N     S  PQ++RS  A+    S P  SPSPS
Sbjct: 255 SGRMEMKEEADSP-LYSLPPSLEKSEYNR-SFNSQVPQSARSISAT----SSPKLSPSPS 314

Query: 319 -PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRR-IASMKPSSWL 378
             SP+K SPSP+ S E+  KS E  VRKKS + SPPPPPPPPPPP V R  +S+KP S  
Sbjct: 315 FSSPKKFSPSPSFSSEVLGKSVEDFVRKKSIYRSPPPPPPPPPPPPVNRESSSVKPISSA 374

Query: 379 NDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEAL-----PRNYDDSLSMG 438
             ++V  +++LKRS TT +P+   R     +   + T  S + L      + +DD ++  
Sbjct: 375 VHDEVLLERELKRSFTT-EPKDLNRGGNLPMPKSVRTIRSNDLLGEARREKEFDDRIN-S 434

Query: 439 KSTRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPI--EFPNDNKKELV 498
           K  ++++  E A    R GR+   +DQ   +   QN    +F   P   EF  +  +E V
Sbjct: 435 KEEKRLKEVE-ARGKERAGRKTVRFDQSSFQTEKQNRESVSFTPQPTFTEFHEEENEEFV 494

Query: 499 EKLSMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDG 558
           EKL +E+D+  E++EE+N     F               +  ++S E +A ++S +++DG
Sbjct: 495 EKLVVESDEGSETEEEENIAGSSFA--------------SSTAASPEKDAAAAS-IASDG 554

Query: 559 GPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSK 571
           GPDVDKKADEFIAKFREQIRLQRIESIK+S+ QIRRN SK
Sbjct: 555 GPDVDKKADEFIAKFREQIRLQRIESIKKSSAQIRRNPSK 563

BLAST of CmoCh16G011720 vs. TAIR10
Match: AT4G16790.1 (AT4G16790.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 129.8 bits (325), Expect = 5.4e-30
Identity = 135/356 (37.92%), Postives = 170/356 (47.75%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           M E+    KP  L  ++D+  P KF S  ++K L+      ++P+  SQ PE  NQT   
Sbjct: 1   MVEARSLKKPIQLGNKEDQ-NPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQT--- 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFD---------NVQSYVSGLLHVSS 120
           R  ELLHL+FVGIAVSYGLFSRRN +       SN D         N  SYV  +L VSS
Sbjct: 61  RLLELLHLVFVGIAVSYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSS 120

Query: 121 VFD--DEAETPSANDESMSLSDGNKVQTWSNRY-FRNESVAVSEESPVVNEQRVRSEKPL 180
           VF+   E+E+  ++D S    D  K QTW N+Y  +   V       V +E R   EKPL
Sbjct: 121 VFNVGHESESEPSDDSS---GDQRKFQTWKNKYHMKIPEVETRFVDRVSSENR---EKPL 180

Query: 181 LLPVRSLK-SRVVVDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNEN 240
           LLPVRSL  SR  V D S   SG   +V S+R L  +    N +V               
Sbjct: 181 LLPVRSLNYSR--VSDSSGDNSGRWEKVRSKRELLKTLGDDNSDV--------------- 240

Query: 241 VALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSS-RPQTSRSSQASA 300
             LPSP+PWRSRS  +                    S S  ++S  S +  T+  SQ   
Sbjct: 241 --LPSPIPWRSRSSSS------------------SSSSSKEVESLPSVKNLTTVESQPLI 291

Query: 301 IKLSPPSPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPP 343
             L+P    PS   SPRK +P PN++ E              F PSPPPPPPPPPP
Sbjct: 301 KNLTP----PSSFSSPRKSNPIPNLASE--------------FHPSPPPPPPPPPP 291

BLAST of CmoCh16G011720 vs. TAIR10
Match: AT3G60380.1 (AT3G60380.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 102.1 bits (253), Expect = 1.2e-21
Identity = 145/487 (29.77%), Postives = 210/487 (43.12%), Query Frame = 1

Query: 10  PPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLTRTWELLHLL 69
           PPN+         S        K ++   FL+ LPL PSQAP+FV +T+LT+ WEL+HLL
Sbjct: 13  PPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLL 72

Query: 70  FVGIAVSYGLFSRRNDEKEDEISVSNFDNVQ-SYVSGLLHVSSVFDDEAETPSANDESMS 129
           FVGIAV+YGLFSRRN E   ++ ++  D    SYVS +  VSSVFD+E +  S   E + 
Sbjct: 73  FVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEFDDNSC--EFVD 132

Query: 130 LSDGNKVQTWSNRYFRNESVAV-------SEESPVVNEQRV--------RSEKPLLLPVR 189
           +     V   ++   ++ES  V       S E    NE R         +S+  +  P  
Sbjct: 133 VRSDESVSARASVVGKSESFVVESGELEESSEFGETNEVRAWNSQYFQGKSKVVVARPAY 192

Query: 190 SLKSRVVVDDESRTVSGSTSRVSSRRLLSDSK--RSSNGEVGGVNLGGVEDNFNENV--A 249
            L   VV       +    S +     L D     S +G V       + DNF + V  A
Sbjct: 193 GLDGHVVHQPLGLPIRRLRSSLRDNAALQDKSFADSCDGAVNAEAESLLADNFFDEVLAA 252

Query: 250 LPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKL 309
             SPVPW++   R E+    DN P     + ++E+    + S SSR   S SSQ S    
Sbjct: 253 PASPVPWQA---RPEMMGIGDNYPSNFQPISVDET----LKSISSRSTGSSSSQTSYASQ 312

Query: 310 SPPSPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPP---PPPPPPHVR 369
           +    SPS S S    S + NV   +K KS + S R  S  PS PP P   P PP P + 
Sbjct: 313 NQNRFSPSRSVSAE--SLNSNVEELVKEKSRQSSSRSSS--PSLPPSPSLSPSPPSPELV 372

Query: 370 RIASMKPSSWLNDNDVPH--------------QKDLKRSVTTSKPRSSIRATGDDIDMVM 429
              + + S  L  +D P               ++D++R        S +R  G   +   
Sbjct: 373 PNDTRRRSPELVTDDTPRRASHSRHYSDGSLLEEDVRRGFENELEGSKVR--GRKAEFFS 432

Query: 430 GTNSSAEALPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQ--LKGKMIDQNAH 458
                +++L    + S    KS R   P  +++         GG D    + + + Q ++
Sbjct: 433 KKERGSKSLNLAAESSRRGNKSRRSYPPESISS-------PVGGADDSTTRRQDLQQKSN 477

BLAST of CmoCh16G011720 vs. NCBI nr
Match: gi|659095884|ref|XP_008448814.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis melo])

HSP 1 Score: 726.9 bits (1875), Expect = 2.8e-206
Identity = 423/589 (71.82%), Postives = 478/589 (81.15%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDV   P N        +PSKFN+H+ YK++ A+FFL+ILPLVPSQAPEFVNQTLLT
Sbjct: 1   MAESDVLTPPQN----HSTPSPSKFNTHLFYKLMTAVFFLLILPLVPSQAPEFVNQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           R+WELLHLLFVGIAVSYGLFSRR+DEKEDEISVS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SAND      D NKVQTW+NRYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRV+
Sbjct: 121 SAND------DENKVQTWNNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVI 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGV-NLGGVEDNFNENVALPSPVPWRSR 240
           VDDESR    S  RVSSRRLLS+ KR+SN E GGV NL  ++D  NENV LPSPVPWRSR
Sbjct: 181 VDDESR----SKKRVSSRRLLSNLKRTSNVEFGGVNNLDEIDDKLNENVVLPSPVPWRSR 240

Query: 241 SGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPS 300
           SGR E QEEADNP        ME+SESN I SRS +PQTS++S+ASAI     SPSPSPS
Sbjct: 241 SGRLEKQEEADNP-------SMEDSESNRIGSRSPKPQTSKASRASAIP-QKLSPSPSPS 300

Query: 301 PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLND 360
           PSPRKPSPS NVSPEL+AKS+E  VRKKSF+ SPPPPPPPP P HV        SSW+N+
Sbjct: 301 PSPRKPSPSHNVSPELQAKSAEDLVRKKSFYRSPPPPPPPPHPHHVFEELPRXKSSWVNE 360

Query: 361 NDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIR 420
           +D+PHQK+L+RS  TSKPR+ IR TGDD DM++G+NSS E  PRNY DSLSMGKS R IR
Sbjct: 361 DDIPHQKELRRSF-TSKPRAIIRDTGDDTDMMLGSNSSGETQPRNYVDSLSMGKSVRTIR 420

Query: 421 PGEVANEPPRRGREFGGYDQLKGK-MIDQNAHVQAFEENPIEFPNDNKKELVEKLSMET- 480
           PGEV NEPPRRGREF   DQLKGK M+++N H+Q FEENPIEFP+++K+ELVEKL+++T 
Sbjct: 421 PGEVVNEPPRRGREFSVNDQLKGKMMMNENTHIQDFEENPIEFPDEDKEELVEKLTLDTD 480

Query: 481 ----DDDMESK-EEDNNMVGKFIREDNGEPFNVNR--RDNERSSSN-------ELEAGSS 540
               DDDMES+ EE+N+MVGKFIREDNGEPF+V R  RD+ER S N       E EAGS+
Sbjct: 481 VDDDDDDMESEIEENNSMVGKFIREDNGEPFDVKRRNRDDERGSRNEEEKEEEEEEAGSA 540

Query: 541 SNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
           SN+ NDGGPDVDKKADEFIAKFREQIRLQRIESIKRS+GQIR+N +KQT
Sbjct: 541 SNIGNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNITKQT 566

BLAST of CmoCh16G011720 vs. NCBI nr
Match: gi|778675313|ref|XP_011650387.1| (PREDICTED: uncharacterized protein DDB_G0284459 [Cucumis sativus])

HSP 1 Score: 726.5 bits (1874), Expect = 3.7e-206
Identity = 425/586 (72.53%), Postives = 475/586 (81.06%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MAESDV   P N        +PSKFN+H+LYK++ AIFFL+ILPLVPSQAPEFVNQTLLT
Sbjct: 1   MAESDVLTPPQN----HSTPSPSKFNTHLLYKLITAIFFLLILPLVPSQAPEFVNQTLLT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
           R+WELLHLLFVGIAVSYGLFSRR+DEKEDEISVS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETP 120

Query: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SAND      D NKVQTW+NRYFRNESV V+EE PV NEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SAND------DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGV-NLGGVEDNFNENVALPSPVPWRSR 240
           VDDE R    S  RVSSRRLLS+ KRSSN E GGV NL  ++D  NEN  LPSPVPWRSR
Sbjct: 181 VDDEFR----SKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSR 240

Query: 241 SGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPS 300
           SGR E QEEADNP        ME+SESN I SRS +PQTS+SS+ASAI   P   SPSPS
Sbjct: 241 SGRMEKQEEADNP-------SMEDSESNRIGSRSPKPQTSKSSRASAI---PQRLSPSPS 300

Query: 301 PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLND 360
           PSPRKPSPS NVSPEL+AKS+E  VRKKSF+ S PPPPPPPPPP VRR +SMKPSSW+N+
Sbjct: 301 PSPRKPSPSHNVSPELQAKSAEDLVRKKSFYRS-PPPPPPPPPPRVRRTSSMKPSSWVNE 360

Query: 361 NDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIR 420
           +DVPHQK+L+RS  TSKPR+  R TGDD DM++G NSS E  PR+Y D LSMGKS R IR
Sbjct: 361 DDVPHQKELRRSY-TSKPRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIR 420

Query: 421 PGEVANEPPRRGREFGGYDQLKGK-MIDQNAHVQAFEENPIEFPNDNKKELVEKLSMET- 480
            GE  NEPPRRGREF   DQLKGK M+++N HVQ FEENP+E P+++K+ELVEKL+M+T 
Sbjct: 421 AGEAVNEPPRRGREFSVNDQLKGKTMMNENTHVQDFEENPLESPDEDKEELVEKLTMDTD 480

Query: 481 -----DDDMESKEEDNNMVGKFIREDNGEPFNVNR--RDNERSSSN----ELEAGSSSNL 540
                DDDMES+ E N+MVGKFIREDNGEPF+V R  R++ER SSN    E EAGSSSN+
Sbjct: 481 VDEDDDDDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEAGSSSNI 540

Query: 541 SNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 573
            NDGGPDVDKKADEFIAKFREQIRLQRIES KRS+GQIR+NT+KQ+
Sbjct: 541 GNDGGPDVDKKADEFIAKFREQIRLQRIESFKRSSGQIRKNTTKQS 560

BLAST of CmoCh16G011720 vs. NCBI nr
Match: gi|590676526|ref|XP_007039761.1| (Hydroxyproline-rich glycoprotein family protein, putative [Theobroma cacao])

HSP 1 Score: 393.3 bits (1009), Expect = 7.4e-106
Identity = 288/602 (47.84%), Postives = 364/602 (60.47%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MA++D   K   L   +++  PSK+  H L K L+ I FLVI+P+ PSQAPEF+NQTLL 
Sbjct: 71  MADTDSYTKRQQLVKAQNQENPSKYYKHFLNKALVVIIFLVIIPVFPSQAPEFINQTLLN 130

Query: 61  RTWELLHLLFVGIAVSYGLFSRRND--EKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAE 120
           R+WELLHLLFVGIAVSYGLFSRRND  EKE+  + S FDNVQS+VS  L VSSVFDDEAE
Sbjct: 131 RSWELLHLLFVGIAVSYGLFSRRNDEIEKENNNNQSKFDNVQSFVSRFLQVSSVFDDEAE 190

Query: 121 TPSANDESMSLSDGNKVQTWSNRYFRNE-SVAVSEESPVVNEQRVR----SEKPLLLPVR 180
               +DES       KVQTWSN+Y+RNE  V V++E  V++EQR      SEKPLLLPVR
Sbjct: 191 NLPGSDES-------KVQTWSNQYYRNEPPVVVAKEHAVLDEQRSSSSRISEKPLLLPVR 250

Query: 181 SLKSRVV----------VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDN 240
           SLKSRV+              S ++S S S  SS+R    S + +NG +GG++   +E  
Sbjct: 251 SLKSRVLDANNLETSRENSSNSSSLSRSDSSFSSKRF---SNKGTNGALGGLDQDALEKK 310

Query: 241 FNE-NVALPSPVPWRSRSGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSS 300
            NE NV LPSP+PWRSRSGR EV+++              ESE N ++SRS R QT+R S
Sbjct: 311 LNENNVVLPSPIPWRSRSGRMEVKDDI-------------ESEFNRLESRSFRSQTNRLS 370

Query: 301 QASAIKLSPP-SPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPP 360
           ++S++  SP  SPSP P  SP+K SPSP +S E +AKS+E  VRKKS + SPPPPPPPPP
Sbjct: 371 RSSSLSSSPKLSPSP-PLSSPKKLSPSPPLSMEAQAKSAEDVVRKKSIYRSPPPPPPPPP 430

Query: 361 PPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEAL 420
           PP + + +S+KPSS L D++V   KDL  +  +            D D +MGT       
Sbjct: 431 PPIIHKSSSLKPSSTLIDDEVSFDKDLPWNYASE---------DSDGDTLMGTQ------ 490

Query: 421 PRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGG-------YDQLKGKMIDQNAHVQAF 480
            R+Y D LS GKS + IRP +      + G    G       +DQ   +    N    +F
Sbjct: 491 -RDYVDGLSKGKSLKMIRPSDSLRGTRKDGEIENGINGKTVRFDQTSFRTEKLNRESVSF 550

Query: 481 EENP--IEFPNDNKKELVEKLSME-TDDDMESKEE---DNNMVGKFIREDNGEPFNVNRR 540
              P  +EFP + K E VEKL ME TDD+ ES+ E   D + +  F R  N E       
Sbjct: 551 MPKPTFMEFPQEQKHEFVEKLVMETTDDESESENEEVGDTSFLSSFERSPNIE------- 610

Query: 541 DNERSSSNELEAGSSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNT 571
                     EA  SS +  DGG DVDKKADEFIAK REQIRLQRI+SIKRS+GQ++RN+
Sbjct: 611 ----------EASPSSGI--DGGSDVDKKADEFIAKVREQIRLQRIDSIKRSSGQMKRNS 613

BLAST of CmoCh16G011720 vs. NCBI nr
Match: gi|1009157195|ref|XP_015896642.1| (PREDICTED: uncharacterized protein LOC107430318 [Ziziphus jujuba])

HSP 1 Score: 383.3 bits (983), Expect = 7.6e-103
Identity = 279/610 (45.74%), Postives = 370/610 (60.66%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPR-KDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLL 60
           MA++D   K   L    +++  PSKF  H LYK  +   FL+ILPL PSQAPEF+NQT+ 
Sbjct: 1   MADTDSYIKHQKLVNSDQNQQNPSKFYYHFLYKAAIVTVFLIILPLFPSQAPEFINQTVF 60

Query: 61  TRTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAET 120
           TR+WELLHLLFVGIA+SYGLFSRRNDE E E + S FDN QSY+S  L VSSVFDD+ E 
Sbjct: 61  TRSWELLHLLFVGIAISYGLFSRRNDETEKENNSSKFDNAQSYMSRFLQVSSVFDDDTEI 120

Query: 121 PSANDESMSLSDGNKVQTWSNRYFRNES-VAVSEESPVVNEQRVR----SEKPLLLPVRS 180
            S +DE       NK+QTWS++Y+RNE  V V++E+ VV+EQR      SEKPLLLPVRS
Sbjct: 121 QSGSDE-------NKIQTWSSQYYRNEPVVVVAQENSVVDEQRGSSSRISEKPLLLPVRS 180

Query: 181 LKSRVVVDD-----------ESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDN 240
           LKSRV   D            S  +  S S+   RR   +S RS NG+ GG+    +ED 
Sbjct: 181 LKSRVPDTDGGVVSVNETSSNSVPLGRSNSKTVPRRFSINSSRSRNGDFGGLEHQELEDK 240

Query: 241 FNENVALPSPVPWRSRSGRTEVQEEADN--PPMYSPAVPMEESESNWIDSRSSRPQTSRS 300
             ENV LPSP+PWRSRSGR E++E+ DN  PP+Y+P   MEESE N ++SR SR   SRS
Sbjct: 241 SKENVVLPSPIPWRSRSGRMEMKEDVDNNSPPLYTPPPSMEESEFNRVESRVSRTHVSRS 300

Query: 301 SQASAIKLSPPSPSPSPS-PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPP 360
           S+ ++ K S P  SPSPS  SP+K +PS ++S E +AK++E SV KK+F+ S PPPPPPP
Sbjct: 301 SRPNS-KNSSPKLSPSPSLSSPKKLTPSSSLSSESQAKNAEDSV-KKTFYMSSPPPPPPP 360

Query: 361 PPPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEA 420
           PPP  +R +++KPSS   +  V  +K+ +RS T+     + R +G+        NS+AE+
Sbjct: 361 PPPMFQRSSTLKPSSGYINGVVSSEKEFRRSFTSESMDLNWR-SGETFG--GRVNSAAES 420

Query: 421 LPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQLKGKM---------IDQNAHV 480
             R+  DSLSMG+S R I+PGE  +    RGRE  G   L G++         +     V
Sbjct: 421 QLRSQFDSLSMGRSVRTIKPGENVS---GRGREIVGERGLNGRVERKVLETTTLIGRKRV 480

Query: 481 QAFEENPIEFPNDNKKELVEKLSMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNE 540
              ++   +  + N+  +   +  E  +  E +EE+   +      +  E  +    D  
Sbjct: 481 GFEDQTSFKTEHLNRGSVWNPIYGEFQEQEEEEEEEEEDLAVEALIETEEEMDSEDDDQI 540

Query: 541 RSSSNELEAGSSSNLS-----------NDGGPDVDKKADEFIAKFREQIRLQRIESIKRS 571
                + + G ++ LS           +DGGPDVDKKADEFIAKFREQIRLQRIESIKRS
Sbjct: 541 GGGFMQNDIGENTKLSPKNEPTPSGSVSDGGPDVDKKADEFIAKFREQIRLQRIESIKRS 595

BLAST of CmoCh16G011720 vs. NCBI nr
Match: gi|641842444|gb|KDO61349.1| (hypothetical protein CISIN_1g009613mg [Citrus sinensis])

HSP 1 Score: 377.9 bits (969), Expect = 3.2e-101
Identity = 277/591 (46.87%), Postives = 361/591 (61.08%), Query Frame = 1

Query: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
           MA  D  AK   L   +++A PSKF +H  YK L+A  FLVILPL PSQAPEF+NQ+L T
Sbjct: 1   MANMDPYAKQKMLKTEENKANPSKFYTHFFYKALIASIFLVILPLFPSQAPEFINQSLFT 60

Query: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEI-SVSNFDNVQSYVSGLLHVSSVFDDEAET 120
           R+WELLHL+FVGIAVSYGLFS RNDE E E  + + FDN Q+YVS  L VSSVFDDE E 
Sbjct: 61  RSWELLHLVFVGIAVSYGLFSHRNDETEKETNNHTKFDNAQTYVSRFLQVSSVFDDENEN 120

Query: 121 PSANDESMSLSDGNKVQTWSNRYFRNESVAV--SEESPVVNEQRVR-------SEKPLLL 180
            S++DE       NKVQTWSN+Y+RNE V V   E S +  E  V         EKPLLL
Sbjct: 121 SSSSDE-------NKVQTWSNQYYRNEPVVVVAKEHSALDQEHMVSCSSFGGGGEKPLLL 180

Query: 181 PVRSLKSRVVVDD------ESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNF 240
           PVRSLKSRV   D      E +++S S+S   SRR  S+S +  N E+GG +   +E+  
Sbjct: 181 PVRSLKSRVSDTDTVEPIKEFKSLSRSSSNSGSRRFSSNSNKRKNRELGGFDQVKLEEKL 240

Query: 241 NENVALPSPVPWRSRSGRTEVQEEADNP-PMYSPAVP-MEESESNWIDSRSSRPQTSRSS 300
           NENV LPSP+PWR+RSGR E++E+ D+P P  +P  P +EE++ N ++SRS R Q  RS+
Sbjct: 241 NENVVLPSPIPWRTRSGRMEMKEDVDSPVPPLNPLPPSVEETDLNGLESRSLRSQMPRSN 300

Query: 301 QASAIKLSPP-SPSPSPSPSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPP 360
           + +A   SP  SPSPS S SP+K SPSP++S E +AK+SE  VRKKSF+   PPPPPPPP
Sbjct: 301 RPNATSSSPKLSPSPSLS-SPKKLSPSPSLSSESQAKNSEDLVRKKSFY--RPPPPPPPP 360

Query: 361 PPHVRRIASMKPSSWLNDNDVPHQKDLKRSVTTSKPRS--SIRATGDDIDMVMGTNSSAE 420
           PP + R ++MK SS L+++    +K+L RS TT   +S  ++R+T  + D    T+   E
Sbjct: 361 PPMMYRSSTMKVSSGLDNDGASFEKNLNRSFTTETKQSVRTMRSTDRETDQ---TSFKTE 420

Query: 421 ALPRNYDDSLSMGKSTRKIRPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPI 480
            L R           +    P     + P + RE         + +D+   V+  E++  
Sbjct: 421 QLRR----------ESVNFMPNASFMQFPEQDRE---------EFVDK-VVVETDEDSGT 480

Query: 481 EFPNDNKKELVEKLSMETDDDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELE 540
           E+ ++ ++E  E +  ET                        PF  N    +RSS+NE  
Sbjct: 481 EYDDEEEEEEEEDVMGET------------------------PFISN---IDRSSNNEAA 531

Query: 541 AGSSSNLSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSK 571
           A S S+  +DGGPDVDKKADEFIAKFREQIRLQRIESIKRS+ QI RN+S+
Sbjct: 541 AASISSSVSDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSAQISRNSSR 531

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L1H7_CUCSA2.6e-20672.53Uncharacterized protein OS=Cucumis sativus GN=Csa_3G019360 PE=4 SV=1[more]
A0A061G512_THECC5.1e-10647.84Hydroxyproline-rich glycoprotein family protein, putative OS=Theobroma cacao GN=... [more]
A0A067F1T8_CITSI2.2e-10146.87Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g009613mg PE=4 SV=1[more]
B9SZP6_RICCO1.2e-9948.09Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0006170 PE=4 SV=1[more]
U5GTV3_POPTR2.7e-9946.90Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15480g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16790.15.4e-3037.92 hydroxyproline-rich glycoprotein family protein[more]
AT3G60380.11.2e-2129.77 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|659095884|ref|XP_008448814.1|2.8e-20671.82PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis me... [more]
gi|778675313|ref|XP_011650387.1|3.7e-20672.53PREDICTED: uncharacterized protein DDB_G0284459 [Cucumis sativus][more]
gi|590676526|ref|XP_007039761.1|7.4e-10647.84Hydroxyproline-rich glycoprotein family protein, putative [Theobroma cacao][more]
gi|1009157195|ref|XP_015896642.1|7.6e-10345.74PREDICTED: uncharacterized protein LOC107430318 [Ziziphus jujuba][more]
gi|641842444|gb|KDO61349.1|3.2e-10146.87hypothetical protein CISIN_1g009613mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008480DUF761_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G011720.1CmoCh16G011720.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 532..566
score: 2.6
NoneNo IPR availablePANTHERPTHR34059FAMILY NOT NAMEDcoord: 502..570
score: 1.2E-142coord: 1..480
score: 1.2E
NoneNo IPR availablePANTHERPTHR34059:SF4SUBFAMILY NOT NAMEDcoord: 1..480
score: 1.2E-142coord: 502..570
score: 1.2E