CmoCh04G007840 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G007840
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHydroxyproline-rich glycoprotein family protein
LocationCmo_Chr04 : 3929395 .. 3931985 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTTCTCTCTTCTAACGAACTTTCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCCCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTTCTATGAACACGATCAGCGATTCCCTGGCTTCGATCAATGAGAGCGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGGTATTCGTCATATCACTTCACCTTTTAATCAATCATTTGGAATTTTAGGTGTTAGTGTTAGTTACCTATGGATTGAGATACTGGCCTTGTTTTTGGTAATTATTGTTCGTTCTTGATTCCGTTAGGTATTAGGATGTAGTTGATTTTCGAGGGATGATATATAGCATATTGCGGAATGGAATCTGTGTTGTTTGTTGTGGATTGCGGAATTCTATATAATGCTTTTAATTTCCCCCTTTTTCTGCTACTTCTGTTTGTTTGCTTTGAAACTGTGAATCGTCACATGGATTCAAGCGAAAAAGGGAATCACGAGTTTCGTTTTCCCTCTTTTTCTCCTTTTGGATTTCTTCTCGCATATGATTATTAGATTTTTAAGGTTTGGAAAGTACCAAGATTAGTGTATTGGATTGGACGACTGTGTCCGGAAGCAATGCGCTGTTCCCAACCAATTTTGTTTGTCTTTTTGTAACATAACACTGCCCTTTTTCTTATCAGAAAGGCATCTCTCCATTCTTCTTTTTATCAAAAAGAGATACAGAGAGGTCTTTTTTTTTAAATGTTCTTCTATATAGAGGAATAGTAATGCTTTTTATATAAAGATTCCAACAAATAGGACTCTCATTTGCCCAAAATCTTTGCACTGTTCTGTACTGATGTCATTATCACTTTCCTCCTTTTCTGTTTGTTTGATTACCCCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTGGACAGATTTAGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGGATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCCACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCATGTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATAGCTAAAGGACTGGTGAGCTTTGAAGGTAAAAAAGAGGACAAATCATGAAAAGAGTAAAACCAGAAGCCATATTCTTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAACCTGTAGGCGACAATGGGCCCTATTAACATACAGTAGTGGCTCCTCACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGGAAGTGTGAAAATATGGTAATAAAAATTGTTTTTTATCTTTTGACAGC

mRNA sequence

TTCTTCTCTCTTCTAACGAACTTTCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCCCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTTCTATGAACACGATCAGCGATTCCCTGGCTTCGATCAATGAGAGCGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTGGACAGATTTAGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGGATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCCACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCATGTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATAGCTAAAGGACTGGTGAGCTTTGAAGGTAAAAAAGAGGACAAATCATGAAAAGAGTAAAACCAGAAGCCATATTCTTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAACCTGTAGGCGACAATGGGCCCTATTAACATACAGTAGTGGCTCCTCACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGGAAGTGTGAAAATATGGTAATAAAAATTGTTTTTTATCTTTTGACAGC

Coding sequence (CDS)

ATGAGAGCGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTGGACAGATTTAGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGGATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCCACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
BLAST of CmoCh04G007840 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 4.4e-32
Identity = 128/316 (40.51%), Postives = 156/316 (49.37%), Query Frame = 1

Query: 56  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVL 115
           Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 116 PFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSS-IFAIGPFAHETQLV 175
              APPSSP SF  S  PS TQSP+    + SL AN  SP GPSS ++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 176 SPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDD 235
           SPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 236 FQSYQFYPGSPVSNLISPRSAISLSGASSPWT--------------DLDFASSASQFSNF 295
             +Y  YPGSP S L SP S  S  G  SP                D +  S+  Q SNF
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESNF 246

Query: 296 SLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM 347
                 A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Sbjct: 247 FCPETFAKFYLDH-----DPSVPQNGGRLSVSKDSDVYP-TNGYGNGNQNRQNRSPKQDM 306

BLAST of CmoCh04G007840 vs. TrEMBL
Match: A0A0A0KY57_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 1.8e-125
Identity = 257/354 (72.60%), Postives = 286/354 (80.79%), Query Frame = 1

Query: 128 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 187
           QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPE
Sbjct: 2   QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPE 61

Query: 188 SIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 247
           SIHLTTPSSPEVPFAQF+QPTLPK ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS
Sbjct: 62  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 121

Query: 248 LSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKS 307
            SGASSP  D DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S+ FKS
Sbjct: 122 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 181

Query: 308 NDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLR 367
           + +DF L+P+TS+SM      NESQNIQILID    + EEP   NHRFSFELSD D LL+
Sbjct: 182 S-NDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQ 241

Query: 368 NIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHS 427
           ++ SKPLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q QE HHS
Sbjct: 242 SVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHS 301

Query: 428 TTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 465
            TLGSV EFNFDNGNGS+   PNI+S+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 302 VTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of CmoCh04G007840 vs. TrEMBL
Match: A0A067JZI1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 1.6e-102
Identity = 239/459 (52.07%), Postives = 291/459 (63.40%), Query Frame = 1

Query: 24  DLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAV 83
           D RP NN   TI AAA AIA+ ++R P+AT VQKRRWGSC+S+YWCFG  + RKRIGHAV
Sbjct: 7   DSRPSNNALDTINAAASAIASAENRVPQAT-VQKRRWGSCFSVYWCFGYNRHRKRIGHAV 66

Query: 84  LVPE---PSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLT 143
           LVPE   P     A +NS Q+P I LPF APPSSP SFLQSEPPSA+QSP+ +LS TS++
Sbjct: 67  LVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSLTSIS 126

Query: 144 ANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEV 203
           ANMYSP GPSSIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEV
Sbjct: 127 ANMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEV 186

Query: 204 PFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLD 263
           PFAQ L P++   E+  ++   N +FQSYQ YPGSPV  LISP S IS SG SSP+ D +
Sbjct: 187 PFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDGE 246

Query: 264 FASSASQFSNFSLDVPPALLNLDR-----QGQSSDSCTQNSVGFKSNDDDFDLDPRTSDS 323
           FA+    F  F +  PP LLNLD+      G    S T      +     F  D   SD 
Sbjct: 247 FAAG---FLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVRPTSCSFTPDRPFSDF 306

Query: 324 MNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKP----------LESNVA 383
           ++   +     +G+Q +E  V +HR SFEL+ E  +L   E  P          LE+   
Sbjct: 307 VSHKHS----DNGNQNDE--VGDHRLSFELAAE-GVLGCEEQNPASPVKIIGDSLENGTV 366

Query: 384 VASSPMHETFETAKETSSGGGHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDN 443
            A +   ++ E   +  S  G +SNG  EKA+ DGE+A    E H S TLGS+ EFNFDN
Sbjct: 367 AART--EDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSITLGSLKEFNFDN 426

Query: 444 GNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQ 463
            +G ++ KPN   DWWAN  D+  +   T  WSFFPM Q
Sbjct: 427 VDGGDSHKPNAGPDWWANGSDIGKEDGATKNWSFFPMMQ 451

BLAST of CmoCh04G007840 vs. TrEMBL
Match: B9RCD8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1)

HSP 1 Score: 372.1 bits (954), Expect = 1.0e-99
Identity = 229/458 (50.00%), Postives = 286/458 (62.45%), Query Frame = 1

Query: 23  ADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHA 82
           AD RP NN   TI AAA  IA+ ++R P+AT +QKRRWGSCWS+YWCFG  + RKRIGHA
Sbjct: 9   ADSRPSNNALDTINAAASVIASAENRVPQAT-IQKRRWGSCWSVYWCFGYHRHRKRIGHA 68

Query: 83  VLVPEPSP----SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTS 142
           VLVPE S     S  A   + Q+P I LPF APPSSP SFLQSEPPSA+QSP+ ILS TS
Sbjct: 69  VLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAGILSLTS 128

Query: 143 LTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSP 202
           ++A+MYSP GP+SIFAIGP+AHETQLVSPP  FST TTEPST  FT PPES+ LTTPSSP
Sbjct: 129 VSASMYSPSGPASIFAIGPYAHETQLVSPPA-FSTFTTEPSTAPFTPPPESVQLTTPSSP 188

Query: 203 EVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTD 262
           EVPFAQ L+P+    E+  ++   N +FQSYQFYPGSPV  LISP S IS SG SSP+ D
Sbjct: 189 EVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFPD 248

Query: 263 LDFASSASQFSNFSLDVPPALLNLDRQ-----GQSSDSCTQNSVGFKSNDDDFDLDPRTS 322
            +FA++  +F  F + VPP LLNLD+      G    S T      ++    F LD + S
Sbjct: 249 GEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSFPLDRQCS 308

Query: 323 DSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPL--------ESNVA 382
           D  +   +       ++ ++  VA+ R SF+LS ED+ LR  E KP              
Sbjct: 309 DIASNRHS------DNENKDDQVADLRVSFDLSAEDA-LRYAEPKPASPVKIMPESMKNE 368

Query: 383 VASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNG 442
           +A+  + ++ E         G +SNGI E+A+ G E     + H + TLG+  EFNFDN 
Sbjct: 369 IAAEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNA 428

Query: 443 NGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQ 463
           +G    KP+   DWW N  DV  +  T   WSFFP+ Q
Sbjct: 429 DG--VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQ 455

BLAST of CmoCh04G007840 vs. TrEMBL
Match: B9GKF9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=2)

HSP 1 Score: 365.5 bits (937), Expect = 9.3e-98
Identity = 237/460 (51.52%), Postives = 295/460 (64.13%), Query Frame = 1

Query: 26  RPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLV 85
           R  NNT +TI AAA AIA+ ++R P+AT VQKRRWGSCWSIY CFG  K +K+IGHAVL 
Sbjct: 9   RAANNTLETINAAATAIASAENRVPQAT-VQKRRWGSCWSIYLCFGYQKHKKQIGHAVLF 68

Query: 86  PEPSP---SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTAN 145
           PEPS       A +N  Q+P + LPFAAPPSSP SF QSEPPS TQSP+ ++S TS++A+
Sbjct: 69  PEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISAS 128

Query: 146 MYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPF 205
           MYSP GP+SIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVPF
Sbjct: 129 MYSPSGPASIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPF 188

Query: 206 AQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFA 265
           AQFL P+L   ++  ++     DFQSYQF+PGSPV  LISP S IS SG SSP+ D +FA
Sbjct: 189 AQFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFA 248

Query: 266 SSASQFSNFSLDVPPALLNLDRQG-------QSSDSCTQNSVGFKSNDDDFDLDPRTSDS 325
              + F  F +  PP LLNLD+         Q S + T  SV  +    +F L  + SD 
Sbjct: 249 VGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESV--RRGSPNFLLHRQFSDV 308

Query: 326 MNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMH-ET 385
            +  ++      G+  +   V NHR SFEL+ ED+  R +E KP  S   + + P + E 
Sbjct: 309 PSRPRS------GNGHKNGQVVNHRVSFELTAEDA-SRCVEEKPAFS---IKTVPEYVEN 368

Query: 386 FETAKETSSGG----------GHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFD 445
              AKE  + G          G +SN   E A+ DGE A QH++   S TLGSV EFNFD
Sbjct: 369 GTQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRK-QQSITLGSVKEFNFD 428

Query: 446 NGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQ 463
           N +  ++ KP+  S+WWAN   +  +G TT  WSFFPM Q
Sbjct: 429 NADEGDSRKPS-SSNWWANGSVIGKEGETTKNWSFFPMVQ 449

BLAST of CmoCh04G007840 vs. TrEMBL
Match: M5WM36_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005552mg PE=4 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 2.3e-96
Identity = 243/463 (52.48%), Postives = 288/463 (62.20%), Query Frame = 1

Query: 26  RPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLV 85
           R  NN  +TI AAA AIA  ++R P+AT VQKRRWGS WS+YWCFG  + +KRIGHAVLV
Sbjct: 9   RTGNNALETINAAASAIAAAENRVPQAT-VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLV 68

Query: 86  PEPSP----SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTA 145
           PE +     +P A +N +Q+P IVLPF APPSSP SFLQSEPPSATQSP+    F SLTA
Sbjct: 69  PETTDRGGDAPRA-ENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAG---FFSLTA 128

Query: 146 NMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVP 205
           +MYSP GP+SIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVP
Sbjct: 129 SMYSPSGPTSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 188

Query: 206 FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDF 265
           FAQ L P     E   ++   + +FQSYQ YPGSPV  LISP S IS SG SSP+ DL+F
Sbjct: 189 FAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEF 248

Query: 266 ASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKS-NDDDFDLDPRTSD- 325
           A+    F  F    PP LLNLD       G    S +    G KS + D F L P+T + 
Sbjct: 249 AARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEV 308

Query: 326 -----SMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASS 385
                S N  +N  I I           NHR SFELS E+ ++R +E KP+    AV++S
Sbjct: 309 VLNPRSNNRGRNNDISI-----------NHRVSFELSSEE-VIRCVEKKPVALAEAVSTS 368

Query: 386 PMHETFETAKETSS--------GGGHSSNGIEEKA-ADGEEANQHQEHHHSTTLGSVNEF 445
                   +KE  S          G +SN   EKA ADGEEA  H +   S TLGSV EF
Sbjct: 369 LEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPK-QRSITLGSVKEF 428

Query: 446 NFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQ 463
           NFDN +G ++   +I SDWWAN K    +   T  WSFFPM Q
Sbjct: 429 NFDNPDGGDS-GNSIGSDWWANEKVDAKENGPTKNWSFFPMMQ 451

BLAST of CmoCh04G007840 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 221.5 bits (563), Expect = 1.1e-57
Identity = 178/463 (38.44%), Postives = 234/463 (50.54%), Query Frame = 1

Query: 28  MNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 87
           +NN+ +T+ AAA AI T + R   +++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPE
Sbjct: 5   VNNSVETVNAAATAIVTAESRVQPSSS-QKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPE 64

Query: 88  PSPS---PEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMY 147
           P  S       QNS  S  +VLPF APPSSP SFLQS+P S + SP   L   SLT+N +
Sbjct: 65  PVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL---SLTSNTF 124

Query: 148 SPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFA 207
           SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFA
Sbjct: 125 SPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 208 QFLQPTLPKAESD------DQYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPW 267
           Q L  +L     D       ++S  + +F+S Q  PGSP   NLISP S IS SG SSP+
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 268 TDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDPR 327
                    S    F +  PP  L  +     + G    S +   VG  S      L P 
Sbjct: 245 ------PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPN 304

Query: 328 --------------TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIES 387
                         T    N+   +  L +     E  VA+HR SFEL+ ED + R + S
Sbjct: 305 GPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELTGED-VARCLAS 364

Query: 388 KPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSV 447
           K   S+  + ++   ET E      S        IE+++ D E      +   S+++GS 
Sbjct: 365 KLNRSHDRMNNNDRIETEE------SSSTDIRRNIEKRSGDRENEQHRIQKLSSSSIGSS 424

Query: 448 NEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFP 460
            EF FD                  N KD   +     +WSFFP
Sbjct: 425 KEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of CmoCh04G007840 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 196.4 bits (498), Expect = 3.8e-50
Identity = 140/291 (48.11%), Postives = 172/291 (59.11%), Query Frame = 1

Query: 29  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 88
           NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 89  P---SPSPEAHQNS-LQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANM 148
           P   S S     NS  +S    LPF APPSSP SF QSEPPSATQSP  ILSF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 149 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPE 208
                  SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 209 VPFAQFLQPTLPKAESDDQYSCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPWTD 268
           VPFAQ             ++   +  +FQ YQ  PGSP+  LISP      SG +SP+ D
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPSPG---SGPTSPFPD 247

Query: 269 LDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLD 309
                  S F +F +  PP LL+    G ++  C +  +        FDLD
Sbjct: 248 ----GETSLFPHFQVSDPPKLLSPKTAGVTT-PCKEQKIVRPHKPVSFDLD 284

BLAST of CmoCh04G007840 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 184.5 bits (467), Expect = 1.5e-46
Identity = 168/466 (36.05%), Postives = 229/466 (49.14%), Query Frame = 1

Query: 29  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP 88
           N++  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP
Sbjct: 6   NSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEP 65

Query: 89  SPSPEA----HQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMY 148
           + S  A      +S  S  I +PF APPSSP SFL S PPSA+ +P   L   SLT N  
Sbjct: 66  AASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLTVN-- 125

Query: 149 SPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQF 208
               P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ 
Sbjct: 126 ---EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVPFAQL 185

Query: 209 LQPTLPKAE------SDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDL 268
           L  +L +A        + ++S  + +F+S Q YPGSP  NLISP      SG SSP+   
Sbjct: 186 LTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SGTSSPY--- 245

Query: 269 DFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDP---R 328
                      F +  PP  L  +     + G    S +    G  S      L P   +
Sbjct: 246 ---PGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGSK 305

Query: 329 TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPM 388
            +  +      + +I  S      +       ++S+  SL  +       ++ A+   P 
Sbjct: 306 LTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDEALV-VPH 365

Query: 389 HETFETAKETSS---GGGHSSNGIEEKAA-----------DGEEANQHQEHHHSTTLGSV 448
             +FE   E  +       + +G  EKA+            GE  ++  +   S + GS 
Sbjct: 366 RVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCCKTSGETESEQSQKLRSFSTGSN 425

Query: 449 NEFNFDNGNGSNALKPNIHSDWWANAKDVETKG--TTTGAWSFFPM 461
            EF FD+ N    +   I S+WWAN K V  KG  +   +W+FFP+
Sbjct: 426 KEFKFDSTN--EEMIEKIRSEWWANEK-VAGKGDHSPRNSWTFFPV 443

BLAST of CmoCh04G007840 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 140.6 bits (353), Expect = 2.5e-33
Identity = 128/316 (40.51%), Postives = 156/316 (49.37%), Query Frame = 1

Query: 56  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVL 115
           Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 116 PFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSS-IFAIGPFAHETQLV 175
              APPSSP SF  S  PS TQSP+    + SL AN  SP GPSS ++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 176 SPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDD 235
           SPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 236 FQSYQFYPGSPVSNLISPRSAISLSGASSPWT--------------DLDFASSASQFSNF 295
             +Y  YPGSP S L SP S  S  G  SP                D +  S+  Q SNF
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESNF 246

Query: 296 SLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM 347
                 A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Sbjct: 247 FCPETFAKFYLDH-----DPSVPQNGGRLSVSKDSDVYP-TNGYGNGNQNRQNRSPKQDM 306

BLAST of CmoCh04G007840 vs. NCBI nr
Match: gi|449457656|ref|XP_004146564.1| (PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus])

HSP 1 Score: 643.7 bits (1659), Expect = 2.5e-181
Identity = 357/479 (74.53%), Postives = 390/479 (81.42%), Query Frame = 1

Query: 4   MRRRADADADADADADADAADLRPM-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGS 63
           MRRR D D            D RP+ NNTFQTITAAADAIATVDHRFPRATAVQKRRWGS
Sbjct: 1   MRRRTDTD------------DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGS 60

Query: 64  CWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSE 123
           C SIYWCFGS+KQRKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSE
Sbjct: 61  CLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSE 120

Query: 124 PPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPS 183
           PPSA QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP 
Sbjct: 121 PPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP- 180

Query: 184 FTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISP 243
           FTPPESIHLTTPSSPEVPFAQF+QPTLPK ESD+QY+ PNDDFQSYQFYPGSPVS+LISP
Sbjct: 181 FTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISP 240

Query: 244 RSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNS 303
           RS IS SGASSP  D DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S
Sbjct: 241 RSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDS 300

Query: 304 VGFKSNDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDE 363
           + FKS+ +DF L+P+TS+SM      NESQNIQILID    + EEP   NHRFSFELSD 
Sbjct: 301 IEFKSS-NDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDG 360

Query: 364 DSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQ 423
           D LL+++ SKPLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q Q
Sbjct: 361 DVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQ 420

Query: 424 EHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 465
           E HHS TLGSV EFNFDNGNGS+   PNI+S+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 421 E-HHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 464

BLAST of CmoCh04G007840 vs. NCBI nr
Match: gi|659102256|ref|XP_008452033.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo])

HSP 1 Score: 640.6 bits (1651), Expect = 2.2e-180
Identity = 353/478 (73.85%), Postives = 384/478 (80.33%), Query Frame = 1

Query: 4   MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC 63
           MRRR D D            D RP+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC
Sbjct: 1   MRRRTDTD------------DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC 60

Query: 64  WSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEP 123
            SIYWCFGSLKQRKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSEP
Sbjct: 61  LSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEP 120

Query: 124 PSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSF 183
           PSA QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP F
Sbjct: 121 PSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPF 180

Query: 184 TPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPR 243
           TPPESIHLTTPSSPEVPFAQF+ P+L K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPR
Sbjct: 181 TPPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPR 240

Query: 244 SAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSV 303
           S IS SGASSP  D DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+
Sbjct: 241 SVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSI 300

Query: 304 GFKSNDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDED 363
            FKS+ +DF L+P TS+SM      NESQNIQILID    + EEP   NHRFSFELSD D
Sbjct: 301 EFKSS-NDFVLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGD 360

Query: 364 SLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQE 423
            L +++ SKPLESN + V SSP+HE FET KE S  G H+SN IEEK  ADG+EA+QHQE
Sbjct: 361 VLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQE 420

Query: 424 HHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 465
            HHS  LGSV EFNFDN NGS+   P I+SDWW NAKD  T+GTTTGAWSFFP  QQR
Sbjct: 421 -HHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of CmoCh04G007840 vs. NCBI nr
Match: gi|659102254|ref|XP_008452032.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo])

HSP 1 Score: 636.0 bits (1639), Expect = 5.3e-179
Identity = 353/479 (73.70%), Postives = 384/479 (80.17%), Query Frame = 1

Query: 4   MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQ-KRRWGS 63
           MRRR D D            D RP+NNTFQTITAAADAIATVDHRFPRATAVQ KRRWGS
Sbjct: 1   MRRRTDTD------------DFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGS 60

Query: 64  CWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSE 123
           C SIYWCFGSLKQRKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSE
Sbjct: 61  CLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSE 120

Query: 124 PPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPS 183
           PPSA QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP 
Sbjct: 121 PPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPP 180

Query: 184 FTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISP 243
           FTPPESIHLTTPSSPEVPFAQF+ P+L K ESD+QY+ PNDDFQSYQFYPGSPVS+LISP
Sbjct: 181 FTPPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISP 240

Query: 244 RSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNS 303
           RS IS SGASSP  D DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S
Sbjct: 241 RSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDS 300

Query: 304 VGFKSNDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDE 363
           + FKS+ +DF L+P TS+SM      NESQNIQILID    + EEP   NHRFSFELSD 
Sbjct: 301 IEFKSS-NDFVLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDG 360

Query: 364 DSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQ 423
           D L +++ SKPLESN + V SSP+HE FET KE S  G H+SN IEEK  ADG+EA+QHQ
Sbjct: 361 DVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ 420

Query: 424 EHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 465
           E HHS  LGSV EFNFDN NGS+   P I+SDWW NAKD  T+GTTTGAWSFFP  QQR
Sbjct: 421 E-HHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of CmoCh04G007840 vs. NCBI nr
Match: gi|700198179|gb|KGN53337.1| (hypothetical protein Csa_4G047980 [Cucumis sativus])

HSP 1 Score: 457.6 bits (1176), Expect = 2.6e-125
Identity = 257/354 (72.60%), Postives = 286/354 (80.79%), Query Frame = 1

Query: 128 QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 187
           QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPE
Sbjct: 2   QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPE 61

Query: 188 SIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 247
           SIHLTTPSSPEVPFAQF+QPTLPK ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS
Sbjct: 62  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 121

Query: 248 LSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKS 307
            SGASSP  D DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S+ FKS
Sbjct: 122 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 181

Query: 308 NDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLR 367
           + +DF L+P+TS+SM      NESQNIQILID    + EEP   NHRFSFELSD D LL+
Sbjct: 182 S-NDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQ 241

Query: 368 NIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHS 427
           ++ SKPLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q QE HHS
Sbjct: 242 SVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHS 301

Query: 428 TTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR 465
            TLGSV EFNFDNGNGS+   PNI+S+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 302 VTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of CmoCh04G007840 vs. NCBI nr
Match: gi|802738414|ref|XP_012086872.1| (PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas])

HSP 1 Score: 381.3 bits (978), Expect = 2.4e-102
Identity = 239/459 (52.07%), Postives = 291/459 (63.40%), Query Frame = 1

Query: 24  DLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAV 83
           D RP NN   TI AAA AIA+ ++R P+AT VQKRRWGSC+S+YWCFG  + RKRIGHAV
Sbjct: 7   DSRPSNNALDTINAAASAIASAENRVPQAT-VQKRRWGSCFSVYWCFGYNRHRKRIGHAV 66

Query: 84  LVPE---PSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLT 143
           LVPE   P     A +NS Q+P I LPF APPSSP SFLQSEPPSA+QSP+ +LS TS++
Sbjct: 67  LVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSLTSIS 126

Query: 144 ANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEV 203
           ANMYSP GPSSIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEV
Sbjct: 127 ANMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEV 186

Query: 204 PFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLD 263
           PFAQ L P++   E+  ++   N +FQSYQ YPGSPV  LISP S IS SG SSP+ D +
Sbjct: 187 PFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDGE 246

Query: 264 FASSASQFSNFSLDVPPALLNLDR-----QGQSSDSCTQNSVGFKSNDDDFDLDPRTSDS 323
           FA+    F  F +  PP LLNLD+      G    S T      +     F  D   SD 
Sbjct: 247 FAAG---FLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVRPTSCSFTPDRPFSDF 306

Query: 324 MNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKP----------LESNVA 383
           ++   +     +G+Q +E  V +HR SFEL+ E  +L   E  P          LE+   
Sbjct: 307 VSHKHS----DNGNQNDE--VGDHRLSFELAAE-GVLGCEEQNPASPVKIIGDSLENGTV 366

Query: 384 VASSPMHETFETAKETSSGGGHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDN 443
            A +   ++ E   +  S  G +SNG  EKA+ DGE+A    E H S TLGS+ EFNFDN
Sbjct: 367 AART--EDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSITLGSLKEFNFDN 426

Query: 444 GNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQ 463
            +G ++ KPN   DWWAN  D+  +   T  WSFFPM Q
Sbjct: 427 VDGGDSHKPNAGPDWWANGSDIGKEDGATKNWSFFPMMQ 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH4.4e-3240.51Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KY57_CUCSA1.8e-12572.60Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1[more]
A0A067JZI1_JATCU1.6e-10252.07Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1[more]
B9RCD8_RICCO1.0e-9950.00Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1[more]
B9GKF9_POPTR9.3e-9851.52Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=2[more]
M5WM36_PRUPE2.3e-9652.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005552mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G52430.11.1e-5738.44 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.13.8e-5048.11 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT4G25620.11.5e-4636.05 hydroxyproline-rich glycoprotein family protein[more]
AT1G76660.12.5e-3340.51 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|449457656|ref|XP_004146564.1|2.5e-18174.53PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus][more]
gi|659102256|ref|XP_008452033.1|2.2e-18073.85PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo][more]
gi|659102254|ref|XP_008452032.1|5.3e-17973.70PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo][more]
gi|700198179|gb|KGN53337.1|2.6e-12572.60hypothetical protein Csa_4G047980 [Cucumis sativus][more]
gi|802738414|ref|XP_012086872.1|2.4e-10252.07PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009725 response to hormone
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G007840.1CmoCh04G007840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..462
score: 1.5E
NoneNo IPR availablePANTHERPTHR31798:SF2SUBFAMILY NOT NAMEDcoord: 1..462
score: 1.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh04G007840CmoCh16G006450Cucurbita moschata (Rifu)cmocmoB285