Cp4.1LG01g06770 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g06770
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHydroxyproline-rich glycoprotein family protein
LocationCp4.1LG01 : 4112594 .. 4114658 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGCGATGAGGCGGCGTGCGGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGGTATTCGTCATATCACTTCACCTTTTAATCAATCATTTGGAATTTTAGGTGTTAGTGTTAGTTACCTATGGATTGAGATACTGGCCTTGTTTTTGGAAATTATTGTTCGTTCTTGATTCCGTTAGGTATTAGGATGTAGTTGATTTTCGAGGGATGATATATAGCATATAGCGGAATGGAATCTGTGTTGTTTGTTGTGGATTGCGGAATTCTATATAATGCTTTTAATTTCCCCCTTTTTCTGCTACTTCTGTTTGTTTGCTTTGAAACTGTGAATCGTCACATGGATTCAAGCGTAAAAGGGAATCACGAGTTTCGTTTTCCCTCTTTTTCTCCTTTTGGATTTCTTCTCGCATATGATTATTAGATTTTTAAGGTTTGGAAAGTACCAAGATTAGTGTATTGGATTGGACGACTGTGTCCGGAAGCAATGCGCTGTTCCCAACCAATTTTGTTTGTCTTTTTGTAATATAACACTGCCCTTTTTCTTATCAGAAAGGCATCTCTCCATTCTTCTTTTTATCAAAAAGAGATACAGAGAGGTCTTTTTTTAAAAATGTTCTTCTATATAGAGGAATAGTAATGCTTTTTATATAAAGATTCCAACAAATAGGACTCTCATTTGCCCAAAATCTTTGCACTGTTCTGTACTGATGTCATTATCACTTTCCTCCTTTTCTGTTTGTTTGATTACCCCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATTTGCTCATGAGACTCAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTTACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCAGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAACCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCCGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA

mRNA sequence

ATGAGAGCGATGAGGCGGCGTGCGGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATTTGCTCATGAGACTCAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTTACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCAGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAACCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCCGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA

Coding sequence (CDS)

ATGAGAGCGATGAGGCGGCGTGCGGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATTTGCTCATGAGACTCAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTTACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCAGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAACCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCCGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA

Protein sequence

MRAMRRRADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR
BLAST of Cp4.1LG01g06770 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 1.7e-25
Identity = 94/189 (49.74%), Postives = 110/189 (58.20%), Query Frame = 1

Query: 48  KRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSAT 107
           KRI  A  +PE      S    AHQ    N+  +  I L   APPSSP SF  S  PS T
Sbjct: 28  KRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSLLAPPSSPASFTNSALPSTT 87

Query: 108 QSPSNILSFTSLTANMYSPDGPSS-IFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-P 167
           QSP+    + SL AN  SP GPSS ++A GP+AHETQLVSPP+ FST TTEPST  FT P
Sbjct: 88  QSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLVSPPV-FSTFTTEPSTAPFTPP 147

Query: 168 PESIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSA 226
           PE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S 
Sbjct: 148 PELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDLQATYSLYPGSPASALRSPISR 207

BLAST of Cp4.1LG01g06770 vs. TrEMBL
Match: A0A0A0KY57_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 1.8e-127
Identity = 262/354 (74.01%), Postives = 288/354 (81.36%), Query Frame = 1

Query: 99  QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 158
           QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPE
Sbjct: 2   QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPE 61

Query: 159 SIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 218
           SIHLTTPSSPEVPFAQF+QPTL K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS
Sbjct: 62  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 121

Query: 219 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKS 278
            SGASSPLPD DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S+ FKS
Sbjct: 122 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 181

Query: 279 NDDDFDLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLR 338
           + +DF LNP+TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D LL+
Sbjct: 182 S-NDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQ 241

Query: 339 NVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHS 398
           +V SKPLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q QE HHS
Sbjct: 242 SVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHS 301

Query: 399 TTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 436
            TLGSV EFNFDNGNGS+   PNINS+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 302 VTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of Cp4.1LG01g06770 vs. TrEMBL
Match: A0A067JZI1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 1.3e-88
Identity = 224/458 (48.91%), Postives = 275/458 (60.04%), Query Frame = 1

Query: 14  DLRPMNNTFQTITAAADAIATVDHRFPRATAVQR------------------KRIGHAVL 73
           D RP NN   TI AAA AIA+ ++R P+AT  +R                  KRIGHAVL
Sbjct: 7   DSRPSNNALDTINAAASAIASAENRVPQATVQKRRWGSCFSVYWCFGYNRHRKRIGHAVL 66

Query: 74  VPE---PSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTA 133
           VPE   P     A +NS Q+P I LPF APPSSP SFLQSEPPSA+QSP+ +LS TS++A
Sbjct: 67  VPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSLTSISA 126

Query: 134 NMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVP 193
           NMYSP GPSSIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVP
Sbjct: 127 NMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 186

Query: 194 FAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDF 253
           FAQ L P+++  E+  ++   N +FQSYQ YPGSPV  LISP S IS SG SSP PD +F
Sbjct: 187 FAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDGEF 246

Query: 254 ASSASQFSNFSLDVPPALLNLDR-----QGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSM 313
           A+    F  F +  PP LLNLD+      G    S T      +     F  +   SD +
Sbjct: 247 AAG---FLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVRPTSCSFTPDRPFSDFV 306

Query: 314 NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKP----------LESNVAV 373
           +   +     +G+Q +E  V +HR SFEL+ E  +L   E  P          LE+    
Sbjct: 307 SHKHS----DNGNQNDE--VGDHRLSFELAAE-GVLGCEEQNPASPVKIIGDSLENGTVA 366

Query: 374 ASSPMHETFETAKETSSGGGHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDNG 433
           A +   ++ E   +  S  G +SNG  EKA+ DGE+A    E H S TLGS+ EFNFDN 
Sbjct: 367 ART--EDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSITLGSLKEFNFDNV 426

BLAST of Cp4.1LG01g06770 vs. TrEMBL
Match: U5GP58_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 3.8e-85
Identity = 223/458 (48.69%), Postives = 282/458 (61.57%), Query Frame = 1

Query: 16  RPMNNTFQTITAAADAIATVDHRFPRATAVQR-----------------KRIGHAVLVPE 75
           R  NNT +TI AAA AIA+ ++R P+AT  +R                 K+IGHAVL PE
Sbjct: 9   RAANNTLETINAAATAIASAENRVPQATVQRRWGSCWSIYLCFGYQKHKKQIGHAVLFPE 68

Query: 76  PSP---SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMY 135
           PS       A +N  Q+P + LPFAAPPSSP SF QSEPPS TQSP+ ++S TS++A+MY
Sbjct: 69  PSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMY 128

Query: 136 SPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQ 195
           SP GP+SIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVPFAQ
Sbjct: 129 SPSGPASIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQ 188

Query: 196 FLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASS 255
           FL P+L+  ++  ++     DFQSYQF+PGSPV  LISP S IS SG SSP PD +FA  
Sbjct: 189 FLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVG 248

Query: 256 ASQFSNFSLDVPPALLNLDRQG-------QSSDSCTQNSVGFKSNDDDFDLNPRTSDSMN 315
            + F  F +  PP LLNLD+         Q S + T  SV  +    +F L+ + SD  +
Sbjct: 249 GAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESV--RRGSPNFLLHRQFSDVPS 308

Query: 316 ESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMH-ETFE 375
             ++      G+  +   V NHR SFEL+ ED+  R VE KP  S   + + P + E   
Sbjct: 309 RPRS------GNGHKNGQVVNHRVSFELTAEDA-SRCVEEKPAFS---IKTVPEYVENGT 368

Query: 376 TAKETSSGG----------GHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDNG 434
            AKE  + G          G +SN   E A+ DGE A QH++   S TLGSV EFNFDN 
Sbjct: 369 QAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRK-QQSITLGSVKEFNFDNA 428

BLAST of Cp4.1LG01g06770 vs. TrEMBL
Match: B9GKF9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=2)

HSP 1 Score: 323.2 bits (827), Expect = 5.0e-85
Identity = 223/459 (48.58%), Postives = 282/459 (61.44%), Query Frame = 1

Query: 16  RPMNNTFQTITAAADAIATVDHRFPRATAVQR------------------KRIGHAVLVP 75
           R  NNT +TI AAA AIA+ ++R P+AT  +R                  K+IGHAVL P
Sbjct: 9   RAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFP 68

Query: 76  EPSP---SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANM 135
           EPS       A +N  Q+P + LPFAAPPSSP SF QSEPPS TQSP+ ++S TS++A+M
Sbjct: 69  EPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASM 128

Query: 136 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFA 195
           YSP GP+SIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVPFA
Sbjct: 129 YSPSGPASIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA 188

Query: 196 QFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFAS 255
           QFL P+L+  ++  ++     DFQSYQF+PGSPV  LISP S IS SG SSP PD +FA 
Sbjct: 189 QFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAV 248

Query: 256 SASQFSNFSLDVPPALLNLDRQG-------QSSDSCTQNSVGFKSNDDDFDLNPRTSDSM 315
             + F  F +  PP LLNLD+         Q S + T  SV  +    +F L+ + SD  
Sbjct: 249 GGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESV--RRGSPNFLLHRQFSDVP 308

Query: 316 NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMH-ETF 375
           +  ++      G+  +   V NHR SFEL+ ED+  R VE KP  S   + + P + E  
Sbjct: 309 SRPRS------GNGHKNGQVVNHRVSFELTAEDA-SRCVEEKPAFS---IKTVPEYVENG 368

Query: 376 ETAKETSSGG----------GHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDN 434
             AKE  + G          G +SN   E A+ DGE A QH++   S TLGSV EFNFDN
Sbjct: 369 TQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRK-QQSITLGSVKEFNFDN 428

BLAST of Cp4.1LG01g06770 vs. TrEMBL
Match: B9RCD8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 1.4e-84
Identity = 213/457 (46.61%), Postives = 269/457 (58.86%), Query Frame = 1

Query: 13  ADLRPMNNTFQTITAAADAIATVDHRFPRATAVQR------------------KRIGHAV 72
           AD RP NN   TI AAA  IA+ ++R P+AT  +R                  KRIGHAV
Sbjct: 9   ADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRHRKRIGHAV 68

Query: 73  LVPEPSP----SPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSL 132
           LVPE S     S  A   + Q+P I LPF APPSSP SFLQSEPPSA+QSP+ ILS TS+
Sbjct: 69  LVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAGILSLTSV 128

Query: 133 TANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPE 192
           +A+MYSP GP+SIFAIGP+AHETQLVSPP  FST TTEPST  FT PPES+ LTTPSSPE
Sbjct: 129 SASMYSPSGPASIFAIGPYAHETQLVSPPA-FSTFTTEPSTAPFTPPPESVQLTTPSSPE 188

Query: 193 VPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDL 252
           VPFAQ L+P+ +  E+  ++   N +FQSYQFYPGSPV  LISP S IS SG SSP PD 
Sbjct: 189 VPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDG 248

Query: 253 DFASSASQFSNFSLDVPPALLNLDRQ-----GQSSDSCTQNSVGFKSNDDDFDLNPRTSD 312
           +FA++  +F  F + VPP LLNLD+      G    S T      ++    F L+ + SD
Sbjct: 249 EFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSFPLDRQCSD 308

Query: 313 SMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPL--------ESNVAV 372
             +   +       ++ ++  V + R SF+LS ED+ LR  E KP              +
Sbjct: 309 IASNRHS------DNENKDDQVADLRVSFDLSAEDA-LRYAEPKPASPVKIMPESMKNEI 368

Query: 373 ASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGN 432
           A+  + ++ E         G +SNGI E+A+ G E     + H + TLG+  EFNFDN +
Sbjct: 369 AAEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNAD 428

Query: 433 GSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQ 434
           G    KP+   DWW N  DV  +  T   WSFFP+ Q
Sbjct: 429 G--VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQ 455

BLAST of Cp4.1LG01g06770 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 182.2 bits (461), Expect = 6.9e-46
Identity = 165/464 (35.56%), Postives = 218/464 (46.98%), Query Frame = 1

Query: 18  MNNTFQTITAAADAIATVDHRFPRA------------------TAVQRKRIGHAVLVPEP 77
           +NN+ +T+ AAA AI T + R   +                  T    KRIG+AVLVPEP
Sbjct: 5   VNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEP 64

Query: 78  SPSPE---AHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYS 137
             S       QNS  S  +VLPF APPSSP SFLQS+P S + SP   LS   LT+N +S
Sbjct: 65  VTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLS---LTSNTFS 124

Query: 138 PDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQ 197
           P  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFAQ
Sbjct: 125 PKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQ 184

Query: 198 FLQPTLQKAESDD------QYSCPNDDFQSYQFYPGSPVS-NLISPRSAISLSGASSPLP 257
            L  +L+    D       ++S  + +F+S Q  PGSP   NLISP S IS SG SSP P
Sbjct: 185 LLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 244

Query: 258 DLDFASSASQFSNFSLDVPPALLNLDR---------------------QGQSSDSCTQNS 317
                   S    F +  PP  L  +                       G +S + T N 
Sbjct: 245 ------GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNG 304

Query: 318 VGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVE 377
               S +     N  T    N+   +  L +     E  V +HR SFEL+ ED + R + 
Sbjct: 305 PEIVSGN--LTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELTGED-VARCLA 364

Query: 378 SKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGS 431
           SK   S+  + ++   ET E      S        IE+++ D E      +   S+++GS
Sbjct: 365 SKLNRSHDRMNNNDRIETEE------SSSTDIRRNIEKRSGDRENEQHRIQKLSSSSIGS 424

BLAST of Cp4.1LG01g06770 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 167.5 bits (423), Expect = 1.8e-41
Identity = 132/291 (45.36%), Postives = 159/291 (54.64%), Query Frame = 1

Query: 19  NNTFQTITAAADAIATVDHRFPRATAV--------------------QRKRIGHAVLVPE 78
           NN F TI AAA AIA+ D R  +++ +                    QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 79  P---SPSPEAHQNS-LQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANM 138
           P   S S     NS  +S    LPF APPSSP SF QSEPPSATQSP  ILSF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 139 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPE 198
                  SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 199 VPFAQFLQPTLQKAESDDQYSCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPD 258
           VPFAQ      Q      ++   +  +FQ YQ  PGSP+  LISP      SG +SP PD
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPSPG---SGPTSPFPD 247

Query: 259 LDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLN 280
                  S F +F +  PP LL+    G ++  C +  +        FDL+
Sbjct: 248 ----GETSLFPHFQVSDPPKLLSPKTAGVTT-PCKEQKIVRPHKPVSFDLD 284

BLAST of Cp4.1LG01g06770 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 144.4 bits (363), Expect = 1.6e-34
Identity = 112/263 (42.59%), Postives = 141/263 (53.61%), Query Frame = 1

Query: 19  NNTFQTITAAADAIATVDHRFPRATAVQRKR------------------IGHAVLVPEPS 78
           N++  T+ AAA AI + + R  + ++VQ+KR                  IGHAVLVPEP+
Sbjct: 6   NSSVDTVNAAASAIVSAESR-TQPSSVQKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPA 65

Query: 79  PSPEA----HQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYS 138
            S  A      +S  S  I +PF APPSSP SFL S PPSA+ +P   L   SLT N   
Sbjct: 66  ASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLTVNE-- 125

Query: 139 PDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFL 198
              P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ L
Sbjct: 126 ---PPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVPFAQLL 185

Query: 199 QPTLQKAE------SDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLD 254
             +L++A        + ++S  + +F+S Q YPGSP  NLISP      SG SSP P   
Sbjct: 186 TSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SGTSSPYP--- 245

BLAST of Cp4.1LG01g06770 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 118.6 bits (296), Expect = 9.4e-27
Identity = 94/189 (49.74%), Postives = 110/189 (58.20%), Query Frame = 1

Query: 48  KRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSAT 107
           KRI  A  +PE      S    AHQ    N+  +  I L   APPSSP SF  S  PS T
Sbjct: 28  KRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSLLAPPSSPASFTNSALPSTT 87

Query: 108 QSPSNILSFTSLTANMYSPDGPSS-IFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-P 167
           QSP+    + SL AN  SP GPSS ++A GP+AHETQLVSPP+ FST TTEPST  FT P
Sbjct: 88  QSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLVSPPV-FSTFTTEPSTAPFTPP 147

Query: 168 PESIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSA 226
           PE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S 
Sbjct: 148 PELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDLQATYSLYPGSPASALRSPISR 207

BLAST of Cp4.1LG01g06770 vs. NCBI nr
Match: gi|659102256|ref|XP_008452033.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo])

HSP 1 Score: 602.1 bits (1551), Expect = 8.0e-169
Identity = 342/468 (73.08%), Postives = 370/468 (79.06%), Query Frame = 1

Query: 4   MRRRADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQ----------------- 63
           MRRR D D  D RP+NNTFQTITAAADAIATVDHRFPRATAVQ                 
Sbjct: 1   MRRRTDTD--DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSL 60

Query: 64  --RKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNI 123
             RKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSP+ +
Sbjct: 61  KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTAL 120

Query: 124 LSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTT 183
           +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPESIHLTT
Sbjct: 121 ISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTT 180

Query: 184 PSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASS 243
           PSSPEVPFAQF+ P+LQK ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGASS
Sbjct: 181 PSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 240

Query: 244 PLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDDDFD 303
           PLPD DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+ FKS+ +DF 
Sbjct: 241 PLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSS-NDFV 300

Query: 304 LNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVESKP 363
           LNP TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D L ++V SKP
Sbjct: 301 LNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKP 360

Query: 364 LESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHSTTLGSV 423
           LESN + V SSP+HE FET KE S  G H+SN IEEK  ADG+EA+QHQE HHS  LGSV
Sbjct: 361 LESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQE-HHSVALGSV 420

Query: 424 NEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 436
            EFNFDN NGS+   P INSDWW NAKD  T+GTTTGAWSFFP  QQR
Sbjct: 421 KEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Cp4.1LG01g06770 vs. NCBI nr
Match: gi|659102254|ref|XP_008452032.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo])

HSP 1 Score: 601.7 bits (1550), Expect = 1.0e-168
Identity = 342/469 (72.92%), Postives = 370/469 (78.89%), Query Frame = 1

Query: 4   MRRRADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQ----------------- 63
           MRRR D D  D RP+NNTFQTITAAADAIATVDHRFPRATAVQ                 
Sbjct: 1   MRRRTDTD--DFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGS 60

Query: 64  ---RKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSN 123
              RKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSP+ 
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 124 ILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLT 183
           ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 184 TPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGAS 243
           TPSSPEVPFAQF+ P+LQK ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGAS
Sbjct: 181 TPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGAS 240

Query: 244 SPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDDDF 303
           SPLPD DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+ FKS+ +DF
Sbjct: 241 SPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSS-NDF 300

Query: 304 DLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVESK 363
            LNP TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D L ++V SK
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 364 PLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHSTTLGS 423
           PLESN + V SSP+HE FET KE S  G H+SN IEEK  ADG+EA+QHQE HHS  LGS
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQE-HHSVALGS 420

Query: 424 VNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 436
           V EFNFDN NGS+   P INSDWW NAKD  T+GTTTGAWSFFP  QQR
Sbjct: 421 VKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of Cp4.1LG01g06770 vs. NCBI nr
Match: gi|449457656|ref|XP_004146564.1| (PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus])

HSP 1 Score: 600.5 bits (1547), Expect = 2.3e-168
Identity = 345/469 (73.56%), Postives = 374/469 (79.74%), Query Frame = 1

Query: 4   MRRRADADAADLRPMNN-TFQTITAAADAIATVDHRFPRATAVQ---------------- 63
           MRRR D D  D RP+NN TFQTITAAADAIATVDHRFPRATAVQ                
Sbjct: 1   MRRRTDTD--DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 64  ---RKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSN 123
              RKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSP+ 
Sbjct: 61  IKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTA 120

Query: 124 ILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLT 183
           ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLT 180

Query: 184 TPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGAS 243
           TPSSPEVPFAQF+QPTL K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGAS
Sbjct: 181 TPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGAS 240

Query: 244 SPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDDDF 303
           SPLPD DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S+ FKS+ +DF
Sbjct: 241 SPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSS-NDF 300

Query: 304 DLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVESK 363
            LNP+TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D LL++V SK
Sbjct: 301 VLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSK 360

Query: 364 PLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHSTTLGS 423
           PLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q QE HHS TLGS
Sbjct: 361 PLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHSVTLGS 420

Query: 424 VNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 436
           V EFNFDNGNGS+   PNINS+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 421 VKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 464

BLAST of Cp4.1LG01g06770 vs. NCBI nr
Match: gi|700198179|gb|KGN53337.1| (hypothetical protein Csa_4G047980 [Cucumis sativus])

HSP 1 Score: 464.2 bits (1193), Expect = 2.6e-127
Identity = 262/354 (74.01%), Postives = 288/354 (81.36%), Query Frame = 1

Query: 99  QSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPE 158
           QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPE
Sbjct: 2   QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPE 61

Query: 159 SIHLTTPSSPEVPFAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAIS 218
           SIHLTTPSSPEVPFAQF+QPTL K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS
Sbjct: 62  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 121

Query: 219 LSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKS 278
            SGASSPLPD DFAS  SQF NF L+VPP LLNLD       RQ QS+DSCTQ+S+ FKS
Sbjct: 122 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 181

Query: 279 NDDDFDLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLR 338
           + +DF LNP+TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D LL+
Sbjct: 182 S-NDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQ 241

Query: 339 NVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEEKA-ADGEEANQHQEHHHS 398
           +V SKPLESN +AV SSP+HE FET KE S  G H+SN IEEK  ADG+EA+Q QE HHS
Sbjct: 242 SVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHS 301

Query: 399 TTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTTGAWSFFPMAQQR 436
            TLGSV EFNFDNGNGS+   PNINS+WW NAKD  T+ T TG WSFFPM QQR
Sbjct: 302 VTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of Cp4.1LG01g06770 vs. NCBI nr
Match: gi|802738414|ref|XP_012086872.1| (PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas])

HSP 1 Score: 335.1 bits (858), Expect = 1.8e-88
Identity = 224/458 (48.91%), Postives = 275/458 (60.04%), Query Frame = 1

Query: 14  DLRPMNNTFQTITAAADAIATVDHRFPRATAVQR------------------KRIGHAVL 73
           D RP NN   TI AAA AIA+ ++R P+AT  +R                  KRIGHAVL
Sbjct: 7   DSRPSNNALDTINAAASAIASAENRVPQATVQKRRWGSCFSVYWCFGYNRHRKRIGHAVL 66

Query: 74  VPE---PSPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTA 133
           VPE   P     A +NS Q+P I LPF APPSSP SFLQSEPPSA+QSP+ +LS TS++A
Sbjct: 67  VPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSLTSISA 126

Query: 134 NMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVP 193
           NMYSP GPSSIFAIGP+AHETQLVSPP+ FST TTEPST  FT PPES+HLTTPSSPEVP
Sbjct: 127 NMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 186

Query: 194 FAQFLQPTLQKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDF 253
           FAQ L P+++  E+  ++   N +FQSYQ YPGSPV  LISP S IS SG SSP PD +F
Sbjct: 187 FAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDGEF 246

Query: 254 ASSASQFSNFSLDVPPALLNLDR-----QGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSM 313
           A+    F  F +  PP LLNLD+      G    S T      +     F  +   SD +
Sbjct: 247 AAG---FLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVRPTSCSFTPDRPFSDFV 306

Query: 314 NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKP----------LESNVAV 373
           +   +     +G+Q +E  V +HR SFEL+ E  +L   E  P          LE+    
Sbjct: 307 SHKHS----DNGNQNDE--VGDHRLSFELAAE-GVLGCEEQNPASPVKIIGDSLENGTVA 366

Query: 374 ASSPMHETFETAKETSSGGGHSSNGIEEKAA-DGEEANQHQEHHHSTTLGSVNEFNFDNG 433
           A +   ++ E   +  S  G +SNG  EKA+ DGE+A    E H S TLGS+ EFNFDN 
Sbjct: 367 ART--EDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSITLGSLKEFNFDNV 426

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH1.7e-2549.74Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KY57_CUCSA1.8e-12774.01Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1[more]
A0A067JZI1_JATCU1.3e-8848.91Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1[more]
U5GP58_POPTR3.8e-8548.69Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=1[more]
B9GKF9_POPTR5.0e-8548.58Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s09590g PE=4 SV=2[more]
B9RCD8_RICCO1.4e-8446.61Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G52430.16.9e-4635.56 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.11.8e-4145.36 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT4G25620.11.6e-3442.59 hydroxyproline-rich glycoprotein family protein[more]
AT1G76660.19.4e-2749.74 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|659102256|ref|XP_008452033.1|8.0e-16973.08PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo][more]
gi|659102254|ref|XP_008452032.1|1.0e-16872.92PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo][more]
gi|449457656|ref|XP_004146564.1|2.3e-16873.56PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus][more]
gi|700198179|gb|KGN53337.1|2.6e-12774.01hypothetical protein Csa_4G047980 [Cucumis sativus][more]
gi|802738414|ref|XP_012086872.1|1.8e-8848.91PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009725 response to hormone
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g06770.1Cp4.1LG01g06770.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..433
score: 1.2E
NoneNo IPR availablePANTHERPTHR31798:SF2SUBFAMILY NOT NAMEDcoord: 1..433
score: 1.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g06770Cp4.1LG14g02100Cucurbita pepo (Zucchini)cpecpeB237