HG10016710 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016710
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncysteine proteinase COT44-like
LocationChr03: 7336539 .. 7339169 (-)
RNA-Seq ExpressionHG10016710
SyntenyHG10016710
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCCGCCCCCTCTTTGCTCGCCCTCCTCTCCTTCTTCTTCCTTTCCATTTCCGCCTCCGCACTCAGCCGCCGGAGCGACGGCGAGGTTAGAGAAATCTATGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAGCGGGAGAAGAGGTTTCAGATCTTCAAGGATAATCTGAACTTTATCGATGATCATAATTCTGAGAATCGGACGTATAAGGTTGGATTGAACAAGTTCGCCGATCTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCCCCTGCTCGACGAGTCATGAAGGCAAAGTCCGCCAGCCGCCGATACGCCGTCAACAACCGCGATCGGTTGCCGGAATCTGTTGATTGGAGGTCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGTGAGTTTTTTAAATATCAACCGCTGTTCAGAATATTTTTTTTTTCTAAATAATTTTCCCAAATCAATACGATTAAACTGGGCAATTGCAATTGGTAATATAAATTATAACAATATTCTAAAAAAATTGCAAAAATGGCAAAATTGATCTACATTTGGGTTTGATCACCCCAAAGAATATATATAGGCCCAAAGAATATAGGGAAAGGTACGGCCCAAAAATAAAAGAAGCTAAATTACAATTATACCCTTATACATCCTTAATATATATATATATGTTTTGAAAATATTAGTGGAGATTCAAGTCTAATCTCGAGTTAAAATAGGTAGGAGCAATTCTTTAAATTCAACTAAGCTATCACATGTGAACTACAAGAGTATAACTCAACTGGAATCAACCACGTACAATCTATTTTGGAGTACTCCCTCACCCTGCACCCCACATTTATGACTTTATATCCTTAATTTGTTGAGCTACTTTTAAAGCACATTTATTGACTCTATATCCGGTTATTTTAAATAGTTAAAGCTTTTTAAATGGATGCTTTATGCTAAAATAAATATGTATATATATATATATATATTTTAAAATTGATCCAAGTAGAATGATTGAAGATAATTAAAAAGGGTAATGATTAGGTCGATTTTCCATGCAACAATATTTTATTACTTAGTAAAAAAAAAAATTCATTTTTTAAAAAAAATGTTTGCTAAATATTTTATTACATAATAAAAAGAAGAAAAAATAATTTTTTTTTGCCACGTGATAAATTTTAATTGAATAACTAAGCAACACAAAAATGGTATAATCCAACTTGCACCTACCTATTTTTCCATGCAAGAGGGATGGCCTTGGACTTTTTATGCAAAGTAGTGAGCTTTCTTGTTTGATAAAAATTCAATTGGAAAAAAGGTTAATTATTTAAAGATGGAAAATATAAATCTCCTCTCATAAAGAACTCAAATTAATTACTTGGATTAGAAAATAAGTTAATTCATAACAAGGTGTGTTATTACTAATTAGTTTTGAGATTGAACTCCATATTTATTTAATATGGTATTGAGCACATAAAGTCCAGTTGACCGAGTATTCGATAAAAGAAAATTAATCCAAAAACGGTGAATTCAAATAAATGTCCCTTTGAGCATGTATATAAAAGTCAAACCAGCCTATACAAATGACAATGGACGTATACACGAATTTGTTGCACTGAAAATGTAAGAATATAAATTAATTTAGAAGATTATAAAATTAGTGTAGAGATCTCTTAAAATATGTTAATGATGTACGATAACCCATTACAATTGGAAACAAGTGAGCCCACTAATGTGAGATGTTTCTGTTGTTGGGGGAAACAGGGAGTTGTTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAGATCGTTACTGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAACTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGTCTTATGGACTATGCCTTCCAATTCATTATTGACAATGGCGGCTTGGACACTGAGGAAGATTATCCTTATGAGGGCGTCGATGGTCAATGCGATCCCACCAGGGTCAGATTATTTCTGCTTTCATTTTCCACTAATCTCTTTGACCCTTTTTACTTGGCTGAATTTTGATTTCCAACATCTCTCTCAGAAAAATGCCAAGGTTGTTAACATCGACGGATACGAGGATGTCCTTGCGAATGACGAGGAAGCATTGAAGAAGGCCATTGCTCACCAACCAGTTAGCGTCGCCATTGAAGCTGGTGGCTTGGCTTTGCAACTTTACCAGTCGGTGAGCAAACTTCTCTGACACACATATACTTTTTAAACTATCCACTTCAATCCATAAATAGTTATCACATTTGATTGAACTTTTCAGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTACGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGAATGGGGTGAGGATGGCTACTTCAAACTAGAGCGCAATGTAAAGCACACTACCAATGGGAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGCAACAACCCAACAACATCATACTTAAGTTTGGAAACAACTACTGGGGACAAGAACAAGATCAACATTGCTTGA

mRNA sequence

ATGGCCGCCGCCCCCTCTTTGCTCGCCCTCCTCTCCTTCTTCTTCCTTTCCATTTCCGCCTCCGCACTCAGCCGCCGGAGCGACGGCGAGGTTAGAGAAATCTATGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAGCGGGAGAAGAGGTTTCAGATCTTCAAGGATAATCTGAACTTTATCGATGATCATAATTCTGAGAATCGGACGTATAAGGTTGGATTGAACAAGTTCGCCGATCTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCCCCTGCTCGACGAGTCATGAAGGCAAAGTCCGCCAGCCGCCGATACGCCGTCAACAACCGCGATCGGTTGCCGGAATCTGTTGATTGGAGGTCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGTTGTTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAGATCGTTACTGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAACTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGTCTTATGGACTATGCCTTCCAATTCATTATTGACAATGGCGGCTTGGACACTGAGGAAGATTATCCTTATGAGGGCGTCGATGGTCAATGCGATCCCACCAGGAAAAATGCCAAGGTTGTTAACATCGACGGATACGAGGATGTCCTTGCGAATGACGAGGAAGCATTGAAGAAGGCCATTGCTCACCAACCAGTTAGCGTCGCCATTGAAGCTGGTGGCTTGGCTTTGCAACTTTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTACGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGAATGGGGTGAGGATGGCTACTTCAAACTAGAGCGCAATGTAAAGCACACTACCAATGGGAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGCAACAACCCAACAACATCATACTTAAGTTTGGAAACAACTACTGGGGACAAGAACAAGATCAACATTGCTTGA

Coding sequence (CDS)

ATGGCCGCCGCCCCCTCTTTGCTCGCCCTCCTCTCCTTCTTCTTCCTTTCCATTTCCGCCTCCGCACTCAGCCGCCGGAGCGACGGCGAGGTTAGAGAAATCTATGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAGCGGGAGAAGAGGTTTCAGATCTTCAAGGATAATCTGAACTTTATCGATGATCATAATTCTGAGAATCGGACGTATAAGGTTGGATTGAACAAGTTCGCCGATCTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCCCCTGCTCGACGAGTCATGAAGGCAAAGTCCGCCAGCCGCCGATACGCCGTCAACAACCGCGATCGGTTGCCGGAATCTGTTGATTGGAGGTCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGTTGTTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAGATCGTTACTGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAACTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGTCTTATGGACTATGCCTTCCAATTCATTATTGACAATGGCGGCTTGGACACTGAGGAAGATTATCCTTATGAGGGCGTCGATGGTCAATGCGATCCCACCAGGAAAAATGCCAAGGTTGTTAACATCGACGGATACGAGGATGTCCTTGCGAATGACGAGGAAGCATTGAAGAAGGCCATTGCTCACCAACCAGTTAGCGTCGCCATTGAAGCTGGTGGCTTGGCTTTGCAACTTTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTACGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGAATGGGGTGAGGATGGCTACTTCAAACTAGAGCGCAATGTAAAGCACACTACCAATGGGAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGCAACAACCCAACAACATCATACTTAAGTTTGGAAACAACTACTGGGGACAAGAACAAGATCAACATTGCTTGA

Protein sequence

MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDKNKINIA
Homology
BLAST of HG10016710 vs. NCBI nr
Match: XP_038880922.1 (cysteine proteinase COT44-like [Benincasa hispida])

HSP 1 Score: 674.5 bits (1739), Expect = 5.2e-190
Identity = 338/367 (92.10%), Postives = 349/367 (95.10%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA+A +LLALLSFFFLSIS+SALS RSD EVREIYDLWLAKHGKAYNGIEEREKRFQIFK
Sbjct: 1   MASATTLLALLSFFFLSISSSALSPRSDREVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NL FIDDHNSENRTYKVGLN FADLTNDEYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLKFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
           RDRLPES DWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 RDRLPESFDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG DGQCDPTRKNAKVV+IDGYEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTS-YLSLETTTGD 360
           LVRNSWGT WGEDGYFKLERNVKHTTNGKCGIAM ASYPVKN   PT S YLSLE T G+
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMQASYPVKNDKKPTKSYYLSLE-TAGE 360

Query: 361 KNKINIA 367
           KNKINIA
Sbjct: 361 KNKINIA 366

BLAST of HG10016710 vs. NCBI nr
Match: XP_008440309.1 (PREDICTED: cysteine proteinase COT44-like [Cucumis melo] >TYK12867.1 cysteine proteinase COT44-like [Cucumis melo var. makuwa])

HSP 1 Score: 671.0 bits (1730), Expect = 5.8e-189
Identity = 334/366 (91.26%), Postives = 346/366 (94.54%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGI+EREKRFQIFK
Sbjct: 1   MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NLNFIDDHNSENRTYKVGLN FADLTNDEYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
           RDRLPESVDWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELV+CD
Sbjct: 121 RDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYE  DGQCDPTRKNAKVV+ID YEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDSYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYG ENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDK 360
           LVRNSWGT WGEDGYFKLERNVKHTT GKCGIAM ASYPVKN NNPT SYLSL++   DK
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTEGKCGIAMQASYPVKNDNNPTKSYLSLKSVE-DK 360

Query: 361 NKINIA 367
            KIN A
Sbjct: 361 YKINSA 365

BLAST of HG10016710 vs. NCBI nr
Match: XP_004141903.1 (cysteine proteinase COT44 [Cucumis sativus] >KGN48548.1 hypothetical protein Csa_004136 [Cucumis sativus])

HSP 1 Score: 664.5 bits (1713), Expect = 5.4e-187
Identity = 330/366 (90.16%), Postives = 343/366 (93.72%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGI+EREKRFQIFK
Sbjct: 1   MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NL FIDDHNSENRTYKVGLN FADLTN+EYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPES+DWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYE  DGQCDPTRKNAKVV+ID YEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEE+LKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYG ENGVDYW
Sbjct: 241 NDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDK 360
           LVRNSWGT WGEDGYFKLERNVKH T GKCGIAM ASYPVKN NNPT SYLSL+    DK
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKNDNNPTKSYLSLKIAE-DK 360

Query: 361 NKINIA 367
           NKIN A
Sbjct: 361 NKINTA 365

BLAST of HG10016710 vs. NCBI nr
Match: XP_023518142.1 (cysteine proteinase COT44-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 644.4 bits (1661), Expect = 5.8e-181
Identity = 323/368 (87.77%), Postives = 342/368 (92.93%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGIEEREKRF IFK
Sbjct: 1   MALATTFLAFLSFFVLSI--SALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           DNLNFID+HNS+NRTY VGLN FADLTN+EYRA +LGTRS PARRVMKAKSASRRYAVN+
Sbjct: 61  DNLNFIDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYAVND 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPESVDWR+RGAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 ADRLPESVDWRTRGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
            KYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG+DGQCDPTR+NAKVV+IDGYEDV A
Sbjct: 181 TKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--NNPTTSYLSLETTTG 360
           LVRNSWGT WGEDGYFKLERNVKHTT+GKCGIAM ASYPVKNG  NNPT SYL LE   G
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNNNNPTGSYLGLE-LAG 360

Query: 361 DKNKINIA 367
           DKNKI+ A
Sbjct: 361 DKNKISSA 365

BLAST of HG10016710 vs. NCBI nr
Match: KAG7027046.1 (Oryzain alpha chain [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 642.5 bits (1656), Expect = 2.2e-180
Identity = 321/368 (87.23%), Postives = 342/368 (92.93%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGIEEREKRF IFK
Sbjct: 1   MALATTFLAFLSFFVLSI--SALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           DNLNF+D+HNS+NRTY VGLN FADLTN+EYRA +LGTRS PARRVMKAKSASRRYAVN+
Sbjct: 61  DNLNFVDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYAVND 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPESVDWR++GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 DDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
            KYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG+DGQCDPTR+NAKVV+IDGYEDV A
Sbjct: 181 TKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--NNPTTSYLSLETTTG 360
           LVRNSWGT WGEDGYFKLERNVKHTT+GKCGIAM ASYPVKNG  NNPT SYL LE   G
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNNNNPTGSYLGLE-LAG 360

Query: 361 DKNKINIA 367
           DKNKI+ A
Sbjct: 361 DKNKISSA 365

BLAST of HG10016710 vs. ExPASy Swiss-Prot
Match: Q9FMH8 (Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 1.5e-126
Identity = 220/325 (67.69%), Postives = 258/325 (79.38%), Query Frame = 0

Query: 26  RSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKDNLNFIDDHNSENRTYKVGLN 85
           RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFKDNL FID+HN++N +YK+GL 
Sbjct: 41  RSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLT 100

Query: 86  KFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKN 145
           +FADLTN+EYR++YLG +  P +RV+K    S RY     D LP+SVDWR  GAVA VK+
Sbjct: 101 RFADLTNEEYRSMYLGAK--PTKRVLK---TSDRYQARVGDALPDSVDWRKEGAVADVKD 160

Query: 146 QGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIID 205
           QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV+CD  YN GCNGGLMDYAF+FII 
Sbjct: 161 QGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIK 220

Query: 206 NGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEA 265
           NGG+DTE DYPY+  DG+CD  RKNAKVV ID YEDV  N E +LKKA+AHQP+SVAIEA
Sbjct: 221 NGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEA 280

Query: 266 GGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERN 325
           GG A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY K+ RN
Sbjct: 281 GGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARN 340

Query: 326 VKHTTNGKCGIAMMASYPVKNGNNP 347
           ++  T GKCGIAM ASYP+K G NP
Sbjct: 341 IEAPT-GKCGIAMEASYPIKKGQNP 359

BLAST of HG10016710 vs. ExPASy Swiss-Prot
Match: Q94B08 (Germination-specific cysteine protease 1 OS=Arabidopsis thaliana OX=3702 GN=GCP1 PE=2 SV=2)

HSP 1 Score: 454.1 bits (1167), Expect = 1.5e-126
Identity = 220/329 (66.87%), Postives = 266/329 (80.85%), Query Frame = 0

Query: 26  RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKDNLNFIDDHNSENR--TYKVG 85
           R+D EVR IY  W A+HGK  N     I +++KRF IFKDNL FID HN +N+  TYK+G
Sbjct: 40  RTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLG 99

Query: 86  LNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRY--AVNNRDRLPESVDWRSRGAVA 145
           L KF DLTNDEYR +YLG R+ PARR+ KAK+ +++Y  AVN ++ +PE+VDWR +GAV 
Sbjct: 100 LTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKE-VPETVDWRQKGAVN 159

Query: 146 PVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQ 205
           P+K+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV+CDK YN GCNGGLMDYAFQ
Sbjct: 160 PIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQ 219

Query: 206 FIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSV 265
           FI+ NGGL+TE+DYPY G  G+C+   KN++VV+IDGYEDV   DE ALKKAI++QPVSV
Sbjct: 220 FIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSV 279

Query: 266 AIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFK 325
           AIEAGG   Q YQSG+FTG CG+ LDH VVAVGYG+ENGVDYW+VRNSWG  WGE+GY +
Sbjct: 280 AIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIR 339

Query: 326 LERNVKHTTNGKCGIAMMASYPVKNGNNP 347
           +ERN+  + +GKCGIA+ ASYPVK   NP
Sbjct: 340 MERNLAASKSGKCGIAVEASYPVKYSPNP 367

BLAST of HG10016710 vs. ExPASy Swiss-Prot
Match: P25776 (Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=1 SV=2)

HSP 1 Score: 451.1 bits (1159), Expect = 1.2e-125
Identity = 218/348 (62.64%), Postives = 267/348 (76.72%), Query Frame = 0

Query: 3   AAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDN 62
           AA +LL LLS     +S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F+DN
Sbjct: 8   AAAALLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDN 67

Query: 63  LNFIDDHNSEN----RTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAV 122
           L +ID+HN+       ++++GLN+FADLTN+EYR  YLG R+ P R     +  S RY  
Sbjct: 68  LRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLA 127

Query: 123 NNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVN 182
            + + LPESVDWR++GAVA +K+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV+
Sbjct: 128 ADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVD 187

Query: 183 CDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDV 242
           CD  YN GCNGGLMDYAF FII+NGG+DTE+DYPY+G D +CD  RKNAKVV ID YEDV
Sbjct: 188 CDTSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDV 247

Query: 243 LANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVD 302
             N E +L+KA+A+QPVSVAIEAGG A QLY SG+FTGKCG+ALDHGV AVGYGTENG D
Sbjct: 248 TPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKD 307

Query: 303 YWLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNP 347
           YW+VRNSWG  WGE GY ++ERN+K  ++GKCGIA+  SYP+K G NP
Sbjct: 308 YWIVRNSWGKSWGESGYVRMERNIK-ASSGKCGIAVEPSYPLKKGENP 350

BLAST of HG10016710 vs. ExPASy Swiss-Prot
Match: P43297 (Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 1.6e-125
Identity = 212/328 (64.63%), Postives = 255/328 (77.74%), Query Frame = 0

Query: 21  SALSRRSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKDNLNFIDDHNSENRTYKV 80
           S    RS+ EV  IY+ WL KHGKA   N + E+++RF+IFKDNL F+D+HN +N +Y++
Sbjct: 36  STTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRL 95

Query: 81  GLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAP 140
           GL +FADLTNDEYR+ YLG +          +  S RY     D LPES+DWR +GAVA 
Sbjct: 96  GLTRFADLTNDEYRSKYLGAKMEKKGE----RRTSLRYEARVGDELPESIDWRKKGAVAE 155

Query: 141 VKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQF 200
           VK+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV+CD  YN GCNGGLMDYAF+F
Sbjct: 156 VKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 215

Query: 201 IIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVA 260
           II NGG+DT++DYPY+GVDG CD  RKNAKVV ID YEDV    EE+LKKA+AHQP+S+A
Sbjct: 216 IIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIA 275

Query: 261 IEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKL 320
           IEAGG A QLY SG+F G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY ++
Sbjct: 276 IEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 335

Query: 321 ERNVKHTTNGKCGIAMMASYPVKNGNNP 347
            RN+  +++GKCGIA+  SYP+KNG NP
Sbjct: 336 ARNIA-SSSGKCGIAIEPSYPIKNGENP 358

BLAST of HG10016710 vs. ExPASy Swiss-Prot
Match: Q9LT78 (Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 SV=1)

HSP 1 Score: 440.3 bits (1131), Expect = 2.2e-122
Identity = 220/348 (63.22%), Postives = 272/348 (78.16%), Query Frame = 0

Query: 8   LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKD 67
           LALL F  L IS S  S       R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFKD
Sbjct: 10  LALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKD 69

Query: 68  NLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 127
           NL F+++H+S  NRTY+VGL +FADLTNDE+RA+YL ++    R  +K +    +Y    
Sbjct: 70  NLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGE----KYLYKV 129

Query: 128 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 187
            D LP+++DWR++GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISLSEQELV+CD
Sbjct: 130 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 189

Query: 188 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVD-GQCDPTRKNAKVVNIDGYEDVL 247
             YN GC GGLMDYAF+FII+NGG+DTEEDYPY   D   C+  +KN +VV IDGYEDV 
Sbjct: 190 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 249

Query: 248 ANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDY 307
            NDE++LKKA+A+QP+SVAIEAGG A QLY SGVFTG CG++LDHGVVAVGYG+E G DY
Sbjct: 250 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDY 309

Query: 308 WLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK-NGNNP 347
           W+VRNSWG+ WGE GYFKLERN+K  ++GKCG+AMMASYP K +G+NP
Sbjct: 310 WIVRNSWGSNWGESGYFKLERNIKE-SSGKCGVAMMASYPTKSSGSNP 352

BLAST of HG10016710 vs. ExPASy TrEMBL
Match: A0A5D3CLQ0 (Cysteine proteinase COT44-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004590 PE=3 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 2.8e-189
Identity = 334/366 (91.26%), Postives = 346/366 (94.54%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGI+EREKRFQIFK
Sbjct: 1   MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NLNFIDDHNSENRTYKVGLN FADLTNDEYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
           RDRLPESVDWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELV+CD
Sbjct: 121 RDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYE  DGQCDPTRKNAKVV+ID YEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDSYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYG ENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDK 360
           LVRNSWGT WGEDGYFKLERNVKHTT GKCGIAM ASYPVKN NNPT SYLSL++   DK
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTEGKCGIAMQASYPVKNDNNPTKSYLSLKSVE-DK 360

Query: 361 NKINIA 367
            KIN A
Sbjct: 361 YKINSA 365

BLAST of HG10016710 vs. ExPASy TrEMBL
Match: A0A1S3B0U6 (cysteine proteinase COT44-like OS=Cucumis melo OX=3656 GN=LOC103484794 PE=3 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 2.8e-189
Identity = 334/366 (91.26%), Postives = 346/366 (94.54%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGI+EREKRFQIFK
Sbjct: 1   MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NLNFIDDHNSENRTYKVGLN FADLTNDEYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
           RDRLPESVDWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELV+CD
Sbjct: 121 RDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYE  DGQCDPTRKNAKVV+ID YEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDSYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYG ENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDK 360
           LVRNSWGT WGEDGYFKLERNVKHTT GKCGIAM ASYPVKN NNPT SYLSL++   DK
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTEGKCGIAMQASYPVKNDNNPTKSYLSLKSVE-DK 360

Query: 361 NKINIA 367
            KIN A
Sbjct: 361 YKINSA 365

BLAST of HG10016710 vs. ExPASy TrEMBL
Match: A0A0A0KGD9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G491600 PE=3 SV=1)

HSP 1 Score: 664.5 bits (1713), Expect = 2.6e-187
Identity = 330/366 (90.16%), Postives = 343/366 (93.72%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGI+EREKRFQIFK
Sbjct: 1   MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           +NL FIDDHNSENRTYKVGLN FADLTN+EYRA+YLGTRSPPARRVMKAK+ASRRYAVNN
Sbjct: 61  ENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNN 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPES+DWR+RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
           KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYE  DGQCDPTRKNAKVV+ID YEDV A
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEE+LKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYG ENGVDYW
Sbjct: 241 NDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNPTTSYLSLETTTGDK 360
           LVRNSWGT WGEDGYFKLERNVKH T GKCGIAM ASYPVKN NNPT SYLSL+    DK
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKNDNNPTKSYLSLKIAE-DK 360

Query: 361 NKINIA 367
           NKIN A
Sbjct: 361 NKINTA 365

BLAST of HG10016710 vs. ExPASy TrEMBL
Match: A0A6J1HGZ6 (zingipain-2-like OS=Cucurbita moschata OX=3662 GN=LOC111463384 PE=3 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 1.4e-180
Identity = 321/368 (87.23%), Postives = 342/368 (92.93%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGIEEREKRF IFK
Sbjct: 1   MALATTFLAFLSFFVLSI--SALNQRTDGEVREIYDIWLAKHGKAYNGIEEREKRFLIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           DNLNF+D+HNS+NRTY VGLN FADLTN+EYRA +LGTRS PARRVMKAKSASRRYAVN+
Sbjct: 61  DNLNFLDEHNSQNRTYTVGLNMFADLTNEEYRATFLGTRSHPARRVMKAKSASRRYAVND 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPESVDWR++GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 DDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
            KYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG+DGQCDPTR+NAKVV+IDGYEDV A
Sbjct: 181 TKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--NNPTTSYLSLETTTG 360
           LVRNSWGT WGEDGYFKLERNVKHTT+GKCGIAM ASYPVKNG  NNPT SYL LE   G
Sbjct: 301 LVRNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNNNNPTGSYLGLE-LAG 360

Query: 361 DKNKINIA 367
           DKNKI+ A
Sbjct: 361 DKNKISSA 365

BLAST of HG10016710 vs. ExPASy TrEMBL
Match: A0A6J1KPD5 (cysteine proteinase COT44-like OS=Cucurbita maxima OX=3661 GN=LOC111497083 PE=3 SV=1)

HSP 1 Score: 637.9 bits (1644), Expect = 2.6e-179
Identity = 321/368 (87.23%), Postives = 339/368 (92.12%), Query Frame = 0

Query: 1   MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFK 60
           MA A + LA LSFF LSI  SAL++RSDGEVREIYD+WLAKHGKAYNGIEE EKRF IFK
Sbjct: 1   MALATASLAFLSFFVLSI--SALNQRSDGEVREIYDMWLAKHGKAYNGIEELEKRFLIFK 60

Query: 61  DNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 120
           DNLNFID+HNS NRTY VGLN FADLTN+EYRA +LGTRS PARRVMKAKSASRRYAVN+
Sbjct: 61  DNLNFIDEHNSHNRTYTVGLNMFADLTNEEYRAAFLGTRSQPARRVMKAKSASRRYAVND 120

Query: 121 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 180
            DRLPESVDWR++GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELV+CD
Sbjct: 121 GDRLPESVDWRTKGAVAPIKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLA 240
            KYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEG+DGQCDPTR+NAKVV+IDGYEDV A
Sbjct: 181 TKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGLDGQCDPTRENAKVVSIDGYEDVPA 240

Query: 241 NDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300
           NDEEALKKA+AHQPVSVAIEA GLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
Sbjct: 241 NDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW 300

Query: 301 LVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--NNPTTSYLSLETTTG 360
           L RNSWGT WGEDGYFKLERNVKHTT+GKCGIAM ASYPVKNG  NNPT SYL LE   G
Sbjct: 301 LARNSWGTGWGEDGYFKLERNVKHTTDGKCGIAMEASYPVKNGNNNNPTGSYLGLE-LGG 360

Query: 361 DKNKINIA 367
           DKNKI+ A
Sbjct: 361 DKNKISSA 365

BLAST of HG10016710 vs. TAIR 10
Match: AT5G43060.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 454.1 bits (1167), Expect = 1.0e-127
Identity = 220/325 (67.69%), Postives = 258/325 (79.38%), Query Frame = 0

Query: 26  RSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKDNLNFIDDHNSENRTYKVGLN 85
           RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFKDNL FID+HN++N +YK+GL 
Sbjct: 41  RSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLT 100

Query: 86  KFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKN 145
           +FADLTN+EYR++YLG +  P +RV+K    S RY     D LP+SVDWR  GAVA VK+
Sbjct: 101 RFADLTNEEYRSMYLGAK--PTKRVLK---TSDRYQARVGDALPDSVDWRKEGAVADVKD 160

Query: 146 QGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIID 205
           QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV+CD  YN GCNGGLMDYAF+FII 
Sbjct: 161 QGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIK 220

Query: 206 NGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEA 265
           NGG+DTE DYPY+  DG+CD  RKNAKVV ID YEDV  N E +LKKA+AHQP+SVAIEA
Sbjct: 221 NGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEA 280

Query: 266 GGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERN 325
           GG A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY K+ RN
Sbjct: 281 GGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARN 340

Query: 326 VKHTTNGKCGIAMMASYPVKNGNNP 347
           ++  T GKCGIAM ASYP+K G NP
Sbjct: 341 IEAPT-GKCGIAMEASYPIKKGQNP 359

BLAST of HG10016710 vs. TAIR 10
Match: AT4G36880.1 (cysteine proteinase1 )

HSP 1 Score: 453.4 bits (1165), Expect = 1.8e-127
Identity = 220/329 (66.87%), Postives = 265/329 (80.55%), Query Frame = 0

Query: 26  RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKDNLNFIDDHNSENR--TYKVG 85
           R+D EVR IY  W A+HGK  N     I +++KRF IFKDNL FID HN  N+  TYK+G
Sbjct: 40  RTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLG 99

Query: 86  LNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRY--AVNNRDRLPESVDWRSRGAVA 145
           L KF DLTNDEYR +YLG R+ PARR+ KAK+ +++Y  AVN ++ +PE+VDWR +GAV 
Sbjct: 100 LTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKE-VPETVDWRQKGAVN 159

Query: 146 PVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQ 205
           P+K+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV+CDK YN GCNGGLMDYAFQ
Sbjct: 160 PIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQ 219

Query: 206 FIIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSV 265
           FI+ NGGL+TE+DYPY G  G+C+   KN++VV+IDGYEDV   DE ALKKAI++QPVSV
Sbjct: 220 FIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSV 279

Query: 266 AIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFK 325
           AIEAGG   Q YQSG+FTG CG+ LDH VVAVGYG+ENGVDYW+VRNSWG  WGE+GY +
Sbjct: 280 AIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIR 339

Query: 326 LERNVKHTTNGKCGIAMMASYPVKNGNNP 347
           +ERN+  + +GKCGIA+ ASYPVK   NP
Sbjct: 340 MERNLAASKSGKCGIAVEASYPVKYSPNP 367

BLAST of HG10016710 vs. TAIR 10
Match: AT1G47128.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 450.7 bits (1158), Expect = 1.1e-126
Identity = 212/328 (64.63%), Postives = 255/328 (77.74%), Query Frame = 0

Query: 21  SALSRRSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKDNLNFIDDHNSENRTYKV 80
           S    RS+ EV  IY+ WL KHGKA   N + E+++RF+IFKDNL F+D+HN +N +Y++
Sbjct: 36  STTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRL 95

Query: 81  GLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAP 140
           GL +FADLTNDEYR+ YLG +          +  S RY     D LPES+DWR +GAVA 
Sbjct: 96  GLTRFADLTNDEYRSKYLGAKMEKKGE----RRTSLRYEARVGDELPESIDWRKKGAVAE 155

Query: 141 VKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQF 200
           VK+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV+CD  YN GCNGGLMDYAF+F
Sbjct: 156 VKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 215

Query: 201 IIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVA 260
           II NGG+DT++DYPY+GVDG CD  RKNAKVV ID YEDV    EE+LKKA+AHQP+S+A
Sbjct: 216 IIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIA 275

Query: 261 IEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKL 320
           IEAGG A QLY SG+F G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY ++
Sbjct: 276 IEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 335

Query: 321 ERNVKHTTNGKCGIAMMASYPVKNGNNP 347
            RN+  +++GKCGIA+  SYP+KNG NP
Sbjct: 336 ARNIA-SSSGKCGIAIEPSYPIKNGENP 358

BLAST of HG10016710 vs. TAIR 10
Match: AT3G19390.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 440.3 bits (1131), Expect = 1.5e-123
Identity = 220/348 (63.22%), Postives = 272/348 (78.16%), Query Frame = 0

Query: 8   LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKD 67
           LALL F  L IS S  S       R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFKD
Sbjct: 10  LALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKD 69

Query: 68  NLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNN 127
           NL F+++H+S  NRTY+VGL +FADLTNDE+RA+YL ++    R  +K +    +Y    
Sbjct: 70  NLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGE----KYLYKV 129

Query: 128 RDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCD 187
            D LP+++DWR++GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISLSEQELV+CD
Sbjct: 130 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 189

Query: 188 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVD-GQCDPTRKNAKVVNIDGYEDVL 247
             YN GC GGLMDYAF+FII+NGG+DTEEDYPY   D   C+  +KN +VV IDGYEDV 
Sbjct: 190 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 249

Query: 248 ANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDY 307
            NDE++LKKA+A+QP+SVAIEAGG A QLY SGVFTG CG++LDHGVVAVGYG+E G DY
Sbjct: 250 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDY 309

Query: 308 WLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK-NGNNP 347
           W+VRNSWG+ WGE GYFKLERN+K  ++GKCG+AMMASYP K +G+NP
Sbjct: 310 WIVRNSWGSNWGESGYFKLERNIKE-SSGKCGVAMMASYPTKSSGSNP 352

BLAST of HG10016710 vs. TAIR 10
Match: AT3G19400.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 392.1 bits (1006), Expect = 4.8e-109
Identity = 200/355 (56.34%), Postives = 260/355 (73.24%), Query Frame = 0

Query: 1   MAAAP-----SLLALLSFFFLSISASALS----RRSDGEVREIYDLWLAKHGKAYNGIEE 60
           MAA P     S L +LS   LS S    +     R++ EVR +Y+ WL ++ K YNG+ E
Sbjct: 1   MAATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGE 60

Query: 61  REKRFQIFKDNLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAK 120
           +E+RF+IFKDNL F+D+HNS  +RT++VGL +FADLTN+E+RA+YL  +    +  +K  
Sbjct: 61  KERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVK-- 120

Query: 121 SASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELIS 180
             + RY     D LP+ VDWR+ GAV  VK+QG+CGSCWAFS + AVEGINQI TGELIS
Sbjct: 121 --TERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELIS 180

Query: 181 LSEQELVNCDKKY-NSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVD-GQCDPTR-KNA 240
           LSEQELV+CD+ + N+GC+GG+M+YAF+FI+ NGG++T++DYPY   D G C+  +  N 
Sbjct: 181 LSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNT 240

Query: 241 KVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVV 300
           +VV IDGYEDV  +DE++LKKA+AHQPVSVAIEA   A QLY+SGV TG CG +LDHGVV
Sbjct: 241 RVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVV 300

Query: 301 AVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN 343
            VGYG+ +G DYW++RNSWG  WG+ GY KL+RN+     GKCGIAMM SYP K+
Sbjct: 301 VVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPF-GKCGIAMMPSYPTKS 350

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880922.15.2e-19092.10cysteine proteinase COT44-like [Benincasa hispida][more]
XP_008440309.15.8e-18991.26PREDICTED: cysteine proteinase COT44-like [Cucumis melo] >TYK12867.1 cysteine pr... [more]
XP_004141903.15.4e-18790.16cysteine proteinase COT44 [Cucumis sativus] >KGN48548.1 hypothetical protein Csa... [more]
XP_023518142.15.8e-18187.77cysteine proteinase COT44-like [Cucurbita pepo subsp. pepo][more]
KAG7027046.12.2e-18087.23Oryzain alpha chain [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q9FMH81.5e-12667.69Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 S... [more]
Q94B081.5e-12666.87Germination-specific cysteine protease 1 OS=Arabidopsis thaliana OX=3702 GN=GCP1... [more]
P257761.2e-12562.64Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=... [more]
P432971.6e-12564.63Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1[more]
Q9LT782.2e-12263.22Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A5D3CLQ02.8e-18991.26Cysteine proteinase COT44-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3B0U62.8e-18991.26cysteine proteinase COT44-like OS=Cucumis melo OX=3656 GN=LOC103484794 PE=3 SV=1[more]
A0A0A0KGD92.6e-18790.16Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G491600 PE=3 SV=1[more]
A0A6J1HGZ61.4e-18087.23zingipain-2-like OS=Cucurbita moschata OX=3662 GN=LOC111463384 PE=3 SV=1[more]
A0A6J1KPD52.6e-17987.23cysteine proteinase COT44-like OS=Cucurbita maxima OX=3661 GN=LOC111497083 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT5G43060.11.0e-12767.69Granulin repeat cysteine protease family protein [more]
AT4G36880.11.8e-12766.87cysteine proteinase1 [more]
AT1G47128.11.1e-12664.63Granulin repeat cysteine protease family protein [more]
AT3G19390.11.5e-12363.22Granulin repeat cysteine protease family protein [more]
AT3G19400.14.8e-10956.34Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 299..305
score: 69.74
coord: 142..157
score: 65.92
coord: 284..294
score: 58.59
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 124..340
e-value: 4.3E-125
score: 431.6
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 124..340
e-value: 1.6E-83
score: 280.1
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 35..91
e-value: 2.9E-26
score: 103.2
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 35..91
e-value: 3.4E-18
score: 65.8
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 17..342
e-value: 7.0E-123
score: 412.2
NoneNo IPR availablePANTHERPTHR12411:SF749CYSTEINE PROTEASEcoord: 21..350
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 21..350
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 142..153
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 282..292
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 299..318
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 125..339
e-value: 6.06982E-112
score: 323.422
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 28..340

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016710.1HG10016710.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity