CmaCh20G009810 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh20G009810
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionaspartic proteinase-like protein 2 isoform X1
LocationCma_Chr20: 5217563 .. 5223142 (+)
RNA-Seq ExpressionCmaCh20G009810
SyntenyCmaCh20G009810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCGCTTTCTTTAAAATGAATCATCCTTCACCCAACTTTAACTGCAATTGTTTATTTTGAATCAATCAATCAGCTCTCACCTTGATTTCGCATCCCTTCAATCCATTCTCCTTCTTCCCTCTTCGATCTGCCTCCTTCCGTTGACCTTTCCCATATCCAATCGCCAATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATGATCCACCTCCTTTAATTTTTGGATGGTTATGATTTTCGTTTGATGAAATTTGGTTTTTGGTTCAGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGTTAAAATGTTTAGAATTTGATGCTTGCCAGATCATTTATAAGGCCTTCATATTCATATCCATATTCAATCATTTGCTCTGTTTCCTTTATCTGCTTTTTATGATATTGATTTGGAAATTGGTAATTTATCAGTCTGGTAAGGAAATAGATCAAATGATATGGATCCTAGAATGATATATGTTTATACTGCACCAAAGGGTAAATGATAAGGTCTCCTTCGATGGACAAATTGTTTTTTTCTGGGAAGAAGTGATCTCGGAATGCTGTCTGACTCTGGACGCTAGAATGTTGTTTCCCTCCAATTTAGAGAAATGTGGATAACTAAAAGGCATTATCTATTTAATGAATGTGACAGAATTGTATTGTCGGTATTCTATATGGGTCTTAAAGCTTGTTCCTACGATGGTTTTATGGATAAAATTTGTGCAGTCATTTATTACATTTATATTAAAGCATTTCTAGTTTTTTCTTCATAGGAACCTTGCCAGCTGTTCTTTCTACTTAAACTTTTATTTCTATCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGGTGTGTACTAAAGTCGTTTAACCTTAAAAAAATTGTACTTCATGCTCTTTCATAACAATTACAATTGATACATCTCACACATTTACTTTAGTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTGTAAGCTTTTATTTCCCAGAATTTAACATTCTCAGTCAGGAGTTTCTTTAGTAATGTCTGAATTTGAAGATTGTTGCATGCTTTGCCAAGGACATAGTAAGACAGTCTGATTCTTACTTTTCCAGAGTTATCTTTCTTCTATTTCTGTTTCAGTCTGATAATTGTTTGCATCGTTTTTTTATTTTCTTTTGTAAACTTGTTCTAGTTGCTTTACATTTTAACGATTTGATATGAATCAATTTACATTGTTTCCTTGTCAATCTTAATAAATGAGAACATTTGTCATATTTGTGGAGGCTTTCAATTCAGTGTTAAGCCAAGTTCTTGTGTTGCCTGAAAAAACTTCGTCTATACAATTAAATGGAGAAAGGAAGAGAGAGAAACGAAAAACTTTTGGCTCCTTATTAGAAAGGATCGATCCCTGTAGTAGTCAAACCATGACCAGGCATATATCCTGTCTCTTTTCTCTTGTTTTCTCTAATAGTGTTCCCTTGATTACAAAATTACAAATTTGATCTTGAGTCGTTAGCAGTTTGAAGTCCTGTTTTTTGCATCTTTGTTTTAGAATGCATAATGAATTTAAAAAGAAAAAATCTTTCAGATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGGTATGGTTAGGTAATACGTTGTCAGTTTTGAAAGTGATTTACAACAATAAGTAATTATGATTCGTTACTTTATGTTAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGGTAAGGAACATCCCTTGATTACTAAAGATTTTTGCGTTACTTCTTTGTATCTGTGTCTGTTTACATGGGATTAAGTGTGGGGTTGGGCAGTATAAGCCTCCATAACAAAAATAAGGGCCGTCAATTGATCTTTGTATTCAATTGAAATAGTGGTCATAAAAGTTTGAGCTAACGGATTCTTCTTTACTGTGAATTTTCTGAATCATTATTTCAGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGTACATTTTTTTCATGGCACAGACGATTTTCGCTCGTGAATGCTTATTCTATAGGAATTTGGAATTTCTGACTGTTCATTTTGTGAATTCGCCTTTTTTTTTTTTTTTTTTTGTAATTTAATCACATATCACATTTTAGCATGACTGTTTTCTGTTACAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGGTATGGTTAATTGCAAGGTTTATTTTTGCTTTTATGCTAATACGGAAGCTTGACAGCATGTCAGTTGGTTCTTTAGTAATATTCTTTTCCTTATTGTCGATATGGTCGTTGAGCTTAATTGATTTGGTAAGTTAAAACAAGTTATACCATCCTTCATATCATGATGACAGTTGAGGTTCTACCATTTTAGTCTTCTTTGATATCATGCTAATGTATATGATATAATATGTGAATTTCAAACGATGATTGTTTTAACAACCCCTTTTGTAATTTCATACTATCGATGAAATTATTTCTTATAAAAAAAAAAAAAAAAAGAAGAGTATTTTAACAAATTTATTATCTGTTGCAAGCTCTACCTTAAAGGTCTGTTTGTTAAAGTGTTTTCTTTTGTGCTACGTGCAGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTTAGTACACTTGACACATTGGGGATTGAAATTTATTTTTCAGTATTTAACAATAATTAATGTAATAACAAATAGGTTTTATTATAAATGTATTCCATTGTTATGGCTATTTGATCCCTAAGATTTATAAGTCATTAAACATTGACGTTAGTTAGTTAAAAGTCATTAATTATGGTTCTAAATAGTTCCTATATTGAGTTAATTTTTCACTAAATATTGACATGCCAAATGAGCTAACACTGGTTCAGTGAGCATATCATGTCATAGCTTAGTTTATAACATAAGTGGGAGGTGAAATCTATTAAAGTTTAAGGAGAATAAATTGATGCAACGATCAAAAGTCACGGGCAAAACTTATAATTTACCCTGAGAGACATCTCTCTCAGCTTTTCTTACTAATTGAGTTGTGACTGTATCCAACTTTTTACTGCATATCATTTTTAACCATTATTTCAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTGTAAGTGAATATACCTGCAACTGTAAACTGCACAACAACCGAAATTCACAAGCAACCCTTTAAAATATTGTTTCTCTGGCAGACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTAAGTATGTAAATAGGTTTAAATGTTAAATTATGAAGTTAGTCAATAAATCTTCTTTAGTTGTGTCTATTTAGTTTTTAAACTTCAAAAACTATGAATTATGTTCTTAAAATTTTGGTTTTTATTCTAAGACGTTCCAGTCATGACATAGTGATAGAACTAACGCACTCACGTACTTGAGGGATTAAGCTAACTAGAAAAAAATAAGTGGACTAGATTTTGCTTGATTGTTGAGCCTTCTTGTGGGTCAAATGTCACGAGCCTTCTTTGCATCATTTACTAAAATATCAACCTGATTGTTACCTATTTAAATTCAAGGAAGTGATCTAAGTGTGATAAAATTTAAACTGAATTTTTTTACATTTTGTTTTATCAACTAGCCTACTCCCTCATGCGCCACCCGTAATCACATCACGTCAAATATAGTCCAGTTGAAACTTTTAAAAATTTATGGATTAAATAGACATATTGTCACGTCTTAAGGACTAAATGGTAATGTAGTCATACAAATGTTATGTTGTTACTACATTCCTAGTTAGTTAGAGTCTTGTATTCATGGATGGCTAAATGTTGCATAGCATTAATGTAATCCAATCTGTTTTATATTGTTTAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAGTAAGAGTCCCGCCAACAAATTTTTGTTTTGTTGTCACTGTTTGTATCATTTGCTTTGAATTTTTATAGGTTCAATGTTCAGTGGGAGACATTGAAAGAATATTATTATTTTCCACATTTTGTATTCATTTTATGCTCAAAGGTTCTAACCTTAAACAAATCGGAAGTTCCCTTTTTTCAC

mRNA sequence

CCTCGCTTTCTTTAAAATGAATCATCCTTCACCCAACTTTAACTGCAATTGTTTATTTTGAATCAATCAATCAGCTCTCACCTTGATTTCGCATCCCTTCAATCCATTCTCCTTCTTCCCTCTTCGATCTGCCTCCTTCCGTTGACCTTTCCCATATCCAATCGCCAATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAGTAAGAGTCCCGCCAACAAATTTTTGTTTTGTTGTCACTGTTTGTATCATTTGCTTTGAATTTTTATAGGTTCAATGTTCAGTGGGAGACATTGAAAGAATATTATTATTTTCCACATTTTGTATTCATTTTATGCTCAAAGGTTCTAACCTTAAACAAATCGGAAGTTCCCTTTTTTCAC

Coding sequence (CDS)

ATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAG

Protein sequence

MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Homology
BLAST of CmaCh20G009810 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 6.2e-40
Identity = 122/372 (32.80%), Postives = 184/372 (49.46%), Query Frame = 0

Query: 87  GYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQD-----PKFDPELSSTYQP 146
           G Y T++ +G+PP+++ + VDTGS + +V C+ C  C    D       +D + SST + 
Sbjct: 76  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 147 VKCNSD-CT----CDNDGVQ--CVYERQYAEMSTSSGVLGDDVISF----GN-QSALVPQ 206
           V C  D C+     +  G +  C Y   Y + STS G    D I+     GN ++A + Q
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 207 RAVFGCENEETGDL--YSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGG 266
             VFGC   ++G L       DGIMG G  + SI+ QL   G     FS C   M+ GGG
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN-GGG 255

Query: 267 AMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSV--FDGRYGSVLDSG 326
              +G +  P  ++ +        +YNV LK + V G  + L PS+   +G  G+++DSG
Sbjct: 256 IFAVGEVESP--VVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 315

Query: 327 TTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLI 386
           TT +YLPQ  +    N+++  + + +++          CF    S  +   K FP V+L 
Sbjct: 316 TTLAYLPQNLY----NSLIEKITAKQQVKLHMVQETFACF----SFTSNTDKAFPVVNLH 375

Query: 387 FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQ----TTLLGGIIVRNTLVMYDRE 434
           F++  KLS+ P +YLF  S     YC G    G   Q      LLG +++ N LV+YD E
Sbjct: 376 FEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLE 434

BLAST of CmaCh20G009810 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 149.1 bits (375), Expect = 1.8e-34
Identity = 116/404 (28.71%), Postives = 188/404 (46.53%), Query Frame = 0

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLN--------GYYTTRLWIGTPPQKFALIVDTGSTV 120
           + L  F + + R ++RM    DL L         G Y T++ +G+PP+++ + VDTGS +
Sbjct: 38  KNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97

Query: 121 TYVPCSTCELCGKHQD-----PKFDPELSSTYQPVKCNSD-CT--CDNDGVQ----CVYE 180
            ++ C  C  C    +       FD   SST + V C+ D C+    +D  Q    C Y 
Sbjct: 98  LWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYH 157

Query: 181 RQYAEMSTSSGVLGDDVISFGN-----QSALVPQRAVFGCENEETGDLYS--QRADGIMG 240
             YA+ STS G    D+++        ++  + Q  VFGC ++++G L +     DG+MG
Sbjct: 158 IVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMG 217

Query: 241 LGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYY 300
            G  + S++ QL   G     FS C   +  GGG   +G +  P   + +        +Y
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPK--VKTTPMVPNQMHY 277

Query: 301 NVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKI 360
           NV L  + V G  L L  S+     G+++DSGTT +Y P+  +      I+       K+
Sbjct: 278 NVMLMGMDVDGTSLDLPRSIVRNG-GTIVDSGTTLAYFPKVLYDSLIETIL--ARQPVKL 337

Query: 361 GGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLG 420
              +  F+  CF    S +  + + FP V   F++  KL++ P +YLF  +     YC G
Sbjct: 338 HIVEETFQ--CF----SFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLF--TLEEELYCFG 397

Query: 421 IFENG----NNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
               G       +  LLG +++ N LV+YD ++  IG+   NCS
Sbjct: 398 WQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427

BLAST of CmaCh20G009810 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 3.9e-34
Identity = 107/361 (29.64%), Postives = 165/361 (45.71%), Query Frame = 0

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y  R+ +G+PP+   +++D+GS + +V C  C+LC K  DP FDP  S +Y  V C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 146 SDCTCD---NDGVQ---CVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEE 205
           S   CD   N G     C YE  Y + S + G L  + ++F   +  V +    GC +  
Sbjct: 188 SS-VCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGHRN 247

Query: 206 TGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY--GGMDIGGGAMVLGGISPP 265
            G      A G++G+G G +S V QL   G    +F  C    G D   G++V G  + P
Sbjct: 248 RGMFIG--AAGLLGIGGGSMSFVGQL--SGQTGGAFGYCLVSRGTD-STGSLVFGREALP 307

Query: 266 --SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTYSYL 325
             +  +    +P    +Y V LK + V G ++PL   VFD    G  G V+D+GT  + L
Sbjct: 308 VGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRL 367

Query: 326 PQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQK 385
           P  A+  F++   +   +L +  G   +  DTC+  +G     +S   PTV   F  G  
Sbjct: 368 PTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSGF----VSVRVPTVSFYFTEGPV 427

Query: 386 LSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTN 433
           L+L   N+L       G YC     +      +++G I      V +D  +  +GF    
Sbjct: 428 LTLPARNFLMPVDD-SGTYCFAFAASPTG--LSIIGNIQQEGIQVSFDGANGFVGFGPNV 470

BLAST of CmaCh20G009810 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 5.1e-34
Identity = 106/365 (29.04%), Postives = 175/365 (47.95%), Query Frame = 0

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y +R+ +GTP ++  L++DTGS V ++ C  C  C +  DP F+P  SSTY+ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218

Query: 146 S-DCT------CDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
           +  C+      C ++  +C+Y+  Y + S + G L  D ++FGN   +       GC ++
Sbjct: 219 APQCSLLETSACRSN--KCLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHD 278

Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMV------LG 265
             G L++  A G++GLG G LSI +Q+        SFS C    D G  + +      LG
Sbjct: 279 NEG-LFTGAA-GLLGLGGGVLSITNQMKA-----TSFSYCLVDRDSGKSSSLDFNSVQLG 338

Query: 266 GISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTY 325
           G    + ++    +     +Y V L    V G+K+ L  ++FD    G  G +LD GT  
Sbjct: 339 GGDATAPLL---RNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAV 398

Query: 326 SYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT-FPTVDLIFD 385
           + L  +A+   ++A +    +LKK G    +  DTC+     D + LS    PTV   F 
Sbjct: 399 TRLQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCY-----DFSSLSTVKVPTVAFHFT 458

Query: 386 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGF 433
            G+ L L  +NYL       G +C       ++   +++G +  + T + YD   + IG 
Sbjct: 459 GGKSLDLPAKNYLIPVDD-SGTFCFAFAPTSSS--LSIIGNVQQQGTRITYDLSKNVIGL 500

BLAST of CmaCh20G009810 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.6e-32
Identity = 111/369 (30.08%), Postives = 169/369 (45.80%), Query Frame = 0

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y   L IGTP Q F+ I+DTGS + +  C  C  C     P F+P+ SS++  + C+
Sbjct: 92  DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 146 SDC-------TCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
           S         TC N+   C Y   Y + S + G +G + ++FG+ S  +P    FGC   
Sbjct: 152 SQLCQALSSPTCSNN--FCQYTYGYGDGSETQGSMGTETLTFGSVS--IP-NITFGCGEN 211

Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGA---MVLGGI- 265
             G      A G++G+G G LS+  QL       D     Y    IG      ++LG + 
Sbjct: 212 NQGFGQGNGA-GLVGMGRGPLSLPSQL-------DVTKFSYCMTPIGSSTPSNLLLGSLA 271

Query: 266 ------SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF-----DGRYGSVLD 325
                 SP + +I S   P    +Y + L  + V   +LP++PS F     +G  G ++D
Sbjct: 272 NSVTAGSPNTTLIQSSQIPT---FYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIID 331

Query: 326 SGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVD 385
           SGTT +Y    A+   +   ++ + +L  + G    F D CF    SD + L    PT  
Sbjct: 332 SGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSGF-DLCFQ-TPSDPSNLQ--IPTFV 391

Query: 386 LIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHS 433
           + FD G  L L  ENY    S  +G  CL +    ++   ++ G I  +N LV+YD  +S
Sbjct: 392 MHFDGGD-LELPSENYFI--SPSNGLICLAM--GSSSQGMSIFGNIQQQNMLVVYDTGNS 434

BLAST of CmaCh20G009810 vs. ExPASy TrEMBL
Match: A0A6J1JE38 (aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111483616 PE=3 SV=1)

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 640/640 (100.00%), Postives = 640/640 (100.00%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV
Sbjct: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. ExPASy TrEMBL
Match: A0A6J1EYA5 (aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC111439463 PE=3 SV=1)

HSP 1 Score: 1279.2 bits (3309), Expect = 0.0e+00
Identity = 625/640 (97.66%), Postives = 633/640 (98.91%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLA+LLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLALLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCD DGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAI+NALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDL+FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISD+NA APSVSNTSHDTD APASAPSESPHDMIPED+QI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIPEDLQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHIT LSDHIA ELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFS ATATTIISLIV HHMKLPP+YGSYQVIRWNVEPLMDRSLWKRLY+LVGLAIMV
Sbjct: 541 SSEFFSPATATTIISLIVGHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSA+GVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. ExPASy TrEMBL
Match: A0A5A7TUH1 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G003080 PE=3 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 531/640 (82.97%), Postives = 573/640 (89.53%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P LLL  +     L HF LSADPIS NPL+TPSHRAMVLPLY SS NSSK IS PH
Sbjct: 1   MAKSPFLLLPAI-----LLHFFLSADPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           R LR FP S+NRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RHLRQFPTSDNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DCTCD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL  S+FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSSIFDGRYGT 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAFG FK+AIM+ LHSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSLAPENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLAPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            HSKIGFWKTNCSELWERL  SD+NAHAPS+S  SH +D APASAP ESPH  IP ++QI
Sbjct: 421 AHSKIGFWKTNCSELWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF+ILLN SY  LEPHIT LSDHIAQELNVSHSQV LLNFTMRGN SLI+LAI+P G
Sbjct: 481 GRITFEILLNKSYTDLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHAT  TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGLAI+V
Sbjct: 541 SSEIFSHATVNTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WFI R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFILRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of CmaCh20G009810 vs. ExPASy TrEMBL
Match: A0A1S3C7L5 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103497389 PE=3 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 531/640 (82.97%), Postives = 573/640 (89.53%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P LLL  +     L HF LSADPIS NPL+TPSHRAMVLPLY SS NSSK IS PH
Sbjct: 1   MAKSPFLLLPAI-----LLHFFLSADPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           R LR FP S+NRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RHLRQFPTSDNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DCTCD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL  S+FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSSIFDGRYGT 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAFG FK+AIM+ LHSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSLAPENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLAPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            HSKIGFWKTNCSELWERL  SD+NAHAPS+S  SH +D APASAP ESPH  IP ++QI
Sbjct: 421 AHSKIGFWKTNCSELWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF+ILLN SY  LEPHIT LSDHIAQELNVSHSQV LLNFTMRGN SLI+LAI+P G
Sbjct: 481 GRITFEILLNKSYTDLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHAT  TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGLAI+V
Sbjct: 541 SSEIFSHATVNTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WFI R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFILRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of CmaCh20G009810 vs. ExPASy TrEMBL
Match: A0A0A0LJB9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G277070 PE=3 SV=1)

HSP 1 Score: 1073.5 bits (2775), Expect = 2.4e-310
Identity = 526/640 (82.19%), Postives = 569/640 (88.91%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P L+ A+LLH        LSADPIS NPLL+PSHRAMVLPLY SSPNSSK IS PH
Sbjct: 1   MAKSPFLVAAILLHIF------LSADPISPNPLLSPSHRAMVLPLYLSSPNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLR FP S+N SNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DC CD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL   +FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGA 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAF  FK+AIM+ +HSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSL PENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            +SKIGFWKTNCSELWERL ISD+NA  PSVS  SHD+D APASAPSE PH  IP ++QI
Sbjct: 421 ANSKIGFWKTNCSELWERLRISDDNADGPSVSTKSHDSDIAPASAPSERPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF ILLN SY  LEPHIT LSDHIAQELNVSHSQV +LNFTMRGN SLIQLAILP G
Sbjct: 481 GRITFAILLNKSYTDLEPHITELSDHIAQELNVSHSQVIILNFTMRGNDSLIQLAILPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHATA TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGL I+V
Sbjct: 541 SSEIFSHATANTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLVIVV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WF+ R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFVLRSRQQAINSYKPVNAAVPEQELQPL 633

BLAST of CmaCh20G009810 vs. NCBI nr
Match: XP_022985603.1 (aspartic proteinase nepenthesin-1-like [Cucurbita maxima])

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 640/640 (100.00%), Postives = 640/640 (100.00%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV
Sbjct: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. NCBI nr
Match: XP_023512095.1 (aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo] >XP_023522201.1 aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1282.7 bits (3318), Expect = 0.0e+00
Identity = 626/640 (97.81%), Postives = 635/640 (99.22%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLA+LLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLALLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCD DGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDL+FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISD+NAHAPSVSNTSHDTD APASAPSESP+DMIPED+QI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDDNAHAPSVSNTSHDTDMAPASAPSESPYDMIPEDLQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHIT LSDHIA ELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFS ATATTIISLIVEHHMKLPP+YGSY+VIRWNVEPLMDRSLWKRLY+LVGLAIMV
Sbjct: 541 SSEFFSPATATTIISLIVEHHMKLPPKYGSYRVIRWNVEPLMDRSLWKRLYILVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSA+GVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. NCBI nr
Match: KAG6571239.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1281.9 bits (3316), Expect = 0.0e+00
Identity = 626/640 (97.81%), Postives = 634/640 (99.06%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLA+LLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLALLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCD DGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAI+NALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDL+FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISD+NA APSVSNTSHDTD APASAPSESPHDMIPED+QI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIPEDLQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHIT LSDHIA ELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFS ATATTIISLIVEHHMKLPP+YGSYQVIRWNVEPLMDRSLWKRLY+LVGLAIMV
Sbjct: 541 SSEFFSPATATTIISLIVEHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSA+GVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. NCBI nr
Match: XP_022932899.1 (aspartic proteinase nepenthesin-1-like [Cucurbita moschata])

HSP 1 Score: 1279.2 bits (3309), Expect = 0.0e+00
Identity = 625/640 (97.66%), Postives = 633/640 (98.91%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLA+LLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLALLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCD DGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAI+NALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDL+FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISD+NA APSVSNTSHDTD APASAPSESPHDMIPED+QI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIPEDLQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHIT LSDHIA ELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFS ATATTIISLIV HHMKLPP+YGSYQVIRWNVEPLMDRSLWKRLY+LVGLAIMV
Sbjct: 541 SSEFFSPATATTIISLIVGHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSA+GVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. NCBI nr
Match: KAG7011039.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1278.8 bits (3308), Expect = 0.0e+00
Identity = 625/640 (97.66%), Postives = 633/640 (98.91%), Query Frame = 0

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MARTPNLLLA+LLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH
Sbjct: 1   MARTPNLLLALLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           ELCGKHQDPKFDPELSSTYQPVKCNSDCTCD DGVQCVYERQYAEMSTSSGVLGDDVISF
Sbjct: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG
Sbjct: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS
Sbjct: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTYSYLPQEAFGPFKNAI+NALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP
Sbjct: 301 VLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVDL+FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR
Sbjct: 361 TVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
           EHSKIGFWKTNCSELWERLHISD+NA APSVSNTSHDTD APASAPSESPHDMIPED+QI
Sbjct: 421 EHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIPEDLQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITFDILLNISYKHLEPHIT LSDHIA ELNVSHSQVRLLNFTMRGNHSLIQLAILPNG
Sbjct: 481 GRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSEFFS ATATTIISLIVEHHMKLPP+YGSYQVIRW VEPLMDRSLWKRLY+LVGLAIMV
Sbjct: 541 SSEFFSPATATTIISLIVEHHMKLPPKYGSYQVIRWYVEPLMDRSLWKRLYILVGLAIMV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           TLILGLSA+GVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
Sbjct: 601 TLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of CmaCh20G009810 vs. TAIR 10
Match: AT3G50050.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 714.9 bits (1844), Expect = 5.7e-206
Identity = 357/607 (58.81%), Postives = 451/607 (74.30%), Query Frame = 0

Query: 37  SHRAMVLPLYRSSPN-SSKLISKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWI 96
           S R MV PL+ S PN SS+ IS PHR+L    +S +  ++RMRLYDDLL+NGYYTTRLWI
Sbjct: 41  SRRPMVFPLFLSQPNSSSRSISIPHRKLHK-SDSKSLPHSRMRLYDDLLINGYYTTRLWI 100

Query: 97  GTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGV 156
           GTPPQ FALIVD+GSTVTYVPCS CE CGKHQDPKF PE+SSTYQPVKCN DC CD+D  
Sbjct: 101 GTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMDCNCDDDRE 160

Query: 157 QCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLG 216
           QCVYER+YAE S+S GVLG+D+ISFGN+S L PQRAVFGCE  ETGDLYSQRADGI+GLG
Sbjct: 161 QCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLG 220

Query: 217 SGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNV 276
            GDLS+VDQLV+KG+I++SF LCYGGMD+GGG+M+LGG   PS+M+F+ SDP RSPYYN+
Sbjct: 221 QGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNI 280

Query: 277 DLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGG 336
           DL  I VAGK+L L   VFDG +G+VLDSGTTY+YLP  AF  F+ A+M  + +LK+I G
Sbjct: 281 DLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDG 340

Query: 337 PDPNFKDTCFSGAGSD-AAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGI 396
           PDPNFKDTCF  A S+  +ELSK FP+V+++F +GQ   L+PENY+FRHSKVHGAYCLG+
Sbjct: 341 PDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGV 400

Query: 397 FENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNT 456
           F NG  D TTLLGGI+VRNTLV+YDRE+SK+GFW+TNCSEL +RLHI      A   SN 
Sbjct: 401 FPNG-KDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHIDGAPPPATLPSND 460

Query: 457 SHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVS 516
           S+         PS +    +    Q+G+I  DI L ++  +L+P I  LS   ++EL+V 
Sbjct: 461 SN---------PSHNSSSNLSGVTQVGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDVK 520

Query: 517 HSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVI 576
            SQV L N T +GN SL+++ +LP   S +FS+ TAT I+S    H +KLP  +G+YQ++
Sbjct: 521 SSQVSLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQLV 580

Query: 577 RWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVN-AAAP 636
            + +EP   R+    + + +G+   + +I+GLSA G W IW+R+Q +   YKPV+ A   
Sbjct: 581 NYKLEPPRKRTNNNIVVIAIGI---IAVIVGLSAYGAWLIWKRKQTSI-PYKPVDEAIVA 632

Query: 637 EQELQTL 641
           EQELQ +
Sbjct: 641 EQELQPI 632

BLAST of CmaCh20G009810 vs. TAIR 10
Match: AT5G43100.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 705.3 bits (1819), Expect = 4.5e-203
Identity = 350/626 (55.91%), Postives = 455/626 (72.68%), Query Frame = 0

Query: 16  LHLTHFTLSADPISSNPLLTPSHRAMVLPL-YRSSPNSSKLISKPHRRLRGFPNSNNRSN 75
           L L  FT +   I    L T     M+ PL Y S P   ++     RRL    + +   N
Sbjct: 6   LLLLLFTTTTISIFFFDLTTADESPMIFPLSYSSLPPRPRVEDFRRRRL----HQSQLPN 65

Query: 76  ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 135
           A M+LYDDLL NGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC+ CGKHQDPKF PE
Sbjct: 66  AHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPE 125

Query: 136 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 195
           LS++YQ +KCN DC CD++G  CVYER+YAEMS+SSGVL +D+ISFGN+S L PQRAVFG
Sbjct: 126 LSTSYQALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFG 185

Query: 196 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 255
           CENEETGDL+SQRADGIMGLG G LS+VDQLV+KGVI D FSLCYGGM++GGGAMVLG I
Sbjct: 186 CENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKI 245

Query: 256 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 315
           SPP  M+FS+SDP RSPYYN+DLK++HVAGK L L P VF+G++G+VLDSGTTY+Y P+E
Sbjct: 246 SPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKE 305

Query: 316 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 375
           AF   K+A++  + SLK+I GPDPN+ D CFSGAG D AE+   FP + + F NGQKL L
Sbjct: 306 AFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLIL 365

Query: 376 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 435
           +PENYLFRH+KV GAYCLGIF   + D TTLLGGI+VRNTLV YDRE+ K+GF KTNCS+
Sbjct: 366 SPENYLFRHTKVRGAYCLGIFP--DRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSD 425

Query: 436 LWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYK 495
           +W RL   +  A    +S  +  ++ +P+ A SESP   +P   ++G ITF++ ++++  
Sbjct: 426 IWRRLAAPESPAPTSPISQ-NKSSNISPSPATSESPTSHLPGVFRVGVITFEVSISVNNS 485

Query: 496 HLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTII 555
            L+P  + ++D IA EL++  +QVRLLNF+  GN   ++  + P  SSE+ S+ TA  I+
Sbjct: 486 SLKPKFSEIADFIAHELDIQSAQVRLLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIM 545

Query: 556 SLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFI 615
            L+ E+ ++LP ++GSY+++ W  E    +S W++  + V    M++L++    + +  +
Sbjct: 546 LLLKENRLRLPGQFGSYKLLEWKAEQKKKQSWWEKHLLGVVGGAMISLLVTSVMIKLALV 605

Query: 616 WRRRQQAFHSYKPVNAAAPEQELQTL 641
           WRRR+Q   +Y+PVNAA  EQELQ L
Sbjct: 606 WRRRKQEEATYEPVNAAIKEQELQPL 624

BLAST of CmaCh20G009810 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 201.4 bits (511), Expect = 2.1e-51
Identity = 142/418 (33.97%), Postives = 210/418 (50.24%), Query Frame = 0

Query: 82  DLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPK-----FDPELS 141
           D  + G Y T+L +GTPP+ F + VDTGS V +V C++C  C +    +     FDP  S
Sbjct: 74  DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133

Query: 142 STYQPVKC----------NSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGN--QS 201
            T  P+ C          +SD  C      C Y  QY + S +SG    DV+ F     S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193

Query: 202 ALVPQR---AVFGCENEETGDLY-SQRA-DGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 261
           +LVP      VFGC   +TGDL  S RA DGI G G   +S++ QL  +G+    FS C 
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253

Query: 262 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF--DGR 321
            G + GGG +VLG I  P+ M+F+   P   P+YNV+L  I V G+ LP+ PSVF     
Sbjct: 254 KGENGGGGILVLGEIVEPN-MVFTPLVP-SQPHYNVNLLSISVNGQALPINPSVFSTSNG 313

Query: 322 YGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSK 381
            G+++D+GTT +YL + A+ PF  AI NA+    +   P  +  + C+    S    +  
Sbjct: 314 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR---PVVSKGNQCYVITTS----VGD 373

Query: 382 TFPTVDLIFDNGQKLSLAPENYLFRHSKVHG--AYCLGIFENGNNDQTTLLGGIIVRNTL 441
            FP V L F  G  + L P++YL + + V G   +C+G F+   N   T+LG +++++ +
Sbjct: 374 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG-FQRIQNQGITILGDLVLKDKI 433

Query: 442 VMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDM 474
            +YD    +IG+   +CS        +  N  A S S  S   +    S  + +P  +
Sbjct: 434 FVYDLVGQRIGWANYDCS--------TSVNVSATSSSGRSEYVNAGQFSENAAAPQKL 473

BLAST of CmaCh20G009810 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 199.9 bits (507), Expect = 6.2e-51
Identity = 152/456 (33.33%), Postives = 226/456 (49.56%), Query Frame = 0

Query: 7   LLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRS--SPNSSKLISKPHRRLR 66
           ++ AVLL  L  T     +D +     L P +  + L   R+  S    +L+  P   + 
Sbjct: 11  IIAAVLL--LAATTLACGSDAVLKLERLIPPNHELGLTELRAFDSARHGRLLQSPVGGVV 70

Query: 67  GFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCG 126
            FP              D  L G Y T++ +GTPP++F + +DTGS V +V C++C  C 
Sbjct: 71  NFPVDG---------ASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCP 130

Query: 127 KHQDPK-----FDPELSSTYQPVKCN-----------SDCTCDNDGVQCVYERQYAEMST 186
           K  + +     FDP +SS+   V C+           S C+ +N    C Y  +Y + S 
Sbjct: 131 KTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNN---LCSYSFKYGDGSG 190

Query: 187 SSGVLGDDVISFGN--QSALVPQRA---VFGCENEETGDLYSQR--ADGIMGLGSGDLSI 246
           +SG    D +SF     S L    +   VFGC N ++GDL   R   DGI GLG G LS+
Sbjct: 191 TSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSV 250

Query: 247 VDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRS-PYYNVDLKEI 306
           + QL  +G+    FS C  G   GGG MVLG I  P  +   Y+  V S P+YNV+L+ I
Sbjct: 251 ISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTV---YTPLVPSQPHYNVNLQSI 310

Query: 307 HVAGKKLPLEPSVFD--GRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDP 366
            V G+ LP++PSVF      G+++D+GTT +YLP EA+ PF  A+ NA   + + G P  
Sbjct: 311 AVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANA---VSQYGRPIT 370

Query: 367 NFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYL-FRHSKVHGAYCLGIFEN 426
                CF     D       FP V L F  G  + L P  YL    S     +C+G F+ 
Sbjct: 371 YESYQCFEITAGDV----DVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIG-FQR 430

Query: 427 GNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
            ++ + T+LG +++++ +V+YD    +IG+ + +CS
Sbjct: 431 MSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441

BLAST of CmaCh20G009810 vs. TAIR 10
Match: AT2G36670.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 182.2 bits (461), Expect = 1.3e-45
Identity = 127/377 (33.69%), Postives = 198/377 (52.52%), Query Frame = 0

Query: 89  YTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQD-----PKFDPELSSTYQPVK 148
           Y T++ +G+PP +F + +DTGS + +V CS+C  C            FD   S T   V 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 149 CNSDCTCD----------NDGVQCVYERQYAEMSTSSG-----------VLGDDVISFGN 208
           C SD  C           ++  QC Y  +Y + S +SG           +LG+ +++  N
Sbjct: 165 C-SDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA--N 224

Query: 209 QSALVPQRAVFGCENEETGDL--YSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 268
            SA +    VFGC   ++GDL    +  DGI G G G LS+V QL  +G+    FS C  
Sbjct: 225 SSAPI----VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLK 284

Query: 269 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD--GRY 328
           G   GGG  VLG I  P  M++S   P   P+YN++L  I V G+ LPL+ +VF+     
Sbjct: 285 GDGSGGGVFVLGEILVPG-MVYSPLVP-SQPHYNLNLLSIGVNGQMLPLDAAVFEASNTR 344

Query: 329 GSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT 388
           G+++D+GTT +YL +EA+  F NAI N   S+ ++  P  +  + C+  + S    +S  
Sbjct: 345 GTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPIISNGEQCYLVSTS----ISDM 404

Query: 389 FPTVDLIFDNGQKLSLAPENYLFRHSKVHGA--YCLGIFENGNNDQTTLLGGIIVRNTLV 434
           FP+V L F  G  + L P++YLF +    GA  +C+G F+    +Q T+LG +++++ + 
Sbjct: 405 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQ-TILGDLVLKDKVF 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q4V3D26.2e-4032.80Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K41.8e-3428.71Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9LHE33.9e-3429.64Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LS405.1e-3429.04Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C31.6e-3230.08Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A6J1JE380.0e+00100.00aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC1114836... [more]
A0A6J1EYA50.0e+0097.66aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A5A7TUH10.0e+0082.97Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sc... [more]
A0A1S3C7L50.0e+0082.97aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103497389 PE=3 SV=1[more]
A0A0A0LJB92.4e-31082.19Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G27707... [more]
Match NameE-valueIdentityDescription
XP_022985603.10.0e+00100.00aspartic proteinase nepenthesin-1-like [Cucurbita maxima][more]
XP_023512095.10.0e+0097.81aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo] >XP_02352220... [more]
KAG6571239.10.0e+0097.81Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
XP_022932899.10.0e+0097.66aspartic proteinase nepenthesin-1-like [Cucurbita moschata][more]
KAG7011039.10.0e+0097.66Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyr... [more]
Match NameE-valueIdentityDescription
AT3G50050.15.7e-20658.81Eukaryotic aspartyl protease family protein [more]
AT5G43100.14.5e-20355.91Eukaryotic aspartyl protease family protein [more]
AT5G22850.12.1e-5133.97Eukaryotic aspartyl protease family protein [more]
AT1G08210.16.2e-5133.33Eukaryotic aspartyl protease family protein [more]
AT2G36670.11.3e-4533.69Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 404..419
score: 25.16
coord: 300..311
score: 34.66
coord: 247..260
score: 24.48
coord: 95..115
score: 50.68
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 18..473
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 70..252
e-value: 3.2E-48
score: 166.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 261..436
e-value: 3.8E-46
score: 159.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 85..437
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 273..427
e-value: 1.5E-26
score: 93.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 89..253
e-value: 1.1E-37
score: 129.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 447..465
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 447..472
NoneNo IPR availablePANTHERPTHR13683:SF817OS07G0592200 PROTEINcoord: 18..473
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 104..115
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 89..428
score: 46.514915
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 88..432
e-value: 7.47453E-77
score: 244.095

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G009810.1CmaCh20G009810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding