Moc10g02970 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc10g02970
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionaspartic proteinase CDR1-like
Locationchr10: 2041320 .. 2044171 (+)
RNA-Seq ExpressionMoc10g02970
SyntenyMoc10g02970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCCTCCATATTTCTCATATTTCTCACACTACTTTCCATCACCCACAAATCCGCCGGCACCGGCGGTCTGAGCCTGGAACTAATCCGCAACGACTCTCCAAAATCCGCCTTCTTCCGCCGCCGCCCCTCCTCCCAAGCTTGCCCGGAGACCGCCCAGTCGACAATAACGCCGGATAACAGCCAGTTTCTGGTGAAACTGGCCGTCGGAACACCGCCGAGAGACGTGTTCGCAATTCTCGACACCGGCAGCGATCTCTTCTGGACTCAATGCCTCCCCTGTGCGAATTGCTACCCACAAACAAATCCCATTTTCGATCCCTCGAGATCGGAGAGCTTCCGGGAGCTGCCGTGCCCAGCGGCGCTGTGCCACCTGCAGGGCTCCGGGGCGGTGTGCTCCGGCGGCGGCGCGTGTGGGTACAGCTACGGGTACGGAGGGGGGCTGACGGAGGGGAATTTGGCGACGGAAACGGTGGCCGTGAGTTCGAGACTTGGAGAGAGGGTTTGGCCGTTTCAGAACGTCGTGTTTGGTTGTGGTCATAATAACAGCGGCGGTTTTAATCAGAATGAAATGGGATTGATTGGCTTTGGTAGAGGACCCGTTTCCTTCATTTCTCAGGTAAATTTTTTGTTTTTGTTTTTGTTTTTGTTTTTTCTTTTAATCACTACGATTTCTAGCAAAACAAAGAATAAAATTTGATGCTTTTTAATTTTTTCTGAAAAATAGAAAAAATAGTAACCTAAGTTTTCTTTTTATTTTATTTTATTTTTTTTTGCAAAACTAATAAGAAAAATAGCGGGTATAAAATTTAATAAGAAAAATTTCATTCCATAGCATAATTTTAATAAGTAATTTTTTTAAATATCTTTTTTAACGTAACGTAATATTGATTTTTAACGTGTTTTCATTTAAATTTTACTATTTTGTTTTGGTTCTAAACTTTTTGATATATTTGAAAATAAATGTTTATTTTATGATATTTATGTAATTTTCAAAAAAAAAAATAATAATAAAATGAATTACTGTCTATGGAATTGGTAGGGTGTATCAGTTTGTATGAGACTTCTGATTTCATCAAATTAGAATTAAATTTGGATGAGACTTTATCTTAAAGTTTAATGAATGTTGAAATTTTTACTTGCTTTGGTTTGAAGAATGGTTAAGAGAAAAAAGTATTTCGATGGTGTTTAATGAATTTTAAAGCATTTTTTTTTTGTAAGATTAAATTGATGGATCCAGGGAAATATAACTCTAGAGCTATTTTTTTTGAGAGATTTGTTCAATGGTTCGTCTGGTATATTTTGGTCATGCTTAATTTTGTACCATGTTTATTTGTTTGTTTTTTTTAACCTAATTAAGTGAAAATTGCAATACATTTGTCTAAGTATAGGTATCAATTTGAAATTGATACACTTTGTGAATTTCATCCATATAAATTGGTTTTTTCTTTAAATTAATTATATTATAAATTATGATTAGTAAATACATAAAAATATATATATATATATATATACTTGTTATTAATTTTTAAAATATTTTTTAATATTTATAATTATTTTTTTTAAAATAACAGTATGAATTTCTTAAAATTATTCAACTTAAAGTGAAGTGATAACTAACAATAATTCTACGATAACTATTTCTTATAATTCCATGATTTTTTCCTCTAAATGTATTCATAATATTTGGATTTGGAATTCTCACCCTTGTGCTGATTGAATCAAAGTTCATAACATGTTATTTATTTTCCTTTTTTTAGTACATGATATTCGAACTCACGTTATTCTGGTTGAACAAAGGTATTTTTATTTTTATTTGTTGAACTATATTCATAGGTTGGCAATAGCTATGTTATTTATTTTTTCTAAATTTGTTTTTTTAATGAGAACTATGTTACTTGTCAATAATAATAACAATAATGATGATGGATGAATAAAAACAAACCTAGTTTAAAGTTTATAAAAGTAATTGAAATTTTGTATATATTGAGAAATTAATAATTATGTACTGTTTATTTAGTCTTTATAAGTTATTGTTAATCTAAACATATTATATATGTACCAAAAAGATTAGATGTTGGAAGCACCACTTCTTGAAATTAACAAAAAAGAAAAAAGAAAAGAAAAAGTAGATAATTTTAAGAAATACTGTCTATTAAAATGACGAGTTTATTATATTCTTATTGAAATAATTGTTTTATTTAATCATAATTTTCAAAATCCAAATTAAAGTCCACGTATTCACAGATAGGCCCATCCATAGGCGGCAGAAAGTTCTCCCACTGCCTGATGCCATTCAACACCGACCCGAGAATTTCAAGCAGCCTCCAACTGGGGTCGGGTTCGGAAGTTCGGGGCCCCGGAGTCATCACGATCCAACTGGTTCCCACGCCCGACCCGACGTTCTACGCTCTCACCCTCACCGGAATCAGCGTCGGAAAAACCTTCCTCCCGTACAGTTCGTCGGGACCGGCGGCGCAGGGGAACGTGATTCTCGATTCCGGCACGCCGCCAACTCTCCTCCCGGAGGATTTTTACAGCCGTTTCGCCGCCGAGGTGCGGCGGCGGATCCGGTGGCGGCCGGTCGGAGCGGGTCTTTGCTACAGAAATGTGAGGAGGTTCGCGGCGCCGCCGGTGACTCTGCACTTCGACGGCGGAGTGGAGTTGCCGCTGAGTACGGTTCAGACGTTCATCCGGAATCGAGATGGGTCGTTTTGCTTCGCTGTGGCAGGAATTTCCGGCACCGGCGGGATCATCGGAAACTTTATGCTGGCGAATTTTTTGGTTGGGTATGATATTGACGAGATGACGGTCTCGTTTAAGAAAGCTGATTGCACTAAAATTGGTTGA

mRNA sequence

ATGGCATCCTCCATATTTCTCATATTTCTCACACTACTTTCCATCACCCACAAATCCGCCGGCACCGGCGGTCTGAGCCTGGAACTAATCCGCAACGACTCTCCAAAATCCGCCTTCTTCCGCCGCCGCCCCTCCTCCCAAGCTTGCCCGGAGACCGCCCAGTCGACAATAACGCCGGATAACAGCCAGTTTCTGGTGAAACTGGCCGTCGGAACACCGCCGAGAGACGTGTTCGCAATTCTCGACACCGGCAGCGATCTCTTCTGGACTCAATGCCTCCCCTGTGCGAATTGCTACCCACAAACAAATCCCATTTTCGATCCCTCGAGATCGGAGAGCTTCCGGGAGCTGCCGTGCCCAGCGGCGCTGTGCCACCTGCAGGGCTCCGGGGCGGTGTGCTCCGGCGGCGGCGCGTGTGGGTACAGCTACGGGTACGGAGGGGGGCTGACGGAGGGGAATTTGGCGACGGAAACGGTGGCCGTGAGTTCGAGACTTGGAGAGAGGGTTTGGCCGTTTCAGAACGTCGTGTTTGGTTGTGGTCATAATAACAGCGGCGGTTTTAATCAGAATGAAATGGGATTGATTGGCTTTGGTAGAGGACCCGTTTCCTTCATTTCTCAGATAGGCCCATCCATAGGCGGCAGAAAGTTCTCCCACTGCCTGATGCCATTCAACACCGACCCGAGAATTTCAAGCAGCCTCCAACTGGGGTCGGGTTCGGAAGTTCGGGGCCCCGGAGTCATCACGATCCAACTGGTTCCCACGCCCGACCCGACGTTCTACGCTCTCACCCTCACCGGAATCAGCGTCGGAAAAACCTTCCTCCCGTACAGTTCGTCGGGACCGGCGGCGCAGGGGAACGTGATTCTCGATTCCGGCACGCCGCCAACTCTCCTCCCGGAGGATTTTTACAGCCGTTTCGCCGCCGAGGTGCGGCGGCGGATCCGGTGGCGGCCGGTCGGAGCGGGTCTTTGCTACAGAAATGTGAGGAGGTTCGCGGCGCCGCCGGTGACTCTGCACTTCGACGGCGGAGTGGAGTTGCCGCTGAGTACGGTTCAGACGTTCATCCGGAATCGAGATGGGTCGTTTTGCTTCGCTGTGGCAGGAATTTCCGGCACCGGCGGGATCATCGGAAACTTTATGCTGGCGAATTTTTTGGTTGGGTATGATATTGACGAGATGACGGTCTCGTTTAAGAAAGCTGATTGCACTAAAATTGGTTGA

Coding sequence (CDS)

ATGGCATCCTCCATATTTCTCATATTTCTCACACTACTTTCCATCACCCACAAATCCGCCGGCACCGGCGGTCTGAGCCTGGAACTAATCCGCAACGACTCTCCAAAATCCGCCTTCTTCCGCCGCCGCCCCTCCTCCCAAGCTTGCCCGGAGACCGCCCAGTCGACAATAACGCCGGATAACAGCCAGTTTCTGGTGAAACTGGCCGTCGGAACACCGCCGAGAGACGTGTTCGCAATTCTCGACACCGGCAGCGATCTCTTCTGGACTCAATGCCTCCCCTGTGCGAATTGCTACCCACAAACAAATCCCATTTTCGATCCCTCGAGATCGGAGAGCTTCCGGGAGCTGCCGTGCCCAGCGGCGCTGTGCCACCTGCAGGGCTCCGGGGCGGTGTGCTCCGGCGGCGGCGCGTGTGGGTACAGCTACGGGTACGGAGGGGGGCTGACGGAGGGGAATTTGGCGACGGAAACGGTGGCCGTGAGTTCGAGACTTGGAGAGAGGGTTTGGCCGTTTCAGAACGTCGTGTTTGGTTGTGGTCATAATAACAGCGGCGGTTTTAATCAGAATGAAATGGGATTGATTGGCTTTGGTAGAGGACCCGTTTCCTTCATTTCTCAGATAGGCCCATCCATAGGCGGCAGAAAGTTCTCCCACTGCCTGATGCCATTCAACACCGACCCGAGAATTTCAAGCAGCCTCCAACTGGGGTCGGGTTCGGAAGTTCGGGGCCCCGGAGTCATCACGATCCAACTGGTTCCCACGCCCGACCCGACGTTCTACGCTCTCACCCTCACCGGAATCAGCGTCGGAAAAACCTTCCTCCCGTACAGTTCGTCGGGACCGGCGGCGCAGGGGAACGTGATTCTCGATTCCGGCACGCCGCCAACTCTCCTCCCGGAGGATTTTTACAGCCGTTTCGCCGCCGAGGTGCGGCGGCGGATCCGGTGGCGGCCGGTCGGAGCGGGTCTTTGCTACAGAAATGTGAGGAGGTTCGCGGCGCCGCCGGTGACTCTGCACTTCGACGGCGGAGTGGAGTTGCCGCTGAGTACGGTTCAGACGTTCATCCGGAATCGAGATGGGTCGTTTTGCTTCGCTGTGGCAGGAATTTCCGGCACCGGCGGGATCATCGGAAACTTTATGCTGGCGAATTTTTTGGTTGGGTATGATATTGACGAGATGACGGTCTCGTTTAAGAAAGCTGATTGCACTAAAATTGGTTGA

Protein sequence

MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCPAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGSEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG
Homology
BLAST of Moc10g02970 vs. NCBI nr
Match: XP_022140900.1 (aspartic proteinase CDR1-like [Momordica charantia])

HSP 1 Score: 833.2 bits (2151), Expect = 9.8e-238
Identity = 407/407 (100.00%), Postives = 407/407 (100.00%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD 60
           MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD
Sbjct: 1   MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD 60

Query: 61  NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP 120
           NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP
Sbjct: 61  NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP 120

Query: 121 AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG 180
           AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG
Sbjct: 121 AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG 180

Query: 181 HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS 240
           HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS
Sbjct: 181 HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS 240

Query: 241 EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP 300
           EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP
Sbjct: 241 EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP 300

Query: 301 EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD 360
           EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD
Sbjct: 301 EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD 360

Query: 361 GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
           GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG
Sbjct: 361 GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 407

BLAST of Moc10g02970 vs. NCBI nr
Match: XP_023538771.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 515.0 bits (1325), Expect = 5.9e-142
Identity = 263/408 (64.46%), Postives = 302/408 (74.02%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA SIFL+ L LLSI    AG  GGL LELIR         RR       P  AQS I P
Sbjct: 1   MALSIFLL-LALLSIAESIAGKGGGLKLELIR---------RRLSPDNVSPMAAQSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + SQF++K+AVGTPP +V AI DTGSDLFWTQCLPCANCY QTNPI+DPS+S +F+ L C
Sbjct: 61  EASQFIIKIAVGTPPTEVHAIFDTGSDLFWTQCLPCANCYRQTNPIYDPSKSSTFQTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL GSGA CSG   C Y+YGYG GLT+G LATE +AV+S  G    PF  VVFGC
Sbjct: 121 ESPQCHLTGSGATCSGTDMCKYNYGYGSGLTQGELATEQMAVTSSSGATT-PFPEVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNN+G FN NEMGLIGFGRG +SFISQIGPS+GGRKFS CLMP NTDP ISSS+ +GSG
Sbjct: 181 GHNNTGTFNANEMGLIGFGRGAISFISQIGPSVGGRKFSLCLMPTNTDPTISSSISIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+GPGVIT QLV   DPT+Y+LTLTGISVGKT +PYS SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGPGVITTQLVQISDPTYYSLTLTGISVGKTLVPYSMSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNR 360
           P++ Y R  AEVRR+I   P+G  LCY++        +TLHFDG V+LPLSTVQTF +  
Sbjct: 301 PKELYERLTAEVRRQIPSEPIGDTLCYKD--NLGDLVMTLHFDGDVDLPLSTVQTFNQMP 360

Query: 361 DGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
           DGSFCFA  G+     +IGN M+ANFLVGYDID M VSFK  DCTKIG
Sbjct: 361 DGSFCFAAMGVDDDSALIGNSMMANFLVGYDIDNMMVSFKPTDCTKIG 395

BLAST of Moc10g02970 vs. NCBI nr
Match: KAG6601733.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 508.1 bits (1307), Expect = 7.2e-140
Identity = 260/409 (63.57%), Postives = 302/409 (73.84%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA +IF++ L LLSI   + G  GGL LELI+         RR       P  A+S I P
Sbjct: 1   MAPTIFIV-LALLSIAESTVGKGGGLKLELIQ---------RRLSPGNVSPMAAKSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + S+F+VK+AVGTPP +V AILDTGSDLFW QC PCA CY QTNPI+DPS+S +FR L C
Sbjct: 61  ETSEFIVKIAVGTPPTEVHAILDTGSDLFWAQCRPCAKCYRQTNPIYDPSKSSTFRTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL+GSGA CSG   C Y YGYG G T+G LATE +AV+SR G    PF  VVFGC
Sbjct: 121 KSPQCHLRGSGAACSGTDTCKYGYGYGSGSTQGELATEKMAVTSRSGATT-PFSGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNNSG FN NEMGLIGFGRG +SF+SQIGPS+GGRKFS CLMP+NTDPRISSSL +GSG
Sbjct: 181 GHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+GPGVIT QLV TPD T Y+LTLTGISVGKT +PYS SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGPGVITAQLVRTPDQTSYSLTLTGISVGKTLVPYSMSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAG-LCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRN 360
           P++ Y R AAEVRR I  +PV    LCY++        +TLHF+GGV+L LSTVQTF + 
Sbjct: 301 PKELYGRLAAEVRRHIPSKPVDDDTLCYKD--NLGDLVMTLHFEGGVDLRLSTVQTFNKM 360

Query: 361 RDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
            DGSFCF   G+     +IGN M+ANFLVGYDID MTVSFK  DCTKIG
Sbjct: 361 SDGSFCFTAMGVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 396

BLAST of Moc10g02970 vs. NCBI nr
Match: XP_022929935.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 508.1 bits (1307), Expect = 7.2e-140
Identity = 259/409 (63.33%), Postives = 304/409 (74.33%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA +IF++ L LLS    +AG  GGL LELI+         RR P     P  A+S I P
Sbjct: 1   MAPTIFIV-LALLSTAESTAGKGGGLKLELIQ---------RRLPPGNVSPMAAKSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + S+F+VK+AVGTPP +V AILDTGSDLFW QC PCA CY QTNPI+DPS+S +FR L C
Sbjct: 61  ETSEFIVKIAVGTPPTEVHAILDTGSDLFWAQCRPCAKCYRQTNPIYDPSKSLTFRTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL+GSGA CSG   C Y YGYG G T+G LATE +AV+SR G +   F  VVFGC
Sbjct: 121 KSPQCHLRGSGAACSGTDTCKYGYGYGSGSTQGELATEKMAVTSRSGAKT-SFSGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNNSG FN NEMGLIGFGRG +SF+SQIGPS+GGRKFS CLMP+NTDPRISSSL +GSG
Sbjct: 181 GHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+GPGVIT QLV TPD T Y+LTLTGISVGKT +PYS+SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGPGVITTQLVRTPDQTSYSLTLTGISVGKTLVPYSTSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAG-LCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRN 360
           P++ Y R AAEVRR I  +P+    LCY++        +TLHFDGGV+L LSTVQTF + 
Sbjct: 301 PKELYGRLAAEVRRHIPSKPIDDDTLCYKD--NLGDLVMTLHFDGGVDLRLSTVQTFNKM 360

Query: 361 RDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
            DGSFCF   G+     +IGN ++ANFLVGYDID MTVSFK  DCTKIG
Sbjct: 361 SDGSFCFTAMGVDDKDALIGNSIMANFLVGYDIDNMTVSFKPTDCTKIG 396

BLAST of Moc10g02970 vs. NCBI nr
Match: KAG6573507.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 506.9 bits (1304), Expect = 1.6e-139
Identity = 257/407 (63.14%), Postives = 302/407 (74.20%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA SIFL+ L LLSI   +A   GGL LELIR         RR       P  AQS I P
Sbjct: 1   MAPSIFLL-LALLSIVKSTAEKGGGLKLELIR---------RRLSPGNVSPMAAQSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + SQF++K+AVGTPP +V AI DTGSDLFWTQCLPCANCY QTNPI++PS+S +F+ L C
Sbjct: 61  EASQFIIKIAVGTPPTEVHAIFDTGSDLFWTQCLPCANCYRQTNPIYNPSKSSTFQTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL GSGA CSG   C Y+YGYG GLT+G LATE +AV+S  G  + PF  VVFGC
Sbjct: 121 ESPQCHLTGSGAACSGTDTCKYNYGYGSGLTQGELATEKMAVTSSFG-AMTPFPGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNN+G FN NEMGLIGFGRG +SF+SQIGPS+GGRKFS CLMP NTDP ISSS+ +G+G
Sbjct: 181 GHNNTGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPTNTDPTISSSISIGTG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           S+V GPGVIT QLV   DPT+Y+LTLTGISV  T +PYS+SGP A+GN +LD+GTPPTLL
Sbjct: 241 SQVEGPGVITAQLVQISDPTYYSLTLTGISVENTLVPYSTSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNR 360
           P++ Y R  AEVRR+I   P+G  LCY++        +TLHFDGGV+LPLSTVQTF +  
Sbjct: 301 PKELYGRLTAEVRRQIPSEPIGDTLCYKD--NLGDLVMTLHFDGGVDLPLSTVQTFNQMP 360

Query: 361 DGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKI 407
           DGSFCFA  G+     +IGN M+ANFLVGYDID MTVSFK  DCTKI
Sbjct: 361 DGSFCFAAMGVDDDSALIGNSMMANFLVGYDIDNMTVSFKPTDCTKI 394

BLAST of Moc10g02970 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 1.3e-70
Identity = 146/362 (40.33%), Postives = 210/362 (58.01%), Query Frame = 0

Query: 54  QSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSES 113
           Q  +T ++ ++L+ +++GTPP  + AI DTGSDL WTQC PC +CY Q +P+FDP  S +
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSST 139

Query: 114 FRELPCPAALCHLQGSGAVCS-GGGACGYSYGYG-GGLTEGNLATETVAVSSRLGERVWP 173
           ++++ C ++ C    + A CS     C YS  YG    T+GN+A +T+ + S    R   
Sbjct: 140 YKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS-DTRPMQ 199

Query: 174 FQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRIS 233
            +N++ GCGHNN+G FN+   G++G G GPVS I Q+G SI G KFS+CL+P  +    +
Sbjct: 200 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 259

Query: 234 SSLQLGSGSEVRGPGVITIQLV-PTPDPTFYALTLTGISVGKTFLPYS-SSGPAAQGNVI 293
           S +  G+ + V G GV++  L+      TFY LTL  ISVG   + YS S   +++GN+I
Sbjct: 260 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNII 319

Query: 294 LDSGTPPTLLPEDFYSRF----AAEVRRRIRWRP-VGAGLCYRNVRRFAAPPVTLHFDGG 353
           +DSGT  TLLP +FYS      A+ +    +  P  G  LCY        P +T+HFD G
Sbjct: 320 IDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFD-G 379

Query: 354 VELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCT 407
            ++ L +   F++  +   CFA  G S +  I GN    NFLVGYD    TVSFK  DC 
Sbjct: 380 ADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 437

BLAST of Moc10g02970 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 1.3e-62
Identity = 160/448 (35.71%), Postives = 228/448 (50.89%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAG-TGGLSLELIRNDSPKS---------------AFFR--- 60
           MA+ I L F    S+T  S+G     S+ELI  DSP S               AF R   
Sbjct: 1   MATQILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVS 60

Query: 61  --RRPSSQACPETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCY 120
             RR + Q      QS +   + +F + + +GTPP  VFAI DTGSDL W QC PC  CY
Sbjct: 61  RSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCY 120

Query: 121 PQTNPIFDPSRSESFRELPCPAALCH-LQGSGAVC-SGGGACGYSYGYGG-GLTEGNLAT 180
            +  PIFD  +S +++  PC +  C  L  +   C      C Y Y YG    ++G++AT
Sbjct: 121 KENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVAT 180

Query: 181 ETVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRK 240
           ETV++ S  G  V  F   VFGCG+NN G F++   G+IG G G +S ISQ+G SI  +K
Sbjct: 181 ETVSIDSASGSPV-SFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKK 240

Query: 241 FSHCLMPFNTDPRISSSLQLGS----GSEVRGPGVITIQLVPTPDPTFYALTLTGISVGK 300
           FS+CL   +     +S + LG+     S  +  GV++  LV     T+Y LTL  ISVGK
Sbjct: 241 FSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK 300

Query: 301 TFLPYSSSG---------PAAQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWR----- 360
             +PY+ S              GN+I+DSGT  TLL   F+ +F++ V   +        
Sbjct: 301 KKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 360

Query: 361 PVG-AGLCYRN-VRRFAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTGGI 405
           P G    C+++       P +T+HF  G ++ LS +  F++  +   C ++   +    I
Sbjct: 361 PQGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEV-AI 420

BLAST of Moc10g02970 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.7e-54
Identity = 137/369 (37.13%), Postives = 184/369 (49.86%), Query Frame = 0

Query: 50  PETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPS 109
           P   ++++   + ++L+ L++GTP +   AI+DTGSDL WTQC PC  C+ Q+ PIF+P 
Sbjct: 81  PSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQ 140

Query: 110 RSESFRELPCPAALCHLQGSGAVCSGGGACGYSYGYG-GGLTEGNLATETVAVSSRLGER 169
            S SF  LPC + LC    S   CS    C Y+YGYG G  T+G++ TET+   S     
Sbjct: 141 GSSSFSTLPCSSQLCQAL-SSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGS----- 200

Query: 170 VWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDP 229
                N+ FGCG NN G    N  GL+G GRGP+S  SQ+  +    KFS+C+ P  +  
Sbjct: 201 -VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT----KFSYCMTPIGSS- 260

Query: 230 RISSSLQLGS-GSEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPA---- 289
              S+L LGS  + V      T  +  +  PTFY +TL G+SVG T LP   S  A    
Sbjct: 261 -TPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSN 320

Query: 290 -AQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWRPV-----GAGLCYR---NVRRFAA 349
              G +I+DSGT  T    + Y     E   +I    V     G  LC++   +      
Sbjct: 321 NGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQI 380

Query: 350 PPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEM 404
           P   +HFDGG +L L +   FI   +G  C A+   S    I GN    N LV YD    
Sbjct: 381 PTFVMHFDGG-DLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNS 434

BLAST of Moc10g02970 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.2e-47
Identity = 135/381 (35.43%), Postives = 187/381 (49.08%), Query Frame = 0

Query: 41  RRRPSSQACPETAQSTITP---DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCAN 100
           RR  S  A  +++    TP    + ++L+ +A+GTP     AI+DTGSDL WTQC PC  
Sbjct: 70  RRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQ 129

Query: 101 CYPQTNPIFDPSRSESFRELPCPAALCHLQGSGAVCSGGGACGYSYGYG-GGLTEGNLAT 160
           C+ Q  PIF+P  S SF  LPC +  C  Q   +       C Y+YGYG G  T+G +AT
Sbjct: 130 CFSQPTPIFNPQDSSSFSTLPCESQYC--QDLPSETCNNNECQYTYGYGDGSTTQGYMAT 189

Query: 161 ETVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRK 220
           ET    +          N+ FGCG +N G    N  GLIG G GP+S  SQ+G  +G  +
Sbjct: 190 ETFTFETS------SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLG--VG--Q 249

Query: 221 FSHCLMPFNTDPRISSSLQLGSGSEVRGPGVITIQLVPTP-DPTFYALTLTGISVGKTFL 280
           FS+C+  + +     S+L LGS +     G  +  L+ +  +PT+Y +TL GI+VG   L
Sbjct: 250 FSYCMTSYGSSS--PSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNL 309

Query: 281 PYSSSGPAAQ----GNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWRPV-----GAGLCY 340
              SS    Q    G +I+DSGT  T LP+D Y+  A     +I    V     G   C+
Sbjct: 310 GIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCF 369

Query: 341 R---NVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTG-GIIGNFML 400
           +   +      P +++ FDGGV L L      I   +G  C A+   S  G  I GN   
Sbjct: 370 QQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQ 429

Query: 401 ANFLVGYDIDEMTVSFKKADC 404
               V YD+  + VSF    C
Sbjct: 430 QETQVLYDLQNLAVSFVPTQC 435

BLAST of Moc10g02970 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.3e-43
Identity = 129/363 (35.54%), Postives = 176/363 (48.48%), Query Frame = 0

Query: 55  STITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESF 114
           S ++  + ++  +L VGTP R V+ +LDTGSD+ W QC PC  CY Q++PIFDP +S+++
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTY 192

Query: 115 RELPCPAALCHLQGSGAVCSGGGACGYSYGYG-GGLTEGNLATETVAVSSRLGERVWPFQ 174
             +PC +  C    S    +    C Y   YG G  T G+ +TET+        RV   +
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR---NRV---K 252

Query: 175 NVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSS 234
            V  GCGH+N G F     GL+G G+G +SF  Q G     +KFS+CL+  +   +  SS
Sbjct: 253 GVALGCGHDNEGLF-VGAAGLLGLGKGKLSFPGQTGHRF-NQKFSYCLVDRSASSK-PSS 312

Query: 235 LQLGSGSEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSG-----PAAQGNV 294
           +  G+ +  R      +   P  D TFY + L GISVG T +P  ++          G V
Sbjct: 313 VVFGNAAVSRIARFTPLLSNPKLD-TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGV 372

Query: 295 ILDSGTPPTLL--PEDFYSRFAAEVRRRIRWRPVGAGL---CY--RNVRRFAAPPVTLHF 354
           I+DSGT  T L  P     R A  V  +   R     L   C+   N+     P V LHF
Sbjct: 373 IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF 432

Query: 355 DGG-VELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKK 404
            G  V LP +T      + +G FCFA AG  G   IIGN     F V YD+    V F  
Sbjct: 433 RGADVSLP-ATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 484

BLAST of Moc10g02970 vs. ExPASy TrEMBL
Match: A0A6J1CGG6 (aspartic proteinase CDR1-like OS=Momordica charantia OX=3673 GN=LOC111011455 PE=3 SV=1)

HSP 1 Score: 833.2 bits (2151), Expect = 4.7e-238
Identity = 407/407 (100.00%), Postives = 407/407 (100.00%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD 60
           MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD
Sbjct: 1   MASSIFLIFLTLLSITHKSAGTGGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITPD 60

Query: 61  NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP 120
           NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP
Sbjct: 61  NSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCP 120

Query: 121 AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG 180
           AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG
Sbjct: 121 AALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGCG 180

Query: 181 HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS 240
           HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS
Sbjct: 181 HNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSGS 240

Query: 241 EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP 300
           EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP
Sbjct: 241 EVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLLP 300

Query: 301 EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD 360
           EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD
Sbjct: 301 EDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRNRD 360

Query: 361 GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
           GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG
Sbjct: 361 GSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 407

BLAST of Moc10g02970 vs. ExPASy TrEMBL
Match: A0A6J1EVM9 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111436390 PE=3 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 3.5e-140
Identity = 259/409 (63.33%), Postives = 304/409 (74.33%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA +IF++ L LLS    +AG  GGL LELI+         RR P     P  A+S I P
Sbjct: 1   MAPTIFIV-LALLSTAESTAGKGGGLKLELIQ---------RRLPPGNVSPMAAKSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + S+F+VK+AVGTPP +V AILDTGSDLFW QC PCA CY QTNPI+DPS+S +FR L C
Sbjct: 61  ETSEFIVKIAVGTPPTEVHAILDTGSDLFWAQCRPCAKCYRQTNPIYDPSKSLTFRTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL+GSGA CSG   C Y YGYG G T+G LATE +AV+SR G +   F  VVFGC
Sbjct: 121 KSPQCHLRGSGAACSGTDTCKYGYGYGSGSTQGELATEKMAVTSRSGAKT-SFSGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNNSG FN NEMGLIGFGRG +SF+SQIGPS+GGRKFS CLMP+NTDPRISSSL +GSG
Sbjct: 181 GHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+GPGVIT QLV TPD T Y+LTLTGISVGKT +PYS+SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGPGVITTQLVRTPDQTSYSLTLTGISVGKTLVPYSTSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAG-LCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRN 360
           P++ Y R AAEVRR I  +P+    LCY++        +TLHFDGGV+L LSTVQTF + 
Sbjct: 301 PKELYGRLAAEVRRHIPSKPIDDDTLCYKD--NLGDLVMTLHFDGGVDLRLSTVQTFNKM 360

Query: 361 RDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
            DGSFCF   G+     +IGN ++ANFLVGYDID MTVSFK  DCTKIG
Sbjct: 361 SDGSFCFTAMGVDDKDALIGNSIMANFLVGYDIDNMTVSFKPTDCTKIG 396

BLAST of Moc10g02970 vs. ExPASy TrEMBL
Match: A0A6J1JIJ5 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111484909 PE=3 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 1.0e-139
Identity = 260/409 (63.57%), Postives = 303/409 (74.08%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA +IFL+ LTLLSI   +AG  GGL LELIR         RR       P  A+S I P
Sbjct: 1   MAPTIFLL-LTLLSIAESTAGKGGGLKLELIR---------RRLSPGNVSPMAAKSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + S+F+VK+A+GTPP +V AILDTGSDLFW QC PCA CY QTNPI+DPS+S +FR L C
Sbjct: 61  ETSEFIVKIAIGTPPTEVHAILDTGSDLFWAQCRPCAKCYQQTNPIYDPSKSSTFRTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL+GSGA CSG   C YSYGYG G T+G LA+E +AV+SR G    PF  VVFGC
Sbjct: 121 KSPQCHLRGSGAACSGTDTCKYSYGYGSGSTQGELASEKMAVTSRSGATT-PFPGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNNSG FN NEMGLIGFGRG +SF+SQIGPS+GGRKFS CLMP+NTDPRISSSL +GSG
Sbjct: 181 GHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+GPGVIT QLV T D T Y+LTLTGISV KT +PYS+SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGPGVITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAG-LCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRN 360
           P++ Y R AAEVRR I  +P+    LCY++        +TLHFDGGV+L LSTVQTF + 
Sbjct: 301 PKELYGRLAAEVRRHIPSKPIDDDTLCYKD--NLGDLVMTLHFDGGVDLRLSTVQTFNKM 360

Query: 361 RDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
            DGSFCF   G+     +IGN M+ANFLVGYDID MTVSFK  DCTK G
Sbjct: 361 PDGSFCFTAMGVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKAG 396

BLAST of Moc10g02970 vs. ExPASy TrEMBL
Match: A0A6J1EQ75 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111436389 PE=3 SV=1)

HSP 1 Score: 503.4 bits (1295), Expect = 8.6e-139
Identity = 258/409 (63.08%), Postives = 302/409 (73.84%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAGT-GGLSLELIRNDSPKSAFFRRRPSSQACPETAQSTITP 60
           MA +IF++ L LLSI   +AG  GGL LELI+         RR       P  A+S I P
Sbjct: 1   MAPTIFIV-LALLSIAESTAGKGGGLKLELIQ---------RRLSPGNVSPMAAKSQIWP 60

Query: 61  DNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPC 120
           + S+F+VK+AVGTPP +V +ILDTGSDLFW QC PCA CY QTNPI+DPS+S +FR L C
Sbjct: 61  ETSEFIVKIAVGTPPTEVHSILDTGSDLFWAQCRPCAKCYRQTNPIYDPSKSSTFRTLSC 120

Query: 121 PAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPFQNVVFGC 180
            +  CHL+GSGA CSG   C Y YGYG G T+G LATE +AV+SR G    PF  VVFGC
Sbjct: 121 KSPQCHLRGSGAACSGTNTCKYGYGYGSGSTQGELATEKMAVTSRSGATT-PFSGVVFGC 180

Query: 181 GHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGSG 240
           GHNNSG FN NEMGLIG GRG +SF+SQIGPS+GG+KFS CLMP+NTDPRISSSL +GSG
Sbjct: 181 GHNNSGTFNANEMGLIGLGRGAISFVSQIGPSVGGKKFSLCLMPYNTDPRISSSLSIGSG 240

Query: 241 SEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDSGTPPTLL 300
           SEV+G GVIT QLV TPD T Y+LTLTGISVGKT +PYS+SGP A+GN +LD+GTPPTLL
Sbjct: 241 SEVKGLGVITAQLVRTPDQTSYSLTLTGISVGKTLVPYSTSGPPAKGNAVLDTGTPPTLL 300

Query: 301 PEDFYSRFAAEVRRRIRWRPVGAG-LCYRNVRRFAAPPVTLHFDGGVELPLSTVQTFIRN 360
           P++ Y R AAEVRR I  +PV    LCY++        +TLHFDGGV+L LSTVQTF + 
Sbjct: 301 PKELYGRLAAEVRRHIPSKPVDDDTLCYKD--HLGDLVMTLHFDGGVDLRLSTVQTFNKM 360

Query: 361 RDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
            DGSFCF   G+     +IGN M+ANFLVGYDID MTVSFK  DCTKIG
Sbjct: 361 SDGSFCFTAMGVDDKDALIGNNMMANFLVGYDIDNMTVSFKPTDCTKIG 396

BLAST of Moc10g02970 vs. ExPASy TrEMBL
Match: A0A6J1HV99 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111466512 PE=3 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 1.5e-138
Identity = 243/355 (68.45%), Postives = 281/355 (79.15%), Query Frame = 0

Query: 53  AQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSE 112
           AQS I P+ SQF++K+AVGTPP +V AILDTGSDLFWTQCLPCANCY QTNPI+DPS+S 
Sbjct: 3   AQSQIWPEASQFIIKIAVGTPPTEVHAILDTGSDLFWTQCLPCANCYRQTNPIYDPSKSS 62

Query: 113 SFRELPCPAALCHLQGSGAVCSGGGACGYSYGYGGGLTEGNLATETVAVSSRLGERVWPF 172
           +F+ L C    CHL GSGA CSG   C Y+YGYG GLT+G LATE +AV+SR G  +  F
Sbjct: 63  TFQTLSCELPQCHLTGSGAACSGTDTCKYNYGYGSGLTQGELATEQMAVTSRSG-AMKLF 122

Query: 173 QNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISS 232
           Q VVFGCGHNNSG FN NEMGLIG GRG +SF+SQIGPSIGGRKFS CLMP NTDP ISS
Sbjct: 123 QGVVFGCGHNNSGTFNPNEMGLIGLGRGAISFVSQIGPSIGGRKFSLCLMPTNTDPTISS 182

Query: 233 SLQLGSGSEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPYSSSGPAAQGNVILDS 292
           S+ +GSGSEV+GPGVIT QLV   DPT+Y+LTLTGISVG T +PYS+SGP AQGN +LD+
Sbjct: 183 SISIGSGSEVKGPGVITAQLVQISDPTYYSLTLTGISVGNTLIPYSTSGPPAQGNTVLDT 242

Query: 293 GTPPTLLPEDFYSRFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTV 352
           GTPPTLLP++ Y R AAEVRR+I   P+G  LCY++        +TLHFDGGV+LPLST+
Sbjct: 243 GTPPTLLPKELYERLAAEVRRQIPSEPIGDTLCYKD--NLGDLVMTLHFDGGVDLPLSTI 302

Query: 353 QTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKIG 408
           QTF +  DGSFCFA  G+     +IGN M+ANFLVGYDID MTVSFK  DCTKIG
Sbjct: 303 QTFNQMPDGSFCFAAMGVDDDSALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 354

BLAST of Moc10g02970 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 281.6 bits (719), Expect = 1.0e-75
Identity = 161/435 (37.01%), Postives = 240/435 (55.17%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSA-GTGGLSLELIRNDSPKSAFFRRRPSS------------- 60
           MAS IF   L+LL +++ +A    G +++LI  DSPKS F+    +S             
Sbjct: 1   MASLIFATLLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSAR 60

Query: 61  --------QACPETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANC 120
                    A P + QS IT +  ++L+ +++GTPP  + AI DTGSDL WTQC PC +C
Sbjct: 61  STLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDC 120

Query: 121 YPQTNPIFDPSRSESFRELPCPAALCHLQGSGAVCSGGGACGYSYGYG-GGLTEGNLATE 180
           Y QT+P+FDP  S ++R++ C ++ C      +  +    C Y+  YG    T+G++A +
Sbjct: 121 YQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVD 180

Query: 181 TVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKF 240
           TV + S  G R    +N++ GCGH N+G F+    G+IG G G  S +SQ+  SI G KF
Sbjct: 181 TVTMGSS-GRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSING-KF 240

Query: 241 SHCLMPFNTDPRISSSLQLGSGSEVRGPGVITIQLVPTPDPTFYALTLTGISVGKTFLPY 300
           S+CL+PF ++  ++S +  G+   V G GV++  +V     T+Y L L  ISVG   + +
Sbjct: 241 SYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQF 300

Query: 301 SSS-GPAAQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWRPVG-----AGLCYRNVRR 360
           +S+     +GN+++DSGT  TLLP +FY    + V   I+   V        LCYR+   
Sbjct: 301 TSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSS 360

Query: 361 FAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDI 407
           F  P +T+HF GG ++ L  + TF+   +   CFA A  +    I GN    NFLVGYD 
Sbjct: 361 FKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDT 420

BLAST of Moc10g02970 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 268.5 bits (685), Expect = 9.0e-72
Identity = 146/362 (40.33%), Postives = 210/362 (58.01%), Query Frame = 0

Query: 54  QSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSES 113
           Q  +T ++ ++L+ +++GTPP  + AI DTGSDL WTQC PC +CY Q +P+FDP  S +
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSST 139

Query: 114 FRELPCPAALCHLQGSGAVCS-GGGACGYSYGYG-GGLTEGNLATETVAVSSRLGERVWP 173
           ++++ C ++ C    + A CS     C YS  YG    T+GN+A +T+ + S    R   
Sbjct: 140 YKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS-DTRPMQ 199

Query: 174 FQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRIS 233
            +N++ GCGHNN+G FN+   G++G G GPVS I Q+G SI G KFS+CL+P  +    +
Sbjct: 200 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 259

Query: 234 SSLQLGSGSEVRGPGVITIQLV-PTPDPTFYALTLTGISVGKTFLPYS-SSGPAAQGNVI 293
           S +  G+ + V G GV++  L+      TFY LTL  ISVG   + YS S   +++GN+I
Sbjct: 260 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNII 319

Query: 294 LDSGTPPTLLPEDFYSRF----AAEVRRRIRWRP-VGAGLCYRNVRRFAAPPVTLHFDGG 353
           +DSGT  TLLP +FYS      A+ +    +  P  G  LCY        P +T+HFD G
Sbjct: 320 IDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFD-G 379

Query: 354 VELPLSTVQTFIRNRDGSFCFAVAGISGTGGIIGNFMLANFLVGYDIDEMTVSFKKADCT 407
            ++ L +   F++  +   CFA  G S +  I GN    NFLVGYD    TVSFK  DC 
Sbjct: 380 ADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 437

BLAST of Moc10g02970 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 241.9 bits (616), Expect = 9.0e-64
Identity = 160/448 (35.71%), Postives = 228/448 (50.89%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSITHKSAG-TGGLSLELIRNDSPKS---------------AFFR--- 60
           MA+ I L F    S+T  S+G     S+ELI  DSP S               AF R   
Sbjct: 1   MATQILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVS 60

Query: 61  --RRPSSQACPETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCY 120
             RR + Q      QS +   + +F + + +GTPP  VFAI DTGSDL W QC PC  CY
Sbjct: 61  RSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCY 120

Query: 121 PQTNPIFDPSRSESFRELPCPAALCH-LQGSGAVC-SGGGACGYSYGYGG-GLTEGNLAT 180
            +  PIFD  +S +++  PC +  C  L  +   C      C Y Y YG    ++G++AT
Sbjct: 121 KENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVAT 180

Query: 181 ETVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRK 240
           ETV++ S  G  V  F   VFGCG+NN G F++   G+IG G G +S ISQ+G SI  +K
Sbjct: 181 ETVSIDSASGSPV-SFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKK 240

Query: 241 FSHCLMPFNTDPRISSSLQLGS----GSEVRGPGVITIQLVPTPDPTFYALTLTGISVGK 300
           FS+CL   +     +S + LG+     S  +  GV++  LV     T+Y LTL  ISVGK
Sbjct: 241 FSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK 300

Query: 301 TFLPYSSSG---------PAAQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWR----- 360
             +PY+ S              GN+I+DSGT  TLL   F+ +F++ V   +        
Sbjct: 301 KKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 360

Query: 361 PVG-AGLCYRN-VRRFAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTGGI 405
           P G    C+++       P +T+HF  G ++ LS +  F++  +   C ++   +    I
Sbjct: 361 PQGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEV-AI 420

BLAST of Moc10g02970 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 238.4 bits (607), Expect = 9.9e-63
Identity = 155/450 (34.44%), Postives = 229/450 (50.89%), Query Frame = 0

Query: 1   MASSIFLIFLTLLSI-----THKSAGTGGLSLELIRNDSPKS---------------AFF 60
           MA+  FL + +LL+I     ++ SA    L++ELI  DSP S               AF 
Sbjct: 1   MATKTFL-YCSLLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFL 60

Query: 61  RRRPSSQ--ACPETAQSTITPDNSQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANC 120
           R    S+        QS +  +  ++ + +++GTPP  VFAI DTGSDL W QC PC  C
Sbjct: 61  RSISRSRRFTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQC 120

Query: 121 YPQTNPIFDPSRSESFRELPCPAALC-----HLQGSGAVCSGGGACGYSYGYG-GGLTEG 180
           Y Q +P+FD  +S +++   C +  C     H +G          C Y Y YG    T+G
Sbjct: 121 YKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCD---ESKDICKYRYSYGDNSFTKG 180

Query: 181 NLATETVAVSSRLGERVWPFQNVVFGCGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSI 240
           ++ATET+++ S  G  V  F   VFGCG+NN G F +   G+IG G GP+S +SQ+G SI
Sbjct: 181 DVATETISIDSSSGSSV-SFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSI 240

Query: 241 GGRKFSHCLMPFNTDPRISSSLQLGSGSEVRGP----GVITIQLVPTPDPTFYALTLTGI 300
            G+KFS+CL         +S + LG+ S    P      +T  L+     T+Y LTL  +
Sbjct: 241 -GKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAV 300

Query: 301 SVGKTFLPYSSSG-------PAAQGNVILDSGTPPTLLPEDFYSRFAAEVRRRIRWR--- 360
           +VGKT LPY+  G           GN+I+DSGT  TLL   FY  F   V   +      
Sbjct: 301 TVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV 360

Query: 361 --PVG-AGLCYRN-VRRFAAPPVTLHFDGGVELPLSTVQTFIRNRDGSFCFAVAGISGTG 405
             P G    C+++  +    P +T+HF    ++ LS +  F++  + + C ++   +   
Sbjct: 361 SDPQGLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTTEV- 420

BLAST of Moc10g02970 vs. TAIR 10
Match: AT2G28040.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 213.8 bits (543), Expect = 2.6e-55
Identity = 146/415 (35.18%), Postives = 213/415 (51.33%), Query Frame = 0

Query: 4   SIFLIFLTLLSITHKSAGTGGLSLELI--RNDSPKSAFFRRRPSSQACPETAQSTITPDN 63
           +IFL  +T   IT  ++   G +++LI  R+++  S  F  +  S        +    D 
Sbjct: 9   AIFLQIITYFLITTTASSPQGFTIDLIHRRSNASSSRVFNTQLGS------PYADTVFDT 68

Query: 64  SQFLVKLAVGTPPRDVFAILDTGSDLFWTQCLPCANCYPQTNPIFDPSRSESFRELPCPA 123
            ++L+KL +GTPP ++ A+LDTGS+  WTQCLPC +CY QT PIFDPS+S +F+E+ C  
Sbjct: 69  YEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCD- 128

Query: 124 ALCHLQGSGAVCSGGGACGYSYGYGG-GLTEGNLATETVAVSSRLGERVWPF--QNVVFG 183
                       +   +C Y   YGG   T+G L TETV + S  G+   PF     + G
Sbjct: 129 ------------THDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQ---PFVMPETIIG 188

Query: 184 CGHNNSGGFNQNEMGLIGFGRGPVSFISQIGPSIGGRKFSHCLMPFNTDPRISSSLQLGS 243
           CG NNS GF     G++G  RGP S I+Q+G    G      LM +    + +S +  G+
Sbjct: 189 CGRNNS-GFKPGFAGVVGLDRGPKSLITQMGGEYPG------LMSYCFAGKGTSKINFGA 248

Query: 244 GSEVRGPGVI-TIQLVPTPDPTFYALTLTGISVGKTFL-PYSSSGPAAQGNVILDSGTPP 303
            + V G GV+ T   V T  P FY L L  +SVG T +    +   A +GN+++DSG+  
Sbjct: 249 NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTL 308

Query: 304 TLLPEDFYS---RFAAEVRRRIRWRPVGAGLCYRNVRRFAAPPVTLHFDGGVELPLSTVQ 363
           T  PE + +   +   +V   +R+ P    LCY +      P +T+HF GG +L L    
Sbjct: 309 TYFPESYCNLVRKAVEQVVTAVRF-PRSDILCYYSKTIDIFPVITMHFSGGADLVLDKYN 368

Query: 364 TFI-RNRDGSFCFAVAGISG-TGGIIGNFMLANFLVGYDIDEMTVSFKKADCTKI 407
            ++  N  G FC A+   S     I GN    NFLVGYD   + VSFK  +C+ +
Sbjct: 369 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCSAL 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140900.19.8e-238100.00aspartic proteinase CDR1-like [Momordica charantia][more]
XP_023538771.15.9e-14264.46aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
KAG6601733.17.2e-14063.57Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022929935.17.2e-14063.33aspartic proteinase CDR1-like [Cucurbita moschata][more]
KAG6573507.11.6e-13963.14Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q6XBF81.3e-7040.33Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM51.3e-6235.71Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C31.7e-5437.13Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.2e-4735.43Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ31.3e-4335.54Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A6J1CGG64.7e-238100.00aspartic proteinase CDR1-like OS=Momordica charantia OX=3673 GN=LOC111011455 PE=... [more]
A0A6J1EVM93.5e-14063.33aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111436390 PE=3... [more]
A0A6J1JIJ51.0e-13963.57aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111484909 PE=3 S... [more]
A0A6J1EQ758.6e-13963.08aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111436389 PE=3... [more]
A0A6J1HV991.5e-13868.45aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111466512 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT1G64830.11.0e-7537.01Eukaryotic aspartyl protease family protein [more]
AT5G33340.19.0e-7240.33Eukaryotic aspartyl protease family protein [more]
AT2G35615.19.0e-6435.71Eukaryotic aspartyl protease family protein [more]
AT1G31450.19.9e-6334.44Eukaryotic aspartyl protease family protein [more]
AT2G28040.12.6e-5535.18Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 375..390
score: 24.88
coord: 288..299
score: 38.59
coord: 70..90
score: 41.25
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 245..406
e-value: 1.1E-35
score: 124.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 41..237
e-value: 5.0E-48
score: 165.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 56..403
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 261..399
e-value: 5.4E-23
score: 81.6
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 64..238
e-value: 2.5E-48
score: 164.6
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 8..405
NoneNo IPR availablePANTHERPTHR47967:SF39ASPARTYL PROTEASE FAMILY PROTEIN, PUTATIVE-RELATEDcoord: 8..405
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 288..299
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 64..399
score: 34.528416
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 63..403
e-value: 4.74724E-69
score: 217.131

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc10g02970.1Moc10g02970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity