Sgr021321 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021321
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionaspartic proteinase-like protein 2 isoform X1
Locationtig00153654: 938821 .. 944859 (+)
RNA-Seq ExpressionSgr021321
SyntenySgr021321
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTAAGGCTGAAGCTATGCTCAGGTTGACAACTACATTTTTTTTTATTAAATTCTTAATTTTCAAAATTTATTTTTTTAAATATCATATCACAAATTCAAGGAAAAATTCCAACTATGTCCTCTTAAAAATACCCCATGTTGCTCAAATTCTTGATTTGAACTTGGAAAAATTGAAAAAATGATAATAATAATTCCATCGTTCTCAACGAAGTTTGACTTTCCCTCTCTTTCTTTAAAATGAATCATTCGCCAAACTGCATTTTTTATGTTTGAAGCAATCAGCTGGCACTTCGATTTCGACTCCGTTCAACCCATACTCCGTCTTCTTCCCTCTTCGATCCCCTTCACTCCGTTGACCTCCGCCGTTCGCAGTCCCTTCCGTTCTCGCCGGAGCCAGCGGACTCCATGGCACTAACATCCAAACTACTCGCTGCAATTCTCCTCCTCCATCTCATGCACCTCATGCATTTCACTCTCTCCGCCGATGATCCCATCACCTCCAATCTTCTCCTTTCCCATTCTCATCGAGCCATGGTGTTGCCCCTTTATCTGTCTTCTCCTAATTCCTCTAGATTGATCTCCAAGCCTCGTCGCCATCTGCGGGAATCCAATTCGAATAATCGCTCCAACGCTCGCATGAGGCTATACGACGACCTCCTTCTCAATGGGTATGTTCCCCTGCTTAAAATTTTGATGATTAAAATTTTCTCTGATAAAATGTGTTCTGTGAAGATACTATACAACGCGGCTTTGGATCGGAACTCCACCGCAGCAATTCGCGCTTATAGTTGATACGGGGAGTACGGTTACCTATGTTCCATGCTCAACTTGCGAACAGTGTGGGAGGCACCAGGTTAAAATGTTTAAAAATTGTTTATTGCCAGAGCATTTATATGGTCTTCATATTCATATTAAATCATCTGTTTTTATTTTCCTGTAATTCAATTATCTCGATAGGAATAGGACATTTCAGTATATCCCTTCTGTGGAGCTTGGTGATCTAATACAACCTTGTATGGAAATAGATCGTATTGATCCTAGTATGATAAGGGTAATTAATAACTTCCCTTTGATGGACAAATTGTTTTTCTCCGAACAAGTGATCTTGGTATCTAACTCTGGAGAGTCTTGAATAATGATACATTTCCCTCCTATTTAGAGAATTATGAATAACTAAAAAGCACTATGAATAATAAATGAGACACAATCATATCTTCGGTATTCTATATGGTTCTTGAAGCTTGTCTATTTGTTAGTTTTATGTATAAATTTGTCCAGTCATTTATTACATTTATATTAAAATATTTCTAGTTTTGTCTTCATAGGTACCTTGCAATGTGCTGTAGTTGGTACTTAAACTTTTATTTCTATCAGGACCCAAAGTTTGATCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATATTGATTGCACTTGTGACAGTAATGGAGCGCAGTGTATCTACGAAAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGAGGATATTATATCCTTTGGCAATCAGAGTGAACTTGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGTGGAAACTGGTGATCTTTACAGTCAACATGCTGATGGAATTATGGGTCTGGGCAGTGGTGATCTCAGTATCGTCGACCAACTTGTTGAAAAAAGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCGCCTCCATCAGATATGATATTTAGCCACTCAGACCCTGTGAGAAGGTGTGTACAAAAGTCGTTTAGCCTTACAAATTGTACTTCATGCTCATTCATAACAATTACAATTGATACATCTCACACATTTACTTTAGTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAATTGCTTCTGAGTCCAAGCGTTTTTGATGGAAAATATGGAACTGTCTTAGATAGTGGTACAACTTATGCTTACCTTCCAGAGGCAGCGTTTGGAGCTTTCAAGGATGCTGTAAGCTTCTATTCTTCTGAATTTAACATTCACTGCCCGGAGTAGCTTTACTTCTGTCTGAATTTGACGATTGTTGCATGCTGTCCCAAGGACATTGTAAGATGATCCAATACTTACTTTTCCAGGATTATCTTTTTGCAACTTCTGTCTCAATCTAATAATTGTTTGCATTGCATTTTTCTTGACTTGTTCTTGCTGTTTTACATTTTAATGATTTGATATGGCTCAATTTACATTATTTCCTTCACAACCTTGATAAATGGGATCATTTGTTATATTTGCAGATGATTTCAATTCATTGTGATACAGTGAAATGACAAGTTTTTTTTTTTTTTCTTTTTTAATTTTTTATCGGATGTCTCTTAAAAACTTTATCTAGACAATGAAATGGAGAAAGGAACAGAATGAAATGAAGAAACTTTTGGTTCCTTATCATAAAAAGGAAGGATTGATCCCTATAGTCAAACCATGACTAGGCATATAACCTGTGTCTCTTTTCTCTTGCTCTCTCTAATGTTGTTGCCTTGTATGTTTTCTTTACCATTGATTGTGCAAATAGTCTCTAGATTAAGTGCAATAGATTTTTCTTCTTTTTGCAAGAGTCTTGATTTTAAAGTTACTCATGCAATCTTGAGTTGTCAGCTGTTTGGACTACTGTTATTGACATTTGTTTTAGAATGCATAATGAATTTAAAAAGAAAAATTCCTTCAGATTATGGATGAGCTTCATTCTTTGGAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGTGCTGGAAGGTACGGTTAGCTAATAAGTTTTGTCAGTTTTAAAAGTGATTTTCATCAATGAGTAATTAGTCCGTTACTTTATGTTAGTGATGTTGCTGAATTATCAAAGACATTCCCAGCAGTTGACATGGTATTTGATAATGGTCAAAAGTTGTCTCTAGCACCGGAAAATTACTTGTTTCGGGTAAGGAATTACTCTTGATTATTTATTATTTTTACCATATTTTGCTGTTCGTTTGTGTGTGTGTTTGTGTCTGTTTACATGGGATTATTTGTGGGGTAGGGGTAAGTTGAGCCTCCTTAACAAAAATTAGAGCCTCTGATTGATTTTTTATTCAATTGATATGGTGGTTATAAAATTTTATGTACTTGCTAAGCATAGGACTAAGTTAACTGATTCTTCTGTACTGTGAATTCTGAATCATTACTCCAGCATTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTTGAGAATGGAAATGACCAAACTACTCTTCTGGGAGGTATGGATTTTTCATGGTCGAGATGATTTTTCCCCCGTGAATGCTTATTCTATGGGAAATTTGGCTTTTTTGTCTGTTCATTTTATGAAATTGCCTAATTTTTAATTTGATCACATATTCATTTTAGCATGACTATCGTTTGTGACAGGAATTGTTGTCCGCAACACTCTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACGAACTGTTCCGAGTTATGGGAAAGGCTTCACATTTCTGATGATACTGCCCCTGCTCCTTCAGTTTCAAATACATCACGTGATACTGAAATGGCACCTGCATCTGCTCCGAGCGAGTCACCACATTATATGATTCCAGGTTTAGATAAGTGCAGAGGATTTCTTTTTGCTTTTATGCTAGTATGGCTGCTTGACTGCATATCGGTTGGTTCTTTAGTAATTTTCTTTCCGTGAAACATCGATTTATTCGATTGAACTTAATTTACTTGGAAAGTTTTGAAAAAGGAGCTATTATTTCAGGCCATCCTTCATGTCATTATGACAATTGAGGTTCTGTCAGAATTTATGCCACAAGTAATGACCTTTGAACATTTTAGTCGACTTTGACATTCAGCGGAATTTGATGATTGATAGTATAAACAATTTGGAATTTCAAGTTTCATATGATGATTATTTTAATAAATTTATGATCTGTTGCAATCTCTACCTTAAATGGCCTGTTTGTGAAGTGTTTTCTTCTGTATTTCGTGCAGGAGAGCTCCAGATTGGACGTATCATATTTGAAATCTTGTTGAACATAAGCTACACGGATCTGGAGCCTCATATTACAGAACTTTCTGATCATATTGATCGCGAGTTAAATATTAGTTATTCACAGGTTAGTGCACGTGACACATCGGGGATTGAAAAGTCTTTTGGCAGTATTTGATGATAATTTTGGTGCAATCTTTAACATTTGCATTTTTAATAGTTTAAATGAACAATGGTGGTGATAACAAATAGGTTATATTGCAAAATTAGTCCATTATTGTGACTATTTCATCCTAGGCTTTAAAAGCATTATGTCTTCCAATAGTCAGTATTGTTAACTAAAAAATGGTATTGCATACCTATTGGATAACGTCGGTTGATTTGACATGATGTGGCAGACAAGGGGTAGAAAAAACTAGACAATTAGATAAAATAAAATAGCATGTACAACATACTTTTTTTTTTTAGCAACATGCCAAATCCTTTTTATAGATTGTTTCACTCTTTTTTATCAGTTATCCACTCTGTTCTACTATTTATTTGCTAAGTTGTGCCAAATTAACTAACATTAGTTCAATGGGCATATCATTTCATCAATTAACAGTATTAACGTAGAGACTGCAAAGAATCACAATTCAAAGTTGAGAATGTAATTGAAATATTTTAAAATTTAAAGAATAAAATAGATGCAACCATCCAAATTGGCTAAATTTGTAATTTAACCTAACAAATATATATTTCATTTTTTTCTTAATCTTGGAGTTTGTGATCATATCTAACTTTTAACTGCATGTCATTTTTAACCATGATTTCAGGTCCGTTTATTGAATTTTACCATGAGAGGAAATGATTCCCTTATTCGGTTGGCCATACTCCCAACTGGATCTTCAGAATTTTTCTCACATGCGACGGCCACTGTAAGTGAATATGCTTGCAACTGTAAACTGCATGACAACAGAAAATTCACAAGCATGCTTTCTAAAATCTCGTTTCTCTTTGGCAGACGATAATTGCCCTGATCGTGGAGCATCACGTGCAGCTACCTCCTACATTTGGAAGTTATCAGGTCATTCGATGGAATGTCGAGCCTCTAATGAAAAGGTAAATATATATATATATATATATATAAAAGTTAAATTATAAATTTGGCTATTGAACTACTTTTTTTACCGTTGAATCTATTTAGCTTTGAACTTTAAAAAATTTAAATTATGCTTTAAACTTTTAATTTTGATTCTAATAAATTTCTGTTATTAATACTTTTATTTAGAGATGTAACATGCTGATTAGACTAACAGTGATGTTTTAGCAAGTTGCACGAGGGAGTAAACTAGCTAATAAAATAAGGATGGGTGGGCCAAAAAATTCAATGAGCCAAGCGGGCAGAATATATTTTTAAACTAATTTTTTTTACCTACCTGCCTTTATTTTATCTAGTTGATCTACTCTCTTATGCCACTTATTATCATGTCATGTCAAATTATTATTCATTAGTCTAATTAATGTAGTCATGTCAACATTTAAGTCATGGTGTTAACGGCATGAACTTATTAGAATAAAAAATTGAAAGCTTAGAATTGTAATCAAAACATTTGAAGGTTTAAGGATCAAATAGATACAATGATCGAAGTTTTGGGACTAAATTTGTAATTTGACGATAGATATATTTTCCCTACCTATATCCTGCATTTAAGTTGAGTGTTCATGGATAGCCAAATGTTGAATAGCTTTAATGTAGTCTAATCTCTTGTTCATGGTTTAGGTCAATTTGGAAGCAGCTTTATCCTTGGGTGGTCTTAGCTATTATTGTCACACTTATTCTTGGGTTGTCAGCATTGGGAGTGTGGATTATTTGGAGAAGGAGACAGCAGTCCTTCAATTCATATAAGCCTGTCAATGCAGCAATTCCAGAGCAAGAACTCCAGCCCCTGTAA

mRNA sequence

ATGACTAAGGCTGAAGCTATGCTCAGCAATCAGCTGGCACTTCGATTTCGACTCCGTTCAACCCATACTCCGTCTTCTTCCCTCTTCGATCCCCTTCACTCCGTTGACCTCCGCCGTTCGCAGTCCCTTCCGTTCTCGCCGGAGCCAGCGGACTCCATGGCACTAACATCCAAACTACTCGCTGCAATTCTCCTCCTCCATCTCATGCACCTCATGCATTTCACTCTCTCCGCCGATGATCCCATCACCTCCAATCTTCTCCTTTCCCATTCTCATCGAGCCATGGTGTTGCCCCTTTATCTGTCTTCTCCTAATTCCTCTAGATTGATCTCCAAGCCTCGTCGCCATCTGCGGGAATCCAATTCGAATAATCGCTCCAACGCTCGCATGAGGCTATACGACGACCTCCTTCTCAATGGATACTATACAACGCGGCTTTGGATCGGAACTCCACCGCAGCAATTCGCGCTTATAGTTGATACGGGGAGTACGGTTACCTATGTTCCATGCTCAACTTGCGAACAGTGTGGGAGGCACCAGGACCCAAAGTTTGATCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATATTGATTGCACTTGTGACAGTAATGGAGCGCAGTGTATCTACGAAAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGAGGATATTATATCCTTTGGCAATCAGAGTGAACTTGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGTGGAAACTGGTGATCTTTACAGTCAACATGCTGATGGAATTATGGGTCTGGGCAGTGGTGATCTCAGTATCGTCGACCAACTTGTTGAAAAAAGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCGCCTCCATCAGATATGATATTTAGCCACTCAGACCCTTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAATTGCTTCTGAGTCCAAGCGTTTTTGATGGAAAATATGGAACTGTCTTAGATAGTGGTACAACTTATGCTTACCTTCCAGAGGCAGCGTTTGGAGCTTTCAAGGATGCTATTATGGATGAGCTTCATTCTTTGGAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGTGCTGGAAGTGATGTTGCTGAATTATCAAAGACATTCCCAGCAGTTGACATGGTATTTGATAATGGTCAAAAGTTGTCTCTAGCACCGGAAAATTACTTGTTTCGGCATTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTTGAGAATGGAAATGACCAAACTACTCTTCTGGGAGGAATTGTTGTCCGCAACACTCTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACGAACTGTTCCGAGTTATGGGAAAGGCTTCACATTTCTGATGATACTGCCCCTGCTCCTTCAGTTTCAAATACATCACGTGATACTGAAATGGCACCTGCATCTGCTCCGAGCGAGTCACCACATTATATGATTCCAGGAGAGCTCCAGATTGGACGTATCATATTTGAAATCTTGTTGAACATAAGCTACACGGATCTGGAGCCTCATATTACAGAACTTTCTGATCATATTGATCGCGAGTTAAATATTAGTTATTCACAGGTCCGTTTATTGAATTTTACCATGAGAGGAAATGATTCCCTTATTCGGTTGGCCATACTCCCAACTGGATCTTCAGAATTTTTCTCACATGCGACGGCCACTACGATAATTGCCCTGATCGTGGAGCATCACGTGCAGCTACCTCCTACATTTGGAAGTTATCAGCTTTATCCTTGGGTGGTCTTAGCTATTATTGTCACACTTATTCTTGGGTTGTCAGCATTGGGAGTGTGGATTATTTGGAGAAGGAGACAGCAGTCCTTCAATTCATATAAGCCTGTCAATGCAGCAATTCCAGAGCAAGAACTCCAGCCCCTGTAA

Coding sequence (CDS)

ATGACTAAGGCTGAAGCTATGCTCAGCAATCAGCTGGCACTTCGATTTCGACTCCGTTCAACCCATACTCCGTCTTCTTCCCTCTTCGATCCCCTTCACTCCGTTGACCTCCGCCGTTCGCAGTCCCTTCCGTTCTCGCCGGAGCCAGCGGACTCCATGGCACTAACATCCAAACTACTCGCTGCAATTCTCCTCCTCCATCTCATGCACCTCATGCATTTCACTCTCTCCGCCGATGATCCCATCACCTCCAATCTTCTCCTTTCCCATTCTCATCGAGCCATGGTGTTGCCCCTTTATCTGTCTTCTCCTAATTCCTCTAGATTGATCTCCAAGCCTCGTCGCCATCTGCGGGAATCCAATTCGAATAATCGCTCCAACGCTCGCATGAGGCTATACGACGACCTCCTTCTCAATGGATACTATACAACGCGGCTTTGGATCGGAACTCCACCGCAGCAATTCGCGCTTATAGTTGATACGGGGAGTACGGTTACCTATGTTCCATGCTCAACTTGCGAACAGTGTGGGAGGCACCAGGACCCAAAGTTTGATCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATATTGATTGCACTTGTGACAGTAATGGAGCGCAGTGTATCTACGAAAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGAGGATATTATATCCTTTGGCAATCAGAGTGAACTTGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGTGGAAACTGGTGATCTTTACAGTCAACATGCTGATGGAATTATGGGTCTGGGCAGTGGTGATCTCAGTATCGTCGACCAACTTGTTGAAAAAAGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCGCCTCCATCAGATATGATATTTAGCCACTCAGACCCTTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAATTGCTTCTGAGTCCAAGCGTTTTTGATGGAAAATATGGAACTGTCTTAGATAGTGGTACAACTTATGCTTACCTTCCAGAGGCAGCGTTTGGAGCTTTCAAGGATGCTATTATGGATGAGCTTCATTCTTTGGAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGTGCTGGAAGTGATGTTGCTGAATTATCAAAGACATTCCCAGCAGTTGACATGGTATTTGATAATGGTCAAAAGTTGTCTCTAGCACCGGAAAATTACTTGTTTCGGCATTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTTGAGAATGGAAATGACCAAACTACTCTTCTGGGAGGAATTGTTGTCCGCAACACTCTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACGAACTGTTCCGAGTTATGGGAAAGGCTTCACATTTCTGATGATACTGCCCCTGCTCCTTCAGTTTCAAATACATCACGTGATACTGAAATGGCACCTGCATCTGCTCCGAGCGAGTCACCACATTATATGATTCCAGGAGAGCTCCAGATTGGACGTATCATATTTGAAATCTTGTTGAACATAAGCTACACGGATCTGGAGCCTCATATTACAGAACTTTCTGATCATATTGATCGCGAGTTAAATATTAGTTATTCACAGGTCCGTTTATTGAATTTTACCATGAGAGGAAATGATTCCCTTATTCGGTTGGCCATACTCCCAACTGGATCTTCAGAATTTTTCTCACATGCGACGGCCACTACGATAATTGCCCTGATCGTGGAGCATCACGTGCAGCTACCTCCTACATTTGGAAGTTATCAGCTTTATCCTTGGGTGGTCTTAGCTATTATTGTCACACTTATTCTTGGGTTGTCAGCATTGGGAGTGTGGATTATTTGGAGAAGGAGACAGCAGTCCTTCAATTCATATAAGCCTGTCAATGCAGCAATTCCAGAGCAAGAACTCCAGCCCCTGTAA

Protein sequence

MTKAEAMLSNQLALRFRLRSTHTPSSSLFDPLHSVDLRRSQSLPFSPEPADSMALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISKPRRHLRESNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDGKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIPGELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPWVVLAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL
Homology
BLAST of Sgr021321 vs. NCBI nr
Match: XP_038902862.1 (aspartic proteinase CDR1-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1064.3 bits (2751), Expect = 4.4e-307
Identity = 538/644 (83.54%), Postives = 570/644 (88.51%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA +  LLAAILL       HF L A DPI+SN LL+ SHRAMVLPLYLSSPNSS+LIS 
Sbjct: 1   MAQSPYLLAAILL-------HFLLFA-DPISSNPLLTPSHRAMVLPLYLSSPNSSKLISN 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P RHLR+  +SNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS
Sbjct: 61  PHRHLRQFPSSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE+CGRHQDPKF+PE SSTY+PVKCNIDCTCD++G QC+YERQYAEMSTSSGVLGED+I
Sbjct: 121 TCEECGRHQDPKFEPESSSTYEPVKCNIDCTCDNDGLQCVYERQYAEMSTSSGVLGEDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQSEL+PQRAVFGCENVETGDL+SQ ADGIMGLG+GDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSELIPQRAVFGCENVETGDLFSQRADGIMGLGTGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPSDMIFS+SDP R   SPYYNVDLKEIHVAGK+LLL+PS+FD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSDMIFSYSDPVR---SPYYNVDLKEIHVAGKRLLLTPSIFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YGTVLDSGTTYAYLP  AFGAFKDAIMDELHSL+KI GPDPNF DICFSGAGSD AEL
Sbjct: 301 GRYGTVLDSGTTYAYLPVEAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLV 472
           S  FP VDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLV
Sbjct: 361 SNIFPTVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLV 420

Query: 473 MYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIPG 532
           MYDR HSKIGFWKTNCSELWERLHISDD A APSVSNTS DT++APASAP ESPHY IPG
Sbjct: 421 MYDRAHSKIGFWKTNCSELWERLHISDDHAHAPSVSNTSHDTDIAPASAPDESPHYTIPG 480

Query: 533 ELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAI 592
           ELQIGRI FEILLNISYTDLEPHITELSDHI  ELN+S+SQV LLNFTMRGNDSLI+LAI
Sbjct: 481 ELQIGRITFEILLNISYTDLEPHITELSDHIAHELNVSHSQVLLLNFTMRGNDSLIQLAI 540

Query: 593 LPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-----------------VVL 652
           LP   SEFFSHATA TII+LIVEHH+QLPPTFGSYQ+  W                 V L
Sbjct: 541 LPNEPSEFFSHATAITIISLIVEHHMQLPPTFGSYQVLQWKIEPLMERSLWKRLYIMVGL 600

Query: 653 AIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           AIIVTLILGLSALG W I RRRQ +FNSY PVNAA+PEQELQPL
Sbjct: 601 AIIVTLILGLSALGAWFILRRRQAAFNSYMPVNAAVPEQELQPL 633

BLAST of Sgr021321 vs. NCBI nr
Match: XP_022149434.1 (aspartic proteinase-like protein 2 isoform X1 [Momordica charantia])

HSP 1 Score: 1063.1 bits (2748), Expect = 9.7e-307
Identity = 542/683 (79.36%), Postives = 580/683 (84.92%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MAL S LL AI   H + LMH T SA DPI +NLLL+  HRAMVLPLYLSSPNSSRLISK
Sbjct: 1   MALQSNLLPAI-AFHFILLMHSTFSA-DPIAANLLLTPPHRAMVLPLYLSSPNSSRLISK 60

Query: 113 PRRHLRESNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST 172
           PRRHLRESNS N SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGS+VTYVPC+ 
Sbjct: 61  PRRHLRESNSYNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCAN 120

Query: 173 CEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIIS 232
           CEQCGRHQDPKFDP+LSST++PVKCN+DC+CD +G  C+YERQYAEMSTSSG+LGEDIIS
Sbjct: 121 CEQCGRHQDPKFDPDLSSTFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIIS 180

Query: 233 FGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCY 292
           FGNQSELVPQRA FGCE VETGDLYSQ ADGIMGLGSG+LSIVDQLVEK VIND+FSLCY
Sbjct: 181 FGNQSELVPQRATFGCETVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCY 240

Query: 293 GGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDG 352
           GGMDIGGGAMVLGGIS PSDM FS SD  R   SPYYNVDLKEI VAGKKLLL+PSVFDG
Sbjct: 241 GGMDIGGGAMVLGGISTPSDMAFSFSDRMR---SPYYNVDLKEIRVAGKKLLLNPSVFDG 300

Query: 353 KYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELS 412
           K+GTVLDSGTTYAYLP+ AFGAFKDAIMDE+HSL+KIGGPDPNFNDICFSGAGSDVAELS
Sbjct: 301 KFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGSDVAELS 360

Query: 413 KTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVM 472
           KTFPAVDMVF+NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGI+VRNTLVM
Sbjct: 361 KTFPAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVM 420

Query: 473 YDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEM----------------- 532
           YDRE+SKIGFWKTNCSELWERLHIS+DTA APSVSNTS DTEM                 
Sbjct: 421 YDRENSKIGFWKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEA 480

Query: 533 -----------------------APASAPSESPHYMIPGELQIGRIIFEILLNISYTDLE 592
                                  APASAPSE+PHYMIPGELQ+GRI FEILLNISY DLE
Sbjct: 481 PASAPSEAPASAPSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLE 540

Query: 593 PHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTIIALI 652
           PHITELSD I +ELN+SYSQVRLLNFTM+GNDSLI+LAI+P GSSEFFSHATATTIIA I
Sbjct: 541 PHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQI 600

Query: 653 VEHHVQLPPTFGSYQLYPW-----------------VVLAIIVTLILGLSALGVWIIWRR 679
           VEHH+QLPPTFGSYQ+  W                 V++A+IVTL+LGLSALGVW+IWRR
Sbjct: 601 VEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRR 660

BLAST of Sgr021321 vs. NCBI nr
Match: KAG7011039.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1051.2 bits (2717), Expect = 3.8e-303
Identity = 532/645 (82.48%), Postives = 570/645 (88.37%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA T  LL A LLLH +HL HFTLSA DPI+SN LL+ SHRAMVLPLY SSPNSS+LISK
Sbjct: 1   MARTPNLLLA-LLLHFLHLTHFTLSA-DPISSNPLLTPSHRAMVLPLYRSSPNSSKLISK 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P R LR   NSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCS
Sbjct: 61  PHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE CG+HQDPKFDPELSSTYQPVKCN DCTCD +G QC+YERQYAEMSTSSGVLG+D+I
Sbjct: 121 TCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQS LVPQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPS+MIFS+SDP R   SPYYNVDLKEIHVAGKKL L PSVFD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSEMIFSYSDPVR---SPYYNVDLKEIHVAGKKLPLEPSVFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YG+VLDSGTTY+YLP+ AFG FK+AI++ LHSL+KIGGPDPNF D CFSGAGSD AEL
Sbjct: 301 GRYGSVLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTL 472
           SKTFP VD+VFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTL
Sbjct: 361 SKTFPTVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTL 420

Query: 473 VMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIP 532
           VMYDREHSKIGFWKTNCSELWERLHISDD A APSVSNTS DT+MAPASAPSESPH MIP
Sbjct: 421 VMYDREHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIP 480

Query: 533 GELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLA 592
            +LQIGRI F+ILLNISY  LEPHIT+LSDHI  ELN+S+SQVRLLNFTMRGN SLI+LA
Sbjct: 481 EDLQIGRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLA 540

Query: 593 ILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPWVV----------------- 652
           ILP GSSEFFS ATATTII+LIVEHH++LPP +GSYQ+  W V                 
Sbjct: 541 ILPNGSSEFFSPATATTIISLIVEHHMKLPPKYGSYQVIRWYVEPLMDRSLWKRLYILVG 600

Query: 653 LAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           LAI+VTLILGLSA+GVW IWRRRQQ+F+SYKPVNAA PEQELQ L
Sbjct: 601 LAIMVTLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of Sgr021321 vs. NCBI nr
Match: KAG6571239.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1050.4 bits (2715), Expect = 6.5e-303
Identity = 532/645 (82.48%), Postives = 570/645 (88.37%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA T  LL A LLLH +HL HFTLSA DPI+SN LL+ SHRAMVLPLY SSPNSS+LISK
Sbjct: 1   MARTPNLLLA-LLLHFLHLTHFTLSA-DPISSNPLLTPSHRAMVLPLYRSSPNSSKLISK 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P R LR   NSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCS
Sbjct: 61  PHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE CG+HQDPKFDPELSSTYQPVKCN DCTCD +G QC+YERQYAEMSTSSGVLG+D+I
Sbjct: 121 TCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQS LVPQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPS+MIFS+SDP R   SPYYNVDLKEIHVAGKKL L PSVFD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSEMIFSYSDPVR---SPYYNVDLKEIHVAGKKLPLEPSVFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YG+VLDSGTTY+YLP+ AFG FK+AI++ LHSL+KIGGPDPNF D CFSGAGSD AEL
Sbjct: 301 GRYGSVLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTL 472
           SKTFP VD+VFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTL
Sbjct: 361 SKTFPTVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTL 420

Query: 473 VMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIP 532
           VMYDREHSKIGFWKTNCSELWERLHISDD A APSVSNTS DT+MAPASAPSESPH MIP
Sbjct: 421 VMYDREHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIP 480

Query: 533 GELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLA 592
            +LQIGRI F+ILLNISY  LEPHIT+LSDHI  ELN+S+SQVRLLNFTMRGN SLI+LA
Sbjct: 481 EDLQIGRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLA 540

Query: 593 ILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-----------------VV 652
           ILP GSSEFFS ATATTII+LIVEHH++LPP +GSYQ+  W                 V 
Sbjct: 541 ILPNGSSEFFSPATATTIISLIVEHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVG 600

Query: 653 LAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           LAI+VTLILGLSA+GVW IWRRRQQ+F+SYKPVNAA PEQELQ L
Sbjct: 601 LAIMVTLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of Sgr021321 vs. NCBI nr
Match: XP_022985603.1 (aspartic proteinase nepenthesin-1-like [Cucurbita maxima])

HSP 1 Score: 1049.7 bits (2713), Expect = 1.1e-302
Identity = 529/645 (82.02%), Postives = 571/645 (88.53%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA T  LL A+ LLH +HL HFTLSA DPI+SN LL+ SHRAMVLPLY SSPNSS+LISK
Sbjct: 1   MARTPNLLLAV-LLHFLHLTHFTLSA-DPISSNPLLTPSHRAMVLPLYRSSPNSSKLISK 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P R LR   NSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCS
Sbjct: 61  PHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE CG+HQDPKFDPELSSTYQPVKCN DCTCD++G QC+YERQYAEMSTSSGVLG+D+I
Sbjct: 121 TCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQS LVPQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPS+MIFS+SDP R   SPYYNVDLKEIHVAGKKL L PSVFD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSEMIFSYSDPVR---SPYYNVDLKEIHVAGKKLPLEPSVFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YG+VLDSGTTY+YLP+ AFG FK+AIM+ LHSL+KIGGPDPNF D CFSGAGSD AEL
Sbjct: 301 GRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTL 472
           SKTFP VD++FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTL
Sbjct: 361 SKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTL 420

Query: 473 VMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIP 532
           VMYDREHSKIGFWKTNCSELWERLHISD+ A APSVSNTS DT+ APASAPSESPH MIP
Sbjct: 421 VMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIP 480

Query: 533 GELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLA 592
            ++QIGRI F+ILLNISY  LEPHIT LSDHI +ELN+S+SQVRLLNFTMRGN SLI+LA
Sbjct: 481 EDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLA 540

Query: 593 ILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-----------------VV 652
           ILP GSSEFFSHATATTII+LIVEHH++LPP +GSYQ+  W                 V 
Sbjct: 541 ILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVG 600

Query: 653 LAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           LAI+VTLILGLSA+GVW IWRRRQQ+F+SYKPVNAA PEQELQ L
Sbjct: 601 LAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of Sgr021321 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 2.8e-38
Identity = 122/375 (32.53%), Postives = 187/375 (49.87%), Query Frame = 0

Query: 140 GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFDPELSSTYQP 199
           G Y T++ +G+PP+++ + VDTGS + +V C+ C +C    D       +D + SST + 
Sbjct: 76  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 200 VKCNID-CT----CDSNGAQ--CIYERQYAEMSTSSGVLGEDIISF----GN-QSELVPQ 259
           V C  D C+     ++ GA+  C Y   Y + STS G   +D I+     GN ++  + Q
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 260 RAVFGCENVETGDL--YSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGG 319
             VFGC   ++G L       DGIMG G  + SI+ QL         FS C   M+ GGG
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN-GGG 255

Query: 320 AMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSV--FDGKYGTVL 379
              +G +  P       + P    N  +YNV LK + V G  + L PS+   +G  GT++
Sbjct: 256 IFAVGEVESP----VVKTTPIVP-NQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTII 315

Query: 380 DSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKTFPAV 439
           DSGTT AYLP+  +    +++++++ + +++          CF    S  +   K FP V
Sbjct: 316 DSGTTLAYLPQNLY----NSLIEKITAKQQVKLHMVQETFACF----SFTSNTDKAFPVV 375

Query: 440 DMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-----NDQTTLLGGIVVRNTLVMY 489
           ++ F++  KLS+ P +YLF  S     YC G    G          LLG +V+ N LV+Y
Sbjct: 376 NLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVY 434

BLAST of Sgr021321 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 3.2e-34
Identity = 107/367 (29.16%), Postives = 172/367 (46.87%), Query Frame = 0

Query: 139 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELSSTYQPVKCN 198
           +G Y +R+ +GTP ++  L++DTGS V ++ C  C  C +  DP F+P  SSTY+ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218

Query: 199 I-DCT------CDSNGAQCIYERQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCENV 258
              C+      C SN  +C+Y+  Y + S + G L  D ++FGN  ++       GC + 
Sbjct: 219 APQCSLLETSACRSN--KCLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHD 278

Query: 259 ETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGGAMV------LG 318
             G L++  A G++GLG G LSI +Q+        SFS C    D G  + +      LG
Sbjct: 279 NEG-LFT-GAAGLLGLGGGVLSITNQMKA-----TSFSYCLVDRDSGKSSSLDFNSVQLG 338

Query: 319 GISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD----GKYGTVLDSG 378
           G    + ++ +           +Y V L    V G+K++L  ++FD    G  G +LD G
Sbjct: 339 GGDATAPLLRNKK------IDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCG 398

Query: 379 TTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKT-FPAVDM 438
           T    L   A+ + +DA +    +L+K G    +  D C+     D + LS    P V  
Sbjct: 399 TAVTRLQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCY-----DFSSLSTVKVPTVAF 458

Query: 439 VFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKI 488
            F  G+ L L  +NYL       G +C   F   +   +++G +  + T + YD   + I
Sbjct: 459 HFTGGKSLDLPAKNYLIPVDD-SGTFCFA-FAPTSSSLSIIGNVQQQGTRITYDLSKNVI 500

BLAST of Sgr021321 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 147.5 bits (371), Expect = 5.4e-34
Identity = 120/402 (29.85%), Postives = 188/402 (46.77%), Query Frame = 0

Query: 123 NNRSNARMRLYDDLLLN--------GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCE 182
           + R ++RM    DL L         G Y T++ +G+PP+++ + VDTGS + ++ C  C 
Sbjct: 47  DTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCP 106

Query: 183 QCGRHQD-----PKFDPELSSTYQPVKCNID-CT--CDSNGAQ----CIYERQYAEMSTS 242
           +C    +       FD   SST + V C+ D C+    S+  Q    C Y   YA+ STS
Sbjct: 107 KCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTS 166

Query: 243 SGVLGEDIISFGN-----QSELVPQRAVFGCENVETGDLYS--QHADGIMGLGSGDLSIV 302
            G    D+++        ++  + Q  VFGC + ++G L +     DG+MG G  + S++
Sbjct: 167 DGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVL 226

Query: 303 DQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKE 362
            QL         FS C   +  GGG   +G +  P       + P    N  +YNV L  
Sbjct: 227 SQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPK----VKTTPMVP-NQMHYNVMLMG 286

Query: 363 IHVAGKKLLLSPSVFDGKYGTVLDSGTTYAYLPEAAFGAFKDAIMD----ELHSLEKIGG 422
           + V G  L L  S+     GT++DSGTT AY P+  + +  + I+     +LH +E+   
Sbjct: 287 MDVDGTSLDLPRSIVRNG-GTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE--- 346

Query: 423 PDPNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIF 482
               F   CFS +      + + FP V   F++  KL++ P +YLF  +     YC G  
Sbjct: 347 ---TFQ--CFSFS----TNVDEAFPPVSFEFEDSVKLTVYPHDYLF--TLEEELYCFGWQ 406

Query: 483 ENG-----NDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNCS 489
             G       +  LLG +V+ N LV+YD ++  IG+   NCS
Sbjct: 407 AGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427

BLAST of Sgr021321 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 1.1e-31
Identity = 103/360 (28.61%), Postives = 162/360 (45.00%), Query Frame = 0

Query: 139 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELSSTYQPVKCN 198
           +G Y  R+ +G+PP+   +++D+GS + +V C  C+ C +  DP FDP  S +Y  V C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 199 IDCTCD------SNGAQCIYERQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCENVE 258
               CD       +   C YE  Y + S + G L  + ++F   ++ V +    GC +  
Sbjct: 188 -SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGHRN 247

Query: 259 TGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISPPSD 318
            G      A G++G+G G +S V QL  ++     + L   G D   G++V G  + P  
Sbjct: 248 RGMFIG--AAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTD-STGSLVFGREALPVG 307

Query: 319 MIFSHSDPFRDGNSP-YYNVDLKEIHVAGKKLLLSPSVFD----GKYGTVLDSGTTYAYL 378
              S     R+  +P +Y V LK + V G ++ L   VFD    G  G V+D+GT    L
Sbjct: 308 A--SWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRL 367

Query: 379 PEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQK 438
           P AA+ AF+D    +  +L +  G   +  D C+  +G     +S   P V   F  G  
Sbjct: 368 PTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSGF----VSVRVPTVSFYFTEGPV 427

Query: 439 LSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 488
           L+L   N+L       G YC   F       +++G I      V +D  +  +GF    C
Sbjct: 428 LTLPARNFLMPVDD-SGTYCFA-FAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Sgr021321 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 5.6e-31
Identity = 120/403 (29.78%), Postives = 182/403 (45.16%), Query Frame = 0

Query: 108 RLISKPRRHLRESNSNNRSNARMR--LYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTV 167
           R I +  R +R  N+  +S++ +   +Y     +G Y   + IGTP   F+ I+DTGS +
Sbjct: 63  RAIKRGERRMRSINAMLQSSSGIETPVYAG---DGEYLMNVAIGTPDSSFSAIMDTGSDL 122

Query: 168 TYVPCSTCEQCGRHQDPKFDPELSSTYQPVKCNID-C------TCDSNGAQCIYERQYAE 227
            +  C  C QC     P F+P+ SS++  + C    C      TC++N  +C Y   Y +
Sbjct: 123 IWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--ECQYTYGYGD 182

Query: 228 MSTSSGVLGEDIISFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQL 287
            ST+ G +  +  +F   S  VP  A FGC     G     +  G++G+G G LS+  QL
Sbjct: 183 GSTTQGYMATETFTFETSS--VPNIA-FGCGEDNQG-FGQGNGAGLIGMGWGPLSLPSQL 242

Query: 288 VEKSVINDSFSLC---YGG-----MDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYN 347
                    FS C   YG      + +G  A  +   SP + +I S  +P       YY 
Sbjct: 243 GV-----GQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNP------TYYY 302

Query: 348 VDLKEIHVAGKKLLLSPSVF----DGKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSL 407
           + L+ I V G  L +  S F    DG  G ++DSGTT  YLP+ A+ A   A  D++ +L
Sbjct: 303 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI-NL 362

Query: 408 EKIGGPDPNFNDICFS--GAGSDVAELSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHG 467
             +       +  CF     GS V       P + M FD G  L+L  +N L   S   G
Sbjct: 363 PTVDESSSGLS-TCFQQPSDGSTV-----QVPEISMQFDGG-VLNLGEQNILI--SPAEG 422

Query: 468 AYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 488
             CL +  +     ++ G I  + T V+YD ++  + F  T C
Sbjct: 423 VICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Sgr021321 vs. ExPASy TrEMBL
Match: A0A6J1D718 (aspartic proteinase-like protein 2 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017863 PE=3 SV=1)

HSP 1 Score: 1063.1 bits (2748), Expect = 4.7e-307
Identity = 542/683 (79.36%), Postives = 580/683 (84.92%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MAL S LL AI   H + LMH T SA DPI +NLLL+  HRAMVLPLYLSSPNSSRLISK
Sbjct: 1   MALQSNLLPAI-AFHFILLMHSTFSA-DPIAANLLLTPPHRAMVLPLYLSSPNSSRLISK 60

Query: 113 PRRHLRESNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST 172
           PRRHLRESNS N SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGS+VTYVPC+ 
Sbjct: 61  PRRHLRESNSYNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCAN 120

Query: 173 CEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIIS 232
           CEQCGRHQDPKFDP+LSST++PVKCN+DC+CD +G  C+YERQYAEMSTSSG+LGEDIIS
Sbjct: 121 CEQCGRHQDPKFDPDLSSTFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIIS 180

Query: 233 FGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCY 292
           FGNQSELVPQRA FGCE VETGDLYSQ ADGIMGLGSG+LSIVDQLVEK VIND+FSLCY
Sbjct: 181 FGNQSELVPQRATFGCETVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCY 240

Query: 293 GGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDG 352
           GGMDIGGGAMVLGGIS PSDM FS SD  R   SPYYNVDLKEI VAGKKLLL+PSVFDG
Sbjct: 241 GGMDIGGGAMVLGGISTPSDMAFSFSDRMR---SPYYNVDLKEIRVAGKKLLLNPSVFDG 300

Query: 353 KYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELS 412
           K+GTVLDSGTTYAYLP+ AFGAFKDAIMDE+HSL+KIGGPDPNFNDICFSGAGSDVAELS
Sbjct: 301 KFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGSDVAELS 360

Query: 413 KTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVM 472
           KTFPAVDMVF+NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGI+VRNTLVM
Sbjct: 361 KTFPAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVM 420

Query: 473 YDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEM----------------- 532
           YDRE+SKIGFWKTNCSELWERLHIS+DTA APSVSNTS DTEM                 
Sbjct: 421 YDRENSKIGFWKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEA 480

Query: 533 -----------------------APASAPSESPHYMIPGELQIGRIIFEILLNISYTDLE 592
                                  APASAPSE+PHYMIPGELQ+GRI FEILLNISY DLE
Sbjct: 481 PASAPSEAPASAPSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLE 540

Query: 593 PHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTIIALI 652
           PHITELSD I +ELN+SYSQVRLLNFTM+GNDSLI+LAI+P GSSEFFSHATATTIIA I
Sbjct: 541 PHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQI 600

Query: 653 VEHHVQLPPTFGSYQLYPW-----------------VVLAIIVTLILGLSALGVWIIWRR 679
           VEHH+QLPPTFGSYQ+  W                 V++A+IVTL+LGLSALGVW+IWRR
Sbjct: 601 VEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRR 660

BLAST of Sgr021321 vs. ExPASy TrEMBL
Match: A0A6J1JE38 (aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111483616 PE=3 SV=1)

HSP 1 Score: 1049.7 bits (2713), Expect = 5.4e-303
Identity = 529/645 (82.02%), Postives = 571/645 (88.53%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA T  LL A+ LLH +HL HFTLSA DPI+SN LL+ SHRAMVLPLY SSPNSS+LISK
Sbjct: 1   MARTPNLLLAV-LLHFLHLTHFTLSA-DPISSNPLLTPSHRAMVLPLYRSSPNSSKLISK 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P R LR   NSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCS
Sbjct: 61  PHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE CG+HQDPKFDPELSSTYQPVKCN DCTCD++G QC+YERQYAEMSTSSGVLG+D+I
Sbjct: 121 TCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQS LVPQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPS+MIFS+SDP R   SPYYNVDLKEIHVAGKKL L PSVFD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSEMIFSYSDPVR---SPYYNVDLKEIHVAGKKLPLEPSVFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YG+VLDSGTTY+YLP+ AFG FK+AIM+ LHSL+KIGGPDPNF D CFSGAGSD AEL
Sbjct: 301 GRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTL 472
           SKTFP VD++FDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTL
Sbjct: 361 SKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTL 420

Query: 473 VMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIP 532
           VMYDREHSKIGFWKTNCSELWERLHISD+ A APSVSNTS DT+ APASAPSESPH MIP
Sbjct: 421 VMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIP 480

Query: 533 GELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLA 592
            ++QIGRI F+ILLNISY  LEPHIT LSDHI +ELN+S+SQVRLLNFTMRGN SLI+LA
Sbjct: 481 EDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLA 540

Query: 593 ILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-----------------VV 652
           ILP GSSEFFSHATATTII+LIVEHH++LPP +GSYQ+  W                 V 
Sbjct: 541 ILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVG 600

Query: 653 LAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           LAI+VTLILGLSA+GVW IWRRRQQ+F+SYKPVNAA PEQELQ L
Sbjct: 601 LAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of Sgr021321 vs. ExPASy TrEMBL
Match: A0A6J1EYA5 (aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC111439463 PE=3 SV=1)

HSP 1 Score: 1047.7 bits (2708), Expect = 2.0e-302
Identity = 531/645 (82.33%), Postives = 569/645 (88.22%), Query Frame = 0

Query: 53  MALTSKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISK 112
           MA T  LL A LLLH +HL HFTLSA DPI+SN LL+ SHRAMVLPLY SSPNSS+LISK
Sbjct: 1   MARTPNLLLA-LLLHFLHLTHFTLSA-DPISSNPLLTPSHRAMVLPLYRSSPNSSKLISK 60

Query: 113 PRRHLRE-SNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCS 172
           P R LR   NSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCS
Sbjct: 61  PHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCS 120

Query: 173 TCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDII 232
           TCE CG+HQDPKFDPELSSTYQPVKCN DCTCD +G QC+YERQYAEMSTSSGVLG+D+I
Sbjct: 121 TCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVI 180

Query: 233 SFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLC 292
           SFGNQS LVPQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEK VINDSFSLC
Sbjct: 181 SFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC 240

Query: 293 YGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD 352
           YGGMDIGGGAMVLGGISPPS+MIFS+SDP R   SPYYNVDLKEIHVAGKKL L PSVFD
Sbjct: 241 YGGMDIGGGAMVLGGISPPSEMIFSYSDPVR---SPYYNVDLKEIHVAGKKLPLEPSVFD 300

Query: 353 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 412
           G+YG+VLDSGTTY+YLP+ AFG FK+AI++ LHSL+KIGGPDPNF D CFSGAGSD AEL
Sbjct: 301 GRYGSVLDSGTTYSYLPQEAFGPFKNAILNALHSLKKIGGPDPNFKDTCFSGAGSDAAEL 360

Query: 413 SKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTL 472
           SKTFP VD+VFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTL
Sbjct: 361 SKTFPTVDLVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTL 420

Query: 473 VMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIP 532
           VMYDREHSKIGFWKTNCSELWERLHISDD A APSVSNTS DT+MAPASAPSESPH MIP
Sbjct: 421 VMYDREHSKIGFWKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIP 480

Query: 533 GELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLA 592
            +LQIGRI F+ILLNISY  LEPHIT+LSDHI  ELN+S+SQVRLLNFTMRGN SLI+LA
Sbjct: 481 EDLQIGRITFDILLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLA 540

Query: 593 ILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-----------------VV 652
           ILP GSSEFFS ATATTII+LIV HH++LPP +GSYQ+  W                 V 
Sbjct: 541 ILPNGSSEFFSPATATTIISLIVGHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVG 600

Query: 653 LAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQPL 679
           LAI+VTLILGLSA+GVW IWRRRQQ+F+SYKPVNAA PEQELQ L
Sbjct: 601 LAIMVTLILGLSAMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of Sgr021321 vs. ExPASy TrEMBL
Match: A0A5A7TUH1 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G003080 PE=3 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 8.3e-296
Identity = 511/626 (81.63%), Postives = 547/626 (87.38%), Query Frame = 0

Query: 71  LMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSNNRSNAR 130
           L+HF LSA DPI+ N L++ SHRAMVLPLYLSS NSS+ IS P RHLR+   S+NRSNAR
Sbjct: 13  LLHFFLSA-DPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPHRHLRQFPTSDNRSNAR 72

Query: 131 MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELS 190
           MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPE S
Sbjct: 73  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESS 132

Query: 191 STYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCE 250
           STY+P+KCNIDCTCDS+G QC+YERQYAEMSTSSGVLGED+ISFGNQSEL+PQRAVFGCE
Sbjct: 133 STYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCE 192

Query: 251 NVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISP 310
           N+ETGDL+SQ ADGIMGLG+GDLS+VDQLVEK  INDSFSLCYGGMDIGGGAMVLGGISP
Sbjct: 193 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISP 252

Query: 311 PSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDGKYGTVLDSGTTYAYLPE 370
           PSDMIF++SDP R   SPYYNVDLKEIHVAGKKL LS S+FDG+YGTVLDSGTTYAYLP 
Sbjct: 253 PSDMIFTYSDPVR---SPYYNVDLKEIHVAGKKLPLSSSIFDGRYGTVLDSGTTYAYLPA 312

Query: 371 AAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQKLS 430
            AFGAFKDAIMDELHSL+KI GPDPNF DICFSGAGSD AELS  FP VDMVF+NGQKLS
Sbjct: 313 EAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFPTVDMVFENGQKLS 372

Query: 431 LAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSE 490
           LAPENY FRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDR HSKIGFWKTNCSE
Sbjct: 373 LAPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRAHSKIGFWKTNCSE 432

Query: 491 LWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIPGELQIGRIIFEILLNISYT 550
           LWERL  SDD A APS+S  S  ++MAPASAP ESPHY IPGELQIGRI FEILLN SYT
Sbjct: 433 LWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQIGRITFEILLNKSYT 492

Query: 551 DLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTII 610
           DLEPHITELSDHI +ELN+S+SQV LLNFTMRGNDSLI+LAI+P GSSE FSHAT  TII
Sbjct: 493 DLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYGSSEIFSHATVNTII 552

Query: 611 ALIVEHHVQLPPTFGSYQLYPW-----------------VVLAIIVTLILGLSALGVWII 670
           + IVEHH+QLPPTFGSYQ+  W                 V LAIIV  ILGLSALG W I
Sbjct: 553 SKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIVIFILGLSALGAWFI 612

Query: 671 WRRRQQSFNSYKPVNAAIPEQELQPL 679
            R RQQ+ NSYKPVNAA+PEQELQPL
Sbjct: 613 LRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of Sgr021321 vs. ExPASy TrEMBL
Match: A0A1S3C7L5 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103497389 PE=3 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 8.3e-296
Identity = 511/626 (81.63%), Postives = 547/626 (87.38%), Query Frame = 0

Query: 71  LMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSNNRSNAR 130
           L+HF LSA DPI+ N L++ SHRAMVLPLYLSS NSS+ IS P RHLR+   S+NRSNAR
Sbjct: 13  LLHFFLSA-DPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPHRHLRQFPTSDNRSNAR 72

Query: 131 MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELS 190
           MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPE S
Sbjct: 73  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESS 132

Query: 191 STYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCE 250
           STY+P+KCNIDCTCDS+G QC+YERQYAEMSTSSGVLGED+ISFGNQSEL+PQRAVFGCE
Sbjct: 133 STYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCE 192

Query: 251 NVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISP 310
           N+ETGDL+SQ ADGIMGLG+GDLS+VDQLVEK  INDSFSLCYGGMDIGGGAMVLGGISP
Sbjct: 193 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISP 252

Query: 311 PSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDGKYGTVLDSGTTYAYLPE 370
           PSDMIF++SDP R   SPYYNVDLKEIHVAGKKL LS S+FDG+YGTVLDSGTTYAYLP 
Sbjct: 253 PSDMIFTYSDPVR---SPYYNVDLKEIHVAGKKLPLSSSIFDGRYGTVLDSGTTYAYLPA 312

Query: 371 AAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQKLS 430
            AFGAFKDAIMDELHSL+KI GPDPNF DICFSGAGSD AELS  FP VDMVF+NGQKLS
Sbjct: 313 EAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFPTVDMVFENGQKLS 372

Query: 431 LAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSE 490
           LAPENY FRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDR HSKIGFWKTNCSE
Sbjct: 373 LAPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRAHSKIGFWKTNCSE 432

Query: 491 LWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIPGELQIGRIIFEILLNISYT 550
           LWERL  SDD A APS+S  S  ++MAPASAP ESPHY IPGELQIGRI FEILLN SYT
Sbjct: 433 LWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQIGRITFEILLNKSYT 492

Query: 551 DLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTII 610
           DLEPHITELSDHI +ELN+S+SQV LLNFTMRGNDSLI+LAI+P GSSE FSHAT  TII
Sbjct: 493 DLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYGSSEIFSHATVNTII 552

Query: 611 ALIVEHHVQLPPTFGSYQLYPW-----------------VVLAIIVTLILGLSALGVWII 670
           + IVEHH+QLPPTFGSYQ+  W                 V LAIIV  ILGLSALG W I
Sbjct: 553 SKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIVIFILGLSALGAWFI 612

Query: 671 WRRRQQSFNSYKPVNAAIPEQELQPL 679
            R RQQ+ NSYKPVNAA+PEQELQPL
Sbjct: 613 LRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of Sgr021321 vs. TAIR 10
Match: AT3G50050.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 731.9 bits (1888), Expect = 4.8e-211
Identity = 373/639 (58.37%), Postives = 470/639 (73.55%), Query Frame = 0

Query: 57  SKLLAAILLLHLMHLMHFTLSADDPITSNLLLSHSHRAMVLPLYLSSPN-SSRLISKPRR 116
           S + A   LL  + L +   + ++ +      + S R MV PL+LS PN SSR IS P R
Sbjct: 7   SSIGATFSLLIYLSLPYSITAGENNLLHQSPTARSRRPMVFPLFLSQPNSSSRSISIPHR 66

Query: 117 HLRESNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQ 176
            L +S+S +  ++RMRLYDDLL+NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPCS CEQ
Sbjct: 67  KLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQ 126

Query: 177 CGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIISFGN 236
           CG+HQDPKF PE+SSTYQPVKCN+DC CD +  QC+YER+YAE S+S GVLGED+ISFGN
Sbjct: 127 CGKHQDPKFQPEMSSTYQPVKCNMDCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGN 186

Query: 237 QSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYGGM 296
           +S+L PQRAVFGCE VETGDLYSQ ADGI+GLG GDLS+VDQLV+K +I++SF LCYGGM
Sbjct: 187 ESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGM 246

Query: 297 DIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFDGKYG 356
           D+GGG+M+LGG   PSDM+F+ SDP R   SPYYN+DL  I VAGK+L L   VFDG++G
Sbjct: 247 DVGGGSMILGGFDYPSDMVFTDSDPDR---SPYYNIDLTGIRVAGKQLSLHSRVFDGEHG 306

Query: 357 TVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSD-VAELSKT 416
            VLDSGTTYAYLP+AAF AF++A+M E+ +L++I GPDPNF D CF  A S+ V+ELSK 
Sbjct: 307 AVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKI 366

Query: 417 FPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYD 476
           FP+V+MVF +GQ   L+PENY+FRHSKVHGAYCLG+F NG D TTLLGGIVVRNTLV+YD
Sbjct: 367 FPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYD 426

Query: 477 REHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDTEMAPASAPSESPHYMIPGELQ 536
           RE+SK+GFW+TNCSEL +RLHI     PA   SN S          PS +    + G  Q
Sbjct: 427 RENSKVGFWRTNCSELSDRLHIDGAPPPATLPSNDSN---------PSHNSSSNLSGVTQ 486

Query: 537 IGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQVRLLNFTMRGNDSLIRLAILPT 596
           +G+I  +I L ++ + L+P I +LS    +EL++  SQV L N T +GN+SL+R+ +LP 
Sbjct: 487 VGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDVKSSQVSLSNLTSKGNESLVRMVVLPP 546

Query: 597 GSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-------------VVLAI-IVTL 656
             S +FS+ TAT I++    H ++LP  FG+YQL  +             VV+AI I+ +
Sbjct: 547 EPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQLVNYKLEPPRKRTNNNIVVIAIGIIAV 606

Query: 657 ILGLSALGVWIIWRRRQQSFNSYKPVNAAI-PEQELQPL 679
           I+GLSA G W+IW+R+Q S   YKPV+ AI  EQELQP+
Sbjct: 607 IVGLSAYGAWLIWKRKQTSI-PYKPVDEAIVAEQELQPI 632

BLAST of Sgr021321 vs. TAIR 10
Match: AT5G43100.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 714.9 bits (1844), Expect = 6.0e-206
Identity = 355/602 (58.97%), Postives = 449/602 (74.58%), Query Frame = 0

Query: 95  MVLPL-YLSSPNSSRLISKPRRHLRESNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQ 154
           M+ PL Y S P   R+    RR L +S      NA M+LYDDLL NGYYTTRLWIGTPPQ
Sbjct: 31  MIFPLSYSSLPPRPRVEDFRRRRLHQS---QLPNAHMKLYDDLLSNGYYTTRLWIGTPPQ 90

Query: 155 QFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPELSSTYQPVKCNIDCTCDSNGAQCIYE 214
           +FALIVDTGSTVTYVPCSTC+QCG+HQDPKF PELS++YQ +KCN DC CD  G  C+YE
Sbjct: 91  EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDCNCDDEGKLCVYE 150

Query: 215 RQYAEMSTSSGVLGEDIISFGNQSELVPQRAVFGCENVETGDLYSQHADGIMGLGSGDLS 274
           R+YAEMS+SSGVL ED+ISFGN+S+L PQRAVFGCEN ETGDL+SQ ADGIMGLG G LS
Sbjct: 151 RRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLS 210

Query: 275 IVDQLVEKSVINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDL 334
           +VDQLV+K VI D FSLCYGGM++GGGAMVLG ISPP  M+FSHSDPFR   SPYYN+DL
Sbjct: 211 VVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSHSDPFR---SPYYNIDL 270

Query: 335 KEIHVAGKKLLLSPSVFDGKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPD 394
           K++HVAGK L L+P VF+GK+GTVLDSGTTYAY P+ AF A KDA++ E+ SL++I GPD
Sbjct: 271 KQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPD 330

Query: 395 PNFNDICFSGAGSDVAELSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGAYCLGIFEN 454
           PN++D+CFSGAG DVAE+   FP + M F NGQKL L+PENYLFRH+KV GAYCLGIF +
Sbjct: 331 PNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD 390

Query: 455 GNDQTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSELWERLHISDDTAPAPSVSNTSRDT 514
             D TTLLGGIVVRNTLV YDRE+ K+GF KTNCS++W RL   +  AP   +S  ++ +
Sbjct: 391 -RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPESPAPTSPISQ-NKSS 450

Query: 515 EMAPASAPSESPHYMIPGELQIGRIIFEILLNISYTDLEPHITELSDHIDRELNISYSQV 574
            ++P+ A SESP   +PG  ++G I FE+ ++++ + L+P  +E++D I  EL+I  +QV
Sbjct: 451 NISPSPATSESPTSHLPGVFRVGVITFEVSISVNNSSLKPKFSEIADFIAHELDIQSAQV 510

Query: 575 RLLNFTMRGNDSLIRLAILPTGSSEFFSHATATTIIALIVEHHVQLPPTFGSYQLYPW-- 634
           RLLNF+  GN+  ++  + P  SSE+ S+ TA  I+ L+ E+ ++LP  FGSY+L  W  
Sbjct: 511 RLLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIMLLLKENRLRLPGQFGSYKLLEWKA 570

Query: 635 ---------------VVLAIIVTLILGLSALGVWIIWRRRQQSFNSYKPVNAAIPEQELQ 679
                          VV   +++L++    + + ++WRRR+Q   +Y+PVNAAI EQELQ
Sbjct: 571 EQKKKQSWWEKHLLGVVGGAMISLLVTSVMIKLALVWRRRKQEEATYEPVNAAIKEQELQ 624

BLAST of Sgr021321 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 195.3 bits (495), Expect = 1.6e-49
Identity = 128/380 (33.68%), Postives = 192/380 (50.53%), Query Frame = 0

Query: 135 DLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK-----FDPELS 194
           D  + G Y T+L +GTPP+ F + VDTGS V +V C++C  C +    +     FDP  S
Sbjct: 74  DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133

Query: 195 STYQPVKC----------NIDCTCDSNGAQCIYERQYAEMSTSSGVLGEDIISFGN--QS 254
            T  P+ C          + D  C      C Y  QY + S +SG    D++ F     S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193

Query: 255 ELVPQR---AVFGCENVETGDLY--SQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCY 314
            LVP      VFGC   +TGDL    +  DGI G G   +S++ QL  + +    FS C 
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253

Query: 315 GGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVF-- 374
            G + GGG +VLG I  P +M+F+   P    + P+YNV+L  I V G+ L ++PSVF  
Sbjct: 254 KGENGGGGILVLGEIVEP-NMVFTPLVP----SQPHYNVNLLSISVNGQALPINPSVFST 313

Query: 375 DGKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAE 434
               GT++D+GTT AYL EAA+  F +AI + +    +   P  +  + C+    S    
Sbjct: 314 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR---PVVSKGNQCYVITTS---- 373

Query: 435 LSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHG--AYCLGIFENGNDQTTLLGGIVVRN 489
           +   FP V + F  G  + L P++YL + + V G   +C+G     N   T+LG +V+++
Sbjct: 374 VGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 433

BLAST of Sgr021321 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 193.4 bits (490), Expect = 6.1e-49
Identity = 130/378 (34.39%), Postives = 196/378 (51.85%), Query Frame = 0

Query: 135 DLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK-----FDPELS 194
           D  L G Y T++ +GTPP++F + +DTGS V +V C++C  C +  + +     FDP +S
Sbjct: 77  DPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVS 136

Query: 195 STYQPVKCNIDCTCDSN---------GAQCIYERQYAEMSTSSGVLGEDIISFGN--QSE 254
           S+   V C+ D  C SN            C Y  +Y + S +SG    D +SF     S 
Sbjct: 137 SSASLVSCS-DRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITST 196

Query: 255 LVPQRA---VFGCENVETGDLY--SQHADGIMGLGSGDLSIVDQLVEKSVINDSFSLCYG 314
           L    +   VFGC N+++GDL    +  DGI GLG G LS++ QL  + +    FS C  
Sbjct: 197 LAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLK 256

Query: 315 GMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSVFD-- 374
           G   GGG MVLG I  P D +++   P    + P+YNV+L+ I V G+ L + PSVF   
Sbjct: 257 GDKSGGGIMVLGQIKRP-DTVYTPLVP----SQPHYNVNLQSIAVNGQILPIDPSVFTIA 316

Query: 375 GKYGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSDVAEL 434
              GT++D+GTT AYLP+ A+  F  A+    +++ + G P    +  CF     DV   
Sbjct: 317 TGDGTIIDTGTTLAYLPDEAYSPFIQAV---ANAVSQYGRPITYESYQCFEITAGDV--- 376

Query: 435 SKTFPAVDMVFDNGQKLSLAPENYL-FRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL 489
              FP V + F  G  + L P  YL    S     +C+G     + + T+LG +V+++ +
Sbjct: 377 -DVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKV 436

BLAST of Sgr021321 vs. TAIR 10
Match: AT2G36670.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 176.0 bits (445), Expect = 1.0e-43
Identity = 123/383 (32.11%), Postives = 193/383 (50.39%), Query Frame = 0

Query: 142 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQC----GRHQDPKF------------- 201
           Y T++ +G+PP +F + +DTGS + +V CS+C  C    G   D  F             
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 202 --DPELSSTYQPVKCNIDCTCDSNGAQCIYERQYAEMSTSSG-----------VLGEDII 261
             DP  SS +Q         C  N  QC Y  +Y + S +SG           +LGE ++
Sbjct: 165 CSDPICSSVFQTTAAQ----CSENN-QCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 224

Query: 262 SFGNQSELVPQRAVFGCENVETGDL--YSQHADGIMGLGSGDLSIVDQLVEKSVINDSFS 321
           +  N S  +    VFGC   ++GDL    +  DGI G G G LS+V QL  + +    FS
Sbjct: 225 A--NSSAPI----VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFS 284

Query: 322 LCYGGMDIGGGAMVLGGISPPSDMIFSHSDPFRDGNSPYYNVDLKEIHVAGKKLLLSPSV 381
            C  G   GGG  VLG I  P  M++S   P    + P+YN++L  I V G+ L L  +V
Sbjct: 285 HCLKGDGSGGGVFVLGEILVPG-MVYSPLVP----SQPHYNLNLLSIGVNGQMLPLDAAV 344

Query: 382 FDGK--YGTVLDSGTTYAYLPEAAFGAFKDAIMDELHSLEKIGGPDPNFNDICFSGAGSD 441
           F+     GT++D+GTT  YL + A+  F +AI    +S+ ++  P  +  + C+  + S 
Sbjct: 345 FEASNTRGTIVDTGTTLTYLVKEAYDLFLNAIS---NSVSQLVTPIISNGEQCYLVSTS- 404

Query: 442 VAELSKTFPAVDMVFDNGQKLSLAPENYLFRHSKVHGA--YCLGIFENGNDQTTLLGGIV 489
              +S  FP+V + F  G  + L P++YLF +    GA  +C+G F+   ++ T+LG +V
Sbjct: 405 ---ISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQTILGDLV 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902862.14.4e-30783.54aspartic proteinase CDR1-like isoform X1 [Benincasa hispida][more]
XP_022149434.19.7e-30779.36aspartic proteinase-like protein 2 isoform X1 [Momordica charantia][more]
KAG7011039.13.8e-30382.48Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyr... [more]
KAG6571239.16.5e-30382.48Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
XP_022985603.11.1e-30282.02aspartic proteinase nepenthesin-1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q4V3D22.8e-3832.53Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9LS403.2e-3429.16Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9S9K45.4e-3429.85Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9LHE31.1e-3128.61Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C25.6e-3129.78Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A6J1D7184.7e-30779.36aspartic proteinase-like protein 2 isoform X1 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1JE385.4e-30382.02aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC1114836... [more]
A0A6J1EYA52.0e-30282.33aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A5A7TUH18.3e-29681.63Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sc... [more]
A0A1S3C7L58.3e-29681.63aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103497389 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G50050.14.8e-21158.37Eukaryotic aspartyl protease family protein [more]
AT5G43100.16.0e-20658.97Eukaryotic aspartyl protease family protein [more]
AT5G22850.11.6e-4933.68Eukaryotic aspartyl protease family protein [more]
AT1G08210.16.1e-4934.39Eukaryotic aspartyl protease family protein [more]
AT2G36670.11.0e-4332.11Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 300..313
score: 24.48
coord: 356..367
score: 34.29
coord: 148..168
score: 49.98
coord: 459..474
score: 25.12
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 71..541
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 316..491
e-value: 3.7E-45
score: 155.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 116..305
e-value: 3.4E-48
score: 166.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 138..492
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 142..306
e-value: 2.1E-37
score: 129.1
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 329..482
e-value: 1.9E-26
score: 92.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 501..517
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 501..523
NoneNo IPR availablePANTHERPTHR13683:SF805EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 71..541
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 157..168
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 142..483
score: 46.60437
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 141..487
e-value: 8.43895E-77
score: 244.865

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021321.1Sgr021321.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding