Sgr029920 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029920
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionProline iminopeptidase
Locationtig00153554: 1140899 .. 1148787 (-)
RNA-Seq ExpressionSgr029920
SyntenySgr029920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTTAGGCTTTTGCCCTAAGAATTCATATTCTCCTCCGTTTTTCGCCTTCTCCAATTCGCACTATCGTCACTGCCTCTGTCTCTTCCCCGTCTCTCGTCTTTCCAACCATATCTGCATCTCAGGTCTGATCTTCTTTTCCACCATTATTTTCTGTTTCTGGTATTTGATCGTCTGTAACTCTTCCTTTTATCGTTTTTCTTTATATTTTACTAGTCTACTATATTCTTTAGTTCTTGTACTGGATTTCGCAGCTTGGCTTTTTATTTCTGTACTTAACTAGTTTGTTTTCCAGAATTTTAATGGAAAAACTATGATATCAGTTCAAAGTACACAATTAATGCTTTTTAAATGTTCATTTTAAACGACGTTTTAGCTTCTTTTTTTTTCCGTAAATCATCAAATTTACAAGTTGACTAATCGGAGGAATTATTAATTACATCTCATACGTTTATTTTCGAAAGTATTATAATGCCGTGGAATGAGAATACTAAATTGTAAGAATTCGAATGTGATACTAAGTGATTTGTTATTTCTGAAAGGGAAAAAACTGATTGATTTTTTGAATTGAAGTTGCAAGATGGAATGTGCATGGAGAGGGATAGTCGAATCAGATTGACTGAAATGTTTTTCTCGAGAGTTCTTGTATGATCATCTAAATTATTTTTATTGTTTCCAAGTGAATTTGAAAATGTATTTGCCACTCTCCTTTTGGATTCAAAAGTCAAAACTCCCTTCTTGAGTTATTTTATCCAGTAGGATGGAGTGGGAGGGATAGGTGGGTCAGAAATTCATATTGTTGTATTTTGCTATTGCAGGGGGAAAAGGTTTGGTCTTAAGTGCTCATTTTGGTTATAAGAGCGATAATCAGAGTGAGTTCAAATCAGAGGACTTGATGGCTCGAGAACAGGAATTTTCAGAGACAAACAGAAACCCTTACCCACCTATAGAACCATACAGTACTGGTTTTTTGAAGGTGTCGGATCTTCATACTATTTATTGGGAGGAATCAGGGAATCCCACTGGTCATGTTAGTATATTCTACCATACAACCATGTTTGAATTTTTCCCAATTTTTTAAAAATATTGTTCTATTATTTTTTCATTTTCACTGGCTATGCCAATAATTGGCTGGTTGACTGTTTCTCCATAAAATTTAATTTGGATCTTAAAGGAAGATTTAAATTGGCTGATGGCTGTTTTTGATGTGGGAGTAGTTGCCTTAATAGCGTTTTTAACCGAGGATCATCTAGGGGGAGCACCTTTCTTTAGGCAATTGTTTGGTTGCTATTTGTCATCTTTAATGCTTTAATTAAGAAGGATACTCTTTATTTTTTTTGTAATGCATTTGACTAATCTCATTTATTTTGAGTATAAGAGAGATCACTAAAGTAATTAGTCAAATTAACATTGATGATTCATGTGACATGCTTGCTTTAACGCGCATTATGCTTTGAGAGCCTAGTCCTACATCCTTTTCGGAAGTTAAATGTTTGACTTCACAATTGCCTAATTTGAAAGTATCTGATGTGTCACCCTGTATAGTTGAAGAAGTATGCATACCTATTAGATTATTGGCATTTTTTTCCCCGGAAATTTCTGCCAACAGGTTGTAATGAACTTCTGATTAGGAAAAAGAACATGTGTCTCCAACAAAACCGTTCTAATATGACCATCCCTCATTTCTGGTTTTTACTTGAGACTTGAGAGAGAGTTACCAATGTTACTTTGGTCTTTGGGACTGTCGTGCAAACAAGTTGGAAAAACTTTGGATTATATTTGGTTGGAACTTGGAAATCAAAAGAAATATATGAATGCATTTCATCCAGTGCAAGCTGCCAGTAGCAATGAAAAATAATTAAAGCTGGCAGATGCAGCTAGCTGTATAGTTTAGGATTCTATTAAAATGGACATGCTCTTGAAGAGTTGCAGTGTTAATGGCTTGTTTTACAGATATCATTTCTGTAAAACAAACCATTTAACATTCTTCTGTACTTGGCGTAATCAGAGATAGTAAATTGTCGAATGTGTAAAATTCAGATTCATATTAATCATCTTATTTAACTGATCTTGCAGCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTTCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGGTTTCTTCTCAATCTTTATTTTGATCATTAGTATGGTGATGGTCTATCTCTGAAGTTAGAAATTTCTCATTTCTTGCAGCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGAATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGGTCACAATTTCTTGAACTTGCTCCTTTCCTCCCCCCTGGTTACCATAATCACGATTCCTGTTTCTTGAGGCTGTGGTGTCACGTGCAGTCTGTTCAACAAATTTTTTTAGTGAGCTCAGGACTTGTTAATATTTGTGGTTGTGTAGGTCTTTGGAGGTTCCTGGGGTAGTACGCTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTGCAGACATTTGCCAAGTTTACCATTCTTCTCCGCAGTGGAACTCCATATATTTGAGAAACATGCCATTAGAAGAATTTCCTCAACATTTTTGCAGTTTTGATTATATAATTTTGGTTTAATCCAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAAATTGATTGGTTCTATGAAGGTGGTGCTGCTGCTATATATCCTGATGGTATGCTTGTTCACTAATTTTTCTCTCACAGAAATATACTATGGTATGATCAATGAAATTTGTAATTTAAGAAGTGTTTAAATTTGCAACATCTGTAACATATTGAATCTAATTTGATGATATTCCAAGGGTATAAATTTTTAGATTATAATTGCTTCTAATCTTTACGATGGAGGAAGCAATTTTGTAAAAATTGGTTTGTAGTAAAATGGACATTACTGTTGTGTTCACTTAACTTTTTCTTCACATGCAGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGATTAAATTCAAATGACATGGAAACCCAAGTAAGTTCTATTATCCGATTTATTAGCTGCATGGCTAGTATTTTTTCTCTTATGATTTTCCTAGATCTTTATTTTGAGTTAGTATTTGGAATCGAGTTTCTTTCTTTCAGACTTATTTGTCCACAATGTTGAACTTTTATACATCTGAAAATATGAAATATTCTGAGAATGAAATAATCAGAAGTCAGAACCATGGTCTAATATGACAAAAATATATGTGAAATAAAAAATAACAGTTTTTTTATAAAAAATTAAAAAGTGAATTTATGGGAGGTTCATAAATGGCTTACACCTGAGTGGATACTTCCCCAATCGAACACACAGACAGGACGTGTAACACGAGATGAGTCAAGAATAACCAATGGGAGACAAAATAAATGGCTCACACAACCACACACATAGGTGCGTGAGACTAGATTAGATGCTTCTGACAACGTGTAAATCCCATGAAAGACAACTCCTTTTGATAATTAACAACGGTAATTCAACTTGTCGGCTGGTGACACTCCTCTTGGGGTGGATGGTTTTTGACAAACAGTCACTCCAAAGTGGTGATTCATTTTCAGACCCTAGCCTCCCTTGGAAATGAATCTCCTGATGATACCAAAAGAAACCCTGATCCTACTCTCTGGTGCCATGTAATATAACAGAATGTTATGAGAGATTGCTCAATATTATTTTATTCACTAAAGTCATTTATATGACATACATAAGAACCCTAAACTAAACTAATAGAATGTAAAATTATAATGAAGGACATTATTGTAGCCTCCCAAAAATTAACTCCACTCCAGGCTTTTAGAAATTTTGCAATTTCATATTTGCATGTCAGAGGCAGAAGCTCTGTGGACAGTCGAATTAATGAATGCTCATTACATTTTGGTACAATATTACCTAGACTCTACTGAATGTCACATGTAATGTGCAGATAGTAAACTTAACCTGCATGGTTAATATTCATTTGTGCTTGTTGGCCTTTCTTCTGTTCTGTCATTTATCTTAATGAAATTTGGTTTTTATTGTAAAAGAAATTGTAAATATTCATTCATATGAGAGGAGAATCATCTGTCAGCAATGCATGTAGTGAAGATTTTGCTAAAAAGTGATAGCAGCACCTTCATCTTTATGATTGCTACATGCATAAGATTGAGTTTTTATGGCTGATGGCATATGGTAAGATTTGAAAATTGATTTTGTTTGTTGATTTAGCCATCTAACTATTCTCTGTTCTTTTGTCACAATATTTTGCATTTGATAACGAAGTTAGAAGCTGTCAAGCATGGTGAAAACAGTTTAAATATTGAGGATAGATTAATTGTCAGCCATTCTATCTGAACCTAATAAATGTTAAAATTGTATTTTGTAATAGTATTTTTAGAAGACTGCAATCTTACCATGGTTGTTTAACCGTATCTTATGCTACTGAAGTATGTCTAATTGTCTAGTATGCAGCCGCAAGAGCATGGACCAAATGGGAAATGATGACTGCTCATCTATTGCCAAATGAGGAGAACATTAAGAGAGGGGAAGATGATAATTTTTCATTGGTAAATTGTTTTTTTGCCTGTAAATTCCTATTTGATATCCGGAAATTCCCTGAACTGTCAAAGACGTGTGTTTGGAACTCTTGCAAGTTTCTATAAGTTGAGCTCTTTCTTAGAAGTAATAGTACATTTTTCTTTTCCGAAAAAGAACTGCAAAGAAGTATTTCTGTTAGATTTCTGGAAGGTTCCAGCTTTTCTACAGGCTGTAAATCTGCCAACATTTAGTGAAGTTCCTTTTTCAGTTCATATAATTATGTAGATTTGAAGGGATCGTAACTTAGATTTTATAGATGTCACTAGGTGCTTAACCCTACCATATAACGTACGGTTTTTTTTTTTTTTAATGGTATCAATTCTGTTCGTTCTATTTTTTGTAACAAAGCCGCTACTTAGTTTGATTTTAAAAATCTCATTGTTTGGACTTATTTGTTTCATGGAATAGAGTGTGTTCATCGATTACTTGGGTGCTTTGGAGATGGCCACTTCCTTTTGATCGTTTGAGTTGCCTCTGGATTTTAAGAAAAGGATGAAAAAGAATGAAGATATTGTGTGAAATACTTAAAAGTTGTGCAATTGAACATATTGTTTAAATATCAAAATCATGAAACAATTGGGAGGATGATCGGATGGTTTTGCAGTGGCAGAAAGGGAAGAAGACGGAGAGGTTTTTAGATCAATGGAGAGCTGGGTTTATTGTTTCAAAATGAAATGATGGAAGAGGAAAACTCGGAAATTACTCTTTTGGAAGGATGTGCTGATAGGTCTGGTGCAACGGAAAAAAATAGAGGAAAATGTAGAATTTCATCCTTTCTTGAGTTTAGGAGTTGGGTGTCATATAAATAGGTGAAAGTATTTTTGACGACTAGTTTCCTTGATTATCTGAAGAATTAGGGGTAAGGTTATCACCAAACCAAGTGGGAACAAAAGGAAGAAAGTCGTAGGAAGAAATTCCACTAAAGTGTGCGGAGAGTTTGGAAATTTGGTGTCTACAGTAAGTTTGAACGATAAAGGAGGCCTTTGTAGGAAGCTCTTGAGAATTCTTTAGTATTTCTAGGAGGTTTTTTGAAAAGTTTTGTGGGGAAGAAGACCATTACGTAAGTCCTTGTTCAGTGAGGATTCTTTCTTGTATTGTCTCAAAAGGGGCCCTTATAGTTTTTTGGTTTTCTTATTGTTTGGGAATCTTTGTATACACATTACGGGAGCTTTGTACTTTTCACTAATTTAATGAAAAAGTTTTTTCCTTTCTGAAAAAAGAAAAAAAAAACCATGAAACCGTTGAGCTGTCACAAGGTGGAAATAGATAGAATGTGTTCATAGCAAGTTGTACTCGAGATCATGCCTGTGGGAAAGCTTCTCACCATTGTGTGGAAAGATGAGATGGTATATGGGTGTAAAAAATTAGGATGTCTTTTTTGTTTTATTCTATCATTTTTCTTTAGCTATTTCGTGTTCTGAATTTTTCAACTTGAATGATAGTTATTTTGCTGGTCAGTAACTCATGATGTTTGTTTCTTTCTTTTGGTTTGACCCAGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATGTTGACAAGATACGACATATCAATGCTATAATTGTACAGGTACATTTGCCTATTCCACATATCAATGCTATAAATGTACACGTAAATGTGCCTATCCCATCCAACATATTGGCTTGGGCACCTTTTTTCATTTGCCTATTTTGGTTGATTGGATGTGATATCCAACCTTTTGTTTTACTATTGAAAGGTTTCTTGTTTCCTATTAAAGGAAAGAAATTTGAGAATTTGAGGCAGTCATTTTCAAGTTTGGAGTTATAGTTTTATGTGGTCTATGAAACAAAGAGTATTTTGATATGCCTTTGATTTCCCTTGTTAGTGCTATAAATGGTAAGCTTTGATGGCTTGTTTGAGTGTCAATTTTGTAAGATTAATTTATGACTTTAGGTGTGATAACACTAACTACATGAGATGTACTTCTCTTCCTCGATATCTTTTGTGATGGTCGTGAATTTATTTTCTTTCCTTTTCTTTGTTATCATGTCATCCAAACATTGTTGGAAGAAGTGATAACTAAAAATTTAAAATGTTATCAAGTTAATTCTGTGGTTTTGTGTCATTGATCACATACCCGTCAGGACTGAGTAGAATCTGAGCTAGATAACAAGATACGCAGCTACTTTTTATCTTCATTTTGCAGGGAAGATATGATGTTTGCTGCCCCATGATGTCAGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGGTAAAAGAAACTGCTGCGGCCTATTTCTTTGTCCAATATAGGTTCCTATTTCCCAACACTTCCCTTTCATTGGGCTTGGATGCAAATATCAACATGGAACTAGGAATTTATGGCATGATATTTGGACTATTATTTAACTTAATGTCCATACTTATGAGTTTATTATTTTCACTCTGCATCAAGGATCAAGAACCAAATTGTGGATTATGTACTTTCATTGTGGACGACATTAATAGAACATAGATGTAGCGTGTATTCTAACGGCCCTCAAACTGATTCTTTTTCCTGGTAGGAAACAATTTTATAGAAGTGACGAAATTACAAAAGAGAGGAGGAAAACCAAATCTCGTGCCAAAGGAGAGTAAAAAATATTCTTCCAATTGATCAAAAGAGAATACTATTATTGTTGAAAGAAGAAGAAAATTTACAAAGAAAGCCAAAAACGTTAAAAGATTCTATAAACAAATGACAGATTTTTCTTTGTTGTTGAAGATACTTTTGTTCCTTTCAAGCCAAAGGATCCAAAAAGAAAAAAAAGTCTCTAACAAAGTGTAGCCATATGACCTTGAAGCAAGCACTCGGGATGACAAATCTGTACCATTTTCCTGATGCGTTCCTATATATATTTTTGTATCATTAGATTATTCTCAAATGTTGGTGAATCTTTTCTTGCTGAAAAGAAAACGAATAAAAAAAGTAGTTGCTGAATATTTTCTTGCTGACAAGAAAAAGAATAAAAAAAGTAGTTGCTGAATCTTACCTCTCTACGCAGATCATTCCAGACGCAGGCCATTCAGCTAATGAACCTGGAATAGCTGCAGAGCTTGTCGCCGCAAATGAGAATCTGAAGAACATTCTCCAGAAGAATGGACCATAA

mRNA sequence

ATGAGGTTAGGCTTTTGCCCTAAGAATTCATATTCTCCTCCGTTTTTCGCCTTCTCCAATTCGCACTATCGTCACTGCCTCTGTCTCTTCCCCGTCTCTCGTCTTTCCAACCATATCTGCATCTCAGGGGGAAAAGGTTTGGTCTTAAGTGCTCATTTTGGTTATAAGAGCGATAATCAGAGTGAGTTCAAATCAGAGGACTTGATGGCTCGAGAACAGGAATTTTCAGAGACAAACAGAAACCCTTACCCACCTATAGAACCATACAGTACTGGTTTTTTGAAGGTGTCGGATCTTCATACTATTTATTGGGAGGAATCAGGGAATCCCACTGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTTCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGAATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTGGGGTAGTACGCTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAAATTGATTGGTTCTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGATTAAATTCAAATGACATGGAAACCCAATATGCAGCCGCAAGAGCATGGACCAAATGGGAAATGATGACTGCTCATCTATTGCCAAATGAGGAGAACATTAAGAGAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATGTTGACAAGATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCCATGATGTCAGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCATTCCAGACGCAGGCCATTCAGCTAATGAACCTGGAATAGCTGCAGAGCTTGTCGCCGCAAATGAGAATCTGAAGAACATTCTCCAGAAGAATGGACCATAA

Coding sequence (CDS)

ATGAGGTTAGGCTTTTGCCCTAAGAATTCATATTCTCCTCCGTTTTTCGCCTTCTCCAATTCGCACTATCGTCACTGCCTCTGTCTCTTCCCCGTCTCTCGTCTTTCCAACCATATCTGCATCTCAGGGGGAAAAGGTTTGGTCTTAAGTGCTCATTTTGGTTATAAGAGCGATAATCAGAGTGAGTTCAAATCAGAGGACTTGATGGCTCGAGAACAGGAATTTTCAGAGACAAACAGAAACCCTTACCCACCTATAGAACCATACAGTACTGGTTTTTTGAAGGTGTCGGATCTTCATACTATTTATTGGGAGGAATCAGGGAATCCCACTGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTTCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGAATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTGGGGTAGTACGCTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAAATTGATTGGTTCTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGATTAAATTCAAATGACATGGAAACCCAATATGCAGCCGCAAGAGCATGGACCAAATGGGAAATGATGACTGCTCATCTATTGCCAAATGAGGAGAACATTAAGAGAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATGTTGACAAGATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCCATGATGTCAGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCATTCCAGACGCAGGCCATTCAGCTAATGAACCTGGAATAGCTGCAGAGCTTGTCGCCGCAAATGAGAATCTGAAGAACATTCTCCAGAAGAATGGACCATAA

Protein sequence

MRLGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
Homology
BLAST of Sgr029920 vs. NCBI nr
Match: XP_022149275.1 (proline iminopeptidase [Momordica charantia])

HSP 1 Score: 770.4 bits (1988), Expect = 7.6e-219
Identity = 361/400 (90.25%), Postives = 376/400 (94.00%), Query Frame = 0

Query: 1   MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDN 60
           M LGFCP  S S PF + SNSH  RHC+ L  VSR+SNH  +SGGKGLVLSAHFGYKSD 
Sbjct: 1   MSLGFCPNISSSHPFSSVSNSHSRRHCIRLCSVSRVSNHFSVSGGKGLVLSAHFGYKSDR 60

Query: 61  QSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLH 120
            SEF++EDLMARE+E SE NRNPYPPIEPYS GFLKVSD+HTIYWE+SGNP GHPVVFLH
Sbjct: 61  LSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLH 120

Query: 121 GGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI 180
           GGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHLEI
Sbjct: 121 GGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI 180

Query: 181 PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWES 240
           PEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWES
Sbjct: 181 PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWES 240

Query: 241 FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN 300
           FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDN
Sbjct: 241 FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDN 300

Query: 301 FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW 360
           FSLAFARIENHYF+NKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW
Sbjct: 301 FSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW 360

Query: 361 PEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           PEAELKII +AGHSANEPGIAAELVAANE LKNILQKN P
Sbjct: 361 PEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP 400

BLAST of Sgr029920 vs. NCBI nr
Match: XP_023552784.1 (proline iminopeptidase [Cucurbita pepo subsp. pepo])

HSP 1 Score: 770.0 bits (1987), Expect = 1.0e-218
Identity = 355/398 (89.20%), Postives = 376/398 (94.47%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQS 62
           LG CP NS++ P  +F SNSHYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QS
Sbjct: 6   LGLCPNNSFAAPLLSFVSNSHYRHCPRLFPVSRVSNHFCVSGGKGLVLAAQFGYKSDSQS 65

Query: 63  EFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGG 122
           EF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHTIYWE+SGNP GHPVVFLHGG
Sbjct: 66  EFQRKDLMAGEKEIPGINKTPYPPIEPYSTGLLKVSDLHTIYWEQSGNPAGHPVVFLHGG 125

Query: 123 PGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPE 182
           PGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHL+IPE
Sbjct: 126 PGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLKIPE 185

Query: 183 WQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFR 242
           WQVFGGSWGSTLALAYSQ+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FR
Sbjct: 186 WQVFGGSWGSTLALAYSQTHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWETFR 245

Query: 243 DLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS 302
           DLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Sbjct: 246 DLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDKFS 305

Query: 303 LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPE 362
           LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPE
Sbjct: 306 LAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPE 365

Query: 363 AELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           AELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Sbjct: 366 AELKIIPDAGHSANEPGVAAELVAANEKLKNILQKNGP 403

BLAST of Sgr029920 vs. NCBI nr
Match: XP_038905843.1 (proline iminopeptidase isoform X1 [Benincasa hispida])

HSP 1 Score: 769.6 bits (1986), Expect = 1.3e-218
Identity = 357/396 (90.15%), Postives = 373/396 (94.19%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSE 62
           LG CP NS SP F   SN H+RHCL LFPV R+SNH C+ GGKGL L+AHFGYKSD+QSE
Sbjct: 4   LGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVPGGKGLALTAHFGYKSDSQSE 63

Query: 63  FKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGP 122
           F+ +DLMA E+E S  NRNPYPPIEPYSTGFLKVSDLHTIYWE+SGNP GHPVVFLHGGP
Sbjct: 64  FQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGP 123

Query: 123 GGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEW 182
           GGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEW
Sbjct: 124 GGGTTPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEW 183

Query: 183 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 242
           QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD
Sbjct: 184 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 243

Query: 243 LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL 302
           LIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRG+DDNFSL
Sbjct: 244 LIPESERGCFVDAYYKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGDDDNFSL 303

Query: 303 AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEA 362
           AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA
Sbjct: 304 AFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEA 363

Query: 363 ELKIIPDAGHSANEPGIAAELVAANENLKNILQKNG 399
           ELKII +AGHSANEPGIAAELVAANE LKNILQKNG
Sbjct: 364 ELKIISNAGHSANEPGIAAELVAANEKLKNILQKNG 399

BLAST of Sgr029920 vs. NCBI nr
Match: KAG7014918.1 (Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 766.5 bits (1978), Expect = 1.1e-217
Identity = 354/398 (88.94%), Postives = 374/398 (93.97%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQS 62
           LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QS
Sbjct: 4   LGLCPNNSFAAPLLSFVSNLHYRHCPRLFPVSRVSNHFCVSGGKGLVLAAQFGYKSDSQS 63

Query: 63  EFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGG 122
           EF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHTIYWE+SGNP GHPVVFLHGG
Sbjct: 64  EFQRKDLMAGEKEIPGINKTPYPPIEPYSTGLLKVSDLHTIYWEQSGNPAGHPVVFLHGG 123

Query: 123 PGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPE 182
           PGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPE
Sbjct: 124 PGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIADIEKLREHLKIPE 183

Query: 183 WQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFR 242
           WQVFGGSWGSTLALAYSQ+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FR
Sbjct: 184 WQVFGGSWGSTLALAYSQTHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWETFR 243

Query: 243 DLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS 302
           DLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Sbjct: 244 DLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDKFS 303

Query: 303 LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPE 362
           LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPE
Sbjct: 304 LAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPE 363

Query: 363 AELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           AELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Sbjct: 364 AELKIIPDAGHSANEPGVAAELVAANEKLKNILQKNGP 401

BLAST of Sgr029920 vs. NCBI nr
Match: XP_022922976.1 (proline iminopeptidase [Cucurbita moschata])

HSP 1 Score: 765.0 bits (1974), Expect = 3.2e-217
Identity = 353/398 (88.69%), Postives = 373/398 (93.72%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQS 62
           LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QS
Sbjct: 4   LGLCPNNSFAAPLLSFVSNLHYRHCPRLFPVSRVSNHFCVSGGKGLVLAAQFGYKSDSQS 63

Query: 63  EFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGG 122
           EF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHTIYWE+SGNP GHPVVFLHGG
Sbjct: 64  EFQRKDLMAGEKEIPGINKTPYPPIEPYSTGLLKVSDLHTIYWEQSGNPAGHPVVFLHGG 123

Query: 123 PGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPE 182
           PGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPE
Sbjct: 124 PGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIADIEKLREHLKIPE 183

Query: 183 WQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFR 242
           WQVFGGSWGSTLALAYSQ+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FR
Sbjct: 184 WQVFGGSWGSTLALAYSQTHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWETFR 243

Query: 243 DLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS 302
           DLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Sbjct: 244 DLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDKFS 303

Query: 303 LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPE 362
           LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPE
Sbjct: 304 LAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPE 363

Query: 363 AELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           AELKIIPDAGHSANEPG+AAELVA NE LKNILQKNGP
Sbjct: 364 AELKIIPDAGHSANEPGVAAELVAGNEKLKNILQKNGP 401

BLAST of Sgr029920 vs. ExPASy Swiss-Prot
Match: P93732 (Proline iminopeptidase OS=Arabidopsis thaliana OX=3702 GN=PIP PE=2 SV=3)

HSP 1 Score: 587.8 bits (1514), Expect = 9.2e-167
Identity = 277/370 (74.86%), Postives = 311/370 (84.05%), Query Frame = 0

Query: 26  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPP 85
           C+  FP +  + ++   G + + +S   G KS+     KS+ +   E E     R  Y P
Sbjct: 15  CVRFFPSNHNNLNLLFPGQRKIQVSC--GGKSE---VLKSDTMEPHEAETFVNKRTLYAP 74

Query: 86  IEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFD 145
           IEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGGT+P NRRFFDP+FYRI+LFD
Sbjct: 75  IEPYSSGNLKVSDVHTLYWEQSGKPDGHPVVFLHGGPGGGTAPSNRRFFDPEFYRIVLFD 134

Query: 146 QRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKV 205
           QRGAGKSTPHACLE NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KV
Sbjct: 135 QRGAGKSTPHACLEENTTWDLVNDIEKLREHLKIPEWLVFGGSWGSTLALAYSQSHPDKV 194

Query: 206 TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSND 265
           TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D
Sbjct: 195 TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWEEFRDLIPENERGSSLVDAYHKRLNSDD 254

Query: 266 METQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL 325
           +E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Sbjct: 255 LEIQYAAARAWTKWEMMTAYLRPNLENVQKAEDDKFSLAFARIENHYFVNKGFFPSDSHL 314

Query: 326 LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELV 385
           LDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV
Sbjct: 315 LDNVDKIRHIKTTIVQGRYDVCCPMMSAWDLHKAWPEAELKIVYDAGHSANEPGISAELV 374

Query: 386 AANENLKNIL 395
            ANE +K ++
Sbjct: 375 VANEKMKALM 379

BLAST of Sgr029920 vs. ExPASy Swiss-Prot
Match: O83041 (Probable proline iminopeptidase OS=Leptolyngbya boryana OX=1184 GN=pip PE=3 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 5.9e-113
Identity = 189/309 (61.17%), Postives = 228/309 (73.79%), Query Frame = 0

Query: 80  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFY 139
           R  YP I PY +G L VS LHTIY+E+SGNP G PVVFLHGGPGGGT P  R++FDP  +
Sbjct: 2   RQLYPAIAPYQSGMLPVSALHTIYYEQSGNPNGKPVVFLHGGPGGGTIPTYRQYFDPSKW 61

Query: 140 RIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ 199
           RIILFDQRGAGKSTPHA L  NTTWDL+ DIEKLR HL I  W VFGGSWGSTL+LAYSQ
Sbjct: 62  RIILFDQRGAGKSTPHAELRENTTWDLVSDIEKLRSHLNIDRWFVFGGSWGSTLSLAYSQ 121

Query: 200 SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKR 259
           +HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +R
Sbjct: 122 THPDRCLGLILRGIFLLRRKEILWFYQDGASWIFPDAWEHYLEPIPPEERDDMISAYYRR 181

Query: 260 LNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFP 319
           L S D E +  AA+AW+ WE  T+ L+ +     +  DD F+ AFARIE HYF+N+GFF 
Sbjct: 182 LTSKDAEIRSTAAKAWSVWEGTTSRLIVDPSLQSKFADDEFADAFARIECHYFINRGFFE 241

Query: 320 SDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI 379
           +D  LL N D+I HI  +IVQGRYDV CPM SAW LHK  PE+EL ++PDAGHS  E GI
Sbjct: 242 TDDQLLQNCDRIAHIPTVIVQGRYDVVCPMTSAWALHKALPESELIVVPDAGHSMMEAGI 301

Query: 380 AAELVAANE 389
            + L+ A +
Sbjct: 302 LSALIDATD 310

BLAST of Sgr029920 vs. ExPASy Swiss-Prot
Match: Q9PD69 (Proline iminopeptidase OS=Xylella fastidiosa (strain 9a5c) OX=160492 GN=pip PE=3 SV=1)

HSP 1 Score: 383.3 bits (983), Expect = 3.4e-105
Identity = 178/310 (57.42%), Postives = 225/310 (72.58%), Query Frame = 0

Query: 80  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFY 139
           R  YP + P+  G L V D H +Y+E+ GNP G PVV LHGGPG G +   RRF DPD Y
Sbjct: 2   RTLYPEVTPFEHGILCVDDNHRLYYEQCGNPHGKPVVILHGGPGSGCNDKMRRFHDPDKY 61

Query: 140 RIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ 199
           RI+LFDQRGAG+STPHA L NNTTWDL+ DIEKLR  L I  WQVFGGSWGSTLALAY+Q
Sbjct: 62  RIVLFDQRGAGRSTPHANLTNNTTWDLVADIEKLRVALGITRWQVFGGSWGSTLALAYAQ 121

Query: 200 SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKR 259
           +HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +R
Sbjct: 122 THPEQTTELVLRGIFMLRRWELEWFYQEGASHLFPDAWDRYIAVIPPVERHDLISAFHRR 181

Query: 260 LNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFP 319
           L S D  T+ AAA+AW+ WE  T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF 
Sbjct: 182 LTSEDEATRLAAAQAWSLWEGATSCLYMDQDFIASHENPHFALAFARIENHYFVNGGFFE 241

Query: 320 SDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI 379
            ++ LL +  +I +I  +IV GRYDV CP+ +AWDLHKVWP+A LKI P AGHSA EP  
Sbjct: 242 VENQLLRDAQRIANIPGVIVHGRYDVVCPLQNAWDLHKVWPKASLKITPGAGHSAFEPQN 301

Query: 380 AAELVAANEN 390
              LV A ++
Sbjct: 302 IDALVCATDS 311

BLAST of Sgr029920 vs. ExPASy Swiss-Prot
Match: O32449 (Proline iminopeptidase OS=Serratia marcescens OX=615 GN=pip PE=1 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 5.9e-105
Identity = 173/312 (55.45%), Postives = 222/312 (71.15%), Query Frame = 0

Query: 77  ETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDP 136
           E  R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG SP +R+ FDP
Sbjct: 2   EQLRGLYPPLAAYDSGWLDTGDGHRIYWELSGNPNGKPAVFIHGGPGGGISPHHRQLFDP 61

Query: 137 DFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALA 196
           + Y+++LFDQRG G+S PHA L+NNTTW L+ DIE+LRE   + +W VFGGSWGSTLALA
Sbjct: 62  ERYKVLLFDQRGCGRSRPHASLDNNTTWHLVADIERLREMAGVEQWLVFGGSWGSTLALA 121

Query: 197 YSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY 256
           Y+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY
Sbjct: 122 YAQTHPERVSEMVLRGIFTLRKQRLHWYYQDGASRFFPEKWERVLSILSDDERKDVIAAY 181

Query: 257 SKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKG 316
            +RL S D + Q  AA+ W+ WE  T  LLP+ E+   GEDD F+LAFARIENHYF + G
Sbjct: 182 RQRLTSADPQVQLEAAKLWSVWEGETVTLLPSRESASFGEDD-FALAFARIENHYFTHLG 241

Query: 317 FFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANE 376
           F  SD  LL NV  IRHI A+IV GRYD+ C + +AWDL K WPEAEL I+  AGHS +E
Sbjct: 242 FLESDDQLLRNVPLIRHIPAVIVHGRYDMACQVQNAWDLAKAWPEAELHIVEGAGHSYDE 301

Query: 377 PGIAAELVAANE 389
           PGI  +L+ A +
Sbjct: 302 PGILHQLMIATD 312

BLAST of Sgr029920 vs. ExPASy Swiss-Prot
Match: Q87DF8 (Proline iminopeptidase OS=Xylella fastidiosa (strain Temecula1 / ATCC 700964) OX=183190 GN=pip PE=3 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 1.0e-104
Identity = 177/310 (57.10%), Postives = 223/310 (71.94%), Query Frame = 0

Query: 80  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFY 139
           R  YP + P+  G L V D H +Y+E+ GNP G PVV LHGGPGGG +   RRF DPD Y
Sbjct: 2   RTLYPEVTPFDHGMLCVDDSHRLYYEQCGNPHGKPVVILHGGPGGGCNDKMRRFHDPDKY 61

Query: 140 RIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ 199
           RI+LFDQRGAG+S PHA L NNTTWDL+ DIEKLR  L I  WQVFGGSWGSTLALAY+Q
Sbjct: 62  RIVLFDQRGAGRSMPHANLTNNTTWDLVADIEKLRVALGITRWQVFGGSWGSTLALAYAQ 121

Query: 200 SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKR 259
           +HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +R
Sbjct: 122 THPEQTTELVLRGIFMLRRWELEWFYQEGASRLFPDAWDRYIAAIPPVERHDLISAFHRR 181

Query: 260 LNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFP 319
           L S+D  T+ AAA+AW+ WE  T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF 
Sbjct: 182 LTSDDEATRLAAAQAWSLWEGATSCLYMDQDFIASHENPHFALAFARIENHYFVNGGFFE 241

Query: 320 SDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI 379
            +  LL +  +I +I  +IV GRYDV CP+ +AWDLHK WP+A LKI P AGHSA EP  
Sbjct: 242 VEDQLLRDAQRIANIPGVIVHGRYDVVCPLQNAWDLHKAWPKASLKITPGAGHSAFEPQN 301

Query: 380 AAELVAANEN 390
              LV A ++
Sbjct: 302 IDALVCATDS 311

BLAST of Sgr029920 vs. ExPASy TrEMBL
Match: A0A6J1D5A1 (Proline iminopeptidase OS=Momordica charantia OX=3673 GN=LOC111017736 PE=3 SV=1)

HSP 1 Score: 770.4 bits (1988), Expect = 3.7e-219
Identity = 361/400 (90.25%), Postives = 376/400 (94.00%), Query Frame = 0

Query: 1   MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDN 60
           M LGFCP  S S PF + SNSH  RHC+ L  VSR+SNH  +SGGKGLVLSAHFGYKSD 
Sbjct: 1   MSLGFCPNISSSHPFSSVSNSHSRRHCIRLCSVSRVSNHFSVSGGKGLVLSAHFGYKSDR 60

Query: 61  QSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLH 120
            SEF++EDLMARE+E SE NRNPYPPIEPYS GFLKVSD+HTIYWE+SGNP GHPVVFLH
Sbjct: 61  LSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLH 120

Query: 121 GGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI 180
           GGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHLEI
Sbjct: 121 GGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI 180

Query: 181 PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWES 240
           PEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWES
Sbjct: 181 PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWES 240

Query: 241 FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN 300
           FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDN
Sbjct: 241 FRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDN 300

Query: 301 FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW 360
           FSLAFARIENHYF+NKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW
Sbjct: 301 FSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVW 360

Query: 361 PEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           PEAELKII +AGHSANEPGIAAELVAANE LKNILQKN P
Sbjct: 361 PEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP 400

BLAST of Sgr029920 vs. ExPASy TrEMBL
Match: A0A6J1E5K8 (Proline iminopeptidase OS=Cucurbita moschata OX=3662 GN=LOC111430799 PE=3 SV=1)

HSP 1 Score: 765.0 bits (1974), Expect = 1.6e-217
Identity = 353/398 (88.69%), Postives = 373/398 (93.72%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQS 62
           LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QS
Sbjct: 4   LGLCPNNSFAAPLLSFVSNLHYRHCPRLFPVSRVSNHFCVSGGKGLVLAAQFGYKSDSQS 63

Query: 63  EFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGG 122
           EF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHTIYWE+SGNP GHPVVFLHGG
Sbjct: 64  EFQRKDLMAGEKEIPGINKTPYPPIEPYSTGLLKVSDLHTIYWEQSGNPAGHPVVFLHGG 123

Query: 123 PGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPE 182
           PGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPE
Sbjct: 124 PGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIADIEKLREHLKIPE 183

Query: 183 WQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFR 242
           WQVFGGSWGSTLALAYSQ+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FR
Sbjct: 184 WQVFGGSWGSTLALAYSQTHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWETFR 243

Query: 243 DLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS 302
           DLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Sbjct: 244 DLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDKFS 303

Query: 303 LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPE 362
           LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPE
Sbjct: 304 LAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPE 363

Query: 363 AELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           AELKIIPDAGHSANEPG+AAELVA NE LKNILQKNGP
Sbjct: 364 AELKIIPDAGHSANEPGVAAELVAGNEKLKNILQKNGP 401

BLAST of Sgr029920 vs. ExPASy TrEMBL
Match: A0A6J1J3D7 (Proline iminopeptidase OS=Cucurbita maxima OX=3661 GN=LOC111483031 PE=3 SV=1)

HSP 1 Score: 761.1 bits (1964), Expect = 2.2e-216
Identity = 351/398 (88.19%), Postives = 373/398 (93.72%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQS 62
           LG CP NS++ P  +F SNSHYRHC  LFPVSR+ N  C+SGGKGLVL+A FGYKSD+QS
Sbjct: 4   LGLCPNNSFAAPLLSFISNSHYRHCPRLFPVSRVYNRFCVSGGKGLVLAAQFGYKSDSQS 63

Query: 63  EFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGG 122
           +F+ +DLMA E+E   TN+ PYPPIEPYSTG LKVSDLHTIYWE+SGNP GHPVVFLHGG
Sbjct: 64  DFQRKDLMAGEKEIPGTNKTPYPPIEPYSTGLLKVSDLHTIYWEQSGNPAGHPVVFLHGG 123

Query: 123 PGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPE 182
           PGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPE
Sbjct: 124 PGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIADIEKLREHLKIPE 183

Query: 183 WQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFR 242
           WQVFGGSWGSTLALAYSQ+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FR
Sbjct: 184 WQVFGGSWGSTLALAYSQTHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWETFR 243

Query: 243 DLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS 302
           DLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Sbjct: 244 DLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDKFS 303

Query: 303 LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPE 362
           LAFARIENHYFV+KGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHK WPE
Sbjct: 304 LAFARIENHYFVHKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKAWPE 363

Query: 363 AELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           AELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Sbjct: 364 AELKIIPDAGHSANEPGVAAELVAANEKLKNILQKNGP 401

BLAST of Sgr029920 vs. ExPASy TrEMBL
Match: A0A0A0KW74 (Proline iminopeptidase OS=Cucumis sativus OX=3659 GN=Csa_4G179110 PE=3 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 6.1e-214
Identity = 355/397 (89.42%), Postives = 367/397 (92.44%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSE 62
           LG CP NS SP F  FSNSH R      PV RLSN  C+SG KG V +A  GYKSD+QSE
Sbjct: 4   LGLCPNNSSSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSE 63

Query: 63  FKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGP 122
           F+ +DLMA E+E S   RNPYPPIEPYSTGFLKVSDLHTIYWE+SGNPTGHPVVFLHGGP
Sbjct: 64  FQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGP 123

Query: 123 GGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEW 182
           GGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEW
Sbjct: 124 GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEW 183

Query: 183 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 242
           QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD
Sbjct: 184 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 243

Query: 243 LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL 302
           LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Sbjct: 244 LIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL 303

Query: 303 AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEA 362
           AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA
Sbjct: 304 AFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEA 363

Query: 363 ELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           ELKII DAGHSANEPGIAAELVAANE LKNILQKNGP
Sbjct: 364 ELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP 395

BLAST of Sgr029920 vs. ExPASy TrEMBL
Match: A0A5D3E1M3 (Proline iminopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G00220 PE=3 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 1.4e-213
Identity = 354/397 (89.17%), Postives = 367/397 (92.44%), Query Frame = 0

Query: 3   LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSE 62
           LG CP NS S P F+FSN H+R      PV RL NH C+ G KG V +A  GYKSD QSE
Sbjct: 4   LGLCPNNSPS-PLFSFSNFHFR-----LPVPRLYNHCCLKGAKGPVFTAQLGYKSDRQSE 63

Query: 63  FKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGP 122
           F+ +DLMA E+E S  NRNPYPPIEPYSTGFLKVSDLHTIYWE+SGNPTGHPVVFLHGGP
Sbjct: 64  FQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGP 123

Query: 123 GGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEW 182
           GGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEW
Sbjct: 124 GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEW 183

Query: 183 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 242
           QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD
Sbjct: 184 QVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRD 243

Query: 243 LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL 302
           LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Sbjct: 244 LIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL 303

Query: 303 AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEA 362
           AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA
Sbjct: 304 AFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEA 363

Query: 363 ELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP 400
           ELKII DAGHSANEPGIAAELVAANE LKNILQKNGP
Sbjct: 364 ELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP 394

BLAST of Sgr029920 vs. TAIR 10
Match: AT2G14260.1 (proline iminopeptidase )

HSP 1 Score: 587.8 bits (1514), Expect = 6.6e-168
Identity = 277/370 (74.86%), Postives = 311/370 (84.05%), Query Frame = 0

Query: 26  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPP 85
           C+  FP +  + ++   G + + +S   G KS+     KS+ +   E E     R  Y P
Sbjct: 15  CVRFFPSNHNNLNLLFPGQRKIQVSC--GGKSE---VLKSDTMEPHEAETFVNKRTLYAP 74

Query: 86  IEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFD 145
           IEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGGT+P NRRFFDP+FYRI+LFD
Sbjct: 75  IEPYSSGNLKVSDVHTLYWEQSGKPDGHPVVFLHGGPGGGTAPSNRRFFDPEFYRIVLFD 134

Query: 146 QRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKV 205
           QRGAGKSTPHACLE NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KV
Sbjct: 135 QRGAGKSTPHACLEENTTWDLVNDIEKLREHLKIPEWLVFGGSWGSTLALAYSQSHPDKV 194

Query: 206 TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSND 265
           TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D
Sbjct: 195 TGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWEEFRDLIPENERGSSLVDAYHKRLNSDD 254

Query: 266 METQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL 325
           +E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Sbjct: 255 LEIQYAAARAWTKWEMMTAYLRPNLENVQKAEDDKFSLAFARIENHYFVNKGFFPSDSHL 314

Query: 326 LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELV 385
           LDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV
Sbjct: 315 LDNVDKIRHIKTTIVQGRYDVCCPMMSAWDLHKAWPEAELKIVYDAGHSANEPGISAELV 374

Query: 386 AANENLKNIL 395
            ANE +K ++
Sbjct: 375 VANEKMKALM 379

BLAST of Sgr029920 vs. TAIR 10
Match: AT2G14260.2 (proline iminopeptidase )

HSP 1 Score: 580.9 bits (1496), Expect = 8.0e-166
Identity = 267/324 (82.41%), Postives = 290/324 (89.51%), Query Frame = 0

Query: 72  EQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNR 131
           E E     R  Y PIEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGGT+P NR
Sbjct: 5   EAETFVNKRTLYAPIEPYSSGNLKVSDVHTLYWEQSGKPDGHPVVFLHGGPGGGTAPSNR 64

Query: 132 RFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGS 191
           RFFDP+FYRI+LFDQRGAGKSTPHACLE NTTWDL++DIEKLREHL+IPEW VFGGSWGS
Sbjct: 65  RFFDPEFYRIVLFDQRGAGKSTPHACLEENTTWDLVNDIEKLREHLKIPEWLVFGGSWGS 124

Query: 192 TLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG- 251
           TLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG 
Sbjct: 125 TLALAYSQSHPDKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWEEFRDLIPENERGS 184

Query: 252 CFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENH 311
             VDAY KRLNS+D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENH
Sbjct: 185 SLVDAYHKRLNSDDLEIQYAAARAWTKWEMMTAYLRPNLENVQKAEDDKFSLAFARIENH 244

Query: 312 YFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDA 371
           YFVNKGFFPSDS LLDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DA
Sbjct: 245 YFVNKGFFPSDSHLLDNVDKIRHIKTTIVQGRYDVCCPMMSAWDLHKAWPEAELKIVYDA 304

Query: 372 GHSANEPGIAAELVAANENLKNIL 395
           GHSANEPGI+AELV ANE +K ++
Sbjct: 305 GHSANEPGISAELVVANEKMKALM 328

BLAST of Sgr029920 vs. TAIR 10
Match: AT3G61540.1 (alpha/beta-Hydrolases superfamily protein )

HSP 1 Score: 49.3 bits (116), Expect = 8.4e-06
Identity = 49/182 (26.92%), Postives = 76/182 (41.76%), Query Frame = 0

Query: 58  DNQSEFKSEDLMAREQEFSETNRNPYPPIEP--YSTGFLKVS----DLHTIYWEESGNPT 117
           D   E KSE +  +     E     +  I P  YS    K++    ++  +  EE   P 
Sbjct: 60  DVAGESKSEHVTGKWFSVPELRLRDHRFIVPLDYSKSSPKITVFAREIVAVGKEEQAMPY 119

Query: 118 GHPVVFLHGGPG-GGTSPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---LENNTTW 177
              +++L GGPG  G  P     +     + +R++L DQRG G STP  C   L+  +  
Sbjct: 120 ---LLYLQGGPGFEGPRPSEASGWIQRACEEFRVVLLDQRGTGLSTPLTCSSMLQFKSAK 179

Query: 178 DLID------------DIEKLREHL--EIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVL 213
           +L D            D E +R  L  +   W + G S+G   AL Y    PE +  +++
Sbjct: 180 ELADYLVHFRADNIVKDAEFIRVRLVPKADPWTILGQSFGGFCALTYLSFAPEGLKQVLI 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149275.17.6e-21990.25proline iminopeptidase [Momordica charantia][more]
XP_023552784.11.0e-21889.20proline iminopeptidase [Cucurbita pepo subsp. pepo][more]
XP_038905843.11.3e-21890.15proline iminopeptidase isoform X1 [Benincasa hispida][more]
KAG7014918.11.1e-21788.94Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022922976.13.2e-21788.69proline iminopeptidase [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
P937329.2e-16774.86Proline iminopeptidase OS=Arabidopsis thaliana OX=3702 GN=PIP PE=2 SV=3[more]
O830415.9e-11361.17Probable proline iminopeptidase OS=Leptolyngbya boryana OX=1184 GN=pip PE=3 SV=1[more]
Q9PD693.4e-10557.42Proline iminopeptidase OS=Xylella fastidiosa (strain 9a5c) OX=160492 GN=pip PE=3... [more]
O324495.9e-10555.45Proline iminopeptidase OS=Serratia marcescens OX=615 GN=pip PE=1 SV=1[more]
Q87DF81.0e-10457.10Proline iminopeptidase OS=Xylella fastidiosa (strain Temecula1 / ATCC 700964) OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D5A13.7e-21990.25Proline iminopeptidase OS=Momordica charantia OX=3673 GN=LOC111017736 PE=3 SV=1[more]
A0A6J1E5K81.6e-21788.69Proline iminopeptidase OS=Cucurbita moschata OX=3662 GN=LOC111430799 PE=3 SV=1[more]
A0A6J1J3D72.2e-21688.19Proline iminopeptidase OS=Cucurbita maxima OX=3661 GN=LOC111483031 PE=3 SV=1[more]
A0A0A0KW746.1e-21489.42Proline iminopeptidase OS=Cucumis sativus OX=3659 GN=Csa_4G179110 PE=3 SV=1[more]
A0A5D3E1M31.4e-21389.17Proline iminopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold6... [more]
Match NameE-valueIdentityDescription
AT2G14260.16.6e-16874.86proline iminopeptidase [more]
AT2G14260.28.0e-16682.41proline iminopeptidase [more]
AT3G61540.18.4e-0626.92alpha/beta-Hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000073Alpha/beta hydrolase fold-1PRINTSPR00111ABHYDROLASEcoord: 184..197
score: 37.44
coord: 198..211
score: 31.53
coord: 138..153
score: 38.36
IPR000073Alpha/beta hydrolase fold-1PFAMPF00561Abhydrolase_1coord: 114..375
e-value: 1.3E-27
score: 97.1
IPR002410Peptidase S33PRINTSPR00793PROAMNOPTASEcoord: 184..198
score: 57.78
coord: 141..152
score: 56.94
coord: 115..123
score: 69.91
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 80..390
e-value: 1.0E-106
score: 359.2
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 80..375
IPR005944Proline iminopeptidaseTIGRFAMTIGR01249TIGR01249coord: 87..390
e-value: 3.2E-157
score: 520.4
IPR005944Proline iminopeptidasePANTHERPTHR43722PROLINE IMINOPEPTIDASEcoord: 64..393
NoneNo IPR availablePIRSRPIRSR006431-1PIRSR006431-1coord: 88..386
e-value: 5.1E-133
score: 440.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029920.1Sgr029920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005737 cytoplasm
molecular_function GO:0004177 aminopeptidase activity
molecular_function GO:0008233 peptidase activity