Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCCTCGTGCTTCTGCAACCATTGATGATTTAAAAAATGTAGTCGTGAATATCAATTGTCTTGAGAAAGAGAGAAGAATTCCTCCTTGGGATGAAGAACACATCCCCGAGGAAATTGTCAGTACCATGCCAGATCCAGGGTAATCTATAATTTTTATAACTCCCCTGCAACGTTATGTTGAAACATTGAAATCTCAAGACCCTTGGTTCTCGTGTCCTTACTATTTTGCCATCTAATTAGAATTTGTTTCAGAACTGTTTTTTAAATATTTTTTAATGAAAAAAATCAATAAAACTATGTTTGAATAACTATTTTTAAAAGTTCTTTTTCCTCTTTTATGTGTTCTCACATTTGACTTTTTTTTAGAATAAATAAAATGGTAAATCTTTCCAAAATTATGAGTTTCTAACTAGAGATTTCAACATATTCAATAACATTAAATTTACAAATTCTTCAATGCTAGTTCATAAAAATAATTTTGGAAAGATAATCAGTATACTATTTTAGATGTAGTTTCCCAAACAGGCCTCTGATGTTCCAATATTCAACTTATAGGTAGAAACATATACGAAGTGAGTAAAATGATGTAAAATAAAGAGTTTCACATTTTCTTGGTTATGTTTAATTCCTTATGCTAAACTCTTCACTTCATTAAGTTTTGAATCCAGATGTTCACCAGTGGCTATCATTTTTATAATGGCATGTGGTAACTACTACACTGTTAAAATAATGTTTCAGGAATATTCTGCAGCAGTTGGAATATCCAAATATTCGAGCTACTGAAATATTTCTATCAAATGGCATGCGAGTTTGCTACAAGTGTACAGACTTTCTTGATGACCAGGTACCTTTTTAATTTTGTTAAAACTTTTCCTCAGATGCAATGGTCCGCTATTCGAATTGTTCATCTTCTATTGCTGCACCCGAGTCTAAGTTTCTCCTGTAAGTCTGACATAAAATTTTGTCTTAACAGGTAATATTTACAGGGTTCTCTTATGGGGCCTTATCTGAACTCCCAGAGAGAGAGTATAGTTCATGCTCGATGGGTTCAACCATTGCTGGAGAAATTGGAGTGTTTGGTTATCGACCTTCTGTACTTATGGACATGCTGGCTGGTAAGAGGGCTGAAGTTGGTACAAAGCTTGGAGCATACATGAGAACCTTTTCTGGTGATTGTTCACCGTCTGATCTGGAAACTGCCCTGCAGGTTTCCTTTCTGTCCTCATCAAATTAAGTACCCCCAGCTCTCCATTACTTATTATAGCCTTTAAACGAAAGAACACAATAAAAATATTTGTTCCAACAGAACATGATATGACTGGCATTGCCCCCTTGAAGCTCCTCTAGTAATAATTTCTGGTTCATACCTAAACTTGCTATGTTTTCTTTTAAGATGATTTCCCAATTAAAGGATTTCTGTTGCTATTCCTTTTTTCTTTATCATACGTGGGGTTTTGCTATAACTAACTGACTTAGGTTGATGTTGTGCTTCCTGCTCGTGCATGGAAATACCTTTATCTCCAAGCAAGTCTGTTGCTCAATTTCAGTTCTGGGTCATCTATTTCTTATTTCATTCTCACTGCCCTTTACTTTTACGCTTATTGTTTGTAATGCATATGGTTTTACAGCTAATTGCAGATTATGAGGTAGTATTCCGTTATCAATACCCTATATGAGTGATAAAAAAAAAAAGAACTACCTTATCATGAAAAAAAAATCATAAGCAGCTGACTGTGCCTAGTCAATGAGGGCCTACAAATTTTTAATCTATAAATAAGCATTTGAACTATGTAGGAATTCAGTCATTGTTCTAGCAGCATATATCATGCTTCTAATTTAATGCTAATGTAATTTCAGAAGAATTAAAAATAAAGATTTTTCCCTTTTGACCATTTCTTGACATTTTGTCTCGTTACATGTCTTGAGCGCATTCTGTTTCTCTGCAGCTGGTTTATCAACTATTCACAACAAATGTGACACCAGGAGAGGAGGATGTCAAAATTGTTATGCAAATGGCAGAAGAAGCTGTTCGTGCTCAGGAGAGGGATCCTTATACTGCATTTGCAACCGTGTGAAGGAGCTCAATTATGGAAACTCCTACTTTTTTAGGGTATGCTTGTTACCGTGCTTTTAGATTTTTATTTTTTCCTATAATTTTAACTTTTTTATTTTTATTTCTTACTCTCCAGCCAATTAGGTTAAGTGACCTTCGAAAGGTTGATCCACAAAAGGCTTGTGAATATTTCAACAACTGTTTCAGAGATCCATCCAGTTTTACTGTTGTAATTGTTGGGAATATTAATCCTTCTATAGCACTTCCTTTAATCCAGCAGTATTTGGTACGTGGAGCTTTTTATTCAGTGTGCTTACACGCAGCAACATATAGTGTTCGTGATATTTAAGGTTCTGATTTTCCTAAGCAGGGTGGAATCCAAAGCCTCCTGAACCAGTTATGGACTTCAATCGTGACGATCTGAAAGGCTTGCCATTCACTTTTCCTACAAGCATAGTTCGGTATAGTTTTCTAGACCATGTATATGAATCCATTGATGGTAGACATAAAACATGTTTCCAAATGTGTGATATAATTAATCAGTGGGAGGGAGGGTAGTGAAAATAAAATTAAAATAGTGAACCAAATCATGTTTCACTCTGTTTCTCATCAATGGCTTTCATTTTCCCCTATATTGTTTCTTAAGGTGTTGCTTAGAAGTGGTCTAGTATGAATGTGCATGTTTTCTGGTTCGTACAGTTATTACATTCATTTGGGATTTCCTTTCTCTAAACATGTTTAATGTGCTTGCATTGTAGAGAAGTGGTATATAGCCCCATGGTTGAAGCTCAATGTTCAGTTCAGCTTTGCTTTCCTGTGGAGCTCAAAAATGGAACCATGGTATTTCTTTCCTTTTTATTGTCAATTGACAGTGTACATTGTAATTGAAGTGTGTCCTGTTTATTACTTTATTTTGTTTATCCAATGGTTTAGAACGTGTTTTGTTCAAAACAAAAATGGCAGGTTGAGGAAATTCATTTCGTAGGGTTTTTGAGCAAATTGCTTGAGACAAAAATGATGCAAGTTCTGCGTTTCAAGCATGGACAGGTGTTTATGCATCTCGTGGTATTGTGAGTATTTGTTTTTTTTTCCTATTCTTTTGCTTAATTCTTTGGTTACCTTTTTGTACCAGATCTATTCTGCTGGGGTTTCAGTATTCCTTGGAGGTAACAAGCCTTCAAGAATTGGTCCTGTTCGTGGTGATATTAGCATAAACTTTTCTTGTGATCCAGAAATCTCATCGAAGCTGGTTTATACATTATCACAATCAACCTTCTTTTACTTTAATTCCATTAGCAAATTTAGCTGACCTTCTTTGTGATCCTGAATTTTTTAAAAGGTTGATCTTGCTTTGGATGAAATATTACGTCTTCAAGAAGAAGGGCCTACGGATCAAGATGTTTCCACTGTCCTAGAGATTGAGCAAAGGGCCCATGAAAATGGACTGCAGGTTATTTTTCATTTATGAGAATTTATTATTACTACATGAAGGGTAAAATGAAAATAGCATTCCACTTCAAGATATGTTTAACAAAATCTCCTCGACACTTGTTTTCCAAATAAAAATCATAATCACATCTTTAGCAACAACTGTCATTTTGGTCAACATTTTTGTTAATGATTTATGGGGAGGGGAACCTGTGTTGCAGTGGCACGCGTGGATTTGTGTCCAGCCACATAACTTATGTGGCAATTATGTGACATAATTACCACAAAGCTCTCTCTTCTCACTCTATTTGAAATGATAGATTCCCAAATCTGAATAATATAGCTAATTGTGGATGATCGCTTGGCTCTTAACCCTTGTTTTGTTCTGTGAATCACAATCCTTAAATCACTCTCTTTCCCCCTATCTTTTTCTATCCCCTCCCCATTTGGTATGGATAATAAAACAAAAACAAAAGTACTTAGATGAAAGAGACTTTCGCCTTCAAATGAAAAAGGTCCTCTTGCAGATGTTTCAAGGTTATATTTTGCCCTTATTAACATGACTGCAGACGTGGTTAGAAGAGTTATGATCCATTTTGTTGCCAACTACAAGAATAAAAGGAGTTGGTTTTCAGTTGTTCCTAGTTTATATATGGGCAATATGCCTAGCCATGAGGAATTTTATGGTTCCTGTACAACCAGTTAACTGTTTTGGGTTTTGGCTACCAGGAAAATTATTACTGGCTGGACAGGATTTTACGCAGCTACCAGTCAAGGATATACTCCGGTGATGTTGGAACTTCTTTTGAGGTTGGTGGCATTTAATAAGTTTTTATCTCTCCCCTTTTTATCTGTCAATACTTAATGTTGCTCTCTATGCAAGTGGCTTTCACCACTTATGGTGTGTTTATTTATTAGGTAAGTTCTTGATGAATCGTTTATGTTTTACAGGATGTGATAAAAGTTCCAATCAGCATTTCATTATTTCCGTTTGATTTAGGAGAAATGAATGTAAATATGTTTATCTACCATGAGATAAATTGTTTAGTTTCACTATACAATGCTGTGCCATATTTTGGAATCTCAATATATAGGTAAAGATGGACCCTGACATTCTATTCTCATCATTTTCTGTAAATTTTTCTCGATGCAGATCCAGGATGAGGGACGTTTGAAGGTCAGAAATGCCTTGACACCATCAACAGCACAGTTGGCACTACGAAGGATACTGCCATTTCCTTGCACAAAACAATATACTGCAGTAGTTCTGTTGCCAAGATCATATCAGTTCAGAAAACTGAAATCATTCTCCAACTCGGTCTATCAAATCATGGCAGAGATGCAAAGGTTAGTTTCATAG
mRNA sequence
ATGGAGCCTCGTGCTTCTGCAACCATTGATGATTTAAAAAATGTAGTCGTGAATATCAATTGTCTTGAGAAAGAGAGAAGAATTCCTCCTTGGGATGAAGAACACATCCCCGAGGAAATTGTCAGTACCATGCCAGATCCAGGGAATATTCTGCAGCAGTTGGAATATCCAAATATTCGAGCTACTGAAATATTTCTATCAAATGGCATGCGAGTTTGCTACAAGTGTACAGACTTTCTTGATGACCAGGTAATATTTACAGGGTTCTCTTATGGGGCCTTATCTGAACTCCCAGAGAGAGAGTATAGTTCATGCTCGATGGGTTCAACCATTGCTGGAGAAATTGGAGTGTTTGGTTATCGACCTTCTGTACTTATGGACATGCTGGCTGGTAAGAGGGCTGAAGTTGGTACAAAGCTTGGAGCATACATGAGAACCTTTTCTGGTGATTGTTCACCGTCTGATCTGGAAACTGCCCTGCAGCTGGTTTATCAACTATTCACAACAAATGTGACACCAGGAGAGGAGGATGTCAAAATTGTTATGCAAATGGCAGAAGAAGCTGTTCGTGCTCAGGAGAGGGATCCTTATACTGCATTTGCAACCCCAATTAGGTTAAGTGACCTTCGAAAGGTTGATCCACAAAAGGCTTGTGAATATTTCAACAACTGTTTCAGAGATCCATCCAGTTTTACTGTTGTAATTGTTGGGAATATTAATCCTTCTATAGCACTTCCTTTAATCCAGCAGTATTTGCCTCCTGAACCAGTTATGGACTTCAATCGTGACGATCTGAAAGGCTTGCCATTCACTTTTCCTACAAGCATAGTTCGAGAAGTGGTATATAGCCCCATGGTTGAAGCTCAATGTTCAGTTCAGCTTTGCTTTCCTGTGGAGCTCAAAAATGGAACCATGGTTGAGGAAATTCATTTCGTAGGGTTTTTGAGCAAATTGCTTGAGACAAAAATGATGCAAGTTCTGCGTTTCAAGCATGGACAGATCTATTCTGCTGGGGTTTCAGTATTCCTTGGAGGTAACAAGCCTTCAAGAATTGGTCCTGTTCGTGGTGATATTAGCATAAACTTTTCTTGTGATCCAGAAATCTCATCGAAGCTGGTTGATCTTGCTTTGGATGAAATATTACGTCTTCAAGAAGAAGGGCCTACGGATCAAGATGTTTCCACTGTCCTAGAGATTGAGCAAAGGGCCCATGAAAATGGACTGCAGGAAAATTATTACTGGCTGGACAGGATTTTACGCAGCTACCAGTCAAGGATATACTCCGGTGATGTTGGAACTTCTTTTGAGATCCAGGATGAGGGACGTTTGAAGGTCAGAAATGCCTTGACACCATCAACAGCACAGTTGGCACTACGAAGGATACTGCCATTTCCTTGCACAAAACAATATACTGCAGTAGTTCTGTTGCCAAGATCATATCAGTTCAGAAAACTGAAATCATTCTCCAACTCGGTCTATCAAATCATGGCAGAGATGCAAAGGTTAGTTTCATAG
Coding sequence (CDS)
ATGGAGCCTCGTGCTTCTGCAACCATTGATGATTTAAAAAATGTAGTCGTGAATATCAATTGTCTTGAGAAAGAGAGAAGAATTCCTCCTTGGGATGAAGAACACATCCCCGAGGAAATTGTCAGTACCATGCCAGATCCAGGGAATATTCTGCAGCAGTTGGAATATCCAAATATTCGAGCTACTGAAATATTTCTATCAAATGGCATGCGAGTTTGCTACAAGTGTACAGACTTTCTTGATGACCAGGTAATATTTACAGGGTTCTCTTATGGGGCCTTATCTGAACTCCCAGAGAGAGAGTATAGTTCATGCTCGATGGGTTCAACCATTGCTGGAGAAATTGGAGTGTTTGGTTATCGACCTTCTGTACTTATGGACATGCTGGCTGGTAAGAGGGCTGAAGTTGGTACAAAGCTTGGAGCATACATGAGAACCTTTTCTGGTGATTGTTCACCGTCTGATCTGGAAACTGCCCTGCAGCTGGTTTATCAACTATTCACAACAAATGTGACACCAGGAGAGGAGGATGTCAAAATTGTTATGCAAATGGCAGAAGAAGCTGTTCGTGCTCAGGAGAGGGATCCTTATACTGCATTTGCAACCCCAATTAGGTTAAGTGACCTTCGAAAGGTTGATCCACAAAAGGCTTGTGAATATTTCAACAACTGTTTCAGAGATCCATCCAGTTTTACTGTTGTAATTGTTGGGAATATTAATCCTTCTATAGCACTTCCTTTAATCCAGCAGTATTTGCCTCCTGAACCAGTTATGGACTTCAATCGTGACGATCTGAAAGGCTTGCCATTCACTTTTCCTACAAGCATAGTTCGAGAAGTGGTATATAGCCCCATGGTTGAAGCTCAATGTTCAGTTCAGCTTTGCTTTCCTGTGGAGCTCAAAAATGGAACCATGGTTGAGGAAATTCATTTCGTAGGGTTTTTGAGCAAATTGCTTGAGACAAAAATGATGCAAGTTCTGCGTTTCAAGCATGGACAGATCTATTCTGCTGGGGTTTCAGTATTCCTTGGAGGTAACAAGCCTTCAAGAATTGGTCCTGTTCGTGGTGATATTAGCATAAACTTTTCTTGTGATCCAGAAATCTCATCGAAGCTGGTTGATCTTGCTTTGGATGAAATATTACGTCTTCAAGAAGAAGGGCCTACGGATCAAGATGTTTCCACTGTCCTAGAGATTGAGCAAAGGGCCCATGAAAATGGACTGCAGGAAAATTATTACTGGCTGGACAGGATTTTACGCAGCTACCAGTCAAGGATATACTCCGGTGATGTTGGAACTTCTTTTGAGATCCAGGATGAGGGACGTTTGAAGGTCAGAAATGCCTTGACACCATCAACAGCACAGTTGGCACTACGAAGGATACTGCCATTTCCTTGCACAAAACAATATACTGCAGTAGTTCTGTTGCCAAGATCATATCAGTTCAGAAAACTGAAATCATTCTCCAACTCGGTCTATCAAATCATGGCAGAGATGCAAAGGTTAGTTTCATAG
Protein sequence
MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIRATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGYRPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKIVMQMAEEAVRAQERDPYTAFATPIRLSDLRKVDPQKACEYFNNCFRDPSSFTVVIVGNINPSIALPLIQQYLPPEPVMDFNRDDLKGLPFTFPTSIVREVVYSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSVFLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQRAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRILPFPCTKQYTAVVLLPRSYQFRKLKSFSNSVYQIMAEMQRLVS
Homology
BLAST of Sgr022307 vs. NCBI nr
Match:
XP_038890062.1 (zinc protease PQQL-like isoform X3 [Benincasa hispida])
HSP 1 Score: 923.7 bits (2386), Expect = 6.8e-265
Identity = 460/507 (90.73%), Postives = 479/507 (94.48%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVV+NI+CLEKER IPPWDEEHIPEEIVSTMP+PGNILQQ EYPNI
Sbjct: 298 IEPRASATIDDLKNVVMNISCLEKERSIPPWDEEHIPEEIVSTMPNPGNILQQQEYPNIG 357
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQV+FTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 358 ATEIFLSNGMRVCYKCTDFLDDQVVFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 417
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI
Sbjct: 418 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 477
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAF PIRLSDLRKVDPQ+ACEYFNNCFR
Sbjct: 478 VMQMAEEAVRAQERDPYTAFVNRVKELNYGNSYFFRPIRLSDLRKVDPQRACEYFNNCFR 537
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGNINPSIALPLIQQYL PPEP+M FNRDDLKGLPFTFPTSIVREVV
Sbjct: 538 DPSNFTVVIVGNINPSIALPLIQQYLGGIPKPPEPIMKFNRDDLKGLPFTFPTSIVREVV 597
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIH+VGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 598 YSPMVEAQCSVQLCFPVELTNGTMVEEIHYVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 657
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 658 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 717
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG+SFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 718 RAHENGLQENYYWLDRILRSYQSRIYSGDVGSSFEIQDEGRLNVRNSLTPLTAQLALQRI 777
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLPRSY+FRKLKSF
Sbjct: 778 LPFPCTKQYTAVILLPRSYRFRKLKSF 804
BLAST of Sgr022307 vs. NCBI nr
Match:
XP_038890061.1 (zinc protease PQQL-like isoform X2 [Benincasa hispida])
HSP 1 Score: 923.7 bits (2386), Expect = 6.8e-265
Identity = 460/507 (90.73%), Postives = 479/507 (94.48%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVV+NI+CLEKER IPPWDEEHIPEEIVSTMP+PGNILQQ EYPNI
Sbjct: 417 IEPRASATIDDLKNVVMNISCLEKERSIPPWDEEHIPEEIVSTMPNPGNILQQQEYPNIG 476
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQV+FTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 477 ATEIFLSNGMRVCYKCTDFLDDQVVFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 536
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI
Sbjct: 537 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 596
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAF PIRLSDLRKVDPQ+ACEYFNNCFR
Sbjct: 597 VMQMAEEAVRAQERDPYTAFVNRVKELNYGNSYFFRPIRLSDLRKVDPQRACEYFNNCFR 656
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGNINPSIALPLIQQYL PPEP+M FNRDDLKGLPFTFPTSIVREVV
Sbjct: 657 DPSNFTVVIVGNINPSIALPLIQQYLGGIPKPPEPIMKFNRDDLKGLPFTFPTSIVREVV 716
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIH+VGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 717 YSPMVEAQCSVQLCFPVELTNGTMVEEIHYVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 776
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 777 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 836
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG+SFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 837 RAHENGLQENYYWLDRILRSYQSRIYSGDVGSSFEIQDEGRLNVRNSLTPLTAQLALQRI 896
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLPRSY+FRKLKSF
Sbjct: 897 LPFPCTKQYTAVILLPRSYRFRKLKSF 923
BLAST of Sgr022307 vs. NCBI nr
Match:
XP_038890060.1 (zinc protease PQQL-like isoform X1 [Benincasa hispida])
HSP 1 Score: 923.7 bits (2386), Expect = 6.8e-265
Identity = 460/507 (90.73%), Postives = 479/507 (94.48%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVV+NI+CLEKER IPPWDEEHIPEEIVSTMP+PGNILQQ EYPNI
Sbjct: 465 IEPRASATIDDLKNVVMNISCLEKERSIPPWDEEHIPEEIVSTMPNPGNILQQQEYPNIG 524
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQV+FTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 525 ATEIFLSNGMRVCYKCTDFLDDQVVFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 584
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI
Sbjct: 585 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 644
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAF PIRLSDLRKVDPQ+ACEYFNNCFR
Sbjct: 645 VMQMAEEAVRAQERDPYTAFVNRVKELNYGNSYFFRPIRLSDLRKVDPQRACEYFNNCFR 704
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGNINPSIALPLIQQYL PPEP+M FNRDDLKGLPFTFPTSIVREVV
Sbjct: 705 DPSNFTVVIVGNINPSIALPLIQQYLGGIPKPPEPIMKFNRDDLKGLPFTFPTSIVREVV 764
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIH+VGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 765 YSPMVEAQCSVQLCFPVELTNGTMVEEIHYVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 824
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 825 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 884
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG+SFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 885 RAHENGLQENYYWLDRILRSYQSRIYSGDVGSSFEIQDEGRLNVRNSLTPLTAQLALQRI 944
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLPRSY+FRKLKSF
Sbjct: 945 LPFPCTKQYTAVILLPRSYRFRKLKSF 971
BLAST of Sgr022307 vs. NCBI nr
Match:
XP_022999427.1 (zinc protease PQQL-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 917.9 bits (2371), Expect = 3.7e-263
Identity = 457/507 (90.14%), Postives = 478/507 (94.28%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASAT+D LKNVV+NIN LEKER IPPWDEEHIPEEIV+TMP+PGNILQQ EYPNI
Sbjct: 503 IEPRASATVDGLKNVVMNINSLEKERSIPPWDEEHIPEEIVTTMPNPGNILQQQEYPNIG 562
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 563 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 622
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETA+QLVYQLFTTNVTPGEEDVKI
Sbjct: 623 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETAMQLVYQLFTTNVTPGEEDVKI 682
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRLSDL+KVDPQKACEYFNNCFR
Sbjct: 683 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLSDLQKVDPQKACEYFNNCFR 742
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVV+VGNINPSIALPLIQQYL PPEP+M+FNRDDLKGLPFTFPTSIVREVV
Sbjct: 743 DPSNFTVVVVGNINPSIALPLIQQYLGGIPKPPEPIMNFNRDDLKGLPFTFPTSIVREVV 802
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIHFVGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 803 YSPMVEAQCSVQLCFPVELTNGTMVEEIHFVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 862
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 863 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 922
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 923 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLNVRNSLTPLTAQLALQRI 982
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLP SY+F+KLKSF
Sbjct: 983 LPFPCTKQYTAVILLPSSYRFKKLKSF 1009
BLAST of Sgr022307 vs. NCBI nr
Match:
XP_022999428.1 (zinc protease PQQL-like isoform X2 [Cucurbita maxima])
HSP 1 Score: 917.9 bits (2371), Expect = 3.7e-263
Identity = 457/507 (90.14%), Postives = 478/507 (94.28%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASAT+D LKNVV+NIN LEKER IPPWDEEHIPEEIV+TMP+PGNILQQ EYPNI
Sbjct: 465 IEPRASATVDGLKNVVMNINSLEKERSIPPWDEEHIPEEIVTTMPNPGNILQQQEYPNIG 524
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 525 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 584
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETA+QLVYQLFTTNVTPGEEDVKI
Sbjct: 585 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETAMQLVYQLFTTNVTPGEEDVKI 644
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRLSDL+KVDPQKACEYFNNCFR
Sbjct: 645 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLSDLQKVDPQKACEYFNNCFR 704
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVV+VGNINPSIALPLIQQYL PPEP+M+FNRDDLKGLPFTFPTSIVREVV
Sbjct: 705 DPSNFTVVVVGNINPSIALPLIQQYLGGIPKPPEPIMNFNRDDLKGLPFTFPTSIVREVV 764
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIHFVGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 765 YSPMVEAQCSVQLCFPVELTNGTMVEEIHFVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 824
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 825 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 884
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 885 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLNVRNSLTPLTAQLALQRI 944
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLP SY+F+KLKSF
Sbjct: 945 LPFPCTKQYTAVILLPSSYRFKKLKSF 971
BLAST of Sgr022307 vs. ExPASy Swiss-Prot
Match:
Q9FJT9 (Zinc protease PQQL-like OS=Arabidopsis thaliana OX=3702 GN=At5g56730 PE=1 SV=1)
HSP 1 Score: 700.7 bits (1807), Expect = 1.2e-200
Identity = 348/510 (68.24%), Postives = 410/510 (80.39%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
MEP+++ATID ++NVV +N LE+E+ I PWDEE+IPEEIVS P PG+I QLEYP +
Sbjct: 439 MEPKSAATIDHMRNVVSKVNSLEEEKMIAPWDEENIPEEIVSEKPTPGDITHQLEYPEVG 498
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
TE+ LSNGM+VCYK TDFLDDQV+FTGFSYG LSELPE +Y SCSMGSTIAGEIG+FGY
Sbjct: 499 VTELTLSNGMQVCYKSTDFLDDQVLFTGFSYGGLSELPESDYISCSMGSTIAGEIGMFGY 558
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
+PSVLMDMLA DLETALQLVYQLFTTNV P EE+V I
Sbjct: 559 KPSVLMDMLA------------------------DLETALQLVYQLFTTNVMPQEEEVGI 618
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEE+VRA+ERDPYT FA PIR+S+LRKVDP KACEYFN+CFR
Sbjct: 619 VMQMAEESVRARERDPYTVFANRVKELNYGNSYFFRPIRISELRKVDPLKACEYFNSCFR 678
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGN++P+IALPLI QYL PP+PV++FNRDDLKGLPFTFPT I +E V
Sbjct: 679 DPSTFTVVIVGNLDPTIALPLILQYLGGIPKPPQPVLNFNRDDLKGLPFTFPTKITKEFV 738
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
SPMVEAQCSVQLCFPV+L NGTM+EEIH +GFL KLLETK++Q LRF+HGQIYSA VSV
Sbjct: 739 RSPMVEAQCSVQLCFPVQLTNGTMIEEIHCIGFLGKLLETKIIQFLRFEHGQIYSAEVSV 798
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSR +RGDIS+NFSCDPEISSKLVDLAL+EI+RLQ+EGP+ +D+S +LEIEQ
Sbjct: 799 FLGGNKPSRTADLRGDISVNFSCDPEISSKLVDLALEEIVRLQKEGPSQEDISAILEIEQ 858
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENG+QENYYWLDRI+R YQSR+Y+GD+G S +I +EGRL++R +L P TAQ AL+RI
Sbjct: 859 RAHENGMQENYYWLDRIIRGYQSRVYAGDLGASCKILEEGRLRMRESLAPQTAQAALQRI 918
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSFSNS 492
LP PC KQYTAV+L+P+ +F L S +S
Sbjct: 919 LPHPCKKQYTAVILMPQRSRFGFLSSIFSS 924
BLAST of Sgr022307 vs. ExPASy TrEMBL
Match:
A0A6J1KAU8 (zinc protease PQQL-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493804 PE=4 SV=1)
HSP 1 Score: 917.9 bits (2371), Expect = 1.8e-263
Identity = 457/507 (90.14%), Postives = 478/507 (94.28%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASAT+D LKNVV+NIN LEKER IPPWDEEHIPEEIV+TMP+PGNILQQ EYPNI
Sbjct: 503 IEPRASATVDGLKNVVMNINSLEKERSIPPWDEEHIPEEIVTTMPNPGNILQQQEYPNIG 562
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 563 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 622
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETA+QLVYQLFTTNVTPGEEDVKI
Sbjct: 623 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETAMQLVYQLFTTNVTPGEEDVKI 682
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRLSDL+KVDPQKACEYFNNCFR
Sbjct: 683 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLSDLQKVDPQKACEYFNNCFR 742
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVV+VGNINPSIALPLIQQYL PPEP+M+FNRDDLKGLPFTFPTSIVREVV
Sbjct: 743 DPSNFTVVVVGNINPSIALPLIQQYLGGIPKPPEPIMNFNRDDLKGLPFTFPTSIVREVV 802
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIHFVGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 803 YSPMVEAQCSVQLCFPVELTNGTMVEEIHFVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 862
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 863 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 922
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 923 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLNVRNSLTPLTAQLALQRI 982
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLP SY+F+KLKSF
Sbjct: 983 LPFPCTKQYTAVILLPSSYRFKKLKSF 1009
BLAST of Sgr022307 vs. ExPASy TrEMBL
Match:
A0A6J1KJP3 (zinc protease PQQL-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111493804 PE=3 SV=1)
HSP 1 Score: 917.9 bits (2371), Expect = 1.8e-263
Identity = 457/507 (90.14%), Postives = 478/507 (94.28%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASAT+D LKNVV+NIN LEKER IPPWDEEHIPEEIV+TMP+PGNILQQ EYPNI
Sbjct: 465 IEPRASATVDGLKNVVMNINSLEKERSIPPWDEEHIPEEIVTTMPNPGNILQQQEYPNIG 524
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 525 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 584
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETA+QLVYQLFTTNVTPGEEDVKI
Sbjct: 585 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETAMQLVYQLFTTNVTPGEEDVKI 644
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRLSDL+KVDPQKACEYFNNCFR
Sbjct: 645 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLSDLQKVDPQKACEYFNNCFR 704
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVV+VGNINPSIALPLIQQYL PPEP+M+FNRDDLKGLPFTFPTSIVREVV
Sbjct: 705 DPSNFTVVVVGNINPSIALPLIQQYLGGIPKPPEPIMNFNRDDLKGLPFTFPTSIVREVV 764
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIHFVGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 765 YSPMVEAQCSVQLCFPVELTNGTMVEEIHFVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 824
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 825 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 884
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 885 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLNVRNSLTPLTAQLALQRI 944
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLP SY+F+KLKSF
Sbjct: 945 LPFPCTKQYTAVILLPSSYRFKKLKSF 971
BLAST of Sgr022307 vs. ExPASy TrEMBL
Match:
A0A6J1DPE6 (zinc protease PQQL-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021875 PE=3 SV=1)
HSP 1 Score: 910.2 bits (2351), Expect = 3.8e-261
Identity = 459/507 (90.53%), Postives = 472/507 (93.10%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVVVNINCLEKER IPPWDEEHIPEEIV + P GNILQQLEYPNI
Sbjct: 465 IEPRASATIDDLKNVVVNINCLEKERSIPPWDEEHIPEEIVISKPGLGNILQQLEYPNIG 524
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
A+EIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREY+SCSMGSTIAGEIGVFGY
Sbjct: 525 ASEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYTSCSMGSTIAGEIGVFGY 584
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFT NVTPGEEDVKI
Sbjct: 585 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTKNVTPGEEDVKI 644
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRL DL+KVDPQKACEYFN CFR
Sbjct: 645 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLRDLQKVDPQKACEYFNKCFR 704
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGNINPSIALPLIQQYL PPEPVMDFNRDDLKGLPFTF T IVREVV
Sbjct: 705 DPSTFTVVIVGNINPSIALPLIQQYLGGIPKPPEPVMDFNRDDLKGLPFTFSTGIVREVV 764
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV
Sbjct: 765 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 824
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSR PVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS+VLEIEQ
Sbjct: 825 FLGGNKPSRNDPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSVLEIEQ 884
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG SFEIQDEGRLKVRN+LTP TAQLAL+RI
Sbjct: 885 RAHENGLQENYYWLDRILRSYQSRIYSGDVGNSFEIQDEGRLKVRNSLTPLTAQLALQRI 944
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLPRS++FRKLKSF
Sbjct: 945 LPFPCTKQYTAVILLPRSFRFRKLKSF 971
BLAST of Sgr022307 vs. ExPASy TrEMBL
Match:
A0A6J1DM98 (zinc protease PQQL-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111021875 PE=4 SV=1)
HSP 1 Score: 910.2 bits (2351), Expect = 3.8e-261
Identity = 459/507 (90.53%), Postives = 472/507 (93.10%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVVVNINCLEKER IPPWDEEHIPEEIV + P GNILQQLEYPNI
Sbjct: 404 IEPRASATIDDLKNVVVNINCLEKERSIPPWDEEHIPEEIVISKPGLGNILQQLEYPNIG 463
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
A+EIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREY+SCSMGSTIAGEIGVFGY
Sbjct: 464 ASEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYTSCSMGSTIAGEIGVFGY 523
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFT NVTPGEEDVKI
Sbjct: 524 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTKNVTPGEEDVKI 583
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRL DL+KVDPQKACEYFN CFR
Sbjct: 584 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLRDLQKVDPQKACEYFNKCFR 643
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGNINPSIALPLIQQYL PPEPVMDFNRDDLKGLPFTF T IVREVV
Sbjct: 644 DPSTFTVVIVGNINPSIALPLIQQYLGGIPKPPEPVMDFNRDDLKGLPFTFSTGIVREVV 703
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV
Sbjct: 704 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 763
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSR PVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS+VLEIEQ
Sbjct: 764 FLGGNKPSRNDPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSVLEIEQ 823
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG SFEIQDEGRLKVRN+LTP TAQLAL+RI
Sbjct: 824 RAHENGLQENYYWLDRILRSYQSRIYSGDVGNSFEIQDEGRLKVRNSLTPLTAQLALQRI 883
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLPRS++FRKLKSF
Sbjct: 884 LPFPCTKQYTAVILLPRSFRFRKLKSF 910
BLAST of Sgr022307 vs. ExPASy TrEMBL
Match:
A0A1S3C9Q7 (zinc protease PQQL-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498391 PE=3 SV=1)
HSP 1 Score: 907.5 bits (2344), Expect = 2.5e-260
Identity = 452/507 (89.15%), Postives = 474/507 (93.49%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
+EPRASATIDDLKNVV+NI CLEKER IPPWDEE+IPEEIVSTMP+PGNI+QQ EYPNI
Sbjct: 464 IEPRASATIDDLKNVVMNITCLEKERSIPPWDEENIPEEIVSTMPNPGNIVQQKEYPNIG 523
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY
Sbjct: 524 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 583
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
RPSVLMD+LAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI
Sbjct: 584 RPSVLMDILAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 643
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEEAVRAQERDPYTAFA PIRL DL+KVDPQ+ACEYFN CFR
Sbjct: 644 VMQMAEEAVRAQERDPYTAFANRVKELNYGNSYFFRPIRLRDLKKVDPQRACEYFNKCFR 703
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVV+VGNINPSIALPLIQQYL PPEP+M+FNRDDLKGLPF FPT IVREVV
Sbjct: 704 DPSNFTVVVVGNINPSIALPLIQQYLGGIPKPPEPIMNFNRDDLKGLPFKFPTRIVREVV 763
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
YSPMVEAQCSVQLCFPVEL NGTMVEEIH+VGFLSKLLET+MMQVLRFKHGQIYSAGVSV
Sbjct: 764 YSPMVEAQCSVQLCFPVELTNGTMVEEIHYVGFLSKLLETRMMQVLRFKHGQIYSAGVSV 823
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSR GPVRGDISINFSCDPEISSKLVDLAL+EILRLQEEGPTDQDVS++LEIEQ
Sbjct: 824 FLGGNKPSRSGPVRGDISINFSCDPEISSKLVDLALNEILRLQEEGPTDQDVSSILEIEQ 883
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENGLQENYYWLDRILRSYQSRIYSGDVG+SFEIQDEGRL VRN+LTP TAQLAL+RI
Sbjct: 884 RAHENGLQENYYWLDRILRSYQSRIYSGDVGSSFEIQDEGRLNVRNSLTPLTAQLALQRI 943
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSF 489
LPFPCTKQYTAV+LLP SY+FRKLKSF
Sbjct: 944 LPFPCTKQYTAVILLPASYRFRKLKSF 970
BLAST of Sgr022307 vs. TAIR 10
Match:
AT5G56730.1 (Insulinase (Peptidase family M16) protein )
HSP 1 Score: 700.7 bits (1807), Expect = 8.8e-202
Identity = 348/510 (68.24%), Postives = 410/510 (80.39%), Query Frame = 0
Query: 1 MEPRASATIDDLKNVVVNINCLEKERRIPPWDEEHIPEEIVSTMPDPGNILQQLEYPNIR 60
MEP+++ATID ++NVV +N LE+E+ I PWDEE+IPEEIVS P PG+I QLEYP +
Sbjct: 439 MEPKSAATIDHMRNVVSKVNSLEEEKMIAPWDEENIPEEIVSEKPTPGDITHQLEYPEVG 498
Query: 61 ATEIFLSNGMRVCYKCTDFLDDQVIFTGFSYGALSELPEREYSSCSMGSTIAGEIGVFGY 120
TE+ LSNGM+VCYK TDFLDDQV+FTGFSYG LSELPE +Y SCSMGSTIAGEIG+FGY
Sbjct: 499 VTELTLSNGMQVCYKSTDFLDDQVLFTGFSYGGLSELPESDYISCSMGSTIAGEIGMFGY 558
Query: 121 RPSVLMDMLAGKRAEVGTKLGAYMRTFSGDCSPSDLETALQLVYQLFTTNVTPGEEDVKI 180
+PSVLMDMLA DLETALQLVYQLFTTNV P EE+V I
Sbjct: 559 KPSVLMDMLA------------------------DLETALQLVYQLFTTNVMPQEEEVGI 618
Query: 181 VMQMAEEAVRAQERDPYTAFAT--------------PIRLSDLRKVDPQKACEYFNNCFR 240
VMQMAEE+VRA+ERDPYT FA PIR+S+LRKVDP KACEYFN+CFR
Sbjct: 619 VMQMAEESVRARERDPYTVFANRVKELNYGNSYFFRPIRISELRKVDPLKACEYFNSCFR 678
Query: 241 DPSSFTVVIVGNINPSIALPLIQQYL-----PPEPVMDFNRDDLKGLPFTFPTSIVREVV 300
DPS+FTVVIVGN++P+IALPLI QYL PP+PV++FNRDDLKGLPFTFPT I +E V
Sbjct: 679 DPSTFTVVIVGNLDPTIALPLILQYLGGIPKPPQPVLNFNRDDLKGLPFTFPTKITKEFV 738
Query: 301 YSPMVEAQCSVQLCFPVELKNGTMVEEIHFVGFLSKLLETKMMQVLRFKHGQIYSAGVSV 360
SPMVEAQCSVQLCFPV+L NGTM+EEIH +GFL KLLETK++Q LRF+HGQIYSA VSV
Sbjct: 739 RSPMVEAQCSVQLCFPVQLTNGTMIEEIHCIGFLGKLLETKIIQFLRFEHGQIYSAEVSV 798
Query: 361 FLGGNKPSRIGPVRGDISINFSCDPEISSKLVDLALDEILRLQEEGPTDQDVSTVLEIEQ 420
FLGGNKPSR +RGDIS+NFSCDPEISSKLVDLAL+EI+RLQ+EGP+ +D+S +LEIEQ
Sbjct: 799 FLGGNKPSRTADLRGDISVNFSCDPEISSKLVDLALEEIVRLQKEGPSQEDISAILEIEQ 858
Query: 421 RAHENGLQENYYWLDRILRSYQSRIYSGDVGTSFEIQDEGRLKVRNALTPSTAQLALRRI 480
RAHENG+QENYYWLDRI+R YQSR+Y+GD+G S +I +EGRL++R +L P TAQ AL+RI
Sbjct: 859 RAHENGMQENYYWLDRIIRGYQSRVYAGDLGASCKILEEGRLRMRESLAPQTAQAALQRI 918
Query: 481 LPFPCTKQYTAVVLLPRSYQFRKLKSFSNS 492
LP PC KQYTAV+L+P+ +F L S +S
Sbjct: 919 LPHPCKKQYTAVILMPQRSRFGFLSSIFSS 924
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FJT9 | 1.2e-200 | 68.24 | Zinc protease PQQL-like OS=Arabidopsis thaliana OX=3702 GN=At5g56730 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1KAU8 | 1.8e-263 | 90.14 | zinc protease PQQL-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493804 P... | [more] |
A0A6J1KJP3 | 1.8e-263 | 90.14 | zinc protease PQQL-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111493804 P... | [more] |
A0A6J1DPE6 | 3.8e-261 | 90.53 | zinc protease PQQL-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC11102187... | [more] |
A0A6J1DM98 | 3.8e-261 | 90.53 | zinc protease PQQL-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC11102187... | [more] |
A0A1S3C9Q7 | 2.5e-260 | 89.15 | zinc protease PQQL-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498391 PE=3 ... | [more] |
Match Name | E-value | Identity | Description | |
AT5G56730.1 | 8.8e-202 | 68.24 | Insulinase (Peptidase family M16) protein | [more] |