Cla97C05G086740 (gene) Watermelon (97103) v2

NameCla97C05G086740
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionbasic 7S globulin-like
LocationCla97Chr05 : 5026986 .. 5028293 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCCTCAACTTCCCTCTCTTTCTTCTCCTCTATTCTCTTCCTCCTCTTCTCCATTTCCATTGCTGCGACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCCGTCACCAAACACCCATCTCTCCAATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCGCTCAAGCTCACGGTTGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCCACCTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCACTACCTGCGGCGAGTGCTTTTCGCCCCCGCGCCCCGGTTGCAACAATAACACGTGCGGCCTCTTCCCCGGCAATACCATTATCGGCCTCTCCACTAGCGGAGAAGTCGCTTCCGATGTCGTCTCCGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGTCGGTGCCCAATTTCCTCTTCGTCTGTGGCTCGACGTTTCTCCTCGACGGTCTTGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCACAATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTTTCCGGGAATGGCCCTTACAATTTCTTACCCAACGTCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTCCTCCGCCGGAGATAAATCCTCCGAGTATTTCATCGGCGTTAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAATACCACTCTCCTCAAAATCGACGAAAACGGAAACGGCGGTACAAAAATCAGCACAGTGAATCCATACACCGTACTCGAATCGTCGATCTACAACGCGGTGGTGAAAACGTTCACGACGGAGCTGTCGAAAATTCCGAGAGTGGCAGCGGTGGCGCCGTTTGGGGTTTGTTATAGTTCAAAGAGCTTTTCGAGTACTCGATTGGGGGCGGGCGTGCCGTCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATATTCGGTGCGAACTCAATGGTGTCTATAAACGACGAAGTTTTGTGCTTGGGATTTGTTGACGGTGGAGTTGAACCAAGAACGGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCCTCTTCCAGACTTGGATTTAGCTCCACTCTTCTAGGTCGGATGACTAATTGTGGTAATTTCAACTTTACTTCTAGCCCTTGA

mRNA sequence

ATGGCGGCCTCAACTTCCCTCTCTTTCTTCTCCTCTATTCTCTTCCTCCTCTTCTCCATTTCCATTGCTGCGACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCCGTCACCAAACACCCATCTCTCCAATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCGCTCAAGCTCACGGTTGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCCACCTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCACTACCTGCGGCGAGTGCTTTTCGCCCCCGCGCCCCGGTTGCAACAATAACACGTGCGGCCTCTTCCCCGGCAATACCATTATCGGCCTCTCCACTAGCGGAGAAGTCGCTTCCGATGTCGTCTCCGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGTCGGTGCCCAATTTCCTCTTCGTCTGTGGCTCGACGTTTCTCCTCGACGGTCTTGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCACAATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTTTCCGGGAATGGCCCTTACAATTTCTTACCCAACGTCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTCCTCCGCCGGAGATAAATCCTCCGAGTATTTCATCGGCGTTAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAATACCACTCTCCTCAAAATCGACGAAAACGGAAACGGCGGTACAAAAATCAGCACAGTGAATCCATACACCGTACTCGAATCGTCGATCTACAACGCGGTGGTGAAAACGTTCACGACGGAGCTGTCGAAAATTCCGAGAGTGGCAGCGGTGGCGCCGTTTGGGGTTTGTTATAGTTCAAAGAGCTTTTCGAGTACTCGATTGGGGGCGGGCGTGCCGTCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATATTCGGTGCGAACTCAATGGTGTCTATAAACGACGAAGTTTTGTGCTTGGGATTTGTTGACGGTGGAGTTGAACCAAGAACGGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCCTCTTCCAGACTTGGATTTAGCTCCACTCTTCTAGGTCGGATGACTAATTGTGGTAATTTCAACTTTACTTCTAGCCCTTGA

Coding sequence (CDS)

ATGGCGGCCTCAACTTCCCTCTCTTTCTTCTCCTCTATTCTCTTCCTCCTCTTCTCCATTTCCATTGCTGCGACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCCGTCACCAAACACCCATCTCTCCAATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCGCTCAAGCTCACGGTTGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCCACCTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCACTACCTGCGGCGAGTGCTTTTCGCCCCCGCGCCCCGGTTGCAACAATAACACGTGCGGCCTCTTCCCCGGCAATACCATTATCGGCCTCTCCACTAGCGGAGAAGTCGCTTCCGATGTCGTCTCCGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGTCGGTGCCCAATTTCCTCTTCGTCTGTGGCTCGACGTTTCTCCTCGACGGTCTTGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCACAATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTTTCCGGGAATGGCCCTTACAATTTCTTACCCAACGTCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTCCTCCGCCGGAGATAAATCCTCCGAGTATTTCATCGGCGTTAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAATACCACTCTCCTCAAAATCGACGAAAACGGAAACGGCGGTACAAAAATCAGCACAGTGAATCCATACACCGTACTCGAATCGTCGATCTACAACGCGGTGGTGAAAACGTTCACGACGGAGCTGTCGAAAATTCCGAGAGTGGCAGCGGTGGCGCCGTTTGGGGTTTGTTATAGTTCAAAGAGCTTTTCGAGTACTCGATTGGGGGCGGGCGTGCCGTCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATATTCGGTGCGAACTCAATGGTGTCTATAAACGACGAAGTTTTGTGCTTGGGATTTGTTGACGGTGGAGTTGAACCAAGAACGGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCCTCTTCCAGACTTGGATTTAGCTCCACTCTTCTAGGTCGGATGACTAATTGTGGTAATTTCAACTTTACTTCTAGCCCTTGA

Protein sequence

MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTNCGNFNFTSSP
BLAST of Cla97C05G086740 vs. NCBI nr
Match: XP_004134154.1 (PREDICTED: basic 7S globulin-like [Cucumis sativus] >KGN57020.1 Xyloglucan-specific endoglucanase inhibitor protein [Cucumis sativus])

HSP 1 Score: 761.5 bits (1965), Expect = 1.5e-216
Identity = 383/434 (88.25%), Postives = 407/434 (93.78%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA+STS SFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYIT IHQRTPLVP+KLT
Sbjct: 1   MASSTSFSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITEIHQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLGGQFMWVDCDRGYVSS+YKPARCRSAQC LASKS+ CG+CFSPPRPGCNNNTC LFP
Sbjct: 61  VDLGGQFMWVDCDRGYVSSSYKPARCRSAQCSLASKSSACGQCFSPPRPGCNNNTCSLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
           GNTII LSTSGEVASDVVSVSSTNGF PTRAVS+PNFLFVCGSTFLL+GLA GVTGMAGF
Sbjct: 121 GNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQFAAAFSFNRKFAVCLSGST SPGVIFSGNGPY+FLPN+DLT S TYTPLF
Sbjct: 181 GRNGISLPSQFAAAFSFNRKFAVCLSGSTSSPGVIFSGNGPYHFLPNIDLTNSFTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGVSSAG+KS+EYFIGV SIV+NSK VPLNTTLLKID NGNGGTKISTVNP+TV
Sbjct: 241 INPVSTAGVSSAGEKSTEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIY A+VK FTTE+SK+PRV AVAPF VCYSSKSF STRLGAGVP+IDL+LQNKKVI
Sbjct: 301 LESSIYKALVKAFTTEVSKVPRVGAVAPFEVCYSSKSFPSTRLGAGVPTIDLVLQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           W +FGANSMV +NDEVLCLGFVDGGV+ RTAIVIGAHQIED LLEFDLA+SRLGF+ TLL
Sbjct: 361 WSMFGANSMVQVNDEVLCLGFVDGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTLL 420

Query: 421 GRMTNCGNFNFTSS 435
           GRMT C NFNFTS+
Sbjct: 421 GRMTTCANFNFTSN 434

BLAST of Cla97C05G086740 vs. NCBI nr
Match: XP_008438718.1 (PREDICTED: basic 7S globulin-like [Cucumis melo])

HSP 1 Score: 758.4 bits (1957), Expect = 1.3e-215
Identity = 380/434 (87.56%), Postives = 409/434 (94.24%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA+STS SFFSSILFLLFSISIAATSFRPKSL+LPVTKHPSLQYIT +HQRTPLVP+KLT
Sbjct: 1   MASSTSFSFFSSILFLLFSISIAATSFRPKSLVLPVTKHPSLQYITEVHQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLGGQFMWVDCDR YVSS+YKPARCRSAQC LASKS++CG+CFSPPRPGCNN+TCGLFP
Sbjct: 61  VDLGGQFMWVDCDRDYVSSSYKPARCRSAQCSLASKSSSCGQCFSPPRPGCNNDTCGLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
            NTII LSTSGEVASDVVSVSSTNGF PTRAVS+PNFLFVCGSTFLL+GLA GVTGMAGF
Sbjct: 121 SNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPY+FLPN+DLT S TYTPLF
Sbjct: 181 GRNGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLPNIDLTNSFTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGVSSAG+KS+EYFIGV SIV+NSK VPLNTTLLKID NGNGGTKISTVNP+TV
Sbjct: 241 INPVSTAGVSSAGEKSTEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIY A+VK FTTE+SK+PRVAAVAPF VCY+SKSF STRLGAGVP+IDL+LQNKKVI
Sbjct: 301 LESSIYKALVKAFTTEVSKVPRVAAVAPFEVCYNSKSFPSTRLGAGVPTIDLVLQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           W IFGANSMV +ND+VLCLGFVDGGV+ RTAIVIGAHQIED LLEFDLA+SRLGF+ TLL
Sbjct: 361 WSIFGANSMVQVNDDVLCLGFVDGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTLL 420

Query: 421 GRMTNCGNFNFTSS 435
           GRMT C NFNFTS+
Sbjct: 421 GRMTTCANFNFTSN 434

BLAST of Cla97C05G086740 vs. NCBI nr
Match: XP_008438715.1 (PREDICTED: basic 7S globulin-like [Cucumis melo])

HSP 1 Score: 755.0 bits (1948), Expect = 1.4e-214
Identity = 386/436 (88.53%), Postives = 408/436 (93.58%), Query Frame = 0

Query: 1   MAAST--SLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLK 60
           MAAST  S SFFSSILFLLFSISIAATSFRPKSL+LPVTKHPS QYIT I QRTPLVP+K
Sbjct: 1   MAASTSFSFSFFSSILFLLFSISIAATSFRPKSLVLPVTKHPSGQYITQIRQRTPLVPVK 60

Query: 61  LTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGL 120
           LTVDLGG+FMWVDCD GYVSS+YKP RCRSAQC L SKST+CGECFSPPRPGCNNNTCG 
Sbjct: 61  LTVDLGGRFMWVDCDSGYVSSSYKPVRCRSAQCSL-SKSTSCGECFSPPRPGCNNNTCGH 120

Query: 121 FPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMA 180
           FPGNTII LSTSGEV +DVVSVSSTNGF PTRAVSVPNF+FVCG TFLL+GL GGV+GMA
Sbjct: 121 FPGNTIIQLSTSGEVTTDVVSVSSTNGFNPTRAVSVPNFIFVCGPTFLLEGLNGGVSGMA 180

Query: 181 GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTP 240
           GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPY+FLPNVDLTKSLTYTP
Sbjct: 181 GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLPNVDLTKSLTYTP 240

Query: 241 LFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPY 300
           LFINPVSTAGVS++G+KSSEYFIGVKSIV NSKTVPLNTTLLKID NGNGGTKIST++PY
Sbjct: 241 LFINPVSTAGVSTSGEKSSEYFIGVKSIVFNSKTVPLNTTLLKIDRNGNGGTKISTIHPY 300

Query: 301 TVLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKK 360
           TVLESSIYNA+VKT TTEL  IPRVAAVAPFGVCY SKSF STRLG G+PSIDLILQNKK
Sbjct: 301 TVLESSIYNALVKTITTELRNIPRVAAVAPFGVCYKSKSFGSTRLGPGMPSIDLILQNKK 360

Query: 361 VIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSST 420
           VIWRIFGANSMV +ND+VLCLGFVDGGVE RTAIVIGAHQ+EDNLLEFDLA+SRLGFSST
Sbjct: 361 VIWRIFGANSMVQVNDDVLCLGFVDGGVEARTAIVIGAHQMEDNLLEFDLATSRLGFSST 420

Query: 421 LLGRMTNCGNFNFTSS 435
           LLGRMT C NFNFTS+
Sbjct: 421 LLGRMTTCANFNFTSA 435

BLAST of Cla97C05G086740 vs. NCBI nr
Match: XP_022936844.1 (basic 7S globulin-like [Cucurbita moschata])

HSP 1 Score: 749.2 bits (1933), Expect = 7.8e-213
Identity = 374/435 (85.98%), Postives = 406/435 (93.33%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA S SLSFFSS+LFLL S +IAATSFRPK+L+LPVTKHPSLQYIT I QRTPLVP+KLT
Sbjct: 1   MATSISLSFFSSVLFLLLSSAIAATSFRPKALVLPVTKHPSLQYITQIRQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLG QFMWVDCDRGY+SSTYKPARCRSAQC+LASKS+ CG+CFSPPRPGCNNNTC LFP
Sbjct: 61  VDLGSQFMWVDCDRGYISSTYKPARCRSAQCNLASKSSGCGQCFSPPRPGCNNNTCSLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
           GNTII LSTSGE+ASDVVSVSST+GF PT+ V+VPNFLFVCGSTFLLDGLAGGVTGMAGF
Sbjct: 121 GNTIIHLSTSGELASDVVSVSSTDGFNPTKPVTVPNFLFVCGSTFLLDGLAGGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQF+AAFSFNRKFAVCLSGSTR PGVIFSGNGPY+FLPN+DLT SLTYTPLF
Sbjct: 181 GRNGISLPSQFSAAFSFNRKFAVCLSGSTRFPGVIFSGNGPYHFLPNIDLTDSLTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGV +AG+KS+EYFIGVKSIVINSKTVPLNTTLLKID NG GGTKISTV+PYTV
Sbjct: 241 INPVSTAGVFTAGEKSTEYFIGVKSIVINSKTVPLNTTLLKIDSNGIGGTKISTVDPYTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIYNAV+KTFTTEL  +PRVAAVAPFG C+++KS SSTRLG GVPSI+LILQNKKVI
Sbjct: 301 LESSIYNAVLKTFTTELKNVPRVAAVAPFGACFNAKSISSTRLGPGVPSIELILQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           WRIFGANSMV + D+VLCLGFVDGGV PRT+IVIGAHQIEDNLLEFD+A+SRLGFS+TLL
Sbjct: 361 WRIFGANSMVQVKDDVLCLGFVDGGVNPRTSIVIGAHQIEDNLLEFDMATSRLGFSATLL 420

Query: 421 GRMTNCGNFNFTSSP 436
           GRMT C NFNFTS P
Sbjct: 421 GRMTTCANFNFTSKP 435

BLAST of Cla97C05G086740 vs. NCBI nr
Match: XP_023538677.1 (basic 7S globulin-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 748.4 bits (1931), Expect = 1.3e-212
Identity = 374/435 (85.98%), Postives = 407/435 (93.56%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA STSLSFFSS+LFLLFS ++A TSFRPK+L+LPVTKHPSLQYIT I QRTPLVP+KLT
Sbjct: 1   MATSTSLSFFSSVLFLLFS-AVATTSFRPKALVLPVTKHPSLQYITQIRQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLG QFMWVDCDRGY+SSTYKPARCRSAQC+LASKS+ CG+CFSPPRPGCNNNTC LFP
Sbjct: 61  VDLGSQFMWVDCDRGYISSTYKPARCRSAQCNLASKSSGCGQCFSPPRPGCNNNTCSLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
           GNTII LSTSGE+ASDVVSVSST+GF PT+ V+VPNFLFVCGSTFLLDGLAGGVTGMAGF
Sbjct: 121 GNTIIHLSTSGELASDVVSVSSTDGFNPTKPVTVPNFLFVCGSTFLLDGLAGGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQF+AAFSFNRKFAVCLSGSTR PGVIFSGNGPY+FLPN+DLT SLTYTPLF
Sbjct: 181 GRNGISLPSQFSAAFSFNRKFAVCLSGSTRFPGVIFSGNGPYHFLPNIDLTDSLTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGV SAG+KS+EYFIGVKSI+INSKTVPLNTTLLKID NG GGTKISTV+PYTV
Sbjct: 241 INPVSTAGVFSAGEKSTEYFIGVKSILINSKTVPLNTTLLKIDSNGVGGTKISTVDPYTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIYNAV+KTFTTEL  +PRVAAVAPFG C+++KS SSTRLG GVPSI+LILQNKKVI
Sbjct: 301 LESSIYNAVLKTFTTELKNVPRVAAVAPFGACFNAKSISSTRLGPGVPSIELILQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           WRIFGANSMV + D+VLCLGFVDGGV PRT+IVIGAHQIEDNLLEFD+A+SRLGFS+TLL
Sbjct: 361 WRIFGANSMVQVKDDVLCLGFVDGGVNPRTSIVIGAHQIEDNLLEFDMATSRLGFSATLL 420

Query: 421 GRMTNCGNFNFTSSP 436
           GRMT C NFNFTS P
Sbjct: 421 GRMTTCANFNFTSKP 434

BLAST of Cla97C05G086740 vs. TrEMBL
Match: tr|A0A0A0L515|A0A0A0L515_CUCSA (Xyloglucan-specific endoglucanase inhibitor protein OS=Cucumis sativus OX=3659 GN=Csa_3G150000 PE=3 SV=1)

HSP 1 Score: 761.5 bits (1965), Expect = 1.0e-216
Identity = 383/434 (88.25%), Postives = 407/434 (93.78%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA+STS SFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYIT IHQRTPLVP+KLT
Sbjct: 1   MASSTSFSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITEIHQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLGGQFMWVDCDRGYVSS+YKPARCRSAQC LASKS+ CG+CFSPPRPGCNNNTC LFP
Sbjct: 61  VDLGGQFMWVDCDRGYVSSSYKPARCRSAQCSLASKSSACGQCFSPPRPGCNNNTCSLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
           GNTII LSTSGEVASDVVSVSSTNGF PTRAVS+PNFLFVCGSTFLL+GLA GVTGMAGF
Sbjct: 121 GNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQFAAAFSFNRKFAVCLSGST SPGVIFSGNGPY+FLPN+DLT S TYTPLF
Sbjct: 181 GRNGISLPSQFAAAFSFNRKFAVCLSGSTSSPGVIFSGNGPYHFLPNIDLTNSFTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGVSSAG+KS+EYFIGV SIV+NSK VPLNTTLLKID NGNGGTKISTVNP+TV
Sbjct: 241 INPVSTAGVSSAGEKSTEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIY A+VK FTTE+SK+PRV AVAPF VCYSSKSF STRLGAGVP+IDL+LQNKKVI
Sbjct: 301 LESSIYKALVKAFTTEVSKVPRVGAVAPFEVCYSSKSFPSTRLGAGVPTIDLVLQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           W +FGANSMV +NDEVLCLGFVDGGV+ RTAIVIGAHQIED LLEFDLA+SRLGF+ TLL
Sbjct: 361 WSMFGANSMVQVNDEVLCLGFVDGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTLL 420

Query: 421 GRMTNCGNFNFTSS 435
           GRMT C NFNFTS+
Sbjct: 421 GRMTTCANFNFTSN 434

BLAST of Cla97C05G086740 vs. TrEMBL
Match: tr|A0A1S3AWQ8|A0A1S3AWQ8_CUCME (basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483742 PE=3 SV=1)

HSP 1 Score: 758.4 bits (1957), Expect = 8.5e-216
Identity = 380/434 (87.56%), Postives = 409/434 (94.24%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MA+STS SFFSSILFLLFSISIAATSFRPKSL+LPVTKHPSLQYIT +HQRTPLVP+KLT
Sbjct: 1   MASSTSFSFFSSILFLLFSISIAATSFRPKSLVLPVTKHPSLQYITEVHQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLGGQFMWVDCDR YVSS+YKPARCRSAQC LASKS++CG+CFSPPRPGCNN+TCGLFP
Sbjct: 61  VDLGGQFMWVDCDRDYVSSSYKPARCRSAQCSLASKSSSCGQCFSPPRPGCNNDTCGLFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
            NTII LSTSGEVASDVVSVSSTNGF PTRAVS+PNFLFVCGSTFLL+GLA GVTGMAGF
Sbjct: 121 SNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GR GISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPY+FLPN+DLT S TYTPLF
Sbjct: 181 GRNGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLPNIDLTNSFTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGVSSAG+KS+EYFIGV SIV+NSK VPLNTTLLKID NGNGGTKISTVNP+TV
Sbjct: 241 INPVSTAGVSSAGEKSTEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIY A+VK FTTE+SK+PRVAAVAPF VCY+SKSF STRLGAGVP+IDL+LQNKKVI
Sbjct: 301 LESSIYKALVKAFTTEVSKVPRVAAVAPFEVCYNSKSFPSTRLGAGVPTIDLVLQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           W IFGANSMV +ND+VLCLGFVDGGV+ RTAIVIGAHQIED LLEFDLA+SRLGF+ TLL
Sbjct: 361 WSIFGANSMVQVNDDVLCLGFVDGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTLL 420

Query: 421 GRMTNCGNFNFTSS 435
           GRMT C NFNFTS+
Sbjct: 421 GRMTTCANFNFTSN 434

BLAST of Cla97C05G086740 vs. TrEMBL
Match: tr|A0A1S3AXR0|A0A1S3AXR0_CUCME (basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483739 PE=3 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 9.4e-215
Identity = 386/436 (88.53%), Postives = 408/436 (93.58%), Query Frame = 0

Query: 1   MAAST--SLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLK 60
           MAAST  S SFFSSILFLLFSISIAATSFRPKSL+LPVTKHPS QYIT I QRTPLVP+K
Sbjct: 1   MAASTSFSFSFFSSILFLLFSISIAATSFRPKSLVLPVTKHPSGQYITQIRQRTPLVPVK 60

Query: 61  LTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGL 120
           LTVDLGG+FMWVDCD GYVSS+YKP RCRSAQC L SKST+CGECFSPPRPGCNNNTCG 
Sbjct: 61  LTVDLGGRFMWVDCDSGYVSSSYKPVRCRSAQCSL-SKSTSCGECFSPPRPGCNNNTCGH 120

Query: 121 FPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMA 180
           FPGNTII LSTSGEV +DVVSVSSTNGF PTRAVSVPNF+FVCG TFLL+GL GGV+GMA
Sbjct: 121 FPGNTIIQLSTSGEVTTDVVSVSSTNGFNPTRAVSVPNFIFVCGPTFLLEGLNGGVSGMA 180

Query: 181 GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTP 240
           GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPY+FLPNVDLTKSLTYTP
Sbjct: 181 GFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLPNVDLTKSLTYTP 240

Query: 241 LFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPY 300
           LFINPVSTAGVS++G+KSSEYFIGVKSIV NSKTVPLNTTLLKID NGNGGTKIST++PY
Sbjct: 241 LFINPVSTAGVSTSGEKSSEYFIGVKSIVFNSKTVPLNTTLLKIDRNGNGGTKISTIHPY 300

Query: 301 TVLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKK 360
           TVLESSIYNA+VKT TTEL  IPRVAAVAPFGVCY SKSF STRLG G+PSIDLILQNKK
Sbjct: 301 TVLESSIYNALVKTITTELRNIPRVAAVAPFGVCYKSKSFGSTRLGPGMPSIDLILQNKK 360

Query: 361 VIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSST 420
           VIWRIFGANSMV +ND+VLCLGFVDGGVE RTAIVIGAHQ+EDNLLEFDLA+SRLGFSST
Sbjct: 361 VIWRIFGANSMVQVNDDVLCLGFVDGGVEARTAIVIGAHQMEDNLLEFDLATSRLGFSST 420

Query: 421 LLGRMTNCGNFNFTSS 435
           LLGRMT C NFNFTS+
Sbjct: 421 LLGRMTTCANFNFTSA 435

BLAST of Cla97C05G086740 vs. TrEMBL
Match: tr|A0A0A0L5N4|A0A0A0L5N4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G149980 PE=3 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 4.4e-212
Identity = 381/434 (87.79%), Postives = 405/434 (93.32%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLT 60
           MAASTS SF  SILFLLFSIS AATSFRPKSLLLPVTKHPS QYIT I QRTPLVP+KLT
Sbjct: 1   MAASTSFSF--SILFLLFSISFAATSFRPKSLLLPVTKHPSGQYITQIRQRTPLVPVKLT 60

Query: 61  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 120
           VDLGGQFMWVDCDRGYVSS+YKP RCRSAQC L SKST+CG+CFSPPRPGCNNNTCG FP
Sbjct: 61  VDLGGQFMWVDCDRGYVSSSYKPVRCRSAQCSL-SKSTSCGDCFSPPRPGCNNNTCGHFP 120

Query: 121 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 180
           GNTII LSTSGEV SDV+SVSSTNGF PTRAVS+PNFLFVCG TFLL+GLAGGV+GMAGF
Sbjct: 121 GNTIIQLSTSGEVTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGF 180

Query: 181 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 240
           GRTGISLPSQF+AAFSFNRKFAVCLSGSTRSPGVIFSGNGPY+FL NVD+TKSLTYTPLF
Sbjct: 181 GRTGISLPSQFSAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLF 240

Query: 241 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV 300
           INPVSTAGVS++G+KSSEYFIGVKSIV NSKTVP+NTTLLKID NGNGGTKISTV+PYTV
Sbjct: 241 INPVSTAGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTV 300

Query: 301 LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI 360
           LESSIYNA+VKT T EL  IPRVAAVAPFGVCY SKSF STRLG G+PSIDLILQNKKVI
Sbjct: 301 LESSIYNALVKTITRELRNIPRVAAVAPFGVCYKSKSFGSTRLGPGMPSIDLILQNKKVI 360

Query: 361 WRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLL 420
           WRIFGANSMV +N+EVLCLGFVDGGVE RTAIVIGA+Q+EDNLLEFDLA+SRLGFSSTLL
Sbjct: 361 WRIFGANSMVQVNEEVLCLGFVDGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFSSTLL 420

Query: 421 GRMTNCGNFNFTSS 435
           GRMT C NFNFTS+
Sbjct: 421 GRMTTCANFNFTST 431

BLAST of Cla97C05G086740 vs. TrEMBL
Match: tr|A0A1S3AXS3|A0A1S3AXS3_CUCME (basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483740 PE=3 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 3.0e-192
Identity = 354/430 (82.33%), Postives = 377/430 (87.67%), Query Frame = 0

Query: 3   ASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVD 62
           AS S S  SSILFLLFSISIAATSF PKSL+LPV KHPSLQYI  IHQRTPLVP+ LTVD
Sbjct: 2   ASFSFSSSSSILFLLFSISIAATSFTPKSLVLPVIKHPSLQYIIQIHQRTPLVPVNLTVD 61

Query: 63  LGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGN 122
           LGG+FMWVDCDRGYVSS+YKPARCRSAQC+LA KS +CG+C+ PP PGCNN+TC L   N
Sbjct: 62  LGGRFMWVDCDRGYVSSSYKPARCRSAQCYLA-KSISCGKCYLPPHPGCNNHTCHLPAEN 121

Query: 123 TIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGR 182
           T+I LS+SGEV SDVVSVSSTN F PTRA+SV NFLFVC STFLL+GLAGGVTGMAGFGR
Sbjct: 122 TVIQLSSSGEVTSDVVSVSSTNDFNPTRALSVHNFLFVCSSTFLLEGLAGGVTGMAGFGR 181

Query: 183 TGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFIN 242
           T ISLPSQFAAAFSFNRKF VCLSGSTR PGVIFSG GPY+FLPN DLT SLTYTPLFIN
Sbjct: 182 TRISLPSQFAAAFSFNRKFTVCLSGSTRFPGVIFSGYGPYHFLPNTDLTNSLTYTPLFIN 241

Query: 243 PVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLE 302
           P     +  AG+KSSEYFIGVKSI  NSKTVPLNTTLLKID NGNGGTKISTVNPYTVLE
Sbjct: 242 P-----LGFAGEKSSEYFIGVKSIEFNSKTVPLNTTLLKIDRNGNGGTKISTVNPYTVLE 301

Query: 303 SSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWR 362
            SIYNA+VKTFTTEL  IPRV AVAPF VCYSSKSF ST LG GV SIDLILQNKKVIWR
Sbjct: 302 PSIYNALVKTFTTELGNIPRVDAVAPFEVCYSSKSFGSTELGPGVASIDLILQNKKVIWR 361

Query: 363 IFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGR 422
           +FGANSMV +NDEVLCLGFVDGGVE +TA+VIGAHQIEDNLLEFDLA+SRLGFSSTLLGR
Sbjct: 362 MFGANSMVLVNDEVLCLGFVDGGVEAKTAVVIGAHQIEDNLLEFDLATSRLGFSSTLLGR 421

Query: 423 MTNCGNFNFT 433
            TNC NFN +
Sbjct: 422 NTNCANFNLS 425

BLAST of Cla97C05G086740 vs. Swiss-Prot
Match: sp|P13917|7SB1_SOYBN (Basic 7S globulin OS=Glycine max OX=3847 GN=BG PE=1 SV=2)

HSP 1 Score: 251.9 bits (642), Expect = 1.3e-65
Identity = 169/445 (37.98%), Postives = 241/445 (54.16%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSL-LLPVTKHPSL-QYITHIHQRTPLVPLK 60
           +A S S SF       LF +S + T  +P +L +LPV    S   +  ++ +RTPL+ + 
Sbjct: 9   LALSLSCSF-------LFFLSDSVTPTKPINLVVLPVQNDGSTGLHWANLQKRTPLMQVP 68

Query: 61  LTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGL 120
           + VDL G  +WV+C++ Y S TY+   C S QC  A+ +  C  C +  RPGC+ NTCGL
Sbjct: 69  VLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQCSRAN-THQCLSCPAASRPGCHKNTCGL 128

Query: 121 FPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRA--VSVPNFLFVCGSTFLLD-GLAGGVT 180
              N I   +  GE+  DV+++ +T G        V+VP FLF C  +FL+  GL     
Sbjct: 129 MSTNPITQQTGLGELGEDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQ 188

Query: 181 GMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNF--LPNVDLTKS 240
           G+AG G   ISLP+Q A+ F   R+F  CLS    S G I  G+ P N     N D+   
Sbjct: 189 GVAGLGHAPISLPNQLASHFGLQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHD 248

Query: 241 LTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTV-PLNTTLLKIDENGNGGTKI 300
           L +TPL I                EY + V SI IN  +V PLN     I  + +GGT I
Sbjct: 249 LAFTPLTIT------------LQGEYNVRVNSIRINQHSVFPLNKISSTIVGSTSGGTMI 308

Query: 301 STVNPYTVLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDL 360
           ST  P+ VL+ S+Y A  + F  +L K  +V +VAPFG+C++S   ++       PS+DL
Sbjct: 309 STSTPHMVLQQSVYQAFTQVFAQQLPKQAQVKSVAPFGLCFNSNKINA------YPSVDL 368

Query: 361 ILQNKK-VIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASS 420
           ++      +WRI G + MV     V CLG ++GG++PR  I +GA Q+E+NL+ FDLA S
Sbjct: 369 VMDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGGMQPRAEITLGARQLEENLVVFDLARS 427

Query: 421 RLGFS-STLLGRMTNCGN-FNFTSS 435
           R+GFS S+L      C + FNF ++
Sbjct: 429 RVGFSTSSLHSHGVKCADLFNFANA 427

BLAST of Cla97C05G086740 vs. Swiss-Prot
Match: sp|Q8RVH5|7SBG2_SOYBN (Basic 7S globulin 2 OS=Glycine max OX=3847 PE=1 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 1.3e-62
Identity = 158/438 (36.07%), Postives = 229/438 (52.28%), Query Frame = 0

Query: 10  FSSILFLLFSISIAATSFRPKS----LLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLG 69
           FS + FL  S+ I      P      L+LPV    S   +  ++ +RTPL+ + + VDL 
Sbjct: 15  FSFLFFLSDSVPIPQHHTNPTKPINLLVLPVQNDASTGLHWANLQKRTPLMQVPVLVDLN 74

Query: 70  GQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTI 129
           G  +WV+C++ Y S TY+   C S QC  A+ +  C  C +  RPG   NTCGL   N I
Sbjct: 75  GNHLWVNCEQHYSSKTYQAPFCHSTQCSRAN-THQCLSCPAASRPGXXXNTCGLMSTNPI 134

Query: 130 IGLSTSGEVASDVVSVSSTNGFIPTRA--VSVPNFLFVCGSTFLLD-GLAGGVTGMAGFG 189
              +  GE+  DV+++ +T G        V+VP FLF C  +FLL  GL   + G+AG G
Sbjct: 135 TQQTGLGELGQDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLLQKGLPRNIQGVAGLG 194

Query: 190 RTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNF--LPNVDLTKSLTYTPL 249
              ISLP+Q A+ F    +F  CLS    S G +  G+ P N     N D+   L +TPL
Sbjct: 195 HAPISLPNQLASHFGLQHQFTTCLSRYPTSKGALIFGDAPNNMQQFHNQDIFHDLAFTPL 254

Query: 250 FINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYT 309
            + P              EY + V SI IN  +V     +       +GGT IST  P+ 
Sbjct: 255 TVTP------------QGEYNVRVSSIRINQHSVFPPNKISSTIVGSSGGTMISTSTPHM 314

Query: 310 VLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKK- 369
           VL+ S+Y A  + F  +L K  +V +VAPFG+C++S   ++       PS+DL++     
Sbjct: 315 VLQQSLYQAFTQVFAQQLEKQAQVKSVAPFGLCFNSNKINA------YPSVDLVMDKPNG 374

Query: 370 VIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFS-S 429
            +WRI G + MV     V CLG ++GG++PR  + +G  Q+E+ L+ FDLA SR+GFS S
Sbjct: 375 PVWRISGEDLMVQAQPGVTCLGVMNGGMQPRAEVTLGTRQLEEKLMVFDLARSRVGFSTS 433

Query: 430 TLLGRMTNCGN-FNFTSS 435
           +L      CG+ FNF ++
Sbjct: 435 SLHSHGVKCGDLFNFANA 433

BLAST of Cla97C05G086740 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 2.5e-13
Identity = 95/396 (23.99%), Postives = 157/396 (39.65%), Query Frame = 0

Query: 43  QYITHIHQRTPLVPLKLTVDLGGQFMWVD---CDRGY----------VSSTYKPARCRSA 102
           +Y T +   TP   + + +D G   +W+    C R Y           S TY    C S 
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200

Query: 103 QCHLASKSTTCGECFSPPRPGCN--NNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFI 162
            C     +            GCN    TC L+  +   G  T G+ +++ ++        
Sbjct: 201 HCRRLDSA------------GCNTRRKTC-LYQVSYGDGSFTVGDFSTETLTF------- 260

Query: 163 PTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCL-- 222
             R   V      CG     +GL  G  G+ G G+  +S P Q      FN+KF+ CL  
Sbjct: 261 --RRNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQ--TGHRFNQKFSYCLVD 320

Query: 223 -SGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVK 282
            S S++   V+F          N  +++   +TPL  NP             + Y++G+ 
Sbjct: 321 RSASSKPSSVVFG---------NAAVSRIARFTPLLSNP----------KLDTFYYVGLL 380

Query: 283 SIVINSKTVP-LNTTLLKIDENGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKIPRV 342
            I +    VP +  +L K+D+ GNGG  I +    T L    Y A+   F      + R 
Sbjct: 381 GISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRA 440

Query: 343 AAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSIN-DEVLCLGFV 402
              + F  C+   + +  +    VP++ L  +   V   +   N ++ ++ +   C  F 
Sbjct: 441 PDFSLFDTCFDLSNMNEVK----VPTVVLHFRGADV--SLPATNYLIPVDTNGKFCFAFA 480

Query: 403 D--GGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFS 417
              GG+      +IG  Q +   + +DLASSR+GF+
Sbjct: 501 GTMGGLS-----IIGNIQQQGFRVVYDLASSRVGFA 480

BLAST of Cla97C05G086740 vs. Swiss-Prot
Match: sp|Q9LZL3|PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 2.5e-13
Identity = 90/393 (22.90%), Postives = 153/393 (38.93%), Query Frame = 0

Query: 52  TPLVPLKLTVDLGGQFMWVDCDRG-----------YVSSTYKPARCRSAQCHLASKSTTC 111
           TP   + + +D G +  W+ C+R              SS+Y P  C S  C   ++    
Sbjct: 81  TPPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD--- 140

Query: 112 GECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSV-SSTNGFIPTRAVSVPNFLF 171
              F  P   C+++       +     S+ G +A+++    +STN           N +F
Sbjct: 141 ---FLIP-ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTND---------SNLIF 200

Query: 172 VC-GSTFLLDGLAG-GVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFS 231
            C GS    D       TG+ G  R  +S  SQ         KF+ C+SG+   PG +  
Sbjct: 201 GCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLL 260

Query: 232 GNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNT 291
           G+  + +L           TPL   P+              Y + +  I +N K +P+  
Sbjct: 261 GDSNFTWL-----------TPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPK 320

Query: 292 TLLKIDENGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKI------PRVAAVAPFGV 351
           ++L  D  G G T + +   +T L   +Y A+   F    + I      P         +
Sbjct: 321 SVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDL 380

Query: 352 CYSSKSFSSTRLGAGV----PSIDLILQNKKVIWR----IFGANSMVSINDEVLCLGFVD 411
           CY     S  R+ +G+    P++ L+ +  ++       ++    +   ND V C  F +
Sbjct: 381 CY---RISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGN 438

Query: 412 GGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFS 417
             +    A VIG H  ++  +EFDL  SR+G +
Sbjct: 441 SDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLA 438

BLAST of Cla97C05G086740 vs. Swiss-Prot
Match: sp|Q8S9J6|ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 2.8e-12
Identity = 93/405 (22.96%), Postives = 149/405 (36.79%), Query Frame = 0

Query: 30  KSLLLPVTKHPSL---QYITHIHQRTPLVPLKLTVDLGGQFMWVDCD------------- 89
           KS  LP     +L    YI  +   TP   L L  D G    W  C              
Sbjct: 115 KSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPI 174

Query: 90  -RGYVSSTYKPARCRSAQC-HLASKSTTCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSG 149
                S++Y    C SA C  L+S +   G C +                N I G+    
Sbjct: 175 FNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSA---------------SNCIYGIQYGD 234

Query: 150 EVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQF 209
           +  S  V   +   F  T +       F CG      GL  GV G+ G GR  +S PSQ 
Sbjct: 235 QSFS--VGFLAKEKFTLTNSDVFDGVYFGCGEN--NQGLFTGVAGLLGLGRDKLSFPSQT 294

Query: 210 AAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSS 269
           A A  +N+ F+ CL  S    G +  G        +  +++S+ +TP          +S+
Sbjct: 295 ATA--YNKIFSYCLPSSASYTGHLTFG--------SAGISRSVKFTP----------IST 354

Query: 270 AGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESSIYNAVVK 329
             D +S Y + + +I +  + +P+ +T+         G  I +    T L    Y A+  
Sbjct: 355 ITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGALIDSGTVITRLPPKAYAALRS 414

Query: 330 TFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVS 389
           +F  ++SK P  + V+    C+    F +      +P +        V+  +        
Sbjct: 415 SFKAKMSKYPTTSGVSILDTCFDLSGFKTVT----IPKVAFSFSGGAVV-ELGSKGIFYV 469

Query: 390 INDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFS 417
                +CL F  G  +   A + G  Q +   + +D A  R+GF+
Sbjct: 475 FKISQVCLAFA-GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 469

BLAST of Cla97C05G086740 vs. TAIR10
Match: AT1G03220.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 521.9 bits (1343), Expect = 3.7e-148
Identity = 277/430 (64.42%), Postives = 326/430 (75.81%), Query Frame = 0

Query: 10  FSSILFLLFSISIAA-TSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQF 69
           FS +L  +FS+S +A T FRPK+LLLPVTK  S LQY T I+QRTPLVP  +  DLGG+ 
Sbjct: 8   FSVLLLFIFSLSSSAQTPFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGRE 67

Query: 70  MWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTIIGL 129
           +WVDCD+GYVSSTY+  RC SA C  A  ST+CG CFSPPRPGC+NNTCG  P NT+ G 
Sbjct: 68  LWVDCDKGYVSSTYQSPRCNSAVCSRAG-STSCGTCFSPPRPGCSNNTCGGIPDNTVTGT 127

Query: 130 STSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISL 189
           +TSGE A DVVS+ STNG  P R V +PN +F CG+TFLL GLA G  GMAG GR  I L
Sbjct: 128 ATSGEFALDVVSIQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGL 187

Query: 190 PSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTA 249
           PSQFAAAFSF+RKFAVCL   T   GV F GNGPY FLP + ++ SL  TPL INPVSTA
Sbjct: 188 PSQFAAAFSFHRKFAVCL---TSGKGVAFFGNGPYVFLPGIQIS-SLQTTPLLINPVSTA 247

Query: 250 GVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYTVLESSIY 309
              S G+KSSEYFIGV +I I  KTVP+N TLLKI+   G GGTKIS+VNPYTVLESSIY
Sbjct: 248 SAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLESSIY 307

Query: 310 NAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIF 369
           NA    F  + +   I RVA+V PFG C+S+K+   TRLG  VP I+L+L +K V+WRIF
Sbjct: 308 NAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVWRIF 367

Query: 370 GANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMT 429
           GANSMVS++D+V+CLGFVDGGV  RT++VIG  Q+EDNL+EFDLAS++ GFSSTLLGR T
Sbjct: 368 GANSMVSVSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQT 427

Query: 430 NCGNFNFTSS 435
           NC NFNFTS+
Sbjct: 428 NCANFNFTST 432

BLAST of Cla97C05G086740 vs. TAIR10
Match: AT1G03230.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 501.5 bits (1290), Expect = 5.2e-142
Identity = 270/437 (61.78%), Postives = 321/437 (73.46%), Query Frame = 0

Query: 3   ASTSLSFFSSILFLLFSISIAA-TSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLT 62
           AS+ +  FS +L  +FS+S +A  SFRPK+LLLPVTK PS LQY T I+QRTPLVP  + 
Sbjct: 2   ASSRIIIFSVLLLSIFSLSSSAQPSFRPKALLLPVTKDPSTLQYTTVINQRTPLVPASVV 61

Query: 63  VDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFP 122
            DLGG+  WVDCD+GYVS+TY+  RC SA C  A  S  CG CFSPPRPGC+NNTCG FP
Sbjct: 62  FDLGGREFWVDCDQGYVSTTYRSPRCNSAVCSRAG-SIACGTCFSPPRPGCSNNTCGAFP 121

Query: 123 GNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGF 182
            N+I G +TSGE A DVVS+ STNG  P R V +PN +F CGST LL GLA G  GMAG 
Sbjct: 122 DNSITGWATSGEFALDVVSIQSTNGSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGM 181

Query: 183 GRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLF 242
           GR  I LP QFAAAFSFNRKFAVCL   T   GV F GNGPY FLP + +++ L  TPL 
Sbjct: 182 GRHNIGLPLQFAAAFSFNRKFAVCL---TSGRGVAFFGNGPYVFLPGIQISR-LQKTPLL 241

Query: 243 INPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYT 302
           INP +T    S G+KS EYFIGV +I I  KT+P++ TLLKI+   G GGTKIS+VNPYT
Sbjct: 242 INPGTTVFEFSKGEKSPEYFIGVTAIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYT 301

Query: 303 VLESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNK 362
           VLESSIY A    F  + +   I RVA+V PFG C+S+K+   TRLG  VP I L+L +K
Sbjct: 302 VLESSIYKAFTSEFIRQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSK 361

Query: 363 KVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSS 422
            V+WRIFGANSMVS++D+V+CLGFVDGGV P  ++VIG  Q+EDNL+EFDLAS++ GFSS
Sbjct: 362 DVVWRIFGANSMVSVSDDVICLGFVDGGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSS 421

Query: 423 TLLGRMTNCGNFNFTSS 435
           TLLGR TNC NFNFTS+
Sbjct: 422 TLLGRQTNCANFNFTST 433

BLAST of Cla97C05G086740 vs. TAIR10
Match: AT5G19100.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 191.4 bits (485), Expect = 1.1e-48
Identity = 150/425 (35.29%), Postives = 210/425 (49.41%), Query Frame = 0

Query: 13  ILFLLFS---ISIAATSF---RPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQ 72
           ++FLL S   + +A TS    + +S L P+ K  +    T           K  +DL G 
Sbjct: 5   VIFLLLSLVFLYLANTSHSLRKFQSFLHPIYKDTAKNIYTIPLSIGSTSSEKFVLDLNGA 64

Query: 73  F-MWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTII 132
             +  +C     S+TY P RC S +C  A+           P   C NN   +    T+ 
Sbjct: 65  APLLQNCPTAAKSTTYHPIRCGSTRCKYAN-----------PNFPCPNNV--IAKKRTVC 124

Query: 133 GLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGI 192
             S +  +  D V +  T   + TR   + + L    +    DG         G   T +
Sbjct: 125 LSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSL----TLTCTDGAPALKQRTIGLANTHL 184

Query: 193 SLPSQFAAAFSFNRKFAVCLSGSTRSP---GVIFSGNGPYNFLP-NVDLTKSLTYTPLFI 252
           S+PSQ  + +    K A+CL  + RS    G ++ G G Y +LP + D++K    TPL  
Sbjct: 185 SIPSQLISMYQLPHKIALCLPSTERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIG 244

Query: 253 NPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL 312
           N            KS EY I VKSI I +KTVP+            G TKIST+ PYTV 
Sbjct: 245 N-----------GKSGEYLIDVKSIQIGAKTVPI----------PYGATKISTLAPYTVF 304

Query: 313 ESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIW 372
           ++S+Y A++  FT  + KI +  AV PFG C+ S        G GVP IDL+L      W
Sbjct: 305 QTSLYKALLTAFTENI-KIAKAPAVKPFGACFYSNG------GRGVPVIDLVLSG-GAKW 364

Query: 373 RIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLG 427
           RI+G+NS+V +N  V+CLGFVDGGV+P+  IVIG  Q+EDNL+EFDL +S+  FSS+LL 
Sbjct: 365 RIYGSNSLVKVNKNVVCLGFVDGGVKPKYPIVIGGFQMEDNLVEFDLEASKFSFSSSLLL 383

BLAST of Cla97C05G086740 vs. TAIR10
Match: AT5G19120.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 183.0 bits (463), Expect = 4.1e-46
Identity = 143/416 (34.38%), Postives = 194/416 (46.63%), Query Frame = 0

Query: 1   MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKH-PSLQYITHIHQRTPLVPLKL 60
           MA+S+ L+ F         IS +  S     ++ PV K  P+ QY+  I       P+KL
Sbjct: 1   MASSSCLNLFFFSFLSALIISKSQISDSVNGVVFPVVKDLPTGQYLAQIRLGDSPDPVKL 60

Query: 61  TVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLF 120
            VDL G  +W DC   +VSS+       S+ C  A          S  R    N  C L 
Sbjct: 61  VVDLAGSILWFDCSSRHVSSSRNLISGSSSGCLKAKVGNERVSSSSSSRKD-QNADCELL 120

Query: 121 PGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAG 180
             N   G++  GE+ SDV+SV S        +    + LF C   +LL GLA G  G+ G
Sbjct: 121 VKNDAFGITARGELFSDVMSVGSVT------SPGTVDLLFACTPPWLLRGLASGAQGVMG 180

Query: 181 FGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPL 240
            GR  ISLPSQ AA  +  R+  V LS      GV+ + +    F   V  ++SL YTPL
Sbjct: 181 LGRAQISLPSQLAAETNERRRLTVYLSPLN---GVVSTSSVEEVF--GVAASRSLVYTPL 240

Query: 241 FINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYT 300
                           S  Y I VKSI +N +         K+   G    ++STV PYT
Sbjct: 241 LTG------------SSGNYVINVKSIRVNGE---------KLSVEGPLAVELSTVVPYT 300

Query: 301 VLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKV 360
           +LESSIY    + +     +   V  VAPFG+C++S            P++DL LQ++ V
Sbjct: 301 ILESSIYKVFAEAYAKAAGEATSVPPVAPFGLCFTS--------DVDFPAVDLALQSEMV 360

Query: 361 IWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGF 416
            WRI G N MV +   V C G VDGG      IV+G  Q+E  +L+FDL +S +GF
Sbjct: 361 RWRIHGKNLMVDVGGGVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375

BLAST of Cla97C05G086740 vs. TAIR10
Match: AT5G19110.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 179.9 bits (455), Expect = 3.4e-45
Identity = 141/430 (32.79%), Postives = 212/430 (49.30%), Query Frame = 0

Query: 11  SSILFLLFSISI-AATSFRPKS-LLLPVTKH--PSLQYITHIHQRTPLVPLKLTVDLGGQ 70
           SS+  LL  +SI AA + +  S  LLP+TKH   +L Y T         P+ L +DLG  
Sbjct: 3   SSLTRLLVFLSIFAAIALKSNSQYLLPITKHEPTNLFYTTFNVGSAAKSPVNLLLDLGTN 62

Query: 71  FMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTIIG 130
             W+DC +    S+ +   C+S+ C             S P  GC   +C L+     +G
Sbjct: 63  LTWLDCRKLKSLSSLRLVTCQSSTCK------------SIPGNGCAGKSC-LYKQPNPLG 122

Query: 131 LS--TSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTG 190
            +   +G V  D  S+ +T+G      VSV +F F C     L GL   V G+       
Sbjct: 123 QNPVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGS 182

Query: 191 ISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPV 250
            S   Q  +AF+   KF++CL  S       F   G + F+P  + +          NP+
Sbjct: 183 SSFTKQVTSAFNVIPKFSLCLPSSGTGH---FYIAGIHYFIPPFNSSD---------NPI 242

Query: 251 STAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESS 310
                   G  S +Y I VKSI +    + LN  LL       GG K+STV  YTVL++ 
Sbjct: 243 PRTLTPIKGTDSGDYLITVKSIYVGGTALKLNPDLL------TGGAKLSTVVHYTVLQTD 302

Query: 311 IYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSF-SSTRLGAGVPSIDLILQNK--KV 370
           IYNA+ ++FT +     I +V +VAPF  C+ S++   +   G  VP I++ L  +  +V
Sbjct: 303 IYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEV 362

Query: 371 IWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTL 430
            W  +GAN++V + + V+CL F+DGG  P+  +VIG HQ++D++LEFD + + L FS +L
Sbjct: 363 KWGFYGANTVVKVKETVMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESL 401

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004134154.11.5e-21688.25PREDICTED: basic 7S globulin-like [Cucumis sativus] >KGN57020.1 Xyloglucan-speci... [more]
XP_008438718.11.3e-21587.56PREDICTED: basic 7S globulin-like [Cucumis melo][more]
XP_008438715.11.4e-21488.53PREDICTED: basic 7S globulin-like [Cucumis melo][more]
XP_022936844.17.8e-21385.98basic 7S globulin-like [Cucurbita moschata][more]
XP_023538677.11.3e-21285.98basic 7S globulin-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A0A0L515|A0A0A0L515_CUCSA1.0e-21688.25Xyloglucan-specific endoglucanase inhibitor protein OS=Cucumis sativus OX=3659 G... [more]
tr|A0A1S3AWQ8|A0A1S3AWQ8_CUCME8.5e-21687.56basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483742 PE=3 SV=1[more]
tr|A0A1S3AXR0|A0A1S3AXR0_CUCME9.4e-21588.53basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483739 PE=3 SV=1[more]
tr|A0A0A0L5N4|A0A0A0L5N4_CUCSA4.4e-21287.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G149980 PE=3 SV=1[more]
tr|A0A1S3AXS3|A0A1S3AXS3_CUCME3.0e-19282.33basic 7S globulin-like OS=Cucumis melo OX=3656 GN=LOC103483740 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|P13917|7SB1_SOYBN1.3e-6537.98Basic 7S globulin OS=Glycine max OX=3847 GN=BG PE=1 SV=2[more]
sp|Q8RVH5|7SBG2_SOYBN1.3e-6236.07Basic 7S globulin 2 OS=Glycine max OX=3847 PE=1 SV=1[more]
sp|Q9LNJ3|APF2_ARATH2.5e-1323.99Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q9LZL3|PCS1L_ARATH2.5e-1322.90Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
sp|Q8S9J6|ASPA_ARATH2.8e-1222.96Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Match NameE-valueIdentityDescription
AT1G03220.13.7e-14864.42Eukaryotic aspartyl protease family protein[more]
AT1G03230.15.2e-14261.78Eukaryotic aspartyl protease family protein[more]
AT5G19100.11.1e-4835.29Eukaryotic aspartyl protease family protein[more]
AT5G19120.14.1e-4634.38Eukaryotic aspartyl protease family protein[more]
AT5G19110.13.4e-4532.79Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033868Xylanase_inhibitor_I-like
IPR033121PEPTIDASE_A1
IPR001461Aspartic_peptidase_A1
IPR032861TAXi_N
IPR032799TAXi_C
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G086740.1Cla97C05G086740.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 20..219
e-value: 9.8E-49
score: 168.1
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 220..433
e-value: 6.4E-66
score: 223.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 38..426
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 258..416
e-value: 2.1E-56
score: 190.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 44..218
e-value: 3.4E-42
score: 144.6
NoneNo IPR availablePANTHERPTHR13683:SF532EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 6..433
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 6..433
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 44..416
score: 18.928
IPR033868Xylanase inhibitor I-likeCDDcd05489xylanase_inhibitor_I_likecoord: 49..419
e-value: 5.16445E-145
score: 420.219

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None