Csor.00g033420 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g033420
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionaspartic proteinase-like protein 2
LocationCsor_Chr07: 4125286 .. 4129812 (+)
RNA-Seq ExpressionCsor.00g033420
SyntenyCsor.00g033420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSinitialstart_codonpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCCTCTTCTTTTGCCTCATCTCCTTCTTACTCTCCTTCTTCCTCACCGGCACGGTCCCCGTCTCCGCCGCCGCCGCCTTCTCCGCCCACCATTTCCCTCTCCACAGAGCCTTCCCCCACCCTCCCACTCCCCATTTTCATTCCCTCAGAGCTCGCGACCGTCTCCGCCATTCCCGCGTCTTGCGCCGATTGCGTGGTGGCATCGTCGACTTCTCCGTTAAAGGCTCCTCCGATCAATTCGTCGGGTGCGTCCTCTTCTTTCTTTCAATTCTCTTCGTCTTTGCTAATTTTGTGCGATTGGGTATTCTGTTTTGTTGCAATATCACGTGCCCGATTGTTTTAATTCTTCCATTATCTGCACCACGATCGTTTGTTTTAATATTTCTATATCTATTTATAGAATGTTGGGCGGAGAAATAATTCGTGGATGGTACTTAATGCTACGAAGACAGTTTTCATTTTCTTCTGATTAATTAAGCGAGTTGGGTTGCAGTTAGTTTTGTTAATCATCAACTCTATCAATATTCTTCGTAAAATTATGAGAAGAACTCTCTTATCAAAAAAAAAAAAAATTATGAGAAGAACTCTTTCGATTTTGTTAAAAGATAATTTCAATTGACCTATGTTTAGATAAATTCAATTTATCTTGAATTCTATTTGCCCTTTTTTCTTTGCCTACAAGTTAATGGAATTGAAAGGGTTTACTGCAAAATTTGTACTGTCTGGTGTGGTGTTATTCAACTCCGGGTCTTGGTTTTCATAGGCTTTATTACACCAAAGTGAAGTTGGGAAATCCCCAGAGGGAATTCAACGTGCAGATCGATACTGGGAGTGACATTTTGTGGGTCAATTGCAGTCCTTGCGATGGTTGTCCCCAATCAAGCGGACTTGGAGTAATGTTCTGTTTTTCTTTCATGAACAAAACCTTGCAGAATGGAGATTTCTTGCTTGACCCCTAAGTTTTTATGATGTTTTATCTTTACAGATAGAACTCAACTTGTTTGATACTGCAATGTCGTCGTCTGCTAGGCTTGTTTCCTGCTCTGATCCGATATGTTCTGCAGTTCCGACCACCACAAATCAATGCTTATCCCAGAATGACAATTGCAATTACACCTTCCAGTATCGAGATAGAAGTGCGACGTCGGGCTTTTATGTTACTGATTCAATGTATTTTGATATGATTCTTGGGGAGTCTGTGATTGCAAACTCCTCAGCCGCTATTGTTTTTGGGTAAGTTGATAAATCTTATTTAAACGTTATATTTCTGTGAAAGATTGGATTTGAACTGCTCGATAGTAAACACGGGGGAAGAACACTCTTATCTTAAAGAAATTCCAGTTATTTGGTAGTTTATTTTGCAGATCAGATAATTTTTTGTTTCTAGAGCGAGATATGATTTGGGCTACTTGTGGAGGTTGAAGTTGTTGTGTACTCATAATGTTGTTTACACGTTTGTAATTTCGAAGGAAGAACGACATTCTTGGAGGGAAAATAACTTTTTGATAAATAATTATTAAAACAGTTTTTCCCTTGGCTTGCCTTTTTAACAAAGAAACGAAAACAATTTTTTCTCCGAACCTCACTAACGACCTCTAGTAAGACTTGGTCTTCCTGATGTTCGTCACCTAAGGGCTAAATTCGTTGCAGAATTCAGCTGAAACATCATATTCAAACCATCCGAAGCCATTTTTTACGTCATTGCTCCTTATTATTCTGATTTATTGCCAAGAGTAAAGTATTTGGGTTGTAGTATGTTCTGTGGTTGATAATATACTAGATTCATGTTTATCTAGTTGAAAGAAAACGTTTGCTCTAATTATATACTCTACTTGATCTTTTGATTTCATGTATGTATTTATTCATTTTTGGTTGGGAACTTGAAAGTACTGTTTCATTTGGTTGTCAATGCAGGTGTAGCATATATCAGTATGGGGATTTGACTCGAACAACCGTAGCACTTGATGGAATTTTTGGATTTGGTCGAGGGGAGTTTTCAGTTATATCACAATTATCGTCTCGAGGGATAACACCGAAAGTTTTCTCTCACTGTTTGAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGTATGTCCTCTATCTTGTGAAGGATTTGGTTATTGCTTCTACTTATTCTTTATAAACCATACATGACTTGCATGAATGACTCCAATAGGATTTGGTTATTGCTCCTCTGTCCTTGGAAATATTCTCGTAATCAATATTTTACGATCACTTAGAAGTCTCTAGTTTAGACCACATGTCATTAAGAGGGATAAGTTGTGTGATCCCACATCGGTTGGAGAGGGGAACGAAACATTCTTTATAAGGGTGTGGAAATCTCTCCCTAGCAGACGCGTTTTCAAAACTTTCAGGAGAAGCCCGAAGAGAACAATATTTGCTAGCGGTGTGCTTGAGTTGTTACAAATAGTATCAAAGCCAGACACCGGGCGGTGTGCTAGCGAGGACGTTGGGCCCCAAGGGGGTGGATTGTGAGATTCCACATAGGTTGGAGAGGGGAACAAAGCATTCTTGATAAGGGTGTTAAAATCTCTCCCTAGCATACACGTTTTAAAAACCTTGAGGGAAAGTTCGAAAGGGAAAGCCCAAAAAGGACAATATCGGCTAGCGGTAGGCTTGGGCTCTTACATAAGCCTTATAATTTTTTAATTGACCTCCACGGTTCATGGCTATTCATTCATCAGGTTATGATTTCTACTTTACCATGGAAACTCAGATTGAAATATTGTACTAGCCTTAAACCCCTTGCCGATTTGTCACTCTTTCAATCCGTTTTTGTCGTCTCTGGCAACAATTTTTTCTTTTTCTGTTTCTTTTCTCCTGTGTTGTTGTCTTGTTAGACATTCATGTGAGATGCATTCCACAGTTTATAGCTATATAGTTATATGTTTACTTATTTACTTTTTAATTGATATCAGGCCGCACTATACCTTAAATCTACAGAGTATTGCAATCAGCGGGCAACCATTTCCAAATCCCACAGTTTTTTCAATATCAAATGCAGGAGGAACCATTATTGACTCTGGAACAACTTTAGCATATCTTGTGGAAGATGTTTACAACTGGATTGTCTCTGTAGTAAGTTTTTTTTTTTTTCCTATTCTTGTATGTTTCCTGTGGTTAGGTTTGATTTTTTTGGGGAGAAGTTGGGACTCCCATTGCTTGCTTGCAATAAATTTCATGATTATAAATTATCATTATCGAAGCGACGGTTCAACTAAACCGTACCAGTTGGTGTTGACTCAACACGTTTACTGATTGTGCTATAGTTTTCCATATATATATATGTATGTATGTATGTATTTTGTTAATGCTTATCTACATGCACTGGTTCTGATTTTACTATCGCTTGCTTGTAGATAACTTCTGCTGTTTCTCAATCAACCACCCCTACAATTTCCAGGGGTAGCCAATGTTATCGAGTCTCGACAAGGTGTTCTCTCTTGTTCGACTTAACCGAATAATATTTGAAATTATGATTGATTTTCGAAGAGAACGGTGCCTTCGTTCTTTTCTTTTTCCAGGAACTAAACGAGTACTTGTTTTTATGTTCAGTGTATCAGAAGTATTTCCTGTGATCAGCTTTAATTTTGAGGGTATTGCATCCATGGTGCTGAAACCTGAAGAATATCTTCAGTTTGACAGCATAGTAAGTTGCTCAACAATTTTGCCAGCATATCATTTCGAATATTTTTGTTCATGCACGACTTATGAGAGTTTGAATATTGAATTATGGCAGTCGTATAACGCTAAGCACCTTAATCTTGCAGGAACCTGCTTTACGGTGCATCGGTTTTCAGAAAGCCGAGGATGGAATAAATATTTTAGGAGGTTAAATTCCAAATTTAGTCTCGAATTTGAAAACCATGGACAAACAGCCTCACACATCTTCATTTTCATTGATGAACACCCTTTCATTTTGTAGATCTTGTTTTGAAAGATAAGATCGTCGTCTATGACTTAGCTCGACAACGAATCGGATGGGCCAATTATGACTGTAAGTTAGACCAACTGTAGAATTCTACATGAAGTATTTCCCTCGAAATCCTTTCTGAATAAAGATAACTGCTGTTCGACAGGTTCATCATCTGTAAACGTTTCTGTAACGTCCGGGAAGGACGTGTTCATCGGTGGACAGCTGAGTGTCAGCAGCTCCTCAAGAAAGCATTTCTATCAGCTGCTCCACATCGTCGTTGTACTACTAATACATTTGAAACTGTTCTGAAGTCCAATAACTCTGAGGATTCCCTTGTGTTCATTATTTGTGCTAGCCACCTTCACCTGGCCAATCTGTATTTTGAATGCTTGGTCGGGTTGGTTATCACCCAAGCTCGAAGAGCAAATGTTTGTGCACCTCCTGCATCTTTCATCTGGCTAAATGCAGGCATCCTTTTCTTCAACCTCGTCGTTCTGCTGCGACCCACAATGAGGTCTGCTATTATAAGTTGA

mRNA sequence

ATGCGCCTCTTCTTTTGCCTCATCTCCTTCTTACTCTCCTTCTTCCTCACCGGCACGGTCCCCGTCTCCGCCGCCGCCGCCTTCTCCGCCCACCATTTCCCTCTCCACAGAGCCTTCCCCCACCCTCCCACTCCCCATTTTCATTCCCTCAGAGCTCGCGACCGTCTCCGCCATTCCCGCGTCTTGCGCCGATTGCGTGGTGGCATCGTCGACTTCTCCGTTAAAGGCTCCTCCGATCAATTCGTCGGGCTTTATTACACCAAAGTGAAGTTGGGAAATCCCCAGAGGGAATTCAACGTGCAGATCGATACTGGGAGTGACATTTTGTGGGTCAATTGCAGTCCTTGCGATGGTTGTCCCCAATCAAGCGGACTTGGAATAGAACTCAACTTGTTTGATACTGCAATGTCGTCGTCTGCTAGGCTTGTTTCCTGCTCTGATCCGATATGTTCTGCAGTTCCGACCACCACAAATCAATGCTTATCCCAGAATGACAATTGCAATTACACCTTCCAGTATCGAGATAGAAGTGCGACGTCGGGCTTTTATGTTACTGATTCAATGTATTTTGATATGATTCTTGGGGAGTCTGTGATTGCAAACTCCTCAGCCGCTATTGTTTTTGGGTGTAGCATATATCAGTATGGGGATTTGACTCGAACAACCGTAGCACTTGATGGAATTTTTGGATTTGGTCGAGGGGAGTTTTCAGTTATATCACAATTATCGTCTCGAGGGATAACACCGAAAGTTTTCTCTCACTGTTTGAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGCCGCACTATACCTTAAATCTACAGAGTATTGCAATCAGCGGGCAACCATTTCCAAATCCCACAGTTTTTTCAATATCAAATGCAGGAGGAACCATTATTGACTCTGGAACAACTTTAGCATATCTTGTGGAAGATGTTTACAACTGGATTGTCTCTGTAATAACTTCTGCTGTTTCTCAATCAACCACCCCTACAATTTCCAGGGGTAGCCAATGTTATCGAGTCTCGACAAGTGTATCAGAAGTATTTCCTGTGATCAGCTTTAATTTTGAGGGTATTGCATCCATGGTGCTGAAACCTGAAGAATATCTTCAGTTTGACAGCATAGAACCTGCTTTACGGTGCATCGGTTTTCAGAAAGCCGAGGATGGAATAAATATTTTAGGAGATCTTGTTTTGAAAGATAAGATCGTCGTCTATGACTTAGCTCGACAACGAATCGGATGGGCCAATTATGACTGTTCATCATCTGTAAACGTTTCTGTAACGTCCGGGAAGGACGTGTTCATCGGTGGACAGCTGAGTTCCAATAACTCTGAGGATTCCCTTGTGTTCATTATTTGTGCTAGCCACCTTCACCTGGCCAATCTGTATTTTGAATGCTTGGTCGGGTTGGTTATCACCCAAGCTCGAAGAGCAAATGTTTGTGCACCTCCTGCATCTTTCATCTGGCTAAATGCAGGCATCCTTTTCTTCAACCTCGTCGTTCTGCTGCGACCCACAATGAGGTCTGCTATTATAAGTTGA

Coding sequence (CDS)

ATGCGCCTCTTCTTTTGCCTCATCTCCTTCTTACTCTCCTTCTTCCTCACCGGCACGGTCCCCGTCTCCGCCGCCGCCGCCTTCTCCGCCCACCATTTCCCTCTCCACAGAGCCTTCCCCCACCCTCCCACTCCCCATTTTCATTCCCTCAGAGCTCGCGACCGTCTCCGCCATTCCCGCGTCTTGCGCCGATTGCGTGGTGGCATCGTCGACTTCTCCGTTAAAGGCTCCTCCGATCAATTCGTCGGGCTTTATTACACCAAAGTGAAGTTGGGAAATCCCCAGAGGGAATTCAACGTGCAGATCGATACTGGGAGTGACATTTTGTGGGTCAATTGCAGTCCTTGCGATGGTTGTCCCCAATCAAGCGGACTTGGAATAGAACTCAACTTGTTTGATACTGCAATGTCGTCGTCTGCTAGGCTTGTTTCCTGCTCTGATCCGATATGTTCTGCAGTTCCGACCACCACAAATCAATGCTTATCCCAGAATGACAATTGCAATTACACCTTCCAGTATCGAGATAGAAGTGCGACGTCGGGCTTTTATGTTACTGATTCAATGTATTTTGATATGATTCTTGGGGAGTCTGTGATTGCAAACTCCTCAGCCGCTATTGTTTTTGGGTGTAGCATATATCAGTATGGGGATTTGACTCGAACAACCGTAGCACTTGATGGAATTTTTGGATTTGGTCGAGGGGAGTTTTCAGTTATATCACAATTATCGTCTCGAGGGATAACACCGAAAGTTTTCTCTCACTGTTTGAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGCCGCACTATACCTTAAATCTACAGAGTATTGCAATCAGCGGGCAACCATTTCCAAATCCCACAGTTTTTTCAATATCAAATGCAGGAGGAACCATTATTGACTCTGGAACAACTTTAGCATATCTTGTGGAAGATGTTTACAACTGGATTGTCTCTGTAATAACTTCTGCTGTTTCTCAATCAACCACCCCTACAATTTCCAGGGGTAGCCAATGTTATCGAGTCTCGACAAGTGTATCAGAAGTATTTCCTGTGATCAGCTTTAATTTTGAGGGTATTGCATCCATGGTGCTGAAACCTGAAGAATATCTTCAGTTTGACAGCATAGAACCTGCTTTACGGTGCATCGGTTTTCAGAAAGCCGAGGATGGAATAAATATTTTAGGAGATCTTGTTTTGAAAGATAAGATCGTCGTCTATGACTTAGCTCGACAACGAATCGGATGGGCCAATTATGACTGTTCATCATCTGTAAACGTTTCTGTAACGTCCGGGAAGGACGTGTTCATCGGTGGACAGCTGAGTTCCAATAACTCTGAGGATTCCCTTGTGTTCATTATTTGTGCTAGCCACCTTCACCTGGCCAATCTGTATTTTGAATGCTTGGTCGGGTTGGTTATCACCCAAGCTCGAAGAGCAAATGTTTGTGCACCTCCTGCATCTTTCATCTGGCTAAATGCAGGCATCCTTTTCTTCAACCTCGTCGTTCTGCTGCGACCCACAATGAGGTCTGCTATTATAAGTTGA

Protein sequence

MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSRVLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQPFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFIICASHLHLANLYFECLVGLVITQARRANVCAPPASFIWLNAGILFFNLVVLLRPTMRSAIIS
Homology
BLAST of Csor.00g033420 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 3.0e-80
Identity = 162/417 (38.85%), Postives = 250/417 (59.95%), Query Frame = 0

Query: 50  LRARDRLRHSRVLRRLRGGIVDFSVKGSS-DQFVGLYYTKVKLGNPQREFNVQIDTGSDI 109
           L++ D  RH+R+L       +D  + G S    +GLY+TK+KLG+P +E+ VQ+DTGSDI
Sbjct: 47  LKSHDSFRHARMLAN-----IDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDI 106

Query: 110 LWVNCSPCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCN 169
           LWVNC+PC  CP  + LGI L+L+D+  SS+++ V C D  CS +    ++       C+
Sbjct: 107 LWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFI--MQSETCGAKKPCS 166

Query: 170 YTFQYRDRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGI 229
           Y   Y D S + G ++ D++  + + G    A  +  +VFGC   Q G L +T  A+DGI
Sbjct: 167 YHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGI 226

Query: 230 FGFGRGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHY 289
            GFG+   S+ISQL++ G T ++FSHCL    NGGGI  +GE+  P +  +P++P+Q HY
Sbjct: 227 MGFGQSNTSIISQLAAGGSTKRIFSHCL-DNMNGGGIFAVGEVESPVVKTTPIVPNQVHY 286

Query: 290 TLNLQSIAISGQPFP-NPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQST 349
            + L+ + + G P    P++ S +  GGTIIDSGTTLAYL +++YN ++  IT A  Q  
Sbjct: 287 NVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT-AKQQVK 346

Query: 350 TPTISRGSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQK-- 409
              +     C+  +++  + FPV++ +FE    + + P +YL   S+   + C G+Q   
Sbjct: 347 LHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYL--FSLREDMYCFGWQSGG 406

Query: 410 --AEDGINI--LGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQ 459
              +DG ++  LGDLVL +K+VVYDL  + IGWA+++CSSS+ V   SG    +G +
Sbjct: 407 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGSGAAYQLGAE 452

BLAST of Csor.00g033420 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 282.3 bits (721), Expect = 1.1e-74
Identity = 158/419 (37.71%), Postives = 232/419 (55.37%), Query Frame = 0

Query: 51  RARDRLRHSRVLRRLRGGIVDFSVKGSSD-QFVGLYYTKVKLGNPQREFNVQIDTGSDIL 110
           ++ D  RHSR+L       +D  + G S    VGLY+TK+KLG+P +E++VQ+DTGSDIL
Sbjct: 44  KSHDTRRHSRML-----ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDIL 103

Query: 111 WVNCSPCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDN--- 170
           W+NC PC  CP  + L   L+LFD   SS+++ V C D  CS +        SQ+D+   
Sbjct: 104 WINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFI--------SQSDSCQP 163

Query: 171 ---CNYTFQYRDRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTV 230
              C+Y   Y D S + G ++ D +  + + G+         +VFGC   Q G L     
Sbjct: 164 ALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDS 223

Query: 231 ALDGIFGFGRGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIP 290
           A+DG+ GFG+   SV+SQL++ G   +VFSHCL     GGGI  +G +  P +  +P++P
Sbjct: 224 AVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVGVVDSPKVKTTPMVP 283

Query: 291 SQPHYTLNLQSIAISGQPFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAV 350
           +Q HY + L  + + G     P   SI   GGTI+DSGTTLAY  + +Y+ ++  I  A 
Sbjct: 284 NQMHYNVMLMGMDVDGTSLDLPR--SIVRNGGTIVDSGTTLAYFPKVLYDSLIETIL-AR 343

Query: 351 SQSTTPTISRGSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGF 410
                  +    QC+  ST+V E FP +SF FE    + + P +YL   ++E  L C G+
Sbjct: 344 QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYL--FTLEEELYCFGW 403

Query: 411 QKA------EDGINILGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIG 457
           Q           + +LGDLVL +K+VVYDL  + IGWA+++CSSS+ +   SG    +G
Sbjct: 404 QAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKDGSGGVYSVG 443

BLAST of Csor.00g033420 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 4.4e-31
Identity = 119/404 (29.46%), Postives = 186/404 (46.04%), Query Frame = 0

Query: 57  RHSRVLRRLRGGIVDFS-VKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSP 116
           R SR L+RL   +   S V+ S     G Y   + +G P + F+  +DTGSD++W  C P
Sbjct: 66  RGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP 125

Query: 117 CDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRD 176
           C  C   S       +F+   SSS   + CS  +C A+ + T      N+ C YT+ Y D
Sbjct: 126 CTQCFNQS-----TPIFNPQGSSSFSTLPCSSQLCQALSSPT----CSNNFCQYTYGYGD 185

Query: 177 RSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGE 236
            S T G   T+++ F  +        S   I FGC     G          G+ G GRG 
Sbjct: 186 GSETQGSMGTETLTFGSV--------SIPNITFGCGENNQGFGQGNGA---GLVGMGRGP 245

Query: 237 FSVISQLSSRGITPKVFSHCLKG-GENGGGILVLGEIL--------EPSIVYSPLIPSQP 296
            S+ SQL    +T   FS+C+   G +    L+LG +           +++ S  IP+  
Sbjct: 246 LSLPSQLD---VTK--FSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFY 305

Query: 297 HYTLNLQSIAISGQPFPNPTVFSISN---AGGTIIDSGTTLAYLVEDVYNWIVSVITSAV 356
           + TLN  S+  +  P  +P+ F++++    GG IIDSGTTL Y V + Y    SV    +
Sbjct: 306 YITLNGLSVGSTRLPI-DPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQ---SVRQEFI 365

Query: 357 SQSTTPTISRGSQ----CYRVSTSVSEV-FPVISFNFEGIASMVLKPEEYLQFDSIEPAL 416
           SQ   P ++  S     C++  +  S +  P    +F+G   + L  E Y  F S    L
Sbjct: 366 SQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDG-GDLELPSENY--FISPSNGL 425

Query: 417 RCIGFQKAEDGINILGDLVLKDKIVVYDLARQRIGWANYDCSSS 443
            C+    +  G++I G++  ++ +VVYD     + +A+  C +S
Sbjct: 426 ICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCGAS 437

BLAST of Csor.00g033420 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 1.7e-30
Identity = 131/463 (28.29%), Postives = 203/463 (43.84%), Query Frame = 0

Query: 11  LLSFFLTGTVPVSAAA---AFSAHHFPLHRAFPHPPTPHFHSLRARDR-----LRHSRVL 70
           LL FFL  +V +S++     FS     +HR  P  P  +   +   DR     LR     
Sbjct: 6   LLCFFLFFSVTLSSSGHPKNFSVE--LIHRDSPLSPI-YNPQITVTDRLNAAFLRSVSRS 65

Query: 71  RRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCPQS 130
           RR    +    ++       G ++  + +G P  +     DTGSD+ WV C PC  C + 
Sbjct: 66  RRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKE 125

Query: 131 SGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATSGF 190
           +G      +FD   SS+ +   C    C A+ +T   C   N+ C Y + Y D+S + G 
Sbjct: 126 NG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGD 185

Query: 191 YVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVISQL 250
             T+++  D   G  V   S    VFGC     G    T     GI G G G  S+ISQL
Sbjct: 186 VATETVSIDSASGSPV---SFPGTVFGCGYNNGGTFDETG---SGIIGLGGGHLSLISQL 245

Query: 251 SSRGITPKVFSHCL---KGGENGGGILVLGEILEPS-------IVYSPLIPSQP--HYTL 310
            S     K FS+CL       NG  ++ LG    PS       +V +PL+  +P  +Y L
Sbjct: 246 GSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYL 305

Query: 311 NLQSIAISGQPFP------NPTVFSI--SNAGGTIIDSGTTLAYLVEDVYNWIVSVITSA 370
            L++I++  +  P      NP    I    +G  IIDSGTTL  L    ++   S +  +
Sbjct: 306 TLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEES 365

Query: 371 VSQSTTPTISRG--SQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRC 430
           V+ +   +  +G  S C++ S S     P I+ +F G A + L P     F  +   + C
Sbjct: 366 VTGAKRVSDPQGLLSHCFK-SGSAEIGLPEITVHFTG-ADVRLSPIN--AFVKLSEDMVC 425

Query: 431 IGFQKAEDGINILGDLVLKDKIVVYDLARQRIGWANYDCSSSV 444
           +      + + I G+    D +V YDL  + + + + DCS+++
Sbjct: 426 LSMVPTTE-VAIYGNFAQMDFLVGYDLETRTVSFQHMDCSANL 447

BLAST of Csor.00g033420 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 6.4e-30
Identity = 107/374 (28.61%), Postives = 165/374 (44.12%), Query Frame = 0

Query: 83  GLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCPQSSGLGIELNLFDTAMSSSARL 142
           G Y   V +G P   F+  +DTGSD++W  C PC  C           +F+   SSS   
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQC-----FSQPTPIFNPQDSSSFST 153

Query: 143 VSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATSGFYVTDSMYFDMILGESVIANS 202
           + C    C  +P+ T      N+ C YT+ Y D S T G+  T++  F+         +S
Sbjct: 154 LPCESQYCQDLPSET----CNNNECQYTYGYGDGSTTQGYMATETFTFE--------TSS 213

Query: 203 SAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVISQLSSRGITPKVFSHCLKG-GEN 262
              I FGC     G          G+ G G G  S+ SQL   G+    FS+C+   G +
Sbjct: 214 VPNIAFGCGEDNQGFGQGNGA---GLIGMGWGPLSLPSQL---GVGQ--FSYCMTSYGSS 273

Query: 263 GGGILVLGEIL--------EPSIVYSPLIPSQPHYTLNLQSIAISGQPFPNP-TVFSISN 322
               L LG             ++++S L P+  +Y + LQ I + G     P + F + +
Sbjct: 274 SPSTLALGSAASGVPEGSPSTTLIHSSLNPT--YYYITLQGITVGGDNLGIPSSTFQLQD 333

Query: 323 --AGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRG-SQCYRVSTSVSEV-F 382
              GG IIDSGTTL YL +D YN +    T  ++  T    S G S C++  +  S V  
Sbjct: 334 DGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQV 393

Query: 383 PVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKDKIVVYDLA 442
           P IS  F+G   ++   E+ +     E  +       ++ GI+I G++  ++  V+YDL 
Sbjct: 394 PEISMQFDG--GVLNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQ 438

BLAST of Csor.00g033420 vs. NCBI nr
Match: KAG6595193.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1058 bits (2737), Expect = 0.0
Identity = 533/533 (100.00%), Postives = 533/533 (100.00%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFIICASHLHLA 480
           KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFIICASHLHLA
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFIICASHLHLA 480

Query: 481 NLYFECLVGLVITQARRANVCAPPASFIWLNAGILFFNLVVLLRPTMRSAIIS 533
           NLYFECLVGLVITQARRANVCAPPASFIWLNAGILFFNLVVLLRPTMRSAIIS
Sbjct: 481 NLYFECLVGLVITQARRANVCAPPASFIWLNAGILFFNLVVLLRPTMRSAIIS 533

BLAST of Csor.00g033420 vs. NCBI nr
Match: KAG7027232.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 912 bits (2358), Expect = 0.0
Identity = 459/464 (98.92%), Postives = 461/464 (99.35%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPT TNQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTITNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 464

BLAST of Csor.00g033420 vs. NCBI nr
Match: XP_022962843.1 (aspartic proteinase-like protein 2 [Cucurbita moschata])

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 458/464 (98.71%), Postives = 461/464 (99.35%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSS+QFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSEQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSA PTTTNQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAAPTTTNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 464

BLAST of Csor.00g033420 vs. NCBI nr
Match: XP_023517316.1 (aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 901 bits (2329), Expect = 0.0
Identity = 456/468 (97.44%), Postives = 460/468 (98.29%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVS----AAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRL 60
           MRLFFCLISFL SFFLTGTVPVS    AAAAFSAHHFPLHRAFPH PTPHFHSLRARDRL
Sbjct: 1   MRLFFCLISFLPSFFLTGTVPVSTAAAAAAAFSAHHFPLHRAFPHSPTPHFHSLRARDRL 60

Query: 61  RHSRVLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPC 120
           RHSRVLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPC
Sbjct: 61  RHSRVLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPC 120

Query: 121 DGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDR 180
           DGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDR
Sbjct: 121 DGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDR 180

Query: 181 SATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEF 240
           SATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEF
Sbjct: 181 SATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEF 240

Query: 241 SVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA 300
           SVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA
Sbjct: 241 SVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA 300

Query: 301 ISGQPFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQ 360
           ISGQPFPNPTVFSISNAGGTIIDSGTTLAYLVE+VYNWIVSVITSAVSQSTTPTISRGSQ
Sbjct: 301 ISGQPFPNPTVFSISNAGGTIIDSGTTLAYLVEEVYNWIVSVITSAVSQSTTPTISRGSQ 360

Query: 361 CYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDL 420
           CYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDL
Sbjct: 361 CYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDL 420

Query: 421 VLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           VLKDKIVVYDLARQR+GWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 VLKDKIVVYDLARQRVGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 468

BLAST of Csor.00g033420 vs. NCBI nr
Match: XP_022972768.1 (aspartic proteinase-like protein 2 [Cucurbita maxima])

HSP 1 Score: 892 bits (2306), Expect = 0.0
Identity = 450/464 (96.98%), Postives = 457/464 (98.49%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAA FSAHHFPLHRA PH PTPHF+SLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAA-FSAHHFPLHRALPHSPTPHFYSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTT NQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTANQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSIS+AGGTIIDSGTTLAYLVE+VYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISSAGGTIIDSGTTLAYLVEEVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STS+SEVFPVISF FEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSISEVFPVISFKFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           KIVVYDLARQR+GWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 KIVVYDLARQRVGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 463

BLAST of Csor.00g033420 vs. ExPASy TrEMBL
Match: A0A6J1HFZ7 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111463213 PE=3 SV=1)

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 458/464 (98.71%), Postives = 461/464 (99.35%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSS+QFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSEQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSA PTTTNQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAAPTTTNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 464

BLAST of Csor.00g033420 vs. ExPASy TrEMBL
Match: A0A6J1IB21 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471277 PE=3 SV=1)

HSP 1 Score: 892 bits (2306), Expect = 0.0
Identity = 450/464 (96.98%), Postives = 457/464 (98.49%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFCLISFLLSFFLTGTVPVSAAA FSAHHFPLHRA PH PTPHF+SLRARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAA-FSAHHFPLHRALPHSPTPHFYSLRARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTT NQCLSQNDNCNYTFQYRDRSATS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTANQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
           PFPNPTVFSIS+AGGTIIDSGTTLAYLVE+VYNWIVSVITSAVSQSTTPTISRGSQCYRV
Sbjct: 301 PFPNPTVFSISSAGGTIIDSGTTLAYLVEEVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STS+SEVFPVISF FEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD
Sbjct: 361 STSISEVFPVISFKFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNS 464
           KIVVYDLARQR+GWANYDCSSSVNVSVTSGKDVFI GQLS ++S
Sbjct: 421 KIVVYDLARQRVGWANYDCSSSVNVSVTSGKDVFIDGQLSVSSS 463

BLAST of Csor.00g033420 vs. ExPASy TrEMBL
Match: A0A1S3CH65 (aspartic proteinase-like protein 2 OS=Cucumis melo OX=3656 GN=LOC103500850 PE=3 SV=1)

HSP 1 Score: 773 bits (1997), Expect = 2.91e-278
Identity = 389/465 (83.66%), Postives = 424/465 (91.18%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFC I  L+S      V V+  AA S +HF LHRAFPH P+P FHSL+ARDRLRHSR
Sbjct: 1   MRLFFCFIYALVS---VVAVAVAGTAAISPNHFLLHRAFPHFPSPQFHSLKARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           +LRRL GGIV+FSVKGSS+ FVGLY+TKVKLGNP+REFNVQIDTGSDILWV CSPCDGCP
Sbjct: 61  LLRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           ++SGLGIELNLFDT  SSSAR++ C+DPIC+AV TTT+QCLSQ D+C+YTF YRDRS TS
Sbjct: 121 ETSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLSQIDHCSYTFHYRDRSGTS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSM+FD++LGES IANSSA IVFGCSIYQYGDLTR T ALDGIFGFGRGEFSVIS
Sbjct: 181 GFYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA+SGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
            FPNPT F ISNAG TIIDSGTTLAYLVE+VY+WIVSVITSAVSQS TPTISRGSQC+RV
Sbjct: 301 LFPNPTTFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSV+E+FPV+SFNFEG+ASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLVLKD
Sbjct: 361 STSVAEIFPVLSFNFEGVASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIG-GQLSSNNS 464
           KI+VYDLARQRIGWANYDCSSSVNVSVTSGKDVFI  GQLS ++S
Sbjct: 421 KIIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLSVSSS 462

BLAST of Csor.00g033420 vs. ExPASy TrEMBL
Match: A0A5D3C7C6 (Aspartic proteinase-like protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G00740 PE=3 SV=1)

HSP 1 Score: 768 bits (1984), Expect = 3.09e-276
Identity = 387/465 (83.23%), Postives = 422/465 (90.75%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60
           MRLFFC I  L+S  +   V V+  AA   +HF LHRAFPH P+P FHSL+ARDRLRHSR
Sbjct: 1   MRLFFCFIYALVSV-VAVAVAVAGTAAIFPNHFLLHRAFPHFPSPQFHSLKARDRLRHSR 60

Query: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120
           +LRRL GGIV+FSVKGSS+ FVGLY+TKVKLGNP+REFNVQIDTGSDILWV CSPCDGCP
Sbjct: 61  LLRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP 120

Query: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180
           ++SGLGIELNLFDT  SSSAR++ C+DPIC+AV TT +QCLSQ D+C+YTF YRDRS TS
Sbjct: 121 ETSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTADQCLSQIDHCSYTFHYRDRSGTS 180

Query: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240
           GFYVTDSM+FD++LGES IANSSA IVFGCSIYQYGDLTR T ALDGIFGFG+GEFSVIS
Sbjct: 181 GFYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA+SGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 300

Query: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360
            FPNPT+F ISNAG TIIDSGTTLAYLVE+VY+WIVSVITSAVSQS TPTISRGSQC+RV
Sbjct: 301 LFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRV 360

Query: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420
           STSV+E+FPV+SFNFEGIASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLVLKD
Sbjct: 361 STSVAEIFPVLSFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKD 420

Query: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIG-GQLSSNNS 464
           KI+VYDLARQRIGWANYDCSSSVNVSVTSGKDVFI  GQLS   S
Sbjct: 421 KIIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLSGKRS 464

BLAST of Csor.00g033420 vs. ExPASy TrEMBL
Match: A0A0A0KF78 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G448710 PE=3 SV=1)

HSP 1 Score: 758 bits (1958), Expect = 2.69e-272
Identity = 384/468 (82.05%), Postives = 419/468 (89.53%), Query Frame = 0

Query: 1   MRLFFCLISFLLSFF---LTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLR 60
           MRLFFC I  L S     L GT  +S       +HF LHRAFPH P+PHFHSL+ARDRLR
Sbjct: 1   MRLFFCFIYALASVVALTLAGTAVISPGP----NHFLLHRAFPHFPSPHFHSLKARDRLR 60

Query: 61  HSRVLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCD 120
           HSR+LRRL GGIV+FSVKGSS+ FVGLY+TKVKLGNP REFNVQIDTGSDILWV CSPCD
Sbjct: 61  HSRLLRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD 120

Query: 121 GCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRS 180
           GCP SSGLGIELNLFDT  SSSAR++ C+DPIC+AV TTT+QCL+Q D+C+Y+F YRDRS
Sbjct: 121 GCPDSSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRS 180

Query: 181 ATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFS 240
            TSGFYVTDSM+FD++LGES IANSSA IVFGCSIYQYGDLTR T ALDGIFGFG+GEFS
Sbjct: 181 GTSGFYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFS 240

Query: 241 VISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAI 300
           VISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTL LQSIA+
Sbjct: 241 VISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIAL 300

Query: 301 SGQPFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQC 360
           SGQ FPNPT+F ISNAG TIIDSGTTLAYLVE+VY+WIVSVITSAVSQS TPTISRGSQC
Sbjct: 301 SGQLFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQC 360

Query: 361 YRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLV 420
           +RVS SV+++FPV+ FNFEGIASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLV
Sbjct: 361 FRVSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLV 420

Query: 421 LKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIG-GQLSSNNS 464
           LKDKI+VYDLARQRIGWANYDCSSSVNVSVTSGKDVFI  GQLS ++S
Sbjct: 421 LKDKIIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLSVSSS 464

BLAST of Csor.00g033420 vs. TAIR 10
Match: AT2G36670.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 526.9 bits (1356), Expect = 1.8e-149
Identity = 281/496 (56.65%), Postives = 352/496 (70.97%), Query Frame = 0

Query: 8   ISFLLSFFLTGTVPVSAA--AAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSRVL--- 67
           ++  ++ F    +P + A  AA      PL RAFP         LRARDR+RH+R+L   
Sbjct: 15  VALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGG 74

Query: 68  --RRLRGGIVDFSVKGSSDQF-VGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGC 127
             +   GG+VDF V+GSSD + VGLY+TKVKLG+P  EFNVQIDTGSDILWV CS C  C
Sbjct: 75  GRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNC 134

Query: 128 PQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSAT 187
           P SSGLGI+L+ FD   S +A  V+CSDPICS+V  TT    S+N+ C Y+F+Y D S T
Sbjct: 135 PHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGT 194

Query: 188 SGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVI 247
           SG+Y+TD+ YFD ILGES++ANSSA IVFGCS YQ GDLT++  A+DGIFGFG+G+ SV+
Sbjct: 195 SGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVV 254

Query: 248 SQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISG 307
           SQLSSRGITP VFSHCLKG  +GGG+ VLGEIL P +VYSPL+PSQPHY LNL SI ++G
Sbjct: 255 SQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNG 314

Query: 308 QPFP-NPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCY 367
           Q  P +  VF  SN  GTI+D+GTTL YLV++ Y+  ++ I+++VSQ  TP IS G QCY
Sbjct: 315 QMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCY 374

Query: 368 RVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSI--EPALRCIGFQKAEDGINILGDL 427
            VSTS+S++FP +S NF G ASM+L+P++YL    I    ++ CIGFQKA +   ILGDL
Sbjct: 375 LVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDL 434

Query: 428 VLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFIICASH 487
           VLKDK+ VYDLARQRIGWA+YDCS SVNVS+TSGKD+   GQ   N S   +        
Sbjct: 435 VLKDKVFVYDLARQRIGWASYDCSMSVNVSITSGKDIVNSGQPCLNISTRDI-------- 494

Query: 488 LHLANLYFECLVGLVI 493
             L  L+F  L GL++
Sbjct: 495 --LIRLFFSILFGLLL 500

BLAST of Csor.00g033420 vs. TAIR 10
Match: AT2G36670.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 520.8 bits (1340), Expect = 1.3e-147
Identity = 281/501 (56.09%), Postives = 352/501 (70.26%), Query Frame = 0

Query: 8   ISFLLSFFLTGTVPVSAA--AAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSRVL--- 67
           ++  ++ F    +P + A  AA      PL RAFP         LRARDR+RH+R+L   
Sbjct: 15  VALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGG 74

Query: 68  --RRLRGGIVDFSVKGSSDQF-VG-----LYYTKVKLGNPQREFNVQIDTGSDILWVNCS 127
             +   GG+VDF V+GSSD + VG     LY+TKVKLG+P  EFNVQIDTGSDILWV CS
Sbjct: 75  GRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCS 134

Query: 128 PCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYR 187
            C  CP SSGLGI+L+ FD   S +A  V+CSDPICS+V  TT    S+N+ C Y+F+Y 
Sbjct: 135 SCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYG 194

Query: 188 DRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRG 247
           D S TSG+Y+TD+ YFD ILGES++ANSSA IVFGCS YQ GDLT++  A+DGIFGFG+G
Sbjct: 195 DGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKG 254

Query: 248 EFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQS 307
           + SV+SQLSSRGITP VFSHCLKG  +GGG+ VLGEIL P +VYSPL+PSQPHY LNL S
Sbjct: 255 KLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLS 314

Query: 308 IAISGQPFP-NPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISR 367
           I ++GQ  P +  VF  SN  GTI+D+GTTL YLV++ Y+  ++ I+++VSQ  TP IS 
Sbjct: 315 IGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISN 374

Query: 368 GSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSI--EPALRCIGFQKAEDGIN 427
           G QCY VSTS+S++FP +S NF G ASM+L+P++YL    I    ++ CIGFQKA +   
Sbjct: 375 GEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQT 434

Query: 428 ILGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQLSSNNSEDSLVFI 487
           ILGDLVLKDK+ VYDLARQRIGWA+YDCS SVNVS+TSGKD+   GQ   N S   +   
Sbjct: 435 ILGDLVLKDKVFVYDLARQRIGWASYDCSMSVNVSITSGKDIVNSGQPCLNISTRDI--- 494

Query: 488 ICASHLHLANLYFECLVGLVI 493
                  L  L+F  L GL++
Sbjct: 495 -------LIRLFFSILFGLLL 505

BLAST of Csor.00g033420 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 483.4 bits (1243), Expect = 2.3e-136
Identity = 256/450 (56.89%), Postives = 327/450 (72.67%), Query Frame = 0

Query: 26  AAFSAHHFP----LHRAFPHPPTPHFHSLRARDRLRHSRVLRRLRGGIVDFSVKGSSDQF 85
           AA  ++ FP    L R  P         L+ARD  RH R+L+ L GG++DF V G+ D F
Sbjct: 18  AAVLSYGFPAALKLERVIPANHEMELSQLKARDEARHGRLLQSL-GGVIDFPVDGTFDPF 77

Query: 86  -VGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCPQSSGLGIELNLFDTAMSSSA 145
            VGLYYTK++LG P R+F VQ+DTGSD+LWV+C+ C+GCPQ+SGL I+LN FD   S +A
Sbjct: 78  VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 137

Query: 146 RLVSCSDPICS-AVPTTTNQCLSQNDNCNYTFQYRDRSATSGFYVTDSMYFDMILGESVI 205
             +SCSD  CS  + ++ + C  QN+ C YTFQY D S TSGFYV+D + FDMI+G S++
Sbjct: 138 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 197

Query: 206 ANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVISQLSSRGITPKVFSHCLKGG 265
            NS+A +VFGCS  Q GDL ++  A+DGIFGFG+   SVISQL+S+GI P+VFSHCLKG 
Sbjct: 198 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 257

Query: 266 ENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQPFP-NPTVFSISNAGGTII 325
             GGGILVLGEI+EP++V++PL+PSQPHY +NL SI+++GQ  P NP+VFS SN  GTII
Sbjct: 258 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 317

Query: 326 DSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRVSTSVSEVFPVISFNFEGI 385
           D+GTTLAYL E  Y   V  IT+AVSQS  P +S+G+QCY ++TSV ++FP +S NF G 
Sbjct: 318 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGG 377

Query: 386 ASMVLKPEEYL--QFDSIEPALRCIGFQKAED-GINILGDLVLKDKIVVYDLARQRIGWA 445
           ASM L P++YL  Q +    A+ CIGFQ+ ++ GI ILGDLVLKDKI VYDL  QRIGWA
Sbjct: 378 ASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWA 437

Query: 446 NYDCSSSVNVSVT--SGKDVFI-GGQLSSN 463
           NYDCS+SVNVS T  SG+  ++  GQ S N
Sbjct: 438 NYDCSTSVNVSATSSSGRSEYVNAGQFSEN 466

BLAST of Csor.00g033420 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 446.4 bits (1147), Expect = 3.1e-125
Identity = 238/438 (54.34%), Postives = 304/438 (69.41%), Query Frame = 0

Query: 35  LHRAFPHPPTPHFHSLRARDRLRHSRVLRRLRGGIVDFSVKGSSDQF-VGLYYTKVKLGN 94
           L R  P         LRA D  RH R+L+   GG+V+F V G+SD F VGLYYTKVKLG 
Sbjct: 33  LERLIPPNHELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGT 92

Query: 95  PQREFNVQIDTGSDILWVNCSPCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAV 154
           P REFNVQIDTGSD+LWV+C+ C+GCP++S L I+L+ FD  +SSSA LVSCSD  C + 
Sbjct: 93  PPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSN 152

Query: 155 PTTTNQCLSQNDNCNYTFQYRDRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIY 214
             T + C S N+ C+Y+F+Y D S TSG+Y++D M FD ++  ++  NSSA  VFGCS  
Sbjct: 153 FQTESGC-SPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNL 212

Query: 215 QYGDLTRTTVALDGIFGFGRGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILE 274
           Q GDL R   A+DGIFG G+G  SVISQL+ +G+ P+VFSHCLKG ++GGGI+VLG+I  
Sbjct: 213 QSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKR 272

Query: 275 PSIVYSPLIPSQPHYTLNLQSIAISGQPFP-NPTVFSISNAGGTIIDSGTTLAYLVEDVY 334
           P  VY+PL+PSQPHY +NLQSIA++GQ  P +P+VF+I+   GTIID+GTTLAYL ++ Y
Sbjct: 273 PDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAY 332

Query: 335 NWIVSVITSAVSQSTTPTISRGSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQ-F 394
           +  +  + +AVSQ   P      QC+ ++    +VFP +S +F G ASMVL P  YLQ F
Sbjct: 333 SPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF 392

Query: 395 DSIEPALRCIGFQK-AEDGINILGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSG 454
            S   ++ CIGFQ+ +   I ILGDLVLKDK+VVYDL RQRIGWA YDCS  VNVS + G
Sbjct: 393 SSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCSLEVNVSASRG 452

Query: 455 ---KDVFIGGQLSSNNSE 466
              KDV   GQ   + SE
Sbjct: 453 GRSKDVINTGQWRESGSE 469

BLAST of Csor.00g033420 vs. TAIR 10
Match: AT5G36260.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 300.8 bits (769), Expect = 2.1e-81
Identity = 162/417 (38.85%), Postives = 250/417 (59.95%), Query Frame = 0

Query: 50  LRARDRLRHSRVLRRLRGGIVDFSVKGSS-DQFVGLYYTKVKLGNPQREFNVQIDTGSDI 109
           L++ D  RH+R+L       +D  + G S    +GLY+TK+KLG+P +E+ VQ+DTGSDI
Sbjct: 47  LKSHDSFRHARMLAN-----IDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDI 106

Query: 110 LWVNCSPCDGCPQSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCN 169
           LWVNC+PC  CP  + LGI L+L+D+  SS+++ V C D  CS +    ++       C+
Sbjct: 107 LWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFI--MQSETCGAKKPCS 166

Query: 170 YTFQYRDRSATSGFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGI 229
           Y   Y D S + G ++ D++  + + G    A  +  +VFGC   Q G L +T  A+DGI
Sbjct: 167 YHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGI 226

Query: 230 FGFGRGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHY 289
            GFG+   S+ISQL++ G T ++FSHCL    NGGGI  +GE+  P +  +P++P+Q HY
Sbjct: 227 MGFGQSNTSIISQLAAGGSTKRIFSHCL-DNMNGGGIFAVGEVESPVVKTTPIVPNQVHY 286

Query: 290 TLNLQSIAISGQPFP-NPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQST 349
            + L+ + + G P    P++ S +  GGTIIDSGTTLAYL +++YN ++  IT A  Q  
Sbjct: 287 NVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT-AKQQVK 346

Query: 350 TPTISRGSQCYRVSTSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQK-- 409
              +     C+  +++  + FPV++ +FE    + + P +YL   S+   + C G+Q   
Sbjct: 347 LHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYL--FSLREDMYCFGWQSGG 406

Query: 410 --AEDGINI--LGDLVLKDKIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIGGQ 459
              +DG ++  LGDLVL +K+VVYDL  + IGWA+++CSSS+ V   SG    +G +
Sbjct: 407 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGSGAAYQLGAE 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q4V3D23.0e-8038.85Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K41.1e-7437.71Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q766C34.4e-3129.46Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q3EBM51.7e-3028.29Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C26.4e-3028.61Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
KAG6595193.10.0100.00Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
KAG7027232.10.098.92Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_022962843.10.098.71aspartic proteinase-like protein 2 [Cucurbita moschata][more]
XP_023517316.10.097.44aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo][more]
XP_022972768.10.096.98aspartic proteinase-like protein 2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1HFZ70.098.71aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111463213... [more]
A0A6J1IB210.096.98aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471277 P... [more]
A0A1S3CH652.91e-27883.66aspartic proteinase-like protein 2 OS=Cucumis melo OX=3656 GN=LOC103500850 PE=3 ... [more]
A0A5D3C7C63.09e-27683.23Aspartic proteinase-like protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... [more]
A0A0A0KF782.69e-27282.05Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G44871... [more]
Match NameE-valueIdentityDescription
AT2G36670.21.8e-14956.65Eukaryotic aspartyl protease family protein [more]
AT2G36670.11.3e-14756.09Eukaryotic aspartyl protease family protein [more]
AT5G22850.12.3e-13656.89Eukaryotic aspartyl protease family protein [more]
AT1G08210.13.1e-12554.34Eukaryotic aspartyl protease family protein [more]
AT5G36260.12.1e-8138.85Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 91..111
score: 48.95
coord: 316..327
score: 42.62
coord: 411..426
score: 39.97
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 18..458
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 70..269
e-value: 9.2E-48
score: 164.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 270..447
e-value: 8.2E-40
score: 138.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 80..443
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 287..435
e-value: 1.4E-22
score: 80.2
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 85..269
e-value: 1.7E-41
score: 142.4
NoneNo IPR availablePANTHERPTHR13683:SF851EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 18..458
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 85..435
score: 41.41618
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 85..439
e-value: 1.12864E-56
score: 188.626

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g033420.m01Csor.00g033420.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016020 membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity