Cp4.1LG05g03970.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG05g03970.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSubtilisin-like serine protease
LocationCp4.1LG05 : 1636417 .. 1639607 (+)
Sequence length2424
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCATACACAATTGCCGGCCAGCCCTGTCTCCCTTCCCCAATATAAACCATCCACTTTGGTGAGTTCACCAAGTGCATCAAGCATTAGCCTTTTCCCTTACACAATGGTTGCTTTCCCTTCTCTTTTTCTTCTTCTATTGCTCAACTTTCATTATGTTGGGGCTTTAGTGACTGAGCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGNTTTTTATTGTTAAAGCCCAAATTTACATCTATGAGGAATCGTTTTATGGGCTTTTATTGTTAAGCCCAAATTTACATCTATGAGGAATCGTTTGATGGGCTTTTTACTGTTCAGCCCAAATTTACATCCGTGAGGAATCAATTGATGGGCTTTTTATTGTTCAGCTCAAATTTACATCATGATGAATCGCTTGATGGGCTTTTTTTATTGTTAAGCCCAAATTTACATCTATGGGGAATCGCTTGATGGGCTTTTTATTGTTCGGCCCAAATTTACATCCATGAGGAGTCCACGTATTGTTATCTTATTACTCATATTTACGCATGCGCCATTTATAAGCTTTTTTATTAAAAAAAAATCCCGAGTTAAATATTGTTGAGATTTAAAAAAACAAGAGCTAAGTTTCACAGCTAAATGCCGATCATTGTAAGAAGTCGAGTTAGATAACAACAAATTTCAAAGCCTGTAAACTAATACTTCACAAACTTAATAGTCCATTCATGTTGCTATCAGTACGAATCATTCAACTTGACAAGCCAAAACTCCTTGCAGGCCTCACCACAACCTTCCAATTGCTATCTTTTAATGCGTTTGAAAGTTTCATACACAATTGCCGGCCAGCCCTGTCTCCCTTCCCCAATATAAACCATCCACTTTGGTGAGTTCACCAAGTGCATCAAGCATTAGCCTTTTCCCTTACACAATGGTTGCTTTCCCTTCTCTTTTTCTTCTTCTATTGCTCAACTTTCATTATGTTGGGGCTTTAGTGACTGAGCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGTCGAGTTCGAGTTTGCTATACTCGTATCGAAATGTGATGAGTGGTTTTGCTGCAAGACTAAGTGAAGAACAAGTGAAAGCAATGGAAGAGAAGGATGGTTTTGTGTCAGCAAGGCGTGAAAGGATATTGCAATTGCATACAACGCATACCCCTGATTTTCTTGGTTTGAATCGCCAATTTGGGTTTTGGAAAGATTCAAACTTTGGAAAGGGAGTGATCATAGGAGTGTTGGATGGCGGAATTGCGCCAAGCCATCCTTCATTTGATGATGTGGGAATGCCACCACCGCCACCCAAATGGAAAGGAAGATGCGAGTTTAATTTCTCAGCCTGCAACAACAAGCTTATAGGTGCGAGATCTTTTAATCTCGCAACAAAAGTCTTGAAAGGAGAGACAATGATGGATGACTCTCCTATTGATGAAGATGGTCATGGGACTCATACAGCGAGCACAGCTGCTGGTGCCTTCGTCAAGGGCGCTGAGGCATTGGGAAACGCTAAAGGCACGGCTGTTGGCATGGCCCCTTTAGCTCATCTAGCAATTTACAAAGTTTGCTTTGGAGAAGACTGCCCTGACACCGACATTCTCGCTGCACTTGATGCCGCTATTGAAGACGGCGTCGATGTGCTCTCACTCTCGCTTGGGAGCCCATCGGTTCCATTCTTCCAAGACTTAGTCGCCATAGGTGCATTCGCAGCCATTCAAAAGGGGATTTTTGTGAGTTGCTCAGCTGCTAATTCAGGCCCTTTTAAAGCCACATTGTCCAACGAAGCCCCGTGGATTCTAACGGTTGCAGCCAGCACCATTGATCGAAGAATCAAAGCCGCTGCAAAGCTTGGAAATGGAGAAGAATTTGATGGCGAATCTCTGTTCCAACCAAGCGATTTCCCACCAACATTTTTGCCACTTGTTTACGCTGGTGAGAAGAATCAAACAGCCGCTTTGTGTGGAGAAGGATCATTGAAAGACATCGACGTAAAGGGAAAAATTGTGGTATGCGAAAGAGGAGGAGGAATTGCAAGAATTGCAAAAGGGACTGAAGTCAAGAACGCAGGGGGCGCCGCCATGATCCTCCTAAACCAACAACAAGATGGGTTCAGTACTGAAGCAGATGCTCACGTTCTTCCGGCAAGCCACGTCAGCCACAAGGCGGCGTTGAAGATAAAAGCCTACATAAACTCAACAACATACCCAACAGCCACAATTCTTTTCAAAGGAACCGTAATTGGCGACGACAACTTTTCACCCGCCATAGCTTCTTTTTCATCTCGAGGTCCCAGCGTTGCAAGCCCTGGAATTTTGAAGCCCGACATAACCGGTCCAGGTGTCAGCATTTTAGCCGCATGGCCATTTCCATTAGACAAAAACAGTAACACAAAATCAACATTCAACATAATTTCAGGAACATCCATGTCCTGTCCTCATCTCAGCGGCATTGCAGCTCTAATCAAAAGCTCTCACCCTGATTGGTCACCGGCCGCCATTAAATCCGCCATAATGACAACCGCTGACATAACAAATCTTGAAGGCCAGCCAATTGTCGACGAAAATTTGCAGCCGGCGGATTTGTTTGCAACTGGCGCTGGTCATGTCAATCCATCAAAAGCAGCCGACCCAGGATTAGTTTATGACATTCAACCCGATGATTACATTCCTTATCTTTGTGGATTGGGATACAAAAGTAACGAAGTTGCAACAATTGCCCGTAAACCAATTAATTGTTTGGCAAAACCAAGCATTCCCGAAGGAGACCTCAACTATCCGTCATTTACGGTCGTTTTAGGACCGCCGCAAACATTTACAAGAACGGTGACAAATGTCGGCTGTGGACGTGAAGTTTATACTGCCGTGGTCGAAGCACCGCCGTCCATTTCTGTAACAATCCGACCAAGTAAGATATTCTTCTCAAAGATTAACGAAAAAGTGACATATTCAGTGACGTTCAAGAGAATTGGTTCGATCAGTCCCTCAACAGAATTTGGTAAAGGATATCTCAAATGGGTTTCCGACAAACACGTCGTTAGAAGTCCGATCTCTTTTAAGTTTGCATGA

mRNA sequence

TTTCATACACAATTGCCGGCCAGCCCTGTCTCCCTTCCCCAATATAAACCATCCACTTTGGTGAGTTCACCAAGTGCATCAAGCATTAGCCTTTTCCCTTACACAATGGTTGCTTTCCCTTCTCTTTTTCTTCTTCTATTGCTCAACTTTCATTATGTTGGGGCTTTAGTGACTGAGCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGNTTTTTATTCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGTCGAGTTCGAGTTTGCTATACTCGTATCGAAATGTGATGAGTGGTTTTGCTGCAAGACTAAGTGAAGAACAAGTGAAAGCAATGGAAGAGAAGGATGGTTTTGTGTCAGCAAGGCGTGAAAGGATATTGCAATTGCATACAACGCATACCCCTGATTTTCTTGGTTTGAATCGCCAATTTGGGTTTTGGAAAGATTCAAACTTTGGAAAGGGAGTGATCATAGGAGTGTTGGATGGCGGAATTGCGCCAAGCCATCCTTCATTTGATGATGTGGGAATGCCACCACCGCCACCCAAATGGAAAGGAAGATGCGAGTTTAATTTCTCAGCCTGCAACAACAAGCTTATAGGTGCGAGATCTTTTAATCTCGCAACAAAAGTCTTGAAAGGAGAGACAATGATGGATGACTCTCCTATTGATGAAGATGGTCATGGGACTCATACAGCGAGCACAGCTGCTGGTGCCTTCGTCAAGGGCGCTGAGGCATTGGGAAACGCTAAAGGCACGGCTGTTGGCATGGCCCCTTTAGCTCATCTAGCAATTTACAAAGTTTGCTTTGGAGAAGACTGCCCTGACACCGACATTCTCGCTGCACTTGATGCCGCTATTGAAGACGGCGTCGATGTGCTCTCACTCTCGCTTGGGAGCCCATCGGTTCCATTCTTCCAAGACTTAGTCGCCATAGGTGCATTCGCAGCCATTCAAAAGGGGATTTTTGTGAGTTGCTCAGCTGCTAATTCAGGCCCTTTTAAAGCCACATTGTCCAACGAAGCCCCGTGGATTCTAACGGTTGCAGCCAGCACCATTGATCGAAGAATCAAAGCCGCTGCAAAGCTTGGAAATGGAGAAGAATTTGATGGCGAATCTCTGTTCCAACCAAGCGATTTCCCACCAACATTTTTGCCACTTGTTTACGCTGGTGAGAAGAATCAAACAGCCGCTTTGTGTGGAGAAGGATCATTGAAAGACATCGACGTAAAGGGAAAAATTGTGGTATGCGAAAGAGGAGGAGGAATTGCAAGAATTGCAAAAGGGACTGAAGTCAAGAACGCAGGGGGCGCCGCCATGATCCTCCTAAACCAACAACAAGATGGGTTCAGTACTGAAGCAGATGCTCACGTTCTTCCGGCAAGCCACGTCAGCCACAAGGCGGCGTTGAAGATAAAAGCCTACATAAACTCAACAACATACCCAACAGCCACAATTCTTTTCAAAGGAACCGTAATTGGCGACGACAACTTTTCACCCGCCATAGCTTCTTTTTCATCTCGAGGTCCCAGCGTTGCAAGCCCTGGAATTTTGAAGCCCGACATAACCGGTCCAGGTGTCAGCATTTTAGCCGCATGGCCATTTCCATTAGACAAAAACAGTAACACAAAATCAACATTCAACATAATTTCAGGAACATCCATGTCCTGTCCTCATCTCAGCGGCATTGCAGCTCTAATCAAAAGCTCTCACCCTGATTGGTCACCGGCCGCCATTAAATCCGCCATAATGACAACCGCTGACATAACAAATCTTGAAGGCCAGCCAATTGTCGACGAAAATTTGCAGCCGGCGGATTTGTTTGCAACTGGCGCTGGTCATGTCAATCCATCAAAAGCAGCCGACCCAGGATTAGTTTATGACATTCAACCCGATGATTACATTCCTTATCTTTGTGGATTGGGATACAAAAGTAACGAAGTTGCAACAATTGCCCGTAAACCAATTAATTGTTTGGCAAAACCAAGCATTCCCGAAGGAGACCTCAACTATCCGTCATTTACGGTCGTTTTAGGACCGCCGCAAACATTTACAAGAACGGTGACAAATGTCGGCTGTGGACGTGAAGTTTATACTGCCGTGGTCGAAGCACCGCCGTCCATTTCTGTAACAATCCGACCAAGTAAGATATTCTTCTCAAAGATTAACGAAAAAGTGACATATTCAGTGACGTTCAAGAGAATTGGTTCGATCAGTCCCTCAACAGAATTTGGTAAAGGATATCTCAAATGGGTTTCCGACAAACACGTCGTTAGAAGTCCGATCTCTTTTAAGTTTGCATGA

Coding sequence (CDS)

TTTCATACACAATTGCCGGCCAGCCCTGTCTCCCTTCCCCAATATAAACCATCCACTTTGGTGAGTTCACCAAGTGCATCAAGCATTAGCCTTTTCCCTTACACAATGGTTGCTTTCCCTTCTCTTTTTCTTCTTCTATTGCTCAACTTTCATTATGTTGGGGCTTTAGTGACTGAGCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGNTTTTTATTCTTCCGTTGATCAATCTTCAAACGTACATTGTCCACGTGGAGAAACCAGAGACGACGGATGATCTTGAGAGCTGGCATCGATCGTTCTTACCGTCGAGTTCGAGTTTGCTATACTCGTATCGAAATGTGATGAGTGGTTTTGCTGCAAGACTAAGTGAAGAACAAGTGAAAGCAATGGAAGAGAAGGATGGTTTTGTGTCAGCAAGGCGTGAAAGGATATTGCAATTGCATACAACGCATACCCCTGATTTTCTTGGTTTGAATCGCCAATTTGGGTTTTGGAAAGATTCAAACTTTGGAAAGGGAGTGATCATAGGAGTGTTGGATGGCGGAATTGCGCCAAGCCATCCTTCATTTGATGATGTGGGAATGCCACCACCGCCACCCAAATGGAAAGGAAGATGCGAGTTTAATTTCTCAGCCTGCAACAACAAGCTTATAGGTGCGAGATCTTTTAATCTCGCAACAAAAGTCTTGAAAGGAGAGACAATGATGGATGACTCTCCTATTGATGAAGATGGTCATGGGACTCATACAGCGAGCACAGCTGCTGGTGCCTTCGTCAAGGGCGCTGAGGCATTGGGAAACGCTAAAGGCACGGCTGTTGGCATGGCCCCTTTAGCTCATCTAGCAATTTACAAAGTTTGCTTTGGAGAAGACTGCCCTGACACCGACATTCTCGCTGCACTTGATGCCGCTATTGAAGACGGCGTCGATGTGCTCTCACTCTCGCTTGGGAGCCCATCGGTTCCATTCTTCCAAGACTTAGTCGCCATAGGTGCATTCGCAGCCATTCAAAAGGGGATTTTTGTGAGTTGCTCAGCTGCTAATTCAGGCCCTTTTAAAGCCACATTGTCCAACGAAGCCCCGTGGATTCTAACGGTTGCAGCCAGCACCATTGATCGAAGAATCAAAGCCGCTGCAAAGCTTGGAAATGGAGAAGAATTTGATGGCGAATCTCTGTTCCAACCAAGCGATTTCCCACCAACATTTTTGCCACTTGTTTACGCTGGTGAGAAGAATCAAACAGCCGCTTTGTGTGGAGAAGGATCATTGAAAGACATCGACGTAAAGGGAAAAATTGTGGTATGCGAAAGAGGAGGAGGAATTGCAAGAATTGCAAAAGGGACTGAAGTCAAGAACGCAGGGGGCGCCGCCATGATCCTCCTAAACCAACAACAAGATGGGTTCAGTACTGAAGCAGATGCTCACGTTCTTCCGGCAAGCCACGTCAGCCACAAGGCGGCGTTGAAGATAAAAGCCTACATAAACTCAACAACATACCCAACAGCCACAATTCTTTTCAAAGGAACCGTAATTGGCGACGACAACTTTTCACCCGCCATAGCTTCTTTTTCATCTCGAGGTCCCAGCGTTGCAAGCCCTGGAATTTTGAAGCCCGACATAACCGGTCCAGGTGTCAGCATTTTAGCCGCATGGCCATTTCCATTAGACAAAAACAGTAACACAAAATCAACATTCAACATAATTTCAGGAACATCCATGTCCTGTCCTCATCTCAGCGGCATTGCAGCTCTAATCAAAAGCTCTCACCCTGATTGGTCACCGGCCGCCATTAAATCCGCCATAATGACAACCGCTGACATAACAAATCTTGAAGGCCAGCCAATTGTCGACGAAAATTTGCAGCCGGCGGATTTGTTTGCAACTGGCGCTGGTCATGTCAATCCATCAAAAGCAGCCGACCCAGGATTAGTTTATGACATTCAACCCGATGATTACATTCCTTATCTTTGTGGATTGGGATACAAAAGTAACGAAGTTGCAACAATTGCCCGTAAACCAATTAATTGTTTGGCAAAACCAAGCATTCCCGAAGGAGACCTCAACTATCCGTCATTTACGGTCGTTTTAGGACCGCCGCAAACATTTACAAGAACGGTGACAAATGTCGGCTGTGGACGTGAAGTTTATACTGCCGTGGTCGAAGCACCGCCGTCCATTTCTGTAACAATCCGACCAAGTAAGATATTCTTCTCAAAGATTAACGAAAAAGTGACATATTCAGTGACGTTCAAGAGAATTGGTTCGATCAGTCCCTCAACAGAATTTGGTAAAGGATATCTCAAATGGGTTTCCGACAAACACGTCGTTAGAAGTCCGATCTCTTTTAAGTTTGCATGA

Protein sequence

FHTQLPASPVSLPQYKPSTLVSSPSASSISLFPYTMVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFILPLINLQTYIVHVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDSPIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPISFKFA
BLAST of Cp4.1LG05g03970.1 vs. Swiss-Prot
Match: SBT18_ARATH (Subtilisin-like protease SBT1.8 OS=Arabidopsis thaliana GN=SBT1.8 PE=2 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 3.1e-174
Identity = 343/730 (46.99%), Postives = 460/730 (63.01%), Query Frame = 1

Query: 100 QTYIV---HVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARL-SEEQVKAMEEK 159
           +TYI+   H +KPE+      W+ S L S SSLLY+Y     GF+A L S E    +   
Sbjct: 28  KTYIIRVNHSDKPESFLTHHDWYTSQLNSESSLLYTYTTSFHGFSAYLDSTEADSLLSSS 87

Query: 160 DGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDV 219
           +  +    + +  LHTT TP+FLGLN +FG     +   GVIIGVLD G+ P   SFDD 
Sbjct: 88  NSILDIFEDPLYTLHTTRTPEFLGLNSEFGVHDLGSSSNGVIIGVLDTGVWPESRSFDDT 147

Query: 220 GMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKGETMMDD----SPIDEDG 279
            MP  P KWKG CE    F+   CN KLIGARSF+   ++  G          SP D DG
Sbjct: 148 DMPEIPSKWKGECESGSDFDSKLCNKKLIGARSFSKGFQMASGGGFSSKRESVSPRDVDG 207

Query: 280 HGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAI 339
           HGTHT++TAAG+ V+ A  LG A GTA GMA  A +A YKVC+   C  +DILAA+D AI
Sbjct: 208 HGTHTSTTAAGSAVRNASFLGYAAGTARGMATRARVATYKVCWSTGCFGSDILAAMDRAI 267

Query: 340 EDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILT 399
            DGVDVLSLSLG  S P+++D +AIGAF+A+++G+FVSCSA NSGP +A+++N APW++T
Sbjct: 268 LDGVDVLSLSLGGGSAPYYRDTIAIGAFSAMERGVFVSCSAGNSGPTRASVANVAPWVMT 327

Query: 400 VAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKD 459
           V A T+DR   A A LGNG+   G SL+         L LVY    + ++ LC  GSL  
Sbjct: 328 VGAGTLDRDFPAFANLGNGKRLTGVSLYSGVGMGTKPLELVYNKGNSSSSNLCLPGSLDS 387

Query: 460 IDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHK 519
             V+GKIVVC+RG   AR+ KG  V++AGG  MI+ N    G    AD+H+LPA  V  K
Sbjct: 388 SIVRGKIVVCDRGVN-ARVEKGAVVRDAGGLGMIMANTAASGEELVADSHLLPAIAVGKK 447

Query: 520 AALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGV 579
               ++ Y+ S + PTA ++FKGTV+ D   SP +A+FSSRGP+  +P ILKPD+ GPGV
Sbjct: 448 TGDLLREYVKSDSKPTALLVFKGTVL-DVKPSPVVAAFSSRGPNTVTPEILKPDVIGPGV 507

Query: 580 SILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSA 639
           +ILA W     P  LDK+S  ++ FNI+SGTSMSCPH+SG+A L+K++HP+WSP+AIKSA
Sbjct: 508 NILAGWSDAIGPTGLDKDSR-RTQFNIMSGTSMSCPHISGLAGLLKAAHPEWSPSAIKSA 567

Query: 640 IMTTA---DITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLC 699
           +MTTA   D TN       D +L  ++ +A G+GHV+P KA  PGLVYDI  ++YI +LC
Sbjct: 568 LMTTAYVLDNTNAPLHDAADNSL--SNPYAHGSGHVDPQKALSPGLVYDISTEEYIRFLC 627

Query: 700 GLGYKSNEVATIARKP-INCLAKPSIPEGDLNYPSFTVVLGPPQT--FTRTVTNVGCGRE 759
            L Y  + +  I ++P +NC  K S P G LNYPSF+V+ G  +   +TR VTNVG    
Sbjct: 628 SLDYTVDHIVAIVKRPSVNCSKKFSDP-GQLNYPSFSVLFGGKRVVRYTREVTNVGAASS 687

Query: 760 VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKH 807
           VY   V   PS+ ++++PSK+ F  + EK  Y+VTF     +S + +   G + W + +H
Sbjct: 688 VYKVTVNGAPSVGISVKPSKLSFKSVGEKKRYTVTFVSKKGVSMTNKAEFGSITWSNPQH 747

BLAST of Cp4.1LG05g03970.1 vs. Swiss-Prot
Match: SBT14_ARATH (Subtilisin-like protease SBT1.4 OS=Arabidopsis thaliana GN=SBT1.4 PE=2 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 1.9e-168
Identity = 346/747 (46.32%), Postives = 459/747 (61.45%), Query Frame = 1

Query: 99  LQTYIVHVE---KPETTDDLESWHRSFL------PSSSSLLYSYRNVMSGFAARLSEEQV 158
           L++YIVHV+   KP       +WH S L      P  ++LLYSY   + GF+ARLS  Q 
Sbjct: 30  LESYIVHVQRSHKPSLFSSHNNWHVSLLRSLPSSPQPATLLYSYSRAVHGFSARLSPIQT 89

Query: 159 KAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSH 218
            A+      +S   ++  ++HTTHTP FLG ++  G W +SN+G+ VI+GVLD GI P H
Sbjct: 90  AALRRHPSVISVIPDQAREIHTTHTPAFLGFSQNSGLWSNSNYGEDVIVGVLDTGIWPEH 149

Query: 219 PSFDDVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKGETMM-----DD 278
           PSF D G+ P P  WKG CE    F  S+CN KLIGAR+F       +  T         
Sbjct: 150 PSFSDSGLGPIPSTWKGECEIGPDFPASSCNRKLIGARAFYRGYLTQRNGTKKHAAKESR 209

Query: 279 SPIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDIL 338
           SP D +GHGTHTASTAAG+ V  A     A+GTA GMA  A +A YK+C+   C D+DIL
Sbjct: 210 SPRDTEGHGTHTASTAAGSVVANASLYQYARGTATGMASKARIAAYKICWTGGCYDSDIL 269

Query: 339 AALDAAIEDGVDVLSLSLG-SPSVP-FFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATL 398
           AA+D A+ DGV V+SLS+G S S P +  D +AIGAF A + GI VSCSA NSGP   T 
Sbjct: 270 AAMDQAVADGVHVISLSVGASGSAPEYHTDSIAIGAFGATRHGIVVSCSAGNSGPNPETA 329

Query: 399 SNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAA 458
           +N APWILTV AST+DR   A A  G+G+ F G SL+     P + L LVY+G+    + 
Sbjct: 330 TNIAPWILTVGASTVDREFAANAITGDGKVFTGTSLYAGESLPDSQLSLVYSGDCG--SR 389

Query: 459 LCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHV 518
           LC  G L    V+GKIV+C+RGG  AR+ KG+ VK AGGA MIL N  + G    AD+H+
Sbjct: 390 LCYPGKLNSSLVEGKIVLCDRGGN-ARVEKGSAVKLAGGAGMILANTAESGEELTADSHL 449

Query: 519 LPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGIL 578
           +PA+ V  KA  +I+ YI ++  PTA I F GT+IG    SP +A+FSSRGP+  +P IL
Sbjct: 450 VPATMVGAKAGDQIRDYIKTSDSPTAKISFLGTLIGPSPPSPRVAAFSSRGPNHLTPVIL 509

Query: 579 KPDITGPGVSILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPD 638
           KPD+  PGV+ILA W     P  LD +   +  FNIISGTSMSCPH+SG+AAL++ +HPD
Sbjct: 510 KPDVIAPGVNILAGWTGMVGPTDLDIDPR-RVQFNIISGTSMSCPHVSGLAALLRKAHPD 569

Query: 639 WSPAAIKSAIMTTADITNLEGQPIVD-ENLQPADLFATGAGHVNPSKAADPGLVYDIQPD 698
           WSPAAIKSA++TTA      G+PI D    + ++ F  GAGHV+P+KA +PGLVYDI+  
Sbjct: 570 WSPAAIKSALVTTAYDVENSGEPIEDLATGKSSNSFIHGAGHVDPNKALNPGLVYDIEVK 629

Query: 699 DYIPYLCGLGYKSNEVATIARKPI---NCLAKPSIPEGDLNYPSFTVVL---GPPQTFTR 758
           +Y+ +LC +GY+   +    + P     C        GDLNYPSF+VV    G    + R
Sbjct: 630 EYVAFLCAVGYEFPGILVFLQDPTLYDACETSKLRTAGDLNYPSFSVVFASTGEVVKYKR 689

Query: 759 TVTNVGCGRE-VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKR------IGSIS 807
            V NVG   + VY   V++P ++ + + PSK+ FSK    + Y VTFK       +GS+ 
Sbjct: 690 VVKNVGSNVDAVYEVGVKSPANVEIDVSPSKLAFSKEKSVLEYEVTFKSVVLGGGVGSV- 749

BLAST of Cp4.1LG05g03970.1 vs. Swiss-Prot
Match: SBT17_ARATH (Subtilisin-like protease SBT1.7 OS=Arabidopsis thaliana GN=SBT1.7 PE=1 SV=1)

HSP 1 Score: 585.5 bits (1508), Expect = 8.9e-166
Identity = 337/730 (46.16%), Postives = 452/730 (61.92%), Query Frame = 1

Query: 101 TYIVHVEK---PETTDDLESWHRSFLPS---SSSLLYSYRNVMSGFAARLSEEQVKAMEE 160
           TYIVH+ K   P + D   +W+ S L S   S+ LLY+Y N + GF+ RL++E+  ++  
Sbjct: 31  TYIVHMAKSQMPSSFDLHSNWYDSSLRSISDSAELLYTYENAIHGFSTRLTQEEADSLMT 90

Query: 161 KDGFVSARRERILQLHTTHTPDFLGLNRQFG-FWKDSNFGKGVIIGVLDGGIAPSHPSFD 220
           + G +S   E   +LHTT TP FLGL+      + ++     V++GVLD G+ P   S+ 
Sbjct: 91  QPGVISVLPEHRYELHTTRTPLFLGLDEHTADLFPEAGSYSDVVVGVLDTGVWPESKSYS 150

Query: 221 DVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKG---ETMMDDSPIDED 280
           D G  P P  WKG CE    F  S CN KLIGAR F    +   G   E+    SP D+D
Sbjct: 151 DEGFGPIPSSWKGGCEAGTNFTASLCNRKLIGARFFARGYESTMGPIDESKESRSPRDDD 210

Query: 281 GHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAA 340
           GHGTHT+STAAG+ V+GA  LG A GTA GMAP A +A+YKVC+   C  +DILAA+D A
Sbjct: 211 GHGTHTSSTAAGSVVEGASLLGYASGTARGMAPRARVAVYKVCWLGGCFSSDILAAIDKA 270

Query: 341 IEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWIL 400
           I D V+VLS+SLG     +++D VAIGAFAA+++GI VSCSA N+GP  ++LSN APWI 
Sbjct: 271 IADNVNVLSMSLGGGMSDYYRDGVAIGAFAAMERGILVSCSAGNAGPSSSSLSNVAPWIT 330

Query: 401 TVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQ--TAALCGEGS 460
           TV A T+DR   A A LGNG+ F G SLF+    P   LP +YAG  +      LC  G+
Sbjct: 331 TVGAGTLDRDFPALAILGNGKNFTGVSLFKGEALPDKLLPFIYAGNASNATNGNLCMTGT 390

Query: 461 LKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHV 520
           L    VKGKIV+C+RG   AR+ KG  VK AGG  MIL N   +G    ADAH+LPA+ V
Sbjct: 391 LIPEKVKGKIVMCDRGIN-ARVQKGDVVKAAGGVGMILANTAANGEELVADAHLLPATTV 450

Query: 521 SHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITG 580
             KA   I+ Y+ +   PTA+I   GTV+G    SP +A+FSSRGP+  +P ILKPD+  
Sbjct: 451 GEKAGDIIRHYVTTDPNPTASISILGTVVGVKP-SPVVAAFSSRGPNSITPNILKPDLIA 510

Query: 581 PGVSILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAI 640
           PGV+ILAAW     P  L  +S  +  FNIISGTSMSCPH+SG+AAL+KS HP+WSPAAI
Sbjct: 511 PGVNILAAWTGAAGPTGLASDSR-RVEFNIISGTSMSCPHVSGLAALLKSVHPEWSPAAI 570

Query: 641 KSAIMTTADITNLEGQPIVD-ENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYL 700
           +SA+MTTA  T  +G+P++D    +P+  F  GAGHV+P+ A +PGL+YD+  +DY+ +L
Sbjct: 571 RSALMTTAYKTYKDGKPLLDIATGKPSTPFDHGAGHVSPTTATNPGLIYDLTTEDYLGFL 630

Query: 701 CGLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLG--PPQTFTRTVTNVGCGRE 760
           C L Y S ++ +++R+   C    S    DLNYPSF V +       +TRTVT+VG    
Sbjct: 631 CALNYTSPQIRSVSRRNYTCDPSKSYSVADLNYPSFAVNVDGVGAYKYTRTVTSVGGAGT 690

Query: 761 VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKH 807
               V      + +++ P+ + F + NEK +Y+VTF  + S  PS     G ++W   KH
Sbjct: 691 YSVKVTSETTGVKISVEPAVLNFKEANEKKSYTVTF-TVDSSKPSGSNSFGSIEWSDGKH 750

BLAST of Cp4.1LG05g03970.1 vs. Swiss-Prot
Match: SBT12_ARATH (Subtilisin-like protease SBT1.2 OS=Arabidopsis thaliana GN=SBT1.2 PE=2 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 2.6e-165
Identity = 332/753 (44.09%), Postives = 457/753 (60.69%), Query Frame = 1

Query: 96  LINLQTYIVHV----EKPETTDDLESWHRSFLPS------------SSSLLYSYRNVMSG 155
           ++  QTYIV +    E  +T      WH SFL              SS LLYSY + + G
Sbjct: 22  ILQKQTYIVQLHPNSETAKTFASKFDWHLSFLQEAVLGVEEEEEEPSSRLLYSYGSAIEG 81

Query: 156 FAARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNR--QFGFWKDSNFGKGVI 215
           FAA+L+E + + +      V+ R + +LQ+ TT++  FLGL+     G W  S FG+G I
Sbjct: 82  FAAQLTESEAEILRYSPEVVAVRPDHVLQVQTTYSYKFLGLDGFGNSGVWSKSRFGQGTI 141

Query: 216 IGVLDGGIAPSHPSFDDVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLK 275
           IGVLD G+ P  PSFDD GMP  P KWKG C+    F+ S+CN KLIGAR F    +V  
Sbjct: 142 IGVLDTGVWPESPSFDDTGMPSIPRKWKGICQEGESFSSSSCNRKLIGARFFIRGHRVAN 201

Query: 276 GETMMDDSPI------DEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYK 335
                 + P       D  GHGTHTAST  G+ V  A  LGN  G A GMAP AH+A+YK
Sbjct: 202 SPEESPNMPREYISARDSTGHGTHTASTVGGSSVSMANVLGNGAGVARGMAPGAHIAVYK 261

Query: 336 VCFGEDCPDTDILAALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCS 395
           VC+   C  +DILAA+D AI+D VDVLSLSLG   +P + D +AIG F A+++GI V C+
Sbjct: 262 VCWFNGCYSSDILAAIDVAIQDKVDVLSLSLGGFPIPLYDDTIAIGTFRAMERGISVICA 321

Query: 396 AANSGPFKATLSNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPT--FL 455
           A N+GP +++++N APW+ T+ A T+DRR  A  +L NG+   GESL+           +
Sbjct: 322 AGNNGPIESSVANTAPWVSTIGAGTLDRRFPAVVRLANGKLLYGESLYPGKGIKNAGREV 381

Query: 456 PLVYAGEKNQTAALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQ 515
            ++Y    ++ +  C  GSL   +++GK+V+C+RG    R  KG  VK AGG AMIL N 
Sbjct: 382 EVIYVTGGDKGSEFCLRGSLPREEIRGKMVICDRGVN-GRSEKGEAVKEAGGVAMILANT 441

Query: 516 QQDGFSTEADAHVLPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASF 575
           + +      D H+LPA+ + +  ++ +KAY+N+T  P A I+F GTVIG    +P +A F
Sbjct: 442 EINQEEDSIDVHLLPATLIGYTESVLLKAYVNATVKPKARIIFGGTVIGRSR-APEVAQF 501

Query: 576 SSRGPSVASPGILKPDITGPGVSILAAWPFPLDKN----SNTKSTFNIISGTSMSCPHLS 635
           S+RGPS+A+P ILKPD+  PGV+I+AAWP  L        + +  F ++SGTSMSCPH+S
Sbjct: 502 SARGPSLANPSILKPDMIAPGVNIIAAWPQNLGPTGLPYDSRRVNFTVMSGTSMSCPHVS 561

Query: 636 GIAALIKSSHPDWSPAAIKSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAA 695
           GI ALI+S++P+WSPAAIKSA+MTTAD+ + +G+ I D N +PA +FA GAGHVNP KA 
Sbjct: 562 GITALIRSAYPNWSPAAIKSALMTTADLYDRQGKAIKDGN-KPAGVFAIGAGHVNPQKAI 621

Query: 696 DPGLVYDIQPDDYIPYLCGLGYKSNEVATIARKPINC---LAKPSIPEGDLNYPSFTVVL 755
           +PGLVY+IQP DYI YLC LG+  +++  I  K ++C   L K   P   LNYPS  V+ 
Sbjct: 622 NPGLVYNIQPVDYITYLCTLGFTRSDILAITHKNVSCNGILRKN--PGFSLNYPSIAVIF 681

Query: 756 GPPQT---FTRTVTNVGCGREVYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTF--K 804
              +T    TR VTNVG    +Y+  V+AP  I V + P ++ F  +++ ++Y V F  K
Sbjct: 682 KRGKTTEMITRRVTNVGSPNSIYSVNVKAPEGIKVIVNPKRLVFKHVDQTLSYRVWFVLK 741

BLAST of Cp4.1LG05g03970.1 vs. Swiss-Prot
Match: SBT15_ARATH (Subtilisin-like protease SBT1.5 OS=Arabidopsis thaliana GN=SBT1.5 PE=2 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 1.3e-161
Identity = 336/746 (45.04%), Postives = 453/746 (60.72%), Query Frame = 1

Query: 98  NLQTYIVHVE---KPETTDDLESWHRSFLPSSSS----LLYSYRNVMSGFAARLSEEQVK 157
           N  TYIVHV+   KP        W+ S L S +S    ++++Y  V  GF+ARL+ +   
Sbjct: 24  NSLTYIVHVDHEAKPSIFPTHFHWYTSSLASLTSSPPSIIHTYDTVFHGFSARLTSQDAS 83

Query: 158 AMEEKDGFVSARRERILQLHTTHTPDFLGLNR--QFGFWKDSNFGKGVIIGVLDGGIAPS 217
            + +    +S   E++  LHTT +P+FLGL    + G  ++S+FG  ++IGV+D G+ P 
Sbjct: 84  QLLDHPHVISVIPEQVRHLHTTRSPEFLGLRSTDKAGLLEESDFGSDLVIGVIDTGVWPE 143

Query: 218 HPSFDDVGMPPPPPKWKGRC----EFNFSACNNKLIGARSF---NLATKVLKGETMMDDS 277
            PSFDD G+ P P KWKG+C    +F  SACN KL+GAR F     AT     ET    S
Sbjct: 144 RPSFDDRGLGPVPIKWKGQCIASQDFPESACNRKLVGARFFCGGYEATNGKMNETTEFRS 203

Query: 278 PIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILA 337
           P D DGHGTHTAS +AG +V  A  LG A G A GMAP A LA YKVC+   C D+DILA
Sbjct: 204 PRDSDGHGTHTASISAGRYVFPASTLGYAHGVAAGMAPKARLAAYKVCWNSGCYDSDILA 263

Query: 338 ALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNE 397
           A D A+ DGVDV+SLS+G   VP++ D +AIGAF AI +GIFVS SA N GP   T++N 
Sbjct: 264 AFDTAVADGVDVISLSVGGVVVPYYLDAIAIGAFGAIDRGIFVSASAGNGGPGALTVTNV 323

Query: 398 APWILTVAASTIDRRIKAAAKLGNGEEFDGESLF-QPSDFPPTFLPLVYAGE----KNQT 457
           APW+ TV A TIDR   A  KLGNG+   G S++  P   P    PLVY G        +
Sbjct: 324 APWMTTVGAGTIDRDFPANVKLGNGKMISGVSVYGGPGLDPGRMYPLVYGGSLLGGDGYS 383

Query: 458 AALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADA 517
           ++LC EGSL    VKGKIV+C+RG   +R  KG  V+  GG  MI+ N   DG    AD 
Sbjct: 384 SSLCLEGSLDPNLVKGKIVLCDRGIN-SRATKGEIVRKNGGLGMIIANGVFDGEGLVADC 443

Query: 518 HVLPASHVSHKAALKIKAYIN------STTYPTATILFKGTVIGDDNFSPAIASFSSRGP 577
           HVLPA+ V      +I+ YI+      S+ +PTATI+FKGT +G    +P +ASFS+RGP
Sbjct: 444 HVLPATSVGASGGDEIRRYISESSKSRSSKHPTATIVFKGTRLG-IRPAPVVASFSARGP 503

Query: 578 SVASPGILKPDITGPGVSILAAWPFPLD----KNSNTKSTFNIISGTSMSCPHLSGIAAL 637
           +  +P ILKPD+  PG++ILAAWP  +      + N ++ FNI+SGTSM+CPH+SG+AAL
Sbjct: 504 NPETPEILKPDVIAPGLNILAAWPDRIGPSGVTSDNRRTEFNILSGTSMACPHVSGLAAL 563

Query: 638 IKSSHPDWSPAAIKSAIMTTADITNLEGQPIVDENL-QPADLFATGAGHVNPSKAADPGL 697
           +K++HPDWSPAAI+SA++TTA   +  G+P++DE+    + +   G+GHV+P+KA DPGL
Sbjct: 564 LKAAHPDWSPAAIRSALITTAYTVDNSGEPMMDESTGNTSSVMDYGSGHVHPTKAMDPGL 623

Query: 698 VYDIQPDDYIPYLCGLGYKSNEVATIARKPINC-LAKPSIPEGDLNYPSFTVVLGP---- 757
           VYDI   DYI +LC   Y    + TI R+  +C  A+ +   G+LNYPSF+VV       
Sbjct: 624 VYDITSYDYINFLCNSNYTRTNIVTITRRQADCDGARRAGHVGNLNYPSFSVVFQQYGES 683

Query: 758 --PQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIG-S 803
                F RTVTNVG    VY   +  P   +VT+ P K+ F ++ +K+++ V  K     
Sbjct: 684 KMSTHFIRTVTNVGDSDSVYEIKIRPPRGTTVTVEPEKLSFRRVGQKLSFVVRVKTTEVK 743

BLAST of Cp4.1LG05g03970.1 vs. TrEMBL
Match: A0A0A0KKE3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157240 PE=4 SV=1)

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 616/772 (79.79%), Postives = 665/772 (86.14%), Query Frame = 1

Query: 36  MVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFILP 95
           MV  PSLFLLLLLNFH   A VTELP  NL TYIVHV+KPE  DDLESWHRSFLP     
Sbjct: 1   MVLLPSLFLLLLLNFHVYEAQVTELPFSNLHTYIVHVKKPEVVDDLESWHRSFLP----- 60

Query: 96  LINLQTYIVHVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARLSEEQVKAMEEK 155
                T + + E+  T                 LLYSYRNVMSGF+ARL+EE VKAMEEK
Sbjct: 61  -----TSLENSEEQPT-----------------LLYSYRNVMSGFSARLTEEHVKAMEEK 120

Query: 156 DGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDV 215
           DGFVSARRE I+ LHTTH+P+FLGLNRQFGFWKDSNFGKGVIIGVLDGGI PSHPSF D 
Sbjct: 121 DGFVSARRETIVHLHTTHSPNFLGLNRQFGFWKDSNFGKGVIIGVLDGGITPSHPSFVDA 180

Query: 216 GMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGE-TMMDDSPIDEDGHGTHTAS 275
           GMP PP KWKGRCEFNFSACNNKLIGARS NLA++ LKG+ T +DDSPIDEDGHGTHTAS
Sbjct: 181 GMPQPPAKWKGRCEFNFSACNNKLIGARSLNLASQALKGKITTLDDSPIDEDGHGTHTAS 240

Query: 276 TAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAIEDGVDVL 335
           TAAG FV GAEALGNA GTAVGMAPLAHLAIYKVCFGE C + DILA LDAA+EDGVDVL
Sbjct: 241 TAAGTFVDGAEALGNAFGTAVGMAPLAHLAIYKVCFGESCSNVDILAGLDAAVEDGVDVL 300

Query: 336 SLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILTVAASTID 395
           S+SLG P VPFF D+ AIGAFAAIQKGIFVSCSAANSGPF ATLSNEAPWILTVAASTID
Sbjct: 301 SISLGGPPVPFFADITAIGAFAAIQKGIFVSCSAANSGPFNATLSNEAPWILTVAASTID 360

Query: 396 RRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKDIDVKGKI 455
           R+I A AKLGNGEEFDGESLFQP+DFP TFLPLV+ GEKN+T ALC EGSLK+IDVKGK+
Sbjct: 361 RKITATAKLGNGEEFDGESLFQPNDFPQTFLPLVFPGEKNETVALCAEGSLKNIDVKGKV 420

Query: 456 VVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHKAALKIKA 515
           VVC+RGGGIARIAKG EVKNAGGAAMILLN + DGF+TEADAHVLPASHVSH AALKIKA
Sbjct: 421 VVCDRGGGIARIAKGVEVKNAGGAAMILLNAESDGFTTEADAHVLPASHVSHTAALKIKA 480

Query: 516 YINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGVSILAAWP 575
           YINSTTYPTATI+FKGT IGDD FSPAIA+FSSRGPS+ASPGILKPDITGPGVSILAAWP
Sbjct: 481 YINSTTYPTATIVFKGTTIGDD-FSPAIAAFSSRGPSLASPGILKPDITGPGVSILAAWP 540

Query: 576 FPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSAIMTTADITNLEG 635
           FPLD N+NTKSTFNI+SGTSMSCPHLSGIAALIKS+HPDWSPAAIKS+IMTTA+ITNLEG
Sbjct: 541 FPLDNNTNTKSTFNIVSGTSMSCPHLSGIAALIKSAHPDWSPAAIKSSIMTTANITNLEG 600

Query: 636 QPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLGYKSNEVATIARK 695
            PIVD+ LQPADLFA GAGHVNPSKA DPGLVYDIQPDDYIPYLCGLGY +N+V+ IA K
Sbjct: 601 NPIVDQTLQPADLFAIGAGHVNPSKAVDPGLVYDIQPDDYIPYLCGLGYTNNQVSLIAHK 660

Query: 696 PINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRP 755
           PI+CL   SIPEG+LNYPSF V LG  QTF+RTVT VG GREVY  V+EAP  +SVT+RP
Sbjct: 661 PIDCLTTTSIPEGELNYPSFMVKLGQVQTFSRTVTYVGSGREVYNVVIEAPEGVSVTVRP 720

Query: 756 SKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPISFKF 807
            K+ FS +N+K TYSVTFKRIGSISPSTEF +GYLKWVS KH+VRSPIS KF
Sbjct: 721 RKVIFSALNQKATYSVTFKRIGSISPSTEFAEGYLKWVSAKHLVRSPISVKF 744

BLAST of Cp4.1LG05g03970.1 vs. TrEMBL
Match: B9GW37_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s11870g PE=4 SV=2)

HSP 1 Score: 1031.6 bits (2666), Expect = 5.3e-298
Identity = 517/721 (71.71%), Postives = 592/721 (82.11%), Query Frame = 1

Query: 99  LQTYIVHVEKPETT-----DDLESWHRSFLPSSSS-------LLYSYRNVMSGFAARLSE 158
           L  YIVHV KPE       +DLESW++SFLP S++       +LY+Y+NVMSGFAARL++
Sbjct: 35  LLNYIVHVAKPEGRTMAEFEDLESWYQSFLPVSTASSEKQQRMLYAYQNVMSGFAARLTQ 94

Query: 159 EQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIA 218
           E+VK+MEEKDGF+SAR ERIL L TTHTP FLGL+++ GFWK+SNFGKGVIIGVLDGGI 
Sbjct: 95  EEVKSMEEKDGFLSARPERILHLQTTHTPRFLGLHQELGFWKESNFGKGVIIGVLDGGIF 154

Query: 219 PSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDSPIDED 278
           PSHPSF D GMPPPP KWKGRC+FN S CNNKLIGARSFN+A K  KG    +  PID D
Sbjct: 155 PSHPSFSDEGMPPPPAKWKGRCDFNASDCNNKLIGARSFNIAAKAKKGSAATEP-PIDVD 214

Query: 279 GHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGE---DCPDTDILAAL 338
           GHGTHTASTAAGAFVK AE LGNA+GTAVG+AP AHLAIYKVCFG+   DCP++DILA L
Sbjct: 215 GHGTHTASTAAGAFVKDAEVLGNARGTAVGIAPHAHLAIYKVCFGDPGDDCPESDILAGL 274

Query: 339 DAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAP 398
           DAA++DGVDVLSLSLG  SVP F D +AIG+FAAIQKGIFVSCSA NSGPF  TLSNEAP
Sbjct: 275 DAAVQDGVDVLSLSLGEDSVPLFNDTIAIGSFAAIQKGIFVSCSAGNSGPFNGTLSNEAP 334

Query: 399 WILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQ-TAALCGE 458
           WILTV AST+DRR  A A+LGNGE+ DGESL Q S+FP T LPLVYAG   +  ++LCGE
Sbjct: 335 WILTVGASTVDRRFSATARLGNGEQIDGESLSQHSNFPSTLLPLVYAGMSGKPNSSLCGE 394

Query: 459 GSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPAS 518
           G+L+ +DVKGKIV+CERGGGI RIAKG EVKNAGGAAMIL+N++ DGFST AD HVLPA+
Sbjct: 395 GALEGMDVKGKIVLCERGGGIGRIAKGGEVKNAGGAAMILMNEEADGFSTNADVHVLPAT 454

Query: 519 HVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDI 578
           HVS  A LKIKAYINST  P ATILFKGTVIGD + SP +ASFSSRGPS+ASPGILKPDI
Sbjct: 455 HVSFAAGLKIKAYINSTQAPMATILFKGTVIGDSS-SPFVASFSSRGPSLASPGILKPDI 514

Query: 579 TGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSA 638
            GPGVSILAAWPFPLD N+N+KSTFNIISGTSMSCPHLSGIAAL+KSSHP WSPAAIKSA
Sbjct: 515 IGPGVSILAAWPFPLDNNTNSKSTFNIISGTSMSCPHLSGIAALLKSSHPYWSPAAIKSA 574

Query: 639 IMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLG 698
           IMTTAD  N+EG+ IVD+ LQPAD+FATGAGHVNPS+A +PGLVYDIQPDDYIPYLCGLG
Sbjct: 575 IMTTADTLNMEGKLIVDQTLQPADIFATGAGHVNPSRANNPGLVYDIQPDDYIPYLCGLG 634

Query: 699 YKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVV 758
           Y  NEV+ I  + + C  KPSIPEG+LNYPSF V LGP QTFTRTVTNVG     Y   +
Sbjct: 635 YADNEVSIIVHEQVKCSEKPSIPEGELNYPSFAVTLGPSQTFTRTVTNVGDVNSAYEVAI 694

Query: 759 EAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPI 804
            +PP + VT++PSK++FSK+N+K TYSV F R      ++E  +GY+ W S K+ VRSPI
Sbjct: 695 VSPPGVDVTVKPSKLYFSKVNQKATYSVAFSRTEYGGKTSETAQGYIVWASAKYTVRSPI 753

BLAST of Cp4.1LG05g03970.1 vs. TrEMBL
Match: W9RYM8_9ROSA (Subtilisin-like protease SDD1 OS=Morus notabilis GN=L484_023157 PE=4 SV=1)

HSP 1 Score: 1029.2 bits (2660), Expect = 2.6e-297
Identity = 529/787 (67.22%), Postives = 613/787 (77.89%), Query Frame = 1

Query: 34  YTMVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFI 93
           + M     L  + +LNF +V AL +E+           +   +TT+              
Sbjct: 83  FNMATTSLLSFIFVLNFFHVIALQSEV-----------ISVSQTTESS------------ 142

Query: 94  LPLINLQTYIVHVEKPE-----TTDDLESWHRSFLPSSSS--------LLYSYRNVMSGF 153
               +LQ YI+HV+ P+      ++DLESW+RSFLP++++        +LY+YRNV+ GF
Sbjct: 143 ----SLQNYIIHVKPPKGRVLSQSEDLESWYRSFLPATTAASSDNQPRMLYAYRNVLRGF 202

Query: 154 AARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGV 213
           AARL+++QV+AME KDGF+SAR ERIL+  TTHTP+FLGL++Q GFW+DSNFGKGVIIGV
Sbjct: 203 AARLTQDQVRAMEGKDGFISARPERILKKLTTHTPNFLGLHQQKGFWRDSNFGKGVIIGV 262

Query: 214 LDGGIAPSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDD 273
           LDGGI PSHPSF D GMPPPP KWKGRC+FN S CNNKLIGARSFNLA K  KG+    +
Sbjct: 263 LDGGIFPSHPSFSDEGMPPPPAKWKGRCDFNVSDCNNKLIGARSFNLAAKATKGDKA--E 322

Query: 274 SPIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDIL 333
            PIDEDGHGTHTASTAAG FV  A+ LGNAKGTAVGMAP AHLAIYKVCFGEDCPD DIL
Sbjct: 323 PPIDEDGHGTHTASTAAGGFVNYADVLGNAKGTAVGMAPYAHLAIYKVCFGEDCPDADIL 382

Query: 334 AALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSN 393
           AALDAA+EDGVDVLSLSLG  S PFF D +AIGAFAA +KGI VSCSA NSGP  +TLSN
Sbjct: 383 AALDAAVEDGVDVLSLSLGDVSRPFFNDSLAIGAFAATEKGILVSCSAGNSGPVNSTLSN 442

Query: 394 EAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQT-AAL 453
           EAPWILTV ASTIDR+I A AKLGN EEFDGES+ +  DFP T  PLVYAG   +  +A 
Sbjct: 443 EAPWILTVGASTIDRKIIATAKLGNDEEFDGESIHR-GDFPQTSWPLVYAGINGKADSAF 502

Query: 454 CGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVL 513
           C EGSLKDIDVK K+V+CERGGG+ RIAKG EVKNAGGAAMIL+NQ+ DGFSTEAD H L
Sbjct: 503 CAEGSLKDIDVKNKVVLCERGGGVGRIAKGEEVKNAGGAAMILVNQESDGFSTEADPHAL 562

Query: 514 PASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILK 573
           PA+HVS    LKIKAYINST  PTAT+ FKGTVIGD + +P IASFSSRGP++ASPGILK
Sbjct: 563 PAAHVSFADGLKIKAYINSTATPTATLFFKGTVIGD-SLAPFIASFSSRGPNLASPGILK 622

Query: 574 PDITGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAI 633
           PDI GPGVSILAAWPFPLD N+N KS FNI+SGTSMSCPHLSGIA L+KSSHP WSPAAI
Sbjct: 623 PDIIGPGVSILAAWPFPLDNNTNPKSPFNIMSGTSMSCPHLSGIAVLLKSSHPYWSPAAI 682

Query: 634 KSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLC 693
           KSAIMTTADI NLEG+ I+D+ L PAD+FATGAGHVNP KA DPGL+YD+QPDDYIPYLC
Sbjct: 683 KSAIMTTADIVNLEGKAILDQALTPADVFATGAGHVNPIKANDPGLIYDLQPDDYIPYLC 742

Query: 694 GLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYT 753
           GLGY   EV  +AR+PI C  KPSIPEG+LNYPSF+V LGP QTFTRTVTNVG     YT
Sbjct: 743 GLGYNDKEVGIVARRPIKCSEKPSIPEGELNYPSFSVTLGPSQTFTRTVTNVGEAYSTYT 802

Query: 754 AVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVR 807
           A + AP  + V+++PSK++FSK+N+K TYSV F RI S   +  +G+G+L WVS +H VR
Sbjct: 803 ANIMAPDGVYVSVKPSKLYFSKVNQKATYSVNFSRITSSGETGPYGQGFLTWVSARHCVR 838

BLAST of Cp4.1LG05g03970.1 vs. TrEMBL
Match: A0A067JNI9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20707 PE=4 SV=1)

HSP 1 Score: 1024.6 bits (2648), Expect = 6.4e-296
Identity = 513/726 (70.66%), Postives = 591/726 (81.40%), Query Frame = 1

Query: 98  NLQTYIVHVEKPE-----TTDDLESWHRSFLPSSSS--------LLYSYRNVMSGFAARL 157
           NLQ YIVHV  PE       ++LE+WH+SFLP S++        +LYSY N++SGF+ARL
Sbjct: 34  NLQAYIVHVSPPEGRTFSQRENLENWHKSFLPFSTASSEKQQKRMLYSYHNIISGFSARL 93

Query: 158 SEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGG 217
           + E+VKAMEE +GFV AR ER L L TTHTP FLGL+RQ GFWK+SNFGKGVIIGVLDGG
Sbjct: 94  THEEVKAMEEINGFVLARPERKLHLQTTHTPSFLGLHRQMGFWKESNFGKGVIIGVLDGG 153

Query: 218 IAPSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDSPID 277
           + PSHPSF+D GMPPPP KWKGRCEFN S CNNKLIGARSFNLA K +KG  +  ++PID
Sbjct: 154 VFPSHPSFNDKGMPPPPAKWKGRCEFNASKCNNKLIGARSFNLAAKAMKG--IAAETPID 213

Query: 278 EDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGE---DCPDTDILA 337
            DGHGTHTASTAAG+FV  A  LGNAKGTAVGMAP AHLAIYKVCFG+   DCP++DILA
Sbjct: 214 VDGHGTHTASTAAGSFVYNANVLGNAKGTAVGMAPYAHLAIYKVCFGDPNDDCPESDILA 273

Query: 338 ALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNE 397
            LDAAI+DGVDVLSLS+G  S+PFFQD +AIG+FAAIQKGIFVSC+A NSGPF  TLSNE
Sbjct: 274 GLDAAIQDGVDVLSLSIGDISMPFFQDNIAIGSFAAIQKGIFVSCAAGNSGPFNGTLSNE 333

Query: 398 APWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQT-AALC 457
           APWILTV ASTIDR+I A AKLGNGEE DGES+ QPS+FP T LPLVY G   +T +A C
Sbjct: 334 APWILTVGASTIDRKIAATAKLGNGEELDGESVLQPSNFPTTLLPLVYPGMNGKTESAFC 393

Query: 458 GEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLP 517
            E +++ +DVK K+V+CERGGGI R+AKG EVKNAGGAAMIL+N +  GFST ADAHVLP
Sbjct: 394 SERAVQGMDVKDKVVLCERGGGIGRVAKGEEVKNAGGAAMILINDEISGFSTIADAHVLP 453

Query: 518 ASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKP 577
           A+HVS  A L+IKAYINST  P ATILFKGTVIGD   SPA+ SFSSRGP++ASPGILKP
Sbjct: 454 ATHVSFAAGLQIKAYINSTKTPMATILFKGTVIGDP-LSPAVTSFSSRGPNLASPGILKP 513

Query: 578 DITGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIK 637
           DI GPGVSILAAWPFPLD  +NTKSTFN++SGTSM+CPHLSGIAAL+KSSHP WSPAAIK
Sbjct: 514 DIIGPGVSILAAWPFPLDNTTNTKSTFNLVSGTSMACPHLSGIAALLKSSHPYWSPAAIK 573

Query: 638 SAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCG 697
           SAIMTTADI N+EG PIVDE  QPADLF  GAGHVNPS+A DPGL+YDIQPDDYIPYLCG
Sbjct: 574 SAIMTTADIFNMEGSPIVDEKHQPADLFTIGAGHVNPSRANDPGLIYDIQPDDYIPYLCG 633

Query: 698 LGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTA 757
           LGYK  +V+ IA + I C  K SIPEG LNYPSF+V LG  QTFTRTVTNVG    VY A
Sbjct: 634 LGYKEEQVSIIAHRRIKCSEKLSIPEGQLNYPSFSVTLGASQTFTRTVTNVGEANSVYAA 693

Query: 758 VVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRS 807
            +  PP ++VT+ P +++FS++N+KVTYSVTF   GS   ++EF +GY+ W S KH+VRS
Sbjct: 694 TIVPPPGVAVTVEPYRLYFSQVNQKVTYSVTFSPTGSSGKTSEFAQGYILWSSAKHLVRS 753

BLAST of Cp4.1LG05g03970.1 vs. TrEMBL
Match: B9RBY5_RICCO (Cucumisin, putative OS=Ricinus communis GN=RCOM_1682320 PE=4 SV=1)

HSP 1 Score: 1021.1 bits (2639), Expect = 7.1e-295
Identity = 522/789 (66.16%), Postives = 617/789 (78.20%), Query Frame = 1

Query: 36  MVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFILP 95
           M   P +FL+ L NF+ + A   E                 TT+ +E             
Sbjct: 10  MTVVPFIFLIFLFNFYPLIAQSAE----------------HTTETIEKK----------- 69

Query: 96  LINLQTYIVHVEKPE-----TTDDLESWHRSFLPSSSS---------LLYSYRNVMSGFA 155
             NLQTYIVHV +PE       +DL++WH+SFL  S++         +LYSY+N++SGF+
Sbjct: 70  --NLQTYIVHVNQPEGRTFSQPEDLKNWHKSFLSFSTASSEEEQQQRMLYSYQNIISGFS 129

Query: 156 ARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVL 215
           ARL++E+VKAMEE  GFVSA  ER L+L TTHTP FLGL++Q G WKDS+FGKGVIIG+L
Sbjct: 130 ARLTQEEVKAMEEITGFVSACLERKLRLQTTHTPSFLGLHQQMGLWKDSDFGKGVIIGIL 189

Query: 216 DGGIAPSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDS 275
           DGG+ PSHPSF D GMP PP KWKGRCEFN S CNNKLIGAR+FNLA K +KG     + 
Sbjct: 190 DGGVYPSHPSFSDEGMPLPPAKWKGRCEFNASECNNKLIGARTFNLAAKTMKGAPT--EP 249

Query: 276 PIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGE---DCPDTD 335
           PID DGHGTHTASTAAG FV  ++ LGNAKGTAVGMAP AHLAIYKVCFG+   DCP++D
Sbjct: 250 PIDVDGHGTHTASTAAGGFVYNSDVLGNAKGTAVGMAPFAHLAIYKVCFGDPNDDCPESD 309

Query: 336 ILAALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATL 395
           +LA LDAA++DGVDVLSLSLG  S+PFFQD +AIG+FAAIQKGIFVSCSA NSGP K+TL
Sbjct: 310 VLAGLDAAVDDGVDVLSLSLGDVSMPFFQDNIAIGSFAAIQKGIFVSCSAGNSGPSKSTL 369

Query: 396 SNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQT-A 455
           SNEAPWILTV ASTIDRRI A AKLGNGEE DGES+ QPS+FP T LP+VYAG  ++  +
Sbjct: 370 SNEAPWILTVGASTIDRRIVAIAKLGNGEELDGESVSQPSNFPTTLLPIVYAGMNSKPDS 429

Query: 456 ALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAH 515
           A CGEG+L+ ++VK K+V+CERGGGI RIAKG EVKNAGGAAMIL+N + +GFST ADAH
Sbjct: 430 AFCGEGALEGMNVKDKVVMCERGGGIGRIAKGDEVKNAGGAAMILVNDETNGFSTIADAH 489

Query: 516 VLPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGI 575
           VLPA+HVS  A LKIKAYINST  P ATILFKGTVIGD + SPA+ SFSSRGPS+ASPGI
Sbjct: 490 VLPATHVSFAAGLKIKAYINSTKTPMATILFKGTVIGDSS-SPAVTSFSSRGPSLASPGI 549

Query: 576 LKPDITGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPA 635
           LKPDI GPGVSILAAWPFPLD N+NTK TFNI+SGTSMSCPHLSGIAAL+KSSHP WSPA
Sbjct: 550 LKPDIIGPGVSILAAWPFPLDNNTNTKLTFNIMSGTSMSCPHLSGIAALLKSSHPYWSPA 609

Query: 636 AIKSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPY 695
           AIKSAI+TTADI N+EG+PIVDE  QPAD FATGAGHVNPS+A DPGLVYDIQPDDYIPY
Sbjct: 610 AIKSAIVTTADILNMEGKPIVDETHQPADFFATGAGHVNPSRANDPGLVYDIQPDDYIPY 669

Query: 696 LCGLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREV 755
           LCGL Y   +V+ IA +PI+C    +I EG LNYPSF+V LGPPQTF RTVTNVG    V
Sbjct: 670 LCGLNYTDEQVSIIAHRPISCSTIQTIAEGQLNYPSFSVTLGPPQTFIRTVTNVGYANSV 729

Query: 756 YTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHV 807
           + A + +PP ++V+++PS+++FSK+N+K TYS+TF   G  + ++EFG+GY+ WVSDK+ 
Sbjct: 730 FAATITSPPGVAVSVKPSRLYFSKLNQKATYSITFSHTGYGAKTSEFGQGYITWVSDKYF 766

BLAST of Cp4.1LG05g03970.1 vs. TAIR10
Match: AT2G05920.1 (AT2G05920.1 Subtilase family protein)

HSP 1 Score: 613.6 bits (1581), Expect = 1.7e-175
Identity = 343/730 (46.99%), Postives = 460/730 (63.01%), Query Frame = 1

Query: 100 QTYIV---HVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARL-SEEQVKAMEEK 159
           +TYI+   H +KPE+      W+ S L S SSLLY+Y     GF+A L S E    +   
Sbjct: 28  KTYIIRVNHSDKPESFLTHHDWYTSQLNSESSLLYTYTTSFHGFSAYLDSTEADSLLSSS 87

Query: 160 DGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDV 219
           +  +    + +  LHTT TP+FLGLN +FG     +   GVIIGVLD G+ P   SFDD 
Sbjct: 88  NSILDIFEDPLYTLHTTRTPEFLGLNSEFGVHDLGSSSNGVIIGVLDTGVWPESRSFDDT 147

Query: 220 GMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKGETMMDD----SPIDEDG 279
            MP  P KWKG CE    F+   CN KLIGARSF+   ++  G          SP D DG
Sbjct: 148 DMPEIPSKWKGECESGSDFDSKLCNKKLIGARSFSKGFQMASGGGFSSKRESVSPRDVDG 207

Query: 280 HGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAI 339
           HGTHT++TAAG+ V+ A  LG A GTA GMA  A +A YKVC+   C  +DILAA+D AI
Sbjct: 208 HGTHTSTTAAGSAVRNASFLGYAAGTARGMATRARVATYKVCWSTGCFGSDILAAMDRAI 267

Query: 340 EDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILT 399
            DGVDVLSLSLG  S P+++D +AIGAF+A+++G+FVSCSA NSGP +A+++N APW++T
Sbjct: 268 LDGVDVLSLSLGGGSAPYYRDTIAIGAFSAMERGVFVSCSAGNSGPTRASVANVAPWVMT 327

Query: 400 VAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKD 459
           V A T+DR   A A LGNG+   G SL+         L LVY    + ++ LC  GSL  
Sbjct: 328 VGAGTLDRDFPAFANLGNGKRLTGVSLYSGVGMGTKPLELVYNKGNSSSSNLCLPGSLDS 387

Query: 460 IDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHK 519
             V+GKIVVC+RG   AR+ KG  V++AGG  MI+ N    G    AD+H+LPA  V  K
Sbjct: 388 SIVRGKIVVCDRGVN-ARVEKGAVVRDAGGLGMIMANTAASGEELVADSHLLPAIAVGKK 447

Query: 520 AALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGV 579
               ++ Y+ S + PTA ++FKGTV+ D   SP +A+FSSRGP+  +P ILKPD+ GPGV
Sbjct: 448 TGDLLREYVKSDSKPTALLVFKGTVL-DVKPSPVVAAFSSRGPNTVTPEILKPDVIGPGV 507

Query: 580 SILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSA 639
           +ILA W     P  LDK+S  ++ FNI+SGTSMSCPH+SG+A L+K++HP+WSP+AIKSA
Sbjct: 508 NILAGWSDAIGPTGLDKDSR-RTQFNIMSGTSMSCPHISGLAGLLKAAHPEWSPSAIKSA 567

Query: 640 IMTTA---DITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLC 699
           +MTTA   D TN       D +L  ++ +A G+GHV+P KA  PGLVYDI  ++YI +LC
Sbjct: 568 LMTTAYVLDNTNAPLHDAADNSL--SNPYAHGSGHVDPQKALSPGLVYDISTEEYIRFLC 627

Query: 700 GLGYKSNEVATIARKP-INCLAKPSIPEGDLNYPSFTVVLGPPQT--FTRTVTNVGCGRE 759
            L Y  + +  I ++P +NC  K S P G LNYPSF+V+ G  +   +TR VTNVG    
Sbjct: 628 SLDYTVDHIVAIVKRPSVNCSKKFSDP-GQLNYPSFSVLFGGKRVVRYTREVTNVGAASS 687

Query: 760 VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKH 807
           VY   V   PS+ ++++PSK+ F  + EK  Y+VTF     +S + +   G + W + +H
Sbjct: 688 VYKVTVNGAPSVGISVKPSKLSFKSVGEKKRYTVTFVSKKGVSMTNKAEFGSITWSNPQH 747

BLAST of Cp4.1LG05g03970.1 vs. TAIR10
Match: AT3G14067.1 (AT3G14067.1 Subtilase family protein)

HSP 1 Score: 594.3 bits (1531), Expect = 1.1e-169
Identity = 346/747 (46.32%), Postives = 459/747 (61.45%), Query Frame = 1

Query: 99  LQTYIVHVE---KPETTDDLESWHRSFL------PSSSSLLYSYRNVMSGFAARLSEEQV 158
           L++YIVHV+   KP       +WH S L      P  ++LLYSY   + GF+ARLS  Q 
Sbjct: 30  LESYIVHVQRSHKPSLFSSHNNWHVSLLRSLPSSPQPATLLYSYSRAVHGFSARLSPIQT 89

Query: 159 KAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSH 218
            A+      +S   ++  ++HTTHTP FLG ++  G W +SN+G+ VI+GVLD GI P H
Sbjct: 90  AALRRHPSVISVIPDQAREIHTTHTPAFLGFSQNSGLWSNSNYGEDVIVGVLDTGIWPEH 149

Query: 219 PSFDDVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKGETMM-----DD 278
           PSF D G+ P P  WKG CE    F  S+CN KLIGAR+F       +  T         
Sbjct: 150 PSFSDSGLGPIPSTWKGECEIGPDFPASSCNRKLIGARAFYRGYLTQRNGTKKHAAKESR 209

Query: 279 SPIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDIL 338
           SP D +GHGTHTASTAAG+ V  A     A+GTA GMA  A +A YK+C+   C D+DIL
Sbjct: 210 SPRDTEGHGTHTASTAAGSVVANASLYQYARGTATGMASKARIAAYKICWTGGCYDSDIL 269

Query: 339 AALDAAIEDGVDVLSLSLG-SPSVP-FFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATL 398
           AA+D A+ DGV V+SLS+G S S P +  D +AIGAF A + GI VSCSA NSGP   T 
Sbjct: 270 AAMDQAVADGVHVISLSVGASGSAPEYHTDSIAIGAFGATRHGIVVSCSAGNSGPNPETA 329

Query: 399 SNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAA 458
           +N APWILTV AST+DR   A A  G+G+ F G SL+     P + L LVY+G+    + 
Sbjct: 330 TNIAPWILTVGASTVDREFAANAITGDGKVFTGTSLYAGESLPDSQLSLVYSGDCG--SR 389

Query: 459 LCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHV 518
           LC  G L    V+GKIV+C+RGG  AR+ KG+ VK AGGA MIL N  + G    AD+H+
Sbjct: 390 LCYPGKLNSSLVEGKIVLCDRGGN-ARVEKGSAVKLAGGAGMILANTAESGEELTADSHL 449

Query: 519 LPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGIL 578
           +PA+ V  KA  +I+ YI ++  PTA I F GT+IG    SP +A+FSSRGP+  +P IL
Sbjct: 450 VPATMVGAKAGDQIRDYIKTSDSPTAKISFLGTLIGPSPPSPRVAAFSSRGPNHLTPVIL 509

Query: 579 KPDITGPGVSILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPD 638
           KPD+  PGV+ILA W     P  LD +   +  FNIISGTSMSCPH+SG+AAL++ +HPD
Sbjct: 510 KPDVIAPGVNILAGWTGMVGPTDLDIDPR-RVQFNIISGTSMSCPHVSGLAALLRKAHPD 569

Query: 639 WSPAAIKSAIMTTADITNLEGQPIVD-ENLQPADLFATGAGHVNPSKAADPGLVYDIQPD 698
           WSPAAIKSA++TTA      G+PI D    + ++ F  GAGHV+P+KA +PGLVYDI+  
Sbjct: 570 WSPAAIKSALVTTAYDVENSGEPIEDLATGKSSNSFIHGAGHVDPNKALNPGLVYDIEVK 629

Query: 699 DYIPYLCGLGYKSNEVATIARKPI---NCLAKPSIPEGDLNYPSFTVVL---GPPQTFTR 758
           +Y+ +LC +GY+   +    + P     C        GDLNYPSF+VV    G    + R
Sbjct: 630 EYVAFLCAVGYEFPGILVFLQDPTLYDACETSKLRTAGDLNYPSFSVVFASTGEVVKYKR 689

Query: 759 TVTNVGCGRE-VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKR------IGSIS 807
            V NVG   + VY   V++P ++ + + PSK+ FSK    + Y VTFK       +GS+ 
Sbjct: 690 VVKNVGSNVDAVYEVGVKSPANVEIDVSPSKLAFSKEKSVLEYEVTFKSVVLGGGVGSV- 749

BLAST of Cp4.1LG05g03970.1 vs. TAIR10
Match: AT5G67360.1 (AT5G67360.1 Subtilase family protein)

HSP 1 Score: 585.5 bits (1508), Expect = 5.0e-167
Identity = 337/730 (46.16%), Postives = 452/730 (61.92%), Query Frame = 1

Query: 101 TYIVHVEK---PETTDDLESWHRSFLPS---SSSLLYSYRNVMSGFAARLSEEQVKAMEE 160
           TYIVH+ K   P + D   +W+ S L S   S+ LLY+Y N + GF+ RL++E+  ++  
Sbjct: 31  TYIVHMAKSQMPSSFDLHSNWYDSSLRSISDSAELLYTYENAIHGFSTRLTQEEADSLMT 90

Query: 161 KDGFVSARRERILQLHTTHTPDFLGLNRQFG-FWKDSNFGKGVIIGVLDGGIAPSHPSFD 220
           + G +S   E   +LHTT TP FLGL+      + ++     V++GVLD G+ P   S+ 
Sbjct: 91  QPGVISVLPEHRYELHTTRTPLFLGLDEHTADLFPEAGSYSDVVVGVLDTGVWPESKSYS 150

Query: 221 DVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLKG---ETMMDDSPIDED 280
           D G  P P  WKG CE    F  S CN KLIGAR F    +   G   E+    SP D+D
Sbjct: 151 DEGFGPIPSSWKGGCEAGTNFTASLCNRKLIGARFFARGYESTMGPIDESKESRSPRDDD 210

Query: 281 GHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAA 340
           GHGTHT+STAAG+ V+GA  LG A GTA GMAP A +A+YKVC+   C  +DILAA+D A
Sbjct: 211 GHGTHTSSTAAGSVVEGASLLGYASGTARGMAPRARVAVYKVCWLGGCFSSDILAAIDKA 270

Query: 341 IEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWIL 400
           I D V+VLS+SLG     +++D VAIGAFAA+++GI VSCSA N+GP  ++LSN APWI 
Sbjct: 271 IADNVNVLSMSLGGGMSDYYRDGVAIGAFAAMERGILVSCSAGNAGPSSSSLSNVAPWIT 330

Query: 401 TVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQ--TAALCGEGS 460
           TV A T+DR   A A LGNG+ F G SLF+    P   LP +YAG  +      LC  G+
Sbjct: 331 TVGAGTLDRDFPALAILGNGKNFTGVSLFKGEALPDKLLPFIYAGNASNATNGNLCMTGT 390

Query: 461 LKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHV 520
           L    VKGKIV+C+RG   AR+ KG  VK AGG  MIL N   +G    ADAH+LPA+ V
Sbjct: 391 LIPEKVKGKIVMCDRGIN-ARVQKGDVVKAAGGVGMILANTAANGEELVADAHLLPATTV 450

Query: 521 SHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITG 580
             KA   I+ Y+ +   PTA+I   GTV+G    SP +A+FSSRGP+  +P ILKPD+  
Sbjct: 451 GEKAGDIIRHYVTTDPNPTASISILGTVVGVKP-SPVVAAFSSRGPNSITPNILKPDLIA 510

Query: 581 PGVSILAAW-----PFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAI 640
           PGV+ILAAW     P  L  +S  +  FNIISGTSMSCPH+SG+AAL+KS HP+WSPAAI
Sbjct: 511 PGVNILAAWTGAAGPTGLASDSR-RVEFNIISGTSMSCPHVSGLAALLKSVHPEWSPAAI 570

Query: 641 KSAIMTTADITNLEGQPIVD-ENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYL 700
           +SA+MTTA  T  +G+P++D    +P+  F  GAGHV+P+ A +PGL+YD+  +DY+ +L
Sbjct: 571 RSALMTTAYKTYKDGKPLLDIATGKPSTPFDHGAGHVSPTTATNPGLIYDLTTEDYLGFL 630

Query: 701 CGLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLG--PPQTFTRTVTNVGCGRE 760
           C L Y S ++ +++R+   C    S    DLNYPSF V +       +TRTVT+VG    
Sbjct: 631 CALNYTSPQIRSVSRRNYTCDPSKSYSVADLNYPSFAVNVDGVGAYKYTRTVTSVGGAGT 690

Query: 761 VYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKH 807
               V      + +++ P+ + F + NEK +Y+VTF  + S  PS     G ++W   KH
Sbjct: 691 YSVKVTSETTGVKISVEPAVLNFKEANEKKSYTVTF-TVDSSKPSGSNSFGSIEWSDGKH 750

BLAST of Cp4.1LG05g03970.1 vs. TAIR10
Match: AT1G04110.1 (AT1G04110.1 Subtilase family protein)

HSP 1 Score: 583.9 bits (1504), Expect = 1.5e-166
Identity = 332/753 (44.09%), Postives = 457/753 (60.69%), Query Frame = 1

Query: 96  LINLQTYIVHV----EKPETTDDLESWHRSFLPS------------SSSLLYSYRNVMSG 155
           ++  QTYIV +    E  +T      WH SFL              SS LLYSY + + G
Sbjct: 22  ILQKQTYIVQLHPNSETAKTFASKFDWHLSFLQEAVLGVEEEEEEPSSRLLYSYGSAIEG 81

Query: 156 FAARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNR--QFGFWKDSNFGKGVI 215
           FAA+L+E + + +      V+ R + +LQ+ TT++  FLGL+     G W  S FG+G I
Sbjct: 82  FAAQLTESEAEILRYSPEVVAVRPDHVLQVQTTYSYKFLGLDGFGNSGVWSKSRFGQGTI 141

Query: 216 IGVLDGGIAPSHPSFDDVGMPPPPPKWKGRCE----FNFSACNNKLIGARSFNLATKVLK 275
           IGVLD G+ P  PSFDD GMP  P KWKG C+    F+ S+CN KLIGAR F    +V  
Sbjct: 142 IGVLDTGVWPESPSFDDTGMPSIPRKWKGICQEGESFSSSSCNRKLIGARFFIRGHRVAN 201

Query: 276 GETMMDDSPI------DEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYK 335
                 + P       D  GHGTHTAST  G+ V  A  LGN  G A GMAP AH+A+YK
Sbjct: 202 SPEESPNMPREYISARDSTGHGTHTASTVGGSSVSMANVLGNGAGVARGMAPGAHIAVYK 261

Query: 336 VCFGEDCPDTDILAALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCS 395
           VC+   C  +DILAA+D AI+D VDVLSLSLG   +P + D +AIG F A+++GI V C+
Sbjct: 262 VCWFNGCYSSDILAAIDVAIQDKVDVLSLSLGGFPIPLYDDTIAIGTFRAMERGISVICA 321

Query: 396 AANSGPFKATLSNEAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPT--FL 455
           A N+GP +++++N APW+ T+ A T+DRR  A  +L NG+   GESL+           +
Sbjct: 322 AGNNGPIESSVANTAPWVSTIGAGTLDRRFPAVVRLANGKLLYGESLYPGKGIKNAGREV 381

Query: 456 PLVYAGEKNQTAALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQ 515
            ++Y    ++ +  C  GSL   +++GK+V+C+RG    R  KG  VK AGG AMIL N 
Sbjct: 382 EVIYVTGGDKGSEFCLRGSLPREEIRGKMVICDRGVN-GRSEKGEAVKEAGGVAMILANT 441

Query: 516 QQDGFSTEADAHVLPASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASF 575
           + +      D H+LPA+ + +  ++ +KAY+N+T  P A I+F GTVIG    +P +A F
Sbjct: 442 EINQEEDSIDVHLLPATLIGYTESVLLKAYVNATVKPKARIIFGGTVIGRSR-APEVAQF 501

Query: 576 SSRGPSVASPGILKPDITGPGVSILAAWPFPLDKN----SNTKSTFNIISGTSMSCPHLS 635
           S+RGPS+A+P ILKPD+  PGV+I+AAWP  L        + +  F ++SGTSMSCPH+S
Sbjct: 502 SARGPSLANPSILKPDMIAPGVNIIAAWPQNLGPTGLPYDSRRVNFTVMSGTSMSCPHVS 561

Query: 636 GIAALIKSSHPDWSPAAIKSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAA 695
           GI ALI+S++P+WSPAAIKSA+MTTAD+ + +G+ I D N +PA +FA GAGHVNP KA 
Sbjct: 562 GITALIRSAYPNWSPAAIKSALMTTADLYDRQGKAIKDGN-KPAGVFAIGAGHVNPQKAI 621

Query: 696 DPGLVYDIQPDDYIPYLCGLGYKSNEVATIARKPINC---LAKPSIPEGDLNYPSFTVVL 755
           +PGLVY+IQP DYI YLC LG+  +++  I  K ++C   L K   P   LNYPS  V+ 
Sbjct: 622 NPGLVYNIQPVDYITYLCTLGFTRSDILAITHKNVSCNGILRKN--PGFSLNYPSIAVIF 681

Query: 756 GPPQT---FTRTVTNVGCGREVYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTF--K 804
              +T    TR VTNVG    +Y+  V+AP  I V + P ++ F  +++ ++Y V F  K
Sbjct: 682 KRGKTTEMITRRVTNVGSPNSIYSVNVKAPEGIKVIVNPKRLVFKHVDQTLSYRVWFVLK 741

BLAST of Cp4.1LG05g03970.1 vs. TAIR10
Match: AT3G14240.1 (AT3G14240.1 Subtilase family protein)

HSP 1 Score: 571.6 bits (1472), Expect = 7.5e-163
Identity = 336/746 (45.04%), Postives = 453/746 (60.72%), Query Frame = 1

Query: 98  NLQTYIVHVE---KPETTDDLESWHRSFLPSSSS----LLYSYRNVMSGFAARLSEEQVK 157
           N  TYIVHV+   KP        W+ S L S +S    ++++Y  V  GF+ARL+ +   
Sbjct: 24  NSLTYIVHVDHEAKPSIFPTHFHWYTSSLASLTSSPPSIIHTYDTVFHGFSARLTSQDAS 83

Query: 158 AMEEKDGFVSARRERILQLHTTHTPDFLGLNR--QFGFWKDSNFGKGVIIGVLDGGIAPS 217
            + +    +S   E++  LHTT +P+FLGL    + G  ++S+FG  ++IGV+D G+ P 
Sbjct: 84  QLLDHPHVISVIPEQVRHLHTTRSPEFLGLRSTDKAGLLEESDFGSDLVIGVIDTGVWPE 143

Query: 218 HPSFDDVGMPPPPPKWKGRC----EFNFSACNNKLIGARSF---NLATKVLKGETMMDDS 277
            PSFDD G+ P P KWKG+C    +F  SACN KL+GAR F     AT     ET    S
Sbjct: 144 RPSFDDRGLGPVPIKWKGQCIASQDFPESACNRKLVGARFFCGGYEATNGKMNETTEFRS 203

Query: 278 PIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILA 337
           P D DGHGTHTAS +AG +V  A  LG A G A GMAP A LA YKVC+   C D+DILA
Sbjct: 204 PRDSDGHGTHTASISAGRYVFPASTLGYAHGVAAGMAPKARLAAYKVCWNSGCYDSDILA 263

Query: 338 ALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNE 397
           A D A+ DGVDV+SLS+G   VP++ D +AIGAF AI +GIFVS SA N GP   T++N 
Sbjct: 264 AFDTAVADGVDVISLSVGGVVVPYYLDAIAIGAFGAIDRGIFVSASAGNGGPGALTVTNV 323

Query: 398 APWILTVAASTIDRRIKAAAKLGNGEEFDGESLF-QPSDFPPTFLPLVYAGE----KNQT 457
           APW+ TV A TIDR   A  KLGNG+   G S++  P   P    PLVY G        +
Sbjct: 324 APWMTTVGAGTIDRDFPANVKLGNGKMISGVSVYGGPGLDPGRMYPLVYGGSLLGGDGYS 383

Query: 458 AALCGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADA 517
           ++LC EGSL    VKGKIV+C+RG   +R  KG  V+  GG  MI+ N   DG    AD 
Sbjct: 384 SSLCLEGSLDPNLVKGKIVLCDRGIN-SRATKGEIVRKNGGLGMIIANGVFDGEGLVADC 443

Query: 518 HVLPASHVSHKAALKIKAYIN------STTYPTATILFKGTVIGDDNFSPAIASFSSRGP 577
           HVLPA+ V      +I+ YI+      S+ +PTATI+FKGT +G    +P +ASFS+RGP
Sbjct: 444 HVLPATSVGASGGDEIRRYISESSKSRSSKHPTATIVFKGTRLG-IRPAPVVASFSARGP 503

Query: 578 SVASPGILKPDITGPGVSILAAWPFPLD----KNSNTKSTFNIISGTSMSCPHLSGIAAL 637
           +  +P ILKPD+  PG++ILAAWP  +      + N ++ FNI+SGTSM+CPH+SG+AAL
Sbjct: 504 NPETPEILKPDVIAPGLNILAAWPDRIGPSGVTSDNRRTEFNILSGTSMACPHVSGLAAL 563

Query: 638 IKSSHPDWSPAAIKSAIMTTADITNLEGQPIVDENL-QPADLFATGAGHVNPSKAADPGL 697
           +K++HPDWSPAAI+SA++TTA   +  G+P++DE+    + +   G+GHV+P+KA DPGL
Sbjct: 564 LKAAHPDWSPAAIRSALITTAYTVDNSGEPMMDESTGNTSSVMDYGSGHVHPTKAMDPGL 623

Query: 698 VYDIQPDDYIPYLCGLGYKSNEVATIARKPINC-LAKPSIPEGDLNYPSFTVVLGP---- 757
           VYDI   DYI +LC   Y    + TI R+  +C  A+ +   G+LNYPSF+VV       
Sbjct: 624 VYDITSYDYINFLCNSNYTRTNIVTITRRQADCDGARRAGHVGNLNYPSFSVVFQQYGES 683

Query: 758 --PQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIG-S 803
                F RTVTNVG    VY   +  P   +VT+ P K+ F ++ +K+++ V  K     
Sbjct: 684 KMSTHFIRTVTNVGDSDSVYEIKIRPPRGTTVTVEPEKLSFRRVGQKLSFVVRVKTTEVK 743

BLAST of Cp4.1LG05g03970.1 vs. NCBI nr
Match: gi|449459724|ref|XP_004147596.1| (PREDICTED: subtilisin-like protease SBT1.2 [Cucumis sativus])

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 616/772 (79.79%), Postives = 665/772 (86.14%), Query Frame = 1

Query: 36  MVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFILP 95
           MV  PSLFLLLLLNFH   A VTELP  NL TYIVHV+KPE  DDLESWHRSFLP     
Sbjct: 1   MVLLPSLFLLLLLNFHVYEAQVTELPFSNLHTYIVHVKKPEVVDDLESWHRSFLP----- 60

Query: 96  LINLQTYIVHVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARLSEEQVKAMEEK 155
                T + + E+  T                 LLYSYRNVMSGF+ARL+EE VKAMEEK
Sbjct: 61  -----TSLENSEEQPT-----------------LLYSYRNVMSGFSARLTEEHVKAMEEK 120

Query: 156 DGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDV 215
           DGFVSARRE I+ LHTTH+P+FLGLNRQFGFWKDSNFGKGVIIGVLDGGI PSHPSF D 
Sbjct: 121 DGFVSARRETIVHLHTTHSPNFLGLNRQFGFWKDSNFGKGVIIGVLDGGITPSHPSFVDA 180

Query: 216 GMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGE-TMMDDSPIDEDGHGTHTAS 275
           GMP PP KWKGRCEFNFSACNNKLIGARS NLA++ LKG+ T +DDSPIDEDGHGTHTAS
Sbjct: 181 GMPQPPAKWKGRCEFNFSACNNKLIGARSLNLASQALKGKITTLDDSPIDEDGHGTHTAS 240

Query: 276 TAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAIEDGVDVL 335
           TAAG FV GAEALGNA GTAVGMAPLAHLAIYKVCFGE C + DILA LDAA+EDGVDVL
Sbjct: 241 TAAGTFVDGAEALGNAFGTAVGMAPLAHLAIYKVCFGESCSNVDILAGLDAAVEDGVDVL 300

Query: 336 SLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILTVAASTID 395
           S+SLG P VPFF D+ AIGAFAAIQKGIFVSCSAANSGPF ATLSNEAPWILTVAASTID
Sbjct: 301 SISLGGPPVPFFADITAIGAFAAIQKGIFVSCSAANSGPFNATLSNEAPWILTVAASTID 360

Query: 396 RRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKDIDVKGKI 455
           R+I A AKLGNGEEFDGESLFQP+DFP TFLPLV+ GEKN+T ALC EGSLK+IDVKGK+
Sbjct: 361 RKITATAKLGNGEEFDGESLFQPNDFPQTFLPLVFPGEKNETVALCAEGSLKNIDVKGKV 420

Query: 456 VVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHKAALKIKA 515
           VVC+RGGGIARIAKG EVKNAGGAAMILLN + DGF+TEADAHVLPASHVSH AALKIKA
Sbjct: 421 VVCDRGGGIARIAKGVEVKNAGGAAMILLNAESDGFTTEADAHVLPASHVSHTAALKIKA 480

Query: 516 YINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGVSILAAWP 575
           YINSTTYPTATI+FKGT IGDD FSPAIA+FSSRGPS+ASPGILKPDITGPGVSILAAWP
Sbjct: 481 YINSTTYPTATIVFKGTTIGDD-FSPAIAAFSSRGPSLASPGILKPDITGPGVSILAAWP 540

Query: 576 FPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSAIMTTADITNLEG 635
           FPLD N+NTKSTFNI+SGTSMSCPHLSGIAALIKS+HPDWSPAAIKS+IMTTA+ITNLEG
Sbjct: 541 FPLDNNTNTKSTFNIVSGTSMSCPHLSGIAALIKSAHPDWSPAAIKSSIMTTANITNLEG 600

Query: 636 QPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLGYKSNEVATIARK 695
            PIVD+ LQPADLFA GAGHVNPSKA DPGLVYDIQPDDYIPYLCGLGY +N+V+ IA K
Sbjct: 601 NPIVDQTLQPADLFAIGAGHVNPSKAVDPGLVYDIQPDDYIPYLCGLGYTNNQVSLIAHK 660

Query: 696 PINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRP 755
           PI+CL   SIPEG+LNYPSF V LG  QTF+RTVT VG GREVY  V+EAP  +SVT+RP
Sbjct: 661 PIDCLTTTSIPEGELNYPSFMVKLGQVQTFSRTVTYVGSGREVYNVVIEAPEGVSVTVRP 720

Query: 756 SKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPISFKF 807
            K+ FS +N+K TYSVTFKRIGSISPSTEF +GYLKWVS KH+VRSPIS KF
Sbjct: 721 RKVIFSALNQKATYSVTFKRIGSISPSTEFAEGYLKWVSAKHLVRSPISVKF 744

BLAST of Cp4.1LG05g03970.1 vs. NCBI nr
Match: gi|659073656|ref|XP_008437181.1| (PREDICTED: subtilisin-like protease [Cucumis melo])

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 619/772 (80.18%), Postives = 662/772 (85.75%), Query Frame = 1

Query: 36  MVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFILP 95
           MV  PSLFLLLLLNFH   A VTELPL NL TYIVHV+KPE  DDLE WHRSFLP     
Sbjct: 1   MVLLPSLFLLLLLNFHGYEAQVTELPLSNLHTYIVHVKKPEVVDDLEIWHRSFLP----- 60

Query: 96  LINLQTYIVHVEKPETTDDLESWHRSFLPSSSSLLYSYRNVMSGFAARLSEEQVKAMEEK 155
                          T+ D E           +LLYSYRNVMSGF+ARL+EE VKAMEEK
Sbjct: 61  ---------------TSLDNEE-------EQPTLLYSYRNVMSGFSARLTEEHVKAMEEK 120

Query: 156 DGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIAPSHPSFDDV 215
           DGFVSARRE I+ LHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGI P+HPSFDD 
Sbjct: 121 DGFVSARRETIVHLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGITPNHPSFDDA 180

Query: 216 GMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGE-TMMDDSPIDEDGHGTHTAS 275
           GM  PP KWKGRCEFNFSACNNKLIGARS NLA++ LKG+ T +DDSPIDEDGHGTHTAS
Sbjct: 181 GMAQPPAKWKGRCEFNFSACNNKLIGARSMNLASQALKGKITTLDDSPIDEDGHGTHTAS 240

Query: 276 TAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAAIEDGVDVL 335
           TAAG FV GAEALGNA GTAVGMAPLAHLAIYKVCFGEDC D DILA LDAA+EDGVDVL
Sbjct: 241 TAAGTFVDGAEALGNAFGTAVGMAPLAHLAIYKVCFGEDCSDVDILAGLDAAVEDGVDVL 300

Query: 336 SLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWILTVAASTID 395
           S+SLG PSVPFF D+ AIG+FAAIQKGIFVSCSAANSGPF ATLSNEAPWILTVAASTID
Sbjct: 301 SISLGGPSVPFFADITAIGSFAAIQKGIFVSCSAANSGPFNATLSNEAPWILTVAASTID 360

Query: 396 RRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQTAALCGEGSLKDIDVKGKI 455
           R+I A AKLGNGEEFDGESLFQP+DFP T LPLV+ GEKN+T ALC EGSLK+IDVKGK+
Sbjct: 361 RKITATAKLGNGEEFDGESLFQPNDFPQTLLPLVFPGEKNETVALCAEGSLKNIDVKGKV 420

Query: 456 VVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVSHKAALKIKA 515
           VVCERGGGIARIAKG EVKN GGAAMILLN + DGF+TE DAHVLPASHVSH AALKIKA
Sbjct: 421 VVCERGGGIARIAKGVEVKNGGGAAMILLNAESDGFTTEVDAHVLPASHVSHTAALKIKA 480

Query: 516 YINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGPGVSILAAWP 575
           YINSTTYPTATILFKGT IGDD FSPAIASFSSRGPS+ASPGILKPDITGPGVSILAAWP
Sbjct: 481 YINSTTYPTATILFKGTTIGDD-FSPAIASFSSRGPSLASPGILKPDITGPGVSILAAWP 540

Query: 576 FPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSAIMTTADITNLEG 635
           FPLD N+NTKSTFNIISGTSMSCPHLSGIAALIKS+HPDWSPAAIKS+IMTTA+ITNLEG
Sbjct: 541 FPLDNNTNTKSTFNIISGTSMSCPHLSGIAALIKSAHPDWSPAAIKSSIMTTANITNLEG 600

Query: 636 QPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLGYKSNEVATIARK 695
            PI+DE LQPADLFA GAGHVNPSKA DPGLVYDIQPDDYIPYLCGLGY +N+V+ IA K
Sbjct: 601 NPILDETLQPADLFAIGAGHVNPSKAIDPGLVYDIQPDDYIPYLCGLGYTNNQVSLIAHK 660

Query: 696 PINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVVEAPPSISVTIRP 755
           PI+CL   SIPEG+LNYPSF V LGP QTF+RTVT+VG GR VY  V+EAP  +SVT+RP
Sbjct: 661 PIDCLTTSSIPEGELNYPSFMVKLGPVQTFSRTVTSVGSGRVVYNVVIEAPEGVSVTVRP 720

Query: 756 SKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPISFKF 807
            K+ FS +N+K TYSVTFKR GSISPS EF +GYLKWVS KHVVRSPIS KF
Sbjct: 721 RKLSFSALNQKATYSVTFKRSGSISPSIEFAEGYLKWVSAKHVVRSPISVKF 744

BLAST of Cp4.1LG05g03970.1 vs. NCBI nr
Match: gi|731379755|ref|XP_002275471.2| (PREDICTED: uncharacterized protein LOC100242816 [Vitis vinifera])

HSP 1 Score: 1049.3 bits (2712), Expect = 3.5e-303
Identity = 520/721 (72.12%), Postives = 600/721 (83.22%), Query Frame = 1

Query: 99   LQTYIVHVEKPETT-----DDLESWHRSFLPSSSS-------LLYSYRNVMSGFAARLSE 158
            LQTYIVHV++ E +     ++LESWHRSFLP +++       L+YSY+NV+SGFAARL+E
Sbjct: 767  LQTYIVHVKQLERSTTAQQENLESWHRSFLPVATATSDNQERLVYSYKNVISGFAARLTE 826

Query: 159  EQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIA 218
            E+V+AME  DGF+SA  E++L L TTH+PDFLGL+++ GFWK+SNFGKGVIIGVLD G+ 
Sbjct: 827  EEVRAMENMDGFISASPEKMLPLLTTHSPDFLGLHQEMGFWKESNFGKGVIIGVLDSGVL 886

Query: 219  PSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDSPIDED 278
            PSHPSF   G+PPPP KWKG CEF  S CNNKLIGARSFN+  K  KG T   + P+D+D
Sbjct: 887  PSHPSFSGEGIPPPPAKWKGSCEFMASECNNKLIGARSFNVGAKATKGVTA--EPPLDDD 946

Query: 279  GHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDILAALDAA 338
            GHGTHTASTAAGAFVK A+ LGNAKGTAVGMAP AHLAIYKVCFG DCP++D++A LDAA
Sbjct: 947  GHGTHTASTAAGAFVKNADVLGNAKGTAVGMAPYAHLAIYKVCFGPDCPESDVIAGLDAA 1006

Query: 339  IEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAPWIL 398
            +EDGVDV+S+SLG P+VPFFQD +A+G+FAA+QKGIFVSCSA NSGPF  TLSNEAPWIL
Sbjct: 1007 VEDGVDVISISLGDPAVPFFQDNIAVGSFAAMQKGIFVSCSAGNSGPFNTTLSNEAPWIL 1066

Query: 399  TVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQT-AALCGEGSL 458
            TV AS+IDR IKAAAKLGNGE+FDGE+LFQPSDFP T LPLVYAG   +  +A+CGEGSL
Sbjct: 1067 TVGASSIDRTIKAAAKLGNGEQFDGETLFQPSDFPATQLPLVYAGMNGKPESAVCGEGSL 1126

Query: 459  KDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPASHVS 518
            K+IDVKGK+V+C+RGGGIARI KGTEVKNAGGAAMIL+NQ+ DGFST ADAHVLPA+HVS
Sbjct: 1127 KNIDVKGKVVLCDRGGGIARIDKGTEVKNAGGAAMILVNQESDGFSTLADAHVLPATHVS 1186

Query: 519  HKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDITGP 578
            + A LKIKAYINST  PTA ILFKGTVIG+   SPAI SFSSRGPS ASPGILKPDI GP
Sbjct: 1187 YAAGLKIKAYINSTATPTAAILFKGTVIGNP-LSPAITSFSSRGPSFASPGILKPDIIGP 1246

Query: 579  GVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSAIMT 638
            GVSILAAWPFPLD N N+KSTFNIISGTSMSCPHLSGIAAL+KSSHPDWSPAAIKSAIMT
Sbjct: 1247 GVSILAAWPFPLDNNINSKSTFNIISGTSMSCPHLSGIAALLKSSHPDWSPAAIKSAIMT 1306

Query: 639  TADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLGYKS 698
            TAD+ N+ G+PIVDE L PAD+FATGAGHVNPS+A DPGLVYDI+PDDYIPYLCGLGY  
Sbjct: 1307 TADLLNVGGKPIVDERLLPADIFATGAGHVNPSRANDPGLVYDIEPDDYIPYLCGLGYTD 1366

Query: 699  NEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVVEAP 758
             EV  +A + I C  + SIPEG+LNYPSF+V LGPPQTFTRTVTNVG     YT     P
Sbjct: 1367 TEVGILAHRSIKCSEESSIPEGELNYPSFSVALGPPQTFTRTVTNVGEAYSSYTVTAIVP 1426

Query: 759  PSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPISFK 807
              + V++ P K++FSK+N+K+TYSVTF    S   S++F +GYLKWVS KH V SPIS  
Sbjct: 1427 QGVDVSVNPDKLYFSKVNQKLTYSVTFSHNSSSGKSSKFAQGYLKWVSGKHSVGSPISIM 1484

BLAST of Cp4.1LG05g03970.1 vs. NCBI nr
Match: gi|566162226|ref|XP_002303551.2| (hypothetical protein POPTR_0003s11870g [Populus trichocarpa])

HSP 1 Score: 1031.6 bits (2666), Expect = 7.5e-298
Identity = 517/721 (71.71%), Postives = 592/721 (82.11%), Query Frame = 1

Query: 99  LQTYIVHVEKPETT-----DDLESWHRSFLPSSSS-------LLYSYRNVMSGFAARLSE 158
           L  YIVHV KPE       +DLESW++SFLP S++       +LY+Y+NVMSGFAARL++
Sbjct: 35  LLNYIVHVAKPEGRTMAEFEDLESWYQSFLPVSTASSEKQQRMLYAYQNVMSGFAARLTQ 94

Query: 159 EQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGVLDGGIA 218
           E+VK+MEEKDGF+SAR ERIL L TTHTP FLGL+++ GFWK+SNFGKGVIIGVLDGGI 
Sbjct: 95  EEVKSMEEKDGFLSARPERILHLQTTHTPRFLGLHQELGFWKESNFGKGVIIGVLDGGIF 154

Query: 219 PSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDDSPIDED 278
           PSHPSF D GMPPPP KWKGRC+FN S CNNKLIGARSFN+A K  KG    +  PID D
Sbjct: 155 PSHPSFSDEGMPPPPAKWKGRCDFNASDCNNKLIGARSFNIAAKAKKGSAATEP-PIDVD 214

Query: 279 GHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGE---DCPDTDILAAL 338
           GHGTHTASTAAGAFVK AE LGNA+GTAVG+AP AHLAIYKVCFG+   DCP++DILA L
Sbjct: 215 GHGTHTASTAAGAFVKDAEVLGNARGTAVGIAPHAHLAIYKVCFGDPGDDCPESDILAGL 274

Query: 339 DAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSNEAP 398
           DAA++DGVDVLSLSLG  SVP F D +AIG+FAAIQKGIFVSCSA NSGPF  TLSNEAP
Sbjct: 275 DAAVQDGVDVLSLSLGEDSVPLFNDTIAIGSFAAIQKGIFVSCSAGNSGPFNGTLSNEAP 334

Query: 399 WILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQ-TAALCGE 458
           WILTV AST+DRR  A A+LGNGE+ DGESL Q S+FP T LPLVYAG   +  ++LCGE
Sbjct: 335 WILTVGASTVDRRFSATARLGNGEQIDGESLSQHSNFPSTLLPLVYAGMSGKPNSSLCGE 394

Query: 459 GSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVLPAS 518
           G+L+ +DVKGKIV+CERGGGI RIAKG EVKNAGGAAMIL+N++ DGFST AD HVLPA+
Sbjct: 395 GALEGMDVKGKIVLCERGGGIGRIAKGGEVKNAGGAAMILMNEEADGFSTNADVHVLPAT 454

Query: 519 HVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILKPDI 578
           HVS  A LKIKAYINST  P ATILFKGTVIGD + SP +ASFSSRGPS+ASPGILKPDI
Sbjct: 455 HVSFAAGLKIKAYINSTQAPMATILFKGTVIGDSS-SPFVASFSSRGPSLASPGILKPDI 514

Query: 579 TGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAIKSA 638
            GPGVSILAAWPFPLD N+N+KSTFNIISGTSMSCPHLSGIAAL+KSSHP WSPAAIKSA
Sbjct: 515 IGPGVSILAAWPFPLDNNTNSKSTFNIISGTSMSCPHLSGIAALLKSSHPYWSPAAIKSA 574

Query: 639 IMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLCGLG 698
           IMTTAD  N+EG+ IVD+ LQPAD+FATGAGHVNPS+A +PGLVYDIQPDDYIPYLCGLG
Sbjct: 575 IMTTADTLNMEGKLIVDQTLQPADIFATGAGHVNPSRANNPGLVYDIQPDDYIPYLCGLG 634

Query: 699 YKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYTAVV 758
           Y  NEV+ I  + + C  KPSIPEG+LNYPSF V LGP QTFTRTVTNVG     Y   +
Sbjct: 635 YADNEVSIIVHEQVKCSEKPSIPEGELNYPSFAVTLGPSQTFTRTVTNVGDVNSAYEVAI 694

Query: 759 EAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVRSPI 804
            +PP + VT++PSK++FSK+N+K TYSV F R      ++E  +GY+ W S K+ VRSPI
Sbjct: 695 VSPPGVDVTVKPSKLYFSKVNQKATYSVAFSRTEYGGKTSETAQGYIVWASAKYTVRSPI 753

BLAST of Cp4.1LG05g03970.1 vs. NCBI nr
Match: gi|703143649|ref|XP_010108071.1| (Subtilisin-like protease SDD1 [Morus notabilis])

HSP 1 Score: 1029.2 bits (2660), Expect = 3.7e-297
Identity = 529/787 (67.22%), Postives = 613/787 (77.89%), Query Frame = 1

Query: 34  YTMVAFPSLFLLLLLNFHYVGALVTELPLINLQTYIVHVEKPETTDDLESWHRSFLPXFI 93
           + M     L  + +LNF +V AL +E+           +   +TT+              
Sbjct: 83  FNMATTSLLSFIFVLNFFHVIALQSEV-----------ISVSQTTESS------------ 142

Query: 94  LPLINLQTYIVHVEKPE-----TTDDLESWHRSFLPSSSS--------LLYSYRNVMSGF 153
               +LQ YI+HV+ P+      ++DLESW+RSFLP++++        +LY+YRNV+ GF
Sbjct: 143 ----SLQNYIIHVKPPKGRVLSQSEDLESWYRSFLPATTAASSDNQPRMLYAYRNVLRGF 202

Query: 154 AARLSEEQVKAMEEKDGFVSARRERILQLHTTHTPDFLGLNRQFGFWKDSNFGKGVIIGV 213
           AARL+++QV+AME KDGF+SAR ERIL+  TTHTP+FLGL++Q GFW+DSNFGKGVIIGV
Sbjct: 203 AARLTQDQVRAMEGKDGFISARPERILKKLTTHTPNFLGLHQQKGFWRDSNFGKGVIIGV 262

Query: 214 LDGGIAPSHPSFDDVGMPPPPPKWKGRCEFNFSACNNKLIGARSFNLATKVLKGETMMDD 273
           LDGGI PSHPSF D GMPPPP KWKGRC+FN S CNNKLIGARSFNLA K  KG+    +
Sbjct: 263 LDGGIFPSHPSFSDEGMPPPPAKWKGRCDFNVSDCNNKLIGARSFNLAAKATKGDKA--E 322

Query: 274 SPIDEDGHGTHTASTAAGAFVKGAEALGNAKGTAVGMAPLAHLAIYKVCFGEDCPDTDIL 333
            PIDEDGHGTHTASTAAG FV  A+ LGNAKGTAVGMAP AHLAIYKVCFGEDCPD DIL
Sbjct: 323 PPIDEDGHGTHTASTAAGGFVNYADVLGNAKGTAVGMAPYAHLAIYKVCFGEDCPDADIL 382

Query: 334 AALDAAIEDGVDVLSLSLGSPSVPFFQDLVAIGAFAAIQKGIFVSCSAANSGPFKATLSN 393
           AALDAA+EDGVDVLSLSLG  S PFF D +AIGAFAA +KGI VSCSA NSGP  +TLSN
Sbjct: 383 AALDAAVEDGVDVLSLSLGDVSRPFFNDSLAIGAFAATEKGILVSCSAGNSGPVNSTLSN 442

Query: 394 EAPWILTVAASTIDRRIKAAAKLGNGEEFDGESLFQPSDFPPTFLPLVYAGEKNQT-AAL 453
           EAPWILTV ASTIDR+I A AKLGN EEFDGES+ +  DFP T  PLVYAG   +  +A 
Sbjct: 443 EAPWILTVGASTIDRKIIATAKLGNDEEFDGESIHR-GDFPQTSWPLVYAGINGKADSAF 502

Query: 454 CGEGSLKDIDVKGKIVVCERGGGIARIAKGTEVKNAGGAAMILLNQQQDGFSTEADAHVL 513
           C EGSLKDIDVK K+V+CERGGG+ RIAKG EVKNAGGAAMIL+NQ+ DGFSTEAD H L
Sbjct: 503 CAEGSLKDIDVKNKVVLCERGGGVGRIAKGEEVKNAGGAAMILVNQESDGFSTEADPHAL 562

Query: 514 PASHVSHKAALKIKAYINSTTYPTATILFKGTVIGDDNFSPAIASFSSRGPSVASPGILK 573
           PA+HVS    LKIKAYINST  PTAT+ FKGTVIGD + +P IASFSSRGP++ASPGILK
Sbjct: 563 PAAHVSFADGLKIKAYINSTATPTATLFFKGTVIGD-SLAPFIASFSSRGPNLASPGILK 622

Query: 574 PDITGPGVSILAAWPFPLDKNSNTKSTFNIISGTSMSCPHLSGIAALIKSSHPDWSPAAI 633
           PDI GPGVSILAAWPFPLD N+N KS FNI+SGTSMSCPHLSGIA L+KSSHP WSPAAI
Sbjct: 623 PDIIGPGVSILAAWPFPLDNNTNPKSPFNIMSGTSMSCPHLSGIAVLLKSSHPYWSPAAI 682

Query: 634 KSAIMTTADITNLEGQPIVDENLQPADLFATGAGHVNPSKAADPGLVYDIQPDDYIPYLC 693
           KSAIMTTADI NLEG+ I+D+ L PAD+FATGAGHVNP KA DPGL+YD+QPDDYIPYLC
Sbjct: 683 KSAIMTTADIVNLEGKAILDQALTPADVFATGAGHVNPIKANDPGLIYDLQPDDYIPYLC 742

Query: 694 GLGYKSNEVATIARKPINCLAKPSIPEGDLNYPSFTVVLGPPQTFTRTVTNVGCGREVYT 753
           GLGY   EV  +AR+PI C  KPSIPEG+LNYPSF+V LGP QTFTRTVTNVG     YT
Sbjct: 743 GLGYNDKEVGIVARRPIKCSEKPSIPEGELNYPSFSVTLGPSQTFTRTVTNVGEAYSTYT 802

Query: 754 AVVEAPPSISVTIRPSKIFFSKINEKVTYSVTFKRIGSISPSTEFGKGYLKWVSDKHVVR 807
           A + AP  + V+++PSK++FSK+N+K TYSV F RI S   +  +G+G+L WVS +H VR
Sbjct: 803 ANIMAPDGVYVSVKPSKLYFSKVNQKATYSVNFSRITSSGETGPYGQGFLTWVSARHCVR 838

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SBT18_ARATH3.1e-17446.99Subtilisin-like protease SBT1.8 OS=Arabidopsis thaliana GN=SBT1.8 PE=2 SV=1[more]
SBT14_ARATH1.9e-16846.32Subtilisin-like protease SBT1.4 OS=Arabidopsis thaliana GN=SBT1.4 PE=2 SV=1[more]
SBT17_ARATH8.9e-16646.16Subtilisin-like protease SBT1.7 OS=Arabidopsis thaliana GN=SBT1.7 PE=1 SV=1[more]
SBT12_ARATH2.6e-16544.09Subtilisin-like protease SBT1.2 OS=Arabidopsis thaliana GN=SBT1.2 PE=2 SV=1[more]
SBT15_ARATH1.3e-16145.04Subtilisin-like protease SBT1.5 OS=Arabidopsis thaliana GN=SBT1.5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KKE3_CUCSA0.0e+0079.79Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157240 PE=4 SV=1[more]
B9GW37_POPTR5.3e-29871.71Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s11870g PE=4 SV=2[more]
W9RYM8_9ROSA2.6e-29767.22Subtilisin-like protease SDD1 OS=Morus notabilis GN=L484_023157 PE=4 SV=1[more]
A0A067JNI9_JATCU6.4e-29670.66Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20707 PE=4 SV=1[more]
B9RBY5_RICCO7.1e-29566.16Cucumisin, putative OS=Ricinus communis GN=RCOM_1682320 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G05920.11.7e-17546.99 Subtilase family protein[more]
AT3G14067.11.1e-16946.32 Subtilase family protein[more]
AT5G67360.15.0e-16746.16 Subtilase family protein[more]
AT1G04110.11.5e-16644.09 Subtilase family protein[more]
AT3G14240.17.5e-16345.04 Subtilase family protein[more]
Match NameE-valueIdentityDescription
gi|449459724|ref|XP_004147596.1|0.0e+0079.79PREDICTED: subtilisin-like protease SBT1.2 [Cucumis sativus][more]
gi|659073656|ref|XP_008437181.1|0.0e+0080.18PREDICTED: subtilisin-like protease [Cucumis melo][more]
gi|731379755|ref|XP_002275471.2|3.5e-30372.12PREDICTED: uncharacterized protein LOC100242816 [Vitis vinifera][more]
gi|566162226|ref|XP_002303551.2|7.5e-29871.71hypothetical protein POPTR_0003s11870g [Populus trichocarpa][more]
gi|703143649|ref|XP_010108071.1|3.7e-29767.22Subtilisin-like protease SDD1 [Morus notabilis][more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004252serine-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR015500Peptidase_S8_subtilisin-rel
IPR010259S8pro/Inhibitor_I9
IPR009020Proteinase propeptide inhibitor
IPR003137PA_domain
IPR000209Peptidase_S8/S53_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004252 serine-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG05g03970Cp4.1LG05g03970gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g03970.1:cds:002Cp4.1LG05g03970.1:cds:002CDS
Cp4.1LG05g03970.1:cds:001Cp4.1LG05g03970.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG05g03970.1Cp4.1LG05g03970.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000209Peptidase S8/S53 domainGENE3DG3DSA:3.40.50.200coord: 144..393
score: 8.5E-78coord: 539..665
score: 8.5
IPR000209Peptidase S8/S53 domainPFAMPF00082Peptidase_S8coord: 193..652
score: 4.8
IPR000209Peptidase S8/S53 domainunknownSSF52743Subtilisin-likecoord: 169..432
score: 4.71E-80coord: 542..665
score: 4.71
IPR003137PA domainPFAMPF02225PAcoord: 426..511
score: 8.4
IPR009020Proteinase propeptide inhibitorunknownSSF54897Protease propeptides/inhibitorscoord: 101..155
score: 6.3
IPR010259Peptidase S8 propeptide/proteinase inhibitor I9PFAMPF05922Inhibitor_I9coord: 101..170
score: 3.0
IPR015500Peptidase S8, subtilisin-relatedPRINTSPR00723SUBTILISINcoord: 264..277
score: 9.9E-19coord: 591..607
score: 9.9E-19coord: 193..212
score: 9.9
IPR015500Peptidase S8, subtilisin-relatedPANTHERPTHR10795PROPROTEIN CONVERTASE SUBTILISIN/KEXINcoord: 15..803
score:
NoneNo IPR availableGENE3DG3DSA:3.50.30.30coord: 426..520
score: 3.4
NoneNo IPR availablePANTHERPTHR10795:SF370SUBFAMILY NOT NAMEDcoord: 15..803
score:
NoneNo IPR availableunknownSSF52025PA domaincoord: 412..507
score: 7.5