Cp4.1LG02g12240.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g12240.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartic proteinase-like protein 2
LocationCp4.1LG02: 11890149 .. 11896384 (-)
Sequence length2102
RNA-Seq ExpressionCp4.1LG02g12240.1
SyntenyCp4.1LG02g12240.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCTTGAGGTTCTTCTCTTCGTCTGTTCATTGAAAATAGCATTGCCTAATAAAACTCTTCATTGACTTCGCTAAATCTCTCTCTCTCTCTCTGTTTCTCTCTCCGGTTTCTTTTTCTAGTAGTTAATAGGTAGAGAATTTCTTCTTGGGGCCTGGCCTGCAGTCAATCACAGTGGAATCGAAGGTTGTTCGTAAGGCCAGGGAATTGAATTGACTGAGAAGCACTGCCGGATTCAAAATCAACCGTTCTATCTATTTGTTTATTCAAAATCAAGGGGAAGTTTGAGACTAAGACTTTGATCTTTCTATATTCCCCTGTTTCTTATTTGGTTTATCCATTGCTGTTAGTCTATAAATCTGAGCTGAGCATCGATTTTAGTTGATTTTTGGTTCTGGGTTTTGGTTTAATCTCTGTTTGATTTGGGGGTTTGGTCTTAGTTATGGAGATTGCAAGATTTTCTATAGTGAACTTGTTGTTAGTGATTTCGTTTTTGTCGAGTGGGGATTGTAATTTGGTGTTTAAGGTTCAGCATAAATTCAATGGTCGACAGAGGTCGTTGAGTGCGTTCAAAGCCCACGATATGCACCGTCGCGGTAGATTTCTTTCTGCTATTGATCTTGAATTGGGTGGCAATGGACATCCTTCTGAATCTGGGTTAGTCTTGATTTCTATTGTGATTATTTGCTATACACTGAAAATTATGGTGTTTTTTATGCGGTAATCTTGATTAGTAATTGCTTATTTAGTGGATAATTTATTCCATTCTGATGTTCTTTCAAAAAGGGCTGGTGTTATTGGATTCTTTTCTACCTCTTTTTAGTAAACACTGTATGGAATCTTTATTTTTTCCTTTTTGTTTTGGCTTTGTGAAATCAATCACTGCTCAAGTTCTTAATTGCTCAAGTTTTTAATTGCTTCAGATCGTAAGAGGAGATTTTTTTCGTGCTTTTTTTGGAAATCCTTGCCTCAAAGTTCTCTTATTCATATTCATTATTGGATGGTGGAAGTAATTATATGCTTTCAGGCTGTACTTTGCTAAAATTGGGCTTGGGACACCAGCCAAGGACTATTATGTTCAAGTGGATACAGGAAGTGACCTCTTATGGGTGAATTGTGCAGGCTGTACAAACTGTCCAAGGAAAAGTGATCTTGGTGTATGTCTCCTGAACTTTAAATTCTTGTTGAATTTCATATTATATTCAGAAGTACACCAACCATTTGTAACTTGTTTTCGTATTGCACCGATCATAGTTTCCGAGCTATACTCTGATTATTTTTGAAGAAACGTTAATTATACGATTGTTCGTTACAGATAGAACTGACTTTATACAATCCATCAAGCTCTAGTACTTCAAACCTGGTAACTTGTGATCAAGATTTTTGCACTTCCTCACATGATGGTCCAATTCCCGGCTGCACGCCTGATCTGAACTGTGAATATACAGTTTCATATGGAGATGGAAGCTCAACTACTGGGTATTTTGTGGAGGATCGTGTAGTACTTGATCAAGTAACGGGAAATTTTCGAACTGCATCTACAAATGGGAGTATAGTATTTGGGTAAGAATATCATTTTCTACGTTACCATTTATTTCAGCACTGTGTCGAGCAACCGAACTTTTGGGTCCTGTTGCTTAGTTTAAGGAATGTGATGCCTTTGAGGGATATTTTGACGTTGACTGTCACAAAAAGAACAGAGGACTTTTTAGATGCTTTGGGATTGTTGATGTTTATGATATACCTTGAACAAGTAAGTTCAAGTCATTTGGAGCATAAACTATATACTTTCTGAAATTCTTTTATTCGTTGGAGCTTTACCATCTCTGAAGTTGAGATTTTTGAAGTATAAACTACATAGCTTTATTTTTTGTGAGAAGGACGCTGGGCCCGAAGTAGGGTGGACTGTGAGATCCCACATCGATTGGGGAGGAGAACGAACCATTCTTTATAAGAGTGTGGAAACCTCTCCCTAGCAGACGCGTTTTAAAAACCTTGAGGGAAGGTCGGAAGGAAAAACCTAAAGAGGACAATATCTGCTAGTGGTGGACTTGGGTCGTTACATTTTTAGATGAATATTAGAAAACCTATCCTGTTATTTGTTTGTGTTTGAAGTAGTATATAGTAGAAAACCTAGCGGTGGGCTTGGGCCGTTATTTGTTTTTGTTCTTCTATTTCTTCATGCTTATTTTTGTTTTCCTTGATGGATAGTTGTGGTGCTCAACAATCTGGCCAACTAGGTGCAACATCTGCTGCGGTCGATGGGATACTTGGTTTCGGACAATCAAATTCATCCATGATTTTGCAGCTGGCTTCATTAGGAAAAGTTAGAAGGATTTTTGCTCATTGCTTGGATAACATTAAAGGAGGTGGAATTTTTGCCATTGGGGAGGTGGTGCAGCCAAAAGTCCACACCACTCCATTGGTGCAGCAACAGTATGTTATACTTCCCTGCATTTAGTAATCTTTGTCATCTTGAATCTAAACTGTTGATTTACTTTAACTAGTACAAGTGATGAGTTAATAGCATATTTTGTTTTGAACTTTTTCATATCTATGTGAAAAATTCAAGAGGGAAAAAGGAATTAACCATCTGTTTGTGGAAGATAAATTGAACATTGTTCCATTATTGTTCTCTGCCGAAAGGAATTCATGAAACCTTATCCCTTTCAAGGAGAATTTCATGGAGAAAAATAATATCCTTAGCAAAGAGTTGAACAATAGATGAAGTTGTAAGAAAGAGGAGTTGGAAAAAGGTGTTGGGTTGAACAAAAATGATATGATATCTTTAACAATCCATCTCAACAAAAAATGACCCGGATCATATCATGTAGAGTCGTGACTTTGATCGAAGACTGGCGCAATTTTTTTAAGGTGCTCAGCTTTTGTTAGCTCCTAGAGTTGCAAAGAGGCGTTTCAGGCGCTTTTCTTTCATCTGCCCTTTTGTGATAAAGGGAGGTTTTTGTCGCAAACTTCTGTGTGTGCTATTTTGTGGGGTCTTTGAGTAGAGAGAGACAACATAATTTTCGGAGGACGTAAGAGTTCTTTTAGCGATGTTTGGTCCCTTGTTAGATTCTTTTTTTTTTGGGTCCCTCAAAGGCAGGGTGAAAGCGTTTGAAAATAATCCAATTACGATACGCTATCGGTATCCTCAATAGTGTTTTTGTTGTTATCGAACATGGGGTAGAATTTTGGATGCTCGTAGGGAAGGAGAATTGCAAGCAATCAGCCCCGTTCCGGTGCTTTGTTGCTTATTTAACCCTTCTCAAACACAGATTAGTATTTTACATGCATTTATATATCTTCTTTTTAAATATTTCGCTTTCTTTTCTTAATTAGTAGCCTTGGACTTAGTTATTTCATCATCAACTTTAGGGCACATTACAATGTGGCTATGAAGGCAATTGAGGTTGGCGGTGACATGCTGAATCTCCCCACAGATGTTTTTGACACTGATTTAAGCAGAGGAACAATTATTGACAGTGGCACGACGTTAGCTTATCTTCCAGACGAGATTTATGAACCATTAATAGCAAAGGTACACTAAATAATTTTTTTTTATTTTACGTTTTTGTTAATCGTTTCCCGTCAAAAACAATATTTAAAGTTCGTATTTTTAGAAACTGAAATTGTCTTTACTATGTTAGATTTTTGCAGGGCAGACTGGACTGAAGTTGCATACTGTTGAAAAACAGTTTACTTGCTTTCAATACAACGGAAAGTAAGATGACATGATACCTTCCTGGCTAACTTCTATTCGTCTTTGTTAAATGTATAAAATAGCTATCTTATTGTTCAACCTTTGATGTTCATTTCGGGGTCGGTCTTATTTCATTCACATGATCGTTAAATGTGTTGCTGGTTGAGGTTCTTATTTATTTCTTTTTATTTGGTCAGTTTTTTCTAGATTCATGAGCAAAGATTTGTTAGTGTTCTTCCCGAGTAAATGATATCCATGTCAAAAAATAGTAGTACAAAGTGTCTGGGGAAAAAGAGATATCTCGAATTGGGGCACAAACTGTTCATCATATTAAAAATAGTTAAAATGGATTCTTGTTAAAAGTCATGTTAGAGATACCCTCTAGGATCTTTGGTTCAAGTCCGAGCTTAATAACTTGAAACTCTGTCATCTCTAGAGTTGGTCTTGGGATAGGAGGACCCTCTCTAGTTGGTGTAACAACCTAAGCCCAGCACTAGCCGATATTGTTCTCTTTGAACTTTCCCTTCTGGCCTTCCCCTCAAAGTTTTTAGAACGTGTCTGTTAGGGAGAGGTTTCCACACCCTTATGAATAATGATGTGAGATTTCACAATCCACTCCCCTTCGGGGCCCAGCGTCCTTGCTTGCACATCGCCTCGTGTCCACCTCCACACCTTTACAAAGAATGTTTCGTTTTCCTCCCCAACCGATGTGAGATTTCACATGTGGTCATGAGCATAATGCAAAAAGTGAGGCCCCAAACTCTGCATCTCTTAAATATAATAAGCATCATAAGAGTTTAGCATATTGCATAGTTTACTTTACCTAATCAAAACATCCTTCAACATGTTGAGACTAGAACATCTTATTAATATCGCTACGCTTAAAACATTCATTATCTGAACATTTTCTTAAAAGTTTCTCATTTGGCTTTTATGTTCTTCATGCAGCATTGATGACGGATTTCCTGATGTTACATTTCACTTTGAAGGTTCACTATCTTTGAGAGTTCGTCCTCGTGAGTATCTGTTTGATATTGATGTGAGTGAATTGAACTTTGAAGTATGACCAGACTTCCTTTTCTCATGCTTTATTGATGCTGACCCCAATAATAAAAATTTGGGCTAATAATGTCAATGATAGGAATATTAGCAAAAAAAAAAAAAAATTAATTTGATAATTACTTAAAGAAGAACATATGAGAATGTCTAAGCTGTGATTTGATAAATTTTGCAGAGTGATAAATGGTGTGTCGGTTGGCAAAACAGTGGTGCCCAATCTAGGGATGGAAAGGGTATGATTCTGTTGGGAGGTCAGTGAAAAGACAATGAACTCGTTTATTATTTCATCACTTCAAATTATGAAATTCTGCTGCTGCAATTTGCCTTACTTGGGTGTTCCTTCTTGGGATACACATGATTGTTAACCAGTTTTCTTTTTTCATTTGTTTCTGATTTCAGATTTGGTGCTTCAAAATAGACTCGTGCTATACGACTTGGAAAATCAGACCATCGGTTGGACAGAGTACAACTGTGAGTACACTCGACTCTTATTTAGCATTAGATAAATTTCGAGTACACTTGTAACAGCCCAAGTCCAAAGATACGCTTTTTAAAAATCTTGAGGGGAAGCCTGAAAGGGAAAGCCCAAAGAGGACACAATAATTGCTAGCGGTGGGCTTGGGCCGTTACAACAACACTTGACTCTTATTTTCTATGTTTCATACGACCCTGCCTTACAAATTTACATGCCTGTGATGCCTATGCCCAGCTTTCTGATTGAATTGCTTTACTGGTTGAAAACAGTAGATTTTTATGCTCTTTCTATTCAAACTTTTGTTAACTATTTGTTTAAACTTGAAAAAGAAATTCATTGGCGTCATATGTTAGCAAGCACTTATTGTTTACGTCTGATGGACCAACAAGTAATGATTTGGTACAATGTGACATAAAGGGTAGGCCAATTATAGATAAAGTAAGGGCAGGTAGGACAAAAACGAGTTCGACGGGTCGGTGGCGGGTTGCTTTTAAATTGGTTTTGGTCTTACTCTTTTTGGCTGGCTTCTCTGTTGTACAAGGTTGTGTTAAATCATGTAGGTCCATGTTCCTCAATATAGTAACAGTGTTGGATGGTTCTGTTGAAATGTTAATATGTTTGAATGGGGAATTGCAGGCTCTTCAAGCATTAAGGTGAGGGATGAGAAGAGCGGAGGCATATATACAGTTGGGCCTCATGATCTTTCTTCAGCTTCTTCTCTAAGAACTGGAAGAACATCAGTGGTGTTGTTCATACTAATGCTTACCATGCTCCATTCTTTTACAAACTAAAAACTAAACAAACATAACATTGTTTTCAAAGTGACTTGAGTTCAACATAAAAAATAGTATGAACAACACCACTCATTCTTTTTCATCATTGAATTGGTTCTTTAAGATCCATTTCTCAATACTGAATTATTCTTTGGGCTTTGTGTAAACAACTTATCCTTTCCATTCATAGGCCTCAACTTAATTCATATATATAAAAACTTTATTGTAATGT

mRNA sequence

TGCTTGAGGTTCTTCTCTTCGTCTGTTCATTGAAAATAGCATTGCCTAATAAAACTCTTCATTGACTTCGCTAAATCTCTCTCTCTCTCTCTGTTTCTCTCTCCGGTTTCTTTTTCTAGTAGTTAATAGGTAGAGAATTTCTTCTTGGGGCCTGGCCTGCAGTCAATCACAGTGGAATCGAAGGTTGTTCGTAAGGCCAGGGAATTGAATTGACTGAGAAGCACTGCCGGATTCAAAATCAACCGTTCTATCTATTTGTTTATTCAAAATCAAGGGGAAGTTTGAGACTAAGACTTTGATCTTTCTATATTCCCCTGTTTCTTATTTGGTTTATCCATTGCTGTTAGTCTATAAATCTGAGCTGAGCATCGATTTTAGTTGATTTTTGGTTCTGGGTTTTGGTTTAATCTCTGTTTGATTTGGGGGTTTGGTCTTAGTTATGGAGATTGCAAGATTTTCTATAGTGAACTTGTTGTTAGTGATTTCGTTTTTGTCGAGTGGGGATTGTAATTTGGTGTTTAAGGTTCAGCATAAATTCAATGGTCGACAGAGGTCGTTGAGTGCGTTCAAAGCCCACGATATGCACCGTCGCGGTAGATTTCTTTCTGCTATTGATCTTGAATTGGGTGGCAATGGACATCCTTCTGAATCTGGGCTGTACTTTGCTAAAATTGGGCTTGGGACACCAGCCAAGGACTATTATGTTCAAGTGGATACAGGAAGTGACCTCTTATGGGTGAATTGTGCAGGCTGTACAAACTGTCCAAGGAAAAGTGATCTTGGTATAGAACTGACTTTATACAATCCATCAAGCTCTAGTACTTCAAACCTGGTAACTTGTGATCAAGATTTTTGCACTTCCTCACATGATGGTCCAATTCCCGGCTGCACGCCTGATCTGAACTGTGAATATACAGTTTCATATGGAGATGGAAGCTCAACTACTGGGTATTTTGTGGAGGATCGTGTAGTACTTGATCAAGTAACGGGAAATTTTCGAACTGCATCTACAAATGGGAGTATAGTATTTGGTTGTGGTGCTCAACAATCTGGCCAACTAGGTGCAACATCTGCTGCGGTCGATGGGATACTTGGTTTCGGACAATCAAATTCATCCATGATTTTGCAGCTGGCTTCATTAGGAAAAGTTAGAAGGATTTTTGCTCATTGCTTGGATAACATTAAAGGAGGTGGAATTTTTGCCATTGGGGAGGTGGTGCAGCCAAAAGTCCACACCACTCCATTGGTGCAGCAACAGGCACATTACAATGTGGCTATGAAGGCAATTGAGGTTGGCGGTGACATGCTGAATCTCCCCACAGATGTTTTTGACACTGATTTAAGCAGAGGAACAATTATTGACAGTGGCACGACGTTAGCTTATCTTCCAGACGAGATTTATGAACCATTAATAGCAAAGATTTTTGCAGGGCAGACTGGACTGAAGTTGCATACTGTTGAAAAACAGTTTACTTGCTTTCAATACAACGGAAACATTGATGACGGATTTCCTGATGTTACATTTCACTTTGAAGGTTCACTATCTTTGAGAGTTCGTCCTCGTGAGTATCTGTTTGATATTGATAGTGATAAATGGTGTGTCGGTTGGCAAAACAGTGGTGCCCAATCTAGGGATGGAAAGGGTATGATTCTGTTGGGAGATTTGGTGCTTCAAAATAGACTCGTGCTATACGACTTGGAAAATCAGACCATCGGTTGGACAGAGTACAACTGCTCTTCAAGCATTAAGGTGAGGGATGAGAAGAGCGGAGGCATATATACAGTTGGGCCTCATGATCTTTCTTCAGCTTCTTCTCTAAGAACTGGAAGAACATCAGTGGTGTTGTTCATACTAATGCTTACCATGCTCCATTCTTTTACAAACTAAAAACTAAACAAACATAACATTGTTTTCAAAGTGACTTGAGTTCAACATAAAAAATAGTATGAACAACACCACTCATTCTTTTTCATCATTGAATTGGTTCTTTAAGATCCATTTCTCAATACTGAATTATTCTTTGGGCTTTGTGTAAACAACTTATCCTTTCCATTCATAGGCCTCAACTTAATTCATATATATAAAAACTTTATTGTAATGT

Coding sequence (CDS)

ATGGAGATTGCAAGATTTTCTATAGTGAACTTGTTGTTAGTGATTTCGTTTTTGTCGAGTGGGGATTGTAATTTGGTGTTTAAGGTTCAGCATAAATTCAATGGTCGACAGAGGTCGTTGAGTGCGTTCAAAGCCCACGATATGCACCGTCGCGGTAGATTTCTTTCTGCTATTGATCTTGAATTGGGTGGCAATGGACATCCTTCTGAATCTGGGCTGTACTTTGCTAAAATTGGGCTTGGGACACCAGCCAAGGACTATTATGTTCAAGTGGATACAGGAAGTGACCTCTTATGGGTGAATTGTGCAGGCTGTACAAACTGTCCAAGGAAAAGTGATCTTGGTATAGAACTGACTTTATACAATCCATCAAGCTCTAGTACTTCAAACCTGGTAACTTGTGATCAAGATTTTTGCACTTCCTCACATGATGGTCCAATTCCCGGCTGCACGCCTGATCTGAACTGTGAATATACAGTTTCATATGGAGATGGAAGCTCAACTACTGGGTATTTTGTGGAGGATCGTGTAGTACTTGATCAAGTAACGGGAAATTTTCGAACTGCATCTACAAATGGGAGTATAGTATTTGGTTGTGGTGCTCAACAATCTGGCCAACTAGGTGCAACATCTGCTGCGGTCGATGGGATACTTGGTTTCGGACAATCAAATTCATCCATGATTTTGCAGCTGGCTTCATTAGGAAAAGTTAGAAGGATTTTTGCTCATTGCTTGGATAACATTAAAGGAGGTGGAATTTTTGCCATTGGGGAGGTGGTGCAGCCAAAAGTCCACACCACTCCATTGGTGCAGCAACAGGCACATTACAATGTGGCTATGAAGGCAATTGAGGTTGGCGGTGACATGCTGAATCTCCCCACAGATGTTTTTGACACTGATTTAAGCAGAGGAACAATTATTGACAGTGGCACGACGTTAGCTTATCTTCCAGACGAGATTTATGAACCATTAATAGCAAAGATTTTTGCAGGGCAGACTGGACTGAAGTTGCATACTGTTGAAAAACAGTTTACTTGCTTTCAATACAACGGAAACATTGATGACGGATTTCCTGATGTTACATTTCACTTTGAAGGTTCACTATCTTTGAGAGTTCGTCCTCGTGAGTATCTGTTTGATATTGATAGTGATAAATGGTGTGTCGGTTGGCAAAACAGTGGTGCCCAATCTAGGGATGGAAAGGGTATGATTCTGTTGGGAGATTTGGTGCTTCAAAATAGACTCGTGCTATACGACTTGGAAAATCAGACCATCGGTTGGACAGAGTACAACTGCTCTTCAAGCATTAAGGTGAGGGATGAGAAGAGCGGAGGCATATATACAGTTGGGCCTCATGATCTTTCTTCAGCTTCTTCTCTAAGAACTGGAAGAACATCAGTGGTGTTGTTCATACTAATGCTTACCATGCTCCATTCTTTTACAAACTAA

Protein sequence

MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDLELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTLYNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLDQVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRIFAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTDLSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDVTFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDLENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSFTN
Homology
BLAST of Cp4.1LG02g12240.1 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.1e-141
Identity = 255/480 (53.12%), Postives = 323/480 (67.29%), Query Frame = 0

Query: 3   IARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDLEL 62
           I+R   V  +LVI  +S    N VF V HKF G+++ LS  K+HD  R  R L+ IDL L
Sbjct: 10  ISRIVAVVFVLVIQVVSG---NFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPL 69

Query: 63  GGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTLYN 122
           GG+      GLYF KI LG+P K+YYVQVDTGSD+LWVNCA C  CP K+DLGI L+LY+
Sbjct: 70  GGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYD 129

Query: 123 PSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLDQV 182
             +SSTS  V C+ DFC+         C     C Y V YGDGS++ G F++D + L+QV
Sbjct: 130 SKTSSTSKNVGCEDDFCSFIMQSET--CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQV 189

Query: 183 TGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRIFA 242
           TGN RTA     +VFGCG  QSGQLG T +AVDGI+GFGQSN+S+I QLA+ G  +RIF+
Sbjct: 190 TGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFS 249

Query: 243 HCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTDLS 302
           HCLDN+ GGGIFA+GEV  P V TTP+V  Q HYNV +K ++V GD ++LP  +  T+  
Sbjct: 250 HCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGD 309

Query: 303 RGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDVTF 362
            GTIIDSGTTLAYLP  +Y  LI KI A Q  +KLH V++ F CF +  N D  FP V  
Sbjct: 310 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ-VKLHMVQETFACFSFTSNTDKAFPVVNL 369

Query: 363 HFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDLEN 422
           HFE SL L V P +YLF +  D +C GWQ+ G  ++DG  +ILLGDLVL N+LV+YDLEN
Sbjct: 370 HFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 429

Query: 423 QTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSFTN 482
           + IGW ++NCSSSIKV+D  SG  Y +G  +L SA+S     T V L  +++ + HSFT+
Sbjct: 430 EVIGWADHNCSSSIKVKD-GSGAAYQLGAENLISAASSVMNGTLVTLLSILIWVFHSFTS 482

BLAST of Cp4.1LG02g12240.1 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 471.1 bits (1211), Expect = 1.5e-131
Identity = 231/482 (47.93%), Postives = 322/482 (66.80%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           ME+ R   + + + +  +     N VFK QHKF G++++L  FK+HD  R  R L++IDL
Sbjct: 1   MELRRKLCIVVAVFVIVIEFASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
            LGG+      GLYF KI LG+P K+Y+VQVDTGSD+LW+NC  C  CP K++L   L+L
Sbjct: 61  PLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           ++ ++SSTS  V CD DFC  S       C P L C Y + Y D S++ G F+ D + L+
Sbjct: 121 FDMNASSTSKKVGCDDDFC--SFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLE 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTG+ +T      +VFGCG+ QSGQLG   +AVDG++GFGQSN+S++ QLA+ G  +R+
Sbjct: 181 QVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRV 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           F+HCLDN+KGGGIFA+G V  PKV TTP+V  Q HYNV +  ++V G  L+LP  +    
Sbjct: 241 FSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV--- 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
            + GTI+DSGTTLAY P  +Y+ LI  I A Q  +KLH VE+ F CF ++ N+D+ FP V
Sbjct: 301 RNGGTIVDSGTTLAYFPKVLYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           +F FE S+ L V P +YLF ++ + +C GWQ  G  + +   +ILLGDLVL N+LV+YDL
Sbjct: 361 SFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           +N+ IGW ++NCSSSIK++D  SGG+Y+VG  +LSSA  L     + +L IL   ++ +F
Sbjct: 421 DNEVIGWADHNCSSSIKIKD-GSGGVYSVGADNLSSAPRLL--MITKLLTILSPLIVMAF 473

Query: 481 TN 483
           T+
Sbjct: 481 TS 473

BLAST of Cp4.1LG02g12240.1 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 7.0e-36
Identity = 113/379 (29.82%), Postives = 172/379 (45.38%), Query Frame = 0

Query: 65  NGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTLYNPS 124
           +G    SG YF++IG+GTPAK+ Y+ +DTGSD+ W+ C  C +C ++SD      ++NP+
Sbjct: 153 SGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPT 212

Query: 125 SSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLDQVTG 184
           SSST   +TC    C+         C  +  C Y VSYGDGS T G    D V       
Sbjct: 213 SSSTYKSLTCSAPQCSLLETS---ACRSN-KCLYQVSYGDGSFTVGELATDTV------- 272

Query: 185 NFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRIFAHC 244
            F  +    ++  GCG    G    T AA  G+LG G    S+  Q+ +       F++C
Sbjct: 273 TFGNSGKINNVALGCGHDNEGLF--TGAA--GLLGLGGGVLSITNQMKATS-----FSYC 332

Query: 245 L---DNIKGGGIFAIGEVVQPKVHTTPLVQQQ---AHYNVAMKAIEVGGDMLNLPTDVFD 304
           L   D+ K   +      +     T PL++ +     Y V +    VGG+ + LP  +FD
Sbjct: 333 LVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFD 392

Query: 305 TDL--SRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLK--LHTVEKQFTCFQYNGNID 364
            D   S G I+D GT +  L  + Y  L          LK    ++    TC+ ++    
Sbjct: 393 VDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST 452

Query: 365 DGFPDVTFHFEGSLSLRVRPREYLFDI-DSDKWCVGWQNSGAQSRDGKGMILLGDLVLQN 424
              P V FHF G  SL +  + YL  + DS  +C  +  + +       + ++G++  Q 
Sbjct: 453 VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS------SLSIIGNVQQQG 500

Query: 425 RLVLYDLENQTIGWTEYNC 433
             + YDL    IG +   C
Sbjct: 513 TRITYDLSKNVIGLSGNKC 500

BLAST of Cp4.1LG02g12240.1 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 2.9e-34
Identity = 121/406 (29.80%), Postives = 186/406 (45.81%), Query Frame = 0

Query: 50  RRGRFLSAIDLELGGNGHP--SESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTN 109
           RR R ++A+     G   P  +  G Y   + +GTP   +   +DTGSDL+W  C  CT 
Sbjct: 70  RRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQ 129

Query: 110 CPRKSDLGIELTLYNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSS 169
           C           ++NP  SS+ + + C+  +C    D P   C  +  C+YT  YGDGS+
Sbjct: 130 C-----FSQPTPIFNPQDSSSFSTLPCESQYC---QDLPSETCNNN-ECQYTYGYGDGST 189

Query: 170 TTGYFVEDRVVLDQVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSM 229
           T GY   +       T  F T+S   +I FGCG    G  G  + A  G++G G    S+
Sbjct: 190 TQGYMATE-------TFTFETSSV-PNIAFGCGEDNQG-FGQGNGA--GLIGMGWGPLSL 249

Query: 230 ILQLASLGKVRRIFAHCLDNI--KGGGIFAIGEV---VQPKVHTTPLVQQQ---AHYNVA 289
             QL  +G+    F++C+ +         A+G     V     +T L+       +Y + 
Sbjct: 250 PSQL-GVGQ----FSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYIT 309

Query: 290 MKAIEVGGDMLNLPTDVF--DTDLSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKL 349
           ++ I VGGD L +P+  F    D + G IIDSGTTL YLP + Y   +A+ F  Q  + L
Sbjct: 310 LQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN-AVAQAFTDQ--INL 369

Query: 350 HTVEKQ----FTCFQYNGNIDDG----FPDVTFHFEGSLSLRVRPREYLFDIDSDKWCVG 409
            TV++      TCFQ      DG     P+++  F+G + L +  +  L        C+ 
Sbjct: 370 PTVDESSSGLSTCFQ---QPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLA 429

Query: 410 WQNSGAQSRDGKGMILLGDLVLQNRLVLYDLENQTIGWTEYNCSSS 436
             +S        G+ + G++  Q   VLYDL+N  + +    C +S
Sbjct: 430 MGSSSQ-----LGISIFGNIQQQETQVLYDLQNLAVSFVPTQCGAS 438

BLAST of Cp4.1LG02g12240.1 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.6e-32
Identity = 123/462 (26.62%), Postives = 198/462 (42.86%), Query Frame = 0

Query: 24  NLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDLEL--------------------- 83
           + VF V HK   R+      +         F+ ++DLEL                     
Sbjct: 129 SFVFPVYHKLRAREFHERILEEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSIDSST 188

Query: 84  -----GGNGHPSESGLYFAKIGLGTP--AKDYYVQVDTGSDLLWVNC-AGCTNCPRKSDL 143
                GGN +P   GLY+ +I +G P   + Y++ +DTGS+L W+ C A CT+C + ++ 
Sbjct: 189 TIFPVGGNVYP--DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN- 248

Query: 144 GIELTLYNPSSSSTSNLVTCDQDFCTSSHDGPI-PGCTPDLNCEYTVSYGDGSSTTGYFV 203
                LY P      NLV   + FC       +   C     C+Y + Y D S + G   
Sbjct: 249 ----QLYKPRK---DNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLT 308

Query: 204 EDRVVLDQVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLAS 263
           +D+  L    G+         IVFGCG  Q G L  T    DGILG  ++  S+  QLAS
Sbjct: 309 KDKFHLKLHNGSL----AESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLAS 368

Query: 264 LGKVRRIFAHCL-DNIKGGGIFAIGEVVQPKVHTT--PLVQQQA--HYNVAMKAIEVGGD 323
            G +  +  HCL  ++ G G   +G  + P    T  P++       Y + +  +  G  
Sbjct: 369 RGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQG 428

Query: 324 MLNLPTDVFDTDLSR--GTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFT- 383
           ML+L     D +  R    + D+G++  Y P++ Y  L+  +    +GL+L   +   T 
Sbjct: 429 MLSL-----DGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSGLELTRDDSDETL 488

Query: 384 --CFQYNGN--------IDDGFPDVTFHFEG-----SLSLRVRPREYLFDIDSDKWCVGW 433
             C++   N        +   F  +T          S  L ++P +YL   +    C+G 
Sbjct: 489 PICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGI 548

BLAST of Cp4.1LG02g12240.1 vs. NCBI nr
Match: XP_023525733.1 (aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 973 bits (2516), Expect = 0.0
Identity = 482/482 (100.00%), Postives = 482/482 (100.00%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL
Sbjct: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. NCBI nr
Match: KAG6607130.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia] >KAG7036820.1 Aspartic proteinase-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 971 bits (2509), Expect = 0.0
Identity = 480/482 (99.59%), Postives = 481/482 (99.79%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL
Sbjct: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMI QLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMISQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKVRDEKSGG+YTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEKSGGVYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. NCBI nr
Match: XP_022948415.1 (aspartic proteinase-like protein 2 [Cucurbita moschata])

HSP 1 Score: 965 bits (2494), Expect = 0.0
Identity = 477/482 (98.96%), Postives = 479/482 (99.38%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           M+IARFSIVNLLL ISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL
Sbjct: 1   MDIARFSIVNLLLGISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGC NCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCANCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMI QLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMISQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKVRDEKSGG+YTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEKSGGVYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. NCBI nr
Match: XP_022998947.1 (aspartic proteinase-like protein 2 [Cucurbita maxima])

HSP 1 Score: 945 bits (2443), Expect = 0.0
Identity = 470/482 (97.51%), Postives = 473/482 (98.13%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIARFSIVNLLLVISFLS G CNLVFKVQHKFNGRQRSLSAF AHDMHRRGRFLSAIDL
Sbjct: 1   MEIARFSIVNLLLVISFLSIGGCNLVFKVQHKFNGRQRSLSAFTAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGP P CTPDLNCEYTVSYGDGSSTTG+FVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPNPDCTPDLNCEYTVSYGDGSSTTGHFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMI QLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMISQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFA QTGLKL++VEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFARQTGLKLYSVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSD WCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDNWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKV DEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVMDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. NCBI nr
Match: KAG6598979.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 837 bits (2163), Expect = 4.31e-304
Identity = 411/484 (84.92%), Postives = 444/484 (91.74%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIAR ++V+ LL ISFLS+GDCNLVF V HKF GR+RSL AFKAHD+ RRGRFLSAIDL
Sbjct: 1   MEIARLAVVSFLLAISFLSTGDCNLVFNVHHKFKGRERSLEAFKAHDVLRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
            LGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  NLGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTC QDFCTS++DGPIPGC P+L CEY V+YGDGSSTTGYFV+D VVLD
Sbjct: 121 YNPSSSSTSNLVTCGQDFCTSTYDGPIPGCRPELLCEYKVAYGDGSSTTGYFVKDHVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           +VTGNF+T STNGSIVFGCGAQQSGQLGATSAA+DGILGFGQ+NSSMI QLAS GK++RI
Sbjct: 181 RVTGNFQTESTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKIKRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNI GGGIFAIGEVVQPKV TTPLVQQQAHYNV MKAIEVG ++LNLPTDVFDTD
Sbjct: 241 FAHCLDNINGGGIFAIGEVVQPKVRTTPLVQQQAHYNVFMKAIEVGNEVLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           L +GTIIDSGTTLAYLPD IYEPLIAKIFA QTGLKLHTVE+QFTCF+Y+GN+DDGFP +
Sbjct: 301 LKKGTIIDSGTTLAYLPDVIYEPLIAKIFARQTGLKLHTVEEQFTCFEYDGNVDDGFPTI 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFE SLSL V P EYLFDI S+KWCVGWQNSGAQSRDGK MILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEDSLSLTVHPHEYLFDIASNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLF--ILMLTMLH 480
           ENQTIGWTEYNCSSSIKVRDE SG IYTVGPH+LSSAS++R G  S +L   +L+LT+LH
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEHSGAIYTVGPHNLSSASTVRVGTISSMLLSILLLLTVLH 480

Query: 481 SFTN 482
           SFTN
Sbjct: 481 SFTN 484

BLAST of Cp4.1LG02g12240.1 vs. ExPASy TrEMBL
Match: A0A6J1G9S6 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111452105 PE=3 SV=1)

HSP 1 Score: 965 bits (2494), Expect = 0.0
Identity = 477/482 (98.96%), Postives = 479/482 (99.38%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           M+IARFSIVNLLL ISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL
Sbjct: 1   MDIARFSIVNLLLGISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGC NCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCANCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMI QLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMISQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKVRDEKSGG+YTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEKSGGVYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. ExPASy TrEMBL
Match: A0A6J1K9F9 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111493453 PE=3 SV=1)

HSP 1 Score: 945 bits (2443), Expect = 0.0
Identity = 470/482 (97.51%), Postives = 473/482 (98.13%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIARFSIVNLLLVISFLS G CNLVFKVQHKFNGRQRSLSAF AHDMHRRGRFLSAIDL
Sbjct: 1   MEIARFSIVNLLLVISFLSIGGCNLVFKVQHKFNGRQRSLSAFTAHDMHRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTCDQDFCTSSHDGP P CTPDLNCEYTVSYGDGSSTTG+FVEDRVVLD
Sbjct: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPNPDCTPDLNCEYTVSYGDGSSTTGHFVEDRVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMI QLASLGKVRRI
Sbjct: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMISQLASLGKVRRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD
Sbjct: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           LSRGTIIDSGTTLAYLPDEIYEPLIAKIFA QTGLKL++VEKQFTCFQYNGNIDDGFPDV
Sbjct: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFARQTGLKLYSVEKQFTCFQYNGNIDDGFPDV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFEGSLSLRVRPREYLFDIDSD WCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEGSLSLRVRPREYLFDIDSDNWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKV DEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVMDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480

Query: 481 TN 482
           TN
Sbjct: 481 TN 482

BLAST of Cp4.1LG02g12240.1 vs. ExPASy TrEMBL
Match: A0A6J1G3Q0 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111450495 PE=3 SV=1)

HSP 1 Score: 834 bits (2155), Expect = 3.45e-303
Identity = 410/484 (84.71%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           M IAR ++V+ LLVIS LS+GDCNLVF V HKF GR+RSL AFKAHD+ RRGRFLSAIDL
Sbjct: 1   MAIARLAVVSFLLVISLLSTGDCNLVFNVHHKFKGRERSLEAFKAHDVLRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
            LGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  NLGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTC QDFCTS++DGPIPGC P+L CEY V+YGDGSSTTGYFV+D VVLD
Sbjct: 121 YNPSSSSTSNLVTCGQDFCTSTYDGPIPGCRPELLCEYKVAYGDGSSTTGYFVKDHVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           +VTGNF+T STNGSIVFGCGAQQSGQLGATSAA+DGILGFGQ+NSSMI QLAS GK++RI
Sbjct: 181 RVTGNFQTESTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKIKRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNI GGGIFAIGEVVQPKV TTPLVQQQAHYNV MKAIEVG ++LNLPTDVFDTD
Sbjct: 241 FAHCLDNINGGGIFAIGEVVQPKVRTTPLVQQQAHYNVFMKAIEVGNEVLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           L +GTIIDSGTTLAYLPD IYEPLIAKIFA QTGLKLHTVE+QFTCF+Y+GN+DDGFP +
Sbjct: 301 LKKGTIIDSGTTLAYLPDVIYEPLIAKIFARQTGLKLHTVEEQFTCFEYDGNVDDGFPTI 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFE SLSL V P EYLFDI S+KWCVGWQNSGAQSRDGK MILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEDSLSLTVHPHEYLFDIASNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLF--ILMLTMLH 480
           ENQTIGWTEYNCSSSIKVRDE SG IYTVGPH+LSSAS++R G  S +L   +L+LT+LH
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEHSGAIYTVGPHNLSSASTVRVGTISSMLLSILLLLTVLH 480

Query: 481 SFTN 482
           SFTN
Sbjct: 481 SFTN 484

BLAST of Cp4.1LG02g12240.1 vs. ExPASy TrEMBL
Match: A0A6J1IBA4 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471359 PE=3 SV=1)

HSP 1 Score: 832 bits (2148), Expect = 4.02e-302
Identity = 410/484 (84.71%), Postives = 442/484 (91.32%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIAR ++V+ LLVIS LS+GDCNLVF V HKF GR+RSL AFKAHD+ RRGRFLSAIDL
Sbjct: 1   MEIARLAVVSFLLVISLLSTGDCNLVFNVHHKFKGRERSLEAFKAHDVLRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
            LGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL
Sbjct: 61  NLGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           YNPSSSSTSNLVTC QDFCTS++DGPIPGC P+L CEY V+YGDGSSTTGYFV+D VVLD
Sbjct: 121 YNPSSSSTSNLVTCGQDFCTSTYDGPIPGCRPELLCEYKVAYGDGSSTTGYFVKDHVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           +VTGNF+T STNGSIVFGCGAQQS QLGATSAA+DGILGFGQ+NSSMI QLAS GK++RI
Sbjct: 181 RVTGNFKTESTNGSIVFGCGAQQSVQLGATSAALDGILGFGQANSSMISQLASSGKIKRI 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNI GGGIFAIGEVVQPKV TTPLVQQQAHYNV MKAIEVG ++LNLPTDVFDTD
Sbjct: 241 FAHCLDNINGGGIFAIGEVVQPKVRTTPLVQQQAHYNVFMKAIEVGDEVLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           L +GTIIDSGTTLAYLPD IYEPLIAKIFA QTGLKLHTVE+QFTCF+Y+GN+DDGFP +
Sbjct: 301 LRKGTIIDSGTTLAYLPDVIYEPLIAKIFARQTGLKLHTVEEQFTCFEYDGNVDDGFPTI 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFE SLSL V P EYLFDI S+KWCVGWQNSGAQSRDGK MILLGDLVLQNRLVLYDL
Sbjct: 361 TFHFEDSLSLTVHPHEYLFDIASNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVLYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLF--ILMLTMLH 480
           ENQTIGWTEYNCSSSIKVRDE SG IYTVGPH+L SASS+R G  S +L   +L+LT+LH
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEHSGAIYTVGPHNLYSASSVRVGTISSMLLSILLLLTVLH 480

Query: 481 SFTN 482
           SFTN
Sbjct: 481 SFTN 484

BLAST of Cp4.1LG02g12240.1 vs. ExPASy TrEMBL
Match: A0A0A0LJW2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G099480 PE=3 SV=1)

HSP 1 Score: 828 bits (2140), Expect = 6.17e-301
Identity = 404/482 (83.82%), Postives = 440/482 (91.29%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           MEIARF++V+  LVISF SSGDCNLV KVQHKF GR+RSL AFKAHD+ RRGRFLSAIDL
Sbjct: 1   MEIARFAVVSFFLVISFFSSGDCNLVLKVQHKFKGRERSLEAFKAHDIQRRGRFLSAIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
           +LGGNGHPSESGLYFAKIGLGTP +DYYVQVDTGSD+LWVNCAGCTNCP+KSDLGIEL+L
Sbjct: 61  QLGGNGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           Y+PSSSSTSN VTC+QDFCTS++DGPIPGCTP+L CEY V+YGDGSST GYFV D VVLD
Sbjct: 121 YSPSSSSTSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLD 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           +VTGNF+T STNGSIVFGCGAQQSGQLGATSAA+DGILGFGQ+NSSMI QLAS GKV+R+
Sbjct: 181 RVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRV 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           FAHCLDNI GGGIFAIGEVVQPKV TTPLV QQAHYNV MKAIEV  ++LNLPTDVFDTD
Sbjct: 241 FAHCLDNINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTD 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
           L +GTIIDSGTTLAY PD IYEPLI+KIFA Q+ LKLHTVE+QFTCF+Y+GN+DDGFP V
Sbjct: 301 LRKGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEYDGNVDDGFPTV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           TFHFE SLSL V P EYLFDIDS+KWCVGWQNSGAQSRDGK MILLGDLVLQNRLV+YDL
Sbjct: 361 TFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           ENQTIGWTEYNCSSSIKVRDE SG IYTVG HDLSSASSLR  R SV+L IL+LT+LHSF
Sbjct: 421 ENQTIGWTEYNCSSSIKVRDEHSGAIYTVGSHDLSSASSLRVERISVLLLILLLTILHSF 480

Query: 481 TN 482
            N
Sbjct: 481 RN 482

BLAST of Cp4.1LG02g12240.1 vs. TAIR 10
Match: AT1G05840.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 553.5 bits (1425), Expect = 1.7e-157
Identity = 273/481 (56.76%), Postives = 344/481 (71.52%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLS---SGDCNL-VFKVQHKFNGRQRSLSAFKAHDMHRRGRFLS 60
           + I  F I     +I FL+   S  CN  VF V++++   Q SL+A K HD  R+   L+
Sbjct: 3   LSIVSFPICGRFTLIWFLTALVSVSCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILA 62

Query: 61  AIDLELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGI 120
            IDL LGG G P   GLY+AKIG+GTPAK YYVQVDTGSD++WVNC  C  CPR+S LGI
Sbjct: 63  GIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGI 122

Query: 121 ELTLYNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDR 180
           ELTLYN   S +  LV+CD DFC     GP+ GC  +++C Y   YGDGSST GYFV+D 
Sbjct: 123 ELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDV 182

Query: 181 VVLDQVTGNFRTASTNGSIVFGCGAQQSGQLGATS-AAVDGILGFGQSNSSMILQLASLG 240
           V  D V G+ +T + NGS++FGCGA+QSG L +++  A+DGILGFG++NSSMI QLAS G
Sbjct: 183 VQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSG 242

Query: 241 KVRRIFAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTD 300
           +V++IFAHCLD   GGGIFAIG VVQPKV+ TPLV  Q HYNV M A++VG + L +P D
Sbjct: 243 RVKKIFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPAD 302

Query: 301 VFDTDLSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDD 360
           +F     +G IIDSGTTLAYLP+ IYEPL+ KI + +  LK+H V+K + CFQY+G +D+
Sbjct: 303 LFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDE 362

Query: 361 GFPDVTFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRL 420
           GFP+VTFHFE S+ LRV P +YLF  +   WC+GWQNS  QSRD + M LLGDLVL N+L
Sbjct: 363 GFPNVTFHFENSVFLRVYPHDYLFPHEG-MWCIGWQNSAMQSRDRRNMTLLGDLVLSNKL 422

Query: 421 VLYDLENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLT 477
           VLYDLENQ IGWTEYNCSSSIKV+DE +G ++ VG H +SSA  L T    +   +L++T
Sbjct: 423 VLYDLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDTSMCLLFSLLLLMT 482

BLAST of Cp4.1LG02g12240.1 vs. TAIR 10
Match: AT3G02740.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 542.0 bits (1395), Expect = 5.0e-154
Identity = 263/451 (58.31%), Postives = 331/451 (73.39%), Query Frame = 0

Query: 24  NLVFKVQHKFNG-RQRSLSAFKAHDMHRRGRFLSAIDLELGGNGHPSESGLYFAKIGLGT 83
           NLVF+V+ KF G R + L A +AHD+HR  R LSAID+ LGG+  P   GLYFAKIGLGT
Sbjct: 34  NLVFEVRSKFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGT 93

Query: 84  PAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTLYNPSSSSTSNLVTCDQDFCTSS 143
           P++D++VQVDTGSD+LWVNCAGC  CPRKSDL +ELT Y+  +SST+  V+C  +FC  S
Sbjct: 94  PSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFC--S 153

Query: 144 HDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLDQVTGNFRTASTNGSIVFGCGAQ 203
           +      C     C+Y + YGDGSST GY V+D V LD VTGN +T STNG+I+FGCG++
Sbjct: 154 YVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSK 213

Query: 204 QSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRIFAHCLDNIKGGGIFAIGEVVQP 263
           QSGQLG + AAVDGI+GFGQSNSS I QLAS GKV+R FAHCLDN  GGGIFAIGEVV P
Sbjct: 214 QSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSP 273

Query: 264 KVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTDLSRGTIIDSGTTLAYLPDEIYE 323
           KV TTP++ + AHY+V + AIEVG  +L L ++ FD+   +G IIDSGTTL YLPD +Y 
Sbjct: 274 KVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYN 333

Query: 324 PLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDVTFHFEGSLSLRVRPREYLFDID 383
           PL+ +I A    L LHTV++ FTCF Y   + D FP VTF F+ S+SL V PREYLF + 
Sbjct: 334 PLLNEILASHPELTLHTVQESFTCFHYTDKL-DRFPTVTFQFDKSVSLAVYPREYLFQVR 393

Query: 384 SDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDLENQTIGWTEYNCSSSIKVRDEK 443
            D WC GWQN G Q++ G  + +LGD+ L N+LV+YD+ENQ IGWT +NCS  I+V+DE+
Sbjct: 394 EDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQVKDEE 453

Query: 444 SGGIYTVGPHDLSSASSLRTGRTSVVLFILM 474
           SG IYTVG H+LS +SSL   +   ++ +L+
Sbjct: 454 SGAIYTVGAHNLSWSSSLAITKLLTLVSLLI 480

BLAST of Cp4.1LG02g12240.1 vs. TAIR 10
Match: AT5G36260.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-142
Identity = 255/480 (53.12%), Postives = 323/480 (67.29%), Query Frame = 0

Query: 3   IARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDLEL 62
           I+R   V  +LVI  +S    N VF V HKF G+++ LS  K+HD  R  R L+ IDL L
Sbjct: 10  ISRIVAVVFVLVIQVVSG---NFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPL 69

Query: 63  GGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTLYN 122
           GG+      GLYF KI LG+P K+YYVQVDTGSD+LWVNCA C  CP K+DLGI L+LY+
Sbjct: 70  GGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYD 129

Query: 123 PSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLDQV 182
             +SSTS  V C+ DFC+         C     C Y V YGDGS++ G F++D + L+QV
Sbjct: 130 SKTSSTSKNVGCEDDFCSFIMQSET--CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQV 189

Query: 183 TGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRIFA 242
           TGN RTA     +VFGCG  QSGQLG T +AVDGI+GFGQSN+S+I QLA+ G  +RIF+
Sbjct: 190 TGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFS 249

Query: 243 HCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTDLS 302
           HCLDN+ GGGIFA+GEV  P V TTP+V  Q HYNV +K ++V GD ++LP  +  T+  
Sbjct: 250 HCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGD 309

Query: 303 RGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDVTF 362
            GTIIDSGTTLAYLP  +Y  LI KI A Q  +KLH V++ F CF +  N D  FP V  
Sbjct: 310 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ-VKLHMVQETFACFSFTSNTDKAFPVVNL 369

Query: 363 HFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDLEN 422
           HFE SL L V P +YLF +  D +C GWQ+ G  ++DG  +ILLGDLVL N+LV+YDLEN
Sbjct: 370 HFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 429

Query: 423 QTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSFTN 482
           + IGW ++NCSSSIKV+D  SG  Y +G  +L SA+S     T V L  +++ + HSFT+
Sbjct: 430 EVIGWADHNCSSSIKVKD-GSGAAYQLGAENLISAASSVMNGTLVTLLSILIWVFHSFTS 482

BLAST of Cp4.1LG02g12240.1 vs. TAIR 10
Match: AT1G65240.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 471.1 bits (1211), Expect = 1.1e-132
Identity = 231/482 (47.93%), Postives = 322/482 (66.80%), Query Frame = 0

Query: 1   MEIARFSIVNLLLVISFLSSGDCNLVFKVQHKFNGRQRSLSAFKAHDMHRRGRFLSAIDL 60
           ME+ R   + + + +  +     N VFK QHKF G++++L  FK+HD  R  R L++IDL
Sbjct: 1   MELRRKLCIVVAVFVIVIEFASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDL 60

Query: 61  ELGGNGHPSESGLYFAKIGLGTPAKDYYVQVDTGSDLLWVNCAGCTNCPRKSDLGIELTL 120
            LGG+      GLYF KI LG+P K+Y+VQVDTGSD+LW+NC  C  CP K++L   L+L
Sbjct: 61  PLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSL 120

Query: 121 YNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTPDLNCEYTVSYGDGSSTTGYFVEDRVVLD 180
           ++ ++SSTS  V CD DFC  S       C P L C Y + Y D S++ G F+ D + L+
Sbjct: 121 FDMNASSTSKKVGCDDDFC--SFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLE 180

Query: 181 QVTGNFRTASTNGSIVFGCGAQQSGQLGATSAAVDGILGFGQSNSSMILQLASLGKVRRI 240
           QVTG+ +T      +VFGCG+ QSGQLG   +AVDG++GFGQSN+S++ QLA+ G  +R+
Sbjct: 181 QVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRV 240

Query: 241 FAHCLDNIKGGGIFAIGEVVQPKVHTTPLVQQQAHYNVAMKAIEVGGDMLNLPTDVFDTD 300
           F+HCLDN+KGGGIFA+G V  PKV TTP+V  Q HYNV +  ++V G  L+LP  +    
Sbjct: 241 FSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV--- 300

Query: 301 LSRGTIIDSGTTLAYLPDEIYEPLIAKIFAGQTGLKLHTVEKQFTCFQYNGNIDDGFPDV 360
            + GTI+DSGTTLAY P  +Y+ LI  I A Q  +KLH VE+ F CF ++ N+D+ FP V
Sbjct: 301 RNGGTIVDSGTTLAYFPKVLYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPV 360

Query: 361 TFHFEGSLSLRVRPREYLFDIDSDKWCVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDL 420
           +F FE S+ L V P +YLF ++ + +C GWQ  G  + +   +ILLGDLVL N+LV+YDL
Sbjct: 361 SFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDL 420

Query: 421 ENQTIGWTEYNCSSSIKVRDEKSGGIYTVGPHDLSSASSLRTGRTSVVLFILMLTMLHSF 480
           +N+ IGW ++NCSSSIK++D  SGG+Y+VG  +LSSA  L     + +L IL   ++ +F
Sbjct: 421 DNEVIGWADHNCSSSIKIKD-GSGGVYSVGADNLSSAPRLL--MITKLLTILSPLIVMAF 473

Query: 481 TN 483
           T+
Sbjct: 481 TS 473

BLAST of Cp4.1LG02g12240.1 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 329.7 bits (844), Expect = 3.9e-90
Identity = 171/419 (40.81%), Postives = 240/419 (57.28%), Query Frame = 0

Query: 37  QRSLSAFKAHDMHRRGRFLSA----IDLELGGNGHPSESGLYFAKIGLGTPAKDYYVQVD 96
           +  LS  KA D  R GR L +    ID  + G   P   GLY+ K+ LGTP +D+YVQVD
Sbjct: 40  EMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVD 99

Query: 97  TGSDLLWVNCAGCTNCPRKSDLGIELTLYNPSSSSTSNLVTCDQDFCTSSHDGPIPGCTP 156
           TGSD+LWV+CA C  CP+ S L I+L  ++P SS T++ ++C    C+        GC+ 
Sbjct: 100 TGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSV 159

Query: 157 DLN-CEYTVSYGDGSSTTGYFVEDRVVLDQVTGNFRTASTNGSIVFGCGAQQSGQLGATS 216
             N C YT  YGDGS T+G++V D +  D + G+    ++   +VFGC   Q+G L  + 
Sbjct: 160 QNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSD 219

Query: 217 AAVDGILGFGQSNSSMILQLASLGKVRRIFAHCLDNIK-GGGIFAIGEVVQPKVHTTPLV 276
            AVDGI GFGQ   S+I QLAS G   R+F+HCL     GGGI  +GE+V+P +  TPLV
Sbjct: 220 RAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVFTPLV 279

Query: 277 QQQAHYNVAMKAIEVGGDMLNLPTDVFDTDLSRGTIIDSGTTLAYLPDEIYEPLIAKIFA 336
             Q HYNV + +I V G  L +   VF T   +GTIID+GTTLAYL +  Y P +  I  
Sbjct: 280 PSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITN 339

Query: 337 GQTGLKLHTVEKQFTCFQYNGNIDDGFPDVTFHFEGSLSLRVRPREYLFDID----SDKW 396
             +      V K   C+    ++ D FP V+ +F G  S+ + P++YL   +    +  W
Sbjct: 340 AVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVW 399

Query: 397 CVGWQNSGAQSRDGKGMILLGDLVLQNRLVLYDLENQTIGWTEYNCSSSIKVRDEKSGG 446
           C+G+Q         +G+ +LGDLVL++++ +YDL  Q IGW  Y+CS+S+ V    S G
Sbjct: 400 CIGFQRI-----QNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVSATSSSG 453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q4V3D22.1e-14153.13Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K41.5e-13147.93Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9LS407.0e-3629.82Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C22.9e-3429.80Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9M9A81.6e-3226.62Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_023525733.10.0100.00aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo][more]
KAG6607130.10.099.59Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
XP_022948415.10.098.96aspartic proteinase-like protein 2 [Cucurbita moschata][more]
XP_022998947.10.097.51aspartic proteinase-like protein 2 [Cucurbita maxima][more]
KAG6598979.14.31e-30484.92Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
Match NameE-valueIdentityDescription
A0A6J1G9S60.098.96aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111452105... [more]
A0A6J1K9F90.097.51aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111493453 P... [more]
A0A6J1G3Q03.45e-30384.71aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111450495... [more]
A0A6J1IBA44.02e-30284.71aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471359 P... [more]
A0A0A0LJW26.17e-30183.82Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G09948... [more]
Match NameE-valueIdentityDescription
AT1G05840.11.7e-15756.76Eukaryotic aspartyl protease family protein [more]
AT3G02740.15.0e-15458.31Eukaryotic aspartyl protease family protein [more]
AT5G36260.11.5e-14253.13Eukaryotic aspartyl protease family protein [more]
AT1G65240.11.1e-13247.93Eukaryotic aspartyl protease family protein [more]
AT5G22850.13.9e-9040.81Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 305..316
score: 42.62
coord: 404..419
score: 29.42
coord: 80..100
score: 48.68
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 9..450
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 74..257
e-value: 4.8E-41
score: 140.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 258..439
e-value: 1.7E-45
score: 156.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 64..257
e-value: 9.5E-50
score: 171.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 69..436
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 275..428
e-value: 6.6E-24
score: 84.5
NoneNo IPR availablePANTHERPTHR13683:SF685EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 9..450
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 74..428
score: 45.513062
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 74..432
e-value: 2.87735E-67
score: 214.819

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG02g12240Cp4.1LG02g12240gene


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g12240.1:three_prime_utr:001Cp4.1LG02g12240.1:three_prime_utr:001three_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g12240.1:exon:010Cp4.1LG02g12240.1:exon:010exon
Cp4.1LG02g12240.1:exon:009Cp4.1LG02g12240.1:exon:009exon
Cp4.1LG02g12240.1:exon:008Cp4.1LG02g12240.1:exon:008exon
Cp4.1LG02g12240.1:exon:007Cp4.1LG02g12240.1:exon:007exon
Cp4.1LG02g12240.1:exon:006Cp4.1LG02g12240.1:exon:006exon
Cp4.1LG02g12240.1:exon:005Cp4.1LG02g12240.1:exon:005exon
Cp4.1LG02g12240.1:exon:004Cp4.1LG02g12240.1:exon:004exon
Cp4.1LG02g12240.1:exon:003Cp4.1LG02g12240.1:exon:003exon
Cp4.1LG02g12240.1:exon:002Cp4.1LG02g12240.1:exon:002exon
Cp4.1LG02g12240.1:exon:001Cp4.1LG02g12240.1:exon:001exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g12240.1:cds:010Cp4.1LG02g12240.1:cds:010CDS
Cp4.1LG02g12240.1:cds:009Cp4.1LG02g12240.1:cds:009CDS
Cp4.1LG02g12240.1:cds:008Cp4.1LG02g12240.1:cds:008CDS
Cp4.1LG02g12240.1:cds:007Cp4.1LG02g12240.1:cds:007CDS
Cp4.1LG02g12240.1:cds:006Cp4.1LG02g12240.1:cds:006CDS
Cp4.1LG02g12240.1:cds:005Cp4.1LG02g12240.1:cds:005CDS
Cp4.1LG02g12240.1:cds:004Cp4.1LG02g12240.1:cds:004CDS
Cp4.1LG02g12240.1:cds:003Cp4.1LG02g12240.1:cds:003CDS
Cp4.1LG02g12240.1:cds:002Cp4.1LG02g12240.1:cds:002CDS
Cp4.1LG02g12240.1:cds:001Cp4.1LG02g12240.1:cds:001CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g12240.1:five_prime_utr:001Cp4.1LG02g12240.1:five_prime_utr:001five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG02g12240.1Cp4.1LG02g12240.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity