Cp4.1LG20g02270 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g02270
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotease Do-like 9
LocationCp4.1LG20: 1306482 .. 1313815 (+)
RNA-Seq ExpressionCp4.1LG20g02270
SyntenyCp4.1LG20g02270
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGAAGTTAGTATAAAGAGAGGAGCCATTTGCGCGCTCCAATCTACCTTTCCCTTATTCTGAGTTTCGGGGCTCAGCGGAGGAGCGGAAATTGAAAACCCATTTTCTCCACTCTGATAAACCCTCTCCGATCTCTCTCCCCTTCAATTCTTCGGAGATGGGAGAAGTCAAACGTAAAAGAGGTCGAAAGCCGAAAAATTCCAAGGCGGATTCTCTAGATTTTCCTCCGCCGACCACTGCTACCGTCACCAGTGTTGCGATGGTTATGGACGACGTTTTTTCAGTCAGCAACGTCGAGCTCATGGACCCGGCCTCCACTTCTAAACCTCACCAGAACCGCCGTGGGAGGCCGAAGAAGCTTTCGAAACATTTGGAAAATCCCGACAAGTTCCCGCAATTGTCTCCTTCTAGACGTGGCCCTCGTGGTGTCGAAAATGGCGAATTTACCGCTTCCGGCGACGTCTTTCCATCTGATATCGCTTTGGAACGGGTGCAGCCGGAGTGGCCGGGCATCGTGAGAGTCGTGCCGGCGATGGATGCTGTGGTCAAGGTATTTTGTGTGCACACGGAGCCGAATTTCTCACTTCCTTGGCAGAGAAAGAGACAATATAGTTCGAGTAGCAGTGGATTTGTGATTGGTGGAAGGAGAGTGCTCACTAATGCTCATTCCGTCGAGCATCATACCCAGGTTAAGCTTAAGAAGCGTGGGTCGGATACGAAGTACTTGGCGACCGTACTTGCGATTGGAACTGAATGCGATATCGGTAATGATATTCGTTGTCCTTCATTTTTCTCTCCTTCTTCGTTGTCTTTGGGTTTTTATTAAATTCACGAGGTTCGAAATGGAATATGACGGTGAAATCTTCACATTAGATCATGCTCGACCTATTTTCTATGTTTATCTCAATCTGAAATTATGCGAACGTTATCATGCATTCCGGCTATTCATGACCATATTGTTGCTAGAGTTGAGAATACTGGTCATATCTTCGTGGGCTTAACTGCACGGAAATTCATTATTTAGGCTTAGGAAATGGTGAAATTCATGTTGGTTTAGGAAAGGCTCGATAACATGTTATATACACTAAACTTGGTTATTTTAGATGAGATTCAGTGTGCACGTATATTTGGTTGCTATTAAGAAACTTAATGATGGATAAATATGTAAGCTTAATATGAAAGAGAGACAAGGATTGATAAAACTCTCCATAATTAAAGATGACTGGTGTAAGTGGATCAGGATGCATGTAATTATAGGCTTACCTACCATTTATGTTGTTGCCTTGTACTTGCATGAATTTATTGTTCCATTCATTTATTGTGTTGTGAACTCAAAGCATTGAGTATTATGATGTATGTGTTGCAGCAATGCTTACTGTTGATGATGACGAGTTTTGGGTGGGAGTTTCACCGGTAGAATTTGGGGAGTTACCTGCACTGCAAGATGCAGTAACTGTTGTTGGTTACCCAATTGGAGGTGATACAATTTCTGTCACAAGTGGGGTTGTATCGCGGATAGAGATCCTGTCTTATGTTCATGGGTCTACTGAGCTTCTCGGTCTGCAGGTTTTGAGCTTTTGAGACTCGACCATATTTGCTGCTATTTAGTTTTTTAGGAATAAGGTTAATGGCCGATGCTTTACTTCAACTTTTTTCTGCTGTAGATAGATGCTGCTATAAACTCGGGTAATTCTGGTGGGCCTGCCTTTAATGATAAAGGAAACTGTGTGGGCATTGCATTTCAGTCGCTCAAGCATGAAGATGCAGAAAATATAGGTTATGTCATACCAACGCCAGTTATCTTGCATTTTATACGGGATTACGAGAAGAATGGAGCATATACAGGTATTTGGGCACTTGGAGTGGGATGAAGATCAGACTGTGGTTCTTTTATGTTATCGTCACTGATTGGCATTGGTCTTTGTTAATGATTTACTTGGTTCTTTACTTAAAGCAATAACATTCTTGGATGCCAGGCTTTCCAATTCTTGGTCTTGAGTGGCAGAAAATGGAAAATCCTGATCTTTGTGAGGCTATGGGGATGAAAAAAGATCAGAAGGGTGTCCGTATTAGACGTATTGATCCCACTGGCCCAGAATCCAAAGTTTTGAAGCCATCAGATATTATTCTCGGCTTTGATGGGGTTGATATTGCTAATGATGGAACAGGTAAATATATGTTTTATCATTTTAACACAATGTAAAAACACAATCCTTGAACTGGCAACCATCTATAGACTACTTTAGTCTCTTGTTGAATAATAGTAATTACATATTTGGATTACTTCCATTAACACTGACAAGATTAGCAAAATATACCTACGTATGGCTCACTAATTACCAAAAGTACAAAGAAAGGTAAATAGACTAACCTAATCAATAGGATGACATGTTATCTTAGGATGAGCAATATGGGAAGGTGATTGATTACAATGTCCATTCCATAATCCATGAAGATGAAATCAGAGAAATAGAAACCTCCTACTATAGCAAAATTCACTTCCCTTTCCCACAAAAATTCTTTTGTTTCCTTCTAGCTATAATTCCTAAAGCCAAGGTATTATACTGCACTTTGGCTTTCTTCATGGAGAGTAAATCACCAACGAATTGAGTAAAAAGCTCCCCTACGGTAGAAGCCTGGAGGTTAAAGCATAATTGCTGAGTACCGTTATTCGAAGCAATATTTTTAAAAAGCAAGCTTTTCGCATCTTTGTTTCTTTTCTTCTTCTAAATACGAAAAGAGAAAAATGGTGGGTGCCTATTTCTTTGGAAGATTTCTCATCTCATTTATTAAAATTTCCTTTATGTCATGCTTCATTATACAATATTTTTTATATATTTTCTTTTCTTTTCACCCCACTTCACCGAAGCTGGCTCTTTTTTCGGTTTTTTTGGCATCTTGTATTGCACTTAATCTCTAGAGGCTATTGAGCATTTAATATATCTGGCGCCTTTATAAAACAGTGATTTGCGGGCTAATGCTAGTGCATGGCAGAATCTAATCCTTAGATTGTTTCTTCTGTCTGTTTTGGAAAAAGAATTCTGTAGGTTCCTCTTCTTTTTTATTTGATAGCCCACTCTCTGTTGCTGTCTTGCAGTTCCTTTCCGGCACGGTGAGCGTATAGGATTCAGTTACCTTGTCTCCCAGAAATATACTGGTGATAGTGCAACAATAAAAGTTCTGCGCAACTCTGAGACACTCAGTTTTAATTACCAGCTTGCAACATACAGAAGGCTCATTCCTGCACATAATGAGGGCAAACCCCCTTCTTATTACATTATTGCAGGATTTGTTTTTTCCACTGTCTCCGTTCCTTATCTCCGTTCTGAGGTTTGTTCCCAGAATACTTTCCCTTCCCCCCAAATTCTATGACAAGGCTTTAGTTGTTTTTTTTTTTCTCTTGATGATTTTCTTTTCTGCCTTTTTTAGATTTTACATGATGGATTTTAAAAGGGTATGAAATAGAATCTTAATGTGTTAAAAGTTTGTTTATAAAAAATTTTCATGAAATTTTGAAGTAGCATAATGTTGTTTGGCATGATTCTCTTATTGTCTGTGCAACTTCTATAGCTCTTGGCTCTTGGCTCTCTATGCACAGTTTTTACCTGCTCTTTCGGTACTGTTTCATTATTATATTGTGAGTGGAGAGTAAATCAAACTTTGAAACTTGTAGCAGCAGCATCCTTGAGAACACTATAGCTGTCTTGATGTGTCAGATCTTAGGCCTATTAAGACGATTCATTCTTGTGTGGTATGTTCTGGATGAGTTTCTTGGGGATTTCTTTCAGGAGCTAGTTTGGAGGGTGGCTTGCATTGAGGTTTTTTTCTTTTTGGTTTTTTACAATTTCATGGATGGTATGAAATTACAAAAGAAGGGGAACCCCAAAAGAGAGATTACAAAGGGCCTCCCCAATTGGAGAAGAGAGGAGAGGAGAGAAGAGAAGAGTAGCTATAGGGGTGGAAAAAGAGGGGAACAGTTACAGCAAGAAACAACCTTATTGATTATAGCCTAATAAAATTATGGAAGGAGAGACTTTTATGTTGGCAGATACGCCTATTTCTTTCTTGCCAAGTGTTTCAATAGAAAACTTTAATAAAATTCTCCCACAGAGCCTTCTTTTCTTTCTTGAAGGGATATTCCCCAAGGGTGAGATTAAGCAGTAACCGTGGCATTAGTTGAGTTAGTTGAAGGAAAATGGAGGCTTAGCATTGGAGGTATGGGAAAAAGAAATGTTGCCTTCCTCTGTGAAAGGCTTTACAGGGTAAATAGACATCAGTATATGAACGAATCTTTCGATAGCAAGTTTATGGAACAATCCTGAGTGTTTAAGCAAATAATTTCAAATCAGTCGTAAATACTGAATGGTGACAGCATAGTACAAAAGAGAATAAATTTTAGTAGAGATCACTAGCATCACTCTTCTTCTCAAATGTTCACTTTTACGAGTTATGATTTGTATATATATGCATGCGCGCGCACACACACACATATTTGAATTTTGAAGTCTTTTACTGATTTGATGTTAAGTTTTCTTACACCATGATACTGAACTTTCTGAAAATGCTTCGAATTGTAGTACGGAAAGGATTATGAATATGAAGCTCCAGTCAAACTATTGGACAAATTATTGCATTCAATGCCACAATCACCAGATGAGCAGCTAGTGGTGGTTTCTCAGGTTCTCTCCTTTACTCTCTTCTCTGTCTTCACTTCAGTGCGCTACATAATGTTGTGTATTTCTTTTTGGAAACTATATAATGTGTATTCTTTTTGGACTGTCAACTTCTCAACCTATTCTCCTTTTATCATGTAAATTTCTTGCAGGTACTCGTGGCTGATATCAACATTGGATATGAAGACATTGTTAACACCCAGGTTTGTATTATATCTTGCTAATAGCCTACGGAAAGACACCATTTTAACTGCACGTAATTACAATGATTGATCATCTTTGTGCACCATAGGTTCTTGCTTTCAATGGTAAACCTGTGAAGAACCTCAAGAGCTTGGCTAACATGGTTGAAAGTTGCGATGATGAGTTTTTGAAGTTCGATTTAGAATATCAACAGGTTCTATCCCCACTCCTGTCTTAAGAATCATCTCCTCCATTTTTTTTAATGATATCCCAAGTCTGATTTTTTTGTTTGCTGCAAATCTTCCAGATAGTTGTCCTCCGTACAAGCACAGCGAAAGCAGCCACTTTAGATATTCTGGCCACACACTGCATACCCTCAGCTATGTCTAACGATCTCAAGACCTAACTTCAGAATGAAAAATTAGGTATACTTGATTACGTAGGTTATCATATTTCGTTTTCCACCCTGCCGGGTATGCCCAGGTTCTGAAGGGGTTCCAGTATGCGTATGCTTTCCTTCTTCATGACGTTGGGTTTAATACGAATGGTAGGAAGTCGAAGTTCGAAAGGATTAGGATGAAGAATAGAATGAATCGAGAGTTAAAATTGCATTGTATCGACTTCTTAGCTCTTGGAAGTTTTGATTAGATGATATTAGTTGTTGGTAAGGCGGATTTGATGGTTTTCTCAATCCATCTTTTACCATTGATAATGGTATGAAAATTAGCCAACTTCTCTAAGTAGCTTGAAGTATGAAGCATGGTGTAGTTTCTCCTCATGGGTTGTTTTTTCTCACGAATAATAATGTGATGAAACATATCCTTATAAAACTTTATTGGGTACCATAACCTTAACCAAGCTTGCACTCCCTTCTCCTTAATGTTCTTTCACTTTTTACATTAAAGCTAGGTGTCATAGTCCTTTTTATATATGTTAAATGTATGAGATGATTTTAGGATTACTTTTTAAAGACATGTAGACCAAATCTAAAGAGGTATGGTAGTTTGTGTACTCACCCATCACATAATGCAAAAAGTTGTTATCCTCTTCCCACGTGTAATTCCACATCGAGTGGGCCTTCCTCCTTAATCTCATTATACAATCTTGAACTACGATACAAGAAATTCTGACCTTAACTAAAGCGTTAGTATCCAAGACTGACCGAATTTACCATGTTAGCTGTCATGAACTTCTAACGACAATAACTAATAGGTAAACTAGCTTGCTTAAGCTTCTTTCAGAAGTGTGTCAATGGTAGTTGTCTACGGCTAAGACATTGTACCTTGTCATCCATGAATTTGTTCGGAGTTACAAAGCTTCGGTACCACGACGACCTTTCGGGTAACTTGGTCTGACATATGATGAAAGAGGTGAGTTATTCTTATTGTCTTGTTATCTCCTTCATGTTGAGGCATATGATAAAGTCATTTGTATTCAACATTAAATACAGATAAGGGAGAGACAAGGTAGAAGTTAGAGAGGTGGATGTATTACTTCCTTTCTTCCCATTCTCTTCGATTCCACATGGGGTTTTATTTGGCAGCGTTGAAGTTTCTTGTCCATTCCAAGACTTAATACTTTATTTTGAGGTTTGAAGGGAATGTCTCATTATGAGGTAGGTCCCTGCCAGTCCATTGAGCTGTTTCTCCCTTTGCTTTCAGCATTCAATGAAATTTGTCTTTCTATCAGTCCATTGCTATCTGCAGATGTATTTGCACAGATGAAATGATCCATTGTTGAATCAGAAACACTTCTACTGACACAACAATCATCTAACTCTGCTGGACTGATCATATCTCTGTTCTTGTTTAGAATCTGAAAGTTTGAATGAATCTGCTCTGCTAGTGTGTTTTGAACTGAATTGATTCATTCCTGAGTGACCCAAATGGTTATACAGATCAGAATTCCACTTCCCTTCTCTTCAAGGAACTGGATAAGCCACAGCATGTGGGTGATGACCAAGTAACCAGCTCGTGAGCTGAAGGACAGATGTTGGAATGCTGGCAAACAAACAAGTGGCCATGATTTTGCATTCAAAATCTGCCTGACACGTGTACAGAGCCCACAACCCAGCTGAGCCTCTGTTCTGAGTTCTTATATAAACAAACCTCCAAACCATTCCCAAACTCATCTCTTTTAGTATTGAATCTATGGGTTCCTCTTGCCATCTGAATGTGCCTGCTTTGAAGAGGAAGAGGAGAGAAGCGCGCCAGGTTCGCCGGAAAGTGCAGAAGCTCCGGTGGGTGGTGCCCGGTGGGAGAGGGTTGCAACAGGAACATCTCTTCGCCAAAACGGCTCATTATATACTGCATTTGAGATTGAAAGTCTGTGCTCTTCAAACCATCCTCAAATTGGAGGATGAGGCTTGAGAAATGTATGATATGATTTGATATGATTTTCTCATCGTTGGGGAGTGTTTATTGTATATTTATATGGGTTTGGTATATTATATTTATAAATCATAAAATTAGTTGT

mRNA sequence

ATGAAAGAATTTCGGGGCTCAGCGGAGGAGCGGAAATTGAAAACCCATTTTCTCCACTCTGATAAACCCTCTCCGATCTCTCTCCCCTTCAATTCTTCGGAGATGGGAGAAGTCAAACGTAAAAGAGGTCGAAAGCCGAAAAATTCCAAGGCGGATTCTCTAGATTTTCCTCCGCCGACCACTGCTACCGTCACCAGTGTTGCGATGGTTATGGACGACGTTTTTTCAGTCAGCAACGTCGAGCTCATGGACCCGGCCTCCACTTCTAAACCTCACCAGAACCGCCGTGGGAGGCCGAAGAAGCTTTCGAAACATTTGGAAAATCCCGACAAGTTCCCGCAATTGTCTCCTTCTAGACGTGGCCCTCGTGGTGTCGAAAATGGCGAATTTACCGCTTCCGGCGACGTCTTTCCATCTGATATCGCTTTGGAACGGGTGCAGCCGGAGTGGCCGGGCATCGTGAGAGTCGTGCCGGCGATGGATGCTGTGGTCAAGGTATTTTGTGTGCACACGGAGCCGAATTTCTCACTTCCTTGGCAGAGAAAGAGACAATATAGTTCGAGTAGCAGTGGATTTGTGATTGGTGGAAGGAGAGTGCTCACTAATGCTCATTCCGTCGAGCATCATACCCAGGTTAAGCTTAAGAAGCGTGGGTCGGATACGAAGTACTTGGCGACCGTACTTGCGATTGGAACTGAATGCGATATCGCAATGCTTACTGTTGATGATGACGAGTTTTGGGTGGGAGTTTCACCGGTAGAATTTGGGGAGTTACCTGCACTGCAAGATGCAGTAACTGTTGTTGGTTACCCAATTGGAGGTGATACAATTTCTGTCACAAGTGGGGTTGTATCGCGGATAGAGATCCTGTCTTATGTTCATGGGTCTACTGAGCTTCTCGGTCTGCAGATAGATGCTGCTATAAACTCGGGTAATTCTGGTGGGCCTGCCTTTAATGATAAAGGAAACTGTGTGGGCATTGCATTTCAGTCGCTCAAGCATGAAGATGCAGAAAATATAGGTTATGTCATACCAACGCCAGTTATCTTGCATTTTATACGGGATTACGAGAAGAATGGAGCATATACAGGCTTTCCAATTCTTGGTCTTGAGTGGCAGAAAATGGAAAATCCTGATCTTTGTGAGGCTATGGGGATGAAAAAAGATCAGAAGGGTGTCCGTATTAGACGTATTGATCCCACTGGCCCAGAATCCAAAGTTTTGAAGCCATCAGATATTATTCTCGGCTTTGATGGGGTTGATATTGCTAATGATGGAACAGTTCCTTTCCGGCACGGTGAGCGTATAGGATTCAGTTACCTTGTCTCCCAGAAATATACTGGTGATAGTGCAACAATAAAAGTTCTGCGCAACTCTGAGACACTCAGTTTTAATTACCAGCTTGCAACATACAGAAGGCTCATTCCTGCACATAATGAGGGCAAACCCCCTTCTTATTACATTATTGCAGGATTTGTTTTTTCCACTGTCTCCGTTCCTTATCTCCGTTCTGAGTACGGAAAGGATTATGAATATGAAGCTCCAGTCAAACTATTGGACAAATTATTGCATTCAATGCCACAATCACCAGATGAGCAGCTAGTGGTGGTTTCTCAGGTACTCGTGGCTGATATCAACATTGGATATGAAGACATTGTTAACACCCAGGTTCTTGCTTTCAATGGTAAACCTGTGAAGAACCTCAAGAGCTTGGCTAACATGGTTGAAAGTTGCGATGATGAGTTTTTGAAGTTCGATTTAGAATATCAACAGATAGTTGTCCTCCGTACAAGCACAGCGAAAGCAGCCACTTTAGATATTCTGGCCACACACTGCATACCCTCAGCTATGTCTAACGATCTCAAGACCTAACTTCAGAATGAAAAATTAGGTATACTTGATTACGTAGGTTATCATATTTCGTTTTCCACCCTGCCGGGTATGCCCAGGTTCTGAAGGGGTTCCAGTATGCGTATGCTTTCCTTCTTCATGACGTTGGGTTTAATACGAATGGTAGGAAGTCGAAGTTCGAAAGGATTAGGATGAAGAATAGAATGAATCGAGAGTTAAAATTGCATTGTATCGACTTCTTAGCTCTTGGAAGTTTTGATTAGATGATATTAGTTGTTGGTAAGGCGGATTTGATGGTTTTCTCAATCCATCTTTTACCATTGATAATGGTATGAAAATTAGCCAACTTCTCTAAGTAGCTTGAAGTATGAAGCATGGTGTAGTTTCTCCTCATGGGTTGTTTTTTCTCACGAATAATAATGTGATGAAACATATCCTTATAAAACTTTATTGGGTACCATAACCTTAACCAAGCTTGCACTCCCTTCTCCTTAATGTTCTTTCACTTTTTACATTAAAGCTAGGTGTCATAGTCCTTTTTATATATGTTAAATGTATGAGATGATTTTAGGATTACTTTTTAAAGACATGTAGACCAAATCTAAAGAGGTATGGTAGTTTGTGTACTCACCCATCACATAATGCAAAAAGTTGTTATCCTCTTCCCACGTGTAATTCCACATCGAGTGGGCCTTCCTCCTTAATCTCATTATACAATCTTGAACTACGATACAAGAAATTCTGACCTTAACTAAAGCGTTAGTATCCAAGACTGACCGAATTTACCATGTTAGCTGTCATGAACTTCTAACGACAATAACTAATAGGTAAACTAGCTTGCTTAAGCTTCTTTCAGAAGTGTGTCAATGGTAGTTGTCTACGGCTAAGACATTGTACCTTGTCATCCATGAATTTGTTCGGAGTTACAAAGCTTCGGTACCACGACGACCTTTCGGGTAACTTGGTCTGACATATGATGAAAGAGATAAGGGAGAGACAAGGTAGAAGTTAGAGAGGTGGATGTATTACTTCCTTTCTTCCCATTCTCTTCGATTCCACATGGGGTTTTATTTGGCAGCGTTGAAGTTTCTTGTCCATTCCAAGACTTAATACTTTATTTTGAGGTTTGAAGGGAATGTCTCATTATGAGATCAGAATTCCACTTCCCTTCTCTTCAAGGAACTGGATAAGCCACAGCATGTGGGTGATGACCAAGTAACCAGCTCGTGAGCTGAAGGACAGATGTTGGAATGCTGGCAAACAAACAAGTGGCCATGATTTTGCATTCAAAATCTGCCTGACACGTGTACAGAGCCCACAACCCAGCTGAGCCTCTGTTCTGAGTTCTTATATAAACAAACCTCCAAACCATTCCCAAACTCATCTCTTTTAGTATTGAATCTATGGGTTCCTCTTGCCATCTGAATGTGCCTGCTTTGAAGAGGAAGAGGAGAGAAGCGCGCCAGGTTCGCCGGAAAGTGCAGAAGCTCCGGTGGGTGGTGCCCGGTGGGAGAGGGTTGCAACAGGAACATCTCTTCGCCAAAACGGCTCATTATATACTGCATTTGAGATTGAAAGTCTGTGCTCTTCAAACCATCCTCAAATTGGAGGATGAGGCTTGAGAAATGTATGATATGATTTGATATGATTTTCTCATCGTTGGGGAGTGTTTATTGTATATTTATATGGGTTTGGTATATTATATTTATAAATCATAAAATTAGTTGT

Coding sequence (CDS)

ATGAAAGAATTTCGGGGCTCAGCGGAGGAGCGGAAATTGAAAACCCATTTTCTCCACTCTGATAAACCCTCTCCGATCTCTCTCCCCTTCAATTCTTCGGAGATGGGAGAAGTCAAACGTAAAAGAGGTCGAAAGCCGAAAAATTCCAAGGCGGATTCTCTAGATTTTCCTCCGCCGACCACTGCTACCGTCACCAGTGTTGCGATGGTTATGGACGACGTTTTTTCAGTCAGCAACGTCGAGCTCATGGACCCGGCCTCCACTTCTAAACCTCACCAGAACCGCCGTGGGAGGCCGAAGAAGCTTTCGAAACATTTGGAAAATCCCGACAAGTTCCCGCAATTGTCTCCTTCTAGACGTGGCCCTCGTGGTGTCGAAAATGGCGAATTTACCGCTTCCGGCGACGTCTTTCCATCTGATATCGCTTTGGAACGGGTGCAGCCGGAGTGGCCGGGCATCGTGAGAGTCGTGCCGGCGATGGATGCTGTGGTCAAGGTATTTTGTGTGCACACGGAGCCGAATTTCTCACTTCCTTGGCAGAGAAAGAGACAATATAGTTCGAGTAGCAGTGGATTTGTGATTGGTGGAAGGAGAGTGCTCACTAATGCTCATTCCGTCGAGCATCATACCCAGGTTAAGCTTAAGAAGCGTGGGTCGGATACGAAGTACTTGGCGACCGTACTTGCGATTGGAACTGAATGCGATATCGCAATGCTTACTGTTGATGATGACGAGTTTTGGGTGGGAGTTTCACCGGTAGAATTTGGGGAGTTACCTGCACTGCAAGATGCAGTAACTGTTGTTGGTTACCCAATTGGAGGTGATACAATTTCTGTCACAAGTGGGGTTGTATCGCGGATAGAGATCCTGTCTTATGTTCATGGGTCTACTGAGCTTCTCGGTCTGCAGATAGATGCTGCTATAAACTCGGGTAATTCTGGTGGGCCTGCCTTTAATGATAAAGGAAACTGTGTGGGCATTGCATTTCAGTCGCTCAAGCATGAAGATGCAGAAAATATAGGTTATGTCATACCAACGCCAGTTATCTTGCATTTTATACGGGATTACGAGAAGAATGGAGCATATACAGGCTTTCCAATTCTTGGTCTTGAGTGGCAGAAAATGGAAAATCCTGATCTTTGTGAGGCTATGGGGATGAAAAAAGATCAGAAGGGTGTCCGTATTAGACGTATTGATCCCACTGGCCCAGAATCCAAAGTTTTGAAGCCATCAGATATTATTCTCGGCTTTGATGGGGTTGATATTGCTAATGATGGAACAGTTCCTTTCCGGCACGGTGAGCGTATAGGATTCAGTTACCTTGTCTCCCAGAAATATACTGGTGATAGTGCAACAATAAAAGTTCTGCGCAACTCTGAGACACTCAGTTTTAATTACCAGCTTGCAACATACAGAAGGCTCATTCCTGCACATAATGAGGGCAAACCCCCTTCTTATTACATTATTGCAGGATTTGTTTTTTCCACTGTCTCCGTTCCTTATCTCCGTTCTGAGTACGGAAAGGATTATGAATATGAAGCTCCAGTCAAACTATTGGACAAATTATTGCATTCAATGCCACAATCACCAGATGAGCAGCTAGTGGTGGTTTCTCAGGTACTCGTGGCTGATATCAACATTGGATATGAAGACATTGTTAACACCCAGGTTCTTGCTTTCAATGGTAAACCTGTGAAGAACCTCAAGAGCTTGGCTAACATGGTTGAAAGTTGCGATGATGAGTTTTTGAAGTTCGATTTAGAATATCAACAGATAGTTGTCCTCCGTACAAGCACAGCGAAAGCAGCCACTTTAGATATTCTGGCCACACACTGCATACCCTCAGCTATGTCTAACGATCTCAAGACCTAA

Protein sequence

MKEFRGSAEERKLKTHFLHSDKPSPISLPFNSSEMGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQNRRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Homology
BLAST of Cp4.1LG20g02270 vs. ExPASy Swiss-Prot
Match: Q9FL12 (Protease Do-like 9 OS=Arabidopsis thaliana OX=3702 GN=DEGP9 PE=1 SV=1)

HSP 1 Score: 837.0 bits (2161), Expect = 1.4e-241
Identity = 429/594 (72.22%), Postives = 486/594 (81.82%), Query Frame = 0

Query: 41  KRGRKPKNSKADSLDFP----PPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQNRR 100
           KRGRK K   A S +         +A   S+    + V           AS + P  +RR
Sbjct: 6   KRGRKHKRQDASSAENAGGEVKEASANEASLPQSPEPV----------SASEANPSPSRR 65

Query: 101 GRPKKLSKHLENPDK-------FPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPE 160
            R +   + L N  +        P+ S SR      +NG+  ++G +  +        P 
Sbjct: 66  SRGRGKKRRLNNESEAGNQRTSSPERSRSRLHHSDTKNGD-CSNGMIVSTTTESIPAAPS 125

Query: 161 WPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHH 220
           W  +V+VVP+MDAVVKVFCVHTEPNFSLPWQRKRQYSS SSGF+IGGRRVLTNAHSVEHH
Sbjct: 126 WETVVKVVPSMDAVVKVFCVHTEPNFSLPWQRKRQYSSGSSGFIIGGRRVLTNAHSVEHH 185

Query: 221 TQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVG 280
           TQVKLKKRGSDTKYLATVLAIGTECDIA+LTV DDEFW GVSPVEFG+LPALQDAVTVVG
Sbjct: 186 TQVKLKKRGSDTKYLATVLAIGTECDIALLTVTDDEFWEGVSPVEFGDLPALQDAVTVVG 245

Query: 281 YPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAF 340
           YPIGGDTISVTSGVVSR+EILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKG CVGIAF
Sbjct: 246 YPIGGDTISVTSGVVSRMEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGKCVGIAF 305

Query: 341 QSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKD 400
           QSLKHEDAENIGYVIPTPVI+HFI+DYEK+  YTGFP+LG+EWQKMENPDL ++MGM+  
Sbjct: 306 QSLKHEDAENIGYVIPTPVIVHFIQDYEKHDKYTGFPVLGIEWQKMENPDLRKSMGMESH 365

Query: 401 QKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGD 460
           QKGVRIRRI+PT PES+VLKPSDIIL FDGV+IANDGTVPFRHGERIGFSYL+SQKYTGD
Sbjct: 366 QKGVRIRRIEPTAPESQVLKPSDIILSFDGVNIANDGTVPFRHGERIGFSYLISQKYTGD 425

Query: 461 SATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKD 520
           SA +KVLRN E L FN +LA ++RLIPAH  GKPPSY+I+AGFVF+TVSVPYLRSEYGK+
Sbjct: 426 SALVKVLRNKEILEFNIKLAIHKRLIPAHISGKPPSYFIVAGFVFTTVSVPYLRSEYGKE 485

Query: 521 YEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLK 580
           YE++APVKLL+K LH+M QS DEQLVVVSQVLV+DINIGYE+IVNTQV+AFNGKPVKNLK
Sbjct: 486 YEFDAPVKLLEKHLHAMAQSVDEQLVVVSQVLVSDINIGYEEIVNTQVVAFNGKPVKNLK 545

Query: 581 SLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 624
            LA MVE+C+DE++KF+L+Y QIVVL T TAK ATLDIL THCIPSAMS+DLKT
Sbjct: 546 GLAGMVENCEDEYMKFNLDYDQIVVLDTKTAKEATLDILTTHCIPSAMSDDLKT 588

BLAST of Cp4.1LG20g02270 vs. ExPASy Swiss-Prot
Match: O82261 (Protease Do-like 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP2 PE=1 SV=2)

HSP 1 Score: 500.4 bits (1287), Expect = 3.0e-140
Identity = 245/477 (51.36%), Postives = 333/477 (69.81%), Query Frame = 0

Query: 145 RVQPEWPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAH 204
           R Q   P  +     ++AVVKV+C HT P++SLPWQ++RQ++S+ S F+IG  ++LTNAH
Sbjct: 100 RDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAH 159

Query: 205 SVEHHTQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDA 264
            VEH TQVK+K+RG D KY+A VL  G +CDIA+L+V+ ++FW G  P+  G LP LQD+
Sbjct: 160 CVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDS 219

Query: 265 VTVVGYPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNC 324
           VTVVGYP+GGDTISVT GVVSRIE+ SY HGS++LLG+QIDAAIN GNSGGPAFND+G C
Sbjct: 220 VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGEC 279

Query: 325 VGIAFQSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAM 384
           +G+AFQ  + E+ ENIGYVIPT V+ HF+ DYE+NG YTG+P LG+  QK+ENP L E +
Sbjct: 280 IGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECL 339

Query: 385 GMKKDQKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQ 444
            +  ++ GV +RR++PT   SKVLK  D+I+ FD + +  +GTVPFR  ERI F YL+SQ
Sbjct: 340 KVPTNE-GVLVRRVEPTSDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQ 399

Query: 445 KYTGDSATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRS 504
           K+ GD A I ++R  E       L     L+P H +G  PSY I+AG VF+ +S P +  
Sbjct: 400 KFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEE 459

Query: 505 EYGKDYEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKP 564
           E     E    +KLL K  +S+ +   EQ+V++SQVL  ++NIGYED+ N QVL FNG P
Sbjct: 460 E----CEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGIP 519

Query: 565 VKNLKSLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDL 622
           ++N+  LA++++ C D++L F+ E   + VL    + +A+L IL  + IPS  S DL
Sbjct: 520 IRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCILKDYGIPSERSADL 571

BLAST of Cp4.1LG20g02270 vs. ExPASy Swiss-Prot
Match: Q9FIV6 (Protease Do-like 10, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DEGP10 PE=2 SV=1)

HSP 1 Score: 441.4 bits (1134), Expect = 1.7e-122
Identity = 233/465 (50.11%), Postives = 310/465 (66.67%), Query Frame = 0

Query: 159 AMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRG 218
           A+D+VVK+F V T P++ LPWQ K Q  S  SGFVI GR+++TNAH V  H+ V ++K G
Sbjct: 111 ALDSVVKIFTVSTSPSYFLPWQNKSQRESMGSGFVISGRKIITNAHVVADHSFVLVRKHG 170

Query: 219 SDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTIS 278
           S  K+ A V A+G ECD+A+L VD + FW G++ +E G++P LQ+AV VVGYP GGD IS
Sbjct: 171 SSIKHRAEVQAVGHECDLAILVVDSEVFWEGMNALELGDIPFLQEAVAVVGYPQGGDNIS 230

Query: 279 VTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCV-GIAFQSLKHEDA 338
           VT GVVSR+E   YVHG+T+L+ +QIDAAIN GNSGGPA    GN V G+AFQ+L    A
Sbjct: 231 VTKGVVSRVEPTQYVHGATQLMAIQIDAAINPGNSGGPAI--MGNKVAGVAFQNL--SGA 290

Query: 339 ENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRR 398
           ENIGY+IPTPVI HFI   E+ G Y GF  +G+  Q MEN +L     M  +  GV + +
Sbjct: 291 ENIGYIIPTPVIKHFINGVEECGKYIGFCSMGVSCQPMENGELRSGFQMSSEMTGVLVSK 350

Query: 399 IDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLR 458
           I+P     K+LK  D++L FDGV IANDGTVPFR+ ERI F +LVS K   ++A +KVLR
Sbjct: 351 INPLSDAHKILKKDDVLLAFDGVPIANDGTVPFRNRERITFDHLVSMKKPDETALVKVLR 410

Query: 459 NSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVK 518
             +   F+  L   + L+P H   + PSYYI AGFVF  ++ PYL  EYG+D+   +P  
Sbjct: 411 EGKEHEFSITLRPLQPLVPVHQFDQLPSYYIFAGFVFVPLTQPYLH-EYGEDWYNTSPRT 470

Query: 519 LLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVES 578
           L  + L  +P+   +QLV+VSQVL+ DIN GYE +   QV   NG  V NL+ L  ++E+
Sbjct: 471 LCHRALKDLPKKAGQQLVIVSQVLMDDINTGYERLAELQVNKVNGVEVNNLRHLCQLIEN 530

Query: 579 CDDEFLKFDLEYQ-QIVVLRTSTAKAATLDILATHCIPSAMSNDL 622
           C+ E L+ DL+ + +++VL   +AK AT  IL  H I SA+S+DL
Sbjct: 531 CNTEKLRIDLDDESRVIVLNYQSAKIATSLILKRHRIASAISSDL 570

BLAST of Cp4.1LG20g02270 vs. ExPASy Swiss-Prot
Match: Q9SHZ1 (Putative protease Do-like 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DEGP3 PE=3 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 8.9e-108
Identity = 214/465 (46.02%), Postives = 288/465 (61.94%), Query Frame = 0

Query: 159 AMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRG 218
           A+++VVKVF V ++P    PWQ   Q  S+ SGFVI G+++LTNAH V + T VK++K G
Sbjct: 93  ALNSVVKVFTVSSKPRLFQPWQITMQSESTGSGFVISGKKILTNAHVVANQTSVKVRKHG 152

Query: 219 SDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTIS 278
           S TKY A V A+G ECD+A+L +D+D+FW G++P+E G++P++QD V VVGYP GGDTIS
Sbjct: 153 STTKYKAKVQAVGHECDLAILEIDNDKFWEGMNPLELGDIPSMQDTVYVVGYPKGGDTIS 212

Query: 279 VTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCV-GIAFQSLKHEDA 338
           V+ GVVSR+  + Y H  TELL +QIDAAIN+GNSGGP     GN V G+AF+SL + D 
Sbjct: 213 VSKGVVSRVGPIKYSHSGTELLAIQIDAAINNGNSGGPVI--MGNKVAGVAFESLCYSD- 272

Query: 339 ENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRR 398
            +IGY+IPTPVI HF+   E++G    F  + L +QKM+N  L +   M     G+ I +
Sbjct: 273 -SIGYIIPTPVIRHFLNAIEESGEDVSFGSINLTYQKMDNDQLRKDFKMSDKMTGILINK 332

Query: 399 IDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLR 458
           I+P     KVLK  DIIL  DGV I ND +V FR  ERI F +LVS K   ++A +KVLR
Sbjct: 333 INPLSDVHKVLKKDDIILAIDGVPIGNDSSVHFRKKERITFKHLVSMKKPCETALLKVLR 392

Query: 459 NSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVK 518
             +   FN  L +   L+P     K  SYYI  G VF  ++ PY+ S             
Sbjct: 393 EGKEYEFNSSLKSVPPLVPKRQYDKSASYYIFGGLVFLPLTKPYIDSSC----------- 452

Query: 519 LLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVES 578
           + +  L  MP+   EQ+V++SQ+L  DIN GY    + QV   NG  V NLK L  +VE 
Sbjct: 453 VSESALGKMPKKAGEQVVIISQILEDDINTGYSIFEDFQVKKVNGVQVHNLKHLYKLVEE 512

Query: 579 CDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLK 623
           C  E ++ DLE  +++ L   +AK  T  IL +  IPSA+S DL+
Sbjct: 513 CCTETVRMDLEKDKVITLDYKSAKKVTSKILKSLKIPSAVSEDLQ 542

BLAST of Cp4.1LG20g02270 vs. ExPASy Swiss-Prot
Match: Q9SHZ0 (Protease Do-like 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DEGP4 PE=2 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 1.7e-103
Identity = 207/462 (44.81%), Postives = 277/462 (59.96%), Query Frame = 0

Query: 159 AMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRG 218
           A+++VVKVF V++ P+   PW+   Q  S  SGFVI G+++LTNAH V  H  ++++K G
Sbjct: 71  AVNSVVKVFTVYSMPSVLQPWRNWPQQESGGSGFVISGKKILTNAHVVADHIFLQVRKHG 130

Query: 219 SDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTIS 278
           S TKY A V AIG ECD+A+L +D++EFW  + P+E GE+P+L ++V V GYP GGD++S
Sbjct: 131 SPTKYKAQVRAIGHECDLAILEIDNEEFWEDMIPLELGEIPSLDESVAVFGYPTGGDSVS 190

Query: 279 VTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKHEDAE 338
           +T G VSR+E   Y HG T LL +Q DAAIN GNSGGPA        G+AFQ  K   A+
Sbjct: 191 ITKGYVSRVEYTRYAHGGTTLLAIQTDAAINPGNSGGPAIIG-NKMAGVAFQ--KDPSAD 250

Query: 339 NIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRRI 398
           NIGY+IPTPVI HF+   E+NG Y GF  L + +Q MEN  L     M  +  G+ I  I
Sbjct: 251 NIGYIIPTPVIKHFLTAVEENGQYGGFCTLDISYQLMENSQLRNHFKMGPEMTGILINEI 310

Query: 399 DPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLRN 458
           +P     K L+  DIIL  D V I ND  V FR+ ERI F++ VS K   ++  ++VLR+
Sbjct: 311 NPLSDAYKRLRKDDIILAIDDVLIGNDAKVTFRNKERINFNHFVSMKKLDETVLLQVLRD 370

Query: 459 SETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVKL 518
            +   F+  +     L+P H   K PSYYI AGFVF  ++ PY+ S             +
Sbjct: 371 GKEHEFHIMVKPVPPLVPGHQYDKLPSYYIFAGFVFVPLTQPYIDS-----------TLI 430

Query: 519 LDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVESC 578
            +  +  MP+   EQL     VL  DIN GY D  N +V+  NG  V+NLK L  +VE+C
Sbjct: 431 CNCAIKYMPEKAGEQL-----VLADDINAGYTDFKNLKVIKVNGVQVENLKHLTELVETC 490

Query: 579 DDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSND 621
             E L+ DLE +++VVL  + AK AT  IL  H IPSA   D
Sbjct: 491 WTEDLRLDLENEKVVVLNYANAKEATSLILELHRIPSANEYD 513

BLAST of Cp4.1LG20g02270 vs. NCBI nr
Match: XP_023519580.1 (protease Do-like 9 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1174 bits (3037), Expect = 0.0
Identity = 589/589 (100.00%), Postives = 589/589 (100.00%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN
Sbjct: 1   MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV
Sbjct: 61  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. NCBI nr
Match: XP_022923859.1 (protease Do-like 9 isoform X1 [Cucurbita moschata] >KAG6584234.1 Protease Do-like 9, partial [Cucurbita argyrosperma subsp. sororia] >KAG7019831.1 Protease Do-like 9 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1172 bits (3033), Expect = 0.0
Identity = 588/589 (99.83%), Postives = 588/589 (99.83%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGEVKRKRGRKPKNSKADSLDFPPPTTAT TSVAMVMDDVFSVSNVELMDPASTSKPHQN
Sbjct: 1   MGEVKRKRGRKPKNSKADSLDFPPPTTATATSVAMVMDDVFSVSNVELMDPASTSKPHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV
Sbjct: 61  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. NCBI nr
Match: XP_023001040.1 (protease Do-like 9 [Cucurbita maxima])

HSP 1 Score: 1172 bits (3031), Expect = 0.0
Identity = 587/589 (99.66%), Postives = 588/589 (99.83%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGEVKRKRGRKPKNSKADSLDFPPPTTAT TSVAMVMDDVFSVSNVELMDPASTSKPHQN
Sbjct: 1   MGEVKRKRGRKPKNSKADSLDFPPPTTATATSVAMVMDDVFSVSNVELMDPASTSKPHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKH+ENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV
Sbjct: 61  RRGRPKKLSKHMENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. NCBI nr
Match: XP_038894378.1 (protease Do-like 9 [Benincasa hispida])

HSP 1 Score: 1120 bits (2898), Expect = 0.0
Identity = 560/588 (95.24%), Postives = 574/588 (97.62%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGE+KRKRGRKPK+SKA++LDFPPPTTAT TSVA+ MDDVFSVSNVELMDPASTSK HQN
Sbjct: 1   MGEIKRKRGRKPKDSKAEALDFPPPTTATSTSVAVAMDDVFSVSNVELMDPASTSKHHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKH+ENPDKFPQLSPSRRGPRGVENG+FTASGD  P+ I  ERVQPEWPGI 
Sbjct: 61  RRGRPKKLSKHVENPDKFPQLSPSRRGPRGVENGDFTASGDALPTSIVSERVQPEWPGIA 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVI GRRVLTNAHSVEH+TQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVISGRRVLTNAHSVEHYTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTV+DDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVEDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDL EAMGMK+DQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLREAMGMKQDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKP DIIL FDGVDIANDGTVPFRHGERIGFSYLVSQKYTG+SATIK
Sbjct: 361 IRRIDPTGPESKVLKPGDIILSFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGNSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYI+AGFVFSTVSVPYLRSE+GKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIVAGFVFSTVSVPYLRSEHGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLK 622
           VESCDDEFLKFDLEYQQIVVLRTSTAKAAT DILATHCIPSAMSNDLK
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATSDILATHCIPSAMSNDLK 588

BLAST of Cp4.1LG20g02270 vs. NCBI nr
Match: XP_022137201.1 (protease Do-like 9 [Momordica charantia] >XP_022137202.1 protease Do-like 9 [Momordica charantia])

HSP 1 Score: 1119 bits (2894), Expect = 0.0
Identity = 559/589 (94.91%), Postives = 571/589 (96.94%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGE+KRKRGRKPKNSKA++LDFPPPTTAT TS A+ MDDVFSV NVELMDPASTSK HQN
Sbjct: 1   MGEIKRKRGRKPKNSKAEALDFPPPTTATATSAAITMDDVFSVGNVELMDPASTSKHHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKHLENPDKFPQLSPSRR PRGVENGEFTASGDV P  I  ER QPEWPGI 
Sbjct: 61  RRGRPKKLSKHLENPDKFPQLSPSRRAPRGVENGEFTASGDVLPPAIVSERAQPEWPGIA 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEH+TQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHYTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIA+LTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIALLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDL +AMGMK+DQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLRKAMGMKQDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKP+DIIL FDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPADIILSFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRN ETLSFNYQL+TYRRLIPAHNEG+PPSYYIIAGFVF+TVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNCETLSFNYQLSTYRRLIPAHNEGRPPSYYIIAGFVFTTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLK LA M
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKCLATM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. ExPASy TrEMBL
Match: A0A6J1E7J6 (protease Do-like 9 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431450 PE=3 SV=1)

HSP 1 Score: 1172 bits (3033), Expect = 0.0
Identity = 588/589 (99.83%), Postives = 588/589 (99.83%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGEVKRKRGRKPKNSKADSLDFPPPTTAT TSVAMVMDDVFSVSNVELMDPASTSKPHQN
Sbjct: 1   MGEVKRKRGRKPKNSKADSLDFPPPTTATATSVAMVMDDVFSVSNVELMDPASTSKPHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV
Sbjct: 61  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. ExPASy TrEMBL
Match: A0A6J1KJZ8 (protease Do-like 9 OS=Cucurbita maxima OX=3661 GN=LOC111495295 PE=3 SV=1)

HSP 1 Score: 1172 bits (3031), Expect = 0.0
Identity = 587/589 (99.66%), Postives = 588/589 (99.83%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGEVKRKRGRKPKNSKADSLDFPPPTTAT TSVAMVMDDVFSVSNVELMDPASTSKPHQN
Sbjct: 1   MGEVKRKRGRKPKNSKADSLDFPPPTTATATSVAMVMDDVFSVSNVELMDPASTSKPHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKH+ENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV
Sbjct: 61  RRGRPKKLSKHMENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. ExPASy TrEMBL
Match: A0A6J1C5U5 (protease Do-like 9 OS=Momordica charantia OX=3673 GN=LOC111008725 PE=3 SV=1)

HSP 1 Score: 1119 bits (2894), Expect = 0.0
Identity = 559/589 (94.91%), Postives = 571/589 (96.94%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGE+KRKRGRKPKNSKA++LDFPPPTTAT TS A+ MDDVFSV NVELMDPASTSK HQN
Sbjct: 1   MGEIKRKRGRKPKNSKAEALDFPPPTTATATSAAITMDDVFSVGNVELMDPASTSKHHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKHLENPDKFPQLSPSRR PRGVENGEFTASGDV P  I  ER QPEWPGI 
Sbjct: 61  RRGRPKKLSKHLENPDKFPQLSPSRRAPRGVENGEFTASGDVLPPAIVSERAQPEWPGIA 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEH+TQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHYTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIA+LTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIALLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDL +AMGMK+DQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLRKAMGMKQDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKP+DIIL FDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK
Sbjct: 361 IRRIDPTGPESKVLKPADIILSFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRN ETLSFNYQL+TYRRLIPAHNEG+PPSYYIIAGFVF+TVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNCETLSFNYQLSTYRRLIPAHNEGRPPSYYIIAGFVFTTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLK LA M
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKCLATM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 589

BLAST of Cp4.1LG20g02270 vs. ExPASy TrEMBL
Match: A0A5D3BK86 (Protease Do-like 9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G003470 PE=3 SV=1)

HSP 1 Score: 1108 bits (2867), Expect = 0.0
Identity = 556/589 (94.40%), Postives = 569/589 (96.60%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGE+KRKRGRK K+SK ++LDFPPPTTAT T   + MDDVFSVSNVELM+PASTSK HQN
Sbjct: 1   MGEIKRKRGRKAKDSKPEALDFPPPTTATAT---VAMDDVFSVSNVELMEPASTSKHHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKH+ENPDKFPQLSPSRRGPR VENGEF ASGD  PS I  ERVQPEWPG+ 
Sbjct: 61  RRGRPKKLSKHVENPDKFPQLSPSRRGPRAVENGEFAASGDALPSSIVSERVQPEWPGMA 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEH+TQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHYTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDL EAMGMK+DQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLREAMGMKQDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKP+DIIL FDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSA IK
Sbjct: 361 IRRIDPTGPESKVLKPADIILSFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSAAIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEG+PPSYYI+AGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGRPPSYYIVAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAAT DILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATSDILATHCIPSAMSNDLKT 586

BLAST of Cp4.1LG20g02270 vs. ExPASy TrEMBL
Match: A0A1S3AVL9 (protease Do-like 9 OS=Cucumis melo OX=3656 GN=LOC103483324 PE=3 SV=1)

HSP 1 Score: 1108 bits (2867), Expect = 0.0
Identity = 556/589 (94.40%), Postives = 569/589 (96.60%), Query Frame = 0

Query: 35  MGEVKRKRGRKPKNSKADSLDFPPPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQN 94
           MGE+KRKRGRK K+SK ++LDFPPPTTAT T   + MDDVFSVSNVELM+PASTSK HQN
Sbjct: 1   MGEIKRKRGRKAKDSKPEALDFPPPTTATAT---VAMDDVFSVSNVELMEPASTSKHHQN 60

Query: 95  RRGRPKKLSKHLENPDKFPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPEWPGIV 154
           RRGRPKKLSKH+ENPDKFPQLSPSRRGPR VENGEF ASGD  PS I  ERVQPEWPG+ 
Sbjct: 61  RRGRPKKLSKHVENPDKFPQLSPSRRGPRAVENGEFAASGDALPSSIVSERVQPEWPGMA 120

Query: 155 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKL 214
           RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEH+TQVKL
Sbjct: 121 RVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHYTQVKL 180

Query: 215 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 274
           KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG
Sbjct: 181 KKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGG 240

Query: 275 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 334
           DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH
Sbjct: 241 DTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAFQSLKH 300

Query: 335 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVR 394
           EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDL EAMGMK+DQKGVR
Sbjct: 301 EDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLREAMGMKQDQKGVR 360

Query: 395 IRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIK 454
           IRRIDPTGPESKVLKP+DIIL FDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSA IK
Sbjct: 361 IRRIDPTGPESKVLKPADIILSFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSAAIK 420

Query: 455 VLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEA 514
           VLRNSETLSFNYQLATYRRLIPAHNEG+PPSYYI+AGFVFSTVSVPYLRSEYGKDYEYEA
Sbjct: 421 VLRNSETLSFNYQLATYRRLIPAHNEGRPPSYYIVAGFVFSTVSVPYLRSEYGKDYEYEA 480

Query: 515 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 574
           PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM
Sbjct: 481 PVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANM 540

Query: 575 VESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 623
           VESCDDEFLKFDLEYQQIVVLRTSTAKAAT DILATHCIPSAMSNDLKT
Sbjct: 541 VESCDDEFLKFDLEYQQIVVLRTSTAKAATSDILATHCIPSAMSNDLKT 586

BLAST of Cp4.1LG20g02270 vs. TAIR 10
Match: AT5G40200.1 (DegP protease 9 )

HSP 1 Score: 837.0 bits (2161), Expect = 9.7e-243
Identity = 429/594 (72.22%), Postives = 486/594 (81.82%), Query Frame = 0

Query: 41  KRGRKPKNSKADSLDFP----PPTTATVTSVAMVMDDVFSVSNVELMDPASTSKPHQNRR 100
           KRGRK K   A S +         +A   S+    + V           AS + P  +RR
Sbjct: 6   KRGRKHKRQDASSAENAGGEVKEASANEASLPQSPEPV----------SASEANPSPSRR 65

Query: 101 GRPKKLSKHLENPDK-------FPQLSPSRRGPRGVENGEFTASGDVFPSDIALERVQPE 160
            R +   + L N  +        P+ S SR      +NG+  ++G +  +        P 
Sbjct: 66  SRGRGKKRRLNNESEAGNQRTSSPERSRSRLHHSDTKNGD-CSNGMIVSTTTESIPAAPS 125

Query: 161 WPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHH 220
           W  +V+VVP+MDAVVKVFCVHTEPNFSLPWQRKRQYSS SSGF+IGGRRVLTNAHSVEHH
Sbjct: 126 WETVVKVVPSMDAVVKVFCVHTEPNFSLPWQRKRQYSSGSSGFIIGGRRVLTNAHSVEHH 185

Query: 221 TQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVG 280
           TQVKLKKRGSDTKYLATVLAIGTECDIA+LTV DDEFW GVSPVEFG+LPALQDAVTVVG
Sbjct: 186 TQVKLKKRGSDTKYLATVLAIGTECDIALLTVTDDEFWEGVSPVEFGDLPALQDAVTVVG 245

Query: 281 YPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCVGIAF 340
           YPIGGDTISVTSGVVSR+EILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKG CVGIAF
Sbjct: 246 YPIGGDTISVTSGVVSRMEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGKCVGIAF 305

Query: 341 QSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKD 400
           QSLKHEDAENIGYVIPTPVI+HFI+DYEK+  YTGFP+LG+EWQKMENPDL ++MGM+  
Sbjct: 306 QSLKHEDAENIGYVIPTPVIVHFIQDYEKHDKYTGFPVLGIEWQKMENPDLRKSMGMESH 365

Query: 401 QKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGD 460
           QKGVRIRRI+PT PES+VLKPSDIIL FDGV+IANDGTVPFRHGERIGFSYL+SQKYTGD
Sbjct: 366 QKGVRIRRIEPTAPESQVLKPSDIILSFDGVNIANDGTVPFRHGERIGFSYLISQKYTGD 425

Query: 461 SATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKD 520
           SA +KVLRN E L FN +LA ++RLIPAH  GKPPSY+I+AGFVF+TVSVPYLRSEYGK+
Sbjct: 426 SALVKVLRNKEILEFNIKLAIHKRLIPAHISGKPPSYFIVAGFVFTTVSVPYLRSEYGKE 485

Query: 521 YEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLK 580
           YE++APVKLL+K LH+M QS DEQLVVVSQVLV+DINIGYE+IVNTQV+AFNGKPVKNLK
Sbjct: 486 YEFDAPVKLLEKHLHAMAQSVDEQLVVVSQVLVSDINIGYEEIVNTQVVAFNGKPVKNLK 545

Query: 581 SLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLKT 624
            LA MVE+C+DE++KF+L+Y QIVVL T TAK ATLDIL THCIPSAMS+DLKT
Sbjct: 546 GLAGMVENCEDEYMKFNLDYDQIVVLDTKTAKEATLDILTTHCIPSAMSDDLKT 588

BLAST of Cp4.1LG20g02270 vs. TAIR 10
Match: AT2G47940.1 (DEGP protease 2 )

HSP 1 Score: 500.4 bits (1287), Expect = 2.1e-141
Identity = 245/477 (51.36%), Postives = 333/477 (69.81%), Query Frame = 0

Query: 145 RVQPEWPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAH 204
           R Q   P  +     ++AVVKV+C HT P++SLPWQ++RQ++S+ S F+IG  ++LTNAH
Sbjct: 100 RDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAH 159

Query: 205 SVEHHTQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDA 264
            VEH TQVK+K+RG D KY+A VL  G +CDIA+L+V+ ++FW G  P+  G LP LQD+
Sbjct: 160 CVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDS 219

Query: 265 VTVVGYPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNC 324
           VTVVGYP+GGDTISVT GVVSRIE+ SY HGS++LLG+QIDAAIN GNSGGPAFND+G C
Sbjct: 220 VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGEC 279

Query: 325 VGIAFQSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAM 384
           +G+AFQ  + E+ ENIGYVIPT V+ HF+ DYE+NG YTG+P LG+  QK+ENP L E +
Sbjct: 280 IGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECL 339

Query: 385 GMKKDQKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQ 444
            +  ++ GV +RR++PT   SKVLK  D+I+ FD + +  +GTVPFR  ERI F YL+SQ
Sbjct: 340 KVPTNE-GVLVRRVEPTSDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQ 399

Query: 445 KYTGDSATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRS 504
           K+ GD A I ++R  E       L     L+P H +G  PSY I+AG VF+ +S P +  
Sbjct: 400 KFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEE 459

Query: 505 EYGKDYEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKP 564
           E     E    +KLL K  +S+ +   EQ+V++SQVL  ++NIGYED+ N QVL FNG P
Sbjct: 460 E----CEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGIP 519

Query: 565 VKNLKSLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDL 622
           ++N+  LA++++ C D++L F+ E   + VL    + +A+L IL  + IPS  S DL
Sbjct: 520 IRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCILKDYGIPSERSADL 571

BLAST of Cp4.1LG20g02270 vs. TAIR 10
Match: AT2G47940.2 (DEGP protease 2 )

HSP 1 Score: 500.4 bits (1287), Expect = 2.1e-141
Identity = 245/477 (51.36%), Postives = 333/477 (69.81%), Query Frame = 0

Query: 145 RVQPEWPGIVRVVPAMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAH 204
           R Q   P  +     ++AVVKV+C HT P++SLPWQ++RQ++S+ S F+IG  ++LTNAH
Sbjct: 99  RDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAH 158

Query: 205 SVEHHTQVKLKKRGSDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDA 264
            VEH TQVK+K+RG D KY+A VL  G +CDIA+L+V+ ++FW G  P+  G LP LQD+
Sbjct: 159 CVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDS 218

Query: 265 VTVVGYPIGGDTISVTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNC 324
           VTVVGYP+GGDTISVT GVVSRIE+ SY HGS++LLG+QIDAAIN GNSGGPAFND+G C
Sbjct: 219 VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGEC 278

Query: 325 VGIAFQSLKHEDAENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAM 384
           +G+AFQ  + E+ ENIGYVIPT V+ HF+ DYE+NG YTG+P LG+  QK+ENP L E +
Sbjct: 279 IGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECL 338

Query: 385 GMKKDQKGVRIRRIDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQ 444
            +  ++ GV +RR++PT   SKVLK  D+I+ FD + +  +GTVPFR  ERI F YL+SQ
Sbjct: 339 KVPTNE-GVLVRRVEPTSDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQ 398

Query: 445 KYTGDSATIKVLRNSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRS 504
           K+ GD A I ++R  E       L     L+P H +G  PSY I+AG VF+ +S P +  
Sbjct: 399 KFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEE 458

Query: 505 EYGKDYEYEAPVKLLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKP 564
           E     E    +KLL K  +S+ +   EQ+V++SQVL  ++NIGYED+ N QVL FNG P
Sbjct: 459 E----CEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGIP 518

Query: 565 VKNLKSLANMVESCDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDL 622
           ++N+  LA++++ C D++L F+ E   + VL    + +A+L IL  + IPS  S DL
Sbjct: 519 IRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCILKDYGIPSERSADL 570

BLAST of Cp4.1LG20g02270 vs. TAIR 10
Match: AT5G36950.1 (DegP protease 10 )

HSP 1 Score: 441.4 bits (1134), Expect = 1.2e-123
Identity = 233/465 (50.11%), Postives = 310/465 (66.67%), Query Frame = 0

Query: 159 AMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRG 218
           A+D+VVK+F V T P++ LPWQ K Q  S  SGFVI GR+++TNAH V  H+ V ++K G
Sbjct: 111 ALDSVVKIFTVSTSPSYFLPWQNKSQRESMGSGFVISGRKIITNAHVVADHSFVLVRKHG 170

Query: 219 SDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTIS 278
           S  K+ A V A+G ECD+A+L VD + FW G++ +E G++P LQ+AV VVGYP GGD IS
Sbjct: 171 SSIKHRAEVQAVGHECDLAILVVDSEVFWEGMNALELGDIPFLQEAVAVVGYPQGGDNIS 230

Query: 279 VTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCV-GIAFQSLKHEDA 338
           VT GVVSR+E   YVHG+T+L+ +QIDAAIN GNSGGPA    GN V G+AFQ+L    A
Sbjct: 231 VTKGVVSRVEPTQYVHGATQLMAIQIDAAINPGNSGGPAI--MGNKVAGVAFQNL--SGA 290

Query: 339 ENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRR 398
           ENIGY+IPTPVI HFI   E+ G Y GF  +G+  Q MEN +L     M  +  GV + +
Sbjct: 291 ENIGYIIPTPVIKHFINGVEECGKYIGFCSMGVSCQPMENGELRSGFQMSSEMTGVLVSK 350

Query: 399 IDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLR 458
           I+P     K+LK  D++L FDGV IANDGTVPFR+ ERI F +LVS K   ++A +KVLR
Sbjct: 351 INPLSDAHKILKKDDVLLAFDGVPIANDGTVPFRNRERITFDHLVSMKKPDETALVKVLR 410

Query: 459 NSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVK 518
             +   F+  L   + L+P H   + PSYYI AGFVF  ++ PYL  EYG+D+   +P  
Sbjct: 411 EGKEHEFSITLRPLQPLVPVHQFDQLPSYYIFAGFVFVPLTQPYLH-EYGEDWYNTSPRT 470

Query: 519 LLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVES 578
           L  + L  +P+   +QLV+VSQVL+ DIN GYE +   QV   NG  V NL+ L  ++E+
Sbjct: 471 LCHRALKDLPKKAGQQLVIVSQVLMDDINTGYERLAELQVNKVNGVEVNNLRHLCQLIEN 530

Query: 579 CDDEFLKFDLEYQ-QIVVLRTSTAKAATLDILATHCIPSAMSNDL 622
           C+ E L+ DL+ + +++VL   +AK AT  IL  H I SA+S+DL
Sbjct: 531 CNTEKLRIDLDDESRVIVLNYQSAKIATSLILKRHRIASAISSDL 570

BLAST of Cp4.1LG20g02270 vs. TAIR 10
Match: AT1G65630.1 (DegP protease 3 )

HSP 1 Score: 392.5 bits (1007), Expect = 6.3e-109
Identity = 214/465 (46.02%), Postives = 288/465 (61.94%), Query Frame = 0

Query: 159 AMDAVVKVFCVHTEPNFSLPWQRKRQYSSSSSGFVIGGRRVLTNAHSVEHHTQVKLKKRG 218
           A+++VVKVF V ++P    PWQ   Q  S+ SGFVI G+++LTNAH V + T VK++K G
Sbjct: 93  ALNSVVKVFTVSSKPRLFQPWQITMQSESTGSGFVISGKKILTNAHVVANQTSVKVRKHG 152

Query: 219 SDTKYLATVLAIGTECDIAMLTVDDDEFWVGVSPVEFGELPALQDAVTVVGYPIGGDTIS 278
           S TKY A V A+G ECD+A+L +D+D+FW G++P+E G++P++QD V VVGYP GGDTIS
Sbjct: 153 STTKYKAKVQAVGHECDLAILEIDNDKFWEGMNPLELGDIPSMQDTVYVVGYPKGGDTIS 212

Query: 279 VTSGVVSRIEILSYVHGSTELLGLQIDAAINSGNSGGPAFNDKGNCV-GIAFQSLKHEDA 338
           V+ GVVSR+  + Y H  TELL +QIDAAIN+GNSGGP     GN V G+AF+SL + D 
Sbjct: 213 VSKGVVSRVGPIKYSHSGTELLAIQIDAAINNGNSGGPVI--MGNKVAGVAFESLCYSD- 272

Query: 339 ENIGYVIPTPVILHFIRDYEKNGAYTGFPILGLEWQKMENPDLCEAMGMKKDQKGVRIRR 398
            +IGY+IPTPVI HF+   E++G    F  + L +QKM+N  L +   M     G+ I +
Sbjct: 273 -SIGYIIPTPVIRHFLNAIEESGEDVSFGSINLTYQKMDNDQLRKDFKMSDKMTGILINK 332

Query: 399 IDPTGPESKVLKPSDIILGFDGVDIANDGTVPFRHGERIGFSYLVSQKYTGDSATIKVLR 458
           I+P     KVLK  DIIL  DGV I ND +V FR  ERI F +LVS K   ++A +KVLR
Sbjct: 333 INPLSDVHKVLKKDDIILAIDGVPIGNDSSVHFRKKERITFKHLVSMKKPCETALLKVLR 392

Query: 459 NSETLSFNYQLATYRRLIPAHNEGKPPSYYIIAGFVFSTVSVPYLRSEYGKDYEYEAPVK 518
             +   FN  L +   L+P     K  SYYI  G VF  ++ PY+ S             
Sbjct: 393 EGKEYEFNSSLKSVPPLVPKRQYDKSASYYIFGGLVFLPLTKPYIDSSC----------- 452

Query: 519 LLDKLLHSMPQSPDEQLVVVSQVLVADINIGYEDIVNTQVLAFNGKPVKNLKSLANMVES 578
           + +  L  MP+   EQ+V++SQ+L  DIN GY    + QV   NG  V NLK L  +VE 
Sbjct: 453 VSESALGKMPKKAGEQVVIISQILEDDINTGYSIFEDFQVKKVNGVQVHNLKHLYKLVEE 512

Query: 579 CDDEFLKFDLEYQQIVVLRTSTAKAATLDILATHCIPSAMSNDLK 623
           C  E ++ DLE  +++ L   +AK  T  IL +  IPSA+S DL+
Sbjct: 513 CCTETVRMDLEKDKVITLDYKSAKKVTSKILKSLKIPSAVSEDLQ 542

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FL121.4e-24172.22Protease Do-like 9 OS=Arabidopsis thaliana OX=3702 GN=DEGP9 PE=1 SV=1[more]
O822613.0e-14051.36Protease Do-like 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP2 PE=1 ... [more]
Q9FIV61.7e-12250.11Protease Do-like 10, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DEGP10 PE=... [more]
Q9SHZ18.9e-10846.02Putative protease Do-like 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DE... [more]
Q9SHZ01.7e-10344.81Protease Do-like 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DEGP4 PE=2 ... [more]
Match NameE-valueIdentityDescription
XP_023519580.10.0100.00protease Do-like 9 [Cucurbita pepo subsp. pepo][more]
XP_022923859.10.099.83protease Do-like 9 isoform X1 [Cucurbita moschata] >KAG6584234.1 Protease Do-lik... [more]
XP_023001040.10.099.66protease Do-like 9 [Cucurbita maxima][more]
XP_038894378.10.095.24protease Do-like 9 [Benincasa hispida][more]
XP_022137201.10.094.91protease Do-like 9 [Momordica charantia] >XP_022137202.1 protease Do-like 9 [Mom... [more]
Match NameE-valueIdentityDescription
A0A6J1E7J60.099.83protease Do-like 9 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431450 PE=3... [more]
A0A6J1KJZ80.099.66protease Do-like 9 OS=Cucurbita maxima OX=3661 GN=LOC111495295 PE=3 SV=1[more]
A0A6J1C5U50.094.91protease Do-like 9 OS=Momordica charantia OX=3673 GN=LOC111008725 PE=3 SV=1[more]
A0A5D3BK860.094.40Protease Do-like 9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G0... [more]
A0A1S3AVL90.094.40protease Do-like 9 OS=Cucumis melo OX=3656 GN=LOC103483324 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G40200.19.7e-24372.22DegP protease 9 [more]
AT2G47940.12.1e-14151.36DEGP protease 2 [more]
AT2G47940.22.1e-14151.36DEGP protease 2 [more]
AT5G36950.11.2e-12350.11DegP protease 10 [more]
AT1G65630.16.3e-10946.02DegP protease 3 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001940Peptidase S1CPRINTSPR00834PROTEASES2Ccoord: 407..419
score: 34.33
coord: 298..315
score: 56.5
coord: 219..239
score: 24.16
coord: 191..203
score: 24.39
coord: 261..285
score: 21.46
IPR036034PDZ superfamilyGENE3D2.30.42.10coord: 365..473
e-value: 1.0E-12
score: 49.8
IPR036034PDZ superfamilySUPERFAMILY50156PDZ domain-likecoord: 391..472
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 147..246
e-value: 1.1E-12
score: 49.6
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 248..358
e-value: 1.2E-23
score: 85.0
NoneNo IPR availableGENE3D3.20.190.20coord: 474..623
e-value: 5.8E-58
score: 196.8
NoneNo IPR availablePFAMPF13365Trypsin_2coord: 190..327
e-value: 1.8E-20
score: 74.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..19
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 84..132
NoneNo IPR availablePANTHERPTHR45980:SF13PROTEASE DO-LIKE 9coord: 38..623
NoneNo IPR availablePANTHERPTHR45980FAMILY NOT NAMEDcoord: 38..623
NoneNo IPR availableCDDcd00987PDZ_serine_proteasecoord: 366..463
e-value: 6.21414E-12
score: 59.9608
IPR041517Protease Do-like, PDZ domainPFAMPF17815PDZ_3coord: 475..620
e-value: 4.4E-47
score: 159.6
IPR009003Peptidase S1, PA clanSUPERFAMILY50494Trypsin-like serine proteasescoord: 159..360

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02270.1Cp4.1LG20g02270.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004252 serine-type endopeptidase activity