Cp4.1LG09g03660 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG09g03660
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like
LocationCp4.1LG09: 2248284 .. 2255323 (+)
RNA-Seq ExpressionCp4.1LG09g03660
SyntenyCp4.1LG09g03660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTAAAGCCACGTGGACGGTACGTGAAAAGGAAAAGAAAAGAAGAGAAAAAACCTAACGTCTCATTCTCCCTTCCTCTCTCACTGTTTTCAGGCCGCCGGCGACTTGCAGGACCGCGACGACGCTACGGCCTTAGGCATCTGCCTTCCCCGTAGGGTTCTGTAGCGCTCATCACCACGAGCTTAAGAGCGACGGATACACACGCGGCGGCGGACTCAAGAGACCTTCTCCGGCGAACCACGAGCGTTGCCGGCGCAGCTTCCTGTCCATCAACGGCGGCTCATTAAAAAAAAGAAAACCAGTGTTTAAAGGGAACGGGTGTATTCGGTGTTGTTTTTGTGCACTATATTGGCAGTCGAGATTGCATTAGCATCTTACATTTGCCTAATCGTTCTGATAGTTAGTACACACTGAAATCTCAGTGGTTCAACCGAAATGGCGGCTTTCAATCTCGTTGCGAATATGTATGATCCGGCTTTGAAGCCTCGCTTGCTACACAAGCTTCTCAGGGAACATGTTCCGGACGATAAACAGACGTTTAGTGATCATTCAGAACTCTCGAAGGTGGTTTCTATGGTCAAAATCCATAATCTCCTCTCTGAATCTTCATCTTCCATGGACCAAAAACTGATGGATAGCTGGAAATCCGCCGTTGATTCCTGGGTCAACCGCTTGCTTGTTCTGCTCTCTAATGATATGGTATGAACATGCATTTCATCTAATTCTGTTTAAATTACAAGAACCCTAGGGATAGGGAGGGACTTTAGCTTTTACATTAAAAAGATAGTATTATGATTTAGAACAAATGGAAACTTTTAGGGTGCCTTAAACTGTGTATTCTATAGACATAATGGGATTCATTGAGGCTATTCCAAACTTCATCAGGTGAGATTTTACAACAAGAAATTGATATTGTTGGACAAAGTTACCTGAGAAACTCAAGATTTCGAACTTGCTTGAAGATTTCAAAATTGATATTGTTGCATAATATTGACTATATTTTTTGAGAAATTGCACAATTTCATTTAGCGGTGACTAATGTTAACTTCCAATTCTTTTAAAGCCTGATAAATGTTGGGCTGGAATCATTTTACTGGGCGTGACTTGTCAACAATGCAGCTCGAGTCGTTTCTTGGCATCGTATGCAGATTGGCTTCACAAGCTTCTGCCTCACTTGCAGGTAATAATTATTTTTTTCTTGGATGGATTCCTCAGTATTTAGCTACACGTATTGAGTGGCGAAGATCTTAAATGATGAAGTATGTCTAAAGAAGAACAAGACTTATTATACATTGGACAGAAGTAAGATACCGGGGGAGAAGCAATGATTCTGGATAGAATTTGTTATAATCGAAGTGATTATTGCTCGTTGGAAAGAATGCATAATAACTTGCTGGAGGAATTTGCTAAAAATTGCCATAAGATAACCTTTGGCTGTTCATTAATGCTAATGGAACTAGGGAACTGGCACATAGCTATTTTCGAGCATACTAATTTGATGTTTGTCATGATCGATGATGCGAGAGTTCTGAGTCGGGGGATATGCTAATAATTTTCACGTGGTTCATTTTATCAAGTATAAAGTGGGAATGCTATGTAATTACAGAAAGTTGTCAAGTTCCTTCTTTTGCTGTTTGATTTATTTATATAGTTTTCATCTTTAGCTTTTCAACTCATAGATCATCATGCTTTCTTATACTTGAATGCTACCAAAAATTTTACACTATTTTGTTTCTTGTGTTTTTTCTTTGTTCTATAGACAGATTCTCAGTTTCTGAAGGTGGCCACGTGTGCTTCGATCTCAGATTTATTCTTGAGGTACATCTTCATCTGACCTTGGACAGAAATTTATCATACAATTTGTCCCTCAAACATAAATCAATTAGTTAAATATACCCTTATGTGAGATCCCACATCGGTTGAAGAGGGGAACGAAACGTTCTTTATATGGGTGTGGAAACCTCTCCCTAACATACGCCTTTTAAAACCGCGAGGCTGACAACGATACGTAATGGGCCAAAACAGACAATATCTGCTAGCGGTGGGCTTGGCTATTACAAATGGTATCAGAACCAGACATCGGGCGGTGTGCTAGCGAGGACACTAGGCCCCCAAGGGGGGTGGATTGTGATATCCCACATCGGTTGGAAAGGGGAACAAAACCGTCCTTATAAGGGTGTGGAAACTTCTCCCTAAGAGATGTGTTTTAAAACCGTGAGGCTGACGGCAATAGGTAACGGACTGAAGCGGACAATATTTGCTAGCGGTAGGCTTGGGCTGTTACACCTTCCCAAAAGAAAAATATCAATGTTAAACCTTTTGCAAATGAAAAATAATTTTTAGGATTTGTATATTGTCGTTTCTATATAGACTTTGAGATCTGTAGATTCAACTTTTGAATGCTGGCTTAAGAATAATAGACAACTGGTTTAATCTCATTCTGCAGGTTATTACTTTTAGTTTATTTTGTCTATAACTTTAAAGTAACATGCTTGACCATAATTTGGCTGGTTTCATGACTTGTAATGATCTCTCGTTTACAAACTAGATTGGGCCGATTTCCAAACGTGAAGAAAGATGGGACTTCTTGTGCTGGGAAGGTCATTCAACCAGTTATTAAGCTGTTGCATGATGATAATACAGAAGCTGTTTTGGTAAGTAGCACAGGTGAACTCACATTTTTCGTAGACTATTTCCATGTGTTCAGGTGCTCAATAATTTTGGAACTACGATATTCGGTGATCTGTGCCAACGCCTCATGTCCTTTTGTACTCTTTGCTCGGTAGTTAGAACTTTCGTTACTGATTCATATATGTAATGATCTGCATGTTGGTTGATGCAGGATGCAGCAGTTAATCTATTATGCACTCTGATTGCTTTCTTTCCCTTTACAATCCATCGTCATTACGACTCTGTAAGTGCATTATTGAATAATGTGATTTCCATTCTATTATAAATTGCATTTATGGGACTAATGGTAGTTCTCCGACTTATCCGTATATCGATGGTGTAATCTAAGCTGCTAAAAGAAAAATTCACAAAAAAACTGCAATGAAAAGCTGGAAAGAGTATACCAATACTAGCTAGAAGAATTTAATTTTGTCCCACAACTCTACTCCGAGCTTAGCAGTTCAACTTTGTTGGTATTCCAACAATTTTTATAAAATTTCATAGCAAATCTTTTTGTCCCAGAGGTATGAGGGCTGTCATTTATATTACTTGGCTGGCCTTTCTCAACATCTTGGCACGTACGGCTTCATGTTGTCAGCATCTTATTTATCTGCATAAATGAAAAATCTAAGAACGTAGAAATCCCGGACAAAGTTTGTCTTAATGTTGAAATAATCGATGTTTATTTCTTCTTGCTGAATTGACTATTGAAATTTTTTGCAGGCTGAAGCTGCAATTGTTTCAAAAATCTTTTCAGGGAAGTGTAGTTTCAATATGCTGAAGGTATTTGGCCTCTTTTTAATTTAATTTAATCATAGATACCACTATAAAAACATCAAATTTCTCCTTTAATTTTCTGTCTCTGTTTCATATCCAATATTTAGAAGCTTGCTCATTGCCTAGCATCACTTCCAAAATCAAAAGGAGATGAAGATAGCTGGACTGTACTAATGCAGAAGATTTTGTTATCAATTGACATACACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGGTAAACATGAACTGTATATCACGTAGAGCTATCCTAATTTATGAAGATGGTGAGAATGATCCAATGATCCTTGATTGTTTTTGTAGACTCAAGAGGTAACGAAGTTGTAAGGTTACTGATTCCACCCGGAAAAGAACCTCCACCACCGTTAGGTTGTAATTCATCGGCAGAAGGCTCCTTTGATAAACTAACAAAGAGCTCAGAGCGAATGTTAACTTCTATTATCTCGACCTTGATGTTTTGCTGTTCTACAATGATAACAAGTTCATACCCCCATCAGGTAGCATCATGTCATCCATTTTTTACTATTTATAAATATATTGAAATAAAGAAGAAAAAGGATGTTTAATGGATATTTTGTATGTATTTTCTCAGGTGGCAGTTCCGATTCGCCCGTTATTAGCTCTTGTTGAGAGAATGCTGATGGTGGATGGTTCTCTGCCGCCCGCTTCAGTGCCATTTATGACATCTCTACAGCAAGAGTCAATGTGTTCGGAACTCCCGACCCTGCATTCGGACAGTTTGGATCTTCTCATTGCCATAATTAAGAGCCTTCGCAGGCAAGCCATCTACTATTCCTAAACTACATTTAATATCTAACATAGAAACCTCGATGCAAATTCTTTCACTAATCCTTCTTCATCCATAGATATGCCTTTCTTCCTCCATCTCTGGAAAATGCAACCCCCACTGTTATAGCCGATGATCTTTCACGTTCACGCGAATTAAAAAATAATCTAATAGTAATACTACAAATATTCGCGCTATATTTTTGTTAATATTATTTATTATTAAATTCCTTTGCAGTCAATTGTTACCACATGCTGCATTTATTGTGCGACTCATTGTGAAGTACTTCAAGAAATGTGTGTCTGCAGAATTGAGAGTAAAGGCCTACGCAGTTGCTAAATTATTGATGATGTCTTTGGGCGTTGGTAAGCAGAAGTACATTTATTTGTATATTTACCTATTTGTCAATTATTCTTCTTTTCTTAGAGAGATCAATTAGGCGCTATCCAAGATAGTTAAGGTTCAGTCTTTCGGTAACGGCTCAAGCCCACTGCTAGCAGATATTGTCCTCTTTTGGTTTTCCTTTTTGGACTTCTTCTCAAAGTTTTAAGAACGCGTCTGCTACAGAGAGGTTTTCACACTCTTATAAAAAATGTTTCGTTCTCCTCCTCAACCATGTGGGATCTCACAATCCATTGATGGTTTTTAAAATGCAAGGCTAGCAAATATTGTCCTGTTGGGTTTTCCTTTTTGAGCTTCCTCTCAAGATTTTTCAAACGCGTCTGATAGAGAGAGGTTTTCACACCCTTATGAAGAATGTTTCATTTTCCTCCCTAATTGATGTGAGATCTCACAATCTATTGATGGTTTTTAAAGGATTCTCAGAATTCAATTTGATCTTTAGTTTCTTTTTGGAGCTAACTGAAATATTTTTAGGCAATGTGAGGAATATAGGATAGGATCTTCCTTTGTTGTCCTTGATAAAATCATTAGAAAATGTAACAAATATATTTTTTGTGTTTCATTTGGTAAGGATATTATTATTATTATAGTTTTCTTTTTGTCTATATGGAAATGATGTTTTGGGATGCATGCTTACTCGACGCTACGTCTGAATCAGGAATGGCTGCATCTCTTGCACGAGATGTGATCGACAATGTACTAGTCGATTTGAATCCTGTTGATAACGAGAGTTGTGCTCCATCTAGTGTGAATCCGAAGGACGCCCAAAGAGAATTGCCGCAACACCATAAGAAGAGGAAACGGCCTTTAGTTCCCACTTCATTTAAAGAGCAGCATGAGGGACATGGATCAAGAGACATTACCAGCAGCTGTATGTCCACTTCTGTCCCCTTGAGGATAGCTGCACTTGAGGCTTTGGAGACTCTTCTTACATTGGTAGGCATTTTATAACTAATCTGTGGCTTTTTGTAAGGTTGTGCTTTGGGACATTTTTGGTCCCGTTTTGTTTCTGATATTGTTGCTAAATGATATTGTTGCATAGGCTGGTGCTTTGAGAACTGAAGAAGGATGGCGTGCGAAAGTCAAACATCTTTTAATAACAGCTGCAACGTCGTCTTTCGAATGGCCACTGGCCTCAGATGACGTCTTTTTCCAAACTAATGAATCTATTGAGGTTTGGGCGGATTATCAGTTGGCAGCATTTCGTGCGCTACTGGCTTCGTTTTTGTCTGCGGTCCATATACGCCCTCTGGCCTTAGCTCAAGGTCTTGATCTTTTCCGTAGAGGTAAATCTCTTTTGATGTTTCATTTGTTAACTATAACGGCCCAAGCCCACTGCTGACCGATATTGTTTTCTTTGGGTTTTTTCCGTGAACCATACCCTTATAAAGAATGTTTTGTTCTCCTCCTCAACCGACGTGGGATCTGTCGTAAACTATTTATCAGAGAAAAATTATGCATAGAACAGCAGATGTTAGCTATTTTCTGATGGGTATTTTTATCTTCAGGTAAACAAGAACTTGGAACCAAACTTCCTGAATTCTGTGCGCATGCACTCTTAGCCTTGGAGGTTCTAATACATCCAAGAGTACTTCCCTTGTCGGATTTCTTGCCCGTGCATTTGAGCTCTCCCGAACCACAAGCTACCTATAAAATCCCGGAAGATATGTACTTCGGTGGTATGAATTCGAGCAAATCGTTGAAGATCATCGACACTCTCGGCATGGACCAGAGTGCCCCTGATTTGGACGACGATTTCCTGTATGATAGAGAAGTTGCAGATGACATCGAAGAGGCTCCAATTAGAGATGCAAGTAATGAGATAAATAACAATGCAACGACATATAACACGTCAAACAATCTCGAAACAGGACCTTCTGCCGATGCCCTACAGACTACAGAAACCCCCAAGAGGACAGAGCAGGAGGACACTGCAGCAGCCATCACAGATGCTGCAGGGATTGTAGAGAAAGATGATGTATTTGCTAATGCAAGAATGAACAGTTCTCCCGTGTCGTTAAAGTCCGACTCGAACTTATTGCCAGAAGATGATTTCCCCGACATTATTGATGCAGATCCTGATACAGACTGTGAGTGAACAAAGGTACTAACAATCTCAAATCTCAATTTTGTAGCAATAAGGATGTTGTTTTAAAGTTCAATATTAACTATTTTTTGTTGTGTTGTATTTCATGTTACCATAGTTTAAGCTAATCATGGAAGAAGAAGAAGAAGAAACTATGTATATAAAGAGCATAGGATCTGACATTTTTGGTCATAAAATTAGGGTTTTCTTTCTCTTAAGACATTGTTCATTCCCACCAAGCAAAATTAATTGTCTTTGAATTATAATTTTTACACATTTTTGGACTAAATTTATTTTTG

mRNA sequence

GGTAAAGCCACGTGGACGGCCGCCGGCGACTTGCAGGACCGCGACGACGCTACGGCCTTAGGCATCTGCCTTCCCCGGTTCTGTAGCGCTCATCACCACGAGCTTAAGAGCGACGGATACACACGCGGCGGCGGACTCAAGAGACCTTCTCCGGCGAACCACGAGCGTTGCCGGCGCAGCTTCCTGTCCATCAACGGCGGCTCATTAAAAAAAAGAAAACCAGTGTTTAAAGGGAACGGTGGTTCAACCGAAATGGCGGCTTTCAATCTCGTTGCGAATATGTATGATCCGGCTTTGAAGCCTCGCTTGCTACACAAGCTTCTCAGGGAACATGTTCCGGACGATAAACAGACGTTTAGTGATCATTCAGAACTCTCGAAGGTGGTTTCTATGGTCAAAATCCATAATCTCCTCTCTGAATCTTCATCTTCCATGGACCAAAAACTGATGGATAGCTGGAAATCCGCCGTTGATTCCTGGGTCAACCGCTTGCTTGTTCTGCTCTCTAATGATATGCCTGATAAATGTTGGGCTGGAATCATTTTACTGGGCGTGACTTGTCAACAATGCAGCTCGAGTCGTTTCTTGGCATCGTATGCAGATTGGCTTCACAAGCTTCTGCCTCACTTGCAGACAGATTCTCAGTTTCTGAAGGTGGCCACGTGTGCTTCGATCTCAGATTTATTCTTGAGATTGGGCCGATTTCCAAACGTGAAGAAAGATGGGACTTCTTGTGCTGGGAAGGTCATTCAACCAGTTATTAAGCTGTTGCATGATGATAATACAGAAGCTGTTTTGGATGCAGCAGTTAATCTATTATGCACTCTGATTGCTTTCTTTCCCTTTACAATCCATCGTCATTACGACTCTGCTGAAGCTGCAATTGTTTCAAAAATCTTTTCAGGGAAGTGTAGTTTCAATATGCTGAAGAAGCTTGCTCATTGCCTAGCATCACTTCCAAAATCAAAAGGAGATGAAGATAGCTGGACTGTACTAATGCAGAAGATTTTGTTATCAATTGACATACACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGACTCAAGAGGTAACGAAGTTGTAAGGTTACTGATTCCACCCGGAAAAGAACCTCCACCACCGTTAGGTTGTAATTCATCGGCAGAAGGCTCCTTTGATAAACTAACAAAGAGCTCAGAGCGAATGTTAACTTCTATTATCTCGACCTTGATGTTTTGCTGTTCTACAATGATAACAAGTTCATACCCCCATCAGGTGGCAGTTCCGATTCGCCCGTTATTAGCTCTTGTTGAGAGAATGCTGATGGTGGATGGTTCTCTGCCGCCCGCTTCAGTGCCATTTATGACATCTCTACAGCAAGAGTCAATTCAATTGTTACCACATGCTGCATTTATTGTGCGACTCATTGTGAAGTACTTCAAGAAATGTGTGTCTGCAGAATTGAGAGTAAAGGCCTACGCAGTTGCTAAATTATTGATGATGTCTTTGGGCGTTGGAATGGCTGCATCTCTTGCACGAGATGTGATCGACAATGTACTAGTCGATTTGAATCCTGTTGATAACGAGAGTTGTGCTCCATCTAGTGTGAATCCGAAGGACGCCCAAAGAGAATTGCCGCAACACCATAAGAAGAGGAAACGGCCTTTAGTTCCCACTTCATTTAAAGAGCAGCATGAGGGACATGGATCAAGAGACATTACCAGCAGCTGTATGTCCACTTCTGTCCCCTTGAGGATAGCTGCACTTGAGGCTTTGGAGACTCTTCTTACATTGGCTGGTGCTTTGAGAACTGAAGAAGGATGGCGTGCGAAAGTCAAACATCTTTTAATAACAGCTGCAACGTCGTCTTTCGAATGGCCACTGGCCTCAGATGACGTCTTTTTCCAAACTAATGAATCTATTGAGGTTTGGGCGGATTATCAGTTGGCAGCATTTCGTAAACAAGAACTTGGAACCAAACTTCCTGAATTCTGTGCGCATGCACTCTTAGCCTTGGAGGTTCTAATACATCCAAGAGTACTTCCCTTGTCGGATTTCTTGCCCGTGCATTTGAGCTCTCCCGAACCACAAGCTACCTATAAAATCCCGGAAGATATGTACTTCGGTGGTATGAATTCGAGCAAATCGTTGAAGATCATCGACACTCTCGGCATGGACCAGAGTGCCCCTGATTTGGACGACGATTTCCTGTATGATAGAGAAGTTGCAGATGACATCGAAGAGGCTCCAATTAGAGATGCAAGTAATGAGATAAATAACAATGCAACGACATATAACACGTCAAACAATCTCGAAACAGGACCTTCTGCCGATGCCCTACAGACTACAGAAACCCCCAAGAGGACAGAGCAGGAGGACACTGCAGCAGCCATCACAGATGCTGCAGGGATTGTAGAGAAAGATGATGTATTTGCTAATGCAAGAATGAACAGTTCTCCCGTGTCGTTAAAGTCCGACTCGAACTTATTGCCAGAAGATGATTTCCCCGACATTATTGATGCAGATCCTGATACAGACTGTGAGTGAACAAAGGTACTAACAATCTCAAATCTCAATTTTGTAGCAATAAGGATGTTGTTTTAAAGTTCAATATTAACTATTTTTTGTTGTGTTGTATTTCATGTTACCATAGTTTAAGCTAATCATGGAAGAAGAAGAAGAAGAAACTATGTATATAAAGAGCATAGGATCTGACATTTTTGGTCATAAAATTAGGGTTTTCTTTCTCTTAAGACATTGTTCATTCCCACCAAGCAAAATTAATTGTCTTTGAATTATAATTTTTACACATTTTTGGACTAAATTTATTTTTG

Coding sequence (CDS)

GGTAAAGCCACGTGGACGGCCGCCGGCGACTTGCAGGACCGCGACGACGCTACGGCCTTAGGCATCTGCCTTCCCCGGTTCTGTAGCGCTCATCACCACGAGCTTAAGAGCGACGGATACACACGCGGCGGCGGACTCAAGAGACCTTCTCCGGCGAACCACGAGCGTTGCCGGCGCAGCTTCCTGTCCATCAACGGCGGCTCATTAAAAAAAAGAAAACCAGTGTTTAAAGGGAACGGTGGTTCAACCGAAATGGCGGCTTTCAATCTCGTTGCGAATATGTATGATCCGGCTTTGAAGCCTCGCTTGCTACACAAGCTTCTCAGGGAACATGTTCCGGACGATAAACAGACGTTTAGTGATCATTCAGAACTCTCGAAGGTGGTTTCTATGGTCAAAATCCATAATCTCCTCTCTGAATCTTCATCTTCCATGGACCAAAAACTGATGGATAGCTGGAAATCCGCCGTTGATTCCTGGGTCAACCGCTTGCTTGTTCTGCTCTCTAATGATATGCCTGATAAATGTTGGGCTGGAATCATTTTACTGGGCGTGACTTGTCAACAATGCAGCTCGAGTCGTTTCTTGGCATCGTATGCAGATTGGCTTCACAAGCTTCTGCCTCACTTGCAGACAGATTCTCAGTTTCTGAAGGTGGCCACGTGTGCTTCGATCTCAGATTTATTCTTGAGATTGGGCCGATTTCCAAACGTGAAGAAAGATGGGACTTCTTGTGCTGGGAAGGTCATTCAACCAGTTATTAAGCTGTTGCATGATGATAATACAGAAGCTGTTTTGGATGCAGCAGTTAATCTATTATGCACTCTGATTGCTTTCTTTCCCTTTACAATCCATCGTCATTACGACTCTGCTGAAGCTGCAATTGTTTCAAAAATCTTTTCAGGGAAGTGTAGTTTCAATATGCTGAAGAAGCTTGCTCATTGCCTAGCATCACTTCCAAAATCAAAAGGAGATGAAGATAGCTGGACTGTACTAATGCAGAAGATTTTGTTATCAATTGACATACACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGACTCAAGAGGTAACGAAGTTGTAAGGTTACTGATTCCACCCGGAAAAGAACCTCCACCACCGTTAGGTTGTAATTCATCGGCAGAAGGCTCCTTTGATAAACTAACAAAGAGCTCAGAGCGAATGTTAACTTCTATTATCTCGACCTTGATGTTTTGCTGTTCTACAATGATAACAAGTTCATACCCCCATCAGGTGGCAGTTCCGATTCGCCCGTTATTAGCTCTTGTTGAGAGAATGCTGATGGTGGATGGTTCTCTGCCGCCCGCTTCAGTGCCATTTATGACATCTCTACAGCAAGAGTCAATTCAATTGTTACCACATGCTGCATTTATTGTGCGACTCATTGTGAAGTACTTCAAGAAATGTGTGTCTGCAGAATTGAGAGTAAAGGCCTACGCAGTTGCTAAATTATTGATGATGTCTTTGGGCGTTGGAATGGCTGCATCTCTTGCACGAGATGTGATCGACAATGTACTAGTCGATTTGAATCCTGTTGATAACGAGAGTTGTGCTCCATCTAGTGTGAATCCGAAGGACGCCCAAAGAGAATTGCCGCAACACCATAAGAAGAGGAAACGGCCTTTAGTTCCCACTTCATTTAAAGAGCAGCATGAGGGACATGGATCAAGAGACATTACCAGCAGCTGTATGTCCACTTCTGTCCCCTTGAGGATAGCTGCACTTGAGGCTTTGGAGACTCTTCTTACATTGGCTGGTGCTTTGAGAACTGAAGAAGGATGGCGTGCGAAAGTCAAACATCTTTTAATAACAGCTGCAACGTCGTCTTTCGAATGGCCACTGGCCTCAGATGACGTCTTTTTCCAAACTAATGAATCTATTGAGGTTTGGGCGGATTATCAGTTGGCAGCATTTCGTAAACAAGAACTTGGAACCAAACTTCCTGAATTCTGTGCGCATGCACTCTTAGCCTTGGAGGTTCTAATACATCCAAGAGTACTTCCCTTGTCGGATTTCTTGCCCGTGCATTTGAGCTCTCCCGAACCACAAGCTACCTATAAAATCCCGGAAGATATGTACTTCGGTGGTATGAATTCGAGCAAATCGTTGAAGATCATCGACACTCTCGGCATGGACCAGAGTGCCCCTGATTTGGACGACGATTTCCTGTATGATAGAGAAGTTGCAGATGACATCGAAGAGGCTCCAATTAGAGATGCAAGTAATGAGATAAATAACAATGCAACGACATATAACACGTCAAACAATCTCGAAACAGGACCTTCTGCCGATGCCCTACAGACTACAGAAACCCCCAAGAGGACAGAGCAGGAGGACACTGCAGCAGCCATCACAGATGCTGCAGGGATTGTAGAGAAAGATGATGTATTTGCTAATGCAAGAATGAACAGTTCTCCCGTGTCGTTAAAGTCCGACTCGAACTTATTGCCAGAAGATGATTTCCCCGACATTATTGATGCAGATCCTGATACAGACTGTGAGTGA

Protein sequence

GKATWTAAGDLQDRDDATALGICLPRFCSAHHHELKSDGYTRGGGLKRPSPANHERCRRSFLSINGGSLKKRKPVFKGNGGSTEMAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSSMDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLHKLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGSFDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPASVPFMTSLQQESIQLLPHAAFIVRLIVKYFKKCVSAELRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQHHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEGWRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFRKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATYKIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINNNATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSPVSLKSDSNLLPEDDFPDIIDADPDTDCE
Homology
BLAST of Cp4.1LG09g03660 vs. NCBI nr
Match: XP_023542346.1 (proline-, glutamic acid- and leucine-rich protein 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1440 bits (3728), Expect = 0.0
Identity = 756/808 (93.56%), Postives = 757/808 (93.69%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS
Sbjct: 301 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAAFIVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAAFIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ
Sbjct: 421 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR              
Sbjct: 541 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY
Sbjct: 601 LALAQGLDLFRRGKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN
Sbjct: 661 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP
Sbjct: 721 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 780

BLAST of Cp4.1LG09g03660 vs. NCBI nr
Match: KAG7012950.1 (hypothetical protein SDJN02_25703 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1412 bits (3655), Expect = 0.0
Identity = 740/808 (91.58%), Postives = 745/808 (92.20%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLG TCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGTTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWT+LMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTILMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSERMLTSIISTLM CCSTMITSSYPHQVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSERMLTSIISTLMLCCSTMITSSYPHQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAA IVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAALIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRE PQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQREFPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLPEFCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLDLFRRGKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMY GGMNS KSLKI DTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYIGGMNSGKSLKINDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           N TTYNTSNNLETGPSADALQTTETPKRT+QEDTAAAITDAAGIVEKDDVFANARMNSSP
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTKQEDTAAAITDAAGIVEKDDVFANARMNSSP 780

BLAST of Cp4.1LG09g03660 vs. NCBI nr
Match: XP_022945087.1 (proline-, glutamic acid- and leucine-rich protein 1-like [Cucurbita moschata])

HSP 1 Score: 1410 bits (3649), Expect = 0.0
Identity = 739/808 (91.46%), Postives = 744/808 (92.08%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLG TCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGTTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSG CSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGNCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWT+LMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTILMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAAFIVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAAFIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSSC STSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCTSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           W AKV+HLLITAA SSFEWPLASDDVFFQTNESIEVWADYQLAAFR              
Sbjct: 541 WHAKVEHLLITAAMSSFEWPLASDDVFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLPEFCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLDLFRRGKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMY GGMNS KSLKI DTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYIGGMNSGKSLKINDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           N TTYNTSNNLETGPSADALQTTETPKRT+QEDTAAAITDAAGIVEKDDVFANARMNSSP
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTKQEDTAAAITDAAGIVEKDDVFANARMNSSP 780

BLAST of Cp4.1LG09g03660 vs. NCBI nr
Match: KAG6573885.1 (hypothetical protein SDJN03_27772, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1405 bits (3636), Expect = 0.0
Identity = 737/806 (91.44%), Postives = 742/806 (92.06%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLG TCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGTTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQP IKLLHDDNTEA
Sbjct: 121 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPAIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWT+LMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTILMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSERMLTSIISTLM CCSTMITSSYPHQVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSERMLTSIISTLMLCCSTMITSSYPHQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAA IVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAALIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRE PQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQREFPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLPEFCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLDLFRRGKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMY GGMNS KSLKI DTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYIGGMNSGKSLKINDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 839
           N TTYNTSNNLETGPSADALQTTETPKRT+QEDTAAAITDAAGIVEKDDVFANARMNSSP
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTKQEDTAAAITDAAGIVEKDDVFANARMNSSP 780

BLAST of Cp4.1LG09g03660 vs. NCBI nr
Match: XP_022968338.1 (proline-, glutamic acid- and leucine-rich protein 1 [Cucurbita maxima])

HSP 1 Score: 1384 bits (3581), Expect = 0.0
Identity = 727/808 (89.98%), Postives = 737/808 (91.21%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLV NMYDPALKPRL+HKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVVNMYDPALKPRLIHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDS FLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTE 
Sbjct: 121 KLLPHLQTDSLFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEV 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLD AVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDTAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWTVLMQKILLSID+HLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTVLMQKILLSIDVHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSE+MLTSIISTLMFCCSTMITSSYP+QVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSEQMLTSIISTLMFCCSTMITSSYPNQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAAFIVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAAFIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASL RDVIDNVL DLNPVDNESC PSSVNPKDAQ ELPQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLTRDVIDNVLADLNPVDNESCTPSSVNPKDAQGELPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSS MSTSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSFMSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWPLASDD+FFQTNESIEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPLASDDIFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLP+FCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLELFRRGKQELGTKLPKFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMYFGGMNS KSLKI DT  MDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYFGGMNSGKSLKINDTRDMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           N TTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARM+SS 
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMSSSL 780

BLAST of Cp4.1LG09g03660 vs. ExPASy TrEMBL
Match: A0A6J1FZZ0 (proline-, glutamic acid- and leucine-rich protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111449429 PE=3 SV=1)

HSP 1 Score: 1410 bits (3649), Expect = 0.0
Identity = 739/808 (91.46%), Postives = 744/808 (92.08%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLG TCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGTTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSG CSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGNCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWT+LMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTILMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAAFIVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAAFIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSSC STSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCTSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           W AKV+HLLITAA SSFEWPLASDDVFFQTNESIEVWADYQLAAFR              
Sbjct: 541 WHAKVEHLLITAAMSSFEWPLASDDVFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLPEFCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLDLFRRGKQELGTKLPEFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMY GGMNS KSLKI DTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYIGGMNSGKSLKINDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           N TTYNTSNNLETGPSADALQTTETPKRT+QEDTAAAITDAAGIVEKDDVFANARMNSSP
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTKQEDTAAAITDAAGIVEKDDVFANARMNSSP 780

BLAST of Cp4.1LG09g03660 vs. ExPASy TrEMBL
Match: A0A6J1HXR1 (proline-, glutamic acid- and leucine-rich protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111467603 PE=3 SV=1)

HSP 1 Score: 1384 bits (3581), Expect = 0.0
Identity = 727/808 (89.98%), Postives = 737/808 (91.21%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLV NMYDPALKPRL+HKLLREHVPDDKQTF+DHSELSKVVSMVKIHNLLSESSSS
Sbjct: 1   MAAFNLVVNMYDPALKPRLIHKLLREHVPDDKQTFNDHSELSKVVSMVKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH
Sbjct: 61  MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPHLQTDS FLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTE 
Sbjct: 121 KLLPHLQTDSLFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEV 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLD AVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VLDTAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSWTVLMQKILLSID+HLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS
Sbjct: 241 DEDSWTVLMQKILLSIDVHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDKLTKSSE+MLTSIISTLMFCCSTMITSSYP+QVAVPIRPLLALVERML VDGSLPPAS
Sbjct: 301 FDKLTKSSEQMLTSIISTLMFCCSTMITSSYPNQVAVPIRPLLALVERMLTVDGSLPPAS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAAFIVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAAFIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASL RDVIDNVL DLNPVDNESC PSSVNPKDAQ ELPQ
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLTRDVIDNVLADLNPVDNESCTPSSVNPKDAQGELPQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRPLVPTSFKEQHEGHGSRDITSS MSTSVPLRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HHKKRKRPLVPTSFKEQHEGHGSRDITSSFMSTSVPLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWPLASDD+FFQTNESIEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPLASDDIFFQTNESIEVWADYQLAAFRALLASFLSAVHIRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQELGTKLP+FCAHALLALEVLIHPRVLPLSDF PVHLSSPEPQATY
Sbjct: 601 LALAQGLELFRRGKQELGTKLPKFCAHALLALEVLIHPRVLPLSDFSPVHLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           KIPEDMYFGGMNS KSLKI DT  MDQSAPDLDDDFLYDREVADDIEEAPIRDA NEINN
Sbjct: 661 KIPEDMYFGGMNSGKSLKINDTRDMDQSAPDLDDDFLYDREVADDIEEAPIRDAGNEINN 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
           N TTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARM+SS 
Sbjct: 721 NVTTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMSSSL 780

BLAST of Cp4.1LG09g03660 vs. ExPASy TrEMBL
Match: A0A6J1GYU8 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458494 PE=3 SV=1)

HSP 1 Score: 1176 bits (3043), Expect = 0.0
Identity = 639/819 (78.02%), Postives = 684/819 (83.52%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRL+HKLLREHVPDDK+ F+DHSELSKVVSM+KIHNLLSES  S
Sbjct: 1   MAAFNLVANMYDPALKPRLIHKLLREHVPDDKRAFNDHSELSKVVSMIKIHNLLSESLHS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKL+DSWKSAVDSWVNRL +LLSNDMPDKCWAGIILLGVTCQQCSSSRFLASY +WLH
Sbjct: 61  MDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYTEWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           +LLPH+QTDSQFLKVA+CASISDLFLRLGRF +VKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 RLLPHVQTDSQFLKVASCASISDLFLRLGRFQSVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKI+SGKC  NMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIYSGKCGSNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSW++LMQKILLSID HLNEAFQGIGEDS+G+EV+RLLIPPGK PPPPLGCNS +E S
Sbjct: 241 DEDSWSLLMQKILLSIDSHLNEAFQGIGEDSKGHEVLRLLIPPGKNPPPPLGCNSLSEDS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDK+T+SSERMLT  ISTLMFCCSTMITSSY HQVAVPIRPLLA+V+R+L VDGSLPP S
Sbjct: 301 FDKITRSSERMLTPSISTLMFCCSTMITSSYNHQVAVPIRPLLAIVKRVLTVDGSLPPTS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAA IVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPALHSDSLDLLIAIVKRLRSQLLPHAASIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDN LVDLNPVDNESC PSSVNPK+AQREL Q
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNALVDLNPVDNESCDPSSVNPKEAQRELLQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           H+KKRKRP VPTS K QHE HGS DITSSCMSTSV LRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HYKKRKRPSVPTSMKGQHERHGSGDITSSCMSTSVHLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWP ASDD+FF+ NE IEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPQASDDIFFRANEFIEVWADYQLAAFRALLASFLSSVHVRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQE G+KL EFCAHALLA+EVLIHPRVLPLSDFLPV LSSPEPQATY
Sbjct: 601 LALAQGLELFRKGKQENGSKLAEFCAHALLAMEVLIHPRVLPLSDFLPVRLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDAS-NEIN 804
           K  EDMYFG M SSK LKI DT GM+QS P+LDD+F YDR  A++IEEAPIRDA+ N IN
Sbjct: 661 KFQEDMYFGSMTSSKLLKI-DTQGMEQSDPELDDEFSYDRVFANNIEEAPIRDATGNPIN 720

Query: 805 NNATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSS 841
           +   TYN SN+LE  P A+ L + ETPK TEQ  TA A+T+  G+VEK DVFA      S
Sbjct: 721 DYEMTYNISNDLEKEPYANGLVSIETPKTTEQAATA-AVTEV-GVVEKVDVFA------S 780

BLAST of Cp4.1LG09g03660 vs. ExPASy TrEMBL
Match: A0A6J1DBX6 (proline-, glutamic acid- and leucine-rich protein 1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018683 PE=3 SV=1)

HSP 1 Score: 1167 bits (3018), Expect = 0.0
Identity = 626/818 (76.53%), Postives = 678/818 (82.89%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRLLHKLLREHVPDDK+TF DHSELS  VSM+KIHNLLSESSSS
Sbjct: 1   MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSNAVSMIKIHNLLSESSSS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
            DQKL+DSWKSAVDSWV+RL +LLSNDMPDKCWAGIILLGVTCQQCSSSRFLASY +WL 
Sbjct: 61  KDQKLIDSWKSAVDSWVDRLFLLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYTEWLQ 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           KLLPH+QTDSQFLKVA CAS+SDLF RL RF NVKKDGTSCAGK+IQPV+KLLHDDN+EA
Sbjct: 121 KLLPHIQTDSQFLKVAACASVSDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDDNSEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           V +AAVNLL TLIAFFPFT+HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG
Sbjct: 181 VWEAAVNLLHTLIAFFPFTVHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSW++LMQKILLSID HLNEAFQGIGEDSRG+EVVRLLIPPGK+PPPPLGCNS   GS
Sbjct: 241 DEDSWSLLMQKILLSIDNHLNEAFQGIGEDSRGSEVVRLLIPPGKDPPPPLGCNSLPGGS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDK+TKSSER+LTS ISTLMFCCSTMITSSYPHQVAVPIRPLLALVER+LMVDGSLPP S
Sbjct: 301 FDKITKSSERLLTSSISTLMFCCSTMITSSYPHQVAVPIRPLLALVERVLMVDGSLPPTS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQESI                        QLLP+AA IVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESICSELPTLHSNCLDLLIAIIKSLRSQLLPYAASIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDV++N L+DLNPVDNE+ APSSVN KD QRE  Q
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVMENALIDLNPVDNENFAPSSVNSKDTQREFMQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           HHKKRKRP VPTS ++Q E HGS D+ +  MST VPLRIAALEALETLLTLAGALR+EEG
Sbjct: 481 HHKKRKRPSVPTSLQQQQERHGSGDVDNIIMSTPVPLRIAALEALETLLTLAGALRSEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WR K++ LL TAATSSF+WP ASD+  FQT+ESIEVW DYQLAAFR              
Sbjct: 541 WRGKIEQLLATAATSSFDWPRASDNGSFQTDESIEVWTDYQLAAFRTLLASFLSAVHVRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQE GTKL EFCAHALLA+EVLIHPRVLPLSDFLPVHLSS E Q+TY
Sbjct: 601 LALAQGLELFRRGKQESGTKLAEFCAHALLAMEVLIHPRVLPLSDFLPVHLSSSERQSTY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           K  E+M+F G+NSSK LKI    G++QSAPDLDDDFL++ EVADDIEEAPIR+A NEIN+
Sbjct: 661 KFEENMFFDGLNSSKVLKIDTMQGVEQSAPDLDDDFLFNNEVADDIEEAPIREAGNEIND 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
             TTYNTSN+     S     +TETPKR+EQE TAAAITD  G+VEKDD F NA +N SP
Sbjct: 721 GETTYNTSNDSSKEASVLGPSSTETPKRSEQE-TAAAITDV-GVVEKDDAFGNASINDSP 780

BLAST of Cp4.1LG09g03660 vs. ExPASy TrEMBL
Match: A0A6J1GXZ0 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111458494 PE=3 SV=1)

HSP 1 Score: 1150 bits (2975), Expect = 0.0
Identity = 627/818 (76.65%), Postives = 667/818 (81.54%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSESSSS 144
           MAAFNLVANMYDPALKPRL+HKLLREHVPDDK+ F+DHSELSKVVSM+KIHNLLSES  S
Sbjct: 1   MAAFNLVANMYDPALKPRLIHKLLREHVPDDKRAFNDHSELSKVVSMIKIHNLLSESLHS 60

Query: 145 MDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWLH 204
           MDQKL+DSWKSAVDSWVNRL +LLSNDMPDKCWAGIILLGVTCQQCSSSRFLASY +WLH
Sbjct: 61  MDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYTEWLH 120

Query: 205 KLLPHLQTDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDNTEA 264
           +LLPH+QTDSQFLKVA+CASISDLFLRLGRF +VKKDGTSCAGKVIQPVIKLLHDDNTEA
Sbjct: 121 RLLPHVQTDSQFLKVASCASISDLFLRLGRFQSVKKDGTSCAGKVIQPVIKLLHDDNTEA 180

Query: 265 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKG 324
           VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKI+SGKC  NMLKKLAHCLASLPKSKG
Sbjct: 181 VLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIYSGKCGSNMLKKLAHCLASLPKSKG 240

Query: 325 DEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSAEGS 384
           DEDSW++LMQKILLSID HLNEAFQGIGEDS+G+EV+RLLIPPGK PPPPLGCNS +E S
Sbjct: 241 DEDSWSLLMQKILLSIDSHLNEAFQGIGEDSKGHEVLRLLIPPGKNPPPPLGCNSLSEDS 300

Query: 385 FDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLPPAS 444
           FDK+T+SSERMLT  ISTLMFCCSTMITSSY HQVAVPIRPLLA+V+R+L VDGSLPP S
Sbjct: 301 FDKITRSSERMLTPSISTLMFCCSTMITSSYNHQVAVPIRPLLAIVKRVLTVDGSLPPTS 360

Query: 445 VPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCVSAE 504
           VPFMTSLQQES+                        QLLPHAA IVRLIVKYFKKCVSAE
Sbjct: 361 VPFMTSLQQESMCSELPALHSDSLDLLIAIVKRLRSQLLPHAASIVRLIVKYFKKCVSAE 420

Query: 505 LRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESCAPSSVNPKDAQRELPQ 564
           LRVK YAVAKLLMMSLGVGMAASLARDVIDN LVDLNPVDNESC PSSVNPK+AQREL Q
Sbjct: 421 LRVKVYAVAKLLMMSLGVGMAASLARDVIDNALVDLNPVDNESCDPSSVNPKEAQRELLQ 480

Query: 565 HHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALRTEEG 624
           H+KKRKRP VPTS K QHE HGS DITSSCMSTSV LRIAALEALETLLTLAGALRTEEG
Sbjct: 481 HYKKRKRPSVPTSMKGQHERHGSGDITSSCMSTSVHLRIAALEALETLLTLAGALRTEEG 540

Query: 625 WRAKVKHLLITAATSSFEWPLASDDVFFQTNESIEVWADYQLAAFR-------------- 684
           WRAKV+HLLITAATSSFEWP ASDD+FF+ NE IEVWADYQLAAFR              
Sbjct: 541 WRAKVEHLLITAATSSFEWPQASDDIFFRANEFIEVWADYQLAAFRALLASFLSSVHVRP 600

Query: 685 -------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPEPQATY 744
                        KQE G+KL EFCAHALLA+EVLIHPRVLPLSDFLPV LSSPEPQATY
Sbjct: 601 LALAQGLELFRKGKQENGSKLAEFCAHALLAMEVLIHPRVLPLSDFLPVRLSSPEPQATY 660

Query: 745 KIPEDMYFGGMNSSKSLKIIDTLGMDQSAPDLDDDFLYDREVADDIEEAPIRDASNEINN 804
           K  EDMYFG M SSK LKI DT GM+QS P+LDD+F YDR  A++IEEAPIRDA      
Sbjct: 661 KFQEDMYFGSMTSSKLLKI-DTQGMEQSDPELDDEFSYDRVFANNIEEAPIRDA------ 720

Query: 805 NATTYNTSNNLETGPSADALQTTETPKRTEQEDTAAAITDAAGIVEKDDVFANARMNSSP 841
                                 TETPK TEQ  TA A+T+  G+VEK DVFA      SP
Sbjct: 721 ----------------------TETPKTTEQAATA-AVTEV-GVVEKVDVFA------SP 780

BLAST of Cp4.1LG09g03660 vs. TAIR 10
Match: AT1G30240.2 (unknown protein; Has 169 Blast hits to 168 proteins in 75 species: Archae - 0; Bacteria - 0; Metazoa - 49; Fungi - 68; Plants - 46; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 499.2 bits (1284), Expect = 6.5e-141
Identity = 295/674 (43.77%), Postives = 408/674 (60.53%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSES-SS 144
           MA+F    +M D  LKP++L  LL E+VP++KQ  ++   LSKVVS +  H LLSES  +
Sbjct: 1   MASFERFDDMCDLRLKPKILRNLLSEYVPNEKQPLTNFLSLSKVVSTISTHKLLSESPPA 60

Query: 145 SMDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWL 204
           S+DQKL    KSAVD WV RL  L+S+DMPDK W GI L+GVTCQ+CSS RF  SY+ W 
Sbjct: 61  SIDQKLHAKSKSAVDDWVARLSALISSDMPDKSWVGICLIGVTCQECSSDRFFKSYSVWF 120

Query: 205 HKLLPHLQ--TDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDN 264
           + LL HL+    S+ ++VA+C SISDL  RL RF N KKD  S A K+I P+IKLL +D+
Sbjct: 121 NSLLSHLKNPASSRIVRVASCTSISDLLTRLSRFSNTKKDAVSHASKLILPIIKLLDEDS 180

Query: 265 TEAVLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPK 324
           +EA+L+  V+LL T++  FP   H +YD  EAAI SKIFS K S NMLKK AH LA LPK
Sbjct: 181 SEALLEGIVHLLSTIVLLFPAAFHSNYDKIEAAIASKIFSAKTSSNMLKKFAHFLALLPK 240

Query: 325 SKGDEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSA 384
           +KGDE +W+++MQK+L+SI++HLN  FQG+ E+++G + ++ L PPGK+ P PLG     
Sbjct: 241 AKGDEGTWSLMMQKLLISINVHLNNFFQGLEEETKGTKAIQRLTPPGKDSPLPLG---GQ 300

Query: 385 EGSFDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLP 444
            G  D  + +SE+++ S +S LMFC STM+T+SY  ++ +P+  LL+LVER+L+V+GSLP
Sbjct: 301 NGGLDDASWNSEQLIVSRVSALMFCTSTMLTTSYKSKINIPVGSLLSLVERVLLVNGSLP 360

Query: 445 PASVPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCV 504
            A  PFMT +QQE +                        QLLP+AA +VRL+  YF+KC 
Sbjct: 361 RAMSPFMTGIQQELVCAELPALHSSALELLCATLKSIRSQLLPYAASVVRLVSSYFRKCS 420

Query: 505 SAELRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESC-APSSVNPKDAQR 564
             ELR+K Y++   L+ S+G+GMA  LA++V+ N  VDL+    E+    SS NP     
Sbjct: 421 LPELRIKLYSITTTLLKSMGIGMAMQLAQEVVINASVDLDQTSLEAFDVASSKNPSLTNG 480

Query: 565 ELPQHHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALR 624
            L Q   K+++     S  E         I  + + + + L+IA+LEALETLLT+ GAL 
Sbjct: 481 ALLQACSKKRK----HSGVEAENSVFELRIPHNHLRSPISLKIASLEALETLLTIGGALG 540

Query: 625 TEEGWRAKVKHLLITAATSSFEWPLASDDVFF-QTNESIEVWADYQLAAFR--------- 684
           + + WR  V +LL+T AT++ E   A+ + +    N+S     ++QLAA R         
Sbjct: 541 S-DSWRESVDNLLLTTATNACEGRWANAETYHCLPNKSTTDLVEFQLAALRAFSASLVSP 600

Query: 685 ------------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPE 703
                             K + G K+  FCAHAL++LEV+IHPR LPL            
Sbjct: 601 SRVRPAFLAEGLELFRTGKLQAGMKVAGFCAHALMSLEVVIHPRALPLDGL--------- 657

BLAST of Cp4.1LG09g03660 vs. TAIR 10
Match: AT1G30240.1 (FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Armadillo-type fold (InterPro:IPR016024); Has 165 Blast hits to 164 proteins in 73 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 68; Plants - 46; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 490.7 bits (1262), Expect = 2.3e-138
Identity = 294/674 (43.62%), Postives = 406/674 (60.24%), Query Frame = 0

Query: 85  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKQTFSDHSELSKVVSMVKIHNLLSES-SS 144
           MA+F    +M D  LKP++L  LL E+VP++KQ  ++   LSKVVS +  H LLSES  +
Sbjct: 1   MASFERFDDMCDLRLKPKILRNLLSEYVPNEKQPLTNFLSLSKVVSTISTHKLLSESPPA 60

Query: 145 SMDQKLMDSWKSAVDSWVNRLLVLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYADWL 204
           S+DQKL    KSAVD WV RL  L+S+DMPDK W GI L+GVTCQ+CSS RF  SY+ W 
Sbjct: 61  SIDQKLHAKSKSAVDDWVARLSALISSDMPDKSWVGICLIGVTCQECSSDRFFKSYSVWF 120

Query: 205 HKLLPHLQ--TDSQFLKVATCASISDLFLRLGRFPNVKKDGTSCAGKVIQPVIKLLHDDN 264
           + LL HL+    S+ ++VA+C SISDL  RL RF N KKD  S A K+I P+IKLL +D+
Sbjct: 121 NSLLSHLKNPASSRIVRVASCTSISDLLTRLSRFSNTKKDAVSHASKLILPIIKLLDEDS 180

Query: 265 TEAVLDAAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPK 324
           +EA+L+  V+LL T++  FP   H +YD  EAAI SKIFS K S NMLKK AH LA LPK
Sbjct: 181 SEALLEGIVHLLSTIVLLFPAAFHSNYDKIEAAIASKIFSAKTSSNMLKKFAHFLALLPK 240

Query: 325 SKGDEDSWTVLMQKILLSIDIHLNEAFQGIGEDSRGNEVVRLLIPPGKEPPPPLGCNSSA 384
           +KGDE +W+++MQK+L+SI++HLN  FQG+ E+++G + ++ L PPGK+ P PLG     
Sbjct: 241 AKGDEGTWSLMMQKLLISINVHLNNFFQGLEEETKGTKAIQRLTPPGKDSPLPLG---GQ 300

Query: 385 EGSFDKLTKSSERMLTSIISTLMFCCSTMITSSYPHQVAVPIRPLLALVERMLMVDGSLP 444
            G  D  + +SE+++ S +S LMFC STM+T+SY  ++ +P+  LL+LVER+L+V+GSLP
Sbjct: 301 NGGLDDASWNSEQLIVSRVSALMFCTSTMLTTSYKSKINIPVGSLLSLVERVLLVNGSLP 360

Query: 445 PASVPFMTSLQQESI------------------------QLLPHAAFIVRLIVKYFKKCV 504
            A  PFMT +QQE +                        QLLP+AA +VRL+  YF+KC 
Sbjct: 361 RAMSPFMTGIQQELVCAELPALHSSALELLCATLKSIRSQLLPYAASVVRLVSSYFRKCS 420

Query: 505 SAELRVKAYAVAKLLMMSLGVGMAASLARDVIDNVLVDLNPVDNESC-APSSVNPKDAQR 564
             ELR+K Y++   L+ S+  GMA  LA++V+ N  VDL+    E+    SS NP     
Sbjct: 421 LPELRIKLYSITTTLLKSM--GMAMQLAQEVVINASVDLDQTSLEAFDVASSKNPSLTNG 480

Query: 565 ELPQHHKKRKRPLVPTSFKEQHEGHGSRDITSSCMSTSVPLRIAALEALETLLTLAGALR 624
            L Q   K+++     S  E         I  + + + + L+IA+LEALETLLT+ GAL 
Sbjct: 481 ALLQACSKKRK----HSGVEAENSVFELRIPHNHLRSPISLKIASLEALETLLTIGGALG 540

Query: 625 TEEGWRAKVKHLLITAATSSFEWPLASDDVFF-QTNESIEVWADYQLAAFR--------- 684
           + + WR  V +LL+T AT++ E   A+ + +    N+S     ++QLAA R         
Sbjct: 541 S-DSWRESVDNLLLTTATNACEGRWANAETYHCLPNKSTTDLVEFQLAALRAFSASLVSP 600

Query: 685 ------------------KQELGTKLPEFCAHALLALEVLIHPRVLPLSDFLPVHLSSPE 703
                             K + G K+  FCAHAL++LEV+IHPR LPL            
Sbjct: 601 SRVRPAFLAEGLELFRTGKLQAGMKVAGFCAHALMSLEVVIHPRALPLDGL--------- 655

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023542346.10.093.56proline-, glutamic acid- and leucine-rich protein 1-like [Cucurbita pepo subsp. ... [more]
KAG7012950.10.091.58hypothetical protein SDJN02_25703 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022945087.10.091.46proline-, glutamic acid- and leucine-rich protein 1-like [Cucurbita moschata][more]
KAG6573885.10.091.44hypothetical protein SDJN03_27772, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022968338.10.089.98proline-, glutamic acid- and leucine-rich protein 1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1FZZ00.091.46proline-, glutamic acid- and leucine-rich protein 1-like OS=Cucurbita moschata O... [more]
A0A6J1HXR10.089.98proline-, glutamic acid- and leucine-rich protein 1 OS=Cucurbita maxima OX=3661 ... [more]
A0A6J1GYU80.078.02proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita... [more]
A0A6J1DBX60.076.53proline-, glutamic acid- and leucine-rich protein 1 isoform X1 OS=Momordica char... [more]
A0A6J1GXZ00.076.65proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 OS=Cucurbita... [more]
Match NameE-valueIdentityDescription
AT1G30240.26.5e-14143.77unknown protein; Has 169 Blast hits to 168 proteins in 75 species: Archae - 0; B... [more]
AT1G30240.12.3e-13843.62FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cell... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 116..617
e-value: 4.3E-8
score: 33.5
IPR012583Pre-rRNA-processing protein RIX1, N-terminalPFAMPF08167RIX1coord: 104..305
e-value: 1.2E-38
score: 132.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 747..777
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 745..786
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 826..841
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 810..841
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 523..569
NoneNo IPR availablePANTHERPTHR34105PROLINE-, GLUTAMIC ACID- AND LEUCINE-RICH PROTEIN 1coord: 88..823
NoneNo IPR availablePANTHERPTHR34105:SF1PROLINE-, GLUTAMIC ACID- AND LEUCINE-RICH PROTEIN 1coord: 88..823
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 132..602

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g03660.1Cp4.1LG09g03660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus