Cp4.1LG02g14560 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g14560
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA-directed RNA polymerase subunit beta
LocationCp4.1LG02: 13910086 .. 13920651 (-)
RNA-Seq ExpressionCp4.1LG02g14560
SyntenyCp4.1LG02g14560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTAAATTACCAATTTAATCCACTCCATTGTCAAGAAAAAAAAGAAAAAAAAGAAAAAAAAGAAAAAAAAAGTAAAAAATTAATTTACCCCCTGTTGTTCAGTTGCTCAAAAAAGGGAAGGAGATGAAGCCATGGGAAGCCCAAGATTCCAATTTCCAAGCGCCAAGCAAACTCTTCGTTAAGAACTCGACCATAGCAACGCCAGCATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCGTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGACCAGATTTTCCTTATTCACCCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTCATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTATGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGTGTGCCCTGGTGTAAGTTTTCTTGCTCATCACTTTTGGATCTGATCCATTGTAGTTCTCTGGAAATGTTGAGTTTTAGATCCGTGTTCCGGGTTTATGGCGGTCTGTATATAATGCAAAAAATGGGGAAGAACTATCGTTAATTTGTTTTGGGTATGATGAATTGGTTACATCGAAAACGGGGCGTTGTTTATAGAATTGAATCTGTGTTATCGTAGTTTGAATTTGTTTCGTATCATTTGAGCTTTTCCTAATTTGGAACATTAAATGTTTTCTTTACTCAATCTTGGACTGTCTATTGTGGCCTCAGCATTGATCCGTTCCTCGAGTAATGGCTAGGCTATTGTTGAGCAACCTTTAATTTCAAGTTATGTGATAGAAGTGAAGAGTCAAGGCAATATTTAATAATGTTAGATTTGTTGGTTTACGATTAAGATGCCTTATAAATGAACTACTATTCTCATTGACTTCACAGTCCTGGGTTCTAGTTCTAGTCTGATTAAATGTTATGAAATTCCTAAAATTTTTGCTTGTTTGAATCCTTATAATCCCATCCTTGGCAAAAGGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCTGAAAAATACTTGAAGAAGGGACCTGCTTATGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTTGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGATGACCATCAGAATGAATTAAAGAGGTATCCTTGAGTTTCATTGCAATCAATTATTTTTGTTCACCGTCTATTTATGCACACTCTATTATGCTGAAAATTTGTCTTGTAATTTGAGGGCAGAATAAGTCAGGCTCTGAAAGCTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGTGCGTGCGCTTCCACTTGTATCTCACTCCAGTTTGCTTTGTTTTTATACTCTATATGAACACTCCTAGGTAAAAAGACAATACAATTTGTTATTCGCACCATCTATTTGGCTAATCAGATAATGGTTGTCTCATAAAGAGCCTTCCTTAATACAAAGAAACTGAAACCTAATGAGAAATTTAGAACTGATTTATGTGATCCTCGTAATATTGTCCTCTTTGGGCTTTCCCTTTTGGGCTTCCCCTCAAGGTTTTAAAACGCGTCTCTTCCCTTCAAGGTTTTTGAACTTTCCCTTATGGGCTTCCCCATTACGCGTCTACTAGGGGAGGTTTTCACACCCTTATAAGGATTGCTTCGTTCCCCTCTCCAACCGATGTGGGATCTCACAATCCACCTCCCTTGGGGGCCAGCGTCCTCGCTGGCACACTGCCTGGAACCTGACTCTGATATCGTTTGTGACGGCTCAAGCCCACCACTAGCAGATATTATCCTCTTTGGGCTTTCCCCTCAAGGTTTTTAAAATGCGTCTTTTAGAGTTACATAGTGGAACCAATACTCATCCAAATATTGTCTTCTTTGGGCTTTCTCTTTCGGACTTCCCCTCAAGGTTTTTAAAATGCGTCTTTCAGAGTTACACAACAAAACGAATACTCATCCAAATACTGCCCTCTTTAGGCTTTCCATTTCGGGCTTCCCCTCAAGATTTTTGGGCTTTCCTATATGGGGTTCTCCATTGCACATCTACTGTGGAGAGGATCTAGCTCTACTTCAACTAGTGCCTTGCACCGTCCGGTGATTGGCTTTGATACCATTTGTAACGGCCCAAGCCCACCGCTAGCAAATACTGTCTTCTTTGGGTTTTTCTTTTGGGCTTCCCCTCAAGGTTTTAAAACGTGTCTCTTCCCCACACCCACCCTTATAAGGAATGTTTCGTTCTCCTCTCCAACTGATAGTGAGATTTTTAGGGCATATTAAAATCAAAATTATATCCCAGTGCTACTTCCAGTGATTTGGAAGTGATTCTGGCAGTCTGACAATTTAAGTTGAAGTTAATTTTAATGCTGGTATATTTCTGATTTAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCTATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGATACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGGTTTGTTCCAGCGACTTGGTTTAACTCGACTTGAGACTTCTGTATTTAATGGATACACACAAATGGTCTATATTCAGATATCTAAACAATAGTTGTTAGGCATTTTACTCTGCTTGTCAATAGCTTGTAGTCTTTTTTCCCATACCACACTGTTATTCAAATTTTGAGTCAATTAAGGGACAAATGCTTTGTGTTCAAACATATTCCTAGTGGTTTTGTTGGTGGCAGTTTTATATATTAGCCACACTCAGATCCCTTGCTTGTCTTATCTCGCACTGTTGTAGCATATCGGACCTCAATTGAGTTTAGCAAACTGTTTACCCCTTTGATCTCCTGTTACCATTATATGTTAACGTCAAATTGTTGGTCTCGTGAAAATATAATTTGACCCATAAATTTTAACAAGTTTGGACCTTTTGATACATTTAATAAATTTATAGTTTATACTTTGCTTAACAAAATAATTGTTAATAAAATACGCTTTTTCTGTGCATTTAACTCAAGTATATTCCTATAGTCGTAAATGATTATTAATACCGTCCTTATACAATTAATGAATTTTAAATTAATACCAAACTCATTTTTTAATGGTTAAAAGTGAATATAATAATTTAATTTGATTTAATTGAGAGGGTGGCGTCATATTTATCCCAATTTGCCACATTTATTTTATTTTAGAGGAAAAAATAATTGGACCGTGTTTGGAGAAGAGGATCCAATTTCTGCTGCCCGTGAAAAAAAGGGCACACAGACCCACGCGGAGCCGATTGATGAGGTAGCGGTTGCTCCGCGTTTGGCGTTCTGTACGAGAGCAGAGCGTCGCTCTTTACCAATCACTCCTCTTCGTCTCCACTTACTCTCCCGCCATTAACAGCGCCACTGTCCTCTCCCTTCTTCGCTTCTCGCTGCAACTCATCGATTTGTCATGGTCATAGAATCCTTAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGGTACCTTCATCCCTCTCTCTTTGTCTCGCACCCAGTCTAATTTTGTCTGCTTATTTGTTGCCTTTTTTATTTGTTCATAGTTGCCTTTTGATTCTGTAGCGTAGGTCTTTTTGTCCGTTCGTTTCTATGTATATACTAGAGTTTGTTTCTAATTAATAGCTTTACCGAAGATCATTTCTGACACATGGTTGGGAAAGATTGCATTCGTGTTAATTGAGCTTCACTTTCTGTTTGTTTCGTTGTCCAAGCCATGCGAGAGTTAAATTTTCTTCTACTTGCACCTTGAACAGTATATGTATTGTAATTTTATATTGGTTGATACATCTGTACTAATTTATGATACTGGAGTCTTTTTTCTNTGTCCTCTCCCTTCTTCGCTTCTCGCTGCAACTCATCGATTTGTCATGGTCATAGAATCCTTAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGGTACCTTCATCCCTCTCTCTTTGTCTCGCACCCAGTCTAATTTTGTCTGCTTATTTGTTGCCTTTTTTATTTGTTCATAGTTGCCTTTTGATTCTGTAGCGTAGGTCTTTTTGTCCGTTCGTTTCTATGTATATACTAGAGTTTGTTTCTAATTAATAGCTTTACCGAAGATCATTTCTGACACATGGTTGGGAAAGATTGCATTCGTGTTAATTGAGCTTCACTTTCTGTTTGTTTCGTTGTCCAAGCCATGCGAGAGTTAAATTTTCTTCTACTTGCACCTTGAACAGTATATGTATTGTAATTTTATATTGGTTGATACATCTGTACTAATTTATGATACTGGAGTCTTTTTTCTTTTTCTTTTTCTTTTTCTTTTCTATCTATTCTGCAGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGGTATAAGTTGCTTGTTATTCGGGCATTGAGATGTAGATACTTGGCTAGTAAAGTTATTTAGGCGTCATAAAGTATTTTCTTCTACTTTGATGGATTAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGAGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTATAGAGATGAGCGTGCAATGGAAATTCCGGGAAAAGGTGAAATGATCTAAACAGCTCCCTGCCATGCTCTACTTGTTTAACTGATTGTTACGGTCATTTCGGTGTAGGAATGTAATTGAATTTATCCAAACTGACATGTAAAATAAATAATTATAAATTGAGTATGAAGTTTAGAATTAAAACTGTAGCTACCATGGAACTGCTGTTGTCAAATCTTCATAGCAATCTGTGGCAGTCACTTACTAATTCCTTTATTGAAAACTATAATCAATTTTTTCTGGCTCTGGACTTCTTCTGCAATACAGTCAGAACTGGATTGGAAGGACCTGGCCTCACAGTTGCTCAAAAGATTTGGTATTGCGTGGCCACTGTGGGTGGTCAATACATTTGGACTCGGTTACAATCGTTTTCTGCTTTTCGTAGATGGGGAGATTCAGAGCAGGTACTGGTTTTCAGAAACTCACCACCCCTACCATGTGGACTACCTTCTTGTTGTGCTGGTATGGTTGTTGAATAAAATAGGTATTCATATGTCCCTGTATTTTTTTTCAGAGGTCCTTGGCAAGGCGAGCATGGCTTTTGATTCAGCGCATTGAAGGAATATACAAAGCTGCTGCATTTGGCAACTTGCTCATATTTCTTTACACAGGAAGGTAGGTTTTCTGCATCTGATAAGCCGAATTAGAGTATCGATATCAAGATTTCAAATTTTCACATACCTTGATGAATATTTTTGTGAATAAATTTTTTTTATATAAATTATTTAAATTAATAATTAAGTTTTTCCCCCTTTTTCTTAACTTTTAAACTAAGTCATAGATCTTGTTATCAATGTTTTAAAAGGCTTAAGGCGGGCCTTGGGGCATGAGGTGGTATGAGGCATAAGCCTTATTTTAAATTTAAAAAATGTACATAAAACATAAAGCATAATATTCTCCTAACGATATAATATTTCTTAATGTACCGGACATACAAATGTTCAATAACTAATGCATAATAGTAGCAAGAACTAAAAACCAATTAGACATTTGGGAAATAAAAAGTCTTCTTCAAAAGAAAACAATGAATAATAGTAGTAAGTAGCTAGTTAGAACCATTAAGAAAGAGAACTTTAGGCGGGAAAACAGATCAGAACATAAAAATAGCTGCTAATAAAAAGAAGAAAACAGAAAAAAATTGAAAGGATAATTATTTAACTAATATTAAGAAAGAAAACAAAGAAGACAATCGAATGGGAAAGAAAACAATAATCAAGATTCAAATATGGAGAGAAAAAAAAAGCAAAATAGAATGGAAAGAAAAATAATAATCGAAACTCAAAAGAAAAGAAAAGAAAGTCGAGCAAAATGAAAATGGAAGTAGTAAGAAAAAATATTAAACTCGCTACACAAGTGAACAAATAAAACAAAGACAAGAACAACAAGATGAAGAAGAATGAAACTCGCTTTTGTTGTGTTGTGTTTTCGTCAAATAGTGATTCCTTGCAGTTTTTTTCTATGCGTACTGTTCACTTCAAATAACACACAAAAAGGAGAAGCTTTCGAGTATTTTGTTTACCAAGTACAGTTCTTTATGTTTTCTTCAATATTCTCGCATGGCCAACCACTACCTTAAATTAGTATGAAGACAACACCAACTTCAATTAAATGCAGCGTTGGATGATGCATTGCATGGGCACTTGGCCCACCAACCAATATCCAAAAGGCTTACGCCTTGTAGCCTTTGAGACTTACGCCTCTCTCAACAGTGGCGCGCAAGCCTCATATTACACTCTCAAGGCGTAAGCCTTAAAGCATGAATCTGCCGCATCACCTTGAGGCGTGCCGTAAGATTTATGCCATGACCGATTTTTAAAACATGGCCTGTTATTAGTTTTTATATTTATATTGTGATTGTATATAAAAGATATTAGAGGTATCGATTAATCTTCAATATTTATGTTGAACTCTCCAATTTACGGAAATATCATCATATTAATGGTTGACGGATATTTTCATCCTTGCTGATAAGTTAACTTTTCAGTGAACTGTTATACTCTGAGCGGCTTGAAGTGATTATTTGTTAAATATTAATGCTAAAAGGGCATTTTCGAAAGAATCTGCCCAAGTAAAAATCATACTTACCGGTGTCAAATGACATCTCTTCAACTATTATGTTCCAGTTTGCAGTGGAAAAGGGAAGTTATTTTTCCATCAATAATGTAGAAGTTTATGTCAGTAGAAACAAAAGAATGGATGCTTCATTTTTATTTATCATTTAAAAATTAATATTTAATGCATCTTGCTAATGTATAGGGAGCGTTGAGTTATTTTGTCATCAGTCTGAATTGGTATTTTGCAGATACATCTGACTGCCCCACTTTTCACTCTATTGCTGAATGTTGTGCATTGTTGGTTGTCACATATGCTTTTCACTCAGTTGTATTGGTTAAATTGTTGTAAGTTTTTGTGTAGTGGAACTCAAATAAAAATTGGTGGCTTGTCGACTATCATAAGTAGTTTACTTTCTGCCCAACTCAGCATCGTATTTTTTTGTGTGGTGCAGGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGTAATTTTCTGTCCTTTATAGTTAGTTGCAGAAGTATTGGAACTCTTATTTTGGCTTAGAACTTATTCACTTCTTGCTTCATTTCAACGAAATATATTCTCGGAATGTACTATTTTTCTTAACCATATGATGATTAGCTCACCCTCTTCCCTTTCCTCCTGAAGTAGAATTCTATTTCAAGTCGAGCATGGTATATATTCTCAGATTGTGTGCTGTTTTAAGTTTAGCATGCAAATCAACTACAAGTTTCAAGTAGTTCATATAGGATATTCATTTGGTTTACAGTGGTTAATATCGTCTACTATACTCATGCATGCATGAATATGGGAAGGGTTTCAATTTCCTTTTGCCCAATGCTTGATCACAATTTTATTATCCATAGTAAGGAACATTTGCCCTCGAGAATTTATTTTTCTGTGCTAATGGCAGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCTACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGGTCAGGCCTTTTTATCACATCACTCTATCTTTTAAGCCAAAAAACTACTATTTGTTATATAAAAAAGGCAGAAACTTAACGATAGAATTAGGATGCATTTGTTCGAAAGACAGACAAAACAAGGAGGATACGTGACGATAGAATTATTTCTTTAATGAATATAGCTTAGACGTGCACATTACAGATTTCGTTCAAAAACAAAAAATTGACTATGATAATAAAATAAACGTTCTAAATATAACAACGTTGATTGATGTAATTAGTTCTAAATGTTAAAACTTCTTAAAGTGAAAGTATATATGCTAACCAAATAAACAAATTTTAAATAATTCAATGTCTTAACGTAAAGTTTTGTAATGAAAGTTCCAGAGAGATTTATGTCCATACGATCATATATTTAAAATAGAAACATAAGAAACAAGTCTATTTCGCATTCATCAGCTTTCACCAATGTTTTTGAGTTGTTTCGACAGTCCAGGCTTTTTATGTTCACAATTAGACATTGAAAAGAACAACAGAATATTTGATTTAAGATTTAAGATTTAAGATTCTGATTTGACTGTCGGTTATTGTCGACCAACGCTCGTAGACAAAGATGGCTGCATTTCATGTCTATATAATTGTTTACATTGAATCTCACAGGTTCAACTGGGGTGAGATGGAGAAAAGTATTGCTAACGTCCATTGGTTTTTTTTTTTTTTTTTTTTCTGACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATATCGAAGGCACTAGTGCAAATCCCAAACGGTAATCCCTGGGCAGAGGGAGCAAATACTATTTATAGTGATTATAATAAGAAAAAGGAAAATTGCTTCAATGTTGCAATTAATTTTTTTTTTTTTTGTTGCTTGAAGCATAAATGCTTACAGAAAATCAAATTCCCTTGAAAAGCATATATGTTGTATTGTGTAGTAGCCGTAGGGGGGCTCATTCTTTTTTAGTTTCCTAGTACATCTGTAGTATAGGTACTCGCTGATTATCTTGTGATTAATTCCCTGCAGTTACAAGAGGAAACATTATGCTGTCTTTCTTTCTTTTTCCTTTTTTTCTTATAAAGGCACAAAGAGAGGTAGGGGAGTATTTCAACCCAAGATAGAAATTTATATATTTTCAGTTCCTGGTTCGACCATGGATCAATTAGTTTAGTCATGTATCCTCAACCAAGAGGTTAGATGTTCGAAGCCTCCTCAAAACATGTTCTGGAACTCGAAATCTTGTGCCATCCCTGGTCGATAATAAATCTTGTTTGTTTGGTATAAAATCAATTTTACCTGTTTAAACTTCATAATGATAAACTTTTGTGCAAGTTTAATTCTAGTTTTGTTGTTCGCCTATATAATTATTTTAACCTGTTTTATTGATCCTCTTCTCTAGTTCTGTGTCTATAGTCTGTGTAACAGGGCAATTGGACCACTTTAGACGGTGTAGATTGAAAATCAACACTTTCTGACTTCTTTGGTCTTTCCATTTCCACAGAGATGGATAAATCTTGGAATGAGTCTAATCATCAACAACTTTAGAATATTGGATTGATGATGTTTGAAATATTGGTGACAGTGGTTGCACTTTCAAAACATGTAATAATGGTAGTTTTGCACTTTCCACGCTTGTTTATTTTTTTCTCAGGACTAGGTGGT

mRNA sequence

ATGGTTGCTCAAAAAAGGGAAGGAGATGAAGCCATGGGAAGCCCAAGATTCCAATTTCCAAGCGCCAAGCAAACTCTTCAACTCGACCATAGCAACGCCAGCATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCGTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGACCAGATTTTCCTTATTCACCCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTCATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTATGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCTGAAAAATACTTGAAGAAGGGACCTGCTTATGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTTGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGATGACCATCAGAATGAATTAAAGAGAATAAGTCAGGCTCTGAAAGCTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCTATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGATACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGGTTTGTTCCAGCGACTTGGTTTAACTCGACTTGAGACTTCTGTATTTAATGGATACACACAAATGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAAATCCTTAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGAGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTCCTTGGCAAGGCGAGCATGGCTTTTGATTCAGCGCATTGAAGGAATATACAAAGCTGCTGCATTTGGCAACTTGCTCATATTTCTTTACACAGGAAGGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCTACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATATCGAAGGCACTAGTGCAAATCCCAAACGGTAATCCCTGGGCAGAGGGAGCAAATACTATTTATAGTGATTATAATAAGAAAAAGGAAAATTGCTTCAATGTTGCAATTAATTTTTTTTTTTTTTGTTGCTTGAAGCATAAATGCTTACAGAAAATCAAATTCCCTTGAAAAGCATATATGTTGTATTGTGTAGTAGCCGTAGGGGGGCTCATTCTTTTTTAGTTTCCTAGTACATCTGTAGTATAGGTACTCGCTGATTATCTTGTGATTAATTCCCTGCAGTTACAAGAGGAAACATTATGCTGTCTTTCTTTCTTTTTCCTTTTTTTCTTATAAAGGCACAAAGAGAGGTAGGGGAGTATTTCAACCCAAGATAGAAATTTATATATTTTCAGTTCCTGGTTCGACCATGGATCAATTAGTTTAGTCATGTATCCTCAACCAAGAGGTTAGATGTTCGAAGCCTCCTCAAAACATGTTCTGGAACTCGAAATCTTGTGCCATCCCTGGTCGATAATAAATCTTGTTTGTTTGGTATAAAATCAATTTTACCTGTTTAAACTTCATAATGATAAACTTTTGTGCAAGTTTAATTCTAGTTTTGTTGTTCGCCTATATAATTATTTTAACCTGTTTTATTGATCCTCTTCTCTAGTTCTGTGTCTATAGTCTGTGTAACAGGGCAATTGGACCACTTTAGACGGTGTAGATTGAAAATCAACACTTTCTGACTTCTTTGGTCTTTCCATTTCCACAGAGATGGATAAATCTTGGAATGAGTCTAATCATCAACAACTTTAGAATATTGGATTGATGATGTTTGAAATATTGGTGACAGTGGTTGCACTTTCAAAACATGTAATAATGGTAGTTTTGCACTTTCCACGCTTGTTTATTTTTTTCTCAGGACTAGGTGGT

Coding sequence (CDS)

ATGGTTGCTCAAAAAAGGGAAGGAGATGAAGCCATGGGAAGCCCAAGATTCCAATTTCCAAGCGCCAAGCAAACTCTTCAACTCGACCATAGCAACGCCAGCATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCGTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGACCAGATTTTCCTTATTCACCCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTCATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTATGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCTGAAAAATACTTGAAGAAGGGACCTGCTTATGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTTGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGATGACCATCAGAATGAATTAAAGAGAATAAGTCAGGCTCTGAAAGCTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCTATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGATACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGGTTTGTTCCAGCGACTTGGTTTAACTCGACTTGAGACTTCTGTATTTAATGGATACACACAAATGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAAATCCTTAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGAGTCTTGCTTATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGAGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTCCTTGGCAAGGCGAGCATGGCTTTTGATTCAGCGCATTGAAGGAATATACAAAGCTGCTGCATTTGGCAACTTGCTCATATTTCTTTACACAGGAAGGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCTACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATATCGAAGGCACTAGTGCAAATCCCAAACGGTAA

Protein sequence

MVAQKREGDEAMGSPRFQFPSAKQTLQLDHSNASMSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDFGSLGRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQMLPPAPPLRLRLPPAPPIFLHLLKMHGVLLISGYFLAGNLFPKSLAAAPASSSSSSSPSTSTSHLSPPPEDAWSLAYQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGIALMNLRSLARRAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYIEGTSANPKR
Homology
BLAST of Cp4.1LG02g14560 vs. ExPASy Swiss-Prot
Match: Q9CA86 (Peroxisome biogenesis protein 2 OS=Arabidopsis thaliana OX=3702 GN=PEX2 PE=1 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 1.2e-104
Identity = 206/330 (62.42%), Postives = 231/330 (70.00%), Query Frame = 0

Query: 539 SPPPEDAWSLAYQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKV 598
           S P +DAW  +YQRLLP  +SL  S  S IP++IS+VNQ DAARLD+EMSAMLKEQLVKV
Sbjct: 4   STPADDAWIRSYQRLLPESQSLLASRRSVIPVAISRVNQFDAARLDVEMSAMLKEQLVKV 63

Query: 599 FALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGIALMNL----------------- 658
           F LMKPGMLFQYE ELDAFLEFLIWRFSIWVDKPTPG ALMNL                 
Sbjct: 64  FTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDERGVVAQHLGKVR 123

Query: 659 --------------------------------------------RSLARRAWLLIQRIEG 718
                                                       R LARR W L+QRIEG
Sbjct: 124 TGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLARRLWTLVQRIEG 183

Query: 719 IYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEML 778
           IYKAA+F NLL FLYTGRYRNL+E+ L+ARLVY SP+MNR+VSFEYMNRQLVWNEFSEML
Sbjct: 184 IYKAASFLNLLSFLYTGRYRNLIEKALKARLVYRSPHMNRSVSFEYMNRQLVWNEFSEML 243

Query: 779 LLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLR 808
           LLLLPLLNSS+V+N L PF+K+KSSS+ ED   CPIC   P IPF+ALPCQHRYCYYC+R
Sbjct: 244 LLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTCPICQVDPAIPFIALPCQHRYCYYCIR 303

BLAST of Cp4.1LG02g14560 vs. ExPASy Swiss-Prot
Match: Q75JQ3 (Peroxisome biogenesis factor 2 OS=Dictyostelium discoideum OX=44689 GN=pex2 PE=3 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 1.9e-30
Identity = 112/421 (26.60%), Postives = 176/421 (41.81%), Query Frame = 0

Query: 465 TSVFNGYTQMLPPAPPLRLRLPPAPPIFLHLLKMHGVLLISGYFLAGNLFPKSLAAAPAS 524
           TS        + P PP    LPP PPI   L   +   LI        +   +    P+S
Sbjct: 15  TSTTTTTNTTITPTPP----LPPPPPISNILDNNNNNNLIKNDIKNDKVAVSNSNVRPSS 74

Query: 525 SSSSSSPSTSTSHLSPPPEDAWSLAYQRLLPRWKSLSQS--HLSPIPISISKVNQVDAAR 584
           SS S   S             W+  Y     +   +++   ++     SI +V+Q+D+AR
Sbjct: 75  SSVSYENSD------------WNKVYNSEREKLHEVNKQILNIKRPSTSIVRVSQLDSAR 134

Query: 585 LDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGIALMNL- 644
           LD E+  +L+ Q +K+F   KP  +  ++ E++  L+ +I++ SI+    T G  L NL 
Sbjct: 135 LDEEILDLLRSQFMKIFTFFKPNFIHNFQPEINLVLKSVIYKLSIFNLGTTYGNQLQNLT 194

Query: 645 -------------------------------------------------------RSLAR 704
                                                                    + +
Sbjct: 195 YRNEKAFDPIRGSDQLNKLTMRQKWLSGLINIGGEWLWTRINRYLINNNWSEHPPNDIRK 254

Query: 705 RAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNR 764
           + W  +   E  YKA A  N L FL+ G+Y  LV R+L  RLVY  P ++R +SFEYMNR
Sbjct: 255 KFWNFLNFAESAYKALALLNFLTFLFNGKYVTLVNRILHMRLVYAHPTLSRNISFEYMNR 314

Query: 765 QLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKS---SSSAEDDSA------------- 802
            LVW+ F+E +L ++PL+N   +++FL     + S   SS   +++A             
Sbjct: 315 LLVWHGFTEFILFIMPLINIDRIKSFLYRLLVKTSFGNSSGNNNNTASNPLQQLQKQQLL 374

BLAST of Cp4.1LG02g14560 vs. ExPASy Swiss-Prot
Match: P24392 (Peroxisome biogenesis factor 2 OS=Rattus norvegicus OX=10116 GN=Pex2 PE=2 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 9.4e-25
Identity = 77/279 (27.60%), Postives = 131/279 (46.95%), Query Frame = 0

Query: 572 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDK 631
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ AFL   +WRF+I+   
Sbjct: 14  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKAFLWLFLWRFTIYSKN 73

Query: 632 PTPGIALMNL------------------------------RSLARRAWLLIQR------- 691
            T G +++N+                              R L  R + L +        
Sbjct: 74  ATVGQSVLNIQYKNDSSPNPVYQPPSKNQKLLYAVCTIGGRWLEERCYDLFRNRHLASFG 133

Query: 692 --------IEGIYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNR 751
                   + G+ K     N LIFL  G++  L ER+L    V+  P   R V FEYMNR
Sbjct: 134 KAKQCMNFVVGLLKLGELMNFLIFLQKGKFATLTERLLGIHSVFCKPQSMREVGFEYMNR 193

Query: 752 QLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDS------ACPICLASPTI 800
           +L+W+ F+E L+ LLPL+N   ++  L  +    +S++  D +       C +C   PT+
Sbjct: 194 ELLWHGFAEFLVFLLPLINIQKLKAKLSSWCIPLTSTAGSDSTLGSSGKECALCGEWPTM 253

BLAST of Cp4.1LG02g14560 vs. ExPASy Swiss-Prot
Match: P55098 (Peroxisome biogenesis factor 2 OS=Mus musculus OX=10090 GN=Pex2 PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 7.9e-24
Identity = 76/279 (27.24%), Postives = 130/279 (46.59%), Query Frame = 0

Query: 572 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDK 631
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ AFL   +WRF+I+   
Sbjct: 14  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKAFLWLFLWRFTIYSKN 73

Query: 632 PTPGIALMNL------------------------------RSLARRAWLLIQR------- 691
            T G +++N+                              R L  R + L +        
Sbjct: 74  ATVGQSVLNIQHKNDSSPNPVYQPPSKNQKLLYAVCTIGGRWLEERCYDLFRNRHLASFG 133

Query: 692 --------IEGIYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNR 751
                   + G+ K     N LIFL  G++  L ER+L    V+  P   R V FEYMNR
Sbjct: 134 KAKQCMNFVVGLLKLGELMNFLIFLQKGKFATLTERLLGIHSVFCKPQNMREVGFEYMNR 193

Query: 752 QLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDS------ACPICLASPTI 800
           +L+W+ F+E L+ LLPL+N   ++  L  +    + ++  D +       C +C   PT+
Sbjct: 194 ELLWHGFAEFLIFLLPLINIQKLKAKLSSWCTLCTGAAGHDSTLGSSGKECALCGEWPTM 253

BLAST of Cp4.1LG02g14560 vs. ExPASy Swiss-Prot
Match: Q06438 (Peroxisome biogenesis factor 2 OS=Cricetulus griseus OX=10029 GN=PEX2 PE=1 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 1.8e-23
Identity = 79/279 (28.32%), Postives = 134/279 (48.03%), Query Frame = 0

Query: 572 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDK 631
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ A L   +WRF+I+   
Sbjct: 13  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKACLWLFLWRFTIYSKN 72

Query: 632 PTPGIALMNLR--------------SLARRAWLLIQRIEGIY------------KAAAFG 691
            T G +++N++              S  ++ W  +  I G +              A+FG
Sbjct: 73  ATVGQSVLNIQYKNDFSSNSRYQPPSKNQKLWYAVCTIGGRWLEERCYDLFRNRHLASFG 132

Query: 692 -------------------NLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNR 751
                              N LIFL  G++  L ER+L    V+  P   R V F+YMNR
Sbjct: 133 KVKQCMNVMVGLLKLGELINFLIFLQKGKFATLTERLLGIHSVFCKPQNIREVGFDYMNR 192

Query: 752 QLVWNEFSEMLLLLLPLLN----SSSVRNFLRPFSKEKSSSSAEDDSA--CPICLASPTI 800
           +L+W+ F+E L+ LLPL+N     + + ++  P +   SS SA   S   C +C   PT+
Sbjct: 193 ELLWHGFAEFLIFLLPLINIQKFKAKLSSWCIPLTGAASSDSALASSGKECALCGEWPTM 252

BLAST of Cp4.1LG02g14560 vs. NCBI nr
Match: KAG6606758.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1370 bits (3546), Expect = 0.0
Identity = 718/843 (85.17%), Postives = 731/843 (86.71%), Query Frame = 0

Query: 35  MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 94
           MSFDTMRVQSSTPQSPTSNRRLERA SSRRAPHHSGDFDDDDDHDVSKTKK RFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 95  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 154
           LSIYFTRIGPIWACLALVGLILLMISS IFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 155 GSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 214
           GSLG    RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 215 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 274
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 275 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 334
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK GFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 335 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 394
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 395 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 454
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 455 RYGLFQRLGLTRLETSVFNGYTQMLPPAPPLRLRLPPAPPIFLHL-LKMHGVLLISGYFL 514
                    ++ L+ S+ N     LP      L LPP   +++++ L  HG L  S    
Sbjct: 421 --------SISDLKLSLAN----CLPLRSISDLLLPP---LYVNVKLLRHGPL-PSSLRA 480

Query: 515 AGNLFP---KSLAAAPASSSSSSSPSTSTSHLSPPPEDAWSLAYQRLLPRWKSLSQSHLS 574
           A + F    +SLAAAPASSSSSSSPSTS SHLSPPPEDAW+LAYQRLLPRWKSLSQSHLS
Sbjct: 481 ATHPFVMVIESLAAAPASSSSSSSPSTSASHLSPPPEDAWTLAYQRLLPRWKSLSQSHLS 540

Query: 575 PIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFS 634
           PIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEF IWRFS
Sbjct: 541 PIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFFIWRFS 600

Query: 635 IWVDKPTPGIALMNLR-------------------------------------------- 694
           IWVDKPTPGIALMNLR                                            
Sbjct: 601 IWVDKPTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRL 660

Query: 695 ---------------SLARRAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRAR 754
                          SLARRAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRAR
Sbjct: 661 QSFSAFRRWGDSEQRSLARRAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRAR 720

Query: 755 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAED 810
           LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAED
Sbjct: 721 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAED 780

BLAST of Cp4.1LG02g14560 vs. NCBI nr
Match: KAG7018321.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1313 bits (3398), Expect = 0.0
Identity = 683/863 (79.14%), Postives = 719/863 (83.31%), Query Frame = 0

Query: 21  SAKQT--LQLDHSNASMSFDT-MRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDD 80
           S KQT  L++DH+N +MSFDT MR QSSTPQSPTS R L+RA+SSRR PHHSGD DDDDD
Sbjct: 10  SPKQTVRLEIDHTNTAMSFDTTMRAQSSTPQSPTSKRMLDRALSSRRVPHHSGDLDDDDD 69

Query: 81  HD-VSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSY 140
            D VSKTKK  FS FTHRLS YF RIGPI ACLAL+ LILL+ISS IFFHSRRFVCVSSY
Sbjct: 70  DDDVSKTKKHNFSFFTHRLSNYFARIGPISACLALLALILLLISSLIFFHSRRFVCVSSY 129

Query: 141 DPVSRSGFFGMDGLDSDFGSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNN 200
           D +SRSGFFG+DGLDSDFGSLG    RSKHGKTVEWT KDLLKGLEEFVPIYETRPI+NN
Sbjct: 130 DHISRSGFFGVDGLDSDFGSLGVPWCRSKHGKTVEWTTKDLLKGLEEFVPIYETRPIQNN 189

Query: 201 LFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLK 260
           ++GMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDT IISLSPRHPEKYLK
Sbjct: 190 MYGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTAIISLSPRHPEKYLK 249

Query: 261 KGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK 320
           KGPAYVDANCTYFAGKDFVDFGSV+W  VMK+HGIDDLSRVLVFFDDHQNELKRI QA+K
Sbjct: 250 KGPAYVDANCTYFAGKDFVDFGSVAWKKVMKEHGIDDLSRVLVFFDDHQNELKRIKQAVK 309

Query: 321 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVD 380
           AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYI+GGGHSCFKDSDEARIRAKRKLFWEKAVD
Sbjct: 310 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIKGGGHSCFKDSDEARIRAKRKLFWEKAVD 369

Query: 381 VEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLT 440
           +EELCGPYE+WWGVRGYMRDDFNHSNRAISHAEH QNSRYLESILDVYWELPPVAGPSLT
Sbjct: 370 IEELCGPYESWWGVRGYMRDDFNHSNRAISHAEHFQNSRYLESILDVYWELPPVAGPSLT 429

Query: 441 HQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQMLPPAPPLRLRLPPAPPIF 500
           HQTRYDPARVS PIVEDGRYGLF+RLGL +LETSVFNGYTQM                ++
Sbjct: 430 HQTRYDPARVSSPIVEDGRYGLFRRLGLAQLETSVFNGYTQM----------------VY 489

Query: 501 LHLLKMH----GVLLISGYFLAGNLFPKSLAAAPASSSSSSSPSTSTSHLSPPPEDAWSL 560
           +  L  H    G    S    A  +F   L A  A+SS++S PSTS  +L PPPEDAWS 
Sbjct: 490 IQFLFRHYQHDGAPSSSPLLSATLIFVMVLEALAAASSTASPPSTSNFNLPPPPEDAWSR 549

Query: 561 AYQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF 620
           AYQRL PRWKSLS SHLS IPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF
Sbjct: 550 AYQRLHPRWKSLSHSHLSAIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF 609

Query: 621 QYEAELDAFLEFLIWRFSIWVDKPTPGIALMNLR-------------------------- 680
           QYEAELDAFLEFLIWRFSIWVDKPTPGI+LMNLR                          
Sbjct: 610 QYEAELDAFLEFLIWRFSIWVDKPTPGISLMNLRYRDERALEVPGKVRTGLEGPGLTVAQ 669

Query: 681 ---------------------------------SLARRAWLLIQRIEGIYKAAAFGNLLI 740
                                            SLARRAWLLIQRIEGIYKAAAFGNLLI
Sbjct: 670 KIWYCVATVGGQYMWTRLQSFSAFRRWGDSEQRSLARRAWLLIQRIEGIYKAAAFGNLLI 729

Query: 741 FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSV 800
           FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSS+V
Sbjct: 730 FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSTV 789

Query: 801 RNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS 811
           RNFLRPFSK+K SSSA+DDSACPICLA+PTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS
Sbjct: 790 RNFLRPFSKDKPSSSAKDDSACPICLANPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS 849

BLAST of Cp4.1LG02g14560 vs. NCBI nr
Match: KAG6581888.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1313 bits (3397), Expect = 0.0
Identity = 683/863 (79.14%), Postives = 719/863 (83.31%), Query Frame = 0

Query: 21  SAKQT--LQLDHSNASMSFDT-MRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDD 80
           S KQT  L++DH+N +MSFDT MR QSSTPQSPTS R L+RA+SSRR PHHSGD DDDDD
Sbjct: 10  SPKQTVRLEIDHTNTAMSFDTTMRAQSSTPQSPTSKRMLDRALSSRRVPHHSGDLDDDDD 69

Query: 81  HD-VSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSY 140
            D VSKTKK  FS FTHRLS YF RIGPI ACLAL+ LILL+ISS IFFHSRRFVCVSSY
Sbjct: 70  DDDVSKTKKHNFSFFTHRLSNYFARIGPISACLALLALILLLISSLIFFHSRRFVCVSSY 129

Query: 141 DPVSRSGFFGMDGLDSDFGSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNN 200
           D +SRSGFFG+DGLDSDFGSLG    RSKHGKTVEWT KDLLKGLEEFVPIYETRPI+NN
Sbjct: 130 DHISRSGFFGVDGLDSDFGSLGVPWCRSKHGKTVEWTTKDLLKGLEEFVPIYETRPIQNN 189

Query: 201 LFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLK 260
           ++GMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDT IISLSPRHPEKYLK
Sbjct: 190 MYGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTAIISLSPRHPEKYLK 249

Query: 261 KGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK 320
           KGPAYVDANCTYFAGKDFVDFGSV+W  VMK+HGIDDLSRVLVFFDDHQNELKRI QA+K
Sbjct: 250 KGPAYVDANCTYFAGKDFVDFGSVAWKKVMKEHGIDDLSRVLVFFDDHQNELKRIKQAVK 309

Query: 321 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVD 380
           AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYI+GGGHSCFKDSDEARIRAKRKLFWEKAVD
Sbjct: 310 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIKGGGHSCFKDSDEARIRAKRKLFWEKAVD 369

Query: 381 VEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLT 440
           +EELCGPYE+WWGVRGYMRDDFNHSNRAISHAEH QNSRYLESILDVYWELPPVAGPSLT
Sbjct: 370 IEELCGPYESWWGVRGYMRDDFNHSNRAISHAEHFQNSRYLESILDVYWELPPVAGPSLT 429

Query: 441 HQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQMLPPAPPLRLRLPPAPPIF 500
           HQTRYDPARVS PIVEDGRYGLF+RLGL +LETSVFNGYTQM                ++
Sbjct: 430 HQTRYDPARVSSPIVEDGRYGLFRRLGLAQLETSVFNGYTQM----------------VY 489

Query: 501 LHLLKMH----GVLLISGYFLAGNLFPKSLAAAPASSSSSSSPSTSTSHLSPPPEDAWSL 560
           +  L  H    G    S    A  +F   L A  A+SS++S PSTS  +L PPPEDAWS 
Sbjct: 490 IQFLFRHYQHDGAPSSSPLLSATLIFVMVLEALAAASSTASPPSTSNFNLPPPPEDAWSR 549

Query: 561 AYQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF 620
           AYQRL PRWKSLS SHLS IPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF
Sbjct: 550 AYQRLHPRWKSLSHSHLSAIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLF 609

Query: 621 QYEAELDAFLEFLIWRFSIWVDKPTPGIALMNLR-------------------------- 680
           QYEAELDAFLEFLIWRFSIWVDKPTPGI+LMNLR                          
Sbjct: 610 QYEAELDAFLEFLIWRFSIWVDKPTPGISLMNLRYRDERALEVPGKVRTGLEGPGLTVAQ 669

Query: 681 ---------------------------------SLARRAWLLIQRIEGIYKAAAFGNLLI 740
                                            SLARRAWLLIQRIEGIYKAAAFGNLLI
Sbjct: 670 KIWYCVATVGGQYMWTRLQSFSAFRRWGDSEQRSLARRAWLLIQRIEGIYKAAAFGNLLI 729

Query: 741 FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSV 800
           FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNS +V
Sbjct: 730 FLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSFTV 789

Query: 801 RNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS 811
           RNFLRPFSK+K SSSAEDDSACPICLA+PTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS
Sbjct: 790 RNFLRPFSKDKPSSSAEDDSACPICLANPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCS 849

BLAST of Cp4.1LG02g14560 vs. NCBI nr
Match: XP_023524868.1 (uncharacterized protein LOC111788671 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 910 bits (2352), Expect = 0.0
Identity = 440/445 (98.88%), Postives = 441/445 (99.10%), Query Frame = 0

Query: 35  MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 94
           MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 60

Query: 95  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 154
           LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 155 GSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 214
           GSLG    RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 215 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 274
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 275 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 334
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 300

Query: 335 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 394
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 395 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 454
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 455 RYGLFQRLGLTRLETSVFNGYTQML 475
           RYGLFQRLGLTRLETSVFNGYTQM+
Sbjct: 421 RYGLFQRLGLTRLETSVFNGYTQMV 445

BLAST of Cp4.1LG02g14560 vs. NCBI nr
Match: XP_022949517.1 (uncharacterized protein LOC111452845 [Cucurbita moschata])

HSP 1 Score: 900 bits (2327), Expect = 0.0
Identity = 435/445 (97.75%), Postives = 436/445 (97.98%), Query Frame = 0

Query: 35  MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 94
           MSFDTMRVQSSTPQSPTSNRRLERA SSRRAPHHSGDFDDDDDHDVSKTKK RFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 95  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 154
           LSIYFTRIGPIWACLALVGLILLMISS IFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 155 GSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 214
           GSLG    RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 215 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 274
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 275 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 334
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK GFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 335 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 394
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 395 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 454
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 455 RYGLFQRLGLTRLETSVFNGYTQML 475
           RYGLFQRLGLTRLET VFNGYTQM+
Sbjct: 421 RYGLFQRLGLTRLETCVFNGYTQMV 445

BLAST of Cp4.1LG02g14560 vs. ExPASy TrEMBL
Match: A0A6J1GC96 (uncharacterized protein LOC111452845 OS=Cucurbita moschata OX=3662 GN=LOC111452845 PE=4 SV=1)

HSP 1 Score: 900 bits (2327), Expect = 0.0
Identity = 435/445 (97.75%), Postives = 436/445 (97.98%), Query Frame = 0

Query: 35  MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 94
           MSFDTMRVQSSTPQSPTSNRRLERA SSRRAPHHSGDFDDDDDHDVSKTKK RFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 95  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 154
           LSIYFTRIGPIWACLALVGLILLMISS IFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 155 GSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 214
           GSLG    RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 215 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 274
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 275 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 334
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK GFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 335 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 394
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 395 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 454
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 455 RYGLFQRLGLTRLETSVFNGYTQML 475
           RYGLFQRLGLTRLET VFNGYTQM+
Sbjct: 421 RYGLFQRLGLTRLETCVFNGYTQMV 445

BLAST of Cp4.1LG02g14560 vs. ExPASy TrEMBL
Match: A0A6J1KCQ5 (uncharacterized protein LOC111492689 OS=Cucurbita maxima OX=3661 GN=LOC111492689 PE=4 SV=1)

HSP 1 Score: 897 bits (2319), Expect = 0.0
Identity = 433/445 (97.30%), Postives = 436/445 (97.98%), Query Frame = 0

Query: 35  MSFDTMRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDDHDVSKTKKTRFSLFTHR 94
           MSFDTMRVQSSTPQSPTSNRRLERA SSRRAPHHSGDFDDDDDHDVSKTKK RFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 95  LSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 154
           LSIYFTRIGPIWACLALVGLILLMISS IFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 155 GSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 214
           GSLG    RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 215 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 274
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 275 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 334
           VDFGS+SWNSVMKQHGI+DLS VLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSISWNSVMKQHGINDLSHVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 300

Query: 335 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 394
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 395 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 454
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVS PIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSSPIVEDG 420

Query: 455 RYGLFQRLGLTRLETSVFNGYTQML 475
           RYGLFQRLGLTRLETSVFNGYTQM+
Sbjct: 421 RYGLFQRLGLTRLETSVFNGYTQMV 445

BLAST of Cp4.1LG02g14560 vs. ExPASy TrEMBL
Match: A0A1S3BHM5 (uncharacterized protein LOC103489955 OS=Cucumis melo OX=3656 GN=LOC103489955 PE=4 SV=1)

HSP 1 Score: 851 bits (2198), Expect = 2.21e-304
Identity = 414/468 (88.46%), Postives = 435/468 (92.95%), Query Frame = 0

Query: 16  RFQFPSAKQTL--QLDHSNASMSFDTMRVQSST-PQSPTSNRRLERAVSSRRAPHHSGDF 75
           +F+ PS K TL  +LDH+N  MSFDTMRVQSST PQSPTS+R LERA+SSRR PHHSGD 
Sbjct: 5   QFKIPSPKNTLRLELDHTNTGMSFDTMRVQSSTTPQSPTSSRMLERALSSRRVPHHSGDI 64

Query: 76  DDDDDHD-VSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFV 135
           DDDDD D VSKTKK  FS FTHR+S YF RIGPIWACLALV LILL+ISS IFFHSRRFV
Sbjct: 65  DDDDDDDDVSKTKKHNFSFFTHRISNYFVRIGPIWACLALVALILLLISSLIFFHSRRFV 124

Query: 136 CVSSYDPVSRSGFFGMDGLDSDFGSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETR 195
           CVSSYDPVSRSGFFGMDGLDSDFGSLG    RSK GKTVEWTAKDLLK LEEFVPIYETR
Sbjct: 125 CVSSYDPVSRSGFFGMDGLDSDFGSLGVPWCRSKQGKTVEWTAKDLLKALEEFVPIYETR 184

Query: 196 PIKNNLFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHP 255
           PIKNN++GMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMP T IISLSPRHP
Sbjct: 185 PIKNNMYGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPYTRIISLSPRHP 244

Query: 256 EKYLKKGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRI 315
           EKYLKKGPAYVDANCTYFAGKDFVDFGSV+W +VMK+HGIDDLS+VLVFFDDHQNELKRI
Sbjct: 245 EKYLKKGPAYVDANCTYFAGKDFVDFGSVAWKNVMKEHGIDDLSQVLVFFDDHQNELKRI 304

Query: 316 SQALKAGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFW 375
            QAL AGF+HLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFW
Sbjct: 305 KQALNAGFRHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFW 364

Query: 376 EKAVDVEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVA 435
           EKAVD+EELCGPYE+WWGV+GYMRDDFNHSNRAISHAEH QNSRYLESILDVYWE+PPVA
Sbjct: 365 EKAVDIEELCGPYESWWGVQGYMRDDFNHSNRAISHAEHFQNSRYLESILDVYWEVPPVA 424

Query: 436 GPSLTHQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQML 475
           GPSLTHQTRYDPARVS PIVEDGRYGLFQRLGLT+LETSVFNGYTQM+
Sbjct: 425 GPSLTHQTRYDPARVSSPIVEDGRYGLFQRLGLTQLETSVFNGYTQMV 472

BLAST of Cp4.1LG02g14560 vs. ExPASy TrEMBL
Match: A0A0A0LAU9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G469200 PE=4 SV=1)

HSP 1 Score: 850 bits (2197), Expect = 2.91e-304
Identity = 412/467 (88.22%), Postives = 434/467 (92.93%), Query Frame = 0

Query: 16  RFQFPSAKQTL--QLDHSNASMSFDTMRVQSST-PQSPTSNRRLERAVSSRRAPHHSGDF 75
           +F+ PS K TL  +LDH+N  MSFDTMRVQSST PQSPTS+R LERA+SSRR PHH+GD 
Sbjct: 5   QFKIPSPKNTLRLELDHTNTGMSFDTMRVQSSTTPQSPTSSRMLERALSSRRVPHHTGDI 64

Query: 76  DDDDDHDVSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVC 135
           DDDDD DVSKTKK  FS FTHR+S YF RIGPIWACLA+V LILL+I S IFFHSRRFVC
Sbjct: 65  DDDDD-DVSKTKKHHFSFFTHRISNYFVRIGPIWACLAIVALILLLIFSLIFFHSRRFVC 124

Query: 136 VSSYDPVSRSGFFGMDGLDSDFGSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRP 195
           VSSYDPVSRSGFFGMDGLDSDFGSLG    RSKHGKTVEWTAKDLLK LEEFVPIYETRP
Sbjct: 125 VSSYDPVSRSGFFGMDGLDSDFGSLGVPWCRSKHGKTVEWTAKDLLKALEEFVPIYETRP 184

Query: 196 IKNNLFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPE 255
           IKNN++GMGFDHSFGLWFIARWLKPDL+IESGAFKGHSTWVLRQAMP T IISLSPRHPE
Sbjct: 185 IKNNMYGMGFDHSFGLWFIARWLKPDLLIESGAFKGHSTWVLRQAMPYTRIISLSPRHPE 244

Query: 256 KYLKKGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRIS 315
           KYLKKGPAYVDANCTYFAGKDFVDFGSV+W +VMK+HGI+DLSRVLVFFDDHQNELKRI 
Sbjct: 245 KYLKKGPAYVDANCTYFAGKDFVDFGSVAWKNVMKEHGINDLSRVLVFFDDHQNELKRIK 304

Query: 316 QALKAGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWE 375
           QAL AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIR KRKLFWE
Sbjct: 305 QALNAGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRGKRKLFWE 364

Query: 376 KAVDVEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAG 435
           KAVD+EELCGPYE+WWGV+GYMRDDFNHSNRAISHAEH QNSRYLESILDVYWE+PPVAG
Sbjct: 365 KAVDIEELCGPYESWWGVQGYMRDDFNHSNRAISHAEHFQNSRYLESILDVYWEVPPVAG 424

Query: 436 PSLTHQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQML 475
           PSLTHQTRYDPARVS PIVEDGRYGLFQRLGLTRLETSVFNGYTQM+
Sbjct: 425 PSLTHQTRYDPARVSSPIVEDGRYGLFQRLGLTRLETSVFNGYTQMV 470

BLAST of Cp4.1LG02g14560 vs. ExPASy TrEMBL
Match: A0A6J1GVV2 (uncharacterized protein LOC111457951 OS=Cucurbita moschata OX=3662 GN=LOC111457951 PE=4 SV=1)

HSP 1 Score: 837 bits (2163), Expect = 4.09e-299
Identity = 409/463 (88.34%), Postives = 430/463 (92.87%), Query Frame = 0

Query: 21  SAKQT--LQLDHSNASMSFDT-MRVQSSTPQSPTSNRRLERAVSSRRAPHHSGDFDDDDD 80
           S KQT  L++DH+N +MSFDT MR QSSTPQSPTS R L+RA+SSRR PHHSGD DDDDD
Sbjct: 10  SPKQTVRLEIDHTNTAMSFDTTMRAQSSTPQSPTSKRMLDRALSSRRVPHHSGDLDDDDD 69

Query: 81  HD-VSKTKKTRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSFIFFHSRRFVCVSSY 140
            D VSKTKK  FS FTHRLS YF RIGPI ACLAL+ LILL+ISS IFFHSRRFVCVSSY
Sbjct: 70  DDDVSKTKKHNFS-FTHRLSNYFARIGPISACLALLALILLLISSLIFFHSRRFVCVSSY 129

Query: 141 DPVSRSGFFGMDGLDSDFGSLG----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNN 200
           D +SRSGFFG+DGLDSDFGSLG    RSKHGKTVEWT KDLLKGLEEFVPIYETRPIKNN
Sbjct: 130 DHISRSGFFGVDGLDSDFGSLGVPWCRSKHGKTVEWTTKDLLKGLEEFVPIYETRPIKNN 189

Query: 201 LFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLK 260
           ++GMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDT IISLSPRHPEKYLK
Sbjct: 190 MYGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTAIISLSPRHPEKYLK 249

Query: 261 KGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALK 320
           KGPAYVDANCTYFAGKDFVDFGSV+W  VMK+HGIDDLSRVLVFFDDHQNELKRI QA+K
Sbjct: 250 KGPAYVDANCTYFAGKDFVDFGSVAWKKVMKEHGIDDLSRVLVFFDDHQNELKRIKQAVK 309

Query: 321 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVD 380
           AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYI+GGGHSCFKDSDEARIRAKRKLFWEKAVD
Sbjct: 310 AGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIKGGGHSCFKDSDEARIRAKRKLFWEKAVD 369

Query: 381 VEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLT 440
           +EELCGPYE+WWGVRGYMRDDFNHSNRAISHAEH QNSRYLESILDVYWELPPVAGPSLT
Sbjct: 370 IEELCGPYESWWGVRGYMRDDFNHSNRAISHAEHFQNSRYLESILDVYWELPPVAGPSLT 429

Query: 441 HQTRYDPARVSIPIVEDGRYGLFQRLGLTRLETSVFNGYTQML 475
           HQTRYDPARVS PIVEDGRYGLF+RLGL +LETSVFNGYTQM+
Sbjct: 430 HQTRYDPARVSSPIVEDGRYGLFRRLGLAQLETSVFNGYTQMV 471

BLAST of Cp4.1LG02g14560 vs. TAIR 10
Match: AT3G16200.1 (unknown protein; Has 97 Blast hits to 97 proteins in 15 species: Archae - 0; Bacteria - 8; Metazoa - 0; Fungi - 0; Plants - 36; Viruses - 0; Other Eukaryotes - 53 (source: NCBI BLink). )

HSP 1 Score: 664.5 bits (1713), Expect = 1.1e-190
Identity = 318/441 (72.11%), Postives = 370/441 (83.90%), Query Frame = 0

Query: 44  SSTPQSPTSNRRLERAVSSRRAPHHSGDF--DDDDDHDVSKTKKTRFSLFTHRLSIYFTR 103
           S +P++PT+   L+RA+SSRR PH   D     +   D SKTK+    L     S + +R
Sbjct: 14  SQSPKTPTT--MLDRALSSRR-PHSDADLSASGESGTDESKTKRPHIYLLA---SNFLSR 73

Query: 104 IGPIW---ACLALVGLILLMISSFIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDFGSLG 163
           IG  W     LAL+ L+LL + S + FHS  FVC+S +DP +R GFFG+DGL+SDFG+LG
Sbjct: 74  IGHQWWPCLILALLFLVLLFLIS-VAFHSHSFVCISRFDPAARIGFFGLDGLESDFGALG 133

Query: 164 ----RSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARWLKPD 223
               RSKHGK VEWT+KDLLKGLEEFVPIYETRPIKNN++GMGFDHSFGLWF+ARWLKPD
Sbjct: 134 VPWCRSKHGKEVEWTSKDLLKGLEEFVPIYETRPIKNNMYGMGFDHSFGLWFMARWLKPD 193

Query: 224 LMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDFVDFG 283
           +MIESGAFKGHSTWVLRQAMPDTP+ISL+PRHPEKYL+KGPAYVD NCTYFAGKDFVDFG
Sbjct: 194 MMIESGAFKGHSTWVLRQAMPDTPMISLTPRHPEKYLRKGPAYVDGNCTYFAGKDFVDFG 253

Query: 284 SVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGDHYSL 343
           SV W +V+++HGI DLSRV+VFFDDHQNELKR+ QALKAGF+HL+FEDNYDTGTGDHYSL
Sbjct: 254 SVDWKNVLRKHGITDLSRVIVFFDDHQNELKRLKQALKAGFRHLIFEDNYDTGTGDHYSL 313

Query: 344 RQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYMRDDF 403
           RQ+CDQ +IRGGGHSCFKDSDEAR+R+KRK FWEKAVD EELCGP E WWGV+G MRDDF
Sbjct: 314 RQICDQSHIRGGGHSCFKDSDEARMRSKRKKFWEKAVDTEELCGPGETWWGVKGEMRDDF 373

Query: 404 NHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDGRYGL 463
           NH+N  IS+ +H QNSRY+ESILDVYWELPPVAGPSLTHQ+RYDPAR + PIV DG++ L
Sbjct: 374 NHTNTPISYNQHFQNSRYVESILDVYWELPPVAGPSLTHQSRYDPARATPPIVADGKHRL 433

Query: 464 FQRLGLTRLETSVFNGYTQML 476
           FQR+GL RL+ SVFNGYTQM+
Sbjct: 434 FQRIGLGRLDKSVFNGYTQMV 447

BLAST of Cp4.1LG02g14560 vs. TAIR 10
Match: AT1G79810.1 (Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 382.5 bits (981), Expect = 8.5e-106
Identity = 206/330 (62.42%), Postives = 231/330 (70.00%), Query Frame = 0

Query: 539 SPPPEDAWSLAYQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKV 598
           S P +DAW  +YQRLLP  +SL  S  S IP++IS+VNQ DAARLD+EMSAMLKEQLVKV
Sbjct: 4   STPADDAWIRSYQRLLPESQSLLASRRSVIPVAISRVNQFDAARLDVEMSAMLKEQLVKV 63

Query: 599 FALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGIALMNL----------------- 658
           F LMKPGMLFQYE ELDAFLEFLIWRFSIWVDKPTPG ALMNL                 
Sbjct: 64  FTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDERGVVAQHLGKVR 123

Query: 659 --------------------------------------------RSLARRAWLLIQRIEG 718
                                                       R LARR W L+QRIEG
Sbjct: 124 TGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLARRLWTLVQRIEG 183

Query: 719 IYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEML 778
           IYKAA+F NLL FLYTGRYRNL+E+ L+ARLVY SP+MNR+VSFEYMNRQLVWNEFSEML
Sbjct: 184 IYKAASFLNLLSFLYTGRYRNLIEKALKARLVYRSPHMNRSVSFEYMNRQLVWNEFSEML 243

Query: 779 LLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLR 808
           LLLLPLLNSS+V+N L PF+K+KSSS+ ED   CPIC   P IPF+ALPCQHRYCYYC+R
Sbjct: 244 LLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTCPICQVDPAIPFIALPCQHRYCYYCIR 303

BLAST of Cp4.1LG02g14560 vs. TAIR 10
Match: AT1G79810.2 (Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 329.3 bits (843), Expect = 8.5e-90
Identity = 177/282 (62.77%), Postives = 195/282 (69.15%), Query Frame = 0

Query: 587 MSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGIALMNL----- 646
           MSAMLKEQLVKVF LMKPGMLFQYE ELDAFLEFLIWRFSIWVDKPTPG ALMNL     
Sbjct: 1   MSAMLKEQLVKVFTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDE 60

Query: 647 --------------------------------------------------------RSLA 706
                                                                   R LA
Sbjct: 61  RGVVAQHLGKVRTGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLA 120

Query: 707 RRAWLLIQRIEGIYKAAAFGNLLIFLYTGRYRNLVERVLRARLVYGSPNMNRAVSFEYMN 766
           RR W L+QRIEGIYKAA+F NLL FLYTGRYRNL+E+ L+ARLVY SP+MNR+VSFEYMN
Sbjct: 121 RRLWTLVQRIEGIYKAASFLNLLSFLYTGRYRNLIEKALKARLVYRSPHMNRSVSFEYMN 180

Query: 767 RQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLAL 808
           RQLVWNEFSEMLLLLLPLLNSS+V+N L PF+K+KSSS+ ED   CPIC   P IPF+AL
Sbjct: 181 RQLVWNEFSEMLLLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTCPICQVDPAIPFIAL 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CA861.2e-10462.42Peroxisome biogenesis protein 2 OS=Arabidopsis thaliana OX=3702 GN=PEX2 PE=1 SV=... [more]
Q75JQ31.9e-3026.60Peroxisome biogenesis factor 2 OS=Dictyostelium discoideum OX=44689 GN=pex2 PE=3... [more]
P243929.4e-2527.60Peroxisome biogenesis factor 2 OS=Rattus norvegicus OX=10116 GN=Pex2 PE=2 SV=1[more]
P550987.9e-2427.24Peroxisome biogenesis factor 2 OS=Mus musculus OX=10090 GN=Pex2 PE=2 SV=1[more]
Q064381.8e-2328.32Peroxisome biogenesis factor 2 OS=Cricetulus griseus OX=10029 GN=PEX2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6606758.10.085.17Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7018321.10.079.14Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. argyrosp... [more]
KAG6581888.10.079.14Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023524868.10.098.88uncharacterized protein LOC111788671 [Cucurbita pepo subsp. pepo][more]
XP_022949517.10.097.75uncharacterized protein LOC111452845 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1GC960.097.75uncharacterized protein LOC111452845 OS=Cucurbita moschata OX=3662 GN=LOC1114528... [more]
A0A6J1KCQ50.097.30uncharacterized protein LOC111492689 OS=Cucurbita maxima OX=3661 GN=LOC111492689... [more]
A0A1S3BHM52.21e-30488.46uncharacterized protein LOC103489955 OS=Cucumis melo OX=3656 GN=LOC103489955 PE=... [more]
A0A0A0LAU92.91e-30488.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G469200 PE=4 SV=1[more]
A0A6J1GVV24.09e-29988.34uncharacterized protein LOC111457951 OS=Cucurbita moschata OX=3662 GN=LOC1114579... [more]
Match NameE-valueIdentityDescription
AT3G16200.11.1e-19072.11unknown protein; Has 97 Blast hits to 97 proteins in 15 species: Archae - 0; Bac... [more]
AT1G79810.18.5e-10662.42Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING f... [more]
AT1G79810.28.5e-9062.77Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING f... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 751..791
e-value: 0.0084
score: 25.3
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 751..792
score: 10.055033
IPR018957Zinc finger, C3HC4 RING-typePFAMPF00097zf-C3HC4coord: 751..791
e-value: 5.9E-5
score: 22.8
IPR006845Pex, N-terminalPFAMPF04757Pex2_Pex12coord: 644..734
e-value: 6.4E-16
score: 58.6
coord: 586..649
e-value: 4.3E-6
score: 26.5
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 727..807
e-value: 7.6E-9
score: 37.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..80
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..56
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..80
NoneNo IPR availablePANTHERPTHR36362:SF2BNAC05G37190D PROTEINcoord: 40..475
NoneNo IPR availablePANTHERPTHR36362DNA-DIRECTED RNA POLYMERASE SUBUNIT BETAcoord: 40..475
NoneNo IPR availableCDDcd16526RING-HC_PEX2coord: 751..792
e-value: 4.63275E-19
score: 78.9674
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 744..798
IPR017907Zinc finger, RING-type, conserved sitePROSITEPS00518ZF_RING_1coord: 767..776

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g14560.1Cp4.1LG02g14560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032774 RNA biosynthetic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0046872 metal ion binding