Cp4.1LG06g08410 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g08410
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionINCENP_ARK-bind domain-containing protein
LocationCp4.1LG06: 5275328 .. 5285653 (+)
RNA-Seq ExpressionCp4.1LG06g08410
SyntenyCp4.1LG06g08410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAACGAGGACCGAAGTCTCCCAAAATTTGAATTCACTTCGCCCGCTACTCGCGAAAGTGTTTCTTCTTCGTCGGAAAACCCTCCATTGAAACTCAACTCTTCTTCCTCTCTGTTACTCGAACTGTTCATCTCTCTCTCTCTCTCTTTTCATGGCCAATCTGTGAAACATGGCGGCGATGGAGAAGCTATTCGTGCAGATCTTTGAGAGGAAGAAGTGGATCATTGACCAGGCCAAGCACCAGATCGATCTCTTCGACCAGCAACTTGCATCCAAGCTCATTATCGATGGAATTGTTCCTCCGCCTTGGCTTCACTCGCCTTTTCTTCATTCCAACATTTCGTATTTTGAAGGTAACTTCGCGTTTTTGTCGTAATTTTTCATTTCTTATGCTCCATGTTTAGTTAGGTCGAAGCAAGAATTTCTCCATTTCGCAACTTTTTTGCTCAATTGGTTTCTTTTAGGTGTAGGAGTGAGCAGGAATTTTGTTCCTGGAGTTGAGGTCCCACGGTCGCCGCTTCAGACCCATTGTTCTAGTTTGAATGAGGCATTTGTTGCAAACAGTGGGGAGGAGTTGCAGCAAAGGTCGAATGAAGATGCTGGTTCTTTAAACGATGATTTTGATGCAGGAAATAGGCCTGCAGTTTTACCTCAGTGCAATGTAAGTGACGCCCGTGTCTTTAATTGCGCACCTCGTGTTGACACAAGTCCTGTTTCTCCTCAAGGTCGAGGAGGCGGAGTTTTAGAAAATTACCAAGATCCTACTCTGTCACGGGCACGGTTACATAGATCTAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGCGCGAAATCTGCTAGGTGCCACTCCCGATATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAGGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGTATTGGTTCTATGGAAGAGGAGACTAATGTTTGTTGCGAGCAGAAGAATATCTCTACTTGCGCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCAGTGGAAGAGGAAACTAATGTTTGTTGCGAGCAGAAGAAGATCTCTATTTGCTCTGGTAAAGTTACAATAGTTGGAAGCCCTGGGTTGCAAAGTAGCTCTATTGATGTGGTTAATTCTTTAAATATTTACTTAGAAAATGAAGGGTTATGTGTAGCGGAAGGTTCAATGCAGAATTCTTATAAAGTGAATGAGCAATTTGACTCGCCTAGAACTTCTTCGGGAAAGATTGGATACTGTGAAGAAGGGCCGGCATGTTGCAGGAGTCAGGAATCTAATTTTGATAATGCTGAACTGTCTAGGTTGCAATGTAGCTCTTTGGATGTGGATAAATCTTCACGCATTCCCCCTGAAGATGGAAGAGAGTGTCCTATAGGAGGATCAAAATTGCATTCTGATCAAGTGGATGAGCAACTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGCTGTGAAGAGGCAAAATTAGTAGACTGCAGGAGCCAGGAATGTAATCTTGACAATGCTCTACAGTCCAAGTCACAACGAAGTTCTCTGGATGTGGACGATTCAGCATGCATTGACGCCAATGATGGAAGATTATTGGACTCGCCTAACCCTTCTTCTAGCAATGTCAAATGCTGTGAAGAAACTGTTGTAGGACATTGTAGGAGCCAGGAATGCAATTTTGATAATGCCCGAGAGGCTGGGTCGCTATACAACTCCCAGGATGTAGATAAGTCTTCATACGTCCACTCTGAGGACGGACAATCATGTCCTAATGGAAGTTCAGAAGTGCATTCTGATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTCCGACAATATGGAGTGTTGTGAAGAAGAAATATTAGGAGATTTCAGGAGTCAGGAGTATAATTTTAATAATGCTCAAAAGTCAGAGATGCAACATGACACCCTGGATGCGGATAATTCATCATGCTTTTCTTCTGAAAATGGAACTTGTTCTGTTGGAAGTTCAAAACTACATTCCGATCGAGTAAGTGAGCCGTCGGAGTTGTTTAGGCCTTCTTCTGCCAATGTTGAATGCCATGAAGTAGGACTAGGAGACTGTAGGACCCAAGATTGTAATTTTGATAATAATGCAGAAAAGTCTGGTTTAGACAAAATTTCCAGTTCACCAATAACGGAAGTAAGGGAGAAAACATCAGATAAGAAGCCCTCCACTTCCGTGGATAACAAGAGGGATGTTAATGAAAAAGAAAAATGCAATTCACCCCTTCACATGCCTATGCCGCAGATTCAGGTCGACTCAGTGAACGAAGACAAACATCATAAAGGTATATGTGAATCTCAAAGTGAAAAGAGATATGATAAAGAAGTAGCTACTTGTTCTTTGCTGCAAAGTGATGAACCTGTAGAACAAAATATTTCTTTGAAAGATGGAGTGCCGAATTTGCAGTATTCCCATGAAAATGCAGTTGAAATTCAACTAGTGGATACAGACGATGCATCAATTCTGATAAGAGATACAGAAACGTTTAGAGATCAAATGGTCATGGCTCCTTGTGTTCCTTCCGCTGGTGAGGGGGATAGTAATTTGGAGCAGAAACAAAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCCTTTGAGGGTTGCACTGATCACATGGTCATGGCTCCTTGTGTTCCTTCTGCTGGTGAGGGGGATAGTAATTTGGAGCAGCCACTGAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCGTTTGAGGGCTTCACTGAGCACTTGAATGGTAACCATCATTACGTATCAACAGAGTGCCAGACTGCAGAGACATCAATAAAGTCAAAAACTTTCAGCTCAGTTTTGAGGGCATCTAGTTCTGACGAAAAGGAGATAGAGGTTGAGCTGCAATTGGACAATGGTATTCCAGCGTCTTTAGGCTTGAGGAGTGAGCAACTTCAAATCAACAGGAGTCCTATAGATAAAAACTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAACTTCAACGATTATCATTTTGTGAAGAAGGATACCAACAACCAAATGTGAGCATCGGCCCTATTGAAATGTTGCTATTGGAAAAAGAAGCTCGCTTGATTCAGAGGTCTGATTCTTCACCCACGCTTCCAGTCAAAGAGGTATGTATTACAACGGGAAGATTTTTCTAGCTAGTGTTTTGAAGTGTGAACTGCAGTAACTTCTTAGGGCTTAATTGCTGGATGCAGGAAATTTTTTTTGAAGGTTGAAGCACACGATAATAAAATATACATTAAAATACATATAATATTAAGAACTAGCATGATGTATGAGGTATAGAAAAAAAGAGGTATTTGATACCCCCCTCCTTTAGGTCATTAACATTTCTGTTTCTTGCTTTCGTTTTTTGTGTTCTTCTGGGTAATAACCATTTTCTGTTTCTCATTTGCAAGAAAATTTTAGAAAAGAAATCTGCTTGTTTCCAATTTTTCACAAACATTTAAAAGTAGTATTTAAATAATTTTAACTAATGAAGTACCTATGAAATATAAAAAATTGAAAATATAAATACAATTTTGAATGTTTTACTTTCAAAAAAATTATATTTTCAAGTAGAATTTTCATTTTGTTATATCATATGTTAGAGAATAAGAAGTCAGACCAAACACATAAAAAATGAAGGATTAGAAAGAAAAAAAAAACTGCCAAACACATTTTATTTCCGTATCAGAAAATGAAAAACAAGAGATCAGACACTATCAAACATGCCTTGGCAGGTTGACTATTAAAAACATAGTTCATGCATAGTCTTCAAAATCTTTAAAAATAAACTACAATATCACGAGGCAGACCCAGAACCAGTCATCATCACCAAATAGATTCTCACCTCCAATTTTTCCATCATTACTTATAATCTCTTGTATTTTTAACCATTAGACTCTATAGCCAAGTATTTCCTTCTTTAGTCTAGTTATTTTCATGTGCCTCTACGTCTGCAAAGCTTTAGCAGTCATTCAGCTGGTGATATGCATTTTACTTCACGATTAAGCTAAGGCACTTAGGCACTTAGGTGTACTTATGAAGTTAAGATCAAACAATTGAACTATGTGTTGTTTTCAAGTATAATGATCTCTTTGGTGAAACATTGGTATACTAAAATGCATATCACTTACATTTATTGCTAAAGTACCTGTGGAAGAATGTGAAAATAATTCTTCCAATTCATATAACAAAAGTTGATTTTGTTCTTTTTAATATGTTGTTTTCTGCATACTGTTCTTGAAAGGATGGAGCCAGATTTAGTAAGGGAGATGATGGTGGTCATTCTGATTGTGCTTTTATTAGGGTGTAATGTTCACTAAATGCTGACGTATCAGTTTCTAACAAATTTCTTGGCAGGGATTTGATTGATTTTTCCTCCTTTTTTACTTTTTCTATTTACTTCACGATCTTGTTTCTTTGTGCATGATTGACAGGATCTCTCTCGGTTCGGAAGCAATAACAGAGGTACACCATTGCAAAATGGTATGCTAGAGAGCCAAAGTTTGGTTCCCGAAGAAAATTTTCAGTGTGGAGATATTGAACTTCCTATGGATACTGGGAAAACTGATGGAATGGAGGAAAAGGGGAAACTTACTTTGTGCTCGCTTCATACTCCACTTACCCAAACTTCTCATTATCTTGGTGCAGACAAGGATATGCCTGCTTTAGAGGGGTTCCTAATGCGATCTGATGACGAAGAGCCATGCATTTCTGTTGGTGGAATCAACTTTGACAAATTAGATCTTTCAAAATGTATGATAGAACGTGCTAGCATCTTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTACATTATCCTCACCTTCAGAAAGTTTTAGGCTGAACAAGGTGACAGATTTGTACAATTCTCTTCCTAATGGTCTACTAGAGTGCATGGACTTGAAGAATAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTCTTTGAATGGAGAAGTCAACTTCTCTCCTCATGGGTCTTCTTTTGATTGCCTGCAAAGCTTTAACAGTCATTCAGCTGGTGATCTCAGGAAGCCATTTGCATCTCCATTTGGTAAGTTGTTGGATAGAAATTCATTAAATTTGTCAAGTTCTGGAAAACGAAGTGGCCAGAACATAGAGCTTCCTTGCATTAGTGAGGAAGCTGAGAATACCGATGAGATTGATAACGAATTTTCGAAGGATATGAGATCGAGCAAGCGAGCACCACTTGTTGACATTACAGAAGATGCAAATGTTGAGGTAACAGTTTCTGAAGCTGCGGCGGTTGCTGATAGATTGAGTTTAGAATCTTTAAACATAGAACTCAGCAACACAAGGACTCATATTGGGACCAAAGAGAATCTGGGAAACCAGAAAAGCAGCAAGAGGAAATATGTGAATGAGGCTGTGAGTCGTGATACCTTGCCAGGAGAAAACGGTGCTAAAAGAGTCACTAGATCATCCTATAATATATTTAGCCGGTCAGATTTATCCTGTAAAAAAGATTTCAGAAAGGAAGGTCCTCGATTCTCTGAAAAGGAATCCAAGCATAGAAATATCGTGTCCAATATAACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGTATGTTTATTTATTTATCTTGACGTACATGGATTGATCATGAACCTTAAGCATTTCTTGGATATAGTTTGAACATCGTGTATCTGATGATCTGTGTATTGTCTGAATTGTAATTTATATTTTGTAACTTGTGCATTCCTGTAGGGAAGAGAGATATTAAGGTGAAGGCCATCGAAGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAAAGAGCAAGAATGGAACAAGAGAATTTGAGGCAGATTGAACTGGATAAAAAGAAGAAAGAAGAAGAGCGGAAGAAGAAGGAGGAAGAAAGGAAGAAAAAGGAGGTTGATATGGCAGCAAAGAAAAGACAGAGGGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTTGAAGAAGTTAGGAGACGATTACGAGAGCATGGTGGGAAGTTACGATCTGATAAAGAGAATAAGGAAGCAAAACCCCAAGCCAATGTAAGATGCTATATGTGACAACGTTTGAACTTTCCTTGTGCTTATGGGCTTTGAAAGGTAAAAACCGCTTATTATTTTCTTCTGTAGGACCAAAAACCACGTGACAGAAAGGGATGTAAGGATGGGACTGTCAAACTGGTCAAGGAAAGTGGCCATGACAGCTTTCACAAACTCTCAGTTACCGAGTCTAAGACTACTTCTACAAGCGATGCTGTGAGGGGAAGCTTTGTTGTGGAGGACTCACAACCAACGAGTGTTGATTTTCTAGAGGCAGAGGTAAATTGCTTTGTGAGTGCAAATTTCCAAAATATATGACGTGATTTAGGTGAAAAGTTATATTCCCATGATAGGAACGGGTAATTAGCTAGTATCCTTCCATTTTCCTAATCCTCACCAGTCTTCTACCTCAAAATATCACATATATTTTTGTGATATTGTGAGTGCTAAGTTATATTCCCAGAGGTAAATTGCTTTGTGAGTGCAAATTTCCAAAATATATGACGTGATTTAGAGTGAAAAGGTATACTCCCATGATAGGAATGGGTAATTAGCTAGTATCCTTCCATTTTCTTGATCCTCACCAGTCTTCTACCTCAAAATATCACACATATATGTACTTTTTATAGAATTGTCATGATCTTTTTGAAGGATAATAGATGGAGGAACTACATTGAAATGTTGTGGAGGTATTCAGTGCTATATCTGAAAGATTTTGGCTTCTAGATGAAGTAGTTGTTGGAATTGTCCTAAGATATCTTTTGTAGATAAACAATAAGATATTTGAAAAAGGACAGAACTCACAACACCTTAGGCTTCCAGCTGGTTTTAGATCTTAATCCACAAGAAACATGGAAGTCATTTTCTAGAGAATTAAAGCTTTATTGATTTTGCTGTGTCCTGTTACATAGTTGTATCCAAAGAAAATTAATTCTAATTCATCTAGAAGTTCTGTGGGACTTTGAAGAGGAAATCATAGTGAAACCTGGAAATCTAATTGAACCAAGAATTTCAGGACACCATATAATAATGACGTTTGTTTAACGCTTGATCTTATCATATGTAGCTTAATAAATATGAGAAATGAAATAATTTATACGTTTGAACTCCCTTGAGAATTTGAATACATTGGAAAATACTGTTTTTCACTCCAATTGCAATCCAACGTAAATTGTGTTGTTTCTCTTCTGTCCTAAGATGACTGCCCAAGTATTTGGTAGTTTAAGGTGAGAATTAGTTGGCAAATAAAGATAGATCTTGCTTGGTTTGAGCAAAATGGAATAAATTTTTTACAATTGTAATACACCTTTAGAGGTTTGATATTAAATCGTCATTTTGCTTGGATGATTTTTCCTGTTCTCCTTTGAGCTTCATATGAGTGTAACCTTCACAACCATTTCTTAAATATTAAAAGTTTATCTGCTTCATGCGGGAATTTATGATCTTCTATAGGCACTTGAAAATGTGATGGAACATAGAATCTCCGAAACAAGTGAAGAACAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGACGATGACGGCATACAAAATAATAAATTTGTTCCTTCATGGGCCAGGTGTGTAGCTCGGCCTTATGTTTTCATATAAGGAAATAACAAATGTGATTTGAACTTCAATAATGCAATTAGTACATTTTGCATTTATTTTGTCAAAGATTATCCCGAAGAACATTTCAAATTTGCCAATTTTTTCTGTTAGAACTAATATCTTCATACTTCTTTATGCAGTAAGGATCGCTTAGCTGTTCTTTTTGCTTCCCAGAAAAAATTGGATCCAGAAATTATCTTTCCACCGAAAAGTTTTTGTGACATAGCTGAAGGTGAAATTAACGCATAAAGATGGTTGCAGCTTTTAATTCTTGTTAAATAAAAACACACTGAGTTAGTTTAACTGTTTTGCAGTTCTCTTGCCTCGACAACATCAGTCTAAATAGACAATAGACCGAACCTTCACAATGTGGATAGATTTTTTATCTGCAACACAAACATTCCCCTTTCTGCCTGTGCAAGAGACTTGCCTAGGTACGCTTTTACTAAACTGCTGTTTTTCTCTGACTTGAACATCATATCGTGGTTTATCATTTTTGCCTTGTTTAATAAATTAGTTGCAACCCCACTTGTTTGGACTTCTTGGCATTAGAGGATCACTGTTAGTAAAGTAGGCCTATGTACATAACGTCGACTTTAGTATTTGAGCAAATCTTTTAGCCACCCTTCACTTATTTCACTCGGTAAATAGGCTTCTCTCTTGAACCTTTTTTCACTCGGTAAAGGTGGATGCTATCGGTCGAAAATCTCTTCTTTCCTCTCCCTTCAAACACTGTAAACGAAGAAAATGAAAGGATGTTCGAGCGTTTTCGCTCCTTCACCTATTTATTTTTGAACTGTAAACTCGGTGGCTTCGCTCCTTAGCTTTTGTTTTTTCTCCTTCACCTCGTAAACAAGAAAGACTCAAGATTAATTCGATTGAATTCAATCACTTAAAGAACGGTAAACTCGGTCTTCGTCTTCTTCCTTTTCCTCTCCCTTCAAATACTGACTCTTTTCTTTTCTTTCTCTTTTGAAGGATAGATTTAGTTACCTAGTATCTAGGAGATTTAGTTACCTAATATCTAGAGATTATTGTATCTAAAGATTATTATTTAGATCTATAGTTACCAATAACTAGAGATTTATCTTTATCGTGTTATCTTATTACCTATTGTTTATGGATTTATTTAGATTACTAGTACTTAGGAAATTTAGATTATCTATTGATTTAGACTTAGTTAGCCTTTCTCACGTAGACATGTAATTCTCTGAATAATAATAAGCCAGCTAATTTAGACTTGCAACCGTTTTTATTTGTGCCATGTCTCATTGAAATGATAGAGTGCTATATTATGTGGCAGATTTTGCTGATTAATTTCTCAAGGAATTGCTGCTCAGGTGATTAGTTTCTTGCTGCTCTTTAACTCTGAATAGCTAAATTGAGATGATCATTGTACAACTACGCAGCTTTATATGTAGGGTCAATTTCTTAGCACCAAAATAAACTATACAACCCGATTCTCACCACCCTCCCTCCTTATGACTGGATATCTCAACCATGAAACAGTCGGGGAAAAAGGCTGTAGTGAATTCTTGGTGTTAACCTTCCCTTCTTTTGCTTTGGGGTAGGCAGGGGGAGGATTTAGGTTACTCCAACTTGTAGGTGTAAAGCACTAGTTTCTAACATAGCTTCTCTATGTACAATTTTTTTTTTTAATATATTATCTAATGAAGAAGATGAAAGTGCAGATGAGATTGTTCAACTTGAAGAGGCAAATTTTGATTATATTCTAGAAAAAGAAATGAGTAACCTTAACGTCGTAACGGCATAAGCATTGTTACCTAGGTCTATGTCGGAGGAAACCATAGT

mRNA sequence

CGAACGAGGACCGAAGTCTCCCAAAATTTGAATTCACTTCGCCCGCTACTCGCGAAAGTGTTTCTTCTTCGTCGGAAAACCCTCCATTGAAACTCAACTCTTCTTCCTCTCTGTTACTCGAACTGTTCATCTCTCTCTCTCTCTCTTTTCATGGCCAATCTGTGAAACATGGCGGCGATGGAGAAGCTATTCGTGCAGATCTTTGAGAGGAAGAAGTGGATCATTGACCAGGCCAAGCACCAGATCGATCTCTTCGACCAGCAACTTGCATCCAAGCTCATTATCGATGGAATTGTTCCTCCGCCTTGGCTTCACTCGCCTTTTCTTCATTCCAACATTTCGTATTTTGAAGGTGTAGGAGTGAGCAGGAATTTTGTTCCTGGAGTTGAGGTCCCACGGTCGCCGCTTCAGACCCATTGTTCTAGTTTGAATGAGGCATTTGTTGCAAACAGTGGGGAGGAGTTGCAGCAAAGGTCGAATGAAGATGCTGGTTCTTTAAACGATGATTTTGATGCAGGAAATAGGCCTGCAGTTTTACCTCAGTGCAATGTAAGTGACGCCCGTGTCTTTAATTGCGCACCTCGTGTTGACACAAGTCCTGTTTCTCCTCAAGGTCGAGGAGGCGGAGTTTTAGAAAATTACCAAGATCCTACTCTGTCACGGGCACGGTTACATAGATCTAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGCGCGAAATCTGCTAGGTGCCACTCCCGATATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAGGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGTATTGGTTCTATGGAAGAGGAGACTAATGTTTGTTGCGAGCAGAAGAATATCTCTACTTGCGCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCAGTGGAAGAGGAAACTAATGTTTGTTGCGAGCAGAAGAAGATCTCTATTTGCTCTGGTAAAGTTACAATAGTTGGAAGCCCTGGGTTGCAAAGTAGCTCTATTGATGTGGTTAATTCTTTAAATATTTACTTAGAAAATGAAGGGTTATGTGTAGCGGAAGGTTCAATGCAGAATTCTTATAAAGTGAATGAGCAATTTGACTCGCCTAGAACTTCTTCGGGAAAGATTGGATACTGTGAAGAAGGGCCGGCATGTTGCAGGAGTCAGGAATCTAATTTTGATAATGCTGAACTGTCTAGGTTGCAATGTAGCTCTTTGGATGTGGATAAATCTTCACGCATTCCCCCTGAAGATGGAAGAGAGTGTCCTATAGGAGGATCAAAATTGCATTCTGATCAAGTGGATGAGCAACTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGCTGTGAAGAGGCAAAATTAGTAGACTGCAGGAGCCAGGAATGTAATCTTGACAATGCTCTACAGTCCAAGTCACAACGAAGTTCTCTGGATGTGGACGATTCAGCATGCATTGACGCCAATGATGGAAGATTATTGGACTCGCCTAACCCTTCTTCTAGCAATGTCAAATGCTGTGAAGAAACTGTTGTAGGACATTGTAGGAGCCAGGAATGCAATTTTGATAATGCCCGAGAGGCTGGGTCGCTATACAACTCCCAGGATGTAGATAAGTCTTCATACGTCCACTCTGAGGACGGACAATCATGTCCTAATGGAAGTTCAGAAGTGCATTCTGATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTCCGACAATATGGAGTGTTGTGAAGAAGAAATATTAGGAGATTTCAGGAGTCAGGAGTATAATTTTAATAATGCTCAAAAGTCAGAGATGCAACATGACACCCTGGATGCGGATAATTCATCATGCTTTTCTTCTGAAAATGGAACTTGTTCTGTTGGAAGTTCAAAACTACATTCCGATCGAGTAAGTGAGCCGTCGGAGTTGTTTAGGCCTTCTTCTGCCAATGTTGAATGCCATGAAGTAGGACTAGGAGACTGTAGGACCCAAGATTGTAATTTTGATAATAATGCAGAAAAGTCTGGTTTAGACAAAATTTCCAGTTCACCAATAACGGAAGTAAGGGAGAAAACATCAGATAAGAAGCCCTCCACTTCCGTGGATAACAAGAGGGATGTTAATGAAAAAGAAAAATGCAATTCACCCCTTCACATGCCTATGCCGCAGATTCAGGTCGACTCAGTGAACGAAGACAAACATCATAAAGGTATATGTGAATCTCAAAGTGAAAAGAGATATGATAAAGAAGTAGCTACTTGTTCTTTGCTGCAAAGTGATGAACCTGTAGAACAAAATATTTCTTTGAAAGATGGAGTGCCGAATTTGCAGTATTCCCATGAAAATGCAGTTGAAATTCAACTAGTGGATACAGACGATGCATCAATTCTGATAAGAGATACAGAAACGTTTAGAGATCAAATGGTCATGGCTCCTTGTGTTCCTTCCGCTGGTGAGGGGGATAGTAATTTGGAGCAGAAACAAAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCCTTTGAGGGTTGCACTGATCACATGGTCATGGCTCCTTGTGTTCCTTCTGCTGGTGAGGGGGATAGTAATTTGGAGCAGCCACTGAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCGTTTGAGGGCTTCACTGAGCACTTGAATGGTAACCATCATTACGTATCAACAGAGTGCCAGACTGCAGAGACATCAATAAAGTCAAAAACTTTCAGCTCAGTTTTGAGGGCATCTAGTTCTGACGAAAAGGAGATAGAGGTTGAGCTGCAATTGGACAATGGTATTCCAGCGTCTTTAGGCTTGAGGAGTGAGCAACTTCAAATCAACAGGAGTCCTATAGATAAAAACTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAACTTCAACGATTATCATTTTGTGAAGAAGGATACCAACAACCAAATGTGAGCATCGGCCCTATTGAAATGTTGCTATTGGAAAAAGAAGCTCGCTTGATTCAGAGGTCTGATTCTTCACCCACGCTTCCAGTCAAAGAGGATCTCTCTCGGTTCGGAAGCAATAACAGAGGTACACCATTGCAAAATGGTATGCTAGAGAGCCAAAGTTTGGTTCCCGAAGAAAATTTTCAGTGTGGAGATATTGAACTTCCTATGGATACTGGGAAAACTGATGGAATGGAGGAAAAGGGGAAACTTACTTTGTGCTCGCTTCATACTCCACTTACCCAAACTTCTCATTATCTTGGTGCAGACAAGGATATGCCTGCTTTAGAGGGGTTCCTAATGCGATCTGATGACGAAGAGCCATGCATTTCTGTTGGTGGAATCAACTTTGACAAATTAGATCTTTCAAAATGTATGATAGAACGTGCTAGCATCTTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTACATTATCCTCACCTTCAGAAAGTTTTAGGCTGAACAAGGTGACAGATTTGTACAATTCTCTTCCTAATGGTCTACTAGAGTGCATGGACTTGAAGAATAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTCTTTGAATGGAGAAGTCAACTTCTCTCCTCATGGGTCTTCTTTTGATTGCCTGCAAAGCTTTAACAGTCATTCAGCTGGTGATCTCAGGAAGCCATTTGCATCTCCATTTGGTAAGTTGTTGGATAGAAATTCATTAAATTTGTCAAGTTCTGGAAAACGAAGTGGCCAGAACATAGAGCTTCCTTGCATTAGTGAGGAAGCTGAGAATACCGATGAGATTGATAACGAATTTTCGAAGGATATGAGATCGAGCAAGCGAGCACCACTTGTTGACATTACAGAAGATGCAAATGTTGAGGTAACAGTTTCTGAAGCTGCGGCGGTTGCTGATAGATTGAGTTTAGAATCTTTAAACATAGAACTCAGCAACACAAGGACTCATATTGGGACCAAAGAGAATCTGGGAAACCAGAAAAGCAGCAAGAGGAAATATGTGAATGAGGCTGTGAGTCGTGATACCTTGCCAGGAGAAAACGGTGCTAAAAGAGTCACTAGATCATCCTATAATATATTTAGCCGGTCAGATTTATCCTGTAAAAAAGATTTCAGAAAGGAAGGTCCTCGATTCTCTGAAAAGGAATCCAAGCATAGAAATATCGTGTCCAATATAACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATATTAAGGTGAAGGCCATCGAAGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAAAGAGCAAGAATGGAACAAGAGAATTTGAGGCAGATTGAACTGGATAAAAAGAAGAAAGAAGAAGAGCGGAAGAAGAAGGAGGAAGAAAGGAAGAAAAAGGAGGTTGATATGGCAGCAAAGAAAAGACAGAGGGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTTGAAGAAGTTAGGAGACGATTACGAGAGCATGGTGGGAAGTTACGATCTGATAAAGAGAATAAGGAAGCAAAACCCCAAGCCAATGACCAAAAACCACGTGACAGAAAGGGATGTAAGGATGGGACTGTCAAACTGGTCAAGGAAAGTGGCCATGACAGCTTTCACAAACTCTCAGTTACCGAGTCTAAGACTACTTCTACAAGCGATGCTGTGAGGGGAAGCTTTGTTGTGGAGGACTCACAACCAACGAGTGTTGATTTTCTAGAGGCAGAGGCACTTGAAAATGTGATGGAACATAGAATCTCCGAAACAAGTGAAGAACAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGACGATGACGGCATACAAAATAATAAATTTGTTCCTTCATGGGCCAGTAAGGATCGCTTAGCTGTTCTTTTTGCTTCCCAGAAAAAATTGGATCCAGAAATTATCTTTCCACCGAAAAGTTTTTGTGACATAGCTGAAGTTCTCTTGCCTCGACAACATCAGTCTAAATAGACAATAGACCGAACCTTCACAATGTGGATAGATTTTTTATCTGCAACACAAACATTCCCCTTTCTGCCTGTGCAAGAGACTTGCCTAGATTTTGCTGATTAATTTCTCAAGGAATTGCTGCTCAGGTGATTAGTTTCTTGCTGCTCTTTAACTCTGAATAGCTAAATTGAGATGATCATTGTACAACTACGCAGCTTTATATGTAGGGTCAATTTCTTAGCACCAAAATAAACTATACAACCCGATTCTCACCACCCTCCCTCCTTATGACTGGATATCTCAACCATGAAACAGTCGGGGAAAAAGGCTGTAGTGAATTCTTGGTGTTAACCTTCCCTTCTTTTGCTTTGGGGTAGGCAGGGGGAGGATTTAGGTTACTCCAACTTGTAGGTGTAAAGCACTAGTTTCTAACATAGCTTCTCTATGTACAATTTTTTTTTTTAATATATTATCTAATGAAGAAGATGAAAGTGCAGATGAGATTGTTCAACTTGAAGAGGCAAATTTTGATTATATTCTAGAAAAAGAAATGAGTAACCTTAACGTCGTAACGGCATAAGCATTGTTACCTAGGTCTATGTCGGAGGAAACCATAGT

Coding sequence (CDS)

ATGGCGGCGATGGAGAAGCTATTCGTGCAGATCTTTGAGAGGAAGAAGTGGATCATTGACCAGGCCAAGCACCAGATCGATCTCTTCGACCAGCAACTTGCATCCAAGCTCATTATCGATGGAATTGTTCCTCCGCCTTGGCTTCACTCGCCTTTTCTTCATTCCAACATTTCGTATTTTGAAGGTGTAGGAGTGAGCAGGAATTTTGTTCCTGGAGTTGAGGTCCCACGGTCGCCGCTTCAGACCCATTGTTCTAGTTTGAATGAGGCATTTGTTGCAAACAGTGGGGAGGAGTTGCAGCAAAGGTCGAATGAAGATGCTGGTTCTTTAAACGATGATTTTGATGCAGGAAATAGGCCTGCAGTTTTACCTCAGTGCAATGTAAGTGACGCCCGTGTCTTTAATTGCGCACCTCGTGTTGACACAAGTCCTGTTTCTCCTCAAGGTCGAGGAGGCGGAGTTTTAGAAAATTACCAAGATCCTACTCTGTCACGGGCACGGTTACATAGATCTAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGCGCGAAATCTGCTAGGTGCCACTCCCGATATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAGGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGTATTGGTTCTATGGAAGAGGAGACTAATGTTTGTTGCGAGCAGAAGAATATCTCTACTTGCGCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCTATGGAAGAGGAGACTGATGTTTGTTGCGAGCAGAAGAATATCTCTATTTGCTCTGATAAATCTAGGCAAAGGGCTTTAGAGTTGCGTAATAGTGTGAAATCTTCTAGGTGCCACTCCCGGTATGAGAACAAGAATGATTCCGTTGCTGATGGGATTGTGGGATCTGCTATTAGTTTGCTGCAAGCTGATCACGAAGATGAATCAGAGTTGGCAAAGCCTTCTAGCAGCTGTAAGGGAATTGGTTCAGTGGAAGAGGAAACTAATGTTTGTTGCGAGCAGAAGAAGATCTCTATTTGCTCTGGTAAAGTTACAATAGTTGGAAGCCCTGGGTTGCAAAGTAGCTCTATTGATGTGGTTAATTCTTTAAATATTTACTTAGAAAATGAAGGGTTATGTGTAGCGGAAGGTTCAATGCAGAATTCTTATAAAGTGAATGAGCAATTTGACTCGCCTAGAACTTCTTCGGGAAAGATTGGATACTGTGAAGAAGGGCCGGCATGTTGCAGGAGTCAGGAATCTAATTTTGATAATGCTGAACTGTCTAGGTTGCAATGTAGCTCTTTGGATGTGGATAAATCTTCACGCATTCCCCCTGAAGATGGAAGAGAGTGTCCTATAGGAGGATCAAAATTGCATTCTGATCAAGTGGATGAGCAACTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGCTGTGAAGAGGCAAAATTAGTAGACTGCAGGAGCCAGGAATGTAATCTTGACAATGCTCTACAGTCCAAGTCACAACGAAGTTCTCTGGATGTGGACGATTCAGCATGCATTGACGCCAATGATGGAAGATTATTGGACTCGCCTAACCCTTCTTCTAGCAATGTCAAATGCTGTGAAGAAACTGTTGTAGGACATTGTAGGAGCCAGGAATGCAATTTTGATAATGCCCGAGAGGCTGGGTCGCTATACAACTCCCAGGATGTAGATAAGTCTTCATACGTCCACTCTGAGGACGGACAATCATGTCCTAATGGAAGTTCAGAAGTGCATTCTGATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTCCGACAATATGGAGTGTTGTGAAGAAGAAATATTAGGAGATTTCAGGAGTCAGGAGTATAATTTTAATAATGCTCAAAAGTCAGAGATGCAACATGACACCCTGGATGCGGATAATTCATCATGCTTTTCTTCTGAAAATGGAACTTGTTCTGTTGGAAGTTCAAAACTACATTCCGATCGAGTAAGTGAGCCGTCGGAGTTGTTTAGGCCTTCTTCTGCCAATGTTGAATGCCATGAAGTAGGACTAGGAGACTGTAGGACCCAAGATTGTAATTTTGATAATAATGCAGAAAAGTCTGGTTTAGACAAAATTTCCAGTTCACCAATAACGGAAGTAAGGGAGAAAACATCAGATAAGAAGCCCTCCACTTCCGTGGATAACAAGAGGGATGTTAATGAAAAAGAAAAATGCAATTCACCCCTTCACATGCCTATGCCGCAGATTCAGGTCGACTCAGTGAACGAAGACAAACATCATAAAGGTATATGTGAATCTCAAAGTGAAAAGAGATATGATAAAGAAGTAGCTACTTGTTCTTTGCTGCAAAGTGATGAACCTGTAGAACAAAATATTTCTTTGAAAGATGGAGTGCCGAATTTGCAGTATTCCCATGAAAATGCAGTTGAAATTCAACTAGTGGATACAGACGATGCATCAATTCTGATAAGAGATACAGAAACGTTTAGAGATCAAATGGTCATGGCTCCTTGTGTTCCTTCCGCTGGTGAGGGGGATAGTAATTTGGAGCAGAAACAAAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCCTTTGAGGGTTGCACTGATCACATGGTCATGGCTCCTTGTGTTCCTTCTGCTGGTGAGGGGGATAGTAATTTGGAGCAGCCACTGAAAAGTTCAGGCATAACTCAGTGTGAAGATTCAGATTCGTTTGAGGGCTTCACTGAGCACTTGAATGGTAACCATCATTACGTATCAACAGAGTGCCAGACTGCAGAGACATCAATAAAGTCAAAAACTTTCAGCTCAGTTTTGAGGGCATCTAGTTCTGACGAAAAGGAGATAGAGGTTGAGCTGCAATTGGACAATGGTATTCCAGCGTCTTTAGGCTTGAGGAGTGAGCAACTTCAAATCAACAGGAGTCCTATAGATAAAAACTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAACTTCAACGATTATCATTTTGTGAAGAAGGATACCAACAACCAAATGTGAGCATCGGCCCTATTGAAATGTTGCTATTGGAAAAAGAAGCTCGCTTGATTCAGAGGTCTGATTCTTCACCCACGCTTCCAGTCAAAGAGGATCTCTCTCGGTTCGGAAGCAATAACAGAGGTACACCATTGCAAAATGGTATGCTAGAGAGCCAAAGTTTGGTTCCCGAAGAAAATTTTCAGTGTGGAGATATTGAACTTCCTATGGATACTGGGAAAACTGATGGAATGGAGGAAAAGGGGAAACTTACTTTGTGCTCGCTTCATACTCCACTTACCCAAACTTCTCATTATCTTGGTGCAGACAAGGATATGCCTGCTTTAGAGGGGTTCCTAATGCGATCTGATGACGAAGAGCCATGCATTTCTGTTGGTGGAATCAACTTTGACAAATTAGATCTTTCAAAATGTATGATAGAACGTGCTAGCATCTTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTACATTATCCTCACCTTCAGAAAGTTTTAGGCTGAACAAGGTGACAGATTTGTACAATTCTCTTCCTAATGGTCTACTAGAGTGCATGGACTTGAAGAATAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTCTTTGAATGGAGAAGTCAACTTCTCTCCTCATGGGTCTTCTTTTGATTGCCTGCAAAGCTTTAACAGTCATTCAGCTGGTGATCTCAGGAAGCCATTTGCATCTCCATTTGGTAAGTTGTTGGATAGAAATTCATTAAATTTGTCAAGTTCTGGAAAACGAAGTGGCCAGAACATAGAGCTTCCTTGCATTAGTGAGGAAGCTGAGAATACCGATGAGATTGATAACGAATTTTCGAAGGATATGAGATCGAGCAAGCGAGCACCACTTGTTGACATTACAGAAGATGCAAATGTTGAGGTAACAGTTTCTGAAGCTGCGGCGGTTGCTGATAGATTGAGTTTAGAATCTTTAAACATAGAACTCAGCAACACAAGGACTCATATTGGGACCAAAGAGAATCTGGGAAACCAGAAAAGCAGCAAGAGGAAATATGTGAATGAGGCTGTGAGTCGTGATACCTTGCCAGGAGAAAACGGTGCTAAAAGAGTCACTAGATCATCCTATAATATATTTAGCCGGTCAGATTTATCCTGTAAAAAAGATTTCAGAAAGGAAGGTCCTCGATTCTCTGAAAAGGAATCCAAGCATAGAAATATCGTGTCCAATATAACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATATTAAGGTGAAGGCCATCGAAGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAAAGAGCAAGAATGGAACAAGAGAATTTGAGGCAGATTGAACTGGATAAAAAGAAGAAAGAAGAAGAGCGGAAGAAGAAGGAGGAAGAAAGGAAGAAAAAGGAGGTTGATATGGCAGCAAAGAAAAGACAGAGGGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTTGAAGAAGTTAGGAGACGATTACGAGAGCATGGTGGGAAGTTACGATCTGATAAAGAGAATAAGGAAGCAAAACCCCAAGCCAATGACCAAAAACCACGTGACAGAAAGGGATGTAAGGATGGGACTGTCAAACTGGTCAAGGAAAGTGGCCATGACAGCTTTCACAAACTCTCAGTTACCGAGTCTAAGACTACTTCTACAAGCGATGCTGTGAGGGGAAGCTTTGTTGTGGAGGACTCACAACCAACGAGTGTTGATTTTCTAGAGGCAGAGGCACTTGAAAATGTGATGGAACATAGAATCTCCGAAACAAGTGAAGAACAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGACGATGACGGCATACAAAATAATAAATTTGTTCCTTCATGGGCCAGTAAGGATCGCTTAGCTGTTCTTTTTGCTTCCCAGAAAAAATTGGATCCAGAAATTATCTTTCCACCGAAAAGTTTTTGTGACATAGCTGAAGTTCTCTTGCCTCGACAACATCAGTCTAAATAG

Protein sequence

MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYFEGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRPAVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALELRNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKEKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHDSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEVLLPRQHQSK
Homology
BLAST of Cp4.1LG06g08410 vs. NCBI nr
Match: XP_023535899.1 (uncharacterized protein LOC111797188 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 3603 bits (9342), Expect = 0.0
Identity = 1869/1869 (100.00%), Postives = 1869/1869 (100.00%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP
Sbjct: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240
            RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN
Sbjct: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240

Query: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300
            VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED
Sbjct: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300

Query: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360
            ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK
Sbjct: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360

Query: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420
            NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR
Sbjct: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420

Query: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
            QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540
            VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY
Sbjct: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540

Query: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600
            KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR
Sbjct: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600

Query: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660
            ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD
Sbjct: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660

Query: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720
            VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK
Sbjct: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720

Query: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780
            SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ
Sbjct: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780

Query: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840
            KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC
Sbjct: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840

Query: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900
            RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM
Sbjct: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900

Query: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960
            PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE
Sbjct: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960

Query: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020
            NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS
Sbjct: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020

Query: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080
            FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC
Sbjct: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080

Query: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ 1140
            QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ
Sbjct: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ 1140

Query: 1141 EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR 1200
            EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR
Sbjct: 1141 EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR 1200

Query: 1201 FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ 1260
            FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ
Sbjct: 1201 FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ 1260

Query: 1261 TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI 1320
            TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI
Sbjct: 1261 TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI 1320

Query: 1321 NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS 1380
            NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS
Sbjct: 1321 NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS 1380

Query: 1381 PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE 1440
            PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE
Sbjct: 1381 PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE 1440

Query: 1441 NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI 1500
            NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI
Sbjct: 1441 NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI 1500

Query: 1501 GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR 1560
            GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR
Sbjct: 1501 GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR 1560

Query: 1561 FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ 1620
            FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ
Sbjct: 1561 FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ 1620

Query: 1621 MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE 1680
            MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE
Sbjct: 1621 MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE 1680

Query: 1681 KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD 1740
            KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD
Sbjct: 1681 KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD 1740

Query: 1741 SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI 1800
            SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI
Sbjct: 1741 SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI 1800

Query: 1801 SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV 1860
            SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV
Sbjct: 1801 SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV 1860

Query: 1861 LLPRQHQSK 1869
            LLPRQHQSK
Sbjct: 1861 LLPRQHQSK 1869

BLAST of Cp4.1LG06g08410 vs. NCBI nr
Match: XP_023535901.1 (uncharacterized protein LOC111797188 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 3594 bits (9319), Expect = 0.0
Identity = 1867/1869 (99.89%), Postives = 1867/1869 (99.89%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGV  SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP
Sbjct: 61   EGV--SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240
            RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN
Sbjct: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240

Query: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300
            VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED
Sbjct: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300

Query: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360
            ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK
Sbjct: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360

Query: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420
            NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR
Sbjct: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420

Query: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
            QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540
            VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY
Sbjct: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540

Query: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600
            KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR
Sbjct: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600

Query: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660
            ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD
Sbjct: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660

Query: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720
            VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK
Sbjct: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720

Query: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780
            SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ
Sbjct: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780

Query: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840
            KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC
Sbjct: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840

Query: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900
            RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM
Sbjct: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900

Query: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960
            PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE
Sbjct: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960

Query: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020
            NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS
Sbjct: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020

Query: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080
            FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC
Sbjct: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080

Query: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ 1140
            QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ
Sbjct: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQINRSPIDKNLMQ 1140

Query: 1141 EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR 1200
            EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR
Sbjct: 1141 EFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLSR 1200

Query: 1201 FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ 1260
            FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ
Sbjct: 1201 FGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQ 1260

Query: 1261 TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI 1320
            TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI
Sbjct: 1261 TSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACI 1320

Query: 1321 NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS 1380
            NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS
Sbjct: 1321 NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNFS 1380

Query: 1381 PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE 1440
            PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE
Sbjct: 1381 PHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAE 1440

Query: 1441 NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI 1500
            NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI
Sbjct: 1441 NTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTHI 1500

Query: 1501 GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR 1560
            GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR
Sbjct: 1501 GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPR 1560

Query: 1561 FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ 1620
            FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ
Sbjct: 1561 FSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENERQ 1620

Query: 1621 MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE 1680
            MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE
Sbjct: 1621 MKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERKE 1680

Query: 1681 KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD 1740
            KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD
Sbjct: 1681 KERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHD 1740

Query: 1741 SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI 1800
            SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI
Sbjct: 1741 SFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQI 1800

Query: 1801 SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV 1860
            SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV
Sbjct: 1801 SPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAEV 1860

Query: 1861 LLPRQHQSK 1869
            LLPRQHQSK
Sbjct: 1861 LLPRQHQSK 1867

BLAST of Cp4.1LG06g08410 vs. NCBI nr
Match: KAG7030270.1 (hypothetical protein SDJN02_08617 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 3392 bits (8794), Expect = 0.0
Identity = 1773/1870 (94.81%), Postives = 1816/1870 (97.11%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWL SPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLQSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGV V+RNFVPGVEVPRSPLQTH SSLNE  VANSGEELQQRSNEDAGSLNDDFDAG RP
Sbjct: 61   EGVEVNRNFVPGVEVPRSPLQTHRSSLNEVLVANSGEELQQRSNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240
            RNSAKSARCHSR+ENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN
Sbjct: 181  RNSAKSARCHSRFENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240

Query: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300
            VCCEQKNIS C+DKSRQRALELR SVKS+RCHSRYENKNDSVADGIVGS+ISLLQADH+D
Sbjct: 241  VCCEQKNISICSDKSRQRALELRKSVKSARCHSRYENKNDSVADGIVGSSISLLQADHDD 300

Query: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360
            ESELAKPSSSCKGIGS+EEE++VCCEQKNISICSDKSRQRALELRNS KS+RCHSRYENK
Sbjct: 301  ESELAKPSSSCKGIGSVEEESNVCCEQKNISICSDKSRQRALELRNSAKSARCHSRYENK 360

Query: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420
            NDSV DGIVGSAIS L+ADHE+ESELAKPSSSCKGIGS+EEET++CCEQKNISICSDKSR
Sbjct: 361  NDSV-DGIVGSAISSLRADHEEESELAKPSSSCKGIGSVEEETNICCEQKNISICSDKSR 420

Query: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
            QRALELR SVKS+RCHSRYENKNDSVADGIVGS+ISLLQADH+DESELAKPSSSCKGIGS
Sbjct: 421  QRALELRKSVKSARCHSRYENKNDSVADGIVGSSISLLQADHDDESELAKPSSSCKGIGS 480

Query: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540
            VEEETN+CCEQK ISICS KVTIVGSPGLQSSSIDVVNSLNI LENEGLCVAEGSMQNSY
Sbjct: 481  VEEETNICCEQKNISICSDKVTIVGSPGLQSSSIDVVNSLNICLENEGLCVAEGSMQNSY 540

Query: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600
            KV+EQFDSPRTSSGKIGYCEEGPACCRSQE NFDNAELSRLQCSSLDVDKSSRIPPEDGR
Sbjct: 541  KVDEQFDSPRTSSGKIGYCEEGPACCRSQEPNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600

Query: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660
             CPI GSKLHSDQVDEQLDLPKPSSDNVECCEEA LVDCRSQECNLDNALQS+ QRSSLD
Sbjct: 601  GCPIRGSKLHSDQVDEQLDLPKPSSDNVECCEEAVLVDCRSQECNLDNALQSERQRSSLD 660

Query: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720
            VDDSACIDA DGRLLD  NPSS NVKCCEETV+GHCRSQECNFDNAREAGSL NSQDVDK
Sbjct: 661  VDDSACIDATDGRLLDLSNPSSGNVKCCEETVIGHCRSQECNFDNAREAGSLCNSQDVDK 720

Query: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780
            SSYVHSEDG+SCPNGSSEVHSDE+KEQLDLSKSSSDNMECC+EEILGDFRSQEYNFNNAQ
Sbjct: 721  SSYVHSEDGRSCPNGSSEVHSDELKEQLDLSKSSSDNMECCKEEILGDFRSQEYNFNNAQ 780

Query: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840
             S MQH++LDADNSSCFSSENGT SVGSSKLHS +VSEPSELFRPSSAN+ECHE GLGDC
Sbjct: 781  MSGMQHNSLDADNSSCFSSENGTRSVGSSKLHSGQVSEPSELFRPSSANIECHEEGLGDC 840

Query: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900
            RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVD+KRDVNEKEKCNSPLHMPM
Sbjct: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDDKRDVNEKEKCNSPLHMPM 900

Query: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960
            PQIQVDS+NED++ KG+ ESQSEKRYDKEVATCSLLQSDEP EQNISLKDGVPNLQYSHE
Sbjct: 901  PQIQVDSLNEDEYDKGVYESQSEKRYDKEVATCSLLQSDEPAEQNISLKDGVPNLQYSHE 960

Query: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020
            NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQ+ KSSGITQCEDS S
Sbjct: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQQLKSSGITQCEDSGS 1020

Query: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080
            FEGCTDHMVMAPCVPSAGEGDSNLE+PLKSSGITQCEDSDSFEG TE  NGNHHYVSTEC
Sbjct: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEKPLKSSGITQCEDSDSFEGCTEQ-NGNHHYVSTEC 1080

Query: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQI-NRSPIDKNLM 1140
            QTAETSI+ KTFSSVLRASSS+EKEIEVELQLDNGIPASLGLR EQLQI NRSPIDKNLM
Sbjct: 1081 QTAETSIELKTFSSVLRASSSNEKEIEVELQLDNGIPASLGLRIEQLQIINRSPIDKNLM 1140

Query: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLS 1200
            QEFDTEKPVLELQRLSFCEEGYQQPNVSIGP E+LLLEKEARLIQ SDSS TLPVKEDLS
Sbjct: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPTEILLLEKEARLIQGSDSSSTLPVKEDLS 1200

Query: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260
            RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT
Sbjct: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260

Query: 1261 QTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320
            QTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC
Sbjct: 1261 QTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320

Query: 1321 INSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNF 1380
            INSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVN 
Sbjct: 1321 INSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNC 1380

Query: 1381 SPHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEA 1440
            SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKLLDRNSLN SSSGKRS QNIELPCISEEA
Sbjct: 1381 SPHGSSFDCLQSFNNHSAGDLRKPFASPFGKLLDRNSLNSSSSGKRSSQNIELPCISEEA 1440

Query: 1441 ENTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTH 1500
            ENTDEIDNEF K MRSSKRAPLVDITEDANVEVTVSEA AVADRLSLESLNIELSNTRTH
Sbjct: 1441 ENTDEIDNEFLKAMRSSKRAPLVDITEDANVEVTVSEAVAVADRLSLESLNIELSNTRTH 1500

Query: 1501 IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGP 1560
             GTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYN FSRSDLSCKKDFRKEGP
Sbjct: 1501 NGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNTFSRSDLSCKKDFRKEGP 1560

Query: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENER 1620
            RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENER
Sbjct: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENER 1620

Query: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680
            QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK
Sbjct: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680

Query: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH 1740
            EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKP+ANDQKPRDRKGCKDGTVKLVKESGH
Sbjct: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPRANDQKPRDRKGCKDGTVKLVKESGH 1740

Query: 1741 DSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQ 1800
            DSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQ
Sbjct: 1741 DSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQ 1800

Query: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAE 1860
            ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQK+LDPEIIFPPKSFCDIAE
Sbjct: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKRLDPEIIFPPKSFCDIAE 1860

Query: 1861 VLLPRQHQSK 1869
            VLLPRQHQ K
Sbjct: 1861 VLLPRQHQFK 1868

BLAST of Cp4.1LG06g08410 vs. NCBI nr
Match: XP_022936495.1 (uncharacterized protein LOC111443094 isoform X1 [Cucurbita moschata])

HSP 1 Score: 3263 bits (8459), Expect = 0.0
Identity = 1774/2198 (80.71%), Postives = 1814/2198 (82.53%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR NEDAGSLNDDFDAG RP
Sbjct: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRLNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDT+PVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTTPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKND------------------------------------------ 240
            RNSAKSARCHSRYENKND                                          
Sbjct: 181  RNSAKSARCHSRYENKNDFVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETN 240

Query: 241  ------------------------------------------------------------ 300
                                                                        
Sbjct: 241  VCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHED 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  ESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENK 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  NDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSR 420

Query: 421  ------------------------SVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
                                    S+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  MEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLL 540
            +EEETNVCCE+ NIS C+DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVGSAISLL
Sbjct: 481  VEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVGSAISLL 540

Query: 541  QADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCH 600
            QADHEDESELAKPSSSCKGIGS+EEET+VCCE+ NISICSDKSRQR LELRNSVKSSRCH
Sbjct: 541  QADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCH 600

Query: 601  SRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISI 660
            SRYENKNDS+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS+EEET+VCCEQKNISI
Sbjct: 601  SRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKNISI 660

Query: 661  CS---------------------------------------------------------- 720
            CS                                                          
Sbjct: 661  CSHKSRQRSLELRNSAKSARCHSPYENKNDSVADGIVGSAISSLRADHEDESELAKPSSS 720

Query: 721  ------------------------DKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVG 780
                                    DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVG
Sbjct: 721  CKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVG 780

Query: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSS 840
            SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVC EQK ISICS KVTIVGSPGLQSS
Sbjct: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCYEQKNISICSDKVTIVGSPGLQSS 840

Query: 841  SIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESN 900
            SIDVVNSLNIY+ENEGLCVAEGS +NSYKVNEQFDSP TSSGKIGYCEEGPA CRSQESN
Sbjct: 841  SIDVVNSLNIYIENEGLCVAEGSTRNSYKVNEQFDSPSTSSGKIGYCEEGPASCRSQESN 900

Query: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960
            FDNAELSRLQCSSLDVDKSSRIPPEDGR  PIGGSKLHSDQVDEQLDLPKPSSDNVECCE
Sbjct: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRGYPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960

Query: 961  EAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETV 1020
            EA LVDCRSQECNLDNALQS+SQRSS DVDDSACIDA DGRLLD  NPSS NVKCCEET+
Sbjct: 961  EAVLVDCRSQECNLDNALQSESQRSSPDVDDSACIDATDGRLLDLSNPSSGNVKCCEETI 1020

Query: 1021 VGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSK 1080
            +GHCRSQECNFDNAREAGSLYNSQDVDKSSYVH ED +SCPNGSSEVHSDE+KE+LDLSK
Sbjct: 1021 LGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHPEDRRSCPNGSSEVHSDELKERLDLSK 1080

Query: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLH 1140
            SSSDNMECCEEEILGDFRSQEYNFNNAQKS MQH++LDADNSSCFSSENGT SVGSSKLH
Sbjct: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSGMQHNSLDADNSSCFSSENGTRSVGSSKLH 1140

Query: 1141 SDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200
            SD+VSEPSELFRPSSAN+ECHE GLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS
Sbjct: 1141 SDQVSEPSELFRPSSANIECHEEGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200

Query: 1201 DKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVAT 1260
            DKKPSTSVD+KRDVNEKEKCNSPLHMPMPQIQVDS+NED++ KG+ ESQSEKRYDKEVAT
Sbjct: 1201 DKKPSTSVDDKRDVNEKEKCNSPLHMPMPQIQVDSLNEDEYDKGVYESQSEKRYDKEVAT 1260

Query: 1261 CSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320
            CSLLQSDEP EQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC
Sbjct: 1261 CSLLQSDEPAEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320

Query: 1321 VPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSG 1380
            VPSAGEGDSNLEQ+ KSSGITQCEDS SFEGCTDHMVMAPCVP AGEGDSNLE+PLKSSG
Sbjct: 1321 VPSAGEGDSNLEQQLKSSGITQCEDSGSFEGCTDHMVMAPCVPCAGEGDSNLEKPLKSSG 1380

Query: 1381 ITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQL 1440
            ITQCEDSDSFEG TE  NGNHHYVSTECQTAETSI+ KTFSSVLRASSS+EKEIEVELQL
Sbjct: 1381 ITQCEDSDSFEGCTEQ-NGNHHYVSTECQTAETSIELKTFSSVLRASSSNEKEIEVELQL 1440

Query: 1441 DNGIPASLGLRSEQLQI-NRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPI 1500
            DNGIPAS GLR EQLQI NRSPIDK+LMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGP 
Sbjct: 1441 DNGIPASFGLRIEQLQIINRSPIDKDLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPT 1500

Query: 1501 EMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560
            E+L LEKEARLIQ SDSS TLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI
Sbjct: 1501 EILRLEKEARLIQGSDSSSTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560

Query: 1561 ELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620
            ELP DTGKTDGMEEKGKL LCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG
Sbjct: 1561 ELPTDTGKTDGMEEKGKLALCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620

Query: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680
            GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM
Sbjct: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680

Query: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKL 1740
            DLKNNLLMNDQNKLLKDGSNSLNGEVN SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKL
Sbjct: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNCSPHGSSFDCLQSFNNHSAGDLRKPFASPFGKL 1740

Query: 1741 LDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVE 1800
            LDRNSLN SSSGKRS QNIELPCISEEAENTDEIDNEFSK MRSSKRAPLVDITEDANVE
Sbjct: 1741 LDRNSLNSSSSGKRSSQNIELPCISEEAENTDEIDNEFSKAMRSSKRAPLVDITEDANVE 1800

Query: 1801 VTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENG 1860
            VTVSEA AVADRLSLESLNIELSNTRTH GTKENLGNQKSSKRKYVNEAVSRD+LPGENG
Sbjct: 1801 VTVSEAVAVADRLSLESLNIELSNTRTHNGTKENLGNQKSSKRKYVNEAVSRDSLPGENG 1860

Query: 1861 AKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1869
            AKRVTRSSYN FSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL
Sbjct: 1861 AKRVTRSSYNTFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1920

BLAST of Cp4.1LG06g08410 vs. NCBI nr
Match: XP_022936496.1 (uncharacterized protein LOC111443094 isoform X2 [Cucurbita moschata])

HSP 1 Score: 3254 bits (8436), Expect = 0.0
Identity = 1772/2198 (80.62%), Postives = 1812/2198 (82.44%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGV  SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR NEDAGSLNDDFDAG RP
Sbjct: 61   EGV--SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRLNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDT+PVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTTPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKND------------------------------------------ 240
            RNSAKSARCHSRYENKND                                          
Sbjct: 181  RNSAKSARCHSRYENKNDFVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETN 240

Query: 241  ------------------------------------------------------------ 300
                                                                        
Sbjct: 241  VCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHED 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  ESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENK 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  NDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSR 420

Query: 421  ------------------------SVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
                                    S+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  MEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLL 540
            +EEETNVCCE+ NIS C+DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVGSAISLL
Sbjct: 481  VEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVGSAISLL 540

Query: 541  QADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCH 600
            QADHEDESELAKPSSSCKGIGS+EEET+VCCE+ NISICSDKSRQR LELRNSVKSSRCH
Sbjct: 541  QADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCH 600

Query: 601  SRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISI 660
            SRYENKNDS+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS+EEET+VCCEQKNISI
Sbjct: 601  SRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKNISI 660

Query: 661  CS---------------------------------------------------------- 720
            CS                                                          
Sbjct: 661  CSHKSRQRSLELRNSAKSARCHSPYENKNDSVADGIVGSAISSLRADHEDESELAKPSSS 720

Query: 721  ------------------------DKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVG 780
                                    DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVG
Sbjct: 721  CKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVG 780

Query: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSS 840
            SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVC EQK ISICS KVTIVGSPGLQSS
Sbjct: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCYEQKNISICSDKVTIVGSPGLQSS 840

Query: 841  SIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESN 900
            SIDVVNSLNIY+ENEGLCVAEGS +NSYKVNEQFDSP TSSGKIGYCEEGPA CRSQESN
Sbjct: 841  SIDVVNSLNIYIENEGLCVAEGSTRNSYKVNEQFDSPSTSSGKIGYCEEGPASCRSQESN 900

Query: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960
            FDNAELSRLQCSSLDVDKSSRIPPEDGR  PIGGSKLHSDQVDEQLDLPKPSSDNVECCE
Sbjct: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRGYPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960

Query: 961  EAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETV 1020
            EA LVDCRSQECNLDNALQS+SQRSS DVDDSACIDA DGRLLD  NPSS NVKCCEET+
Sbjct: 961  EAVLVDCRSQECNLDNALQSESQRSSPDVDDSACIDATDGRLLDLSNPSSGNVKCCEETI 1020

Query: 1021 VGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSK 1080
            +GHCRSQECNFDNAREAGSLYNSQDVDKSSYVH ED +SCPNGSSEVHSDE+KE+LDLSK
Sbjct: 1021 LGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHPEDRRSCPNGSSEVHSDELKERLDLSK 1080

Query: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLH 1140
            SSSDNMECCEEEILGDFRSQEYNFNNAQKS MQH++LDADNSSCFSSENGT SVGSSKLH
Sbjct: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSGMQHNSLDADNSSCFSSENGTRSVGSSKLH 1140

Query: 1141 SDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200
            SD+VSEPSELFRPSSAN+ECHE GLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS
Sbjct: 1141 SDQVSEPSELFRPSSANIECHEEGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200

Query: 1201 DKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVAT 1260
            DKKPSTSVD+KRDVNEKEKCNSPLHMPMPQIQVDS+NED++ KG+ ESQSEKRYDKEVAT
Sbjct: 1201 DKKPSTSVDDKRDVNEKEKCNSPLHMPMPQIQVDSLNEDEYDKGVYESQSEKRYDKEVAT 1260

Query: 1261 CSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320
            CSLLQSDEP EQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC
Sbjct: 1261 CSLLQSDEPAEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320

Query: 1321 VPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSG 1380
            VPSAGEGDSNLEQ+ KSSGITQCEDS SFEGCTDHMVMAPCVP AGEGDSNLE+PLKSSG
Sbjct: 1321 VPSAGEGDSNLEQQLKSSGITQCEDSGSFEGCTDHMVMAPCVPCAGEGDSNLEKPLKSSG 1380

Query: 1381 ITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQL 1440
            ITQCEDSDSFEG TE  NGNHHYVSTECQTAETSI+ KTFSSVLRASSS+EKEIEVELQL
Sbjct: 1381 ITQCEDSDSFEGCTEQ-NGNHHYVSTECQTAETSIELKTFSSVLRASSSNEKEIEVELQL 1440

Query: 1441 DNGIPASLGLRSEQLQI-NRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPI 1500
            DNGIPAS GLR EQLQI NRSPIDK+LMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGP 
Sbjct: 1441 DNGIPASFGLRIEQLQIINRSPIDKDLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPT 1500

Query: 1501 EMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560
            E+L LEKEARLIQ SDSS TLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI
Sbjct: 1501 EILRLEKEARLIQGSDSSSTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560

Query: 1561 ELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620
            ELP DTGKTDGMEEKGKL LCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG
Sbjct: 1561 ELPTDTGKTDGMEEKGKLALCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620

Query: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680
            GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM
Sbjct: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680

Query: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKL 1740
            DLKNNLLMNDQNKLLKDGSNSLNGEVN SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKL
Sbjct: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNCSPHGSSFDCLQSFNNHSAGDLRKPFASPFGKL 1740

Query: 1741 LDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVE 1800
            LDRNSLN SSSGKRS QNIELPCISEEAENTDEIDNEFSK MRSSKRAPLVDITEDANVE
Sbjct: 1741 LDRNSLNSSSSGKRSSQNIELPCISEEAENTDEIDNEFSKAMRSSKRAPLVDITEDANVE 1800

Query: 1801 VTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENG 1860
            VTVSEA AVADRLSLESLNIELSNTRTH GTKENLGNQKSSKRKYVNEAVSRD+LPGENG
Sbjct: 1801 VTVSEAVAVADRLSLESLNIELSNTRTHNGTKENLGNQKSSKRKYVNEAVSRDSLPGENG 1860

Query: 1861 AKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1869
            AKRVTRSSYN FSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL
Sbjct: 1861 AKRVTRSSYNTFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1920

BLAST of Cp4.1LG06g08410 vs. ExPASy TrEMBL
Match: A0A6J1FDU9 (uncharacterized protein LOC111443094 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443094 PE=3 SV=1)

HSP 1 Score: 3263 bits (8459), Expect = 0.0
Identity = 1774/2198 (80.71%), Postives = 1814/2198 (82.53%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR NEDAGSLNDDFDAG RP
Sbjct: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRLNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDT+PVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTTPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKND------------------------------------------ 240
            RNSAKSARCHSRYENKND                                          
Sbjct: 181  RNSAKSARCHSRYENKNDFVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETN 240

Query: 241  ------------------------------------------------------------ 300
                                                                        
Sbjct: 241  VCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHED 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  ESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENK 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  NDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSR 420

Query: 421  ------------------------SVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
                                    S+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  MEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLL 540
            +EEETNVCCE+ NIS C+DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVGSAISLL
Sbjct: 481  VEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVGSAISLL 540

Query: 541  QADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCH 600
            QADHEDESELAKPSSSCKGIGS+EEET+VCCE+ NISICSDKSRQR LELRNSVKSSRCH
Sbjct: 541  QADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCH 600

Query: 601  SRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISI 660
            SRYENKNDS+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS+EEET+VCCEQKNISI
Sbjct: 601  SRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKNISI 660

Query: 661  CS---------------------------------------------------------- 720
            CS                                                          
Sbjct: 661  CSHKSRQRSLELRNSAKSARCHSPYENKNDSVADGIVGSAISSLRADHEDESELAKPSSS 720

Query: 721  ------------------------DKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVG 780
                                    DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVG
Sbjct: 721  CKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVG 780

Query: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSS 840
            SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVC EQK ISICS KVTIVGSPGLQSS
Sbjct: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCYEQKNISICSDKVTIVGSPGLQSS 840

Query: 841  SIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESN 900
            SIDVVNSLNIY+ENEGLCVAEGS +NSYKVNEQFDSP TSSGKIGYCEEGPA CRSQESN
Sbjct: 841  SIDVVNSLNIYIENEGLCVAEGSTRNSYKVNEQFDSPSTSSGKIGYCEEGPASCRSQESN 900

Query: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960
            FDNAELSRLQCSSLDVDKSSRIPPEDGR  PIGGSKLHSDQVDEQLDLPKPSSDNVECCE
Sbjct: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRGYPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960

Query: 961  EAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETV 1020
            EA LVDCRSQECNLDNALQS+SQRSS DVDDSACIDA DGRLLD  NPSS NVKCCEET+
Sbjct: 961  EAVLVDCRSQECNLDNALQSESQRSSPDVDDSACIDATDGRLLDLSNPSSGNVKCCEETI 1020

Query: 1021 VGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSK 1080
            +GHCRSQECNFDNAREAGSLYNSQDVDKSSYVH ED +SCPNGSSEVHSDE+KE+LDLSK
Sbjct: 1021 LGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHPEDRRSCPNGSSEVHSDELKERLDLSK 1080

Query: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLH 1140
            SSSDNMECCEEEILGDFRSQEYNFNNAQKS MQH++LDADNSSCFSSENGT SVGSSKLH
Sbjct: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSGMQHNSLDADNSSCFSSENGTRSVGSSKLH 1140

Query: 1141 SDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200
            SD+VSEPSELFRPSSAN+ECHE GLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS
Sbjct: 1141 SDQVSEPSELFRPSSANIECHEEGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200

Query: 1201 DKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVAT 1260
            DKKPSTSVD+KRDVNEKEKCNSPLHMPMPQIQVDS+NED++ KG+ ESQSEKRYDKEVAT
Sbjct: 1201 DKKPSTSVDDKRDVNEKEKCNSPLHMPMPQIQVDSLNEDEYDKGVYESQSEKRYDKEVAT 1260

Query: 1261 CSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320
            CSLLQSDEP EQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC
Sbjct: 1261 CSLLQSDEPAEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320

Query: 1321 VPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSG 1380
            VPSAGEGDSNLEQ+ KSSGITQCEDS SFEGCTDHMVMAPCVP AGEGDSNLE+PLKSSG
Sbjct: 1321 VPSAGEGDSNLEQQLKSSGITQCEDSGSFEGCTDHMVMAPCVPCAGEGDSNLEKPLKSSG 1380

Query: 1381 ITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQL 1440
            ITQCEDSDSFEG TE  NGNHHYVSTECQTAETSI+ KTFSSVLRASSS+EKEIEVELQL
Sbjct: 1381 ITQCEDSDSFEGCTEQ-NGNHHYVSTECQTAETSIELKTFSSVLRASSSNEKEIEVELQL 1440

Query: 1441 DNGIPASLGLRSEQLQI-NRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPI 1500
            DNGIPAS GLR EQLQI NRSPIDK+LMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGP 
Sbjct: 1441 DNGIPASFGLRIEQLQIINRSPIDKDLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPT 1500

Query: 1501 EMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560
            E+L LEKEARLIQ SDSS TLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI
Sbjct: 1501 EILRLEKEARLIQGSDSSSTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560

Query: 1561 ELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620
            ELP DTGKTDGMEEKGKL LCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG
Sbjct: 1561 ELPTDTGKTDGMEEKGKLALCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620

Query: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680
            GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM
Sbjct: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680

Query: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKL 1740
            DLKNNLLMNDQNKLLKDGSNSLNGEVN SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKL
Sbjct: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNCSPHGSSFDCLQSFNNHSAGDLRKPFASPFGKL 1740

Query: 1741 LDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVE 1800
            LDRNSLN SSSGKRS QNIELPCISEEAENTDEIDNEFSK MRSSKRAPLVDITEDANVE
Sbjct: 1741 LDRNSLNSSSSGKRSSQNIELPCISEEAENTDEIDNEFSKAMRSSKRAPLVDITEDANVE 1800

Query: 1801 VTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENG 1860
            VTVSEA AVADRLSLESLNIELSNTRTH GTKENLGNQKSSKRKYVNEAVSRD+LPGENG
Sbjct: 1801 VTVSEAVAVADRLSLESLNIELSNTRTHNGTKENLGNQKSSKRKYVNEAVSRDSLPGENG 1860

Query: 1861 AKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1869
            AKRVTRSSYN FSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL
Sbjct: 1861 AKRVTRSSYNTFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1920

BLAST of Cp4.1LG06g08410 vs. ExPASy TrEMBL
Match: A0A6J1F7M4 (uncharacterized protein LOC111443094 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443094 PE=3 SV=1)

HSP 1 Score: 3254 bits (8436), Expect = 0.0
Identity = 1772/2198 (80.62%), Postives = 1812/2198 (82.44%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGV  SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR NEDAGSLNDDFDAG RP
Sbjct: 61   EGV--SRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRLNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDT+PVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTTPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKND------------------------------------------ 240
            RNSAKSARCHSRYENKND                                          
Sbjct: 181  RNSAKSARCHSRYENKNDFVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETN 240

Query: 241  ------------------------------------------------------------ 300
                                                                        
Sbjct: 241  VCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHED 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  ESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENK 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  NDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSR 420

Query: 421  ------------------------SVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
                                    S+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  MEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLL 540
            +EEETNVCCE+ NIS C+DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVGSAISLL
Sbjct: 481  VEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVGSAISLL 540

Query: 541  QADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCH 600
            QADHEDESELAKPSSSCKGIGS+EEET+VCCE+ NISICSDKSRQR LELRNSVKSSRCH
Sbjct: 541  QADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCH 600

Query: 601  SRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISI 660
            SRYENKNDS+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS+EEET+VCCEQKNISI
Sbjct: 601  SRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKNISI 660

Query: 661  CS---------------------------------------------------------- 720
            CS                                                          
Sbjct: 661  CSHKSRQRSLELRNSAKSARCHSPYENKNDSVADGIVGSAISSLRADHEDESELAKPSSS 720

Query: 721  ------------------------DKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVG 780
                                    DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVG
Sbjct: 721  CKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVG 780

Query: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSS 840
            SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVC EQK ISICS KVTIVGSPGLQSS
Sbjct: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCYEQKNISICSDKVTIVGSPGLQSS 840

Query: 841  SIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESN 900
            SIDVVNSLNIY+ENEGLCVAEGS +NSYKVNEQFDSP TSSGKIGYCEEGPA CRSQESN
Sbjct: 841  SIDVVNSLNIYIENEGLCVAEGSTRNSYKVNEQFDSPSTSSGKIGYCEEGPASCRSQESN 900

Query: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960
            FDNAELSRLQCSSLDVDKSSRIPPEDGR  PIGGSKLHSDQVDEQLDLPKPSSDNVECCE
Sbjct: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRGYPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960

Query: 961  EAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETV 1020
            EA LVDCRSQECNLDNALQS+SQRSS DVDDSACIDA DGRLLD  NPSS NVKCCEET+
Sbjct: 961  EAVLVDCRSQECNLDNALQSESQRSSPDVDDSACIDATDGRLLDLSNPSSGNVKCCEETI 1020

Query: 1021 VGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSK 1080
            +GHCRSQECNFDNAREAGSLYNSQDVDKSSYVH ED +SCPNGSSEVHSDE+KE+LDLSK
Sbjct: 1021 LGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHPEDRRSCPNGSSEVHSDELKERLDLSK 1080

Query: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLH 1140
            SSSDNMECCEEEILGDFRSQEYNFNNAQKS MQH++LDADNSSCFSSENGT SVGSSKLH
Sbjct: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSGMQHNSLDADNSSCFSSENGTRSVGSSKLH 1140

Query: 1141 SDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200
            SD+VSEPSELFRPSSAN+ECHE GLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS
Sbjct: 1141 SDQVSEPSELFRPSSANIECHEEGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200

Query: 1201 DKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVAT 1260
            DKKPSTSVD+KRDVNEKEKCNSPLHMPMPQIQVDS+NED++ KG+ ESQSEKRYDKEVAT
Sbjct: 1201 DKKPSTSVDDKRDVNEKEKCNSPLHMPMPQIQVDSLNEDEYDKGVYESQSEKRYDKEVAT 1260

Query: 1261 CSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320
            CSLLQSDEP EQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC
Sbjct: 1261 CSLLQSDEPAEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320

Query: 1321 VPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSG 1380
            VPSAGEGDSNLEQ+ KSSGITQCEDS SFEGCTDHMVMAPCVP AGEGDSNLE+PLKSSG
Sbjct: 1321 VPSAGEGDSNLEQQLKSSGITQCEDSGSFEGCTDHMVMAPCVPCAGEGDSNLEKPLKSSG 1380

Query: 1381 ITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQL 1440
            ITQCEDSDSFEG TE  NGNHHYVSTECQTAETSI+ KTFSSVLRASSS+EKEIEVELQL
Sbjct: 1381 ITQCEDSDSFEGCTEQ-NGNHHYVSTECQTAETSIELKTFSSVLRASSSNEKEIEVELQL 1440

Query: 1441 DNGIPASLGLRSEQLQI-NRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPI 1500
            DNGIPAS GLR EQLQI NRSPIDK+LMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGP 
Sbjct: 1441 DNGIPASFGLRIEQLQIINRSPIDKDLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPT 1500

Query: 1501 EMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560
            E+L LEKEARLIQ SDSS TLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI
Sbjct: 1501 EILRLEKEARLIQGSDSSSTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560

Query: 1561 ELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620
            ELP DTGKTDGMEEKGKL LCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG
Sbjct: 1561 ELPTDTGKTDGMEEKGKLALCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620

Query: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680
            GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM
Sbjct: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680

Query: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKL 1740
            DLKNNLLMNDQNKLLKDGSNSLNGEVN SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKL
Sbjct: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNCSPHGSSFDCLQSFNNHSAGDLRKPFASPFGKL 1740

Query: 1741 LDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVE 1800
            LDRNSLN SSSGKRS QNIELPCISEEAENTDEIDNEFSK MRSSKRAPLVDITEDANVE
Sbjct: 1741 LDRNSLNSSSSGKRSSQNIELPCISEEAENTDEIDNEFSKAMRSSKRAPLVDITEDANVE 1800

Query: 1801 VTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENG 1860
            VTVSEA AVADRLSLESLNIELSNTRTH GTKENLGNQKSSKRKYVNEAVSRD+LPGENG
Sbjct: 1801 VTVSEAVAVADRLSLESLNIELSNTRTHNGTKENLGNQKSSKRKYVNEAVSRDSLPGENG 1860

Query: 1861 AKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1869
            AKRVTRSSYN FSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL
Sbjct: 1861 AKRVTRSSYNTFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1920

BLAST of Cp4.1LG06g08410 vs. ExPASy TrEMBL
Match: A0A6J1F8M1 (uncharacterized protein LOC111443094 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111443094 PE=3 SV=1)

HSP 1 Score: 3174 bits (8230), Expect = 0.0
Identity = 1738/2198 (79.07%), Postives = 1777/2198 (80.85%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR NEDAGSLNDDFDAG RP
Sbjct: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRLNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
            AVLPQCNVSDARVFNCAPRVDT+PVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  AVLPQCNVSDARVFNCAPRVDTTPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKND------------------------------------------ 240
            RNSAKSARCHSRYENKND                                          
Sbjct: 181  RNSAKSARCHSRYENKNDFVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETN 240

Query: 241  ------------------------------------------------------------ 300
                                                                        
Sbjct: 241  VCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHED 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  ESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSGKSSRCHSRYENK 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  NDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSR 420

Query: 421  ------------------------SVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
                                    S+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRPLELRNSGKSSRCHSRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  MEEETNVCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLL 540
            +EEETNVCCE+ NIS C+DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVGSAISLL
Sbjct: 481  VEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVGSAISLL 540

Query: 541  QADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCH 600
            QADHEDESELAKPSSSCKGIGS+EEET+VCCE+ NISICSDKSRQR LELRNSVKSSRCH
Sbjct: 541  QADHEDESELAKPSSSCKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCH 600

Query: 601  SRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISI 660
            SRYENKNDS+ADGIVGSAISLLQADHEDESELAKPSSSCKGIGS+EEET+VCCEQKNISI
Sbjct: 601  SRYENKNDSIADGIVGSAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKNISI 660

Query: 661  CS---------------------------------------------------------- 720
            CS                                                          
Sbjct: 661  CSHKSRQRSLELRNSAKSARCHSPYENKNDSVADGIVGSAISSLRADHEDESELAKPSSS 720

Query: 721  ------------------------DKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVG 780
                                    DKSRQR LELRNSVKSSRCHSRYENKNDS+ADGIVG
Sbjct: 721  CKGIGSVEEETNVCCEKNNISICSDKSRQRPLELRNSVKSSRCHSRYENKNDSIADGIVG 780

Query: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSS 840
            SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVC EQK ISICS KVTIVGSPGLQSS
Sbjct: 781  SAISLLQADHEDESELAKPSSSCKGIGSVEEETNVCYEQKNISICSDKVTIVGSPGLQSS 840

Query: 841  SIDVVNSLNIYLENEGLCVAEGSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRSQESN 900
            SIDVVNSLNIY+ENEGLCVAEGS +NSYKVNEQFDSP TSSGKIGYCEEGPA CRSQESN
Sbjct: 841  SIDVVNSLNIYIENEGLCVAEGSTRNSYKVNEQFDSPSTSSGKIGYCEEGPASCRSQESN 900

Query: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960
            FDNAELSRLQCSSLDVDKSSRIPPEDGR  PIGGSKLHSDQVDEQLDLPKPSSDNVECCE
Sbjct: 901  FDNAELSRLQCSSLDVDKSSRIPPEDGRGYPIGGSKLHSDQVDEQLDLPKPSSDNVECCE 960

Query: 961  EAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLDSPNPSSSNVKCCEETV 1020
            EA LVDCRSQECNLDNALQS+SQRSS DVDDSACIDA DGRLLD  NPSS NVKCCEET+
Sbjct: 961  EAVLVDCRSQECNLDNALQSESQRSSPDVDDSACIDATDGRLLDLSNPSSGNVKCCEETI 1020

Query: 1021 VGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSK 1080
            +GHCRSQECNFDNAREAGSLYNSQDVDKSSYVH ED +SCPNGSSEVHSDE+KE+LDLSK
Sbjct: 1021 LGHCRSQECNFDNAREAGSLYNSQDVDKSSYVHPEDRRSCPNGSSEVHSDELKERLDLSK 1080

Query: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFSSENGTCSVGSSKLH 1140
            SSSDNMECCEEEILGDFRSQEYNFNNAQKS MQH++LDADNSSCFSSENGT SVGSSKLH
Sbjct: 1081 SSSDNMECCEEEILGDFRSQEYNFNNAQKSGMQHNSLDADNSSCFSSENGTRSVGSSKLH 1140

Query: 1141 SDRVSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200
            SD+VSEPSELFRPSSAN+ECHE GLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS
Sbjct: 1141 SDQVSEPSELFRPSSANIECHEEGLGDCRTQDCNFDNNAEKSGLDKISSSPITEVREKTS 1200

Query: 1201 DKKPSTSVDNKRDVNEKEKCNSPLHMPMPQIQVDSVNEDKHHKGICESQSEKRYDKEVAT 1260
            DKKPSTSVD+KRDVNEKEKCNSPLHMPMPQIQVDS+NED++ KG+ ESQSEKRYDKEVAT
Sbjct: 1201 DKKPSTSVDDKRDVNEKEKCNSPLHMPMPQIQVDSLNEDEYDKGVYESQSEKRYDKEVAT 1260

Query: 1261 CSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320
            CSLLQSDEP EQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC
Sbjct: 1261 CSLLQSDEPAEQNISLKDGVPNLQYSHENAVEIQLVDTDDASILIRDTETFRDQMVMAPC 1320

Query: 1321 VPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSG 1380
            VPSAGEGDSNLEQ+ KSSGITQCEDS SFEGCTDHM                        
Sbjct: 1321 VPSAGEGDSNLEQQLKSSGITQCEDSGSFEGCTDHM------------------------ 1380

Query: 1381 ITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTFSSVLRASSSDEKEIEVELQL 1440
                             NGNHHYVSTECQTAETSI+ KTFSSVLRASSS+EKEIEVELQL
Sbjct: 1381 -----------------NGNHHYVSTECQTAETSIELKTFSSVLRASSSNEKEIEVELQL 1440

Query: 1441 DNGIPASLGLRSEQLQI-NRSPIDKNLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPI 1500
            DNGIPAS GLR EQLQI NRSPIDK+LMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGP 
Sbjct: 1441 DNGIPASFGLRIEQLQIINRSPIDKDLMQEFDTEKPVLELQRLSFCEEGYQQPNVSIGPT 1500

Query: 1501 EMLLLEKEARLIQRSDSSPTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560
            E+L LEKEARLIQ SDSS TLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI
Sbjct: 1501 EILRLEKEARLIQGSDSSSTLPVKEDLSRFGSNNRGTPLQNGMLESQSLVPEENFQCGDI 1560

Query: 1561 ELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620
            ELP DTGKTDGMEEKGKL LCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG
Sbjct: 1561 ELPTDTGKTDGMEEKGKLALCSLHTPLTQTSHYLGADKDMPALEGFLMRSDDEEPCISVG 1620

Query: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680
            GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM
Sbjct: 1621 GINFDKLDLSKCMIERASILEKICKSACINSTLSSPSESFRLNKVTDLYNSLPNGLLECM 1680

Query: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNFSPHGSSFDCLQSFNSHSAGDLRKPFASPFGKL 1740
            DLKNNLLMNDQNKLLKDGSNSLNGEVN SPHGSSFDCLQSFN+HSAGDLRKPFASPFGKL
Sbjct: 1681 DLKNNLLMNDQNKLLKDGSNSLNGEVNCSPHGSSFDCLQSFNNHSAGDLRKPFASPFGKL 1740

Query: 1741 LDRNSLNLSSSGKRSGQNIELPCISEEAENTDEIDNEFSKDMRSSKRAPLVDITEDANVE 1800
            LDRNSLN SSSGKRS QNIELPCISEEAENTDEIDNEFSK MRSSKRAPLVDITEDANVE
Sbjct: 1741 LDRNSLNSSSSGKRSSQNIELPCISEEAENTDEIDNEFSKAMRSSKRAPLVDITEDANVE 1800

Query: 1801 VTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSRDTLPGENG 1860
            VTVSEA AVADRLSLESLNIELSNTRTH GTKENLGNQKSSKRKYVNEAVSRD+LPGENG
Sbjct: 1801 VTVSEAVAVADRLSLESLNIELSNTRTHNGTKENLGNQKSSKRKYVNEAVSRDSLPGENG 1860

Query: 1861 AKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1869
            AKRVTRSSYN FSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL
Sbjct: 1861 AKRVTRSSYNTFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQQREAATIL 1920

BLAST of Cp4.1LG06g08410 vs. ExPASy TrEMBL
Match: A0A6J1IL11 (uncharacterized protein LOC111476581 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476581 PE=3 SV=1)

HSP 1 Score: 3010 bits (7803), Expect = 0.0
Identity = 1610/1870 (86.10%), Postives = 1647/1870 (88.07%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQ DLFDQQLASKLIIDGIVPP WLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQTDLFDQQLASKLIIDGIVPPTWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGVGVSRNFVPGVEVPRSPLQTH SSLNEAFVANSGEELQQRSNEDAGSLNDDFDAG RP
Sbjct: 61   EGVGVSRNFVPGVEVPRSPLQTHRSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
             VLPQC+ SDA V NCA RVDTSPVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  PVLPQCDTSDACVLNCATRVDTSPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240
            RNSAKSARCHSRYENKNDSVADG+ GSAISLLQAD EDES                    
Sbjct: 181  RNSAKSARCHSRYENKNDSVADGLGGSAISLLQADDEDES-------------------- 240

Query: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300
                                                                        
Sbjct: 241  ------------------------------------------------------------ 300

Query: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360
                                                                        
Sbjct: 301  ------------------------------------------------------------ 360

Query: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420
                                     LAKPSSSCKGIGSMEEET+VCCEQKNISICSDKSR
Sbjct: 361  -------------------------LAKPSSSCKGIGSMEEETNVCCEQKNISICSDKSR 420

Query: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
            QRALELRNSVKSSRC+SRYEN+NDSVADGIV SAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRALELRNSVKSSRCNSRYENENDSVADGIVRSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540
            VEEETNVCCEQK ISICS KVTIVGSPGLQSSSID+VNSLNIYLENEGLCVAEGSMQNSY
Sbjct: 481  VEEETNVCCEQKNISICSDKVTIVGSPGLQSSSIDLVNSLNIYLENEGLCVAEGSMQNSY 540

Query: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600
            KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSR+QCSSLDVDKS RIPPEDGR
Sbjct: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRMQCSSLDVDKSPRIPPEDGR 600

Query: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660
             CPIGGSKLHSDQVDEQLDLPKPSSDNVEC EEA LVDCRSQECNLDNALQS+SQRSSLD
Sbjct: 601  GCPIGGSKLHSDQVDEQLDLPKPSSDNVECGEEAVLVDCRSQECNLDNALQSESQRSSLD 660

Query: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720
            VDDSACIDA DGRLLD  NPSS NVKC EET++GHCRS E NFDNAREAG LYNSQDVDK
Sbjct: 661  VDDSACIDATDGRLLDLSNPSSGNVKCREETLLGHCRSHEFNFDNAREAGLLYNSQDVDK 720

Query: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780
            SSYVHSEDG+SCPNGSSEVHSDE+KEQLDLSKSSSDNMECCEEEILGDFR+QEYNFNNAQ
Sbjct: 721  SSYVHSEDGRSCPNGSSEVHSDELKEQLDLSKSSSDNMECCEEEILGDFRNQEYNFNNAQ 780

Query: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840
            KS MQH++LDADNSSCFSSENGTCSVGSSKLHSDRVSEP ELFR SS NVECHE GLG+C
Sbjct: 781  KSGMQHNSLDADNSSCFSSENGTCSVGSSKLHSDRVSEPLELFRSSSTNVECHEEGLGNC 840

Query: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900
            RTQDCNFDN AEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNS LHMP+
Sbjct: 841  RTQDCNFDNTAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSSLHMPI 900

Query: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960
            PQIQVDS+NED++ K + ESQSEKRYDKEVATCSLLQSDEP EQ ISLKDGVPNLQYSHE
Sbjct: 901  PQIQVDSLNEDEYDKDVYESQSEKRYDKEVATCSLLQSDEPAEQKISLKDGVPNLQYSHE 960

Query: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020
            NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQ+ KSSGITQCEDSDS
Sbjct: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQQLKSSGITQCEDSDS 1020

Query: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080
            FEGCTDHMVMAPCVPSAGEGDSNLE+PLKSS ITQCEDSDSFEG T+HLNGNHHY+STEC
Sbjct: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEKPLKSSSITQCEDSDSFEGCTDHLNGNHHYLSTEC 1080

Query: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQI-NRSPIDKNLM 1140
            QTAETSI+ KTFSSVLRASSSD+KEIEVELQLDNGIPASLGLRSEQLQI NRSPIDKNLM
Sbjct: 1081 QTAETSIELKTFSSVLRASSSDQKEIEVELQLDNGIPASLGLRSEQLQIINRSPIDKNLM 1140

Query: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLS 1200
            QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEA +IQ SDSSPTLPVKEDLS
Sbjct: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEAHIIQGSDSSPTLPVKEDLS 1200

Query: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260
            RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT
Sbjct: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260

Query: 1261 QTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320
            QTSHYLGADKDMPALEGFLM+SDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC
Sbjct: 1261 QTSHYLGADKDMPALEGFLMQSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320

Query: 1321 INSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNF 1380
            +NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLK+GSN LNGEVN 
Sbjct: 1321 MNSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKNGSNFLNGEVNC 1380

Query: 1381 SPHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEA 1440
            SPHGSSFDCLQSF+SHSAGDLRKPFASPFGKLLDRNSLN SSSGKRS QNIELPCISEEA
Sbjct: 1381 SPHGSSFDCLQSFSSHSAGDLRKPFASPFGKLLDRNSLNSSSSGKRSSQNIELPCISEEA 1440

Query: 1441 ENTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTH 1500
            ENTDEIDNEFSKD+RSSKRAPLVDITEDANV+VTVSEAA VADRLSLESL IELSNT TH
Sbjct: 1441 ENTDEIDNEFSKDIRSSKRAPLVDITEDANVKVTVSEAATVADRLSLESLIIELSNTGTH 1500

Query: 1501 IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGP 1560
            IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYN FSRSDLSCKKDFRKEGP
Sbjct: 1501 IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNTFSRSDLSCKKDFRKEGP 1560

Query: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENER 1620
            RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKR+AEKKENER
Sbjct: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRVAEKKENER 1620

Query: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680
            QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK
Sbjct: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680

Query: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH 1740
            EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH
Sbjct: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH 1705

Query: 1741 DSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQ 1800
            DSFHK SVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRI ETSEEQSYQ
Sbjct: 1741 DSFHKFSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRIYETSEEQSYQ 1705

Query: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAE 1860
            ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFAS +KLDPEIIFPPKSFCDIAE
Sbjct: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASHQKLDPEIIFPPKSFCDIAE 1705

Query: 1861 VLLPRQHQSK 1869
            VLLPRQHQ K
Sbjct: 1861 VLLPRQHQFK 1705

BLAST of Cp4.1LG06g08410 vs. ExPASy TrEMBL
Match: A0A6J1IMH2 (uncharacterized protein LOC111476581 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111476581 PE=3 SV=1)

HSP 1 Score: 3001 bits (7780), Expect = 0.0
Identity = 1608/1870 (85.99%), Postives = 1645/1870 (87.97%), Query Frame = 0

Query: 1    MAAMEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYF 60
            MAAMEKLFVQIFERKKWIIDQAKHQ DLFDQQLASKLIIDGIVPP WLHSPFLHSNIS+F
Sbjct: 1    MAAMEKLFVQIFERKKWIIDQAKHQTDLFDQQLASKLIIDGIVPPTWLHSPFLHSNISHF 60

Query: 61   EGVGVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGNRP 120
            EGV  SRNFVPGVEVPRSPLQTH SSLNEAFVANSGEELQQRSNEDAGSLNDDFDAG RP
Sbjct: 61   EGV--SRNFVPGVEVPRSPLQTHRSSLNEAFVANSGEELQQRSNEDAGSLNDDFDAGIRP 120

Query: 121  AVLPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALEL 180
             VLPQC+ SDA V NCA RVDTSPVSPQGRGG VLENYQDPTLSRARLHRSKSRQRALEL
Sbjct: 121  PVLPQCDTSDACVLNCATRVDTSPVSPQGRGGVVLENYQDPTLSRARLHRSKSRQRALEL 180

Query: 181  RNSAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETN 240
            RNSAKSARCHSRYENKNDSVADG+ GSAISLLQAD EDES                    
Sbjct: 181  RNSAKSARCHSRYENKNDSVADGLGGSAISLLQADDEDES-------------------- 240

Query: 241  VCCEQKNISTCADKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHED 300
                                                                        
Sbjct: 241  ------------------------------------------------------------ 300

Query: 301  ESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSSRCHSRYENK 360
                                                                        
Sbjct: 301  ------------------------------------------------------------ 360

Query: 361  NDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSR 420
                                     LAKPSSSCKGIGSMEEET+VCCEQKNISICSDKSR
Sbjct: 361  -------------------------LAKPSSSCKGIGSMEEETNVCCEQKNISICSDKSR 420

Query: 421  QRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGS 480
            QRALELRNSVKSSRC+SRYEN+NDSVADGIV SAISLLQADHEDESELAKPSSSCKGIGS
Sbjct: 421  QRALELRNSVKSSRCNSRYENENDSVADGIVRSAISLLQADHEDESELAKPSSSCKGIGS 480

Query: 481  VEEETNVCCEQKKISICSGKVTIVGSPGLQSSSIDVVNSLNIYLENEGLCVAEGSMQNSY 540
            VEEETNVCCEQK ISICS KVTIVGSPGLQSSSID+VNSLNIYLENEGLCVAEGSMQNSY
Sbjct: 481  VEEETNVCCEQKNISICSDKVTIVGSPGLQSSSIDLVNSLNIYLENEGLCVAEGSMQNSY 540

Query: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRLQCSSLDVDKSSRIPPEDGR 600
            KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSR+QCSSLDVDKS RIPPEDGR
Sbjct: 541  KVNEQFDSPRTSSGKIGYCEEGPACCRSQESNFDNAELSRMQCSSLDVDKSPRIPPEDGR 600

Query: 601  ECPIGGSKLHSDQVDEQLDLPKPSSDNVECCEEAKLVDCRSQECNLDNALQSKSQRSSLD 660
             CPIGGSKLHSDQVDEQLDLPKPSSDNVEC EEA LVDCRSQECNLDNALQS+SQRSSLD
Sbjct: 601  GCPIGGSKLHSDQVDEQLDLPKPSSDNVECGEEAVLVDCRSQECNLDNALQSESQRSSLD 660

Query: 661  VDDSACIDANDGRLLDSPNPSSSNVKCCEETVVGHCRSQECNFDNAREAGSLYNSQDVDK 720
            VDDSACIDA DGRLLD  NPSS NVKC EET++GHCRS E NFDNAREAG LYNSQDVDK
Sbjct: 661  VDDSACIDATDGRLLDLSNPSSGNVKCREETLLGHCRSHEFNFDNAREAGLLYNSQDVDK 720

Query: 721  SSYVHSEDGQSCPNGSSEVHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQ 780
            SSYVHSEDG+SCPNGSSEVHSDE+KEQLDLSKSSSDNMECCEEEILGDFR+QEYNFNNAQ
Sbjct: 721  SSYVHSEDGRSCPNGSSEVHSDELKEQLDLSKSSSDNMECCEEEILGDFRNQEYNFNNAQ 780

Query: 781  KSEMQHDTLDADNSSCFSSENGTCSVGSSKLHSDRVSEPSELFRPSSANVECHEVGLGDC 840
            KS MQH++LDADNSSCFSSENGTCSVGSSKLHSDRVSEP ELFR SS NVECHE GLG+C
Sbjct: 781  KSGMQHNSLDADNSSCFSSENGTCSVGSSKLHSDRVSEPLELFRSSSTNVECHEEGLGNC 840

Query: 841  RTQDCNFDNNAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSPLHMPM 900
            RTQDCNFDN AEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNS LHMP+
Sbjct: 841  RTQDCNFDNTAEKSGLDKISSSPITEVREKTSDKKPSTSVDNKRDVNEKEKCNSSLHMPI 900

Query: 901  PQIQVDSVNEDKHHKGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHE 960
            PQIQVDS+NED++ K + ESQSEKRYDKEVATCSLLQSDEP EQ ISLKDGVPNLQYSHE
Sbjct: 901  PQIQVDSLNEDEYDKDVYESQSEKRYDKEVATCSLLQSDEPAEQKISLKDGVPNLQYSHE 960

Query: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDS 1020
            NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQ+ KSSGITQCEDSDS
Sbjct: 961  NAVEIQLVDTDDASILIRDTETFRDQMVMAPCVPSAGEGDSNLEQQLKSSGITQCEDSDS 1020

Query: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTEC 1080
            FEGCTDHMVMAPCVPSAGEGDSNLE+PLKSS ITQCEDSDSFEG T+HLNGNHHY+STEC
Sbjct: 1021 FEGCTDHMVMAPCVPSAGEGDSNLEKPLKSSSITQCEDSDSFEGCTDHLNGNHHYLSTEC 1080

Query: 1081 QTAETSIKSKTFSSVLRASSSDEKEIEVELQLDNGIPASLGLRSEQLQI-NRSPIDKNLM 1140
            QTAETSI+ KTFSSVLRASSSD+KEIEVELQLDNGIPASLGLRSEQLQI NRSPIDKNLM
Sbjct: 1081 QTAETSIELKTFSSVLRASSSDQKEIEVELQLDNGIPASLGLRSEQLQIINRSPIDKNLM 1140

Query: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQRSDSSPTLPVKEDLS 1200
            QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEA +IQ SDSSPTLPVKEDLS
Sbjct: 1141 QEFDTEKPVLELQRLSFCEEGYQQPNVSIGPIEMLLLEKEAHIIQGSDSSPTLPVKEDLS 1200

Query: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260
            RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT
Sbjct: 1201 RFGSNNRGTPLQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLT 1260

Query: 1261 QTSHYLGADKDMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320
            QTSHYLGADKDMPALEGFLM+SDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC
Sbjct: 1261 QTSHYLGADKDMPALEGFLMQSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSAC 1320

Query: 1321 INSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSLNGEVNF 1380
            +NSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLK+GSN LNGEVN 
Sbjct: 1321 MNSTLSSPSESFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKNGSNFLNGEVNC 1380

Query: 1381 SPHGSSFDCLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEA 1440
            SPHGSSFDCLQSF+SHSAGDLRKPFASPFGKLLDRNSLN SSSGKRS QNIELPCISEEA
Sbjct: 1381 SPHGSSFDCLQSFSSHSAGDLRKPFASPFGKLLDRNSLNSSSSGKRSSQNIELPCISEEA 1440

Query: 1441 ENTDEIDNEFSKDMRSSKRAPLVDITEDANVEVTVSEAAAVADRLSLESLNIELSNTRTH 1500
            ENTDEIDNEFSKD+RSSKRAPLVDITEDANV+VTVSEAA VADRLSLESL IELSNT TH
Sbjct: 1441 ENTDEIDNEFSKDIRSSKRAPLVDITEDANVKVTVSEAATVADRLSLESLIIELSNTGTH 1500

Query: 1501 IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGP 1560
            IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYN FSRSDLSCKKDFRKEGP
Sbjct: 1501 IGTKENLGNQKSSKRKYVNEAVSRDTLPGENGAKRVTRSSYNTFSRSDLSCKKDFRKEGP 1560

Query: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRLAEKKENER 1620
            RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKR+AEKKENER
Sbjct: 1561 RFSEKESKHRNIVSNITSFIPLVQQREAATILKGKRDIKVKAIEAAEAAKRVAEKKENER 1620

Query: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680
            QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK
Sbjct: 1621 QMKKEALKLERARMEQENLRQIELDKKKKEEERKKKEEERKKKEVDMAAKKRQREEEERK 1680

Query: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH 1740
            EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH
Sbjct: 1681 EKERKRMRVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGH 1703

Query: 1741 DSFHKLSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRISETSEEQSYQ 1800
            DSFHK SVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRI ETSEEQSYQ
Sbjct: 1741 DSFHKFSVTESKTTSTSDAVRGSFVVEDSQPTSVDFLEAEALENVMEHRIYETSEEQSYQ 1703

Query: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASQKKLDPEIIFPPKSFCDIAE 1860
            ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFAS +KLDPEIIFPPKSFCDIAE
Sbjct: 1801 ISPYKASDDEDEEDDDDGIQNNKFVPSWASKDRLAVLFASHQKLDPEIIFPPKSFCDIAE 1703

Query: 1861 VLLPRQHQSK 1869
            VLLPRQHQ K
Sbjct: 1861 VLLPRQHQFK 1703

BLAST of Cp4.1LG06g08410 vs. TAIR 10
Match: AT5G55820.1 (CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterPro:IPR005635); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 280.8 bits (717), Expect = 8.0e-75
Identity = 525/1974 (26.60%), Postives = 827/1974 (41.89%), Query Frame = 0

Query: 4    MEKLFVQIFERKKWIIDQAKHQIDLFDQQLASKLIIDGIVPPPWLHSPFLHSNISYFEGV 63
            +E LFVQIFERK+ I++Q + Q+DL+DQ LASK ++ G+ PP WL SP L S  S     
Sbjct: 48   IENLFVQIFERKRRIVEQVQQQVDLYDQHLASKCLLAGVSPPSWLWSPSLPSQTSELN-- 107

Query: 64   GVSRNFVPGVEVPRSPLQTHCSSLNEAFVANSGEELQQR-SNEDAGSLNDDFDAGNRPAV 123
                  +  +  P S     C S            L      +D  S+ ++         
Sbjct: 108  --KEEIISELLFPSSRPSIVCPSSRPFSYQRPVRFLADNVVRQDLTSVVNNPLEEQLLEE 167

Query: 124  LPQCNVSDARVFNCAPRVDTSPVSPQGRGGGVLENYQDPTLSRARLHRSKSRQRALELRN 183
             PQ N+S     N   +V                + QD  ++  R    K R       +
Sbjct: 168  EPQHNLS----HNLVRQVSNH------------SHEQDVNIASPRDVHEKERLPESVSID 227

Query: 184  SAKSARCHSRYENKNDSVADGIVGSAISLLQADHEDESELAKPSSSCKGIGSMEEETNVC 243
              ++  C S   +KN  V   +  ++    Q       E      S  G          C
Sbjct: 228  CRENQSCSSPEHSKNQRVETNLDATSPGCSQ------GEKVPKCVSTTGCKRKSSSLGYC 287

Query: 244  CEQKNISTCAD-----------KSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAI 303
             E+    TC D           +SRQ+ALELR+S K+S+  S   N+      G +G  I
Sbjct: 288  QEEIEPDTCIDPGLSLAKMQRSRSRQKALELRSSAKASKSRSNSRNELKPSPGGDIGFGI 347

Query: 304  SLLQADHEDESELAKPSSSCKGIGSMEEETDVCCEQKNISICSDKSRQRALELRNSVKSS 363
            + L++D   E +L K           +E  + C E+   S    K   + +++    +S 
Sbjct: 348  ASLRSDSVSEIKLFK----------HDENDEECREEVENSNSQGKRGDQCIKISVPTESF 407

Query: 364  RCHSRYENKNDSVADGIVGSAI--SLLQADHEDESELAKPSSSC-KGIGSMEEETDVCCE 423
              H   ++ + S +     S +   LL++ H ++ ++ +   +  +  G ++E+ D   +
Sbjct: 408  TLHHEVDSVSISSSGDAYASIVPECLLESGHVNDIDILQSIETIDEASGKVDEQVD---D 467

Query: 424  QKNISICSDKSRQRALELRNSVKSSRCHSRYENKNDSVADGIVGSAISLLQADHEDESEL 483
             K+ S         +   ++S++ +      ++ N    + ++ ++     ADHE E   
Sbjct: 468  PKSRSCYETAYLDGSTRSKSSIQDNSKRKHQKSSNSFSGNFLLTNSNPSHWADHEVELPQ 527

Query: 484  AKPSSS----CKGIGSVEEETNVCCEQKKISICSGKVTIVGSPGLQSSSID--------- 543
            A  ++S        G+   ++ +    +  +    +     S  ++SSSI+         
Sbjct: 528  AITTTSEVSMVTDAGTSIFQSEIIARSRS-NARENRSKTEHSGSVESSSINLEPRDSIPV 587

Query: 544  -----VVNSLN-IYLENEGLCVAE-GSMQNSYKVNEQFDSPRTSSGKIGYCEEGPACCRS 603
                 V +SLN   ++ EGL V    S   S +  E  D+ R SS +             
Sbjct: 588  LQGSHVKDSLNPSSVDAEGLVVENITSSDQSKETGECVDTNRCSSAE----RVSQTGISP 647

Query: 604  QESNFDNA-ELSRLQCSSLDVDKSSRIPPEDGRECPIGGSKLHSDQVDEQLDLPKPSSDN 663
             E+ F  A + S  Q   L   +SS I  +         S+    Q D++  L KP + N
Sbjct: 648  DETTFAGAIQDSISQIELLSFVESSSIELQ---------SRHSVKQSDDESVLLKPVTVN 707

Query: 664  VECCEEAKLVDCRSQECNLDNALQSKSQRSSLDVDDSACIDANDGRLLD---SPNPSSSN 723
                 EA LV+  +   + + +  SKS RS    D +  +      +L+   +P     +
Sbjct: 708  ----GEALLVEEDNNGESTEISGISKS-RSLSQTDITVVLPVVVESILNESGTPEKLIDH 767

Query: 724  VKCCEETVVGHCRSQECN-FDNAREAGSLYNSQDVD--KSSYVHSE---DGQSCPNGSSE 783
             K C+ +    C S+E     +  E GS  +   +   +SS +  E   D ++  +GS+ 
Sbjct: 768  SKRCDIS----CGSKEVQPLGSLTEVGSNQSHGIISRARSSLIEEESANDYKALSDGSNH 827

Query: 784  VHSDEVKEQLDLSKSSSDNMECCEEEILGDFRSQEYNFNNAQKSEMQHDTLDADNSSCFS 843
              +D   +QL++ + +S  +   +  +  D    E   N+ +KS M+     A  +  F 
Sbjct: 828  KSAD---KQLEVREGNS-LLRTPDRPVFVD-NFDEVPENSREKSSMEKVPTPAPTARVFD 887

Query: 844  SENGTCSVGSSKLHSDR--VSEPSELFRPSSANVECHEVGLGDCRTQDCNFDNNAEKSGL 903
              + T S  +   +++   + + + L     A +E +    G    ++   ++N     +
Sbjct: 888  VPSLTDSGVNLSANNEMNDIEDHNGLNIEMVAEMESYASHPGLKVGENEPTESNTFTGHI 947

Query: 904  DKISSSPITEVREKTSDKKPSTSVDNKRDV--NEKEKCNSPLHMPMPQIQVDSVNEDKHH 963
            D ++  P    + +TS +K    +  KRDV   E ++C+  L  P+ +    S       
Sbjct: 948  DALTKRP----QHETSSEKAVPPI--KRDVTCTEADECHD-LESPIQEFFCSS-----SP 1007

Query: 964  KGICESQSEKRYDKEVATCSLLQSDEPVEQNISLKDGVPNLQYSHENAVEIQLVDTDDAS 1023
             G    Q+++R   E  T   L S      +I   D V    +  E A     VD  D  
Sbjct: 1008 MGGSMRQNKRRRILEKPTRRELSSSP--GGDILESDYVREAVHHREEAA-CHNVDNYDVE 1067

Query: 1024 I--LIRDTETFRDQMVMAPCVPSAGEGDSNLEQKQKSSGITQCEDSDSFEGCTDHMVMAP 1083
            +  LI    +    + +   + SA   +   E+           +SD       H   A 
Sbjct: 1068 LQKLIGSASSHHYSVELQKMIGSASSAELRFEE-------GDILESDYVREAVHHREEAA 1127

Query: 1084 CVPSAGEGDSNLEQPLKSSGITQCEDSDSFEGFTEHLNGNHHYVSTECQTAETSIKSKTF 1143
            C  +    D  L++ + S+                    +HHY S E Q           
Sbjct: 1128 C-HNVDNYDVELQKLIGSA-------------------SSHHY-SVELQ----------- 1187

Query: 1144 SSVLRASSSDEKEIEVELQLDNGI--PASLGLRSEQLQINRSPIDKNLMQEFDTEKPVLE 1203
              +  ASS++ +  E  L  + G+  PASL  R+EQL + RS I                
Sbjct: 1188 KMIGSASSAELRFEESYLLKEAGLMSPASLSYRTEQLSVQRSQIAP-------------- 1247

Query: 1204 LQRLSFCEEGYQQPNVSIGPIEMLLLEKEARLIQR-SDSSPTLPVKEDLSRFGSNNRGTP 1263
                   +      N++  P         A  I R SDSSP L               TP
Sbjct: 1248 -------DHRVGSENINFFPYAGETSHGLASCIVRDSDSSPCL---------------TP 1307

Query: 1264 LQNGMLESQSLVPEENFQCGDIELPMDTGKTDGMEEKGKLTLCSLHTPLTQTSHYLGADK 1323
            L  G++ S                                                  D 
Sbjct: 1308 L--GLISSD-------------------------------------------------DG 1367

Query: 1324 DMPALEGFLMRSDDEEPCISVGGINFDKLDLSKCMIERASILEKICKSACINSTLSSPSE 1383
              P LEGF++++DDE    S   +N D   L +   E A+++E+ICKSAC+N+     ++
Sbjct: 1368 SPPVLEGFIIQTDDENQSGSKNQLNHDSFQLPRTTAESAAMIEQICKSACMNTPSLHLAK 1427

Query: 1384 SFRLNKVTDLYNSLPNGLLECMDLKNNLLMNDQNKLLKDGSNSL-NGEVNFSPHGSSF-D 1443
            +F+ ++  DL  S+   L + M    NL          +GS+   N  +N    G S+ D
Sbjct: 1428 TFKFDEKLDLDQSVSTELFDGMFFSQNL----------EGSSVFDNLGINHDYTGRSYTD 1487

Query: 1444 CLQSFNSHSAGDLRKPFASPFGKLLDRNSLNLSSSGKRSGQNIELPCISEEAENTDE--- 1503
             L    + S+ + R P  SP  KL  R+    SSS KRS Q  +LPCISEE EN +E   
Sbjct: 1488 SLP--GTGSSAEARNPCMSPTEKLWYRSLQKSSSSEKRSTQTPDLPCISEENENIEEEAE 1547

Query: 1504 -IDNEFSKDMRSSKRA---------------------------------------PLVDI 1563
             +     K MRS KR                                        PL D+
Sbjct: 1548 NLCTNTPKSMRSEKRGSSIPELPCIAEENENIDEISDAVNEASGSERENVSAERKPLGDV 1607

Query: 1564 TED-ANVEVTVSEAAAVADRLSLESLNIELSNTRTHIGTKENLGNQKSSKRKYVNEAVSR 1623
             ED   +  +VSEA   ADR SL+S++   S +      K  +G  K S R++  +    
Sbjct: 1608 NEDPMKLLPSVSEAKIPADRQSLDSVSTAFSFSAKCNSVKSKVG--KLSNRRFTGKGKEN 1667

Query: 1624 DTLPGENGAKRVTRSSYNIFSRSDLSCKKDFRKEGPRFSEKESKHRNIVSNITSFIPLVQ 1683
                G  GAKR  +   + FS+  LSC       GPR  EKE +H NIVSNITSF+PLVQ
Sbjct: 1668 Q---GGAGAKRNVKPPSSRFSKPKLSCNSSLTTVGPRLQEKEPRHNNIVSNITSFVPLVQ 1727

Query: 1684 QRE-AATILKGKRDIKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQIE 1743
            Q++ A  ++ GKRD+KVKA+EAAEA+KR+AE+KEN+R++KKEA+KLERA+ EQENL++ E
Sbjct: 1728 QQKPAPALITGKRDVKVKALEAAEASKRIAEQKENDRKLKKEAMKLERAKQEQENLKKQE 1785

Query: 1744 LDKKKKEE---------------ERKKKEEERKKKEVDMAAKKRQREEEERKEKE-RKRM 1803
            ++KKKKEE               E+KKKEEERK+KE +MA +KRQREEE+++ KE +KR 
Sbjct: 1788 IEKKKKEEDRKKKEAEMAWKQEMEKKKKEEERKRKEFEMADRKRQREEEDKRLKEAKKRQ 1785

Query: 1804 RVEEVRRRLREHGGKLRSDKENKEAKPQANDQKPRDRKGCKDGTVKLVKESGHDSFHKLS 1859
            R+ + +R+ RE   KL+++   KE K QA D + + +K  K+      K    +S  ++ 
Sbjct: 1848 RIADFQRQQREADEKLQAE---KELKRQAMDARIKAQKELKEDQNNAEKTRQANS--RIP 1785

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023535899.10.0100.00uncharacterized protein LOC111797188 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023535901.10.099.89uncharacterized protein LOC111797188 isoform X2 [Cucurbita pepo subsp. pepo][more]
KAG7030270.10.094.81hypothetical protein SDJN02_08617 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022936495.10.080.71uncharacterized protein LOC111443094 isoform X1 [Cucurbita moschata][more]
XP_022936496.10.080.62uncharacterized protein LOC111443094 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1FDU90.080.71uncharacterized protein LOC111443094 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F7M40.080.62uncharacterized protein LOC111443094 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F8M10.079.07uncharacterized protein LOC111443094 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IL110.086.10uncharacterized protein LOC111476581 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1IMH20.085.99uncharacterized protein LOC111476581 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G55820.18.0e-7526.60CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterP... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1598..1698
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1786..1823
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 723..742
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1628..1758
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 851..899
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 863..892
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1628..1746
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 589..608
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1187..1206
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 723..738
NoneNo IPR availablePANTHERPTHR13738TROPONIN Icoord: 418..1811
NoneNo IPR availablePANTHERPTHR13738:SF1TROPONIN Icoord: 418..1811
IPR005635Inner centromere protein, ARK-binding domainPFAMPF03941INCENP_ARK-bindcoord: 1806..1860
e-value: 2.4E-9
score: 36.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g08410.1Cp4.1LG06g08410.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
cellular_component GO:0005819 spindle