Cp4.1LG06g06470 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g06470
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG06: 4079925 .. 4088158 (+)
RNA-Seq ExpressionCp4.1LG06g06470
SyntenyCp4.1LG06g06470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAGATATATATAGAGAGAGAGAGAGATATATATAGAGAGAGAGAGAGAGAGCACATAAATCATAAGCGCCTATTTGTGATTTATGATCGGAGAGGATGAGACAAGAGCGATGATCCCCAACCTCCATTACTCTTGATTCCGTCATCTTTCCACCAATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGTACTTTACTGGTCTGCCCATGTGTTTGAACTTGAACTTGTTTATAATGGCATTCAATACAAAATTGCTTTCGATTATAATGTATTGTAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGGTTAGTTCATTTACTTCCTTTATATGAATCAGTTGAGGTGTTCTTGATGATCATGCAATTGTCTTGTCTTATGTTCATTAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGTATGCTAATTCATTTGATCAATCTCTTCCTATGCTCAAACTTTCATGTTCTTGATTCATACTAACAATGCTGGAATTGCAGGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAAGTTCACAACCAATCCGTCGGTGCAGTCAGGATGATACCGGAGAAGATGACTCGAAAAAGGTTAGTGAAGTGTTCTTGTTCTTGTTGTTTCTGTGTTAGACTGGTAGATGAAGGTTTTGTGGCATTTTGAGTGTGAGATATGATATTTAGGCCTATTCAAAAGAAGCCCTGTTGGCTCCCATTTTCTCCCCCCATTCCCTTCATTCTATTGGTTTCTCATGAATCTTTAGGTTGTTGATAAATGCATAGCTAGGAGTCTCATGTTGGTTTCTCGTGGTCACCTTTAAAACTCCGATCCTGCTGCTAGGGAGAGGGAGAGGTTTCCATATCCTTATAAGGAATGTTCTGTTTCCCTTTCCAACTGACGTAGGACCTCACAATTCAAATCTCTTTTTGCCCACCGTCCTCATTGGCATACTGCCCGGTGTGTAGCTCTGATATCATTTGTAACCGTTCAAGCCCACCGCTAGCAAATATTGTTCATTTGGCGTCAGCCTCACGGTTTTCAAACATGTCTATTAGGGAGAGGTTTCCACACCCTTATAAGAAATGCTCTGTTCCCCTGTTCCCCTCTCCAACTGATGTGGGACCTCATAATCCCCCCCCCNTCGGTGTCTGGCTGCGATACCATTTGTAACAGCCTAAGCCTGCCGCTAGCAGATATTGTCTGCTTTGGCATCAACCTCACGGTTTTAAAATGCGTCCATTAGAGAGAGGTTTCCACATCCTTATAAGGAATGCTTCGTTCCCCTCTCTATCTGACGTGGGACCTCACAATCCACCCTTCTTGGGACCCAGTGTCTGGCTCTAATACCATTTGTAACAGCCCAAACTCACCGTTAGCAGATAATGTCCGCTTTGGTGTCAGCCTCACGGTTTTAAAATGCGTCTACTAAGAAGAAGTTTCCACATCCTTATAAGGAATGTTTCGTTCCCCCCTCTAACCGACAAGGGACTTCATAATCTACCCCCTTGGATCCCAGCGTTCTCGCTGGCACATTGCCCCGTGTGTAGCTCTAATACTATTTGTAACAGCCCAAACCCACTACTAACAGATATTGTCTGTTTTGGCCCGTAACGTATCACCGTCAGCCTCACAGTTTTAAAACGCTTCTACTAAGGAGAGGTTTCCCCACGCTGTTCCACTCTCCAACCGATGTGGGACCTCACAGAACTAATCACATGGATAATACTACATAAGAAGGCACCATGTTTGCCATCTAGCTAAGAACGATAACGTTAAATTCCCTATACCATGAACATTTCAATCGATTCATTGAATAATTGTATCATATCAACTTTCAATACTCAACTCTACAAGGGAGATCACCAGAATTAACACACCTCATCTCTACAGGTTTTGGCTCAGCCCCAAGCTCGTTATAACCAAGTCGGTCTCTAATGGCGACGGAAGATGAGAGGAGCAGGGCAAGGATGATTCACAACGTAATAGAGAGGAGAGTTTCGGTTTCTTTCATTTCTAATTTTCTTTTTAAGTTCATTTCATTTATATCGAGTCGAATCAAATTACATGAATTTCAAACAAATTTAGATTGAAATTCATCGACTCCCGTTAATTTCTGGTCAAAATTCATGAATCCCGACCTTTTTCGATGCATATTTATAAATTTTGATCGAGATTATGGTAGTGAACAAACATAGAACAGATATCTTGACGGAGACACCGACATTATTTTATTTTCAAATGGGTTTCAAATTTAAAATTTTGATTAGTGATTTTTTTTTTCATTTAATAATAATTTTTAAAATAACTTAATTATTACACAAAAAAATATTATTATTATTATTATTATTTATCATTTTTTCAAATACAACCTTGACTATAATTCTTATAAATAATTTTAAAATAAAAACTATTACTTTTCATCATGAAAATTATTAAACTAATTATTTAATTAAAAAAAATTATTTTCTAAAAAAAAAATAATAATAAAAATAAAGGAAGAATCATTTCTAAGTTTGGGAGCCAAAAACCGTTTTTTTAGAATCTGAAATAATATTATAATTTAGAGATTAAACGTCTCTGCTTTTGCCTCTGCCATTTAAAGATAAACTAAAACCCTGTTCCTGTGCAGCTGCCGATCAAGAACACAACCCATCCAACCCCATGATTATGGACTGACCTCCGCGGCACCTCAGGTTGATTTCTTCTTCCAACCCCTTTTTGATTCTGCTGTTCGCTTTCTGTGAATGTTGCGTAAATATTCAATTTCGCAAAACAATGTGATTCTGCTGATCAAATTTGAGAATCTGCTCCTCGAATGCCTTAAATCTCTATATTTTCGCTCCATCTTCAGTTTGGTTCTATGATCGTTTAAGAACACTGAGAACTTCGAGGATCTTTTGTTATTTTTTTTTCTAACAAAACTCTCATTCAATGAGTTGAAAGATTAGAAAATCATTCCTTGTGGTACTGTTCTTTGGTTATAAGGATTATGAATCTTGCAGGTGAACAAAAATGGATTTGAAACATAAGGGTATATCATGGGTTGGAAACATGTTCCAAAAATTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACCAGGTTATCCTCATTTCCTTCTCTTTTTTACGTTATGTGATCCAACATTTTGACAATATAGTTCATGTGGTAGCTTTCTTATTGGGTTGATTGTGTCACTGAATCGTTCATCTTTGTCGTGCAATTTAGTAATTTTTCGCTGAGGTTTGCAGTAGGAATAAACTGTTAAACTGATCTTATGCACAGGATAAGGTTGAATATGTTGAAAATCGGGTTAGTTCAGCAAGTGTTAATGTGAAGAGATTAGATGTTGTTCAAGGTTTACTTCCTCCTACAGAGGGTTCTGTGAAATATGAAGCTAAAGCAGTGGCTCCGAGGGGACGTACATATTTCAAGTCACTGTCATACAATGAAGAAAAATCTGCACATAATGTTGCTGATAAATCATCTGTGGGGCATGGTACTATCAATCATCAAGCTTCTTGTAAAGTTCTCTTTGTAAATGAAGAAGTTGCTCGAGTTCCTAATCGTTCTTCTCTTCGGTTGAATGCTGGTTTACATGAGAACAAAAAAGAAAAACCTGTTAATGAACTACTTTCGGAGAAAAGTGATGGCTCATTGACTGATAAGTTTGCGTTCGTGGAGTCGGATGCTATTGATCCTTTGAATCGATCACTGAGAAATGTAAGTCGTGAAGTTAATGAAATTAATAAAAGTTGTTCTCCGGTTTTTGATGACTCCGATCTGCAATTGGTGGATAATGTACTCTTAGTAGGGAACAACAATGGGGCTTTGACAAATAATGATGCAAGTAAGAGTTCTAAAGAGGATACGACCATAGAGTTCAATGCTAGTGATCCGTTGAACCATACGGCTAATCATAAATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAATTTTTTATTTTGGATAACTCTCATCTGCCAATGGAATCTTCCAGATTCTCGTCGAAGGACGACGACTTGTCAAATGAAAACACCAATGAGTTTGTAAAGAAGGTTGGGATCATGGAACCTAATGCTGCTGATCATTTGAACGACAAACATCTTAGTCATGTATGGAGCAGTACAAACTTCGTAAGTAAAGAAGCTGATAATTCTAATATGCTTTTGAAGTCCGAGGTACCTTCAAGCAGAATCGATCATGCCTTGATAGATAAAGATTTCAATGAGGGTCCTGTAAAGGATGCTATCTTTGAGGATGATCTTGAAAGTTATTTATTGAATCTTCCCAGTGAAGAAGCTATGATTTCTAATGGAAACCATCTGCAAATGGAGCCTGAACTACTTGCTAGAAACAATGATGATGCTTTGACAGATGCATACTCTAATGAAAGTTTAGAAAAGGATACCATTTTGGAGTTGGAGTATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGACGTATATCAAGCAGCGTAAAATATAAAAATGAAGAAGTTTCTTCAGTTTCAATAGATAGGGCATCAGATGCAAGTTGTAAAGAACAAGACAATTTAGAATTATCAACTGAGTTAACTTTGCATTGTGATGAAGAGTCGATTAAGGGCAGTTCGTGCATTTATGGTAATGAACGTGACGGGGATATTGCGACCTCAACTCGAAATCCACAGGAAACTTCGGTTCATGGTGCTGATGTTGAATCCATCCATAAAGTAGGAGAACCTCCTAGCATCTCGTTGAACAATTTAGTTGACTTATCACCTAGGATGGAGACACATTTGAGGTACTTCGAAAATGTTCCACATGCTACTTCTTCTGAACTGGCTTCTGTAGTTTTAGCTAGTGGAGAAACTGTAAAAGAGACAAAGTCAGTCTCCTCTCTGAAACCGCTACCGAAGGGTCCGTTTTCTGCTTCCAGAAGTTCGGTCGACAACTTTTCTAGTACCACCGTTCATGAAAAACCAGTCGATCAGCGTGCATACATTGAGTGTAGATCTCATCCATCTTTCGAAGTGGTCACTCGTGCATCTAATGGAAACAAGGCTTCGGAGACGAGATTTAACTCCTCCAGAAGCTCCTTATCATCATTTGAATCGCTTGGTCTGTACTCTTATCTTCTATCCTAGGATAAGTGTGTTTTCTTATGATTTTCTTAAACAATTCATGATTCAGCAGGAACTCATGCCAGTAGCCAGGTTGAGTTTTCCAAATCTACTGGTTCTGGGATTCTAAGTTTCTCTACTGAAGTAGGTATGTCCCAAAGCTTGTGTTCATACAATTTTGTAAATTCCCGGCTGTTATCTATAATTCTTATGATGTGGTGGTAGTTCAATCTGTTGGCTATGGAGAAAATCTTTCATTAGGAATAAAGCAACAATCTTAAAAACTTTTCTCAGGTTGTCTGTATGATTCGAGTGGCCATATTCTGGATTTTGAAATGGAAACAGTGGATTTGGGACATAAGGTGACCGTCGAAGACGAGTGTGGCGTTATTGACTATAAAGCTCTCCATGCTGTCTCTCGCCGAACCCAAAAGCTCCATTCTTACAAGGTCCATATATTATCTAAAGTTACATATATGAGCTTCATCTTGCAATGTTAAGTTTATTTAAAACCTGGAAACACCTCTCGTTCTTTAAGATGATAATGTTTAACGATAACTTTTACCGTCCGTTTCTGATCTATTTTTCTTTTCTTTCTGAAAATAGAAGAGAATCCAGGATGCTTTTACTACCAAAAAGAGGTTGGCAAAGGATTATGAACAGCTAGCAATCTGGTATGGAGATACTGATCTGGACTCCATCACAGACAGTTCCCAGAAGTCGGACAAGAAGAACGCATCCGATTCCGAGTGGGAGCTCCTGTAAATAAGACAGCTAATTCACTTCGTCTCGGCAATCAAACTTGTTTCCAGGTGGAGGAGAATCTTATATGCTGGAGATGAAGAGGAAGCTCGTCTGTTAATACCTACTCAAGAATAAGGTTCCTCACTTTATCTATTGAAGTGCATAAGTTACCTTGGAAATTCCTAAATAAACGAGTTGCAAGAAATTTTGCACATATTGGCACTCTTTCTGGGCTATGAACCTTTGATACTTTAATTAATTTCATATCCTTGGTAAATAAAGCTGTCTTGTTTTATCTTTTTTGGAAAGATCTACTCTTTGTTTGACCTTTTGAACAGAACACTCGTGGGCTTGTCGAGTTCAACAATTAGAAGCGCTTTTGGTGTTCGTTCTTCCCATCCAGACACTTGGACGACATTGCATTCCACGTTCGACCACTCTCAAGCTTACAGGCTCCCATTCTTTGCTCATGATGTCAATGAAATAGAAAGTGACAAACAGTAAGTCTTAACTTTGCATTTTGTGGCTCTGTCGTATACTTTTTATTGTCATCTGAACTTTTACTACAAAGATTAGCATCTGACTGCCATCATATTGGAATGATTGCCTCTGTCTCTTCGTGTCGGGACAGCCTGGAACTGCTATGTAGTATAGTTTTACTTCTCATTGCTCCAATAAACAAAATTCAAACTCAAATCACATGACTTTGGGGATCTGGTTTTTTTGAAGTGAAGAAGATCATCACTGTGTTTTTGCCAAACTAATGTGAAAGAGTGTTGATTCAATGGTCTCATTCAGGATTCTGTTTTTGGATATCTTTATTAAAGTAGTGGACTAGCCATATCCTCTTGAGATTCTAAAATAGTGTAGCTTTACAGGAGCACGAGAATTACTGTAGGAGTTTGAAAAGGATTGTCTTTTGCCTATCTCCAAAGGCTGATA

mRNA sequence

GAGAGATATATATAGAGAGAGAGAGAGATATATATAGAGAGAGAGAGAGAGAGCACATAAATCATAAGCGCCTATTTGTGATTTATGATCGGAGAGGATGAGACAAGAGCGATGATCCCCAACCTCCATTACTCTTGATTCCGTCATCTTTCCACCAATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAAGTTCACAACCAATCCGTCGGTGCAGTCAGGATGATACCGGAGAAGATGACTCGAAAAAGGTTTTGGCTCAGCCCCAAGCTCGTTATAACCAAGTCGGTCTCTAATGGCGACGGAAGATGAGAGGAGCAGGGCAAGGATGATTCACAACGTAATAGAGAGGAGAGTTTCGGTTTCTTTCATTTCTAATTTTCTTTTTAAGTTCATTTCATTTATATCGAGTCGAATCAAATTACATGAATTTCAAACAAATTTAGATTGAAATTCATCGACTCCCGTTAATTTCTGGTCAAAATTCATGAATCCCGACCTTTTTCGATGCATATTTATAAATTTTGATCGAGATTATGGTAGTGAACAAACATAGAACAGATATCTTGACGGAGACACCGACATTATTTTATTTTCAAATGGGTTTCAAATTTAAAATTTTGATTAGTGATTTTTTTTTTCATTTAATAATAATTTTTAAAATAACTTAATTATTACACAAAAAAATATTATTATTATTATTATTATTTATCATTTTTTCAAATACAACCTTGACTATAATTCTTATAAATAATTTTAAAATAAAAACTATTACTTTTCATCATGAAAATTATTAAACTAATTATTTAATTAAAAAAAATTATTTTCTAAAAAAAAAATAATAATAAAAATAAAGGAAGAATCATTTCTAAGTTTGGGAGCCAAAAACCGTTTTTTTAGAATCTGAAATAATATTATAATTTAGAGATTAAACGTCTCTGCTTTTGCCTCTGCCATTTAAAGATAAACTAAAACCCTGTTCCTGTGCAGCTGCCGATCAAGAACACAACCCATCCAACCCCATGATTATGGACTGACCTCCGCGGCACCTCAGGTGAACAAAAATGGATTTGAAACATAAGGGTATATCATGGGTTGGAAACATGTTCCAAAAATTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACCAGGATAAGGTTGAATATGTTGAAAATCGGGTTAGTTCAGCAAGTGTTAATGTGAAGAGATTAGATGTTGTTCAAGGTTTACTTCCTCCTACAGAGGGTTCTGTGAAATATGAAGCTAAAGCAGTGGCTCCGAGGGGACGTACATATTTCAAGTCACTGTCATACAATGAAGAAAAATCTGCACATAATGTTGCTGATAAATCATCTGTGGGGCATGGTACTATCAATCATCAAGCTTCTTGTAAAGTTCTCTTTGTAAATGAAGAAGTTGCTCGAGTTCCTAATCGTTCTTCTCTTCGGTTGAATGCTGGTTTACATGAGAACAAAAAAGAAAAACCTGTTAATGAACTACTTTCGGAGAAAAGTGATGGCTCATTGACTGATAAGTTTGCGTTCGTGGAGTCGGATGCTATTGATCCTTTGAATCGATCACTGAGAAATGTAAGTCGTGAAGTTAATGAAATTAATAAAAGTTGTTCTCCGGTTTTTGATGACTCCGATCTGCAATTGGTGGATAATGTACTCTTAGTAGGGAACAACAATGGGGCTTTGACAAATAATGATGCAAGTAAGAGTTCTAAAGAGGATACGACCATAGAGTTCAATGCTAGTGATCCGTTGAACCATACGGCTAATCATAAATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAATTTTTTATTTTGGATAACTCTCATCTGCCAATGGAATCTTCCAGATTCTCGTCGAAGGACGACGACTTGTCAAATGAAAACACCAATGAGTTTGTAAAGAAGGTTGGGATCATGGAACCTAATGCTGCTGATCATTTGAACGACAAACATCTTAGTCATGTATGGAGCAGTACAAACTTCGTAAGTAAAGAAGCTGATAATTCTAATATGCTTTTGAAGTCCGAGGTACCTTCAAGCAGAATCGATCATGCCTTGATAGATAAAGATTTCAATGAGGGTCCTGTAAAGGATGCTATCTTTGAGGATGATCTTGAAAGTTATTTATTGAATCTTCCCAGTGAAGAAGCTATGATTTCTAATGGAAACCATCTGCAAATGGAGCCTGAACTACTTGCTAGAAACAATGATGATGCTTTGACAGATGCATACTCTAATGAAAGTTTAGAAAAGGATACCATTTTGGAGTTGGAGTATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGACGTATATCAAGCAGCGTAAAATATAAAAATGAAGAAGTTTCTTCAGTTTCAATAGATAGGGCATCAGATGCAAGTTGTAAAGAACAAGACAATTTAGAATTATCAACTGAGTTAACTTTGCATTGTGATGAAGAGTCGATTAAGGGCAGTTCGTGCATTTATGGTAATGAACGTGACGGGGATATTGCGACCTCAACTCGAAATCCACAGGAAACTTCGGTTCATGGTGCTGATGTTGAATCCATCCATAAAGTAGGAGAACCTCCTAGCATCTCGTTGAACAATTTAGTTGACTTATCACCTAGGATGGAGACACATTTGAGGTACTTCGAAAATGTTCCACATGCTACTTCTTCTGAACTGGCTTCTGTAGTTTTAGCTAGTGGAGAAACTGTAAAAGAGACAAAGTCAGTCTCCTCTCTGAAACCGCTACCGAAGGGTCCGTTTTCTGCTTCCAGAAGTTCGGTCGACAACTTTTCTAGTACCACCGTTCATGAAAAACCAGTCGATCAGCGTGCATACATTGAGTGTAGATCTCATCCATCTTTCGAAGTGGTCACTCGTGCATCTAATGGAAACAAGGCTTCGGAGACGAGATTTAACTCCTCCAGAAGCTCCTTATCATCATTTGAATCGCTTGGAACTCATGCCAGTAGCCAGGTTGAGTTTTCCAAATCTACTGGTTCTGGGATTCTAAGTTTCTCTACTGAAGTAGGTTGTCTGTATGATTCGAGTGGCCATATTCTGGATTTTGAAATGGAAACAGTGGATTTGGGACATAAGGTGACCGTCGAAGACGAGTGTGGCGTTATTGACTATAAAGCTCTCCATGCTGTCTCTCGCCGAACCCAAAAGCTCCATTCTTACAAGAAGAGAATCCAGGATGCTTTTACTACCAAAAAGAGGTTGGCAAAGGATTATGAACAGCTAGCAATCTGGTATGGAGATACTGATCTGGACTCCATCACAGACAGTTCCCAGAAGTCGGACAAGAAGAACGCATCCGATTCCGAGTGGGAGCTCCTGTAAATAAGACAGCTAATTCACTTCGTCTCGGCAATCAAACTTGTTTCCAGGTGGAGGAGAATCTTATATGCTGGAGATGAAGAGGAAGCTCGTCTGTTAATACCTACTCAAGAATAAGGTTCCTCACTTTATCTATTGAAGTGCATAAGTTACCTTGGAAATTCCTAAATAAACGAGTTGCAAGAAATTTTGCACATATTGGCACTCTTTCTGGGCTATGAACCTTTGATACTTTAATTAATTTCATATCCTTGGTAAATAAAGCTGTCTTGTTTTATCTTTTTTGGAAAGATCTACTCTTTGTTTGACCTTTTGAACAGAACACTCGTGGGCTTGTCGAGTTCAACAATTAGAAGCGCTTTTGGTGTTCGTTCTTCCCATCCAGACACTTGGACGACATTGCATTCCACGTTCGACCACTCTCAAGCTTACAGGCTCCCATTCTTTGCTCATGATGTCAATGAAATAGAAAGTGACAAACAGAGCACGAGAATTACTGTAGGAGTTTGAAAAGGATTGTCTTTTGCCTATCTCCAAAGGCTGATA

Coding sequence (CDS)

ATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAA

Protein sequence

MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Homology
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match: KAG6591921.1 (hypothetical protein SDJN03_14267, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 497/516 (96.32%), Postives = 502/516 (97.29%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWWRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEER NRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERGNRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGEKTKN+DSS VVTECRKHVVS+DELIQL+VL VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLHVPESITEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLK 
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKD 480

Query: 481 EGHGCSLKKLDVLK---DDPVGNAGDNTNEVDDPVR 513
           EGHGCSL KLDVLK   DDPVGNAGDN NEVDDPV+
Sbjct: 481 EGHGCSLTKLDVLKELQDDPVGNAGDNMNEVDDPVK 516

BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match: XP_023535254.1 (uncharacterized protein LOC111796743 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 972 bits (2513), Expect = 0.0
Identity = 489/520 (94.04%), Postives = 489/520 (94.04%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGE                               VPESIMEACEEFFAASLTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESIMEACEEFFAASLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 489

BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match: KAG7024795.1 (hypothetical protein SDJN02_13614, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 971 bits (2510), Expect = 0.0
Identity = 480/494 (97.17%), Postives = 485/494 (98.18%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWWRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEER NRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERGNRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGEKTKN+DSS VVTECRKHVVS+DELIQL+VL VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLHVPESITEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAI+LKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIMLKG 480

Query: 481 EGHGCSLKKLDVLK 494
           EGHGCSL KLDVLK
Sbjct: 481 EGHGCSLTKLDVLK 494

BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match: XP_023535255.1 (uncharacterized protein LOC111796743 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 963 bits (2490), Expect = 0.0
Identity = 487/520 (93.65%), Postives = 487/520 (93.65%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLANSGDFK  PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFK--PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGE                               VPESIMEACEEFFAASLTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESIMEACEEFFAASLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 487

BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match: XP_022937203.1 (uncharacterized protein LOC111443568 isoform X1 [Cucurbita moschata])

HSP 1 Score: 954 bits (2467), Expect = 0.0
Identity = 480/520 (92.31%), Postives = 483/520 (92.88%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGE                               VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 489

BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match: A0A6J1FFD4 (uncharacterized protein LOC111443568 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443568 PE=4 SV=1)

HSP 1 Score: 954 bits (2467), Expect = 0.0
Identity = 480/520 (92.31%), Postives = 483/520 (92.88%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGE                               VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 489

BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match: A0A6J1FAI7 (uncharacterized protein LOC111443568 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443568 PE=4 SV=1)

HSP 1 Score: 946 bits (2444), Expect = 0.0
Identity = 478/520 (91.92%), Postives = 481/520 (92.50%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1   MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLA SGDFK  PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFK--PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           EDLISGE                               VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 487

BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match: A0A6J1IMA4 (uncharacterized protein LOC111476868 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476868 PE=4 SV=1)

HSP 1 Score: 916 bits (2367), Expect = 0.0
Identity = 463/520 (89.04%), Postives = 472/520 (90.77%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK 
Sbjct: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDPKNRRQKKKKS 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARLVSSEERANRVALQLQY GIE
Sbjct: 61  RPEPLQDTGPEWPFPEPVQNQPLTSSGWPPMPCATPAARLVSSEERANRVALQLQYNGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLTRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAICRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLA+SGDFKDQPEE+QVAEEHDSWV  ENVAI ND+IDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLAHSGDFKDQPEEDQVAEEHDSWVQIENVAISNDDIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           E+ ISGE                               VPESIMEACEEFFAA LTSMAD
Sbjct: 301 EESISGE-------------------------------VPESIMEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVC+GAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEECEEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCKGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTY GKNKTG KRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYTGKNKTGNKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSL KLDVLKDDPVGNAGDNTNEVDDPV+DDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDDPVGNAGDNTNEVDDPVKDDSTEID 489

BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match: A0A6J1INL5 (uncharacterized protein LOC111476868 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111476868 PE=4 SV=1)

HSP 1 Score: 907 bits (2344), Expect = 0.0
Identity = 461/520 (88.65%), Postives = 470/520 (90.38%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
           MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK 
Sbjct: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDPKNRRQKKKKS 60

Query: 61  RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
           R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARLVSSEERANRVALQLQY GIE
Sbjct: 61  RPEPLQDTGPEWPFPEPVQNQPLTSSGWPPMPCATPAARLVSSEERANRVALQLQYNGIE 120

Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
           ACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLTRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180

Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
           CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAICRVFGWDIDRLPTI 240

Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
           VLNGEPLSRSLA+SGDFK  PEE+QVAEEHDSWV  ENVAI ND+IDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLAHSGDFK--PEEDQVAEEHDSWVQIENVAISNDDIDMKNEQKWEEEKTA 300

Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
           E+ ISGE                               VPESIMEACEEFFAA LTSMAD
Sbjct: 301 EESISGE-------------------------------VPESIMEACEEFFAAFLTSMAD 360

Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
           DDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVC+GAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEECEEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCKGAGKKTLRSFKTC 420

Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
           VRLLRHTTY GKNKTG KRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYTGKNKTGNKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480

Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
           EGHGCSL KLDVLKDDPVGNAGDNTNEVDDPV+DDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDDPVGNAGDNTNEVDDPVKDDSTEID 487

BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match: A0A1S3CJZ0 (uncharacterized protein LOC103501816 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501816 PE=4 SV=1)

HSP 1 Score: 587 bits (1514), Expect = 2.63e-204
Identity = 336/543 (61.88%), Postives = 392/543 (72.19%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAA--TNKRPRDT---KNRKQ 60
           M+PYS+ERLT+EVLYLHSLW RGPPR PKPT  + STAVA    +NKRP D    KN+ +
Sbjct: 1   MDPYSDERLTKEVLYLHSLWHRGPPRNPKPTHDHSSTAVADPNPSNKRPIDPDRRKNKNK 60

Query: 61  KKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPM-PCATPAARLVSSEERANRVALQL 120
           KKKKPR +P QD+GPEWPCPEPVQNQPSTSSGWPP+ P ATPAA+LVSSEER N  ALQL
Sbjct: 61  KKKKPRSDPPQDSGPEWPCPEPVQNQPSTSSGWPPIQPVATPAAQLVSSEERKNLAALQL 120

Query: 121 QYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN 180
           QYKG +ACR+F  RNADSGSDEE EEEE +DGE+MES+EY FFL +F+EN+ELR YYEKN
Sbjct: 121 QYKGSDACRKFFARNADSGSDEEEEEEEEDDGEMMESKEYTFFLKMFVENEELRVYYEKN 180

Query: 181 SEDGLFCCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDI 240
            E GLFCCLVC GMGKKK GK+FKNC+ LV HS SIS TKKK AHRAFG  V RVFGWDI
Sbjct: 181 CESGLFCCLVCVGMGKKKFGKKFKNCLALVQHSISISGTKKKRAHRAFGHVVSRVFGWDI 240

Query: 241 DRLPTIVLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKW 300
           DRLPTIVL GEPLSRSLANSGD K QPEE  V  +      NE V++  +E    +EQK 
Sbjct: 241 DRLPTIVLKGEPLSRSLANSGDLKVQPEEIHVDNK------NEVVSVSVNE----DEQKL 300

Query: 301 EEEKTAED-------LISGEKTKNNDSSAVVTECRKHVVSADELI--------QLNVLQV 360
           EE KTAED       LISGE    ND +   T+ +  V +AD  I        +++ L V
Sbjct: 301 EEVKTAEDPTSNSKDLISGE----NDDAYKDTDVKLQVENADNSISGMGESNGEMDNLHV 360

Query: 361 PESIMEACEEFFAASLTSMADDDVSENNAI---EEREEFKFFLKLFIENESLRRYYKNKY 420
             +I+ AC+EF AA   SM DDDVSE  +    EEREEFKFFLKLF ENE+LRRYY+N Y
Sbjct: 361 --TILRACKEFQAAFFRSMNDDDVSEKESTDGAEEREEFKFFLKLFTENENLRRYYENHY 420

Query: 421 DDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHR 480
            DGEF+CL CE AG+K ++ FKTC RLL+H+T  GKN   K+  KP   K+LK+ MLAHR
Sbjct: 421 GDGEFTCLACEVAGRK-VKCFKTCSRLLQHSTQLGKNNIEKQGQKPQKTKVLKMGMLAHR 480

Query: 481 AYSLVICQVLGWDIEKLPAIVLKGEGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDS 519
           AY+ V+C+VLG DI+ LPAIVL GE  G SL K DV K     +    ++  DD V DDS
Sbjct: 481 AYTSVVCKVLGCDIKMLPAIVLNGEALGLSLTKSDVSKLQDKSDVQMQSSNADDIVEDDS 526

BLAST of Cp4.1LG06g06470 vs. TAIR 10
Match: AT1G78810.1 (unknown protein; Has 75 Blast hits to 52 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 2; Plants - 66; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 207.2 bits (526), Expect = 3.1e-53
Identity = 161/520 (30.96%), Postives = 241/520 (46.35%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPP-RGPKPTRYY---------------------LSTA 60
           MN Y +E L +EV+YLHSLW +GPP R P P+  +                     L + 
Sbjct: 2   MNIYDDESLKQEVIYLHSLWHQGPPTRKPIPSPNFNLIHDPIQRPRPNYIPPSDLQLLSR 61

Query: 61  VAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPP-MPC 120
             A T    ++ P + +N     K+PR     D+G EWP  + V   PST SGWP   PC
Sbjct: 62  YGAVTPQIISRNPNNPQNLYNNNKRPR----PDSGREWPVND-VPQPPSTGSGWPEYRPC 121

Query: 121 ATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGE 180
                R +S+EE+    A  LQ      CR F  R +       +G DE  E +EG++ +
Sbjct: 122 --KKTRPISAEEKEKLAANMLQRDIHRTCREFFGRKSGEEDSSVAGGDES-EIDEGDEDQ 181

Query: 181 IME------SEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCI 240
            +E      S+E++F   +F EN +L+ YYEKN+ +G F CLVC G+G +KS ++FK+C+
Sbjct: 182 SLEKEESSSSKEFQFLSRVFEENVKLKEYYEKNTGNGEFWCLVCGGIG-EKSCRKFKSCL 241

Query: 241 GLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQP 300
            L+ HS +I +T  K+ HRA  Q VC V GWD+                           
Sbjct: 242 ALIQHSLTIHKTDLKIQHRALAQVVCNVLGWDV--------------------------- 301

Query: 301 EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTEC 360
                                       N      +K ++ ++ G     +DS   + + 
Sbjct: 302 ----------------------------NNPVVSSQKDSQTVVEGASEPPSDSK--IPQE 361

Query: 361 RKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKL 420
           ++ V+S +E  +  VLQ+ ++  EA ++ F    T  A D   EN      EE +   K+
Sbjct: 362 KQQVMSVEEHAKAAVLQMQQNASEALKDIFVKDGTGAA-DGTEENGDENLSEELELISKV 421

Query: 421 FIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRV 480
           F EN  L+ YY+  Y+ G F CLVC  A  KK L+ FK C  +++H T            
Sbjct: 422 FSENVELKSYYEKNYEGGAFICLVCCAATDKKMLKRFKHCYGVVQHCT------------ 437

BLAST of Cp4.1LG06g06470 vs. TAIR 10
Match: AT1G78810.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 207.2 bits (526), Expect = 3.1e-53
Identity = 161/520 (30.96%), Postives = 241/520 (46.35%), Query Frame = 0

Query: 1   MNPYSEERLTEEVLYLHSLWRRGPP-RGPKPTRYY---------------------LSTA 60
           MN Y +E L +EV+YLHSLW +GPP R P P+  +                     L + 
Sbjct: 2   MNIYDDESLKQEVIYLHSLWHQGPPTRKPIPSPNFNLIHDPIQRPRPNYIPPSDLQLLSR 61

Query: 61  VAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPP-MPC 120
             A T    ++ P + +N     K+PR     D+G EWP  + V   PST SGWP   PC
Sbjct: 62  YGAVTPQIISRNPNNPQNLYNNNKRPR----PDSGREWPVND-VPQPPSTGSGWPEYRPC 121

Query: 121 ATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGE 180
                R +S+EE+    A  LQ      CR F  R +       +G DE  E +EG++ +
Sbjct: 122 --KKTRPISAEEKEKLAANMLQRDIHRTCREFFGRKSGEEDSSVAGGDES-EIDEGDEDQ 181

Query: 181 IME------SEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCI 240
            +E      S+E++F   +F EN +L+ YYEKN+ +G F CLVC G+G +KS ++FK+C+
Sbjct: 182 SLEKEESSSSKEFQFLSRVFEENVKLKEYYEKNTGNGEFWCLVCGGIG-EKSCRKFKSCL 241

Query: 241 GLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQP 300
            L+ HS +I +T  K+ HRA  Q VC V GWD+                           
Sbjct: 242 ALIQHSLTIHKTDLKIQHRALAQVVCNVLGWDV--------------------------- 301

Query: 301 EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTEC 360
                                       N      +K ++ ++ G     +DS   + + 
Sbjct: 302 ----------------------------NNPVVSSQKDSQTVVEGASEPPSDSK--IPQE 361

Query: 361 RKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKL 420
           ++ V+S +E  +  VLQ+ ++  EA ++ F    T  A D   EN      EE +   K+
Sbjct: 362 KQQVMSVEEHAKAAVLQMQQNASEALKDIFVKDGTGAA-DGTEENGDENLSEELELISKV 421

Query: 421 FIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRV 480
           F EN  L+ YY+  Y+ G F CLVC  A  KK L+ FK C  +++H T            
Sbjct: 422 FSENVELKSYYEKNYEGGAFICLVCCAATDKKMLKRFKHCYGVVQHCT------------ 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6591921.10.096.32hypothetical protein SDJN03_14267, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023535254.10.094.04uncharacterized protein LOC111796743 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7024795.10.097.17hypothetical protein SDJN02_13614, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023535255.10.093.65uncharacterized protein LOC111796743 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022937203.10.092.31uncharacterized protein LOC111443568 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1FFD40.092.31uncharacterized protein LOC111443568 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FAI70.091.92uncharacterized protein LOC111443568 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IMA40.089.04uncharacterized protein LOC111476868 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1INL50.088.65uncharacterized protein LOC111476868 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A1S3CJZ02.63e-20461.88uncharacterized protein LOC103501816 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT1G78810.13.1e-5330.96unknown protein; Has 75 Blast hits to 52 proteins in 16 species: Archae - 0; Bac... [more]
AT1G78810.23.1e-5330.96unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 498..520
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..92
NoneNo IPR availablePANTHERPTHR34546OS06G0153600 PROTEINcoord: 324..507
coord: 1..311

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g06470.1Cp4.1LG06g06470.1mRNA