Cp4.1LG15g08100 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG15g08100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptiontranscription factor SPT20 homolog isoform X1
LocationCp4.1LG15: 8050221 .. 8058279 (+)
RNA-Seq ExpressionCp4.1LG15g08100
SyntenyCp4.1LG15g08100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGGACAGAAAATTAGAAGATATTAAAAAAAAAATCAACTACCAATTAGTGCAAAACCACAGAATAATTAAGCTTGATTATTGATTAATCAACAATAATTTAAACAAAGGCAACAATTGGAAATGAACCAAAATAGAAGGACTGATCTGCAAAATCAGAAGATGGTCGGCGAGCCCATATCACTAATTTCATATCTGGCCGTACACTTCAAATCATCCAAAATATTATAACAAAGTCATCTCCCCATGCCTTTAACGTTTCTTCGATTTTTTCATTTTTATTTGTAATTTAATTTATTTTTAAAATCGAAATTCGTTTCTATAAGATTACTCTGAACCAAATCCATGGGGAGTTCATTCGCGGAAAGAGCCACAAGGAGAAACTTTCCTTCTCGATAATCGCAAAGAAAAAGAAAAAAAATTCATCTCTTTCTTTTGTCGTTCTTTGAATCGGTCCCCGTTTTACTTCCGTCTTTCCGATTTCCCACTGCGATCCATGGCGTCTGGTTCAGCTGGTCGCCCTAATTCGGGCTCTAAGGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCTTATGAGGACTACGGCAACCAGGAATCTTCCAACGGTAGCCATAGCGATCTCTCCGTCGCGAATTCTAGCAAGGTAGGTCGGAGTTAGTTGTTTTCATATTGCTGCGGTTGGTTTTTTGGTTGTGGTTATGTCTTGTTGGATGTTTGAAATTAAATTCTCGAGTTCTTATTGTTGAGTGTTCGATTAGGGTTTTGGATTGTTGAGTTGCGTTTTTATAGTCGACGGGAATAACTCGCTATTAAACAATGATTTGTATTGGGTTAGGAAATGAGTGATTGAGGCTGGGGATAAACATTCGTGCAGTGACCTTTCCGGATTGTAATGATGTATGCTTTCAATTATTGGTTTTAATTTGTGTATTACGGCTGAGTCGATCCATTTTGAAGTAACTTTGAATCTGATCCTGATTGTCAATAGTGTTGGTAGTACTGTCCAATCTGTAGAGTGGTTCAATCTTGCTTGAGCTGTGTAAAACTTGGTCCTGAGTGTAATGAGCATTTCCCCCTAAGATTTTGAAAATTTCATTGCGTGCTCAGAAATGTTCTTACGATCCTCCTTTGCGAGGAATTGAATCTTTTCTATAATAGTCGGTTCGTGAACAGACAATTGATATATCAGTGTTCGGTATATCTGTTCGTGAGCAGACATAGTGAGAAGTGAGCTGAACACTGTTATATTTTCTGTCTGATATTGACTTTCATTGTTGAAATTGAAATTACAGAGGAAGAACATCCATGAGATTATTTGGCTGCATTGATTATTATTTCTTTTTGATGTTTCTGGGTTCTATCCTCAGTTGTCTCATTGTATTAAGTTTTATAATTATCGGTTTCATTGAATTGATTAACCGAGTTTTGTGTCTTATCCAACAATAGTGGATTCATTGTGTATTATTGGATGAATAATTGAGTTTTGATGTTTGTTTGAGCAGGATTTTCACAAAAGTAGAGTATCTACTGTATATCCTGCTGCTGTTTATGGTCAGCCAGAAGATTCCATGAAACAAGATGTGATTTCTACTGTTGAGAACAGCATGAAAAAGTATTCTGATAACATTTTGCGTTTTCTCGAGGGAATAAGCTCGCGCCTATCACAACTTGAACTGAATTACTACAACCTTGATAAATCTGTTGGAGAAATGCGATCTGACTTGATTCGTGACCACGAAGAGGCAGATTTGAAGCTTAAATCTCTTGAGAAGCATCTGCAAGAAGTAAGCTATGCACTACTTCGAGCAATATGTAGAAATATTATTTACTTTGCAACTTAAACTCAACACTTATCTGTTGAATCTGGAAATTTGTGCACTTGTTGTGAACAATGACGGCCCCCAAGCTCTTTTTTTTGTTATTACCTTTTGTGAGATCTCACATTGGTTGGAGGAGGAACGAAACATTCCTTATAAGGATGTAGAAATCTCTCTCTCGTAAGACACGTTTTAAAACCGTGAGGCCGACAGCGATACGTAACGAGTCAAAGCGGGCAATATCTACTAGTGGTGGGCTTGTGCTTGGGCTGTTACAAATGACATCAGAGCTAGACACCGAGTGGTGTGCCAGCATTGGGGGTGGACTGTGAGATTCCACATTGGTTGGAGAGGGGAGCGAAACATTCCTTATAAAGGTGTAGAAACCTCTCCCTAGTAGACGCGTTTAAAACCGTGAGGCTGACGTTGATACGTAATGGGCTAAAGGGGACAATATCTAGTAGTAGTAGCGGTGGGCATGGGCTTTTACACCATTTTAGTTTTGTTTAAGTTGATACTGATTCTGCTGTATAGGGACGATCACTATATGAAATATGATTTGAAGTTGTCAGCCTCTGCTCTTATCAGTTTCATCTTAATTAAAATTTTCCTTGATGTCCTCCGTAGGTCCACAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTCGCCGAGACTCAGAAAGACTTGGCCAAACTTCATCTTCTGCAGAAAGAGTCGTCTTTGTCGAGCCATTCGCATTCAAACGAGGAGAGAGCTTCACCTGGTGCCTTTGATCCTAAGAAGAATGAAATTCCGTCCAAGAATCACAATCAGCAACTAGCTCTTGCCCTGCCGCACCAGATTGTCCCACAGCAACATCCACCTCCTCCAGCAGCTTTGCCGGAGAATGTGCCCCAACAGCAATCTTATTACATCCAGCATCCTCAGAGCCAACATCAAATGACCAATGCCCATGCCCAGCTAAGTCAAACTCCACCACCACCACCACAACAGTTCAGTCAGTATCAACAACAATGGACGCAGCAGCCACCTCAACAGGCACAACCACCACAACAGCATCCTTCTATGCAACCTCAGATCAGGCTGCCGCCTACTTCAGTCTACTCTTCTTATTCGATGAATCAACCGACTTCTATGCCAGAGACTCCGCCTATGCAAGTGTCATTTTCACCTATTCCTCAACCGGGTTCGAGCCGCATGGACACCGTGCAATATGGATATGTTGGAAGTGCTGGTACTATGCCCCAGCAACCTCCTCAAGTAAAAAATGCTTTTGGTGGAGGACCACAAGCCGGAGAAGGATATTTACCTTCTGGACCACAATCTGGGCTTTCCTCGGGAGGTGCATATATGATATATGATAGGGAAAGTGGAAGACCACCGCACCATCCACCGCACCATCCTCCGCAACCTCAGCAACCACCACACCATCCTCCGCAACCACAACAACCGCCACACCATCCTCCTCAGCCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCGGCCAATGTTCAGATTCCTCAGCATCCATCAGGTCCACACGTTATGGCCAGGAATCATCCGAATCAGTCGCATTTTATGCGTAACCAGAACCATCCTTATGGCGAAATAGTTGATAAACTGGTTGGGATGGGTTTCAGGAGTGACCACATTGGCAGTGTAATTCATAGGATGGAGGAGAGCGGGCAACCCATCGACTTCAACGCTGTTCTAGACGGGTTGAGTAATTCTGGAGGTCCTCAGCGGGCATGGTGACAGTAATTTCAGTAGCGGTCGCCTATTTGTATGTTGAAAGCTCATATCGGCCATGACCAGCCTAGCCATTGCGTCTTTTTAATGCATTGAATAAAAGACTGGTTTAGGATTCCACTGTGTTCTTCCTTCTCGTATATATATATATATGGTTTGCGAGATATAAAAGAAGTTGATTGTATCTTATGTTATGAATATTTCTACCCTCAGTCCCAAATCTTATGTTATGAATATTCCTTCGCTTTCCACCTAAAAACAAGGAAAAATGTGAAGGTTCATATGGTGTGTAATGCAATGCTATTTGTTTGCAATTATTCACACAAATATTTTGAGTCAAAGTTATTAACTCTCAAATTATACTGAGGTTGGACTAAGTTATTTGAGCTTGAGGAGAACAAAACACAATTTATAAGGGCGTGGAGACATTCCCTGGCAAACGCGTTTTAAAGCCTTGAGGGGAAGCTTGTAAGGAAAGCCCTTGAGGGAAAGCTCGTAAGCCCAAAGAGGACAAGGCGTGGAGACATTCCCCCCTAGCAGATGCGTTTTAAAGCCTTGAGGGGAAGCTCGCAAGGGAAGCCTCAAGCCCCTGGAGGGAAAGCTCGTAAGCCCAAAGAGCACAATATTTGCTAACGGACTTGGGTTGCTACATTTTTTGGTTTTAAACACCCCTATTTTATACTTTAATCTTAAACTCATTGATTATTGAATACTAATAAAGACCTAAGAATTAGATCAAGTATATAAGTTTACTAAAAACAACGAATTTTTTAATTTTTTCTGGTCATCTTGATCTTATATTATTGCTTTAATTTATATGCTTCATTTTGAAGCATATTAATCATTTATCATATTAAATAATTTTTCTAAAATAAAAAGAATATTAGACACGCATCTTTTGTATTTTTTTTTTTTTTTTTTTTTAAGAAAAAAAAAATATCGAAAACCTTATTCATGTCTATATAAACACAGTCTTGGATTAGAAGTCATCCCAAGATAATTTTGGTCTCGATTATCGGAAACCCCCATCTCTCACTATTTCCTAAAAAATCTCAGCCTTCACCCCCATTCCTTCGATTTCGCACTGCGATCTATGGCGTCTGGTTCCGCAGGTCGCCCTAATTCCGCCCCCAAATCCTTTGATTTTGGTTCTGATAATATCCTCTGCTCATTTGAGGACTACACTAAACAGGAACCTTCAAACGGTAGCCATAGCGATCCAGTCTCCGTTGCCAATTCTAGCAAGGTTCGTGGGGGGTATTACTTGTTCCTACGTTTCTTTGGCTAGTTTTCTGTCCTGGGTTATGGATGTCTGAAAGTAATTTTCGATTTTTCATTGTTGGGTGCGATTAGGGTTTGTGATTTTTGATATTAACATTCTAGTTTTGATCGTAGTTTAATTGGGCGGTGTTTATTTTTGGCTTAAGAAACGAGTACTCGACTGTTAAGGGTAGAGATGATGATCCTTGCCAATCCATTTTTGCATTAGCTGAGTTCTAGGCCTTATGCTTGTCTGGTTTATTGTTGAGCGAACGTTGCTGGTGCTTCCAGTCTCCAGAATGAAGTTAGTTGTTCAATTCAATTGGAGTTTGATCCAATTTTGAATAATCATCCTAGACGAAAGGAACATTTGTTTTTACGTGTTCTTGGGCTTTGTATCTTATCCGCCTTGCATCTTTTATTTGTTCTTGTAAATAATGGGTTCATGAACTGATAGAGTGGAACATTTAACGCAATTATTATTGTTAGTATTTTCCGTTCATAAGCAAGTTTCATTAATGGTTAAAGTTGGTATTACTGATGGAGAATCCCATAGGATTATGAAGCTGCAATAGCAGCTTGCAGTGGCAGCAAGTATTGTTGCTAGTCGTATTGTGGCTCGAGCATCCCCGACGATTTTATGTACTTTTTAATTGCCTTGGATTTATATAAACCATTCTATCAATTTTACTATACATGCAGCATCTCTCAGTTTATGTCTTCTGGAATCATATTAAATTAGGAATTCAATGTGTATTATTTTGTGAACGATTAAAGTTGTGCATATTTATATGGGCAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGGTGCTGCATATGGTCAACCAGATGATTCCATTAATCAAGATGTGATTACTACTGTTGAGAACAGCATGAAAAAGCATTCCGATAACCTTTTGCGTTTTCTCGAGGGAATAAGTTCACGCCTATCACAGCTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGACTTAGCCCGTGACCATGAAGAGGCAGAATCCAAGCTTAAATCTATTGAGAAGCATGTACAAGAGGTGAGCCACGACTTTAGAACTTTGAAGAGTTTGTCAGTAGTATCTTATCAATAAGTGTTCGGAATTGGTACAATGAGGATCTTCTTTGATACTTTATGATCTTGAAAAAGTAACATTGTTTTCATTCATATTATACGATCATTTGTCGACACAATAAATGAATTGTTTCTAATGATTCGGTAGAAAGTAAGATTAGTTTGCGACGTAAACCTATCACTTGTTGACTCTTGAATCATGCACTTGATGTGAAAAGCGACTGTGCTTTCCAAGCTGAGTATTGTTATCGTCATTTTTATTTCTTAAATTTTGGTACTGACCAAGAGGATTAGTAACATTTGAATTGAATTTGTACTTGATGCATTTCTGTAGGTTCACAGATCTGTACAGATTATCAGAGACAAGCAAGAGCTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCCCACAGAAAGAGCCATCTTTGTCGAGCCATTCGCAGACAAATGAGGAGAGGGTTTCAACCGATCCTAAAAAGAACGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCGCAGCAAAATCCTATTACACCCCCTTCAGCAGCTTTGCCTCAGAATGTGCCTCAACAACAGCAATCTTACTACATTTCTTCATCCCAATTGCCTGGTCAACAACCATCCCATATCCAGCACGCTCAGAACCAGTATATCTCATCGGACTCCCAACACCGGGCATCACAACCTCAAGACGTTTCGCATATGACCAATCCCCAGCTAAGTCAAACTCCACAACCATTTAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCCGGCACAACCACCACAACAGGCTTCTATGCAACCTCAGATCAGACCACCGCCTACTTCAGTCTACCCTTCTCCTTACCCACCAAATCAACCAACTTCTATGCCAGAGACTCTGTCAAGCAGCATGCCTATGCAAATGTCTTTTGCATATATTCCTCAACCTGGTTCAAGCCGTGCGGACGCAGTGCCTTATGGGTATGCTGCTTCAAGTGGTGGTTCTGCTCCGCAGCAACCTCCTCAAGTGAAAAATGCTTATGGACCAGCAACAGGCGAGGGCTATATGCCTCCTGGACAACAGCCTGCGCTATCCTCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCCCCACACCATCTCCCTCAACAGCCACATCATCCGTCTCAGCAATCCCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAAGCTCCAGCAGGCCCCCATGTCTCAGCCAGGAATCCAAGCCATTCACATTTAATCGAAAAACTGGTTGGCATGGGTTTCAGGGGCGACCATGTTGCCAGTATAATTCAGAGAATGGAGGACAGTGGGCAAACTGTTGACTTCAACGCAGTTCTAGACAGATTGAGTACTCCTGCAGGTCCAGGGCCCCAAAGAGCGTGGTGAGTAATTTAATCAACCCCTGTTTGCGGCCCATCCTGGGCATGACCAGCCTCGTAAATTGCATCGTTTTAATGCATTGAATAAATTTTATCCTCGTACATATATTGTTATGGTTTGTGAGATTTAAAAATGTCGGCTGTATGATTTAAACATTGTGTGAATATCTTCTACCTGCTATTTCAAATCTTTTCATATGACTCCCTCTTAATGTTTGTATGCACGTTTAGTAGTCAAAGACATCATTGCTGTTAGCTTGTTAGTGCTAGTTTCCTACTTCCCCAAATCCTTTTTTCCACTCGAGACTTGGGAATAGGTTTGGTTTGGAATGGACATACAACTATGGTAATAGGTAATAGGTTTGGAATGGACATGCGGCCCTGGTCATATGACCGACGATCATGACCTGACCACGACATCGTGTTTCCAAAGAACATAAAAGAAGAAAGAATGTATTCATGCCGACACCTAACACCACAAGGAAAGGAGAAAGAATGCAGTCTTGCCGACGCCCTGTGGGGGTGGGGGTGTCCTTTGCTACGATTCAATGAAACAGCAGTGTTGAAAATGTGCATCGCTATTGCCTGCTTGCTATGATTCAATTTACATTTACTTCCTGACAAACACGTCGTTGAGT

mRNA sequence

ATGGATTACTCTGAACCAAATCCATGGGGAGTTCATTCGCGGAAAGAGCCACAAGGAGAAACTTTCCTTCTCGATAATCGCAAAGAAAAAGAAAAAAAATTCATCTCTTTCTTTTGTCGTTCTTTGAATCGGTCCCCGTTTTACTTCCGTCTTTCCGATTTCCCACTGCGATCCATGGCGTCTGGTTCAGCTGGTCGCCCTAATTCGGGCTCTAAGGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCTTATGAGGACTACGGCAACCAGGAATCTTCCAACGGTAGCCATAGCGATCTCTCCGTCGCGAATTCTAGCAAGGATTTTCACAAAAGTAGAGTATCTACTGTATATCCTGCTGCTGTTTATGGTCAGCCAGAAGATTCCATGAAACAAGATGTGATTTCTACTGTTGAGAACAGCATGAAAAAGTATTCTGATAACATTTTGCGTTTTCTCGAGGGAATAAGCTCGCGCCTATCACAACTTGAACTGAATTACTACAACCTTGATAAATCTGTTGGAGAAATGCGATCTGACTTGATTCGTGACCACGAAGAGGCAGATTTGAAGCTTAAATCTCTTGAGAAGCATCTGCAAGAAGTCCACAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTCGCCGAGACTCAGAAAGACTTGGCCAAACTTCATCTTCTGCAGAAAGAGTCGTCTTTGTCGAGCCATTCGCATTCAAACGAGGAGAGAGCTTCACCTGGTGCCTTTGATCCTAAGAAGAATGAAATTCCGTCCAAGAATCACAATCAGCAACTAGCTCTTGCCCTGCCGCACCAGATTGTCCCACAGCAACATCCACCTCCTCCAGCAGCTTTGCCGGAGAATGTGCCCCAACAGCAATCTTATTACATCCAGCATCCTCAGAGCCAACATCAAATGACCAATGCCCATGCCCAGCTAAGTCAAACTCCACCACCACCACCACAACAGTTCAGTCAGTATCAACAACAATGGACGCAGCAGCCACCTCAACAGGCACAACCACCACAACAGCATCCTTCTATGCAACCTCAGATCAGGCTGCCGCCTACTTCAGTCTACTCTTCTTATTCGATGAATCAACCGACTTCTATGCCAGAGACTCCGCCTATGCAAGTGTCATTTTCACCTATTCCTCAACCGGGTTCGAGCCGCATGGACACCGTGCAATATGGATATGTTGGAAGTGCTGGTACTATGCCCCAGCAACCTCCTCAAGTAAAAAATGCTTTTGGTGGAGGACCACAAGCCGGAGAAGGATATTTACCTTCTGGACCACAATCTGGGCTTTCCTCGGGAGGTGCATATATGATATATGATAGGGAAAGTGGAAGACCACCGCACCATCCACCGCACCATCCTCCGCAACCTCAGCAACCACCACACCATCCTCCGCAACCACAACAACCGCCACACCATCCTCCTCAGCCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCGGCCAATGTTCAGATTCCTCAGCATCCATCAGGTCCACACGTTATGGCCAGGAATCATCCGAATCAGTCGCATTTTATGCGTAACCAGAACCATCCTTATGGCGAAATAGTTGATAAACTGGTTGGGATGGGTTTCAGGAGTGACCACATTGGCAGTGTAATTCATAGGATGGAGGAGAGCGGGCAACCCATCGACTTCAACGCTGTTCTAGACGGCCTTCACCCCCATTCCTTCGATTTCGCACTGCGATCTATGGCGTCTGGTTCCGCAGGTCGCCCTAATTCCGCCCCCAAATCCTTTGATTTTGGTTCTGATAATATCCTCTGCTCATTTGAGGACTACACTAAACAGGAACCTTCAAACGGTAGCCATAGCGATCCAGTCTCCGTTGCCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGGTGCTGCATATGGTCAACCAGATGATTCCATTAATCAAGATGTGATTACTACTGTTGAGAACAGCATGAAAAAGCATTCCGATAACCTTTTGCGTTTTCTCGAGGGAATAAGTTCACGCCTATCACAGCTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGACTTAGCCCGTGACCATGAAGAGGCAGAATCCAAGCTTAAATCTATTGAGAAGCATGTACAAGAGGTTCACAGATCTGTACAGATTATCAGAGACAAGCAAGAGCTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCCCACAGAAAGAGCCATCTTTGTCGAGCCATTCGCAGACAAATGAGGAGAGGGTTTCAACCGATCCTAAAAAGAACGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCGCAGCAAAATCCTATTACACCCCCTTCAGCAGCTTTGCCTCAGAATGTGCCTCAACAACAGCAATCTTACTACATTTCTTCATCCCAATTGCCTGGTCAACAACCATCCCATATCCAGCACGCTCAGAACCAGTATATCTCATCGGACTCCCAACACCGGGCATCACAACCTCAAGACGTTTCGCATATGACCAATCCCCAGCTAAGTCAAACTCCACAACCATTTAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCCGGCACAACCACCACAACAGGCTTCTATGCAACCTCAGATCAGACCACCGCCTACTTCAGTCTACCCTTCTCCTTACCCACCAAATCAACCAACTTCTATGCCAGAGACTCTGTCAAGCAGCATGCCTATGCAAATGTCTTTTGCATATATTCCTCAACCTGGTTCAAGCCGTGCGGACGCAGTGCCTTATGGGTATGCTGCTTCAAGTGGTGGTTCTGCTCCGCAGCAACCTCCTCAAGTGAAAAATGCTTATGGACCAGCAACAGGCGAGGGCTATATGCCTCCTGGACAACAGCCTGCGCTATCCTCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCCCCACACCATCTCCCTCAACAGCCACATCATCCGTCTCAGCAATCCCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAAGCTCCAGCAGGCCCCCATGTCTCAGCCAGGAATCCAAGCCATTCACATTTAATCGAAAAACTGGTTGGCATGGGTTTCAGGGGCGACCATGTTGCCAGTATAATTCAGAGAATGGAGGACAGTGGGCAAACTGTTGACTTCAACGCAGTTCTAGACAGATTGAGTACTCCTGCAGGTCCAGGGCCCCAAAGAGCGTGGTGAGTAATTTAATCAACCCCTGTTTGCGGCCCATCCTGGGCATGACCAGCCTCGTAAATTGCATCGTTTTAATGCATTGAATAAATTTTATCCTCGTACATATATTGTTATGGTTTGTGAGATTTAAAAATGTCGGCTGTATGATTTAAACATTGTGTGAATATCTTCTACCTGCTATTTCAAATCTTTTCATATGACTCCCTCTTAATGTTTGTATGCACGTTTAGTAGTCAAAGACATCATTGCTGTTAGCTTGTTAGTGCTAGTTTCCTACTTCCCCAAATCCTTTTTTCCACTCGAGACTTGGGAATAGGTTTGGTTTGGAATGGACATACAACTATGGTAATAGGTAATAGGTTTGGAATGGACATGCGGCCCTGGTCATATGACCGACGATCATGACCTGACCACGACATCGTGTTTCCAAAGAACATAAAAGAAGAAAGAATGTATTCATGCCGACACCTAACACCACAAGGAAAGGAGAAAGAATGCAGTCTTGCCGACGCCCTGTGGGGGTGGGGGTGTCCTTTGCTACGATTCAATGAAACAGCAGTGTTGAAAATGTGCATCGCTATTGCCTGCTTGCTATGATTCAATTTACATTTACTTCCTGACAAACACGTCGTTGAGT

Coding sequence (CDS)

ATGGATTACTCTGAACCAAATCCATGGGGAGTTCATTCGCGGAAAGAGCCACAAGGAGAAACTTTCCTTCTCGATAATCGCAAAGAAAAAGAAAAAAAATTCATCTCTTTCTTTTGTCGTTCTTTGAATCGGTCCCCGTTTTACTTCCGTCTTTCCGATTTCCCACTGCGATCCATGGCGTCTGGTTCAGCTGGTCGCCCTAATTCGGGCTCTAAGGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCTTATGAGGACTACGGCAACCAGGAATCTTCCAACGGTAGCCATAGCGATCTCTCCGTCGCGAATTCTAGCAAGGATTTTCACAAAAGTAGAGTATCTACTGTATATCCTGCTGCTGTTTATGGTCAGCCAGAAGATTCCATGAAACAAGATGTGATTTCTACTGTTGAGAACAGCATGAAAAAGTATTCTGATAACATTTTGCGTTTTCTCGAGGGAATAAGCTCGCGCCTATCACAACTTGAACTGAATTACTACAACCTTGATAAATCTGTTGGAGAAATGCGATCTGACTTGATTCGTGACCACGAAGAGGCAGATTTGAAGCTTAAATCTCTTGAGAAGCATCTGCAAGAAGTCCACAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTCGCCGAGACTCAGAAAGACTTGGCCAAACTTCATCTTCTGCAGAAAGAGTCGTCTTTGTCGAGCCATTCGCATTCAAACGAGGAGAGAGCTTCACCTGGTGCCTTTGATCCTAAGAAGAATGAAATTCCGTCCAAGAATCACAATCAGCAACTAGCTCTTGCCCTGCCGCACCAGATTGTCCCACAGCAACATCCACCTCCTCCAGCAGCTTTGCCGGAGAATGTGCCCCAACAGCAATCTTATTACATCCAGCATCCTCAGAGCCAACATCAAATGACCAATGCCCATGCCCAGCTAAGTCAAACTCCACCACCACCACCACAACAGTTCAGTCAGTATCAACAACAATGGACGCAGCAGCCACCTCAACAGGCACAACCACCACAACAGCATCCTTCTATGCAACCTCAGATCAGGCTGCCGCCTACTTCAGTCTACTCTTCTTATTCGATGAATCAACCGACTTCTATGCCAGAGACTCCGCCTATGCAAGTGTCATTTTCACCTATTCCTCAACCGGGTTCGAGCCGCATGGACACCGTGCAATATGGATATGTTGGAAGTGCTGGTACTATGCCCCAGCAACCTCCTCAAGTAAAAAATGCTTTTGGTGGAGGACCACAAGCCGGAGAAGGATATTTACCTTCTGGACCACAATCTGGGCTTTCCTCGGGAGGTGCATATATGATATATGATAGGGAAAGTGGAAGACCACCGCACCATCCACCGCACCATCCTCCGCAACCTCAGCAACCACCACACCATCCTCCGCAACCACAACAACCGCCACACCATCCTCCTCAGCCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCGGCCAATGTTCAGATTCCTCAGCATCCATCAGGTCCACACGTTATGGCCAGGAATCATCCGAATCAGTCGCATTTTATGCGTAACCAGAACCATCCTTATGGCGAAATAGTTGATAAACTGGTTGGGATGGGTTTCAGGAGTGACCACATTGGCAGTGTAATTCATAGGATGGAGGAGAGCGGGCAACCCATCGACTTCAACGCTGTTCTAGACGGCCTTCACCCCCATTCCTTCGATTTCGCACTGCGATCTATGGCGTCTGGTTCCGCAGGTCGCCCTAATTCCGCCCCCAAATCCTTTGATTTTGGTTCTGATAATATCCTCTGCTCATTTGAGGACTACACTAAACAGGAACCTTCAAACGGTAGCCATAGCGATCCAGTCTCCGTTGCCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGGTGCTGCATATGGTCAACCAGATGATTCCATTAATCAAGATGTGATTACTACTGTTGAGAACAGCATGAAAAAGCATTCCGATAACCTTTTGCGTTTTCTCGAGGGAATAAGTTCACGCCTATCACAGCTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGACTTAGCCCGTGACCATGAAGAGGCAGAATCCAAGCTTAAATCTATTGAGAAGCATGTACAAGAGGTTCACAGATCTGTACAGATTATCAGAGACAAGCAAGAGCTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCCCACAGAAAGAGCCATCTTTGTCGAGCCATTCGCAGACAAATGAGGAGAGGGTTTCAACCGATCCTAAAAAGAACGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCGCAGCAAAATCCTATTACACCCCCTTCAGCAGCTTTGCCTCAGAATGTGCCTCAACAACAGCAATCTTACTACATTTCTTCATCCCAATTGCCTGGTCAACAACCATCCCATATCCAGCACGCTCAGAACCAGTATATCTCATCGGACTCCCAACACCGGGCATCACAACCTCAAGACGTTTCGCATATGACCAATCCCCAGCTAAGTCAAACTCCACAACCATTTAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCCGGCACAACCACCACAACAGGCTTCTATGCAACCTCAGATCAGACCACCGCCTACTTCAGTCTACCCTTCTCCTTACCCACCAAATCAACCAACTTCTATGCCAGAGACTCTGTCAAGCAGCATGCCTATGCAAATGTCTTTTGCATATATTCCTCAACCTGGTTCAAGCCGTGCGGACGCAGTGCCTTATGGGTATGCTGCTTCAAGTGGTGGTTCTGCTCCGCAGCAACCTCCTCAAGTGAAAAATGCTTATGGACCAGCAACAGGCGAGGGCTATATGCCTCCTGGACAACAGCCTGCGCTATCCTCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCCCCACACCATCTCCCTCAACAGCCACATCATCCGTCTCAGCAATCCCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAAGCTCCAGCAGGCCCCCATGTCTCAGCCAGGAATCCAAGCCATTCACATTTAATCGAAAAACTGGTTGGCATGGGTTTCAGGGGCGACCATGTTGCCAGTATAATTCAGAGAATGGAGGACAGTGGGCAAACTGTTGACTTCAACGCAGTTCTAGACAGATTGAGTACTCCTGCAGGTCCAGGGCCCCAAAGAGCGTGGTGA

Protein sequence

MDYSEPNPWGVHSRKEPQGETFLLDNRKEKEKKFISFFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTVYPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQSYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLHPHSFDFALRSMASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPGSSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW
Homology
BLAST of Cp4.1LG15g08100 vs. NCBI nr
Match: KAG7011974.1 (hypothetical protein SDJN02_26882, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1977 bits (5123), Expect = 0.0
Identity = 1040/1087 (95.68%), Postives = 1052/1087 (96.78%), Query Frame = 0

Query: 21   TFLLDNRKEKEKKFISFFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDD 80
            +F +  +K+K+  + S FCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDD
Sbjct: 29   SFSIIAKKKKKIIYFSIFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDD 88

Query: 81   VLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTVYPAAVYGQPEDSMKQDVISTVE 140
            VLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSR+STVYPAA YGQPEDSMKQDVISTVE
Sbjct: 89   VLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVE 148

Query: 141  NSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHL 200
            NSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHL
Sbjct: 149  NSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHL 208

Query: 201  QEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSHSHSNEERASPGAFDPKKNEIPS 260
            QEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLS HSHSNEERASPGAFDPKKNEIPS
Sbjct: 209  QEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPS 268

Query: 261  KNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQSYYIQHPQSQHQMTNAHAQLSQTPP 320
            KNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQ YYIQHPQSQHQMTNAHAQLSQTPP
Sbjct: 269  KNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPP 328

Query: 321  PPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQ 380
            PPPQQFSQYQQQW QQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQ
Sbjct: 329  PPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQ 388

Query: 381  VSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSS 440
            VSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSS
Sbjct: 389  VSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSS 448

Query: 441  GGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANV 500
            GGAYMIYDRE+GRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANV
Sbjct: 449  GGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANV 508

Query: 501  QIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQP 560
            QIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQP
Sbjct: 509  QIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQP 568

Query: 561  IDFNAVLDGLHPHSFDFALRSMASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGS 620
            IDFNAVLDGL             S S GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGS
Sbjct: 569  IDFNAVLDGL-------------SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGS 628

Query: 621  HSDPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEG 680
            HS+PVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVI  VENSMKKHSDNLLRFLEG
Sbjct: 629  HSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEG 688

Query: 681  ISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQEL 740
            ISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQEL
Sbjct: 689  ISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQEL 748

Query: 741  AETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQ 800
            AETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQ
Sbjct: 749  AETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQ 808

Query: 801  NPIT-PPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVS 860
            NPIT PPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVS
Sbjct: 809  NPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVS 868

Query: 861  HMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPP-NQPT 920
             MTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPP NQPT
Sbjct: 869  QMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPT 928

Query: 921  SMPETLSSSMPMQMSFAYIPQPGSSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEG 980
            SMPETLSSSMPMQMSFA IPQPGSSRADAVPYGYAA+SGGSAPQQPPQVKNAYGPATGEG
Sbjct: 929  SMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEG 988

Query: 981  YMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHH-PSQQSHFNQSGYPPANAPHQVPP 1040
            YMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHH PSQQSHF+QSGYPPANAPHQVPP
Sbjct: 989  YMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPP 1048

Query: 1041 QAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAG 1100
            QAP GPHVSARNPSHSHLIEKLVGMGFRGDHV +IIQRMEDSGQTVDFNAVLDRLSTPAG
Sbjct: 1049 QAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAG 1102

Query: 1101 PGPQRAW 1104
            PGPQRAW
Sbjct: 1109 PGPQRAW 1102

BLAST of Cp4.1LG15g08100 vs. NCBI nr
Match: KAG6589219.1 (hypothetical protein SDJN03_17784, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1452 bits (3758), Expect = 0.0
Identity = 839/1099 (76.34%), Postives = 898/1099 (81.71%), Query Frame = 0

Query: 59   MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTV 118
            MASGSAGRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H+DLSVANSSKDFHKSR+STV
Sbjct: 1    MASGSAGRPNSGSKAFDFGSNDILCSYEDYGNQESSNGTHTDLSVANSSKDFHKSRMSTV 60

Query: 119  YPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 178
            YPAA Y QPEDS+KQDVISTVENSMKKYSDNILRFLEGISSRLSQLELN YNLDKSVGEM
Sbjct: 61   YPAAAYAQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEM 120

Query: 179  RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSH 238
            RSD+IRDHEE DLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHL+QKES  SSH
Sbjct: 121  RSDVIRDHEEEDLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLVQKESPSSSH 180

Query: 239  SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHP---PPPAALPENVPQQQ 298
            SHSNEERASP A DPK NE PS+NHNQQLALALPHQ++ QQ+P   PPPAALP+N+PQQQ
Sbjct: 181  SHSNEERASPVASDPK-NENPSENHNQQLALALPHQVLQQQNPLTPPPPAALPQNMPQQQ 240

Query: 299  SYYI------------QHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQWT----- 358
            +YYI            QH Q Q+Q      +   Q SQTPPP  QQF+QY QQWT     
Sbjct: 241  AYYISSTHLPNQLANIQHGQGQYQQQLQDVSRLPQPSQTPPP--QQFNQYPQQWTTQQQQ 300

Query: 359  -QQPPQQAQPPQQH-PSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQ 418
             QQPPQ  QPPQQ  PSMQPQIR  P+SVY SYSMNQPTSMPET     PMQ +FSP+PQ
Sbjct: 301  QQQPPQPVQPPQQQQPSMQPQIRPQPSSVYLSYSMNQPTSMPETLPNSMPMQATFSPMPQ 360

Query: 419  PGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYD 478
            PGSSR+DTV YGY GS  T+PQQPPQVKNAFG  P AGEGYLPSGPQ  LSSGG+YM+YD
Sbjct: 361  PGSSRVDTVPYGYAGSGSTVPQQPPQVKNAFG--PPAGEGYLPSGPQPALSSGGSYMMYD 420

Query: 479  RESGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHP 538
            RESGRP HHP               QPQQ         QPHFNQ  YPPAN  +QIPQ  
Sbjct: 421  RESGRPHHHP---------------QPQQ---------QPHFNQGVYPPANASLQIPQQ- 480

Query: 539  SGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAV 598
            SGPHV+ARN P+ +H MRNQ+HPYGEIV+KLVGMGFRSDHI SVIHRMEESGQPIDFNAV
Sbjct: 481  SGPHVVARN-PSHAHLMRNQSHPYGEIVEKLVGMGFRSDHIASVIHRMEESGQPIDFNAV 540

Query: 599  LDGL----------------HPHSFDFALRSMASGSAGRPNSAPKSFDFGSDNILCSFED 658
            LDGL                H  SFD ++R MASGSAGR NSAPK+FDFGSD+ILCS+ED
Sbjct: 541  LDGLSNPGGPQRASEENSGLHSCSFDHSVRFMASGSAGRSNSAPKAFDFGSDDILCSYED 600

Query: 659  YTKQEPSNGSHSDPVSVANSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVITTVENSMKK 718
            Y KQ+ SNGSHSDPVSV NSSKDFHK RMST FP AA YGQPDDSI QDVI+ VENSMKK
Sbjct: 601  YGKQDTSNGSHSDPVSVTNSSKDFHKVRMSTAFPAAASYGQPDDSITQDVISAVENSMKK 660

Query: 719  HSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHR 778
            HSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD+ARDHEE +SKLKS+EKH+QEVHR
Sbjct: 661  HSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDVARDHEEVDSKLKSLEKHLQEVHR 720

Query: 779  SVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQ 838
            SVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNEN SEIH Q
Sbjct: 721  SVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENLSEIHGQ 780

Query: 839  QLALALPHQIVPQQNPITPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISS 898
            QLALALPHQIVPQQNPI P SA LP NVPQQQ SYYIS +QL GQ P HIQHA  QYIS 
Sbjct: 781  QLALALPHQIVPQQNPIAPASATLPPNVPQQQ-SYYISPTQLSGQPP-HIQHAPGQYISP 840

Query: 899  DSQHRASQPQDVSHMTNPQLSQTP-QPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPT 958
            D QHRA QPQDVS  TNPQLSQ+P QPFNQYQQQWAQ PSQ  QPPQQ+SMQPQIRPPPT
Sbjct: 841  DPQHRALQPQDVS--TNPQLSQSPPQPFNQYQQQWAQVPSQQPQPPQQSSMQPQIRPPPT 900

Query: 959  SVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPGSSRADAVPYGYAASSGGSAPQQPP 1018
            S YP  YPPNQP+S+PETLSS+M    SFA IP PGSSR D VPYGYAA+SGGS+PQQPP
Sbjct: 901  SGYP--YPPNQPSSVPETLSSTM----SFASIPNPGSSRPDPVPYGYAAASGGSSPQQPP 960

Query: 1019 QVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHPSQQSHFNQSG 1078
            QVKN YGPATGEGY+PPGQ        AYMMYDRESGRPPHH PQQPH       FNQSG
Sbjct: 961  QVKNTYGPATGEGYLPPGQP-------AYMMYDRESGRPPHHPPQQPH-------FNQSG 1020

Query: 1079 YPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQTVDF 1104
            YPPANAPHQ+P QA   P VS+RNPSHSHLIEKLVGMGFRGDHVASIIQRMED G+ VDF
Sbjct: 1021 YPPANAPHQIP-QAATVPPVSSRNPSHSHLIEKLVGMGFRGDHVASIIQRMEDRGEPVDF 1043

BLAST of Cp4.1LG15g08100 vs. NCBI nr
Match: KAG7022919.1 (hypothetical protein SDJN02_16655, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1402 bits (3628), Expect = 0.0
Identity = 820/1081 (75.86%), Postives = 873/1081 (80.76%), Query Frame = 0

Query: 59   MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTV 118
            MASGSAGRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H+DLSVANSSKDFHKSR+STV
Sbjct: 1    MASGSAGRPNSGSKAFDFGSNDILCSYEDYGNQESSNGTHTDLSVANSSKDFHKSRMSTV 60

Query: 119  YPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 178
            YPAA Y QPEDS+KQDVISTVENSMKKYSDNILRFLEGISSRLSQLELN YNLDKSVGEM
Sbjct: 61   YPAAAYAQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEM 120

Query: 179  RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSH 238
            RSD+IRDHEE DLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHL+QKES  SSH
Sbjct: 121  RSDVIRDHEEEDLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLVQKESPSSSH 180

Query: 239  SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHP---PPPAALPENVPQQQ 298
            SHSNEERASP A DPK NE PS+NHNQQLALALPHQ++ QQ+P   PPPAALP+NVPQQQ
Sbjct: 181  SHSNEERASPVASDPK-NENPSENHNQQLALALPHQVLQQQNPLTPPPPAALPQNVPQQQ 240

Query: 299  SYYI------------QHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQWT----- 358
            +YYI            QH Q Q+Q      +   Q SQTPPP  QQF+QY QQWT     
Sbjct: 241  AYYISSTHLPNQLAHIQHGQGQYQQQLQDVSRLPQPSQTPPP--QQFNQYPQQWTTQQQQ 300

Query: 359  QQPPQQAQPPQQH-PSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQP 418
            QQPPQ  QPPQQ  PSMQPQIR  P+SVY SYSMNQPTSMPET     PMQ +FSP+PQP
Sbjct: 301  QQPPQPVQPPQQQQPSMQPQIRPQPSSVYLSYSMNQPTSMPETLPNSMPMQATFSPMPQP 360

Query: 419  GSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDR 478
            GSSR+DTV YGY GS  T+PQQPPQVKNAFG  P AGEGYLPSGPQ  LSSGG+YM+YDR
Sbjct: 361  GSSRVDTVPYGYAGSGSTVPQQPPQVKNAFG--PPAGEGYLPSGPQPALSSGGSYMMYDR 420

Query: 479  ESGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHPS 538
            ESGRP HHP               QPQQ         QPHFNQ  YPPAN  +QIPQ  S
Sbjct: 421  ESGRPHHHP---------------QPQQ---------QPHFNQGVYPPANASLQIPQQ-S 480

Query: 539  GPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVL 598
            GPHV+ARN P+ +H MRNQ+HPYGEIV+KLVGMGFRSDHI SVIHRMEESGQPIDFNAVL
Sbjct: 481  GPHVVARN-PSHAHLMRNQSHPYGEIVEKLVGMGFRSDHIASVIHRMEESGQPIDFNAVL 540

Query: 599  DGLHPHSFDFALRSMASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSV 658
            DGL          S  S   GR NSAPK+FDFGSD+ILCS+EDY KQ+ SNGSHSDPVSV
Sbjct: 541  DGLSNPGGPQRC-SPTSYIHGRSNSAPKAFDFGSDDILCSYEDYGKQDTSNGSHSDPVSV 600

Query: 659  ANSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLS 718
             NSSKDFHK RMST FP AA YGQPDDSI QDVI+ VENSMKKHSDNLLRFLEGISSRLS
Sbjct: 601  TNSSKDFHKVRMSTAFPAAASYGQPDDSITQDVISAVENSMKKHSDNLLRFLEGISSRLS 660

Query: 719  QLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKD 778
            QLELYCYNLDKSVGEMRSD+ARDHEE              VHRSVQIIRDKQELAETQKD
Sbjct: 661  QLELYCYNLDKSVGEMRSDVARDHEE--------------VHRSVQIIRDKQELAETQKD 720

Query: 779  LAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPI 838
            LAKLQV QKEPS SSHSQ+NEER   V++DPKKNEN SEIH QQLALALPHQIVPQQNPI
Sbjct: 721  LAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENLSEIHGQQLALALPHQIVPQQNPI 780

Query: 839  TPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTN 898
             P SA LP NVPQQQ SYYIS +QL GQ P HIQHA  QYIS D QHRA QPQDVS  TN
Sbjct: 781  APASATLPPNVPQQQ-SYYISPTQLSGQPP-HIQHAPGQYISPDPQHRALQPQDVS--TN 840

Query: 899  PQLSQTP-QPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPE 958
            PQLSQ+P QPFNQYQQQWAQ PSQ  QPPQQ+SMQPQIRPPPTS YP  YPPNQP+S+PE
Sbjct: 841  PQLSQSPPQPFNQYQQQWAQVPSQQPQPPQQSSMQPQIRPPPTSGYP--YPPNQPSSVPE 900

Query: 959  TLSSSMPMQMSFAYIPQPGSSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPP 1018
            TLSS+M    SFA IP PGSSR D VPYGYAA+SGGS+PQQPPQVKN YGPATGEGY+PP
Sbjct: 901  TLSSTM----SFASIPNPGSSRPDPVPYGYAAASGGSSPQQPPQVKNTYGPATGEGYLPP 960

Query: 1019 GQQPALSSGGAYMMYDRESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAG 1078
            GQ        AYMMYDRESGRPPHH PQQPH       FNQSGYPPANAPHQ+P QA   
Sbjct: 961  GQP-------AYMMYDRESGRPPHHPPQQPH-------FNQSGYPPANAPHQIP-QAATV 1010

Query: 1079 PHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR 1103
            P VS+RNPSHSHLIEKLVGMGFRGDHVASIIQRMED G+ VDFN VLDRLS+ A PGPQR
Sbjct: 1021 PSVSSRNPSHSHLIEKLVGMGFRGDHVASIIQRMEDRGEPVDFNGVLDRLSSSASPGPQR 1010

BLAST of Cp4.1LG15g08100 vs. NCBI nr
Match: XP_023554446.1 (trithorax group protein osa-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1006 bits (2602), Expect = 0.0
Identity = 523/523 (100.00%), Postives = 523/523 (100.00%), Query Frame = 0

Query: 582  MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 641
            MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST
Sbjct: 1    MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 642  VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 701
            VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61   VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 702  MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 761
            MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS
Sbjct: 121  MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 762  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSY 821
            HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSY
Sbjct: 181  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSY 240

Query: 822  YISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWA 881
            YISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWA
Sbjct: 241  YISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWA 300

Query: 882  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPG 941
            QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPG
Sbjct: 301  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPG 360

Query: 942  SSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 1001
            SSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES
Sbjct: 361  SSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 420

Query: 1002 GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVG 1061
            GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVG
Sbjct: 421  GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVG 480

Query: 1062 MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 1104
            MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW
Sbjct: 481  MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 523

BLAST of Cp4.1LG15g08100 vs. NCBI nr
Match: XP_022969058.1 (ataxin-2 homolog [Cucurbita maxima])

HSP 1 Score: 985 bits (2547), Expect = 0.0
Identity = 513/523 (98.09%), Postives = 516/523 (98.66%), Query Frame = 0

Query: 582  MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 641
            MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST
Sbjct: 1    MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 642  VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 701
            VFPGAAYGQPDDSINQDVI TVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61   VFPGAAYGQPDDSINQDVIATVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 702  MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 761
            MRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS
Sbjct: 121  MRSDLARDHEEADSKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 762  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSY 821
            HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNP+TPPSAALPQNVPQQ QSY
Sbjct: 181  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPMTPPSAALPQNVPQQHQSY 240

Query: 822  YISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWA 881
            YISSSQLPGQQPSHIQHAQNQYISSDS HRASQPQDVS MTNPQLSQTPQPFNQYQQQWA
Sbjct: 241  YISSSQLPGQQPSHIQHAQNQYISSDSHHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWA 300

Query: 882  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPG 941
            QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFA IPQPG
Sbjct: 301  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPG 360

Query: 942  SSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 1001
            SSRADAVPYGYAA+SGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES
Sbjct: 361  SSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 420

Query: 1002 GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVG 1061
            GRPPHHLPQQPHHPSQQSHFNQSGYPPANAP QVPPQAP GPHVSARNPSHSHLIEKLVG
Sbjct: 421  GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPPQVPPQAPTGPHVSARNPSHSHLIEKLVG 480

Query: 1062 MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 1104
            MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW
Sbjct: 481  MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 523

BLAST of Cp4.1LG15g08100 vs. ExPASy TrEMBL
Match: A0A6J1HZW1 (ataxin-2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111468169 PE=4 SV=1)

HSP 1 Score: 985 bits (2547), Expect = 0.0
Identity = 513/523 (98.09%), Postives = 516/523 (98.66%), Query Frame = 0

Query: 582  MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 641
            MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST
Sbjct: 1    MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 642  VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 701
            VFPGAAYGQPDDSINQDVI TVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61   VFPGAAYGQPDDSINQDVIATVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 702  MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 761
            MRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS
Sbjct: 121  MRSDLARDHEEADSKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 762  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQQSY 821
            HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNP+TPPSAALPQNVPQQ QSY
Sbjct: 181  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPMTPPSAALPQNVPQQHQSY 240

Query: 822  YISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQWA 881
            YISSSQLPGQQPSHIQHAQNQYISSDS HRASQPQDVS MTNPQLSQTPQPFNQYQQQWA
Sbjct: 241  YISSSQLPGQQPSHIQHAQNQYISSDSHHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWA 300

Query: 882  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYIPQPG 941
            QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFA IPQPG
Sbjct: 301  QPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPG 360

Query: 942  SSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 1001
            SSRADAVPYGYAA+SGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES
Sbjct: 361  SSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRES 420

Query: 1002 GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVG 1061
            GRPPHHLPQQPHHPSQQSHFNQSGYPPANAP QVPPQAP GPHVSARNPSHSHLIEKLVG
Sbjct: 421  GRPPHHLPQQPHHPSQQSHFNQSGYPPANAPPQVPPQAPTGPHVSARNPSHSHLIEKLVG 480

Query: 1062 MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 1104
            MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW
Sbjct: 481  MGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 523

BLAST of Cp4.1LG15g08100 vs. ExPASy TrEMBL
Match: A0A6J1GLD5 (class E vacuolar protein-sorting machinery protein hse1-like OS=Cucurbita moschata OX=3662 GN=LOC111455039 PE=4 SV=1)

HSP 1 Score: 981 bits (2536), Expect = 0.0
Identity = 515/525 (98.10%), Postives = 518/525 (98.67%), Query Frame = 0

Query: 582  MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 641
            MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST
Sbjct: 1    MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 642  VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 701
            VFPGAAYGQPDDSINQDVI  VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61   VFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 702  MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 761
            MRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS
Sbjct: 121  MRSDLARDHEEADSKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 762  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNVPQQQQS 821
            HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPIT PPSAALPQNVPQQQQS
Sbjct: 181  HSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQS 240

Query: 822  YYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQTPQPFNQYQQQW 881
            YYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVS MTNPQLSQTPQPFNQYQQQW
Sbjct: 241  YYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQW 300

Query: 882  AQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPP-NQPTSMPETLSSSMPMQMSFAYIPQ 941
            AQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPP NQPTSMPETLSSSMPMQMSFA IPQ
Sbjct: 301  AQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQ 360

Query: 942  PGSSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDR 1001
            PGSSRADAVPYGYAA+SGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDR
Sbjct: 361  PGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDR 420

Query: 1002 ESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGPHVSARNPSHSHLIEKL 1061
            ESGRPPHHLPQQPHHPSQQSHF+QSGYPPANAPHQVPPQAP GPHVSARNPSHSHLIEKL
Sbjct: 421  ESGRPPHHLPQQPHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKL 480

Query: 1062 VGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 1104
            VGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW
Sbjct: 481  VGMGFRGDHVASIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRAW 525

BLAST of Cp4.1LG15g08100 vs. ExPASy TrEMBL
Match: A0A6J1GLG8 (trithorax group protein osa-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455041 PE=4 SV=1)

HSP 1 Score: 980 bits (2533), Expect = 0.0
Identity = 508/512 (99.22%), Postives = 509/512 (99.41%), Query Frame = 0

Query: 59  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTV 118
           MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSR+STV
Sbjct: 1   MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTV 60

Query: 119 YPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 178
           YPAA YGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM
Sbjct: 61  YPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 120

Query: 179 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSH 238
           RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLSSH
Sbjct: 121 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELTETQKDLAKLHLLQKESSLSSH 180

Query: 239 SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQSYY 298
           SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQ YY
Sbjct: 181 SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYY 240

Query: 299 IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 358
           IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP
Sbjct: 241 IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 300

Query: 359 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA 418
           TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA
Sbjct: 301 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA 360

Query: 419 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQP 478
           FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQP
Sbjct: 361 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQP 420

Query: 479 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV 538
           PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV
Sbjct: 421 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV 480

Query: 539 GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 570
           GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL
Sbjct: 481 GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 512

BLAST of Cp4.1LG15g08100 vs. ExPASy TrEMBL
Match: A0A6J1GK45 (trithorax group protein osa-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111455041 PE=4 SV=1)

HSP 1 Score: 966 bits (2497), Expect = 0.0
Identity = 504/512 (98.44%), Postives = 505/512 (98.63%), Query Frame = 0

Query: 59  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTV 118
           MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSR+STV
Sbjct: 1   MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTV 60

Query: 119 YPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 178
           YPAA YGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM
Sbjct: 61  YPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 120

Query: 179 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSH 238
           RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLSSH
Sbjct: 121 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELTETQKDLAKLHLLQKESSLSSH 180

Query: 239 SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQSYY 298
           SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQ YY
Sbjct: 181 SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYY 240

Query: 299 IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 358
           IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP
Sbjct: 241 IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 300

Query: 359 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA 418
           TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA
Sbjct: 301 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA 360

Query: 419 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQP 478
           FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPP    QPQQPPHHPPQPQQP
Sbjct: 361 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPP----QPQQPPHHPPQPQQP 420

Query: 479 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV 538
           PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV
Sbjct: 421 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV 480

Query: 539 GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 570
           GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL
Sbjct: 481 GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 508

BLAST of Cp4.1LG15g08100 vs. ExPASy TrEMBL
Match: A0A6J1I1G6 (RNA-binding protein 33-like OS=Cucurbita maxima OX=3661 GN=LOC111468171 PE=4 SV=1)

HSP 1 Score: 933 bits (2411), Expect = 0.0
Identity = 494/512 (96.48%), Postives = 496/512 (96.88%), Query Frame = 0

Query: 59  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRVSTV 118
           MASGS GRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSR+STV
Sbjct: 1   MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTV 60

Query: 119 YPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 178
           YPAA YGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM
Sbjct: 61  YPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEM 120

Query: 179 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSSH 238
           RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLSSH
Sbjct: 121 RSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELVETQKDLAKLHLLQKESSLSSH 180

Query: 239 SHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQSYY 298
           SHSNEERASPGAFDPKKNEIPSKN NQQLALALPHQIVPQQHPPPPAALPENVPQQQ YY
Sbjct: 181 SHSNEERASPGAFDPKKNEIPSKNPNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYY 240

Query: 299 IQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 358
           IQHPQSQHQMTNA  QLSQTPPPP QQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP
Sbjct: 241 IQHPQSQHQMTNA--QLSQTPPPP-QQFSQYQQQWTQQPPQQAQPPQQHPSMQPQIRLPP 300

Query: 359 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNA 418
           TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSR+DTVQYGYVGSAGTMPQ PPQVKNA
Sbjct: 301 TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRIDTVQYGYVGSAGTMPQHPPQVKNA 360

Query: 419 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPPHHPPQPQQPPHHPPQPQQP 478
           FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPP    QPQQPPHHPPQPQQP
Sbjct: 361 FGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRESGRPPHHPP----QPQQPPHHPPQPQQP 420

Query: 479 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLV 538
           PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNH N SHFMRNQNHPYGEIVDKLV
Sbjct: 421 PHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHLNHSHFMRNQNHPYGEIVDKLV 480

Query: 539 GMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 570
           GMGFRSDHI SVIHRMEESGQPIDFNAVLDGL
Sbjct: 481 GMGFRSDHIASVIHRMEESGQPIDFNAVLDGL 505

BLAST of Cp4.1LG15g08100 vs. TAIR 10
Match: AT4G28300.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 364.0 bits (933), Expect = 4.3e-100
Identity = 266/545 (48.81%), Postives = 326/545 (59.82%), Query Frame = 0

Query: 59  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVA--NSSKDFHKSRV- 118
           MASGS+GR NSGSKGFDFG+DD+LCSY+DY NQ+SSNG HSD ++A  NS+K+FHK+R+ 
Sbjct: 1   MASGSSGRVNSGSKGFDFGSDDILCSYDDYTNQDSSNGPHSDPAIAASNSNKEFHKTRMA 60

Query: 119 -STVYPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKS 178
            S+V+P + Y  PEDS+ QD+  TVE +MK Y+DN++RFLEG+SSRLSQLEL  YNLDK+
Sbjct: 61  RSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKT 120

Query: 179 VGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESS 238
           +GEMRS+L   HE+AD+KL+SL+KHLQEVHRSVQI+RDKQELA+TQK+LAKL L+QKESS
Sbjct: 121 IGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESS 180

Query: 239 LSSHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQ 298
            SSHS   E+R +    +PKK+E  S  HNQQLALALPHQI PQ     P   P+  PQQ
Sbjct: 181 SSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQ-----PQVQPQPQPQQ 240

Query: 299 QSYYIQHPQSQHQMTNAHAQLSQTP-------------PPPP----------QQFSQYQQ 358
             YY+  P +Q Q T A   +S  P             PPPP          Q F QYQQ
Sbjct: 241 HQYYMPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQ 300

Query: 359 QWTQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQP 418
            W  QP  + Q    +P+  P    PP         NQP   S+P +  MQ  +S  PQ 
Sbjct: 301 NWPPQPQARPQSSGGYPTYSP---APPG--------NQPPVESLPSSMQMQSPYSGPPQQ 360

Query: 419 GSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDR 478
                    YGY   A   PQ PPQ +      PQ G+GYLPSGP     SG A  +Y  
Sbjct: 361 SMQ-----AYGY--GAAPPPQAPPQ-QTKMSYSPQTGDGYLPSGPPP--PSGYANAMY-- 420

Query: 479 ESGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGP 538
           E GR  + PP   PQ QQ   H  Q  Q   + PQP Q      G PP            
Sbjct: 421 EGGRMQYPPPQ--PQQQQQQAHYLQGPQGGGYSPQPHQAGGGNIGAPPV----------- 480

Query: 539 HVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDG 575
                        +R++   YGE+++KLV MGFR DH+ +VI RMEESGQPIDFN +LD 
Sbjct: 481 -------------LRSK---YGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFNTLLDR 488

BLAST of Cp4.1LG15g08100 vs. TAIR 10
Match: AT4G28300.2 (Protein of unknown function (DUF1421) )

HSP 1 Score: 286.6 bits (732), Expect = 8.6e-77
Identity = 224/484 (46.28%), Postives = 273/484 (56.40%), Query Frame = 0

Query: 116 STVYPAAVYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSV 175
           S+V+P + Y  PEDS+ QD+  TVE +MK Y+DN++RFLEG+SSRLSQLEL  YNLDK++
Sbjct: 4   SSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKTI 63

Query: 176 GEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSL 235
           GEMRS+L   HE+AD+KL+SL+KHLQEVHRSVQI+RDKQELA+TQK+LAKL L+QKESS 
Sbjct: 64  GEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESSS 123

Query: 236 SSHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQ 295
           SSHS   E+R +    +PKK+E  S  HNQQLALALPHQI PQ     P   P+  PQQ 
Sbjct: 124 SSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQ-----PQVQPQPQPQQH 183

Query: 296 SYYIQHPQSQHQMTNAHAQLSQTP-------------PPPP----------QQFSQYQQQ 355
            YY+  P +Q Q T A   +S  P             PPPP          Q F QYQQ 
Sbjct: 184 QYYMPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQN 243

Query: 356 WTQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQPG 415
           W  QP  + Q    +P+  P    PP         NQP   S+P +  MQ  +S  PQ  
Sbjct: 244 WPPQPQARPQSSGGYPTYSP---APPG--------NQPPVESLPSSMQMQSPYSGPPQQS 303

Query: 416 SSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRE 475
                   YGY   A   PQ PPQ +      PQ G+GYLPSGP     SG A  +Y  E
Sbjct: 304 MQ-----AYGY--GAAPPPQAPPQ-QTKMSYSPQTGDGYLPSGPPP--PSGYANAMY--E 363

Query: 476 SGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPH 535
            GR  + PP   PQ QQ   H  Q  Q   + PQP Q      G PP             
Sbjct: 364 GGRMQYPPPQ--PQQQQQQAHYLQGPQGGGYSPQPHQAGGGNIGAPPV------------ 423

Query: 536 VMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGL 575
                       +R++   YGE+++KLV MGFR DH+ +VI RMEESGQPIDFN +LD L
Sbjct: 424 ------------LRSK---YGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFNTLLDRL 430

BLAST of Cp4.1LG15g08100 vs. TAIR 10
Match: AT3G01560.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 105.5 bits (262), Expect = 2.7e-22
Identity = 136/498 (27.31%), Postives = 209/498 (41.97%), Query Frame = 0

Query: 618  NGSHSDPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVITT--VENSMKKHSDNLL 677
            N S SD   V+ +S + +   + ++ P         ++    I +  ++ +MKKH+D LL
Sbjct: 87   NWSASDYKPVSTTSPNTNFGSLDSIEPSKLVPDKGQNVFNTTIMSEIIDRTMKKHTDTLL 146

Query: 678  RFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIR 737
              +EG+S+RLSQLE   +NL+  V +++  +   H   + K++ ++  + EV   VQ+++
Sbjct: 147  HVMEGVSARLSQLETRTHNLENLVDDLKVSVDNSHGSTDGKMRQLKNILVEVQSGVQLLK 206

Query: 738  DKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQ 797
            DKQE+ E Q  L+K QV       + H++T+   V  DP   ++P+ +  QQ  L     
Sbjct: 207  DKQEILEAQ--LSKHQVS------NQHAKTHSLHV--DPTA-QSPAPVPMQQFPLT---- 266

Query: 798  IVPQQNPITPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQP 857
                  P  P S A P   P         SSQLP Q P+     Q  Y    S H    P
Sbjct: 267  ----SFPQPPSSTAAPSQPP---------SSQLPPQLPTQFSSQQEPYCPPPS-HPQPPP 326

Query: 858  QDVSHMTNPQLSQTPQPFNQYQQQWAQPPSQ---PAQPPQQASMQPQIRPPPTSVYPSPY 917
                  +NP   Q PQ    +Q  +  PP Q   P QPP  +   P+ +PP        Y
Sbjct: 327  ------SNPPPYQAPQTQTPHQPSYQSPPQQPQYPQQPPPSSGYNPEEQPP---YQMQSY 386

Query: 918  PPNQPTSMPETLSSSMPMQMSFAYIPQPGSSRADAVPYGYAASSGGSAPQQPPQVKNAYG 977
            PPN P   P   + S P Q  +                        + PQ  P + +  G
Sbjct: 387  PPNPPRQQPP--AGSTPSQQFY------------------------NPPQPQPSMYDGAG 446

Query: 978  PATGEGYMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAP 1037
              +  G+ P G    LS    Y      S +PPH               N +GYP  +  
Sbjct: 447  GRSNSGF-PSGY---LSEPYTYSGSPMSSAKPPH------------ISSNGTGYPQLSNS 504

Query: 1038 HQVPPQAPAGPHVSARNPSHS----------HLIEKLVGMGFRGDHVASIIQRMEDSGQT 1097
              +P   P    VS+   S S           +I+++  MGF  D V + ++++ ++GQ 
Sbjct: 507  RPLPHALPMVSAVSSGGGSSSPRSESRAPIDDVIDRVTTMGFPRDQVRATVRKLTENGQA 504

Query: 1098 VDFNAVLDRLSTPAGPGP 1101
            VD N VLD+L    G  P
Sbjct: 567  VDLNVVLDKLMNEGGAPP 504

BLAST of Cp4.1LG15g08100 vs. TAIR 10
Match: AT5G14540.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 92.0 bits (227), Expect = 3.1e-18
Identity = 136/491 (27.70%), Postives = 211/491 (42.97%), Query Frame = 0

Query: 99  SDLSVANSSKDFHKSRVSTVYPAAVYGQPE-DSMKQDVISTVENSMKKYSDNILRFLEGI 158
           SD    ++S       + ++ P+ ++ + + +S +  +IS ++ +MK ++D +L  +EG+
Sbjct: 89  SDPKPVSASSARSYGSMDSLEPSKLFAEKDRNSPESAIISAIDRTMKAHADKLLHVMEGV 148

Query: 159 SSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELA 218
           S+RL+QLE    +L+  V +++  +   H + D KL+ LE  + EV   VQ+++DKQE+ 
Sbjct: 149 SARLTQLETRTRDLENLVDDVKVSVGNSHGKTDGKLRQLENIMLEVQNGVQLLKDKQEIV 208

Query: 219 ETQKDLAKLHLLQKESSLSSHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVP 278
           E Q  L+KL L +      +HS   E  A P A  P+                      P
Sbjct: 209 EAQLQLSKLQLSKVNQQPETHSTHVEPTAQPPASLPQP---------------------P 268

Query: 279 QQHPPPPAALPENVPQQQSYYIQHPQSQHQMTNAHAQLSQTPPP-PPQQFSQYQQQWTQQ 338
                PP+   + +P QQ  +IQ P SQH ++    QL Q P    PQQ   +      Q
Sbjct: 269 ASAAAPPSLTQQGLPPQQ--FIQPPASQHGLSPPSLQLPQLPNQFSPQQEPYFPPSGQSQ 328

Query: 339 PPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDT 398
           PP   QPP Q P        PPT      S++QP   P  PP Q  +   P P       
Sbjct: 329 PPPTIQPPYQPP--------PPTQ-----SLHQPPYQP--PPQQPQYPQQPPP------Q 388

Query: 399 VQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAY--------MIYD 458
           +Q+     +G  P++PP  + ++   P       PS P  G +    Y         +YD
Sbjct: 389 LQH----PSGYNPEEPPYPQQSYPPNPPRQP---PSHPPPGSAPSQQYYNAPPTPPSMYD 448

Query: 459 RESGR-----PPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYP--PANVQ 518
              GR     P  + P   P    P  +   P   P H     Q       YP  P    
Sbjct: 449 GPGGRSNSGFPSGYSPESYPYTGPPSQYGNTPSVKPTH-----QSGSGSGAYPQLPMARP 508

Query: 519 IPQH-PSGPHVMARNHPNQSHFMRNQNH-PYGEIVDKLVGMGFRSDHIGSVIHRMEESGQ 571
           +PQ  P    + +      S   R+ N  P  +++DK+V MGF  D +   +  + E+GQ
Sbjct: 509 LPQGLPMASAISSGGSGGGSDSPRSGNRAPVDDVIDKVVSMGFPRDQVRGTVRTLTENGQ 523

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG7011974.10.095.68hypothetical protein SDJN02_26882, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6589219.10.076.34hypothetical protein SDJN03_17784, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7022919.10.075.86hypothetical protein SDJN02_16655, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023554446.10.0100.00trithorax group protein osa-like [Cucurbita pepo subsp. pepo][more]
XP_022969058.10.098.09ataxin-2 homolog [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1HZW10.098.09ataxin-2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111468169 PE=4 SV=1[more]
A0A6J1GLD50.098.10class E vacuolar protein-sorting machinery protein hse1-like OS=Cucurbita moscha... [more]
A0A6J1GLG80.099.22trithorax group protein osa-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1GK450.098.44trithorax group protein osa-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1I1G60.096.48RNA-binding protein 33-like OS=Cucurbita maxima OX=3661 GN=LOC111468171 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G28300.14.3e-10048.81Protein of unknown function (DUF1421) [more]
AT4G28300.28.6e-7746.28Protein of unknown function (DUF1421) [more]
AT3G01560.12.7e-2227.31Protein of unknown function (DUF1421) [more]
AT5G14540.13.1e-1827.70Protein of unknown function (DUF1421) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 179..206
NoneNo IPR availableCOILSCoilCoilcoord: 703..730
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 327..377
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 245..259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 884..918
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 300..377
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 458..490
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 953..1050
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 405..526
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 300..315
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1012..1029
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 810..883
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 232..260
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 798..926
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 749..780
NoneNo IPR availablePANTHERPTHR31805RECEPTOR-LIKE KINASE, PUTATIVE (DUF1421)-RELATEDcoord: 59..574
NoneNo IPR availablePANTHERPTHR31805:SF16FORMIN-LIKE PROTEIN (DUF1421)coord: 59..574
NoneNo IPR availablePANTHERPTHR31805RECEPTOR-LIKE KINASE, PUTATIVE (DUF1421)-RELATEDcoord: 582..1104
NoneNo IPR availablePANTHERPTHR31805:SF16FORMIN-LIKE PROTEIN (DUF1421)coord: 582..1104
IPR010820UBA-like domain DUF1421PFAMPF07223DUF1421coord: 1055..1093
e-value: 1.3E-19
score: 69.7
coord: 530..573
e-value: 1.7E-19
score: 69.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g08100.1Cp4.1LG15g08100.1mRNA