Cp4.1LG01g08680 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g08680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPlus3 domain-containing protein
LocationCp4.1LG01: 4820522 .. 4830662 (-)
RNA-Seq ExpressionCp4.1LG01g08680
SyntenyCp4.1LG01g08680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTTTTTTTTTTTTTGTCGAGTTAAAAATTTGCCGTCAAAACCCTCTAAAGGAGAAGTCAGATTTTAAGGCTTTTAATCATTGCTTTGCAAAATGGGTTAGAGTAATCATCATTTCATAAAAGGGTCCTTCCTGATCCTCAGTCCTCACTGTGAAACTGTTCAGTGATTTCAAGCTTCGATTTACGAAAATGGTAAAGAAGAAATGCACAAGGAAAGAAGAAATCGGAGAGGATTTTTGCTTCATTTGTAAAGATGGAGGACTCCTCAGATTTTGCGACTTCAAGTGAGTTCCATTACCATTGAGAATGCCCATTCGTTGTTTTATTCATGCAAGGATTTTCTTAGGAGAAATGGTTATGCTAGGAGAAGCACTGAGATGTTCATGTTTAGATAGGAAGTTCATGCATGTAAAATCGGACGAATAGCGAAATGAAGTGGAAAATGGAAATGAACAAAAGTTCTGCTGGTCTGTTTGCTTGTTTGTTTGTTCTTTTTCTCCTTTTTCCTCACAATCGTGTTACAATTGGGTTCTTAATAGTTTAATTTATTTTTCCTTGACTCACCAGTGTTCACTTGTTTTCACATTGTAATTTTCAGGGAAAGTTTCTCCTAAAATTATTTTGAGAAAAGTACACACTTCGAAGCTTTTAGTATGACGATCAGGGTTATTTATTTAGTATGATAGTAAAAATTTATAGAATTCTCTACCGAGTTCAGCATGTTAATCCACATTATGTCCGGAGACGTTTCATTGATTCAAGGAAAGTTGTTTCTATCACTCTCTCTCTCTCTCCCTCTAAAAAAAAGAAAAAAAAAAAAAACCAAAAGATTTTATTATAGAGTCTGTCAGACCGCCATGCGACCATCCCAGCAGGATACCCTACCGAGAACGATCCTCCACAGCCGCAGGACTAATGTCCGTTGAGCCGTTAATCACGCTCTAATCGCAGGGCAGTGGTTCATCAACCAGACTTTAGTTCTTGTCTCCTTAGAACTCAAGGACACAGTTGTACAACCTATAGAACAGATCCTGTTCAAGTAAGGTCGTGCCCAATCCCTTGCATTGCTCCATATAACTACTCCAACTCTGCAGTCNAAAAAAAAAACGCACATGTATCAGTTTATGAGTATTCATGAGTGAACTTGTATGATTTGAAAACATTTTTTCTTCGTATATCGGTAAGGATTTTGTATGCATGTACTGTAGATGTGATGCATTTGGACTCTGTTGTTAAAATTTAAAATTACTATAAGATTCTGTGAAGATGCGATGGAGTAGATTGCTTCTTCAGTCTATTTTTGTAGTTATCAAAAGGGATTTGGGACACTCTTATGTATCTATCCTTCTTTGTCCAGATAAATATTTCTTCGATCATGGATTTTCTTTCTGTATAATGATTTCAGAGATTGCCTTAAAGCGTATCACCCTGAATGTGTTGGAAGGGAAGATTCTTCTGTGGAGTCTGAAGATCGTTGGACCTGTGGTAAGATGCTGAAAGCAATTATGATTTTAAGAGTGATCTATGTGTCATGTATGGATTTTAGTCTGTTTTAATTAAACAGTCGCTAGTTGCTTATTTAATGATATCTTCTTAATACGGGTTACAACTTTTGATAAGTACGAGAATACAAAGTTTGTCATCTGCTATTGTCTTATGAGTTCCAAGCCTCCCCCATTTTTGGTTTCACGACTGCCGGGAGAAAGAGCTTGGTATGATGTAGTCCCTGGACATGTTTATTATGGTGTCTGTCATTGACTACTTCAAATTTATGAAATTAACATATCTATAACATATGGATGCAAGTGAAGCATGAGTCATATCGTGCTATCTGCTGTGAATATTTCTCTGACTCTTGTACTATGCCATCTTTCTAATTTTGGAAAAGATTTATTAACGATTATGTGTTATTCCACAAACCTACTTGATGCCTTCCCCGTATATGGTATCCCATGTCTTCAACATAGTACATACACTCGGAGGCCTCAATAAGACCAGTTTATATAACTGGGAAGTGTAAGCTCACTTTTTGTATGTTGCTTTTTTTGATATCTAGTAGTTTTTTTTTGTCTGCCTGTGTCATTTGTATTTTTTCAGTTTCCTCAAATCATGATATATATTATGTATCAGACTGGCATTCATGCTTCCTCTGCCATAAAACCTCAAAGTTCCGTTGTGTTTGCTGCCCACAAGCTGTATGTGGACGCTGCATTTTTAATGCTGAGTTTGTGCGTGTTCGAGGCTGGAGAGGATTCTGCAATCACTGTTTGCAGCTCACATTACTTATAGAAGACGGTAAAGATGTTGACATTGATGGGGTATTCTCACTTGTCCTTTTTATGATTGTTTGATCCTACTCCTTTGCACAAGTGGAATATCCATGAACAAACCCCTCTCCTTTTTGGTTTTGTCCTTCCGTTTTCTTCATTCATTCCAACAAACAAGTCAAATTATTTCATGAGGAACCATACTGTTAAGCGCATGCATTCGGCCAAATAATCTTACCTAATCTGGTAAGGAAAAAAATGTTTCAAGACAATTTATCAAAAATTTCAACCAATTAAATTATTCTTAAATTTATTGATGGAAATATGTAAATTCGGCTCTTTGTTGAGATATTTGGTTTGGTGAATCAAAGATTATTGCTTCCTTTTGATTTCGTCTTACCTCTATGCTTGCTAATTTGCAGACAAAGGTTGACTTCAATGATCGTGAGACTTATGAATTTCTATTCAAGGAATACTGGGAACTGATGAAGAAAAAACAGGGTTTGACAGCAGAACTTGTTCATATGGCAAGTAACTTATTGAAGAAGGGAAGGAATTTTAGAAATGAAATTGAGGAATCAGAAGAAGACACTGATGAATATGAAATTCCATCAGACTATGAAGAGTTGGTGGATACAGAAGAAGGGCACAAACTAGTAAGAAAATGCAAAAGAAGCAAGGAGAAGCTATGCACTACGAGAAAAAAAATGAAGTCAAGTGACCAAAAGTTCATTGGATGGGGGTCAAAACCAGTTATAGAATTTCTTTCAACAATTGGCAAGGATACAAGAAAAAAGTTGTCACAGCATGATGTGACTTCTATAATTACAACCTATTGTAAAGAAAACAAGCTTTTCCATCCGCTGAAAAAGAAGAAGATTATTTGTGATGCTAAGCTACAAGCTGTTTTTGGAAGGAAATCAATGAATGTGAATACTGTACACAAGCACCTAACTGCTCATTTTGCTGAAAACATGGAGCAATCATCTGATGATGAGAGCACAAGTAGTATTGAAGAGAAGGATGATAATTCTTCTATGGCTTGTAACAAGCCAAGGAAGTTGATCTCAGATAGAAAACCTGCCGAACTGGAACTGTCAGATGTGTCACATACTTGTTCTGCTGCTATCATTTCAGCAAACATCAAACTGGTCTATCTGAAAAGAAGTTTAGTAGAGAGGCTTCTCGAGAATCCTGAATGTTTTGAAGGAAAAATGATTGGAAGTTTTATAAGAGTTAAATCCGATCCCAACGACTATTCACAGAAAAATTCTTACCAGTTGCTACAAGTTACAGGTAATTAACTATCTATGTTGCTTAGATTAGTGATTGCATTTTAGAAATATGTAAGCTCAATATTTGAAACATTTAAATATTTTGTCCAAGTTATCTGCAGGAGAAGCTACTTCTATGGTATGCTGCCTTTTTTCATGGTGTTTTCTTTTATTGTTTTTCTTAGCAAGAAACTAAGCTTTTCATTGAAATAATGAATAATGAAAAGTGGCTAAGACCCAAAAATAAAAACTCCACACGGGAGTGAGAATAAAAATAGTATAAAAATAAAAATAAAAACAAAGAAGACGAAATATAGCAAAAATATTCACAAAATAATAATAATAATAATGNTATAAAATAATCAGCATAGCATTTAGAAAGGTAGCACCATAATGAAGCTTTAAGACGAGCTGATTCAAAATAGTCAAAAGAGGAGAGACGCTTGTTTTGAAAGATCCTCTGATTTCTTTCAAACCAAAGCTCCAAAAGAAAGACTTTCACTAAATTGCACCGAAGAGTCTTGGATTTGGCTGACAAAGGAGAACCAAGAAGCAACTGCAACACAATGTAGGCTGAAAACTGAATACAAACTAGCCCAAACCTACCGAGAATATCAGCAATTAAAGACCTTGTGTTGAAGAGAGTCCTCTGCTGCACAGCAAAAGCATACAAATAGATGGAGAAATGCAATCAAAAGAAGTGTCCGTTGGAGAACCTCTGAACAGTTTAAAGATCCGGTAAGCATAATCCAAATTAAAATATTAATCCCCTTGGGTCTTTTGCAATTCCATAAAGCAGAAAATAAATCCTTGTTCAAAGGAAAAGTCAAGGCAAGATGTCGAGTTAACAAATTAAGTGAAAACAAGCCCAAAGATTGAAGTGATCGCTTCCTTGAGTCACTTAAAAGCCCAACCTTAACTATAGAGACATTAGCCAACAGAAACTGGAGATCAGCAATGTACAAAGCTAACGATTTATTTTGTTCATTTAAAATTTTTCTCATATTTCTCTATTCACTGTATTCAAAATTATTGACTTGCGTTTTTAATCTCTAATATGTAAGCAAGGGAGTTTTGATATCCAAGTAGGTTGGACCGTAGTAGTTTTGTCTCATTCACCTCTCTTTCAAAGGTCATTTCATCTATCTTTTCTGAACCACAGCGTGCACACGAGTCCTTGCATAATTTAGTGAGTTAAGACACAATATAGATTTAGCAATTTTTTAATATATGAAAGAGATTGTTTAAAGAGAGAACAATTAAACATAGAACACTATTAAACAAATGAGGATTTTGTTTAAAGAGATAGGCATCCCTCACCCTAGACCTCTTAAAAAAAAATCACGAGCTTGAAGCCCCATGTGTCACAATCCTACTATCTTGGAGAGTGTGGTTCCATGATTCCAGTCTCAACCATGGTCGGTTGTAAGGCTTTTCTTGCCTTGCCACTCTTTTCAAGATATTCAAATCTGTGAGACTAAAAGGCAAGGTTTGTTCTTTTTTAAATTGAGCCTGTTGATCTTGCAGGGTTGCAGGCATCTTGAACATTTACCTAACTAACCACATGCACGAACATTCCCTTTGCTTTTTAGTAGCTTAGCCTTGAGCTCGTGTTCACAACTCACATGCAACTTGAATGACCATGCAAATGGCTTATATAGATTGGCGAGGACTTGAGCTTGAGAGGAACAGGGAGGGAATGAGATGAAAGAGAATCTGACAGAAAGAGGTGTAGAAAAGGAAAAATGTAAGAGAGAGAGAGAGCGAGGTGGAGGTGGTGGTAAAAAGTGATGGTAAAAAGTGATGGAAAAAGAGGTAGAGTATATGATTATAAACAGAAGCAACAGCCATCTGTTGGAAACATTAGTGCTTAGTTTATTGTTTGAATTTTTTAATCTAATTTCTTCTATGGTGAAATGAATTTGGTTAAATTATAGTATAGATTGTCCTTTCGTGCATCAGTTACTGTTCATGTTATTTGCTCTGCATACTAAAGTTGGCTCATCGTGGAACTTATCATTGAATTTTCAGACAATGACTAGTGAACAAAATTTGATAAATCAAGCTATGGTGTCGGATTTATTTATTTGTTCAGTTTGAAGGCACTAAGTGTAGCATTTTAAGACGAAAGTCCAGCTTTATTTATGTTATTCTTTTTTTTCCCTACTGGAAAAGAACTTAATCTTTACTTTTGGCGAGAATAACAAAATGTTCTTTTCAAAATTATTCCTAAGCACCTTATCTACCTTTAGAACTGTGTTTGCCGTACCTTTCTCTTGCTGAAACTGATTTATCTTTTGTAATTACATATTAAAGTGCTGCAGGTGATTTGTAAATTACTGTGCTTTGTTCTAAACCTTTTTCTGCACATCACAGGCATAATGATAGATTCAAGCAATACTGGGAAGCAGGAAATTCTCCTGCAAGTTACCTATCGACTAGACTATATACCAATCTACAATTTATCAGACGATGACTTCTGTGAGGTAGAAAATACATCTTTATATTAATTTCTATCGTCTGCTAATAAAATTCATATCCCACCGTCGGGAGAAAATTTTCAGTTCCCACTATGTTTTAATTATAAGATAATTGTGACATATAGAGTTTTTCAAGGATCTTTTAAATTTTAGCACTGTCTCTAATATCTTGTTTCGGGACAGGGAACTCTTGAAGTTGAGTTGCTCCTTGTGATGAAGATCTGGATTTATAATCCTTAGATAAGAAGGGAAAGACCTTCTCCTAATTTTAGCAAACCTCATGGTTTTTACTCTGAAGTTTTTCTAGATAAATTGCAAGTTTTCTTTAATTTTGTGCCACAGATGGTGATAAATTGTAAATCAGAATCTAAAACAATCGAATGCTGATTAGCATGCCTAGGTAAAGTTACGATCTGTTGAGTTGGGTGCATCACGGAATTGAACAGTTATACTGGAGTTGCATGAGAAGGATTTTCTCTGCCGGTAGAAACTTCCTTCTTTGTAACGTTCTGATCTCAAAAGAAGCCTCCGATGTTCTCTTAGGGATCCACCCTACTCATATTGAAATATAGATATTCTGGGGAAATTTTGACCAAGTGTCTAGTATTGCAGGAAGAATGTGAGGATTTGCGCCAGAGAATGAAAAATGGCCTGCTGAAGAATCCCACTGTGGTAAGTCTGTAGAATTTTGAGGCATCCTTGAACTTGTAGAGAATTATTTAACTGAAAAACTTTTGTGGGGATTTCTGTTCTTGAGAGAGTATATTCTCGGTTGAGCAAGTCACTGTTTTCATGGACTATTATAGTAGTAGCTTCGTGAACGTTCTTGCCAGGTGACTGACAGTGGCTTTCTGTAAATTTATGAAACAGATTGATCTAGAAAACGCCTGACACTTAGTTTTACCTAATTCTATTTGTTCGTCTTTTTTACAATGGTGAGCATAACAATGGTATAATGATCAAAGCCCGTAAGTAGTTGTGAATATGCATGTCTCTCTTAATTTCATGCATGTCTTATAATCTATTCTTCCTATAATGCCAAAGACTATATCTTTTTTGAAGAGTTTCTTTCCTTGGTATTGCTACGGTCAGAGGCATAATTCCAAATGTACAGTTTTACTCATCCTTCAATATAATTCTTTTTTTTTCCTCAGATGGAGCTTTACGAGAAGGCAAAAAGTCTGCATGAGGATATCACGAAACATGTAGGGTAATCTGCGATTAAATTTAAAGCAAATAATTGTTTTCAACTCTAAATACATTCAACTGGTTTATATTTCTTTCACTGACAGTGGATTACAGAGGAGCTGGCTAGATTGCAAACGTGTATTGATCATGCAAATGAGAAGGGATGGAGAAGAGAATATCCTATCTGATTCACTTTATATGATATATGTCGTTTTGGGGTGGCTAAAATTAAGACAATGACTTATTTTGGAAGACATAAAATTTAAATAGACTTTCTAATTGGTGCCATTTCATTTTATTATGGTTCCACCTTTACAATAAGGGTAACGATCAACAATAATTTAGTTTAAGCTTTTCTCATTGGACTTGTGTACTCATTGTATACCAAGTCGTCATTTAACATATGAATACTTACAAGTAGACAGAAGAAAGATAGATAGCGTACAATTGGGATGAGGGGATAATTGTTACAACCTCTAGACAGGACAGGACATCTAACATTATGTATCCGTGAGTTGGAATCACTTAGGATATTTGATACTAACAATTATATCCTTGTGTATTGAGTATACCTTTTCTTCAACTTCTACAGTTCCAGACATTCTTCTATATACTTTTAGGCTATAAAGTTGTGTTTCTTAGTTAAGATGATAACAATAACAACAATAATAATACTGTAATATGGAAATATACAGACATGGTTTTGCCTATCCATGTACTTCATTTTTCAGTTTGGGTCCATTTTATTCTGTTGGGGAGTTTCATGTGGATCTGACCAAGCCGGCATTTTTTCTTTCTAAACAAGAATAAAATAAGTTTCCATAACCAGTGCTTTGTGTGTACATTGTTCGAGTACATGGAAAAAAGACTTCTACTTCAAAAGTCATCAGAACAAGCACGGTTGATCCATGAACTGCCAGAAGTGATAGCAGATATTCTTGAACCTACATTTGATGATTTACTAAAGCAGAATGAACAAGAAAATCACATGTTGGTTGATGGGAGCGACGTCAGAAAAGTTGCCACAGGTTAGTGATCTTTCTCATAGTAATGATATTTTTTTGGTCAGGCTTGCTCTTTGCTTATTATTTTTTGAATAATGGATGCTCAGATGTTCACTTTTAACTTTGAAGAAGTGCTCCCTTTTGGATGTCTGTTCTATCCTAGTTGCTGCATTGAATTTTAGAGCCATTGATCCGTTGAATTTAGTTGGCCCTGTATCTTGTTCCATTGGATTTATGAACCAGACCAGAGCATATTGTTTTCTGCCACCCAAGTTGGACGTGCTGTTTCTATCCTAAGTGCTGCCTATGTTGGTTATTTCTTTGTTTATTGTTTCATTTGTATGTTTTTTTTTCAACCCCNTTAATTAATGAAAAGGTTTATAAATGTTTTACAATTCAAACCCAGAAGGGAAGAAAATAGCAGTAAAATAATCGAAAGACCTAGAGGTATGAATAAACATGTCCCAATTTGAACAAATGTCGCTAGAACAATAATGTTCCAATCAATTGGAAAACTAACACTATTGAAAAGTTTTGACATTCTATCAAAGCTTGCTCTTATCTTAAAAGTACTTCGATTCCACTCGAACCATAGTTAAAAAAAAAAAAACTTTGTAGGTAGTTCAAATCCTTTGAACTGGTAGATTGGAATTTGAAGATTTAATTTCTCTTGATAGGTTACTAACTCTGACTGGTAGAATACAAGGAGTTATATCAGTGCTTTCTTTTACTGTTGGGTTACTGTTGAAGGGCTCAGCCGCTGGTAGTTTTAGTTTGCCAATTGAGAGTTGGTTGGCTTTGATCTCATGTGAGCCACCTGGGATAGAGAAAGATTCTTATTCAATCGAATAATTTGATAAAAGAAACTTATCAACTACGACCATGTGAGTTACAAGGGTGTGCTTTAAAAGAGGTGACCAATCTCTATGGGTTGCGAGAATTGCATCCTATGGGGGTTGACTCGTTTAGCTAATTTTAAGTAAAATGGAATGAGACCTAAAGTTTAGGGACAAAATAGTAGTTTTAACCTGAGTCTCACATATACACATTTCCAAGGTTAGGCCATGTTAAATGATTGTCCATGAATAACAGAAGGCAGGCCTCAAATGCAACTTGGATCATGTTCTAATATTTTTTGAGATTCACTTATATAGAAAATTGATTCGAGTAGATACAAAAGGAGTGCGCAGCGTGATTGAGAACAAAAGTAGTTGCCCTTTTGTTTCAAATGCGTATTTACCATCTATATTATAATTAGCATCAAAGATCAATATTTTGACAAACTATGTTTTTATTTTCTGACTTGATCCTCAGCTGCCATGGTAGAAGAATGTTTGATTGGCATGCAAACTATTTCAGAGAAGCAGCAACACTTCGAGGTATCTACTTGTAAAGATTTTGCTCAAAAATCTTATATCTCAGCTGTAGAATTTCAAACTCATGAAGAGCAGCATCAACCTCTTCTGCCAAAGGAGAAAGCATGTAAAGGTTTTGCTACAAAATCATGCATCCCAGCAGCAGAATTTCAACCTCATAGAGAGCAGCATCAATCCATTCTACCAAAGAAACATGCATATTCCAAACCATTGCTTTCCAGCATCAAAAGGCAAAGTGAGTATATCAATATTCAGAAATCAAAGTTCAAAAGCAAAAGGGCTTCTGAAGTAGAGTTGATTGAGCTAAGCGATAACGAGGATTTGAAAGCTGAAGATAAAATGCAGACTTCAGAGAATCCAAATTTCTCCCTGTGGTATTGTGCAAGTCCTCAAGGGGAGACAAGGGGACCCTTGCCACTGTCATTACTGAAGCAATGGAGGGACCGAAGCTCATTTGAGTTGAAATGTAAAGTTTGGAAGAATGGTCAGAGCTCGCAAGAGGGCATCCCTTTGAGCGATGCCATTAGGTTGTTTTTCCCTGAATAG

mRNA sequence

TTTTTTTTTTTTTTTTTTTTTGTCGAGTTAAAAATTTGCCGTCAAAACCCTCTAAAGGAGAAGTCAGATTTTAAGGCTTTTAATCATTGCTTTGCAAAATGGGTTAGAGTAATCATCATTTCATAAAAGGGTCCTTCCTGATCCTCAGTCCTCACTGTGAAACTGTTCAGTGATTTCAAGCTTCGATTTACGAAAATGGTAAAGAAGAAATGCACAAGGAAAGAAGAAATCGGAGAGGATTTTTGCTTCATTTGTAAAGATGGAGGACTCCTCAGATTTTGCGACTTCAAAGATTGCCTTAAAGCGTATCACCCTGAATGTGTTGGAAGGGAAGATTCTTCTGTGGAGTCTGAAGATCGTTGGACCTGTGACTGGCATTCATGCTTCCTCTGCCATAAAACCTCAAAGTTCCGTTGTGTTTGCTGCCCACAAGCTGTATGTGGACGCTGCATTTTTAATGCTGAGTTTGTGCGTGTTCGAGGCTGGAGAGGATTCTGCAATCACTGTTTGCAGCTCACATTACTTATAGAAGACGGTAAAGATGTTGACATTGATGGGACAAAGGTTGACTTCAATGATCGTGAGACTTATGAATTTCTATTCAAGGAATACTGGGAACTGATGAAGAAAAAACAGGGTTTGACAGCAGAACTTGTTCATATGGCAAGTAACTTATTGAAGAAGGGAAGGAATTTTAGAAATGAAATTGAGGAATCAGAAGAAGACACTGATGAATATGAAATTCCATCAGACTATGAAGAGTTGGTGGATACAGAAGAAGGGCACAAACTAGTAAGAAAATGCAAAAGAAGCAAGGAGAAGCTATGCACTACGAGAAAAAAAATGAAGTCAAGTGACCAAAAGTTCATTGGATGGGGGTCAAAACCAGTTATAGAATTTCTTTCAACAATTGGCAAGGATACAAGAAAAAAGTTGTCACAGCATGATGTGACTTCTATAATTACAACCTATTGTAAAGAAAACAAGCTTTTCCATCCGCTGAAAAAGAAGAAGATTATTTGTGATGCTAAGCTACAAGCTGTTTTTGGAAGGAAATCAATGAATGTGAATACTGTACACAAGCACCTAACTGCTCATTTTGCTGAAAACATGGAGCAATCATCTGATGATGAGAGCACAAGTAGTATTGAAGAGAAGGATGATAATTCTTCTATGGCTTGTAACAAGCCAAGGAAGTTGATCTCAGATAGAAAACCTGCCGAACTGGAACTGTCAGATGTGTCACATACTTGTTCTGCTGCTATCATTTCAGCAAACATCAAACTGGTCTATCTGAAAAGAAGTTTAGTAGAGAGGCTTCTCGAGAATCCTGAATGTTTTGAAGGAAAAATGATTGGAAGTTTTATAAGAGTTAAATCCGATCCCAACGACTATTCACAGAAAAATTCTTACCAGTTGCTACAAGTTACAGGCATAATGATAGATTCAAGCAATACTGGGAAGCAGGAAATTCTCCTGCAAGTTACCTATCGACTAGACTATATACCAATCTACAATTTATCAGACGATGACTTCTGTGAGGAAGAATGTGAGGATTTGCGCCAGAGAATGAAAAATGGCCTGCTGAAGAATCCCACTGTGATGGAGCTTTACGAGAAGGCAAAAAGTCTGCATGAGGATATCACGAAACATTGGATTACAGAGGAGCTGGCTAGATTGCAAACGTGTATTGATCATGCAAATGAGAAGGGATGGAGAAGAGAATTGTTCGAGTACATGGAAAAAAGACTTCTACTTCAAAAGTCATCAGAACAAGCACGGTTGATCCATGAACTGCCAGAAGTGATAGCAGATATTCTTGAACCTACATTTGATGATTTACTAAAGCAGAATGAACAAGAAAATCACATGTTGGTTGATGGGAGCGACGTCAGAAAAGTTGCCACAGCTGCCATGGTAGAAGAATGTTTGATTGGCATGCAAACTATTTCAGAGAAGCAGCAACACTTCGAGGTATCTACTTGTAAAGATTTTGCTCAAAAATCTTATATCTCAGCTGTAGAATTTCAAACTCATGAAGAGCAGCATCAACCTCTTCTGCCAAAGGAGAAAGCATGTAAAGGTTTTGCTACAAAATCATGCATCCCAGCAGCAGAATTTCAACCTCATAGAGAGCAGCATCAATCCATTCTACCAAAGAAACATGCATATTCCAAACCATTGCTTTCCAGCATCAAAAGGCAAAGTGAGTATATCAATATTCAGAAATCAAAGTTCAAAAGCAAAAGGGCTTCTGAAGTAGAGTTGATTGAGCTAAGCGATAACGAGGATTTGAAAGCTGAAGATAAAATGCAGACTTCAGAGAATCCAAATTTCTCCCTGTGGTATTGTGCAAGTCCTCAAGGGGAGACAAGGGGACCCTTGCCACTGTCATTACTGAAGCAATGGAGGGACCGAAGCTCATTTGAGTTGAAATGTAAAGTTTGGAAGAATGGTCAGAGCTCGCAAGAGGGCATCCCTTTGAGCGATGCCATTAGGTTGTTTTTCCCTGAATAG

Coding sequence (CDS)

ATGGTAAAGAAGAAATGCACAAGGAAAGAAGAAATCGGAGAGGATTTTTGCTTCATTTGTAAAGATGGAGGACTCCTCAGATTTTGCGACTTCAAAGATTGCCTTAAAGCGTATCACCCTGAATGTGTTGGAAGGGAAGATTCTTCTGTGGAGTCTGAAGATCGTTGGACCTGTGACTGGCATTCATGCTTCCTCTGCCATAAAACCTCAAAGTTCCGTTGTGTTTGCTGCCCACAAGCTGTATGTGGACGCTGCATTTTTAATGCTGAGTTTGTGCGTGTTCGAGGCTGGAGAGGATTCTGCAATCACTGTTTGCAGCTCACATTACTTATAGAAGACGGTAAAGATGTTGACATTGATGGGACAAAGGTTGACTTCAATGATCGTGAGACTTATGAATTTCTATTCAAGGAATACTGGGAACTGATGAAGAAAAAACAGGGTTTGACAGCAGAACTTGTTCATATGGCAAGTAACTTATTGAAGAAGGGAAGGAATTTTAGAAATGAAATTGAGGAATCAGAAGAAGACACTGATGAATATGAAATTCCATCAGACTATGAAGAGTTGGTGGATACAGAAGAAGGGCACAAACTAGTAAGAAAATGCAAAAGAAGCAAGGAGAAGCTATGCACTACGAGAAAAAAAATGAAGTCAAGTGACCAAAAGTTCATTGGATGGGGGTCAAAACCAGTTATAGAATTTCTTTCAACAATTGGCAAGGATACAAGAAAAAAGTTGTCACAGCATGATGTGACTTCTATAATTACAACCTATTGTAAAGAAAACAAGCTTTTCCATCCGCTGAAAAAGAAGAAGATTATTTGTGATGCTAAGCTACAAGCTGTTTTTGGAAGGAAATCAATGAATGTGAATACTGTACACAAGCACCTAACTGCTCATTTTGCTGAAAACATGGAGCAATCATCTGATGATGAGAGCACAAGTAGTATTGAAGAGAAGGATGATAATTCTTCTATGGCTTGTAACAAGCCAAGGAAGTTGATCTCAGATAGAAAACCTGCCGAACTGGAACTGTCAGATGTGTCACATACTTGTTCTGCTGCTATCATTTCAGCAAACATCAAACTGGTCTATCTGAAAAGAAGTTTAGTAGAGAGGCTTCTCGAGAATCCTGAATGTTTTGAAGGAAAAATGATTGGAAGTTTTATAAGAGTTAAATCCGATCCCAACGACTATTCACAGAAAAATTCTTACCAGTTGCTACAAGTTACAGGCATAATGATAGATTCAAGCAATACTGGGAAGCAGGAAATTCTCCTGCAAGTTACCTATCGACTAGACTATATACCAATCTACAATTTATCAGACGATGACTTCTGTGAGGAAGAATGTGAGGATTTGCGCCAGAGAATGAAAAATGGCCTGCTGAAGAATCCCACTGTGATGGAGCTTTACGAGAAGGCAAAAAGTCTGCATGAGGATATCACGAAACATTGGATTACAGAGGAGCTGGCTAGATTGCAAACGTGTATTGATCATGCAAATGAGAAGGGATGGAGAAGAGAATTGTTCGAGTACATGGAAAAAAGACTTCTACTTCAAAAGTCATCAGAACAAGCACGGTTGATCCATGAACTGCCAGAAGTGATAGCAGATATTCTTGAACCTACATTTGATGATTTACTAAAGCAGAATGAACAAGAAAATCACATGTTGGTTGATGGGAGCGACGTCAGAAAAGTTGCCACAGCTGCCATGGTAGAAGAATGTTTGATTGGCATGCAAACTATTTCAGAGAAGCAGCAACACTTCGAGGTATCTACTTGTAAAGATTTTGCTCAAAAATCTTATATCTCAGCTGTAGAATTTCAAACTCATGAAGAGCAGCATCAACCTCTTCTGCCAAAGGAGAAAGCATGTAAAGGTTTTGCTACAAAATCATGCATCCCAGCAGCAGAATTTCAACCTCATAGAGAGCAGCATCAATCCATTCTACCAAAGAAACATGCATATTCCAAACCATTGCTTTCCAGCATCAAAAGGCAAAGTGAGTATATCAATATTCAGAAATCAAAGTTCAAAAGCAAAAGGGCTTCTGAAGTAGAGTTGATTGAGCTAAGCGATAACGAGGATTTGAAAGCTGAAGATAAAATGCAGACTTCAGAGAATCCAAATTTCTCCCTGTGGTATTGTGCAAGTCCTCAAGGGGAGACAAGGGGACCCTTGCCACTGTCATTACTGAAGCAATGGAGGGACCGAAGCTCATTTGAGTTGAAATGTAAAGTTTGGAAGAATGGTCAGAGCTCGCAAGAGGGCATCCCTTTGAGCGATGCCATTAGGTTGTTTTTCCCTGAATAG

Protein sequence

MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDEYEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIADILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYSKPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCASPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Homology
BLAST of Cp4.1LG01g08680 vs. ExPASy Swiss-Prot
Match: Q9FT92 (Uncharacterized protein At5g08430 OS=Arabidopsis thaliana OX=3702 GN=At5g08430 PE=1 SV=2)

HSP 1 Score: 307.4 bits (786), Expect = 4.7e-82
Identity = 204/576 (35.42%), Postives = 303/576 (52.60%), Query Frame = 0

Query: 214 RKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKK 273
           ++K +    +F+GWGS+ +IEFL ++GKDT + +S++DV+  I  Y  +  L  P  KKK
Sbjct: 19  KRKARPKRFEFVGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKK 78

Query: 274 IICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPR 333
           ++CD +L  +FG +++    V+  L  H+ EN + S  D       +   +S     +  
Sbjct: 79  VVCDKRLVLLFGTRTIFRMKVYDLLEKHYKENQDDSDFDFLYEDEPQIICHSEKIAKRTS 138

Query: 334 KLISDRKPAELELSDVSHTCSAAIISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRV 393
           K++  +KP             AAI+S NIKL+YL++SLV+ LL++P+ FEGKM+GSF+R+
Sbjct: 139 KVV--KKP---------RGTFAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRI 198

Query: 394 KSDPNDYSQKNSYQLLQVTGIMIDSSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECE 453
           KSDPNDY QK  YQL+QVTG+       G  + LLQVT  +  + I  LSDD+F +EECE
Sbjct: 199 KSDPNDYLQKYPYQLVQVTGV---KKEHGTDDFLLQVTNYVKDVSISVLSDDNFSQEECE 258

Query: 454 DLRQRMKNGLLKNPTVMELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFE 513
           DL QR+KNGLLK PT++E+ EKAK LH+D TKHW+  E+  L+  ID ANEKGWRREL E
Sbjct: 259 DLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRELSE 318

Query: 514 YMEKRLLLQKSSEQARLIHELPEVIADILEPTFDDLLKQNEQENHMLVDGSDVRKVATAA 573
           Y++KR LLQ   EQARL+ E+PEVI        ++L++  E  +       + ++++ + 
Sbjct: 319 YLDKRELLQNPDEQARLLREVPEVIG-------EELVQNPEVSSPEAHKSDNEQRLSESP 378

Query: 574 M--VEECLIGMQTISEKQQHFEVSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKAC--- 633
           +  + E          + Q F            Y+ +    T         P   +C   
Sbjct: 379 LSCIHETPEARNLFGGEDQQF---------NNGYVMSNPITT---------PGITSCATE 438

Query: 634 --KGFATKSCIPAAEFQPHREQHQ---SILPKKHAYSKPLLSSIKRQSEYINIQKSKFKS 693
             KG  T      AE+  H +  Q    I+  +    +  +S ++      N+       
Sbjct: 439 INKGLPTWIASAGAEYL-HVDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQ 498

Query: 694 KRASEVELIELSDNE----------DLKAEDKMQTSENPNFSLWYCASPQGETRGPLPLS 753
              SEV  IELSD++          D K ED    S +     W    PQG  +GP  L+
Sbjct: 499 PNPSEV--IELSDDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLT 552

Query: 754 LLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRL 770
            LK W D   F  + +VW  G+S +  + L+D +RL
Sbjct: 559 QLKAWSDAEYFTKQFRVWMTGESMESAVLLTDVLRL 552

BLAST of Cp4.1LG01g08680 vs. ExPASy Swiss-Prot
Match: Q9SIV5 (Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana OX=3702 GN=NERD PE=1 SV=3)

HSP 1 Score: 261.5 bits (667), Expect = 2.9e-68
Identity = 215/805 (26.71%), Postives = 379/805 (47.08%), Query Frame = 0

Query: 14   EDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCHKTSKFR 73
            ED CF+C DGG L  CD + C KAYHP CV R+++  +++ +W C WH C  C KT+ + 
Sbjct: 599  EDVCFMCFDGGDLVLCDRRGCTKAYHPSCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYL 658

Query: 74   CVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFNDRETYE 133
            C  C  ++C  C  +A F  +RG +G C  C++   LIE  K  + +  ++DFND+ ++E
Sbjct: 659  CYTCMFSLCKGCAKDAVFFCIRGNKGLCETCMETVKLIE-RKQQEKEPAQLDFNDKTSWE 718

Query: 134  FLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDEYEIPSDYEELVDT 193
            +LFK+YW  +K +  L+ E +  A   LK      +E   S++ T      +DY     +
Sbjct: 719  YLFKDYWIDLKTQLSLSPEELDQAKRPLK-----GHETNASKQGTAS---ETDYVTDGGS 778

Query: 194  EEGHKLVRKCKRSKEKLCTTRKKMKSSDQKF----IGWGSKPVIEFLSTIGKDTRKKLSQ 253
            +      ++  RS+ K  +  K + S D+      + W SK +++ +  + +  R  L  
Sbjct: 779  DSDSSPKKRKTRSRSKSGSAEKILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPM 838

Query: 254  HDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQS 313
             +V +++  Y K   L  P +K ++ICD++LQ +FG+  +    +   L +HF +  +  
Sbjct: 839  LEVQTLLLAYIKRYNLRDPRRKSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQ 898

Query: 314  SDDESTSSIEEKDDNS-----------SMACNKPRKLISD--RKPAELELSDVSHTCSAA 373
            +DD     ++ ++ N                +K RK      RK  +  L D      AA
Sbjct: 899  ADDIQGDIVDTEEPNHVDVDENLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDF-----AA 958

Query: 374  IISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI-- 433
            +   NI L+YL+RSLVE LLE+   FE K+  +F+R++   N   +++ Y+L+QV G   
Sbjct: 959  VDMHNINLIYLRRSLVEDLLEDSTAFEEKVASAFVRLRISGN--QKQDLYRLVQVVGTSK 1018

Query: 434  MIDSSNTGKQ--EILLQVTY--RLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVM 493
              +    GK+  + +L++    + + I I  +S+ DF E+EC+ L+Q +K GL+   TV 
Sbjct: 1019 APEPYKVGKKTTDYVLEILNLDKTEVISIDIISNQDFTEDECKRLKQSIKCGLINRLTVG 1078

Query: 494  ELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRE---------------LFEYM 553
            ++ EKA +L E   K+ +  E+ R     D A++ G R+E               L E +
Sbjct: 1079 DIQEKAIALQEVRVKNLLEAEILRFSHLRDRASDMGRRKEYPYLLKLSNSLTMLTLRECV 1138

Query: 554  EKRLLLQKSSEQARLIHELPEVIADILEPTFD-DLLKQNEQENHMLVDGSDVRKVATAAM 613
            EK  LL+   E+ R + E+PE+ AD   P  D D   ++E E         +R  +++  
Sbjct: 1139 EKLQLLKSPEERQRRLEEIPEIHAD---PKMDPDCESEDEDEKEEKEKEKQLRPRSSSFN 1198

Query: 614  VEECLIGMQTISEKQQHFEVSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATK 673
                  G   IS ++  F              S+ E  T    +       +  + ++ +
Sbjct: 1199 RR----GRDPISPRKGGF--------------SSNESWTGTSNYSNTSANRELSRSYSGR 1258

Query: 674  SCIPAAEFQPHREQHQS---ILPKKHAYSKPLLSSIKRQSEYINIQKSKFKSKRA-SEVE 733
                  ++    +   S       +    +P L S K +S  ++I ++  +S RA +  E
Sbjct: 1259 GSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPRS--VSIPETPARSSRAIAPPE 1318

Query: 734  LIELSDNEDLKAEDKMQT----SENPNFSLWYCASPQGETRGPLPLSLLKQWRDRSSFEL 772
            L     +E   A   + +      N +  +W+   P G+ +GP  ++ L++W +   F  
Sbjct: 1319 LSPRIASEISMAPPAVVSQPVPKSNDSEKIWHYKDPSGKVQGPFSMAQLRKWNNTGYFPA 1364

BLAST of Cp4.1LG01g08680 vs. ExPASy Swiss-Prot
Match: Q9SD34 (Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana OX=3702 GN=At3g51120 PE=2 SV=3)

HSP 1 Score: 241.1 bits (614), Expect = 4.1e-62
Identity = 170/573 (29.67%), Postives = 294/573 (51.31%), Query Frame = 0

Query: 8   RKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCH 67
           +KE+  ED CFIC DGG L  CD ++C KAYHP C+ R+++   +  +W C WH C  C 
Sbjct: 104 KKEDKEEDVCFICFDGGDLVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQ 163

Query: 68  KTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFN 127
           K S + C  C  +VC RCI +A++V VRG  G C  C++  +LIE+    D +  KVDF+
Sbjct: 164 KASSYMCYTCTFSVCKRCIKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFD 223

Query: 128 DRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE--YEIPS 187
           D+ ++E+LFK YW  +K++  LT + +  A+N  K+  N   ++E   + T+    ++  
Sbjct: 224 DKLSWEYLFKVYWLCLKEELSLTVDELTRANNPWKEVPNTAPKVESQNDHTNNRALDVAV 283

Query: 188 DYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRK 247
           +  +   T +   L  K           +    +S      W +K ++EF+S +      
Sbjct: 284 NGTKRRRTSDSPTLPNKLDGKNPSNILKKAPGDTS------WATKELLEFVSFMKNGDTS 343

Query: 248 KLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMN----VNTVHKHLTAH 307
            LSQ DV  ++  Y K+  L  PL+K +++CD  L  +FG++ +     +  +  H+   
Sbjct: 344 VLSQFDVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQ 403

Query: 308 FAENMEQSSDDEST----SSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAI 367
                 ++++ E+T    S IEE   +  M  ++ RK+   R+  +  + + +    AAI
Sbjct: 404 EKPKGAKTTNGETTHAVPSQIEEDSVHDPMVRDRRRKM---RRKTDGRVQNENLDAYAAI 463

Query: 368 ISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI--M 427
              NI L+YL+R  +E LL++    + K++G+ +R+K   +D  + + ++L+QV G    
Sbjct: 464 DVHNINLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSD-QKLDIHRLVQVVGTSKA 523

Query: 428 IDSSNTGKQ--EILLQVTY--RLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVME 487
           I S   G +  +++L++    + + I I  LSD +  E+EC+ LRQ +K GL K  TV++
Sbjct: 524 IASYQLGAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVD 583

Query: 488 LYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLI 547
           + + A +L        +  E+ +L    D A             +K  LL+   E+ RL+
Sbjct: 584 ILKTAATLQAMRINEALEAEILKLNHLRDRA-------------KKLELLKSPEERQRLL 643

Query: 548 HELPEVIAD-ILEPTF----DDLLKQNEQENHM 560
            E+PEV  D  ++P+     D  L   +Q+NH+
Sbjct: 644 QEVPEVHTDPSMDPSHALSEDAGLGTRKQDNHV 653

BLAST of Cp4.1LG01g08680 vs. ExPASy Swiss-Prot
Match: O96028 (Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens OX=9606 GN=NSD2 PE=1 SV=1)

HSP 1 Score: 70.5 bits (171), Expect = 9.6e-11
Identity = 39/105 (37.14%), Postives = 45/105 (42.86%), Query Frame = 0

Query: 3    KKKCTRKEEIG------EDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRW 62
            KKK  R+   G      ED CF C DGG L  CD K C KAYH  C+G          +W
Sbjct: 1222 KKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLG---LGKRPFGKW 1281

Query: 63   TCDWHSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFC 102
             C WH C +C K S   C  CP + C        F      R +C
Sbjct: 1282 ECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCTPDGRSYC 1323

BLAST of Cp4.1LG01g08680 vs. ExPASy Swiss-Prot
Match: Q8BVE8 (Histone-lysine N-methyltransferase NSD2 OS=Mus musculus OX=10090 GN=Nsd2 PE=1 SV=2)

HSP 1 Score: 70.5 bits (171), Expect = 9.6e-11
Identity = 38/105 (36.19%), Postives = 46/105 (43.81%), Query Frame = 0

Query: 3    KKKCTRKEEIG------EDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRW 62
            KKK  R+   G      ED CF C DGG L  CD K C KAYH  C+G          +W
Sbjct: 1222 KKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLG---LGKRPFGKW 1281

Query: 63   TCDWHSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFC 102
             C WH C +C K S   C  CP + C        F   +  + +C
Sbjct: 1282 ECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFRSTQDGQSYC 1323

BLAST of Cp4.1LG01g08680 vs. NCBI nr
Match: XP_023539109.1 (uncharacterized protein At5g08430-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1557 bits (4031), Expect = 0.0
Identity = 773/773 (100.00%), Postives = 773/773 (100.00%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG
Sbjct: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH
Sbjct: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480

Query: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540
           EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD
Sbjct: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540

Query: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600
           ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF
Sbjct: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600

Query: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660
           AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS
Sbjct: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660

Query: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720
           KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA
Sbjct: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720

Query: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773
           SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Sbjct: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773

BLAST of Cp4.1LG01g08680 vs. NCBI nr
Match: XP_022942556.1 (uncharacterized protein At5g08430-like [Cucurbita moschata])

HSP 1 Score: 1539 bits (3985), Expect = 0.0
Identity = 763/773 (98.71%), Postives = 768/773 (99.35%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEI SDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG
Sbjct: 181 YEISSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMAC KPRKLISDRKPAELELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACKKPRKLISDRKPAELELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI+IDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIVIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMEL+EKAKSLH
Sbjct: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELFEKAKSLH 480

Query: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540
           EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD
Sbjct: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540

Query: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600
           ILEPTFDDLLKQNEQENHMLVDG D RKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF
Sbjct: 541 ILEPTFDDLLKQNEQENHMLVDGRDDRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600

Query: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660
           AQKSYISAVEFQTHE+QHQP+LPKEK CKGFATKSCIPAAEFQPH+EQHQSILPKKHAYS
Sbjct: 601 AQKSYISAVEFQTHEQQHQPILPKEKVCKGFATKSCIPAAEFQPHKEQHQSILPKKHAYS 660

Query: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720
           KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA
Sbjct: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720

Query: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773
           SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Sbjct: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773

BLAST of Cp4.1LG01g08680 vs. NCBI nr
Match: KAG7031352.1 (hypothetical protein SDJN02_05392, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1445 bits (3741), Expect = 0.0
Identity = 737/826 (89.23%), Postives = 752/826 (91.04%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELV MASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVQMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEI SDYEELVDTEEGH LVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG
Sbjct: 181 YEISSDYEELVDTEEGHNLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMAC KPRKLISDRKPAELELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACKKPRKLISDRKPAELELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI+IDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIVIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMEL+EKAKSLH
Sbjct: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELFEKAKSLH 480

Query: 481 EDITKH-------WI-------------------TEELARLQTCID-------------- 540
           EDITKH       W+                   ++ L  +   +               
Sbjct: 481 EDITKHVGGLQRSWLDCKRVLIMQMRRDGEENILSDSLYMIYVVLGLGPFYSVGEFHVDL 540

Query: 541 --------HANEKGWRRE-----LFEYMEKRLLLQKSSEQARLIHELPEVIADILEPTFD 600
                   + ++  +  +     LFEYMEKRLLLQKSSEQARLIHELPEVIADILEPTFD
Sbjct: 541 TKPAFFFLNKDKISFHNQCFVCTLFEYMEKRLLLQKSSEQARLIHELPEVIADILEPTFD 600

Query: 601 DLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDFAQKSYIS 660
           DL KQNEQENHMLVDG D RKVATAAMVEECLIGMQTISEKQQHFEVSTCKDFAQKS+IS
Sbjct: 601 DLQKQNEQENHMLVDGRDDRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDFAQKSFIS 660

Query: 661 AVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYSKPLLSSI 720
           AVEFQTHEEQHQP+LPKEKACK FATKSCI AAEFQPH+EQHQSILPKKHAYSKPLLSSI
Sbjct: 661 AVEFQTHEEQHQPILPKEKACKDFATKSCIAAAEFQPHKEQHQSILPKKHAYSKPLLSSI 720

Query: 721 KRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCASPQGETR 773
           KRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCASPQGETR
Sbjct: 721 KRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCASPQGETR 780

BLAST of Cp4.1LG01g08680 vs. NCBI nr
Match: XP_022984438.1 (uncharacterized protein At5g08430-like [Cucurbita maxima])

HSP 1 Score: 1439 bits (3726), Expect = 0.0
Identity = 724/773 (93.66%), Postives = 731/773 (94.57%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           H+CFLCHKTSKFRCV CPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HACFLCHKTSKFRCVGCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELV+MASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVYMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEI SDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLS IG
Sbjct: 181 YEISSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSKIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKK+SQHDVTSIIT YCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKMSQHDVTSIITIYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMAC KPRKLISDRKPAE ELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACKKPRKLISDRKPAEQELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           T KQEILLQVTYRLDYIPIYNLSDDDFCE+ECEDLRQRMKNGLLKNPTVMELYEKAKSLH
Sbjct: 421 TEKQEILLQVTYRLDYIPIYNLSDDDFCEDECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480

Query: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540
           EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD
Sbjct: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540

Query: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600
           ILEPTFDDLLKQNEQENHMLVDG D RKVATAAMVEECLIGMQTISEKQQHFEVSTCK  
Sbjct: 541 ILEPTFDDLLKQNEQENHMLVDGRDDRKVATAAMVEECLIGMQTISEKQQHFEVSTCK-- 600

Query: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660
                                        GFA KSC+ AAEFQPH+EQHQSILPKKHAYS
Sbjct: 601 -----------------------------GFAKKSCVSAAEFQPHKEQHQSILPKKHAYS 660

Query: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720
           KPLLSSIKRQSEYINIQKSKFKSKRAS+VELIELSDNEDLKAEDKMQTSENPNFSLWYCA
Sbjct: 661 KPLLSSIKRQSEYINIQKSKFKSKRASDVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720

Query: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773
           SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Sbjct: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 742

BLAST of Cp4.1LG01g08680 vs. NCBI nr
Match: XP_038905176.1 (uncharacterized protein At5g08430-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1155 bits (2989), Expect = 0.0
Identity = 604/796 (75.88%), Postives = 667/796 (83.79%), Query Frame = 0

Query: 4   KKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSC 63
           KKC  KEEIG+DFCFICKDGGLLRFCDFKDCLKAYHPECVGRE+S VESEDRW CDWHSC
Sbjct: 7   KKCKTKEEIGDDFCFICKDGGLLRFCDFKDCLKAYHPECVGREESFVESEDRWICDWHSC 66

Query: 64  FLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTK 123
           FLC KTSKFRCV CPQAVCGRCIFNAEFV VRG RGFCNHCL+L LLIEDGKD DIDGTK
Sbjct: 67  FLCRKTSKFRCVGCPQAVCGRCIFNAEFVCVRGSRGFCNHCLKLALLIEDGKDADIDGTK 126

Query: 124 VDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFR-----NEIEESEEDT 183
           VDFNDRETYE LFKEYWELMKKK+GLTAE VH ASNLLKKGRN+R     NEIEESEEDT
Sbjct: 127 VDFNDRETYECLFKEYWELMKKKEGLTAEHVHTASNLLKKGRNYRCDFNSNEIEESEEDT 186

Query: 184 DEYEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLST 243
           DEYE+ SDYEELV TEEGH LV+KCKR KEKL +TRKKMKSS+++FIGWGSKPVI+FLS 
Sbjct: 187 DEYELSSDYEELVYTEEGHALVKKCKRRKEKLGSTRKKMKSSNKEFIGWGSKPVIDFLSK 246

Query: 244 IGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHL 303
           IGKDT KKL+QHDV SIIT YCKENKLFHP KKK+I+CDAKLQ+VFGRK MNVN+V+KHL
Sbjct: 247 IGKDTSKKLTQHDVASIITAYCKENKLFHPQKKKRILCDAKLQSVFGRKIMNVNSVNKHL 306

Query: 304 TAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAII 363
           TAHFAENME+SS+DESTSS+E KDDNS MAC + RKL SDRKPAE   SD+SH CSAAII
Sbjct: 307 TAHFAENMEESSEDESTSSME-KDDNSIMACKRQRKLGSDRKPAEQNPSDMSHNCSAAII 366

Query: 364 SANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDS 423
           +ANIKLVYLKRSLVERLLE+ ECFEGKM+GSF+R KSDPNDYSQKNSYQLLQVTGI IDS
Sbjct: 367 AANIKLVYLKRSLVERLLEDKECFEGKMMGSFVRAKSDPNDYSQKNSYQLLQVTGIKIDS 426

Query: 424 SNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKS 483
           SNTGKQ ILLQV  RLDYIPIYNLSDDDF EEECEDL QR++NGLL+ PT+ EL EKAKS
Sbjct: 427 SNTGKQGILLQVANRLDYIPIYNLSDDDFFEEECEDLHQRVRNGLLRQPTLEELCEKAKS 486

Query: 484 LHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVI 543
           LHEDI KHWI +ELARLQTCIDHANEKGWRRELFEYMEKR+LLQ+ SEQARLIHELP+VI
Sbjct: 487 LHEDIIKHWIPKELARLQTCIDHANEKGWRRELFEYMEKRILLQEPSEQARLIHELPKVI 546

Query: 544 ADILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCK 603
           ADI EPTF+DLL+++E  NH+LVD  D RK ATAA VEECLIG++ ISEKQQ  EVSTCK
Sbjct: 547 ADIPEPTFEDLLEKDEV-NHVLVDRKDGRKAATAAEVEECLIGVRNISEKQQQSEVSTCK 606

Query: 604 DFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFA----TKSCIP---AAEFQ-------- 663
           DFA+KS ISAVEFQT +EQHQ +LPKE  C   +     KS +    A+E Q        
Sbjct: 607 DFAKKSCISAVEFQTRDEQHQSILPKEHVCSNPSWNNIQKSKLKNKKASEVQLIESKLKN 666

Query: 664 ---PHREQHQSILPKKHAYSKPLLSSIKRQSEYINIQ--KSKFKSKRASEVELIELSDN- 723
                 +  QS L  K A    L+ S  +     ++Q  + K K+K ASEV+LIELSD+ 
Sbjct: 667 KNASEVQLMQSKLKNKIASDVQLVESKLKNKNASDVQMVEPKLKNKNASEVQLIELSDDD 726

Query: 724 EDLKAEDKMQTSENPNFSLWYCASPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSS 773
           EDL+ E+KMQ  ENPN S+WYCASPQGETRGPLP+SLLKQWRD SSFELKCKVWK+ QSS
Sbjct: 727 EDLRVEEKMQNLENPNVSMWYCASPQGETRGPLPMSLLKQWRDSSSFELKCKVWKSDQSS 786

BLAST of Cp4.1LG01g08680 vs. ExPASy TrEMBL
Match: A0A6J1FP67 (uncharacterized protein At5g08430-like OS=Cucurbita moschata OX=3662 GN=LOC111447554 PE=4 SV=1)

HSP 1 Score: 1539 bits (3985), Expect = 0.0
Identity = 763/773 (98.71%), Postives = 768/773 (99.35%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEI SDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG
Sbjct: 181 YEISSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMAC KPRKLISDRKPAELELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACKKPRKLISDRKPAELELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI+IDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIVIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMEL+EKAKSLH
Sbjct: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELFEKAKSLH 480

Query: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540
           EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD
Sbjct: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540

Query: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600
           ILEPTFDDLLKQNEQENHMLVDG D RKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF
Sbjct: 541 ILEPTFDDLLKQNEQENHMLVDGRDDRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600

Query: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660
           AQKSYISAVEFQTHE+QHQP+LPKEK CKGFATKSCIPAAEFQPH+EQHQSILPKKHAYS
Sbjct: 601 AQKSYISAVEFQTHEQQHQPILPKEKVCKGFATKSCIPAAEFQPHKEQHQSILPKKHAYS 660

Query: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720
           KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA
Sbjct: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720

Query: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773
           SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Sbjct: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773

BLAST of Cp4.1LG01g08680 vs. ExPASy TrEMBL
Match: A0A6J1JAH4 (uncharacterized protein At5g08430-like OS=Cucurbita maxima OX=3661 GN=LOC111482736 PE=4 SV=1)

HSP 1 Score: 1439 bits (3726), Expect = 0.0
Identity = 724/773 (93.66%), Postives = 731/773 (94.57%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW
Sbjct: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           H+CFLCHKTSKFRCV CPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID
Sbjct: 61  HACFLCHKTSKFRCVGCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE 180
           GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELV+MASNLLKKGRNFRNEIEESEEDTDE
Sbjct: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVYMASNLLKKGRNFRNEIEESEEDTDE 180

Query: 181 YEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIG 240
           YEI SDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLS IG
Sbjct: 181 YEISSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSKIG 240

Query: 241 KDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300
           KDTRKK+SQHDVTSIIT YCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA
Sbjct: 241 KDTRKKMSQHDVTSIITIYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTA 300

Query: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISA 360
           HFAENMEQSSDDESTSSIEEKDDNSSMAC KPRKLISDRKPAE ELSDVSHTCSAAIISA
Sbjct: 301 HFAENMEQSSDDESTSSIEEKDDNSSMACKKPRKLISDRKPAEQELSDVSHTCSAAIISA 360

Query: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420
           NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN
Sbjct: 361 NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSN 420

Query: 421 TGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480
           T KQEILLQVTYRLDYIPIYNLSDDDFCE+ECEDLRQRMKNGLLKNPTVMELYEKAKSLH
Sbjct: 421 TEKQEILLQVTYRLDYIPIYNLSDDDFCEDECEDLRQRMKNGLLKNPTVMELYEKAKSLH 480

Query: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540
           EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD
Sbjct: 481 EDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIAD 540

Query: 541 ILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVSTCKDF 600
           ILEPTFDDLLKQNEQENHMLVDG D RKVATAAMVEECLIGMQTISEKQQHFEVSTCK  
Sbjct: 541 ILEPTFDDLLKQNEQENHMLVDGRDDRKVATAAMVEECLIGMQTISEKQQHFEVSTCK-- 600

Query: 601 AQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKKHAYS 660
                                        GFA KSC+ AAEFQPH+EQHQSILPKKHAYS
Sbjct: 601 -----------------------------GFAKKSCVSAAEFQPHKEQHQSILPKKHAYS 660

Query: 661 KPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720
           KPLLSSIKRQSEYINIQKSKFKSKRAS+VELIELSDNEDLKAEDKMQTSENPNFSLWYCA
Sbjct: 661 KPLLSSIKRQSEYINIQKSKFKSKRASDVELIELSDNEDLKAEDKMQTSENPNFSLWYCA 720

Query: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 773
           SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE
Sbjct: 721 SPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRLFFPE 742

BLAST of Cp4.1LG01g08680 vs. ExPASy TrEMBL
Match: A0A1S3BSR8 (uncharacterized protein At5g08430 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493309 PE=4 SV=1)

HSP 1 Score: 1093 bits (2828), Expect = 0.0
Identity = 569/784 (72.58%), Postives = 637/784 (81.25%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           M +KK    EEI +DFCF CKDGGLLRFCDFK CLKAYHPECVGRE+S  ESEDRW C  
Sbjct: 1   MGRKKSKTIEEIVDDFCFTCKDGGLLRFCDFKGCLKAYHPECVGREESFAESEDRWICGC 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           HSCFLCHKTSKFRCV CPQAVCGRCI++AEFV +RG RGFCNHCL+L LLIEDGKDVDID
Sbjct: 61  HSCFLCHKTSKFRCVGCPQAVCGRCIYSAEFVCIRGSRGFCNHCLKLALLIEDGKDVDID 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFR-----NEIEESE 180
           GTKVDFNDR+TYE LFKEYWELMKK++GLTAE VH ASNLLKKGRN+      NEIE SE
Sbjct: 121 GTKVDFNDRDTYECLFKEYWELMKKREGLTAEHVHKASNLLKKGRNYNCGFNSNEIELSE 180

Query: 181 EDTDEYEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEF 240
           EDTDE EI SDYEELV TE+ H +VRKCKR K+KL +TRKKMKSS+++F GWGSKP+I+F
Sbjct: 181 EDTDEGEISSDYEELVYTEDEHAMVRKCKRRKQKLGSTRKKMKSSNKEFNGWGSKPLIDF 240

Query: 241 LSTIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVH 300
           LS IGK T KKL+QHDV SIIT YCKENKLFHP KKK+I+CDAKLQ+VFGRK+MNVN+V+
Sbjct: 241 LSKIGKYTSKKLTQHDVASIITAYCKENKLFHPQKKKRILCDAKLQSVFGRKTMNVNSVN 300

Query: 301 KHLTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSA 360
           KHLTAHFAENME+SS+DESTSSIE+ DDNS M    P KL S RKP E   SD+SH CSA
Sbjct: 301 KHLTAHFAENMEESSEDESTSSIEKNDDNSIMDYEGPSKLRSVRKPPEQNPSDMSHNCSA 360

Query: 361 AIISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIM 420
           AII ANIKLVYLKRS+VE  LE+ ECFE KM+GSF+R KSDPNDYSQKNSYQLL+VTGI 
Sbjct: 361 AIIVANIKLVYLKRSVVENFLEDEECFEAKMMGSFVRAKSDPNDYSQKNSYQLLRVTGIK 420

Query: 421 IDSS--NTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELY 480
           +DSS  NTGKQ ILLQV  RLDYIPIYNLSDDDF EEECEDL QRM+NGLL  PTV+ELY
Sbjct: 421 MDSSRSNTGKQGILLQVANRLDYIPIYNLSDDDFLEEECEDLHQRMRNGLLGKPTVVELY 480

Query: 481 EKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHE 540
           EKAKSLHEDITKHWIT+ELARLQTCIDHANEKGWRRELFE+MEKR+LLQK SEQARLIHE
Sbjct: 481 EKAKSLHEDITKHWITKELARLQTCIDHANEKGWRRELFEFMEKRILLQKPSEQARLIHE 540

Query: 541 LPEVIADILEPTFDDLLKQNEQENHMLVDGSDVRKVATAAMVEECLIGMQTISEKQQHFE 600
           LP+VI DI EPTF+DLL+++E+ NH+LVD SD RKVAT A VEECLIG   ISEKQQHF+
Sbjct: 541 LPKVIPDIPEPTFEDLLEEDEEVNHVLVDRSDHRKVATVADVEECLIGEPNISEKQQHFK 600

Query: 601 VSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSIL 660
           VS+C+DFA++S ISA EFQ   EQHQ +LPKE  C                         
Sbjct: 601 VSSCEDFAKESCISATEFQADGEQHQSILPKENVC------------------------- 660

Query: 661 PKKHAYSKPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNE----DLKAEDKMQTS 720
                 SK L SS     E I IQ+SK K+K A+EV+LIELSD++    DLK  +K +  
Sbjct: 661 ------SKTLPSSNNIPIESIKIQESKSKNKIATEVQLIELSDDDNEDGDLKVAEKKRNL 720

Query: 721 ENPNFSLWYCASPQGETRGPLPLSLLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRL 773
           ENPNFS+WYC SPQGETRGPLP+SLLKQWRD S+FELKCKVWK+ QSSQE + LSDAIRL
Sbjct: 721 ENPNFSMWYCTSPQGETRGPLPMSLLKQWRDSSAFELKCKVWKSDQSSQEAMLLSDAIRL 753

BLAST of Cp4.1LG01g08680 vs. ExPASy TrEMBL
Match: A0A6J1C4Q9 (uncharacterized protein At5g08430-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008459 PE=4 SV=1)

HSP 1 Score: 1070 bits (2767), Expect = 0.0
Identity = 570/838 (68.02%), Postives = 635/838 (75.78%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           M KKKC  KEEIGEDFCF CKDGG +RFCDF+DCLKAYH +CVG+E+S VESEDRW C+W
Sbjct: 1   MGKKKCKTKEEIGEDFCFHCKDGGQIRFCDFRDCLKAYHADCVGKEESFVESEDRWICEW 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
           H C  C KTSKFRCVCCP+AVCGRCI  +EFV VRG+RGFC+HCL+L LLIE+G+DVD D
Sbjct: 61  HLCQHCPKTSKFRCVCCPKAVCGRCISISEFVHVRGYRGFCSHCLKLALLIENGEDVDSD 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNF---RNEIEESEED 180
           GTK+DFND ETYEFLFKEYWELMK K+GLTA+ V  ASNLL  G       NEIEESEED
Sbjct: 121 GTKIDFNDSETYEFLFKEYWELMKVKEGLTAKDVRTASNLLMTGSRSDFNSNEIEESEED 180

Query: 181 TDEYEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLS 240
           TDEYEI SDYEE VDTEEGHKLVRK KRSKEKL T  KKMKSS+++FIGWGSKP+I+FLS
Sbjct: 181 TDEYEISSDYEEQVDTEEGHKLVRKGKRSKEKLGTM-KKMKSSNKEFIGWGSKPIIDFLS 240

Query: 241 TIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKH 300
            IGKDT +KLSQ DVTSII  YCKENKLFHP KKKKI+CDAKL+AVFGRK++N+ +V+  
Sbjct: 241 KIGKDTSQKLSQDDVTSIIIAYCKENKLFHPQKKKKIVCDAKLRAVFGRKAINMISVYNQ 300

Query: 301 LTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAI 360
           LTAHFAENMEQ SDDESTSSIEEKDD SSMAC +PRKL+ DRKPAE E S VSH CSAAI
Sbjct: 301 LTAHFAENMEQPSDDESTSSIEEKDDTSSMACKRPRKLVLDRKPAEQEPSHVSHNCSAAI 360

Query: 361 ISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMID 420
           I+ N+KLVYLK+SLVERLLEN ECFEGKM+GSFIR KSDPNDYSQKNSYQLLQVTGI   
Sbjct: 361 IAENVKLVYLKKSLVERLLENHECFEGKMMGSFIRAKSDPNDYSQKNSYQLLQVTGIKTY 420

Query: 421 SSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAK 480
           SSNT KQ+ILLQVT RLDYIPI NLSDDDFCEEEC+DL QR++NGLLK PTV ELYEKAK
Sbjct: 421 SSNTEKQKILLQVTNRLDYIPINNLSDDDFCEEECKDLLQRVRNGLLKKPTVAELYEKAK 480

Query: 481 SLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEV 540
           SLHEDITKHWIT EL RLQTCIDHANEKG RRELFEYMEKRLLLQKSSEQARLI+ELP+V
Sbjct: 481 SLHEDITKHWITRELTRLQTCIDHANEKGKRRELFEYMEKRLLLQKSSEQARLINELPKV 540

Query: 541 IADILEPTFDDLLKQNEQENHM-------------------------------------- 600
           IADI EPTFDDLL+++EQ +H                                       
Sbjct: 541 IADIPEPTFDDLLERDEQVSHKHEAQPPIWGLQHKALVDTRDDGKDVTGRSLFSTFDDLL 600

Query: 601 -----------------------LVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVST 660
                                  +VD  D RK   A  VEEC +G+ TISEKQQHF+V T
Sbjct: 601 ERDEQVSHKHEGQLPIWGLQHKAVVDTRDDRKDVRAVEVEECQVGVPTISEKQQHFDVPT 660

Query: 661 CKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKK 720
           CKDFA+KS                               CI AA+ Q H+EQHQSILPK+
Sbjct: 661 CKDFAKKS-------------------------------CISAAKSQTHQEQHQSILPKE 720

Query: 721 HAYSKPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNED-LKAEDKMQTSENPNFS 773
           H  S+ L+S   +Q E   IQ+SK KS+  SEV+LIELSD++  L+ EDK Q SENPN  
Sbjct: 721 HPCSETLVSCTSKQDEATVIQESKLKSEGPSEVQLIELSDDDGHLRVEDKKQNSENPNCP 780

BLAST of Cp4.1LG01g08680 vs. ExPASy TrEMBL
Match: A0A6J1C6N4 (uncharacterized protein At5g08430-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008459 PE=4 SV=1)

HSP 1 Score: 1022 bits (2643), Expect = 0.0
Identity = 555/838 (66.23%), Postives = 618/838 (73.75%), Query Frame = 0

Query: 1   MVKKKCTRKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDW 60
           M KKKC  KEEIGEDFCF CKDGG +RFCDF+DCLKAYH +CVG+E+S VESEDRW C  
Sbjct: 1   MGKKKCKTKEEIGEDFCFHCKDGGQIRFCDFRDCLKAYHADCVGKEESFVESEDRWIC-- 60

Query: 61  HSCFLCHKTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDID 120
                              AVCGRCI  +EFV VRG+RGFC+HCL+L LLIE+G+DVD D
Sbjct: 61  -------------------AVCGRCISISEFVHVRGYRGFCSHCLKLALLIENGEDVDSD 120

Query: 121 GTKVDFNDRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNF---RNEIEESEED 180
           GTK+DFND ETYEFLFKEYWELMK K+GLTA+ V  ASNLL  G       NEIEESEED
Sbjct: 121 GTKIDFNDSETYEFLFKEYWELMKVKEGLTAKDVRTASNLLMTGSRSDFNSNEIEESEED 180

Query: 181 TDEYEIPSDYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLS 240
           TDEYEI SDYEE VDTEEGHKLVRK KRSKEKL T  KKMKSS+++FIGWGSKP+I+FLS
Sbjct: 181 TDEYEISSDYEEQVDTEEGHKLVRKGKRSKEKLGTM-KKMKSSNKEFIGWGSKPIIDFLS 240

Query: 241 TIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKH 300
            IGKDT +KLSQ DVTSII  YCKENKLFHP KKKKI+CDAKL+AVFGRK++N+ +V+  
Sbjct: 241 KIGKDTSQKLSQDDVTSIIIAYCKENKLFHPQKKKKIVCDAKLRAVFGRKAINMISVYNQ 300

Query: 301 LTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAI 360
           LTAHFAENMEQ SDDESTSSIEEKDD SSMAC +PRKL+ DRKPAE E S VSH CSAAI
Sbjct: 301 LTAHFAENMEQPSDDESTSSIEEKDDTSSMACKRPRKLVLDRKPAEQEPSHVSHNCSAAI 360

Query: 361 ISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMID 420
           I+ N+KLVYLK+SLVERLLEN ECFEGKM+GSFIR KSDPNDYSQKNSYQLLQVTGI   
Sbjct: 361 IAENVKLVYLKKSLVERLLENHECFEGKMMGSFIRAKSDPNDYSQKNSYQLLQVTGIKTY 420

Query: 421 SSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAK 480
           SSNT KQ+ILLQVT RLDYIPI NLSDDDFCEEEC+DL QR++NGLLK PTV ELYEKAK
Sbjct: 421 SSNTEKQKILLQVTNRLDYIPINNLSDDDFCEEECKDLLQRVRNGLLKKPTVAELYEKAK 480

Query: 481 SLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEV 540
           SLHEDITKHWIT EL RLQTCIDHANEKG RRELFEYMEKRLLLQKSSEQARLI+ELP+V
Sbjct: 481 SLHEDITKHWITRELTRLQTCIDHANEKGKRRELFEYMEKRLLLQKSSEQARLINELPKV 540

Query: 541 IADILEPTFDDLLKQNEQENHM-------------------------------------- 600
           IADI EPTFDDLL+++EQ +H                                       
Sbjct: 541 IADIPEPTFDDLLERDEQVSHKHEAQPPIWGLQHKALVDTRDDGKDVTGRSLFSTFDDLL 600

Query: 601 -----------------------LVDGSDVRKVATAAMVEECLIGMQTISEKQQHFEVST 660
                                  +VD  D RK   A  VEEC +G+ TISEKQQHF+V T
Sbjct: 601 ERDEQVSHKHEGQLPIWGLQHKAVVDTRDDRKDVRAVEVEECQVGVPTISEKQQHFDVPT 660

Query: 661 CKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATKSCIPAAEFQPHREQHQSILPKK 720
           CKDFA+KS                               CI AA+ Q H+EQHQSILPK+
Sbjct: 661 CKDFAKKS-------------------------------CISAAKSQTHQEQHQSILPKE 720

Query: 721 HAYSKPLLSSIKRQSEYINIQKSKFKSKRASEVELIELSDNED-LKAEDKMQTSENPNFS 773
           H  S+ L+S   +Q E   IQ+SK KS+  SEV+LIELSD++  L+ EDK Q SENPN  
Sbjct: 721 HPCSETLVSCTSKQDEATVIQESKLKSEGPSEVQLIELSDDDGHLRVEDKKQNSENPNCP 780

BLAST of Cp4.1LG01g08680 vs. TAIR 10
Match: AT5G63700.1 (zinc ion binding;DNA binding )

HSP 1 Score: 404.1 bits (1037), Expect = 2.6e-112
Identity = 229/567 (40.39%), Postives = 339/567 (59.79%), Query Frame = 0

Query: 14  EDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCHKTSKFR 73
           ED+CFICKDGG L  CDFKDC K YH  CV ++ S+ ++ D + C WHSC+LC KT K  
Sbjct: 22  EDWCFICKDGGNLMLCDFKDCPKVYHESCVEKDSSASKNGDSYICMWHSCYLCKKTPKLC 81

Query: 74  CVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFNDRETYE 133
           C+CC  AVC  C+ +AEF++++G +G CN C +    +E+ ++ D  G K+D  DR T+E
Sbjct: 82  CLCCSHAVCEGCVTHAEFIQLKGDKGLCNQCQEYVFALEEIQEYDAAGDKLDLTDRNTFE 141

Query: 134 FLFKEYWELMKKKQGLTAELVH--MASNLLKKG--RNFRNE---------IEESEEDTDE 193
            LF EYWE+ KK++GLT + V    AS   KKG    ++++           +S++  D+
Sbjct: 142 CLFLEYWEIAKKQEGLTFDDVRKVCASKPQKKGVKSKYKDDPKFSLGDVHTSKSQKKGDK 201

Query: 194 YEIPSDYE----ELVDTEEGHKLVRKCKRSKEKLCTT----------RKKMKSSDQKFIG 253
            +   D +    +   ++ G K V+   +   K   +          +K  K+   +FI 
Sbjct: 202 LKNKDDPKFALGDAHTSKSGKKGVKLKNKDDPKFLVSDHAVEDAVDYKKVGKNKRMEFIR 261

Query: 254 WGSKPVIEFLSTIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGR 313
           WGSKP+I+FL++IG+DTR+ +SQH V S+I  Y +E  L    KKKK+ CD KL ++F +
Sbjct: 262 WGSKPLIDFLTSIGEDTREAMSQHSVESVIRRYIREKNLLDREKKKKVHCDEKLYSIFRK 321

Query: 314 KSMNVNTVHKHLTAHFAENMEQ---------SSDDESTSSIEEKDDNSSMACNKPRKLIS 373
           KS+N   ++  L  H  EN++Q            +++     EK+D   M C K +   S
Sbjct: 322 KSINQKRIYTLLNTHLKENLDQVEYFTPLELGFIEKNEKRFSEKNDKVMMPCKKQKTESS 381

Query: 374 DRKPAELELSDVSHTCSAAIISA-NIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSD 433
           D +  E E+         A I+A N+KLVYL++SLV  LL+  + F  K++GSF++VK+ 
Sbjct: 382 DDEICEKEVQPEMRATGFATINADNLKLVYLRKSLVLELLKQNDSFVDKVVGSFVKVKNG 441

Query: 434 PNDYSQKNSYQLLQVTGIMIDSSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECEDLR 493
           P D+    +YQ+LQVTG  I +++   + +LL V+     + I  L D D  EEE +DL+
Sbjct: 442 PRDFM---AYQILQVTG--IKNADDQSEGVLLHVSGMASGVSISKLDDSDIREEEIKDLK 501

Query: 494 QRMKNGLLKNPTVMELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYME 544
           Q++ NGLL+  TV+E+ +KAK+LH DITKHWI  +L  LQ  I+ ANEKGWRREL EY+E
Sbjct: 502 QKVMNGLLRQTTVVEMEQKAKALHYDITKHWIARQLNILQKRINCANEKGWRRELEEYLE 561

BLAST of Cp4.1LG01g08680 vs. TAIR 10
Match: AT5G08430.1 (SWIB/MDM2 domain;Plus-3;GYF )

HSP 1 Score: 307.4 bits (786), Expect = 3.3e-83
Identity = 204/576 (35.42%), Postives = 303/576 (52.60%), Query Frame = 0

Query: 214 RKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRKKLSQHDVTSIITTYCKENKLFHPLKKKK 273
           ++K +    +F+GWGS+ +IEFL ++GKDT + +S++DV+  I  Y  +  L  P  KKK
Sbjct: 19  KRKARPKRFEFVGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKK 78

Query: 274 IICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQSSDDESTSSIEEKDDNSSMACNKPR 333
           ++CD +L  +FG +++    V+  L  H+ EN + S  D       +   +S     +  
Sbjct: 79  VVCDKRLVLLFGTRTIFRMKVYDLLEKHYKENQDDSDFDFLYEDEPQIICHSEKIAKRTS 138

Query: 334 KLISDRKPAELELSDVSHTCSAAIISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRV 393
           K++  +KP             AAI+S NIKL+YL++SLV+ LL++P+ FEGKM+GSF+R+
Sbjct: 139 KVV--KKP---------RGTFAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRI 198

Query: 394 KSDPNDYSQKNSYQLLQVTGIMIDSSNTGKQEILLQVTYRLDYIPIYNLSDDDFCEEECE 453
           KSDPNDY QK  YQL+QVTG+       G  + LLQVT  +  + I  LSDD+F +EECE
Sbjct: 199 KSDPNDYLQKYPYQLVQVTGV---KKEHGTDDFLLQVTNYVKDVSISVLSDDNFSQEECE 258

Query: 454 DLRQRMKNGLLKNPTVMELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFE 513
           DL QR+KNGLLK PT++E+ EKAK LH+D TKHW+  E+  L+  ID ANEKGWRREL E
Sbjct: 259 DLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRELSE 318

Query: 514 YMEKRLLLQKSSEQARLIHELPEVIADILEPTFDDLLKQNEQENHMLVDGSDVRKVATAA 573
           Y++KR LLQ   EQARL+ E+PEVI        ++L++  E  +       + ++++ + 
Sbjct: 319 YLDKRELLQNPDEQARLLREVPEVIG-------EELVQNPEVSSPEAHKSDNEQRLSESP 378

Query: 574 M--VEECLIGMQTISEKQQHFEVSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKAC--- 633
           +  + E          + Q F            Y+ +    T         P   +C   
Sbjct: 379 LSCIHETPEARNLFGGEDQQF---------NNGYVMSNPITT---------PGITSCATE 438

Query: 634 --KGFATKSCIPAAEFQPHREQHQ---SILPKKHAYSKPLLSSIKRQSEYINIQKSKFKS 693
             KG  T      AE+  H +  Q    I+  +    +  +S ++      N+       
Sbjct: 439 INKGLPTWIASAGAEYL-HVDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQ 498

Query: 694 KRASEVELIELSDNE----------DLKAEDKMQTSENPNFSLWYCASPQGETRGPLPLS 753
              SEV  IELSD++          D K ED    S +     W    PQG  +GP  L+
Sbjct: 499 PNPSEV--IELSDDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLT 552

Query: 754 LLKQWRDRSSFELKCKVWKNGQSSQEGIPLSDAIRL 770
            LK W D   F  + +VW  G+S +  + L+D +RL
Sbjct: 559 QLKAWSDAEYFTKQFRVWMTGESMESAVLLTDVLRL 552

BLAST of Cp4.1LG01g08680 vs. TAIR 10
Match: AT2G16485.1 (nucleic acid binding;zinc ion binding;DNA binding )

HSP 1 Score: 261.5 bits (667), Expect = 2.1e-69
Identity = 215/805 (26.71%), Postives = 379/805 (47.08%), Query Frame = 0

Query: 14   EDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCHKTSKFR 73
            ED CF+C DGG L  CD + C KAYHP CV R+++  +++ +W C WH C  C KT+ + 
Sbjct: 599  EDVCFMCFDGGDLVLCDRRGCTKAYHPSCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYL 658

Query: 74   CVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFNDRETYE 133
            C  C  ++C  C  +A F  +RG +G C  C++   LIE  K  + +  ++DFND+ ++E
Sbjct: 659  CYTCMFSLCKGCAKDAVFFCIRGNKGLCETCMETVKLIE-RKQQEKEPAQLDFNDKTSWE 718

Query: 134  FLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDEYEIPSDYEELVDT 193
            +LFK+YW  +K +  L+ E +  A   LK      +E   S++ T      +DY     +
Sbjct: 719  YLFKDYWIDLKTQLSLSPEELDQAKRPLK-----GHETNASKQGTAS---ETDYVTDGGS 778

Query: 194  EEGHKLVRKCKRSKEKLCTTRKKMKSSDQKF----IGWGSKPVIEFLSTIGKDTRKKLSQ 253
            +      ++  RS+ K  +  K + S D+      + W SK +++ +  + +  R  L  
Sbjct: 779  DSDSSPKKRKTRSRSKSGSAEKILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPM 838

Query: 254  HDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQS 313
             +V +++  Y K   L  P +K ++ICD++LQ +FG+  +    +   L +HF +  +  
Sbjct: 839  LEVQTLLLAYIKRYNLRDPRRKSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQ 898

Query: 314  SDDESTSSIEEKDDNS-----------SMACNKPRKLISD--RKPAELELSDVSHTCSAA 373
            +DD     ++ ++ N                +K RK      RK  +  L D      AA
Sbjct: 899  ADDIQGDIVDTEEPNHVDVDENLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDF-----AA 958

Query: 374  IISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI-- 433
            +   NI L+YL+RSLVE LLE+   FE K+  +F+R++   N   +++ Y+L+QV G   
Sbjct: 959  VDMHNINLIYLRRSLVEDLLEDSTAFEEKVASAFVRLRISGN--QKQDLYRLVQVVGTSK 1018

Query: 434  MIDSSNTGKQ--EILLQVTY--RLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVM 493
              +    GK+  + +L++    + + I I  +S+ DF E+EC+ L+Q +K GL+   TV 
Sbjct: 1019 APEPYKVGKKTTDYVLEILNLDKTEVISIDIISNQDFTEDECKRLKQSIKCGLINRLTVG 1078

Query: 494  ELYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRE---------------LFEYM 553
            ++ EKA +L E   K+ +  E+ R     D A++ G R+E               L E +
Sbjct: 1079 DIQEKAIALQEVRVKNLLEAEILRFSHLRDRASDMGRRKEYPYLLKLSNSLTMLTLRECV 1138

Query: 554  EKRLLLQKSSEQARLIHELPEVIADILEPTFD-DLLKQNEQENHMLVDGSDVRKVATAAM 613
            EK  LL+   E+ R + E+PE+ AD   P  D D   ++E E         +R  +++  
Sbjct: 1139 EKLQLLKSPEERQRRLEEIPEIHAD---PKMDPDCESEDEDEKEEKEKEKQLRPRSSSFN 1198

Query: 614  VEECLIGMQTISEKQQHFEVSTCKDFAQKSYISAVEFQTHEEQHQPLLPKEKACKGFATK 673
                  G   IS ++  F              S+ E  T    +       +  + ++ +
Sbjct: 1199 RR----GRDPISPRKGGF--------------SSNESWTGTSNYSNTSANRELSRSYSGR 1258

Query: 674  SCIPAAEFQPHREQHQS---ILPKKHAYSKPLLSSIKRQSEYINIQKSKFKSKRA-SEVE 733
                  ++    +   S       +    +P L S K +S  ++I ++  +S RA +  E
Sbjct: 1259 GSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPRS--VSIPETPARSSRAIAPPE 1318

Query: 734  LIELSDNEDLKAEDKMQT----SENPNFSLWYCASPQGETRGPLPLSLLKQWRDRSSFEL 772
            L     +E   A   + +      N +  +W+   P G+ +GP  ++ L++W +   F  
Sbjct: 1319 LSPRIASEISMAPPAVVSQPVPKSNDSEKIWHYKDPSGKVQGPFSMAQLRKWNNTGYFPA 1364

BLAST of Cp4.1LG01g08680 vs. TAIR 10
Match: AT3G51120.1 (DNA binding;zinc ion binding;nucleic acid binding;nucleic acid binding )

HSP 1 Score: 241.1 bits (614), Expect = 2.9e-63
Identity = 170/573 (29.67%), Postives = 294/573 (51.31%), Query Frame = 0

Query: 8   RKEEIGEDFCFICKDGGLLRFCDFKDCLKAYHPECVGREDSSVESEDRWTCDWHSCFLCH 67
           +KE+  ED CFIC DGG L  CD ++C KAYHP C+ R+++   +  +W C WH C  C 
Sbjct: 104 KKEDKEEDVCFICFDGGDLVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQ 163

Query: 68  KTSKFRCVCCPQAVCGRCIFNAEFVRVRGWRGFCNHCLQLTLLIEDGKDVDIDGTKVDFN 127
           K S + C  C  +VC RCI +A++V VRG  G C  C++  +LIE+    D +  KVDF+
Sbjct: 164 KASSYMCYTCTFSVCKRCIKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFD 223

Query: 128 DRETYEFLFKEYWELMKKKQGLTAELVHMASNLLKKGRNFRNEIEESEEDTDE--YEIPS 187
           D+ ++E+LFK YW  +K++  LT + +  A+N  K+  N   ++E   + T+    ++  
Sbjct: 224 DKLSWEYLFKVYWLCLKEELSLTVDELTRANNPWKEVPNTAPKVESQNDHTNNRALDVAV 283

Query: 188 DYEELVDTEEGHKLVRKCKRSKEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRK 247
           +  +   T +   L  K           +    +S      W +K ++EF+S +      
Sbjct: 284 NGTKRRRTSDSPTLPNKLDGKNPSNILKKAPGDTS------WATKELLEFVSFMKNGDTS 343

Query: 248 KLSQHDVTSIITTYCKENKLFHPLKKKKIICDAKLQAVFGRKSMN----VNTVHKHLTAH 307
            LSQ DV  ++  Y K+  L  PL+K +++CD  L  +FG++ +     +  +  H+   
Sbjct: 344 VLSQFDVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQ 403

Query: 308 FAENMEQSSDDEST----SSIEEKDDNSSMACNKPRKLISDRKPAELELSDVSHTCSAAI 367
                 ++++ E+T    S IEE   +  M  ++ RK+   R+  +  + + +    AAI
Sbjct: 404 EKPKGAKTTNGETTHAVPSQIEEDSVHDPMVRDRRRKM---RRKTDGRVQNENLDAYAAI 463

Query: 368 ISANIKLVYLKRSLVERLLENPECFEGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGI--M 427
              NI L+YL+R  +E LL++    + K++G+ +R+K   +D  + + ++L+QV G    
Sbjct: 464 DVHNINLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSD-QKLDIHRLVQVVGTSKA 523

Query: 428 IDSSNTGKQ--EILLQVTY--RLDYIPIYNLSDDDFCEEECEDLRQRMKNGLLKNPTVME 487
           I S   G +  +++L++    + + I I  LSD +  E+EC+ LRQ +K GL K  TV++
Sbjct: 524 IASYQLGAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVD 583

Query: 488 LYEKAKSLHEDITKHWITEELARLQTCIDHANEKGWRRELFEYMEKRLLLQKSSEQARLI 547
           + + A +L        +  E+ +L    D A             +K  LL+   E+ RL+
Sbjct: 584 ILKTAATLQAMRINEALEAEILKLNHLRDRA-------------KKLELLKSPEERQRLL 643

Query: 548 HELPEVIAD-ILEPTF----DDLLKQNEQENHM 560
            E+PEV  D  ++P+     D  L   +Q+NH+
Sbjct: 644 QEVPEVHTDPSMDPSHALSEDAGLGTRKQDNHV 653

BLAST of Cp4.1LG01g08680 vs. TAIR 10
Match: AT5G23480.1 (SWIB/MDM2 domain;Plus-3;GYF )

HSP 1 Score: 206.5 bits (524), Expect = 8.0e-53
Identity = 178/625 (28.48%), Postives = 268/625 (42.88%), Query Frame = 0

Query: 207 KEKLCTTRKKMKSSDQKFIGWGSKPVIEFLSTIGKDTRKKLSQHDVTSIITTYCKENKLF 266
           K K  + ++  K    +F+GWGS+ +IEFL ++G+DT  K+S++DVT+II  Y +E    
Sbjct: 7   KVKGSSKKRLRKPKSLEFVGWGSRNLIEFLESLGRDTTNKISENDVTAIIMNYIREKSRE 66

Query: 267 HPLKKKK----IICDAKLQAVFGRKSMNVNTVHKHLTAHFAENMEQSSDDESTSSIEEKD 326
            PLK KK    + CD KL+ +FG   +NV  V   +  H+ EN E   +D     +   +
Sbjct: 67  TPLKSKKRRKTVACDEKLRLLFGAGKINVIKVPDLVEKHYVENQE---EDLFYDDLYASE 126

Query: 327 DNSSMACNKPRKLISDRKPAELELSDVSHTCSAAIISANIKLVYLKRSLVERLLENPECF 386
           D+     +   K+    K    ++        AAI+   +KL+YL++SLV+ L + PE F
Sbjct: 127 DDKQQRLSLSDKVAKQTK----QVVSKPRGTFAAIVRDTVKLLYLRKSLVQELAKTPETF 186

Query: 387 EGKMIGSFIRVKSDPNDYSQKNSYQLLQVTGIMIDSSNTGKQEILLQVTYRLDYIPIYNL 446
           E K++ +F+R+         KN  QL+ VTG+       G    +   +Y L  +   +L
Sbjct: 187 ESKVVRTFVRI---------KNPCQLVHVTGVKEGDPIDGNLFQVTNYSYYLKDVTTSSL 246

Query: 447 SDDDFCEEECEDLRQRMKNGLLKNPTVMELYEKAKSLHEDITKHWITEELARLQTCIDHA 506
           SDDDF +EECE+L QR+ NG  K  TV+++ EKA+SLHED                    
Sbjct: 247 SDDDFSQEECEELHQRINNGFAKRLTVVDMEEKARSLHED-------------------- 306

Query: 507 NEKGWRRELFEYMEKRLLLQKSSEQARLIHELPEVIADILEPTFDD-------------- 566
                      Y+EKR LLQ   EQ RL+ E+PE++A+ LEP  +D              
Sbjct: 307 -----------YLEKRELLQNPDEQKRLVDEVPEIVAEELEPECEDDDDDRTIEDSLIVP 366

Query: 567 ------------------------------LLKQNEQENHMLVDGSDV--RKVA------ 626
                                         LLK  E++  +L D  DV   K+       
Sbjct: 367 NPEAHQSDKEQRQRDLPVSSSVKKSQENSLLLKNPEEQLRLLCDVPDVVAEKLEPEFVDD 426

Query: 627 TAAMVEECLI-GMQTISEKQQHFEVSTCKDFAQKSYISAVEFQ-----------THEEQH 686
              +V +  +   +  +E  Q  E     D    S    +E Q            HE+ +
Sbjct: 427 DGKLVNDATVPNPEAFTEAHQSDEEIQPSDLPDSSIQKTLEDQPIWTASAGNKDLHEDVY 486

Query: 687 QPLLPKEKACKGFATKS-CIPAAEFQPHREQHQSILPKKHAYSKPLLSSIKRQSEYINIQ 746
           +P         G    +  I   E      QHQS  P                     I 
Sbjct: 487 EP------PANGITLNTDSITEGEMNTKVSQHQSSTPV--------------------ID 546

Query: 747 KSKFKSKRASEVELIELSDNEDLKAEDK---MQTSENPNFSLWYCASPQGETRGPLPLSL 758
            S      ++ +E+IELSD++D   +DK      S +P   +W+   P+G+T GP  L+ 
Sbjct: 547 LSNKTQAHSNPIEIIELSDDDDDDEKDKNDQAYQSYDPKKVMWFYEYPKGKTHGPFSLTD 558

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FT924.7e-8235.42Uncharacterized protein At5g08430 OS=Arabidopsis thaliana OX=3702 GN=At5g08430 P... [more]
Q9SIV52.9e-6826.71Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9SD344.1e-6229.67Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana OX=3702 GN... [more]
O960289.6e-1137.14Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens OX=9606 GN=NSD2 PE=1 SV=... [more]
Q8BVE89.6e-1136.19Histone-lysine N-methyltransferase NSD2 OS=Mus musculus OX=10090 GN=Nsd2 PE=1 SV... [more]
Match NameE-valueIdentityDescription
XP_023539109.10.0100.00uncharacterized protein At5g08430-like [Cucurbita pepo subsp. pepo][more]
XP_022942556.10.098.71uncharacterized protein At5g08430-like [Cucurbita moschata][more]
KAG7031352.10.089.23hypothetical protein SDJN02_05392, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022984438.10.093.66uncharacterized protein At5g08430-like [Cucurbita maxima][more]
XP_038905176.10.075.88uncharacterized protein At5g08430-like isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FP670.098.71uncharacterized protein At5g08430-like OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1JAH40.093.66uncharacterized protein At5g08430-like OS=Cucurbita maxima OX=3661 GN=LOC1114827... [more]
A0A1S3BSR80.072.58uncharacterized protein At5g08430 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A6J1C4Q90.068.02uncharacterized protein At5g08430-like isoform X1 OS=Momordica charantia OX=3673... [more]
A0A6J1C6N40.066.23uncharacterized protein At5g08430-like isoform X2 OS=Momordica charantia OX=3673... [more]
Match NameE-valueIdentityDescription
AT5G63700.12.6e-11240.39zinc ion binding;DNA binding [more]
AT5G08430.13.3e-8335.42SWIB/MDM2 domain;Plus-3;GYF [more]
AT2G16485.12.1e-6926.71nucleic acid binding;zinc ion binding;DNA binding [more]
AT3G51120.12.9e-6329.67DNA binding;zinc ion binding;nucleic acid binding;nucleic acid binding [more]
AT5G23480.18.0e-5328.48SWIB/MDM2 domain;Plus-3;GYF [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 157..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 304..332
NoneNo IPR availablePANTHERPTHR46851OS01G0884500 PROTEINcoord: 1..739
NoneNo IPR availablePANTHERPTHR46851:SF2GYF DOMAIN PROTEINcoord: 1..739
NoneNo IPR availableCDDcd10567SWIB-MDM2_likecoord: 227..302
e-value: 8.52664E-15
score: 67.5685
NoneNo IPR availableCDDcd15568PHD5_NSDcoord: 16..61
e-value: 3.61728E-16
score: 70.8212
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 16..62
e-value: 2.4E-7
score: 40.4
IPR004343Plus-3 domainSMARTSM00719rtf1coord: 356..461
e-value: 1.8E-26
score: 103.9
IPR004343Plus-3 domainPFAMPF03126Plus-3coord: 361..460
e-value: 7.4E-14
score: 52.2
IPR004343Plus-3 domainPROSITEPS51360PLUS3coord: 356..484
score: 27.352474
IPR003121SWIB/MDM2 domainPFAMPF02201SWIBcoord: 244..302
e-value: 3.4E-14
score: 52.5
IPR003121SWIB/MDM2 domainPROSITEPS51925SWIB_MDM2coord: 224..304
score: 14.855769
IPR035445GYF-like domain superfamilyGENE3D3.30.1490.40coord: 716..773
e-value: 3.4E-16
score: 60.6
IPR035445GYF-like domain superfamilySUPERFAMILY55277GYF domaincoord: 707..769
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 2..107
e-value: 6.1E-19
score: 70.1
IPR036885SWIB/MDM2 domain superfamilyGENE3D1.10.245.10SWIB/MDM2 domaincoord: 213..313
e-value: 1.8E-30
score: 106.6
IPR036885SWIB/MDM2 domain superfamilySUPERFAMILY47592SWIB/MDM2 domaincoord: 215..307
IPR036128Plus3-like superfamilyGENE3D3.90.70.200coord: 358..481
e-value: 9.1E-25
score: 89.0
IPR036128Plus3-like superfamilySUPERFAMILY159042Plus3-likecoord: 357..482
IPR003169GYF domainPROSITEPS50829GYFcoord: 714..768
score: 8.651098
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 13..71

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g08680.1Cp4.1LG01g08680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding