Cp4.1LG04g01060 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g01060
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionzinc finger CCCH domain-containing protein 19
LocationCp4.1LG04: 924782 .. 935958 (+)
RNA-Seq ExpressionCp4.1LG04g01060
SyntenyCp4.1LG04g01060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGGAGCTGAGCGAAACTCAATTTTTTTGTGCAAACGCTCTGTTTGTACCGTAAAAAATATAGTCAGAGGGGGGGGAAGTGGATCTTAAAACCCTAGGGTTTCAATTCCGGGCCATCCTAATTTTTAATCCTAATTTTGTTTCTAAATGGAAGCCGAAGAGAACGATTCCTCCAAACATGACCAACCATCATCACCTCTTCTCTCCGTTGATGATGGCAACGACTTGGATGTTAAGTGTCACACCCATCGGGAGCTTCACAGCAATGAAGAACAGCATTGTTTGTTCCAGTCCGCCATTAATGAACTGGAGTTTCCATCCAATTCTAGCGTTGAATCTTTGCAGCCGAGCGATGCAATTCGTGGGGATGAGAGTCTTGTTGCAGAGACTTGTTTGGAGGTGGAGGAGACAGAGATAGCCGGGGTTAAGGCTTGTCGTAACGGTATTGAGGATATGGGGGAAGATTCAGTGAAGTTGGAGGTTGAACCGGATATTGCGGCGATGGGGTTATTGGGGGAAACGGTTTTCAATGATGTGAAAGAGGAGGATGCAGGGGCGGAGGAGGTGAAGGCTGTTGCTGAATTTGGAGAAGGTGACTTGTTATGTGAAATGGATTTGGTTGGTGGTGCTGAAAATCAGGTTGAGGGCAACGTTTTGATGGTGAATCTTCCGGACAATACTGTTGGCTGCGGTGAGACAGACACATGCTTGAGTGATGTTTTGGCTGAGCTTGCAGAAACGACTCCTTTTGTGCATGGTGTAGATACTACTGATGTAGCCAATTTGGTGGAGAGGAAAGAGGTGGAAGAAAATGCCGATGATCCCAAAGATTCGAAGGATATAGAGGTGGCAAAGCAAGAAACTTTTTCTATGGAGGATGGGAAATTAGGCGTCCCGGTGCAGCTTGTGGAGAAGTCCGAGTTGAAACAAAGTTTGGTAGATGGGGCTGTGGTGGAGGAAGGAAGGACGGAGAATTTAGCTGACAGAACTGGTGAAACTTTGAAGATGGAGAATGATTCAAGCAAAACTGATGAGGTGGGGCTGGCGAATTTTGCTGGTGAGATTGATGGGGCGGTAACTATGGAGAATACTGAAGACAAGACTGTTGAGGTGGATGGAATGTGTTTGGAAGACAAGGCTGCTGATGCGACGACGAAGACGACGACGGGAAATTTGGCAGATGAGACCCCGAAGATCAAGGGAGTGCACGTAACAGACGACAACATTGAAGTGTTGAAGATTGAAAACGTTGAAGATAGGGAAGCAGGGGTGCAAGGATTGGGTGTGGCTGATGAGAGTGCCGAGGTTGGAAAGATTGAGAATTTGGTAGATGAGACTGCAGAGGCTGAGAATGTGACAAACTACACAGCCGAATCAATGGAGAATCTGGACGACAAGACTGCACAATTGGAGGAAATAGCTATGGAAGAAGAGACTGAGGAAGCAGATGACAGGGTTTATTTGGTGGATGAAGGGATTGGATCAGAGGAGAATGATGCAAACATGACATACTTGGTGGGGGAAACAGAAGCAGCGGAGGAGGTTGAGGAGATGGATGTTACAGAGGAGGTTGATGAGGCAAGTAAAGGTAGCAGTGGGGCTAAAAGGAAGCGTGGAAAGAATTCTAAAGCTCCTGCTAGAGTTCCTTCTAGGAAGAAGGTGGAAGAAGACGTTTGCTTTATTTGTTTTGATGGGGGTGACCTTGTACTCTGTGATCGCAGGTTTGTTTTTTCAGTGGCATCCTAAACCATCTTTTTATTAAGATTTTATGTTGATCTTTTTTCTCTTGATTCCAGAGGCTGTCCCAAGGCGTACCACCCTTCCTGTATTAATCGTGATGAGGCGTTCTTCCGAGCCAAGGGTCGATGGAATTGTGGTAATGTGCTGTCTTATATTACATATTCTGAGTAATTCGTTTTTCCTACATGGATATTAGCCTTTTTTAGCGTGTGTTCCTGATGTTTTCTTGTTCTTTTGGACCAAGATAAGAATGTTTTTTGGGGTGAAAAAGTTATTTCCACTTTCCTGATAAATTGCTTGTACCTTACTTTTATCCGGCCAGAATCTTTTATGATCTCATTCAATTCCGTTGTGTTACATGGTGAAACCGAAAGCCACTATTAGGTCGTTGTGGGTGGAAAGGAATTAAGGGGTTTTTTCTTTTGTTTTTGTAACGGTCATTGATAATTATCTTATTTGTTAGAGGGTTTTGTTTCTCGACCTTCATCGTGGTGTTCCTTGGGGATTTGTCGTTCTTTATTATTTTCTTTCTTTTCTGTAATGATTCCCTTTCACAAATTGAGTAACTTGTGAGTATTTTTTCTAAAATGTTTATGAAAAGGAAGTCTTTGCAACCAGACTTCTTGTCCTTTGCTTTTACAAAGATTTATGTGCAGATGGCTGCAGATTATTATCATGTAGATTTTAACATCCAAATGACGTGCACTACGGTGTTTTGGAATCAGGAATCATATCGATTCAGTTTTCCTCGTTTGTTGGTGTCGGTCTGGAGTTTCAGCTGCAGCCGCTTCCCTCTGAATAAAAAAAGGATGGAGAGTGAAGTTTCAGTCGCTCTACATTGGACTATATATGCTTTTTGAAAAACGTTGGAGGCTTAGTTGTTTATAAAATTGTACTGTGGAAGTGGAAATAGTTGTCAAATATTTCAATGATAGGACACAAGAAAATAAGTTTCATTAAGAATAATGGGACCAAAAATTAAGGCAATATGATAAACAGGCATGTAACAAACAAAAGGAGTCTCAGAAGAAGATAGTACAAGAAGAGACTCCAGTTGTTAAGAATTAAAAAGAAAGGANTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTTATTGAAATTATCTAGAGGGTTGAAATGTGCAAAACTAAACCAGCTTCAAAGATGACCCTTTGTGCCTTTAGAGAACGTCATGTTTCTCGCAAACCAAAAATTTCAACGAGTACCAGCCGAAGGATTCTTGCTCTTATCTTAAAATGCGTGTTCAATATAACTTCATCGCTGAAAATAGTGCAATAGAATTGGGAAGTATTCTACTTGACTGAAAATTGTAGCAACGGATGTTAGGACGTGTTTGTCTATCTTAACCAATAACAATTCTCCTTATTCTCAGTATAATGAGTGAGTGGGGCAGGGGAAGGGGAGAGGTATCTGTTGGTGTGTAGGTAACTCTTCATGGATAAACTCTTTTCAATTCTACACTAAAATTATCTTAGAAACTTGAGCTATTGATTTAAAAAATAAATTAGTATAGGGGTTTACTTGGTAAAGGTAGCCATTTATGTTGGAAATCAAGAAGGCTGTAGCCTGTGGAGATGAAAGTCCTGCATTATTTTCTTTCAAAGAAGACAGTTGAGTTTTAACGCAGTGGATCATTTTTGTTTGTTATGATGACTTATTTTAAGATTATAGATGTTCTTCTCCATCTTTTTATAATCAACTTCCAATGTAATTGCCAAATTGGTGTGGTTTTTTTGTTAATCCCTTTTGGGAGGCTATAGGTTGAACACTGTTTTTTGATTATGCATGAAGTGAGCTTCAAGAGTAGACTTTTTACTAAACTTTTCCATCTTTAAGTTAATGATTAAACTTTTTTCTTATCAATGGAAGAGAAATAAAAATTTACTGAACAATGACACGCATATTGTACATATAAAGCGTTTCCTTTTCTTTTAAGCCTTCACTTTGTATGATTCTACGGTCATTCCCTCTGTTCTTATAGACTTGGAAATAATATGGCCAATAATTTCGTACTTTTTCCCCCTCCTCTTTGCCTACTTTGTGAACTTTTTTTGATTTATGTTCTCATTCAAGGTTTTCTTTTATATTGATTTTAGGTTGGCATCTTTGTAGTAACTGTGAGAAGACTGCCCACTACATGTGTTACACATGTACATTTTCCTTGTGCAAGGGCTGCATAAAAAATGCTGTTATTTTGTGTGTTAGAGGTAACAAAGGCTTCTGCGAGGCGTGCATGAGATTTGTTATGTTGATTGAAAAGAATGAGCAGGGAAGTACAGAAAAGGTATGTGTTACTCTGTTAGTGCCAAAAATGTATGAATACTGGTCAGGTGTACTCAATTGCTAATAAGTTCATAATATTTCTGGCTGTCTCCATTTTGCAGCATTACTTTTTGTCTCATTATCATTTTGGACATTTAAAGTTTAGAGAAACCTTTTCTTTTGCAGCAAGTATACAAACAGTAAGAATTAAGAACAGCATAGCTTTACATAGGTAACTGGTTAGACAAACACGATATAGCGTTTGGGAGACAGAATCACCTAACTTTGCTGTCCTTGTTAGATTATGCATGAAATAGTCTGAAGTTCATCATAAATCAGCCATGAGTTTTCATTTTTTTACTATTCATTAAGCGTCATTCTTATTTCATTTCAATTCGAATGTGTTCCCTTCTTCAAATCATTATTTGTGTTGTTTCTGTTGCATTTCCTTGTACCCTGTTACCTTAGGAGTAGCAAGCCCATATCTTCATGAACTCCAAAGTCATTCCGTGAAATATGGCTGGAGCGTCAAGGACTTATATTTTGTAAAAGTGTTAGGGAGGGAGAAAACGATATATTAGAAATTCATGGACTAGATGAAGTATGGAAAAGGTATATGGTAATGGATTCAAGGAAATGTTTGGAAGGTGGGTCTTGCTTCTAATAGTCATTACTTCCATGCATCAAATTTTCCGCAGAACTATTTTTGATAGATCTGAGTGTCAGCTGGCACTTGAGTGAATTTAATTGCAGACGCATACAGGCATTGTTCATTAGTTGACGTGCAACCTTCATGGCTATGAAACATAGCTTTGTGGTGAACACCAATGAAGGATTACGAGGAGATGATGTGGTTGGTTGGTTTTTAGCTCGTTAGGAAATAAAGAGCTGGGGATGGAAGGTCAAATGCACATTTAGTAATCCCCAGTGTAGATGGAGAAAAGTTTTACACACACACACACACACACTCAATTTTGGTGAATTTTCAAAATTTTCAATGTACTTCCAAACTTCAACTTTTGTTGGTTAATCAGTAACGAGTACAAGGTTGTTCTTGTTTGAAACATTCGCATTATAAGATGCAACTTATTCATGTCATGTGCATTTCTGGTTGATTTCAATAGATGCAAAAGTTGGAGACATATTAGCAAGAAAAGGACTGTAGGCTAAGGGCTTCTTTGATTGCAAAATGATTGAACTGTGAAATTAATTGTTCGTTGTTCCTTTTTATGTTTAGATCTTAAGAGAAACTTTTATATCTATGTATGTGAAAGAATTTTCCAGTGATGTGAAAATCAGATTGCATACATTCCATCCCTACCAGGGCAGATAACCATAATTGTCTTGGCTGTACTTCTGCAAATCACTTTATTTTCTCACTTTCTATTTGTTATCACTTCTCACTCATGCATATAAACAGTAAAAGCTACGTTTTAATAAAGCAGATTTTCGAATGCAGTATCCTAGATGTGTCTTACTGCAGCAGGCCATTTTGCTTATTTGGTCATTCTTGTGCTTGTACAGGGCCAAATTGATTTTAATGACAAAACTAGCTGGGAATATCTTTTCAAGGAATACTGGACTGACCTGAAAGGAAGCCTTTCTCTAACTTTTGATGAACTTGTTCATGCAAAAAACCCATGGAAAGGATCTGAAACACTAAATAGCAGACCCGATTCACCTGGCGAGCTATATGATGGTAATGTTGATGGAGGATCAGATTTGGATGTTTCTGAAAATGAAGAATCTGGTAGTTCTAAGAAAAGAAAAGCTAAGAGAAGGTCAAAATCCCAAGCAAAGGAAACCAATTCCCCTAGTATGCCAATAATACCTGATTCTCAAGGGCCATCCACGGATAACAATGTTGAGTGGGCATCTAAAGAGCTCTTGGAGTTTGTTATGCACATGAAGAATGGTGATAGAACTGTTTTATCTCAGTTTGACGTGCAGGCTCTCTTATTAGAATATATTAAAAGAAATAAGCTTCGTGATCCTCGTAGGAAAAGTCAAATCATATGTGATTCAAGACTTGAGAATTTGTTTGGAAAGCCACGTGTAGGACATTTTGAAATGTTGAAGCTCCTAGAGTCACATTTCCTCATCAGAGAAGATGTGCAGATAAATGACCTCCAAGGGAGTGTTGCCGATACTGAATCAAGTCAGTTGGAAGGTGATGGGTACACCGATGCGTCGGGAAAAACTAGGAAAGAAAAGAAACGCCGGATGCGGAAAAAAGGTGATCAGAGAGGATTGCAGTCCAACCTTGATGACTATGCAGCCATTGATATTCACAACATTAATTTAATCTACCTGAGACGTAATTTGGTGGAATATCTGATTGAAGACGAGGAGAGTTTTCATGAAAAGGTTGTTGGTTCTTTTGTGAGGATAAGAATATCAGGCAATGCACAAAAACAAGATTTATACCGGTTGGTTCAAGTTGTAGGTATGAGGAGTCGTGCCTTTACCATTTGCCTCTTATAAGAACTTGATAGAACTGTAGAAGTAACGTAGTTCTTAACATGTTATCAGGTACAAGCAAAGCGTCTGAGCCTTATAAAGTTGGTAAAAAGATGACAGATATCTTGCTAGAGATCTTGAATTTGAACAAGACAGAAGTGATTTCAATTGATATTATCTCGAATCAAGAGTTCACAGAGGTAGTTTTAATACAAATATGGTGCTTGAATATCTTCTTAAAACTGGATTACCAGCAAAACAAGTGTAGCTATGATCTCTTCGGTACATATATTTTTGTTTAATCATTTTTTTTTGGGTATTTTGCCTTCTTTTTTTCCGGTGACGGTGGTGCTCAGGAGGAGTGCAAGCGTCTCAGACAGAGCATTAAGTGTGGAATCCTTAACCGTCTGACTGTGGTAATTTTACCTTTCTTTTTTACTTATCTTCTATTTAATTATTTATTCAAGCTTCTCATAAACAATAATATTGAGTATAAGTTCAAAGTATGTTAAAAAATCAATTCTCCCTTGTAGGGTGACCTTCAGGAGAGAGCAATGTCGCTTCAAGATGCTAGAGTTAAGGATGTAAGTATTTGATGTTAACAGTTCTGGATTTTTTTTTATACATGCTTCTGGAAATTATTTGATACATCTGATCCTGGAAGAATTTGACATGGATGCATGTTTTGAAAGGAAATATTTTTGAAAGTTTCCTTTGTCCTTTAATTGGCTGACCTGTTCATGTCGGCCTTCGTGTTTGCTGCAGATAACTAGTAACATTGTTTGTCATTGTAATGTTCTATCATTATTTACCTGCTAGTCGATATCTCTTGTCTGGTCCTCCCCTTGTTTTATTATTAGAGGAAAAATAAGATTCATCTAACTCTTGAAACATTTGGAGGTAATGCTTGTACTGAGATGCCCTCTTTAAGTATCCGCTACTTTTTGTCGAACAATTAACGGCTTCATTTGGATAAGGCTCTTAACTTTTTTCCCTTTCGTTCTGTATAAGAACTTTTGCAACCAAGATTTTAAGTAGATATAGATCAACGTTGAGTCACCATTTCAAACAATAAAAAAGAGCATTTAAGAATTATTACGAGGAACTAACCTAAAGAAATTAACGTCTCACTGTGAGATCCCACCTCGGTTGAGGAGGAGGACGAAACACCCTTTATAAAGGTGTGGAAACCTCTCCCTAGCATACGCATTTTAAAAACCTCGAGCAAAAGCTCGGAAGAGAAAGTTCAAAGAAGACAATATCTGGTAGTGGTGCGCGCAACTGATCTAAAGAAGTTAGGAAGATTTAGAGGGGACTGTGATGTTAGATTGTTCATTTGCACTATGTTATGAAATAATCAGTGCCAACTATGATCAGCAACTCCCCAAATGTCGTCATTATGTAATGAACATTAGTTTCTTTCTCTTGGCATCTTTTCCTTGTTAAAGATTGGTTGCATTTAAGTAGTTTATATGTTTAAGGCAAACTATTTGTATTTTAGATACCACTTGTTTGACTTATAGTTTACTATTTCTTTGTATGTCTCCCTTCCAGTGGATGGAAACCGAGATAGTTCGACTGAGTCATCTTCGTGATCGAGCAAGTGAAAAAGGGCGCAGAAAAGAATATCCTTTTTACAACATTATGTTGAATTATGGCTGAATTTTATTGGAAAATTGATGTTTTTTGGGGTAAAACCACTGCCCTCTAGGTACTTTGAAACAGATTATGGTGTGGGAAATTTCCATGTCCTATCTTGTTCCGTCTTAAAATAATTTGAAAATTCTTTCCTTCCACCACCTTCCATGAATCCAGACTAGATGATAACCTTAGTCCTGCATTTTTTGAAGGGATTACAGAACGTTAGCTGGAAGATGGAATCTATGGGAAAAGTTGAACTTTTATGTATTGGCCCCGTTCGATTACCATTTGGTTTTCAAACACTACTTACGATTCTTGATTTCTTTGTTTCGTTACCTACTTTTTAGGAAATGTTTTTGAAATCCAAGTCAAAGTTTTGAAACTAAAAATTATAGTTTGTTTTTTTAATGTGGAATGCAAGTGGCAGAGATCATATTCATATAGGAGATCACTCCCTATTTCTTTGAGTGCACAACTAGAATAAAGACTACAAGATTATTAACAATTCATCTTTATTTTATTTGTTTTGTTGCACGCTCTGCATGAAAATGAGATAGAGAATTCTATGATTATCTTCCTTGACTTCTCTCAACGCTTAGGGAATGTGTTGAGAAACTACAGCTTTTGAAGACACCCGAGGAGCGGCAGCGCAGACTGGAGGAGCTACCGGGAATACATACAGACCCAAATATGGATCCGAGTCATGAATCTGAAGATGAGGATGAAGCAGATGATAAGAGACAAGGTTTTTTGCTTTTTCTTTTTGTTGTCAATTCACATTTCTTTTTCATTAGCTAGACAGTATAACGTTATCGGTTTACGTGCAGAAACCTACACCTTGTCAAGAGGCTCAGGCTTTAGTAGGAGGACAAGGGAGCCAGTTTCTCCTGGAAAAGCAGGTTCAAATTTGAATGATTCCTGGAGTGGTACTAGAAACTTTTCGAGCACGAATCGGGACTTGAGCAGGAACTTGTCTGGAAAAGGCTTCTCTAACCAAGGTGAAGATGCCATTGGTTCTGGTGAAATAATAAATGAAAATTCTTGGAGCCATGGAAGGGAGGGAGATGTTAAAAAACCAAATAAGTGGGACAAGCAACAAGTTTCGCCTAGCTCAGAAATGACTGCCAGGAATGCCTTGTCCGGGGCAGCGTCTGAGTTGCCTTCTGCCGCTCGTTCGGTAAATTCAGCAGCATCTCCATCTGTTGGGACTACACAAAATGCTGCTACAGTTAACGAAACAGAGAAGATTTGGCGTTATCAGGATCCATCTGGGAAAGTGCAGGGACCGTTTTCGATGGTGCAACTTCGTAAGTGGAGTAACACAGGCTATTTTCCTGCAGACTTGAGAGTATGGAGAGCCTCAGACAAGCAAGACGACTCGCTGCTTCTGACCGATGTCTTAGCGGGAAAGATTCCGAAGGATACTTCATCCGTGGACAACAGTATTCAAGCACAAGCACATGCTTCTTCTTTCGTTGCAAAGCCTCAGGGAGCTACTGTGCAGTCAGGTATGGATGTTCAGAATACTGGTACTTCAAATCCACATACTAATCCAACTTCTTATGGCCAATCTGCTGGAGGAAGATGGAAATCTCAAACTGAAGTTAGCCCTACTGGTATACCCGCCTCAGCTTCGATAGAAGTCCCCAGGTACTCGGGAGACCGATGGTCGTCTGACCATGGTAATAAGGACTTTACGAGTCTTCCTTCTCCTACTCCCAGCTCAGGAGGAACGAAGGAGCAGCCATTTCAAATGGCTACACCGTTCGCCTCCTCAGCAGGTGGTGGCAGTTTGCACGGTTCTTCACTTATGCAAGGATCCGAAAACGATTCCTTGCGCTCACATTCTGGCCTGAACGCTGCAGAAAAGGGCACGGGTTTAGGTCCTATAAATGGACTTCAAAATCATCATTCGCTGCCAGTAAGGCCTTCATCTATCATTGATGATACTTTGGTGAATCCAGCTGCAGATATTAAAAGCATTAGTGCAAATCTTCATTCTCTAGTACAATCCATCAACAGTCGTAATCCTCCTATTGAAACTCAAACTGTTGAAACAAACATTTCTTCTAGCATGCCGCCAGGACAAACTCTTCACAGGCGTTGGGGGGAGATGTCACCCGCGCAAAATGCTGCGACAGCGAGTTTTTCCACGCCTGGTTTAACCAATTTTTCATCCTCTGAGCCTTGGCGATCGATGCCTCCTATTCCGAGTAACCCGCCACACATTCAGTCTTCAACTCCGCCTAATATACCGTGGGGAATGGGTGCTCCAGAAGGTCAAAGCACCGTTCCACGACCGGGGTTGGAGTCTCAGAACCATAGCTGGGGGCCAATGCCATCAGGAAATCCAAACATGACTTGGGCTCCATCAGCACCTCCGAATGCTACTGGTATGATGTGGGGGTCTTCAGCTCAAAGTTCTGCTTCTGTAGGTACAAACCCAGGTTGGAATGCCCCAGGTCAAGGGCCACCAGTCAGAAACAACATTCAAGGATGGCAAGCGCATAGCTCGATACCACCTCAGGTAAACGCAACCCCGGGTTGGGTTGCCCCCAACCTCGGACCGATGCCACCTATGAACATGAATCCCAATTGGCATGCCCCATCAGCCAATCAGGGCATGTGGAGTAACGAACATGGTAAGAATGGGGATAGATTCTCGAACCCGGACAGTGTCTCTCACGGCGGAGATCCAGGGAACGGAGGCAAATCTTGGGGGATGCCACCATCTTATGGCGGCGGCGGCGGAAGTTCTTCTAGGCTTCCTTACAACAATAAAGGGCAAAAATTGTGCAAATATCATGAAAGTGGACATTGCAAGAAAGGAGGTTCTTGTGATTACAGGCACAAGTAGAATTTAGTTGTTTGACAGTGATTAGAGAGCAATTAATTTTGTATAGTCCTATCATTATTCAGAGCTTTTCTTTGTATAAACGCCCTGTTATTAATGTTATACATGTGCTACTTTTGCAATTCTCTATTCTTCGAGGGTCTTCAGATGCAAATAATTTAAGTATTATTTTTTTTTTTTTTTTTTTAGTTTAAAAA

mRNA sequence

GAGGAGCTGAGCGAAACTCAATTTTTTTGTGCAAACGCTCTGTTTGTACCGTAAAAAATATAGTCAGAGGGGGGGGAAGTGGATCTTAAAACCCTAGGGTTTCAATTCCGGGCCATCCTAATTTTTAATCCTAATTTTGTTTCTAAATGGAAGCCGAAGAGAACGATTCCTCCAAACATGACCAACCATCATCACCTCTTCTCTCCGTTGATGATGGCAACGACTTGGATGTTAAGTGTCACACCCATCGGGAGCTTCACAGCAATGAAGAACAGCATTGTTTGTTCCAGTCCGCCATTAATGAACTGGAGTTTCCATCCAATTCTAGCGTTGAATCTTTGCAGCCGAGCGATGCAATTCGTGGGGATGAGAGTCTTGTTGCAGAGACTTGTTTGGAGGTGGAGGAGACAGAGATAGCCGGGGTTAAGGCTTGTCGTAACGGTATTGAGGATATGGGGGAAGATTCAGTGAAGTTGGAGGTTGAACCGGATATTGCGGCGATGGGGTTATTGGGGGAAACGGTTTTCAATGATGTGAAAGAGGAGGATGCAGGGGCGGAGGAGGTGAAGGCTGTTGCTGAATTTGGAGAAGGTGACTTGTTATGTGAAATGGATTTGGTTGGTGGTGCTGAAAATCAGGTTGAGGGCAACGTTTTGATGGTGAATCTTCCGGACAATACTGTTGGCTGCGGTGAGACAGACACATGCTTGAGTGATGTTTTGGCTGAGCTTGCAGAAACGACTCCTTTTGTGCATGGTGTAGATACTACTGATGTAGCCAATTTGGTGGAGAGGAAAGAGGTGGAAGAAAATGCCGATGATCCCAAAGATTCGAAGGATATAGAGGTGGCAAAGCAAGAAACTTTTTCTATGGAGGATGGGAAATTAGGCGTCCCGGTGCAGCTTGTGGAGAAGTCCGAGTTGAAACAAAGTTTGGTAGATGGGGCTGTGGTGGAGGAAGGAAGGACGGAGAATTTAGCTGACAGAACTGGTGAAACTTTGAAGATGGAGAATGATTCAAGCAAAACTGATGAGGTGGGGCTGGCGAATTTTGCTGGTGAGATTGATGGGGCGGTAACTATGGAGAATACTGAAGACAAGACTGTTGAGGTGGATGGAATGTGTTTGGAAGACAAGGCTGCTGATGCGACGACGAAGACGACGACGGGAAATTTGGCAGATGAGACCCCGAAGATCAAGGGAGTGCACGTAACAGACGACAACATTGAAGTGTTGAAGATTGAAAACGTTGAAGATAGGGAAGCAGGGGTGCAAGGATTGGGTGTGGCTGATGAGAGTGCCGAGGTTGGAAAGATTGAGAATTTGGTAGATGAGACTGCAGAGGCTGAGAATGTGACAAACTACACAGCCGAATCAATGGAGAATCTGGACGACAAGACTGCACAATTGGAGGAAATAGCTATGGAAGAAGAGACTGAGGAAGCAGATGACAGGGTTTATTTGGTGGATGAAGGGATTGGATCAGAGGAGAATGATGCAAACATGACATACTTGGTGGGGGAAACAGAAGCAGCGGAGGAGGTTGAGGAGATGGATGTTACAGAGGAGGTTGATGAGGCAAGTAAAGGTAGCAGTGGGGCTAAAAGGAAGCGTGGAAAGAATTCTAAAGCTCCTGCTAGAGTTCCTTCTAGGAAGAAGGTGGAAGAAGACGTTTGCTTTATTTGTTTTGATGGGGGTGACCTTGTACTCTGTGATCGCAGAGGCTGTCCCAAGGCGTACCACCCTTCCTGTATTAATCGTGATGAGGCGTTCTTCCGAGCCAAGGGTCGATGGAATTGTGGTTGGCATCTTTGTAGTAACTGTGAGAAGACTGCCCACTACATGTGTTACACATGTACATTTTCCTTGTGCAAGGGCTGCATAAAAAATGCTGTTATTTTGTGTGTTAGAGGTAACAAAGGCTTCTGCGAGGCGTGCATGAGATTTGTTATGTTGATTGAAAAGAATGAGCAGGGAAGTACAGAAAAGGGCCAAATTGATTTTAATGACAAAACTAGCTGGGAATATCTTTTCAAGGAATACTGGACTGACCTGAAAGGAAGCCTTTCTCTAACTTTTGATGAACTTGTTCATGCAAAAAACCCATGGAAAGGATCTGAAACACTAAATAGCAGACCCGATTCACCTGGCGAGCTATATGATGGTAATGTTGATGGAGGATCAGATTTGGATGTTTCTGAAAATGAAGAATCTGGTAGTTCTAAGAAAAGAAAAGCTAAGAGAAGGTCAAAATCCCAAGCAAAGGAAACCAATTCCCCTAGTATGCCAATAATACCTGATTCTCAAGGGCCATCCACGGATAACAATGTTGAGTGGGCATCTAAAGAGCTCTTGGAGTTTGTTATGCACATGAAGAATGGTGATAGAACTGTTTTATCTCAGTTTGACGTGCAGGCTCTCTTATTAGAATATATTAAAAGAAATAAGCTTCGTGATCCTCGTAGGAAAAGTCAAATCATATGTGATTCAAGACTTGAGAATTTGTTTGGAAAGCCACGTGTAGGACATTTTGAAATGTTGAAGCTCCTAGAGTCACATTTCCTCATCAGAGAAGATGTGCAGATAAATGACCTCCAAGGGAGTGTTGCCGATACTGAATCAAGTCAGTTGGAAGGTGATGGGTACACCGATGCGTCGGGAAAAACTAGGAAAGAAAAGAAACGCCGGATGCGGAAAAAAGGTGATCAGAGAGGATTGCAGTCCAACCTTGATGACTATGCAGCCATTGATATTCACAACATTAATTTAATCTACCTGAGACGTAATTTGGTGGAATATCTGATTGAAGACGAGGAGAGTTTTCATGAAAAGGTTGTTGGTTCTTTTGTGAGGATAAGAATATCAGGCAATGCACAAAAACAAGATTTATACCGGTTGGTTCAAGTTGTAGGTACAAGCAAAGCGTCTGAGCCTTATAAAGTTGGTAAAAAGATGACAGATATCTTGCTAGAGATCTTGAATTTGAACAAGACAGAAGTGATTTCAATTGATATTATCTCGAATCAAGAGTTCACAGAGGAGGAGTGCAAGCGTCTCAGACAGAGCATTAAGTGTGGAATCCTTAACCGTCTGACTGTGGGTGACCTTCAGGAGAGAGCAATGTCGCTTCAAGATGCTAGAGTTAAGGATTGGATGGAAACCGAGATAGTTCGACTGAGTCATCTTCGTGATCGAGCAAGTGAAAAAGGGCGCAGAAAAGAATATCCTTTTTACAACATTATGGAATGTGTTGAGAAACTACAGCTTTTGAAGACACCCGAGGAGCGGCAGCGCAGACTGGAGGAGCTACCGGGAATACATACAGACCCAAATATGGATCCGAGTCATGAATCTGAAGATGAGGATGAAGCAGATGATAAGAGACAAGAAACCTACACCTTGTCAAGAGGCTCAGGCTTTAGTAGGAGGACAAGGGAGCCAGTTTCTCCTGGAAAAGCAGGTTCAAATTTGAATGATTCCTGGAGTGGTACTAGAAACTTTTCGAGCACGAATCGGGACTTGAGCAGGAACTTGTCTGGAAAAGGCTTCTCTAACCAAGGTGAAGATGCCATTGGTTCTGGTGAAATAATAAATGAAAATTCTTGGAGCCATGGAAGGGAGGGAGATGTTAAAAAACCAAATAAGTGGGACAAGCAACAAGTTTCGCCTAGCTCAGAAATGACTGCCAGGAATGCCTTGTCCGGGGCAGCGTCTGAGTTGCCTTCTGCCGCTCGTTCGGTAAATTCAGCAGCATCTCCATCTGTTGGGACTACACAAAATGCTGCTACAGTTAACGAAACAGAGAAGATTTGGCGTTATCAGGATCCATCTGGGAAAGTGCAGGGACCGTTTTCGATGGTGCAACTTCGTAAGTGGAGTAACACAGGCTATTTTCCTGCAGACTTGAGAGTATGGAGAGCCTCAGACAAGCAAGACGACTCGCTGCTTCTGACCGATGTCTTAGCGGGAAAGATTCCGAAGGATACTTCATCCGTGGACAACAGTATTCAAGCACAAGCACATGCTTCTTCTTTCGTTGCAAAGCCTCAGGGAGCTACTGTGCAGTCAGGTATGGATGTTCAGAATACTGGTACTTCAAATCCACATACTAATCCAACTTCTTATGGCCAATCTGCTGGAGGAAGATGGAAATCTCAAACTGAAGTTAGCCCTACTGGTATACCCGCCTCAGCTTCGATAGAAGTCCCCAGGTACTCGGGAGACCGATGGTCGTCTGACCATGGTAATAAGGACTTTACGAGTCTTCCTTCTCCTACTCCCAGCTCAGGAGGAACGAAGGAGCAGCCATTTCAAATGGCTACACCGTTCGCCTCCTCAGCAGGTGGTGGCAGTTTGCACGGTTCTTCACTTATGCAAGGATCCGAAAACGATTCCTTGCGCTCACATTCTGGCCTGAACGCTGCAGAAAAGGGCACGGGTTTAGGTCCTATAAATGGACTTCAAAATCATCATTCGCTGCCAGTAAGGCCTTCATCTATCATTGATGATACTTTGGTGAATCCAGCTGCAGATATTAAAAGCATTAGTGCAAATCTTCATTCTCTAGTACAATCCATCAACAGTCGTAATCCTCCTATTGAAACTCAAACTGTTGAAACAAACATTTCTTCTAGCATGCCGCCAGGACAAACTCTTCACAGGCGTTGGGGGGAGATGTCACCCGCGCAAAATGCTGCGACAGCGAGTTTTTCCACGCCTGGTTTAACCAATTTTTCATCCTCTGAGCCTTGGCGATCGATGCCTCCTATTCCGAGTAACCCGCCACACATTCAGTCTTCAACTCCGCCTAATATACCGTGGGGAATGGGTGCTCCAGAAGGTCAAAGCACCGTTCCACGACCGGGGTTGGAGTCTCAGAACCATAGCTGGGGGCCAATGCCATCAGGAAATCCAAACATGACTTGGGCTCCATCAGCACCTCCGAATGCTACTGGTATGATGTGGGGGTCTTCAGCTCAAAGTTCTGCTTCTGTAGGTACAAACCCAGGTTGGAATGCCCCAGGTCAAGGGCCACCAGTCAGAAACAACATTCAAGGATGGCAAGCGCATAGCTCGATACCACCTCAGGTAAACGCAACCCCGGGTTGGGTTGCCCCCAACCTCGGACCGATGCCACCTATGAACATGAATCCCAATTGGCATGCCCCATCAGCCAATCAGGGCATGTGGAGTAACGAACATGGTAAGAATGGGGATAGATTCTCGAACCCGGACAGTGTCTCTCACGGCGGAGATCCAGGGAACGGAGGCAAATCTTGGGGGATGCCACCATCTTATGGCGGCGGCGGCGGAAGTTCTTCTAGGCTTCCTTACAACAATAAAGGGCAAAAATTGTGCAAATATCATGAAAGTGGACATTGCAAGAAAGGAGGTTCTTGTGATTACAGGCACAAGTAGAATTTAGTTGTTTGACAGTGATTAGAGAGCAATTAATTTTGTATAGTCCTATCATTATTCAGAGCTTTTCTTTGTATAAACGCCCTGTTATTAATGTTATACATGTGCTACTTTTGCAATTCTCTATTCTTCGAGGGTCTTCAGATGCAAATAATTTAAGTATTATTTTTTTTTTTTTTTTTTTAGTTTAAAAA

Coding sequence (CDS)

ATGGAAGCCGAAGAGAACGATTCCTCCAAACATGACCAACCATCATCACCTCTTCTCTCCGTTGATGATGGCAACGACTTGGATGTTAAGTGTCACACCCATCGGGAGCTTCACAGCAATGAAGAACAGCATTGTTTGTTCCAGTCCGCCATTAATGAACTGGAGTTTCCATCCAATTCTAGCGTTGAATCTTTGCAGCCGAGCGATGCAATTCGTGGGGATGAGAGTCTTGTTGCAGAGACTTGTTTGGAGGTGGAGGAGACAGAGATAGCCGGGGTTAAGGCTTGTCGTAACGGTATTGAGGATATGGGGGAAGATTCAGTGAAGTTGGAGGTTGAACCGGATATTGCGGCGATGGGGTTATTGGGGGAAACGGTTTTCAATGATGTGAAAGAGGAGGATGCAGGGGCGGAGGAGGTGAAGGCTGTTGCTGAATTTGGAGAAGGTGACTTGTTATGTGAAATGGATTTGGTTGGTGGTGCTGAAAATCAGGTTGAGGGCAACGTTTTGATGGTGAATCTTCCGGACAATACTGTTGGCTGCGGTGAGACAGACACATGCTTGAGTGATGTTTTGGCTGAGCTTGCAGAAACGACTCCTTTTGTGCATGGTGTAGATACTACTGATGTAGCCAATTTGGTGGAGAGGAAAGAGGTGGAAGAAAATGCCGATGATCCCAAAGATTCGAAGGATATAGAGGTGGCAAAGCAAGAAACTTTTTCTATGGAGGATGGGAAATTAGGCGTCCCGGTGCAGCTTGTGGAGAAGTCCGAGTTGAAACAAAGTTTGGTAGATGGGGCTGTGGTGGAGGAAGGAAGGACGGAGAATTTAGCTGACAGAACTGGTGAAACTTTGAAGATGGAGAATGATTCAAGCAAAACTGATGAGGTGGGGCTGGCGAATTTTGCTGGTGAGATTGATGGGGCGGTAACTATGGAGAATACTGAAGACAAGACTGTTGAGGTGGATGGAATGTGTTTGGAAGACAAGGCTGCTGATGCGACGACGAAGACGACGACGGGAAATTTGGCAGATGAGACCCCGAAGATCAAGGGAGTGCACGTAACAGACGACAACATTGAAGTGTTGAAGATTGAAAACGTTGAAGATAGGGAAGCAGGGGTGCAAGGATTGGGTGTGGCTGATGAGAGTGCCGAGGTTGGAAAGATTGAGAATTTGGTAGATGAGACTGCAGAGGCTGAGAATGTGACAAACTACACAGCCGAATCAATGGAGAATCTGGACGACAAGACTGCACAATTGGAGGAAATAGCTATGGAAGAAGAGACTGAGGAAGCAGATGACAGGGTTTATTTGGTGGATGAAGGGATTGGATCAGAGGAGAATGATGCAAACATGACATACTTGGTGGGGGAAACAGAAGCAGCGGAGGAGGTTGAGGAGATGGATGTTACAGAGGAGGTTGATGAGGCAAGTAAAGGTAGCAGTGGGGCTAAAAGGAAGCGTGGAAAGAATTCTAAAGCTCCTGCTAGAGTTCCTTCTAGGAAGAAGGTGGAAGAAGACGTTTGCTTTATTTGTTTTGATGGGGGTGACCTTGTACTCTGTGATCGCAGAGGCTGTCCCAAGGCGTACCACCCTTCCTGTATTAATCGTGATGAGGCGTTCTTCCGAGCCAAGGGTCGATGGAATTGTGGTTGGCATCTTTGTAGTAACTGTGAGAAGACTGCCCACTACATGTGTTACACATGTACATTTTCCTTGTGCAAGGGCTGCATAAAAAATGCTGTTATTTTGTGTGTTAGAGGTAACAAAGGCTTCTGCGAGGCGTGCATGAGATTTGTTATGTTGATTGAAAAGAATGAGCAGGGAAGTACAGAAAAGGGCCAAATTGATTTTAATGACAAAACTAGCTGGGAATATCTTTTCAAGGAATACTGGACTGACCTGAAAGGAAGCCTTTCTCTAACTTTTGATGAACTTGTTCATGCAAAAAACCCATGGAAAGGATCTGAAACACTAAATAGCAGACCCGATTCACCTGGCGAGCTATATGATGGTAATGTTGATGGAGGATCAGATTTGGATGTTTCTGAAAATGAAGAATCTGGTAGTTCTAAGAAAAGAAAAGCTAAGAGAAGGTCAAAATCCCAAGCAAAGGAAACCAATTCCCCTAGTATGCCAATAATACCTGATTCTCAAGGGCCATCCACGGATAACAATGTTGAGTGGGCATCTAAAGAGCTCTTGGAGTTTGTTATGCACATGAAGAATGGTGATAGAACTGTTTTATCTCAGTTTGACGTGCAGGCTCTCTTATTAGAATATATTAAAAGAAATAAGCTTCGTGATCCTCGTAGGAAAAGTCAAATCATATGTGATTCAAGACTTGAGAATTTGTTTGGAAAGCCACGTGTAGGACATTTTGAAATGTTGAAGCTCCTAGAGTCACATTTCCTCATCAGAGAAGATGTGCAGATAAATGACCTCCAAGGGAGTGTTGCCGATACTGAATCAAGTCAGTTGGAAGGTGATGGGTACACCGATGCGTCGGGAAAAACTAGGAAAGAAAAGAAACGCCGGATGCGGAAAAAAGGTGATCAGAGAGGATTGCAGTCCAACCTTGATGACTATGCAGCCATTGATATTCACAACATTAATTTAATCTACCTGAGACGTAATTTGGTGGAATATCTGATTGAAGACGAGGAGAGTTTTCATGAAAAGGTTGTTGGTTCTTTTGTGAGGATAAGAATATCAGGCAATGCACAAAAACAAGATTTATACCGGTTGGTTCAAGTTGTAGGTACAAGCAAAGCGTCTGAGCCTTATAAAGTTGGTAAAAAGATGACAGATATCTTGCTAGAGATCTTGAATTTGAACAAGACAGAAGTGATTTCAATTGATATTATCTCGAATCAAGAGTTCACAGAGGAGGAGTGCAAGCGTCTCAGACAGAGCATTAAGTGTGGAATCCTTAACCGTCTGACTGTGGGTGACCTTCAGGAGAGAGCAATGTCGCTTCAAGATGCTAGAGTTAAGGATTGGATGGAAACCGAGATAGTTCGACTGAGTCATCTTCGTGATCGAGCAAGTGAAAAAGGGCGCAGAAAAGAATATCCTTTTTACAACATTATGGAATGTGTTGAGAAACTACAGCTTTTGAAGACACCCGAGGAGCGGCAGCGCAGACTGGAGGAGCTACCGGGAATACATACAGACCCAAATATGGATCCGAGTCATGAATCTGAAGATGAGGATGAAGCAGATGATAAGAGACAAGAAACCTACACCTTGTCAAGAGGCTCAGGCTTTAGTAGGAGGACAAGGGAGCCAGTTTCTCCTGGAAAAGCAGGTTCAAATTTGAATGATTCCTGGAGTGGTACTAGAAACTTTTCGAGCACGAATCGGGACTTGAGCAGGAACTTGTCTGGAAAAGGCTTCTCTAACCAAGGTGAAGATGCCATTGGTTCTGGTGAAATAATAAATGAAAATTCTTGGAGCCATGGAAGGGAGGGAGATGTTAAAAAACCAAATAAGTGGGACAAGCAACAAGTTTCGCCTAGCTCAGAAATGACTGCCAGGAATGCCTTGTCCGGGGCAGCGTCTGAGTTGCCTTCTGCCGCTCGTTCGGTAAATTCAGCAGCATCTCCATCTGTTGGGACTACACAAAATGCTGCTACAGTTAACGAAACAGAGAAGATTTGGCGTTATCAGGATCCATCTGGGAAAGTGCAGGGACCGTTTTCGATGGTGCAACTTCGTAAGTGGAGTAACACAGGCTATTTTCCTGCAGACTTGAGAGTATGGAGAGCCTCAGACAAGCAAGACGACTCGCTGCTTCTGACCGATGTCTTAGCGGGAAAGATTCCGAAGGATACTTCATCCGTGGACAACAGTATTCAAGCACAAGCACATGCTTCTTCTTTCGTTGCAAAGCCTCAGGGAGCTACTGTGCAGTCAGGTATGGATGTTCAGAATACTGGTACTTCAAATCCACATACTAATCCAACTTCTTATGGCCAATCTGCTGGAGGAAGATGGAAATCTCAAACTGAAGTTAGCCCTACTGGTATACCCGCCTCAGCTTCGATAGAAGTCCCCAGGTACTCGGGAGACCGATGGTCGTCTGACCATGGTAATAAGGACTTTACGAGTCTTCCTTCTCCTACTCCCAGCTCAGGAGGAACGAAGGAGCAGCCATTTCAAATGGCTACACCGTTCGCCTCCTCAGCAGGTGGTGGCAGTTTGCACGGTTCTTCACTTATGCAAGGATCCGAAAACGATTCCTTGCGCTCACATTCTGGCCTGAACGCTGCAGAAAAGGGCACGGGTTTAGGTCCTATAAATGGACTTCAAAATCATCATTCGCTGCCAGTAAGGCCTTCATCTATCATTGATGATACTTTGGTGAATCCAGCTGCAGATATTAAAAGCATTAGTGCAAATCTTCATTCTCTAGTACAATCCATCAACAGTCGTAATCCTCCTATTGAAACTCAAACTGTTGAAACAAACATTTCTTCTAGCATGCCGCCAGGACAAACTCTTCACAGGCGTTGGGGGGAGATGTCACCCGCGCAAAATGCTGCGACAGCGAGTTTTTCCACGCCTGGTTTAACCAATTTTTCATCCTCTGAGCCTTGGCGATCGATGCCTCCTATTCCGAGTAACCCGCCACACATTCAGTCTTCAACTCCGCCTAATATACCGTGGGGAATGGGTGCTCCAGAAGGTCAAAGCACCGTTCCACGACCGGGGTTGGAGTCTCAGAACCATAGCTGGGGGCCAATGCCATCAGGAAATCCAAACATGACTTGGGCTCCATCAGCACCTCCGAATGCTACTGGTATGATGTGGGGGTCTTCAGCTCAAAGTTCTGCTTCTGTAGGTACAAACCCAGGTTGGAATGCCCCAGGTCAAGGGCCACCAGTCAGAAACAACATTCAAGGATGGCAAGCGCATAGCTCGATACCACCTCAGGTAAACGCAACCCCGGGTTGGGTTGCCCCCAACCTCGGACCGATGCCACCTATGAACATGAATCCCAATTGGCATGCCCCATCAGCCAATCAGGGCATGTGGAGTAACGAACATGGTAAGAATGGGGATAGATTCTCGAACCCGGACAGTGTCTCTCACGGCGGAGATCCAGGGAACGGAGGCAAATCTTGGGGGATGCCACCATCTTATGGCGGCGGCGGCGGAAGTTCTTCTAGGCTTCCTTACAACAATAAAGGGCAAAAATTGTGCAAATATCATGAAAGTGGACATTGCAAGAAAGGAGGTTCTTGTGATTACAGGCACAAGTAG

Protein sequence

MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNSSVESLQPSDAIRGDESLVAETCLEVEETEIAGVKACRNGIEDMGEDSVKLEVEPDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNISSSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKYHESGHCKKGGSCDYRHK
Homology
BLAST of Cp4.1LG04g01060 vs. ExPASy Swiss-Prot
Match: Q9SIV5 (Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana OX=3702 GN=NERD PE=1 SV=3)

HSP 1 Score: 946.4 bits (2445), Expect = 4.5e-274
Identity = 707/1802 (39.23%), Postives = 962/1802 (53.39%), Query Frame = 0

Query: 64   SLQPSDAIRGDESLVAETCLE------------------VEETEIAGVKACRNGIEDMGE 123
            ++Q  D++ GD + V E  L+                   EE  +A      + +E+  E
Sbjct: 95   NIQEIDSVGGDAAAVEEVPLKSSSVVGEGREEEAGASIVKEEDFVAEANLSGDRLEENKE 154

Query: 124  DSVKLEVEP---DIAAMGLLGETVFNDVKEEDAGAEEV--KAVAEFGEGDLLCEMDLVGG 183
              V +E EP   +++   + G    ND +  + G + V      E  E DL  + + V  
Sbjct: 155  --VSMEEEPSSHELSVCEVNGVDSLNDEENREVGEQIVCGSMGGEEIESDLESKKEKVDV 214

Query: 184  AEN----QVEGNVLMVNLPDN-TVGCGETDTCLSDVLAELAET------TPFVHGVDTTD 243
             E     Q    V  + +PD+  V C    T +S     L E+         V  +   +
Sbjct: 215  IEEETTAQAASLVNAIEIPDDKEVACVAGFTEISSQDKGLDESGNGFLDEEPVKELQIGE 274

Query: 244  VANLVERKEVEENADDPKDSKDIEVAKQ--------ETFSMEDGKLGVPVQLVEKSELKQ 303
             A  +   + +E  D  +D  DI+V K+         T  +E   + + V  V      +
Sbjct: 275  GAKDLTDGDAKEGVDVTEDEMDIQVLKKSKEEEKVDSTTELEIETMRLEVHDVATEMSDK 334

Query: 304  SLVDGAVVEE--GRTEN----LADRTGETLKMENDSSKTDEVGLANFAGEID-------- 363
            +++  AVV +  G T N    + D   E +  ++++ K+ ++ +     E+D        
Sbjct: 335  TVISSAVVTQFTGETSNDKETVMDDVKEDVDKDSEAGKSLDIHVPEATEEVDTDVNYGVG 394

Query: 364  ---------GA------VTMENTEDKTVEVD-GMCLEDKAADATTKTTTGNLADETPKIK 423
                     GA      V +E   ++  E+   +   D+   +     T  +  +  + K
Sbjct: 395  IEKEGDGVGGAEEAGQTVDLEEIREENQELSKELAQVDETKISEMSEVTETMIKDEDQEK 454

Query: 424  GVHVTD--DNIEVLKIENVEDREAG---VQGLGVADESAE--VGKIENLVDETAEAENVT 483
              ++TD  +++E  +  +V D E G    + +GV +   E  +GK++         E  T
Sbjct: 455  DDNMTDLAEDVENHRDSSVADIEEGREDHEDMGVTETQKETVLGKVDRTKIAEVSEETDT 514

Query: 484  NYTAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAE 543
                E  E  D+ T   E++    ++  AD     ++EG  S+E    MT    ++  A+
Sbjct: 515  RIEDEDQEKDDEMTDVAEDVKTHGDSSVAD-----IEEGRESQE---EMTETQEDSVMAD 574

Query: 544  EVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDR 603
            E       EEV+E +K S+G KRKRG+N+K      + KK EEDVCF+CFDGGDLVLCDR
Sbjct: 575  E-----EPEEVEEENK-SAGGKRKRGRNTKTVK--GTGKKKEEDVCFMCFDGGDLVLCDR 634

Query: 604  RGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 663
            RGC KAYHPSC++RDEAFF+ KG+WNCGWHLCS CEKTA Y+CYTC FSLCKGC K+AV 
Sbjct: 635  RGCTKAYHPSCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYLCYTCMFSLCKGCAKDAVF 694

Query: 664  LCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTF 723
             C+RGNKG CE CM  V LIE+ +Q   E  Q+DFNDKTSWEYLFK+YW DLK  LSL+ 
Sbjct: 695  FCIRGNKGLCETCMETVKLIERKQQ-EKEPAQLDFNDKTSWEYLFKDYWIDLKTQLSLSP 754

Query: 724  DELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQ 783
            +EL  AK P KG ET  S+  +  E  D   DGGSD D        S KKRK + RSKS 
Sbjct: 755  EELDQAKRPLKGHETNASKQGTASET-DYVTDGGSDSD-------SSPKKRKTRSRSKSG 814

Query: 784  AKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 843
            + E       I+       +D  +EWASKELL+ V+HM+ GDR+ L   +VQ LLL YIK
Sbjct: 815  SAE------KILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPMLEVQTLLLAYIK 874

Query: 844  RNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTE 903
            R  LRDPRRKSQ+ICDSRL+NLFGK  VGHFEML LL+SHFL +E  Q +D+QG + DTE
Sbjct: 875  RYNLRDPRRKSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQADDIQGDIVDTE 934

Query: 904  S-SQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVE 963
              + ++ D   D   K+ K+KKR+ RKK  ++G QSNLDD+AA+D+HNINLIYLRR+LVE
Sbjct: 935  EPNHVDVDENLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDFAAVDMHNINLIYLRRSLVE 994

Query: 964  YLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEI 1023
             L+ED  +F EKV  +FVR+RISGN QKQDLYRLVQVVGTSKA EPYKVGKK TD +LEI
Sbjct: 995  DLLEDSTAFEEKVASAFVRLRISGN-QKQDLYRLVQVVGTSKAPEPYKVGKKTTDYVLEI 1054

Query: 1024 LNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWM 1083
            LNL+KTEVISIDIISNQ+FTE+ECKRL+QSIKCG++NRLTVGD+QE+A++LQ+ RVK+ +
Sbjct: 1055 LNLDKTEVISIDIISNQDFTEDECKRLKQSIKCGLINRLTVGDIQEKAIALQEVRVKNLL 1114

Query: 1084 ETEIVRLSHLRDRASEKGRRKEYPF----------YNIMECVEKLQLLKTPEERQRRLEE 1143
            E EI+R SHLRDRAS+ GRRKEYP+            + ECVEKLQLLK+PEERQRRLEE
Sbjct: 1115 EAEILRFSHLRDRASDMGRRKEYPYLLKLSNSLTMLTLRECVEKLQLLKSPEERQRRLEE 1174

Query: 1144 LPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWS 1203
            +P IH DP MDP  ESEDEDE ++K +E     R S F+RR R+P+SP K G + N+SW+
Sbjct: 1175 IPEIHADPKMDPDCESEDEDEKEEKEKEKQLRPRSSSFNRRGRDPISPRKGGFSSNESWT 1234

Query: 1204 GTRNFSST--NRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQ 1263
            GT N+S+T  NR+LSR+ SG+G + +G+    S + ++++ W+  RE +V+     +K +
Sbjct: 1235 GTSNYSNTSANRELSRSYSGRGSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPR 1294

Query: 1264 VSPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKV 1323
                 E  AR++ + A  EL     S  S A P+V  +Q     N++EKIW Y+DPSGKV
Sbjct: 1295 SVSIPETPARSSRAIAPPELSPRIASEISMAPPAV-VSQPVPKSNDSEKIWHYKDPSGKV 1354

Query: 1324 QGPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQA 1383
            QGPFSM QLRKW+NTGYFPA L +W+A++   DS+LLTD LAG   K T +VDNS   +A
Sbjct: 1355 QGPFSMAQLRKWNNTGYFPAKLEIWKANESPLDSVLLTDALAGLFQKQTQAVDNSYM-KA 1414

Query: 1384 HASSFVAKPQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASI 1443
              ++F       + QS     N G +                      ++PT      +I
Sbjct: 1415 QVAAF-------SGQSSQSEPNLGFA--------------------ARIAPT------TI 1474

Query: 1444 EVPRYSGDRWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQG 1503
            E+PR S D WS         SLPSPTP+         Q+ TP A      S    +    
Sbjct: 1475 EIPRNSQDTWSQG------GSLPSPTPN---------QITTPTAKRRNFESRWSPTKPSP 1534

Query: 1504 SENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLH 1563
               +   ++S   + +  T    I  + N         ++   T   P  D  ++S N  
Sbjct: 1535 QSANQSMNYSVAQSGQSQTSRIDIPVVVN------SAGALQPQTYPIPTPDPINVSVNHS 1594

Query: 1564 SLVQSINSRNPPIETQTVETNI-SSSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFS 1623
            + + S           +++T+   S+ P  Q     +G  SP   +   S S PG   F 
Sbjct: 1595 ATLHSPTPAGGKQSWGSMQTDHGGSNTPSSQNNSTSYGTPSP---SVLPSQSQPG---FP 1654

Query: 1624 SSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRP-GLESQNHSWGPMPSGNP 1683
             S+ W+    +PS P     +      WGM          +P    +QN SWG   + NP
Sbjct: 1655 PSDSWK--VAVPSQP-----NAQAQAQWGMNMVNNNQNSAQPQAPANQNSSWG-QGTVNP 1714

Query: 1684 NMTWAPSAPPNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGW--------QAH 1743
            NM W   A         GSS  S+    T+ GW AP QG        GW        Q+ 
Sbjct: 1715 NMGWVGPAQTGVNVNWGGSSVPSTVQGITHSGWVAPVQGQTQAYPNPGWGPTGHPQSQSQ 1772

Query: 1744 SSIPPQVNAT-PGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNPDSVSH 1756
            S +  Q   T  GW+ P  G +   N N NW                      N  ++  
Sbjct: 1775 SQVQAQAGTTGSGWMQPGQG-IQSGNSNQNWGT-------------------QNQTAIPS 1772

BLAST of Cp4.1LG04g01060 vs. ExPASy Swiss-Prot
Match: Q9SD34 (Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana OX=3702 GN=At3g51120 PE=2 SV=3)

HSP 1 Score: 600.1 bits (1546), Expect = 7.9e-170
Identity = 483/1442 (33.50%), Postives = 683/1442 (47.36%), Query Frame = 0

Query: 412  ENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDV 471
            + L     +L  +A  EE+      +  VD+      N      +   T A      M  
Sbjct: 6    KQLQQGVPELASLAGREESSVRGIDLMRVDQCEEIGVNQVPALSVPASTVAGAVAVPMSN 65

Query: 472  TEEV---DEASKGSSGAKRKRGKNSKAPARV----------PSRKKVEEDVCFICFDGGD 531
             +EV   DEA+      KRKRG+  +A A            P ++  EEDVCFICFDGGD
Sbjct: 66   EQEVKVIDEAAP----IKRKRGRPPRAQANTPLHIRPPPPPPKKEDKEEDVCFICFDGGD 125

Query: 532  LVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGC 591
            LVLCDRR CPKAYHP+CI RDEAFFR   +WNCGWH+C  C+K + YMCYTCTFS+CK C
Sbjct: 126  LVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQKASSYMCYTCTFSVCKRC 185

Query: 592  IKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKG 651
            IK+A  + VRGN G C  C++ +MLIE   QG  E  ++DF+DK SWEYLFK YW  LK 
Sbjct: 186  IKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFDDKLSWEYLFKVYWLCLKE 245

Query: 652  SLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAK 711
             LSLT DEL  A NPWK  E  N+ P    +    +      LDV+ N   G+ ++R   
Sbjct: 246  ELSLTVDELTRANNPWK--EVPNTAPKVESQ---NDHTNNRALDVAVN---GTKRRR--- 305

Query: 712  RRSKSQAKETNSPSMPIIPDSQGPS-----TDNNVEWASKELLEFVMHMKNGDRTVLSQF 771
                     ++SP++P   D + PS        +  WA+KELLEFV  MKNGD +VLSQF
Sbjct: 306  --------TSDSPTLPNKLDGKNPSNILKKAPGDTSWATKELLEFVSFMKNGDTSVLSQF 365

Query: 772  DVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQ- 831
            DVQ LLL+YIK+  LRDP +KSQ++CD  L  LFGK RVGHFEMLKLLESH LI+E  + 
Sbjct: 366  DVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQEKPKG 425

Query: 832  INDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNI 891
                 G       SQ+E D   D      ++++R+MR+K D R    NLD YAAID+HNI
Sbjct: 426  AKTTNGETTHAVPSQIEEDSVHD---PMVRDRRRKMRRKTDGRVQNENLDAYAAIDVHNI 485

Query: 892  NLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKV 951
            NLIYLRR  +E L++D     EKVVG+ +RI++SG+ QK D++RLVQVVGTSKA   Y++
Sbjct: 486  NLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSDQKLDIHRLVQVVGTSKAIASYQL 545

Query: 952  GKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAM 1011
            G K TD++LEILNL+K EVISID +S+Q  TE+ECKRLRQSIKCG+  RLTV D+ + A 
Sbjct: 546  GAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVDILKTAA 605

Query: 1012 SLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLE 1071
            +LQ  R+ + +E EI++L+HLRDRA                  +KL+LLK+PEERQR L+
Sbjct: 606  TLQAMRINEALEAEILKLNHLRDRA------------------KKLELLKSPEERQRLLQ 665

Query: 1072 ELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSW 1131
            E+P +HTDP+MDPSH   ++     ++Q+ +  ++  G          P   G NLN   
Sbjct: 666  EVPEVHTDPSMDPSHALSEDAGLGTRKQDNHVKAQSKG----------PQNKGVNLN--- 725

Query: 1132 SGTRNFSSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQV 1191
                       ++  N+  K              I+   +  H  + D  K +       
Sbjct: 726  -----------NVGNNVQKK----------YDAPILRSRNNVHADKDDCSKVHN------ 785

Query: 1192 SPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQ 1251
                                      NS+        Q     +E  +IW Y+DP+GK Q
Sbjct: 786  --------------------------NSS------NIQETGKDDEESEIWHYRDPTGKTQ 845

Query: 1252 GPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDT-----SSVDNSI 1311
            GPFSMVQLR+W ++G+FP  LR+WRA + QD+S+LLTD LAG+  K T     SS+   +
Sbjct: 846  GPFSMVQLRRWKSSGHFPPYLRIWRAHENQDESVLLTDALAGRFDKATTLPSSSSLPQEL 905

Query: 1312 QAQAHASSFVAKPQGATVQSGMDVQNTGTSNPHTNPTSYGQ-----------SAGGRWKS 1371
            +   H S           ++ M V  + TS+  +  T++             +  G+ + 
Sbjct: 906  KPSPHDSGRTGADVNCLQKNQMPVNTSATSSSSSTVTAHSNDPKEKQVVALVACSGKVED 965

Query: 1372 QTEVSP---TGIPASASI-----------EVP---RYSGDRWSSDHG------------- 1431
               V P      PAS S+           E P   +Y+  R   +H              
Sbjct: 966  GNSVRPQPQVSCPASISVVPGHVVTPDVRETPGTDQYNTVRADGNHNTTKTLEDETNGGS 1025

Query: 1432 --------------NKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGS 1491
                             F   PSPTP S   ++   Q A    S +    + G S +  S
Sbjct: 1026 VSINGSVHAPNLNQESHFLDFPSPTPKS-SPEDLEAQAAETIQSLSSCVLVKGPSGVTWS 1085

Query: 1492 ENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHS 1551
               +  + +    +      G +            P  I  +T+V  A  +K I      
Sbjct: 1086 TTTTSTTDAATTTSSVVVTGGQL------------PQVIQQNTVVLAAPSVKPIELAADH 1145

Query: 1552 LVQSINSRNPPI-----------ETQTVETNISSSMPPGQTLHRRWGEMSPAQNAATASF 1611
               +  S N  +           +    + ++S  +   + + +     SP     T++F
Sbjct: 1146 ATATQTSDNTQVAQASGWPAIVADPDECDESVSDLLAEVEAMEQNGLPSSP-----TSTF 1205

Query: 1612 STPGLTNFSSSE-------PWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGL 1671
                  +    E          S+ P        Q+S   N+  G  +   ++       
Sbjct: 1206 HCDDDDDLKGPEKDFFNPVARMSLTPETCRLDVSQTSILDNVSAGKSSMLTEA------- 1265

Query: 1672 ESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA-SVGTNPGWNAPGQGPPVRN 1731
               N  +    +  P +      PP  T +    +  ++A  +G+     A G    +  
Sbjct: 1266 -KDNTPFSHCGTAGPELLLFAPPPPPPTAISHDLTLTTTALRLGSETTVEA-GTVERLPK 1291

Query: 1732 NIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFS 1756
            ++ G  +  S P  +++     A       P    P   +  +    W N +G N   F+
Sbjct: 1326 SVLGVSSEPS-PRSLSSHDSSSARGSTERSPRVSQPKRSSGHSRDRQWLN-NGHNSS-FN 1291

BLAST of Cp4.1LG04g01060 vs. ExPASy Swiss-Prot
Match: Q9FT92 (Uncharacterized protein At5g08430 OS=Arabidopsis thaliana OX=3702 GN=At5g08430 PE=1 SV=2)

HSP 1 Score: 184.1 bits (466), Expect = 1.4e-44
Identity = 157/562 (27.94%), Postives = 260/562 (46.26%), Query Frame = 0

Query: 728  VEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLF 787
            V W S++L+EF+  +      ++S++DV   + +YI +  L DP  K +++CD RL  LF
Sbjct: 30   VGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKKVVCDKRLVLLF 89

Query: 788  GKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRR 847
            G   +   ++  LLE H+  +E+   +D      D        +     + K  K+ +  
Sbjct: 90   GTRTIFRMKVYDLLEKHY--KENQDDSDFDFLYEDEPQIICHSEKIAKRTSKVVKKPR-- 149

Query: 848  MRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISG 907
                            +AAI   NI L+YLR++LV+ L++  ++F  K++GSFVRI+   
Sbjct: 150  --------------GTFAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRIKSDP 209

Query: 908  NAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEE 967
            N   Q   Y+LVQV G  K            D LL++ N  K   +SI ++S+  F++EE
Sbjct: 210  NDYLQKYPYQLVQVTGVKKE-------HGTDDFLLQVTNYVKD--VSISVLSDDNFSQEE 269

Query: 968  CKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEY 1027
            C+ L Q IK G+L + T+ +++E+A  L   + K W+  EI  L  L DRA+EKG R+E 
Sbjct: 270  CEDLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRE- 329

Query: 1028 PFYNIMECVEKLQLLKTPEERQRRLEELPGI-------HTDPNMDPSHESEDEDEADDKR 1087
                + E ++K +LL+ P+E+ R L E+P +       + + +   +H+S++E    +  
Sbjct: 330  ----LSEYLDKRELLQNPDEQARLLREVPEVIGEELVQNPEVSSPEAHKSDNEQRLSESP 389

Query: 1088 QE-TYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFS--- 1147
                +             +  + G   SN   +   T   +  N+ L   ++  G     
Sbjct: 390  LSCIHETPEARNLFGGEDQQFNNGYVMSNPITTPGITSCATEINKGLPTWIASAGAEYLH 449

Query: 1148 ---NQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELP 1207
                Q  + I  GE   E S     +  +   N  +  QV P+              EL 
Sbjct: 450  VDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQPN---------PSEVIELS 509

Query: 1208 SAARSVNSAASPSVGTTQNAATVN-ETEKI-WRYQDPSGKVQGPFSMVQLRKWSNTGYFP 1267
                  N          ++   ++ + EK+ W Y+DP G VQGPFS+ QL+ WS+  YF 
Sbjct: 510  DDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLTQLKAWSDAEYFT 550

Query: 1268 ADLRVWRASDKQDDSLLLTDVL 1273
               RVW   +  + ++LLTDVL
Sbjct: 570  KQFRVWMTGESMESAVLLTDVL 550

BLAST of Cp4.1LG04g01060 vs. ExPASy Swiss-Prot
Match: Q6P2L6 (Histone-lysine N-methyltransferase NSD3 OS=Mus musculus OX=10090 GN=Nsd3 PE=1 SV=2)

HSP 1 Score: 88.6 bits (218), Expect = 7.7e-16
Identity = 47/113 (41.59%), Postives = 60/113 (53.10%), Query Frame = 0

Query: 472  TEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAY 531
            T  VDE +K    AK K+ +  KA A     K + ED CF C DGG+LV+CD++ CPKAY
Sbjct: 1296 TSAVDEKTK---NAKLKKRRKVKAEA-----KPIHEDYCFQCGDGGELVMCDKKDCPKAY 1355

Query: 532  HPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 585
            H  C+N  +      G+W C WH C  C   A   C  C  S CK   K A++
Sbjct: 1356 HLLCLNLTQP---PHGKWECPWHRCDECGSVAVSFCEFCPHSFCKAHGKGALV 1397

BLAST of Cp4.1LG04g01060 vs. ExPASy Swiss-Prot
Match: Q9BZ95 (Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens OX=9606 GN=NSD3 PE=1 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 8.5e-15
Identity = 42/109 (38.53%), Postives = 58/109 (53.21%), Query Frame = 0

Query: 476  DEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSC 535
            +E    ++  K+KR K    P      K++ ED CF C DGG+LV+CD++ CPKAYH  C
Sbjct: 1296 NEEKAKNAKLKQKRRKIKTEP------KQMHEDYCFQCGDGGELVMCDKKDCPKAYHLLC 1355

Query: 536  INRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 585
            +N  +  +   G+W C WH C  C   A   C  C  S CK   K A++
Sbjct: 1356 LNLTQPPY---GKWECPWHQCDECSSAAVSFCEFCPHSFCKDHEKGALV 1395

BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match: XP_023531029.1 (zinc finger CCCH domain-containing protein 44-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 3412 bits (8848), Expect = 0.0
Identity = 1749/1756 (99.60%), Postives = 1750/1756 (99.66%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVEETEIAGVKACRNGIEDMGEDSVKLEVEPDIAAMG 120
            SVESLQPSDAIRGDESLVAETCLEVEETEIAGVKACRNGIEDMGEDSVKLEVEPDIAAMG
Sbjct: 61   SVESLQPSDAIRGDESLVAETCLEVEETEIAGVKACRNGIEDMGEDSVKLEVEPDIAAMG 120

Query: 121  LLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVG 180
            LLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVG
Sbjct: 121  LLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVG 180

Query: 181  CGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQETF 240
            CGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQETF
Sbjct: 181  CGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQETF 240

Query: 241  SMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKTDEVGLA 300
            SMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKTDEVGLA
Sbjct: 241  SMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKTDEVGLA 300

Query: 301  NFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHVTDDNI 360
            NFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHVTDDNI
Sbjct: 301  NFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHVTDDNI 360

Query: 361  EVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENLDDKTAQ 420
            EVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENLDDKTAQ
Sbjct: 361  EVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENLDDKTAQ 420

Query: 421  LEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEEVDEASK 480
            LEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEEVDEASK
Sbjct: 421  LEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEEVDEASK 480

Query: 481  GSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDE 540
            GSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDE
Sbjct: 481  GSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDE 540

Query: 541  AFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRF 600
            AFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRF
Sbjct: 541  AFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRF 600

Query: 601  VMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETL 660
            VMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETL
Sbjct: 601  VMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETL 660

Query: 661  NSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQ 720
            NSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQ
Sbjct: 661  NSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQ 720

Query: 721  GPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICD 780
            GPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICD
Sbjct: 721  GPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICD 780

Query: 781  SRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKT 840
            SRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKT
Sbjct: 781  SRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKT 840

Query: 841  RKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSF 900
            RKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSF
Sbjct: 841  RKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSF 900

Query: 901  VRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQ 960
            VRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQ
Sbjct: 901  VRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQ 960

Query: 961  EFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEK 1020
            EFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEK
Sbjct: 961  EFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEK 1020

Query: 1021 GRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEADDKRQ 1080
            GRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEADDKRQ
Sbjct: 1021 GRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEADDKRQ 1080

Query: 1081 ETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFSNQGED 1140
            ETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFSNQGED
Sbjct: 1081 ETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFSNQGED 1140

Query: 1141 AIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAARSVNS 1200
            AIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAARSVNS
Sbjct: 1141 AIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAARSVNS 1200

Query: 1201 AASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASD 1260
            AASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASD
Sbjct: 1201 AASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASD 1260

Query: 1261 KQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNTGTSNPH 1320
            KQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNTGTSNPH
Sbjct: 1261 KQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNTGTSNPH 1320

Query: 1321 TNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLPSPTPSS 1380
            TNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLPSPTPSS
Sbjct: 1321 TNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLPSPTPSS 1380

Query: 1381 GGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPINGLQN 1440
            GGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPINGLQN
Sbjct: 1381 GGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPINGLQN 1440

Query: 1441 HHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNISSSMPPG 1500
            HHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNISSSMPPG
Sbjct: 1441 HHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNISSSMPPG 1500

Query: 1501 QTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGM 1560
            QTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGM
Sbjct: 1501 QTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGM 1560

Query: 1561 GAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNP 1620
            GAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNP
Sbjct: 1561 GAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNP 1620

Query: 1621 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1680
            GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM
Sbjct: 1621 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1680

Query: 1681 WSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKYH 1740
            WSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKYH
Sbjct: 1681 WSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKYH 1740

Query: 1741 ESGHCKKGGSCDYRHK 1756
            ESGHCKKGGSCDYRHK
Sbjct: 1741 ESGHCKKGGSCDYRHK 1751

BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match: XP_022928299.1 (zinc finger CCCH domain-containing protein 44-like [Cucurbita moschata])

HSP 1 Score: 3338 bits (8656), Expect = 0.0
Identity = 1720/1764 (97.51%), Postives = 1734/1764 (98.30%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHT+RELHSNEEQHCLFQSAINELEFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTYRELHSNEEQHCLFQSAINELEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESLQ SDAIRGDESLVAETCLEVE      ETEIAGVKACRNGIEDMGEDSVKLEVEP
Sbjct: 61   SVESLQSSDAIRGDESLVAETCLEVEKKDMVEETEIAGVKACRNGIEDMGEDSVKLEVEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL
Sbjct: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180

Query: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEV 240
            PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTD ANLVE+KEVEE+ADDPKDSKDIEV
Sbjct: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDAANLVEKKEVEEHADDPKDSKDIEV 240

Query: 241  AKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKT 300
            AKQETFSMEDGKL VPVQLVEKSELK+SLVDGAVVEEGRTENLADRTGETLKMEN+SS T
Sbjct: 241  AKQETFSMEDGKLRVPVQLVEKSELKESLVDGAVVEEGRTENLADRTGETLKMENESSNT 300

Query: 301  DEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVH 360
            DEVGLANF+GEIDGAVTMENTEDKTVEVDGMCLEDKAAD TT  T GNLADETP+IKGVH
Sbjct: 301  DEVGLANFSGEIDGAVTMENTEDKTVEVDGMCLEDKAADVTT--TMGNLADETPEIKGVH 360

Query: 361  VTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420
            VTDD+IE+LKIENVEDREAGVQGLGVADESAEVGKIENLVDETA AENVTNYTAESMENL
Sbjct: 361  VTDDSIEMLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAGAENVTNYTAESMENL 420

Query: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480
            DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE
Sbjct: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480

Query: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540
            VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS
Sbjct: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540

Query: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600
            CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC
Sbjct: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600

Query: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660
            EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW
Sbjct: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660

Query: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720
            KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP
Sbjct: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720

Query: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780
            IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK
Sbjct: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780

Query: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840
            SQIICDSRLE+LFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT
Sbjct: 781  SQIICDSRLESLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840

Query: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900
            DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE
Sbjct: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900

Query: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960
            KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI
Sbjct: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960

Query: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020
            DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR
Sbjct: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020

Query: 1021 DRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080
            DRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE
Sbjct: 1021 DRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080

Query: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140
            ADDKRQETYTLSRGSGFSRRTREPVSPGKAG+NLNDSWSGTRNFSSTNRDLSRNLSGKGF
Sbjct: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGANLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140

Query: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1200
            SNQ EDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAA+ELPSA
Sbjct: 1141 SNQVEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAATELPSA 1200

Query: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260
            ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260

Query: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNT 1320
            VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQA A ASSFVAKPQGATVQSGMDVQNT
Sbjct: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAHARASSFVAKPQGATVQSGMDVQNT 1320

Query: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLP 1380
            GTSNPHTNPTSYGQSAGGRWKS TEVSPTGIPASASIEVPRY+GDRWSSDHGNKDFTSLP
Sbjct: 1321 GTSNPHTNPTSYGQSAGGRWKSHTEVSPTGIPASASIEVPRYTGDRWSSDHGNKDFTSLP 1380

Query: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440
            SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP
Sbjct: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440

Query: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500
            INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS
Sbjct: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500

Query: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPP 1560
            SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSST P
Sbjct: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTLP 1560

Query: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA 1620
            NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQS A
Sbjct: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSPA 1620

Query: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAP 1680
            SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATP WVAPNLGPMPPMNMNPNWHAP
Sbjct: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPSWVAPNLGPMPPMNMNPNWHAP 1680

Query: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG--SSSRLPYNNK 1740
            SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG  SSSRLPYNNK
Sbjct: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGGGSSSRLPYNNK 1740

Query: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1756
            GQKLCKYHESGHCKKGGSCDYRHK
Sbjct: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1757

BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match: KAG6588420.1 (Zinc finger CCCH domain-containing protein 44, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 3337 bits (8652), Expect = 0.0
Identity = 1718/1764 (97.39%), Postives = 1731/1764 (98.13%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESLQ SDAIRGDESLVAETCLEVE      ETEIAGVKACRNGIEDMGEDSVKLEVEP
Sbjct: 61   SVESLQSSDAIRGDESLVAETCLEVEKKDMVEETEIAGVKACRNGIEDMGEDSVKLEVEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL
Sbjct: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180

Query: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEV 240
            PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTD ANLVE+KEVEE+ADDPKDSKDIEV
Sbjct: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDAANLVEKKEVEEHADDPKDSKDIEV 240

Query: 241  AKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKT 300
            AKQETFSMEDGKL VPVQLVEKSELK+SLVDGAVVEEGRTENLADRTGETLKMEN+SS T
Sbjct: 241  AKQETFSMEDGKLRVPVQLVEKSELKESLVDGAVVEEGRTENLADRTGETLKMENESSNT 300

Query: 301  DEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVH 360
            DEVGLANF+GEIDGAVTMENTEDKTVEVDGMCLEDKAAD TT  T GNLADETP+IKGVH
Sbjct: 301  DEVGLANFSGEIDGAVTMENTEDKTVEVDGMCLEDKAADVTT--TMGNLADETPEIKGVH 360

Query: 361  VTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420
            VTDD+IE+LKIENVEDREAGVQGLGVADESAEVGKIENLVDETA AENVTNYTAESMENL
Sbjct: 361  VTDDSIEMLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAGAENVTNYTAESMENL 420

Query: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480
            DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE
Sbjct: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480

Query: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540
            VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS
Sbjct: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540

Query: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600
            CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC
Sbjct: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600

Query: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660
            EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW
Sbjct: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660

Query: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720
            KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP
Sbjct: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720

Query: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780
            IIPDSQGP TDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK
Sbjct: 721  IIPDSQGPCTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780

Query: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840
            SQIICDSRLE+LFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT
Sbjct: 781  SQIICDSRLESLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840

Query: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900
            DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE
Sbjct: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900

Query: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960
            KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI
Sbjct: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960

Query: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020
            DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR
Sbjct: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020

Query: 1021 DRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080
            DRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE
Sbjct: 1021 DRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080

Query: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140
            ADDKRQETYTLSRGSGFSRRTREPVSPGKAG+NLNDSWSGTRNFSSTNRDLSRNLSGKGF
Sbjct: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGANLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140

Query: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1200
            SNQ EDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAA+ELPSA
Sbjct: 1141 SNQVEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAATELPSA 1200

Query: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260
             RSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 1201 TRSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260

Query: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNT 1320
            VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQA A ASS VAKPQGATVQSGMD QNT
Sbjct: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAHARASSLVAKPQGATVQSGMDFQNT 1320

Query: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLP 1380
            GTSNPHTNPTSYGQSAGGRWKS TEVSPTGIPASASIEVPRY+GDRWSSDHGNKDFTSLP
Sbjct: 1321 GTSNPHTNPTSYGQSAGGRWKSHTEVSPTGIPASASIEVPRYTGDRWSSDHGNKDFTSLP 1380

Query: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440
            SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP
Sbjct: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440

Query: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500
            INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS
Sbjct: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500

Query: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPP 1560
            SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPP
Sbjct: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPP 1560

Query: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA 1620
            NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA
Sbjct: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA 1620

Query: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAP 1680
            SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVN TP WVAPNLGPMPPMNMNPNWHAP
Sbjct: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNPTPSWVAPNLGPMPPMNMNPNWHAP 1680

Query: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG--SSSRLPYNNK 1740
            SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG  SSSRLPYNNK
Sbjct: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGGGSSSRLPYNNK 1740

Query: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1756
            GQKLCKYHESGHCKKGGSCDYRHK
Sbjct: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1757

BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match: XP_022970324.1 (zinc finger CCCH domain-containing protein 44-like [Cucurbita maxima])

HSP 1 Score: 3290 bits (8531), Expect = 0.0
Identity = 1699/1763 (96.37%), Postives = 1718/1763 (97.45%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKC THREL SNEEQHCLFQSAINE+EFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCDTHRELRSNEEQHCLFQSAINEVEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESLQPSDAIRGDESLVAETCLEVE      ETEIAGVKACRNGIEDMGEDSVKLEVEP
Sbjct: 61   SVESLQPSDAIRGDESLVAETCLEVEKKDMVEETEIAGVKACRNGIEDMGEDSVKLEVEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL
Sbjct: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180

Query: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEV 240
            PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTD ANLVE+KEVEENADDPKDSKDIEV
Sbjct: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDAANLVEKKEVEENADDPKDSKDIEV 240

Query: 241  AKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKT 300
            AKQE FSMED KLGVPVQLVEKSELK+SLVDGAVVEEGRTENLADRTGETLKMEN+SS T
Sbjct: 241  AKQENFSMEDEKLGVPVQLVEKSELKESLVDGAVVEEGRTENLADRTGETLKMENESSNT 300

Query: 301  DEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVH 360
            DEV LANFA EIDGAVTMENTEDKTVEVDGMCLEDKAADATT +  GNLADETP+IKGV 
Sbjct: 301  DEVELANFASEIDGAVTMENTEDKTVEVDGMCLEDKAADATTMS--GNLADETPEIKGVQ 360

Query: 361  VTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420
            VTDD+IE+LKIENVEDREAGVQ LGVADESAEVGKIENLVDETAEAENVTNYTAESMENL
Sbjct: 361  VTDDSIEMLKIENVEDREAGVQELGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420

Query: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480
            DDKTAQ+EEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE
Sbjct: 421  DDKTAQMEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480

Query: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540
            VDEASKGSSGAKRKRGKN KAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS
Sbjct: 481  VDEASKGSSGAKRKRGKNFKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540

Query: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600
            CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC
Sbjct: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600

Query: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660
            EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLT DELVHAKNPW
Sbjct: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTLDELVHAKNPW 660

Query: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720
            KGSETLNSRPDSPGELYDGNVDGGSDL+VSENEESGSSKKRKAKRRSKSQAKETNSPSMP
Sbjct: 661  KGSETLNSRPDSPGELYDGNVDGGSDLEVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720

Query: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780
            IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK
Sbjct: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780

Query: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840
            SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDV+INDLQGSVADTESSQLEGDGYT
Sbjct: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVRINDLQGSVADTESSQLEGDGYT 840

Query: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900
            DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE
Sbjct: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900

Query: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960
            KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI
Sbjct: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960

Query: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020
            DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR
Sbjct: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020

Query: 1021 DRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080
            DRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE
Sbjct: 1021 DRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080

Query: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140
            ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSS NRDLSRNLSGKGF
Sbjct: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSMNRDLSRNLSGKGF 1140

Query: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1200
            SNQGEDAIGSGEIINENSWSHGREGDVKK NKWDKQQVSPSSEMTA NA SGAASELPSA
Sbjct: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKTNKWDKQQVSPSSEMTAGNASSGAASELPSA 1200

Query: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260
            ARSVNSAA PSVGTTQNAA VNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 1201 ARSVNSAA-PSVGTTQNAAIVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260

Query: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNT 1320
            VWRASDKQDDSLLLTDVLAGKIPKDTSSVDN+IQA AHASSF+AKPQG+TVQSGMDVQNT
Sbjct: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNNIQAHAHASSFIAKPQGSTVQSGMDVQNT 1320

Query: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLP 1380
            GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRY+GDRWSSDHGNKDFTSLP
Sbjct: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYTGDRWSSDHGNKDFTSLP 1380

Query: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440
            SPTPSSGGTKEQPFQM  PFASS GGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP
Sbjct: 1381 SPTPSSGGTKEQPFQMPAPFASSGGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440

Query: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500
            INGLQNH SLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIET+TVETNIS
Sbjct: 1441 INGLQNHQSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETKTVETNIS 1500

Query: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPP-HIQSSTP 1560
            SSMPPGQTLHRRWGEMSPAQNA+TASFSTPGLTNFSSSEPWRSMPPIPSNPP HIQSSTP
Sbjct: 1501 SSMPPGQTLHRRWGEMSPAQNASTASFSTPGLTNFSSSEPWRSMPPIPSNPPPHIQSSTP 1560

Query: 1561 PNIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS 1620
            PNIPWGMG PEGQS VPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS
Sbjct: 1561 PNIPWGMGPPEGQSNVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS 1620

Query: 1621 ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHA 1680
            ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATP WVAPNLGPMPPMNMNPNWHA
Sbjct: 1621 ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPSWVAPNLGPMPPMNMNPNWHA 1680

Query: 1681 PSANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKG 1740
            PSANQGMWSNEHGKNGDRFSNPDS SHGGDPGNGGKSWGMPPSYGGGG  SSRLPY+NKG
Sbjct: 1681 PSANQGMWSNEHGKNGDRFSNPDSGSHGGDPGNGGKSWGMPPSYGGGG--SSRLPYSNKG 1740

Query: 1741 QKLCKYHESGHCKKGGSCDYRHK 1756
            QKLCKYHESGHCKKGGSCDYRHK
Sbjct: 1741 QKLCKYHESGHCKKGGSCDYRHK 1753

BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match: XP_038888157.1 (zinc finger CCCH domain-containing protein 19 [Benincasa hispida])

HSP 1 Score: 2630 bits (6817), Expect = 0.0
Identity = 1428/1817 (78.59%), Postives = 1522/1817 (83.76%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            M+AEE+DSS HDQ SS  L VDDGN LDVKC T+REL SNE QHC+ +S+I E EFPSN+
Sbjct: 1    MDAEEDDSSYHDQKSS--LYVDDGN-LDVKCDTNRELQSNEVQHCVSKSSIIETEFPSNT 60

Query: 61   SVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVEP 120
             VESL P DAI GDE L  +T  E      +EETEIA  K  RN IEDM EDSVKLE+EP
Sbjct: 61   GVESLPPRDAILGDEILAVDTYSEMKKQDLIEETEIAEEKDFRNIIEDMAEDSVKLEIEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIA +GLLG+ VF DVK     AEE KA+ EF EG+LL EM LVGGAE+QVEGNVLM N 
Sbjct: 121  DIAEVGLLGKRVFADVKNNTGVAEEEKALNEFAEGELLPEMVLVGGAEDQVEGNVLMANF 180

Query: 181  PDNTV-------GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENA 240
             ++TV       GC ET   TCLSDVLAE  LAETT FV  VD TD  N+V++ EVEENA
Sbjct: 181  SEDTVVEGSATVGCAETTEKTCLSDVLAEETLAETTLFVQDVDVTDATNVVQKIEVEENA 240

Query: 241  DDPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGE 300
            DD  DSKD EV KQE F++E+ +LGV VQL E SELK SLVD AV EEGR  NLA RTGE
Sbjct: 241  DDLNDSKDTEVPKQENFAVEERELGVLVQLAENSELKVSLVDEAVGEEGRMANLAGRTGE 300

Query: 301  TLKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNL 360
            TLKMEN S+  DEVGL +FA      V + N EDKTVE+DGMC+EDKA D        NL
Sbjct: 301  TLKMENVSNTIDEVGLTHFA------VKIGNAEDKTVEMDGMCMEDKATDVAMME---NL 360

Query: 361  ADETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV 420
            ADETP+IKGV + D  IE LKIENVEDREAGVQGL VADES  VGK+EN  DE AEAE V
Sbjct: 361  ADETPEIKGVDLADYTIEELKIENVEDREAGVQGLAVADESPVVGKLENTADENAEAEGV 420

Query: 421  ---TNYTAESMENLD-DKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
               T+Y     EN++ DKTAQ EEIAM EE+ EAD  VYLVDEGIGSEE DA MTYLV E
Sbjct: 421  VQVTDYEVIKSENVEEDKTAQGEEIAMAEESAEADGMVYLVDEGIGSEETDATMTYLVEE 480

Query: 481  TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
            TEAAEEVEEMDVTEEVDE +KGSSG+KRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL
Sbjct: 481  TEAAEEVEEMDVTEEVDEPNKGSSGSKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540

Query: 541  VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
            VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541  VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600

Query: 601  KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
            KNAVILCVRGNKGFCE CMRFVMLIEKNEQG+TEKGQIDFNDK SWEYLFKEYWTDLKGS
Sbjct: 601  KNAVILCVRGNKGFCETCMRFVMLIEKNEQGNTEKGQIDFNDKNSWEYLFKEYWTDLKGS 660

Query: 661  LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
            LSLTFDELVHAKNPWKGSETL S+PDSPGEL DGNVD GSDLDVSENEESGSSKKRKAK+
Sbjct: 661  LSLTFDELVHAKNPWKGSETLTSKPDSPGELCDGNVDRGSDLDVSENEESGSSKKRKAKK 720

Query: 721  RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
            RSKSQAKE +SP++P  PDSQG STDNNVEWASKELLEFVMHMKNGD+TVLSQFDVQALL
Sbjct: 721  RSKSQAKEISSPTIPAKPDSQGLSTDNNVEWASKELLEFVMHMKNGDKTVLSQFDVQALL 780

Query: 781  LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
            LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLI+ED Q NDLQGS
Sbjct: 781  LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIKEDAQTNDLQGS 840

Query: 841  VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
            VADTESSQLE DG  D  GKT+KEKKRR RKKGD RGLQSNLDDYAAID HNINLIYL+R
Sbjct: 841  VADTESSQLEADG-ADGLGKTKKEKKRRTRKKGDDRGLQSNLDDYAAIDTHNINLIYLKR 900

Query: 901  NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
            NLVEYLIEDEESF  KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGKKMTDI
Sbjct: 901  NLVEYLIEDEESFLVKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960

Query: 961  LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
            LLEILNLNKTEVISID+ISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV
Sbjct: 961  LLEILNLNKTEVISIDVISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020

Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
            KDWMETEIVRLSHLRDRASEKGRRKE     + ECVEKLQLLKTPEERQRR+EE+P IHT
Sbjct: 1021 KDWMETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHT 1080

Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
            DPNMDPSHESEDEDEADDK++ETYTLSR + F RR REPVSPGK GSNLND WSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKKRETYTLSRSTSFGRRMREPVSPGKGGSNLNDPWSGTRNFS 1140

Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
            + NRD++RNLSGKGFSNQG+DAIGSGEIINE+SW+ GRE DVKK NKWDKQ VSPSSE+T
Sbjct: 1141 NMNRDMNRNLSGKGFSNQGDDAIGSGEIINEHSWAQGRERDVKKTNKWDKQ-VSPSSEIT 1200

Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
            ARNALSGA SE  +AA SVN A SPSVGTTQNAATVNE+EKIWRYQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGATSESSAAAHSVNPAVSPSVGTTQNAATVNESEKIWRYQDPSGKVQGPFSMVQ 1260

Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
            LRKWSNTGYFPADLRVWR SD+Q+DSLLLTDVLAGKIPKD  S  NS QA   +SSFV K
Sbjct: 1261 LRKWSNTGYFPADLRVWRISDQQEDSLLLTDVLAGKIPKDAPSTSNSFQAHP-SSSFVGK 1320

Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
            PQ AT+QSGMD QN GTSN H+N TSY QS+GGRWKSQ EVSPTG PASASIEVPRYSGD
Sbjct: 1321 PQVATLQSGMDGQNAGTSNSHSNQTSYDQSSGGRWKSQNEVSPTGRPASASIEVPRYSGD 1380

Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFA-----SSAGGGSLHGSSLMQGSEN 1440
            RWSSDHGNK+FTSLPSPTPSSGGTKEQPFQ+A PF      S  GGG LHGSSLMQGSEN
Sbjct: 1381 RWSSDHGNKNFTSLPSPTPSSGGTKEQPFQVAAPFKEAKSLSGGGGGGLHGSSLMQGSEN 1440

Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAA-DIKSISANLHSL 1500
            DSLR HSG N++EKG GLG IN  QNHHS PVR S  IDD  VNPAA DIKSISANLHSL
Sbjct: 1441 DSLRIHSGRNSSEKGMGLGLINAFQNHHSQPVRLSPTIDDASVNPAAADIKSISANLHSL 1500

Query: 1501 VQSINSRNPPIETQ------------------------TVETNISSSMPPGQTLHRRWGE 1560
            VQSINSRNPPIE Q                         VE+NISSSMPP QTLH RWGE
Sbjct: 1501 VQSINSRNPPIEAQGQGSGSVLKRETNASESWQNAQSLKVESNISSSMPPAQTLHSRWGE 1560

Query: 1561 MSPAQNAATA--------SFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
            MSPAQNAAT+        SFS+PGL+NF SS+PWRS PPIPSNP HIQSST PN+PWGMG
Sbjct: 1561 MSPAQNAATSFSAGSSTSSFSSPGLSNFPSSDPWRSTPPIPSNPQHIQSSTSPNLPWGMG 1620

Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPP-NATGMMWGSSAQSSASVGTNP 1680
            APEGQSTVPRPG E+QN +WGPMPSGNPNM W P+APP NATGMMWG++AQSS   GTNP
Sbjct: 1621 APEGQSTVPRPGSETQNQAWGPMPSGNPNMGWGPTAPPPNATGMMWGTAAQSSGPAGTNP 1680

Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1740
            GW  PGQGPP  NNIQGW AH    P VNATPGWVAPN+GP+PPMNMNP+W  PS NQ M
Sbjct: 1681 GWIPPGQGPPTGNNIQGWPAH----PPVNATPGWVAPNVGPLPPMNMNPSWPVPSPNQNM 1740

Query: 1741 WSNEHGKNGDRFSNP-DSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKY 1756
            W NEH KNGDR+SN  DS SHGGDPGNGGKSWGM PS+GGGGGSS R PYN   Q+LCKY
Sbjct: 1741 WGNEHSKNGDRYSNQKDSGSHGGDPGNGGKSWGMQPSFGGGGGSS-RSPYNRV-QRLCKY 1791

BLAST of Cp4.1LG04g01060 vs. ExPASy TrEMBL
Match: A0A6J1EKF5 (zinc finger CCCH domain-containing protein 44-like OS=Cucurbita moschata OX=3662 GN=LOC111435162 PE=4 SV=1)

HSP 1 Score: 3338 bits (8656), Expect = 0.0
Identity = 1720/1764 (97.51%), Postives = 1734/1764 (98.30%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHT+RELHSNEEQHCLFQSAINELEFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTYRELHSNEEQHCLFQSAINELEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESLQ SDAIRGDESLVAETCLEVE      ETEIAGVKACRNGIEDMGEDSVKLEVEP
Sbjct: 61   SVESLQSSDAIRGDESLVAETCLEVEKKDMVEETEIAGVKACRNGIEDMGEDSVKLEVEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL
Sbjct: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180

Query: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEV 240
            PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTD ANLVE+KEVEE+ADDPKDSKDIEV
Sbjct: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDAANLVEKKEVEEHADDPKDSKDIEV 240

Query: 241  AKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKT 300
            AKQETFSMEDGKL VPVQLVEKSELK+SLVDGAVVEEGRTENLADRTGETLKMEN+SS T
Sbjct: 241  AKQETFSMEDGKLRVPVQLVEKSELKESLVDGAVVEEGRTENLADRTGETLKMENESSNT 300

Query: 301  DEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVH 360
            DEVGLANF+GEIDGAVTMENTEDKTVEVDGMCLEDKAAD TT  T GNLADETP+IKGVH
Sbjct: 301  DEVGLANFSGEIDGAVTMENTEDKTVEVDGMCLEDKAADVTT--TMGNLADETPEIKGVH 360

Query: 361  VTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420
            VTDD+IE+LKIENVEDREAGVQGLGVADESAEVGKIENLVDETA AENVTNYTAESMENL
Sbjct: 361  VTDDSIEMLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAGAENVTNYTAESMENL 420

Query: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480
            DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE
Sbjct: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480

Query: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540
            VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS
Sbjct: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540

Query: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600
            CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC
Sbjct: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600

Query: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660
            EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW
Sbjct: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660

Query: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720
            KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP
Sbjct: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720

Query: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780
            IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK
Sbjct: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780

Query: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840
            SQIICDSRLE+LFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT
Sbjct: 781  SQIICDSRLESLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840

Query: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900
            DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE
Sbjct: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900

Query: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960
            KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI
Sbjct: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960

Query: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020
            DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR
Sbjct: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020

Query: 1021 DRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080
            DRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE
Sbjct: 1021 DRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080

Query: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140
            ADDKRQETYTLSRGSGFSRRTREPVSPGKAG+NLNDSWSGTRNFSSTNRDLSRNLSGKGF
Sbjct: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGANLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140

Query: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1200
            SNQ EDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAA+ELPSA
Sbjct: 1141 SNQVEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAATELPSA 1200

Query: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260
            ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260

Query: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNT 1320
            VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQA A ASSFVAKPQGATVQSGMDVQNT
Sbjct: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAHARASSFVAKPQGATVQSGMDVQNT 1320

Query: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLP 1380
            GTSNPHTNPTSYGQSAGGRWKS TEVSPTGIPASASIEVPRY+GDRWSSDHGNKDFTSLP
Sbjct: 1321 GTSNPHTNPTSYGQSAGGRWKSHTEVSPTGIPASASIEVPRYTGDRWSSDHGNKDFTSLP 1380

Query: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440
            SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP
Sbjct: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440

Query: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500
            INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS
Sbjct: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500

Query: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPP 1560
            SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSST P
Sbjct: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTLP 1560

Query: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA 1620
            NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQS A
Sbjct: 1561 NIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSPA 1620

Query: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAP 1680
            SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATP WVAPNLGPMPPMNMNPNWHAP
Sbjct: 1621 SVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPSWVAPNLGPMPPMNMNPNWHAP 1680

Query: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG--SSSRLPYNNK 1740
            SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGG  SSSRLPYNNK
Sbjct: 1681 SANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGGGSSSRLPYNNK 1740

Query: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1756
            GQKLCKYHESGHCKKGGSCDYRHK
Sbjct: 1741 GQKLCKYHESGHCKKGGSCDYRHK 1757

BLAST of Cp4.1LG04g01060 vs. ExPASy TrEMBL
Match: A0A6J1I569 (zinc finger CCCH domain-containing protein 44-like OS=Cucurbita maxima OX=3661 GN=LOC111469319 PE=4 SV=1)

HSP 1 Score: 3290 bits (8531), Expect = 0.0
Identity = 1699/1763 (96.37%), Postives = 1718/1763 (97.45%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEENDSSKHDQPSSPLLSVDDGNDLDVKC THREL SNEEQHCLFQSAINE+EFPSNS
Sbjct: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCDTHRELRSNEEQHCLFQSAINEVEFPSNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESLQPSDAIRGDESLVAETCLEVE      ETEIAGVKACRNGIEDMGEDSVKLEVEP
Sbjct: 61   SVESLQPSDAIRGDESLVAETCLEVEKKDMVEETEIAGVKACRNGIEDMGEDSVKLEVEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL
Sbjct: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180

Query: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEV 240
            PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTD ANLVE+KEVEENADDPKDSKDIEV
Sbjct: 181  PDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDAANLVEKKEVEENADDPKDSKDIEV 240

Query: 241  AKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMENDSSKT 300
            AKQE FSMED KLGVPVQLVEKSELK+SLVDGAVVEEGRTENLADRTGETLKMEN+SS T
Sbjct: 241  AKQENFSMEDEKLGVPVQLVEKSELKESLVDGAVVEEGRTENLADRTGETLKMENESSNT 300

Query: 301  DEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVH 360
            DEV LANFA EIDGAVTMENTEDKTVEVDGMCLEDKAADATT +  GNLADETP+IKGV 
Sbjct: 301  DEVELANFASEIDGAVTMENTEDKTVEVDGMCLEDKAADATTMS--GNLADETPEIKGVQ 360

Query: 361  VTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420
            VTDD+IE+LKIENVEDREAGVQ LGVADESAEVGKIENLVDETAEAENVTNYTAESMENL
Sbjct: 361  VTDDSIEMLKIENVEDREAGVQELGVADESAEVGKIENLVDETAEAENVTNYTAESMENL 420

Query: 421  DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480
            DDKTAQ+EEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE
Sbjct: 421  DDKTAQMEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEE 480

Query: 481  VDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540
            VDEASKGSSGAKRKRGKN KAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS
Sbjct: 481  VDEASKGSSGAKRKRGKNFKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPS 540

Query: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600
            CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC
Sbjct: 541  CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFC 600

Query: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPW 660
            EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLT DELVHAKNPW
Sbjct: 601  EACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTLDELVHAKNPW 660

Query: 661  KGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720
            KGSETLNSRPDSPGELYDGNVDGGSDL+VSENEESGSSKKRKAKRRSKSQAKETNSPSMP
Sbjct: 661  KGSETLNSRPDSPGELYDGNVDGGSDLEVSENEESGSSKKRKAKRRSKSQAKETNSPSMP 720

Query: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780
            IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK
Sbjct: 721  IIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRK 780

Query: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYT 840
            SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDV+INDLQGSVADTESSQLEGDGYT
Sbjct: 781  SQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVRINDLQGSVADTESSQLEGDGYT 840

Query: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900
            DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE
Sbjct: 841  DASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHE 900

Query: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960
            KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI
Sbjct: 901  KVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISI 960

Query: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020
            DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR
Sbjct: 961  DIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLR 1020

Query: 1021 DRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080
            DRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE
Sbjct: 1021 DRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDE 1080

Query: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1140
            ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSS NRDLSRNLSGKGF
Sbjct: 1081 ADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSMNRDLSRNLSGKGF 1140

Query: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1200
            SNQGEDAIGSGEIINENSWSHGREGDVKK NKWDKQQVSPSSEMTA NA SGAASELPSA
Sbjct: 1141 SNQGEDAIGSGEIINENSWSHGREGDVKKTNKWDKQQVSPSSEMTAGNASSGAASELPSA 1200

Query: 1201 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260
            ARSVNSAA PSVGTTQNAA VNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 1201 ARSVNSAA-PSVGTTQNAAIVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1260

Query: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGATVQSGMDVQNT 1320
            VWRASDKQDDSLLLTDVLAGKIPKDTSSVDN+IQA AHASSF+AKPQG+TVQSGMDVQNT
Sbjct: 1261 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNNIQAHAHASSFIAKPQGSTVQSGMDVQNT 1320

Query: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLP 1380
            GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRY+GDRWSSDHGNKDFTSLP
Sbjct: 1321 GTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYTGDRWSSDHGNKDFTSLP 1380

Query: 1381 SPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440
            SPTPSSGGTKEQPFQM  PFASS GGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP
Sbjct: 1381 SPTPSSGGTKEQPFQMPAPFASSGGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGP 1440

Query: 1441 INGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQTVETNIS 1500
            INGLQNH SLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIET+TVETNIS
Sbjct: 1441 INGLQNHQSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETKTVETNIS 1500

Query: 1501 SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMPPIPSNPP-HIQSSTP 1560
            SSMPPGQTLHRRWGEMSPAQNA+TASFSTPGLTNFSSSEPWRSMPPIPSNPP HIQSSTP
Sbjct: 1501 SSMPPGQTLHRRWGEMSPAQNASTASFSTPGLTNFSSSEPWRSMPPIPSNPPPHIQSSTP 1560

Query: 1561 PNIPWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS 1620
            PNIPWGMG PEGQS VPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS
Sbjct: 1561 PNIPWGMGPPEGQSNVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSS 1620

Query: 1621 ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHA 1680
            ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATP WVAPNLGPMPPMNMNPNWHA
Sbjct: 1621 ASVGTNPGWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPSWVAPNLGPMPPMNMNPNWHA 1680

Query: 1681 PSANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKG 1740
            PSANQGMWSNEHGKNGDRFSNPDS SHGGDPGNGGKSWGMPPSYGGGG  SSRLPY+NKG
Sbjct: 1681 PSANQGMWSNEHGKNGDRFSNPDSGSHGGDPGNGGKSWGMPPSYGGGG--SSRLPYSNKG 1740

Query: 1741 QKLCKYHESGHCKKGGSCDYRHK 1756
            QKLCKYHESGHCKKGGSCDYRHK
Sbjct: 1741 QKLCKYHESGHCKKGGSCDYRHK 1753

BLAST of Cp4.1LG04g01060 vs. ExPASy TrEMBL
Match: A0A0A0K4G1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G006220 PE=4 SV=1)

HSP 1 Score: 2509 bits (6504), Expect = 0.0
Identity = 1371/1819 (75.37%), Postives = 1488/1819 (81.80%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
            MEAEE+DSS  DQ SS L  VDDG  LDVKC T+RE L SNE+QHC+ +S+I E  F  N
Sbjct: 1    MEAEEDDSSYQDQKSSSLY-VDDGK-LDVKCDTNREELLSNEQQHCVSKSSIIETGFSPN 60

Query: 61   SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
            + VESL P DAI GDE L   TC E      VEE E       RN I+DMGEDSVKLE+E
Sbjct: 61   TVVESLPPRDAILGDEILAVGTCSEMEKKDLVEERERVEENDFRNIIQDMGEDSVKLEIE 120

Query: 121  PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
            P IA  GLL +  F+DVK+     EE KA++EF EG+LL  M  VG AENQVEGNVLM N
Sbjct: 121  PGIAKAGLLEQRAFDDVKKNTGVTEEEKALSEFAEGELLPGMVFVGVAENQVEGNVLMAN 180

Query: 181  LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
              ++TV     GC ET   TCLS VLAE  LAETTPFV GVD T   NLV++ EVEE+AD
Sbjct: 181  FSEHTVVDGSAGCVETTETTCLSYVLAEERLAETTPFVQGVDVTVATNLVQKTEVEEHAD 240

Query: 241  DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
            D  DSKD EV KQE F++E  +LGV VQL E SELK SLVDG V  EGRTENLADRTGET
Sbjct: 241  DTNDSKDTEVPKQENFAVEKMELGVQVQLEEDSELKVSLVDGVV--EGRTENLADRTGET 300

Query: 301  LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
            LKMEN SS ++EVGL +FA EI   V + N EDKT+E DGMC+E+KA D        NLA
Sbjct: 301  LKMENASSTSNEVGLTHFAVEIKETVNIGNDEDKTMETDGMCVEEKATDVGMME---NLA 360

Query: 361  DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
            DETP+IKGV V D +IE LKIE++EDREAGVQGLG+ADES  V K+EN+ DE AE E V 
Sbjct: 361  DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADESPVVEKLENVADENAEPEGVQ 420

Query: 421  -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
             T+YTAE +  EN+ DDKTAQ EE+AM EE  E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421  VTDYTAEEVKSENVEDDKTAQGEEVAMGEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480

Query: 481  TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
            TEAAEEVEEMD TEEVDE +  SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481  TEAAEEVEEMDATEEVDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540

Query: 541  VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
            VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541  VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600

Query: 601  KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
            KNAVILCVRGNKGFCE CMRFV  IEKNEQG+ EKGQIDFNDK SWEYLFKEYWTDLKGS
Sbjct: 601  KNAVILCVRGNKGFCETCMRFVTSIEKNEQGNKEKGQIDFNDKNSWEYLFKEYWTDLKGS 660

Query: 661  LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
            LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661  LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720

Query: 721  RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
            RS+SQAKE +SPSMP    SQG STD+NVEW SKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721  RSRSQAKEMSSPSMPATA-SQGLSTDDNVEWGSKELLEFVMHMKNGDRTVLSQFDVQALL 780

Query: 781  LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
            LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL  S
Sbjct: 781  LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHVS 840

Query: 841  VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
            VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841  VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900

Query: 901  NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
            NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901  NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960

Query: 961  LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
            LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVGDLQERAMSLQDARV
Sbjct: 961  LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGDLQERAMSLQDARV 1020

Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
            KDWMETEIVRLSHL                 + ECVEKLQLLKTPEERQRR+EE+P IH 
Sbjct: 1021 KDWMETEIVRLSHLHSLL-------------LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080

Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
            DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140

Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
            +TNRD+SRNLSGKGF+NQG+DAIGSGEIINE SW HGRE DVKK +KWDKQ VSPSSE+T
Sbjct: 1141 NTNRDMSRNLSGKGFANQGDDAIGSGEIINETSWGHGRERDVKKTSKWDKQ-VSPSSEIT 1200

Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
            ARNALSGAASE  SAA SVN AAS SVGTTQNAATVNE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPAASSSVGTTQNAATVNESEKIWHYQDPSGKVQGPFSMVQ 1260

Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
            LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT    NS+Q   ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320

Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
            PQG T+QSG+D QN  +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSGD
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGD 1380

Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFA-----SSAGGGSLHGSSLMQGSEN 1440
            RWSSDHGNK+FT+LPSPTPSSGG+KEQPFQ+A  F      S   GG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGSKEQPFQVAASFMEAKSLSGTAGGGLHGSSVMQGSEN 1440

Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
            DSLRSH G N++EKG G GPIN LQNH S PVR S IIDD  +NPAADI+SISANL SLV
Sbjct: 1441 DSLRSHLGRNSSEKGLGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500

Query: 1501 QSINSRNPPIE-------------TQT-----------VETNISSSMPPGQTLHRRWGEM 1560
            QSINSRNPPIE             T T           VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGHGSGSILKRETDTSEAWQNAHSLKVESNVSSSMPPAQTLHSRWGEM 1560

Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
            SPAQNAA         T+SFS+ G+++F SS+PWRS  PI SNP HIQ STPPN+PWGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGMSSFPSSDPWRSTAPISSNPQHIQCSTPPNLPWGMG 1620

Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPP-NATGMMWGSSAQSSASVGTNP 1680
            APEGQSTVPR G ESQN +WGPMPSGNPNM W P+ PP NAT MMWG++AQSS    TNP
Sbjct: 1621 APEGQSTVPRQGSESQNQTWGPMPSGNPNMGWGPTGPPPNATAMMWGATAQSSGPAATNP 1680

Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1740
            GW APGQGP   NN+QGW AHS +PP VNATPGWV PN+ PMPPMNMNP+W  PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNLQGWPAHSPMPPPVNATPGWVGPNVAPMPPMNMNPSWLVPSVNQNM 1740

Query: 1741 WSNEHGKNGDRFSNP-DSVSHGGDPGNGGKSWGMPPSYGGGGGS--SSRLPYNNKGQKLC 1756
            W NEHGKNG+RFSN  D  SHGGDPGNG KSWGM PS+GGGGG   +SR PYN   QKLC
Sbjct: 1741 WGNEHGKNGNRFSNQKDGGSHGGDPGNGDKSWGMQPSFGGGGGGGGNSRSPYNRV-QKLC 1793

BLAST of Cp4.1LG04g01060 vs. ExPASy TrEMBL
Match: A0A1S3BIT9 (zinc finger CCCH domain-containing protein 19 OS=Cucumis melo OX=3656 GN=LOC103490349 PE=4 SV=1)

HSP 1 Score: 2497 bits (6472), Expect = 0.0
Identity = 1357/1791 (75.77%), Postives = 1471/1791 (82.13%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
            MEAEE+DSS HDQ SS L         DVKC T+RE LHSNE+QHC  +S+I E EF  N
Sbjct: 1    MEAEEDDSSYHDQKSSSL---------DVKCDTNREELHSNEQQHCASKSSIIETEFSPN 60

Query: 61   SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
            + VESL P DAI GDE L  +TC E      VEE EI   K  RN I+DM EDSVKLE+E
Sbjct: 61   TVVESLPPRDAILGDEILAVDTCSEMEKKDLVEEKEIKEEKDSRNIIQDMAEDSVKLEIE 120

Query: 121  PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
            PDI   GL  +  F+DVKE     EE KA++EF +G+LL EM  VG AENQ EGNVLM N
Sbjct: 121  PDIEKTGLSEQRAFDDVKENTGVTEEEKALSEFAQGELLPEMVFVGVAENQAEGNVLMAN 180

Query: 181  LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
              ++TV     GC ET   TCLSDVLAE  LAETT FV  VD TD  NLV++ +VEE+AD
Sbjct: 181  FSEHTVVDGSAGCVETTETTCLSDVLAEETLAETTLFVQDVDVTDAINLVQKTKVEEHAD 240

Query: 241  DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
            D  DSKD EV KQE FS+E  +LGV VQL E SELK SLVDGAV  EGRTENLADR GET
Sbjct: 241  DANDSKDTEVPKQENFSVEKMELGVRVQLEENSELKGSLVDGAV--EGRTENLADRPGET 300

Query: 301  LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
            LK EN SS T+EVGL + A EI   V + N EDKT+E+DGMC+EDKA   T      NL 
Sbjct: 301  LKRENASSTTNEVGLTHIAVEIKETVNVGNAEDKTIEMDGMCMEDKA---TAVGMMENLT 360

Query: 361  DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
            DETP+IKGV V D +IE LKIE++EDREAGVQGLG+AD+S  V K+EN+ DE AEAE V 
Sbjct: 361  DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADKSPVVEKLENVADENAEAEGVQ 420

Query: 421  -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
             T+YTAE +  EN+ DDKTAQ EEIAM EE  E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421  VTDYTAEEVKSENVEDDKTAQGEEIAMAEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480

Query: 481  TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
            TEAAEEVEEMDVTEE+DE +  SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481  TEAAEEVEEMDVTEEMDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540

Query: 541  VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
            VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541  VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600

Query: 601  KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
            KNAVI CVRGNKGFCE CMRFV  IEKNEQGSTEKGQIDFNDK SWEYLFKEYW DLKGS
Sbjct: 601  KNAVIFCVRGNKGFCETCMRFVTSIEKNEQGSTEKGQIDFNDKNSWEYLFKEYWIDLKGS 660

Query: 661  LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
            LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661  LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720

Query: 721  RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
            RS+SQAKE +SPSMP I DSQG S D+NVEWASKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721  RSRSQAKEMSSPSMPAIADSQGLSADDNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780

Query: 781  LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
            LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL GS
Sbjct: 781  LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHGS 840

Query: 841  VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
            VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841  VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900

Query: 901  NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
            NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901  NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960

Query: 961  LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
            LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVG+LQERAMSLQDARV
Sbjct: 961  LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGELQERAMSLQDARV 1020

Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
            KDWMETEIVRLSHLRDRASEKGRRKE     + ECVEKLQLLKTPEERQRR+EE+P IH 
Sbjct: 1021 KDWMETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080

Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
            DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140

Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
            +TNRD+SRNLSGKGFSNQG+DAIGSGEIINE SW HGRE DVKK +KWDKQ VSPSSEMT
Sbjct: 1141 NTNRDMSRNLSGKGFSNQGDDAIGSGEIINETSWGHGRERDVKKTSKWDKQ-VSPSSEMT 1200

Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
            ARNALSGAASE  SAA SVN   S SVGTTQNAAT NE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPTVSSSVGTTQNAATANESEKIWHYQDPSGKVQGPFSMVQ 1260

Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
            LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT    NS+Q   ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320

Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
            PQG T+QSG+D QN  +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSG+
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGE 1380

Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFA-----SSAGGGSLHGSSLMQGSEN 1440
            RWSSDHGNK+FT+LPSPTPSSGGTKEQPFQ+A  F      S  GGG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGTKEQPFQVAASFMEAKSLSGTGGGGLHGSSVMQGSEN 1440

Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
            D LRSH G N++EKG G GPIN LQNH S PVR S IIDD  +NPAADI+SISANL SLV
Sbjct: 1441 DPLRSHLGRNSSEKGMGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500

Query: 1501 QSINSRNPPIE------------------------TQTVETNISSSMPPGQTLHRRWGEM 1560
            QSINSRNPPIE                        +  VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGRGSGSILKRETDTSEAWQNAQSHKVESNVSSSMPPAQTLHSRWGEM 1560

Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
            SPAQNAA         T+SFS+ GL+NF SS+PWRS  PI +NP HIQ STPPN+ WGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGLSNFPSSDPWRSTAPISNNPQHIQCSTPPNLAWGMG 1620

Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPP-NATGMMWGSSAQSSASVGTNP 1680
            APEGQSTVPRPG ESQN +WGPMPSGNPNM W P+APP NA+ MMWG++AQSS    TNP
Sbjct: 1621 APEGQSTVPRPGSESQNQTWGPMPSGNPNMGWGPTAPPPNASAMMWGTTAQSSGPAATNP 1680

Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1730
            GW APGQGP   NNIQGW AHS +PP VNATPGWV  N+ PMPPMNMNP+W  PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNIQGWPAHSPMPPPVNATPGWVGSNVAPMPPMNMNPSWLVPSVNQNM 1740

BLAST of Cp4.1LG04g01060 vs. ExPASy TrEMBL
Match: A0A6J1C404 (zinc finger CCCH domain-containing protein 19 OS=Momordica charantia OX=3673 GN=LOC111007718 PE=4 SV=1)

HSP 1 Score: 2496 bits (6469), Expect = 0.0
Identity = 1383/1856 (74.52%), Postives = 1489/1856 (80.23%), Query Frame = 0

Query: 1    MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRELHSNEEQHCLFQSAINELEFPSNS 60
            MEAEE+DS+KHD+ SS  L VDD N   VKC   +EL SNE QHC+F+S+I E E P NS
Sbjct: 1    MEAEEDDSTKHDETSS--LYVDDAN-FGVKCDALQELQSNE-QHCVFKSSIIETELPPNS 60

Query: 61   SVESLQPSDAIRGDESLVAETCLEVE------ETEIAGVKACRNGIEDMGEDSVKLEVEP 120
            SVESL P DAI GD+ L A+   E+E      ET++A  KA  N +ED+ EDSVKLE+EP
Sbjct: 61   SVESLPPRDAILGDQGLAADAYSEMEKEGVMEETQMAEEKAFPNILEDLAEDSVKLEIEP 120

Query: 121  DIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNL 180
            DIA  GLLG+ VF+DV  EDAGAEE KAV  F +G+LL EM L G AEN+VEGN LM N 
Sbjct: 121  DIADTGLLGQRVFSDVT-EDAGAEE-KAVTGFSDGELLHEMVLAGVAENRVEGNALMDNF 180

Query: 181  PDNTV-------GCGET--DTCLSDVLA--ELAETTPFVHGVDTTDVANLVERKEVEENA 240
             +NTV       GC E    T  +DV+A   LAETT FV     TD  NL E+ EV  +A
Sbjct: 181  QENTVLEGAAALGCAEIIGKTRSTDVVAVETLAETTLFVQDAGVTDATNLAEKTEVAMDA 240

Query: 241  DDPKDSKDIEVAKQETFSM-EDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTG 300
            D P+D+ D+EV KQETF+  ED +LGVP  L E S+LK SLVDG  VEEGRT NLAD TG
Sbjct: 241  DGPEDADDMEVPKQETFATTEDRELGVP--LSENSKLKVSLVDGTAVEEGRTTNLADNTG 300

Query: 301  ETLKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGN 360
            ETLKMEN S  T E+GL N AGEID AV MEN EDKTVE+DGMC+EDK A+        N
Sbjct: 301  ETLKMENFSRNTAEMGLTNLAGEIDEAVNMENAEDKTVELDGMCMEDKTAEVVMMME--N 360

Query: 361  LADETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAEN 420
            LADET  IKGV + D  IEVLKIENVEDREAG+Q  GVADESAEVGKIE++V+E +EAE 
Sbjct: 361  LADETRDIKGVDIADYGIEVLKIENVEDREAGMQEFGVADESAEVGKIESMVEEASEAEG 420

Query: 421  ---VTNYTAE--SMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLV 480
               V +YTAE  +MEN  D   Q+EEIAM EETEEADD VYLVD+GIGSEE DANMTYL+
Sbjct: 421  GVQVADYTAEVETMENEVDNNEQVEEIAMAEETEEADDMVYLVDDGIGSEETDANMTYLL 480

Query: 481  GETEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGG 540
             ETEAAEE EE      VDE SKG SG+KRKRGKN KAPAR PSRKKVEEDVCFICFDGG
Sbjct: 481  EETEAAEEAEE------VDELSKGGSGSKRKRGKNPKAPARAPSRKKVEEDVCFICFDGG 540

Query: 541  DLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKG 600
            +LVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEK+AHYMCYTCTFSLCKG
Sbjct: 541  NLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKSAHYMCYTCTFSLCKG 600

Query: 601  CIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLK 660
            CIKNAVILCVRGNKGFCEACMRFV LIEKNEQG+TEKGQIDFNDK SWEYLFKEYWTDLK
Sbjct: 601  CIKNAVILCVRGNKGFCEACMRFVTLIEKNEQGNTEKGQIDFNDKNSWEYLFKEYWTDLK 660

Query: 661  GSLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKA 720
            GSLSLTFDELVHAKNP KGSETL SRPDSPGEL DG VDGGSDLDVSENEES SSKKRKA
Sbjct: 661  GSLSLTFDELVHAKNPCKGSETLTSRPDSPGELCDGTVDGGSDLDVSENEESSSSKKRKA 720

Query: 721  KRRSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQA 780
            K+RSK+ AKE +SPSMPIIPDSQG STDN+VEWASKELLEFVMHM+NGD+TVLSQFDVQA
Sbjct: 721  KKRSKTHAKEMSSPSMPIIPDSQGLSTDNDVEWASKELLEFVMHMRNGDKTVLSQFDVQA 780

Query: 781  LLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQ 840
            LLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFL++ED Q NDLQ
Sbjct: 781  LLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLVKEDAQTNDLQ 840

Query: 841  GSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYL 900
            GSVADTESSQLE DG+TDA GKT+KEKKRR RKKGD+RGLQSNLDDYAAIDIHNINLIYL
Sbjct: 841  GSVADTESSQLEADGHTDALGKTKKEKKRRTRKKGDERGLQSNLDDYAAIDIHNINLIYL 900

Query: 901  RRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMT 960
            RRNLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQV+GTSKASEPYKVGKKMT
Sbjct: 901  RRNLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVMGTSKASEPYKVGKKMT 960

Query: 961  DILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDA 1020
            +I+LEILNLNKTEVISIDIISNQEFTEEECKRLRQS+KCGILNRLTVGDLQERAMSLQDA
Sbjct: 961  NIMLEILNLNKTEVISIDIISNQEFTEEECKRLRQSMKCGILNRLTVGDLQERAMSLQDA 1020

Query: 1021 RVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGI 1080
            RVKDWMETEIVRLSHLRDRASEKGRRKE     + ECVEKLQLLKTPEERQRRLEE+P I
Sbjct: 1021 RVKDWMETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEEIPEI 1080

Query: 1081 HTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRN 1140
            HTDPNMDPSHES+DEDE DDKR ETY L R + F RRTR+PVSPGK GSNLNDSWSG RN
Sbjct: 1081 HTDPNMDPSHESDDEDETDDKRPETYALPRSTSFGRRTRDPVSPGKGGSNLNDSWSGMRN 1140

Query: 1141 FSSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSE 1200
            FS+TNRDLSR  SGKGFS QG+DAIGSGEIINENSWS GRE DVKK NKWDKQQVSPSSE
Sbjct: 1141 FSNTNRDLSR--SGKGFSAQGDDAIGSGEIINENSWSQGRERDVKKINKWDKQQVSPSSE 1200

Query: 1201 MTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSM 1260
            +  RNALSGAASE  SAA SVN AASPSVGTTQNAA VNE+EKIWRYQDPSGKVQGPFSM
Sbjct: 1201 IIVRNALSGAASESSSAAHSVNPAASPSVGTTQNAAIVNESEKIWRYQDPSGKVQGPFSM 1260

Query: 1261 VQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAH-ASSF 1320
            VQLRKWSNTGYFPADLRVWR SDKQDDSLLLTDVLAGKI KDTS  DN+ Q   H ASSF
Sbjct: 1261 VQLRKWSNTGYFPADLRVWRTSDKQDDSLLLTDVLAGKILKDTS--DNNFQMPTHHASSF 1320

Query: 1321 VA-KPQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIE--- 1380
            V  KPQGAT Q GMD QNTGTSNP  NP S+G S+GGRWKSQ EVSPTG PASASIE   
Sbjct: 1321 VGGKPQGATSQLGMDGQNTGTSNPLNNPISFGHSSGGRWKSQNEVSPTGRPASASIEAPR 1380

Query: 1381 ---------------------------VPRYSGDRWSSDHGNKDFTSLPSPTPSSGGTKE 1440
                                       VPRYSG+RWSSDHGNK+FTSLPSPTPS GGTKE
Sbjct: 1381 YSGERPASASIEAPRYSGERPASASIEVPRYSGERWSSDHGNKNFTSLPSPTPSPGGTKE 1440

Query: 1441 QPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLP 1500
            QPFQ+A  F  +       G+  +QG+E DSLRSHSG+N+AEKG GLG IN  QNH S P
Sbjct: 1441 QPFQVAASFLEARTLSLSGGNGGLQGAEKDSLRSHSGMNSAEKGMGLGSINTYQNH-SQP 1500

Query: 1501 VRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIETQ------------------ 1560
             RPS+I+DD  V  AADIKSISANL SLVQSI++RNP IETQ                  
Sbjct: 1501 ARPSTILDDASVKAAADIKSISANLQSLVQSISNRNPHIETQGHGSGSILKREMSNSVSM 1560

Query: 1561 --------------TVETNISSSMPPGQTLHRRWGEMSPAQNAA---------TASFSTP 1620
                           VE NISSSM P Q  H RWGEMSP QNAA         T SFSTP
Sbjct: 1561 LGNELQSWENAKSLKVEPNISSSMLPAQPPHSRWGEMSPVQNAAATSFSAGTPTGSFSTP 1620

Query: 1621 GLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGLESQNHSWGPM 1680
            GLT + SS+ WRS PP+PSN PHIQSSTPPN+ WGMGAPEGQSTVPRPGLESQN  WGPM
Sbjct: 1621 GLTGYPSSDLWRSTPPMPSNQPHIQSSTPPNLSWGMGAPEGQSTVPRPGLESQNQGWGPM 1680

Query: 1681 PSGNPNMTWAPSAPPNA-TGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGWQAHSS 1740
            PSGNPNM W  + PPNA  GMMW ++A SSA  GTNPGW+APGQGPP  N IQGW  H  
Sbjct: 1681 PSGNPNMGWGATPPPNAGAGMMWRAAAPSSAPAGTNPGWSAPGQGPPTENAIQGWPGHGP 1740

Query: 1741 IPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNP-DSVSHGG 1755
            +PP VN TPGW AP+LGPMPPMN+NPNW APS+NQGMW NEHGKNGDRFSN  D  SHGG
Sbjct: 1741 MPPPVNPTPGWGAPSLGPMPPMNVNPNWSAPSSNQGMWGNEHGKNGDRFSNQKDGGSHGG 1800

BLAST of Cp4.1LG04g01060 vs. TAIR 10
Match: AT2G16485.1 (nucleic acid binding;zinc ion binding;DNA binding )

HSP 1 Score: 946.4 bits (2445), Expect = 3.2e-275
Identity = 707/1802 (39.23%), Postives = 962/1802 (53.39%), Query Frame = 0

Query: 64   SLQPSDAIRGDESLVAETCLE------------------VEETEIAGVKACRNGIEDMGE 123
            ++Q  D++ GD + V E  L+                   EE  +A      + +E+  E
Sbjct: 95   NIQEIDSVGGDAAAVEEVPLKSSSVVGEGREEEAGASIVKEEDFVAEANLSGDRLEENKE 154

Query: 124  DSVKLEVEP---DIAAMGLLGETVFNDVKEEDAGAEEV--KAVAEFGEGDLLCEMDLVGG 183
              V +E EP   +++   + G    ND +  + G + V      E  E DL  + + V  
Sbjct: 155  --VSMEEEPSSHELSVCEVNGVDSLNDEENREVGEQIVCGSMGGEEIESDLESKKEKVDV 214

Query: 184  AEN----QVEGNVLMVNLPDN-TVGCGETDTCLSDVLAELAET------TPFVHGVDTTD 243
             E     Q    V  + +PD+  V C    T +S     L E+         V  +   +
Sbjct: 215  IEEETTAQAASLVNAIEIPDDKEVACVAGFTEISSQDKGLDESGNGFLDEEPVKELQIGE 274

Query: 244  VANLVERKEVEENADDPKDSKDIEVAKQ--------ETFSMEDGKLGVPVQLVEKSELKQ 303
             A  +   + +E  D  +D  DI+V K+         T  +E   + + V  V      +
Sbjct: 275  GAKDLTDGDAKEGVDVTEDEMDIQVLKKSKEEEKVDSTTELEIETMRLEVHDVATEMSDK 334

Query: 304  SLVDGAVVEE--GRTEN----LADRTGETLKMENDSSKTDEVGLANFAGEID-------- 363
            +++  AVV +  G T N    + D   E +  ++++ K+ ++ +     E+D        
Sbjct: 335  TVISSAVVTQFTGETSNDKETVMDDVKEDVDKDSEAGKSLDIHVPEATEEVDTDVNYGVG 394

Query: 364  ---------GA------VTMENTEDKTVEVD-GMCLEDKAADATTKTTTGNLADETPKIK 423
                     GA      V +E   ++  E+   +   D+   +     T  +  +  + K
Sbjct: 395  IEKEGDGVGGAEEAGQTVDLEEIREENQELSKELAQVDETKISEMSEVTETMIKDEDQEK 454

Query: 424  GVHVTD--DNIEVLKIENVEDREAG---VQGLGVADESAE--VGKIENLVDETAEAENVT 483
              ++TD  +++E  +  +V D E G    + +GV +   E  +GK++         E  T
Sbjct: 455  DDNMTDLAEDVENHRDSSVADIEEGREDHEDMGVTETQKETVLGKVDRTKIAEVSEETDT 514

Query: 484  NYTAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAE 543
                E  E  D+ T   E++    ++  AD     ++EG  S+E    MT    ++  A+
Sbjct: 515  RIEDEDQEKDDEMTDVAEDVKTHGDSSVAD-----IEEGRESQE---EMTETQEDSVMAD 574

Query: 544  EVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDR 603
            E       EEV+E +K S+G KRKRG+N+K      + KK EEDVCF+CFDGGDLVLCDR
Sbjct: 575  E-----EPEEVEEENK-SAGGKRKRGRNTKTVK--GTGKKKEEDVCFMCFDGGDLVLCDR 634

Query: 604  RGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 663
            RGC KAYHPSC++RDEAFF+ KG+WNCGWHLCS CEKTA Y+CYTC FSLCKGC K+AV 
Sbjct: 635  RGCTKAYHPSCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYLCYTCMFSLCKGCAKDAVF 694

Query: 664  LCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTF 723
             C+RGNKG CE CM  V LIE+ +Q   E  Q+DFNDKTSWEYLFK+YW DLK  LSL+ 
Sbjct: 695  FCIRGNKGLCETCMETVKLIERKQQ-EKEPAQLDFNDKTSWEYLFKDYWIDLKTQLSLSP 754

Query: 724  DELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQ 783
            +EL  AK P KG ET  S+  +  E  D   DGGSD D        S KKRK + RSKS 
Sbjct: 755  EELDQAKRPLKGHETNASKQGTASET-DYVTDGGSDSD-------SSPKKRKTRSRSKSG 814

Query: 784  AKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 843
            + E       I+       +D  +EWASKELL+ V+HM+ GDR+ L   +VQ LLL YIK
Sbjct: 815  SAE------KILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPMLEVQTLLLAYIK 874

Query: 844  RNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTE 903
            R  LRDPRRKSQ+ICDSRL+NLFGK  VGHFEML LL+SHFL +E  Q +D+QG + DTE
Sbjct: 875  RYNLRDPRRKSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQADDIQGDIVDTE 934

Query: 904  S-SQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVE 963
              + ++ D   D   K+ K+KKR+ RKK  ++G QSNLDD+AA+D+HNINLIYLRR+LVE
Sbjct: 935  EPNHVDVDENLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDFAAVDMHNINLIYLRRSLVE 994

Query: 964  YLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEI 1023
             L+ED  +F EKV  +FVR+RISGN QKQDLYRLVQVVGTSKA EPYKVGKK TD +LEI
Sbjct: 995  DLLEDSTAFEEKVASAFVRLRISGN-QKQDLYRLVQVVGTSKAPEPYKVGKKTTDYVLEI 1054

Query: 1024 LNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWM 1083
            LNL+KTEVISIDIISNQ+FTE+ECKRL+QSIKCG++NRLTVGD+QE+A++LQ+ RVK+ +
Sbjct: 1055 LNLDKTEVISIDIISNQDFTEDECKRLKQSIKCGLINRLTVGDIQEKAIALQEVRVKNLL 1114

Query: 1084 ETEIVRLSHLRDRASEKGRRKEYPF----------YNIMECVEKLQLLKTPEERQRRLEE 1143
            E EI+R SHLRDRAS+ GRRKEYP+            + ECVEKLQLLK+PEERQRRLEE
Sbjct: 1115 EAEILRFSHLRDRASDMGRRKEYPYLLKLSNSLTMLTLRECVEKLQLLKSPEERQRRLEE 1174

Query: 1144 LPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWS 1203
            +P IH DP MDP  ESEDEDE ++K +E     R S F+RR R+P+SP K G + N+SW+
Sbjct: 1175 IPEIHADPKMDPDCESEDEDEKEEKEKEKQLRPRSSSFNRRGRDPISPRKGGFSSNESWT 1234

Query: 1204 GTRNFSST--NRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQ 1263
            GT N+S+T  NR+LSR+ SG+G + +G+    S + ++++ W+  RE +V+     +K +
Sbjct: 1235 GTSNYSNTSANRELSRSYSGRGSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPR 1294

Query: 1264 VSPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKV 1323
                 E  AR++ + A  EL     S  S A P+V  +Q     N++EKIW Y+DPSGKV
Sbjct: 1295 SVSIPETPARSSRAIAPPELSPRIASEISMAPPAV-VSQPVPKSNDSEKIWHYKDPSGKV 1354

Query: 1324 QGPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQA 1383
            QGPFSM QLRKW+NTGYFPA L +W+A++   DS+LLTD LAG   K T +VDNS   +A
Sbjct: 1355 QGPFSMAQLRKWNNTGYFPAKLEIWKANESPLDSVLLTDALAGLFQKQTQAVDNSYM-KA 1414

Query: 1384 HASSFVAKPQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASI 1443
              ++F       + QS     N G +                      ++PT      +I
Sbjct: 1415 QVAAF-------SGQSSQSEPNLGFA--------------------ARIAPT------TI 1474

Query: 1444 EVPRYSGDRWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQG 1503
            E+PR S D WS         SLPSPTP+         Q+ TP A      S    +    
Sbjct: 1475 EIPRNSQDTWSQG------GSLPSPTPN---------QITTPTAKRRNFESRWSPTKPSP 1534

Query: 1504 SENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLH 1563
               +   ++S   + +  T    I  + N         ++   T   P  D  ++S N  
Sbjct: 1535 QSANQSMNYSVAQSGQSQTSRIDIPVVVN------SAGALQPQTYPIPTPDPINVSVNHS 1594

Query: 1564 SLVQSINSRNPPIETQTVETNI-SSSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFS 1623
            + + S           +++T+   S+ P  Q     +G  SP   +   S S PG   F 
Sbjct: 1595 ATLHSPTPAGGKQSWGSMQTDHGGSNTPSSQNNSTSYGTPSP---SVLPSQSQPG---FP 1654

Query: 1624 SSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRP-GLESQNHSWGPMPSGNP 1683
             S+ W+    +PS P     +      WGM          +P    +QN SWG   + NP
Sbjct: 1655 PSDSWK--VAVPSQP-----NAQAQAQWGMNMVNNNQNSAQPQAPANQNSSWG-QGTVNP 1714

Query: 1684 NMTWAPSAPPNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGW--------QAH 1743
            NM W   A         GSS  S+    T+ GW AP QG        GW        Q+ 
Sbjct: 1715 NMGWVGPAQTGVNVNWGGSSVPSTVQGITHSGWVAPVQGQTQAYPNPGWGPTGHPQSQSQ 1772

Query: 1744 SSIPPQVNAT-PGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNPDSVSH 1756
            S +  Q   T  GW+ P  G +   N N NW                      N  ++  
Sbjct: 1775 SQVQAQAGTTGSGWMQPGQG-IQSGNSNQNWGT-------------------QNQTAIPS 1772

BLAST of Cp4.1LG04g01060 vs. TAIR 10
Match: AT3G51120.1 (DNA binding;zinc ion binding;nucleic acid binding;nucleic acid binding )

HSP 1 Score: 600.1 bits (1546), Expect = 5.6e-171
Identity = 483/1442 (33.50%), Postives = 683/1442 (47.36%), Query Frame = 0

Query: 412  ENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDV 471
            + L     +L  +A  EE+      +  VD+      N      +   T A      M  
Sbjct: 6    KQLQQGVPELASLAGREESSVRGIDLMRVDQCEEIGVNQVPALSVPASTVAGAVAVPMSN 65

Query: 472  TEEV---DEASKGSSGAKRKRGKNSKAPARV----------PSRKKVEEDVCFICFDGGD 531
             +EV   DEA+      KRKRG+  +A A            P ++  EEDVCFICFDGGD
Sbjct: 66   EQEVKVIDEAAP----IKRKRGRPPRAQANTPLHIRPPPPPPKKEDKEEDVCFICFDGGD 125

Query: 532  LVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGC 591
            LVLCDRR CPKAYHP+CI RDEAFFR   +WNCGWH+C  C+K + YMCYTCTFS+CK C
Sbjct: 126  LVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQKASSYMCYTCTFSVCKRC 185

Query: 592  IKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKG 651
            IK+A  + VRGN G C  C++ +MLIE   QG  E  ++DF+DK SWEYLFK YW  LK 
Sbjct: 186  IKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFDDKLSWEYLFKVYWLCLKE 245

Query: 652  SLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAK 711
             LSLT DEL  A NPWK  E  N+ P    +    +      LDV+ N   G+ ++R   
Sbjct: 246  ELSLTVDELTRANNPWK--EVPNTAPKVESQ---NDHTNNRALDVAVN---GTKRRR--- 305

Query: 712  RRSKSQAKETNSPSMPIIPDSQGPS-----TDNNVEWASKELLEFVMHMKNGDRTVLSQF 771
                     ++SP++P   D + PS        +  WA+KELLEFV  MKNGD +VLSQF
Sbjct: 306  --------TSDSPTLPNKLDGKNPSNILKKAPGDTSWATKELLEFVSFMKNGDTSVLSQF 365

Query: 772  DVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQ- 831
            DVQ LLL+YIK+  LRDP +KSQ++CD  L  LFGK RVGHFEMLKLLESH LI+E  + 
Sbjct: 366  DVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQEKPKG 425

Query: 832  INDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNI 891
                 G       SQ+E D   D      ++++R+MR+K D R    NLD YAAID+HNI
Sbjct: 426  AKTTNGETTHAVPSQIEEDSVHD---PMVRDRRRKMRRKTDGRVQNENLDAYAAIDVHNI 485

Query: 892  NLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKV 951
            NLIYLRR  +E L++D     EKVVG+ +RI++SG+ QK D++RLVQVVGTSKA   Y++
Sbjct: 486  NLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSDQKLDIHRLVQVVGTSKAIASYQL 545

Query: 952  GKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAM 1011
            G K TD++LEILNL+K EVISID +S+Q  TE+ECKRLRQSIKCG+  RLTV D+ + A 
Sbjct: 546  GAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVDILKTAA 605

Query: 1012 SLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLE 1071
            +LQ  R+ + +E EI++L+HLRDRA                  +KL+LLK+PEERQR L+
Sbjct: 606  TLQAMRINEALEAEILKLNHLRDRA------------------KKLELLKSPEERQRLLQ 665

Query: 1072 ELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSW 1131
            E+P +HTDP+MDPSH   ++     ++Q+ +  ++  G          P   G NLN   
Sbjct: 666  EVPEVHTDPSMDPSHALSEDAGLGTRKQDNHVKAQSKG----------PQNKGVNLN--- 725

Query: 1132 SGTRNFSSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQV 1191
                       ++  N+  K              I+   +  H  + D  K +       
Sbjct: 726  -----------NVGNNVQKK----------YDAPILRSRNNVHADKDDCSKVHN------ 785

Query: 1192 SPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQ 1251
                                      NS+        Q     +E  +IW Y+DP+GK Q
Sbjct: 786  --------------------------NSS------NIQETGKDDEESEIWHYRDPTGKTQ 845

Query: 1252 GPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDT-----SSVDNSI 1311
            GPFSMVQLR+W ++G+FP  LR+WRA + QD+S+LLTD LAG+  K T     SS+   +
Sbjct: 846  GPFSMVQLRRWKSSGHFPPYLRIWRAHENQDESVLLTDALAGRFDKATTLPSSSSLPQEL 905

Query: 1312 QAQAHASSFVAKPQGATVQSGMDVQNTGTSNPHTNPTSYGQ-----------SAGGRWKS 1371
            +   H S           ++ M V  + TS+  +  T++             +  G+ + 
Sbjct: 906  KPSPHDSGRTGADVNCLQKNQMPVNTSATSSSSSTVTAHSNDPKEKQVVALVACSGKVED 965

Query: 1372 QTEVSP---TGIPASASI-----------EVP---RYSGDRWSSDHG------------- 1431
               V P      PAS S+           E P   +Y+  R   +H              
Sbjct: 966  GNSVRPQPQVSCPASISVVPGHVVTPDVRETPGTDQYNTVRADGNHNTTKTLEDETNGGS 1025

Query: 1432 --------------NKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGS 1491
                             F   PSPTP S   ++   Q A    S +    + G S +  S
Sbjct: 1026 VSINGSVHAPNLNQESHFLDFPSPTPKS-SPEDLEAQAAETIQSLSSCVLVKGPSGVTWS 1085

Query: 1492 ENDSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHS 1551
               +  + +    +      G +            P  I  +T+V  A  +K I      
Sbjct: 1086 TTTTSTTDAATTTSSVVVTGGQL------------PQVIQQNTVVLAAPSVKPIELAADH 1145

Query: 1552 LVQSINSRNPPI-----------ETQTVETNISSSMPPGQTLHRRWGEMSPAQNAATASF 1611
               +  S N  +           +    + ++S  +   + + +     SP     T++F
Sbjct: 1146 ATATQTSDNTQVAQASGWPAIVADPDECDESVSDLLAEVEAMEQNGLPSSP-----TSTF 1205

Query: 1612 STPGLTNFSSSE-------PWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGL 1671
                  +    E          S+ P        Q+S   N+  G  +   ++       
Sbjct: 1206 HCDDDDDLKGPEKDFFNPVARMSLTPETCRLDVSQTSILDNVSAGKSSMLTEA------- 1265

Query: 1672 ESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSA-SVGTNPGWNAPGQGPPVRN 1731
               N  +    +  P +      PP  T +    +  ++A  +G+     A G    +  
Sbjct: 1266 -KDNTPFSHCGTAGPELLLFAPPPPPPTAISHDLTLTTTALRLGSETTVEA-GTVERLPK 1291

Query: 1732 NIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFS 1756
            ++ G  +  S P  +++     A       P    P   +  +    W N +G N   F+
Sbjct: 1326 SVLGVSSEPS-PRSLSSHDSSSARGSTERSPRVSQPKRSSGHSRDRQWLN-NGHNSS-FN 1291

BLAST of Cp4.1LG04g01060 vs. TAIR 10
Match: AT2G18090.1 (PHD finger family protein / SWIB complex BAF60b domain-containing protein / GYF domain-containing protein )

HSP 1 Score: 322.4 bits (825), Expect = 2.3e-87
Identity = 169/380 (44.47%), Postives = 239/380 (62.89%), Query Frame = 0

Query: 469 MDVTEEVDEASKGSSGAKRKRGKNSKAPARVPS-----RKKVEEDVCFICFDGGDLVLCD 528
           +D   ++DE    S   + +RG+  +  A+  S     +++ +EDVCF+CFDGG LVLCD
Sbjct: 36  LDSDVKLDEEDSDSLKKRGRRGRPPRILAKASSPPISRKRREDEDVCFVCFDGGSLVLCD 95

Query: 529 RRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAV 588
           RRGCPKAYHP+C+ R EAFFR++ +WNCGWH+C+ C+K + YMCYTC +S+CK C++++ 
Sbjct: 96  RRGCPKAYHPACVKRTEAFFRSRSKWNCGWHICTTCQKDSFYMCYTCPYSVCKRCVRSSE 155

Query: 589 ILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLT 648
            + VR NKGFC  CM+ +MLIE   + + EK Q+DF+D+ SWEYLFK YW  LK  L L+
Sbjct: 156 YVVVRENKGFCGICMKTIMLIENAAEANKEKVQVDFDDQGSWEYLFKIYWVSLKEKLGLS 215

Query: 649 FDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKS 708
            D+L  AKNPWK S +  ++  +   +++ + DG S          G  K R+AK R   
Sbjct: 216 LDDLTKAKNPWKSSSSTAAKRRTTSRVHEKD-DGNS---------PGVMKIRRAKVRKMD 275

Query: 709 QAKETNSPSMPIIPDSQGPSTDNN-------------VEWASKELLEFVMHMKNGDRTVL 768
               +N           GPS D+N               WA+ ELL+FV +MKNGD +VL
Sbjct: 276 AVSVSN----------LGPSLDSNCSLGDRLPQLTSAATWATNELLDFVGYMKNGDISVL 335

Query: 769 SQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFL--IR 828
           S++DVQ L+LEY++RN L++  + S+I+CDS+L  LFGK RV + EMLKLL+SHF+  +R
Sbjct: 336 SKYDVQTLVLEYVRRNNLQNSPQNSEIMCDSKLMRLFGKERVDNLEMLKLLDSHFIDQVR 394


HSP 2 Score: 76.3 bits (186), Expect = 2.8e-13
Identity = 47/142 (33.10%), Postives = 79/142 (55.63%), Query Frame = 0

Query: 1168 DKQQVSPSSEMTARNALSGAASEL-PSAARSVNSA-ASPSVGTTQNAATVN--ETEKIWR 1227
            D+   S   +      L+G + ++ PS++ S N A   P    T +   ++  +T  +W 
Sbjct: 401  DRLNTSEQHQEGESQQLNGHSIQVRPSSSDSRNHAVVKPDTSATLSNKPIDGLDTNMVWL 460

Query: 1228 YQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSV 1287
            Y DP GK+ GPFS+  LR+W+++G+FP +LR+WR  ++Q  S+LLTD L G+  K     
Sbjct: 461  YGDPDGKIHGPFSLYNLRQWNSSGHFPPELRIWRLGEQQHSSILLTDALNGQFHKTGLLQ 520

Query: 1288 DNSIQAQAHASSFVAKPQGATV 1306
            ++SI  Q   ++ +A  Q  +V
Sbjct: 521  NHSIPKQ-EVTATIANDQNRSV 541

BLAST of Cp4.1LG04g01060 vs. TAIR 10
Match: AT5G63700.1 (zinc ion binding;DNA binding )

HSP 1 Score: 235.0 bits (598), Expect = 4.7e-61
Identity = 162/588 (27.55%), Postives = 292/588 (49.66%), Query Frame = 0

Query: 507  EDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYM 566
            ED CFIC DGG+L+LCD + CPK YH SC+ +D +  +    + C WH C  C+KT    
Sbjct: 22   EDWCFICKDGGNLMLCDFKDCPKVYHESCVEKDSSASKNGDSYICMWHSCYLCKKTPKLC 81

Query: 567  CYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWE 626
            C  C+ ++C+GC+ +A  + ++G+KG C  C  +V  +E+ ++      ++D  D+ ++E
Sbjct: 82   CLCCSHAVCEGCVTHAEFIQLKGDKGLCNQCQEYVFALEEIQEYDAAGDKLDLTDRNTFE 141

Query: 627  YLFKEYWTDLKGSLSLTFDEL--VHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVS 686
             LF EYW   K    LTFD++  V A  P K       + D    L         D+  S
Sbjct: 142  CLFLEYWEIAKKQEGLTFDDVRKVCASKPQKKGVKSKYKDDPKFSL--------GDVHTS 201

Query: 687  ENEESGSSKKRKAKRR--------SKS-----QAKETNSPSMPIIPDSQGPSTDNN---- 746
            ++++ G   K K   +        SKS     + K  + P   +   +   + D      
Sbjct: 202  KSQKKGDKLKNKDDPKFALGDAHTSKSGKKGVKLKNKDDPKFLVSDHAVEDAVDYKKVGK 261

Query: 747  ------VEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDS 806
                  + W SK L++F+  +    R  +SQ  V++++  YI+   L D  +K ++ CD 
Sbjct: 262  NKRMEFIRWGSKPLIDFLTSIGEDTREAMSQHSVESVIRRYIREKNLLDREKKKKVHCDE 321

Query: 807  RLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGD--GYTDASGK 866
            +L ++F K  +    +  LL +H  ++E++   D        E   +E +   +++ + K
Sbjct: 322  KLYSIFRKKSINQKRIYTLLNTH--LKENL---DQVEYFTPLELGFIEKNEKRFSEKNDK 381

Query: 867  TRKEKKRRMRKKGDQRGLQSNLD------DYAAIDIHNINLIYLRRNLVEYLIEDEESFH 926
                 K++  +  D    +  +        +A I+  N+ L+YLR++LV  L++  +SF 
Sbjct: 382  VMMPCKKQKTESSDDEICEKEVQPEMRATGFATINADNLKLVYLRKSLVLELLKQNDSFV 441

Query: 927  EKVVGSFVRIRISGNAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVI 986
            +KVVGSFV+++   N  +  + Y+++QV G   A +      +   +LL +  +     +
Sbjct: 442  DKVVGSFVKVK---NGPRDFMAYQILQVTGIKNADD------QSEGVLLHVSGM--ASGV 501

Query: 987  SIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSH 1046
            SI  + + +  EEE K L+Q +  G+L + TV +++++A +L     K W+  ++  L  
Sbjct: 502  SISKLDDSDIREEEIKDLKQKVMNGLLRQTTVVEMEQKAKALHYDITKHWIARQLNILQK 561

Query: 1047 LRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTD 1061
              + A+EKG R+E     + E +E+ +LL+ P E++R L+E+P I  D
Sbjct: 562  RINCANEKGWRRE-----LEEYLEQRELLEKPSEQERLLKEIPRIIED 580

BLAST of Cp4.1LG04g01060 vs. TAIR 10
Match: AT5G08430.1 (SWIB/MDM2 domain;Plus-3;GYF )

HSP 1 Score: 184.1 bits (466), Expect = 9.6e-46
Identity = 157/562 (27.94%), Postives = 260/562 (46.26%), Query Frame = 0

Query: 728  VEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLF 787
            V W S++L+EF+  +      ++S++DV   + +YI +  L DP  K +++CD RL  LF
Sbjct: 30   VGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKKVVCDKRLVLLF 89

Query: 788  GKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRR 847
            G   +   ++  LLE H+  +E+   +D      D        +     + K  K+ +  
Sbjct: 90   GTRTIFRMKVYDLLEKHY--KENQDDSDFDFLYEDEPQIICHSEKIAKRTSKVVKKPR-- 149

Query: 848  MRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISG 907
                            +AAI   NI L+YLR++LV+ L++  ++F  K++GSFVRI+   
Sbjct: 150  --------------GTFAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRIKSDP 209

Query: 908  NAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEE 967
            N   Q   Y+LVQV G  K            D LL++ N  K   +SI ++S+  F++EE
Sbjct: 210  NDYLQKYPYQLVQVTGVKKE-------HGTDDFLLQVTNYVKD--VSISVLSDDNFSQEE 269

Query: 968  CKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEY 1027
            C+ L Q IK G+L + T+ +++E+A  L   + K W+  EI  L  L DRA+EKG R+E 
Sbjct: 270  CEDLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRE- 329

Query: 1028 PFYNIMECVEKLQLLKTPEERQRRLEELPGI-------HTDPNMDPSHESEDEDEADDKR 1087
                + E ++K +LL+ P+E+ R L E+P +       + + +   +H+S++E    +  
Sbjct: 330  ----LSEYLDKRELLQNPDEQARLLREVPEVIGEELVQNPEVSSPEAHKSDNEQRLSESP 389

Query: 1088 QE-TYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFS--- 1147
                +             +  + G   SN   +   T   +  N+ L   ++  G     
Sbjct: 390  LSCIHETPEARNLFGGEDQQFNNGYVMSNPITTPGITSCATEINKGLPTWIASAGAEYLH 449

Query: 1148 ---NQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELP 1207
                Q  + I  GE   E S     +  +   N  +  QV P+              EL 
Sbjct: 450  VDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQPN---------PSEVIELS 509

Query: 1208 SAARSVNSAASPSVGTTQNAATVN-ETEKI-WRYQDPSGKVQGPFSMVQLRKWSNTGYFP 1267
                  N          ++   ++ + EK+ W Y+DP G VQGPFS+ QL+ WS+  YF 
Sbjct: 510  DDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLTQLKAWSDAEYFT 550

Query: 1268 ADLRVWRASDKQDDSLLLTDVL 1273
               RVW   +  + ++LLTDVL
Sbjct: 570  KQFRVWMTGESMESAVLLTDVL 550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SIV54.5e-27439.23Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9SD347.9e-17033.50Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9FT921.4e-4427.94Uncharacterized protein At5g08430 OS=Arabidopsis thaliana OX=3702 GN=At5g08430 P... [more]
Q6P2L67.7e-1641.59Histone-lysine N-methyltransferase NSD3 OS=Mus musculus OX=10090 GN=Nsd3 PE=1 SV... [more]
Q9BZ958.5e-1538.53Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens OX=9606 GN=NSD3 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_023531029.10.099.60zinc finger CCCH domain-containing protein 44-like [Cucurbita pepo subsp. pepo][more]
XP_022928299.10.097.51zinc finger CCCH domain-containing protein 44-like [Cucurbita moschata][more]
KAG6588420.10.097.39Zinc finger CCCH domain-containing protein 44, partial [Cucurbita argyrosperma s... [more]
XP_022970324.10.096.37zinc finger CCCH domain-containing protein 44-like [Cucurbita maxima][more]
XP_038888157.10.078.59zinc finger CCCH domain-containing protein 19 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EKF50.097.51zinc finger CCCH domain-containing protein 44-like OS=Cucurbita moschata OX=3662... [more]
A0A6J1I5690.096.37zinc finger CCCH domain-containing protein 44-like OS=Cucurbita maxima OX=3661 G... [more]
A0A0A0K4G10.075.37Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G006220 PE=4 SV=1[more]
A0A1S3BIT90.075.77zinc finger CCCH domain-containing protein 19 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A6J1C4040.074.52zinc finger CCCH domain-containing protein 19 OS=Momordica charantia OX=3673 GN=... [more]
Match NameE-valueIdentityDescription
AT2G16485.13.2e-27539.23nucleic acid binding;zinc ion binding;DNA binding [more]
AT3G51120.15.6e-17133.50DNA binding;zinc ion binding;nucleic acid binding;nucleic acid binding [more]
AT2G18090.12.3e-8744.47PHD finger family protein / SWIB complex BAF60b domain-containing protein / GYF ... [more]
AT5G63700.14.7e-6127.55zinc ion binding;DNA binding [more]
AT5G08430.19.6e-4627.94SWIB/MDM2 domain;Plus-3;GYF [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 411..431
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1294..1345
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1670..1695
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 706..727
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1168..1183
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1192..1212
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1153..1167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1294..1441
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1367..1423
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1053..1212
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 823..855
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1670..1731
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 655..727
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1102..1137
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 462..500
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..27
NoneNo IPR availablePANTHERPTHR13115UNCHARACTERIZEDcoord: 251..1755
NoneNo IPR availablePANTHERPTHR13115:SF14ZINC FINGER CCCH DOMAIN-CONTAINING PROTEIN 19coord: 251..1755
NoneNo IPR availableCDDcd10567SWIB-MDM2_likecoord: 730..805
e-value: 1.34436E-19
score: 82.5913
NoneNo IPR availableCDDcd19757Bbox1coord: 555..580
e-value: 0.00345647
score: 35.1643
NoneNo IPR availableCDDcd15568PHD5_NSDcoord: 509..554
e-value: 2.08629E-22
score: 89.696
IPR003169GYF domainSMARTSM00444gyf_5coord: 1220..1275
e-value: 1.4E-19
score: 81.1
IPR003169GYF domainPFAMPF02213GYFcoord: 1222..1263
e-value: 2.1E-13
score: 49.7
IPR003169GYF domainPROSITEPS50829GYFcoord: 1219..1273
score: 16.285046
IPR003169GYF domainCDDcd00072GYFcoord: 1219..1274
e-value: 1.23918E-18
score: 79.2732
IPR019835SWIB domainSMARTSM00151swib_2coord: 725..810
e-value: 2.1E-4
score: 27.5
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 509..555
e-value: 3.2E-8
score: 43.3
IPR004343Plus-3 domainSMARTSM00719rtf1coord: 866..976
e-value: 8.0E-53
score: 191.5
IPR004343Plus-3 domainPFAMPF03126Plus-3coord: 871..974
e-value: 1.4E-24
score: 86.7
IPR004343Plus-3 domainPROSITEPS51360PLUS3coord: 866..999
score: 31.169786
IPR036885SWIB/MDM2 domain superfamilyGENE3D1.10.245.10SWIB/MDM2 domaincoord: 718..813
e-value: 9.2E-29
score: 101.2
IPR036885SWIB/MDM2 domain superfamilySUPERFAMILY47592SWIB/MDM2 domaincoord: 725..806
IPR003121SWIB/MDM2 domainPFAMPF02201SWIBcoord: 732..805
e-value: 4.6E-16
score: 58.4
IPR003121SWIB/MDM2 domainPROSITEPS51925SWIB_MDM2coord: 724..807
score: 21.704117
IPR035445GYF-like domain superfamilyGENE3D3.30.1490.40coord: 1221..1279
e-value: 8.5E-22
score: 78.6
IPR035445GYF-like domain superfamilySUPERFAMILY55277GYF domaincoord: 1210..1272
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 497..600
e-value: 2.5E-23
score: 84.1
IPR036128Plus3-like superfamilyGENE3D3.90.70.200coord: 867..997
e-value: 4.5E-36
score: 125.6
IPR036128Plus3-like superfamilySUPERFAMILY159042Plus3-likecoord: 867..997
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 510..570
IPR019787Zinc finger, PHD-fingerPROSITEPS50016ZF_PHD_2coord: 507..573
score: 8.742399
IPR000571Zinc finger, CCCH-typePROSITEPS50103ZF_C3H1coord: 1731..1756
score: 12.990384
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 505..554

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g01060.1Cp4.1LG04g01060.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding