Cp4.1LG20g01730 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g01730
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotein NRDE2 homolog isoform X2
LocationCp4.1LG20: 979105 .. 987852 (+)
RNA-Seq ExpressionCp4.1LG20g01730
SyntenyCp4.1LG20g01730
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCAATTGGGCCTGTGTACATTTTGCCTCTTCCATTTCCTTCTTAGGCTTTCAGTTCATAAGCTCCGAATAGTTCGATCTCGCGGGCAATATTCTTCGTAGCCATGGAAGCTCCAGCAGAAGAGGAGTTGCCACCTGAAGAGCAAAAGCCTAAAACCTCCCTCTTCCCGCTCCCGTTCGTCGCTAACAATCCCCAGAGTCAGATAAGTCCGCCCAATTCAAGCGTTCCTCAGTGGCTTTGCAACTCCAGCTTCACCACTGACCTATCCGTCATCAACGATGCTCTTTCATCTCAAAACAATGTATATCCCTCCCTCTCCACCGATGGCGACCAGGAAGAGGCTGTGGAAGATGAAGGAGGTCCAAGTGTTAGACCTGAGGTGCAGAAGTCATCTCGATCATACGAATTGCTGGAATCTTCTGCTTCGGACGACGGCTCCGAGCATGAGAAGAGGAAAAAGAGGAAGAAGAAGAAGAGGAGGAGGCGAAATGAATATGAAGAAAAAAAGGGATTCGGCGAGTATGGTTCGAGAAAGTCCGATGTTCGGGCTTGGGCCGATGCCGATGGTAGACCTTCCAAGGATTATTACTTCGATTCTAATGGAGACCGGGATAACTTAGCATTCGGGTCTCTTTACAGGTATTTATGCTGGCTTAATTTAATTCTTTCAATCCAACTCTCTAGGTAGTTTCAGAGAAGATGTGGAAGATGAGTTTGAATGTAGTTGAATGCACTTGTTCAGAAATTTGAAAAGACATGCCGATAATTTAATCAGACAGAGTAGAATTCACCTAAGACGTACATGTTGTTCTAATGGCTTACCACTATTTTCATTTTTCTCTTAAATCAACTGCGTGGAAGTTCAATCAATCTCCAGAACTTGTTTAATGAAATCATTTGGTTCGTTCAGTAAATCTAATTCATATGCCCAAAGAAAATCATGAATATAATTAGTTATTTATTTTATTTATCTAAAGGCAATAAGATAACCTCGACATTATGCACAAAAAGAAAAAGAAGGGCAATCCCAAAAAAGAAAAAGAAGGGCAATCCCCGCGTTAATGATATGCTGAAAATTGTAACCCACAATCCCACAAGAATTCAGTACCTCTTTACTCGTTAGTACTTAAGTATCTGTAGGAAGCGGCAGAATTAAATTTGAATTTCAATTTAGAGGGAACAGCATTTTGTTTGGAACTATTGCCGAGACGTGTAGAATAAGTTTTTCCAATTTTTTTTCCACGGAAGATGAGCAGCCAGAACTGATTGCTTCAAATTTGAATTTTAAGTTCACTATGTTTTTATTACAATACAGGATGGATGTTGCACGCTACAGACCGCTCAACCATGGGGAAAGACCTGGACTAAATTTTAATGGATTTTCTCAGTGGAATAAAAGTAGTTCTGCCTTAGACAAAGATGCTGATGCTGAAGTGTTGGATAGTAAATTGAAATCAGGTGGACGCTATTGGTCTGCAAAGAATGCAGCAATAGAGCGACATAAGAACTTCAAACGTGTACGTATTGGTTTTTCTAGAAAAACTCCAGACAAATTATTGGATGATTTCATTCCTTTTTCGGATTCTCAAACATCAAATAATATTGAGGAATCTTGGGAGGATGAAGTGCTGCGTAAAACGCGGGAGTTTAACAAATTGACTAGGGAGCATCCTCATGACGAGAAGGCTTGGTTAGCTTTTGCTGAATTTCAAGACAAAGTTGCAGCTATGCAACCTCAGAAAGGTGCTCGCTTGCAAACTCTAGAGAAAAAAATTAGCATATTAGAGAAGGCTGCTGAGCTTAACCCAGAAAATGAGGAATTATTGCTATACCTTTTAAAGAATTACCAGAAAAGAGATACTATTGATGTGTTGATTAGTAGATGGGAAAAGATACTGATGCAGAATTCTGGGAGTTATAAGTTGTGGAGAGAGTTTTTGCATCTCATTCAAGGGGAGTTCTCTAGATTCAAGGTTTCAGACATGAGACAAATGTATGCACATGCAATCCAAGCTCTATCTGCTGCATGCAACCAGCACATTAGGCAGGTTCTCAAGTCAAATGACTATTTCTTCTCGGGGCATCACCTTTTTATATTCCTCTTTGTTTTATTTAACTGTTTGAAGTTTTCCCCCCCAAATACTATGCTATTGGGGTAGATCTTGCTCTTTATTGGTAGATTTCGTTAACTAATTTCATACCATCAACGAAATTGTTTCCTTTACAAGAAAAAAAATAAAATAAAGGTATATATCATTAGCCATATTTTTATATTACGTATGATGCACATAATTTTATTTGAAATTGTACATGAGGGGTCATCATTGCTTTGGAGGCATTTTATTTTCAATTTGCTGGTTTTATAGGGAACATGACTCGGTCTGTTGTGCTGTTACTTCTTGTTATACTTAATTTCTGTACTTAGAATTCCAAATAGTTGCTTTTACTACATTTGACTTCCTAGATATAATGAACCATGCATCTTAGGAGACTAAGAGTACCTCTTTACCATTTGATTTTTTATGTGACGAAGTTGGCTTATTGCCCTTGTTATGCCCACCATTCATTCAACCATTATTGTTACCGGAGACTGTGTCAATTTCTTATTAAATGCAAGTTGCTTGACGGGTGTGGTTTATAAAGATGTATATGTGTGTTGTTATACAAGCTTTACTCTTTTTTAGGTGGACTTCACTTGCTTGTTGCCGTAATATTTTCCTTTTCCCAAGTTCTGAAATCGTGTTTATAAACCTGTTTGTTTTGAATTTAAATAAACCCTTTTATCATTTATACATGGATAACTTAAATTTTATTTTCTGTATTTTTATTATGTTTCTGTACGATGCAGGCCAATCAAACTGCCAAACCTTCAGTGGAGCATGATCTCATTCAGCTAGAACTTGGTCTGGTTGATATTTTTCTGAGTTTGTGCCGATTTGAGTGGCAGGCTGGGTATCAGGAGTTGGCTACTGCTTTATTTCAGGCTGAAATTGAATTTAGCTTGTTTTGCCCTGCTTTGCATTTAAACGATCGAAGTAAACAAAGATTATTTGAACACTTTTGGAACACTAATGCTGAAAGAGTCGGTGAGGAAGGTGCCATCGGTTGGTCTACATGGCTAGAAAAAGAGGAGGAAAATAGGCAAAAGGTTATGAGAGAGGAGGAGGCCTTAGAGGCTGATGAAAAGGGTGGCTGGACTGGTTGGTCTGATCCAGCACCAAAAGAGAAGAAAAATAATGATGACGCAGAAACTACTGCAGAAGTGGGTGTAGCAGCAGAGGAGGCTATGGAGCAAGATGTGGAAGAAGAAGACACTGAAAGAGAAGATAGCACGGAAGCATTGCTCAAAATTCTTGGAATTAACGCTGATGCAGGGGTCGACGAGGAGGTTAAGGACACCTCAACCTGGGCTAGATGGTCAAAAGAAGAGTCATTAAGAGACTGTGAACAATGGATGCCTATCCGGGAAAAATCTGGTACCCTCCAGCTGCCTGCACACATCATTAAATTCTAACTCGACTCTTTTGGTTCACATTTATTCTGACATGTACGCTGAAAAGTCATTGATTTACCATGAATCTTCACATTGTTCTGAATATTAAATATGATTGAACTGCTTGAATCTGTTTTCTAATTAATTAATTTTATGTTTGTATTTATTTGAATTCATACTGTGAAACAGCAGATGTTATTCATGATGAAGGGATGCCTGATGGAGAAACAAATGAACAACTTCAAAGAGTTATATTATATGAAGATGTCAAGGAGTACCTGTTTTCATTGATTTCAAGTGAAGCCCGTTTATCCTTGATATATCAGCTAATTGAATTCTTTAGTGGAAAAATATATTCAAGGTATATACTCTTCTTGTGTAGAATCTTATTTACAGTCATTTTTCAGTTTGAAAACCAGTCGCATTATTCATTTATATAGCATGTAATCTACTTTTACTTGAACATGTGAGTGGCATCGGTGAAGTCTTTGTGGCGAGTGAACTAGTATCTTTCATGTAATATGTTGTCTGTCGTCTGAACATTATGCTAATATGTTGGCATGGACTCCCTACAAACTTATGTGCAATGAAAGAGCTTCTTGAAAAAAGAAATGGTCTTATGGGTGAACAATGGAAGAGTTGGATTGAGGCTTAGTTTCTAAGATTCTACCAAGACGAAAGGCAACACGGAAACCAAATTTTTGTTTTAATTTTGAAGTCTTGTAATGTAAGTTATTACTATGAGTTTTAATTTTCAAGTCATAGGTTTCACCTTCCTTTATTCCTCACCCCCTTCAACCTATGACTTGCCCTTTTGCGGTGACATAACATCTACTATCCTGGACTTGGGATTTGGGGAGGGTCTAATAGTTAAGAACTTCAGAAAGGCCTTGAGTGAAATATTACGCATTTGTTTGGTTGAAAATGGTGGAAATGTTAAGCTTATGGTTTCTGGAGTGACTGCTATGGATGTCTGTCAATTACAATTTTCTGATGAGACTATAATTTCAAAATGGTTGGCATGGTTTTACTCATAAGTATCTTTCTGGTGTTAGTACGACCGTCATTACTGAGATATTTATTTAGCACAAACGTTTTATTGCTGCTAACTCTTGTCTTCAGTGATCTTCATGTCTGGTTTCTTTTCAGAGGGCAAGTCCTTTTGTATTTTTGGTTGACAGACTATTGACATTTTTTTTTCTCTCCATCAGGGTGGCTTCAAATAGTTCAAGTTGGATGGAGAGAATCCTTAGTTTAGAGGTGTTGCCAGACGATATATTACATCATCTGAGAAGTGTTCACGATGTTCTTAATAAAAGGCAAAGCAGCTCAAGTAGCTTCACTTTGGAGGTCCTTGTGGGAGGTTCTGATAACCTAACTCAGATGTCTGACATGATGAAGTTTCTTCGCAATGTTATATTACTTTGTTTAACAGCTTTCCCACGTAATTTCATATTGGAAGAAGCTGCTTTAATTGCTGAAGAGTTATTTGTTACAAAAATGAATTCTTGTAGCTCCTCAGTTACTCCCTGCCGTTCCTTAGCAAAGAATCTCTTGAAAAGTGATCGTCAGGTGATCTATTTTCTATTCACTATCAATATGCATGCTTTTTTGAGGATTGAATGTCTAGGTCTACCACGTATGTTTTGATGTCAATGAGGGACTTTATAGTTAACCGGGGAAAAAAATAGTAGTGATAGTTCTGACAGAAGGTTGTTGGGTTCTAAGCGGAGCAAAAGCTTTCAAACTTTTGTGGAATAAGATATTGCTATCAGTATTTTTATTTATTAAGAAGTTTTAAGCAAAATGGAATAGAAATCAACTGCATGAAACAACTCCATTGGGACTGCATCTTCAACATTTCAAACTTACGAGTTAGGTAGTTACCTCCTTTCCAATTGAGGCTATTCTCCCCGTATTGACCTTAAATTATCTTAATTTAAAGTCTGAACTAGATTATAATTTAGTGACAATATCGTTATATCATTTAGTAGTGATGTTTTTTTTTTGTTGTGCATTGATGCTTCGTTCCTGATCATACGAGGGTGGTGCCTAAAGTGCTTTTGTGGATAATTGCCTTTGTTTTTTATTGTAATTCAAAATGCCTTTGTTCATGAAGCATAGAAATAATTGCACCTTCGGTATTCTAGAAAAATGGATACACGTTCTTTATGTAATGGGGGTCTATATCCTCATTTCATACTATTAGCACTGTATAATTCTATAACTGAAATAATCATGTCTGATATTATTGGTATTATCATGTGATAATTATAGGACATGTTACTTTGTGGAGTCTATGCACGAAGAGAGGCAACACATGGAAATATCGATCATGCTAGAAAAGTATTTGACATGTCATTGGCATCTGTGGAAAGCCTTCCCGTGGTATGTTACCGTATCCCTAAATGTAACTCTTGTCTGGGCCAAAGTATGATGCTAGTCCTTAAAAAAGAAAAAACAACTTCAATTACACCATTTTTGGGTTAATTTTTCTACCAAGGAAAAAGGAACTGTCCAAACGATGGAATGTCCGGGCACTTTGACTTCTCTTGCTAGCTTTTCTTGAAGGGCCTTGTTCTAATAGGAGACTGGGTCTTGATTCCTTTGATATCTTGACTTGCAGCTCCATGTTGAAACTCCCTCGTTTATCTTTGACCTATTTGTCTACCGATGGCATGTGTGTTGGGTGTATAATATGTACGATATGTTTCTTGAAACGAACTTCTTTTTCTAGTCAAATTTGGCATTTTGGATGTTAAACTATAATAGTTGTAACTGAGTTTTGAGGTGCATGCTCTATGAGTGGATGGTATTTAATTTCAACTGCTGTGATTGCTGAAGCAGGATCAGAAGTCCAATGCTCCTCTCTTGTATTTCTGGTATGCTGAATTGGAGCTTGCGAAGGATCCTCACAATGGTCATGATTCTGTAAATCGTGCTGTTCACATTTTATCTTGCCTAGGAAGTGGTGATTCTTACAGTCCATTTAAATGTCAACCATCAAGTTTGCAACTGCTGAGAGCGCACCAAGGTTTTAAAGAAAAAATCAGGGCAGTACGATCTACGTGGCTCCATGGAGTTATAGATGACTCGTCTGTGGCTCTCATATCCTCTGCAGCTTTGTTTGAGGAGTTGACCACTGGATACAATGCCGGTCTTGAGGTTTTAGATCAGGCTTTCAACATGGTACTTCCAGGTTATTTACCAATTCTGTTCTCTCTCTCTCTCTCTCTCTGTCTGGTTCAAATATGTTTCCTCCTTCCTCCCCGGGTATATTCTATAGAAAAAAAAAAAGATTACTTATTTAGCACTGGCAAAAGATTTGGATACCCTATAGAAATTTCTTGTATTTTATTGTAGGGGAAACCTCCACTTGTGTAAGTATGGCCCTAGACATAAAGAGCTAGCGTTAGGCAAGTAATATGAGCTTATTGCCTCCCAGCATCAGATCATTGTATATGGGAGTGATAGCTTTCGGAACTAGAAGTGGTAAATTGTCAATAGTAGTGGGAAGGCGAATCAAATATTGAATATTTTCTGGCTCTATGTTCTGAAGAAAGTTTAGAGGACTATATTTTCTCACATTCTTTGATTTGCAACCCCTTCATGCACAGCTATAATCTTTTAACGACACAGATTTGTCCTATTTCCGAGAAGGAAGATTGTTTTCCCATTCGTTCTTCAACATATATATATATATATATATATATATATATATATATATATATATATATATAAGCTAGGACAGAATTTTTCCGCTTGAATTTTATTTGTTTTGTTGATTATTCGTTACTTACTATTGTTATTGATGTGCAGAAAGAAGAAAACAGAGCTATCAACTAGAATGTTTGTTCAACTACTATGTGAAGATGCTTCTGAGACATCATAAGCAATTAAGCCAACTAAAAGTCCGGGAGTCAATTTCTCAGGGATTGCAGTTCTATCCATTAAATCCTGAACTTTATACTGCTTTTCTGGAGATTAGCTACATTTATTCGGTACCCAGTAAACTGCGATGGACCTTTGATGACTACTGTCAGAAGTAAGATGATCTTTCTTGTGCTATCTCTCACATTGCTAATCACATTTTGGTGTTTAATTTCCCCTACATGGCCAGGCTGTGATATAAATTCACAGTCGAGTTTTAATTGGATCCATATTTGGCATCGATTAACTATAGTAATAACCTTCCTTGCATGTTTATGACCATGATAGATGCACTCATTTCAGATGATATCGCTTTCGTTACTTGATCTATTTGACAGTGGTTTTTCGGACTAAAATCCTTTCTGAATCCGACGCTTTATCTTTTGATATAGTATATTGTTGTTGATCTTTTTTTTTTTCCTGTTTAATTGTGAGAATTACACATACATTTGGGGTCCCTAGGCTTGACAAATTTGCATGTAATACTCTCAGGCAACCTTCTCTGATCCTTTGGATTTTTGCATTATCCTTTGAGATGGGTTATGCGGGTTCTCCTCATAGAATACGTAGGCTGTTTGAAAAGGCATTGGAAAATGACAATTTGCGTCATTCTGTTCTTCTCTGGCGCTGCTACATTTCATATGAGCTGAACACAGCATGCGATCCTTCTTCAGCCAAGCGAGTTTTCTTCCGAGCCATCCATTCCTGCCCATGGTACGACCCGATTTCTTGTTATTTACACCAATTAAAGTATCATTCTGGCTCCAGTTCCTTCTGTTTTTTTAGGTCCTTCAGATAAACAGCAACCACTCTGTCTAACCGTTTTATGTTCTGAAGGTCAAAAAAGCTGTGGCTTGACGGTTTCATCAAACTGAACTCTATTTTGAGCGCGAAAGAGCTTTCGGATCTCCAAGAAGTTATGCGCGACAAAGAGCTCAATCTACGGACTGATATCTATGAGATTCTCTTGCAAGAAGAACTCATATCTTGAACTGTCTTCGACGACATATTTCTGCGCTTTTTCGTGGCTCAATTGTGTCGTCACCATGTTCTCTTCTCGCTCCTCCTAAAGTGATGGAACTTAGAAAGGTCTAAAAACGTTATACCAGAAGTCGCTTCATGTTTCCCTCCCGATATAGGTAACGTTGGTCAAAGGGGGTGATCGTTGTAAGTTGTACATTGGGCAAAAGGATTTAGAATAGGTTGGGTTCAATTTGAGCTGTTACTTGTATGTTTGTACCAAGTTAATTTCAGGTGAGTTGAAATATTTGAGTTGGTTTGTAATTAAAAAAAAAAAAAAACAGCAATCTACTAATTAAAAAGAT

mRNA sequence

TTCAATTGGGCCTGTGTACATTTTGCCTCTTCCATTTCCTTCTTAGGCTTTCAGTTCATAAGCTCCGAATAGTTCGATCTCGCGGGCAATATTCTTCGTAGCCATGGAAGCTCCAGCAGAAGAGGAGTTGCCACCTGAAGAGCAAAAGCCTAAAACCTCCCTCTTCCCGCTCCCGTTCGTCGCTAACAATCCCCAGAGTCAGATAAGTCCGCCCAATTCAAGCGTTCCTCAGTGGCTTTGCAACTCCAGCTTCACCACTGACCTATCCGTCATCAACGATGCTCTTTCATCTCAAAACAATGTATATCCCTCCCTCTCCACCGATGGCGACCAGGAAGAGGCTGTGGAAGATGAAGGAGGTCCAAGTGTTAGACCTGAGGTGCAGAAGTCATCTCGATCATACGAATTGCTGGAATCTTCTGCTTCGGACGACGGCTCCGAGCATGAGAAGAGGAAAAAGAGGAAGAAGAAGAAGAGGAGGAGGCGAAATGAATATGAAGAAAAAAAGGGATTCGGCGAGTATGGTTCGAGAAAGTCCGATGTTCGGGCTTGGGCCGATGCCGATGGTAGACCTTCCAAGGATTATTACTTCGATTCTAATGGAGACCGGGATAACTTAGCATTCGGGTCTCTTTACAGGATGGATGTTGCACGCTACAGACCGCTCAACCATGGGGAAAGACCTGGACTAAATTTTAATGGATTTTCTCAGTGGAATAAAAGTAGTTCTGCCTTAGACAAAGATGCTGATGCTGAAGTGTTGGATAGTAAATTGAAATCAGGTGGACGCTATTGGTCTGCAAAGAATGCAGCAATAGAGCGACATAAGAACTTCAAACGTGTACGTATTGGTTTTTCTAGAAAAACTCCAGACAAATTATTGGATGATTTCATTCCTTTTTCGGATTCTCAAACATCAAATAATATTGAGGAATCTTGGGAGGATGAAGTGCTGCGTAAAACGCGGGAGTTTAACAAATTGACTAGGGAGCATCCTCATGACGAGAAGGCTTGGTTAGCTTTTGCTGAATTTCAAGACAAAGTTGCAGCTATGCAACCTCAGAAAGGTGCTCGCTTGCAAACTCTAGAGAAAAAAATTAGCATATTAGAGAAGGCTGCTGAGCTTAACCCAGAAAATGAGGAATTATTGCTATACCTTTTAAAGAATTACCAGAAAAGAGATACTATTGATGTGTTGATTAGTAGATGGGAAAAGATACTGATGCAGAATTCTGGGAGTTATAAGTTGTGGAGAGAGTTTTTGCATCTCATTCAAGGGGAGTTCTCTAGATTCAAGGTTTCAGACATGAGACAAATGTATGCACATGCAATCCAAGCTCTATCTGCTGCATGCAACCAGCACATTAGGCAGGCCAATCAAACTGCCAAACCTTCAGTGGAGCATGATCTCATTCAGCTAGAACTTGGTCTGGTTGATATTTTTCTGAGTTTGTGCCGATTTGAGTGGCAGGCTGGGTATCAGGAGTTGGCTACTGCTTTATTTCAGGCTGAAATTGAATTTAGCTTGTTTTGCCCTGCTTTGCATTTAAACGATCGAAGTAAACAAAGATTATTTGAACACTTTTGGAACACTAATGCTGAAAGAGTCGGTGAGGAAGGTGCCATCGGTTGGTCTACATGGCTAGAAAAAGAGGAGGAAAATAGGCAAAAGGTTATGAGAGAGGAGGAGGCCTTAGAGGCTGATGAAAAGGGTGGCTGGACTGGTTGGTCTGATCCAGCACCAAAAGAGAAGAAAAATAATGATGACGCAGAAACTACTGCAGAAGTGGGTGTAGCAGCAGAGGAGGCTATGGAGCAAGATGTGGAAGAAGAAGACACTGAAAGAGAAGATAGCACGGAAGCATTGCTCAAAATTCTTGGAATTAACGCTGATGCAGGGGTCGACGAGGAGGTTAAGGACACCTCAACCTGGGCTAGATGGTCAAAAGAAGAGTCATTAAGAGACTGTGAACAATGGATGCCTATCCGGGAAAAATCTGCAGATGTTATTCATGATGAAGGGATGCCTGATGGAGAAACAAATGAACAACTTCAAAGAGTTATATTATATGAAGATGTCAAGGAGTACCTGTTTTCATTGATTTCAAGTGAAGCCCGTTTATCCTTGATATATCAGCTAATTGAATTCTTTAGTGGAAAAATATATTCAAGGGTGGCTTCAAATAGTTCAAGTTGGATGGAGAGAATCCTTAGTTTAGAGGTGTTGCCAGACGATATATTACATCATCTGAGAAGTGTTCACGATGTTCTTAATAAAAGGCAAAGCAGCTCAAGTAGCTTCACTTTGGAGGTCCTTGTGGGAGGTTCTGATAACCTAACTCAGATGTCTGACATGATGAAGTTTCTTCGCAATGTTATATTACTTTGTTTAACAGCTTTCCCACGTAATTTCATATTGGAAGAAGCTGCTTTAATTGCTGAAGAGTTATTTGTTACAAAAATGAATTCTTGTAGCTCCTCAGTTACTCCCTGCCGTTCCTTAGCAAAGAATCTCTTGAAAAGTGATCGTCAGGACATGTTACTTTGTGGAGTCTATGCACGAAGAGAGGCAACACATGGAAATATCGATCATGCTAGAAAAGTATTTGACATGTCATTGGCATCTGTGGAAAGCCTTCCCGTGGATCAGAAGTCCAATGCTCCTCTCTTGTATTTCTGGTATGCTGAATTGGAGCTTGCGAAGGATCCTCACAATGGTCATGATTCTGTAAATCGTGCTGTTCACATTTTATCTTGCCTAGGAAGTGGTGATTCTTACAGTCCATTTAAATGTCAACCATCAAGTTTGCAACTGCTGAGAGCGCACCAAGGTTTTAAAGAAAAAATCAGGGCAGTACGATCTACGTGGCTCCATGGAGTTATAGATGACTCGTCTGTGGCTCTCATATCCTCTGCAGCTTTGTTTGAGGAGTTGACCACTGGATACAATGCCGGTCTTGAGGTTTTAGATCAGGCTTTCAACATGGTACTTCCAGAAAGAAGAAAACAGAGCTATCAACTAGAATGTTTGTTCAACTACTATGTGAAGATGCTTCTGAGACATCATAAGCAATTAAGCCAACTAAAAGTCCGGGAGTCAATTTCTCAGGGATTGCAGTTCTATCCATTAAATCCTGAACTTTATACTGCTTTTCTGGAGATTAGCTACATTTATTCGGTACCCAGTAAACTGCGATGGACCTTTGATGACTACTGTCAGAAGCAACCTTCTCTGATCCTTTGGATTTTTGCATTATCCTTTGAGATGGGTTATGCGGGTTCTCCTCATAGAATACGTAGGCTGTTTGAAAAGGCATTGGAAAATGACAATTTGCGTCATTCTGTTCTTCTCTGGCGCTGCTACATTTCATATGAGCTGAACACAGCATGCGATCCTTCTTCAGCCAAGCGAGTTTTCTTCCGAGCCATCCATTCCTGCCCATGGTCAAAAAAGCTGTGGCTTGACGGTTTCATCAAACTGAACTCTATTTTGAGCGCGAAAGAGCTTTCGGATCTCCAAGAAGTTATGCGCGACAAAGAGCTCAATCTACGGACTGATATCTATGAGATTCTCTTGCAAGAAGAACTCATATCTTGAACTGTCTTCGACGACATATTTCTGCGCTTTTTCGTGGCTCAATTGTGTCGTCACCATGTTCTCTTCTCGCTCCTCCTAAAGTGATGGAACTTAGAAAGGTCTAAAAACGTTATACCAGAAGTCGCTTCATGTTTCCCTCCCGATATAGGTAACGTTGGTCAAAGGGGGTGATCGTTGTAAGTTGTACATTGGGCAAAAGGATTTAGAATAGGTTGGGTTCAATTTGAGCTGTTACTTGTATGTTTGTACCAAGTTAATTTCAGGTGAGTTGAAATATTTGAGTTGGTTTGTAATTAAAAAAAAAAAAAAACAGCAATCTACTAATTAAAAAGAT

Coding sequence (CDS)

ATGGAAGCTCCAGCAGAAGAGGAGTTGCCACCTGAAGAGCAAAAGCCTAAAACCTCCCTCTTCCCGCTCCCGTTCGTCGCTAACAATCCCCAGAGTCAGATAAGTCCGCCCAATTCAAGCGTTCCTCAGTGGCTTTGCAACTCCAGCTTCACCACTGACCTATCCGTCATCAACGATGCTCTTTCATCTCAAAACAATGTATATCCCTCCCTCTCCACCGATGGCGACCAGGAAGAGGCTGTGGAAGATGAAGGAGGTCCAAGTGTTAGACCTGAGGTGCAGAAGTCATCTCGATCATACGAATTGCTGGAATCTTCTGCTTCGGACGACGGCTCCGAGCATGAGAAGAGGAAAAAGAGGAAGAAGAAGAAGAGGAGGAGGCGAAATGAATATGAAGAAAAAAAGGGATTCGGCGAGTATGGTTCGAGAAAGTCCGATGTTCGGGCTTGGGCCGATGCCGATGGTAGACCTTCCAAGGATTATTACTTCGATTCTAATGGAGACCGGGATAACTTAGCATTCGGGTCTCTTTACAGGATGGATGTTGCACGCTACAGACCGCTCAACCATGGGGAAAGACCTGGACTAAATTTTAATGGATTTTCTCAGTGGAATAAAAGTAGTTCTGCCTTAGACAAAGATGCTGATGCTGAAGTGTTGGATAGTAAATTGAAATCAGGTGGACGCTATTGGTCTGCAAAGAATGCAGCAATAGAGCGACATAAGAACTTCAAACGTGTACGTATTGGTTTTTCTAGAAAAACTCCAGACAAATTATTGGATGATTTCATTCCTTTTTCGGATTCTCAAACATCAAATAATATTGAGGAATCTTGGGAGGATGAAGTGCTGCGTAAAACGCGGGAGTTTAACAAATTGACTAGGGAGCATCCTCATGACGAGAAGGCTTGGTTAGCTTTTGCTGAATTTCAAGACAAAGTTGCAGCTATGCAACCTCAGAAAGGTGCTCGCTTGCAAACTCTAGAGAAAAAAATTAGCATATTAGAGAAGGCTGCTGAGCTTAACCCAGAAAATGAGGAATTATTGCTATACCTTTTAAAGAATTACCAGAAAAGAGATACTATTGATGTGTTGATTAGTAGATGGGAAAAGATACTGATGCAGAATTCTGGGAGTTATAAGTTGTGGAGAGAGTTTTTGCATCTCATTCAAGGGGAGTTCTCTAGATTCAAGGTTTCAGACATGAGACAAATGTATGCACATGCAATCCAAGCTCTATCTGCTGCATGCAACCAGCACATTAGGCAGGCCAATCAAACTGCCAAACCTTCAGTGGAGCATGATCTCATTCAGCTAGAACTTGGTCTGGTTGATATTTTTCTGAGTTTGTGCCGATTTGAGTGGCAGGCTGGGTATCAGGAGTTGGCTACTGCTTTATTTCAGGCTGAAATTGAATTTAGCTTGTTTTGCCCTGCTTTGCATTTAAACGATCGAAGTAAACAAAGATTATTTGAACACTTTTGGAACACTAATGCTGAAAGAGTCGGTGAGGAAGGTGCCATCGGTTGGTCTACATGGCTAGAAAAAGAGGAGGAAAATAGGCAAAAGGTTATGAGAGAGGAGGAGGCCTTAGAGGCTGATGAAAAGGGTGGCTGGACTGGTTGGTCTGATCCAGCACCAAAAGAGAAGAAAAATAATGATGACGCAGAAACTACTGCAGAAGTGGGTGTAGCAGCAGAGGAGGCTATGGAGCAAGATGTGGAAGAAGAAGACACTGAAAGAGAAGATAGCACGGAAGCATTGCTCAAAATTCTTGGAATTAACGCTGATGCAGGGGTCGACGAGGAGGTTAAGGACACCTCAACCTGGGCTAGATGGTCAAAAGAAGAGTCATTAAGAGACTGTGAACAATGGATGCCTATCCGGGAAAAATCTGCAGATGTTATTCATGATGAAGGGATGCCTGATGGAGAAACAAATGAACAACTTCAAAGAGTTATATTATATGAAGATGTCAAGGAGTACCTGTTTTCATTGATTTCAAGTGAAGCCCGTTTATCCTTGATATATCAGCTAATTGAATTCTTTAGTGGAAAAATATATTCAAGGGTGGCTTCAAATAGTTCAAGTTGGATGGAGAGAATCCTTAGTTTAGAGGTGTTGCCAGACGATATATTACATCATCTGAGAAGTGTTCACGATGTTCTTAATAAAAGGCAAAGCAGCTCAAGTAGCTTCACTTTGGAGGTCCTTGTGGGAGGTTCTGATAACCTAACTCAGATGTCTGACATGATGAAGTTTCTTCGCAATGTTATATTACTTTGTTTAACAGCTTTCCCACGTAATTTCATATTGGAAGAAGCTGCTTTAATTGCTGAAGAGTTATTTGTTACAAAAATGAATTCTTGTAGCTCCTCAGTTACTCCCTGCCGTTCCTTAGCAAAGAATCTCTTGAAAAGTGATCGTCAGGACATGTTACTTTGTGGAGTCTATGCACGAAGAGAGGCAACACATGGAAATATCGATCATGCTAGAAAAGTATTTGACATGTCATTGGCATCTGTGGAAAGCCTTCCCGTGGATCAGAAGTCCAATGCTCCTCTCTTGTATTTCTGGTATGCTGAATTGGAGCTTGCGAAGGATCCTCACAATGGTCATGATTCTGTAAATCGTGCTGTTCACATTTTATCTTGCCTAGGAAGTGGTGATTCTTACAGTCCATTTAAATGTCAACCATCAAGTTTGCAACTGCTGAGAGCGCACCAAGGTTTTAAAGAAAAAATCAGGGCAGTACGATCTACGTGGCTCCATGGAGTTATAGATGACTCGTCTGTGGCTCTCATATCCTCTGCAGCTTTGTTTGAGGAGTTGACCACTGGATACAATGCCGGTCTTGAGGTTTTAGATCAGGCTTTCAACATGGTACTTCCAGAAAGAAGAAAACAGAGCTATCAACTAGAATGTTTGTTCAACTACTATGTGAAGATGCTTCTGAGACATCATAAGCAATTAAGCCAACTAAAAGTCCGGGAGTCAATTTCTCAGGGATTGCAGTTCTATCCATTAAATCCTGAACTTTATACTGCTTTTCTGGAGATTAGCTACATTTATTCGGTACCCAGTAAACTGCGATGGACCTTTGATGACTACTGTCAGAAGCAACCTTCTCTGATCCTTTGGATTTTTGCATTATCCTTTGAGATGGGTTATGCGGGTTCTCCTCATAGAATACGTAGGCTGTTTGAAAAGGCATTGGAAAATGACAATTTGCGTCATTCTGTTCTTCTCTGGCGCTGCTACATTTCATATGAGCTGAACACAGCATGCGATCCTTCTTCAGCCAAGCGAGTTTTCTTCCGAGCCATCCATTCCTGCCCATGGTCAAAAAAGCTGTGGCTTGACGGTTTCATCAAACTGAACTCTATTTTGAGCGCGAAAGAGCTTTCGGATCTCCAAGAAGTTATGCGCGACAAAGAGCTCAATCTACGGACTGATATCTATGAGATTCTCTTGCAAGAAGAACTCATATCTTGA

Protein sequence

MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDALSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKRKKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRDTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWTGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGVDEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVKEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVHDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAEELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPSSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFNMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHSVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQEVMRDKELNLRTDIYEILLQEELIS
Homology
BLAST of Cp4.1LG20g01730 vs. ExPASy Swiss-Prot
Match: Q80XC6 (Nuclear exosome regulator NRDE2 OS=Mus musculus OX=10090 GN=Nrde2 PE=1 SV=3)

HSP 1 Score: 260.4 bits (664), Expect = 9.8e-68
Identity = 309/1238 (24.96%), Postives = 496/1238 (40.06%), Query Frame = 0

Query: 70   SLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELL-ESSASDDGSEHEKRKKRKKKKRRRR 129
            SLS   ++  A+  EG P  R    +S    EL  ES+ S+  ++  ++KK++KKKRR+ 
Sbjct: 38   SLSRQTEEVTALASEGSPPPRYSFIRSPLKSELSGESNTSEKLAQTSRKKKKEKKKRRKH 97

Query: 130  NEYEEKKGFGEY---GSRKSDVRAWADADGRPSKD------------------------- 189
              + + K   E       +SD  A  D   R  +D                         
Sbjct: 98   QHHRKTKRRHEQLSSSGSESDTEAGKDRASRSIRDDQKEAEKPCQGSNAAAAVAAAAGHR 157

Query: 190  -------------YYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNFNGFSQWNKS 249
                         +  D   D  N  + SLYR D+ARY+      R G +  G    N  
Sbjct: 158  SIWLEDIHDLTDVFRTDKKPDPANWEYKSLYRGDIARYK------RKGDSCLGI---NPK 217

Query: 250  SSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDKLLDDFIPFS 309
               +  +  +       +   RY++ KN  + R +    + +  + +        FIP  
Sbjct: 218  KQCISWEGASAAKKHSHRHLERYFTKKNVGLMRTEG---IAVCSNPEPASSEPVTFIPVK 277

Query: 310  DSQTSNNIEESW----------------------------------EDEVLR-KTREFNK 369
            DS  +     SW                                  E+  L+ +  EFN+
Sbjct: 278  DSAEAATPVTSWLNPLGIYDQSTTQWLQGQGPAEQESKQPDSQQDRENAALKARVEEFNR 337

Query: 370  LTREHPHDEKAWLAFAEFQDKV-----------AAMQPQKGARLQTLEKKISILEKAAEL 429
              RE+P D + W+AF  FQD+V              +  + +    LEKK+++LE+A E 
Sbjct: 338  RVRENPWDTQLWMAFVAFQDEVMRSPGIYALGEGEQEKHRKSLKLLLEKKLAVLERAIES 397

Query: 430  NPENEELLLYLLKNYQKRDTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSD 489
            NP + EL L  L+   +      L   W+K+L  +  +  LW+ +L   Q +F  F VS 
Sbjct: 398  NPGSVELKLAKLQLCSEFWEPSALAKEWQKLLFLHPNNTSLWQRYLSFCQSQFGTFSVSK 457

Query: 490  MRQMYAHAIQALSAACNQHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQE 549
            +  +Y   +  LSA     ++  +  + P     L   E  +  +FL  C F  QAG+ E
Sbjct: 458  LHSLYGKCLSTLSA-----VKDGSMLSHPV----LPGTEEAMFGLFLQQCHFLRQAGHSE 517

Query: 550  LATALFQAEIEFSLFCP--ALHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEE 609
               +LFQA ++F+ F P     L  + +   FE FW++   RVGE+GA GW  W+ ++  
Sbjct: 518  KVISLFQAMVDFTFFKPDSVKELPTKVQVEFFEPFWDSGEPRVGEKGARGWRAWMHQQ-- 577

Query: 610  NRQKVMREEEALEADEKGGWTGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDT 669
                           E+GGW                              +  D ++E+ 
Sbjct: 578  ---------------ERGGWV----------------------------LITPDEDDEEP 637

Query: 670  EREDSTEALLKILGINADAGVDEEVKDTS--TWARWSKEESLRDCEQWMPIREKSADVIH 729
            E E                  D+E+KD +   W  W   E  RD   W P R        
Sbjct: 638  EEE------------------DQEIKDKTLPRWQIWLAVERSRDQRHWRPWRPDKTKKQT 697

Query: 730  DEGMPDGETNEQLQRVILYEDVKEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSS 789
            +E   D E      R +L++D+ + L  L S + +  LI   ++F             S 
Sbjct: 698  EEDCEDPE------RQVLFDDIGQSLIRLSSPDLQFQLIQAFLQFL---------GVPSG 757

Query: 790  WMERILSLEVLPDDILHHLRSVHDVLNKRQSSSSSFTLEVLVGGSDNLTQ-------MSD 849
            ++     L +  D+       ++D        + SF+    VG  + L +         +
Sbjct: 758  FLPPASCLYLAMDESSIFESELYDE-KPLTYFNPSFSGISCVGSMEQLGRPRWTKGHNRE 817

Query: 850  MMKFLRNV--ILLCLTAFPRNFILEEAAL---IAEELFV--TKMNSCSSSVTPCRSLAKN 909
              +F+RNV  ++L L A  +   L  + L   IA+ ++   TK     S    C+ LAKN
Sbjct: 818  GEEFVRNVFHLVLPLLAGKQKSQLSLSWLRYEIAKVIWCLHTKKKRLKSQGKSCKKLAKN 877

Query: 910  LLK--SDRQDMLLCGVYARREATHGNIDHARKVFDMSLASVESLPVDQKSNAPLLYFWYA 969
            LLK   +R +  L   YA  E   GN + ARKVFD +L+   S  +  +    L    YA
Sbjct: 878  LLKEPENRNNFCLWKQYAHLEWLLGNTEDARKVFDTALSMAGSSELKDRELCELSLL-YA 937

Query: 970  ELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPSSLQLLRAHQGFKEKI-----R 1029
            ELE+   P +   +  RAVHIL+ L     Y P+  Q SS Q+L+A + ++  +     +
Sbjct: 938  ELEMELSPDSRGATTGRAVHILTRLTESSPYGPYTGQVSSTQVLKARKAYELALQDCLGQ 997

Query: 1030 AVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFNMVL---------PERR 1089
            +  S+       D   +L+    LF+ LT G +A +++  + F  +          PE  
Sbjct: 998  SCASSPAPAEALDCLGSLVRCFMLFQYLTVGIDAAVQIYGRVFAKLKGSARLEDPGPEDS 1057

Query: 1090 KQSYQLECLFNYYVKM---LLRHHKQL---SQLKVRESISQGLQFYPLNPELYTAFLEIS 1149
              S  L  +      M   LLR H  +       +RE++S  L+ YP N  L+ A+++I 
Sbjct: 1058 TSSQSLTNVLEAVSMMHTSLLRFHMNVCVYPLAPLRETLSDALKLYPGNQVLWRAYVQIQ 1117

Query: 1150 YIYSVPSKLRWTFDDYCQKQPSLILWIFALSFE---------------------MGYAGS 1159
                  +K R  FD   +    L  W+FA+  E                     +   G 
Sbjct: 1118 NKSHSANKTRRFFDTVTRSAKHLEPWLFAIEAEKLRKKLVESVQRVGGREVHATIPETGL 1168

BLAST of Cp4.1LG20g01730 vs. ExPASy Swiss-Prot
Match: Q9H7Z3 (Nuclear exosome regulator NRDE2 OS=Homo sapiens OX=9606 GN=NRDE2 PE=1 SV=3)

HSP 1 Score: 253.8 bits (647), Expect = 9.2e-66
Identity = 309/1241 (24.90%), Postives = 501/1241 (40.37%), Query Frame = 0

Query: 69   PSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSE-----HEKRKKRKKK 128
            PS         + + E  P+   E    +RS+   ESS   D ++       K+KK KKK
Sbjct: 29   PSFCVGSITSLSQQTEAAPAHVSEGLPLTRSHLKSESSDESDTNKKLKQTSRKKKKEKKK 88

Query: 129  KRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSK------------------------ 188
            KR+ ++  + K+  G   S +S+    ++ D +PS+                        
Sbjct: 89   KRKHQHHKKTKRKHGPSSSSRSETDTDSEKD-KPSRGVGGSKKESEEPNQGNNAAADTGH 148

Query: 189  --------------DYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNFNGFSQWN 248
                           +  D   D  N  + SLYR D+ARY+      R G +  G    N
Sbjct: 149  RFVWLEDIQAVTGETFRTDKKPDPANWEYKSLYRGDIARYK------RKGDSCLGI---N 208

Query: 249  KSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDKLLDDFIP 308
                 +  +  +       K   RY++ K+  +    N   V I    + P      FIP
Sbjct: 209  PKKQCISWEGTSTEKKHSRKQVERYFTKKSVGL---MNIDGVAISSKTEPPSSEPISFIP 268

Query: 309  FSDSQTSNNI-------------------------EESWEDE---------VLRKTREFN 368
              D + +  +                         +ES + +         +  K  EFN
Sbjct: 269  VKDLEDAAPVTTWLNPLGIYDQSTTHWLQGQGPPEQESKQPDAQPDSESAALKAKVEEFN 328

Query: 369  KLTREHPHDEKAWLAFAEFQDKV-----------AAMQPQKGARLQTLEKKISILEKAAE 428
            +  RE+P D + W+AF  FQD+V              + +K +    LEKK++ILE+A E
Sbjct: 329  RRVRENPRDTQLWMAFVAFQDEVMKSPGLYAIEEGEQEKRKRSLKLILEKKLAILERAIE 388

Query: 429  LNPENEELLLYLLKNYQKRDTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVS 488
             N  + +L L  LK   +      L+  W+K++  +  +  LW+++L   Q +FS F +S
Sbjct: 389  SNQSSVDLKLAKLKLCTEFWEPSTLVKEWQKLIFLHPNNTALWQKYLLFCQSQFSTFSIS 448

Query: 489  DMRQMYAHAIQALSAACNQHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQ 548
             +  +Y   +  LSA  +  I   +  A P  E  +  L       FL  C F  QAG+ 
Sbjct: 449  KIHSLYGKCLSTLSAVKDGSI--LSHPALPGTEEAMFAL-------FLQQCHFLRQAGHS 508

Query: 549  ELATALFQAEIEFSLFCP--ALHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEE 608
            E A +LFQA ++F+ F P     L  + +   FE FW++   R GE+GA GW  W+ ++ 
Sbjct: 509  EKAISLFQAMVDFTFFKPDSVKDLPTKGQVEFFEPFWDSGEPRAGEKGARGWKAWMHQQ- 568

Query: 609  ENRQKVMREEEALEADEKGGWTGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEED 668
                            E+GGW                              +  + +E+D
Sbjct: 569  ----------------ERGGW------------------------------VVINPDEDD 628

Query: 669  TEREDSTEALLKILGINADAGVDEEVKDTS--TWARWSKEESLRDCEQWMPIREKSADVI 728
             E E+                 D+E+KD +   W  W   E  RD   W P R       
Sbjct: 629  DEPEED----------------DQEIKDKTLPRWQIWLAAERSRDQRHWRPWRPDKTKKQ 688

Query: 729  HDEGMPDGETNEQLQRVILYEDVKEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSS 788
             +E   D E      R +L++D+ + L  L S + +  L+   ++F          +  +
Sbjct: 689  TEEDCEDPE------RQVLFDDIGQSLIRLSSHDLQFQLVEAFLQFLG---VPSGFTPPA 748

Query: 789  SWMERILSLEVLPDDILHHLRSVHDVLNKRQSSSSSFTLEVLVGGSDNL-------TQMS 848
            S +   +    + D+ L+  + +    N   S +S       VG  D L        Q  
Sbjct: 749  SCLYLAMDENSIFDNGLYDEKPL-TFFNPLFSGAS------CVGRMDRLGYPRWTRGQNR 808

Query: 849  DMMKFLRNVILLCLTAFPR--------NFILEEAALIAEELFVTKMNSCSSSVTPCRSLA 908
            +  +F+RNV  L +  F          +++  E A +   L         S    C+ LA
Sbjct: 809  EGEEFIRNVFHLVMPLFSGKEKSQLCFSWLQYEIAKVIWCLHTKNKKRLKSQGKNCKKLA 868

Query: 909  KNLLKSDR--QDMLLCGVYARREATHGNIDHARKVFDMSLASVESLPVDQKSNAPLLYFW 968
            KNLLK      +  L   YA  E   GN + ARKVFD +L    S  + + S+   L   
Sbjct: 869  KNLLKEPENCNNFCLWKQYAHLEWLLGNTEDARKVFDTALGMAGSREL-KDSDLCELSLL 928

Query: 969  YAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPSSLQLLRAHQGFKEKIRAV- 1028
            YAELE+   P     +  RAVHIL+ L     Y P+  Q  ++ +L+A + ++  ++   
Sbjct: 929  YAELEVELSPEVRRAATARAVHILTKLTESSPYGPYTGQVLAVHILKARKAYEHALQDCL 988

Query: 1029 -RSTWLHGVIDDSSVALISSA---ALFEELTTGYNAGLEVLDQAF----NMVLPE----- 1088
              S   +    DS   LIS A    LF+ LT G +A +++ +Q F    + V PE     
Sbjct: 989  GDSCVSNPAPTDSCSRLISLAKCFMLFQYLTIGIDAAVQIYEQVFAKLNSSVFPEGSGEG 1048

Query: 1089 --RRKQSYQ--LECLFNYYVKMLLRHHKQLS---QLKVRESISQGLQFYPLNPELYTAFL 1148
                 QS+   LE +   +   LLR H ++S      +RE++SQ L+ YP N  L+ +++
Sbjct: 1049 DSASSQSWTSVLEAITLMHTS-LLRFHMKVSVYPLAPLREALSQALKLYPGNQVLWRSYV 1108

Query: 1149 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFE---------------------MGY 1159
            +I       SK R  FD   +    L  W+FA+  E                     +  
Sbjct: 1109 QIQNKSHSASKTRRFFDTITRSAKPLEPWLFAIEAEKLRKRLVETVQRLDGREIHATIPE 1160

BLAST of Cp4.1LG20g01730 vs. ExPASy Swiss-Prot
Match: Q54QP0 (Nuclear exosome regulator NRDE2 OS=Dictyostelium discoideum OX=44689 GN=nrde2 PE=3 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 1.7e-35
Identity = 267/1276 (20.92%), Postives = 513/1276 (40.20%), Query Frame = 0

Query: 27   ANNPQSQISPPNSSVPQWLCNSSFTTD--------LSVINDALSSQNNVYPSLSTDGDQE 86
            ++N  S   PP+SS P +        D         S I+ +    ++   + S+  DQ+
Sbjct: 77   SDNTFSSPPPPSSSPPLYSKTEKKNNDKVKIIMKKRSFIDSSSDDNSDDDDNDSSSSDQD 136

Query: 87   EAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKRKKKKRRRRNEYEEKKGFG 146
             + +D GG +     +K  +  +  + +  ++ +E + RKK KKKKR+ +    + K   
Sbjct: 137  SSDDDSGGFTYN---RKKYKKEQQQQENEENEENERKNRKKEKKKKRKDKKFKNDDKSMM 196

Query: 147  EYGSRKSDVRAWADADG---RPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPG 206
               +  S+   ++D        + D  F S     N  + + + + ++ Y+ +   ++ G
Sbjct: 197  IISNENSE--NYSDNSSYFIEKTGDKVFSSRTSTPNYNYDNSFILGMSDYK-IGFSKKEG 256

Query: 207  -----LNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIG 266
                 ++   F++   ++    K + +    S+ +        K   IE  +  K +   
Sbjct: 257  YQIEPISLTSFNKQQINNRYFTKPSSSSSSSSQSQQQLITVITKRKEIEEIEKVKPIS-- 316

Query: 267  FSRKTPDKLLDDFIPF----------SDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 326
             + K P K  DD I             D    ++  E+ E + L+K  E NKL  ++P++
Sbjct: 317  -NIKDPSKSNDDEIKLIVLNENNHDNDDDDNDDDDNETLERKTLKKNSELNKLVEQYPNN 376

Query: 327  EKAWLAFAEFQDKVAAMQPQ-KGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 386
             + W+   +FQ+           ++    EK++SI   +   NP++E L +  LK   K 
Sbjct: 377  IEYWIDLVKFQENFQQFSRNVNKSKTSMYEKQLSIYRNSLLHNPDSEILTIEYLKLASKL 436

Query: 387  DTIDVLISRWEKILMQNSG-------SYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQA 446
                 ++  W K+L  +S        S KLW+E++      F+ FK+  +++     I+ 
Sbjct: 437  WDQQKVLDLWNKVLSSSSSSSSSSIISEKLWKEYIEFCLSNFNDFKIEKIKETIITIIRK 496

Query: 447  LSAACNQHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIE 506
            +        R++ +    +   ++  LE  ++     L +   QAG+ E    ++Q+ IE
Sbjct: 497  MLVK-----RRSFKVKDYNFMENISNLEESILQFISQLSKLLNQAGFSERVIGIYQSLIE 556

Query: 507  FSLFCPALHLNDRSKQRL--FEHFWNT-NAERVGEEGAIGWSTWL------------EKE 566
            F+ F P    N+     L  F+ +W++ +  ++G   +IGWS                K 
Sbjct: 557  FNCFEPIQLSNETQATLLKEFKSYWSSLDYPKIGNPNSIGWSKSFTILLNNSINNNNNKN 616

Query: 567  EENRQKVMREEEALEADEKGGWTGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEE 626
              N             +        ++       NN++ +      ++ EE  +   E+E
Sbjct: 617  NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMDLDNLDNLSIEEIEKLLKEQE 676

Query: 627  DTEREDSTEALLKILGINADAGVD----------EEVKDTST-----------WARWSKE 686
            D E +D+ E +  I   + D   D          EE +D  +           +  W K+
Sbjct: 677  DQENQDN-ENIFNITHKSKDLNEDDDNENNNNNQEEQEDNDSNSNDNDNNNNKFNTWGKK 736

Query: 687  ESLRDCEQWMPIREKSADVIHDEGMPDGETNE-QLQRVILYEDVKEYLFSLISSEARLSL 746
            E   D  +W P+       I++    + E NE   +RV+L+ D  E LF  +  E +L L
Sbjct: 737  EIELDELKWKPLD------INNNLEVNKEVNENDTERVVLFNDFYELLFRFVKEENKLEL 796

Query: 747  IYQLIEFFSGKI----------YS-----RVASNSSSWMERILSL-------EVLPDDIL 806
            ++Q +EF    I          YS     R  S +S   E I+SL       +  P    
Sbjct: 797  VFQFLEFLGVPISLLDDKIQPRYSFYHPQRRDSINSIHNENIISLLFKDLKQQPSPPSPS 856

Query: 807  HHLRSVHDVLNKRQSSSSSFTLEVLVGGSDNLTQMS-DMMKFLRNVILLCLTAFPRNFIL 866
                +     +K  +++++         S NL  +S D +KF+ ++  L L         
Sbjct: 857  PEYPNWFKTFDKFSNNNNN---------SQNLLGLSDDKIKFIDSIYKLILEN------- 916

Query: 867  EEAALIAEELFVTK-MNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHA 926
                 + E+L+V+  M   S  +   +   K+L +  + +++   ++A  E   G    A
Sbjct: 917  SNGIKLKEKLYVSYIMFKASIDINDAKVYTKSLCEKFK-NLIYFDIFASLELKSGKTQQA 976

Query: 927  RKVFDMS------LASVESLPVDQKSNAPLLYFWYAELEL-------AKDP--------- 986
            R ++  +      L + ++    Q+    L+Y  Y  +EL        KDP         
Sbjct: 977  RTIYQTTCFYINQLINQQAQQQQQQLQIDLVYREYLFMELNLIYQTIEKDPQILKRFIKS 1036

Query: 987  -HNGHDSVNRAVHILSCLGSGD----SYSPFKCQPSSLQLLRAHQGFKEKIRAVRSTWLH 1046
             H   +     +HIL C   G+    S S F     +  L + +  F +K++  +     
Sbjct: 1037 NHKPIELFFTPLHILQCYLDGNYKQYSSSTFNLNTINQFLNQLNLKFLQKLQQQQQQQQQ 1096

Query: 1047 GVIDDSSVALISSAA---------------LFEELTTGYNAGLEVLDQAFNMVLPERRK- 1106
                 SS +  SS++               +FE L+ G++  L +  +  +    +  K 
Sbjct: 1097 NSSSSSSSSSSSSSSSSSSSSSVDFLLCYCIFELLSNGFDGFLILFKRITSSSTNDYLKI 1156

Query: 1107 QSYQLECLFNYYVKMLLRHHKQL--SQLKVRESISQGLQFYPLNPELYTAFLEISYIYSV 1151
             S Q E L    + M+ +    +     +++  I   L  Y  +P+L + FL       +
Sbjct: 1157 FSIQHELLTIRCIDMVTKIAPLIGTDPKRIKNLIIDSLNQYYDHPKLLSLFLNWESKNQL 1216

BLAST of Cp4.1LG20g01730 vs. ExPASy Swiss-Prot
Match: O42975 (Protein NRDE2 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC20F10.05 PE=1 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 3.8e-11
Identity = 76/354 (21.47%), Postives = 160/354 (45.20%), Query Frame = 0

Query: 195 GLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRK 254
           G+N     ++++SSS++   A    +  + K G      K+  I+    +   R    ++
Sbjct: 72  GINKRPVPKYHRSSSSVYGSAPLLRIVKESKEGITLNKKKSLEIK----YDEERSFDEKE 131

Query: 255 TPDKLLDD----FIPFSDSQTSNNIEES-WEDEVLRKTREFNKLTREHPHDEKAWLAFAE 314
             +   +D    FIP   ++ S+  E+S +   +L+  +E ++  +++P   + W+   E
Sbjct: 132 NDESEFEDGQQGFIPLLVNRNSDPSEKSTFSLNILKAIKETDEEIKKNPGKARLWIKMCE 191

Query: 315 FQDKVAAMQPQKG----------ARLQTLEKKISILEKAAE--LNPENEELLLYLLKNYQ 374
           +Q+++   + ++               +   K+SILEKA +     ++E L+ Y L+   
Sbjct: 192 YQERLLFDEFRRSNSDDIKGKLKIENNSRSVKLSILEKALKEVKGCDHEILVSYYLQLGS 251

Query: 375 KRDTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAAC 434
           +  + +    ++E++L+++ G   LW ++     G  S F  +D   M++   + L    
Sbjct: 252 EEWSKEETNQKFEEVLIEHPGYLNLWMKYAEYFTG-ISEFTFNDCLNMFSKCFKFLKQKL 311

Query: 435 NQHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFC 494
           +   R++ +  + +      ++E  ++ + + LC F    GY ELA ++FQA +E   F 
Sbjct: 312 SD--RKSCKERESTDVTSNFEVEEAILHLLIRLCDFLKNCGYYELAWSIFQANMELCYFY 371

Query: 495 PALHLNDRSKQRLFE---HFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREE 529
           P  +L  +     FE    FWN++  +  EE A GW   L+ E   + +    E
Sbjct: 372 PR-YLEKKLDSTFFESFSKFWNSDTPKFSEENARGWCNVLDDESSQQNQNFSSE 417

BLAST of Cp4.1LG20g01730 vs. NCBI nr
Match: XP_023519328.1 (protein NRDE2 homolog isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2283 bits (5915), Expect = 0.0
Identity = 1164/1164 (100.00%), Postives = 1164/1164 (100.00%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV
Sbjct: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840

Query: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS 900
            SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS
Sbjct: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS 900

Query: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960
            SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN
Sbjct: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960

Query: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020
            MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI
Sbjct: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020

Query: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080
            SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS
Sbjct: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080

Query: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140
            VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE
Sbjct: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140

Query: 1141 VMRDKELNLRTDIYEILLQEELIS 1164
            VMRDKELNLRTDIYEILLQEELIS
Sbjct: 1141 VMRDKELNLRTDIYEILLQEELIS 1164

BLAST of Cp4.1LG20g01730 vs. NCBI nr
Match: XP_023519327.1 (protein NRDE2 homolog isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2278 bits (5903), Expect = 0.0
Identity = 1164/1165 (99.91%), Postives = 1164/1165 (99.91%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV
Sbjct: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840

Query: 841  SVESLPV-DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900
            SVESLPV DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP
Sbjct: 841  SVESLPVQDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900

Query: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
            SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960

Query: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020
            NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE
Sbjct: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020

Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080
            ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080

Query: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140
            SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140

Query: 1141 EVMRDKELNLRTDIYEILLQEELIS 1164
            EVMRDKELNLRTDIYEILLQEELIS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQEELIS 1165

BLAST of Cp4.1LG20g01730 vs. NCBI nr
Match: XP_023519329.1 (protein NRDE2 homolog isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2271 bits (5886), Expect = 0.0
Identity = 1163/1165 (99.83%), Postives = 1163/1165 (99.83%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV
Sbjct: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIREKS DVIHDEGMPDGETNEQLQRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKS-DVIHDEGMPDGETNEQLQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840

Query: 841  SVESLPV-DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900
            SVESLPV DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP
Sbjct: 841  SVESLPVQDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900

Query: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
            SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960

Query: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020
            NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE
Sbjct: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020

Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080
            ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080

Query: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140
            SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140

Query: 1141 EVMRDKELNLRTDIYEILLQEELIS 1164
            EVMRDKELNLRTDIYEILLQEELIS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQEELIS 1164

BLAST of Cp4.1LG20g01730 vs. NCBI nr
Match: KAG7019890.1 (Protein NRDE2-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2254 bits (5842), Expect = 0.0
Identity = 1147/1164 (98.54%), Postives = 1156/1164 (99.31%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDD S+HEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTRE+PH+
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTRENPHE 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDV+ISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQT KPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELAT LFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTTKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATGLFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGA+ WSTWLEKEEENRQKVMREEEALEADEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGALSWSTWLEKEEENRQKVMREEEALEADEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAGV
Sbjct: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIRE SADVIHDEGMPDGETNEQ QRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIRENSADVIHDEGMPDGETNEQFQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREAT+GNIDHARKVFDM+LA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMALA 840

Query: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS 900
            SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQPS
Sbjct: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQPS 900

Query: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960
            SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN
Sbjct: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960

Query: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020
            MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAF+EI
Sbjct: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFVEI 1020

Query: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080
            SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS
Sbjct: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080

Query: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140
            VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE
Sbjct: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140

Query: 1141 VMRDKELNLRTDIYEILLQEELIS 1164
            VMRDKELNLRTDIYEILLQEE IS
Sbjct: 1141 VMRDKELNLRTDIYEILLQEEFIS 1164

BLAST of Cp4.1LG20g01730 vs. NCBI nr
Match: KAG6584297.1 (Nuclear exosome regulator NRDE2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2251 bits (5832), Expect = 0.0
Identity = 1147/1164 (98.54%), Postives = 1154/1164 (99.14%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDD S+HEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDV+ISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQT KPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELAT LFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTTKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATGLFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGA+ WSTWLEKEEENRQKVMREEEALEADEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGALSWSTWLEKEEENRQKVMREEEALEADEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAGV
Sbjct: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIRE S DVIHDEGMPDGETNEQ QRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIRENS-DVIHDEGMPDGETNEQFQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREAT+GNIDHARKVFDM+LA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMALA 840

Query: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS 900
            SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQPS
Sbjct: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQPS 900

Query: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960
            SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSV LISSAALFEELTTGYNAGLEVLDQAFN
Sbjct: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVVLISSAALFEELTTGYNAGLEVLDQAFN 960

Query: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020
            MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAF+EI
Sbjct: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFVEI 1020

Query: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080
            SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS
Sbjct: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080

Query: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140
            VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE
Sbjct: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140

Query: 1141 VMRDKELNLRTDIYEILLQEELIS 1164
            VMRDKELNLRTDIYEILLQEE IS
Sbjct: 1141 VMRDKELNLRTDIYEILLQEEFIS 1163

BLAST of Cp4.1LG20g01730 vs. ExPASy TrEMBL
Match: A0A6J1E7N7 (protein NRDE2 homolog isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111431485 PE=3 SV=1)

HSP 1 Score: 2251 bits (5832), Expect = 0.0
Identity = 1149/1165 (98.63%), Postives = 1155/1165 (99.14%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDD S+HEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA-DGRPSKDYYFDSNGDRDNLAFGSLYR 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA DGRPSKDYYFDSNGDRDNLAFGSLYR
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADAADGRPSKDYYFDSNGDRDNLAFGSLYR 180

Query: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240
            MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE
Sbjct: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240

Query: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300
            RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH
Sbjct: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300

Query: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360
            DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR
Sbjct: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360

Query: 361  DTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420
            DTIDV+IS WEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ
Sbjct: 361  DTIDVVISTWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420

Query: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480
            HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA
Sbjct: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480

Query: 481  LHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGW 540
            LHLNDRSKQRLFEHFWNT+AERVGEEGA+GWSTWLEKEEENRQKVMREEEALEADEKGGW
Sbjct: 481  LHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKVMREEEALEADEKGGW 540

Query: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAG 600
            TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAG
Sbjct: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAG 600

Query: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDV 660
            VDEEVKDTSTWARWSKEESLRDCEQWMPIRE  ADVIHDEGMPDGETNEQ QRVILYEDV
Sbjct: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIRENYADVIHDEGMPDGETNEQFQRVILYEDV 660

Query: 661  KEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSV 720
            KEYLFSLISSEARLSLIYQLIEFFSGKI SRVASNSSSWMERILSLEVLPDDILHHLRSV
Sbjct: 661  KEYLFSLISSEARLSLIYQLIEFFSGKICSRVASNSSSWMERILSLEVLPDDILHHLRSV 720

Query: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780
            HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA
Sbjct: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780

Query: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSL 840
            EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDM+L
Sbjct: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMAL 840

Query: 841  ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900
            ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQP
Sbjct: 841  ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQP 900

Query: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
            SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960

Query: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020
            NMVLPERRK SYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE
Sbjct: 961  NMVLPERRKHSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020

Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080
            ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080

Query: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140
            SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140

Query: 1141 EVMRDKELNLRTDIYEILLQEELIS 1164
            EVM DKELNLRTDIYEILLQEELIS
Sbjct: 1141 EVMHDKELNLRTDIYEILLQEELIS 1165

BLAST of Cp4.1LG20g01730 vs. ExPASy TrEMBL
Match: A0A6J1KM35 (protein NRDE2 homolog isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495812 PE=3 SV=1)

HSP 1 Score: 2246 bits (5820), Expect = 0.0
Identity = 1143/1164 (98.20%), Postives = 1151/1164 (98.88%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVA+NPQSQISPPNSS PQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVASNPQSQISPPNSSAPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVE EGGPSVRPEVQKSSRSYELLESSASDD SEHEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEGEGGPSVRPEVQKSSRSYELLESSASDDDSEHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWL FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLTFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDV+ISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQT KPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELAT LFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTTKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATGLFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGA+GWSTWLEKEEENRQKVMREEEALE DEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGALGWSTWLEKEEENRQKVMREEEALETDEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNN+DAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAGV
Sbjct: 541  GWSDPAPKEKKNNEDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIRE SADVIHDEGMPDGETNEQ QRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIRENSADVIHDEGMPDGETNEQFQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQ SDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQTSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDM+LA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMALA 840

Query: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQPS 900
            SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQPS
Sbjct: 841  SVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQPS 900

Query: 901  SLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960
            SLQLLRAHQGFKEKIRAVRS WLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN
Sbjct: 901  SLQLLRAHQGFKEKIRAVRSMWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFN 960

Query: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020
            MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI
Sbjct: 961  MVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEI 1020

Query: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHS 1080
            SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMG+AGSPHRIRRLFEKALENDNLRHS
Sbjct: 1021 SYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGHAGSPHRIRRLFEKALENDNLRHS 1080

Query: 1081 VLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140
            V+LWRCYISYELNTACD SSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE
Sbjct: 1081 VILWRCYISYELNTACDASSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQE 1140

Query: 1141 VMRDKELNLRTDIYEILLQEELIS 1164
            VMRDKELNLRTDIYEILLQEELIS
Sbjct: 1141 VMRDKELNLRTDIYEILLQEELIS 1164

BLAST of Cp4.1LG20g01730 vs. ExPASy TrEMBL
Match: A0A6J1EAU9 (protein NRDE2 homolog isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111431485 PE=3 SV=1)

HSP 1 Score: 2246 bits (5820), Expect = 0.0
Identity = 1149/1166 (98.54%), Postives = 1155/1166 (99.06%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDD S+HEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA-DGRPSKDYYFDSNGDRDNLAFGSLYR 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA DGRPSKDYYFDSNGDRDNLAFGSLYR
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADAADGRPSKDYYFDSNGDRDNLAFGSLYR 180

Query: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240
            MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE
Sbjct: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240

Query: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300
            RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH
Sbjct: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300

Query: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360
            DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR
Sbjct: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360

Query: 361  DTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420
            DTIDV+IS WEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ
Sbjct: 361  DTIDVVISTWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420

Query: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480
            HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA
Sbjct: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480

Query: 481  LHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGW 540
            LHLNDRSKQRLFEHFWNT+AERVGEEGA+GWSTWLEKEEENRQKVMREEEALEADEKGGW
Sbjct: 481  LHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKVMREEEALEADEKGGW 540

Query: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAG 600
            TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAG
Sbjct: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAG 600

Query: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDV 660
            VDEEVKDTSTWARWSKEESLRDCEQWMPIRE  ADVIHDEGMPDGETNEQ QRVILYEDV
Sbjct: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIRENYADVIHDEGMPDGETNEQFQRVILYEDV 660

Query: 661  KEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSV 720
            KEYLFSLISSEARLSLIYQLIEFFSGKI SRVASNSSSWMERILSLEVLPDDILHHLRSV
Sbjct: 661  KEYLFSLISSEARLSLIYQLIEFFSGKICSRVASNSSSWMERILSLEVLPDDILHHLRSV 720

Query: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780
            HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA
Sbjct: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780

Query: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSL 840
            EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDM+L
Sbjct: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMAL 840

Query: 841  ASVESLPV-DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQ 900
            ASVESLPV DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQ
Sbjct: 841  ASVESLPVQDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQ 900

Query: 901  PSSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQA 960
            PSSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQA
Sbjct: 901  PSSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQA 960

Query: 961  FNMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFL 1020
            FNMVLPERRK SYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFL
Sbjct: 961  FNMVLPERRKHSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFL 1020

Query: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLR 1080
            EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLR
Sbjct: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLR 1080

Query: 1081 HSVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDL 1140
            HSVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDL
Sbjct: 1081 HSVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDL 1140

Query: 1141 QEVMRDKELNLRTDIYEILLQEELIS 1164
            QEVM DKELNLRTDIYEILLQEELIS
Sbjct: 1141 QEVMHDKELNLRTDIYEILLQEELIS 1166

BLAST of Cp4.1LG20g01730 vs. ExPASy TrEMBL
Match: A0A6J1E7Z6 (protein NRDE2 homolog isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431485 PE=3 SV=1)

HSP 1 Score: 2245 bits (5817), Expect = 0.0
Identity = 1145/1162 (98.54%), Postives = 1152/1162 (99.14%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDD S+HEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA-DGRPSKDYYFDSNGDRDNLAFGSLYR 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADA DGRPSKDYYFDSNGDRDNLAFGSLYR
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADAADGRPSKDYYFDSNGDRDNLAFGSLYR 180

Query: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240
            MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE
Sbjct: 181  MDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIE 240

Query: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300
            RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH
Sbjct: 241  RHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPH 300

Query: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360
            DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR
Sbjct: 301  DEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKR 360

Query: 361  DTIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420
            DTIDV+IS WEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ
Sbjct: 361  DTIDVVISTWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQ 420

Query: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480
            HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA
Sbjct: 421  HIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPA 480

Query: 481  LHLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGW 540
            LHLNDRSKQRLFEHFWNT+AERVGEEGA+GWSTWLEKEEENRQKVMREEEALEADEKGGW
Sbjct: 481  LHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKVMREEEALEADEKGGW 540

Query: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAG 600
            TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAG
Sbjct: 541  TGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAG 600

Query: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDV 660
            VDEEVKDTSTWARWSKEESLRDCEQWMPIRE  ADVIHDEGMPDGETNEQ QRVILYEDV
Sbjct: 601  VDEEVKDTSTWARWSKEESLRDCEQWMPIRENYADVIHDEGMPDGETNEQFQRVILYEDV 660

Query: 661  KEYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSV 720
            KEYLFSLISSEARLSLIYQLIEFFSGKI SRVASNSSSWMERILSLEVLPDDILHHLRSV
Sbjct: 661  KEYLFSLISSEARLSLIYQLIEFFSGKICSRVASNSSSWMERILSLEVLPDDILHHLRSV 720

Query: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780
            HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA
Sbjct: 721  HDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIA 780

Query: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSL 840
            EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDM+L
Sbjct: 781  EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMAL 840

Query: 841  ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900
            ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQP
Sbjct: 841  ASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQP 900

Query: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
            SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960

Query: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020
            NMVLPERRK SYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE
Sbjct: 961  NMVLPERRKHSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020

Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080
            ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080

Query: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140
            SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140

Query: 1141 EVMRDKELNLRTDIYEILLQEE 1161
            EVM DKELNLRTDIYEILLQE+
Sbjct: 1141 EVMHDKELNLRTDIYEILLQED 1162

BLAST of Cp4.1LG20g01730 vs. ExPASy TrEMBL
Match: A0A6J1KNL6 (protein NRDE2 homolog isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495812 PE=3 SV=1)

HSP 1 Score: 2241 bits (5808), Expect = 0.0
Identity = 1143/1165 (98.11%), Postives = 1151/1165 (98.80%), Query Frame = 0

Query: 1    MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
            MEAPAEEELPPEEQKPKTSLFPLPFVA+NPQSQISPPNSS PQWLCNSSFTTDLSVINDA
Sbjct: 1    MEAPAEEELPPEEQKPKTSLFPLPFVASNPQSQISPPNSSAPQWLCNSSFTTDLSVINDA 60

Query: 61   LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDGSEHEKRKKR 120
            LSSQNNVYPSLSTDGDQEEAVE EGGPSVRPEVQKSSRSYELLESSASDD SEHEKRKKR
Sbjct: 61   LSSQNNVYPSLSTDGDQEEAVEGEGGPSVRPEVQKSSRSYELLESSASDDDSEHEKRKKR 120

Query: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180
            KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM
Sbjct: 121  KKKKRRRRNEYEEKKGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRM 180

Query: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240
            DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER
Sbjct: 181  DVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIER 240

Query: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300
            HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD
Sbjct: 241  HKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHPHD 300

Query: 301  EKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360
            EKAWL FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD
Sbjct: 301  EKAWLTFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRD 360

Query: 361  TIDVLISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420
            TIDV+ISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH
Sbjct: 361  TIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQH 420

Query: 421  IRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPAL 480
            IRQANQT KPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELAT LFQAEIEFSLFCPAL
Sbjct: 421  IRQANQTTKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATGLFQAEIEFSLFCPAL 480

Query: 481  HLNDRSKQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWT 540
            HLNDRSKQRLFEHFWNTNAERVGEEGA+GWSTWLEKEEENRQKVMREEEALE DEKGGWT
Sbjct: 481  HLNDRSKQRLFEHFWNTNAERVGEEGALGWSTWLEKEEENRQKVMREEEALETDEKGGWT 540

Query: 541  GWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGV 600
            GWSDPAPKEKKNN+DAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGIN DAGV
Sbjct: 541  GWSDPAPKEKKNNEDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDAGV 600

Query: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVK 660
            DEEVKDTSTWARWSKEESLRDCEQWMPIRE SADVIHDEGMPDGETNEQ QRVILYEDVK
Sbjct: 601  DEEVKDTSTWARWSKEESLRDCEQWMPIRENSADVIHDEGMPDGETNEQFQRVILYEDVK 660

Query: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720
            EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH
Sbjct: 661  EYLFSLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVH 720

Query: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780
            DVLNKRQSSSSSFTLEVLVGGSDNLTQ SDMMKFLRNVILLCLTAFPRNFILEEAALIAE
Sbjct: 721  DVLNKRQSSSSSFTLEVLVGGSDNLTQTSDMMKFLRNVILLCLTAFPRNFILEEAALIAE 780

Query: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLA 840
            ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDM+LA
Sbjct: 781  ELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMALA 840

Query: 841  SVESLPV-DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGSGDSYSPFKCQP 900
            SVESLPV DQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLG+GDSYSPFKCQP
Sbjct: 841  SVESLPVQDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQP 900

Query: 901  SSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
            SSLQLLRAHQGFKEKIRAVRS WLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901  SSLQLLRAHQGFKEKIRAVRSMWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960

Query: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020
            NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE
Sbjct: 961  NMVLPERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLE 1020

Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRH 1080
            ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMG+AGSPHRIRRLFEKALENDNLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGHAGSPHRIRRLFEKALENDNLRH 1080

Query: 1081 SVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140
            SV+LWRCYISYELNTACD SSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ
Sbjct: 1081 SVILWRCYISYELNTACDASSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQ 1140

Query: 1141 EVMRDKELNLRTDIYEILLQEELIS 1164
            EVMRDKELNLRTDIYEILLQEELIS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQEELIS 1165

BLAST of Cp4.1LG20g01730 vs. TAIR 10
Match: AT3G17740.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17712.1); Has 409 Blast hits to 335 proteins in 133 species: Archae - 1; Bacteria - 0; Metazoa - 140; Fungi - 188; Plants - 42; Viruses - 0; Other Eukaryotes - 38 (source: NCBI BLink). )

HSP 1 Score: 1167.9 bits (3020), Expect = 0.0e+00
Identity = 634/1156 (54.84%), Postives = 824/1156 (71.28%), Query Frame = 0

Query: 20   LFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDALSSQNNVYPSLSTDGDQEE 79
            LFP+   + N  S I    S+ PQWL N+SFTTDLSVIN A S+     PS S++ +  +
Sbjct: 18   LFPVFPTSANSISAI----SNAPQWLRNASFTTDLSVINAAASTA----PS-SSEVEAGD 77

Query: 80   AVEDEGGPSVRPEVQKSSRSYELLESSAS-DDGSEHEKRKKRKKKKRRRRNEYEEKKGFG 139
              ++EGG      +   +R Y L+E   S +   +  KRK+ KKKKR+  N  +E +   
Sbjct: 78   DEDEEGGADGNIGLANQARVYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDESR--- 137

Query: 140  EYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNF 199
               SRKSD     +   +P KDYY D+  D DNLA+GS+YRM+V RY+  N    PG   
Sbjct: 138  ---SRKSD-----EYYSKPVKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGS 197

Query: 200  NGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDK 259
              F   N+ SS LD + D + L+ + KS  RYW AK+AA+ER+KNFKR+R+  + +  D 
Sbjct: 198  LRFYLRNRRSSMLDTEIDIDSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDS 257

Query: 260  LLDDFIPFSDSQTSNNIEE------------SWEDEVLRKTREFNKLTREHPHDEKAWLA 319
              D+FIP  +  T    +E            SWEDEVL KTREFN++TRE PHD KAWLA
Sbjct: 258  SFDNFIPLEEDVTVPESDEEDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLA 317

Query: 320  FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRDTIDVLI 379
            FA+FQDKV++MQ QKG RLQTLEKKISILEKA ELNP++EELLL LLK Y+ RD  DVLI
Sbjct: 318  FADFQDKVSSMQSQKGVRLQTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLI 377

Query: 380  SRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQHIRQANQ 439
            SRWEK LMQNS SYKLWREFL ++QGEFSRFKVS++R++Y++AIQALS+AC++  RQ + 
Sbjct: 378  SRWEKALMQNSASYKLWREFLCVVQGEFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDT 437

Query: 440  TAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRS 499
            T++P ++   IQ EL LVD+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++S
Sbjct: 438  TSEP-LDSAAIQQELVLVDMLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQS 497

Query: 500  KQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWTGWSDPA 559
            K RLFEHFW++N  RVGEEGA GW  WLEKEEENRQK+++EE + + +E GGWTGW++  
Sbjct: 498  KLRLFEHFWSSNGARVGEEGAFGWLLWLEKEEENRQKILKEESS-DDNEVGGWTGWTEQV 557

Query: 560  PKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGVDEEVKD 619
                 ++  +  T EV V   + +++++E+E+++ ED TEA+LK+LGI+ +    +EVKD
Sbjct: 558  SGRNGDDIASANTGEVDV-DRKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKD 617

Query: 620  TSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVKEYLFSL 679
            TSTW +W +EE  RD  QWMP R K+ +    EGM +GE  EQL  V+LYED+  YLFSL
Sbjct: 618  TSTWVKWFEEEVSRDHSQWMPTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSL 677

Query: 680  ISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVHDVLNKR 739
             S EARLSL+YQ I+FF   I    +SNS SW E+I SLE   D +L +LRSVH+ L+K 
Sbjct: 678  RSKEARLSLVYQFIDFFGAHISPWTSSNSLSWSEKISSLETFSDSMLENLRSVHECLSK- 737

Query: 740  QSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAEELFVTK 799
              S++ F+L  L+GGS +L+  ++MMKFLRN ILLCL  FPRN+ILEEA L+AEELFVT 
Sbjct: 738  SDSANCFSLGSLLGGSCDLSMRTEMMKFLRNAILLCLNVFPRNYILEEAVLVAEELFVTN 797

Query: 800  MNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLASVESLP 859
            M +C  +  PC++LAK LLKSDRQD+LLCGVYA+REA  GN+ HAR+VFDM+L S+  LP
Sbjct: 798  MKTCEVATMPCQALAKRLLKSDRQDLLLCGVYAQREAASGNMKHARRVFDMALTSICGLP 857

Query: 860  VDQKSNAPLLYFWYAELELAKDPHNGHD--SVNRAVHILSCLGSGDSYSPFKCQPSSLQL 919
             + + N PLL  WYAE E+A    +G D  S +RA+HIL  LGSG +YSP+  Q SS+Q+
Sbjct: 858  KELQCNTPLLCLWYAESEVANSSGSGRDTESSSRAMHILCYLGSGLAYSPYTSQSSSMQI 917

Query: 920  LRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAFNMVLP 979
            LRA QGF+EK++ ++STW HGV DD S AL+ SAALFEELT      LE+L+  F+ VLP
Sbjct: 918  LRARQGFREKLKKIQSTWSHGVTDDQSAALVCSAALFEELTNDLPGALEILEHMFSSVLP 977

Query: 980  ERRKQSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFLEISYIY 1039
             R+ QS+QLE LFNYYV+ML RH   L+  ++ + IS+GLQ YPLNPELY A ++I    
Sbjct: 978  GRKSQSHQLELLFNYYVRMLQRHQDDLTLSQLWKPISEGLQLYPLNPELYRALVDICNHR 1037

Query: 1040 SVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLRHSVLLW 1099
                KLR  FDDY +K  S+++W+FALS+E+   GS HRIR LFE+AL  D   +SV+LW
Sbjct: 1038 MTSHKLRMMFDDYSRKNSSVVVWLFALSYELSKGGSSHRIRGLFERALAQDTQNNSVILW 1097

Query: 1100 RCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDLQEVMRD 1159
            RCYI+YE++ A +PS+A+R++FRAI++CPWSKKLWLDGF KL S+L+AKE+SDLQEVMRD
Sbjct: 1098 RCYIAYEIDIADNPSAARRIYFRAINACPWSKKLWLDGFGKLGSVLTAKEMSDLQEVMRD 1148

Query: 1160 KELNLRTDIYEILLQE 1161
            KELN+RTDIYEILL +
Sbjct: 1158 KELNIRTDIYEILLMQ 1148

BLAST of Cp4.1LG20g01730 vs. TAIR 10
Match: AT3G17712.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1); Has 265 Blast hits to 264 proteins in 123 species: Archae - 1; Bacteria - 0; Metazoa - 116; Fungi - 89; Plants - 33; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 709.9 bits (1831), Expect = 3.3e-204
Identity = 413/786 (52.54%), Postives = 538/786 (68.45%), Query Frame = 0

Query: 20  LFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDALSSQNNVYPSLSTDGDQEE 79
           LFP+   + N  S I    S+ PQWL N+SFTTDLSVIN A S+     PS S++ +  +
Sbjct: 18  LFPVFPTSANSISAI----SNAPQWLRNASFTTDLSVINAAASTA----PS-SSEVEAGD 77

Query: 80  AVEDEGGPSVRPEVQKSSRSYELLESSAS-DDGSEHEKRKKRKKKKRRRRNEYEEKKGFG 139
             ++EGG      +   +R Y L+E   S +   +  KRK+ KKKKR+  N  +E +   
Sbjct: 78  DEDEEGGADGNIGLANQARVYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDESR--- 137

Query: 140 EYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNF 199
              SRKSD     +   +P KDYY D+  D DNLA+GS+YRM+V RY+  N    PG   
Sbjct: 138 ---SRKSD-----EYYSKPVKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGS 197

Query: 200 NGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDK 259
             F   N+ SS LD + D + L+ + KS  RYW AK+AA+ER+KNFKR+R+  + +  D 
Sbjct: 198 LRFYLRNRRSSMLDTEIDIDSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDS 257

Query: 260 LLDDFIPFSDSQTSNNIEE------------SWEDEVLRKTREFNKLTREHPHDEKAWLA 319
             D+FIP  +  T    +E            SWEDEVL KTREFN++TRE PHD KAWLA
Sbjct: 258 SFDNFIPLEEDVTVPESDEEDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLA 317

Query: 320 FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRDTIDVLI 379
           FA+FQDKV++MQ QKG RLQTLEKKISILEKA ELNP++EELLL LLK Y+ RD  DVLI
Sbjct: 318 FADFQDKVSSMQSQKGVRLQTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLI 377

Query: 380 SRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQHIRQANQ 439
           SRWEK LMQNS SYKLWREFL ++QGEFSRFKVS++R++Y++AIQALS+AC++  RQ + 
Sbjct: 378 SRWEKALMQNSASYKLWREFLCVVQGEFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDT 437

Query: 440 TAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRS 499
           T++P ++   IQ EL LVD+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++S
Sbjct: 438 TSEP-LDSAAIQQELVLVDMLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQS 497

Query: 500 KQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWTGWSDPA 559
           K RLFEHFW++N  RVGEEGA GW  WLEKEEENRQK+++EE + + +E GGWTGW++  
Sbjct: 498 KLRLFEHFWSSNGARVGEEGAFGWLLWLEKEEENRQKILKEESS-DDNEVGGWTGWTEQV 557

Query: 560 PKEKKNNDD--AETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGVDEEV 619
               +N DD  +  T EV V   + +++++E+E+++ ED TEA+LK+LGI+ +    +EV
Sbjct: 558 --SGRNGDDLASANTGEVDV-DRKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEV 617

Query: 620 KDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVKEYLF 679
           KDTSTW  W +EE  RD  QWMP R K+ +    EGM +GE  EQL  V+LYED+  YLF
Sbjct: 618 KDTSTWVEWFEEEVSRDHSQWMPTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLF 677

Query: 680 SLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVHDVLN 739
           SL S EARLSL+YQ I+FF   I         SW E+I SLE L D +L +LRSVH+ L+
Sbjct: 678 SLRSKEARLSLVYQFIDFFGAHISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLS 737

Query: 740 KRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAEELFV 791
           K   S++ F+L  L+GGS +L+  ++MMKFLRN ILLCL  FP+N+I EEA L+ EELFV
Sbjct: 738 K-SDSANCFSLGSLLGGSCDLSMRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFV 776

BLAST of Cp4.1LG20g01730 vs. TAIR 10
Match: AT3G17712.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1). )

HSP 1 Score: 692.2 bits (1785), Expect = 7.2e-199
Identity = 416/842 (49.41%), Postives = 546/842 (64.85%), Query Frame = 0

Query: 20  LFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDALSSQNNVYPSLSTDGDQEE 79
           LFP+   + N  S I    S+ PQWL N+SFTTDLSVIN A S+     PS S++ +  +
Sbjct: 49  LFPVFPTSANSISAI----SNAPQWLRNASFTTDLSVINAAASTA----PS-SSEVEAGD 108

Query: 80  AVEDEGGPSVRPEVQKSSRSYELLESSAS-DDGSEHEKRKKRKKKKRRRRNEYEEKKGFG 139
             ++EGG      +   +R Y L+E   S +   +  KRK+ KKKKR+  N  +E +   
Sbjct: 109 DEDEEGGADGNIGLANQARVYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDESR--- 168

Query: 140 EYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNF 199
              SRKSD     +   +P KDYY D+  D DNLA+GS+YRM+V RY+  N    PG   
Sbjct: 169 ---SRKSD-----EYYSKPVKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGS 228

Query: 200 NGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDK 259
             F   N+ SS LD + D + L+ + KS  RYW AK+AA+ER+KNFKR+R+  + +  D 
Sbjct: 229 LRFYLRNRRSSMLDTEIDIDSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDS 288

Query: 260 LLDDFIPFSDSQTSNNIEE------------SWEDEVLRKTREFNKLTREHPHDEKAWLA 319
             D+FIP  +  T    +E            SWEDEVL KTREFN++TRE PHD KAWLA
Sbjct: 289 SFDNFIPLEEDVTVPESDEEDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLA 348

Query: 320 FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRDTIDVLI 379
           FA+FQDKV++MQ QKG RLQTLEKKISILEKA ELNP++EELLL LLK Y+ RD  DVLI
Sbjct: 349 FADFQDKVSSMQSQKGVRLQTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLI 408

Query: 380 SRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQHIRQANQ 439
                                     EFSRFKVS++R++Y++AIQALS+AC++  RQ + 
Sbjct: 409 R-------------------------EFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDT 468

Query: 440 TAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRS 499
           T++P ++   IQ EL LVD+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++S
Sbjct: 469 TSEP-LDSAAIQQELVLVDMLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQS 528

Query: 500 KQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWTGWSDPA 559
           K RLFEHFW++N  RVGEEGA GW  WLEKEEENRQK+++EE + + +E GGWTGW++  
Sbjct: 529 KLRLFEHFWSSNGARVGEEGAFGWLLWLEKEEENRQKILKEESS-DDNEVGGWTGWTEQV 588

Query: 560 PKEKKNNDD--AETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGVDEEV 619
               +N DD  +  T EV V   + +++++E+E+++ ED TEA+LK+LGI+ +    +EV
Sbjct: 589 --SGRNGDDLASANTGEVDV-DRKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEV 648

Query: 620 KDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVKEYLF 679
           KDTSTW  W +EE  RD  QWMP R K+ +    EGM +GE  EQL  V+LYED+  YLF
Sbjct: 649 KDTSTWVEWFEEEVSRDHSQWMPTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLF 708

Query: 680 SLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVHDVLN 739
           SL S EARLSL+YQ I+FF   I         SW E+I SLE L D +L +LRSVH+ L+
Sbjct: 709 SLRSKEARLSLVYQFIDFFGAHISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLS 768

Query: 740 KRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAEELFV 799
           K   S++ F+L  L+GGS +L+  ++MMKFLRN ILLCL  FP+N+I EEA L+ EELFV
Sbjct: 769 K-SDSANCFSLGSLLGGSCDLSMRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFV 819

Query: 800 TKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMSLASVES 847
           T M +C                   +D+LLCGVYA+REA  GN+ HAR+VFDM+L S+  
Sbjct: 829 TNMKTC-------------------EDLLLCGVYAQREAASGNMKHARRVFDMALTSICG 819

BLAST of Cp4.1LG20g01730 vs. TAIR 10
Match: AT3G17712.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1). )

HSP 1 Score: 651.0 bits (1678), Expect = 1.8e-186
Identity = 392/786 (49.87%), Postives = 515/786 (65.52%), Query Frame = 0

Query: 20  LFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDALSSQNNVYPSLSTDGDQEE 79
           LFP+   + N  S I    S+ PQWL N+SFTTDLSVIN A S+     PS S++ +  +
Sbjct: 18  LFPVFPTSANSISAI----SNAPQWLRNASFTTDLSVINAAASTA----PS-SSEVEAGD 77

Query: 80  AVEDEGGPSVRPEVQKSSRSYELLESSAS-DDGSEHEKRKKRKKKKRRRRNEYEEKKGFG 139
             ++EGG      +   +R Y L+E   S +   +  KRK+ KKKKR+  N  +E +   
Sbjct: 78  DEDEEGGADGNIGLANQARVYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDESR--- 137

Query: 140 EYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNHGERPGLNF 199
              SRKSD     +   +P KDYY D+  D DNLA+GS+YRM+V RY+  N    PG   
Sbjct: 138 ---SRKSD-----EYYSKPVKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGS 197

Query: 200 NGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDK 259
             F   N+ SS LD + D + L+ + KS  RYW AK+AA+ER+KNFKR+R+  + +  D 
Sbjct: 198 LRFYLRNRRSSMLDTEIDIDSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDS 257

Query: 260 LLDDFIPFSDSQTSNNIEE------------SWEDEVLRKTREFNKLTREHPHDEKAWLA 319
             D+FIP  +  T    +E            SWEDEVL KTREFN++TRE PHD KAWLA
Sbjct: 258 SFDNFIPLEEDVTVPESDEEDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLA 317

Query: 320 FAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQKRDTIDVLI 379
           FA+FQDKV++MQ QKG RLQTLEKKISILEKA ELNP++EELLL LLK Y+ RD  DVLI
Sbjct: 318 FADFQDKVSSMQSQKGVRLQTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLI 377

Query: 380 SRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACNQHIRQANQ 439
                                     EFSRFKVS++R++Y++AIQALS+AC++  RQ + 
Sbjct: 378 R-------------------------EFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDT 437

Query: 440 TAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRS 499
           T++P ++   IQ EL LVD+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++S
Sbjct: 438 TSEP-LDSAAIQQELVLVDMLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQS 497

Query: 500 KQRLFEHFWNTNAERVGEEGAIGWSTWLEKEEENRQKVMREEEALEADEKGGWTGWSDPA 559
           K RLFEHFW++N  RVGEEGA GW  WLEKEEENRQK+++EE + + +E GGWTGW++  
Sbjct: 498 KLRLFEHFWSSNGARVGEEGAFGWLLWLEKEEENRQKILKEESS-DDNEVGGWTGWTEQV 557

Query: 560 PKEKKNNDD--AETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINADAGVDEEV 619
               +N DD  +  T EV V   + +++++E+E+++ ED TEA+LK+LGI+ +    +EV
Sbjct: 558 --SGRNGDDLASANTGEVDV-DRKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEV 617

Query: 620 KDTSTWARWSKEESLRDCEQWMPIREKSADVIHDEGMPDGETNEQLQRVILYEDVKEYLF 679
           KDTSTW  W +EE  RD  QWMP R K+ +    EGM +GE  EQL  V+LYED+  YLF
Sbjct: 618 KDTSTWVEWFEEEVSRDHSQWMPTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLF 677

Query: 680 SLISSEARLSLIYQLIEFFSGKIYSRVASNSSSWMERILSLEVLPDDILHHLRSVHDVLN 739
           SL S EARLSL+YQ I+FF   I         SW E+I SLE L D +L +LRSVH+ L+
Sbjct: 678 SLRSKEARLSLVYQFIDFFGAHISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLS 737

Query: 740 KRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALIAEELFV 791
           K   S++ F+L  L+GGS +L+  ++MMKFLRN ILLCL  FP+N+I EEA L+ EELFV
Sbjct: 738 K-SDSANCFSLGSLLGGSCDLSMRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFV 751

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q80XC69.8e-6824.96Nuclear exosome regulator NRDE2 OS=Mus musculus OX=10090 GN=Nrde2 PE=1 SV=3[more]
Q9H7Z39.2e-6624.90Nuclear exosome regulator NRDE2 OS=Homo sapiens OX=9606 GN=NRDE2 PE=1 SV=3[more]
Q54QP01.7e-3520.92Nuclear exosome regulator NRDE2 OS=Dictyostelium discoideum OX=44689 GN=nrde2 PE... [more]
O429753.8e-1121.47Protein NRDE2 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=... [more]
Match NameE-valueIdentityDescription
XP_023519328.10.0100.00protein NRDE2 homolog isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_023519327.10.099.91protein NRDE2 homolog isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023519329.10.099.83protein NRDE2 homolog isoform X3 [Cucurbita pepo subsp. pepo][more]
KAG7019890.10.098.54Protein NRDE2-like protein [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6584297.10.098.54Nuclear exosome regulator NRDE2, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1E7N70.098.63protein NRDE2 homolog isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111431485 P... [more]
A0A6J1KM350.098.20protein NRDE2 homolog isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495812 PE=... [more]
A0A6J1EAU90.098.54protein NRDE2 homolog isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111431485 P... [more]
A0A6J1E7Z60.098.54protein NRDE2 homolog isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431485 P... [more]
A0A6J1KNL60.098.11protein NRDE2 homolog isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495812 PE=... [more]
Match NameE-valueIdentityDescription
AT3G17740.10.0e+0054.84unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G17712.13.3e-20452.54unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G17712.27.2e-19949.41unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G17712.31.8e-18649.87unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 114..134
NoneNo IPR availableCOILSCoilCoilcoord: 514..534
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..133
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 24..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 526..559
IPR003107HAT (Half-A-TPR) repeatSMARTSM00386hat_new_1coord: 1059..1093
e-value: 8.2E-5
score: 32.0
coord: 913..944
e-value: 1200.0
score: 0.3
coord: 360..392
e-value: 940.0
score: 1.0
coord: 826..866
e-value: 94.0
score: 8.3
coord: 281..313
e-value: 74.0
score: 9.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 291..435
e-value: 4.0E-7
score: 32.1
coord: 967..1121
e-value: 5.6E-9
score: 38.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 984..1113
IPR013633siRNA-mediated silencing protein NRDE-2PFAMPF08424NRDE-2coord: 286..684
e-value: 6.0E-87
score: 291.8
IPR013633siRNA-mediated silencing protein NRDE-2PANTHERPTHR13471TETRATRICOPEPTIDE-LIKE HELICALcoord: 26..1155

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g01730.1Cp4.1LG20g01730.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006396 RNA processing
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding