Sgr026353 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026353
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionSpatacsin_C domain-containing protein
Locationtig00153031: 4422013 .. 4458526 (-)
RNA-Seq ExpressionSgr026353
SyntenySgr026353
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAGTTATTAAGGTATAATCTACTTAATATAGAGGATACACCTCTCATGCTTAGATTTTACATAAAATATTTCTTAATGAAGGATGATTCACCCTAAAGCTTCATACTTTGCGCTTTTGTTAAAATATTTATTTCAATACCCTCCACTAAAAAATAAAAAGATTACATAAAACTTGTATAATTAACAAAAAGTTATCTTTGAATGTAAAAAAACAATACACAAGGTGCATGGATCAACTGCTCATCAATAAGCGTAAACAAAAGTTGAACTGACTCAATAAGCTAGAACCAACTCCATAGGTTTAGTCAAGTCAAGTTACAGTAATTCAAGTAGTTCAACTAGCTCAAATAATTCAAATACATTAAAAAGATTTATTCAATTCAAATTAGTTAACAAGTTATCAAACACAAATGAACCTTCTTTAAAGATTTTCACTTGTTGATAACGTCACCAATTAAACTGTTATACCGCACTAACATTTCTTCTACCATGATAACAAAAGTACAAGTGTCCTAAATAGATTAATCTCAAAGCATATATATATTTAATTCATTTAAGAAGAGGTTCACTGTGCTCATCTCAACAAAGTAGCACTTATGTAAAAATTCCATACGAGAACTTGAACGTAAACACTTCTAATTTGGAGATTTGAAGTGATTTGTATCGATCATCATATTTACTTTAAGGGAATAAAAAATAGGTGTAAGATGTAGAGGAACATACTTGTGGTATCAAGGTAACTCATAGTTGAAGGAGAGAAAATACTTGAGTATGTTAGACAGTTAAATAATGTCTTGACTAGTCACTCCATGTAAAGCTATTTTCTTTTTCTTAAATTATTGATTAATAAGCTAAGATGGGTTTTTTTAGTTAAGGTTCCAATGATAAAAAAATTGGTGTTGGAAACTTAGACATTTTATTCTCTCATGTGTAATCAATCACTATGTTCAATCAATATGATTACAAATTCCACAATAATCTTCTATAAATAAAATTTGGAGACCTAACAAGTCATTTGTTGATTTTAATTATAGCAATTAATTAAATTTCTAAAAACTTTTTGCAGTGGTCATTTGTTGATTTTGTAATGATTTTTTTGAGAGTAACTATATTTATCTTATCAAACAATTATTAACTATATTTTATAGAAATATCTTTAATAAACATATCCAAACAAACAAGATAAATAGCGAATCTGAATATAGTTTAATGGTTAAGACATCCATAATATTTTCTATAGATTGAATGTGAGAATTTTTGACTCCAAAATTGTAATGTTATATTTATATTAAAAAAAAAACAATTCAGATAAATAAAACATATATTTTAAAAGGAATTATCTAAAATTTAATTTATCAAATTACTTTGTATGAGCGTCGAGTGAAAAGAACCGAAGAAATTCCAGGAGAGACGCTGCACGGGACCGGACAACGACCGGAAGAATATGAATGTAAATTGACGAAAAAGCCCATAATATCTGCAACTTTGGTCGGTAAGACTGTAACGATTCCCTTGCGAAATCCTGTTGTCTTCCCCGTTGCTATCGATTGGAGAATTCCGATAACACAGCTTAAGAGAGAGAGAGAGAGCGAGAGAGAGAGCTCCGATTCTGAAAATGGCTTCCTTCGTCGACTAGGTTATCTTTATGAGCTCCAGGTTTTTCTTCTGGACAATTTCGTTTGATTTTCTCTGGTCCTGATTTTCCCTGTTATTGTCTTACTTCTCAATTCTTTTTGGGATACAATTGTTATGGGTGTGCTTTTAATTCATCTACCATCTTGTATATATTTCATTTCGTTGTAGTGATTCCGTTTGGTAGGACTTGATTGGTAACCATTGCAGTGGTGGATGCTGACTTTCTGTTTGTTTCGGTTGCCATGCTGTTTCGTCCATTCCGTTGTGCTAATTTGAAAACAAGTTTCTTCAGCTGATATATGCGCCAATTCGGTTATAGAACTTCTTTATACTTTATGGGAGCTGAAGGATTCAATTCTTCGTTTTGAGCTTGTCCCTTCTCGACCATTGTTTCTGAATGTCTAAGATTTAATGTTTTGGAGGTAGCCTAGTATCACGATGTTAATTGACTTCTTCATCTAAGATAGTGAAAGTTCCATGAAATGGAGTGGTTCTAAAACCAAAATCATTCATTCAACAATGTTTGGAGATCCCACTATGATGTGAGTTGGTGCGTCGGCCTTTTAACTGTCATTTAGTTATTATTTATCAAGCCCCTGTTGTATAGGAGTATAAGGTTGAATCTCATCTCATGGTCTCACCACCATCGTACTGTTGCTCTCTTCCTGTCACTCTCACCATCACCGTGCTGTTGCTCTCTCTGTCTCTCTTAGAGTCAGCATTTGGTCTGTTCAGATTCTAACATTTGACTGTGCCCTGTTCACTGCTTAGGGCAACATGATGGACTCGGTTTCAGGTGGTGGAGGTCCTGCCATACTGCAGCTGCATAAGTGGAATCCTTCACAGCCTCAACTCAACCTCTCAGAGTATCGTGAAGCTTTTATATCTCCTGCAAGGCAAATATTATTATTGCATTCATACAAACATGAAGCGTTGCTTCTTCCTCTAAATACAGGTAAACCGCCTTTCCCCCTCTCTCTAGATTTCCTCGCTTGTTTTGATATTTAATTATACACACACAATAGAAAGGTACAAGGACAAAAATCCACATCTTAAAAGCATCATGCAAAGCATCAACTTTTGTGGTTACTTTTGGTATTATCTTACGAGTTTCTTGAAACCAAATATTACTGAGTTAGGCACTATCTGGTAGGATTAGGCAAGAGATGCATGTAAAATGGCCTAGACACCCATAGGTATTAAAAAAATTTCAAATATTATGTCACATCCACACAAAGAATTTGGAGACTTAGTATCCTTAAAAATAATGCTCTTAAAGATGCTAAAACTTTAATTCTATTGAATATCATGAGGTTAAAAACTTTTTGCAAATGTAGCTCCCCAACCAACAATCGACCTCGTTCTTGTTTTGCTTCTTCACATGATGTTTAATTATAAGTTATTTTTTTTGTTTGGATCAGGGGACGTCAGGTGTGGTAATGATCTCCCAAACGGATATGATATCAACTTAAAAGATTTGGGGTCGTTAGCTTTCTCAGAAGTAGTATCAACAGCACCTAGGTCAGAAGATGCAGAAGGCAATGTACGATGCTCTAACAAATCAGCTGTTGATATTGATAATGATTCTCCTACAGGAAACAAATCTTCAAGGTCTAGTTGTAACAACTTCCTTGGTGATGTAAGCTCACTTGCTTGGGGGCTTTGTGGAGATACCTATAAGAAGCGCAAAGATTCTTCTTTTAAGGAAATTTTATTTGTATCTGGAAATCATGGTGTCACTGCTCATGCTTTTTGTCAACCCAACAAGACCAATGAAGAGGCTAAAAATATGGTGCAGTCTGAGTTTTGGAAAGGAAGGTGGATGGAATGGGGACCCTATCCTACATTAGTTCAAAACTTGGAGGTCCAAGAACTTTCTGATTCTTGTGTAACCTCCGGAAATGTTGACAAAAACAGGATAAACCAGAATGGGGAAATTTTGCGAAGTTCTTGCTATGAGTTTGAGGATGATGCATTGTTGTTGGGAAATAGTGCACCTAAGAGATATTTACAATCTTTTCTTGCTAAGGTTAAGACTATTGAATATGAAGATGACATTTGGACGGTATACCCAAAAAAAACATCAGTTCCTTGCTTTGCAGAGGTGGTTTCATTTAACATTTTTAATTACAACCTGCCGCACCAACTTCCAGTGATGATTCTTATGTTAATGAACAGAGCTGGCATGAAATAATCCTTGATGCACCCAGTAATATAAGTCCTACTTCATCGGACACACATTTTCTATCTGACATTTTATCCAATGTATTTGGCGTTGGCATGAATGAATCATACAAATGTTCTAGAATATTTTCTAGCAACTCACATTTTTTAATTGGATTTGTTTTGAAGATAGTGGATTCAGTATCTGCTGATAGAGTTGATGAAACTGGAAGCAGAAATGATACCTTAATTCTTGTAGCTAGAGTTGGCAATTTGGGAATTAAGTGGGTTTCTTCTGTGAAATTTGAGAAAAGTCTATATATTTCACCGTTGATGGAGTGGGCAGATTTCTGCTTTTCAAATGATTTTCTTCTTTGTCTAAGCGACTCTGGTTTTATCTTTGTACACTCTGCTTTGTCTGGCAAGCATGTTACCTGTATAGATGTTTTACAGGCTTGTGGACTCAATCCTAAGTACTTACATTTGAAACAAGATTTGCAAATGAATCAAGTAGATCAAGTCCAGGATGATGTATCCTGTAGTCGTGATAGTTTTTATGACAGAAGAAAGTTTAGAAGGTTGTTATCTGATTCTCATTCCTCACATTTTGCCGTGATTGATGCATTTGGTATAATGTATGTCGTTTCTGCTGTTGACCATATGTTAGAGCACTATCATGGATCTGAAAATCTGTTTCCACATCCTCACAATTTTGAACTTGGGAGGGCTCCAGTTAGTTGGGAGGTTGGTGGTTATGACATAGGCTGCCAGAGGAACTATTCAGAGTCATTGGGGTCTCATTCATGTAGGGTTTTTTCCATGAAAAATGAAGGTGTTTCATTTTGGGGTAATACTAGATTTGATGTGCTTCAGAATACTCAGGACTCAAAGGTTTGTACGGGGAGAAAATATAAGTGCTCGTGTTTAACTGCTTCTGCTTCAATTTTACAAAATCAGAAGTTCCAGGGTGGTGAATTACAGTCTTGCACTATGCGAAAGATGTTTCTTTCCACTTGGAAAACTAATGAAGATGATTGCTTCTGCTTCTCTCCTATGGGACTTACTCAATTCATTAAAAGATGCAATATAAGTGGCCAAAAGTGCTCTCAAGTTGTCCATTTTGATCTGCATCTCAAGTCTGAAGTCCATGATGATAGCTGCTTAAAATCCCAAATGATTTTTGTTGATGGTAGGAAAGAAGAACTTGTTGGAGAAGCAGTTGGCTGCACTTCACAAGGATCTCTTTATTTGGTGACAAATAATGGTCTTTCCGTGGTTTTGCCTTCTGTTACCATTGCATCAGATTCTTTACCATCTGAGTCTGTTGCTAGATTACAACCTGGTGTTCTTCTTGGCACTCCTAATCAAGTAAAAGGTTTGGAACTGAAAGAATCTAATTGTTCATGGTCACCCTGGCAAGTTGAAGTTTTGGATAGGGTTCTTCTATATGAAAGCATAGATGAGGCAGATCGCTTGTGTTCTGAGAATGGTGAGAATAATGGACAATACATTCTGTACCTTATAGCAATATTTCAGTTGATTTAATGAAAATGCTCTTCTTGATTTTCTTTGCATTTAACTCTCTGATGAAGCTGTTTTTATTTTTCTAAAATGTCATCATATGGACATCCATTGGCATAACTCTAATAGTTGTGAATCTGACAATATGTCTGAAGACGATTAGAAGATACTGTATCATGCTCAGAATGTTTTCCACCCGTTAGTTAGAAATGTAAAATTTGAAGTACAGGCAACCTTAAGTTCTTTGGGTTTTGGGGCGAATTGTTCTTATGAAAAGTAGAATATGATTAAATGCCATAGCTTAAAATTTTAAGTCAAAATGCTCTCCTTTTATTAACGTGCCCTTCTCAAATATACCACCAAATCTGAAACTAATAGTTCTCGTTTTAGTGTATTGATGTATGATTGTGAGTAGGATGAATGACATGCACATGTGGATGAGTAGGTTTGTTATTGTATGTAAAAGAGTGGACCTCTTTCACTTTCTCGTTAGGTGGTGGTGATCAGAAAGGGGGAGCAAGGAGCAACATATATTTTAATTACGAAAAGTTAAAAAATACACTAGTGATCTTGTCTTAGATATTTTACAGTGTATATATATACAAGAAAACAAAACTTTTCATTGAAATTATGAAACATTACAAGCGAGATTACAAAGAAAATTACAAAACCAAATTTCCTTAAGGAGTACAATAAATATCCCCCAAAAACTGAAAAGTAAAATCAGTACAACCAAACTCAAAATCAACATAATAAGAGACCCATCAAAAAAGAAAGTAAGAAAGAAATCCCAGAGCCACAGAAAGATAGATGGCATATATTAGCCTCCGAACAAGTCTTAGACCTTCTAATCAAGCCAAAAACTTGGAGTTATGACCATGTAACCGAAAGAACTCTTACATCTCTTAGACTCTAAACATCAGCAACTGTAAACCTTCCTAGTGTAAGCTTTTCCAACTCTTCTAATAATGGTTTTTCCTCATCTACAAAATGACTTTCTACAAAATTTGAAAAGGATTTAAGGCATCCTCATCTCTTGAAGCATCAATTTCTTTGTTTTCCATATCATTATGAACATTTATAACGAAGACTAATCTCTTTGGCTGTCCATAAGGAGATTACTATTAAAGTGGCTTATGACCCCATCTCTAGTGGCAGGAATATCTCTCTTTGCCTCCATAGTTTGTGGGATAGAGAAGTGAGTGATTGGTGATGTAACCAAGCATCAAAGGAGAAAATCTCCAAGAATCCTCACAGGAATATTTATTATGATTCAAAGTGTAAGCTAACGACAAGAGATTAACCCTCTATTTATAGGAGATTAATGGGTTAACCTACAACCTACTAACAAGGTAAATCTAACTAACTACAAACAAAAATAATTAACCATTCAATTGAAAATTTCTAAACAATTACTAATTATGCTACATCAATTGGCTTGTCTCTTTCAATTGGAAATGTCAGTAAAAGATGGCTTCTAGACAACTCAAGCTTTTATCCAGTCAAGTCTTTTTCTCCCATCTCATGAAGAGAGAAGTACACCGCATTCCTTCCATCCTCTTCTTGTAAGAAACTTTGGAATTCAACTCTCCCCAAAAAGTGTGAATTTTCATTTGGTTGCTGAGCCTTGGTGTTTGAATACAAATGATAAAATCGATAAAAGAAATCCTAGCTTCTACCTTAATCCAAATTGGGGTTGCCTTAGTAGAAAGGATTTGGAGACAGCTGGTCTCTCCTCGTCTATCCGTTCAAGTTTAGGCTTTGGACCCAAAGTTTGGACATCTTCCAAGTAAGGCGTAAGTCTAAAAATCAAGAGATTAATGTTGATGGAAAGAACCAAGTAAGGCGTTAGTCAATTCTTCATGTTTCAGTATGCTTAGTGTTCTGAAATGTGCGCTTTCTTAAAAAGCACCATGAGGCCGCCTCACATTTTTATATAAATAAATTTATTTAAAGAAATTAAGAAATTAAGAAATTAAGAAATATCCTCCAAAAACCATCATTTCTCTAAAAAATCTCCTACTAAAGCTCTAATTTCTTAAAATTTTTATTATCCTACTATTCACAATACAATATTTTTTTCCATGTTTAAAAAATTAGTTTACATTTTTCTATCGTGTGCCTAAAATTTTTTTTTTTCTTTTGTGCTTTATGCTTTAAGTCCTAGAGGGCTATTGCTCTTTAGTATGCCTCACGCGTTAGAAAACACTGACTATACTATGGATGCAAGCTACTTAACTTCCATCATCATTACAATTGGTTCTTTATTCCTACGAGGAACTCTGTAGATATGGCAATGTTAAAAAGATTAGTAAACTTGGAGGTGAAAAGAATTGTATCATCTGCAAATTAATGGTGACTAGTAAAGAAATTACTTTTGTTGACTTTGAAGCCTTCCACTAAGCCCAAATGGAGCCCTTACAAAAGGAGTCTATTGAAATCGTCTAAATAAGACTAAAAAAAAGTAGGAAGATGTAAATCCATATGTCGACAGTTTTGGAGCTGTTGTTTTGGCCTTGTCATTGGGTATGATGTTGTTGATTTTTCTATAATGGTTCTTTAGGGGTTTGGAGCTGTTTGTTAGTTTTGTTCTTCATGAGCTCTCATTGCAGTAGAGTTATGATATCACTTTGAGAAAAGGCTTGATTAATACTTCAATTGGTCATTGGTATTTTCCAAGTTCCATATTTGAAAAGTATCCATATGTCGGCTTTGTTGTGGTTCTTCTAAGCTGGTTCAGATGGCATTTTCTAGTTGTTCTTTGGTTGGTTGTATTGTCACATCAGCTTCAAGTTTTGGGTTGTATTATTCTCTTATGGTCTTTAGTTGGGGTTTTAGGTTATCTTGTACTCACTAGATTTTGGATTGGGTTTTTTCTTTGAGTTTTGGACTCTCATGGTAACTTTTCATTCAATCAATTAAAAATTTGTTTCCTTAAAAAAAAAAAAGGAAGATGGGTTCCCCTTGTCTAATTCCTTTAGAAGCAATGATCTCCCCCCACTGGTTTTCCATAGGTGGCATGAGAGACAAGCATCAAACGAGCTCTCCATCTAGGGCCAAAGCCTTCGGCTTCAATAATAGCATCTAAGAAATTCCAATCAAATTTATCAAAAGCTTTTTCAAGGTCTAGTATGATGACACACTTTTCATTCTTTTTCTTCTTCAATCATCAATCAATTCATTGGCAAGATGTAGGAGTTAATGATTTGTCTTCCCTCTATAAAGCTGATTGAAACTTGTAATCGTGGGTCTAGAAGAACTTCTGAAGTCTCTTAGAAAGAACTTTGGTAAGGATCTTGTAAGGATAGTGGTAAGGCTGATGGGTCTAACTTAGAGTTTTTTTTTTTAGCACCACACTTTTCGAAAATAAGGCAAATGTAAGTCTTCTGTTAACACTTGCATTGATATTGGTATTCTAGGAAAAAATCCTTAGCTCAAGATTTTCCCACTTTTTTGAGGAACTCATTTATAAAATAATTTGGCCCAAGAGCCTACACAACCCCAAGCACCAAATAGCTTGGTAAATCTCAGCCTCAATGATTTACCTTCCAACAAGAGCATTTCATCCTCAGAAACAGGGCACAAATCAAGATTAGAGGGGATTTGGGGCTGTCATGAGAAAGCTTATGACAAGGTAGATTGGTCTTACCTAGATGTTATTCTTAACCTTAAAGGCTTTGGGCAAAGATGGAGGAAGTGGATTTGGGGCTGTATTTCTTCGGCAAATTTCTCCATTTTTGTCAACGAAAGACCTAGGGAAAAGATTCACACTAAAAGGGGCTTTAGACAAGGGCATCCTTTGTCTCCTTTTTTATTTACCTTATTGGGGGACTCGCTTAGCTGTCTAGTTCATTATTGGCTTATTGCTGTGAAAAAAGAGTGCTCAGTGGCCTTCAAGTGGGAGACCAGTCAGTTGAGGTTACGTATCTTCAATATGCGACGATACGCTCTTGTTTTGCCCTGGATCAACTGAGATGGTCAAGCAGTGGTGGTTCGTCCTGAACCTTGTTTTTCCCTGGATCAACCTAGATGGTTAGGCAGTGGTGGTCGTCCTGAACCTTTTTTTGCTTGGGTCTGGTCTAACTTTGAATTTCTTTAAACAGCCTTGATTGGGATAAATGTTAGTTCCTCTGAGGTTGAATGTATAGCTCTGTCCTTTGGCTGTAGGATGGAAGAGTTGCTTTTGCCTTATCTTGAATTTCCTTTTGGGGGGATATCACAGAGCAGTGTCTTAGTGTCTTTTGTGATCTGATTGTTGACAAATTCAAAGCTAAGTTGGACAAATGGAGAAGTCTTCTTGTATCTAAAGGGGGTAGACTTACCCTTGCTCAATCGGTCCTCACTAGCCTCCCTATTTATTACTTCTCCCTACTGACAGCGCCTCGCAAGATTATTAACTCCCTAGAAACGATGGTTAGAGATTTCATCTGGTATGATGGTATCTACTGTCTTGGTTGTAGTGGAGTTGGACGTCCCTCCCTATCTAGCATGGGGGGTGGGAGTCGGTTCCTTTCAACAAAAAAGTAAAGCCCTCCTCTTCAAGTGGTTGTGGAGATTTTCAGTTGAACAAGATGCCTTATGGAGAAAGGTGATTGCTGCCATTTATGGGTAGATTCTTAGGGGTGGACTATTAAGCCAGATGTGGGAAGTTCCAAAGGAAGACCCTGGGTTGACATAGACAAGAATAGGGAAGACTTCCTCAAGTCTGCAGTCTTCAAAGCTGAAAATGGGCACAAAGTACGGTTTTGGGAAGACAGTTGGGCTGGTGACATCCCCTTTTCTACTCTTTTCTTTGATATTTATACGATTTCTATGAAGAAAGGTGCCGCGATTGCAGATTGCTGGAACATAACCACCCAACTTGGGACTTGGTGGTGCGAAGGGGAATTTTTTATAGGGAGTTGAGTAGTTGGATAGCTTTTACAGAGAAACTTAATTTTACGTGTAAATCCACCCTCCATCTTTCTTCCCCAAACAACAAATTAAAATCCCCGTTGGTGAATTTGATTTGGAAGTTCAAGGTCCCAAAAAGGTTAAATTCTTCCTTTGGTTGGTTTGTTATAGAACCATCAACATGGCAGACAAGCTTCAAAGGAAGTTTCCTAGGTGGACTCTCCCCCCCACAGTTTGTAGCCTCTGCTTTAAGAGTGAAGAATCATTAGATCATATCTTTCTTCATTGTCCCTTTGCAGTGAATGGCTGGAATCGGCTATTGAACGAGTTTGGTATCTTGATTAGACTCTCTAACCAAATAGATGACTGGATTTTGGAGTGCTTCGCCGGAGTGAGGTTTAGAAACAAGGCCAATGTTCTGTAGAACTGTGCAGTTAGAGCTTTTTTATGGCTTATTTGGAAAGAAAGAAATCAAAGGTTTTTTTTAGGATATTTATTCCATAGAGGATATTTTTGGAACAATGTACAACATACTTCTTCTTGGTGGTGTTCCAACCACAATAAATTCTTTTGTAATTATAGCCTCCTCATGATTATGTATGATTGGAAGGCCTTGATGGAGCTTTTCTTGGGAGTGGGTTTTCTCTACCCGAGCCCTTAGGCTGTTCCTTTGTTTTTTTGAATCTTAACATTCTCTGTTTCTTATAAAAAAAAATCAAGATTAGAGGGGAAAATCCTGTATCAATTCAAGAGAACTGTAGAAAGAAAAGAAAGCTTCTTCAATTTCACCACAATTCACTAACCTTTTACCTTTGGAGTTCACAATCTCTAAATTAAAGATCTCTCCTTCCGACTATTAGCCACATGGTGAAAGAGGGAGGTATTCTCATCTCCTTCACGAAGCCTTCTAGATTTACGTTTCTGACTCCCAACTCCAACAAATTTTTTCTTTGAGCGTGAAATCAAAAGGTTGAGAGTTAAGGTGAAGCTTGAATCCCCTTTGAGAACTAAAAAGGGGAAACGAATCCTCTAAACCATCCAATGCCTTAATCTCATAGATAAGATTGTTTCCAATGCGGTGACATTGCTAGAGTTCCAAATCCTCAAGGGAGGATTTTAAGCTGTAAGCTTGTCATAAAACCGTGGTTGGGCTGCCAACCTTACTATTGAGAGTAATTCTTGGTAGCCTCCAACATCTTCAAAGAGTTTCCATAGTCAAGTTACACAGTTTCAAATTTGATGGACCCCATTTAATACCTCCAGCCCCAAGAAAGATGGGGAAGTAATCATAAGTAGTTCTCTCTAGCATAGAAACCCTAGCATTGGCAAACTTGGAACACAAACTTGCATGCCATTCCCATAGGTTGGGCCAAGTATATTGGGCATTGTTGAGAGGGTGATCCACAAGAGAGGTCATTAATTAACTTATTGTACTTATCATATGAAGGAGGCTGGGAAAAGAAACCTAATCAAAACACCTTGCAGCTTTGGGAGTTCATTGGAGTAAAGATTTGACCCAGTCAAACACATTGAAGCTTCTTGGTGTGCTTTTTCCGAAGACTTTAGCAATTGCTCTTCTTTTGATATTTATTGATGTTGGAGTAATTTTTTATTTTAGATCGTTAGTTGGGTTTGTGCTATTACTTCATGTAACCTTGTCGATGTAGAATCATGTATCATTCATTCAATAAAAAATTCTGGAGCTTTTGTTTTTTGTTCTCCTGAAGGGAGTCATTATTGAAGTTTTCTGTGTAACTTAAATATAATGAGAAATTCTATTTTCCCTTTGAAAAATATATGGAGGAGGCTGGCACCCCATTATTTCAGTGTGTATGTGGATTGTTGTGCTCAAGCATGTGGTTGGTCCTTCTGTGTTATAATTCTTAGGTTTAGGAAGTTAAGTTGGCTTTATTAAAGTATAAATTAGCTAACTTTTCTAGTGAGAACTTAAGTCCTTGTAATACTCAGTTCCCCTGGTCTTGGTTGTTCCATTTTTAAACTGTTCTCAGAATTTAATTAGCTGTCTGTGGATTCCGTTGTGGAGACCATGTATTCATTCTACAAGCAGTTATAAAGAGAATACAATTACAAAATATAATAACAGTCATATTTTGTCTTTTTTTTTTTTTTTTAAATTTGATATGTAATAAATGTGTTATATAGTTTTTAATAGCAATGTTAAGACATGTCAAATATTGGTTTCAAAGTTTATTATCCATTTCAGGATGGGACTTGAAAGTCGTGCGAATGCGTCGGTTTCAAATGACATTGCATTATTTGAGATTTGATGAACTAGAGCGGTAAGATCCAAAATTTCATGTTTCAATGTATTTTTGAATTAATAATTGAAACAGTGCTAGTGCTACTGTTATTACGTCTTGTAGATTCTTCAGAGGTACTCGATTGGATTGATAAATGAAGTTTGAAAATGGTTTCCCTAAATTATTTATTGCCAAATAAGGAAAAGGAACCAAGGCTGCAATACTAAAAGCATTCAGGATTGAAGATACAAGGATCCTGGAATTTGTACTACCTTTGATGTTTTACTAGTTATTGATTTTGCTGTCTTGCTGCCCATTATTATTATGGCTTGGGTTTGAACCCCAACAAGGAGAAATACTTTGACATATATCTGGCACATTTTTAATTTTCACAGTTCAGGCCACTATTTTCAATATCTTATAGGGATTTTATCATTTGCTGAAGTTTTATTGTAGAAGTCATTATAGTTTTCCTTTCAATTTATTGATGGACAGATCACTAGAAATGCTTGTGGATGTTGATTTGGAAGAAGTAGGAATTTTGAGACTGCTCTTTGCTGCCGTACATCTGATGTTTCAAAAAGCTGGTACTGATAATGATATTTCAGCCGCTTCAAGGTAGAAATCTGTATACATTTTTTTTTCTCTCATCCTCTTAAGTACTGTGGCCAACGTATTATTTTTCAAGCAATTTGTGCATTAAATCACAACATCTAATGAGAAATTGCTGAAATCGGCATATCCTCTGCCTGGAATGTGAAGCTTATGCATTCTGCACATACTACTTAGAGAGAGAGAAGTATCAGAAGGGAAAGTATTTTCAAGGGGTGGGGGGTTTAGGCTGAAAGTTGAAAGTGATCTCTTGGCCTCTAAAGTCTGGGAACTTGTAACTATGTTCTCTTGATTGTAATAAGACTTCAGTAAGTGTTTTGATTCCTACAAAATGGTATTAGAACATTTTTTGGGTATGTCAGGACGGGATGGATATTGGAATGGAAGTTGTGGAATCAAATTTGGGAGTGGTAGCAAAGGCGGTTCAAAAGGTCCTAAGGATGAAGCAAAACATTGTGGTGATGTTCGAGGGAGTGAGCCACTTGCTGCACAAATAAGAGACAGGGTAAATGAACAATGAACTGGTAGACGGGAATTTGGGAATAGAGAAGGTGCTTGTTGATCAGATATCTGATATCAAAGTAGGTAAGTCCAGTGCAAAAGAACGCAATAGAGCAACATGATGGAGGTGGGAGGACGAAGGCTGAAATTACCTACATTTGACAGGATTGATCCAAACGGATGGATTTTTCAAGCAATGTGCCGTTTTGAGATTAATTGTATATCAAAGGTGGAGAGAATAGAGGTGATTGTTGTTTGTTTGACTGGAGAGGCCTTGGCATGGTTTCATTGGATGGATGGATGAAGCCTTTATGCTCTTGGTGTGGGAATTCAAGTTGGCTGTAGCAAATAGGGTTCGTTTATCTGTGGATGCCTTGCTGTATGAATACCTATAGGCTCTGAAATAGGAGGGCAGTTGAGGGACTACTAGAAGGTATTTAAATTCTCGCCGCACTGTGCTTTGTCCAATCTGTTTATAGGAGATTGTTGGAAGGAAATTTCCTTATTGGGTTGAAACCAAAAACAAGGACTAAGGTTCGGATGCTGAGGAGGCTTGTTGGGTTGGAAGACATTATGGAAGGGCCTAACATATGGATGTTCAAATCAGGTGGTGAAACCAATGTTTTCTGAAGCGCGAGGTGTTCTAAAGTGCAGTAGCCCTCTGAAGCTTAGTTGAAGGGCAAAAAAAAAGCGAGGATTTCTTTTCTTTTTTTTTTTTTTATTTTTTCTTAGAGGTGTACAATAGAAAAATGTAAATAAATATTTTTTATATGAAAAAAATAGTATTATGAATATTAAGACAATGAAAAAGAATTAATGGTGATTGTCCTAGTAGTCTAGAAACGGAGGCTCCACCTCTTGAGGGGTACTTTTTCTAATCCGTATCAACTAGAAAAGCCTAAATTTTTTTTTGGGGCAATAGTTAGTAAGCCTCGATCACCAAAGGTGGCTAACTAAGTTGTTAGGATATGACTTTGAGATTTAGTACTGCCTGGGTTTGGAGAATGATAAGATATCAATGAAAATCAAATTTAACTAATCCCGAAGAGCACAAAACGAAAGGGGAAAAGAGGCAAGTTCTAGGCAATTTGAGAAGTGTGGGTCTCTCCCAGAAAACACAGCAAAACACTTCATACCTTTTTTTGTGCTACCACACCTACTATAAAGCATTAACCGTATTACCCAAAGTGCCCCACAGCCCTCACGTACAAATTATACCTCCCCTTAGCTGTGTAACAAACTCTACTCCTCCTCGTGTACCTGCACGTGCTCATTTTTCATGCTTCTCCAATAAGTGTATTTGATCGGGGGTTTCTCAATTCTATCAATACCCCCTCCCCAACATTCACCTTGTCCTCAAGGTGGAAAGAAGGAAATTGTTGTTTGATCGAAGTAAAGAGTTCCCATGTAGCCTTGTTAGTCGATAATCCCTCCCACTGAATTAAAACATTAAAATCCCTTAAAGTAGGGTCCTTTGTGGGCGGGATGCTCAGTACAGCAGAAGGATATGGCTTTGTGTGGTCCTCGTGGCCTAGCTGATCCGGTAGTGGGCTGGTTCAAAATTGGCCCCTTTAGCTAGTTTCAGCTGTGACAAAGCAACTAATGACTTGTCTTGCTTCCCCTCCAACGTGACCCTTTTATTGTGGTTCTTGATGAGGATTATTTGGAAACTATCAAACAACAAGTGGAGGAAGACCCATTTCTATCACAAATTAAGAGGGTGGTTGCTAGAGGGAGGTCACTGCCCACATTTTTACGTGGAAGGAGAACTTGAAATGGAAGTTGATCTACAATAGGAATTTCCTTTTATTGCCACCATGCTTCAAGATTATCATTCCAGTTTGGTGGGTGCACTCTGGGGAGTTCAAGACCTATAAAAGAATTCTGGCTGACAATTATTGAGAAGGGATGTGTAAATTAGTATGGGGCTTTGTGGCTTATAGTGATACTTGCCAATGCAACAAGTATTTCATCTGATCGTGTGCAAGGTTGTTGCAACCCTTACCTATTCCTTAGTTACCATGGGATGATGTGATAAGGATTTCATTGAGGGCCTGCCCCAATTTAAAGGGAAGGATACAATTCTAGTGGTAGTGGACCACTTGACTAATTATGTGTGCTTCTTACATCTAAAACATCCTTCAATTGTTTTTTCAGTGGTTTTTTGGCAATCACCAAGGAAACTTCCGAACTCCATGGAGTACTAAAGTCCATTGTTTCAGTAAGACAAGGTCTTTTAGAGCACATTGTGGAGAGAAATCTTTAAGCTTTAAGGACGATCCTTAAATGTAGGTGAACATATCGCCTAAAGACTGACAGGCAATAAAAAGCGGTTAACTGTGCCTTGAGATCTATCTTTGTTGTTTTGCCTTCTCCTAGACCCAACACTTGGGCTAAATGGTTATCTTGGGCAGAATACTAGTATAACACTTATTTTTTTTTGTTTTTAAGGTAACACCTTTCCAAATATTGTATGGTAGACCCACCTCCATCAGTTCGCTATGGGCAGGCTACTTCTTCAATCTCTATGGTGAATCAATTCTTGCAAGACTGTAATGCCTTCTTCGATGATCTTAAACTTTAGCTAATTCGAGCCCAATTGATTGTGAACATACTGCTGATAAGCATCAAATGGATGTGAAATTTCAAATTGGGGATTTGGTTTTTATAGAAGTACAACCCTCTAGGTAGCAGTCGTTGGTTGTGAGAAAGAGTAAAAGGCTTGCACTTGCTCCACAATTTTATGGTCATTTTCCGGTTATCAAGCATATTGGCACAGCTGCTTGTCAATTGGAGCTGCCAGTTACTGCTACTATTCATTTGGTATTCCATGTGTCTCAGTTGCGGTTGGCCCAGGTGGCAGGCTGTAACCTTCAACCCTGTCCTTTACAAATTACAACCACACTTGAGCTCTGAGCTCGAACTCAACATCCATCCAGCTGCTTTTTTGGGTGTATGGCCATCTAATCCACCATCTTCAGAAGATTATGAAGTCCTTATTTAGTGGGATGTTTAGCAAGTTGGGAGTCATTTACTCGGATAAATCAACATTCTCCTATGCTCCACATGAGGGTGCAGTGAAAGTTTGGTTGGAGGTTTCGATAGAATTGCTAAGACTCTAATATGTTTCACTTATATGCTTTGAGGAAGGTGGGAAGGCTCGCCAGGGGAATCTTTCAGTTTGTGTTGTTATAGTTATGTAAGTAGGTAGTGTATTGAGTGGGTGGTAGGGAGGTCAGTTGTAACAGAGACAAGAATAGAGTCTATTAGGGAGAGCGAAGTATTAGTAGGGAAAGTTTAAGAGGGAGGGGGACTAGACTGTTAGTTGGAAATGAACTCGAGTGATCTCTTGGAGGATAGGGAGAGCACCAGGCCTCTCAAAAGTCTGGGAACGTGTAACTATTTTCTCTTGCTTGCAACAAGACTCCAGTAAATGTTTTGATTCCCATCACAGCTTGCTATTGCATTTGAAGTTAGTTGCATTTTGTGGTTTGCAAATTAACTTGATTATATTATCCTAAATAGATGAACGTTCATGTCTGTTTTGTGCCTTCACACACTTGTTATATTCTAATGAAATTTCTACTTTTTCGAGTTTTATCTTTATTGTGTATTCTTTTCATTGTTTTTATCTAGTGTAAATCATATTTTCTATTTTGTATAAGCTAATATTAGTAGGTCTACTCAATGAAATGCTTGATGATCCATGGTTGTTGTGTTTTCCCATCAAATGGGGGATGCTAAAGATTGATTGTATGCATGGTTGTAAGTTGTAACTTAGTTTAATATCCCAATTTGGGTGTAAAACTGAATTTATTTGTTCTATCATTTCCTCAATTGTCCTTTTATGCTGTTTAAAACCATTTACTTTATAAGGGTTCTTGTCTGATCAAAATTTGAAGTTGTTTTAAAAATGCTCAACATTATATGTACTGACACATACACAACTACGTAAGCATTTTAATTTTATGTTGCAGTAATCATCATCCTACAAACTTTTTAAGTTATACAATCTCTTTAAAGAAAGAAAAATAAAAATACTTCCGAACTTACTTGAAATGAATTTCATTCATAGGCTTCTAGCACTGGGCACGCGCTTTGCAACTAGAATGACTCATCGATATGGGATGGCCGAGTTCAAGAGAAATGCTACTATGTTTAATGACTTTAGTAGCAGCCAAGAAATTTCCATTCTCCCTCATTTTCCATTTCGAAAGCAAAACGAGTTGGAGTATTCAAGAAAACTTCATGAGATGTCTCACTTTTTGGAGATAATAAGAAATCTGCATTGCCATCTTAGTTCAAAATTTAAGAGGCCATGTCAGGAATTGGTATGTCTATATTGGTATCTATGTGAATTTATTCATTGTAGTAGTAATAATTTATTTGAACATTGTATTTTATAGGTAGCTGGGGAGGCACCAATTTCGGATGAAAACAATCTATTGCTGGATGAACCTCAGCTTGTTTCTACAGATATAATACCATTGGGGAGTACAAGTCAATATGAACTTTCATTTCCTTCAAATGATTTGAGCTCTACCGTTGTAGATGGTCTTGTTATGATGCCCATGGTTTCTGAATCCCAGTTGGATTCAGAAGATTTAAATGGAGACTCTGCTGTTGTACCACAAGGAGTCTTGGAAAAGAAAGTTGTTCCATTGGAGAATCCCAAGCAGATGATTGCACGTTGGAAGTCAGATAAGTTGCCACTTAAAAATGTTGTTAAAGACGCTCTTCTCTCTGGCCGTCTTCCTTTGGCTGTTCTTCAACTACACATTAATCATTTAAGAGAATTAATTGGCGAGAAAGAACCTCATGATACATTTTCTGAAATTCGTGACATTGGAAGAGCTATTGCTTATGATCTCTTCTTAAAGGTAACGGGTCTATTTTTTCTGCGGATCGTGTATCCATGTCTTCTGAACAGCATGTTTATGTGTTGTACAGGGTGAGATTGGGCTTGCCATTGCTACGCTGCAGAGGCTTGGAGATGACATTGAAGTTAGCCTCAAACAATTGTTGTATGGCACAATTAACAGATCTTTTCGAGTGGAAATTGCTGCGGAGATGAAAAAATATGGTTATCTGGGGCCATTTGACCAGAGGATGATGGATAGAATAGTACATATTGAGGTATCAATAACTTCAGTTGCATCCATTTTGAGTCTATTCAAGTTAAAATCACATTCTTTTTATCCATTTGAAACTAAAATATGCTCTTTTCTGCACTTCTTTATAAATGGGGGAACGGATTAGAGGCTCTACCCAAGCAGTAATTTCTGGAAAACATTTCTGAGCAGGCAGAGAGCAAATATGGGATCCCCATCAAGTTCTGCCACACCTGGAGAAAATGATTTGAGGACATTACGTTTCCATTTAATCAACAATACTATCATTGATTGTGGTGAGGTTGATGGTGTCGTTTTAGGTTCGTGGCCCAATGCCAATGAGAACTCTTTTGTTCTGGAGATCACTGAAGATAATGCTCATGTGGGATATTGGGCTGCCGCTGCCATTTGGACAAACACATGGGATCAACGAACAACTGATCGTGTAAGCATGTTTATTTTCTCTGGTCCAAGTACCTGTTTAATGTGGACCTGATTCCTTTGAAACAATTAATCTGCATTAATATTTACTTGCTTGGAAAGCTAGTGCAAGAATTAATGGATATGCTGATCGCGTGTATTGGTGCCTGTCTTTCTTTCCTGTGACTAATGCCCTAATATAATATTGTTTGCACATGATTCAGCTCAGAGTTATTGGGTGAATAATTCTGGGAGGTTAAGTCTCCTGGATATGCTTTGTGCGAGCTTCATTTTTTTTCATGTATCAAACGTATCAATGGAAAGACTTAATGGATTACATGGTTGCTTCCAAACATATTCCATGTAGTTACTTCTTTAGGGATAGGAAACTCAATTTAAAAAAAAGGGAAAGGAAGATAATATATCCACTCTAATCTACAGCCGAAAGAGATTACAAAATCTCTCCAAATTTTATCAAACAAAAGGCAGACTAATGAGAAAGGGTTGTTGATGTAGTATTGAATGATATTTACATGGTTGATATTTACATCTGGAGCATAGTATGGGTATAAGTCTAAAACATTGAATATAGGGCTGATGTTTAATGTTGGTGGTAACTCAACAGTATAGCATTGGAGTCAATCTTCTGCAAGATATGGCAGGGCCCAATCTTCTTTGGTTTCGTTTGTTGTATATGTATTCTATTATTGGAAGTTCTGGCTTTCTTACATGCACCATGACAAGATCACTAATGCTAAACTCTTTTCCTCTTCGCTTTTTTATCTGCTGAAGCTTTGTATTCTGAATTAGCATTTCAAGTGTTCTGTCACATGAATTTGGTGAACACACTAAGCCATATCAATAGCGTCATTACTTGAGGTAATGGCAATAGGTATGTTGCAAGGTCAAGAGCGAGTTGTGGAAGCATCATATAAACCACTTTAAAAGGAGATTTCCTTATTGAACGATTTTCTATGTGATTAGATGCAAATTCTGCTTGGGCCAGAAATAAATTCCCTTGTCTAGGCTTTTGTCCACTTATATAACATATAAGATTCACCAAGGTGCAGTTTGTAACTTCCAGTTGTTATTGGCTTGGGGGTGGCTAGTAGTGTTGGACCGTAAGTTGGTATCAAACATCTCTCACAATCTTCCAAATGTAGCTCAAATGTTTAGCTAGGGTCTCCATGCAATTGTACAATTTTCCTAAAAAATGACTACATGTATGGCATCCAAAGTCTTTCTACTAGGAAGAAAATGTCTCATTTTGTTGTAACGATCCACCATGACAAGTACCGAGTCATTGATAAACAACCATGAATAAAAATTTCATTGGCAATCCTCTTAAATAGAAATTGGAATATGCAATGGAGAGTATAAGCCCACTATTGTGGGAAGTACCTATTGCTGTTTGACATGTATTTGGCAAAGCGTGTGATAGGTGTTTATTTAGATTTCACTCCCAGTTGGAGATTTTCATTACATAAGATAATACAAAAGATTTTCTGGCCAATTCGATGGCAGAATCTTCTGCTGCTTTGACCCCTTCCAAAACAACCTAGTCTTATCTAGTTACAGTTCTCACACTCTACGTAACCACCTATGGAGGTTACAATCCCAAATAACTTAACTTCTGGAAGTTACATGAGTTGCATTCCTCTTTTTACCCCTCCTAGAACAAACTTTTACAATTGGTGGTCTAACAGACAGCTAATCTAAAATCTATCACTAGTAAATGAAGACATCAAAGGACAGCAAAAGAAAGAGAAAAGAACAATAGAAAGTTGAAGATTTTCACCCAAGAACTTGCCATAGAGACTCCCTTGCGTCCCCCTTTATCCTAAGATCCAAAAGTTGTGTAAAAGAAAGAACCCTTAAGTGCACCTAGAGCTTCGTTTTAGTGCCTAATGTTGATAGTGGAGTGCCTAGGTGATGAAGCTAATAGAAATTGGAGAAGTTCTTGTCATAGCACAGGCTCTAATGCCAGTGCCTGGGTCTGGTATTCTTTTTAATTTTTACTTCTTCAACTGCAAATCCACGATTTTGACTTTCAGAATGATATTTCTTTATTGACCTACAAAAATAAATGAAAACTCATAATAAGATTATAAAATCCTCAAAGGATAGTTAAAGATCTAAGTAATAGAGATGCAACTAGTGAATATGGAACTCCCCTACCAGTTTCTGAGAGTATTTTTCCCTTCTTCCTGCCATTCTTTCTGTCCCCAGAGCAAAAACATCATGATACCAATTTGATGTAATATCATTGATTCATTACCTTCGGTTCTGAAAAGGATTAGACTGATAAAAAAAGAGTTGCGAAATGGCAGCTTGACAATTACAGTAGTTAGAAATAACTAACATCTGATCGAATTACCTGGTCCAATTAGTTAGTTAACAACTAACAATATACAACTAACTGCTGTTAGCTTTTGTATAAATACGTCAGTTGTTGGGAAAGAACTTCATGAGGCAAGAAAACAAGCTACTCATAATAATTCAATAATGTTTTTATCAATACCCTAGTTGTCCTATTTTCTCCAACCAGATCTTCCAAAGTAAAGCTTTGATAGCGTTAATTTGTGAGATCTGATTTTGAATGAAGAACGATGTGGTATATGGTTTGTACTGTATTATGTAGGTCCATATTACATATCTTTGATAAAAATTCTATTATGATTCAGGCAAAAAAAAATTTCATTCCAGATATTATTTCTGTTCTTTGCTTCATTGCTTCTAAATTATGTAGTGCATCTCCTTTGTCCACTTTTTTGTTTTTGTTGTCCAGTTGTGTTTGATGGGGGATATTAATTGGTCAATACATCCTTCTTTGTACAGATACTACTTGATCGATCTTTGGGTATTGGTATCCCTGTGGCGTGGGAATCTCAACTTGATTACCACATATGCCATAATAACTGGGATGAAGTATCAAGACTTCTTGATATGATTCCTGTTTCTAATTTGCTAGATGGAAGCCTCCAAGTAAGCTTAGATGGTCTACAGTCAGCTTCAGCAGTTGGGTGCAATCGAGAGTCTTCTTTTTACAGCAATTACTTATACCCTCTTGAAGAATTAGATGCTGTTTGCTTGTATATTCCCAAAGCCAAAATTTTCAGGTTCTCAGCTAATATTATGTGCTCCAAATGGTTGGGTATGCTCTTGGAGGAGAAGCTTGCAAGACAGTTTATATTTCTGAAGGAATACTGGGAAGGCACAATGGAGTTGGTACCTCTTCTTGCACGTTCTGGCTTCATTACAAACAGACTTGATGAGATTGCTTCCGTGGATGATCACATCAGCAGTTCAGTTGATCAAAGATCCACAAACAATGGTGGAGCATTTTATGTTGATTCTGTGCAAGCATTATATAAAGTTTTCATACATCACTGTTCACAGTATAACTTGCCATTTCTTCTCGACCTTTATCTGGACCATCACAAATTGGTTGTTGATAATAATTCAGTTCGTTCACTACTGGAAGCTGCAGTAAGTACCAACACATTCAGTCAGTGTGACTTATTATATATCCTGATTATATTTAATAATCATCATTTATCTTTCTTCTGTTGCTTCTTACGTATCTTAAGTGTCAGTTTAAGGGATTAGTTAGTAGCAACTTAGCTGTGGGTTGAAGGGATTTTGTTTGGGTTAGCAATTATAAGTAGAAGAGGGCCAGGGAGACAAATGAGGGAAGTTCTATTGGGATTTATATTGATCTTGGCTCAAGCTAGGCAAGCTCAATATTACTTTATTGTATTTTTCATTTACTTTATTTTATATTTTCCCAACAAGTTACTCTTTCTGCTTTAGAGGCTTCTGTTTTGAGTTCTTTTTGACAAAAGAGCTTGGAGTCTGGGTAATTGACAAAAGTAGCCCAAAATAAAGTAAAGAATAGACTTTCTACCTATTGTCATAAAACAAGAAGAATGACCTTCTGTTGATATCACAAATAGGTAATTATTTTAAAATTCTATATTTGTAATTGTTTTTATAAAAATACGCATTTGCAAATATATTAGAAAATGAGGCTACCGACTCAAATTTCCCTTCATTTTAGCCTAGTGCTTAGTCTTGCCAAGATATGATAGGAGTTCAATTTCTGCTTACACAAAGCTGGAGTGCTGGCATTTGAACCGTTTTGGTGGGACCAACACTTTTCTTATATTCTTGTACAACTTCTACACAACCCATTTAATTTATTTTACATTTGTAGTTAGGTATGGTGAGTTATGTTAGGTTACTTGTGAGACAATGTGCCTCATTCACAGAGGCAAACTTATTACAATTTTCCCATGACCTTAATATAAATAAACATTCCTTTGACTCTTCGAGTATCTCTTTGATATTGCCTAATCATGCACTTAAAAAATTAGTTTATATGCTCATCTTTGCTCTCTATTCTTGTGGTGTGCTGGTGTAGGGAGATTGTCAATGGGCAAGATGGTTACTTCTGTCGAGGATCAGGGGCTGTGAATATGATGCATCATTTTCTAACGCTCGCTCAATAATGTCACTGAATTTAGTTCATGATCCTAACCTTGGTGTTCGGGATATTGATGAGATTATTTGTACTGTTGGTGACATTGCTGAAGGAGGAGGAGAAATGGCAGCCCTAGCAACTCTGATGTATGCTCCTTCCCCAATACAAGATTGTTTGAGTAGCAGTGGTGTGAACAGACATAGTAGCTCGTCAGCCCAATGTACTCTTGAAAACCTCAGGCCAGCCCTGCAACGATTCCCTACATTGTGCCGTGCTCTAGTTACATCGGCTTTCCAGCAAGATACAACTTGCAATTTTTTGGGTCCAAAATTGAAGAATGGTTGGCTTCTCCTCCTCCCTTAAGAACTTCTCATTCTGAAGCTTTATACTGCCAGCTTTATTATCATTTTTGGAATCTCTCTCTCTCTCTCTCTCTATATATATATATATATGTAAAAGAATTAGGGGGAATCCTTTTTCAGAGATAAAAGTTTTTTGGACTTCTCTAATCACCCATTAGAATTTGTGATGTAATTGATTGTAATCAGTTTACAATTCATTATTTCTGTATTTCAGCATTGTCAGAATATCTACATTGGCGTAGCAGCATCTTTTTTTCTGCTGGACGTGACACTTCACTTCTACATATGCTGCCATGCTGGTTTCCCAAGGCAGTTAGGAGATTGCTTCAGCTCTATGTCCAGGTGTCCTTCCAATCCATGCATTTTGTTTTCATTCTGAATTAGATGATTCATTGAATTAACTGACAATAGATTAGAAATGGATCCAATTTTTTAAATTTTTACTGGAGACTTGGAATGGATGGACATGCTGTAGGACCCGTTTTACTAATGGAGTTTAATGCTTCCATACACTATAACTTCCTATCCACTGTAACCAGTTGGAGATCCTTCTTATAGCTCCACACTATTCAAAATAAGAAGCATAAGTTCACTTGCTATCCACACTATCCACAGAAAGGTTGTACAAGCGTTGAACCATAGAATCTTCTTATGCTGATTGAATATTCCTTTCTATCAACACTATCCACCCAAGTGAACCATAGAATCGTCTTTTATTTGAATTCAATCATTAGCCGTTTTTCATCGGGCTAGCTCCTCTCCTATACACTTGGGATTAGCCAGTCGTTGGAGTTAACAATATGGTCATAACAAATATAAATAGTGTTACAGTTCGGCTAGCATCCCACATCAGAATTACCTAAGTTTTAGGGATGTTACTTTTAAAAGTCATACTGCTCATATTTTTGTATAAGCTTTATGTTTGTGCAAATTCTATTCCAGGGTCCTCTTGGATGGCAATCACTCTCAGCTTTGCCAACGGGGCAGACGTTATGGGAGAGGGATGTTCATTTTTTTATGAATGATTATGAACATTCTGAAATCAGTCCAATCTCTTGGGAAGCAACCATACAGAAGCACATAGAAGATGAGTTATATGATTCTTCTCTTAAGGTATGTCGAGCTTTATGATATATGTATATTTTTTGCATTCTCACATCAGCTACTTCAGTAAATGGCTTATTTTTCCCTGTTTGATGAATCCATCTGCATGTCCGTAGCTATTCTCTGAACTTGTACTCGATTGGGTTAACCTTATATGGTTCTTATTCTCTAAAGTTCCTTAGTTTTCCATTCATTATCTACTCGCTATTGTGCATAGAAAAAAGAGGTCTTTTATTGATGGAAACATTTACAATACCTTAAGAAAGTGTTCCCTTATCTCCCATGGCTATATGATAAATTTTGAATATGTTGTAAAGAAAGTACATTCCTAATTCACAACTGCACTGCATAGTATGGTCAGGCAGTTGTGAATTAGAAATCTACTTGCTTTACAATATATTTCAAACTTATCATATAGCCAGGGTTGATAAGGGATCACTTTCATAAGGTATTGTTTCCTGCTGGTGATTTAAATAGATTAGCATTATTGGCTGTTTATATAAAGATAACATATATGTTGTGTTATCTGGCAGTTGGAAATTTTATCTGCTAAATTTATTACATTAATGTTCTGTTTTTATCCAAAAACGTTACTAGTGATTGTAAAAAGTTCGAGAGAGTCTGAGGGTATAACACCTTTCTAAAATTTTCTGAACATTTAACTATTATCAATAAAGGGTGAAACAGATTTAGATGCTTGATTTTCGATTTCTTTTCTGTTTACATAAAGCATTTGTCTCAGATAGAGACTCTCAGTCTTTAAATTTGTGCATGATTATTAATAATTAAAAATCAATTGATGATGTTTATGGGTGCAAATCTCTTGATAAAGCATTATATATTGCATTGCAGGTGCCATGATGCTTTTACATTGACCAAAGAAATAAACTGCCTAGCTCATTTGTCAAAATCATACTAAATGTCTTAGTTATCTTGACAGATCCTCAATCTCGCCAATCTTGTTTATTATTATTTCTCTTTTAAGTTTTACTATCTCTAGTAATATATGGTGCTACATTTTGTCATGCGGAGTCTGCACTATTTCATTACTCTGTCATAATTGGTTGTCCGTTAATCTGTACTCTCAGCTCAGCATTTCTATTGTCTTTAGGAAACTGGAGTTGGGCTTGAGCACAATTTGCATCGTGGACGTGCATTTTCAGCTTTTAACCATCTTCTTGCTGCTAGAGTTCAGAAACTAAAATCAGAGATTCAGCCGGGTTCAGCAACTGGACCATCAAATACACAGTTCGATCTACAGGCACTTTTTGCTCCTCTGACATTAAGGGAACAGTCTCTTCTTTCTTCTGTAAGGGTTTATTTATTTATTATTATTTCCTTTATAGGTAATTAACACAATAACAAAGCACTAATAGGATAAGTTAAATGTCAAAACTACAAAAAAGAACATCATGGAGGACATAAAAAAGCAAAGAAATGTCAAAATCTAAAGATATGAAATTGTTGGATCAACAAATCAGTGTAATATTTAATTTTTGCCAACTTTTTCCTCAAATATTCTCATTTTCCTTTTCAATTATTAAATAACCCAACAATTATGAAAATTATACATTCCAGAAAATTCCTCTTTTTTCGGCGTCCTTCCATAAAAAAAACGTTCAACTTAATGATTAGTAAAATCTTTATAATATGTTGCACACTGAATGATAATTTAAGCTGCAACTTGTGAAAGTTGTGAAAAGGAAACAAATGGTTCACCGATTCTCTACTCATGCAACAAATGACTCACTTGGGGGGGAAGAGAGAACTAAATGGGTTCTCTTTGTTCTGAATGGCACTTCTACGCAGGTATCTTGGCTGTAGGAGCACTCTTGCTCAATTTGAGCACCTTGTGCACCAAGCCTTACTGAGCTTCATTTTCATTTATTTAATAATGATCCAACATCATTTTTCTTCATTTTCTTTGAAAACCATCTTACTCTAAGCTATTTCTTTCGCTCCTTTCTTTTCTTTTCTTTTTTATTAAACAAGTGACTCTATCCATGATTTCTTTTGTCTTTTTGTAGCCCTCCCAAAAGTTCCCGTATCATTGAGAGTTATTCTAAAATCAAGGATCTATTCTCCCTGGGATAAAGCAAAGATTGATAACTTATTCAACGTCCATAGAATTTTTGAGCTTGCTTTAGGTCTGTATATGAACCAATCAAAATCTATGCTGGTTTGAATCATATGTTCCTAGGGGACCAGAAGGATTTAACCAGCAGATTTTCAATCTCCTTGGGAAGTTGGCCTTTAAATTATTGGGTCTTTCTCTTGGCGGTAACCCTAGATCCATTCATTTTTGGACATTGATGGTTGCCAAAACTGCCAAAAAGCTTACCCACCTCTGACAACTTTAGGATCCTTACTGTTATTCTAAAATTCCTATTTTTTTTTAATTATACTATTAATTTTTTTCTTGAAAAATGGTATTTTATTATTTTTGTTTAATTTCAAATCATTATTACTGATTGGTCCATACATTGTTCAATCTTCTAGTAATATGAGTTATTCTGGTTAACTTGTATTTTTCAATTAATTATGCAGAGTTTAACTGTTTTACTTGAGCGTGTAAAAATTTCTACCAACTTGATTTTCAGATTATTCCACTTGCCATTACACATTTTGAGAACTCTGTGTTAGTTGCTTCATGTGCTTTTCTCCTGGAGCTTTGTGGATTATCTGCCAGTATGCTCCGTGTAGATGTAGCAGCTTTAAGACGAATATCTACCTTTAACAAGTCTGGGCAATCCTTTGAGAATTTCAGGCAACTTTCACCTAAGGGCTCTGCTTTTCATCCAGTACCCCTAGAATCTGATAAAGTAGAGACTCTTGCTCGAGCTCTGGCTGATGAGTATCTGCACCAGGAAAGTTCAAGTGTTAATAAGCCAAAGGGCACTTCTAATTCAGCACCTTCAAAACGTTGTCCACAGGTGCTTTTCGTATTACAGCATTTGGAAGAGGTCAGTCTTCCCCAAGTGGTCGATGGAAATTCATGTGGATCATGGCTATTAAGTGGTAAAGGCGATGGGACTGAGCTAAGAAATCAACAAAAAGCTGCAAGCCATTACTGGAACTTAGTTACAGTCTTTTGTCGGATGCATAGGCTCCCTCCAAGTTCAAAGTATCTTGCTTTGTTAGCAAGAGACAATGACTGGGTAATCTCCTTCGGTCTATTTTTAATACTGAACTAATTTTATCAACTGTGATGATTCACATTCTGATTTTATATCTCCTGGTGCTATTATTATTTGAAAGGTTGGATTTTTAACTGAGGCTCACGTTGGCGGGTACCCTTTTGACACAGTTATCCAAGTAGTAAGTCCTGTACATTACTTAATTTGCGGTTACTTACCCAAATCTCTTATCCTTTTGTTACATGTTTTCTTCCTTTTTTCTTTGTTACTTTAGGCATCAAAAGAGTTCAGTGATCCACGTCTCAAAATCCATATATTAACTGTATTGAAGGCTGTACAGTCAAGGAAAAACCCTGGCCCTTCATCATACTCTGACACTGAAGATAAAAAAAGTCAAACTTCCTTTTTGGATGGAAGTACGTATATTCCAGTTGAGCTCTTTACAATTTTAGCCGAATGTGAGAAGAAGAAAAACCCTGGAAAAGCTCTCTTGATAAAGGCAGAGGAGTTATCCTGGTCTATTTTGGCAATGATTGCTTCTTGTTTCCCAGATGTGTCTCCATTATCCTGTCTTACTGTTTGGCTAGAAATTACTGCAGCAAGGTAACGATCAATTATTCCTTTTAAAAATGAAACATATAGATAGTTATTTCCAAACTTGAATGTAGACCTTTAATTCTAAGTAGTTATTTCATTGATTTTATGCAAGTTTTATAATTATATATATTCAGTTCTTATTGTCGGACTGAAGATCTTTTCTGATTCTGTGTTATATACAGGGAAACTACATCCATTAAGGTAAATGATATTGCTTCCCAGATTGCAGAAAATGTTGGGGCAGCTGTAGAAGCTACCAATACCTTGCCAGCTGGGTGTAGATCACCTGCATTTCATTACTGCCGGAAAAATCCCAAACGAAGGCGAACCATGGATTCCATTTCTAAAGATCCATCAGTTGGAGTGATCTCTGATACTTTCAGTGCTTCAACAGGTGCATCAACTAATGTTTCAGGTGGCTTTATTGTCAAGGAAGAAGGAAAGATAGTTCAGGAACGTCGACCTATTTCTGTTTCATATGATTCAGATGAAGCACCATCATCTCTGTCCAAGATGGTTTCTGTGCTTTGTGAACAGAAATTATTCTTGCCTCTGTTAAGGGCTTTTGAAATGTTCCTTCCTTCGTGTTCCCTGCTACCGTTCATCCGCGCTCTTCAGGTTTTTGAGCTTACCCATAATTGAAGCTTCTCATCTCATATTATCCCCACTCCTTTCTCTTCATGGTAAATTTTTTTACTGGTTTCAATATTCTGTGGACAACATGGGTAAGATTTAAGAGATTTTGTTCAATCGTAACAGCAATGTGATTGGTTGAATGTTTTTAATTTCAGTGGCATGGCTTTGAATTTATTTTAACCGAATCATCTTCTGATGTAAAATTAATTGTTCATTAGGCGTTTTCACAAATGCGTTTATCTGAAGCTTCAGCCCATTTAGGTTCTTTTTCTGTACGAGTTAAGGATGAAGCTAGCTTTTCTCATGCAAATGTTGAGGGAGAAGAACACACTGGGACATCATGGACCGGGTCCACTGCTGTTAAGGCTGCTAATGCTGTACTGTCTGTTTGCCCATCTCCATATGAAAGAAAATGTCTACTGAAACTGCTAGCCGCAACTGATTTTGGTGATGGAGGATTTGCTGCCGCTTATTATCAACGGCTTTATTGGAAAATCAATTTAGCAGAGCCTTCAATACGTATAGATGATGGCCTGCACCTTGGAAATGAGGCTCTGGACGATGCATCACTTTTAACAGCGCTAGAAAATAATGGACATTGGGAGCAAGCACGCAATTGGGCAAAGCAACTGGAAGCTAGTGGGGGTTCTTGGAAATCTGCTAGTCATCATGTCACAGAAACTCAGGTATTTTTGCATAAAGTTCCTGGTTGACTTTTGCACTTCTCAGTTGCGAATTATTCTTCTTGCAGGTAGTTTAAATAAATAGATTATTTCTAATTTCTATATATTCTTATTACAACACTATATATTTGAAATCAGCCTACAGTTTTTGGTTTGTATTAATTGGGGTAACATTTTTTTTAAGCATTCCTAAGACTTGGATATATTTTTGCTAAGACGAGAATGAAACCTTATTTGGTATCGCTGCTTCAATACCTGGAAGTACCTTTGGTGCTGTAAGCTGTACTGGATGCTATGATTGGTATTAGATGTATTCTTTGCAGAGAAAACTTGAAAAGAACACTTTTCTTCTTATTAGATGCTGTGAGTGATTGTAGATTTAAGTGAAGTTCACCATTATGTTTAAGTTTGTGTTGTGTCGATCTCACTGCATTTGGTATTCCTTTTAAACGTCCCAAATTCATAGTCACTTATACAATTTCCTTGAGTCTTTTATATTTCTGTTGTGAAGGAAGACTCAAGTTAAAAAATTGCCGGTAATTACTGGCTTAATGAAGTAGTGGAACATGCTGAACTATTTTTTATGTTTTGTTCATGAAACTAACCAGTTAATGTTTGGAAGATTTAAAATATTTGTTGTCTTCATGTTATTATCCCAAAAATAGGGTAAAAAGGAAAAAGAAAGTAAGATTTTGAGGTTTTCGGATTACTGTTGGGTTAATGGTTCTCTCCTTGCTTAGGCTGAATCTATGGTGGCAGAATGGAAGGAATTTTTATGGGATGTTCAAGAAGAGAGAGTTGCATTGTGGGGTCACTGCCAGGCACTCTTCATTAGATATTCCTTTCCTGCTTTACAGGTAAATTACAATGATATTTGATGTGGAATTGAAAATGTATCATCTGCTTAATCTTTGGATGAAAAATGCTTACCTTGTGAATAATCTAGACATAATTGCTGTGTTATGTTTGTAGGCTGGATTATTTTTCCTTAAACATGCAGAAGCTGTGGAGAAAGATCTTCCAGCCAAGGAGCTTCATGAACTATTATTACTTTCCTTGCAATGGTTAAGTGGGATGTTTACCATGTCTTATCCGTAAGAGATTTTTCTTAAATTTTAACTTATGCTTTTACTTTGCATTTAAAAATCATTGTCACTGACTTGTGTGATCCAATCTATTAGAGTTTATCCATTGCATCTTCTACGAGAAATTGAGACCAAGGTTTGGCTCCTGGCAGTAGAGTCAGAAGCTGAGCTGAAGAATGAACGGGATTTGAACATTAACAACTCCAGCCGGGAATGTATATCTAGGAATAGCTCAAGTATTATCGACTGGACTGCAAGTATAATATCAAAAATGGATAAACATATTAGTACAATGAAGAATAAAAGTATGGATAAACATGAGGTGAGAGAAAACAGCCAGACTCATCATAAAAGTCACGTATTAGATGCTGGCCTTTCAACTGCAGGAGGGGGGAATACAAAGGCTAAAAGGAGGACCAAAGGTTCCGTGCTAATTCGACGGCCATTAGTGGACTCTACAGACATGAACACTAACCCTGAAGATGGATGTGTTCCTTCCAATTTTAAAAATGACTTGCACTTGCAAGATGAGAACTTAAAAATGGATACATCATTATCGGGGTGGGAAGAAAGAATTGGACCTGCAGAGGTGGATAGAGCTGTTCTTTCATTATTAGAGTTTGGACAAATTACGGCTGCCAAGCAGCTTCAACAAAAGCTGTCTCCTGGGCAAGTACCTTCAGAATTCCTTCTTGTGGATGCTGCTTTTACGCTTGCAGCTATATCAACCCCTAATCGTGAAGTTTCCATGTCCATGCTTGATGAGGATTTATGTTCAGTTATTCTTGCATATGATATTCCGGTTGATCAGTATCTCAATCCGTTGCAGGTTTTTGTCAATCTCTCTCTCTCTTTCTTGCCTTCTTTACATTAAGCTTAGTAGCAATTGTTAAAAGAAAAATATCTTCTCGTTTCCACTAGTCTCTTTCAATTTTCTGAATTTGTTACCATCCAATGATAACTCGTGTTTTTATTTTGCTTGATGTGCATAATCCGTGTGTTCACCTCTATAGGTTTTGGAGATTTTAGCAACAATATTTGCCGAAGGAGGTGGACGTGGACTTTGTAGAAGAGTGATTGCAGTTGTAAAAGCTGCAAATGTCTTGGGACTTCCATTTTCAGAGGCATATAACAAACAGCCAATTGAACTATTACAGCTGCTCTCTCTCAAGGCACAAGAGTCATTTGAGGAGGCAAATTTCCTTGTGCAGACTCACTCTATGCCTGCTGCTAGTATTGCTCAAATTCTTGCAGAATCCTTCCTAAAGGTTAATTTTCTATCAAGTGCTAGTAGATCTCCAGAATTGTGAATTTAGATTTCAGTTAAACTATCTCCGACGTTTAATTATTGAACCATTTTGGTTTCAGTCTCTATTTGATGGCTTGAATTTAAACATATTGTGTATGAATTGGATAACGTTCCTTAGTTTGAATAAATCCATGTGATAAGAATCAGTAATTAATCTAAACTGACCTCTTATTATTTTGACAAAATCAGGGCTTGTTGGCTGCACATCGTGGAGGTTATATGGATTCCCAGAAAGATGAAGGACCTGCTCCTCTACTATGGAGATTCTCTGACTTCTTGAAGTGGTCAGAACTTTGTCCTTCTGAACCAGAGATTGGGCATGCGTTAATGCGTTTAGTTATTACTGGACAAGAGATACCACATGCCTGTGAGGTACTACTGTGACTATGATATATATTTTGCTATTAGGTGGCCAAAGATGCACATTACGGAAGAGAATAGGTCTGAGAGTGCAAATAGAAAAAGAAGAAGAAAATATTGAAATTTCCTTTTGGAGATATGTATTGTTAGTTATGAGTAATATTGTGGATATACTGTGAATCGTAATCATTATTTTATTATAGACTGCTGTTAGGGATAAAAGATTTGTCTTCTTAGTGTGATTCCTTGCGGTAATATTGTTTTTTTTTTTTTAATGAAAAGTTGACCTTTCATTGAGAAAAATGAAAAAAATACAAAGGGGCATCAAAAAAACCAGCCCTGCGAAAAGAGCCAAACCCAAACTCTACAAAAAGGGGCTCCAATATAACAAGATAAGACCTAAAGGATAATTACAACAAATTTTAGAAACTGAAGCCCACAGGGAGGCGTGGAACTTCGCAAGATTCCACACCTCCTCACTAGAAAGCTTAATCCCCCTAAAAATTCTGTTATTTCTCTCCAACCACAAGCCCAACAAGATAGCCAAAGAGGCTGCATGCCACAAGAAACGATCCCGATCCCTAAAATGCGAATGAAGAAGGATTTCCTCCACCATAGAACTACATCCTCTATGATGTGCCAAGCAAACCCTAAAAGTCGAGAAGCACGAGGTCCACACTGAACAAGCAAACTCACATCTCCAAAATAGATGATCGATGTCCTCAACTGCTCTTTTACAAATAATACAACATTGCGGCCCTGGAAGAAAGGATGAGTGCCCCATAACACGATCGAGGGTATTAACTCTGCCGTGTAAAACCTGCCAAACAAAAAATTTGACCTTGGGAATTTTAACCTTCCATAACGAGGAAAAGGTAGGGGACAAGGAGGAAGCGGGATCCAAAAGCTGTAAGAAAAAGGACTTTCAAGAAAAATCCTTGGAAGGATTAGGATCCTATAGACGCAAATCCCTTCTATCCAGATTGGATAACAAAGTTTTCCAACAAGGATAAAAGGGTCGTAATATCCGTTGTCTCCCTATCGGTTGAAGGACGACGAAAACCAAAAGAAGGGGATAAAGAGGACGTGTCATCGGGAGAGGATAAAACATAAGCCACTGAACTGAACCTCAAAGAAGTAAGAGTGTACAAATGAGGAAACAGATAACAGGGAGGTTTATCCCCCAACCACTTGTCTTCCCAAATATAAGTATCCATCCCACTATCCACCACACACCACAAACTGAGAAAATGGGGGAGGCCAGAGGAGATAGCCACTCACGGGTTTCAAAATGTGTCTTTCAAACTCTGACTCGAGACCCAATCGAGAGTATGGGGACCATATTTGCTTACAATAACCCTGTGCCACAAAGCCTTAGACACTAAAGAGAACCACCACAACCATTTGGCCAAAAGAGCCTCATTGCGAAGTCTCACATTACCAATACCAAAAGAACCGCACAATCCAAAGGCTTAGAGACCACCCCCCACTTGATAAGATGTCTCCCTCCCCCCTCTTCGATCCCTTCCCAAAGAAAGTCTCTCAAACACAATAGGAATATTATTAGCTTTTATTTATATTATTAAACACAAAAGGAATTGTGCAAGTTTTAGTCAGTTCAGTGTGATATGAAAATGCTGTTCTTTTTCACTCCATCTATAGCAAAATGTCCACATCTTTCAATGTTTATCAAGATGGTTTAAGAGTGATGTTGTTCCATTTAATGAGTTAACTTTTTCTTATTAAAAATTTTATGACTCAACTTTTTAACTTTTCAAAAAATGTTCTGTTTTAGGTGGAGCTTTTAATTTTGTCTCACCACTTCTACAAATCATCGGCTTGCCTTGATGGGGTGGATGTTCTCGTGGCTCTTGCTGCCACAAGAGTTGAGGCTTATGTAGCTGAGGGTGATTTTCCATGTTTAGCTCGCCTGATAACTGGAGTTGGAAACTTCTATGCCCTTAGTTTTATTCTTGGCATTCTTATAGAGAACGGTCAGCTAGAACTTCTTCTTCAAAAGTTCTCAGCTGCTGCAGATACAAGTGCAGGGAGTGCTGAGGCTGTCAGGGGATTTCGCATGGCTGTTCTCACATCCCTCAAGCATTTTAACCCCACTGATCTTGATGCATTTGCTAAGGTTTGTAGAAATGTTCACAATGTCCAAAGGTGTCATATTCCTGACAATACGATAAATTCATAGGATGTCATATTTAATGAGATCTTCTGTGGCTTGGTAGGAACTCAGTTATATCACACTGTGTAGCACACGCTTGCAACTTATCAAAGACAACAACAATTAGGTTGATGTGAATATGTTATGGTTTTGAACAGTTAGCTATTTTGTCAGTTATTTCATCTTACTAGGCCAACATCAGCCTTTGTTTTGTCCTCTTCATGTTTTATATTATGAATTTTCAAATCTATATTCTAATCATAAATCTACTGAGATATTTTTGTTAACTCCGTATGCTATATTTTTGGTTTATCTACCTTCTGGATCTTAGTTATTTTTGTAATTTATTTGGATAGTTATAAGTATCAGAAATATTGGCATTTTAAAGATTTGGAGTACTATAGAATAAGATTTAAGATTTGAGATTAAATGACAAAAGGTTTCTTTTCATATTGTCTAACCTTAGATTTACTATACAAATTATTTCTGTGAACTATGTGGTCCTTTTTTCTCTTATTTCTCCATTCTCTTCTTCTAAGATATGCATGATCTGATTGGACTATTGTACTAATTTAGTAATTTTCATCTCGCTGAATGTTAGGCTGCGGAAAATTTCTGAAGTACTTGGTCTGATCATGATGTATACTTAGATGCAGGTTAACCTGAGTGAAATGTTGATGGTGTTTGATTTGGTGATTTACATCCTGTATACAGGTCTACAGCCATTTTGACATGAAACATGAAACAGCTGCTCTTCTGGAGTCACAAGCGGAGCAGTCGTGTGAGATGTGGTTCCGCCGCTATTACAAGGACCAGAATGCAGACCTTTTAGATGCAATGCATTACTACATCGCAGCTGCTGAAGTTCATTCTTCCATTGATGCTGGCAACAAAACCCGCAGATCCTGTGCACAGGCTTCTCTAGTGTCCCTTCAGATTAGGATGCCCGACTTTAAGTGGCTCTTTCAGTCGGAAACCAACGCCAGAAGAGCTCTTGTCGAGCAATCAAGATTCCAAGAGGCACTAATTGTTGCTGAAGCTTATGATCTTGACCAGCCAAGCGAGTGGGCTTTAGTCATTTGGAATCAGATGCTTAAACCAGAGATTCTAGAAGAATTTGTGGCTGAATTTGTGAGTGTGCTTCCACTCCATCCTTCAATGTTAGCTGACATTGCAAGATTTTATAGGTCTGAAGTGGCTGCCCGCGGGGACCAGTCCCAATTCTCCGTCTGGCTAACTGGGGGAGGGTTGCCTGCAGAGTGGGCAAAATATTTGGGAAGATCATTTAGGTGCTTGTTGAAAAGGACTCGAGATTTGAGGCTCCGTTTGCAACTAGCTCAAGTTGCCACTGGTTTCGTGGATGTCATGGATGCTTGCACAAAAGCACTTGATAAGGTACCCGAGAATGCCGGGCCTCTTGTGCTTAGGAAAGGGCATGGTGGTACATATCTTCCACTGATGTGA

mRNA sequence

ATGATAGTTATTAAGCGTCGAGTGAAAAGAACCGAAGAAATTCCAGGAGAGACGCTGCACGGGACCGGACAACGACCGGAAGAATATGAATGTAAATTGACGAAAAAGCCCATAATATCTGCAACTTTGGTCGGTAAGACTGTAACGATTCCCTTGCGAAATCCTGTTGTCTTCCCCGTTGCTATCGATTGGAGAATTCCGATAACACAGCTTAAGAGAGAGAGAGAGAGCGAGAGAGAGAGCTCCGATTCTGAAAATGGCTTCCTTCGTCGACTAGGTTATCTTTATGAGCTCCAGGGCAACATGATGGACTCGGTTTCAGGTGGTGGAGGTCCTGCCATACTGCAGCTGCATAAGTGGAATCCTTCACAGCCTCAACTCAACCTCTCAGAGTATCGTGAAGCTTTTATATCTCCTGCAAGGCAAATATTATTATTGCATTCATACAAACATGAAGCGTTGCTTCTTCCTCTAAATACAGGGGACGTCAGGTGTGGTAATGATCTCCCAAACGGATATGATATCAACTTAAAAGATTTGGGGTCGTTAGCTTTCTCAGAAGTAGTATCAACAGCACCTAGGTCAGAAGATGCAGAAGGCAATGTACGATGCTCTAACAAATCAGCTGTTGATATTGATAATGATTCTCCTACAGGAAACAAATCTTCAAGGTCTAGTTGTAACAACTTCCTTGGTGATGTAAGCTCACTTGCTTGGGGGCTTTGTGGAGATACCTATAAGAAGCGCAAAGATTCTTCTTTTAAGGAAATTTTATTTGTATCTGGAAATCATGGTGTCACTGCTCATGCTTTTTGTCAACCCAACAAGACCAATGAAGAGGCTAAAAATATGGTGCAGTCTGAGTTTTGGAAAGGAAGGTGGATGGAATGGGGACCCTATCCTACATTAGTTCAAAACTTGGAGGTCCAAGAACTTTCTGATTCTTGTGTAACCTCCGGAAATGTTGACAAAAACAGGATAAACCAGAATGGGGAAATTTTGCGAAGTTCTTGCTATGAGTTTGAGGATGATGCATTGTTGTTGGGAAATAGTGCACCTAAGAGATATTTACAATCTTTTCTTGCTAAGGTTAAGACTATTGAATATGAAGATGACATTTGGACGATAGTGGATTCAGTATCTGCTGATAGAGTTGATGAAACTGGAAGCAGAAATGATACCTTAATTCTTGTAGCTAGAGTTGGCAATTTGGGAATTAAGTGGGTTTCTTCTGTGAAATTTGAGAAAAGTCTATATATTTCACCGTTGATGGAGTGGGCAGATTTCTGCTTTTCAAATGATTTTCTTCTTTGTCTAAGCGACTCTGGTTTTATCTTTGTACACTCTGCTTTGTCTGGCAAGCATGTTACCTGTATAGATGTTTTACAGGCTTGTGGACTCAATCCTAAGTACTTACATTTGAAACAAGATTTGCAAATGAATCAAGTAGATCAAGTCCAGGATGATGTATCCTGTAGTCGTGATAGTTTTTATGACAGAAGAAAGTTTAGAAGGTTGTTATCTGATTCTCATTCCTCACATTTTGCCGTGATTGATGCATTTGGTATAATGTATGTCGTTTCTGCTGTTGACCATATGTTAGAGCACTATCATGGATCTGAAAATCTGTTTCCACATCCTCACAATTTTGAACTTGGGAGGGCTCCAGTTAGTTGGGAGGTTGGTGGTTATGACATAGGCTGCCAGAGGAACTATTCAGAGTCATTGGGGTCTCATTCATGTAGGGTTTTTTCCATGAAAAATGAAGGTGTTTCATTTTGGGGTAATACTAGATTTGATGTGCTTCAGAATACTCAGGACTCAAAGGTTTGTACGGGGAGAAAATATAAGTGCTCGTGTTTAACTGCTTCTGCTTCAATTTTACAAAATCAGAAGTTCCAGGGTGGTGAATTACAGTCTTGCACTATGCGAAAGATGTTTCTTTCCACTTGGAAAACTAATGAAGATGATTGCTTCTGCTTCTCTCCTATGGGACTTACTCAATTCATTAAAAGATGCAATATAAGTGGCCAAAAGTGCTCTCAAGTTGTCCATTTTGATCTGCATCTCAAGTCTGAAGTCCATGATGATAGCTGCTTAAAATCCCAAATGATTTTTGTTGATGGTAGGAAAGAAGAACTTGTTGGAGAAGCAGTTGGCTGCACTTCACAAGGATCTCTTTATTTGGTGACAAATAATGGTCTTTCCGTGGTTTTGCCTTCTGTTACCATTGCATCAGATTCTTTACCATCTGAGTCTGTTGCTAGATTACAACCTGGTGTTCTTCTTGGCACTCCTAATCAAGTAAAAGGTTTGGAACTGAAAGAATCTAATTGTTCATGGTCACCCTGGCAAGTTGAAGTTTTGGATAGGGTTCTTCTATATGAAAGCATAGATGAGGCAGATCGCTTGTGTTCTGAGAATGGATGGGACTTGAAAGTCGTGCGAATGCGTCGGTTTCAAATGACATTGCATTATTTGAGATTTGATGAACTAGAGCGATCACTAGAAATGCTTGTGGATGTTGATTTGGAAGAAGTAGGAATTTTGAGACTGCTCTTTGCTGCCGTACATCTGATGTTTCAAAAAGCTGGTACTGATAATGATATTTCAGCCGCTTCAAGGCTTCTAGCACTGGGCACGCGCTTTGCAACTAGAATGACTCATCGATATGGGATGGCCGAGTTCAAGAGAAATGCTACTATGTTTAATGACTTTAGTAGCAGCCAAGAAATTTCCATTCTCCCTCATTTTCCATTTCGAAAGCAAAACGAGTTGGAGTATTCAAGAAAACTTCATGAGATGTCTCACTTTTTGGAGATAATAAGAAATCTGCATTGCCATCTTAGTTCAAAATTTAAGAGGCCATGTCAGGAATTGGTAGCTGGGGAGGCACCAATTTCGGATGAAAACAATCTATTGCTGGATGAACCTCAGCTTGTTTCTACAGATATAATACCATTGGGGAGTACAAGTCAATATGAACTTTCATTTCCTTCAAATGATTTGAGCTCTACCGTTGTAGATGGTCTTGTTATGATGCCCATGGTTTCTGAATCCCAGTTGGATTCAGAAGATTTAAATGGAGACTCTGCTGTTGTACCACAAGGAGTCTTGGAAAAGAAAGTTGTTCCATTGGAGAATCCCAAGCAGATGATTGCACGTTGGAAGTCAGATAAGTTGCCACTTAAAAATGTTGTTAAAGACGCTCTTCTCTCTGGCCGTCTTCCTTTGGCTGTTCTTCAACTACACATTAATCATTTAAGAGAATTAATTGGCGAGAAAGAACCTCATGATACATTTTCTGAAATTCGTGACATTGGAAGAGCTATTGCTTATGATCTCTTCTTAAAGGGTGAGATTGGGCTTGCCATTGCTACGCTGCAGAGGCTTGGAGATGACATTGAAGTTAGCCTCAAACAATTGTTGTATGGCACAATTAACAGATCTTTTCGAGTGGAAATTGCTGCGGAGATGAAAAAATATGGTTATCTGGGGCCATTTGACCAGAGGATGATGGATAGAATAGTACATATTGAGAGGCTCTACCCAAGCAGTAATTTCTGGAAAACATTTCTGAGCAGGCAGAGAGCAAATATGGGATCCCCATCAAGTTCTGCCACACCTGGAGAAAATGATTTGAGGACATTACGTTTCCATTTAATCAACAATACTATCATTGATTGTGGTGAGGTTGATGGTGTCGTTTTAGGTTCGTGGCCCAATGCCAATGAGAACTCTTTTGTTCTGGAGATCACTGAAGATAATGCTCATGTGGGATATTGGGCTGCCGCTGCCATTTGGACAAACACATGGGATCAACGAACAACTGATCGTATACTACTTGATCGATCTTTGGGTATTGGTATCCCTGTGGCGTGGGAATCTCAACTTGATTACCACATATGCCATAATAACTGGGATGAAGTATCAAGACTTCTTGATATGATTCCTGTTTCTAATTTGCTAGATGGAAGCCTCCAAGTAAGCTTAGATGGTCTACAGTCAGCTTCAGCAGTTGGGTGCAATCGAGAGTCTTCTTTTTACAGCAATTACTTATACCCTCTTGAAGAATTAGATGCTGTTTGCTTGTATATTCCCAAAGCCAAAATTTTCAGGTTCTCAGCTAATATTATGTGCTCCAAATGGTTGGGTATGCTCTTGGAGGAGAAGCTTGCAAGACAGTTTATATTTCTGAAGGAATACTGGGAAGGCACAATGGAGTTGGTACCTCTTCTTGCACGTTCTGGCTTCATTACAAACAGACTTGATGAGATTGCTTCCGTGGATGATCACATCAGCAGTTCAGTTGATCAAAGATCCACAAACAATGGTGGAGCATTTTATGTTGATTCTGTGCAAGCATTATATAAAGTTTTCATACATCACTGTTCACAGTATAACTTGCCATTTCTTCTCGACCTTTATCTGGACCATCACAAATTGGTTGTTGATAATAATTCAGTTCGTTCACTACTGGAAGCTGCAGGAGATTGTCAATGGGCAAGATGGTTACTTCTGTCGAGGATCAGGGGCTGTGAATATGATGCATCATTTTCTAACGCTCGCTCAATAATGTCACTGAATTTAGTTCATGATCCTAACCTTGGTGTTCGGGATATTGATGAGATTATTTGTACTGTTGGTGACATTGCTGAAGGAGGAGGAGAAATGGCAGCCCTAGCAACTCTGATGTATGCTCCTTCCCCAATACAAGATTGTTTGAGTAGCAGTGGTGTGAACAGACATAGTAGCTCGTCAGCCCAATGTACTCTTGAAAACCTCAGGCCAGCCCTGCAACGATTCCCTACATTGTGCCGTGCTCTAGTTACATCGGCTTTCCAGCAAGATACAACTTGCAATTTTTTGGGTCCAAAATTGAAGAATGCATTGTCAGAATATCTACATTGGCGTAGCAGCATCTTTTTTTCTGCTGGACGTGACACTTCACTTCTACATATGCTGCCATGCTGGTTTCCCAAGGCAGTTAGGAGATTGCTTCAGCTCTATGTCCAGGGTCCTCTTGGATGGCAATCACTCTCAGCTTTGCCAACGGGGCAGACGTTATGGGAGAGGGATGTTCATTTTTTTATGAATGATTATGAACATTCTGAAATCAGTCCAATCTCTTGGGAAGCAACCATACAGAAGCACATAGAAGATGAGTTATATGATTCTTCTCTTAAGGAAACTGGAGTTGGGCTTGAGCACAATTTGCATCGTGGACGTGCATTTTCAGCTTTTAACCATCTTCTTGCTGCTAGAGTTCAGAAACTAAAATCAGAGATTCAGCCGGGTTCAGCAACTGGACCATCAAATACACAGTTCGATCTACAGGCACTTTTTGCTCCTCTGACATTAAGGGAACAGTCTCTTCTTTCTTCTATTATTCCACTTGCCATTACACATTTTGAGAACTCTGTGTTAGTTGCTTCATGTGCTTTTCTCCTGGAGCTTTGTGGATTATCTGCCAGTATGCTCCGTGTAGATGTAGCAGCTTTAAGACGAATATCTACCTTTAACAAGTCTGGGCAATCCTTTGAGAATTTCAGGCAACTTTCACCTAAGGGCTCTGCTTTTCATCCAGTACCCCTAGAATCTGATAAAGTAGAGACTCTTGCTCGAGCTCTGGCTGATGAGTATCTGCACCAGGAAAGTTCAAGTGTTAATAAGCCAAAGGGCACTTCTAATTCAGCACCTTCAAAACGTTGTCCACAGGTGCTTTTCGTATTACAGCATTTGGAAGAGGTCAGTCTTCCCCAAGTGGTCGATGGAAATTCATGTGGATCATGGCTATTAAGTGGTAAAGGCGATGGGACTGAGCTAAGAAATCAACAAAAAGCTGCAAGCCATTACTGGAACTTAGTTACAGTCTTTTGTCGGATGCATAGGCTCCCTCCAAGTTCAAAGTATCTTGCTTTGTTAGCAAGAGACAATGACTGGGTTGGATTTTTAACTGAGGCTCACGTTGGCGGGTACCCTTTTGACACAGTTATCCAAGTAGCATCAAAAGAGTTCAGTGATCCACGTCTCAAAATCCATATATTAACTGTATTGAAGGCTGTACAGTCAAGGAAAAACCCTGGCCCTTCATCATACTCTGACACTGAAGATAAAAAAAGTCAAACTTCCTTTTTGGATGGAAGTACGTATATTCCAGTTGAGCTCTTTACAATTTTAGCCGAATGTGAGAAGAAGAAAAACCCTGGAAAAGCTCTCTTGATAAAGGCAGAGGAGTTATCCTGGTCTATTTTGGCAATGATTGCTTCTTGTTTCCCAGATGTGTCTCCATTATCCTGTCTTACTGTTTGGCTAGAAATTACTGCAGCAAGGGAAACTACATCCATTAAGGTAAATGATATTGCTTCCCAGATTGCAGAAAATGTTGGGGCAGCTGTAGAAGCTACCAATACCTTGCCAGCTGGGTGTAGATCACCTGCATTTCATTACTGCCGGAAAAATCCCAAACGAAGGCGAACCATGGATTCCATTTCTAAAGATCCATCAGTTGGAGTGATCTCTGATACTTTCAGTGCTTCAACAGGTGCATCAACTAATGTTTCAGGTGGCTTTATTGTCAAGGAAGAAGGAAAGATAGTTCAGGAACGTCGACCTATTTCTGTTTCATATGATTCAGATGAAGCACCATCATCTCTGTCCAAGATGGTTTCTGTGCTTTGTGAACAGAAATTATTCTTGCCTCTGTTAAGGGCTTTTGAAATGTTCCTTCCTTCGTGTTCCCTGCTACCGTTCATCCGCGCTCTTCAGGCGTTTTCACAAATGCGTTTATCTGAAGCTTCAGCCCATTTAGGTTCTTTTTCTGTACGAGTTAAGGATGAAGCTAGCTTTTCTCATGCAAATGTTGAGGGAGAAGAACACACTGGGACATCATGGACCGGGTCCACTGCTGTTAAGGCTGCTAATGCTGTACTGTCTGTTTGCCCATCTCCATATGAAAGAAAATGTCTACTGAAACTGCTAGCCGCAACTGATTTTGGTGATGGAGGATTTGCTGCCGCTTATTATCAACGGCTTTATTGGAAAATCAATTTAGCAGAGCCTTCAATACGTATAGATGATGGCCTGCACCTTGGAAATGAGGCTCTGGACGATGCATCACTTTTAACAGCGCTAGAAAATAATGGACATTGGGAGCAAGCACGCAATTGGGCAAAGCAACTGGAAGCTAGTGGGGGTTCTTGGAAATCTGCTAGTCATCATGTCACAGAAACTCAGGCTGAATCTATGGTGGCAGAATGGAAGGAATTTTTATGGGATGTTCAAGAAGAGAGAGTTGCATTGTGGGGTCACTGCCAGGCACTCTTCATTAGATATTCCTTTCCTGCTTTACAGGCTGGATTATTTTTCCTTAAACATGCAGAAGCTGTGGAGAAAGATCTTCCAGCCAAGGAGCTTCATGAACTATTATTACTTTCCTTGCAATGGTTAAGTGGGATGTTTACCATGTCTTATCCAGTTTATCCATTGCATCTTCTACGAGAAATTGAGACCAAGGTTTGGCTCCTGGCAGTAGAGTCAGAAGCTGAGCTGAAGAATGAACGGGATTTGAACATTAACAACTCCAGCCGGGAATGTATATCTAGGAATAGCTCAAGTATTATCGACTGGACTGCAAGTATAATATCAAAAATGGATAAACATATTAGTACAATGAAGAATAAAAGTATGGATAAACATGAGGTGAGAGAAAACAGCCAGACTCATCATAAAAGTCACGTATTAGATGCTGGCCTTTCAACTGCAGGAGGGGGGAATACAAAGGCTAAAAGGAGGACCAAAGGTTCCGTGCTAATTCGACGGCCATTAGTGGACTCTACAGACATGAACACTAACCCTGAAGATGGATGTGTTCCTTCCAATTTTAAAAATGACTTGCACTTGCAAGATGAGAACTTAAAAATGGATACATCATTATCGGGGTGGGAAGAAAGAATTGGACCTGCAGAGGTGGATAGAGCTGTTCTTTCATTATTAGAGTTTGGACAAATTACGGCTGCCAAGCAGCTTCAACAAAAGCTGTCTCCTGGGCAAGTACCTTCAGAATTCCTTCTTGTGGATGCTGCTTTTACGCTTGCAGCTATATCAACCCCTAATCGTGAAGTTTCCATGTCCATGCTTGATGAGGATTTATGTTCAGTTATTCTTGCATATGATATTCCGGTTGATCAGTATCTCAATCCGTTGCAGGTTTTGGAGATTTTAGCAACAATATTTGCCGAAGGAGGTGGACGTGGACTTTGTAGAAGAGTGATTGCAGTTGTAAAAGCTGCAAATGTCTTGGGACTTCCATTTTCAGAGGCATATAACAAACAGCCAATTGAACTATTACAGCTGCTCTCTCTCAAGGCACAAGAGTCATTTGAGGAGGCAAATTTCCTTGTGCAGACTCACTCTATGCCTGCTGCTAGTATTGCTCAAATTCTTGCAGAATCCTTCCTAAAGGGCTTGTTGGCTGCACATCGTGGAGGTTATATGGATTCCCAGAAAGATGAAGGACCTGCTCCTCTACTATGGAGATTCTCTGACTTCTTGAAGTGGTCAGAACTTTGTCCTTCTGAACCAGAGATTGGGCATGCGTTAATGCGTTTAGTTATTACTGGACAAGAGATACCACATGCCTGTGAGGTGGAGCTTTTAATTTTGTCTCACCACTTCTACAAATCATCGGCTTGCCTTGATGGGGTGGATGTTCTCGTGGCTCTTGCTGCCACAAGAGTTGAGGCTTATGTAGCTGAGGGTGATTTTCCATGTTTAGCTCGCCTGATAACTGGAGTTGGAAACTTCTATGCCCTTAGTTTTATTCTTGGCATTCTTATAGAGAACGGTCAGCTAGAACTTCTTCTTCAAAAGTTCTCAGCTGCTGCAGATACAAGTGCAGGGAGTGCTGAGGCTGTCAGGGGATTTCGCATGGCTGTTCTCACATCCCTCAAGCATTTTAACCCCACTGATCTTGATGCATTTGCTAAGGTCTACAGCCATTTTGACATGAAACATGAAACAGCTGCTCTTCTGGAGTCACAAGCGGAGCAGTCGTGTGAGATGTGGTTCCGCCGCTATTACAAGGACCAGAATGCAGACCTTTTAGATGCAATGCATTACTACATCGCAGCTGCTGAAGTTCATTCTTCCATTGATGCTGGCAACAAAACCCGCAGATCCTGTGCACAGGCTTCTCTAGTGTCCCTTCAGATTAGGATGCCCGACTTTAAGTGGCTCTTTCAGTCGGAAACCAACGCCAGAAGAGCTCTTGTCGAGCAATCAAGATTCCAAGAGGCACTAATTGTTGCTGAAGCTTATGATCTTGACCAGCCAAGCGAGTGGGCTTTAGTCATTTGGAATCAGATGCTTAAACCAGAGATTCTAGAAGAATTTGTGGCTGAATTTGTGAGTGTGCTTCCACTCCATCCTTCAATGTTAGCTGACATTGCAAGATTTTATAGGTCTGAAGTGGCTGCCCGCGGGGACCAGTCCCAATTCTCCGTCTGGCTAACTGGGGGAGGGTTGCCTGCAGAGTGGGCAAAATATTTGGGAAGATCATTTAGGTGCTTGTTGAAAAGGACTCGAGATTTGAGGCTCCGTTTGCAACTAGCTCAAGTTGCCACTGGTTTCGTGGATGTCATGGATGCTTGCACAAAAGCACTTGATAAGGTACCCGAGAATGCCGGGCCTCTTGTGCTTAGGAAAGGGCATGGTGGTACATATCTTCCACTGATGTGA

Coding sequence (CDS)

ATGATAGTTATTAAGCGTCGAGTGAAAAGAACCGAAGAAATTCCAGGAGAGACGCTGCACGGGACCGGACAACGACCGGAAGAATATGAATGTAAATTGACGAAAAAGCCCATAATATCTGCAACTTTGGTCGGTAAGACTGTAACGATTCCCTTGCGAAATCCTGTTGTCTTCCCCGTTGCTATCGATTGGAGAATTCCGATAACACAGCTTAAGAGAGAGAGAGAGAGCGAGAGAGAGAGCTCCGATTCTGAAAATGGCTTCCTTCGTCGACTAGGTTATCTTTATGAGCTCCAGGGCAACATGATGGACTCGGTTTCAGGTGGTGGAGGTCCTGCCATACTGCAGCTGCATAAGTGGAATCCTTCACAGCCTCAACTCAACCTCTCAGAGTATCGTGAAGCTTTTATATCTCCTGCAAGGCAAATATTATTATTGCATTCATACAAACATGAAGCGTTGCTTCTTCCTCTAAATACAGGGGACGTCAGGTGTGGTAATGATCTCCCAAACGGATATGATATCAACTTAAAAGATTTGGGGTCGTTAGCTTTCTCAGAAGTAGTATCAACAGCACCTAGGTCAGAAGATGCAGAAGGCAATGTACGATGCTCTAACAAATCAGCTGTTGATATTGATAATGATTCTCCTACAGGAAACAAATCTTCAAGGTCTAGTTGTAACAACTTCCTTGGTGATGTAAGCTCACTTGCTTGGGGGCTTTGTGGAGATACCTATAAGAAGCGCAAAGATTCTTCTTTTAAGGAAATTTTATTTGTATCTGGAAATCATGGTGTCACTGCTCATGCTTTTTGTCAACCCAACAAGACCAATGAAGAGGCTAAAAATATGGTGCAGTCTGAGTTTTGGAAAGGAAGGTGGATGGAATGGGGACCCTATCCTACATTAGTTCAAAACTTGGAGGTCCAAGAACTTTCTGATTCTTGTGTAACCTCCGGAAATGTTGACAAAAACAGGATAAACCAGAATGGGGAAATTTTGCGAAGTTCTTGCTATGAGTTTGAGGATGATGCATTGTTGTTGGGAAATAGTGCACCTAAGAGATATTTACAATCTTTTCTTGCTAAGGTTAAGACTATTGAATATGAAGATGACATTTGGACGATAGTGGATTCAGTATCTGCTGATAGAGTTGATGAAACTGGAAGCAGAAATGATACCTTAATTCTTGTAGCTAGAGTTGGCAATTTGGGAATTAAGTGGGTTTCTTCTGTGAAATTTGAGAAAAGTCTATATATTTCACCGTTGATGGAGTGGGCAGATTTCTGCTTTTCAAATGATTTTCTTCTTTGTCTAAGCGACTCTGGTTTTATCTTTGTACACTCTGCTTTGTCTGGCAAGCATGTTACCTGTATAGATGTTTTACAGGCTTGTGGACTCAATCCTAAGTACTTACATTTGAAACAAGATTTGCAAATGAATCAAGTAGATCAAGTCCAGGATGATGTATCCTGTAGTCGTGATAGTTTTTATGACAGAAGAAAGTTTAGAAGGTTGTTATCTGATTCTCATTCCTCACATTTTGCCGTGATTGATGCATTTGGTATAATGTATGTCGTTTCTGCTGTTGACCATATGTTAGAGCACTATCATGGATCTGAAAATCTGTTTCCACATCCTCACAATTTTGAACTTGGGAGGGCTCCAGTTAGTTGGGAGGTTGGTGGTTATGACATAGGCTGCCAGAGGAACTATTCAGAGTCATTGGGGTCTCATTCATGTAGGGTTTTTTCCATGAAAAATGAAGGTGTTTCATTTTGGGGTAATACTAGATTTGATGTGCTTCAGAATACTCAGGACTCAAAGGTTTGTACGGGGAGAAAATATAAGTGCTCGTGTTTAACTGCTTCTGCTTCAATTTTACAAAATCAGAAGTTCCAGGGTGGTGAATTACAGTCTTGCACTATGCGAAAGATGTTTCTTTCCACTTGGAAAACTAATGAAGATGATTGCTTCTGCTTCTCTCCTATGGGACTTACTCAATTCATTAAAAGATGCAATATAAGTGGCCAAAAGTGCTCTCAAGTTGTCCATTTTGATCTGCATCTCAAGTCTGAAGTCCATGATGATAGCTGCTTAAAATCCCAAATGATTTTTGTTGATGGTAGGAAAGAAGAACTTGTTGGAGAAGCAGTTGGCTGCACTTCACAAGGATCTCTTTATTTGGTGACAAATAATGGTCTTTCCGTGGTTTTGCCTTCTGTTACCATTGCATCAGATTCTTTACCATCTGAGTCTGTTGCTAGATTACAACCTGGTGTTCTTCTTGGCACTCCTAATCAAGTAAAAGGTTTGGAACTGAAAGAATCTAATTGTTCATGGTCACCCTGGCAAGTTGAAGTTTTGGATAGGGTTCTTCTATATGAAAGCATAGATGAGGCAGATCGCTTGTGTTCTGAGAATGGATGGGACTTGAAAGTCGTGCGAATGCGTCGGTTTCAAATGACATTGCATTATTTGAGATTTGATGAACTAGAGCGATCACTAGAAATGCTTGTGGATGTTGATTTGGAAGAAGTAGGAATTTTGAGACTGCTCTTTGCTGCCGTACATCTGATGTTTCAAAAAGCTGGTACTGATAATGATATTTCAGCCGCTTCAAGGCTTCTAGCACTGGGCACGCGCTTTGCAACTAGAATGACTCATCGATATGGGATGGCCGAGTTCAAGAGAAATGCTACTATGTTTAATGACTTTAGTAGCAGCCAAGAAATTTCCATTCTCCCTCATTTTCCATTTCGAAAGCAAAACGAGTTGGAGTATTCAAGAAAACTTCATGAGATGTCTCACTTTTTGGAGATAATAAGAAATCTGCATTGCCATCTTAGTTCAAAATTTAAGAGGCCATGTCAGGAATTGGTAGCTGGGGAGGCACCAATTTCGGATGAAAACAATCTATTGCTGGATGAACCTCAGCTTGTTTCTACAGATATAATACCATTGGGGAGTACAAGTCAATATGAACTTTCATTTCCTTCAAATGATTTGAGCTCTACCGTTGTAGATGGTCTTGTTATGATGCCCATGGTTTCTGAATCCCAGTTGGATTCAGAAGATTTAAATGGAGACTCTGCTGTTGTACCACAAGGAGTCTTGGAAAAGAAAGTTGTTCCATTGGAGAATCCCAAGCAGATGATTGCACGTTGGAAGTCAGATAAGTTGCCACTTAAAAATGTTGTTAAAGACGCTCTTCTCTCTGGCCGTCTTCCTTTGGCTGTTCTTCAACTACACATTAATCATTTAAGAGAATTAATTGGCGAGAAAGAACCTCATGATACATTTTCTGAAATTCGTGACATTGGAAGAGCTATTGCTTATGATCTCTTCTTAAAGGGTGAGATTGGGCTTGCCATTGCTACGCTGCAGAGGCTTGGAGATGACATTGAAGTTAGCCTCAAACAATTGTTGTATGGCACAATTAACAGATCTTTTCGAGTGGAAATTGCTGCGGAGATGAAAAAATATGGTTATCTGGGGCCATTTGACCAGAGGATGATGGATAGAATAGTACATATTGAGAGGCTCTACCCAAGCAGTAATTTCTGGAAAACATTTCTGAGCAGGCAGAGAGCAAATATGGGATCCCCATCAAGTTCTGCCACACCTGGAGAAAATGATTTGAGGACATTACGTTTCCATTTAATCAACAATACTATCATTGATTGTGGTGAGGTTGATGGTGTCGTTTTAGGTTCGTGGCCCAATGCCAATGAGAACTCTTTTGTTCTGGAGATCACTGAAGATAATGCTCATGTGGGATATTGGGCTGCCGCTGCCATTTGGACAAACACATGGGATCAACGAACAACTGATCGTATACTACTTGATCGATCTTTGGGTATTGGTATCCCTGTGGCGTGGGAATCTCAACTTGATTACCACATATGCCATAATAACTGGGATGAAGTATCAAGACTTCTTGATATGATTCCTGTTTCTAATTTGCTAGATGGAAGCCTCCAAGTAAGCTTAGATGGTCTACAGTCAGCTTCAGCAGTTGGGTGCAATCGAGAGTCTTCTTTTTACAGCAATTACTTATACCCTCTTGAAGAATTAGATGCTGTTTGCTTGTATATTCCCAAAGCCAAAATTTTCAGGTTCTCAGCTAATATTATGTGCTCCAAATGGTTGGGTATGCTCTTGGAGGAGAAGCTTGCAAGACAGTTTATATTTCTGAAGGAATACTGGGAAGGCACAATGGAGTTGGTACCTCTTCTTGCACGTTCTGGCTTCATTACAAACAGACTTGATGAGATTGCTTCCGTGGATGATCACATCAGCAGTTCAGTTGATCAAAGATCCACAAACAATGGTGGAGCATTTTATGTTGATTCTGTGCAAGCATTATATAAAGTTTTCATACATCACTGTTCACAGTATAACTTGCCATTTCTTCTCGACCTTTATCTGGACCATCACAAATTGGTTGTTGATAATAATTCAGTTCGTTCACTACTGGAAGCTGCAGGAGATTGTCAATGGGCAAGATGGTTACTTCTGTCGAGGATCAGGGGCTGTGAATATGATGCATCATTTTCTAACGCTCGCTCAATAATGTCACTGAATTTAGTTCATGATCCTAACCTTGGTGTTCGGGATATTGATGAGATTATTTGTACTGTTGGTGACATTGCTGAAGGAGGAGGAGAAATGGCAGCCCTAGCAACTCTGATGTATGCTCCTTCCCCAATACAAGATTGTTTGAGTAGCAGTGGTGTGAACAGACATAGTAGCTCGTCAGCCCAATGTACTCTTGAAAACCTCAGGCCAGCCCTGCAACGATTCCCTACATTGTGCCGTGCTCTAGTTACATCGGCTTTCCAGCAAGATACAACTTGCAATTTTTTGGGTCCAAAATTGAAGAATGCATTGTCAGAATATCTACATTGGCGTAGCAGCATCTTTTTTTCTGCTGGACGTGACACTTCACTTCTACATATGCTGCCATGCTGGTTTCCCAAGGCAGTTAGGAGATTGCTTCAGCTCTATGTCCAGGGTCCTCTTGGATGGCAATCACTCTCAGCTTTGCCAACGGGGCAGACGTTATGGGAGAGGGATGTTCATTTTTTTATGAATGATTATGAACATTCTGAAATCAGTCCAATCTCTTGGGAAGCAACCATACAGAAGCACATAGAAGATGAGTTATATGATTCTTCTCTTAAGGAAACTGGAGTTGGGCTTGAGCACAATTTGCATCGTGGACGTGCATTTTCAGCTTTTAACCATCTTCTTGCTGCTAGAGTTCAGAAACTAAAATCAGAGATTCAGCCGGGTTCAGCAACTGGACCATCAAATACACAGTTCGATCTACAGGCACTTTTTGCTCCTCTGACATTAAGGGAACAGTCTCTTCTTTCTTCTATTATTCCACTTGCCATTACACATTTTGAGAACTCTGTGTTAGTTGCTTCATGTGCTTTTCTCCTGGAGCTTTGTGGATTATCTGCCAGTATGCTCCGTGTAGATGTAGCAGCTTTAAGACGAATATCTACCTTTAACAAGTCTGGGCAATCCTTTGAGAATTTCAGGCAACTTTCACCTAAGGGCTCTGCTTTTCATCCAGTACCCCTAGAATCTGATAAAGTAGAGACTCTTGCTCGAGCTCTGGCTGATGAGTATCTGCACCAGGAAAGTTCAAGTGTTAATAAGCCAAAGGGCACTTCTAATTCAGCACCTTCAAAACGTTGTCCACAGGTGCTTTTCGTATTACAGCATTTGGAAGAGGTCAGTCTTCCCCAAGTGGTCGATGGAAATTCATGTGGATCATGGCTATTAAGTGGTAAAGGCGATGGGACTGAGCTAAGAAATCAACAAAAAGCTGCAAGCCATTACTGGAACTTAGTTACAGTCTTTTGTCGGATGCATAGGCTCCCTCCAAGTTCAAAGTATCTTGCTTTGTTAGCAAGAGACAATGACTGGGTTGGATTTTTAACTGAGGCTCACGTTGGCGGGTACCCTTTTGACACAGTTATCCAAGTAGCATCAAAAGAGTTCAGTGATCCACGTCTCAAAATCCATATATTAACTGTATTGAAGGCTGTACAGTCAAGGAAAAACCCTGGCCCTTCATCATACTCTGACACTGAAGATAAAAAAAGTCAAACTTCCTTTTTGGATGGAAGTACGTATATTCCAGTTGAGCTCTTTACAATTTTAGCCGAATGTGAGAAGAAGAAAAACCCTGGAAAAGCTCTCTTGATAAAGGCAGAGGAGTTATCCTGGTCTATTTTGGCAATGATTGCTTCTTGTTTCCCAGATGTGTCTCCATTATCCTGTCTTACTGTTTGGCTAGAAATTACTGCAGCAAGGGAAACTACATCCATTAAGGTAAATGATATTGCTTCCCAGATTGCAGAAAATGTTGGGGCAGCTGTAGAAGCTACCAATACCTTGCCAGCTGGGTGTAGATCACCTGCATTTCATTACTGCCGGAAAAATCCCAAACGAAGGCGAACCATGGATTCCATTTCTAAAGATCCATCAGTTGGAGTGATCTCTGATACTTTCAGTGCTTCAACAGGTGCATCAACTAATGTTTCAGGTGGCTTTATTGTCAAGGAAGAAGGAAAGATAGTTCAGGAACGTCGACCTATTTCTGTTTCATATGATTCAGATGAAGCACCATCATCTCTGTCCAAGATGGTTTCTGTGCTTTGTGAACAGAAATTATTCTTGCCTCTGTTAAGGGCTTTTGAAATGTTCCTTCCTTCGTGTTCCCTGCTACCGTTCATCCGCGCTCTTCAGGCGTTTTCACAAATGCGTTTATCTGAAGCTTCAGCCCATTTAGGTTCTTTTTCTGTACGAGTTAAGGATGAAGCTAGCTTTTCTCATGCAAATGTTGAGGGAGAAGAACACACTGGGACATCATGGACCGGGTCCACTGCTGTTAAGGCTGCTAATGCTGTACTGTCTGTTTGCCCATCTCCATATGAAAGAAAATGTCTACTGAAACTGCTAGCCGCAACTGATTTTGGTGATGGAGGATTTGCTGCCGCTTATTATCAACGGCTTTATTGGAAAATCAATTTAGCAGAGCCTTCAATACGTATAGATGATGGCCTGCACCTTGGAAATGAGGCTCTGGACGATGCATCACTTTTAACAGCGCTAGAAAATAATGGACATTGGGAGCAAGCACGCAATTGGGCAAAGCAACTGGAAGCTAGTGGGGGTTCTTGGAAATCTGCTAGTCATCATGTCACAGAAACTCAGGCTGAATCTATGGTGGCAGAATGGAAGGAATTTTTATGGGATGTTCAAGAAGAGAGAGTTGCATTGTGGGGTCACTGCCAGGCACTCTTCATTAGATATTCCTTTCCTGCTTTACAGGCTGGATTATTTTTCCTTAAACATGCAGAAGCTGTGGAGAAAGATCTTCCAGCCAAGGAGCTTCATGAACTATTATTACTTTCCTTGCAATGGTTAAGTGGGATGTTTACCATGTCTTATCCAGTTTATCCATTGCATCTTCTACGAGAAATTGAGACCAAGGTTTGGCTCCTGGCAGTAGAGTCAGAAGCTGAGCTGAAGAATGAACGGGATTTGAACATTAACAACTCCAGCCGGGAATGTATATCTAGGAATAGCTCAAGTATTATCGACTGGACTGCAAGTATAATATCAAAAATGGATAAACATATTAGTACAATGAAGAATAAAAGTATGGATAAACATGAGGTGAGAGAAAACAGCCAGACTCATCATAAAAGTCACGTATTAGATGCTGGCCTTTCAACTGCAGGAGGGGGGAATACAAAGGCTAAAAGGAGGACCAAAGGTTCCGTGCTAATTCGACGGCCATTAGTGGACTCTACAGACATGAACACTAACCCTGAAGATGGATGTGTTCCTTCCAATTTTAAAAATGACTTGCACTTGCAAGATGAGAACTTAAAAATGGATACATCATTATCGGGGTGGGAAGAAAGAATTGGACCTGCAGAGGTGGATAGAGCTGTTCTTTCATTATTAGAGTTTGGACAAATTACGGCTGCCAAGCAGCTTCAACAAAAGCTGTCTCCTGGGCAAGTACCTTCAGAATTCCTTCTTGTGGATGCTGCTTTTACGCTTGCAGCTATATCAACCCCTAATCGTGAAGTTTCCATGTCCATGCTTGATGAGGATTTATGTTCAGTTATTCTTGCATATGATATTCCGGTTGATCAGTATCTCAATCCGTTGCAGGTTTTGGAGATTTTAGCAACAATATTTGCCGAAGGAGGTGGACGTGGACTTTGTAGAAGAGTGATTGCAGTTGTAAAAGCTGCAAATGTCTTGGGACTTCCATTTTCAGAGGCATATAACAAACAGCCAATTGAACTATTACAGCTGCTCTCTCTCAAGGCACAAGAGTCATTTGAGGAGGCAAATTTCCTTGTGCAGACTCACTCTATGCCTGCTGCTAGTATTGCTCAAATTCTTGCAGAATCCTTCCTAAAGGGCTTGTTGGCTGCACATCGTGGAGGTTATATGGATTCCCAGAAAGATGAAGGACCTGCTCCTCTACTATGGAGATTCTCTGACTTCTTGAAGTGGTCAGAACTTTGTCCTTCTGAACCAGAGATTGGGCATGCGTTAATGCGTTTAGTTATTACTGGACAAGAGATACCACATGCCTGTGAGGTGGAGCTTTTAATTTTGTCTCACCACTTCTACAAATCATCGGCTTGCCTTGATGGGGTGGATGTTCTCGTGGCTCTTGCTGCCACAAGAGTTGAGGCTTATGTAGCTGAGGGTGATTTTCCATGTTTAGCTCGCCTGATAACTGGAGTTGGAAACTTCTATGCCCTTAGTTTTATTCTTGGCATTCTTATAGAGAACGGTCAGCTAGAACTTCTTCTTCAAAAGTTCTCAGCTGCTGCAGATACAAGTGCAGGGAGTGCTGAGGCTGTCAGGGGATTTCGCATGGCTGTTCTCACATCCCTCAAGCATTTTAACCCCACTGATCTTGATGCATTTGCTAAGGTCTACAGCCATTTTGACATGAAACATGAAACAGCTGCTCTTCTGGAGTCACAAGCGGAGCAGTCGTGTGAGATGTGGTTCCGCCGCTATTACAAGGACCAGAATGCAGACCTTTTAGATGCAATGCATTACTACATCGCAGCTGCTGAAGTTCATTCTTCCATTGATGCTGGCAACAAAACCCGCAGATCCTGTGCACAGGCTTCTCTAGTGTCCCTTCAGATTAGGATGCCCGACTTTAAGTGGCTCTTTCAGTCGGAAACCAACGCCAGAAGAGCTCTTGTCGAGCAATCAAGATTCCAAGAGGCACTAATTGTTGCTGAAGCTTATGATCTTGACCAGCCAAGCGAGTGGGCTTTAGTCATTTGGAATCAGATGCTTAAACCAGAGATTCTAGAAGAATTTGTGGCTGAATTTGTGAGTGTGCTTCCACTCCATCCTTCAATGTTAGCTGACATTGCAAGATTTTATAGGTCTGAAGTGGCTGCCCGCGGGGACCAGTCCCAATTCTCCGTCTGGCTAACTGGGGGAGGGTTGCCTGCAGAGTGGGCAAAATATTTGGGAAGATCATTTAGGTGCTTGTTGAAAAGGACTCGAGATTTGAGGCTCCGTTTGCAACTAGCTCAAGTTGCCACTGGTTTCGTGGATGTCATGGATGCTTGCACAAAAGCACTTGATAAGGTACCCGAGAATGCCGGGCCTCTTGTGCTTAGGAAAGGGCATGGTGGTACATATCTTCCACTGATGTGA

Protein sequence

MIVIKRRVKRTEEIPGETLHGTGQRPEEYECKLTKKPIISATLVGKTVTIPLRNPVVFPVAIDWRIPITQLKRERESERESSDSENGFLRRLGYLYELQGNMMDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGDVRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKSSRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAKNMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFEDDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTIVDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWADFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQDDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHPHNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQDSKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLTQFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSLYLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVEVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEVGILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFSSSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPISDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDLNGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSATPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAIWTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSLQVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGMLLEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGAFYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLLSRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYAPSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKLKNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQTLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSAFNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENSVLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLESDKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQVLFVLQHLEEVSLPQVVDGNSCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFLDGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISKDPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVLCEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHANVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLYWKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASHHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNINNSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLSTAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLSGWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNREVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANVLGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHETAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPEILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRTRDLRLRLQLAQVATGFVDVMDACTKALDKVPENAGPLVLRKGHGGTYLPLM
Homology
BLAST of Sgr026353 vs. NCBI nr
Match: XP_038881148.1 (uncharacterized protein LOC120072742 [Benincasa hispida])

HSP 1 Score: 5549.2 bits (14394), Expect = 0.0e+00
Identity = 2802/3240 (86.48%), Postives = 2945/3240 (90.90%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GP ILQL KW+PSQ QLNLSEYREAFISP RQ LLLHSYK+EALLLPLNTGD
Sbjct: 1    MDSVSGCEGPVILQLQKWSPSQSQLNLSEYREAFISPTRQNLLLHSYKYEALLLPLNTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
            +RC N+ PN YDINLKDLGSLAFSE VSTA RSEDAEG+V+CSN+S +DID +SPTGNKS
Sbjct: 61   IRCSNNFPNEYDINLKDLGSLAFSEEVSTASRSEDAEGDVQCSNRSVIDIDKESPTGNKS 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SR++CNNFLGDVSSLAWGLCGD YKK ++S F EILFVSGNHGVTAHAFCQP     EAK
Sbjct: 121  SRANCNNFLGDVSSLAWGLCGDNYKKHENSYFMEILFVSGNHGVTAHAFCQPKNIVAEAK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWG  PTL Q LEVQE S SC T G+VD+N  NQNGE+LRSS  E E
Sbjct: 181  NMVQSEFWKGRWVEWGADPTLPQILEVQERSGSCETFGHVDENGRNQNGEMLRSSYSECE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWT--------------------------- 402
            +DALL GNSA KRYLQSFLAKVKTIEYEDDIWT                           
Sbjct: 241  NDALLSGNSASKRYLQSFLAKVKTIEYEDDIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  ISVDDSSVNEQSWHEIILGTPSIIRSTSSDTRFLSDILSNVFDIGTNKSYKCSRIFASNS 360

Query: 463  ---------IVDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEW 522
                     IV+SVSAD+ DE  SRNDTLILVARVGNLGIKWVSSV+FEKS Y+SP MEW
Sbjct: 361  HILIGFVLKIVESVSADKGDEIASRNDTLILVARVGNLGIKWVSSVEFEKSQYVSPTMEW 420

Query: 523  ADFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQV 582
            ADFCFSNDF++CLSDSGFIF+HSALSGKHVTCIDVLQACGL+PKYL  KQDLQM QVDQV
Sbjct: 421  ADFCFSNDFIVCLSDSGFIFIHSALSGKHVTCIDVLQACGLDPKYLQEKQDLQMKQVDQV 480

Query: 583  QDDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPH 642
            Q+ VSC R S Y  RKFRRLL DS SS FAVID FG+MYVVS+VD MLEH HGSENL P 
Sbjct: 481  QEVVSCRRGSLYHTRKFRRLLLDSLSSRFAVIDTFGVMYVVSSVDRMLEHCHGSENLLPP 540

Query: 643  PHNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQ 702
             HNFELG+APVSWE GGYDIGCQRNYSESLG HSC   SMKNEG S WGNT+ +VLQN Q
Sbjct: 541  YHNFELGKAPVSWEGGGYDIGCQRNYSESLGPHSCGNCSMKNEGASLWGNTKSNVLQNIQ 600

Query: 703  DSKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGL 762
            DSKV TG++YKCSCLTAS+ ILQ+QK QGGELQSC MRK+FLS WKTNE+DCFCFSPMGL
Sbjct: 601  DSKVYTGKRYKCSCLTASSPILQDQKSQGGELQSCMMRKIFLSGWKTNENDCFCFSPMGL 660

Query: 763  TQFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGS 822
            TQ+I+RCNI+G  C QVVHFDLHLKSEVHDDSCLKSQMIF+DGRK++LVGEA+GCTSQGS
Sbjct: 661  TQYIRRCNINGPNCYQVVHFDLHLKSEVHDDSCLKSQMIFIDGRKKDLVGEAIGCTSQGS 720

Query: 823  LYLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQV 882
            LYLVTNNGLSVVLPSVT++S+SLP ESVARLQP  LLGT NQVK L+LKES C WSPWQV
Sbjct: 721  LYLVTNNGLSVVLPSVTVSSNSLPYESVARLQPDSLLGTANQVKDLDLKESVCPWSPWQV 780

Query: 883  EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 942
            EVLDR+LLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE
Sbjct: 781  EVLDRILLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 840

Query: 943  VGILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDF 1002
             GILRLLFAAVHLMFQKA  DNDISAASRLLALGT FATRM H+YG+AEFKRNAT FNDF
Sbjct: 841  EGILRLLFAAVHLMFQKARNDNDISAASRLLALGTHFATRMIHQYGLAEFKRNATTFNDF 900

Query: 1003 SSSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAP 1062
            SSSQEISILPHFPFR QNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEA 
Sbjct: 901  SSSQEISILPHFPFRMQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAS 960

Query: 1063 ISDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSED 1122
            ISD+ + +LDEPQ VSTDIIP GSTSQYELS PSNDL+S V+DGLVMMPM+SESQ+DSED
Sbjct: 961  ISDQTSQVLDEPQFVSTDIIPSGSTSQYELSLPSNDLNSNVMDGLVMMPMISESQMDSED 1020

Query: 1123 LNGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN 1182
            ++GDSAVVPQGV EKKVVPLENP QMIARWKSDKLPLK VVKDALLSGRLPLAVLQLHIN
Sbjct: 1021 VDGDSAVVPQGVFEKKVVPLENPNQMIARWKSDKLPLKTVVKDALLSGRLPLAVLQLHIN 1080

Query: 1183 HLRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGT 1242
            H+RELIG+ EPHDTFSEIRDIGRAIAYDLFLKGE G+AIATLQRLGDDIEVSLKQLLYGT
Sbjct: 1081 HVRELIGDDEPHDTFSEIRDIGRAIAYDLFLKGETGVAIATLQRLGDDIEVSLKQLLYGT 1140

Query: 1243 INRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSS 1302
            INR+FRVEIA EMKKYGYLGPFDQRMMD I+HIERLYPSSNFWKTFLSRQ+ANMG PS S
Sbjct: 1141 INRTFRVEIATEMKKYGYLGPFDQRMMDIILHIERLYPSSNFWKTFLSRQKANMGFPSRS 1200

Query: 1303 ATPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAA 1362
             +PGENDL+TLRFHLINNTIIDCGEVDGV+LGSWPNANE+S VLEI EDN H+GYWAAAA
Sbjct: 1201 NSPGENDLKTLRFHLINNTIIDCGEVDGVILGSWPNANESSPVLEINEDNVHMGYWAAAA 1260

Query: 1363 IWTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGS 1422
            IWTNTWDQRTTDRILLD+SL IG  V WESQLDYHICH+NWD VSRLLD+IPV+NLLDGS
Sbjct: 1261 IWTNTWDQRTTDRILLDQSLDIGTHVTWESQLDYHICHDNWDGVSRLLDVIPVANLLDGS 1320

Query: 1423 LQVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGM 1482
            LQ+SLDGLQ+A+AVGCNRESSFYSNYLY LEELDA+CLYIP AKIFRFS NIMCSKWLGM
Sbjct: 1321 LQISLDGLQTATAVGCNRESSFYSNYLYSLEELDAICLYIPNAKIFRFSTNIMCSKWLGM 1380

Query: 1483 LLEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGG 1542
            LLEEKLA  FIFLKEYWEGTMELVPLLARSGFIT+RLDEIASVDDHISS VDQ  +N GG
Sbjct: 1381 LLEEKLATHFIFLKEYWEGTMELVPLLARSGFITHRLDEIASVDDHISSLVDQSFSNKGG 1440

Query: 1543 AFYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLL 1602
            AF VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSVRSLLEAAGDCQWARWLL
Sbjct: 1441 AFSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVRSLLEAAGDCQWARWLL 1500

Query: 1603 LSRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMY 1662
            LSR+RGCEYDASFSNARSIMSLNLVHDPNL VR+IDEII TV DIAEG GEMAALATLMY
Sbjct: 1501 LSRVRGCEYDASFSNARSIMSLNLVHDPNLSVRNIDEIISTVADIAEGAGEMAALATLMY 1560

Query: 1663 APSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPK 1722
            APSPIQDCL+SSGVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK
Sbjct: 1561 APSPIQDCLNSSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPK 1620

Query: 1723 LKNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTG 1782
             KNALSEYLHWR+ IFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTG
Sbjct: 1621 SKNALSEYLHWRNIIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTG 1680

Query: 1783 QTLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFS 1842
            QTLWER+V+FFMND EHSEISPISWEATIQKHIEDELYDSSLKETG+GLEHNLHRGRA S
Sbjct: 1681 QTLWEREVYFFMNDDEHSEISPISWEATIQKHIEDELYDSSLKETGLGLEHNLHRGRALS 1740

Query: 1843 AFNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFEN 1902
            AFNHLLAARVQKLKSEIQ  SATG SN Q DLQ LFAPLT REQSLLSSIIPLAITHFEN
Sbjct: 1741 AFNHLLAARVQKLKSEIQSSSATGQSNIQLDLQTLFAPLTPREQSLLSSIIPLAITHFEN 1800

Query: 1903 SVLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLE 1962
            SVLVASCAFLLEL GLSASMLRVDVA LRRISTF KSGQSFENFRQLSPKGSAFHPVPLE
Sbjct: 1801 SVLVASCAFLLELGGLSASMLRVDVATLRRISTFYKSGQSFENFRQLSPKGSAFHPVPLE 1860

Query: 1963 SDKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQVLF-VLQHLEEVSLPQVVDG 2022
            SDK+ETLARALADEYLHQESSS+ KPKGTS+S P KRCPQVL  VLQHLEEVSLPQVVDG
Sbjct: 1861 SDKIETLARALADEYLHQESSSIKKPKGTSDSEPPKRCPQVLLVVLQHLEEVSLPQVVDG 1920

Query: 2023 NSCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFL 2082
            NSCGSWLLSGKGDGTELRNQQKAASHYWNLV VFCRMH LP SSKYLALLARDNDWVGFL
Sbjct: 1921 NSCGSWLLSGKGDGTELRNQQKAASHYWNLVIVFCRMHSLPLSSKYLALLARDNDWVGFL 1980

Query: 2083 TEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSF 2142
            TEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQ RK+PGPSS+SDTE+KK +T+F
Sbjct: 1981 TEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQLRKSPGPSSHSDTEEKKGRTTF 2040

Query: 2143 LDGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWL 2202
            LDG+ YIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCF DVSPLSCLTVWL
Sbjct: 2041 LDGNMYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFSDVSPLSCLTVWL 2100

Query: 2203 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSIS 2262
            EITAARETTSIKVNDIASQIAENVGAAVEATNTLP GCRSPAFHYCRKNPKRRRTMD IS
Sbjct: 2101 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPVGCRSPAFHYCRKNPKRRRTMDFIS 2160

Query: 2263 KDPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSV 2322
            ++ SVGV+SD   ASTGASTNVSG  IVKEEGK+VQE + ISVSYDSDEA SSLSKMVSV
Sbjct: 2161 EEQSVGVMSDNIGASTGASTNVSGDCIVKEEGKMVQECKRISVSYDSDEAASSLSKMVSV 2220

Query: 2323 LCEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSH 2382
            LCEQ+L+LPLLRAFEMFLPSCSLL FIRALQAFSQMRL+EASAHLGSFSVRVKDEAS+SH
Sbjct: 2221 LCEQQLYLPLLRAFEMFLPSCSLLSFIRALQAFSQMRLAEASAHLGSFSVRVKDEASYSH 2280

Query: 2383 ANVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRL 2442
            +NVEGEE+ GTSWTGSTA+KAA+AVLSVCPSPYER+CLLKLLAATDFGDGGFAA YY+RL
Sbjct: 2281 SNVEGEENIGTSWTGSTAIKAADAVLSVCPSPYERRCLLKLLAATDFGDGGFAATYYRRL 2340

Query: 2443 YWKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2502
            YWKINLAEPS+RIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSAS
Sbjct: 2341 YWKINLAEPSLRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2400

Query: 2503 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEK 2562
            HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEK
Sbjct: 2401 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEK 2460

Query: 2563 DLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNI 2622
            DLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELK+ER+LNI
Sbjct: 2461 DLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKSERELNI 2520

Query: 2623 NNSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGL 2682
            + SSREC +RNSSSIID TAS+ISKMDKHISTMKNK+MDKHEVRENSQTHHK  VLDAGL
Sbjct: 2521 SGSSRECTTRNSSSIIDSTASMISKMDKHISTMKNKNMDKHEVRENSQTHHKIQVLDAGL 2580

Query: 2683 STAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSL 2742
            STAGGGNTKAKRRTKGS+L+RR LVDSTD+NTNPEDG + SNFKNDL  QDEN KMDTS 
Sbjct: 2581 STAGGGNTKAKRRTKGSMLLRRSLVDSTDLNTNPEDGYISSNFKNDLQSQDENSKMDTSF 2640

Query: 2743 SGWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPN 2802
            SGWEER+GPAE DRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDA+F LAA+STPN
Sbjct: 2641 SGWEERVGPAEADRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDASFKLAALSTPN 2700

Query: 2803 REVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAAN 2862
            REVSMSMLD DL SVIL+YDIPVD+YLNPLQVLE LATIFAEG GRGLCRRVIAV KAAN
Sbjct: 2701 REVSMSMLDGDLSSVILSYDIPVDRYLNPLQVLETLATIFAEGSGRGLCRRVIAVAKAAN 2760

Query: 2863 VLGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLA 2922
            VLGL FSEAYNKQPIELLQLLSLKAQESFEEAN LVQTHSMPAASIAQILAESFLKGLLA
Sbjct: 2761 VLGLSFSEAYNKQPIELLQLLSLKAQESFEEANSLVQTHSMPAASIAQILAESFLKGLLA 2820

Query: 2923 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL 2982
            AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPE+GHALMRLVITGQEIPHACEVEL
Sbjct: 2821 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEVGHALMRLVITGQEIPHACEVEL 2880

Query: 2983 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL 3042
            LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDF CLARLITGVGNFYALSFILGIL
Sbjct: 2881 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFACLARLITGVGNFYALSFILGIL 2940

Query: 3043 IENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKH 3102
            IENGQLELLLQKFSAA +TSAGSAEAVRGFR+AVLTSLKHFNP DLDAFAKVYSHFDMKH
Sbjct: 2941 IENGQLELLLQKFSAAVNTSAGSAEAVRGFRIAVLTSLKHFNPNDLDAFAKVYSHFDMKH 3000

Query: 3103 ETAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQAS 3162
            ETAALLESQAEQSCEMWFRRY KDQN DLLDAMHYYI AAEV+SSIDAGNKTRRSCAQAS
Sbjct: 3001 ETAALLESQAEQSCEMWFRRYDKDQNEDLLDAMHYYIKAAEVYSSIDAGNKTRRSCAQAS 3060

Query: 3163 LVSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3222
            LVSLQIRMPDFKWLFQ+ETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP
Sbjct: 3061 LVSLQIRMPDFKWLFQTETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3120

Query: 3223 EILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR 3246
            EILEEFVAEFV+VLPLHPSMLADIARFYRSE+AARGDQSQFSVWLTGGGLPAEWAKYLGR
Sbjct: 3121 EILEEFVAEFVTVLPLHPSMLADIARFYRSEIAARGDQSQFSVWLTGGGLPAEWAKYLGR 3180

BLAST of Sgr026353 vs. NCBI nr
Match: XP_023518580.1 (uncharacterized protein LOC111782046 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 5508.0 bits (14287), Expect = 0.0e+00
Identity = 2796/3239 (86.32%), Postives = 2932/3239 (90.52%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TGD
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST  R EDAEG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   DRCSDDFPNKYDIDLKDLGSSAFSEEASTTSRQEDAEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCGD+Y K +DSSFKEILFVSGNHGVTAHAF QP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGDSYTKHEDSSFKEILFVSGNHGVTAHAFYQPKKVVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGP P L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC EFE
Sbjct: 181  NMVQSEFWKGRWVEWGPSPRLPQNLEIEERSGFCETSGNVDENGTNQNGEMLRSSCSEFE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL GNSA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGNSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTHSNMSPTSFDTHFLSDILSNVLGIGMNKSYKCSRIFSSDSH 360

Query: 463  ---------VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWA 522
                     +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWA
Sbjct: 361  FLIGFVLKRMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWA 420

Query: 523  DFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQ 582
            DFCFSNDF++CLSDSGFIF+HSALSGKHV CIDVLQACGLNP+YLH KQDLQ N VDQVQ
Sbjct: 421  DFCFSNDFIVCLSDSGFIFLHSALSGKHVACIDVLQACGLNPQYLHEKQDLQRNLVDQVQ 480

Query: 583  DDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHP 642
            DD+S  R SF++RRKFRRLLSDSHSS FAVIDA G++YVVSA++HMLEH HG ENLFPH 
Sbjct: 481  DDLSYRR-SFHERRKFRRLLSDSHSSRFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHS 540

Query: 643  HNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQD 702
            H+FELGR+ VSWEVGGYDIGCQRNYSESLG+HSCR FS KNEG S WGNT+ +VLQN +D
Sbjct: 541  HDFELGRSLVSWEVGGYDIGCQRNYSESLGNHSCRDFSKKNEGASHWGNTKSNVLQNIKD 600

Query: 703  SKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLT 762
            SKV  GR  KCSCLTASAS+L++QK +GGELQSCTMRKMFLSTWKTNEDDCF FSPMGLT
Sbjct: 601  SKVYRGRGDKCSCLTASASLLKDQKSEGGELQSCTMRKMFLSTWKTNEDDCFGFSPMGLT 660

Query: 763  QFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSL 822
            Q+IKRCN+SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSL
Sbjct: 661  QYIKRCNMSGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSL 720

Query: 823  YLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVE 882
            YLVTNNGLSVVLPSVTI S+SLPSE VAR QP ++LGT NQVK LELKES C WSPWQVE
Sbjct: 721  YLVTNNGLSVVLPSVTIPSNSLPSEYVARSQPDIILGTANQVKDLELKESKCPWSPWQVE 780

Query: 883  VLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEV 942
            VLDRVLLYESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLVDVDLEE 
Sbjct: 781  VLDRVLLYESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVDVDLEEE 840

Query: 943  GILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFS 1002
            GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFS
Sbjct: 841  GILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFS 900

Query: 1003 SSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPI 1062
            S QEISILPH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE  +
Sbjct: 901  SGQEISILPHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGE--V 960

Query: 1063 SDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDL 1122
            SD+ +LLLDEPQLVSTDIIPLGSTSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL
Sbjct: 961  SDQTSLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDL 1020

Query: 1123 NGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1182
            + DSAVVPQGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH
Sbjct: 1021 DEDSAVVPQGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1080

Query: 1183 LRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTI 1242
            LRELI E EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTI
Sbjct: 1081 LRELIEENEPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTI 1140

Query: 1243 NRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSA 1302
            NRSFRVEIAAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS 
Sbjct: 1141 NRSFRVEIAAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSN 1200

Query: 1303 TPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAI 1362
            +PGEN+LRTLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAI
Sbjct: 1201 SPGENELRTLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAI 1260

Query: 1363 WTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSL 1422
            WTNTWDQRTTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSL
Sbjct: 1261 WTNTWDQRTTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSL 1320

Query: 1423 QVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGML 1482
            QVSLDGLQSASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGML
Sbjct: 1321 QVSLDGLQSASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGML 1380

Query: 1483 LEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGA 1542
            LEEKLAR FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA
Sbjct: 1381 LEEKLARHFIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGA 1440

Query: 1543 FYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLL 1602
            + VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSV SLLEAAGDC WARWLLL
Sbjct: 1441 YSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVHSLLEAAGDCHWARWLLL 1500

Query: 1603 SRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYA 1662
            SRIRGCEYDASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYA
Sbjct: 1501 SRIRGCEYDASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYA 1560

Query: 1663 PSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKL 1722
            PSPIQDCL+SSGVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK 
Sbjct: 1561 PSPIQDCLNSSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKW 1620

Query: 1723 KNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQ 1782
            KNALSEYLHWR+S  FSAGRDTSLLHMLPCWFPKAVRRLL LYVQGPLGWQS+S LPTGQ
Sbjct: 1621 KNALSEYLHWRNSTIFSAGRDTSLLHMLPCWFPKAVRRLLHLYVQGPLGWQSISGLPTGQ 1680

Query: 1783 TLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSA 1842
             LWERDV+FFMND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SA
Sbjct: 1681 ALWERDVYFFMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSA 1740

Query: 1843 FNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENS 1902
            FNHLL ARVQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENS
Sbjct: 1741 FNHLLVARVQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPMEQSLLSSVIPLAITHFENS 1800

Query: 1903 VLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLES 1962
            VLVASCAFLLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLES
Sbjct: 1801 VLVASCAFLLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLES 1860

Query: 1963 DKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGN 2022
            DK+ETLARALADEYLHQESSSVN+PKGTS+ AP KRC QV LFVLQHLEEVSLP +VDGN
Sbjct: 1861 DKIETLARALADEYLHQESSSVNEPKGTSDPAPPKRCSQVLLFVLQHLEEVSLPHMVDGN 1920

Query: 2023 SCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLT 2082
            SCGSWLL GKGDGTELRNQQKAASH+WNLV VFCRMHR+P SSKYLALLARDNDWVGFLT
Sbjct: 1921 SCGSWLLCGKGDGTELRNQQKAASHHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLT 1980

Query: 2083 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFL 2142
            EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLK+VQSRK+PGPSSYSDTE+KK QT+ L
Sbjct: 1981 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKSVQSRKSPGPSSYSDTEEKKGQTTIL 2040

Query: 2143 DGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2202
            DGS YIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE
Sbjct: 2041 DGSMYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2100

Query: 2203 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISK 2262
            ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S 
Sbjct: 2101 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSD 2160

Query: 2263 DPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVL 2322
            DPSV  ISD FSAS  ASTNV G  IVKEEGKIVQE + ISVSYDSDEAPSSLSKMVSVL
Sbjct: 2161 DPSVIAISDNFSASR-ASTNVPGDSIVKEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVL 2220

Query: 2323 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHA 2382
            CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHA
Sbjct: 2221 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHA 2280

Query: 2383 NVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLY 2442
            NVEGEE+TGTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY
Sbjct: 2281 NVEGEENTGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLY 2340

Query: 2443 WKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASH 2502
            +KINLAEP +RIDDGLHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASH
Sbjct: 2341 YKINLAEPLLRIDDGLHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASH 2400

Query: 2503 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2562
            HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD
Sbjct: 2401 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2460

Query: 2563 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIN 2622
            LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNI+
Sbjct: 2461 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIS 2520

Query: 2623 NSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLS 2682
             S REC SRNSSSIID TAS+ISKMDKHISTM +K+MDKHEVRENSQTHHKS VLDAGLS
Sbjct: 2521 GSIRECKSRNSSSIIDLTASMISKMDKHISTMTSKNMDKHEVRENSQTHHKSQVLDAGLS 2580

Query: 2683 TAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLS 2742
            TAGGGNTKAKRRTKGS+L+RRPL DS DMNTN EDG + SNFKNDLH+QDENLKMDTS S
Sbjct: 2581 TAGGGNTKAKRRTKGSMLLRRPLADSADMNTNSEDGYISSNFKNDLHMQDENLKMDTSFS 2640

Query: 2743 GWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNR 2802
            GWEERIGPAEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST NR
Sbjct: 2641 GWEERIGPAEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNR 2700

Query: 2803 EVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANV 2862
            EV MSMLD DLCSVIL+  I VDQYLNPLQVLE LAT+FAEGGGRGLCRRVIAVVKAANV
Sbjct: 2701 EVPMSMLDGDLCSVILSSGIQVDQYLNPLQVLETLATVFAEGGGRGLCRRVIAVVKAANV 2760

Query: 2863 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2922
            LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA
Sbjct: 2761 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2820

Query: 2923 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2982
            HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL
Sbjct: 2821 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2880

Query: 2983 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 3042
            ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI
Sbjct: 2881 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 2940

Query: 3043 ENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHE 3102
            EN QLE LLQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHE
Sbjct: 2941 ENNQLEFLLQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHE 3000

Query: 3103 TAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASL 3162
            TAALLE+QAEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASL
Sbjct: 3001 TAALLETQAEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASL 3060

Query: 3163 VSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPE 3222
            VSLQIRMPDFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+
Sbjct: 3061 VSLQIRMPDFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPD 3120

Query: 3223 ILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3246
            ILE+FVAEFV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS
Sbjct: 3121 ILEDFVAEFVTVLPLHPSMLGDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3180

BLAST of Sgr026353 vs. NCBI nr
Match: XP_023518582.1 (uncharacterized protein LOC111782046 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 5502.2 bits (14272), Expect = 0.0e+00
Identity = 2795/3239 (86.29%), Postives = 2931/3239 (90.49%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TGD
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST  R EDAEG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   DRCSDDFPNKYDIDLKDLGSSAFSEEASTTSRQEDAEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCGD+Y K +DSSFKEILFVSGNHGVTAHAF QP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGDSYTKHEDSSFKEILFVSGNHGVTAHAFYQPKKVVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGP P L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC EFE
Sbjct: 181  NMVQSEFWKGRWVEWGPSPRLPQNLEIEERSGFCETSGNVDENGTNQNGEMLRSSCSEFE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL GNSA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGNSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTHSNMSPTSFDTHFLSDILSNVLGIGMNKSYKCSRIFSSDSH 360

Query: 463  ---------VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWA 522
                     +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWA
Sbjct: 361  FLIGFVLKRMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWA 420

Query: 523  DFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQ 582
            DFCFSNDF++CLSDSGFIF+HSALSGKHV CIDVLQACGLNP+YLH KQDLQ N VDQVQ
Sbjct: 421  DFCFSNDFIVCLSDSGFIFLHSALSGKHVACIDVLQACGLNPQYLHEKQDLQRNLVDQVQ 480

Query: 583  DDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHP 642
            DD+S  R SF++RRKFRRLLSDSHSS FAVIDA G++YVVSA++HMLEH HG ENLFPH 
Sbjct: 481  DDLSYRR-SFHERRKFRRLLSDSHSSRFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHS 540

Query: 643  HNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQD 702
            H+FELGR+ VSWEVGGYDIGCQRNYSESLG+HSCR FS KNEG S WGNT+ +VLQN +D
Sbjct: 541  HDFELGRSLVSWEVGGYDIGCQRNYSESLGNHSCRDFSKKNEGASHWGNTKSNVLQNIKD 600

Query: 703  SKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLT 762
            SKV  GR  KCSCLTASAS+L++QK +GGELQSCTMRKMFLSTWKTNEDDCF FSPMGLT
Sbjct: 601  SKVYRGRGDKCSCLTASASLLKDQKSEGGELQSCTMRKMFLSTWKTNEDDCFGFSPMGLT 660

Query: 763  QFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSL 822
            Q+IKRCN+SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSL
Sbjct: 661  QYIKRCNMSGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSL 720

Query: 823  YLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVE 882
            YLVTNNGLSVVLPSVTI S+SLPSE VAR QP ++LGT NQVK LELKES C WSPWQVE
Sbjct: 721  YLVTNNGLSVVLPSVTIPSNSLPSEYVARSQPDIILGTANQVKDLELKESKCPWSPWQVE 780

Query: 883  VLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEV 942
            VLDRVLLYESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLVDVDLEE 
Sbjct: 781  VLDRVLLYESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVDVDLEEE 840

Query: 943  GILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFS 1002
            GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFS
Sbjct: 841  GILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFS 900

Query: 1003 SSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPI 1062
            S QEISILPH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE  +
Sbjct: 901  SGQEISILPHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGE--V 960

Query: 1063 SDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDL 1122
            SD+ +LLLDEPQLVSTDIIPLGSTSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL
Sbjct: 961  SDQTSLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDL 1020

Query: 1123 NGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1182
            + DSAVVPQGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH
Sbjct: 1021 DEDSAVVPQGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1080

Query: 1183 LRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTI 1242
            LRELI E EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTI
Sbjct: 1081 LRELIEENEPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTI 1140

Query: 1243 NRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSA 1302
            NRSFRVEIAAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS 
Sbjct: 1141 NRSFRVEIAAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSN 1200

Query: 1303 TPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAI 1362
            +PGEN+LRTLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAI
Sbjct: 1201 SPGENELRTLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAI 1260

Query: 1363 WTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSL 1422
            WTNTWDQRTTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSL
Sbjct: 1261 WTNTWDQRTTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSL 1320

Query: 1423 QVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGML 1482
            QVSLDGLQSASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGML
Sbjct: 1321 QVSLDGLQSASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGML 1380

Query: 1483 LEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGA 1542
            LEEKLAR FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA
Sbjct: 1381 LEEKLARHFIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGA 1440

Query: 1543 FYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLL 1602
            + VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSV SLLEAAGDC WARWLLL
Sbjct: 1441 YSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVHSLLEAAGDCHWARWLLL 1500

Query: 1603 SRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYA 1662
            SRIRGCEYDASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYA
Sbjct: 1501 SRIRGCEYDASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYA 1560

Query: 1663 PSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKL 1722
            PSPIQDCL+SSGVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK 
Sbjct: 1561 PSPIQDCLNSSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKW 1620

Query: 1723 KNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQ 1782
            KNALSEYLHWR+S  FSAGRDTSLLHMLPCWFPKAVRRLL LYVQGPLGWQS+S LPTGQ
Sbjct: 1621 KNALSEYLHWRNSTIFSAGRDTSLLHMLPCWFPKAVRRLLHLYVQGPLGWQSISGLPTGQ 1680

Query: 1783 TLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSA 1842
             LWERDV+FFMND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SA
Sbjct: 1681 ALWERDVYFFMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSA 1740

Query: 1843 FNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENS 1902
            FNHLL ARVQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENS
Sbjct: 1741 FNHLLVARVQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPMEQSLLSSVIPLAITHFENS 1800

Query: 1903 VLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLES 1962
            VLVASCAFLLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLES
Sbjct: 1801 VLVASCAFLLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLES 1860

Query: 1963 DKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGN 2022
            DK+ETLARALADEYLHQESSSVN+PKGTS+ AP KRC QV LFVLQHLEEVSLP +VDGN
Sbjct: 1861 DKIETLARALADEYLHQESSSVNEPKGTSDPAPPKRCSQVLLFVLQHLEEVSLPHMVDGN 1920

Query: 2023 SCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLT 2082
            SCGSWLL GKGDGTELRNQQKAASH+WNLV VFCRMHR+P SSKYLALLARDNDWVGFLT
Sbjct: 1921 SCGSWLLCGKGDGTELRNQQKAASHHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLT 1980

Query: 2083 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFL 2142
            EAHVGGYPFDTVIQ ASKEFSDPRLKIHILTVLK+VQSRK+PGPSSYSDTE+KK QT+ L
Sbjct: 1981 EAHVGGYPFDTVIQ-ASKEFSDPRLKIHILTVLKSVQSRKSPGPSSYSDTEEKKGQTTIL 2040

Query: 2143 DGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2202
            DGS YIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE
Sbjct: 2041 DGSMYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2100

Query: 2203 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISK 2262
            ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S 
Sbjct: 2101 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSD 2160

Query: 2263 DPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVL 2322
            DPSV  ISD FSAS  ASTNV G  IVKEEGKIVQE + ISVSYDSDEAPSSLSKMVSVL
Sbjct: 2161 DPSVIAISDNFSASR-ASTNVPGDSIVKEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVL 2220

Query: 2323 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHA 2382
            CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHA
Sbjct: 2221 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHA 2280

Query: 2383 NVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLY 2442
            NVEGEE+TGTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY
Sbjct: 2281 NVEGEENTGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLY 2340

Query: 2443 WKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASH 2502
            +KINLAEP +RIDDGLHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASH
Sbjct: 2341 YKINLAEPLLRIDDGLHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASH 2400

Query: 2503 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2562
            HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD
Sbjct: 2401 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2460

Query: 2563 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIN 2622
            LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNI+
Sbjct: 2461 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIS 2520

Query: 2623 NSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLS 2682
             S REC SRNSSSIID TAS+ISKMDKHISTM +K+MDKHEVRENSQTHHKS VLDAGLS
Sbjct: 2521 GSIRECKSRNSSSIIDLTASMISKMDKHISTMTSKNMDKHEVRENSQTHHKSQVLDAGLS 2580

Query: 2683 TAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLS 2742
            TAGGGNTKAKRRTKGS+L+RRPL DS DMNTN EDG + SNFKNDLH+QDENLKMDTS S
Sbjct: 2581 TAGGGNTKAKRRTKGSMLLRRPLADSADMNTNSEDGYISSNFKNDLHMQDENLKMDTSFS 2640

Query: 2743 GWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNR 2802
            GWEERIGPAEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST NR
Sbjct: 2641 GWEERIGPAEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNR 2700

Query: 2803 EVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANV 2862
            EV MSMLD DLCSVIL+  I VDQYLNPLQVLE LAT+FAEGGGRGLCRRVIAVVKAANV
Sbjct: 2701 EVPMSMLDGDLCSVILSSGIQVDQYLNPLQVLETLATVFAEGGGRGLCRRVIAVVKAANV 2760

Query: 2863 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2922
            LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA
Sbjct: 2761 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2820

Query: 2923 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2982
            HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL
Sbjct: 2821 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2880

Query: 2983 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 3042
            ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI
Sbjct: 2881 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 2940

Query: 3043 ENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHE 3102
            EN QLE LLQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHE
Sbjct: 2941 ENNQLEFLLQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHE 3000

Query: 3103 TAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASL 3162
            TAALLE+QAEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASL
Sbjct: 3001 TAALLETQAEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASL 3060

Query: 3163 VSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPE 3222
            VSLQIRMPDFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+
Sbjct: 3061 VSLQIRMPDFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPD 3120

Query: 3223 ILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3246
            ILE+FVAEFV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS
Sbjct: 3121 ILEDFVAEFVTVLPLHPSMLGDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3180

BLAST of Sgr026353 vs. NCBI nr
Match: XP_023004088.1 (uncharacterized protein LOC111497504 isoform X1 [Cucurbita maxima])

HSP 1 Score: 5459.8 bits (14162), Expect = 0.0e+00
Identity = 2774/3239 (85.64%), Postives = 2915/3239 (90.00%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TGD
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST    ED EG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   NRCSDDFPNKYDIDLKDLGSSAFSEEASTTCWREDDEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCG++Y K +DSSFKEILFVSGNHGVTAHAFCQP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGESYTKHEDSSFKEILFVSGNHGVTAHAFCQPKKVVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGPYP L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC EFE
Sbjct: 181  NMVQSEFWKGRWVEWGPYPRLPQNLEIEERSGFCETSGNVDENGTNQNGEMLRSSCSEFE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL G+SA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGDSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTRSNMSPTSFDTHFLSDILSNVLGIGMNKSYKCSRIFSSDSH 360

Query: 463  ---------VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWA 522
                     +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWA
Sbjct: 361  FLIGFVLKRMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWA 420

Query: 523  DFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQ 582
            DFCFSNDF++CLSDSGFIF+HSALSGKHV CIDVLQACGLN +YLH KQDLQ N VDQVQ
Sbjct: 421  DFCFSNDFIVCLSDSGFIFLHSALSGKHVACIDVLQACGLNSQYLHEKQDLQRNIVDQVQ 480

Query: 583  DDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHP 642
            DD+S  R SF++RRKFRRLLSDSHSSHFAVIDA G++YVVSA++HMLEH HG ENLFPH 
Sbjct: 481  DDLSYRR-SFHERRKFRRLLSDSHSSHFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHS 540

Query: 643  HNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQD 702
            H+F+LGR+ VSWEVGGYDIGCQRNYSESLG+HSCR FS +NEG S WGNT+ +VLQN +D
Sbjct: 541  HDFKLGRSLVSWEVGGYDIGCQRNYSESLGNHSCRDFSKQNEGASHWGNTKSNVLQNIKD 600

Query: 703  SKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLT 762
            SKV  GR  KCSCLTASAS L++QK  GGELQSC MRKMFLSTWKTNEDDCF FSPMG+T
Sbjct: 601  SKVYRGRGDKCSCLTASASFLKDQKSVGGELQSCIMRKMFLSTWKTNEDDCFGFSPMGIT 660

Query: 763  QFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSL 822
            Q+IKRCN+SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSL
Sbjct: 661  QYIKRCNMSGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSL 720

Query: 823  YLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVE 882
            YLVTNNGLSVVLPSVTI S+SLP E VAR QP ++LGT NQVK LELKES C WSPWQVE
Sbjct: 721  YLVTNNGLSVVLPSVTIPSNSLPPEYVARSQPDIILGTANQVKDLELKESKCPWSPWQVE 780

Query: 883  VLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEV 942
            VLDRVLLYESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLV+VDLEE 
Sbjct: 781  VLDRVLLYESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVEVDLEEE 840

Query: 943  GILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFS 1002
            GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFS
Sbjct: 841  GILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFS 900

Query: 1003 SSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPI 1062
            S QEISILPH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE  +
Sbjct: 901  SGQEISILPHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGE--V 960

Query: 1063 SDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDL 1122
            SD+ +LLLDEPQLVSTDII LG+TSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL
Sbjct: 961  SDQTSLLLDEPQLVSTDIISLGNTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDL 1020

Query: 1123 NGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1182
            + DSAVVPQGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH
Sbjct: 1021 DEDSAVVPQGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1080

Query: 1183 LRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTI 1242
            LRELI E EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTI
Sbjct: 1081 LRELIEENEPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTI 1140

Query: 1243 NRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSA 1302
            NRSFRVEIAAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS 
Sbjct: 1141 NRSFRVEIAAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSN 1200

Query: 1303 TPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAI 1362
            +PGEN+LRTLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAI
Sbjct: 1201 SPGENELRTLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAI 1260

Query: 1363 WTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSL 1422
            WTNTWDQRTTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSL
Sbjct: 1261 WTNTWDQRTTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSL 1320

Query: 1423 QVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGML 1482
            QVSLDGLQSASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGML
Sbjct: 1321 QVSLDGLQSASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGML 1380

Query: 1483 LEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGA 1542
            LEEKLAR FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA
Sbjct: 1381 LEEKLARHFIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGA 1440

Query: 1543 FYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLL 1602
            + VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSV SLLEAAGDC WARWLLL
Sbjct: 1441 YSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVHSLLEAAGDCHWARWLLL 1500

Query: 1603 SRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYA 1662
            SRIRGCEYDASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYA
Sbjct: 1501 SRIRGCEYDASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYA 1560

Query: 1663 PSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKL 1722
            PSPIQDCLSS GVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK 
Sbjct: 1561 PSPIQDCLSSCGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKW 1620

Query: 1723 KNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQ 1782
            KNALSEYLHWR+S  FSAGRDTSLLHMLPCWFPKAVRRLL LYVQGPLGWQS+S LPTGQ
Sbjct: 1621 KNALSEYLHWRNSTIFSAGRDTSLLHMLPCWFPKAVRRLLHLYVQGPLGWQSISGLPTGQ 1680

Query: 1783 TLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSA 1842
             LWERDV+F MND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SA
Sbjct: 1681 ALWERDVYFVMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSA 1740

Query: 1843 FNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENS 1902
            FNHLL ARVQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENS
Sbjct: 1741 FNHLLVARVQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPMEQSLLSSVIPLAITHFENS 1800

Query: 1903 VLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLES 1962
            VLVASCAFLLEL GLSASML VDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLES
Sbjct: 1801 VLVASCAFLLELGGLSASMLHVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLES 1860

Query: 1963 DKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGN 2022
            DK+ETLARALADEYLHQESSSVN+PKGTS+SAP KRC QV LFVLQHLEEVSLP +VDGN
Sbjct: 1861 DKIETLARALADEYLHQESSSVNEPKGTSDSAPPKRCSQVLLFVLQHLEEVSLPHMVDGN 1920

Query: 2023 SCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLT 2082
            SCGSWLL GKGDGTELRNQQK AS +WNLV VFCRMHR+P SSKYLALLARDNDWVGFLT
Sbjct: 1921 SCGSWLLCGKGDGTELRNQQKTASQHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLT 1980

Query: 2083 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFL 2142
            EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLK+VQSRK+PGPSSYSDTE+KK QT+ L
Sbjct: 1981 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKSVQSRKSPGPSSYSDTEEKKGQTTIL 2040

Query: 2143 DGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2202
            DGS YIPVELFTILAECE KKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE
Sbjct: 2041 DGSMYIPVELFTILAECENKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2100

Query: 2203 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISK 2262
            ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S 
Sbjct: 2101 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSD 2160

Query: 2263 DPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVL 2322
            DPSV  ISD FSAS  AST V G  IV EEGKIVQE + ISVSYDSDEAPSSLSKMVSVL
Sbjct: 2161 DPSVIAISDNFSASR-ASTKVPGDSIVMEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVL 2220

Query: 2323 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHA 2382
            CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHA
Sbjct: 2221 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHA 2280

Query: 2383 NVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLY 2442
            NVEGEE+TGTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY
Sbjct: 2281 NVEGEENTGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLY 2340

Query: 2443 WKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASH 2502
            +KINLAEP +RIDDGLHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASH
Sbjct: 2341 YKINLAEPLLRIDDGLHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASH 2400

Query: 2503 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2562
            HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD
Sbjct: 2401 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2460

Query: 2563 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIN 2622
            LPAKELHELLLLSLQWLSGMFT+SYPVYPL+LLREIETKVWLLAVESEAELKNERDLNI+
Sbjct: 2461 LPAKELHELLLLSLQWLSGMFTVSYPVYPLNLLREIETKVWLLAVESEAELKNERDLNIS 2520

Query: 2623 NSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLS 2682
             S REC SRNSSSIID TAS+ISKMDKHISTM NK+MDKHEVRENSQTHHKS VLDAGLS
Sbjct: 2521 GSIRECKSRNSSSIIDLTASMISKMDKHISTMTNKNMDKHEVRENSQTHHKSQVLDAGLS 2580

Query: 2683 TAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLS 2742
            T GGGNTK KRRTKGS+L+RRPL DS DMNTN EDG + SN KNDLH+QDENLKMDTS S
Sbjct: 2581 TTGGGNTKTKRRTKGSMLLRRPLADSADMNTNSEDGYISSNVKNDLHMQDENLKMDTSFS 2640

Query: 2743 GWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNR 2802
            GWEERIGPAEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST NR
Sbjct: 2641 GWEERIGPAEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNR 2700

Query: 2803 EVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANV 2862
            EV M+MLD DLCSVIL+  I VDQYLNPLQVLE LATIFAEGGGRGLCRRVIAVVKAANV
Sbjct: 2701 EVPMAMLDGDLCSVILSSGIQVDQYLNPLQVLETLATIFAEGGGRGLCRRVIAVVKAANV 2760

Query: 2863 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2922
            LGL FSEAYNKQPIELLQLLSLKAQESF EANFLVQTHSMPAASIAQILAESFLKGLLAA
Sbjct: 2761 LGLSFSEAYNKQPIELLQLLSLKAQESFAEANFLVQTHSMPAASIAQILAESFLKGLLAA 2820

Query: 2923 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2982
            HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL
Sbjct: 2821 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2880

Query: 2983 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 3042
            ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI
Sbjct: 2881 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 2940

Query: 3043 ENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHE 3102
            EN QLE LLQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHE
Sbjct: 2941 ENNQLEFLLQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHE 3000

Query: 3103 TAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASL 3162
            TAALLE QAEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASL
Sbjct: 3001 TAALLERQAEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASL 3060

Query: 3163 VSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPE 3222
            VSLQIRMPDFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+
Sbjct: 3061 VSLQIRMPDFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPD 3120

Query: 3223 ILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3246
            ILE+FVAEFV+VLPLHPSML+DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS
Sbjct: 3121 ILEDFVAEFVTVLPLHPSMLSDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3180

BLAST of Sgr026353 vs. NCBI nr
Match: XP_011657786.1 (uncharacterized protein LOC101206379 [Cucumis sativus] >KGN48416.1 hypothetical protein Csa_003922 [Cucumis sativus])

HSP 1 Score: 5459.0 bits (14160), Expect = 0.0e+00
Identity = 2760/3240 (85.19%), Postives = 2917/3240 (90.03%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL KWNPSQPQLNL+EYREAFISP RQ LLLHSYKHEALLLPLNTGD
Sbjct: 1    MDSVSGCEGPAILQLQKWNPSQPQLNLAEYREAFISPTRQNLLLHSYKHEALLLPLNTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
            +RC ++ P  YD +LKD GSL FSE VSTA RSEDAEG+V+CSN+S VDID  SPT ++S
Sbjct: 61   IRCSDNFPKEYDTHLKDSGSLTFSE-VSTAFRSEDAEGDVQCSNQSVVDIDTHSPTRDES 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            S +SCNNFLGDVSSLAWGLCGD YKK +D  F EILFVSG+HGVTAHAFC+P KT  EAK
Sbjct: 121  SGASCNNFLGDVSSLAWGLCGDNYKKHEDYFFMEILFVSGSHGVTAHAFCEPKKTVAEAK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEF KGRW+EWGPYPTL Q L  QE S S  T GNVD+N  NQNGE+L SS  + E
Sbjct: 181  NMVQSEFRKGRWVEWGPYPTLPQILGAQESSGSSETCGNVDENGRNQNGEMLPSSNSKCE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWT--------------------------- 402
            +DALL GNS  KRYL+SFLAKVKTIEYEDDIWT                           
Sbjct: 241  NDALLSGNSTSKRYLRSFLAKVKTIEYEDDIWTMYPEKSSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSVDNSSVNEQNWHEIILGTPGNTRSTSSDTRVLSDILSNVFGIGMNKSYKCSRVFASNS 360

Query: 463  ---------IVDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEW 522
                     +V+SVSAD   ET SRNDTLILVAR G+LGIKWVSSV+FEKS Y+SP MEW
Sbjct: 361  HILIGFVLKMVESVSADEDAETESRNDTLILVARAGSLGIKWVSSVEFEKSQYVSPRMEW 420

Query: 523  ADFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQV 582
            ADFCFSNDF++CLSDSGFIF+HSALSGKHVT IDVLQACGL+PKYLH KQDLQM QVD V
Sbjct: 421  ADFCFSNDFIVCLSDSGFIFIHSALSGKHVTRIDVLQACGLDPKYLHEKQDLQMKQVDHV 480

Query: 583  QDDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPH 642
            QD VSC R SFY  RKFRRLLSDS SS FAVID FG+MYVVSAVDHML+HY+GSENL  H
Sbjct: 481  QDVVSCRRGSFYGTRKFRRLLSDSLSSRFAVIDTFGVMYVVSAVDHMLDHYYGSENLLGH 540

Query: 643  PHNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQ 702
             HN EL + P SWE GGYDIGCQRNYSESLGSHSC   SMKNEG S WGN++++VLQN Q
Sbjct: 541  SHNLELVKVPASWEGGGYDIGCQRNYSESLGSHSCGNGSMKNEGASLWGNSKYNVLQNIQ 600

Query: 703  DSKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGL 762
            DSKV TG++YKCSCLTASA ILQ+Q+ QGGELQSC MRK+F+S  KTNE+DCFCFSPMGL
Sbjct: 601  DSKVYTGKRYKCSCLTASAPILQDQESQGGELQSCMMRKIFVSACKTNENDCFCFSPMGL 660

Query: 763  TQFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGS 822
            TQ+I+RCN SGQ   QVVHFDLHLKSEVHDDSCLKSQM F+DGRK++LVGEAVGCTSQGS
Sbjct: 661  TQYIRRCNTSGQNSFQVVHFDLHLKSEVHDDSCLKSQMTFIDGRKKDLVGEAVGCTSQGS 720

Query: 823  LYLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQV 882
            LYLVTN+GLSVVLPS+T++S+SLP ESVARLQPG LLGT NQVK LELKES C WSPWQV
Sbjct: 721  LYLVTNDGLSVVLPSITVSSNSLPYESVARLQPGSLLGTTNQVKDLELKESKCPWSPWQV 780

Query: 883  EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 942
            EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE
Sbjct: 781  EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 840

Query: 943  VGILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDF 1002
             GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM H+YGMAE KRNAT FNDF
Sbjct: 841  EGILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHQYGMAELKRNATTFNDF 900

Query: 1003 SSSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAP 1062
            SSSQEISI P FPFR QNEL+YSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEA 
Sbjct: 901  SSSQEISIFPDFPFRMQNELDYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAL 960

Query: 1063 ISDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSED 1122
            ISD+ + LLDEPQ VSTD+IP GSTSQYELSFPSNDL+S V+DGLVMMPM+S SQ+DSED
Sbjct: 961  ISDQTSQLLDEPQFVSTDVIPSGSTSQYELSFPSNDLNSNVIDGLVMMPMISGSQMDSED 1020

Query: 1123 LNGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN 1182
            L+GDSAVVPQGV EKKV+PLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN
Sbjct: 1021 LDGDSAVVPQGVFEKKVLPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN 1080

Query: 1183 HLRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGT 1242
            H+RELIGE EPHDTFSEIRDIGRAIAYDLFLKGE G+AIATLQRLGDDIEVSLKQLLYGT
Sbjct: 1081 HVRELIGENEPHDTFSEIRDIGRAIAYDLFLKGETGVAIATLQRLGDDIEVSLKQLLYGT 1140

Query: 1243 INRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSS 1302
            INR+FRVEIAAEM+KYGYLGPFDQRMMD I+HIERLYPSSNFWKTFLSRQ+ANMG PSSS
Sbjct: 1141 INRTFRVEIAAEMEKYGYLGPFDQRMMDIILHIERLYPSSNFWKTFLSRQKANMGFPSSS 1200

Query: 1303 ATPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAA 1362
             +PGENDL+TL FH+INNTIIDCGEVDGVVLGSWP+ANENS VLEI EDN H+GYWAAAA
Sbjct: 1201 NSPGENDLKTLHFHVINNTIIDCGEVDGVVLGSWPDANENSPVLEINEDNVHMGYWAAAA 1260

Query: 1363 IWTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGS 1422
            IWTNTWDQRTTDRILLD+SL IGI V WESQLDYHICHNNWD VSRLLDMIPV+NLLDGS
Sbjct: 1261 IWTNTWDQRTTDRILLDQSLDIGIHVTWESQLDYHICHNNWDGVSRLLDMIPVANLLDGS 1320

Query: 1423 LQVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGM 1482
            LQVSLDGLQ+A+AVGCNRESSFY NYLYPLEELDA+CLYIP AKIFRFS NIMCSKWLG 
Sbjct: 1321 LQVSLDGLQTATAVGCNRESSFYGNYLYPLEELDAICLYIPNAKIFRFSTNIMCSKWLGA 1380

Query: 1483 LLEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGG 1542
            LLEEKLAR FIFLKEYWEGTMELVPLLAR+GFIT RLDEI  +DDHI+SSV Q ++N GG
Sbjct: 1381 LLEEKLARYFIFLKEYWEGTMELVPLLARAGFITPRLDEIDFMDDHINSSVGQSTSNKGG 1440

Query: 1543 AFYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLL 1602
            +F VDS+QALYKVFIHHCSQYNLPFLLDLYLDHHKL VDNNSVRSLLEAAGDCQWARWLL
Sbjct: 1441 SFSVDSMQALYKVFIHHCSQYNLPFLLDLYLDHHKLAVDNNSVRSLLEAAGDCQWARWLL 1500

Query: 1603 LSRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMY 1662
            LSR RGCEYDASF+NARSIMS NLVHDPNL VR+IDEII TV DIAEG GEMAALATLMY
Sbjct: 1501 LSRTRGCEYDASFANARSIMSPNLVHDPNLSVRNIDEIISTVADIAEGAGEMAALATLMY 1560

Query: 1663 APSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPK 1722
            APSPIQDCL+ SGVNRHSSSSAQCTLENLRP LQRFPTLCRAL TSAFQQDT CNFLGPK
Sbjct: 1561 APSPIQDCLNCSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALFTSAFQQDTACNFLGPK 1620

Query: 1723 LKNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTG 1782
             KNALSEYLHWR+ IF SAGRDTSLLHMLPCWFPK VRRLLQLYVQGPLGWQS+S LPTG
Sbjct: 1621 SKNALSEYLHWRNIIFLSAGRDTSLLHMLPCWFPKTVRRLLQLYVQGPLGWQSVSGLPTG 1680

Query: 1783 QTLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFS 1842
            QT+WERDV+FFMND EHSEISPISWEATIQKHIEDELYDSSLKETG+GLEHNLHRGRA S
Sbjct: 1681 QTIWERDVYFFMNDDEHSEISPISWEATIQKHIEDELYDSSLKETGLGLEHNLHRGRALS 1740

Query: 1843 AFNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFEN 1902
            AFNHLLAARVQKLKSE+Q  SA G SN Q DLQ LFAPLT  EQSLLSSIIPLAITHFEN
Sbjct: 1741 AFNHLLAARVQKLKSEVQSSSAPGHSNVQLDLQTLFAPLTPGEQSLLSSIIPLAITHFEN 1800

Query: 1903 SVLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLE 1962
            SVLVASCAFLLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLE
Sbjct: 1801 SVLVASCAFLLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLE 1860

Query: 1963 SDKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDG 2022
            SDK+E LARALADEYLHQESS V + KG+S+S P KRCP V LFVLQHLEEVSLPQVVDG
Sbjct: 1861 SDKIENLARALADEYLHQESSGVKRSKGSSDSEPPKRCPHVLLFVLQHLEEVSLPQVVDG 1920

Query: 2023 NSCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFL 2082
            NSCGSWL SGKGDGTELRNQQKAASHYWNLVTVFCRMH LP SSKYLALLARDNDWVGFL
Sbjct: 1921 NSCGSWLSSGKGDGTELRNQQKAASHYWNLVTVFCRMHSLPLSSKYLALLARDNDWVGFL 1980

Query: 2083 TEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSF 2142
            TEAHVGGYPFDTVIQVAS+EFSDPRLKIHILTVLKAVQ RK+ GPSS+ DTE+KK QT+F
Sbjct: 1981 TEAHVGGYPFDTVIQVASREFSDPRLKIHILTVLKAVQLRKSSGPSSHYDTEEKKGQTTF 2040

Query: 2143 LDGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWL 2202
            LDG  Y+PVELFTILAECEKKKNPGKALLI+AEELSWSILAMIASCF DVSPLSCLTVWL
Sbjct: 2041 LDGKMYVPVELFTILAECEKKKNPGKALLIRAEELSWSILAMIASCFSDVSPLSCLTVWL 2100

Query: 2203 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSIS 2262
            EITAARETTSIKVNDIASQIAENVGAAVEATNTLP GCRSPAFHYCRKNPKRRRT+  IS
Sbjct: 2101 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPVGCRSPAFHYCRKNPKRRRTVVFIS 2160

Query: 2263 KDPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSV 2322
            ++ SVGV+SD  SAS G STNVSG  IVKEEGK+VQER+PISVSYDSDEA SSLSKMVSV
Sbjct: 2161 EEQSVGVMSDNSSASAGVSTNVSGDCIVKEEGKVVQERQPISVSYDSDEAASSLSKMVSV 2220

Query: 2323 LCEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSH 2382
            LCEQ+L+LPLLRAFEMFLPSCSLL FIRALQAFSQMRL+EASAHLGSFSVRVKDEAS+SH
Sbjct: 2221 LCEQQLYLPLLRAFEMFLPSCSLLSFIRALQAFSQMRLAEASAHLGSFSVRVKDEASYSH 2280

Query: 2383 ANVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRL 2442
            +NVEGEE+ GTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAA+DFGDGGFAA YY+RL
Sbjct: 2281 SNVEGEENIGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAASDFGDGGFAATYYRRL 2340

Query: 2443 YWKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2502
            YWKI+LAEP +RIDDGLHLGNEALDD+SLLTALENNGHWEQARNWAKQLEASGGSWKSAS
Sbjct: 2341 YWKIDLAEPLLRIDDGLHLGNEALDDSSLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2400

Query: 2503 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEK 2562
            HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALF+RYSFPALQAGLFFLKHAEAVEK
Sbjct: 2401 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFVRYSFPALQAGLFFLKHAEAVEK 2460

Query: 2563 DLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNI 2622
            DLPAKELHELLLLSLQWLSGMFTMS PVYPLHLLREIETKVWLLAVESEAELKNERDLNI
Sbjct: 2461 DLPAKELHELLLLSLQWLSGMFTMSNPVYPLHLLREIETKVWLLAVESEAELKNERDLNI 2520

Query: 2623 NNSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGL 2682
            + SSRECISRNSSSIID TA++ISKMDKHISTMKNK++DKHE RENSQTHHK  +LDAG+
Sbjct: 2521 SGSSRECISRNSSSIIDSTANMISKMDKHISTMKNKNIDKHEARENSQTHHKGQILDAGI 2580

Query: 2683 STAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSL 2742
            STAGGGNTKAKRRTKGS+L+RR +VDSTDMNTNPEDG + SNFKNDL  QDEN KMDTS 
Sbjct: 2581 STAGGGNTKAKRRTKGSMLLRRSVVDSTDMNTNPEDGYISSNFKNDLQSQDENSKMDTSF 2640

Query: 2743 SGWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPN 2802
            SGWEER+GPAE DRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDA+F LAA+STPN
Sbjct: 2641 SGWEERVGPAEADRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDASFKLAALSTPN 2700

Query: 2803 REVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAAN 2862
            REVSMSM+D+DL SVIL+ +IPVD+YLNPLQVLEILATIFAEG GRGLC+RVIAVVKAAN
Sbjct: 2701 REVSMSMVDDDLSSVILSNNIPVDRYLNPLQVLEILATIFAEGSGRGLCKRVIAVVKAAN 2760

Query: 2863 VLGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLA 2922
            VLGL FSEAYNKQPIELLQLLSLKAQESFEEAN LVQTHSMPAASIAQILAESFLKGLLA
Sbjct: 2761 VLGLSFSEAYNKQPIELLQLLSLKAQESFEEANLLVQTHSMPAASIAQILAESFLKGLLA 2820

Query: 2923 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL 2982
            AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL
Sbjct: 2821 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL 2880

Query: 2983 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL 3042
            LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL
Sbjct: 2881 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL 2940

Query: 3043 IENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKH 3102
            IENGQLELLLQKFSAA +TSAGSAEAVRGFR+AVLTSLKHFNP DLDAFAKVYSHFDMKH
Sbjct: 2941 IENGQLELLLQKFSAAVNTSAGSAEAVRGFRIAVLTSLKHFNPNDLDAFAKVYSHFDMKH 3000

Query: 3103 ETAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQAS 3162
            ETAALLESQAEQSCEMWFRRY KDQN DLLDAMHYYI AAEV+SSIDAGNKTRRSCAQ+S
Sbjct: 3001 ETAALLESQAEQSCEMWFRRYDKDQNEDLLDAMHYYIKAAEVYSSIDAGNKTRRSCAQSS 3060

Query: 3163 LVSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3222
            LVSLQIRMPDFKWLFQ+ETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP
Sbjct: 3061 LVSLQIRMPDFKWLFQTETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3120

Query: 3223 EILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR 3246
            EILEEFVAEFV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR
Sbjct: 3121 EILEEFVAEFVTVLPLHPSMLTDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR 3180

BLAST of Sgr026353 vs. ExPASy Swiss-Prot
Match: Q55GD2 (Protein DDB_G0268328 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0268328 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 4.4e-25
Identity = 248/1450 (17.10%), Postives = 516/1450 (35.59%), Query Frame = 0

Query: 1953 WNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYPFDTVIQVASKEFSDPRLK 2012
            W L++ FC+ H +   ++ L  LA   +W+ F+ +A +  +P   + ++  ++ +   +K
Sbjct: 2425 WYLLSKFCQCHHVAKLTRQLEHLASSGNWLEFIYQAQIQDFPLQQIKEIIYEKVNSIGIK 2484

Query: 2013 IHILTVLKAV-------------QSRKNPGPSSYSDTEDKK----SQTSFLDGSTYIPVE 2072
             H+L VL+ +             Q   N   +   + E+++    +    ++   Y P+E
Sbjct: 2485 SHLLLVLEQLSNDRKRQYINHHHQINNNNNNNEKEEEEEREEGINNSELLINSMNYYPIE 2544

Query: 2073 -------LFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFP----DVSPLSCLTVW 2132
                   +   L    +  NP   LL  A +    +LA+IA+C      D+S + CL  W
Sbjct: 2545 ESKLSNDIIGYLLSSYRSNNPRNYLLYNATKDKRPLLAVIANCIDRNQNDLSTIECLVTW 2604

Query: 2133 LEITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSI 2192
            L +     +  +  ND  + I ++    ++A N           H+   N K+    D I
Sbjct: 2605 LCVCTTNLSKFLSGNDNVT-IDDSFDFDLDALNI-------SLDHHVVPNIKKYNYQDLI 2664

Query: 2193 SKDPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVS 2252
            +                                         SV Y              
Sbjct: 2665 N-----------------------------------------SVQY-------------- 2724

Query: 2253 VLCEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKD----- 2312
            ++  +K F+ LL+ F++FLP+  LL +++ +  F Q R  ++   L  F   + D     
Sbjct: 2725 IIQNRKSFI-LLQGFKIFLPNNILLNYLKFINQFHQYRFEDSEESLRLFINDLFDFKGGD 2784

Query: 2313 ----------EASFSHANVEGEEHTGTSWTGSTAVKAANAVLSVCP------SPYERKCL 2372
                      + + +  N   + +  +S    +       V+ +C       S YER+  
Sbjct: 2785 GNNNNNNNNNQNNNNQNNNNNQNNNDSSIYFKSKDSVKKMVVEICENLLEMFSSYEREHF 2844

Query: 2373 LKLLAATDFGDGGFAAAYYQRLYWKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGH 2432
            + +L  +        +  +  LY  +NL + +   D  +H     +D   ++  L   G 
Sbjct: 2845 IGILHRSGI------SYTFSTLYSTLNLLKRTQMQDKSIH-----MDPKLIVQHLIEKGL 2904

Query: 2433 WEQARNWAKQLEASGGSWKSASHHVTETQAESMVAEWKE-FLWDVQEERVALWGHCQALF 2492
            ++ AR ++ +              VT  + ++++  +++  LW++++ER+ LW  CQ  F
Sbjct: 2905 FKDARTYSLENHLD-------KDLVTVAEVDALINHYQQGCLWEIEQERINLWKKCQQYF 2964

Query: 2493 IRYSFPALQAGLFFLKHAEAVEKDLPAKELHELLLLSLQWL-----------------SG 2552
            I++      AG  F      ++   P++E   LL ++++W                  SG
Sbjct: 2965 IQHQSKPDIAGELFYNRGNNIQ---PSREKVFLLSIAVEWFEKSYFENINTDNIVGSGSG 3024

Query: 2553 MFTMSYPVYPLHL----LREIETKVWLLAVESEAELKNERDL------------------ 2612
              T   P     +    + +++ ++ LL+V    +  N+R                    
Sbjct: 3025 NTTQLTPTKSSSITTTFIEDLKKQILLLSVGLSNQESNDRPFDDDGSYEDFSPSTSPARS 3084

Query: 2613 --------------------NINNSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSM 2672
                                N NNS+ + I  +S +++ +     S M    S++ +  +
Sbjct: 3085 ITFPRGTNRGSGGFKSVNNNNNNNSNSDSIKNSSQNLLSY---FTSPMASPSSSLSSSPI 3144

Query: 2673 DKHE--VRENSQTHHKSHVLDAGLSTAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPED 2732
            D H+  +++ +Q + K  ++ +   T    +T   ++      I+     +T   T+   
Sbjct: 3145 DHHDSPIKDYAQRNRKFQLISSS-GTNSSFSTSPLQQPSFVSSIKDNKTTTTTTTTSVSS 3204

Query: 2733 GCVPSNFKNDLHLQDENLKMDTSLSGWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLS 2792
               P                         ++ P  +D  +  LL   Q+  A+Q+ Q+ +
Sbjct: 3205 VISPIK---------------------SIQLEPKALDNVLSKLLNSNQLFEAEQIVQQFN 3264

Query: 2793 PGQVPSEFLLVDAAFTLAAISTPNREVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEIL 2852
               +  + +          IS    +    +++E      L+ D    ++L   Q  +  
Sbjct: 3265 YKSIDYDLISTMLKIVNRTISPNPNQFPQELINE------LSRDYNTSRWLKASQQSQFQ 3324

Query: 2853 AT---IFAEGGGRG----------------------------LCRRVIAVVKAANVLGLP 2912
            ++     A G G G                              + +I     +  L + 
Sbjct: 3325 SSSTFTGASGNGNGGNSGFEISVENILSTLENLSSCCTLAKQAAKAIINKFNVSEKLSMG 3384

Query: 2913 FSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGG 2972
            +SE     P +++  L L  ++ F      + T+ +    I   LA+ F + ++  +   
Sbjct: 3385 YSELLISNPYDIINQLLLLGKDCFRLIKSYIFTNHLDIDKINDQLADLFSETIINQYNKS 3444

Query: 2973 YMDSQKDEGPAPLL-------------------------WRFSDFLKWSELCPSEPEIGH 3032
            +  +      +P +                         W   +F ++  +       G 
Sbjct: 3445 HQSTSSSGSTSPNMSILSLSLDDQPGRVGEGKSNGIDPNWTSEEFHEYIRIGRDPFTFGM 3504

Query: 3033 ALMRLVITGQE------------------------------------------------- 3092
             L+       E                                                 
Sbjct: 3505 KLIEATRINYESYFSVDNSPGIFGNTGMGSYVSPSLKQQSNTLNGTGGGGGNGGGNNGSG 3564

Query: 3093 -----IPHACEVELLILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITG 3152
                 I    EVE+ + +H  +  +  +DG  +++ +  +RV  Y   G +  L RLITG
Sbjct: 3565 KLSSPIGMEAEVEMFVRAHFCFVIACSVDGTILVLNMVKSRVNYYADAGKYKLLVRLITG 3624

Query: 3153 VGNFYALSFILGILIENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDL 3162
            +  +  L  I  IL+++ Q ELLL+K            E   G ++A+ + L    P   
Sbjct: 3625 MQCYNELQSIFDILLQHNQFELLLRK-------KIHQHEDQNGLKLALHSYLMKKQPLYQ 3684

BLAST of Sgr026353 vs. ExPASy Swiss-Prot
Match: Q3UHA3 (Spatacsin OS=Mus musculus OX=10090 GN=Spg11 PE=1 SV=3)

HSP 1 Score: 96.7 bits (239), Expect = 5.2e-18
Identity = 98/424 (23.11%), Postives = 190/424 (44.81%), Query Frame = 0

Query: 2750 GRGLCRRVIAVVKAANVLGLPFSEAYNKQPIELLQ-LLSLKAQESFEEANFLVQTHSMPA 2809
            G+  CR+V+ + + A  LG  + +   +    +L+ +L+ +  +   +A   + T  + A
Sbjct: 1974 GKNYCRQVLCLYELAKDLGCSYGDVAARDSEAMLRAILASQRPDRCRQAQVFINTQGLEA 2033

Query: 2810 ASIAQILAESFLKGLLAAHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALM 2869
             ++A+++AE   + LL    G     ++   PA        FL+ + LC     +G   M
Sbjct: 2034 DTVAELVAEEVTRELLTPSEG--TGEKQPFNPAE---ESQTFLQLTALCQDRTLVG---M 2093

Query: 2870 RLVITGQEIPH---ACEVELLILSHHFYKSSACLDGVDVLVALAATRVEAYVAEG-DFPC 2929
            +L+     +PH   +C  ELLIL+HH +  +  ++G+  ++  A    + ++A   ++  
Sbjct: 2094 KLLDKIPSVPHGELSCTTELLILAHHCFTFTCHMEGITRVLQAARMLTDNHLAPNEEYGL 2153

Query: 2930 LARLITGVGNFYALSFILGILIENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLK 2989
            + RL+TG+G +  +++I  +L +    E+L++K            +     + A+L  +K
Sbjct: 2154 VVRLLTGIGRYNEMTYIFDLLHQKHYFEVLMRK----------KLDPTGTLKTALLDYIK 2213

Query: 2990 HFNPTDLDAFAKVYSHFDM------KHETAALLESQAEQSCEMWFRRYYKD---QNADLL 3049
               P D +    +   F M       HE AA ++ +  +S + W     KD       LL
Sbjct: 2214 RCRPGDSEKHNMIALCFSMCREIGENHEAAACIQLKLIES-QPW-EESLKDGAQLKQLLL 2273

Query: 3050 DAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQIRM----PDFKWLFQSETNARRALV 3109
             A+   + AAE ++      +         L++LQI       +   +          ++
Sbjct: 2274 KALTLMLDAAESYAKDSCVRQALHCNRLTKLITLQIHFLNSGQNTMLINLGHQKLMDCIM 2333

Query: 3110 EQSRFQEALIVAEAYDLDQPSEWALVIWNQ-MLKPEILEEFVAEFVSVLPLHPSMLADIA 3155
               RF +A IVAEAYD     +WA V++ Q +LK +    ++ EF     L P++  DI+
Sbjct: 2334 TLPRFYQASIVAEAYDF--VPDWAEVLYQQVILKGDF--SYLEEFKQQKLLRPNIFEDIS 2373

BLAST of Sgr026353 vs. ExPASy Swiss-Prot
Match: Q96JI7 (Spatacsin OS=Homo sapiens OX=9606 GN=SPG11 PE=1 SV=3)

HSP 1 Score: 75.9 bits (185), Expect = 9.6e-12
Identity = 49/176 (27.84%), Postives = 88/176 (50.00%), Query Frame = 0

Query: 1941 ELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYPFDTVIQ 2000
            E++     +S  W LV  FCR+H +  S  YL   A+ NDW+ F+  + +  Y    V  
Sbjct: 1331 EIKRLSSESSSQWALVVQFCRLHNMKLSISYLRECAKANDWLQFIIHSQLHNYHPAEVKS 1390

Query: 2001 VASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQ-TSFLDGSTYIPVELFTI 2060
            +   ++  P ++ H+    + + S     P+S  D++   ++    L GS     +LF I
Sbjct: 1391 LI--QYFSPVIQDHLRLAFENLPS----VPTSKMDSDQVCNKCPQELQGSKQEMTDLFEI 1450

Query: 2061 LAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETTS 2116
            L +C ++ +    LL++A +    IL+++ASC    S +SCL VW+ IT+  +  +
Sbjct: 1451 LLQCSEEPDSWHWLLVEAVKQQAPILSVLASCLQGASAISCLCVWI-ITSVEDNVA 1499

BLAST of Sgr026353 vs. ExPASy TrEMBL
Match: A0A6J1KYH3 (uncharacterized protein LOC111497504 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111497504 PE=4 SV=1)

HSP 1 Score: 5459.8 bits (14162), Expect = 0.0e+00
Identity = 2774/3239 (85.64%), Postives = 2915/3239 (90.00%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TGD
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST    ED EG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   NRCSDDFPNKYDIDLKDLGSSAFSEEASTTCWREDDEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCG++Y K +DSSFKEILFVSGNHGVTAHAFCQP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGESYTKHEDSSFKEILFVSGNHGVTAHAFCQPKKVVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGPYP L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC EFE
Sbjct: 181  NMVQSEFWKGRWVEWGPYPRLPQNLEIEERSGFCETSGNVDENGTNQNGEMLRSSCSEFE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL G+SA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGDSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTRSNMSPTSFDTHFLSDILSNVLGIGMNKSYKCSRIFSSDSH 360

Query: 463  ---------VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWA 522
                     +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWA
Sbjct: 361  FLIGFVLKRMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWA 420

Query: 523  DFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQ 582
            DFCFSNDF++CLSDSGFIF+HSALSGKHV CIDVLQACGLN +YLH KQDLQ N VDQVQ
Sbjct: 421  DFCFSNDFIVCLSDSGFIFLHSALSGKHVACIDVLQACGLNSQYLHEKQDLQRNIVDQVQ 480

Query: 583  DDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHP 642
            DD+S  R SF++RRKFRRLLSDSHSSHFAVIDA G++YVVSA++HMLEH HG ENLFPH 
Sbjct: 481  DDLSYRR-SFHERRKFRRLLSDSHSSHFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHS 540

Query: 643  HNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQD 702
            H+F+LGR+ VSWEVGGYDIGCQRNYSESLG+HSCR FS +NEG S WGNT+ +VLQN +D
Sbjct: 541  HDFKLGRSLVSWEVGGYDIGCQRNYSESLGNHSCRDFSKQNEGASHWGNTKSNVLQNIKD 600

Query: 703  SKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLT 762
            SKV  GR  KCSCLTASAS L++QK  GGELQSC MRKMFLSTWKTNEDDCF FSPMG+T
Sbjct: 601  SKVYRGRGDKCSCLTASASFLKDQKSVGGELQSCIMRKMFLSTWKTNEDDCFGFSPMGIT 660

Query: 763  QFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSL 822
            Q+IKRCN+SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSL
Sbjct: 661  QYIKRCNMSGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSL 720

Query: 823  YLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVE 882
            YLVTNNGLSVVLPSVTI S+SLP E VAR QP ++LGT NQVK LELKES C WSPWQVE
Sbjct: 721  YLVTNNGLSVVLPSVTIPSNSLPPEYVARSQPDIILGTANQVKDLELKESKCPWSPWQVE 780

Query: 883  VLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEV 942
            VLDRVLLYESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLV+VDLEE 
Sbjct: 781  VLDRVLLYESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVEVDLEEE 840

Query: 943  GILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFS 1002
            GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFS
Sbjct: 841  GILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFS 900

Query: 1003 SSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPI 1062
            S QEISILPH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE  +
Sbjct: 901  SGQEISILPHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGE--V 960

Query: 1063 SDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDL 1122
            SD+ +LLLDEPQLVSTDII LG+TSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL
Sbjct: 961  SDQTSLLLDEPQLVSTDIISLGNTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDL 1020

Query: 1123 NGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1182
            + DSAVVPQGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH
Sbjct: 1021 DEDSAVVPQGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1080

Query: 1183 LRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTI 1242
            LRELI E EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTI
Sbjct: 1081 LRELIEENEPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTI 1140

Query: 1243 NRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSA 1302
            NRSFRVEIAAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS 
Sbjct: 1141 NRSFRVEIAAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSN 1200

Query: 1303 TPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAI 1362
            +PGEN+LRTLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAI
Sbjct: 1201 SPGENELRTLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAI 1260

Query: 1363 WTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSL 1422
            WTNTWDQRTTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSL
Sbjct: 1261 WTNTWDQRTTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSL 1320

Query: 1423 QVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGML 1482
            QVSLDGLQSASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGML
Sbjct: 1321 QVSLDGLQSASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGML 1380

Query: 1483 LEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGA 1542
            LEEKLAR FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA
Sbjct: 1381 LEEKLARHFIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGA 1440

Query: 1543 FYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLL 1602
            + VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSV SLLEAAGDC WARWLLL
Sbjct: 1441 YSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVHSLLEAAGDCHWARWLLL 1500

Query: 1603 SRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYA 1662
            SRIRGCEYDASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYA
Sbjct: 1501 SRIRGCEYDASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYA 1560

Query: 1663 PSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKL 1722
            PSPIQDCLSS GVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK 
Sbjct: 1561 PSPIQDCLSSCGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKW 1620

Query: 1723 KNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQ 1782
            KNALSEYLHWR+S  FSAGRDTSLLHMLPCWFPKAVRRLL LYVQGPLGWQS+S LPTGQ
Sbjct: 1621 KNALSEYLHWRNSTIFSAGRDTSLLHMLPCWFPKAVRRLLHLYVQGPLGWQSISGLPTGQ 1680

Query: 1783 TLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSA 1842
             LWERDV+F MND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SA
Sbjct: 1681 ALWERDVYFVMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSA 1740

Query: 1843 FNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENS 1902
            FNHLL ARVQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENS
Sbjct: 1741 FNHLLVARVQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPMEQSLLSSVIPLAITHFENS 1800

Query: 1903 VLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLES 1962
            VLVASCAFLLEL GLSASML VDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLES
Sbjct: 1801 VLVASCAFLLELGGLSASMLHVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLES 1860

Query: 1963 DKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGN 2022
            DK+ETLARALADEYLHQESSSVN+PKGTS+SAP KRC QV LFVLQHLEEVSLP +VDGN
Sbjct: 1861 DKIETLARALADEYLHQESSSVNEPKGTSDSAPPKRCSQVLLFVLQHLEEVSLPHMVDGN 1920

Query: 2023 SCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLT 2082
            SCGSWLL GKGDGTELRNQQK AS +WNLV VFCRMHR+P SSKYLALLARDNDWVGFLT
Sbjct: 1921 SCGSWLLCGKGDGTELRNQQKTASQHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLT 1980

Query: 2083 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFL 2142
            EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLK+VQSRK+PGPSSYSDTE+KK QT+ L
Sbjct: 1981 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKSVQSRKSPGPSSYSDTEEKKGQTTIL 2040

Query: 2143 DGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2202
            DGS YIPVELFTILAECE KKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE
Sbjct: 2041 DGSMYIPVELFTILAECENKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2100

Query: 2203 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISK 2262
            ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S 
Sbjct: 2101 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSD 2160

Query: 2263 DPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVL 2322
            DPSV  ISD FSAS  AST V G  IV EEGKIVQE + ISVSYDSDEAPSSLSKMVSVL
Sbjct: 2161 DPSVIAISDNFSASR-ASTKVPGDSIVMEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVL 2220

Query: 2323 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHA 2382
            CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHA
Sbjct: 2221 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHA 2280

Query: 2383 NVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLY 2442
            NVEGEE+TGTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY
Sbjct: 2281 NVEGEENTGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLY 2340

Query: 2443 WKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASH 2502
            +KINLAEP +RIDDGLHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASH
Sbjct: 2341 YKINLAEPLLRIDDGLHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASH 2400

Query: 2503 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2562
            HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD
Sbjct: 2401 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2460

Query: 2563 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIN 2622
            LPAKELHELLLLSLQWLSGMFT+SYPVYPL+LLREIETKVWLLAVESEAELKNERDLNI+
Sbjct: 2461 LPAKELHELLLLSLQWLSGMFTVSYPVYPLNLLREIETKVWLLAVESEAELKNERDLNIS 2520

Query: 2623 NSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLS 2682
             S REC SRNSSSIID TAS+ISKMDKHISTM NK+MDKHEVRENSQTHHKS VLDAGLS
Sbjct: 2521 GSIRECKSRNSSSIIDLTASMISKMDKHISTMTNKNMDKHEVRENSQTHHKSQVLDAGLS 2580

Query: 2683 TAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLS 2742
            T GGGNTK KRRTKGS+L+RRPL DS DMNTN EDG + SN KNDLH+QDENLKMDTS S
Sbjct: 2581 TTGGGNTKTKRRTKGSMLLRRPLADSADMNTNSEDGYISSNVKNDLHMQDENLKMDTSFS 2640

Query: 2743 GWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNR 2802
            GWEERIGPAEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST NR
Sbjct: 2641 GWEERIGPAEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNR 2700

Query: 2803 EVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANV 2862
            EV M+MLD DLCSVIL+  I VDQYLNPLQVLE LATIFAEGGGRGLCRRVIAVVKAANV
Sbjct: 2701 EVPMAMLDGDLCSVILSSGIQVDQYLNPLQVLETLATIFAEGGGRGLCRRVIAVVKAANV 2760

Query: 2863 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2922
            LGL FSEAYNKQPIELLQLLSLKAQESF EANFLVQTHSMPAASIAQILAESFLKGLLAA
Sbjct: 2761 LGLSFSEAYNKQPIELLQLLSLKAQESFAEANFLVQTHSMPAASIAQILAESFLKGLLAA 2820

Query: 2923 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2982
            HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL
Sbjct: 2821 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2880

Query: 2983 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 3042
            ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI
Sbjct: 2881 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 2940

Query: 3043 ENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHE 3102
            EN QLE LLQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHE
Sbjct: 2941 ENNQLEFLLQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHE 3000

Query: 3103 TAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASL 3162
            TAALLE QAEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASL
Sbjct: 3001 TAALLERQAEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASL 3060

Query: 3163 VSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPE 3222
            VSLQIRMPDFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+
Sbjct: 3061 VSLQIRMPDFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPD 3120

Query: 3223 ILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3246
            ILE+FVAEFV+VLPLHPSML+DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS
Sbjct: 3121 ILEDFVAEFVTVLPLHPSMLSDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3180

BLAST of Sgr026353 vs. ExPASy TrEMBL
Match: A0A0A0KKY4 (Spatacsin_C domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G486890 PE=4 SV=1)

HSP 1 Score: 5459.0 bits (14160), Expect = 0.0e+00
Identity = 2760/3240 (85.19%), Postives = 2917/3240 (90.03%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL KWNPSQPQLNL+EYREAFISP RQ LLLHSYKHEALLLPLNTGD
Sbjct: 1    MDSVSGCEGPAILQLQKWNPSQPQLNLAEYREAFISPTRQNLLLHSYKHEALLLPLNTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
            +RC ++ P  YD +LKD GSL FSE VSTA RSEDAEG+V+CSN+S VDID  SPT ++S
Sbjct: 61   IRCSDNFPKEYDTHLKDSGSLTFSE-VSTAFRSEDAEGDVQCSNQSVVDIDTHSPTRDES 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            S +SCNNFLGDVSSLAWGLCGD YKK +D  F EILFVSG+HGVTAHAFC+P KT  EAK
Sbjct: 121  SGASCNNFLGDVSSLAWGLCGDNYKKHEDYFFMEILFVSGSHGVTAHAFCEPKKTVAEAK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEF KGRW+EWGPYPTL Q L  QE S S  T GNVD+N  NQNGE+L SS  + E
Sbjct: 181  NMVQSEFRKGRWVEWGPYPTLPQILGAQESSGSSETCGNVDENGRNQNGEMLPSSNSKCE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWT--------------------------- 402
            +DALL GNS  KRYL+SFLAKVKTIEYEDDIWT                           
Sbjct: 241  NDALLSGNSTSKRYLRSFLAKVKTIEYEDDIWTMYPEKSSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSVDNSSVNEQNWHEIILGTPGNTRSTSSDTRVLSDILSNVFGIGMNKSYKCSRVFASNS 360

Query: 463  ---------IVDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEW 522
                     +V+SVSAD   ET SRNDTLILVAR G+LGIKWVSSV+FEKS Y+SP MEW
Sbjct: 361  HILIGFVLKMVESVSADEDAETESRNDTLILVARAGSLGIKWVSSVEFEKSQYVSPRMEW 420

Query: 523  ADFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQV 582
            ADFCFSNDF++CLSDSGFIF+HSALSGKHVT IDVLQACGL+PKYLH KQDLQM QVD V
Sbjct: 421  ADFCFSNDFIVCLSDSGFIFIHSALSGKHVTRIDVLQACGLDPKYLHEKQDLQMKQVDHV 480

Query: 583  QDDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPH 642
            QD VSC R SFY  RKFRRLLSDS SS FAVID FG+MYVVSAVDHML+HY+GSENL  H
Sbjct: 481  QDVVSCRRGSFYGTRKFRRLLSDSLSSRFAVIDTFGVMYVVSAVDHMLDHYYGSENLLGH 540

Query: 643  PHNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQ 702
             HN EL + P SWE GGYDIGCQRNYSESLGSHSC   SMKNEG S WGN++++VLQN Q
Sbjct: 541  SHNLELVKVPASWEGGGYDIGCQRNYSESLGSHSCGNGSMKNEGASLWGNSKYNVLQNIQ 600

Query: 703  DSKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGL 762
            DSKV TG++YKCSCLTASA ILQ+Q+ QGGELQSC MRK+F+S  KTNE+DCFCFSPMGL
Sbjct: 601  DSKVYTGKRYKCSCLTASAPILQDQESQGGELQSCMMRKIFVSACKTNENDCFCFSPMGL 660

Query: 763  TQFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGS 822
            TQ+I+RCN SGQ   QVVHFDLHLKSEVHDDSCLKSQM F+DGRK++LVGEAVGCTSQGS
Sbjct: 661  TQYIRRCNTSGQNSFQVVHFDLHLKSEVHDDSCLKSQMTFIDGRKKDLVGEAVGCTSQGS 720

Query: 823  LYLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQV 882
            LYLVTN+GLSVVLPS+T++S+SLP ESVARLQPG LLGT NQVK LELKES C WSPWQV
Sbjct: 721  LYLVTNDGLSVVLPSITVSSNSLPYESVARLQPGSLLGTTNQVKDLELKESKCPWSPWQV 780

Query: 883  EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 942
            EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE
Sbjct: 781  EVLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEE 840

Query: 943  VGILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDF 1002
             GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM H+YGMAE KRNAT FNDF
Sbjct: 841  EGILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHQYGMAELKRNATTFNDF 900

Query: 1003 SSSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAP 1062
            SSSQEISI P FPFR QNEL+YSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEA 
Sbjct: 901  SSSQEISIFPDFPFRMQNELDYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAL 960

Query: 1063 ISDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSED 1122
            ISD+ + LLDEPQ VSTD+IP GSTSQYELSFPSNDL+S V+DGLVMMPM+S SQ+DSED
Sbjct: 961  ISDQTSQLLDEPQFVSTDVIPSGSTSQYELSFPSNDLNSNVIDGLVMMPMISGSQMDSED 1020

Query: 1123 LNGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN 1182
            L+GDSAVVPQGV EKKV+PLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN
Sbjct: 1021 LDGDSAVVPQGVFEKKVLPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHIN 1080

Query: 1183 HLRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGT 1242
            H+RELIGE EPHDTFSEIRDIGRAIAYDLFLKGE G+AIATLQRLGDDIEVSLKQLLYGT
Sbjct: 1081 HVRELIGENEPHDTFSEIRDIGRAIAYDLFLKGETGVAIATLQRLGDDIEVSLKQLLYGT 1140

Query: 1243 INRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSS 1302
            INR+FRVEIAAEM+KYGYLGPFDQRMMD I+HIERLYPSSNFWKTFLSRQ+ANMG PSSS
Sbjct: 1141 INRTFRVEIAAEMEKYGYLGPFDQRMMDIILHIERLYPSSNFWKTFLSRQKANMGFPSSS 1200

Query: 1303 ATPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAA 1362
             +PGENDL+TL FH+INNTIIDCGEVDGVVLGSWP+ANENS VLEI EDN H+GYWAAAA
Sbjct: 1201 NSPGENDLKTLHFHVINNTIIDCGEVDGVVLGSWPDANENSPVLEINEDNVHMGYWAAAA 1260

Query: 1363 IWTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGS 1422
            IWTNTWDQRTTDRILLD+SL IGI V WESQLDYHICHNNWD VSRLLDMIPV+NLLDGS
Sbjct: 1261 IWTNTWDQRTTDRILLDQSLDIGIHVTWESQLDYHICHNNWDGVSRLLDMIPVANLLDGS 1320

Query: 1423 LQVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGM 1482
            LQVSLDGLQ+A+AVGCNRESSFY NYLYPLEELDA+CLYIP AKIFRFS NIMCSKWLG 
Sbjct: 1321 LQVSLDGLQTATAVGCNRESSFYGNYLYPLEELDAICLYIPNAKIFRFSTNIMCSKWLGA 1380

Query: 1483 LLEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGG 1542
            LLEEKLAR FIFLKEYWEGTMELVPLLAR+GFIT RLDEI  +DDHI+SSV Q ++N GG
Sbjct: 1381 LLEEKLARYFIFLKEYWEGTMELVPLLARAGFITPRLDEIDFMDDHINSSVGQSTSNKGG 1440

Query: 1543 AFYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLL 1602
            +F VDS+QALYKVFIHHCSQYNLPFLLDLYLDHHKL VDNNSVRSLLEAAGDCQWARWLL
Sbjct: 1441 SFSVDSMQALYKVFIHHCSQYNLPFLLDLYLDHHKLAVDNNSVRSLLEAAGDCQWARWLL 1500

Query: 1603 LSRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMY 1662
            LSR RGCEYDASF+NARSIMS NLVHDPNL VR+IDEII TV DIAEG GEMAALATLMY
Sbjct: 1501 LSRTRGCEYDASFANARSIMSPNLVHDPNLSVRNIDEIISTVADIAEGAGEMAALATLMY 1560

Query: 1663 APSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPK 1722
            APSPIQDCL+ SGVNRHSSSSAQCTLENLRP LQRFPTLCRAL TSAFQQDT CNFLGPK
Sbjct: 1561 APSPIQDCLNCSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALFTSAFQQDTACNFLGPK 1620

Query: 1723 LKNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTG 1782
             KNALSEYLHWR+ IF SAGRDTSLLHMLPCWFPK VRRLLQLYVQGPLGWQS+S LPTG
Sbjct: 1621 SKNALSEYLHWRNIIFLSAGRDTSLLHMLPCWFPKTVRRLLQLYVQGPLGWQSVSGLPTG 1680

Query: 1783 QTLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFS 1842
            QT+WERDV+FFMND EHSEISPISWEATIQKHIEDELYDSSLKETG+GLEHNLHRGRA S
Sbjct: 1681 QTIWERDVYFFMNDDEHSEISPISWEATIQKHIEDELYDSSLKETGLGLEHNLHRGRALS 1740

Query: 1843 AFNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFEN 1902
            AFNHLLAARVQKLKSE+Q  SA G SN Q DLQ LFAPLT  EQSLLSSIIPLAITHFEN
Sbjct: 1741 AFNHLLAARVQKLKSEVQSSSAPGHSNVQLDLQTLFAPLTPGEQSLLSSIIPLAITHFEN 1800

Query: 1903 SVLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLE 1962
            SVLVASCAFLLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLE
Sbjct: 1801 SVLVASCAFLLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLE 1860

Query: 1963 SDKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDG 2022
            SDK+E LARALADEYLHQESS V + KG+S+S P KRCP V LFVLQHLEEVSLPQVVDG
Sbjct: 1861 SDKIENLARALADEYLHQESSGVKRSKGSSDSEPPKRCPHVLLFVLQHLEEVSLPQVVDG 1920

Query: 2023 NSCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFL 2082
            NSCGSWL SGKGDGTELRNQQKAASHYWNLVTVFCRMH LP SSKYLALLARDNDWVGFL
Sbjct: 1921 NSCGSWLSSGKGDGTELRNQQKAASHYWNLVTVFCRMHSLPLSSKYLALLARDNDWVGFL 1980

Query: 2083 TEAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSF 2142
            TEAHVGGYPFDTVIQVAS+EFSDPRLKIHILTVLKAVQ RK+ GPSS+ DTE+KK QT+F
Sbjct: 1981 TEAHVGGYPFDTVIQVASREFSDPRLKIHILTVLKAVQLRKSSGPSSHYDTEEKKGQTTF 2040

Query: 2143 LDGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWL 2202
            LDG  Y+PVELFTILAECEKKKNPGKALLI+AEELSWSILAMIASCF DVSPLSCLTVWL
Sbjct: 2041 LDGKMYVPVELFTILAECEKKKNPGKALLIRAEELSWSILAMIASCFSDVSPLSCLTVWL 2100

Query: 2203 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSIS 2262
            EITAARETTSIKVNDIASQIAENVGAAVEATNTLP GCRSPAFHYCRKNPKRRRT+  IS
Sbjct: 2101 EITAARETTSIKVNDIASQIAENVGAAVEATNTLPVGCRSPAFHYCRKNPKRRRTVVFIS 2160

Query: 2263 KDPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSV 2322
            ++ SVGV+SD  SAS G STNVSG  IVKEEGK+VQER+PISVSYDSDEA SSLSKMVSV
Sbjct: 2161 EEQSVGVMSDNSSASAGVSTNVSGDCIVKEEGKVVQERQPISVSYDSDEAASSLSKMVSV 2220

Query: 2323 LCEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSH 2382
            LCEQ+L+LPLLRAFEMFLPSCSLL FIRALQAFSQMRL+EASAHLGSFSVRVKDEAS+SH
Sbjct: 2221 LCEQQLYLPLLRAFEMFLPSCSLLSFIRALQAFSQMRLAEASAHLGSFSVRVKDEASYSH 2280

Query: 2383 ANVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRL 2442
            +NVEGEE+ GTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAA+DFGDGGFAA YY+RL
Sbjct: 2281 SNVEGEENIGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAASDFGDGGFAATYYRRL 2340

Query: 2443 YWKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2502
            YWKI+LAEP +RIDDGLHLGNEALDD+SLLTALENNGHWEQARNWAKQLEASGGSWKSAS
Sbjct: 2341 YWKIDLAEPLLRIDDGLHLGNEALDDSSLLTALENNGHWEQARNWAKQLEASGGSWKSAS 2400

Query: 2503 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEK 2562
            HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALF+RYSFPALQAGLFFLKHAEAVEK
Sbjct: 2401 HHVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFVRYSFPALQAGLFFLKHAEAVEK 2460

Query: 2563 DLPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNI 2622
            DLPAKELHELLLLSLQWLSGMFTMS PVYPLHLLREIETKVWLLAVESEAELKNERDLNI
Sbjct: 2461 DLPAKELHELLLLSLQWLSGMFTMSNPVYPLHLLREIETKVWLLAVESEAELKNERDLNI 2520

Query: 2623 NNSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGL 2682
            + SSRECISRNSSSIID TA++ISKMDKHISTMKNK++DKHE RENSQTHHK  +LDAG+
Sbjct: 2521 SGSSRECISRNSSSIIDSTANMISKMDKHISTMKNKNIDKHEARENSQTHHKGQILDAGI 2580

Query: 2683 STAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSL 2742
            STAGGGNTKAKRRTKGS+L+RR +VDSTDMNTNPEDG + SNFKNDL  QDEN KMDTS 
Sbjct: 2581 STAGGGNTKAKRRTKGSMLLRRSVVDSTDMNTNPEDGYISSNFKNDLQSQDENSKMDTSF 2640

Query: 2743 SGWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPN 2802
            SGWEER+GPAE DRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDA+F LAA+STPN
Sbjct: 2641 SGWEERVGPAEADRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDASFKLAALSTPN 2700

Query: 2803 REVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAAN 2862
            REVSMSM+D+DL SVIL+ +IPVD+YLNPLQVLEILATIFAEG GRGLC+RVIAVVKAAN
Sbjct: 2701 REVSMSMVDDDLSSVILSNNIPVDRYLNPLQVLEILATIFAEGSGRGLCKRVIAVVKAAN 2760

Query: 2863 VLGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLA 2922
            VLGL FSEAYNKQPIELLQLLSLKAQESFEEAN LVQTHSMPAASIAQILAESFLKGLLA
Sbjct: 2761 VLGLSFSEAYNKQPIELLQLLSLKAQESFEEANLLVQTHSMPAASIAQILAESFLKGLLA 2820

Query: 2923 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL 2982
            AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL
Sbjct: 2821 AHRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVEL 2880

Query: 2983 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL 3042
            LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL
Sbjct: 2881 LILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGIL 2940

Query: 3043 IENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKH 3102
            IENGQLELLLQKFSAA +TSAGSAEAVRGFR+AVLTSLKHFNP DLDAFAKVYSHFDMKH
Sbjct: 2941 IENGQLELLLQKFSAAVNTSAGSAEAVRGFRIAVLTSLKHFNPNDLDAFAKVYSHFDMKH 3000

Query: 3103 ETAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQAS 3162
            ETAALLESQAEQSCEMWFRRY KDQN DLLDAMHYYI AAEV+SSIDAGNKTRRSCAQ+S
Sbjct: 3001 ETAALLESQAEQSCEMWFRRYDKDQNEDLLDAMHYYIKAAEVYSSIDAGNKTRRSCAQSS 3060

Query: 3163 LVSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3222
            LVSLQIRMPDFKWLFQ+ETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP
Sbjct: 3061 LVSLQIRMPDFKWLFQTETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP 3120

Query: 3223 EILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR 3246
            EILEEFVAEFV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR
Sbjct: 3121 EILEEFVAEFVTVLPLHPSMLTDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGR 3180

BLAST of Sgr026353 vs. ExPASy TrEMBL
Match: A0A6J1KV83 (uncharacterized protein LOC111497504 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111497504 PE=4 SV=1)

HSP 1 Score: 5454.0 bits (14147), Expect = 0.0e+00
Identity = 2773/3239 (85.61%), Postives = 2914/3239 (89.97%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TGD
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGD 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST    ED EG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   NRCSDDFPNKYDIDLKDLGSSAFSEEASTTCWREDDEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCG++Y K +DSSFKEILFVSGNHGVTAHAFCQP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGESYTKHEDSSFKEILFVSGNHGVTAHAFCQPKKVVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGPYP L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC EFE
Sbjct: 181  NMVQSEFWKGRWVEWGPYPRLPQNLEIEERSGFCETSGNVDENGTNQNGEMLRSSCSEFE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL G+SA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGDSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTRSNMSPTSFDTHFLSDILSNVLGIGMNKSYKCSRIFSSDSH 360

Query: 463  ---------VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWA 522
                     +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWA
Sbjct: 361  FLIGFVLKRMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWA 420

Query: 523  DFCFSNDFLLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQ 582
            DFCFSNDF++CLSDSGFIF+HSALSGKHV CIDVLQACGLN +YLH KQDLQ N VDQVQ
Sbjct: 421  DFCFSNDFIVCLSDSGFIFLHSALSGKHVACIDVLQACGLNSQYLHEKQDLQRNIVDQVQ 480

Query: 583  DDVSCSRDSFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHP 642
            DD+S  R SF++RRKFRRLLSDSHSSHFAVIDA G++YVVSA++HMLEH HG ENLFPH 
Sbjct: 481  DDLSYRR-SFHERRKFRRLLSDSHSSHFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHS 540

Query: 643  HNFELGRAPVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQD 702
            H+F+LGR+ VSWEVGGYDIGCQRNYSESLG+HSCR FS +NEG S WGNT+ +VLQN +D
Sbjct: 541  HDFKLGRSLVSWEVGGYDIGCQRNYSESLGNHSCRDFSKQNEGASHWGNTKSNVLQNIKD 600

Query: 703  SKVCTGRKYKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLT 762
            SKV  GR  KCSCLTASAS L++QK  GGELQSC MRKMFLSTWKTNEDDCF FSPMG+T
Sbjct: 601  SKVYRGRGDKCSCLTASASFLKDQKSVGGELQSCIMRKMFLSTWKTNEDDCFGFSPMGIT 660

Query: 763  QFIKRCNISGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSL 822
            Q+IKRCN+SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSL
Sbjct: 661  QYIKRCNMSGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSL 720

Query: 823  YLVTNNGLSVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVE 882
            YLVTNNGLSVVLPSVTI S+SLP E VAR QP ++LGT NQVK LELKES C WSPWQVE
Sbjct: 721  YLVTNNGLSVVLPSVTIPSNSLPPEYVARSQPDIILGTANQVKDLELKESKCPWSPWQVE 780

Query: 883  VLDRVLLYESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEV 942
            VLDRVLLYESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLV+VDLEE 
Sbjct: 781  VLDRVLLYESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVEVDLEEE 840

Query: 943  GILRLLFAAVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFS 1002
            GILRLLFAAVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFS
Sbjct: 841  GILRLLFAAVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFS 900

Query: 1003 SSQEISILPHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPI 1062
            S QEISILPH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE  +
Sbjct: 901  SGQEISILPHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGE--V 960

Query: 1063 SDENNLLLDEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDL 1122
            SD+ +LLLDEPQLVSTDII LG+TSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL
Sbjct: 961  SDQTSLLLDEPQLVSTDIISLGNTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDL 1020

Query: 1123 NGDSAVVPQGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1182
            + DSAVVPQGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH
Sbjct: 1021 DEDSAVVPQGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINH 1080

Query: 1183 LRELIGEKEPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTI 1242
            LRELI E EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTI
Sbjct: 1081 LRELIEENEPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTI 1140

Query: 1243 NRSFRVEIAAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSA 1302
            NRSFRVEIAAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS 
Sbjct: 1141 NRSFRVEIAAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSN 1200

Query: 1303 TPGENDLRTLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAI 1362
            +PGEN+LRTLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAI
Sbjct: 1201 SPGENELRTLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAI 1260

Query: 1363 WTNTWDQRTTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSL 1422
            WTNTWDQRTTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSL
Sbjct: 1261 WTNTWDQRTTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSL 1320

Query: 1423 QVSLDGLQSASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGML 1482
            QVSLDGLQSASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGML
Sbjct: 1321 QVSLDGLQSASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGML 1380

Query: 1483 LEEKLARQFIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGA 1542
            LEEKLAR FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA
Sbjct: 1381 LEEKLARHFIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGA 1440

Query: 1543 FYVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLL 1602
            + VDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLV+DNNSV SLLEAAGDC WARWLLL
Sbjct: 1441 YSVDSVQALYKVFIHHCSQYNLPFLLDLYLDHHKLVIDNNSVHSLLEAAGDCHWARWLLL 1500

Query: 1603 SRIRGCEYDASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYA 1662
            SRIRGCEYDASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYA
Sbjct: 1501 SRIRGCEYDASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYA 1560

Query: 1663 PSPIQDCLSSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKL 1722
            PSPIQDCLSS GVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK 
Sbjct: 1561 PSPIQDCLSSCGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKW 1620

Query: 1723 KNALSEYLHWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQ 1782
            KNALSEYLHWR+S  FSAGRDTSLLHMLPCWFPKAVRRLL LYVQGPLGWQS+S LPTGQ
Sbjct: 1621 KNALSEYLHWRNSTIFSAGRDTSLLHMLPCWFPKAVRRLLHLYVQGPLGWQSISGLPTGQ 1680

Query: 1783 TLWERDVHFFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSA 1842
             LWERDV+F MND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SA
Sbjct: 1681 ALWERDVYFVMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSA 1740

Query: 1843 FNHLLAARVQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENS 1902
            FNHLL ARVQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENS
Sbjct: 1741 FNHLLVARVQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPMEQSLLSSVIPLAITHFENS 1800

Query: 1903 VLVASCAFLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLES 1962
            VLVASCAFLLEL GLSASML VDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLES
Sbjct: 1801 VLVASCAFLLELGGLSASMLHVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLES 1860

Query: 1963 DKVETLARALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGN 2022
            DK+ETLARALADEYLHQESSSVN+PKGTS+SAP KRC QV LFVLQHLEEVSLP +VDGN
Sbjct: 1861 DKIETLARALADEYLHQESSSVNEPKGTSDSAPPKRCSQVLLFVLQHLEEVSLPHMVDGN 1920

Query: 2023 SCGSWLLSGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLT 2082
            SCGSWLL GKGDGTELRNQQK AS +WNLV VFCRMHR+P SSKYLALLARDNDWVGFLT
Sbjct: 1921 SCGSWLLCGKGDGTELRNQQKTASQHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLT 1980

Query: 2083 EAHVGGYPFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFL 2142
            EAHVGGYPFDTVIQ ASKEFSDPRLKIHILTVLK+VQSRK+PGPSSYSDTE+KK QT+ L
Sbjct: 1981 EAHVGGYPFDTVIQ-ASKEFSDPRLKIHILTVLKSVQSRKSPGPSSYSDTEEKKGQTTIL 2040

Query: 2143 DGSTYIPVELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2202
            DGS YIPVELFTILAECE KKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE
Sbjct: 2041 DGSMYIPVELFTILAECENKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLE 2100

Query: 2203 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISK 2262
            ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S 
Sbjct: 2101 ITAARETTSIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSD 2160

Query: 2263 DPSVGVISDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVL 2322
            DPSV  ISD FSAS  AST V G  IV EEGKIVQE + ISVSYDSDEAPSSLSKMVSVL
Sbjct: 2161 DPSVIAISDNFSASR-ASTKVPGDSIVMEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVL 2220

Query: 2323 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHA 2382
            CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHA
Sbjct: 2221 CEQKLFLPLLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHA 2280

Query: 2383 NVEGEEHTGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLY 2442
            NVEGEE+TGTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY
Sbjct: 2281 NVEGEENTGTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLY 2340

Query: 2443 WKINLAEPSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASH 2502
            +KINLAEP +RIDDGLHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASH
Sbjct: 2341 YKINLAEPLLRIDDGLHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASH 2400

Query: 2503 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2562
            HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD
Sbjct: 2401 HVTETQAESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKD 2460

Query: 2563 LPAKELHELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNIN 2622
            LPAKELHELLLLSLQWLSGMFT+SYPVYPL+LLREIETKVWLLAVESEAELKNERDLNI+
Sbjct: 2461 LPAKELHELLLLSLQWLSGMFTVSYPVYPLNLLREIETKVWLLAVESEAELKNERDLNIS 2520

Query: 2623 NSSRECISRNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLS 2682
             S REC SRNSSSIID TAS+ISKMDKHISTM NK+MDKHEVRENSQTHHKS VLDAGLS
Sbjct: 2521 GSIRECKSRNSSSIIDLTASMISKMDKHISTMTNKNMDKHEVRENSQTHHKSQVLDAGLS 2580

Query: 2683 TAGGGNTKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLS 2742
            T GGGNTK KRRTKGS+L+RRPL DS DMNTN EDG + SN KNDLH+QDENLKMDTS S
Sbjct: 2581 TTGGGNTKTKRRTKGSMLLRRPLADSADMNTNSEDGYISSNVKNDLHMQDENLKMDTSFS 2640

Query: 2743 GWEERIGPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNR 2802
            GWEERIGPAEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST NR
Sbjct: 2641 GWEERIGPAEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNR 2700

Query: 2803 EVSMSMLDEDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANV 2862
            EV M+MLD DLCSVIL+  I VDQYLNPLQVLE LATIFAEGGGRGLCRRVIAVVKAANV
Sbjct: 2701 EVPMAMLDGDLCSVILSSGIQVDQYLNPLQVLETLATIFAEGGGRGLCRRVIAVVKAANV 2760

Query: 2863 LGLPFSEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAA 2922
            LGL FSEAYNKQPIELLQLLSLKAQESF EANFLVQTHSMPAASIAQILAESFLKGLLAA
Sbjct: 2761 LGLSFSEAYNKQPIELLQLLSLKAQESFAEANFLVQTHSMPAASIAQILAESFLKGLLAA 2820

Query: 2923 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2982
            HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL
Sbjct: 2821 HRGGYMDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELL 2880

Query: 2983 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 3042
            ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI
Sbjct: 2881 ILSHHFYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILI 2940

Query: 3043 ENGQLELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHE 3102
            EN QLE LLQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHE
Sbjct: 2941 ENNQLEFLLQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHE 3000

Query: 3103 TAALLESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASL 3162
            TAALLE QAEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASL
Sbjct: 3001 TAALLERQAEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASL 3060

Query: 3163 VSLQIRMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPE 3222
            VSLQIRMPDFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+
Sbjct: 3061 VSLQIRMPDFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPD 3120

Query: 3223 ILEEFVAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3246
            ILE+FVAEFV+VLPLHPSML+DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS
Sbjct: 3121 ILEDFVAEFVTVLPLHPSMLSDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRS 3180

BLAST of Sgr026353 vs. ExPASy TrEMBL
Match: A0A6J1HEZ7 (uncharacterized protein LOC111463367 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463367 PE=4 SV=1)

HSP 1 Score: 5448.2 bits (14132), Expect = 0.0e+00
Identity = 2774/3231 (85.86%), Postives = 2915/3231 (90.22%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TG 
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGG 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST    EDAEG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   NRCSDDFPNKYDIDLKDLGSSAFSEEASTTSWREDAEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCGD+Y K++DSSFKEILFVSGNHGVTAHAF QP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGDSYTKQEDSSFKEILFVSGNHGVTAHAFYQPKKFVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGP P L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC E E
Sbjct: 181  NMVQSEFWKGRWVEWGPSPRLPQNLEIEECSGFCETSGNVDENGTNQNGEMLRSSCSESE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL GNSA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGNSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTRSNMSPTSFDTHFLGIGMNKSYKCSKIFSSDSHFLIGFVLK 360

Query: 463  -VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWADFCFSNDF 522
             +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWADFCFSNDF
Sbjct: 361  RMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWADFCFSNDF 420

Query: 523  LLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQDDVSCSRD 582
            ++CLSDSGFIF+HSALSGKHV C+DVLQACGLNP+YLH KQDLQ N VDQVQDD+S  R 
Sbjct: 421  IVCLSDSGFIFLHSALSGKHVACVDVLQACGLNPQYLHEKQDLQRNLVDQVQDDLSYRR- 480

Query: 583  SFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHPHNFELGRA 642
            SF++ RKFRRLLSDSHSSHFAVIDA G++YVVSA++HMLEH HG ENLFPH H+FELGR+
Sbjct: 481  SFHE-RKFRRLLSDSHSSHFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHSHDFELGRS 540

Query: 643  PVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQDSKVCTGRK 702
             VSWEVGGYDIGCQRNY       SCR FS ++EG S WGNT+ +VLQN +DSKV  GR 
Sbjct: 541  LVSWEVGGYDIGCQRNY-------SCRDFSKQSEGGSHWGNTKSNVLQNIKDSKVYRGRG 600

Query: 703  YKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLTQFIKRCNI 762
             KCSCLTA+AS+L+ QK +GGELQS TMRKMFLSTWKTNEDDCF FSPMGLTQ+IKRCN+
Sbjct: 601  DKCSCLTATASLLKEQKSEGGELQSGTMRKMFLSTWKTNEDDCFGFSPMGLTQYIKRCNM 660

Query: 763  SGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSLYLVTNNGL 822
            SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSLYLVTNNGL
Sbjct: 661  SGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSLYLVTNNGL 720

Query: 823  SVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVEVLDRVLLY 882
            SVVLPSVTI S+SLPSE V+R QP ++LGT NQVK LELKES C WSPWQVEVLDRVLLY
Sbjct: 721  SVVLPSVTIPSNSLPSEYVSRSQPDIILGTANQVKDLELKESKCPWSPWQVEVLDRVLLY 780

Query: 883  ESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEVGILRLLFA 942
            ESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLVDVDLEE GILRLLFA
Sbjct: 781  ESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVDVDLEEEGILRLLFA 840

Query: 943  AVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFSSSQEISIL 1002
            AVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFSS QEISIL
Sbjct: 841  AVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFSSGQEISIL 900

Query: 1003 PHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPISDENNLLL 1062
            PH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE   SD+ +LLL
Sbjct: 901  PHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGEE--SDQTSLLL 960

Query: 1063 DEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDLNGDSAVVP 1122
            DEPQLVSTDIIPLGSTSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL+ DSAVVP
Sbjct: 961  DEPQLVSTDIIPLGSTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDLDEDSAVVP 1020

Query: 1123 QGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIGEK 1182
            QGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELI E 
Sbjct: 1021 QGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIEEN 1080

Query: 1183 EPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI 1242
            EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI
Sbjct: 1081 EPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI 1140

Query: 1243 AAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSATPGENDLR 1302
            AAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS +PGEN+LR
Sbjct: 1141 AAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSNSPGENELR 1200

Query: 1303 TLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAIWTNTWDQR 1362
            TLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAIWTNTWDQR
Sbjct: 1201 TLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAIWTNTWDQR 1260

Query: 1363 TTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSLQVSLDGLQ 1422
            TTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSLQVSLDGLQ
Sbjct: 1261 TTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSLQVSLDGLQ 1320

Query: 1423 SASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGMLLEEKLARQ 1482
            SASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGMLLEEKLAR 
Sbjct: 1321 SASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGMLLEEKLARH 1380

Query: 1483 FIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGAFYVDSVQA 1542
            FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA+ VDSVQA
Sbjct: 1381 FIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGAYSVDSVQA 1440

Query: 1543 LYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLLSRIRGCEY 1602
            LYKVFIHHCSQYNLPFLLDLYLDH KLV+DNNSV SLLEAAGDC WARWLLLSRIRGCEY
Sbjct: 1441 LYKVFIHHCSQYNLPFLLDLYLDHQKLVIDNNSVHSLLEAAGDCHWARWLLLSRIRGCEY 1500

Query: 1603 DASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYAPSPIQDCL 1662
            DASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYAPSPIQDCL
Sbjct: 1501 DASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYAPSPIQDCL 1560

Query: 1663 SSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKLKNALSEYL 1722
            +SSGVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK KNALSEYL
Sbjct: 1561 NSSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKWKNALSEYL 1620

Query: 1723 HWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQTLWERDVH 1782
            HWR+S  FSAGRDTSLLHMLPCWFP AVRRLL LYVQGPLGWQS+S LPTGQ LWERDV+
Sbjct: 1621 HWRNSTIFSAGRDTSLLHMLPCWFPNAVRRLLHLYVQGPLGWQSISGLPTGQALWERDVY 1680

Query: 1783 FFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSAFNHLLAAR 1842
            FFMND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SAFNHLL AR
Sbjct: 1681 FFMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSAFNHLLVAR 1740

Query: 1843 VQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENSVLVASCAF 1902
            VQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENSVLVASCAF
Sbjct: 1741 VQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPTEQSLLSSVIPLAITHFENSVLVASCAF 1800

Query: 1903 LLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLESDKVETLAR 1962
            LLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLESDK+ETLAR
Sbjct: 1801 LLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLESDKIETLAR 1860

Query: 1963 ALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGNSCGSWLLS 2022
            ALADEYLHQESSSVN+P GTS+SAP KRC QV LFVLQHLEEVSLP +VDGNSCGSWLL 
Sbjct: 1861 ALADEYLHQESSSVNEPTGTSDSAPPKRCSQVLLFVLQHLEEVSLPHMVDGNSCGSWLLC 1920

Query: 2023 GKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYP 2082
            GKGDGTELRNQQKAASH+WNLV VFCRMHR+P SSKYLALLARDNDWVGFLTEAHVGGYP
Sbjct: 1921 GKGDGTELRNQQKAASHHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLTEAHVGGYP 1980

Query: 2083 FDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFLDGSTYIPV 2142
            FDTVIQVAS+EFSDPRLKIHILTVLK+VQSRK+PG SSYSDTE+KK QT+ LDGS YIPV
Sbjct: 1981 FDTVIQVASREFSDPRLKIHILTVLKSVQSRKSPGTSSYSDTEEKKGQTTILDGSMYIPV 2040

Query: 2143 ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT 2202
            ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT
Sbjct: 2041 ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT 2100

Query: 2203 SIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISKDPSVGVIS 2262
            SIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S DPSV  IS
Sbjct: 2101 SIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSDDPSVIAIS 2160

Query: 2263 DTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVLCEQKLFLP 2322
            D FSAS  ASTNV G  IVKEEGKIVQE + ISVSYDSDEAPSSLSKMVSVLCEQKLFLP
Sbjct: 2161 DNFSASR-ASTNVPGDSIVKEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVLCEQKLFLP 2220

Query: 2323 LLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHANVEGEEHT 2382
            LLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHANVEGEE+T
Sbjct: 2221 LLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHANVEGEENT 2280

Query: 2383 GTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLYWKINLAEP 2442
            GTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY+KINLAEP
Sbjct: 2281 GTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLYYKINLAEP 2340

Query: 2443 SIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASHHVTETQAE 2502
             +RIDD LHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASHHVTETQAE
Sbjct: 2341 LLRIDDALHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASHHVTETQAE 2400

Query: 2503 SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE 2562
            SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE
Sbjct: 2401 SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE 2460

Query: 2563 LLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNINNSSRECIS 2622
            LLLLSLQWLSGMFT+SYPVYPL+LLREIETKVWLLAVESEAELKNERDLNI+ S REC S
Sbjct: 2461 LLLLSLQWLSGMFTVSYPVYPLNLLREIETKVWLLAVESEAELKNERDLNISGSIRECKS 2520

Query: 2623 RNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLSTAGGGNTK 2682
            RNSSSIID TAS+ISKMDKHISTM NK+MDKHEVRENSQTHHKS VLDAGLSTAGGGNTK
Sbjct: 2521 RNSSSIIDLTASMISKMDKHISTMTNKNMDKHEVRENSQTHHKSQVLDAGLSTAGGGNTK 2580

Query: 2683 AKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLSGWEERIGP 2742
            AKRRTKGS+L+RRPL DS DMNTN EDG + SNFKNDLH+QDENLKMDTS SGWEERIGP
Sbjct: 2581 AKRRTKGSMLLRRPLADSADMNTNSEDGYISSNFKNDLHMQDENLKMDTSFSGWEERIGP 2640

Query: 2743 AEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNREVSMSMLD 2802
            AEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST N EV MSMLD
Sbjct: 2641 AEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNCEVPMSMLD 2700

Query: 2803 EDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANVLGLPFSEA 2862
             DLCSVIL+  I VDQYLNPLQVLE LAT+FAEGGGRGLCRRVIAVVKAANVLGLPFSEA
Sbjct: 2701 GDLCSVILSSGIQVDQYLNPLQVLETLATVFAEGGGRGLCRRVIAVVKAANVLGLPFSEA 2760

Query: 2863 YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS 2922
            YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS
Sbjct: 2761 YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS 2820

Query: 2923 QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2982
            QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK
Sbjct: 2821 QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2880

Query: 2983 SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENGQLELL 3042
            SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIEN QLE L
Sbjct: 2881 SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENNQLEFL 2940

Query: 3043 LQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHETAALLESQ 3102
            LQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHETAALLE+Q
Sbjct: 2941 LQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHETAALLETQ 3000

Query: 3103 AEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQIRMP 3162
            AEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASLVSLQIRMP
Sbjct: 3001 AEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASLVSLQIRMP 3060

Query: 3163 DFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPEILEEFVAE 3222
            DFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+ILE+FVAE
Sbjct: 3061 DFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPDILEDFVAE 3120

Query: 3223 FVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT 3246
            FV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT
Sbjct: 3121 FVTVLPLHPSMLGDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT 3180

BLAST of Sgr026353 vs. ExPASy TrEMBL
Match: A0A6J1HE64 (uncharacterized protein LOC111463367 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463367 PE=4 SV=1)

HSP 1 Score: 5442.1 bits (14116), Expect = 0.0e+00
Identity = 2773/3231 (85.82%), Postives = 2914/3231 (90.19%), Query Frame = 0

Query: 103  MDSVSGGGGPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGD 162
            MDSVSG  GPAILQL  WNPSQPQLNLSEYREAFISP R ILLLHSYKHEALLLPL+TG 
Sbjct: 1    MDSVSGCEGPAILQLQNWNPSQPQLNLSEYREAFISPTRSILLLHSYKHEALLLPLDTGG 60

Query: 163  VRCGNDLPNGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKS 222
             RC +D PN YDI+LKDLGS AFSE  ST    EDAEG+V+CSN+ AVD+D DSPT N+ 
Sbjct: 61   NRCSDDFPNKYDIDLKDLGSSAFSEEASTTSWREDAEGDVQCSNRLAVDVDKDSPTKNRF 120

Query: 223  SRSSCNNFLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAK 282
            SRSSCNNFLGDVSSLAWGLCGD+Y K++DSSFKEILFVSGNHGVTAHAF QP K   E K
Sbjct: 121  SRSSCNNFLGDVSSLAWGLCGDSYTKQEDSSFKEILFVSGNHGVTAHAFYQPKKFVVEGK 180

Query: 283  NMVQSEFWKGRWMEWGPYPTLVQNLEVQELSDSCVTSGNVDKNRINQNGEILRSSCYEFE 342
            NMVQSEFWKGRW+EWGP P L QNLE++E S  C TSGNVD+N  NQNGE+LRSSC E E
Sbjct: 181  NMVQSEFWKGRWVEWGPSPRLPQNLEIEECSGFCETSGNVDENGTNQNGEMLRSSCSESE 240

Query: 343  DDALLLGNSAPKRYLQSFLAKVKTIEYEDDIWTI-------------------------- 402
            +DALL GNSA KRYLQSFLAKVKT+E+ED+IWT+                          
Sbjct: 241  NDALLSGNSASKRYLQSFLAKVKTVEFEDNIWTMYPEKTSVPCFTKVVSFNIFNYNLPPP 300

Query: 403  ------------------------------------------------------------ 462
                                                                        
Sbjct: 301  NSDDSFVNEQSWHEIILGTRSNMSPTSFDTHFLGIGMNKSYKCSKIFSSDSHFLIGFVLK 360

Query: 463  -VDSVSADRVDETGSRNDTLILVARVGNLGIKWVSSVKFEKSLYISPLMEWADFCFSNDF 522
             +D+VS +   ET SRN TLILVARVGNLGIKWVSSVKFEKSLYI+P+MEWADFCFSNDF
Sbjct: 361  RMDTVSVEEGAETESRNGTLILVARVGNLGIKWVSSVKFEKSLYITPVMEWADFCFSNDF 420

Query: 523  LLCLSDSGFIFVHSALSGKHVTCIDVLQACGLNPKYLHLKQDLQMNQVDQVQDDVSCSRD 582
            ++CLSDSGFIF+HSALSGKHV C+DVLQACGLNP+YLH KQDLQ N VDQVQDD+S  R 
Sbjct: 421  IVCLSDSGFIFLHSALSGKHVACVDVLQACGLNPQYLHEKQDLQRNLVDQVQDDLSYRR- 480

Query: 583  SFYDRRKFRRLLSDSHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHPHNFELGRA 642
            SF++ RKFRRLLSDSHSSHFAVIDA G++YVVSA++HMLEH HG ENLFPH H+FELGR+
Sbjct: 481  SFHE-RKFRRLLSDSHSSHFAVIDASGVIYVVSAIEHMLEHCHGYENLFPHSHDFELGRS 540

Query: 643  PVSWEVGGYDIGCQRNYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQDSKVCTGRK 702
             VSWEVGGYDIGCQRNY       SCR FS ++EG S WGNT+ +VLQN +DSKV  GR 
Sbjct: 541  LVSWEVGGYDIGCQRNY-------SCRDFSKQSEGGSHWGNTKSNVLQNIKDSKVYRGRG 600

Query: 703  YKCSCLTASASILQNQKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLTQFIKRCNI 762
             KCSCLTA+AS+L+ QK +GGELQS TMRKMFLSTWKTNEDDCF FSPMGLTQ+IKRCN+
Sbjct: 601  DKCSCLTATASLLKEQKSEGGELQSGTMRKMFLSTWKTNEDDCFGFSPMGLTQYIKRCNM 660

Query: 763  SGQKCSQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKEELVGEAVGCTSQGSLYLVTNNGL 822
            SGQ  SQVVHFDLHLKSEVHDDSCLKSQMIFVDGRK+++VGEAVGCTSQGSLYLVTNNGL
Sbjct: 661  SGQNISQVVHFDLHLKSEVHDDSCLKSQMIFVDGRKKDIVGEAVGCTSQGSLYLVTNNGL 720

Query: 823  SVVLPSVTIASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVEVLDRVLLY 882
            SVVLPSVTI S+SLPSE V+R QP ++LGT NQVK LELKES C WSPWQVEVLDRVLLY
Sbjct: 721  SVVLPSVTIPSNSLPSEYVSRSQPDIILGTANQVKDLELKESKCPWSPWQVEVLDRVLLY 780

Query: 883  ESIDEADRLCSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEVGILRLLFA 942
            ESIDEADRLCSENGWDLKVVRMR FQM LHYLRFDELERSLEMLVDVDLEE GILRLLFA
Sbjct: 781  ESIDEADRLCSENGWDLKVVRMRCFQMALHYLRFDELERSLEMLVDVDLEEEGILRLLFA 840

Query: 943  AVHLMFQKAGTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFSSSQEISIL 1002
            AVHLMFQKAG DNDISAASRLLALGT FATRM HRYGMAEFKRNAT FNDFSS QEISIL
Sbjct: 841  AVHLMFQKAGNDNDISAASRLLALGTHFATRMIHRYGMAEFKRNATTFNDFSSGQEISIL 900

Query: 1003 PHFPFRKQNELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPISDENNLLL 1062
            PH PF+KQN  E+SRKLHEMSHFLEIIRNLH HLSSKFKRP QELV GE   SD+ +LLL
Sbjct: 901  PHLPFQKQNVSEHSRKLHEMSHFLEIIRNLHGHLSSKFKRPSQELVVGEE--SDQTSLLL 960

Query: 1063 DEPQLVSTDIIPLGSTSQYELSFPSNDLSSTVVDGLVMMPMVSESQLDSEDLNGDSAVVP 1122
            DEPQLVSTDIIPLGSTSQYELSFPSNDL+S VVDGL +MPMVS SQ +SEDL+ DSAVVP
Sbjct: 961  DEPQLVSTDIIPLGSTSQYELSFPSNDLNSNVVDGLAIMPMVSGSQFNSEDLDEDSAVVP 1020

Query: 1123 QGVLEKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIGEK 1182
            QGVLEKKVVPLENP QMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELI E 
Sbjct: 1021 QGVLEKKVVPLENPNQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIEEN 1080

Query: 1183 EPHDTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI 1242
            EPHDTFSEIRDIGRAIAYDLFLKGE GLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI
Sbjct: 1081 EPHDTFSEIRDIGRAIAYDLFLKGETGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEI 1140

Query: 1243 AAEMKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSATPGENDLR 1302
            AAEMKKYGYLGPFDQRMMDRI+HIERLYPSSNFWKTFLSRQ+ANMG PSSS +PGEN+LR
Sbjct: 1141 AAEMKKYGYLGPFDQRMMDRILHIERLYPSSNFWKTFLSRQKANMGFPSSSNSPGENELR 1200

Query: 1303 TLRFHLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAIWTNTWDQR 1362
            TLRFHLINNT IDCGEVDGVVLGSWPNANE+S V+E TEDNAH+GYWAAAAIWTNTWDQR
Sbjct: 1201 TLRFHLINNTFIDCGEVDGVVLGSWPNANESSSVVETTEDNAHIGYWAAAAIWTNTWDQR 1260

Query: 1363 TTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSLQVSLDGLQ 1422
            TTDRILLD+SLG GI VAWESQLDYHICHNNWD VSRLLDMIP +N+LDGSLQVSLDGLQ
Sbjct: 1261 TTDRILLDQSLGNGIHVAWESQLDYHICHNNWDGVSRLLDMIPDANILDGSLQVSLDGLQ 1320

Query: 1423 SASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGMLLEEKLARQ 1482
            SASAVGCNRES+FYSNYLYPLEELDAVCLYIP  KIF+FSANIMCSK LGMLLEEKLAR 
Sbjct: 1321 SASAVGCNRESTFYSNYLYPLEELDAVCLYIPNVKIFKFSANIMCSKLLGMLLEEKLARH 1380

Query: 1483 FIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGAFYVDSVQA 1542
            FIFLKEYWEG+MELVPLLARSGFI +RLDEIAS+DDHISSSVDQRS+N GGA+ VDSVQA
Sbjct: 1381 FIFLKEYWEGSMELVPLLARSGFIIHRLDEIASMDDHISSSVDQRSSNKGGAYSVDSVQA 1440

Query: 1543 LYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLLSRIRGCEY 1602
            LYKVFIHHCSQYNLPFLLDLYLDH KLV+DNNSV SLLEAAGDC WARWLLLSRIRGCEY
Sbjct: 1441 LYKVFIHHCSQYNLPFLLDLYLDHQKLVIDNNSVHSLLEAAGDCHWARWLLLSRIRGCEY 1500

Query: 1603 DASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYAPSPIQDCL 1662
            DASFSNARSIM LNLVHDPNL VR+I+EII TV DIAEGGGEMAALATLMYAPSPIQDCL
Sbjct: 1501 DASFSNARSIMPLNLVHDPNLSVRNIEEIISTVADIAEGGGEMAALATLMYAPSPIQDCL 1560

Query: 1663 SSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKLKNALSEYL 1722
            +SSGVNRHSSSSAQCTLENLRP LQRFPTLCRALVTSAFQQDTTCNFLGPK KNALSEYL
Sbjct: 1561 NSSGVNRHSSSSAQCTLENLRPVLQRFPTLCRALVTSAFQQDTTCNFLGPKWKNALSEYL 1620

Query: 1723 HWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQTLWERDVH 1782
            HWR+S  FSAGRDTSLLHMLPCWFP AVRRLL LYVQGPLGWQS+S LPTGQ LWERDV+
Sbjct: 1621 HWRNSTIFSAGRDTSLLHMLPCWFPNAVRRLLHLYVQGPLGWQSISGLPTGQALWERDVY 1680

Query: 1783 FFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSAFNHLLAAR 1842
            FFMND EHSEISPISWEA IQKHIEDELYDSSLKETG+GLEHNLHRGRA SAFNHLL AR
Sbjct: 1681 FFMNDDEHSEISPISWEAAIQKHIEDELYDSSLKETGLGLEHNLHRGRALSAFNHLLVAR 1740

Query: 1843 VQKLKSEIQPGSATGPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENSVLVASCAF 1902
            VQKLKSEIQ GSA G SN Q DLQ LFAPLT  EQSLLSS+IPLAITHFENSVLVASCAF
Sbjct: 1741 VQKLKSEIQSGSAIGQSNIQLDLQTLFAPLTPTEQSLLSSVIPLAITHFENSVLVASCAF 1800

Query: 1903 LLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLESDKVETLAR 1962
            LLEL GLSASMLRVDVAALRRISTF KSGQSFENFRQLSPKGSAFHPVPLESDK+ETLAR
Sbjct: 1801 LLELGGLSASMLRVDVAALRRISTFYKSGQSFENFRQLSPKGSAFHPVPLESDKIETLAR 1860

Query: 1963 ALADEYLHQESSSVNKPKGTSNSAPSKRCPQV-LFVLQHLEEVSLPQVVDGNSCGSWLLS 2022
            ALADEYLHQESSSVN+P GTS+SAP KRC QV LFVLQHLEEVSLP +VDGNSCGSWLL 
Sbjct: 1861 ALADEYLHQESSSVNEPTGTSDSAPPKRCSQVLLFVLQHLEEVSLPHMVDGNSCGSWLLC 1920

Query: 2023 GKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYP 2082
            GKGDGTELRNQQKAASH+WNLV VFCRMHR+P SSKYLALLARDNDWVGFLTEAHVGGYP
Sbjct: 1921 GKGDGTELRNQQKAASHHWNLVRVFCRMHRIPLSSKYLALLARDNDWVGFLTEAHVGGYP 1980

Query: 2083 FDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFLDGSTYIPV 2142
            FDTVIQ AS+EFSDPRLKIHILTVLK+VQSRK+PG SSYSDTE+KK QT+ LDGS YIPV
Sbjct: 1981 FDTVIQ-ASREFSDPRLKIHILTVLKSVQSRKSPGTSSYSDTEEKKGQTTILDGSMYIPV 2040

Query: 2143 ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT 2202
            ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT
Sbjct: 2041 ELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETT 2100

Query: 2203 SIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISKDPSVGVIS 2262
            SIKVNDIASQIAENVGAAVEATNTLPAGCRS AFHYCRKNPKRRRTMDS+S DPSV  IS
Sbjct: 2101 SIKVNDIASQIAENVGAAVEATNTLPAGCRSSAFHYCRKNPKRRRTMDSVSDDPSVIAIS 2160

Query: 2263 DTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVLCEQKLFLP 2322
            D FSAS  ASTNV G  IVKEEGKIVQE + ISVSYDSDEAPSSLSKMVSVLCEQKLFLP
Sbjct: 2161 DNFSASR-ASTNVPGDSIVKEEGKIVQEPQRISVSYDSDEAPSSLSKMVSVLCEQKLFLP 2220

Query: 2323 LLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHANVEGEEHT 2382
            LLRAFEMFLPSCSLLPFIRALQAFSQM L+EASAHLGSFS RVKDEA +SHANVEGEE+T
Sbjct: 2221 LLRAFEMFLPSCSLLPFIRALQAFSQMCLAEASAHLGSFSARVKDEAIYSHANVEGEENT 2280

Query: 2383 GTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLYWKINLAEP 2442
            GTSWTGSTAVKAANAVLSVCPSPYER+CLLKLLAATDFGDGGF+A+YY+RLY+KINLAEP
Sbjct: 2281 GTSWTGSTAVKAANAVLSVCPSPYERRCLLKLLAATDFGDGGFSASYYRRLYYKINLAEP 2340

Query: 2443 SIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASHHVTETQAE 2502
             +RIDD LHLGNEALDDASLL+ALENN HWEQARNWAKQLEASGGSWKSASHHVTETQAE
Sbjct: 2341 LLRIDDALHLGNEALDDASLLSALENNRHWEQARNWAKQLEASGGSWKSASHHVTETQAE 2400

Query: 2503 SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE 2562
            SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE
Sbjct: 2401 SMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHE 2460

Query: 2563 LLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNINNSSRECIS 2622
            LLLLSLQWLSGMFT+SYPVYPL+LLREIETKVWLLAVESEAELKNERDLNI+ S REC S
Sbjct: 2461 LLLLSLQWLSGMFTVSYPVYPLNLLREIETKVWLLAVESEAELKNERDLNISGSIRECKS 2520

Query: 2623 RNSSSIIDWTASIISKMDKHISTMKNKSMDKHEVRENSQTHHKSHVLDAGLSTAGGGNTK 2682
            RNSSSIID TAS+ISKMDKHISTM NK+MDKHEVRENSQTHHKS VLDAGLSTAGGGNTK
Sbjct: 2521 RNSSSIIDLTASMISKMDKHISTMTNKNMDKHEVRENSQTHHKSQVLDAGLSTAGGGNTK 2580

Query: 2683 AKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLSGWEERIGP 2742
            AKRRTKGS+L+RRPL DS DMNTN EDG + SNFKNDLH+QDENLKMDTS SGWEERIGP
Sbjct: 2581 AKRRTKGSMLLRRPLADSADMNTNSEDGYISSNFKNDLHMQDENLKMDTSFSGWEERIGP 2640

Query: 2743 AEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNREVSMSMLD 2802
            AEV+RA+LSLLEFGQITAAKQLQQKLSP QVPSEFLLVDA+F LAAIST N EV MSMLD
Sbjct: 2641 AEVERAILSLLEFGQITAAKQLQQKLSPEQVPSEFLLVDASFKLAAISTSNCEVPMSMLD 2700

Query: 2803 EDLCSVILAYDIPVDQYLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANVLGLPFSEA 2862
             DLCSVIL+  I VDQYLNPLQVLE LAT+FAEGGGRGLCRRVIAVVKAANVLGLPFSEA
Sbjct: 2701 GDLCSVILSSGIQVDQYLNPLQVLETLATVFAEGGGRGLCRRVIAVVKAANVLGLPFSEA 2760

Query: 2863 YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS 2922
            YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS
Sbjct: 2761 YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS 2820

Query: 2923 QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2982
            QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK
Sbjct: 2821 QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2880

Query: 2983 SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENGQLELL 3042
            SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIEN QLE L
Sbjct: 2881 SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENNQLEFL 2940

Query: 3043 LQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHETAALLESQ 3102
            LQKFSAA  TS GSAEAVRGFR+AVLTSLKH  P DLDAFAKVYSHFDMKHETAALLE+Q
Sbjct: 2941 LQKFSAAISTSTGSAEAVRGFRIAVLTSLKHLIPNDLDAFAKVYSHFDMKHETAALLETQ 3000

Query: 3103 AEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQIRMP 3162
            AEQSCEMWFRRY KDQN DLLDAM YYI AAEV+SSIDAGNKTRRSCAQASLVSLQIRMP
Sbjct: 3001 AEQSCEMWFRRYDKDQNEDLLDAMLYYIKAAEVYSSIDAGNKTRRSCAQASLVSLQIRMP 3060

Query: 3163 DFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPEILEEFVAE 3222
            DFKWLFQ+ETNARRALV+QSRFQEALIVAEAYDLDQPSEWALVIWNQMLKP+ILE+FVAE
Sbjct: 3061 DFKWLFQTETNARRALVDQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPDILEDFVAE 3120

Query: 3223 FVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT 3246
            FV+VLPLHPSML DIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT
Sbjct: 3121 FVTVLPLHPSMLGDIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT 3180

BLAST of Sgr026353 vs. TAIR 10
Match: AT4G39420.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: leaf; EXPRESSED DURING: LP.04 four leaves visible, LP.02 two leaves visible; Has 20 Blast hits to 19 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 3204.1 bits (8306), Expect = 0.0e+00
Identity = 1770/3234 (54.73%), Postives = 2249/3234 (69.54%), Query Frame = 0

Query: 111  GPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTGDVRCGNDLP 170
            GP +LQLHKW PSQ QL LSE+REAFISP+RQ+LLL SY  EALLLPL  G    G+++ 
Sbjct: 8    GPTLLQLHKWEPSQFQLKLSEFREAFISPSRQLLLLLSYHSEALLLPLVAGR-SIGSEV- 67

Query: 171  NGYDINLKDLGSLAFSEVVSTAPRSEDAEGNVRCSNKSAVDIDNDSPTGNKSSRSSCNN- 230
                       SL+       +P         +  +     + +  P    +  SSCN+ 
Sbjct: 68   -----------SLSGDNEELNSPSCSGGSDPEKIESPCGSGVGSGEPGFVDNCSSSCNSF 127

Query: 231  -FLGDVSSLAWGLCGDTYKKRKDSSFKEILFVSGNHGVTAHAFCQPNKTNEEAKNMVQSE 290
             F+ D  S+AWG CGDTY + KD  F+E+LFVSGNHGVT HAFC     +++AK     E
Sbjct: 128  PFIFDAKSVAWGSCGDTYNRHKDPLFRELLFVSGNHGVTVHAFCCTKDLSDKAKGKPNGE 187

Query: 291  FWKGRWMEWGPYPTLVQNLEVQELSDS----------------CVTSGN-----VDKNRI 350
               G W+EWGP   L Q  E + +S S                 V  G       +K+  
Sbjct: 188  LRHGEWVEWGP-SRLSQKSEPERVSSSDGSKQWMQSFLIDLETTVIDGTRQSRFPEKSAF 247

Query: 351  NQNGEILRSSCYE---------FEDDALLLGNSAPKR----------------------- 410
              + E++  S            F+D+++L  ++ P+                        
Sbjct: 248  PGSAEVVSFSILNTDLPFSNLLFQDNSILPKDNMPEDGNVNDNNFLVASDPTALDEKSRA 307

Query: 411  -------YLQSFLAKVKTIEYEDD-----IWTIVDSVSADRVDET----GSRNDTLILVA 470
                    + S    +K    +       +  + D  S  R +E     G RN   I VA
Sbjct: 308  DMPVNNVSVNSLYRCIKVFSSDAHSLIGFVMELSDCASTPRRNENERSKGKRN---IFVA 367

Query: 471  RVGNLGIKWVSSVKFEKSLYISPLMEWADFCFSNDFLLCLSDSGFIFVHSALSGKHVTCI 530
            ++ + GI+WVS VKF +S  I P  EWADF  S++F++CLS SG IF++   SG  ++  
Sbjct: 368  KLFSWGIEWVSLVKFGES-SIGPTNEWADFRLSDNFVICLSVSGLIFLYDVNSGDFISHG 427

Query: 531  DVLQACGLNPKYLHLKQDLQ--MNQVDQVQD--------DVSCSRDSFYDRRKFRRLLSD 590
            D+LQ CG   + LH   D Q    + DQ+ D          +C   S  DRRKFR+L+  
Sbjct: 428  DILQTCG---RGLHSSSDRQEATAEADQLSDFQNRAPSMSKTCIVGS-TDRRKFRKLIVA 487

Query: 591  SHSSHFAVIDAFGIMYVVSAVDHMLEHYHGSENLFPHPHNFELGRAPVSWEVGGYDIGCQ 650
            SH+   A +D  G++YV+   D + + YH +    P   +  LG + V W++GG DIG +
Sbjct: 488  SHTPLIAAVDENGLVYVLCVNDFVSKEYHMAAEPIPDLLHLGLG-SLVGWKIGGMDIGQK 547

Query: 651  R-NYSESLGSHSCRVFSMKNEGVSFWGNTRFDVLQNTQDSKVCTGRKYKCSCLTA-SASI 710
            + ++  S GS     FS ++   S    +  D     Q +       Y  S L+  SA  
Sbjct: 548  KVHHPSSSGSRGEDAFSRRDLSFSASEISMSDPCLERQQNNFDRRAGYSGSWLSGFSAQP 607

Query: 711  LQN-QKFQGGELQSCTMRKMFLSTWKTNEDDCFCFSPMGLTQFIKRCNISGQKCSQVVHF 770
              N  K +     S   RKMFLS  K   DD  CFSP G T F ++      +  ++ H+
Sbjct: 608  KTNGLKLEKFRRDSHVTRKMFLSAEKLGLDDNICFSPYGFTHFSRKYTNKDDRSCKIFHY 667

Query: 771  DLHLKSEVHDDSCLKSQM--IFVDGRKEELVGEAVGCTSQGSLYLVTNNGLSVVLPSVTI 830
             L       DDS L   +    + G +E  +GE+VGC+ QG L+LVT +GLSV LPS++I
Sbjct: 668  SLQTHMTARDDSYLNYDVNKNSIQGAEENFIGESVGCSFQGFLFLVTCDGLSVFLPSISI 727

Query: 831  ASDSLPSESVARLQPGVLLGTPNQVKGLELKESNCSWSPWQVEVLDRVLLYESIDEADRL 890
             S+    E++  LQP     T    +G +   +  S  PWQVEV+DRV+L+E  + AD L
Sbjct: 728  TSNYPTIEAIEYLQP--FQTTVMGYRGRDDLAAGESRFPWQVEVIDRVILFEGPEVADHL 787

Query: 891  CSENGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEVGILRLLFAAVHLMFQKA 950
            C ENGWDLK+VR+RR QM L YL++D++  SL+ML +V L E G+LR+LF+AV+L+ +K 
Sbjct: 788  CLENGWDLKIVRLRRLQMALDYLKYDDINESLKMLGNVKLAEEGMLRVLFSAVYLLSRKD 847

Query: 951  GTDNDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFSSSQEISILPHFPFRKQN 1010
              DN+ISA SRLL L T FAT M  RYG+ E++++  MF+    +Q +S LP       +
Sbjct: 848  RNDNEISAVSRLLGLATMFATEMIRRYGLLEYRKDVYMFDSKPRTQILS-LPAVSL-NID 907

Query: 1011 ELEYSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPISDENNLLLDEPQLVSTD 1070
             +E SR+L EM + LEI RN+   ++ KFK+  +        + D N+ L D+ QL   +
Sbjct: 908  VMENSRRLSEMGYLLEITRNIQSRITRKFKKLGKGNNEKSLNLVDPNS-LQDDSQL---E 967

Query: 1071 IIPLGSTSQYELSFPSNDLSSTVVD-----GLVMMPMVSESQLDSEDLNGDSAVVPQGVL 1130
            I+P  ++++      S  L +++ D      L  M M++  Q+  ++ +  S +VPQG++
Sbjct: 968  IVPDPASAE------SRQLDTSLFDTNEELALTPMGMMTAGQI-IDERSYASGLVPQGIV 1027

Query: 1131 -EKKVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIGEKEPH 1190
             EKKV+PLENPK+M+ARWK++ L LK VVKDALLSGRLPLAVLQLH+ H ++++ + E H
Sbjct: 1028 EEKKVLPLENPKEMMARWKANNLDLKTVVKDALLSGRLPLAVLQLHLQHSKDVVEDGEHH 1087

Query: 1191 DTFSEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEIAAE 1250
            DTF+E+RDIGRAIAYDLFLKGE G+AIATLQRLG+D+E  L QL++GT+ RS R +IA E
Sbjct: 1088 DTFTEVRDIGRAIAYDLFLKGEPGVAIATLQRLGEDVEACLNQLVFGTVRRSLRYQIAEE 1147

Query: 1251 MKKYGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSATPGENDLRTLR 1310
            M+K G+L P++  +++RI  IERLYPSS+FW+T+L+R++  +     +A P ++   +L 
Sbjct: 1148 MRKLGFLRPYEDNVLERISLIERLYPSSHFWETYLARRKELL----KAALPFDSSEISLH 1207

Query: 1311 F---HLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAIWTNTWDQR 1370
                 L  +  I+CGEVDGVVLGSW   NE++      E +A  GYWAAAA+W+N WDQR
Sbjct: 1208 LGGSSLFQHLKIECGEVDGVVLGSWTKINESASEHAPDETDAVAGYWAAAAVWSNAWDQR 1267

Query: 1371 TTDRILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSLQVSLDGLQ 1430
            T D I+LD+ L +G+ V W+SQL+Y++CHN+WDEV +LLD+IP   L DGSLQ++LDG +
Sbjct: 1268 TFDHIVLDQPLVMGVHVPWDSQLEYYMCHNDWDEVLKLLDLIPEDVLYDGSLQIALDGPK 1327

Query: 1431 SASAVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGMLLEEKLARQ 1490
             +S  G N   S  S Y+  +EE+DAV + +P  KIFR   +I CS WL  L+E++LAR+
Sbjct: 1328 QSS--GVNYSVSSRSEYICSIEEVDAVLMDVPYIKIFRLPGDIRCSLWLTTLMEQELARK 1387

Query: 1491 FIFLKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGAFYVDSVQA 1550
             IFLKEYWE  +++V LLAR+G I     E++  ++  + S+D   +   G   VD++ A
Sbjct: 1388 LIFLKEYWENALDVVYLLARAGVILGNC-EVSFKEETCTPSLDLCLSIKKGGANVDTLNA 1447

Query: 1551 LYKVFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLLSRIRGCEY 1610
            ++K+FIH+C+QYNLP LLDLYLDHH+LV+DN+S+ SL EA GD  WA+WLLLSRI+G EY
Sbjct: 1448 VHKLFIHYCTQYNLPNLLDLYLDHHELVLDNDSLSSLQEAVGDSHWAKWLLLSRIKGREY 1507

Query: 1611 DASFSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYAPSPIQDCL 1670
            DASFSNARSIMS N   +    V +IDE++CTV DIA+G GEMAALAT+M AP PIQ  L
Sbjct: 1508 DASFSNARSIMSRNGAPNSEPSVPEIDEMVCTVDDIADGAGEMAALATMMCAPVPIQKSL 1567

Query: 1671 SSSGVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKLKNALSEYL 1730
            S+  VNRH++SSAQCTLENLR  LQRFPTL   LV++   +D + N L  K KN LSEYL
Sbjct: 1568 STGSVNRHTNSSAQCTLENLRSFLQRFPTLWSKLVSACLGEDISGNLLRTKTKNVLSEYL 1627

Query: 1731 HWRSSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQTLWERDVH 1790
            +WR  +FFS  RDTSLL MLPCWFPKAVRRL+QLY+QGPLGW S S  PTG+ L  R V 
Sbjct: 1628 NWRDGVFFSTARDTSLLQMLPCWFPKAVRRLVQLYIQGPLGWLSFSGYPTGEYLLHRGVE 1687

Query: 1791 FFMNDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSAFNHLLAAR 1850
            FF+N  + +EIS ISWEA IQKHIE+EL+ +  + T +GLEH LHRGR  +AFN  L  R
Sbjct: 1688 FFINVDDPTEISAISWEAIIQKHIEEELHHTKTEGTELGLEHFLHRGRPLAAFNAFLEHR 1747

Query: 1851 VQKLKSEIQPGSAT-GPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENSVLVASCA 1910
            V+KLK E Q GS+  G  N Q D+  L APLT  ++SLLSS+IPLAITHF +SVLVASCA
Sbjct: 1748 VEKLKLEDQSGSSIHGQRNMQSDVPMLLAPLTQSDESLLSSVIPLAITHFGDSVLVASCA 1807

Query: 1911 FLLELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLESDKVETLA 1970
            FLLELCGLSASMLR+DVA+LRRIS+F KS  + +   Q S K S FH V  E D + +LA
Sbjct: 1808 FLLELCGLSASMLRIDVASLRRISSFYKSNGNADMAHQKSLKRSMFHSVSSEDDLMGSLA 1867

Query: 1971 RALADEYLHQESSSVNKPKGTSNSAPSKRCPQVLFVLQHLEEVSLPQV-VDGNSCGSWLL 2030
            RALA+EY + + SSV K K   + + S+    ++ VL HLE+ SLP++ V   + G WLL
Sbjct: 1868 RALANEYAYPDISSVPKQKQNPSISGSQPGLPLMLVLHHLEQASLPEIGVGRKTSGYWLL 1927

Query: 2031 SGKGDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGY 2090
            +G GDG+ELR+QQ +AS +W+LVT+FC+MH++P S+KYLA+LARDNDWVGFL+EA +GGY
Sbjct: 1928 TGDGDGSELRSQQTSASLHWSLVTLFCQMHKIPLSTKYLAMLARDNDWVGFLSEAQLGGY 1987

Query: 2091 PFDTVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFLDGSTYIP 2150
            PFDTV+ VASKEF D RLK HILTVL+   S+K    +S+SD   +    S  +G  Y+ 
Sbjct: 1988 PFDTVLNVASKEFGDQRLKAHILTVLRYANSKKK-ATTSFSDDPSRGLSCSPSEGGAYVS 2047

Query: 2151 VELFTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARET 2210
             ELF +LA  EK KNPG+ LL KA+E SWSILA+IASCFPDVSPLSCLT+WLEITAARET
Sbjct: 2048 AELFRVLAYSEKLKNPGEYLLSKAKEFSWSILALIASCFPDVSPLSCLTIWLEITAARET 2107

Query: 2211 TSIKVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISKDPSVGVI 2270
            +SIKVNDI ++IAEN+GAAV +TN+LP   R   FHY R+NPKRRR    ++   SV ++
Sbjct: 2108 SSIKVNDITTKIAENIGAAVVSTNSLPTDARGVQFHYNRRNPKRRR----LTAHTSVDLL 2167

Query: 2271 SDTFSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVLCEQKLFL 2330
            +   S +  A          + E    ++    SV  DS +  +SLSKMV+VLCEQ+LFL
Sbjct: 2168 ASANSLNISAGKTFCSH---RTEAAEDEKAEDSSVIDDSSDEHASLSKMVAVLCEQRLFL 2227

Query: 2331 PLLRAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHANVEGEEH 2390
            PLL+AF++FLPSCSLLPF RALQAFSQMRLSEASAHLGSF  RVK+E+    +N   + +
Sbjct: 2228 PLLKAFDLFLPSCSLLPFFRALQAFSQMRLSEASAHLGSFWGRVKEESMHFQSNTAKDVN 2287

Query: 2391 TGTSWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLYWKINLAE 2450
             G SW   TAVKAA+AVLS CPSPYE++CLL+LLAATDFGDGG AA YY+RLYWK+NLAE
Sbjct: 2288 FGASWISRTAVKAADAVLSACPSPYEKRCLLQLLAATDFGDGGSAATYYRRLYWKVNLAE 2347

Query: 2451 PSIRIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASHHVTETQA 2510
            PS+R +D L LGNE+LDD SLLTALE N  WEQARNWAKQLE  G +W S+ HHVTETQA
Sbjct: 2348 PSLREND-LDLGNESLDDGSLLTALEKNRQWEQARNWAKQLETIGATWTSSVHHVTETQA 2407

Query: 2511 ESMVAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELH 2570
            ESMVAEWKEFLWDV EER+ALWGHCQ LFIRYSFPALQAGLFFL+HAE VEKDLPA+E++
Sbjct: 2408 ESMVAEWKEFLWDVPEERIALWGHCQTLFIRYSFPALQAGLFFLRHAEVVEKDLPAREIY 2467

Query: 2571 ELLLLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNINNSSRECI 2630
            ELLLLSLQWLSG+ T+S+PVYPLHLLREIET+VWLLAVE+E+ +KN    + ++  ++ +
Sbjct: 2468 ELLLLSLQWLSGLTTLSHPVYPLHLLREIETRVWLLAVEAESHVKNVGAFSPSSIGKDMV 2527

Query: 2631 SRNSSSIIDWTASIISKMDKHIST-MKNKSMDKHEVRENSQTHHKSHVLDAGLSTAGGGN 2690
            +  SS++ID TASII+KMD HIS+  KN+  +KH+ R   Q + ++      +    G +
Sbjct: 2528 NGYSSNLIDRTASIITKMDSHISSATKNRIGEKHDARAAGQGNQRNQDTSTSIF---GAS 2587

Query: 2691 TKAKRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLSGWEERI 2750
            TK KRR KG+V   R  VDS+D NT+ ED     N K++  LQ+E+  ++ SLS WEE I
Sbjct: 2588 TKPKRRAKGNVPQIRHFVDSSDRNTDFEDSSSLINIKSEFQLQEESTGLEISLSKWEESI 2647

Query: 2751 GPAEVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNREVSMSM 2810
             PAE++RAVLSLLEFGQ+TAAKQLQ KL+PG +PSE +++DA   LA +STP R+V +SM
Sbjct: 2648 EPAELERAVLSLLEFGQVTAAKQLQLKLAPGNLPSELIILDAVMKLAMLSTPCRQVLLSM 2707

Query: 2811 LDEDLCSVILAYDIPVDQ-YLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANVLGLPF 2870
            LD+++ SVI ++ + +DQ  + PLQ+LE L+TI  EG GRGL R++IAV+KAAN+LGL F
Sbjct: 2708 LDDEVRSVIQSHSLKIDQPMIEPLQILENLSTILNEGSGRGLARKIIAVIKAANILGLTF 2767

Query: 2871 SEAYNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGY 2930
            +EAY KQPIELL+LLSLKAQ+SFEEA  LVQTHSMPAASIAQILAESFLKGLLAAHRGGY
Sbjct: 2768 TEAYQKQPIELLRLLSLKAQDSFEEACLLVQTHSMPAASIAQILAESFLKGLLAAHRGGY 2827

Query: 2931 MDSQKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHH 2990
            +DSQK+EGPAPLLWRFSDFLKW+ELCPSE EIGHALMRLVITGQEIPHACEVELLILSHH
Sbjct: 2828 IDSQKEEGPAPLLWRFSDFLKWAELCPSEQEIGHALMRLVITGQEIPHACEVELLILSHH 2887

Query: 2991 FYKSSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENGQL 3050
            FYKSS CLDGVDVLVALAATRVEAYVAEGDF CLARLITGVGNF+AL+FIL ILIENGQL
Sbjct: 2888 FYKSSTCLDGVDVLVALAATRVEAYVAEGDFSCLARLITGVGNFHALNFILNILIENGQL 2947

Query: 3051 ELLLQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHETAALL 3110
            +LLLQKFSAAAD + G+A+AVR FRMAVLTSL  +NP D DAFA VY HFDMKHETA LL
Sbjct: 2948 DLLLQKFSAAADANTGTAQAVRSFRMAVLTSLNLYNPNDHDAFAMVYKHFDMKHETATLL 3007

Query: 3111 ESQAEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQI 3170
            E++A+Q+ + WF RY KDQN DLLD+M YYI AAEVH+SIDAGNK R++C QASLVSLQI
Sbjct: 3008 EARADQAAQQWFLRYDKDQNEDLLDSMRYYIEAAEVHTSIDAGNKARKACGQASLVSLQI 3067

Query: 3171 RMPDFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPEILEEF 3230
            RMPD KWL  SETNARRALV+QSRFQEALIVAEAY L+QPSEWALV+WN MLKPE+ E+F
Sbjct: 3068 RMPDSKWLCLSETNARRALVDQSRFQEALIVAEAYGLNQPSEWALVLWNLMLKPELAEDF 3127

Query: 3231 VAEFVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLL 3246
            VAEFV+VLPL  SML ++ARFYR+E+AARGDQSQFSVWLTGGGLPAEWAKY+ RSFRCLL
Sbjct: 3128 VAEFVAVLPLQASMLLELARFYRAEMAARGDQSQFSVWLTGGGLPAEWAKYMWRSFRCLL 3184

BLAST of Sgr026353 vs. TAIR 10
Match: AT4G39420.1 (unknown protein; Has 46 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 44; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 2855.9 bits (7402), Expect = 0.0e+00
Identity = 1505/2451 (61.40%), Postives = 1873/2451 (76.42%), Query Frame = 0

Query: 808  NGWDLKVVRMRRFQMTLHYLRFDELERSLEMLVDVDLEEVGILRLLFAAVHLMFQKAGTD 867
            +GWDLK+VR+RR QM L YL++D++  SL+ML +V L E G+LR+LF+AV+L+ +K   D
Sbjct: 98   SGWDLKIVRLRRLQMALDYLKYDDINESLKMLGNVKLAEEGMLRVLFSAVYLLSRKDRND 157

Query: 868  NDISAASRLLALGTRFATRMTHRYGMAEFKRNATMFNDFSSSQEISILPHFPFRKQNELE 927
            N+ISA SRLL L T FAT M  RYG+ E++++  MF+    +Q +S LP       + +E
Sbjct: 158  NEISAVSRLLGLATMFATEMIRRYGLLEYRKDVYMFDSKPRTQILS-LPAVSL-NIDVME 217

Query: 928  YSRKLHEMSHFLEIIRNLHCHLSSKFKRPCQELVAGEAPISDENNLLLDEPQLVSTDIIP 987
             SR+L EM + LEI RN+   ++ KFK+  +        + D N+ L D+ QL   +I+P
Sbjct: 218  NSRRLSEMGYLLEITRNIQSRITRKFKKLGKGNNEKSLNLVDPNS-LQDDSQL---EIVP 277

Query: 988  LGSTSQYELSFPSNDLSSTVVD-----GLVMMPMVSESQLDSEDLNGDSAVVPQGVL-EK 1047
              ++++      S  L +++ D      L  M M++  Q+  ++ +  S +VPQG++ EK
Sbjct: 278  DPASAE------SRQLDTSLFDTNEELALTPMGMMTAGQI-IDERSYASGLVPQGIVEEK 337

Query: 1048 KVVPLENPKQMIARWKSDKLPLKNVVKDALLSGRLPLAVLQLHINHLRELIGEKEPHDTF 1107
            KV+PLENPK+M+ARWK++ L LK VVKDALLSGRLPLAVLQLH+ H ++++ + E HDTF
Sbjct: 338  KVLPLENPKEMMARWKANNLDLKTVVKDALLSGRLPLAVLQLHLQHSKDVVEDGEHHDTF 397

Query: 1108 SEIRDIGRAIAYDLFLKGEIGLAIATLQRLGDDIEVSLKQLLYGTINRSFRVEIAAEMKK 1167
            +E+RDIGRAIAYDLFLKGE G+AIATLQRLG+D+E  L QL++GT+ RS R +IA EM+K
Sbjct: 398  TEVRDIGRAIAYDLFLKGEPGVAIATLQRLGEDVEACLNQLVFGTVRRSLRYQIAEEMRK 457

Query: 1168 YGYLGPFDQRMMDRIVHIERLYPSSNFWKTFLSRQRANMGSPSSSATPGENDLRTLRF-- 1227
             G+L P++  +++RI  IERLYPSS+FW+T+L+R++  +     +A P ++   +L    
Sbjct: 458  LGFLRPYEDNVLERISLIERLYPSSHFWETYLARRKELL----KAALPFDSSEISLHLGG 517

Query: 1228 -HLINNTIIDCGEVDGVVLGSWPNANENSFVLEITEDNAHVGYWAAAAIWTNTWDQRTTD 1287
              L  +  I+CGEVDGVVLGSW   NE++      E +A  GYWAAAA+W+N WDQRT D
Sbjct: 518  SSLFQHLKIECGEVDGVVLGSWTKINESASEHAPDETDAVAGYWAAAAVWSNAWDQRTFD 577

Query: 1288 RILLDRSLGIGIPVAWESQLDYHICHNNWDEVSRLLDMIPVSNLLDGSLQVSLDGLQSAS 1347
             I+LD+ L +G+ V W+SQL+Y++CHN+WDEV +LLD+IP   L DGSLQ++LDG + +S
Sbjct: 578  HIVLDQPLVMGVHVPWDSQLEYYMCHNDWDEVLKLLDLIPEDVLYDGSLQIALDGPKQSS 637

Query: 1348 AVGCNRESSFYSNYLYPLEELDAVCLYIPKAKIFRFSANIMCSKWLGMLLEEKLARQFIF 1407
              G N   S  S Y+  +EE+DAV + +P  KIFR   +I CS WL  L+E++LAR+ IF
Sbjct: 638  --GVNYSVSSRSEYICSIEEVDAVLMDVPYIKIFRLPGDIRCSLWLTTLMEQELARKLIF 697

Query: 1408 LKEYWEGTMELVPLLARSGFITNRLDEIASVDDHISSSVDQRSTNNGGAFYVDSVQALYK 1467
            LKEYWE  +++V LLAR+G I     E++  ++  + S+D   +   G   VD++ A++K
Sbjct: 698  LKEYWENALDVVYLLARAGVILGNC-EVSFKEETCTPSLDLCLSIKKGGANVDTLNAVHK 757

Query: 1468 VFIHHCSQYNLPFLLDLYLDHHKLVVDNNSVRSLLEAAGDCQWARWLLLSRIRGCEYDAS 1527
            +FIH+C+QYNLP LLDLYLDHH+LV+DN+S+ SL EA GD  WA+WLLLSRI+G EYDAS
Sbjct: 758  LFIHYCTQYNLPNLLDLYLDHHELVLDNDSLSSLQEAVGDSHWAKWLLLSRIKGREYDAS 817

Query: 1528 FSNARSIMSLNLVHDPNLGVRDIDEIICTVGDIAEGGGEMAALATLMYAPSPIQDCLSSS 1587
            FSNARSIMS N   +    V +IDE++CTV DIA+G GEMAALAT+M AP PIQ  LS+ 
Sbjct: 818  FSNARSIMSRNGAPNSEPSVPEIDEMVCTVDDIADGAGEMAALATMMCAPVPIQKSLSTG 877

Query: 1588 GVNRHSSSSAQCTLENLRPALQRFPTLCRALVTSAFQQDTTCNFLGPKLKNALSEYLHWR 1647
             VNRH++SSAQCTLENLR  LQRFPTL   LV++   +D + N L  K KN   EYL+WR
Sbjct: 878  SVNRHTNSSAQCTLENLRSFLQRFPTLWSKLVSACLGEDISGNLLRTKTKN---EYLNWR 937

Query: 1648 SSIFFSAGRDTSLLHMLPCWFPKAVRRLLQLYVQGPLGWQSLSALPTGQTLWERDVHFFM 1707
              +FFS  RDTSLL MLPCWFPKAVRRL+QLY+QGPLGW S S  PTG+ L  R V FF+
Sbjct: 938  DGVFFSTARDTSLLQMLPCWFPKAVRRLVQLYIQGPLGWLSFSGYPTGEYLLHRGVEFFI 997

Query: 1708 NDYEHSEISPISWEATIQKHIEDELYDSSLKETGVGLEHNLHRGRAFSAFNHLLAARVQK 1767
            N  + +EIS ISWEA IQKHIE+EL+ +  + T +GLEH LHRGR  +AFN  L  RV+K
Sbjct: 998  NVDDPTEISAISWEAIIQKHIEEELHHTKTEGTELGLEHFLHRGRPLAAFNAFLEHRVEK 1057

Query: 1768 LKSEIQPGSAT-GPSNTQFDLQALFAPLTLREQSLLSSIIPLAITHFENSVLVASCAFLL 1827
            LK E Q GS+  G  N Q D+  L APLT  ++SLLSS+IPLAITHF +SVLVASCAFLL
Sbjct: 1058 LKLEDQSGSSIHGQRNMQSDVPMLLAPLTQSDESLLSSVIPLAITHFGDSVLVASCAFLL 1117

Query: 1828 ELCGLSASMLRVDVAALRRISTFNKSGQSFENFRQLSPKGSAFHPVPLESDKVETLARAL 1887
            ELCGLSASMLR+DVA+LRRIS+F KS  + +   Q S K S FH V  E D + +LARAL
Sbjct: 1118 ELCGLSASMLRIDVASLRRISSFYKSNGNADMAHQKSLKRSMFHSVSSEDDLMGSLARAL 1177

Query: 1888 ADEYLHQESSSVNKPKGTSNSAPSKRCPQVLFVLQHLEEVSLPQV-VDGNSCGSWLLSGK 1947
            A+EY + + SSV K K   + + S+    ++ VL HLE+ SLP++ V   + G WLL+G 
Sbjct: 1178 ANEYAYPDISSVPKQKQNPSISGSQPGLPLMLVLHHLEQASLPEIGVGRKTSGYWLLTGD 1237

Query: 1948 GDGTELRNQQKAASHYWNLVTVFCRMHRLPPSSKYLALLARDNDWVGFLTEAHVGGYPFD 2007
            GDG+ELR+QQ +AS +W+LVT+FC+MH++P S+KYLA+LARDNDWVGFL+EA +GGYPFD
Sbjct: 1238 GDGSELRSQQTSASLHWSLVTLFCQMHKIPLSTKYLAMLARDNDWVGFLSEAQLGGYPFD 1297

Query: 2008 TVIQVASKEFSDPRLKIHILTVLKAVQSRKNPGPSSYSDTEDKKSQTSFLDGSTYIPVEL 2067
            TV+ VASKEF D RLK HILTVL+   S+K    +S+SD   +    S  +G  Y+  EL
Sbjct: 1298 TVLNVASKEFGDQRLKAHILTVLRYANSKKK-ATTSFSDDPSRGLSCSPSEGGAYVSAEL 1357

Query: 2068 FTILAECEKKKNPGKALLIKAEELSWSILAMIASCFPDVSPLSCLTVWLEITAARETTSI 2127
            F +LA  EK KNPG+ LL KA+E SWSILA+IASCFPDVSPLSCLT+WLEITAARET+SI
Sbjct: 1358 FRVLAYSEKLKNPGEYLLSKAKEFSWSILALIASCFPDVSPLSCLTIWLEITAARETSSI 1417

Query: 2128 KVNDIASQIAENVGAAVEATNTLPAGCRSPAFHYCRKNPKRRRTMDSISKDPSVGVISDT 2187
            KVNDI ++IAEN+GAAV +TN+LP   R   FHY R+NPKRRR    ++   SV +++  
Sbjct: 1418 KVNDITTKIAENIGAAVVSTNSLPTDARGVQFHYNRRNPKRRR----LTAHTSVDLLASA 1477

Query: 2188 FSASTGASTNVSGGFIVKEEGKIVQERRPISVSYDSDEAPSSLSKMVSVLCEQKLFLPLL 2247
             S +  A          + E    ++    SV  DS +  +SLSKMV+VLCEQ+LFLPLL
Sbjct: 1478 NSLNISAGKTFCSH---RTEAAEDEKAEDSSVIDDSSDEHASLSKMVAVLCEQRLFLPLL 1537

Query: 2248 RAFEMFLPSCSLLPFIRALQAFSQMRLSEASAHLGSFSVRVKDEASFSHANVEGEEHTGT 2307
            +AF++FLPSCSLLPF RALQAFSQMRLSEASAHLGSF  RVK+E+    +N   + + G 
Sbjct: 1538 KAFDLFLPSCSLLPFFRALQAFSQMRLSEASAHLGSFWGRVKEESMHFQSNTAKDVNFGA 1597

Query: 2308 SWTGSTAVKAANAVLSVCPSPYERKCLLKLLAATDFGDGGFAAAYYQRLYWKINLAEPSI 2367
            SW   TAVKAA+AVLS CPSPYE++CLL+LLAATDFGDGG AA YY+RLYWK+NLAEPS+
Sbjct: 1598 SWISRTAVKAADAVLSACPSPYEKRCLLQLLAATDFGDGGSAATYYRRLYWKVNLAEPSL 1657

Query: 2368 RIDDGLHLGNEALDDASLLTALENNGHWEQARNWAKQLEASGGSWKSASHHVTETQAESM 2427
            R +D L LGNE+LDD SLLTALE N  WEQARNWAKQLE  G +W S+ HHVTETQAESM
Sbjct: 1658 REND-LDLGNESLDDGSLLTALEKNRQWEQARNWAKQLETIGATWTSSVHHVTETQAESM 1717

Query: 2428 VAEWKEFLWDVQEERVALWGHCQALFIRYSFPALQAGLFFLKHAEAVEKDLPAKELHELL 2487
            VAEWKEFLWDV EER+ALWGHCQ LFIRYSFPALQAGLFFL+HAE VEKDLPA+E++ELL
Sbjct: 1718 VAEWKEFLWDVPEERIALWGHCQTLFIRYSFPALQAGLFFLRHAEVVEKDLPAREIYELL 1777

Query: 2488 LLSLQWLSGMFTMSYPVYPLHLLREIETKVWLLAVESEAELKNERDLNINNSSRECISRN 2547
            LLSLQWLSG+ T+S+PVYPLHLLREIET+VWLLAVE+E+ +KN    + ++  ++ ++  
Sbjct: 1778 LLSLQWLSGLTTLSHPVYPLHLLREIETRVWLLAVEAESHVKNVGAFSPSSIGKDMVNGY 1837

Query: 2548 SSSIIDWTASIISKMDKHIST-MKNKSMDKHEVRENSQTHHKSHVLDAGLSTAGGGNTKA 2607
            SS++ID TASII+KMD HIS+  KN+  +KH+ R   Q + ++      +    G +TK 
Sbjct: 1838 SSNLIDRTASIITKMDSHISSATKNRIGEKHDARAAGQGNQRNQDTSTSIF---GASTKP 1897

Query: 2608 KRRTKGSVLIRRPLVDSTDMNTNPEDGCVPSNFKNDLHLQDENLKMDTSLSGWEERIGPA 2667
            KRR KG+V   R  VDS+D NT+ ED     N K++  LQ+E+  ++ SLS WEE I PA
Sbjct: 1898 KRRAKGNVPQIRHFVDSSDRNTDFEDSSSLINIKSEFQLQEESTGLEISLSKWEESIEPA 1957

Query: 2668 EVDRAVLSLLEFGQITAAKQLQQKLSPGQVPSEFLLVDAAFTLAAISTPNREVSMSMLDE 2727
            E++RAVLSLLEFGQ+TAAKQLQ KL+PG +PSE +++DA   LA +STP R+V +SMLD+
Sbjct: 1958 ELERAVLSLLEFGQVTAAKQLQLKLAPGNLPSELIILDAVMKLAMLSTPCRQVLLSMLDD 2017

Query: 2728 DLCSVILAYDIPVDQ-YLNPLQVLEILATIFAEGGGRGLCRRVIAVVKAANVLGLPFSEA 2787
            ++ SVI ++ + +DQ  + PLQ+LE L+TI  EG GRGL R++IAV+KAAN+LGL F+EA
Sbjct: 2018 EVRSVIQSHSLKIDQPMIEPLQILENLSTILNEGSGRGLARKIIAVIKAANILGLTFTEA 2077

Query: 2788 YNKQPIELLQLLSLKAQESFEEANFLVQTHSMPAASIAQILAESFLKGLLAAHRGGYMDS 2847
            Y KQPIELL+LLSLKAQ+SFEEA  LVQTHSMPAASIAQILAESFLKGLLAAHRGGY+DS
Sbjct: 2078 YQKQPIELLRLLSLKAQDSFEEACLLVQTHSMPAASIAQILAESFLKGLLAAHRGGYIDS 2137

Query: 2848 QKDEGPAPLLWRFSDFLKWSELCPSEPEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2907
            QK+EGPAPLLWRFSDFLKW+ELCPSE EIGHALMRLVITGQEIPHACEVELLILSHHFYK
Sbjct: 2138 QKEEGPAPLLWRFSDFLKWAELCPSEQEIGHALMRLVITGQEIPHACEVELLILSHHFYK 2197

Query: 2908 SSACLDGVDVLVALAATRVEAYVAEGDFPCLARLITGVGNFYALSFILGILIENGQLELL 2967
            SS CLDGVDVLVALAATRVEAYVAEGDF CLARLITGVGNF+AL+FIL ILIENGQL+LL
Sbjct: 2198 SSTCLDGVDVLVALAATRVEAYVAEGDFSCLARLITGVGNFHALNFILNILIENGQLDLL 2257

Query: 2968 LQKFSAAADTSAGSAEAVRGFRMAVLTSLKHFNPTDLDAFAKVYSHFDMKHETAALLESQ 3027
            LQKFSAAAD + G+A+AVR FRMAVLTSL  +NP D DAFA VY HFDMKHETA LLE++
Sbjct: 2258 LQKFSAAADANTGTAQAVRSFRMAVLTSLNLYNPNDHDAFAMVYKHFDMKHETATLLEAR 2317

Query: 3028 AEQSCEMWFRRYYKDQNADLLDAMHYYIAAAEVHSSIDAGNKTRRSCAQASLVSLQIRMP 3087
            A+Q+ + WF RY KDQN DLLD+M YYI AAEVH+SIDAGNK R++C QASLVSLQIRMP
Sbjct: 2318 ADQAAQQWFLRYDKDQNEDLLDSMRYYIEAAEVHTSIDAGNKARKACGQASLVSLQIRMP 2377

Query: 3088 DFKWLFQSETNARRALVEQSRFQEALIVAEAYDLDQPSEWALVIWNQMLKPEILEEFVAE 3147
            D KWL  SETNARRALV+QSRFQEALIVAEAY L+QPSEWALV+WN MLKPE+ E+FVAE
Sbjct: 2378 DSKWLCLSETNARRALVDQSRFQEALIVAEAYGLNQPSEWALVLWNLMLKPELAEDFVAE 2437

Query: 3148 FVSVLPLHPSMLADIARFYRSEVAARGDQSQFSVWLTGGGLPAEWAKYLGRSFRCLLKRT 3207
            FV+VLPL  SML ++ARFYR+E+AARGDQSQFSVWLTGGGLPAEWAKY+ RSFRCLLKRT
Sbjct: 2438 FVAVLPLQASMLLELARFYRAEMAARGDQSQFSVWLTGGGLPAEWAKYMWRSFRCLLKRT 2497

Query: 3208 RDLRLRLQLAQVATGFVDVMDACTKALDKVPENAGPLVLRKGHGGTYLPLM 3246
            RDLRLRLQLA  ATGF D++D C  ALDKVPENAGPLVL+KGHGG YLPLM
Sbjct: 2498 RDLRLRLQLATTATGFADMVDVCMNALDKVPENAGPLVLKKGHGGGYLPLM 2513


HSP 2 Score: 78.6 bits (192), Expect = 1.0e-13
Identity = 38/51 (74.51%), Postives = 42/51 (82.35%), Query Frame = 0

Query: 111 GPAILQLHKWNPSQPQLNLSEYREAFISPARQILLLHSYKHEALLLPLNTG 162
           GP +LQLHKW PSQ QL LSE+REAFISP+RQ+LLL SY  EALLLPL  G
Sbjct: 8   GPTLLQLHKWEPSQFQLKLSEFREAFISPSRQLLLLLSYHSEALLLPLVAG 58

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881148.10.0e+0086.48uncharacterized protein LOC120072742 [Benincasa hispida][more]
XP_023518580.10.0e+0086.32uncharacterized protein LOC111782046 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023518582.10.0e+0086.29uncharacterized protein LOC111782046 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_023004088.10.0e+0085.64uncharacterized protein LOC111497504 isoform X1 [Cucurbita maxima][more]
XP_011657786.10.0e+0085.19uncharacterized protein LOC101206379 [Cucumis sativus] >KGN48416.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
Q55GD24.4e-2517.10Protein DDB_G0268328 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0268328 PE=4 S... [more]
Q3UHA35.2e-1823.11Spatacsin OS=Mus musculus OX=10090 GN=Spg11 PE=1 SV=3[more]
Q96JI79.6e-1227.84Spatacsin OS=Homo sapiens OX=9606 GN=SPG11 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A6J1KYH30.0e+0085.64uncharacterized protein LOC111497504 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0KKY40.0e+0085.19Spatacsin_C domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G486890... [more]
A0A6J1KV830.0e+0085.61uncharacterized protein LOC111497504 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HEZ70.0e+0085.86uncharacterized protein LOC111463367 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HE640.0e+0085.82uncharacterized protein LOC111463367 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G39420.20.0e+0054.73unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_c... [more]
AT4G39420.10.0e+0061.40unknown protein; Has 46 Blast hits to 40 proteins in 10 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR028107Spatacsin, C-terminal domainPFAMPF14649Spatacsin_Ccoord: 2878..3154
e-value: 1.0E-67
score: 228.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 207..225
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 196..225
NoneNo IPR availablePANTHERPTHR13650:SF0SPATACSINcoord: 231..3245
IPR028103SpatacsinPANTHERPTHR13650UNCHARACTERIZEDcoord: 231..3245

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026353.1Sgr026353.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005737 cytoplasm
molecular_function GO:0016787 hydrolase activity