Moc09g05800 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc09g05800
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionsacsin isoform X1
Locationchr9: 4431001 .. 4459761 (-)
RNA-Seq ExpressionMoc09g05800
SyntenyMoc09g05800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCTGAATCGACGTCGCTAGACTCGATTTTTCTGGAGGATTTTGGCCAGAAGGTTGACCTGACTCGGCGGATTCGTGAGGTGCTCCTTAACTATCCTGAGGGGACTACTGTTCTCAAGGAGTTGGTGCAGAACGCCGATGATGCAGGCGCCACCAAGGTTTGCCTCTGTCTCGACCGCAGGGTCCACGGCAGCGAGTCGTTGCTGTCGGAGTCACTGGCACCGTTTCAAGGGCCTGCGCTTTTGGCTTATAACAATGCGGTATTTACTGAAGAAGATTTTGTCAGCATTTCGAGAATTGGTGGTAGTAATAAGCACGGGCAAGCCTGGAAGACTGGTCGATTTGGGTAAATATCTCCGTTTCAATATCTCTAGACGTGTGTTTAATCTGACTCGTTTTGCTTATATGATGGAATTATGGATAGTATACATTTCACGTGTTATTTTGCCAAATGAATTGTACATATTTATCTTGTCGTTGAACCTCATTTCATTCCCTCCGGCAAGTGAAAATGTTGCATTATAGTGAACCAATCTATTTTTGCTTCTGCTTTAAGAACCTTTAGCTAGCGGTGTTAATGAGACAGGATCTTCATGCACTAAAATTTCGAATTTCAGGCTTGTATGTAGAGTTTCATTTTTTACTCTTTTCATTTAATTTAAGTAAAACAAGCTCCCTCAATGTTTTTATGGGCCTCTCCTTCAAGTCAGGTGGTCTCTGACACTTGTCTTTGGTGACAGTTTTTATAAGCTCCAGTTATGTGCATAATTTATTGTTCCAAACTGGCTTACTGGATTTAACTTTTTTCGATATCAGAAAGTAGTAGGCTATTGGTAATGGTTACCATATATTTGGAAGTAATTGGCAACGCTCAAGTTTGCATGATCATCATTCATCTCATAAACAATTCCATAGGCAACCATGAATTTTGGCAGGATCACGTGTTTCCATAAACTAGTGCCGTGCTTTCTCCTTCCAAAATATTATGGACAAACGAATAGATAACCTACAATTAAATGTTATATATGGATGCCTATGATCTAATAGATCTGTCCAGCGACTCGGGTATAATTGTTGCAACTGCAGTTGATGGTTTTGATAACTTGAATTCATCTTCCTTTGGCTTCTTAACTATATTTTCTTACTTCCAATATCACAACTGAAGTAATCTTAGTGAAGTGCAAGCAATTTCATTAAAAAAAAGCACAATGCAAAATTAGATGTTGACAACTTGACATGCTTGAAGCAAGCAGATAAGTTGTTCAAATTTTATAACATTTTCGGAAAGCACTGTCTGCGGTTATTTTCCTTTTGGTAGTTGTTACCATATTTTACAAGAGTTGCAAAGGTTCTATTATTGCTGTTTAAAATTTATCATTGGATGTGATAAAAGCCGAGTCAGTAGTGAAGATAATAGTGCCCTTCTCCATTCTCCTATCAACTTTCATTCCATGAATTCAATATTTATCGCATTATCAAAATAAATGGAAATACAATTAAGTTTCTGTAATATTTTGCTTCCAACATTGACTTCATAATAACTATTTTTTGGTTTAATTTTATAGGGTTGGCTTCAACTCTGTGTACCATTTAACTGAGCTGCCTTCATTTGTTAGTGGCAAATATGTTGTAATGTTTGATCCTCAGGGCATTTACCTTCCAAAGGTTTCTGCAGCAAATCCTGGAAAGCGAATTGACTTCATTCGTTGCTCTGCCATCTCACAATATAGAGATCAGTTTCTTCCTTATTGCGCTTTTGATTGTGACATGGAAAGTTCTTTTGATGGAACTTTATTCCGGTTGCCATTAAGAAATGCAGATCAAGCAGCCAGAAGTAAAATTTCAAGGCAAGCTTATACAGAAGAAGATATTTCTTCCATGTTTGCTGAACTTTATGAAGAAGGAGTCTTGACGTTACTCTTTTTGAAAAGTGTTTCATGTATTGAAATGTTTGTATGGAATGATGGTGAGGCAGAACCCCAGAAGCTTTATTCATTCTCTGTGCGTTCAGCAAGCAGTGATATTATTTGGCACAGGCAGATGTTACTGCGGTTATCAAAATCGACAACTTCCACTCAGAGTGAGGTAGATAGCTTTTCACTGGACTTCTTGAGTCAAGCGACAACTGGGACCCAAATAGAGGAAAGGATTGATAGCTTTTTTATAGTTCAAACCATGGCATCAGCTACCAGTAGGATTAGCTCTTTTGCTGCTACTGCATCGAAAGAATATGATATTCACTTGCTACCTTGGGCATCGCTTGCTGTTTGCACATCCGATGACTCCTCAAAGGTAACCTCGGTAATTTCAAATTTTCTCTCTTATTTTGTGCAATCTCTCTGTCACCCTCACACCTGCACCTGCACCCCACTGTTCCCATACCCACATGTATATGTACACCAACTTGCACCAAAATACTCTCTGTTCATGGATGACTTCTTGAATTTCCTGATCATTTGGAATTTCTCTTTTTTACAAGAGAATAAATTGACTGAATAGTGAATGAATAGACGATGTTGTGTAGCGTAGTTAGTTTTGTTTTCTTCTATGATATAGTTGGGAGGCTAAGAGCTCCTTATATCCAACAGTTATGGCTGTCATATGAGTGCTTGTATTGGCATTTTATTGTTGAATTTAGGCAGCTTAATTCTATTCAAACACTGAATAAATTTCCATAATTTTTTTATGAGAAATGGGGAAATATATATTAACATCAAAATTAAATTTCCATCATTTTGGTATTGGTAGTTTTGTTTTTGTGCCAAATATTATGGTTGGGAGCTGTGCTTTCTCTTGTAAATTATTTTCATGAATGAAATTGAGTTTCATATAGAAAACAGTTTCCTGCTCCTGCTAGAACAGATAACAATCTAAAGCAAATAAAGGTTTTTCTGCATTTTCAAGAAAGTCAATTTGAATGCCTACACCATTGTACCACCTCTATATTTGAACATCCATCCCTTCATTTAATGTTTTCTCATTTGCATGAGTTTGCCTCTGATGAATTTTCTTTGTCTGCTTGACTCGTGGGTGAACTGATTGTCTATTTATGTATTTTGAATGGAATCATTCTCTCTCAATGAAACCTCAGTTTCTAATAAGAAAAAGAAAATTATTTTGAATGGAATGAATTAATGGTAGTAAATTTTACTGTTTCAGCATCTTTTGAAAAAAAAGTTTGTTATTTGTTGTTTTAGCAGTCTAGTCGGTGGGCTATAATCTTATAATAAACTTGGAATTATCGACATGGTTCTTGAATTTTTGTTAAAATGGAGAACAGAGATGGATACTGGATTGCATTGATAGATAGAAATTGGGGGAAATTGGATTTCTTTTTGCTGTTGATGATACAGGTCCTTAGCTGCTGTAACTGCAACAACTATCAAATATGGCAGCCTGTATATATACTGCTTGGACCAATTTTGGGAGAAATAATTCAATTCAATAAAACAGGAAAAAGTATTCTGTCAATAGAAGCCTTAAGGGTGAACTTCTAGATGCTTCAGAAACAGCAGATTTTGTGGGGGGTTTGGCTTGAGAGGAACAGCAGAAATTTGAGGGGTCGAGAGACCTTGGGAGGAAGTGTGGGAAATATCTAGATTTAAATCCTATCTTGTGAGGCACTTTCTTATAGGCTGGTCCCCATTCATTGAGCTCGTTTTTTCTTGTTGGCCTTTTGTATATTATTTCACATTTTCATTTTGTTTCGATAAAAGCTCCATTTCTTATTAAAAAAAGAAAAAAAGAAAAAAAGAATAAAGTTCTATGCAACTAGTTTCTTCCCTCATAGCTACTAGTTTTTCTGCATTTTGATCTTGTTGTACTTATTTAATTTTTGCATGTATTGTTTTCAAAACTTGAATCCTTTCCAAATATATTTATAAAATAGTAAAGGTTACAAAAAAAGGAAGACAGAGGGAACCTCCTCCCCATTAGGCAAAGGAGTTAAGAGAAGTCCCCAATTGAATTGCAATAACTTTTGCATACTGGTATTGATTCATTCAAACTTGTAAGTTTAATTTATTTCCAGGCATGTATTTTACATGTACCATTATCTTGTTAAATTGGTTATCTTCTCATATAAAATTTATTATATATAAGATTTATTGGTGGAGCTCTTTGATCTTTTGTTACTTTCTACAAGAGTATGTTGTCATAGACTCCTAGTAAATAATTGTTAATTGTATTCCTTTATGTTGCATGGTTTTTTTTGTTTGCTTGCGGATCTTAGTTATTTTACCTCCCTTTTATGGAAACACCAGATCTTATGTACAATTGCAATGTAATATTTAACAGACTAATGTTCTCAAACTTGGTCGGGCCTTCTGTTTCCTCCCTTTGCCAGTAAAAACTGGTCTAAATGTTCAAGTAAATGGATTCTTTGAGGTCTCATCAAATCGTCGTGGCATTTGGTATGGGGCTGACATGGACAGGAGTGGGAAGATTCGTTCAATCTGGAACAGGCTCCTTTTAGAGGATATAATTGCTCCTTCTTTTATAGAACTGCTGATTGGCGTGCAAGTGTTTCTTGGTCCAACGGATACTTACTTTTCCTTGTGGCCTAGTGGTTCATTTGAGGAGCCATGGAACATATTAGTTGAGCAAGTGTACAAAAATATCAGCAATGCTCTTGTTTTGTATTCAAACGTTGAAGGTGGAAAATGGGTCTCACCAATTCAAGCTTTTCTTCACGATGATAAATTCACAAGAAGTAAGGAACTTGGTGAGGCTCTGGTGCTTTTAGGAATGCCTATTGTGCATCTTCCCGAAAATCTGTCCAACATGCTTTTGAAGTTCTGCCGTGCTTTCCAACAGAAAGTGGTTACACCCTGTACAGTTCGTCATTTTCTTAGGGAATGTAAACATGTTGCTACACTGAACAGACCTTACAAACTTGTATTGTTAGAATATTGCATTGAAGATTTGATTGATGCTGATGTTTGCGCACAGGCATTTGACTTACCCTTGATTCCATTGGCAAATGGTGATTTTGGGTTATTTTCAGAAGCCTCCAAAGGGATCTCTTATTTTATTTGTGATGAGTTGGAATATACCCTTCTGCACCAGATTTCTGATAGAGTAATAGATCGGAATATACCTCTCAATATATCAACTAGACTTTCAAATATTGCCAGGTCTTCAAAATCAAATATTTTCATTTTTAATGTTCATTATTTTCTTCAGCTGTTTCCAAAATTTGTGCCTGCTGATTGGAAGTATAAAAATGAAGTGTTGTGGGACCCTGAGTCTTGTAGCAACCATCCTACTTCATCATGGTTTTCACTATTTTGGCGGTATCTTCATGATCGGTGTGAAAAGTTGTCATTATTCAGTGACTGGCCCATTTTGCCATCCAAATCAAGGTATCTTTATAGGGCGACTAAAAAATCAAAATTGATTAACGTACAGATGCTTTCTAATGAGATGCAAAAGATACTCAGTAAACTTGGATGCAAGTTGCTCGATCCCTACTATAAGGTTGAACACCGAGACCTAATTCACTATGTAAATGATGGTAACTGTACAGGCGTTTTGGACTCTATATATGATGCGATTTCTTCAACTGGCGGTTTAATGCTGACTTCACTCTACAGTTTAGAGGTGGAAGAGAAGGATGGGCTGCGTAGGTTTCTTCTTGATCCTAAGTGGTATCTTGGAGGCTGTATGAATGATAGTGATTTAGAGAAGTGCAAGAGACTCCCAATTTATAAAGTCTATAATGGAGGATCTGCTCAAGATTTTGGTTTCTCAGATCTTGAAAACCCTCAAAAATACTTACCACCCTCGGATGTTGGGGAGTTCTTTCTTGGAGTAGAGTTTATTGTCAGCTCCTTCGATAGCGAAGTGGAGATTTTGTTAAAATACTATGGGATCAAAAAAATGGGAAAGGCATCTTTCTACAGAAAGCATGTTTTAAACCAAGTTGAGCAGTTGCAGCCCGAACTTCGTGACAGTACCATGCTTTCCGTTTTGAAAAATCTTCCACAATTATGTGTTGAAGATGTAGCTTTCAGAGAATGTTTGTCAAATTTGGCTTTTGTTCCAACTTCAAGTGGCACTCTTAAGTGCCCGACTGTTTTATATGATCCTCGTTATGAAGAATTGTGTGCTTTACTAGATGACTTCGATAGTTTCCCTTCCACTCCTTTCAATGAATCTTACATTCTGGACATTCTACAAGGTTTGGGCCTTAGAACATGTGTAGCTCCTGAGACCATTGTGCAAAGTGCTCAACATGTAGAAAGATTAATGCATAAGGATCACAACAAGGCTCACTCAAGGGGTAAGGTTCTTCTATCATACTTAGAAGTTAATGCAATTAAGTGGCTTCTCAATCCAATGAATGAAGATCAAGGGATGGTGAACCGATTATTTTCTACAGCTGCAACTGCATTTAGACCTCGCAACTTCAATTCTGATCTTGAGAAGTTCTGGAATGATCTTTGTAAAATTTCCTGGTGCCCAGTATTACTTTCCCCTCCTTTTGAAACCTTACCCTGGCCTGTTGTCTCATCCATGGTAGCTCCCCCAAAACTTGTAAGATTGCCCAAGGACTTGTGGCTTGTTTCAGCTAGCATGCGAATACTGGATGGTGAATGTTCTTCTTCAGCTCTTGCACACAGCCTTGGTTGGTCTTCTCCCCCTGGTGGTAGTATTATCGCTGCTCAACTTCTTGAGCTTGGAAAAAACAATGAGATTGTATATGATCAGGTGCTTAGGAAAGAACTGGCCTTAGCAATGCCAAGAATATATGCATTGCTGACAGGCTTGATAGGTTCAGATGAGATGGATGTTGTAAAAGCTGTGCTTGAAGGTTGTCGGTGGATTTGGGTTGGAGATGGGTTTGCAACATCAGAAGAGGTTGTCCTTGATGGTCCTCTTCACTTAGCCCCTTATATTCGTGTTATACCCATTGATTTGGCAGTTTTTAAAGATTTATTTTTAGAACTTGGAATTCGGGAATTTCTGAAGCCCAATGATTATGCTGATATTTTGTCTAGAATGGCAATAAAAAAAGGTTCCTCTCCTCTCAACGCACAGGAAGTGAGGGCAGCCATTCTCATTGTACAACATCTGGCTGAAGCTCAACTTCCAAAGCAACAGATCAATATATATTTACCCGATATTTCGGGTAGGCTATTGCCGGCCAGCAACTTAGTTTACAATGATGCTCCCTGGTTACTAGGGACAGATGATACCGATGTTCCATTTGATGGGGAATCAACTGTTGTCTTGAATGCTAGGAAAACAGTTCAAAAATTTGTCCATGGGAACATATCCAATGATGTTGCAGAGAAACTTGGTGTCTGCTCACTTCGTAGGATTTTATTAGCTGAGAGTGCTGATTCTATGAATTTGAGTTTATCTGGAGCTGCTGAAGCTTTTGGGCAGCATGAAGCCTTGACAAACAGACTTAGACATATACTTGAAATGTATGCAGATGGTCCTGGGATTCTATTTGAGTTGATTCAAAATGCAGAGGATGCAGGCGCCTCTGAGGTGGTATTTCTTTTGGACAAAACTCATTATGGAACTTCTTCCATTTTATCACCTGAAATGGCTGATTGGCAAGGACCAGCATTGTACTGTTATAATGACTCTGTTTTTAGTTCTCAAGATCTTTATGCAATTTCACGTGTTGGCCAAGAAAGTAAGCTGCAAAAACCATTATCAATTGGAAGATTTGGTTTGGGTTTTAATTGTGTGTATCATTTTACGGATATCCCCACGTTTGTTTCTGGGGAGAACATTGTTATGTTTGATCCTCATGCCTGTAATTTGCCTGGGATTTCTCCTTCTCATCCGGGTCTCCGAATAAAGTATGCTGGAAGAAGAATTTTGGAGCAATTTCCTGATCAATTTTCTCCATACTTGCATTTTGGTTGTGACATGCAGAAGCCATTTCCTGGTACGCTATTTCGTTTCCCCCTTAGGAGTCCGGCTCTTGCTTCTCGAAGTGAAATTAAGAAAGAAGGTTATGCACCTGAAGATGTTACTTCACTTTTTTATTCTTTTTCCGAAGTTGCTTCTGATGCTTTGCTTTTCCTCACCAATGTCAAAAAGATCTCAATATTTATAAAGGATGATATAGAACATGACATGCAATGCCTATATCGTGTGCATAAGAATACAGTTAGTGAACCTTCTACTGAATCCAGTGCAAAGCAGGATATTATTAGCTTTATATATGGAAACCGCCAAGGTGAAATGGATAGAGAACAGTTCCTAATGAAATTGAGTAAATCCATCAACAGAGATCTCCCATATAAATGTCAGAAACTTATTATCACGGAGAAAAGTTCAAGTGGTGATATATTACAGCACTATTGGATAACCTCTGGATGTTTAGGTGGTGGGCTCCCGAGAAACAACTCAGGCTTGGGTGACAAGTCCTATAATTTCATTCCTTGGGCTTGTGTTGCTGCACTTCTACATTCTGTACAGGTAGATGGGGAAATGAACTATGACCCTGAGACTGAAAATAATTGGCTAGTTGCTTCTGATTTAGTTCAAGTTTCTTCTGCTTCTATAGAAGGCAGGAAACCTTTTGAAGGACGTGCTTTCTGCTTTCTACCGTTACCTGTCAGAACTGGTCTCCCTGTGCATGTCAATGCGTATTTTGAGCTTTCGTCAAATCGAAGGGACATATGGTATGGTGATGACATGGCAGGGGGCGGAAAAAAACGTTCAGAATGGAATTCTTATCTCCTTGAAGATGTTGTTGCCCCTGCTTATGGTCGCTTGCTTGAAAAAATTGTGTCAGAGATTGGTCACTCTGGTTTATTCTCCTCATTTTGGCCGACAACAGCAGGATTAGAACCTTGGGGTTCAGTAGTTCGAAAACTCTATAGCTTTATTGGTGATTTTGGTCTTCTTGTTCTGTATACAAATGCTCGAGGAGGTCAGTGGATTTCCACAAAACAAGCTATTTTTCCTGATTTTTCTTTTGACAAAGTATATGAACTTATTGAAGCATTAGCTGATTCCGGCCTGCCAGTCATCGCTATTTCAAAGTCAATTGTTGACAGATTCATGGAGGTACGTCCCTCATTACATTTCCTGACTCCCCATTTGTTAAGAACTCTGTTGATTAAAAGGAAGCGTGCGTTTAAAGACAGGAAAGCAACCATCTTGACCCTTGAATATTGTTTAGTCGATTTGAAACTACCTTTTCAATCCGACAGTCTGTGTGGATTGCCTTTACTACCACTTGCTGATGGTTCATTTACCTCATTTCACAAGAATGGGATGGGAGAAAGAACTTACATTGCAAGGGGAGATGAATATGGCCTTCTCAAGGATTCAGTTCCCGGCCAACTTGTGGATCCTGGAATACCAGAAGTAGTTCATGCAAAGCTCTGTGAGGTAGCCCAAACTGAGGATTTAAATATTTGTTTTCTTTCGTGCCAGTTGCTCGAGAAACTTTTCCTGAGATTTCTTCCGGCAGAGTGGCAAAATGCTAGACAGGTGAACTGGAATCCTGGTCATCAGGGTCAGCCTAGCTTGGAATGGATAAGATTGATTTGGTGCTACCTCAAGTCACATTGTGATGATCTATCTCAGTTTTCTAAGTGGCCTATACTGCCCGTTGGGCAGAATTCTCTCCTGCAACTTGTTGAGAATTCAAATGTTCTTAAAGCTGATGGTTGGAGTGAGAACATGTTTTCTTTGTTGCTGAAAGTTGGGTGCTTGTTTTTGAGGCGTGATATGCCTATAGAACATCCACAACTGGAAAACTATGTGCATCCTTCAACAGCAATCGGTATTCTAAATGCTTTTCTGTCTATTGCTGGTGATATTGAGAATGTTGAAGGGCTATTTCGTGATGCATCTGAAAGTGAACTGCATGAGCTCCGGAGCTTTATTCTTCAGTCAAAATGGTATCTGGAAGGAAAAATGGAAGCCATTCATGTTGATATTATTAAATGCATCCCCATGTTTGAGTCGTATAAATGTCGAAAATTAGTAAGTTTGAGTAAACCTATAAGATGGATTAAACCCACTGGTTTATGTGAGGATTTTTTGAATGATGATTTTGTGCGCATGGAATCAGAAAAAGAGAGAATCATTTTGAAAAGATACTTTGGGATCGGAGAACCATCAAGAGTAGAATTCTACAAAGATTATGTTCTTAATCATATGTCGGAATTCCTTTCAGAAAGGGGAGCTCTTTCAACCATTCTGCATGATGTGAAGCTTTTAATTGAAGAGGATGTCTCCCTCAAATCTTCAGTTTCTATGATACCATTTGTGTTGGCTAGCAATGGATCCTGGCAACCACCATCAAGGTAATATCAAATTTTAGCTTCCCTCATAAGTTATTTCAATTTCTCTTACATAAAAAATTAAATTATTGCTATATGATAAAGTAAAGTGCTACAATCCATGTGGATGTTATCTAATAATTGAACTGTGTCATAGTTACAAGATTTGTAGGACTTTCCATGTATTTTCCTTTTACGTACAAGATTTGTTGGTATTTACAGTCCGCATGAAAATGTGGAGGTAATCTCTGAAAATAAAAGAAGGAAGGAATCAAGATATTATTCCTGCCCTCAAAGAGTGCAAAAACAAGATGCCATAAATCCTTCCTCCCACCAGAAGAAAAAAAGAATCTAGATTGTTGTTATTATCTTTTTATTATTTTTGATATCCGGGAGTATCCAAGTCAGCTTACGCGCACCTCTATTAATCTCATTGGGCAACCGCCTGGTCCTACAACATTTGGTGTCGGAAAACACATAGGAATTATTAAATCCTAAGGTAGGTGGCCATCATGAGGTTTGAACCCATTCCCTCTTAATTAACCCTAGCCCCTTTGTTTGTTGTTGTTATTATTATTATTATTTTTTTTCAATAGGACCATAAGTATGCCAGATCTAGAAGCTAAAAGTTACTGTATTTGATTTTTACCAGGCTGTATGATCCTCGGGTACTTGAGTTAAATAATATGCTGCATGAAGAAACTTTTTTTCCATCTGAAAATTTTTCAGATGATGACATTTTAGATGCTTTAGTCAGCCTTGGACTTAGAAGATCCCTTGATTTGACTGGTTTGCTAGATTGTGCTAGATCAGTTTCATTGTTGAATGATTCTAAGAATTCTGAATCACAGAGTTACGCTAGGAGATTGTTTGTGTGTTTAGATGCTCTTGCACACAAGCTCTCAATCAAAGTGGAAGGAAGTGGTTATGAACTACAGAATTCTATGCTCATTAAGAGCAATTATGTTGATGATGATGCTTCTATGGAAGTTGGCTCTCTCAATATAGAGGATACCTCTGATATGGGCACTGATTCCTTAATAGGCAACCTGACTGGTGATGAATCAGAGGAAGAATTTTGGTCTGAAATGAATACTATTGCTTGGTGCCCCATTTGTGCCGATTCACCTTTAAAAGTACTTCCATGGTTGAAAACCAATAATCAGGTAGCTCCACCGAGCATTGTGAGACCTAAATCACAGATGTGGATGGTCTCCTCTTCAATGCATATTCTAGATGGTGTGCCTCCTTCAGAATACTTGCAACATAAACTTGGTTGGACTGATTGCCCGAGGGTTGAAGTTTTATGTGCACAGTTGACGGACATATCCAAGCTTTATGGTGAGCTTAGGTTGCATTCTTCACTAGAACCTGATATCAACACTGCATTGCAAGAAGGGATTCCCATTCTTTACTCAAAACTGCAAGAATATATAGGAACTGATGAGTCCGTGCTGTTAAAATCTGCTTTAAATGGTGTATCCTGGGTGTGGGTGGGGGATGATTTTGTACCCCCAAGCGCTCTTGCCTTTGACTCGCCAGTAAAATTCTCTCCTTATCTTTATGTCGTTCCATCCGAATTATCAGAATTTAGAGATCTGCTTTCAGAATTAGGTGTTAGGCTTAGTTTTAATGTTGAGGGCTACTTGGATGTTCTTCAACGCTTACACAGTGATGTTGAAGGGTCCCCTCTATCCACAGATCAGATGGATTTTGTGATCTGTATGCTTGAAGCTATTTCAGACTGTTGTGTGGACAAGCCAGAATTCACTGCTACCAGTACTTCTCTTTTAATTCCCAACTCTTCTCAGGTTCTGATGCAAGCAAATGATCTTGTTTACAATGATGCACCATGGATGGAAGACAACAATATTCTTGTTGGGAAACACTTTGTGCATCCAAGTATCAGCAATGATTTGGCAAGCAGGTTGGGCGTGCAATCCATTCGTTGTCTCTCATTAGTTGATGAAGAGATGACTAAAGATCTACCATGCATGGACTATGCTAAAATCAGTGAGCTTCTGATGTTGTATGGCAATGACTATTTGTTCTTTGACCTTCTGGAATTGGCAGACTGCTGCAGAGCTAAAAAGCTGCGCCTAATATTTGACAAAAGAGAACATCCTCGCCAATCATTACTGCAACATAATCTAGGTAACTGGTATACATTGTACTATAATGTCTTAACTGACCTTAAAACTTCACTACTATGCTTCCATGAAATGAACAAGTGAAAGAACTTAAAAAATGAATGTGCTGTAGAGTATATCCTTCGTCTTTTCTGATATTGAAATATTGCGTTTTTATCTGGTACTCTGCTTCAACCTAGGGGAATTTACCCATGCTGGTTTTCATTAATTTAGGTTTCATTTTCTTCTGCATTTTTTTGTGTGATGAATTGTGTATTGTAGTTGTTTTATTACTTTGTTACCTGCATAGGTGAATTTCAAGGTCCTGCACTGGTTGCCATCTTTGAAGGCTCTAGCTTAAATACAGAGGAAATCAGTAGCCTGCAATTTCGTCCTCCTTGGAAATTAAGGGGCGATACTCTTAACTATGGTTTGGGACTGCTCAGCTGTTATTATGTTTGTGATCTCCTTTCAATTGTTTCTGGTGGCTACTTTTATATATTTGATCCCCGTGGAATAGCTCTTTCTGTAGCTCCCAAATCTGCCCCAGGGGCAAAAGTGTTTTCTCTGATAGGTATCTCTTTGACTACAGATGAATTTTTTTATGCTCTGCTCATGTAGATATTTTATTTCAACCATTTCTTTAATCTCAGACGATATCTTCAGCATGTTTACAGCATAATGTGCATGTTGAGAGAATGATGCTCAACCATTTATTTCAATCTCGTACAGGTTAACAGATACAGGAAAAAAGTGCATGTTGAGAGAATGATGCATCCTTTTTTTCTAAAAAAAGAAATTATTTATTAAACCTTATTGTAAAATAAAATGCAGAAAGTAGATGCATGTTTTTTGTGAACAAGAAACATCAAAGAAAAGAGAAAAAAGAAACGAGACAAAGAAAAGAAAATTTCAACAAAGAGCATTACAGCCAAAGATAAGAAAGTAGATACTTATATGCATGTTTAACAACAAAAAAATTCTCTGAAGTGCAGAAGCATGTGTTAATCAATAATATATATTTTTTCTTTTTTGACAAAGAGAAGAAACTTTTCATTGATGTTGTAAATAGAGAGATTTTGCGATCAATTATTAAGCAATGAGCAAACTCTTCATTGATGTTTCATTCATTAATCAATAATCTGTCTTCTCCTCTAGGTAGTAATTTGATAGAGAAATTTAACGATCAATTTCATCCTATGTTGGGGGGTCAAAATATGTCATGGCCATCAGATTCCACCATAGTCCGCATGCCCCTATCTCCAGCATGCTTAAAAGATGGACTTGAGCCTGGAATAAGAAAGATAAAGGAGATAAGTAGCAAATTTTTGGATCATGCTTCAAGGTCTCTTTTATTCTTAAAATCTGTCGTGCAGGTACGCTACCATCATGAACATTATTTTACCATGCAGCATATGGACATATATATTTTGAGATATCTTTACACTACAGTAGCCTATATCTGAAACGCTGTGGTGCTAAGTTATCAATTAAAGATAAATTGTAGAATGTTTCTGATTTTAAGCATCTGTAGTTATGCCTGAGCATTCTTCTGTTTTATGTACATATTTGTATATTTATTTTGACCAGCCTTTTAACACATGCCATGCCAGTAATATTTCAATTTTATTTTATCATATACTAAAAGTAATTTTCTTTAAAATATGGAGTACTATCATATACTATGATCATTTTACGAGGAAATAAATGATCTTTTAAGTAGTATGTGATATTTTTATAACTATCATCCTTAAGTTTAACCATAGTTATTAAAATAAATAGCATCTATTTTACTAAATCAAATTCAAGTGGTAGGTTCACGTGTCCAAGTATGTTAATAAAACAAAAGTTTAATGATTCATAACCTTTTTGTTAACAAAATTTGACAATTTTTTTACGGAAGTAGATGAGAATTTTTAATATACACACTTTTAGATGTATTATAGATATAGATATATATATATTTTTTTAATATTTAACTGACATCTAAATGAAAGATTAGCTTCATGACTGATTTACAGTAGTGATGACTTGTGATTTTTCAAGGCAATGTGCCTGTACATGTCTTGATTCAGAAACTTCTATAATTTTAATGAAAGGTCACTGGTCATATTCAAGAGTAACCTTTCACAAATGATCAGGTTTCATTCTCGACCTGGGACCAAGGAGGGCTTCACCCATACCAAGATTATTCAGTTTGCATCAATTTATCATCTGCTATAGCAAGGAACCCATTTTCAGAAAAGAAGTGGAAGAAGTTCCAATTATCAAGGTTATTTAGCAGCTCAAATGCTGCCACAAAATTGCATACAATAGATATAATTGTATTCCAGGGAGAAACTCAATTCGTTGACCGGTGGCTTGTGGTGCTTAGCTTGGGTTCTGGGCAAACTAGAAATATGGCTCTTGATAGGTAAGTTTGCTAACTTTGTGCAGCACATTGCAAAGTGGTTTTTGTGGACAGTATAGAAGTTCAAGCATTATATTATGAACCTTGTGAATGGTGTGTTTATTAGAATGATTGCCAAACGCAGGCATTCAAGAGGCATGTTCTCTTTTTAAGTTTTCCACCAATCTCTCCTCTTTTCTCCTTATCTAAGCCATTAAATTGGTCATTTGATCAGTTGGAAAGGACATATTCCATTGGAGTTTTTTTTAAATCATCGGAAGAAGAATCTTTAGAACAAAACTCTTGACTTCATCATTTTTAACTCAAGGACGAGTAATTTTGGGGTGGAGGAAATCTGATGTAATTTAATTAAACTGGTTGTTTATGATTCAGTTGGCATAAACCAGGCAATAGGTTAGTTAGGATTCCGTTACCATAACCTAGCTGTTAGGTCAGTTTGGATTTGATTGGCATAAATTAGTATAAGTACTAGTGTTCCAGTGGTAAACATTTCTTTAATAAAGTTGCTTTCCTTGGTTTACACCATTCTCACTCAGTGTCTACATTTTCTACCCTTCCATTTTTCCAAAATAACTTTTTAATCTTTTTTGGAATTAGAAGAGAGTGAAATAGCATCTTATGGAGTTTCCTATTTACAGGCGATATCTTGCATACAACTTGACACCTGTTGCTGGAGTTGCAGCTCATATATCTCGCAATGGCCTTCCTGCTGATATATGTCAAAAGAGCCCTTTGATGGCTCCCTGTCCTTTATCTGGTGATATAACATTACCTGTCACCGTTCTAGGATGTTTCCTTGTTTGTCACAGTGGTGGCCGTTATCTATTCAAGAATCAAGTTCTCGAGGCTGTAGCTGCACCCCTTGATGCTGGAAATAAGTTAGTCGAAGCCTGGAACAGGGAATTGATGTCCTGCGTATGTGATTCTTACATTTACATGATATTGGAAATTCATAAACAACGAAAAGAATCTTCAAGTTCTGCGTTGGAGTCAAATGTGAGTCATTCCATAAGTTCATCTTTGAAGGCATATGGAAATCAAGTTTATTCGTTTTGGCCTAGGTCTGAACCTGCAAATGGCAGTGATTCTGATCTGGACAGAGGGTTGAAAGCAGATTGGGAATGTCTGGTTGAACAAGTAATCAGGCCATTTTATACTCGTGCTATTGATCTCCCTGTGTGGCAGCTTTACTCTGGAAATTTAGTTAAGGCTGAAGAGGGTATGTTTCTTGCACAACCTGGGAGTCCCGTGGGTGGTAACTTGCTGCCAGCAACAGTTTGTGGTTTTGTGAAGGAGCATTATCCTGTGTTTTCGGTGCCATGGGAGTTGATTAAGGAGATTCAAGCTGTGGGAATTACAGTACGCCAAATTAGACCTAAAATGGTTCGGGATCTCCTCAGGGTTTCTTCAGCATCTATAGTTCTTCAATCAATTGATACATATTTGGACGTTCTTGAATACTGCTTGTCGGATATTCTGTTGGCTGCATTATCTAATCATGCCGAGGATAGTATGGGAGCTGACTCTGTTAACACTAATCCTGGTGGTAGATCAACTAATACTTCAGAAGGCAGCTCAACTTCTGTTTCCGTCTCTAGCATGAATAGTTTTGCCAGGTTATCTAACCAGAATGCAGCCAGTTCAGGTGATGCTCTTGAAATGATGACAAGTCTGGGCAGGGCTTTATTAGATTTTGGTCGAGGAGTTGTTGAAGATATTGGTAGAAGTGGGGATTCCTTGTCTCATAGTAATACATTTACTGGCAGAAATAACAGCAGCTACAGAAATGTGGACCAGAATTTTCTACAAATGGTATCCGAGATCAAAGGCTTACCATTTCCAAGTGCATCCAACAATTTAGTAAGGTTGGGGAGCATGGAACTTTGGCTTGGTAGTAAAGATCAACAGGAACTGATGATCCCCTTGGCAGCAAGGTTTGTCCACCCTAAAGTATTTGATAGATCAATTTTAGGCAATATCTTGACCAATGATGCCCTGCATAAATTTTTGAAACTACAAAAGTTCTCTCTTAGCTTACTGGCGACCAATATGAGGTCAGTGTTCCATGCGAACTGGGTGAATCATGTAATGAACTCAAATATGTCTCCGTGGTTTTCATGGGAGAATAAATCATGCTCTGGGGTTGAGGAGGGACCATCCTCTGAATGGATAAGACTCTTTTGGAAGAATACGGGTGATTCATCACAGGACCTTTTGCTCTTTTCTGATTGGCCACTTGTTCCTGCTTTTCTTGGTAGACCAATCTTATGTCGTGTGAGAGAGCGCCATCTTGTCTTTCTCCCTCCTGTCACATACCCCGTTTTACCAAATGCTATTTTAGAGATTGGTGCAGGGGGCAGTGATGTGGCGGAGACATCTACGAGTGTGATTTCTAAACCTGAATCCATTCAGCCTTACACTTCAGCTTTTCAAAAATTCCAGGACACGTATCCTTGGCTATTCCCTCTTTTAAACCACTGCAACATTCCAATATTTGATGTAGCTTTCATGGACTGTGCATCCCTATGCAACTGTCTCTCTAATTCTGGCCAATCATTAGGACAAATAATTGCCTCTATGTTTGTGGCGGCTAATAATGCAGGTTACTTTCCGGAACTTGCATCACTTTCAGATTCAAATAGTAATGAGCTCCTCAACCTTTTTGCCAATGACTTTGTTTCAAATGGAACTAACTATGGGCGAGATGAGCTTGAAATATTACGGAAGTTACCCATATATAGAACTGTTGTCGGATCATATACACAATTGCGCGACAATGATCAATGTATGATCTCTTCAAATTCATTCCTTAAACCATATAATGATTGTTGTCTATCTTATTCATCAAATTCAATGGAATATTCATTACTTAGAGCCCTCAGGGTCCCTGAATTGGACAATCAACAAATTTTGATTAGGTTTGGGTTGCCTGCATTTGATTGTAAACCTCAGTCGGAACAGGAAGATATCTTAATATATCTATTTACAAATTGGCAAGATCTTCAAGCCGATGCTCATTTAGTTGAATGCTTGAGCGAGACTAATTTTGTGAGGAGTGCTGATGAGTTTTGCACGGATTTGTTTAAATCAAAGGAATTGTATGATCCAAGTGATGCTTTGTTAACATCCGTCTTCTCTGGTGAAAGGAAAAAATTTCCTGGAGAAAGGTTTGCTGCTGATGGTTGGCTTCGAATTTTAAGGAAAATTGGCCTCAGAACCACAACAGAAGCCAATGTCATTCTTGAATGTGCCAAAAAAGTAGAGACTCTAGGAAGTGAATGGAGGAAGTCGGAGGAGGATGGTTCTGAGTTTGACTTGATAAATGGTCAAAATGAAGTGCCTATGGAAGTATGGACTTTAGCTGGATCTGTTGTTGAAGCTGTTTTTTCAAATTTTGCTGTCTTTTATAGCAACAATTTTTGTAATGCTCTTGGCAATATTGCTTTTGTTCCGGCTGATTTAGGCTTTCCAAATCTTGGTGGCAATAAAGGTGGCAAAAGAGTTCTCACTTCATACGGTGATGGAATTGTATCAAAAGATTGGCCTCTGGCTTGGAGTTGTGCTCCAATTCTTTCCAAGCACAGTGTTATACCTCCTGACTACTCTTGGGGAGCACTTAATTTGAGAAGCCCTCCAGCCTTCCCCACAGTACTAAAACATTTACAGGTAGCCACACTACGCCGGTCTTAGTTATTATTTTAGTGACAGTGGCCTATTCAGAATATAGCTAATGTACTTTTGACACAGGTTATTGGAAGGAATGGTGGTGAAGACACTCTTGCTCATTGGCCAATATCCATGGGCGTAATGTCAATTAATGAAGCTTCTTGTGAGGTTTTAAAGTATCTTGAAAGGATTTGGAGTACCTTATCTTCTTTGGGTAAATATTTGCCTCATTCTCAAGTAGGACCCCACCCCCTCCCCCGTAGCACTTTGATGTGGTGTGTCAAGCAGTTGACATTTGTTTGTGTCTAATTTGAAATAAATTGATAGCCTGTGAAATAAGTATTTTTGATTTCATAAATTGTGAAATGGGTAAAGTTATAATGTGAGTGGCTGCAAGTACACATATGTGTCCGAATCATATACTAATTAAATGTAATCATATGATGATATAATACACTAATGGTTGGGGGAAATATCATTTGGACTTTTTTGAAATACTTTTTTGTGCAATGCATGTGAGAAGACTACGTAAGAACATAGTCAAGCATATTCAGGTGGAGGAAAAATAAGAGAAAATGGAGGACTAAAACAAACTCTGGTATTTGTCCCTCGCAATAAATGATATCGGCAGGGCAATGCATGTAGAAAAATGGAATTAAGAATTCATATAGCTGCATGTTATGGTTCTTAATTTATGAATGTAGACGATATGATTGCTACAACTTTAAATGCTACTACAATTTTATTAATTGTAGTCTTCAAGAGATGCAAGTAGTTGTCGTGGATTGGAATGTACTTTCTTGAGCCAGATTTTTGTATTTTGATTAGTTGTAGCCTTGGGAGCAAGAAAAATTCATAAACTTACATGATCATATTCCTATGCCACGTTTTTACATGGTATGAAAACCATATTTTTTTAACAAAATACGAACTTTTCATTGCGTAATGAAAAGAAACAAAATTGTTCAAGGATAAAACTCCTTGTTCAAGGATAAAAACTCCCTAAGGAGTGAAAAGAGAAAAGAGAGTAGAACTAAAATAAGAGAATCAAGGAGAGTTATAAAGTCTCTCAATTCATACAAATCATACTAGGAGAATATGAAGAAAAGGAATCGGAAACCTGTATTGTTTTCATGAGTTGGGTTGGTCTCATGCTGTTATTTGAAAAGTACAATCACGGGTAGTTATTAACTTTTTCGTGAGTTACATGGCCATTATTTTTCTCAAGGCAAACTTGGTTGTTTTGGCACATTGGCATAAAAAAAAGGTTTTTAACTGATCTATTACCTGCTTAATTATTATTTTTTTCTTATAAGATCTGAAATTGTAATGAAAACAAACATATACCTAAATAAACTAGGAAAGACGGATATATCTCTAACCCAAAAAACCAATGGAGAATATAAAAAGATTCTCCATGTGTACCAAGTCGAAAAGGTTCACTAGTTTCTGGACAGTTATGAGGGAGGTTTATGGGAGATCATATATGTATATTTAAATTTTTAATATCACCGGTCTTAATTATAATTATAATTATAATTATAATTTATTATTTATTATTATTTTTGTATGTTTTAATATCGGTTAGATGAATTGTCAAAATATATATATATATCCGTTAGATGAGAGTTAGTTTCCTTAGGATCTTATCTTGTTCACGTACCTTGTTTGTTTAGTGTATCTGTGAAAATCTATTGTATGGCATACTTCGTGAGTAGTTTCTTAAGAGAGTTTTTTTCTTTTTTGAAACTTAATTTCTGTATAAGCCTTGGAAAGCTTTTTAGTCAAAGCAGGGAAAGATTCTGGATTTCGGGGAAACAAAAATTGTCAGCCCCTAACTTTTATATGTATGCCTTCAGCGCATGTTTTTCTATGGGCACTTTCAGTGTTGGTTTTATGAGACTTGTTATTTGAGTGTAGTAAATGATCTGTTCATCCATGCAGATGTTTTGGAATTGCAGAGAGTGGCATTCATTCCTGTAGCTAATGCAACACGCTTGGTCAAAGCTAATGCTCTATTTGCTAGATTGACTATTAATTTATCTCCTTTTGCATTTGAACTTCCAAGTGGATATCTTCCATTCGTGAAGATCCTCAAAGATTTGGGGCTTCAGGACGTACTGTCAGTTGCTTCTGCAAAGGATCTTCTATCAAGTCTTCAAGTAGCTTGTGGATATCAACGCCTAAATCCTAATGAACTTCGATCTGTAATGGAAATCTTACATTATATCTGTGATGAGGCTATGGAAGCAAAGATGTTTGATGGTCGGGAACCTGAAATTATAGTCCCAGATGATGGCTGCAGGCTTGTTCATGCAACATCCTGTGCATATATTGATACTTATGGTTCCCGATATATAAAATGCATCGACACTTCAAGGCTGAGATTTGTTCACTCAGATCTTCCTGAGAGGATTTGTAGAATGTTGGGCATTAAGAAACTATCCGATTTAGTTATTGAGGTTAACATGCCATCTTGATGTTTCAATTCTTGTGATTCCTGTCTTTTTCCCCTTTTGGTTTTGTCAGTTGACAAATACTATTTCTTACTTATTAACCAACTTGTGTGGCAGGAGTTGGATCATGAAGATAGTATAGAACCCTTGGAACGTATTGGAGCAGTGTCTCTAGAATTCATCAGAAAGAAGTTATTGAGCAGGTCGTTTCAGAATGCTGTGTGGAATGTTGTCAATAGTATGGTTAATTACATTCATGCAAATAAAAATCTAGATCTGAAAGCTGTAGAGAAATTACTAAAATCTATTGCAGAGAGGCTTCAGTTTGTTAAATCTCTCCATACTCGGTTTTTACTTCTTCCAAATTCTATAGACATCACACGTCCTGCTAAAGATTCCATTATTCCAGAATGGAAGGACGGAATCCATCATAGGGCTCTTTACTTTGTTAATCACTCAAAAACCTGTATTTTAGTTGCTGAGCCCCCTGCTTATATATCAATCTTTGATGTCATTGCCATTGTTGTGAGCCAGATTTTAGGATCACCTATTCCCTTGCCTGTTGGCTCTTTGCTTTTCTGCCCTGAAGGTACTGAAATTGCCATTATCAATATATTAAAACTTTGTTCTGAGAAGGAGAACGAACAATTTACTGGAATTAGTAGTTTGCTTGGAAAAGAGATACTACCCCAAGATGCTCTTCAATTACAGCTTCACCCATTAAGACCGTTTTATGCGGCAGAAGTAGTGGCTTGGCGGTCTCAAAGTGGAGAAAAGCTGAAATATGGTAGGGTCCCAGAGGATGTTAGACCATCAGCTGGCCAAGCACTCTACAAATTCAGGGTCGAAACAGCACCAGGCATCACTCAGTCTCTTATTTCTTCACAAGTTTTATCATTCAGAAGCATTTCCATTGATGGTAGCCATTCCTCTACAAACTTGCAAGATAGTGGTCACATGATAATTGATAGTGGTGCTTCCGTCGAAATGCCAGAGAACTCTGAAAGAGGCAAAATACGATCCCAGGTTTGGGTCTTTATCCTTTGGATTTCAAATATTATGCTATATATCCCCTTTTTTTTTTATTCTTGATTTGGGGAACTTGGATGCGGCATTCATTTAGGGACTGGGGAAAAATAAGTGTCACGTTGTGACCTGTTTGCTTTCTGCATCTGCAAAATAAACACTTCGATTGTGTAACAAAGATAATTGACCCTGTCATCGTGTGTATGCGCATTTTTGTGGAATATTGAGGAAGTTTATTAAATATGATTATGATTCCACTAGATGTAGGTAATAAGTGGACCAGAGTCCAACCTACTAAATAAATGACATAAATGTGAAAACCTAATGAGGTCATTTATTAAATATTGCTACTTGATCCAACTCAACTACCAGAGTCTATGAATAAATACTTGAAATTGTATCAAGGCTTCCTTCATGCTTCATGAAAAATCATTGTTATTTCTTCATAAACTTTCCATTTCCTTTATTGGCTGCCGCAGCCTGTTGCAGAGCTCCAATATGGCAGAGTATCTGCTGAAGAATTGGTGCAGGCAGTTCATGAAATGCTTTCTACTGCTGGAATCAATGTGGATATTGAGCGACAATCCCTCTTGCAAAAGACCGTAGTCCTGCAAGAACAACTGAAGGATTCGCAGGCAGCTCTTCTGTTAGAACAGGTGTGCATTTGTTGAATTCTTTGTGTTTGGATTTGTGTGCTCTGTATTGTGTAAAACTATCAAGTTCTACAAACAACTCCGATATCGAAACGATGCAACCTATCATCTAAGGGATAGAAATCCAAGAATACTCAACAAGAGTCTTTATGATTATCCAGAAGTATTTCAACTCTTTTCCGTAACTCTTTTGAACCCACACCCACTTGGTTGCATCTGTAACTGTAGAATCCGTTTGTGCAGCAACAGTTTTTGCAACTTCGTCAAAAATTCCGAAATCGAGATTCATGGTCTGATTAGACTACCAATTTTGGAACATAGGATTGTAGTTATAGGAGGTGTAGATTACCAATGTCTTAACCATTAAGCTATGCTCGCTCTTGGTAGATTGATATTGATATTAAAATGAGGAAAAAATTATGATACTAGAGAAGAAATTAGGGATCCATCCCATTAACTCCCTCCTCCCTTTCTCCTAAGCAAAGTACCTTCACTCACAATCCTAGCTATTATGTTCTGGTTTGATATTGCTATTTAAGTTTTTTTTCTTTTTTAAAACTCAGTGACCTTCCAGTTACAAGTCACAAACTTTCTAAATATCTCAAGAGTCATTCATGGTTTCTGGATCTATTTTACAGGAAAGATCTGATGCGGCAGCCAAAGAAGCCGATACAGCGAAAGCAGCTTGGCTTTGTCGGGTTTGTTTGACTTCCGAGGTAGAAATTACCATAGTTCCCTGTGGCCATGTTCTGTGTCGAAAATGTTCTGCTGCTGTTTCAAAGTGTCCATTTTGTCGACTTAAAGTCTCAAAAATCATGAGAATTTTTCGACCATGATCTCTTCTGCTTCAAGCCACGAGAACACAACATGGCAAAATGGCAATTTTGACCTCGGATATCACTTGTCTCAACTACAAGGACAACATGGTTTTGATTTTGCACAGAATTATTGGTTATATTTGTACAGCAGAGTCGGAAATGTAAATGAAAAGTCAGTTTAACGTTAATTTCCCTTTCTCTTTAAACTTATTCTCATTTATTGATGTACATAAAGTATATTGGCAATGCAAGTAAATATTTGATATGTTCAATAAGAATTATTGTTTATCGGGAAAACTCATTGGATGTATTTAAAGTGGATTGTCCACAAGTAAATATTTTTATATGTTCAGGTTGGATATCATGTGAGAGAGGTGAATCAACATTCTTATAAATTAATTCGAGTTCAAGCACTGATAGGAGATAAAAGATAAAAGAGTCGTGAGTTTGAATCTCCATCTTCAGATCTTGCATAAAATTTTTTTTTATACAATTACATACAAAAATGGTAAAATATAAGAAATATTTATAAAATATCACTGTATGAACATAATATTAACATTAATCCTTTTGAATCTATTTACAAAGAGGATATTGTTTATTTGTTAATTAATATATTTGTTATTAATCTTTTATTTTTATAATTTTTTCAAAAACATCCATCAATATTAATATGTTTTTGATATTTTAATGACCATTCACGTAAAATTGATATATCCATATTTTCATTAAAAAATTCTTTTTAGTATAAAAAATGCTGATAGTACGAAGAATAAATGGAGATGCAGGGTATCGATCCCCGTACCTCTCATGCTAAGCGAGCGCTCTACCATCTGAGCTACATCCCCAGGCGGGATTCAAATGGAAATTAATTTAATTTAATATTTTATACTTTTCATAGTCATATCCGATTGCATTTTTGTAGAAGTAGACGCTTCAAAATCACCCGCGAAGCCGTTGAGAATTGCGTTGTGGATAATGTGAGAGATAAGAGGTCGGAATCGGAACCTCCAACAATTCCTCTGCTCCTTTCCTTCTCTTCAATGGCGTCTCTCCAACAACTCCTCAAACCTTCCTGGACTTCTCTCTTCGCCCCTTCCCTCTCTCAGCCTCAGCCTCAGCCTCAGCCTCGAACTTTTACTACTTCCACTCCTAGAGCCTCTCTCCAGAATTCTTCCATCAATCGCCGGCAGTTCGTAGCTGAGACGGCAGCAGCGGTTTCTCTGTCGCTTTCTCCGCTTATTGCTCCCGTGCAACCGGCGAAGTCCGAAGAAGCTCTCTCGGAGTGGGAGAGGCTTTTCCTTCCTATAGATCCGGGTGTTGTCCTTCTCGACATTGCTTTTGTGCCCGATGATTTGGACCATGGTTAGTTTCGAAAGTTTTGTTTCCCTTTTGCTCCAGATTATCAGAGAATTTTGGGTTTGTTTTTGTTTTTCTGTCACGGATTTTTCGATTTGCGATTAGTAGAGAGCGCTTTGAGAGCTCAATAGTCCCTCGCATGATTAAATTTCGGAAACAACTGAAGTTTGTGAGGATTTGTGAACAGATATTAGGCGGTTTAATTGTTAGTTACATTGTCGTCTGTTAAGTCTCGAACTGCATTTTAAATACTTACTTAGTGTACTTGGCTAGTGTGTGCAGTATTAATATCATTAATGTGTGTGGAATGTTTTTCCCTCTGTCGATTTTAATAACAACTTCAATTTTTAGGTGCTTAGAGAGTTAATTTTTAACCATTTTAATCTCCTCGTGTTCTGTTTCGACATAGATTTTTGACTTTCCGAGGATGTAATTTGTGCAAAGCCTGATATAGGATGAATTACCTCTATTTATTAAATGAATATAAAAGATATTGACAAATTGGAAGATTTAAAACCAAACACACACGCATGTGAAGCTTATGCTTGTAAATCCTTCCTCATTTCCAATTATTTTTTAAGCAGTTTCTTCATCAATAATTTTATTTTAGGGGTGGGGAGTTGGGGCTTTGGTGTCTAATTGAATCCGTGGTAGGAATTTTGGATTATCGCTCTTTAAATAATTGTCTCTACCTTGTTGATACTTAACGAATTTTGTCAAGGCTCCTGACTATTGTAATTTCTCGTTCCTACAGTTATGGTTTCAAGTTCCTTCCGCCAATATAATGCCATTAAATCATTTGACACTTTCAGCCCACTGCAGGCTTCCTTTTGGGGACCAGGCAAACCATTTTAGAGACAAAAGATGGTGGAAGAACTTGGGCTCCACGTACAATACCCTCGGCTGAAGAAGAAGATTTTAACTACAGATTTAATTCTATTAGCTTCAAAGGAAAGGAGGGATGGATTGTTGGCAAACCTGCAATTCTGTTGTACACTTCGGATGCTGGAGAAAGCTGGGAAAGGATACCTCTCAGTGCTCAGCTTCCTGGAGATATGGTAAACTAATATTTTATGTCTTCAATGAGGAATGGGCATATATTATAGGGCTTGATTTTGAGTTTATTTTATTTTAGACTATCCTCTACATGGTTAAATGGTAAATTATAGATTTTTATTTATCAGTTGTAGACTGTAGTGTAATGCTGATTAAATATAGCTAAACAAGGTAAGAGTGTAATAGCACTAAGATATTCGCAGAAACTCAAGTGCACTCATCTACAAATTAAGATGTAAATTTTGGTTTTTAATAATTATACAGCTATAAATTTATTTGTTCTTTTCTGCTAGAAAGTGCTTATATTTAACTCTGGAGTGGGTGATTCTAATATAAAACCTTCAGGTCTACATTAAAGCAACTGGAGAAAAAAGTGCAGAGATGGTTACTGATGAAGGTGCGATATACGTTACATCAAACAAGGGATATAATTGGAAGGCTGCAGTTCAGGAGACTGTTTCTGCCACCCTTAATAGGTATGCTTAGCAGGGGAAAAAAATTAATTCTATATGCGTTTATAATAAAAAAATATATTGTTTAGAATTTAACCTTGAACCATTGTGGATTAACTAATAGACTGATAAGATTTGACTGGCAGCTTGTGGCAGTAAATCATGGAACACAGTTCATTATTTTAGCACATTAAACAAAAAAATTCAATCCCAATTATATGATGCTAATCTGGTTTCTTTACCCTTTTTCAGAACAGTTTCAAGTGGGATAAGTGGTGCAAGCTATTATACTGGAACCTTTAACACAGTAAATCGTTCTCCTGATGGGCGTTATGTTGCTGTTTCGAGTCGTGGTAACTTCTATCTCACCTGGGAGCCTGGGCAGGTTTAACTTCTATGTTTCTTTTACTCTCTGTCTCTCTTTTTCTAACTGTATTATTCTGAACTGGTAGCTTCTTGTAGCTTTGATATCTAATTCATAGTTTTTTGAACTTGGCTTTATTTTCAGCCATTCTGGCAGCCACATAATAGAGCTATTGCTAGGAGGATTCAGAACATGGGGTGGAGAGCTGATGGTGGTCTTTGGCTTCTTGTTCGTGGAGGAGGACTTTTTCTGAGTAAAGGCACAGGGGTAAGTGAAATTGAGGACGTTCCTTTGCATGTCTAGTTACTGGCAATGCATTTCGTTCAAACCACCCTACTGGAAAATATGAACAATGGTTTTGAGTTGCAATTTGGTTTGATTTTAACCTTTTAACTAAAATTATTTCTCCCCTTCTTCTACTAGTTACTCAAGCATAGAATATTGTATTTATCAGCTTTTTATACCTTTATATTTATGGCTGCATAATAACTCAATAGCTGGAAATGCTATACATGGATTGCTTTTCAGATATCATTAGAAACTTTTCAAGTTTTCGATCAATATCATGGAGACGTTGTAGTAATTTTTCATCATATTCATGATTTTCTTGAAAACAGAGTGACAAGTAAGTTCTAGATTGTAAAAAAAAAAAGTAAATTCTAGATTGTCATTATGTAGGTTAAATATGTGTTTTGATTGTTCCCATGTTTTAATAAACCTGTTGTTCTATGTGGTTTCTTACCCAAAATTTCTGCTAGACCTCTCAGCCAACTTCTGTTAAGTATTAACTGCTTGTTGAAACCAATATGTTTACTATAACAGATAAGCGAGGAGTTTGAAGAAGTTCCAGTTCAAAGCCGAGGTTTTGGCATATTAGATGTTGGTTATCGTTCAACGGTAATTTTTTAACATCATTTCAATGTTCATGCATTTGCGCATCCTAATTCATTCATAAAGACTCCTGCTGATATTGTGATTGAGATAGAAATATCATATTTTTTGTCGTAAATAGCTAAAACAGACCTTCTTGGAGGCTGTACTTGATGGATCTGGTTGATTTGTGTGGCAGATTACAGAAGGGAAGGGCCAGACAGAAGTGTCAATATGATTATTTATGTTTATTTTGTGATTATACATGCTATTTGTTGAACTGAAAGATACAGTACTAATCGTGCATTTGGCTGCATTAGTTAGTTATTAGCTTTCTATAAACAAAACAAATGTGTGTGAGAGAGAGAAAGCACAAATGATTAGATGAATGGAAGAGTGGTAGAACTTCTACAATCTCATCACTCATGTTCGTATTAAAATTTGCGTTGTCCAAGATACCTTGAGCTTGAAGTCGTGTTTGTATTGTGCAAAGTAGCACATATCTCATCATCCCTTGGCAATGAATATATAGTCTGTGTGCTAAATCTGAAGATCCATATCCTTTCGGATTTATTGTGAGCACCGGATGAAAATGAAGCCATCCGTTTAACCACAGTTGATCATTGCCTTGATGGAATATTTTTTAATGTCTGGTCTTCGTACAGGAAGAGGCTTGGGCAGCTGGGGGAAGTGGAATACTTCTGAAAACTACCAATGGTGGCAGGACATGGTCCCGTGATAAAGCAGCTGACAACATTGCAGCCAACCTATACTCTGTAAAGTGGGTATCACCATACTGTAGCCTTTAGTTAGTTTGCAATTTTTGAAATCTGAAGTTACCTCTTTTAGAAAGGAAAAAAAAAAAAAAACTATCCCTTTATCTTAAGCTCTCTTATATGGCTGTTTACCTTGGATGAGTAGAGAGTAAGAAGAAGAATCAAAGCTCTGTGTCTTGCCTCTTTAATGTAAAAATAATGTCTGGTTTTGTTGCCAAGAAAAGCCAACACTGCCAACCATAAACATGGCTGCAAATACAATGCAGTAATTTTGTAGAAGAAATATAGTGCATTGAGTTTAGTGGTGGAGCTTGAGTTCTTTTCTTTCCATTCACATTAAAATTGCTGTTTTCTTTTAATCTATTTACAAGCTTTTATACATCTCCGCAGGTTTATAAACGACAAGAAAGGTTTTGTGCTGGGAAATGACGGTGTATTGCTTCAATATCTTGGTTGA

mRNA sequence

ATGGCGTCTGAATCGACGTCGCTAGACTCGATTTTTCTGGAGGATTTTGGCCAGAAGGTTGACCTGACTCGGCGGATTCGTGAGGTGCTCCTTAACTATCCTGAGGGGACTACTGTTCTCAAGGAGTTGGTGCAGAACGCCGATGATGCAGGCGCCACCAAGGTTTGCCTCTGTCTCGACCGCAGGGTCCACGGCAGCGAGTCGTTGCTGTCGGAGTCACTGGCACCGTTTCAAGGGCCTGCGCTTTTGGCTTATAACAATGCGGTATTTACTGAAGAAGATTTTGTCAGCATTTCGAGAATTGGTGGTAGTAATAAGCACGGGCAAGCCTGGAAGACTGGTCGATTTGGGGTTGGCTTCAACTCTGTGTACCATTTAACTGAGCTGCCTTCATTTGTTAGTGGCAAATATGTTGTAATGTTTGATCCTCAGGGCATTTACCTTCCAAAGGTTTCTGCAGCAAATCCTGGAAAGCGAATTGACTTCATTCGTTGCTCTGCCATCTCACAATATAGAGATCAGTTTCTTCCTTATTGCGCTTTTGATTGTGACATGGAAAGTTCTTTTGATGGAACTTTATTCCGGTTGCCATTAAGAAATGCAGATCAAGCAGCCAGAAGTAAAATTTCAAGGCAAGCTTATACAGAAGAAGATATTTCTTCCATGTTTGCTGAACTTTATGAAGAAGGAGTCTTGACGTTACTCTTTTTGAAAAGTGTTTCATGTATTGAAATGTTTGTATGGAATGATGGTGAGGCAGAACCCCAGAAGCTTTATTCATTCTCTGTGCGTTCAGCAAGCAGTGATATTATTTGGCACAGGCAGATGTTACTGCGGTTATCAAAATCGACAACTTCCACTCAGAGTGAGGTAGATAGCTTTTCACTGGACTTCTTGAGTCAAGCGACAACTGGGACCCAAATAGAGGAAAGGATTGATAGCTTTTTTATAGTTCAAACCATGGCATCAGCTACCAGTAGGATTAGCTCTTTTGCTGCTACTGCATCGAAAGAATATGATATTCACTTGCTACCTTGGGCATCGCTTGCTGTTTGCACATCCGATGACTCCTCAAAGACTAATGTTCTCAAACTTGGTCGGGCCTTCTGTTTCCTCCCTTTGCCAGTAAAAACTGGTCTAAATGTTCAAGTAAATGGATTCTTTGAGGTCTCATCAAATCGTCGTGGCATTTGGTATGGGGCTGACATGGACAGGAGTGGGAAGATTCGTTCAATCTGGAACAGGCTCCTTTTAGAGGATATAATTGCTCCTTCTTTTATAGAACTGCTGATTGGCGTGCAAGTGTTTCTTGGTCCAACGGATACTTACTTTTCCTTGTGGCCTAGTGGTTCATTTGAGGAGCCATGGAACATATTAGTTGAGCAAGTGTACAAAAATATCAGCAATGCTCTTGTTTTGTATTCAAACGTTGAAGGTGGAAAATGGGTCTCACCAATTCAAGCTTTTCTTCACGATGATAAATTCACAAGAAGTAAGGAACTTGGTGAGGCTCTGGTGCTTTTAGGAATGCCTATTGTGCATCTTCCCGAAAATCTGTCCAACATGCTTTTGAAGTTCTGCCGTGCTTTCCAACAGAAAGTGGTTACACCCTGTACAGTTCGTCATTTTCTTAGGGAATGTAAACATGTTGCTACACTGAACAGACCTTACAAACTTGTATTGTTAGAATATTGCATTGAAGATTTGATTGATGCTGATGTTTGCGCACAGGCATTTGACTTACCCTTGATTCCATTGGCAAATGGTGATTTTGGGTTATTTTCAGAAGCCTCCAAAGGGATCTCTTATTTTATTTGTGATGAGTTGGAATATACCCTTCTGCACCAGATTTCTGATAGAGTAATAGATCGGAATATACCTCTCAATATATCAACTAGACTTTCAAATATTGCCAGGTCTTCAAAATCAAATATTTTCATTTTTAATGTTCATTATTTTCTTCAGCTGTTTCCAAAATTTGTGCCTGCTGATTGGAAGTATAAAAATGAAGTGTTGTGGGACCCTGAGTCTTGTAGCAACCATCCTACTTCATCATGGTTTTCACTATTTTGGCGGTATCTTCATGATCGGTGTGAAAAGTTGTCATTATTCAGTGACTGGCCCATTTTGCCATCCAAATCAAGGTATCTTTATAGGGCGACTAAAAAATCAAAATTGATTAACGTACAGATGCTTTCTAATGAGATGCAAAAGATACTCAGTAAACTTGGATGCAAGTTGCTCGATCCCTACTATAAGGTTGAACACCGAGACCTAATTCACTATGTAAATGATGGTAACTGTACAGGCGTTTTGGACTCTATATATGATGCGATTTCTTCAACTGGCGGTTTAATGCTGACTTCACTCTACAGTTTAGAGGTGGAAGAGAAGGATGGGCTGCGTAGGTTTCTTCTTGATCCTAAGTGGTATCTTGGAGGCTGTATGAATGATAGTGATTTAGAGAAGTGCAAGAGACTCCCAATTTATAAAGTCTATAATGGAGGATCTGCTCAAGATTTTGGTTTCTCAGATCTTGAAAACCCTCAAAAATACTTACCACCCTCGGATGTTGGGGAGTTCTTTCTTGGAGTAGAGTTTATTGTCAGCTCCTTCGATAGCGAAGTGGAGATTTTGTTAAAATACTATGGGATCAAAAAAATGGGAAAGGCATCTTTCTACAGAAAGCATGTTTTAAACCAAGTTGAGCAGTTGCAGCCCGAACTTCGTGACAGTACCATGCTTTCCGTTTTGAAAAATCTTCCACAATTATGTGTTGAAGATGTAGCTTTCAGAGAATGTTTGTCAAATTTGGCTTTTGTTCCAACTTCAAGTGGCACTCTTAAGTGCCCGACTGTTTTATATGATCCTCGTTATGAAGAATTGTGTGCTTTACTAGATGACTTCGATAGTTTCCCTTCCACTCCTTTCAATGAATCTTACATTCTGGACATTCTACAAGGTTTGGGCCTTAGAACATGTGTAGCTCCTGAGACCATTGTGCAAAGTGCTCAACATGTAGAAAGATTAATGCATAAGGATCACAACAAGGCTCACTCAAGGGGTAAGGTTCTTCTATCATACTTAGAAGTTAATGCAATTAAGTGGCTTCTCAATCCAATGAATGAAGATCAAGGGATGGTGAACCGATTATTTTCTACAGCTGCAACTGCATTTAGACCTCGCAACTTCAATTCTGATCTTGAGAAGTTCTGGAATGATCTTTGTAAAATTTCCTGGTGCCCAGTATTACTTTCCCCTCCTTTTGAAACCTTACCCTGGCCTGTTGTCTCATCCATGGTAGCTCCCCCAAAACTTGTAAGATTGCCCAAGGACTTGTGGCTTGTTTCAGCTAGCATGCGAATACTGGATGGTGAATGTTCTTCTTCAGCTCTTGCACACAGCCTTGGTTGGTCTTCTCCCCCTGGTGGTAGTATTATCGCTGCTCAACTTCTTGAGCTTGGAAAAAACAATGAGATTGTATATGATCAGGTGCTTAGGAAAGAACTGGCCTTAGCAATGCCAAGAATATATGCATTGCTGACAGGCTTGATAGGTTCAGATGAGATGGATGTTGTAAAAGCTGTGCTTGAAGGTTGTCGGTGGATTTGGGTTGGAGATGGGTTTGCAACATCAGAAGAGGTTGTCCTTGATGGTCCTCTTCACTTAGCCCCTTATATTCGTGTTATACCCATTGATTTGGCAGTTTTTAAAGATTTATTTTTAGAACTTGGAATTCGGGAATTTCTGAAGCCCAATGATTATGCTGATATTTTGTCTAGAATGGCAATAAAAAAAGGTTCCTCTCCTCTCAACGCACAGGAAGTGAGGGCAGCCATTCTCATTGTACAACATCTGGCTGAAGCTCAACTTCCAAAGCAACAGATCAATATATATTTACCCGATATTTCGGGTAGGCTATTGCCGGCCAGCAACTTAGTTTACAATGATGCTCCCTGGTTACTAGGGACAGATGATACCGATGTTCCATTTGATGGGGAATCAACTGTTGTCTTGAATGCTAGGAAAACAGTTCAAAAATTTGTCCATGGGAACATATCCAATGATGTTGCAGAGAAACTTGGTGTCTGCTCACTTCGTAGGATTTTATTAGCTGAGAGTGCTGATTCTATGAATTTGAGTTTATCTGGAGCTGCTGAAGCTTTTGGGCAGCATGAAGCCTTGACAAACAGACTTAGACATATACTTGAAATGTATGCAGATGGTCCTGGGATTCTATTTGAGTTGATTCAAAATGCAGAGGATGCAGGCGCCTCTGAGGTGGTATTTCTTTTGGACAAAACTCATTATGGAACTTCTTCCATTTTATCACCTGAAATGGCTGATTGGCAAGGACCAGCATTGTACTGTTATAATGACTCTGTTTTTAGTTCTCAAGATCTTTATGCAATTTCACGTGTTGGCCAAGAAAGTAAGCTGCAAAAACCATTATCAATTGGAAGATTTGGTTTGGGTTTTAATTGTGTGTATCATTTTACGGATATCCCCACGTTTGTTTCTGGGGAGAACATTGTTATGTTTGATCCTCATGCCTGTAATTTGCCTGGGATTTCTCCTTCTCATCCGGGTCTCCGAATAAAGTATGCTGGAAGAAGAATTTTGGAGCAATTTCCTGATCAATTTTCTCCATACTTGCATTTTGGTTGTGACATGCAGAAGCCATTTCCTGGTACGCTATTTCGTTTCCCCCTTAGGAGTCCGGCTCTTGCTTCTCGAAGTGAAATTAAGAAAGAAGGTTATGCACCTGAAGATGTTACTTCACTTTTTTATTCTTTTTCCGAAGTTGCTTCTGATGCTTTGCTTTTCCTCACCAATGTCAAAAAGATCTCAATATTTATAAAGGATGATATAGAACATGACATGCAATGCCTATATCGTGTGCATAAGAATACAGTTAGTGAACCTTCTACTGAATCCAGTGCAAAGCAGGATATTATTAGCTTTATATATGGAAACCGCCAAGGTGAAATGGATAGAGAACAGTTCCTAATGAAATTGAGTAAATCCATCAACAGAGATCTCCCATATAAATGTCAGAAACTTATTATCACGGAGAAAAGTTCAAGTGGTGATATATTACAGCACTATTGGATAACCTCTGGATGTTTAGGTGGTGGGCTCCCGAGAAACAACTCAGGCTTGGGTGACAAGTCCTATAATTTCATTCCTTGGGCTTGTGTTGCTGCACTTCTACATTCTGTACAGGTAGATGGGGAAATGAACTATGACCCTGAGACTGAAAATAATTGGCTAGTTGCTTCTGATTTAGTTCAAGTTTCTTCTGCTTCTATAGAAGGCAGGAAACCTTTTGAAGGACGTGCTTTCTGCTTTCTACCGTTACCTGTCAGAACTGGTCTCCCTGTGCATGTCAATGCGTATTTTGAGCTTTCGTCAAATCGAAGGGACATATGGTATGGTGATGACATGGCAGGGGGCGGAAAAAAACGTTCAGAATGGAATTCTTATCTCCTTGAAGATGTTGTTGCCCCTGCTTATGGTCGCTTGCTTGAAAAAATTGTGTCAGAGATTGGTCACTCTGGTTTATTCTCCTCATTTTGGCCGACAACAGCAGGATTAGAACCTTGGGGTTCAGTAGTTCGAAAACTCTATAGCTTTATTGGTGATTTTGGTCTTCTTGTTCTGTATACAAATGCTCGAGGAGGTCAGTGGATTTCCACAAAACAAGCTATTTTTCCTGATTTTTCTTTTGACAAAGTATATGAACTTATTGAAGCATTAGCTGATTCCGGCCTGCCAGTCATCGCTATTTCAAAGTCAATTGTTGACAGATTCATGGAGGTACGTCCCTCATTACATTTCCTGACTCCCCATTTGTTAAGAACTCTGTTGATTAAAAGGAAGCGTGCGTTTAAAGACAGGAAAGCAACCATCTTGACCCTTGAATATTGTTTAGTCGATTTGAAACTACCTTTTCAATCCGACAGTCTGTGTGGATTGCCTTTACTACCACTTGCTGATGGTTCATTTACCTCATTTCACAAGAATGGGATGGGAGAAAGAACTTACATTGCAAGGGGAGATGAATATGGCCTTCTCAAGGATTCAGTTCCCGGCCAACTTGTGGATCCTGGAATACCAGAAGTAGTTCATGCAAAGCTCTGTGAGGTAGCCCAAACTGAGGATTTAAATATTTGTTTTCTTTCGTGCCAGTTGCTCGAGAAACTTTTCCTGAGATTTCTTCCGGCAGAGTGGCAAAATGCTAGACAGGTGAACTGGAATCCTGGTCATCAGGGTCAGCCTAGCTTGGAATGGATAAGATTGATTTGGTGCTACCTCAAGTCACATTGTGATGATCTATCTCAGTTTTCTAAGTGGCCTATACTGCCCGTTGGGCAGAATTCTCTCCTGCAACTTGTTGAGAATTCAAATGTTCTTAAAGCTGATGGTTGGAGTGAGAACATGTTTTCTTTGTTGCTGAAAGTTGGGTGCTTGTTTTTGAGGCGTGATATGCCTATAGAACATCCACAACTGGAAAACTATGTGCATCCTTCAACAGCAATCGGTATTCTAAATGCTTTTCTGTCTATTGCTGGTGATATTGAGAATGTTGAAGGGCTATTTCGTGATGCATCTGAAAGTGAACTGCATGAGCTCCGGAGCTTTATTCTTCAGTCAAAATGGTATCTGGAAGGAAAAATGGAAGCCATTCATGTTGATATTATTAAATGCATCCCCATGTTTGAGTCGTATAAATGTCGAAAATTAGTAAGTTTGAGTAAACCTATAAGATGGATTAAACCCACTGGTTTATGTGAGGATTTTTTGAATGATGATTTTGTGCGCATGGAATCAGAAAAAGAGAGAATCATTTTGAAAAGATACTTTGGGATCGGAGAACCATCAAGAGTAGAATTCTACAAAGATTATGTTCTTAATCATATGTCGGAATTCCTTTCAGAAAGGGGAGCTCTTTCAACCATTCTGCATGATGTGAAGCTTTTAATTGAAGAGGATGTCTCCCTCAAATCTTCAGTTTCTATGATACCATTTGTGTTGGCTAGCAATGGATCCTGGCAACCACCATCAAGGCTGTATGATCCTCGGGTACTTGAGTTAAATAATATGCTGCATGAAGAAACTTTTTTTCCATCTGAAAATTTTTCAGATGATGACATTTTAGATGCTTTAGTCAGCCTTGGACTTAGAAGATCCCTTGATTTGACTGGTTTGCTAGATTGTGCTAGATCAGTTTCATTGTTGAATGATTCTAAGAATTCTGAATCACAGAGTTACGCTAGGAGATTGTTTGTGTGTTTAGATGCTCTTGCACACAAGCTCTCAATCAAAGTGGAAGGAAGTGGTTATGAACTACAGAATTCTATGCTCATTAAGAGCAATTATGTTGATGATGATGCTTCTATGGAAGTTGGCTCTCTCAATATAGAGGATACCTCTGATATGGGCACTGATTCCTTAATAGGCAACCTGACTGGTGATGAATCAGAGGAAGAATTTTGGTCTGAAATGAATACTATTGCTTGGTGCCCCATTTGTGCCGATTCACCTTTAAAAGTACTTCCATGGTTGAAAACCAATAATCAGGTAGCTCCACCGAGCATTGTGAGACCTAAATCACAGATGTGGATGGTCTCCTCTTCAATGCATATTCTAGATGGTGTGCCTCCTTCAGAATACTTGCAACATAAACTTGGTTGGACTGATTGCCCGAGGGTTGAAGTTTTATGTGCACAGTTGACGGACATATCCAAGCTTTATGGTGAGCTTAGGTTGCATTCTTCACTAGAACCTGATATCAACACTGCATTGCAAGAAGGGATTCCCATTCTTTACTCAAAACTGCAAGAATATATAGGAACTGATGAGTCCGTGCTGTTAAAATCTGCTTTAAATGGTGTATCCTGGGTGTGGGTGGGGGATGATTTTGTACCCCCAAGCGCTCTTGCCTTTGACTCGCCAGTAAAATTCTCTCCTTATCTTTATGTCGTTCCATCCGAATTATCAGAATTTAGAGATCTGCTTTCAGAATTAGGTGTTAGGCTTAGTTTTAATGTTGAGGGCTACTTGGATGTTCTTCAACGCTTACACAGTGATGTTGAAGGGTCCCCTCTATCCACAGATCAGATGGATTTTGTGATCTGTATGCTTGAAGCTATTTCAGACTGTTGTGTGGACAAGCCAGAATTCACTGCTACCAGTACTTCTCTTTTAATTCCCAACTCTTCTCAGGTTCTGATGCAAGCAAATGATCTTGTTTACAATGATGCACCATGGATGGAAGACAACAATATTCTTGTTGGGAAACACTTTGTGCATCCAAGTATCAGCAATGATTTGGCAAGCAGGTTGGGCGTGCAATCCATTCGTTGTCTCTCATTAGTTGATGAAGAGATGACTAAAGATCTACCATGCATGGACTATGCTAAAATCAGTGAGCTTCTGATGTTGTATGGCAATGACTATTTGTTCTTTGACCTTCTGGAATTGGCAGACTGCTGCAGAGCTAAAAAGCTGCGCCTAATATTTGACAAAAGAGAACATCCTCGCCAATCATTACTGCAACATAATCTAGGTGAATTTCAAGGTCCTGCACTGGTTGCCATCTTTGAAGGCTCTAGCTTAAATACAGAGGAAATCAGTAGCCTGCAATTTCGTCCTCCTTGGAAATTAAGGGGCGATACTCTTAACTATGGTTTGGGACTGCTCAGCTGTTATTATGTTTGTGATCTCCTTTCAATTGTTTCTGGTGGCTACTTTTATATATTTGATCCCCGTGGAATAGCTCTTTCTGTAGCTCCCAAATCTGCCCCAGGGGCAAAAGTGTTTTCTCTGATAGGTAGTAATTTGATAGAGAAATTTAACGATCAATTTCATCCTATGTTGGGGGGTCAAAATATGTCATGGCCATCAGATTCCACCATAGTCCGCATGCCCCTATCTCCAGCATGCTTAAAAGATGGACTTGAGCCTGGAATAAGAAAGATAAAGGAGATAAGTAGCAAATTTTTGGATCATGCTTCAAGGTCTCTTTTATTCTTAAAATCTGTCGTGCAGGTTTCATTCTCGACCTGGGACCAAGGAGGGCTTCACCCATACCAAGATTATTCAGTTTGCATCAATTTATCATCTGCTATAGCAAGGAACCCATTTTCAGAAAAGAAGTGGAAGAAGTTCCAATTATCAAGGTTATTTAGCAGCTCAAATGCTGCCACAAAATTGCATACAATAGATATAATTGTATTCCAGGGAGAAACTCAATTCGTTGACCGGTGGCTTGTGGTGCTTAGCTTGGGTTCTGGGCAAACTAGAAATATGGCTCTTGATAGGCGATATCTTGCATACAACTTGACACCTGTTGCTGGAGTTGCAGCTCATATATCTCGCAATGGCCTTCCTGCTGATATATGTCAAAAGAGCCCTTTGATGGCTCCCTGTCCTTTATCTGGTGATATAACATTACCTGTCACCGTTCTAGGATGTTTCCTTGTTTGTCACAGTGGTGGCCGTTATCTATTCAAGAATCAAGTTCTCGAGGCTGTAGCTGCACCCCTTGATGCTGGAAATAAGTTAGTCGAAGCCTGGAACAGGGAATTGATGTCCTGCGTATGTGATTCTTACATTTACATGATATTGGAAATTCATAAACAACGAAAAGAATCTTCAAGTTCTGCGTTGGAGTCAAATGTGAGTCATTCCATAAGTTCATCTTTGAAGGCATATGGAAATCAAGTTTATTCGTTTTGGCCTAGGTCTGAACCTGCAAATGGCAGTGATTCTGATCTGGACAGAGGGTTGAAAGCAGATTGGGAATGTCTGGTTGAACAAGTAATCAGGCCATTTTATACTCGTGCTATTGATCTCCCTGTGTGGCAGCTTTACTCTGGAAATTTAGTTAAGGCTGAAGAGGGTATGTTTCTTGCACAACCTGGGAGTCCCGTGGGTGGTAACTTGCTGCCAGCAACAGTTTGTGGTTTTGTGAAGGAGCATTATCCTGTGTTTTCGGTGCCATGGGAGTTGATTAAGGAGATTCAAGCTGTGGGAATTACAGTACGCCAAATTAGACCTAAAATGGTTCGGGATCTCCTCAGGGTTTCTTCAGCATCTATAGTTCTTCAATCAATTGATACATATTTGGACGTTCTTGAATACTGCTTGTCGGATATTCTGTTGGCTGCATTATCTAATCATGCCGAGGATAGTATGGGAGCTGACTCTGTTAACACTAATCCTGGTGGTAGATCAACTAATACTTCAGAAGGCAGCTCAACTTCTGTTTCCGTCTCTAGCATGAATAGTTTTGCCAGGTTATCTAACCAGAATGCAGCCAGTTCAGGTGATGCTCTTGAAATGATGACAAGTCTGGGCAGGGCTTTATTAGATTTTGGTCGAGGAGTTGTTGAAGATATTGGTAGAAGTGGGGATTCCTTGTCTCATAGTAATACATTTACTGGCAGAAATAACAGCAGCTACAGAAATGTGGACCAGAATTTTCTACAAATGGTATCCGAGATCAAAGGCTTACCATTTCCAAGTGCATCCAACAATTTAGTAAGGTTGGGGAGCATGGAACTTTGGCTTGGTAGTAAAGATCAACAGGAACTGATGATCCCCTTGGCAGCAAGGTTTGTCCACCCTAAAGTATTTGATAGATCAATTTTAGGCAATATCTTGACCAATGATGCCCTGCATAAATTTTTGAAACTACAAAAGTTCTCTCTTAGCTTACTGGCGACCAATATGAGGTCAGTGTTCCATGCGAACTGGGTGAATCATGTAATGAACTCAAATATGTCTCCGTGGTTTTCATGGGAGAATAAATCATGCTCTGGGGTTGAGGAGGGACCATCCTCTGAATGGATAAGACTCTTTTGGAAGAATACGGGTGATTCATCACAGGACCTTTTGCTCTTTTCTGATTGGCCACTTGTTCCTGCTTTTCTTGGTAGACCAATCTTATGTCGTGTGAGAGAGCGCCATCTTGTCTTTCTCCCTCCTGTCACATACCCCGTTTTACCAAATGCTATTTTAGAGATTGGTGCAGGGGGCAGTGATGTGGCGGAGACATCTACGAGTGTGATTTCTAAACCTGAATCCATTCAGCCTTACACTTCAGCTTTTCAAAAATTCCAGGACACGTATCCTTGGCTATTCCCTCTTTTAAACCACTGCAACATTCCAATATTTGATGTAGCTTTCATGGACTGTGCATCCCTATGCAACTGTCTCTCTAATTCTGGCCAATCATTAGGACAAATAATTGCCTCTATGTTTGTGGCGGCTAATAATGCAGGTTACTTTCCGGAACTTGCATCACTTTCAGATTCAAATAGTAATGAGCTCCTCAACCTTTTTGCCAATGACTTTGTTTCAAATGGAACTAACTATGGGCGAGATGAGCTTGAAATATTACGGAAGTTACCCATATATAGAACTGTTGTCGGATCATATACACAATTGCGCGACAATGATCAATGTATGATCTCTTCAAATTCATTCCTTAAACCATATAATGATTGTTGTCTATCTTATTCATCAAATTCAATGGAATATTCATTACTTAGAGCCCTCAGGGTCCCTGAATTGGACAATCAACAAATTTTGATTAGGTTTGGGTTGCCTGCATTTGATTGTAAACCTCAGTCGGAACAGGAAGATATCTTAATATATCTATTTACAAATTGGCAAGATCTTCAAGCCGATGCTCATTTAGTTGAATGCTTGAGCGAGACTAATTTTGTGAGGAGTGCTGATGAGTTTTGCACGGATTTGTTTAAATCAAAGGAATTGTATGATCCAAGTGATGCTTTGTTAACATCCGTCTTCTCTGGTGAAAGGAAAAAATTTCCTGGAGAAAGGTTTGCTGCTGATGGTTGGCTTCGAATTTTAAGGAAAATTGGCCTCAGAACCACAACAGAAGCCAATGTCATTCTTGAATGTGCCAAAAAAGTAGAGACTCTAGGAAGTGAATGGAGGAAGTCGGAGGAGGATGGTTCTGAGTTTGACTTGATAAATGGTCAAAATGAAGTGCCTATGGAAGTATGGACTTTAGCTGGATCTGTTGTTGAAGCTGTTTTTTCAAATTTTGCTGTCTTTTATAGCAACAATTTTTGTAATGCTCTTGGCAATATTGCTTTTGTTCCGGCTGATTTAGGCTTTCCAAATCTTGGTGGCAATAAAGGTGGCAAAAGAGTTCTCACTTCATACGGTGATGGAATTGTATCAAAAGATTGGCCTCTGGCTTGGAGTTGTGCTCCAATTCTTTCCAAGCACAGTGTTATACCTCCTGACTACTCTTGGGGAGCACTTAATTTGAGAAGCCCTCCAGCCTTCCCCACAGTACTAAAACATTTACAGGTTATTGGAAGGAATGGTGGTGAAGACACTCTTGCTCATTGGCCAATATCCATGGGCGTAATGTCAATTAATGAAGCTTCTTGTGAGGTTTTAAAGTATCTTGAAAGGATTTGGAGTACCTTATCTTCTTTGGATGTTTTGGAATTGCAGAGAGTGGCATTCATTCCTGTAGCTAATGCAACACGCTTGGTCAAAGCTAATGCTCTATTTGCTAGATTGACTATTAATTTATCTCCTTTTGCATTTGAACTTCCAAGTGGATATCTTCCATTCGTGAAGATCCTCAAAGATTTGGGGCTTCAGGACGTACTGTCAGTTGCTTCTGCAAAGGATCTTCTATCAAGTCTTCAAGTAGCTTGTGGATATCAACGCCTAAATCCTAATGAACTTCGATCTGTAATGGAAATCTTACATTATATCTGTGATGAGGCTATGGAAGCAAAGATGTTTGATGGTCGGGAACCTGAAATTATAGTCCCAGATGATGGCTGCAGGCTTGTTCATGCAACATCCTGTGCATATATTGATACTTATGGTTCCCGATATATAAAATGCATCGACACTTCAAGGCTGAGATTTGTTCACTCAGATCTTCCTGAGAGGATTTGTAGAATGTTGGGCATTAAGAAACTATCCGATTTAGTTATTGAGGAGTTGGATCATGAAGATAGTATAGAACCCTTGGAACGTATTGGAGCAGTGTCTCTAGAATTCATCAGAAAGAAGTTATTGAGCAGGTCGTTTCAGAATGCTGTGTGGAATGTTGTCAATAGTATGGTTAATTACATTCATGCAAATAAAAATCTAGATCTGAAAGCTGTAGAGAAATTACTAAAATCTATTGCAGAGAGGCTTCAGTTTGTTAAATCTCTCCATACTCGGTTTTTACTTCTTCCAAATTCTATAGACATCACACGTCCTGCTAAAGATTCCATTATTCCAGAATGGAAGGACGGAATCCATCATAGGGCTCTTTACTTTGTTAATCACTCAAAAACCTGTATTTTAGTTGCTGAGCCCCCTGCTTATATATCAATCTTTGATGTCATTGCCATTGTTGTGAGCCAGATTTTAGGATCACCTATTCCCTTGCCTGTTGGCTCTTTGCTTTTCTGCCCTGAAGGTACTGAAATTGCCATTATCAATATATTAAAACTTTGTTCTGAGAAGGAGAACGAACAATTTACTGGAATTAGTAGTTTGCTTGGAAAAGAGATACTACCCCAAGATGCTCTTCAATTACAGCTTCACCCATTAAGACCGTTTTATGCGGCAGAAGTAGTGGCTTGGCGGTCTCAAAGTGGAGAAAAGCTGAAATATGGTAGGGTCCCAGAGGATGTTAGACCATCAGCTGGCCAAGCACTCTACAAATTCAGGGTCGAAACAGCACCAGGCATCACTCAGTCTCTTATTTCTTCACAAGTTTTATCATTCAGAAGCATTTCCATTGATGGTAGCCATTCCTCTACAAACTTGCAAGATAGTGGTCACATGATAATTGATAGTGGTGCTTCCGTCGAAATGCCAGAGAACTCTGAAAGAGGCAAAATACGATCCCAGCCTGTTGCAGAGCTCCAATATGGCAGAGTATCTGCTGAAGAATTGGTGCAGGCAGTTCATGAAATGCTTTCTACTGCTGGAATCAATGTGGATATTGAGCGACAATCCCTCTTGCAAAAGACCGTAGTCCTGCAAGAACAACTGAAGGATTCGCAGGCAGCTCTTCTGTTAGAACAGGAAAGATCTGATGCGGCAGCCAAAGAAGCCGATACAGCGAAAGCAGCTTGGCTTTGTCGGGTTTGTTTGACTTCCGAGGTAGAAATTACCATAGTTCCCTGTGGCCATGTTCTGTGTCGAAAATGTTCTGCTGCTGTTTCAAAAAGTAGACGCTTCAAAATCACCCGCGAAGCCGTTGAGAATTGCGTTGTGGATAATGTGAGAGATAAGAGGTCGGAATCGGAACCTCCAACAATTCCTCTGCTCCTTTCCTTCTCTTCAATGGCGTCTCTCCAACAACTCCTCAAACCTTCCTGGACTTCTCTCTTCGCCCCTTCCCTCTCTCAGCCTCAGCCTCAGCCTCAGCCTCGAACTTTTACTACTTCCACTCCTAGAGCCTCTCTCCAGAATTCTTCCATCAATCGCCGGCAGTTCGTAGCTGAGACGGCAGCAGCGGTTTCTCTGTCGCTTTCTCCGCTTATTGCTCCCGTGCAACCGGCGAAGTCCGAAGAAGCTCTCTCGGAGTGGGAGAGGCTTTTCCTTCCTATAGATCCGGGTGTTGTCCTTCTCGACATTGCTTTTGTGCCCGATGATTTGGACCATGGCTTCCTTTTGGGGACCAGGCAAACCATTTTAGAGACAAAAGATGGTGGAAGAACTTGGGCTCCACGTACAATACCCTCGGCTGAAGAAGAAGATTTTAACTACAGATTTAATTCTATTAGCTTCAAAGGAAAGGAGGGATGGATTGTTGGCAAACCTGCAATTCTGTTGTACACTTCGGATGCTGGAGAAAGCTGGGAAAGGATACCTCTCAGTGCTCAGCTTCCTGGAGATATGGTCTACATTAAAGCAACTGGAGAAAAAAGTGCAGAGATGGTTACTGATGAAGGTGCGATATACGTTACATCAAACAAGGGATATAATTGGAAGGCTGCAGTTCAGGAGACTGTTTCTGCCACCCTTAATAGAACAGTTTCAAGTGGGATAAGTGGTGCAAGCTATTATACTGGAACCTTTAACACAGTAAATCGTTCTCCTGATGGGCGTTATGTTGCTGTTTCGAGTCGTGGTAACTTCTATCTCACCTGGGAGCCTGGGCAGCCATTCTGGCAGCCACATAATAGAGCTATTGCTAGGAGGATTCAGAACATGGGGTGGAGAGCTGATGGTGGTCTTTGGCTTCTTGTTCGTGGAGGAGGACTTTTTCTGAGTAAAGGCACAGGGATAAGCGAGGAGTTTGAAGAAGTTCCAGTTCAAAGCCGAGGTTTTGGCATATTAGATGTTGGTTATCGTTCAACGGAAGAGGCTTGGGCAGCTGGGGGAAGTGGAATACTTCTGAAAACTACCAATGGTGGCAGGACATGGTCCCGTGATAAAGCAGCTGACAACATTGCAGCCAACCTATACTCTGTAAAGTTTATAAACGACAAGAAAGGTTTTGTGCTGGGAAATGACGGTGTATTGCTTCAATATCTTGGTTGA

Coding sequence (CDS)

ATGGCGTCTGAATCGACGTCGCTAGACTCGATTTTTCTGGAGGATTTTGGCCAGAAGGTTGACCTGACTCGGCGGATTCGTGAGGTGCTCCTTAACTATCCTGAGGGGACTACTGTTCTCAAGGAGTTGGTGCAGAACGCCGATGATGCAGGCGCCACCAAGGTTTGCCTCTGTCTCGACCGCAGGGTCCACGGCAGCGAGTCGTTGCTGTCGGAGTCACTGGCACCGTTTCAAGGGCCTGCGCTTTTGGCTTATAACAATGCGGTATTTACTGAAGAAGATTTTGTCAGCATTTCGAGAATTGGTGGTAGTAATAAGCACGGGCAAGCCTGGAAGACTGGTCGATTTGGGGTTGGCTTCAACTCTGTGTACCATTTAACTGAGCTGCCTTCATTTGTTAGTGGCAAATATGTTGTAATGTTTGATCCTCAGGGCATTTACCTTCCAAAGGTTTCTGCAGCAAATCCTGGAAAGCGAATTGACTTCATTCGTTGCTCTGCCATCTCACAATATAGAGATCAGTTTCTTCCTTATTGCGCTTTTGATTGTGACATGGAAAGTTCTTTTGATGGAACTTTATTCCGGTTGCCATTAAGAAATGCAGATCAAGCAGCCAGAAGTAAAATTTCAAGGCAAGCTTATACAGAAGAAGATATTTCTTCCATGTTTGCTGAACTTTATGAAGAAGGAGTCTTGACGTTACTCTTTTTGAAAAGTGTTTCATGTATTGAAATGTTTGTATGGAATGATGGTGAGGCAGAACCCCAGAAGCTTTATTCATTCTCTGTGCGTTCAGCAAGCAGTGATATTATTTGGCACAGGCAGATGTTACTGCGGTTATCAAAATCGACAACTTCCACTCAGAGTGAGGTAGATAGCTTTTCACTGGACTTCTTGAGTCAAGCGACAACTGGGACCCAAATAGAGGAAAGGATTGATAGCTTTTTTATAGTTCAAACCATGGCATCAGCTACCAGTAGGATTAGCTCTTTTGCTGCTACTGCATCGAAAGAATATGATATTCACTTGCTACCTTGGGCATCGCTTGCTGTTTGCACATCCGATGACTCCTCAAAGACTAATGTTCTCAAACTTGGTCGGGCCTTCTGTTTCCTCCCTTTGCCAGTAAAAACTGGTCTAAATGTTCAAGTAAATGGATTCTTTGAGGTCTCATCAAATCGTCGTGGCATTTGGTATGGGGCTGACATGGACAGGAGTGGGAAGATTCGTTCAATCTGGAACAGGCTCCTTTTAGAGGATATAATTGCTCCTTCTTTTATAGAACTGCTGATTGGCGTGCAAGTGTTTCTTGGTCCAACGGATACTTACTTTTCCTTGTGGCCTAGTGGTTCATTTGAGGAGCCATGGAACATATTAGTTGAGCAAGTGTACAAAAATATCAGCAATGCTCTTGTTTTGTATTCAAACGTTGAAGGTGGAAAATGGGTCTCACCAATTCAAGCTTTTCTTCACGATGATAAATTCACAAGAAGTAAGGAACTTGGTGAGGCTCTGGTGCTTTTAGGAATGCCTATTGTGCATCTTCCCGAAAATCTGTCCAACATGCTTTTGAAGTTCTGCCGTGCTTTCCAACAGAAAGTGGTTACACCCTGTACAGTTCGTCATTTTCTTAGGGAATGTAAACATGTTGCTACACTGAACAGACCTTACAAACTTGTATTGTTAGAATATTGCATTGAAGATTTGATTGATGCTGATGTTTGCGCACAGGCATTTGACTTACCCTTGATTCCATTGGCAAATGGTGATTTTGGGTTATTTTCAGAAGCCTCCAAAGGGATCTCTTATTTTATTTGTGATGAGTTGGAATATACCCTTCTGCACCAGATTTCTGATAGAGTAATAGATCGGAATATACCTCTCAATATATCAACTAGACTTTCAAATATTGCCAGGTCTTCAAAATCAAATATTTTCATTTTTAATGTTCATTATTTTCTTCAGCTGTTTCCAAAATTTGTGCCTGCTGATTGGAAGTATAAAAATGAAGTGTTGTGGGACCCTGAGTCTTGTAGCAACCATCCTACTTCATCATGGTTTTCACTATTTTGGCGGTATCTTCATGATCGGTGTGAAAAGTTGTCATTATTCAGTGACTGGCCCATTTTGCCATCCAAATCAAGGTATCTTTATAGGGCGACTAAAAAATCAAAATTGATTAACGTACAGATGCTTTCTAATGAGATGCAAAAGATACTCAGTAAACTTGGATGCAAGTTGCTCGATCCCTACTATAAGGTTGAACACCGAGACCTAATTCACTATGTAAATGATGGTAACTGTACAGGCGTTTTGGACTCTATATATGATGCGATTTCTTCAACTGGCGGTTTAATGCTGACTTCACTCTACAGTTTAGAGGTGGAAGAGAAGGATGGGCTGCGTAGGTTTCTTCTTGATCCTAAGTGGTATCTTGGAGGCTGTATGAATGATAGTGATTTAGAGAAGTGCAAGAGACTCCCAATTTATAAAGTCTATAATGGAGGATCTGCTCAAGATTTTGGTTTCTCAGATCTTGAAAACCCTCAAAAATACTTACCACCCTCGGATGTTGGGGAGTTCTTTCTTGGAGTAGAGTTTATTGTCAGCTCCTTCGATAGCGAAGTGGAGATTTTGTTAAAATACTATGGGATCAAAAAAATGGGAAAGGCATCTTTCTACAGAAAGCATGTTTTAAACCAAGTTGAGCAGTTGCAGCCCGAACTTCGTGACAGTACCATGCTTTCCGTTTTGAAAAATCTTCCACAATTATGTGTTGAAGATGTAGCTTTCAGAGAATGTTTGTCAAATTTGGCTTTTGTTCCAACTTCAAGTGGCACTCTTAAGTGCCCGACTGTTTTATATGATCCTCGTTATGAAGAATTGTGTGCTTTACTAGATGACTTCGATAGTTTCCCTTCCACTCCTTTCAATGAATCTTACATTCTGGACATTCTACAAGGTTTGGGCCTTAGAACATGTGTAGCTCCTGAGACCATTGTGCAAAGTGCTCAACATGTAGAAAGATTAATGCATAAGGATCACAACAAGGCTCACTCAAGGGGTAAGGTTCTTCTATCATACTTAGAAGTTAATGCAATTAAGTGGCTTCTCAATCCAATGAATGAAGATCAAGGGATGGTGAACCGATTATTTTCTACAGCTGCAACTGCATTTAGACCTCGCAACTTCAATTCTGATCTTGAGAAGTTCTGGAATGATCTTTGTAAAATTTCCTGGTGCCCAGTATTACTTTCCCCTCCTTTTGAAACCTTACCCTGGCCTGTTGTCTCATCCATGGTAGCTCCCCCAAAACTTGTAAGATTGCCCAAGGACTTGTGGCTTGTTTCAGCTAGCATGCGAATACTGGATGGTGAATGTTCTTCTTCAGCTCTTGCACACAGCCTTGGTTGGTCTTCTCCCCCTGGTGGTAGTATTATCGCTGCTCAACTTCTTGAGCTTGGAAAAAACAATGAGATTGTATATGATCAGGTGCTTAGGAAAGAACTGGCCTTAGCAATGCCAAGAATATATGCATTGCTGACAGGCTTGATAGGTTCAGATGAGATGGATGTTGTAAAAGCTGTGCTTGAAGGTTGTCGGTGGATTTGGGTTGGAGATGGGTTTGCAACATCAGAAGAGGTTGTCCTTGATGGTCCTCTTCACTTAGCCCCTTATATTCGTGTTATACCCATTGATTTGGCAGTTTTTAAAGATTTATTTTTAGAACTTGGAATTCGGGAATTTCTGAAGCCCAATGATTATGCTGATATTTTGTCTAGAATGGCAATAAAAAAAGGTTCCTCTCCTCTCAACGCACAGGAAGTGAGGGCAGCCATTCTCATTGTACAACATCTGGCTGAAGCTCAACTTCCAAAGCAACAGATCAATATATATTTACCCGATATTTCGGGTAGGCTATTGCCGGCCAGCAACTTAGTTTACAATGATGCTCCCTGGTTACTAGGGACAGATGATACCGATGTTCCATTTGATGGGGAATCAACTGTTGTCTTGAATGCTAGGAAAACAGTTCAAAAATTTGTCCATGGGAACATATCCAATGATGTTGCAGAGAAACTTGGTGTCTGCTCACTTCGTAGGATTTTATTAGCTGAGAGTGCTGATTCTATGAATTTGAGTTTATCTGGAGCTGCTGAAGCTTTTGGGCAGCATGAAGCCTTGACAAACAGACTTAGACATATACTTGAAATGTATGCAGATGGTCCTGGGATTCTATTTGAGTTGATTCAAAATGCAGAGGATGCAGGCGCCTCTGAGGTGGTATTTCTTTTGGACAAAACTCATTATGGAACTTCTTCCATTTTATCACCTGAAATGGCTGATTGGCAAGGACCAGCATTGTACTGTTATAATGACTCTGTTTTTAGTTCTCAAGATCTTTATGCAATTTCACGTGTTGGCCAAGAAAGTAAGCTGCAAAAACCATTATCAATTGGAAGATTTGGTTTGGGTTTTAATTGTGTGTATCATTTTACGGATATCCCCACGTTTGTTTCTGGGGAGAACATTGTTATGTTTGATCCTCATGCCTGTAATTTGCCTGGGATTTCTCCTTCTCATCCGGGTCTCCGAATAAAGTATGCTGGAAGAAGAATTTTGGAGCAATTTCCTGATCAATTTTCTCCATACTTGCATTTTGGTTGTGACATGCAGAAGCCATTTCCTGGTACGCTATTTCGTTTCCCCCTTAGGAGTCCGGCTCTTGCTTCTCGAAGTGAAATTAAGAAAGAAGGTTATGCACCTGAAGATGTTACTTCACTTTTTTATTCTTTTTCCGAAGTTGCTTCTGATGCTTTGCTTTTCCTCACCAATGTCAAAAAGATCTCAATATTTATAAAGGATGATATAGAACATGACATGCAATGCCTATATCGTGTGCATAAGAATACAGTTAGTGAACCTTCTACTGAATCCAGTGCAAAGCAGGATATTATTAGCTTTATATATGGAAACCGCCAAGGTGAAATGGATAGAGAACAGTTCCTAATGAAATTGAGTAAATCCATCAACAGAGATCTCCCATATAAATGTCAGAAACTTATTATCACGGAGAAAAGTTCAAGTGGTGATATATTACAGCACTATTGGATAACCTCTGGATGTTTAGGTGGTGGGCTCCCGAGAAACAACTCAGGCTTGGGTGACAAGTCCTATAATTTCATTCCTTGGGCTTGTGTTGCTGCACTTCTACATTCTGTACAGGTAGATGGGGAAATGAACTATGACCCTGAGACTGAAAATAATTGGCTAGTTGCTTCTGATTTAGTTCAAGTTTCTTCTGCTTCTATAGAAGGCAGGAAACCTTTTGAAGGACGTGCTTTCTGCTTTCTACCGTTACCTGTCAGAACTGGTCTCCCTGTGCATGTCAATGCGTATTTTGAGCTTTCGTCAAATCGAAGGGACATATGGTATGGTGATGACATGGCAGGGGGCGGAAAAAAACGTTCAGAATGGAATTCTTATCTCCTTGAAGATGTTGTTGCCCCTGCTTATGGTCGCTTGCTTGAAAAAATTGTGTCAGAGATTGGTCACTCTGGTTTATTCTCCTCATTTTGGCCGACAACAGCAGGATTAGAACCTTGGGGTTCAGTAGTTCGAAAACTCTATAGCTTTATTGGTGATTTTGGTCTTCTTGTTCTGTATACAAATGCTCGAGGAGGTCAGTGGATTTCCACAAAACAAGCTATTTTTCCTGATTTTTCTTTTGACAAAGTATATGAACTTATTGAAGCATTAGCTGATTCCGGCCTGCCAGTCATCGCTATTTCAAAGTCAATTGTTGACAGATTCATGGAGGTACGTCCCTCATTACATTTCCTGACTCCCCATTTGTTAAGAACTCTGTTGATTAAAAGGAAGCGTGCGTTTAAAGACAGGAAAGCAACCATCTTGACCCTTGAATATTGTTTAGTCGATTTGAAACTACCTTTTCAATCCGACAGTCTGTGTGGATTGCCTTTACTACCACTTGCTGATGGTTCATTTACCTCATTTCACAAGAATGGGATGGGAGAAAGAACTTACATTGCAAGGGGAGATGAATATGGCCTTCTCAAGGATTCAGTTCCCGGCCAACTTGTGGATCCTGGAATACCAGAAGTAGTTCATGCAAAGCTCTGTGAGGTAGCCCAAACTGAGGATTTAAATATTTGTTTTCTTTCGTGCCAGTTGCTCGAGAAACTTTTCCTGAGATTTCTTCCGGCAGAGTGGCAAAATGCTAGACAGGTGAACTGGAATCCTGGTCATCAGGGTCAGCCTAGCTTGGAATGGATAAGATTGATTTGGTGCTACCTCAAGTCACATTGTGATGATCTATCTCAGTTTTCTAAGTGGCCTATACTGCCCGTTGGGCAGAATTCTCTCCTGCAACTTGTTGAGAATTCAAATGTTCTTAAAGCTGATGGTTGGAGTGAGAACATGTTTTCTTTGTTGCTGAAAGTTGGGTGCTTGTTTTTGAGGCGTGATATGCCTATAGAACATCCACAACTGGAAAACTATGTGCATCCTTCAACAGCAATCGGTATTCTAAATGCTTTTCTGTCTATTGCTGGTGATATTGAGAATGTTGAAGGGCTATTTCGTGATGCATCTGAAAGTGAACTGCATGAGCTCCGGAGCTTTATTCTTCAGTCAAAATGGTATCTGGAAGGAAAAATGGAAGCCATTCATGTTGATATTATTAAATGCATCCCCATGTTTGAGTCGTATAAATGTCGAAAATTAGTAAGTTTGAGTAAACCTATAAGATGGATTAAACCCACTGGTTTATGTGAGGATTTTTTGAATGATGATTTTGTGCGCATGGAATCAGAAAAAGAGAGAATCATTTTGAAAAGATACTTTGGGATCGGAGAACCATCAAGAGTAGAATTCTACAAAGATTATGTTCTTAATCATATGTCGGAATTCCTTTCAGAAAGGGGAGCTCTTTCAACCATTCTGCATGATGTGAAGCTTTTAATTGAAGAGGATGTCTCCCTCAAATCTTCAGTTTCTATGATACCATTTGTGTTGGCTAGCAATGGATCCTGGCAACCACCATCAAGGCTGTATGATCCTCGGGTACTTGAGTTAAATAATATGCTGCATGAAGAAACTTTTTTTCCATCTGAAAATTTTTCAGATGATGACATTTTAGATGCTTTAGTCAGCCTTGGACTTAGAAGATCCCTTGATTTGACTGGTTTGCTAGATTGTGCTAGATCAGTTTCATTGTTGAATGATTCTAAGAATTCTGAATCACAGAGTTACGCTAGGAGATTGTTTGTGTGTTTAGATGCTCTTGCACACAAGCTCTCAATCAAAGTGGAAGGAAGTGGTTATGAACTACAGAATTCTATGCTCATTAAGAGCAATTATGTTGATGATGATGCTTCTATGGAAGTTGGCTCTCTCAATATAGAGGATACCTCTGATATGGGCACTGATTCCTTAATAGGCAACCTGACTGGTGATGAATCAGAGGAAGAATTTTGGTCTGAAATGAATACTATTGCTTGGTGCCCCATTTGTGCCGATTCACCTTTAAAAGTACTTCCATGGTTGAAAACCAATAATCAGGTAGCTCCACCGAGCATTGTGAGACCTAAATCACAGATGTGGATGGTCTCCTCTTCAATGCATATTCTAGATGGTGTGCCTCCTTCAGAATACTTGCAACATAAACTTGGTTGGACTGATTGCCCGAGGGTTGAAGTTTTATGTGCACAGTTGACGGACATATCCAAGCTTTATGGTGAGCTTAGGTTGCATTCTTCACTAGAACCTGATATCAACACTGCATTGCAAGAAGGGATTCCCATTCTTTACTCAAAACTGCAAGAATATATAGGAACTGATGAGTCCGTGCTGTTAAAATCTGCTTTAAATGGTGTATCCTGGGTGTGGGTGGGGGATGATTTTGTACCCCCAAGCGCTCTTGCCTTTGACTCGCCAGTAAAATTCTCTCCTTATCTTTATGTCGTTCCATCCGAATTATCAGAATTTAGAGATCTGCTTTCAGAATTAGGTGTTAGGCTTAGTTTTAATGTTGAGGGCTACTTGGATGTTCTTCAACGCTTACACAGTGATGTTGAAGGGTCCCCTCTATCCACAGATCAGATGGATTTTGTGATCTGTATGCTTGAAGCTATTTCAGACTGTTGTGTGGACAAGCCAGAATTCACTGCTACCAGTACTTCTCTTTTAATTCCCAACTCTTCTCAGGTTCTGATGCAAGCAAATGATCTTGTTTACAATGATGCACCATGGATGGAAGACAACAATATTCTTGTTGGGAAACACTTTGTGCATCCAAGTATCAGCAATGATTTGGCAAGCAGGTTGGGCGTGCAATCCATTCGTTGTCTCTCATTAGTTGATGAAGAGATGACTAAAGATCTACCATGCATGGACTATGCTAAAATCAGTGAGCTTCTGATGTTGTATGGCAATGACTATTTGTTCTTTGACCTTCTGGAATTGGCAGACTGCTGCAGAGCTAAAAAGCTGCGCCTAATATTTGACAAAAGAGAACATCCTCGCCAATCATTACTGCAACATAATCTAGGTGAATTTCAAGGTCCTGCACTGGTTGCCATCTTTGAAGGCTCTAGCTTAAATACAGAGGAAATCAGTAGCCTGCAATTTCGTCCTCCTTGGAAATTAAGGGGCGATACTCTTAACTATGGTTTGGGACTGCTCAGCTGTTATTATGTTTGTGATCTCCTTTCAATTGTTTCTGGTGGCTACTTTTATATATTTGATCCCCGTGGAATAGCTCTTTCTGTAGCTCCCAAATCTGCCCCAGGGGCAAAAGTGTTTTCTCTGATAGGTAGTAATTTGATAGAGAAATTTAACGATCAATTTCATCCTATGTTGGGGGGTCAAAATATGTCATGGCCATCAGATTCCACCATAGTCCGCATGCCCCTATCTCCAGCATGCTTAAAAGATGGACTTGAGCCTGGAATAAGAAAGATAAAGGAGATAAGTAGCAAATTTTTGGATCATGCTTCAAGGTCTCTTTTATTCTTAAAATCTGTCGTGCAGGTTTCATTCTCGACCTGGGACCAAGGAGGGCTTCACCCATACCAAGATTATTCAGTTTGCATCAATTTATCATCTGCTATAGCAAGGAACCCATTTTCAGAAAAGAAGTGGAAGAAGTTCCAATTATCAAGGTTATTTAGCAGCTCAAATGCTGCCACAAAATTGCATACAATAGATATAATTGTATTCCAGGGAGAAACTCAATTCGTTGACCGGTGGCTTGTGGTGCTTAGCTTGGGTTCTGGGCAAACTAGAAATATGGCTCTTGATAGGCGATATCTTGCATACAACTTGACACCTGTTGCTGGAGTTGCAGCTCATATATCTCGCAATGGCCTTCCTGCTGATATATGTCAAAAGAGCCCTTTGATGGCTCCCTGTCCTTTATCTGGTGATATAACATTACCTGTCACCGTTCTAGGATGTTTCCTTGTTTGTCACAGTGGTGGCCGTTATCTATTCAAGAATCAAGTTCTCGAGGCTGTAGCTGCACCCCTTGATGCTGGAAATAAGTTAGTCGAAGCCTGGAACAGGGAATTGATGTCCTGCGTATGTGATTCTTACATTTACATGATATTGGAAATTCATAAACAACGAAAAGAATCTTCAAGTTCTGCGTTGGAGTCAAATGTGAGTCATTCCATAAGTTCATCTTTGAAGGCATATGGAAATCAAGTTTATTCGTTTTGGCCTAGGTCTGAACCTGCAAATGGCAGTGATTCTGATCTGGACAGAGGGTTGAAAGCAGATTGGGAATGTCTGGTTGAACAAGTAATCAGGCCATTTTATACTCGTGCTATTGATCTCCCTGTGTGGCAGCTTTACTCTGGAAATTTAGTTAAGGCTGAAGAGGGTATGTTTCTTGCACAACCTGGGAGTCCCGTGGGTGGTAACTTGCTGCCAGCAACAGTTTGTGGTTTTGTGAAGGAGCATTATCCTGTGTTTTCGGTGCCATGGGAGTTGATTAAGGAGATTCAAGCTGTGGGAATTACAGTACGCCAAATTAGACCTAAAATGGTTCGGGATCTCCTCAGGGTTTCTTCAGCATCTATAGTTCTTCAATCAATTGATACATATTTGGACGTTCTTGAATACTGCTTGTCGGATATTCTGTTGGCTGCATTATCTAATCATGCCGAGGATAGTATGGGAGCTGACTCTGTTAACACTAATCCTGGTGGTAGATCAACTAATACTTCAGAAGGCAGCTCAACTTCTGTTTCCGTCTCTAGCATGAATAGTTTTGCCAGGTTATCTAACCAGAATGCAGCCAGTTCAGGTGATGCTCTTGAAATGATGACAAGTCTGGGCAGGGCTTTATTAGATTTTGGTCGAGGAGTTGTTGAAGATATTGGTAGAAGTGGGGATTCCTTGTCTCATAGTAATACATTTACTGGCAGAAATAACAGCAGCTACAGAAATGTGGACCAGAATTTTCTACAAATGGTATCCGAGATCAAAGGCTTACCATTTCCAAGTGCATCCAACAATTTAGTAAGGTTGGGGAGCATGGAACTTTGGCTTGGTAGTAAAGATCAACAGGAACTGATGATCCCCTTGGCAGCAAGGTTTGTCCACCCTAAAGTATTTGATAGATCAATTTTAGGCAATATCTTGACCAATGATGCCCTGCATAAATTTTTGAAACTACAAAAGTTCTCTCTTAGCTTACTGGCGACCAATATGAGGTCAGTGTTCCATGCGAACTGGGTGAATCATGTAATGAACTCAAATATGTCTCCGTGGTTTTCATGGGAGAATAAATCATGCTCTGGGGTTGAGGAGGGACCATCCTCTGAATGGATAAGACTCTTTTGGAAGAATACGGGTGATTCATCACAGGACCTTTTGCTCTTTTCTGATTGGCCACTTGTTCCTGCTTTTCTTGGTAGACCAATCTTATGTCGTGTGAGAGAGCGCCATCTTGTCTTTCTCCCTCCTGTCACATACCCCGTTTTACCAAATGCTATTTTAGAGATTGGTGCAGGGGGCAGTGATGTGGCGGAGACATCTACGAGTGTGATTTCTAAACCTGAATCCATTCAGCCTTACACTTCAGCTTTTCAAAAATTCCAGGACACGTATCCTTGGCTATTCCCTCTTTTAAACCACTGCAACATTCCAATATTTGATGTAGCTTTCATGGACTGTGCATCCCTATGCAACTGTCTCTCTAATTCTGGCCAATCATTAGGACAAATAATTGCCTCTATGTTTGTGGCGGCTAATAATGCAGGTTACTTTCCGGAACTTGCATCACTTTCAGATTCAAATAGTAATGAGCTCCTCAACCTTTTTGCCAATGACTTTGTTTCAAATGGAACTAACTATGGGCGAGATGAGCTTGAAATATTACGGAAGTTACCCATATATAGAACTGTTGTCGGATCATATACACAATTGCGCGACAATGATCAATGTATGATCTCTTCAAATTCATTCCTTAAACCATATAATGATTGTTGTCTATCTTATTCATCAAATTCAATGGAATATTCATTACTTAGAGCCCTCAGGGTCCCTGAATTGGACAATCAACAAATTTTGATTAGGTTTGGGTTGCCTGCATTTGATTGTAAACCTCAGTCGGAACAGGAAGATATCTTAATATATCTATTTACAAATTGGCAAGATCTTCAAGCCGATGCTCATTTAGTTGAATGCTTGAGCGAGACTAATTTTGTGAGGAGTGCTGATGAGTTTTGCACGGATTTGTTTAAATCAAAGGAATTGTATGATCCAAGTGATGCTTTGTTAACATCCGTCTTCTCTGGTGAAAGGAAAAAATTTCCTGGAGAAAGGTTTGCTGCTGATGGTTGGCTTCGAATTTTAAGGAAAATTGGCCTCAGAACCACAACAGAAGCCAATGTCATTCTTGAATGTGCCAAAAAAGTAGAGACTCTAGGAAGTGAATGGAGGAAGTCGGAGGAGGATGGTTCTGAGTTTGACTTGATAAATGGTCAAAATGAAGTGCCTATGGAAGTATGGACTTTAGCTGGATCTGTTGTTGAAGCTGTTTTTTCAAATTTTGCTGTCTTTTATAGCAACAATTTTTGTAATGCTCTTGGCAATATTGCTTTTGTTCCGGCTGATTTAGGCTTTCCAAATCTTGGTGGCAATAAAGGTGGCAAAAGAGTTCTCACTTCATACGGTGATGGAATTGTATCAAAAGATTGGCCTCTGGCTTGGAGTTGTGCTCCAATTCTTTCCAAGCACAGTGTTATACCTCCTGACTACTCTTGGGGAGCACTTAATTTGAGAAGCCCTCCAGCCTTCCCCACAGTACTAAAACATTTACAGGTTATTGGAAGGAATGGTGGTGAAGACACTCTTGCTCATTGGCCAATATCCATGGGCGTAATGTCAATTAATGAAGCTTCTTGTGAGGTTTTAAAGTATCTTGAAAGGATTTGGAGTACCTTATCTTCTTTGGATGTTTTGGAATTGCAGAGAGTGGCATTCATTCCTGTAGCTAATGCAACACGCTTGGTCAAAGCTAATGCTCTATTTGCTAGATTGACTATTAATTTATCTCCTTTTGCATTTGAACTTCCAAGTGGATATCTTCCATTCGTGAAGATCCTCAAAGATTTGGGGCTTCAGGACGTACTGTCAGTTGCTTCTGCAAAGGATCTTCTATCAAGTCTTCAAGTAGCTTGTGGATATCAACGCCTAAATCCTAATGAACTTCGATCTGTAATGGAAATCTTACATTATATCTGTGATGAGGCTATGGAAGCAAAGATGTTTGATGGTCGGGAACCTGAAATTATAGTCCCAGATGATGGCTGCAGGCTTGTTCATGCAACATCCTGTGCATATATTGATACTTATGGTTCCCGATATATAAAATGCATCGACACTTCAAGGCTGAGATTTGTTCACTCAGATCTTCCTGAGAGGATTTGTAGAATGTTGGGCATTAAGAAACTATCCGATTTAGTTATTGAGGAGTTGGATCATGAAGATAGTATAGAACCCTTGGAACGTATTGGAGCAGTGTCTCTAGAATTCATCAGAAAGAAGTTATTGAGCAGGTCGTTTCAGAATGCTGTGTGGAATGTTGTCAATAGTATGGTTAATTACATTCATGCAAATAAAAATCTAGATCTGAAAGCTGTAGAGAAATTACTAAAATCTATTGCAGAGAGGCTTCAGTTTGTTAAATCTCTCCATACTCGGTTTTTACTTCTTCCAAATTCTATAGACATCACACGTCCTGCTAAAGATTCCATTATTCCAGAATGGAAGGACGGAATCCATCATAGGGCTCTTTACTTTGTTAATCACTCAAAAACCTGTATTTTAGTTGCTGAGCCCCCTGCTTATATATCAATCTTTGATGTCATTGCCATTGTTGTGAGCCAGATTTTAGGATCACCTATTCCCTTGCCTGTTGGCTCTTTGCTTTTCTGCCCTGAAGGTACTGAAATTGCCATTATCAATATATTAAAACTTTGTTCTGAGAAGGAGAACGAACAATTTACTGGAATTAGTAGTTTGCTTGGAAAAGAGATACTACCCCAAGATGCTCTTCAATTACAGCTTCACCCATTAAGACCGTTTTATGCGGCAGAAGTAGTGGCTTGGCGGTCTCAAAGTGGAGAAAAGCTGAAATATGGTAGGGTCCCAGAGGATGTTAGACCATCAGCTGGCCAAGCACTCTACAAATTCAGGGTCGAAACAGCACCAGGCATCACTCAGTCTCTTATTTCTTCACAAGTTTTATCATTCAGAAGCATTTCCATTGATGGTAGCCATTCCTCTACAAACTTGCAAGATAGTGGTCACATGATAATTGATAGTGGTGCTTCCGTCGAAATGCCAGAGAACTCTGAAAGAGGCAAAATACGATCCCAGCCTGTTGCAGAGCTCCAATATGGCAGAGTATCTGCTGAAGAATTGGTGCAGGCAGTTCATGAAATGCTTTCTACTGCTGGAATCAATGTGGATATTGAGCGACAATCCCTCTTGCAAAAGACCGTAGTCCTGCAAGAACAACTGAAGGATTCGCAGGCAGCTCTTCTGTTAGAACAGGAAAGATCTGATGCGGCAGCCAAAGAAGCCGATACAGCGAAAGCAGCTTGGCTTTGTCGGGTTTGTTTGACTTCCGAGGTAGAAATTACCATAGTTCCCTGTGGCCATGTTCTGTGTCGAAAATGTTCTGCTGCTGTTTCAAAAAGTAGACGCTTCAAAATCACCCGCGAAGCCGTTGAGAATTGCGTTGTGGATAATGTGAGAGATAAGAGGTCGGAATCGGAACCTCCAACAATTCCTCTGCTCCTTTCCTTCTCTTCAATGGCGTCTCTCCAACAACTCCTCAAACCTTCCTGGACTTCTCTCTTCGCCCCTTCCCTCTCTCAGCCTCAGCCTCAGCCTCAGCCTCGAACTTTTACTACTTCCACTCCTAGAGCCTCTCTCCAGAATTCTTCCATCAATCGCCGGCAGTTCGTAGCTGAGACGGCAGCAGCGGTTTCTCTGTCGCTTTCTCCGCTTATTGCTCCCGTGCAACCGGCGAAGTCCGAAGAAGCTCTCTCGGAGTGGGAGAGGCTTTTCCTTCCTATAGATCCGGGTGTTGTCCTTCTCGACATTGCTTTTGTGCCCGATGATTTGGACCATGGCTTCCTTTTGGGGACCAGGCAAACCATTTTAGAGACAAAAGATGGTGGAAGAACTTGGGCTCCACGTACAATACCCTCGGCTGAAGAAGAAGATTTTAACTACAGATTTAATTCTATTAGCTTCAAAGGAAAGGAGGGATGGATTGTTGGCAAACCTGCAATTCTGTTGTACACTTCGGATGCTGGAGAAAGCTGGGAAAGGATACCTCTCAGTGCTCAGCTTCCTGGAGATATGGTCTACATTAAAGCAACTGGAGAAAAAAGTGCAGAGATGGTTACTGATGAAGGTGCGATATACGTTACATCAAACAAGGGATATAATTGGAAGGCTGCAGTTCAGGAGACTGTTTCTGCCACCCTTAATAGAACAGTTTCAAGTGGGATAAGTGGTGCAAGCTATTATACTGGAACCTTTAACACAGTAAATCGTTCTCCTGATGGGCGTTATGTTGCTGTTTCGAGTCGTGGTAACTTCTATCTCACCTGGGAGCCTGGGCAGCCATTCTGGCAGCCACATAATAGAGCTATTGCTAGGAGGATTCAGAACATGGGGTGGAGAGCTGATGGTGGTCTTTGGCTTCTTGTTCGTGGAGGAGGACTTTTTCTGAGTAAAGGCACAGGGATAAGCGAGGAGTTTGAAGAAGTTCCAGTTCAAAGCCGAGGTTTTGGCATATTAGATGTTGGTTATCGTTCAACGGAAGAGGCTTGGGCAGCTGGGGGAAGTGGAATACTTCTGAAAACTACCAATGGTGGCAGGACATGGTCCCGTGATAAAGCAGCTGACAACATTGCAGCCAACCTATACTCTGTAAAGTTTATAAACGACAAGAAAGGTTTTGTGCTGGGAAATGACGGTGTATTGCTTCAATATCTTGGTTGA

Protein sequence

MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLDRRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGFNSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCAFDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSVSCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLSQATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKTNVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCSAAVSKSRRFKITREAVENCVVDNVRDKRSESEPPTIPLLLSFSSMASLQQLLKPSWTSLFAPSLSQPQPQPQPRTFTTSTPRASLQNSSINRRQFVAETAAAVSLSLSPLIAPVQPAKSEEALSEWERLFLPIDPGVVLLDIAFVPDDLDHGFLLGTRQTILETKDGGRTWAPRTIPSAEEEDFNYRFNSISFKGKEGWIVGKPAILLYTSDAGESWERIPLSAQLPGDMVYIKATGEKSAEMVTDEGAIYVTSNKGYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPDGRYVAVSSRGNFYLTWEPGQPFWQPHNRAIARRIQNMGWRADGGLWLLVRGGGLFLSKGTGISEEFEEVPVQSRGFGILDVGYRSTEEAWAAGGSGILLKTTNGGRTWSRDKAADNIAANLYSVKFINDKKGFVLGNDGVLLQYLG
Homology
BLAST of Moc09g05800 vs. NCBI nr
Match: XP_022155038.1 (sacsin isoform X1 [Momordica charantia])

HSP 1 Score: 9549.9 bits (24780), Expect = 0.0e+00
Identity = 4745/4745 (100.00%), Postives = 4745/4745 (100.00%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
            SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS
Sbjct: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT
Sbjct: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED
Sbjct: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG
Sbjct: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV
Sbjct: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG
Sbjct: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA
Sbjct: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS
Sbjct: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD
Sbjct: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900
            FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN
Sbjct: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900

Query: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960
            QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE
Sbjct: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960

Query: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020
            ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS
Sbjct: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020

Query: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080
            RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW
Sbjct: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080

Query: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140
            CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS
Sbjct: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140

Query: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200
            PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC
Sbjct: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200

Query: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260
            RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS
Sbjct: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260

Query: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320
            RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW
Sbjct: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320

Query: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380
            LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL
Sbjct: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380

Query: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440
            SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS
Sbjct: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440

Query: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500
            SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT
Sbjct: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500

Query: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560
            DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK
Sbjct: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560

Query: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620
            PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI
Sbjct: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620

Query: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680
            KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD
Sbjct: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680

Query: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740
            LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS
Sbjct: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740

Query: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800
            VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY
Sbjct: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800

Query: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860
            FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW
Sbjct: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860

Query: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920
            PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL
Sbjct: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920

Query: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980
            ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD
Sbjct: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980

Query: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040
            LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP
Sbjct: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040

Query: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100
            EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR
Sbjct: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100

Query: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160
            LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL
Sbjct: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160

Query: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220
            RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK
Sbjct: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220

Query: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280
            WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK
Sbjct: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280

Query: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340
            ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS
Sbjct: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340

Query: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400
            MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD
Sbjct: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400

Query: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460
            LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN
Sbjct: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460

Query: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520
            YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL
Sbjct: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520

Query: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580
            PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT
Sbjct: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580

Query: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640
            DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD
Sbjct: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640

Query: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700
            DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE
Sbjct: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700

Query: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760
            GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME
Sbjct: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760

Query: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820
            DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND
Sbjct: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820

Query: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880
            YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS
Sbjct: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880

Query: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940
            SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK
Sbjct: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940

Query: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000
            VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS
Sbjct: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000

Query: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060
            KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL
Sbjct: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060

Query: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120
            SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA
Sbjct: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120

Query: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180
            GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV
Sbjct: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180

Query: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240
            AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY
Sbjct: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240

Query: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300
            GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA
Sbjct: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300

Query: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360
            EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD
Sbjct: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360

Query: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420
            LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG
Sbjct: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420

Query: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480
            SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN
Sbjct: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480

Query: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540
            TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA
Sbjct: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540

Query: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600
            RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW
Sbjct: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600

Query: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660
            FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL
Sbjct: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660

Query: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720
            VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL
Sbjct: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720

Query: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780
            NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL
Sbjct: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780

Query: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840
            LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC
Sbjct: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840

Query: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900
            LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD
Sbjct: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900

Query: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960
            AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR
Sbjct: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960

Query: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020
            ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV
Sbjct: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020

Query: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080
            VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA
Sbjct: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080

Query: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140
            WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS
Sbjct: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140

Query: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200
            INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF
Sbjct: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200

Query: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260
            ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD
Sbjct: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260

Query: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320
            EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI
Sbjct: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320

Query: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380
            CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY
Sbjct: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380

Query: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440
            IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH
Sbjct: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440

Query: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500
            RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN
Sbjct: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500

Query: 4501 ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR 4560
            ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR
Sbjct: 4501 ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR 4560

Query: 4561 VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS 4620
            VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS
Sbjct: 4561 VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS 4620

Query: 4621 GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV 4680
            GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV
Sbjct: 4621 GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV 4680

Query: 4681 VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS 4740
            VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS
Sbjct: 4681 VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS 4740

Query: 4741 AAVSK 4746
            AAVSK
Sbjct: 4741 AAVSK 4745

BLAST of Moc09g05800 vs. NCBI nr
Match: XP_022155046.1 (uncharacterized protein LOC111022177 isoform X2 [Momordica charantia])

HSP 1 Score: 8754.8 bits (22716), Expect = 0.0e+00
Identity = 4343/4343 (100.00%), Postives = 4343/4343 (100.00%), Query Frame = 0

Query: 403  MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ 462
            MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ
Sbjct: 1    MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ 60

Query: 463  VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM 522
            VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM
Sbjct: 61   VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM 120

Query: 523  LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP 582
            LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP
Sbjct: 121  LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP 180

Query: 583  LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK 642
            LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK
Sbjct: 181  LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK 240

Query: 643  SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS 702
            SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS
Sbjct: 241  SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS 300

Query: 703  LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY 762
            LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY
Sbjct: 301  LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY 360

Query: 763  VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE 822
            VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE
Sbjct: 361  VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE 420

Query: 823  KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY 882
            KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY
Sbjct: 421  KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY 480

Query: 883  YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP 942
            YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP
Sbjct: 481  YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP 540

Query: 943  TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ 1002
            TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ
Sbjct: 541  TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ 600

Query: 1003 SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR 1062
            SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR
Sbjct: 601  SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR 660

Query: 1063 NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI 1122
            NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI
Sbjct: 661  NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI 720

Query: 1123 LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT 1182
            LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT
Sbjct: 721  LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT 780

Query: 1183 GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL 1242
            GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL
Sbjct: 781  GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL 840

Query: 1243 ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD 1302
            ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD
Sbjct: 841  ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD 900

Query: 1303 ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG 1362
            ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG
Sbjct: 901  ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG 960

Query: 1363 VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED 1422
            VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED
Sbjct: 961  VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED 1020

Query: 1423 AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK 1482
            AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK
Sbjct: 1021 AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK 1080

Query: 1483 PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE 1542
            PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE
Sbjct: 1081 PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE 1140

Query: 1543 QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV 1602
            QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV
Sbjct: 1141 QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV 1200

Query: 1603 ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG 1662
            ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG
Sbjct: 1201 ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG 1260

Query: 1663 EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG 1722
            EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG
Sbjct: 1261 EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG 1320

Query: 1723 DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF 1782
            DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF
Sbjct: 1321 DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF 1380

Query: 1783 CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL 1842
            CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL
Sbjct: 1381 CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL 1440

Query: 1843 LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ 1902
            LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ
Sbjct: 1441 LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ 1500

Query: 1903 AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR 1962
            AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR
Sbjct: 1501 AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR 1560

Query: 1963 AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE 2022
            AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE
Sbjct: 1561 AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE 1620

Query: 2023 YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR 2082
            YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR
Sbjct: 1621 YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR 1680

Query: 2083 QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD 2142
            QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD
Sbjct: 1681 QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD 1740

Query: 2143 GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR 2202
            GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR
Sbjct: 1741 GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR 1800

Query: 2203 DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT 2262
            DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT
Sbjct: 1801 DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT 1860

Query: 2263 GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL 2322
            GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL
Sbjct: 1861 GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL 1920

Query: 2323 HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS 2382
            HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS
Sbjct: 1921 HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS 1980

Query: 2383 DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI 2442
            DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI
Sbjct: 1981 DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI 2040

Query: 2443 KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE 2502
            KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE
Sbjct: 2041 KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE 2100

Query: 2503 MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH 2562
            MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH
Sbjct: 2101 MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH 2160

Query: 2563 KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE 2622
            KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE
Sbjct: 2161 KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE 2220

Query: 2623 SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS 2682
            SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS
Sbjct: 2221 SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS 2280

Query: 2683 FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS 2742
            FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS
Sbjct: 2281 FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS 2340

Query: 2743 QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL 2802
            QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL
Sbjct: 2341 QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL 2400

Query: 2803 PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG 2862
            PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG
Sbjct: 2401 PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG 2460

Query: 2863 PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF 2922
            PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF
Sbjct: 2461 PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF 2520

Query: 2923 DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA 2982
            DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA
Sbjct: 2521 DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA 2580

Query: 2983 CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS 3042
            CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS
Sbjct: 2581 CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS 2640

Query: 3043 SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT 3102
            SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT
Sbjct: 2641 SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT 2700

Query: 3103 RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL 3162
            RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL
Sbjct: 2701 RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL 2760

Query: 3163 VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS 3222
            VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS
Sbjct: 2761 VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS 2820

Query: 3223 SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT 3282
            SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT
Sbjct: 2821 SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT 2880

Query: 3283 RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE 3342
            RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE
Sbjct: 2881 RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE 2940

Query: 3343 IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG 3402
            IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG
Sbjct: 2941 IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG 3000

Query: 3403 ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG 3462
            ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG
Sbjct: 3001 ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG 3060

Query: 3463 RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM 3522
            RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM
Sbjct: 3061 RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM 3120

Query: 3523 ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS 3582
            ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS
Sbjct: 3121 ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS 3180

Query: 3583 VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL 3642
            VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL
Sbjct: 3181 VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL 3240

Query: 3643 VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY 3702
            VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY
Sbjct: 3241 VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY 3300

Query: 3703 TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN 3762
            TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN
Sbjct: 3301 TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN 3360

Query: 3763 AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN 3822
            AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN
Sbjct: 3361 AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN 3420

Query: 3823 DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE 3882
            DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE
Sbjct: 3421 DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE 3480

Query: 3883 QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS 3942
            QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS
Sbjct: 3481 QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS 3540

Query: 3943 GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL 4002
            GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL
Sbjct: 3541 GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL 3600

Query: 4003 INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK 4062
            INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK
Sbjct: 3601 INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK 3660

Query: 4063 RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR 4122
            RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR
Sbjct: 3661 RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR 3720

Query: 4123 NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV 4182
            NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV
Sbjct: 3721 NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV 3780

Query: 4183 KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR 4242
            KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR
Sbjct: 3781 KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR 3840

Query: 4243 LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK 4302
            LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK
Sbjct: 3841 LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK 3900

Query: 4303 CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL 4362
            CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL
Sbjct: 3901 CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL 3960

Query: 4363 SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI 4422
            SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI
Sbjct: 3961 SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI 4020

Query: 4423 TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL 4482
            TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL
Sbjct: 4021 TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL 4080

Query: 4483 PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA 4542
            PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA
Sbjct: 4081 PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA 4140

Query: 4543 AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID 4602
            AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID
Sbjct: 4141 AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID 4200

Query: 4603 GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS 4662
            GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS
Sbjct: 4201 GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS 4260

Query: 4663 TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS 4722
            TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS
Sbjct: 4261 TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS 4320

Query: 4723 EVEITIVPCGHVLCRKCSAAVSK 4746
            EVEITIVPCGHVLCRKCSAAVSK
Sbjct: 4321 EVEITIVPCGHVLCRKCSAAVSK 4343

BLAST of Moc09g05800 vs. NCBI nr
Match: XP_038897839.1 (sacsin isoform X2 [Benincasa hispida])

HSP 1 Score: 8714.0 bits (22610), Expect = 0.0e+00
Identity = 4300/4746 (90.60%), Postives = 4513/4746 (95.09%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASESTSLDSI LEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESTSLDSILLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSA+NPGKRIDFIR SAIS+YRDQFLPYCA
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSASNPGKRIDFIRSSAISKYRDQFLPYCA 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            FDCDMESSF GTLFR PLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FDCDMESSFAGTLFRFPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYS+ VRSA+SDIIWHRQMLLRLSKSTTSTQ EVD FSL+F S
Sbjct: 241  LCIEMFVWNDGETEPQKLYSYFVRSANSDIIWHRQMLLRLSKSTTSTQHEVDRFSLEFSS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            QA TGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSS  
Sbjct: 301  QAMTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSND 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYGADMDR+GKIRSIWNRLLLED
Sbjct: 361  SVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGADMDRTGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAP+FIELLIGVQV LGPTDTYFSLWPSGSFEEPWNILVEQVYK+ISNA VL+S+VEGG
Sbjct: 421  IIAPAFIELLIGVQVLLGPTDTYFSLWPSGSFEEPWNILVEQVYKSISNAFVLHSDVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSP +AFLHDDKF RS ELGE LV LGMPIVHLPENLSNMLLKFC  FQQKVVTPCTV
Sbjct: 481  KWVSPNEAFLHDDKFARSMELGETLVHLGMPIVHLPENLSNMLLKFCCTFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            RHFLRECKHV TLNRPY+LVLLEYCIEDLIDADVC  AF LPL+PLANGDFG FSEASKG
Sbjct: 541  RHFLRECKHVFTLNRPYRLVLLEYCIEDLIDADVCTHAFGLPLLPLANGDFGSFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEY LLHQISDRVIDRN PL ISTRLSNIARSSKSN+FI NVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYKLLHQISDRVIDRNTPLTISTRLSNIARSSKSNLFILNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKY+NEV WDPESCSNHPTSSWF LFW YL D CEKLSLFSDWPILPSKSRYLYRA
Sbjct: 661  VPADWKYRNEVFWDPESCSNHPTSSWFLLFWEYLRDHCEKLSLFSDWPILPSKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            TK+SK+INVQ LS+EMQ IL KLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS
Sbjct: 721  TKQSKVINVQKLSHEMQNILGKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGLMLTSLY+LEVEEKDGLRRFLLDPKWYLGGCMND+DL KC+RLPI+KVYNGGS+QD
Sbjct: 781  STGGLMLTSLYNLEVEEKDGLRRFLLDPKWYLGGCMNDNDLGKCRRLPIFKVYNGGSSQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900
            F FSDLE+P+KYLPPSDVGE FLGVEFI+SS DSE EILLKYYGIKKMGKASFYRK+VLN
Sbjct: 841  FCFSDLEDPRKYLPPSDVGECFLGVEFIISSSDSEEEILLKYYGIKKMGKASFYRKYVLN 900

Query: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960
            QV QLQPELRDSTMLS+L NLPQLC EDV FRECLSNL F+PTSSGTLKCP VLYDPRYE
Sbjct: 901  QVGQLQPELRDSTMLSLLLNLPQLCTEDVTFRECLSNLDFIPTSSGTLKCPAVLYDPRYE 960

Query: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020
            ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCV+PETIVQSA HVER MHKD NKAHS
Sbjct: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVSPETIVQSALHVERFMHKDQNKAHS 1020

Query: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080
            RGKVLLSYLEVNAIKWLLNP NEDQGMVNRLFSTAATAF+PRNF SDLEKFWNDL KISW
Sbjct: 1021 RGKVLLSYLEVNAIKWLLNPTNEDQGMVNRLFSTAATAFKPRNFTSDLEKFWNDLRKISW 1080

Query: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140
            CPVLLSPPFETLPWPVVSSMVAPPKLVRLP+DLWLVSASMRILDGECSSSALAHSLGWSS
Sbjct: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPRDLWLVSASMRILDGECSSSALAHSLGWSS 1140

Query: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200
            PP GSIIAAQLLELGKNNEIVYDQ+LRKELALAMPRIYALLT LIGSD+MDVVKAVLEGC
Sbjct: 1141 PPTGSIIAAQLLELGKNNEIVYDQMLRKELALAMPRIYALLTSLIGSDDMDVVKAVLEGC 1200

Query: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260
            RWIWVGDGFATS+EVVL+GPLHLAPYIRVIPIDLAVFKDLFLELGIREFL PNDYADILS
Sbjct: 1201 RWIWVGDGFATSKEVVLEGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLNPNDYADILS 1260

Query: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320
            RMA +KGSSPLN QE+RAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW
Sbjct: 1261 RMATRKGSSPLNTQELRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320

Query: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380
            LLGTD+TDV +DGEST  L+ARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL
Sbjct: 1321 LLGTDNTDVSYDGESTAFLSARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380

Query: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440
            SLSGAAEAFGQHEALTNRLRHILEMYADG GILFELIQNAEDAGASEV+FLLDKTHYGTS
Sbjct: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGSGILFELIQNAEDAGASEVIFLLDKTHYGTS 1440

Query: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500
            S+LSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT
Sbjct: 1441 SVLSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500

Query: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560
            DIPTFVSGEN+VMFDPHACNLPGISPSHPGLRIKYAGR+ILEQFPDQFSPYLHFGCDMQK
Sbjct: 1501 DIPTFVSGENVVMFDPHACNLPGISPSHPGLRIKYAGRKILEQFPDQFSPYLHFGCDMQK 1560

Query: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620
            PFPGTLFRFPLRSPALASRSEIKKEGYAPEDV SLFYSFSEVASDAL+FLTNVK ISIFI
Sbjct: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVISLFYSFSEVASDALVFLTNVKTISIFI 1620

Query: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680
            KDDI H+MQCLYRVHKNT+SEP+T+SSA+QDI+SFIYGNR+GEMDREQFLMKL+KSINRD
Sbjct: 1621 KDDIGHEMQCLYRVHKNTISEPTTKSSAQQDIMSFIYGNRRGEMDREQFLMKLNKSINRD 1680

Query: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740
            LP+ CQKLIITEKSS GDILQH+WI+SGCLGGGLPRNNSG GDKSYNFIPWA VAALLHS
Sbjct: 1681 LPHVCQKLIITEKSSGGDILQHFWISSGCLGGGLPRNNSGGGDKSYNFIPWASVAALLHS 1740

Query: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800
            V+V+ EMN+DPETENNWL+ASDLVQVSSAS + RKP EGRAFCFLPLP++TGLPVHVNAY
Sbjct: 1741 VKVNVEMNHDPETENNWLIASDLVQVSSASEQDRKPLEGRAFCFLPLPIKTGLPVHVNAY 1800

Query: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860
            FELSSNRRDIWYGDDMAG G+KRSEWNSYLLEDVVAPAYGRLLEK+ SEIGH GL+SSFW
Sbjct: 1801 FELSSNRRDIWYGDDMAGCGRKRSEWNSYLLEDVVAPAYGRLLEKVASEIGHFGLYSSFW 1860

Query: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920
            P  AGLEPWG VVRKLYSFIGDFGLLVLYTNARGGQWIS +QAIFPDFSFDKV+ELIEAL
Sbjct: 1861 PAAAGLEPWGLVVRKLYSFIGDFGLLVLYTNARGGQWISARQAIFPDFSFDKVHELIEAL 1920

Query: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980
            +DSGLPVI+ISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD
Sbjct: 1921 SDSGLPVISISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980

Query: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040
            LKLP QSDSLCGLPLLPL DGSFTSFHKN +GER YIA+GDEYGLLKDSVP QLVDP IP
Sbjct: 1981 LKLPIQSDSLCGLPLLPLVDGSFTSFHKNRIGERIYIAKGDEYGLLKDSVPSQLVDPDIP 2040

Query: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100
            EVVHAKLCEVAQTEDLN+CFLSC LLEK+FLRFLP EWQNARQVNWNPGHQGQPSLEWIR
Sbjct: 2041 EVVHAKLCEVAQTEDLNLCFLSCHLLEKIFLRFLPTEWQNARQVNWNPGHQGQPSLEWIR 2100

Query: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160
            L+WCYLKSHC+DLSQFSKWPILPVG+NSLLQLVENSNVL+ADGWSENMFSLLLKVGCLFL
Sbjct: 2101 LMWCYLKSHCNDLSQFSKWPILPVGENSLLQLVENSNVLRADGWSENMFSLLLKVGCLFL 2160

Query: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220
            RRDMP+EHPQLEN+VHPSTAIGILNAFLSIAGDI NVE LF DASE ELHE RSFILQSK
Sbjct: 2161 RRDMPVEHPQLENFVHPSTAIGILNAFLSIAGDIGNVERLFHDASEGELHEFRSFILQSK 2220

Query: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280
            W+LE KMEAIHVDIIK IPMFESYKCRKLVSLSKP+RWIKPTGLCEDFLNDDFVR+ESEK
Sbjct: 2221 WFLEEKMEAIHVDIIKRIPMFESYKCRKLVSLSKPVRWIKPTGLCEDFLNDDFVRVESEK 2280

Query: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340
            ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSER A+STIL DVKLLIEEDVSLKSSVS
Sbjct: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSEREAISTILLDVKLLIEEDVSLKSSVS 2340

Query: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400
             IPFVL  NGSW+PPSRLYDPRV EL NMLHEE FFPSE FSDD ILDALVSLGL+RSL 
Sbjct: 2341 TIPFVLTGNGSWRPPSRLYDPRVHELKNMLHEEAFFPSEKFSDDVILDALVSLGLKRSLG 2400

Query: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460
            L+GLLDCARS+SLLNDSKNSES+SY RR FVCLDALAHKLSI VEG+ YELQNSML KS+
Sbjct: 2401 LSGLLDCARSISLLNDSKNSESKSYGRRFFVCLDALAHKLSINVEGNCYELQNSMLFKSD 2460

Query: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520
            +VDDDAS++V SLN EDTS MG DS+IGNLTGD +EEEFWSEM TIAWCP+ A+SP+KVL
Sbjct: 2461 HVDDDASVQVSSLNREDTSGMGIDSIIGNLTGDGTEEEFWSEMKTIAWCPVFANSPVKVL 2520

Query: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580
            PWLKT+NQVAPPSIVRPKSQMWMVSSSMHILDGVPPS YLQ KLGWTDCP VEVLCAQLT
Sbjct: 2521 PWLKTSNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSVYLQDKLGWTDCPSVEVLCAQLT 2580

Query: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640
            DISKLYGELRLHSS   D+NTALQ+GIPILYSKLQEY GTD+ +LLKSALNGVSWVWVGD
Sbjct: 2581 DISKLYGELRLHSSTGSDVNTALQDGIPILYSKLQEYRGTDDFMLLKSALNGVSWVWVGD 2640

Query: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700
            DFV P+ALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YLDVLQRLH DV 
Sbjct: 2641 DFVSPNALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKDYLDVLQRLHRDVR 2700

Query: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760
            GSPLSTDQM+FVIC+LEAISDC VD PEFTATS  LLIPNSSQVLM A+DLVYNDAPWME
Sbjct: 2701 GSPLSTDQMNFVICVLEAISDCWVDMPEFTATSIPLLIPNSSQVLMLASDLVYNDAPWME 2760

Query: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820
            DNNILVGKHFVHPSISNDLA RLGVQSIRCLSLVDEEMTKDLPCMDYAKIS+LL LYGND
Sbjct: 2761 DNNILVGKHFVHPSISNDLAGRLGVQSIRCLSLVDEEMTKDLPCMDYAKISDLLKLYGND 2820

Query: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880
            YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSL+TEEIS
Sbjct: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLSTEEIS 2880

Query: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940
            SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSI+SGGYFYIFDPRGIALSVAPKS+PGAK
Sbjct: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIISGGYFYIFDPRGIALSVAPKSSPGAK 2940

Query: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000
            VFSLIGSNLIE+FNDQFHP+LGG NMSWPSDSTI+RMPLS ACLKDGLE GI +IKEISS
Sbjct: 2941 VFSLIGSNLIERFNDQFHPLLGGHNMSWPSDSTIIRMPLSSACLKDGLESGIIRIKEISS 3000

Query: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060
            KFLDHASRSLLFLKSVVQVSFSTWDQ GLHP+QDYSVC+NLSSAIARNPFSEKKWKKFQL
Sbjct: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQDGLHPHQDYSVCVNLSSAIARNPFSEKKWKKFQL 3060

Query: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120
            SRLFSSSNAATK+H ID+I+ QGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA
Sbjct: 3061 SRLFSSSNAATKVHAIDVILLQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120

Query: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180
            GVAAHISRNGLPADI +KSPLMAP PLSGDI LPVTVLGCFLVCHSGGRYLFKNQVLEA+
Sbjct: 3121 GVAAHISRNGLPADIYRKSPLMAPFPLSGDIMLPVTVLGCFLVCHSGGRYLFKNQVLEAL 3180

Query: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240
              PLDAGNKLVEAWNRELMSCVCDSYI+M+LEIHKQRKESSSSALESNVSHSI+SSLKAY
Sbjct: 3181 VEPLDAGNKLVEAWNRELMSCVCDSYIFMVLEIHKQRKESSSSALESNVSHSITSSLKAY 3240

Query: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300
            GNQVYSFWPRSEPAN SDSD+DRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA
Sbjct: 3241 GNQVYSFWPRSEPANDSDSDVDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300

Query: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360
            EEGMFLAQPGSPVGGNLLPATVC FVKEH+PVFSVPWELIKEIQAVGITVRQIRPKMVRD
Sbjct: 3301 EEGMFLAQPGSPVGGNLLPATVCSFVKEHHPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360

Query: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420
            LLR  SASIVLQSIDTYLDVLEYCLSDI+LAA S+HA D+MG DSVNTNPGGRSTNT+EG
Sbjct: 3361 LLRAPSASIVLQSIDTYLDVLEYCLSDIVLAASSSHAVDNMGGDSVNTNPGGRSTNTTEG 3420

Query: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480
            SSTSV VSSM+SF R SNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGR+G+SLSH N
Sbjct: 3421 SSTSVPVSSMHSFGRSSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRNGESLSHGN 3480

Query: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540
            TFTGRNNSSY NVD++FLQMVSE+KGLPFP+ASNN+VRLGSMELWLGSKDQQELMIPLAA
Sbjct: 3481 TFTGRNNSSY-NVDKHFLQMVSELKGLPFPTASNNVVRLGSMELWLGSKDQQELMIPLAA 3540

Query: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600
            +FVHPK+FDR ILGNILTNDALHKFLKL++FSLSLLAT+MRSVFHANWVNHVMNSNM+PW
Sbjct: 3541 KFVHPKIFDRPILGNILTNDALHKFLKLREFSLSLLATHMRSVFHANWVNHVMNSNMAPW 3600

Query: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660
            FSW+NKS +GVEEGPSSEWIRLFWK +  SSQ+LLLFSDWPLVPAFLGRPILCRV+ERHL
Sbjct: 3601 FSWDNKSNAGVEEGPSSEWIRLFWKISSGSSQNLLLFSDWPLVPAFLGRPILCRVKERHL 3660

Query: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720
            VFLPPVT+P  PNAI EI AGGSDVAETS+S ISKPESIQPYTSAFQ+FQDTYPWLFPLL
Sbjct: 3661 VFLPPVTHPPSPNAISEIVAGGSDVAETSSSEISKPESIQPYTSAFQRFQDTYPWLFPLL 3720

Query: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780
            NHCNIPIFDVAFMDCA+LCNCL NS QSLGQ IAS FVAA NAGYFPELASLSDSNS+EL
Sbjct: 3721 NHCNIPIFDVAFMDCAALCNCLPNSSQSLGQAIASKFVAAKNAGYFPELASLSDSNSDEL 3780

Query: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840
            LNLFA DFVSNGTNY R+ELEILR LPIYRTV+GSYTQLR+ +QCMISSNSFLKPYN  C
Sbjct: 3781 LNLFAKDFVSNGTNYRREELEILRTLPIYRTVIGSYTQLREYEQCMISSNSFLKPYNKSC 3840

Query: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900
            LSYSSNSMEYSLLRAL VPEL++QQIL+RFGLP FD KPQSEQED+LIYL+TNW+DLQ+D
Sbjct: 3841 LSYSSNSMEYSLLRALGVPELNDQQILVRFGLPGFDFKPQSEQEDVLIYLYTNWKDLQSD 3900

Query: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960
            A LVECLSET FVRSADEFCTDLFKSKELYDPSDALLTSVFSGER+KFPGERF ADGWL+
Sbjct: 3901 AQLVECLSETKFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERRKFPGERFGADGWLQ 3960

Query: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020
            ILRKIGLRT  EANVILECAKKVETLGSEWRKSEED  EFDL N QNEVPME+WTLAGSV
Sbjct: 3961 ILRKIGLRTAAEANVILECAKKVETLGSEWRKSEEDSLEFDLTNAQNEVPMEIWTLAGSV 4020

Query: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080
            VE+VFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVLTSY D IVSKDWPLA
Sbjct: 4021 VESVFSNFAVFYSNSFCNALGNIVFVPAELGFPNLGGNKGGKRVLTSYSDAIVSKDWPLA 4080

Query: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140
            WSCAPILSKHSVIPP+YSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTL+HWPISMG+MS
Sbjct: 4081 WSCAPILSKHSVIPPEYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLSHWPISMGIMS 4140

Query: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200
            INEASCEVLKYLERIWS L+SLD+LELQRVAFIPVANATRLVKANALFARLTINLSPFAF
Sbjct: 4141 INEASCEVLKYLERIWSNLASLDILELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200

Query: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260
            ELPSGYLPFVKILKDLGLQDVLSVASAK LLSSLQVACGYQRLNPNELRSVMEILH+ICD
Sbjct: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKYLLSSLQVACGYQRLNPNELRSVMEILHFICD 4260

Query: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320
            EA EAKM+DGREPEIIVPDDGCRLVHATS  YIDTYGSRYIKCIDTSRLRFVH DLPERI
Sbjct: 4261 EASEAKMYDGREPEIIVPDDGCRLVHATSSVYIDTYGSRYIKCIDTSRLRFVHPDLPERI 4320

Query: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380
            CRMLGIKKLSDLVIEELDHEDSI+ LE IGAVSL FI+ KLLS+SFQNAVWNVVNSMVNY
Sbjct: 4321 CRMLGIKKLSDLVIEELDHEDSIDRLEYIGAVSLGFIKVKLLSKSFQNAVWNVVNSMVNY 4380

Query: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440
            IH NKNLDL+AVE+LLKS+AERLQFVK LHTRFLLLPNSI+ITRPAKDSIIPEW+DG HH
Sbjct: 4381 IHPNKNLDLEAVEELLKSVAERLQFVKCLHTRFLLLPNSINITRPAKDSIIPEWEDGSHH 4440

Query: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500
            RALYF+  SKTCILVAEPPAYIS+FDVIAIVVSQILGS IPLP+GSLLFCPEGTE AII+
Sbjct: 4441 RALYFIKQSKTCILVAEPPAYISVFDVIAIVVSQILGSSIPLPIGSLLFCPEGTETAIID 4500

Query: 4501 ILKLCSEK-ENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYG 4560
            IL LCSE+ ENE++TG +SL+GKEILPQDALQ+QLHPLRPFYA EVVAWRS+SGEKLKYG
Sbjct: 4501 ILNLCSEQMENEKYTGSTSLVGKEILPQDALQVQLHPLRPFYAGEVVAWRSKSGEKLKYG 4560

Query: 4561 RVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIID 4620
            RV EDVRPSAGQALY+FRVETA GITQSL+SSQVLSFRSISIDG  SSTNLQD  H++ID
Sbjct: 4561 RVLEDVRPSAGQALYRFRVETAAGITQSLLSSQVLSFRSISIDGGPSSTNLQDKSHIVID 4620

Query: 4621 SGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKT 4680
            SGAS E+ ENS  GKIRSQPVAELQYG+VSAEELVQAVHEML+TAGINVDIERQSLLQKT
Sbjct: 4621 SGASAEIRENSGGGKIRSQPVAELQYGKVSAEELVQAVHEMLTTAGINVDIERQSLLQKT 4680

Query: 4681 VVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740
            ++LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC
Sbjct: 4681 IILQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740

Query: 4741 SAAVSK 4746
            S+AVSK
Sbjct: 4741 SSAVSK 4745

BLAST of Moc09g05800 vs. NCBI nr
Match: XP_038897838.1 (sacsin isoform X1 [Benincasa hispida])

HSP 1 Score: 8709.0 bits (22597), Expect = 0.0e+00
Identity = 4300/4749 (90.55%), Postives = 4513/4749 (95.03%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASESTSLDSI LEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESTSLDSILLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSA+NPGKRIDFIR SAIS+YRDQFLPYCA
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSASNPGKRIDFIRSSAISKYRDQFLPYCA 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            FDCDMESSF GTLFR PLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FDCDMESSFAGTLFRFPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYS+ VRSA+SDIIWHRQMLLRLSKSTTSTQ EVD FSL+F S
Sbjct: 241  LCIEMFVWNDGETEPQKLYSYFVRSANSDIIWHRQMLLRLSKSTTSTQHEVDRFSLEFSS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            QA TGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSS  
Sbjct: 301  QAMTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSNV 360

Query: 361  ---NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLL 420
               +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYGADMDR+GKIRSIWNRLL
Sbjct: 361  TSDSVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGADMDRTGKIRSIWNRLL 420

Query: 421  LEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNV 480
            LEDIIAP+FIELLIGVQV LGPTDTYFSLWPSGSFEEPWNILVEQVYK+ISNA VL+S+V
Sbjct: 421  LEDIIAPAFIELLIGVQVLLGPTDTYFSLWPSGSFEEPWNILVEQVYKSISNAFVLHSDV 480

Query: 481  EGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTP 540
            EGGKWVSP +AFLHDDKF RS ELGE LV LGMPIVHLPENLSNMLLKFC  FQQKVVTP
Sbjct: 481  EGGKWVSPNEAFLHDDKFARSMELGETLVHLGMPIVHLPENLSNMLLKFCCTFQQKVVTP 540

Query: 541  CTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEA 600
            CTVRHFLRECKHV TLNRPY+LVLLEYCIEDLIDADVC  AF LPL+PLANGDFG FSEA
Sbjct: 541  CTVRHFLRECKHVFTLNRPYRLVLLEYCIEDLIDADVCTHAFGLPLLPLANGDFGSFSEA 600

Query: 601  SKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLF 660
            SKGISYFICDELEY LLHQISDRVIDRN PL ISTRLSNIARSSKSN+FI NVHYFLQLF
Sbjct: 601  SKGISYFICDELEYKLLHQISDRVIDRNTPLTISTRLSNIARSSKSNLFILNVHYFLQLF 660

Query: 661  PKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYL 720
            PKFVPADWKY+NEV WDPESCSNHPTSSWF LFW YL D CEKLSLFSDWPILPSKSRYL
Sbjct: 661  PKFVPADWKYRNEVFWDPESCSNHPTSSWFLLFWEYLRDHCEKLSLFSDWPILPSKSRYL 720

Query: 721  YRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYD 780
            YRATK+SK+INVQ LS+EMQ IL KLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYD
Sbjct: 721  YRATKQSKVINVQKLSHEMQNILGKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYD 780

Query: 781  AISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGS 840
            AISSTGGLMLTSLY+LEVEEKDGLRRFLLDPKWYLGGCMND+DL KC+RLPI+KVYNGGS
Sbjct: 781  AISSTGGLMLTSLYNLEVEEKDGLRRFLLDPKWYLGGCMNDNDLGKCRRLPIFKVYNGGS 840

Query: 841  AQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKH 900
            +QDF FSDLE+P+KYLPPSDVGE FLGVEFI+SS DSE EILLKYYGIKKMGKASFYRK+
Sbjct: 841  SQDFCFSDLEDPRKYLPPSDVGECFLGVEFIISSSDSEEEILLKYYGIKKMGKASFYRKY 900

Query: 901  VLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDP 960
            VLNQV QLQPELRDSTMLS+L NLPQLC EDV FRECLSNL F+PTSSGTLKCP VLYDP
Sbjct: 901  VLNQVGQLQPELRDSTMLSLLLNLPQLCTEDVTFRECLSNLDFIPTSSGTLKCPAVLYDP 960

Query: 961  RYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNK 1020
            RYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCV+PETIVQSA HVER MHKD NK
Sbjct: 961  RYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVSPETIVQSALHVERFMHKDQNK 1020

Query: 1021 AHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCK 1080
            AHSRGKVLLSYLEVNAIKWLLNP NEDQGMVNRLFSTAATAF+PRNF SDLEKFWNDL K
Sbjct: 1021 AHSRGKVLLSYLEVNAIKWLLNPTNEDQGMVNRLFSTAATAFKPRNFTSDLEKFWNDLRK 1080

Query: 1081 ISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLG 1140
            ISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLP+DLWLVSASMRILDGECSSSALAHSLG
Sbjct: 1081 ISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPRDLWLVSASMRILDGECSSSALAHSLG 1140

Query: 1141 WSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVL 1200
            WSSPP GSIIAAQLLELGKNNEIVYDQ+LRKELALAMPRIYALLT LIGSD+MDVVKAVL
Sbjct: 1141 WSSPPTGSIIAAQLLELGKNNEIVYDQMLRKELALAMPRIYALLTSLIGSDDMDVVKAVL 1200

Query: 1201 EGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYAD 1260
            EGCRWIWVGDGFATS+EVVL+GPLHLAPYIRVIPIDLAVFKDLFLELGIREFL PNDYAD
Sbjct: 1201 EGCRWIWVGDGFATSKEVVLEGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLNPNDYAD 1260

Query: 1261 ILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYND 1320
            ILSRMA +KGSSPLN QE+RAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYND
Sbjct: 1261 ILSRMATRKGSSPLNTQELRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYND 1320

Query: 1321 APWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADS 1380
            APWLLGTD+TDV +DGEST  L+ARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADS
Sbjct: 1321 APWLLGTDNTDVSYDGESTAFLSARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADS 1380

Query: 1381 MNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHY 1440
            MNLSLSGAAEAFGQHEALTNRLRHILEMYADG GILFELIQNAEDAGASEV+FLLDKTHY
Sbjct: 1381 MNLSLSGAAEAFGQHEALTNRLRHILEMYADGSGILFELIQNAEDAGASEVIFLLDKTHY 1440

Query: 1441 GTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVY 1500
            GTSS+LSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVY
Sbjct: 1441 GTSSVLSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVY 1500

Query: 1501 HFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCD 1560
            HFTDIPTFVSGEN+VMFDPHACNLPGISPSHPGLRIKYAGR+ILEQFPDQFSPYLHFGCD
Sbjct: 1501 HFTDIPTFVSGENVVMFDPHACNLPGISPSHPGLRIKYAGRKILEQFPDQFSPYLHFGCD 1560

Query: 1561 MQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKIS 1620
            MQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDV SLFYSFSEVASDAL+FLTNVK IS
Sbjct: 1561 MQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVISLFYSFSEVASDALVFLTNVKTIS 1620

Query: 1621 IFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSI 1680
            IFIKDDI H+MQCLYRVHKNT+SEP+T+SSA+QDI+SFIYGNR+GEMDREQFLMKL+KSI
Sbjct: 1621 IFIKDDIGHEMQCLYRVHKNTISEPTTKSSAQQDIMSFIYGNRRGEMDREQFLMKLNKSI 1680

Query: 1681 NRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAAL 1740
            NRDLP+ CQKLIITEKSS GDILQH+WI+SGCLGGGLPRNNSG GDKSYNFIPWA VAAL
Sbjct: 1681 NRDLPHVCQKLIITEKSSGGDILQHFWISSGCLGGGLPRNNSGGGDKSYNFIPWASVAAL 1740

Query: 1741 LHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHV 1800
            LHSV+V+ EMN+DPETENNWL+ASDLVQVSSAS + RKP EGRAFCFLPLP++TGLPVHV
Sbjct: 1741 LHSVKVNVEMNHDPETENNWLIASDLVQVSSASEQDRKPLEGRAFCFLPLPIKTGLPVHV 1800

Query: 1801 NAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFS 1860
            NAYFELSSNRRDIWYGDDMAG G+KRSEWNSYLLEDVVAPAYGRLLEK+ SEIGH GL+S
Sbjct: 1801 NAYFELSSNRRDIWYGDDMAGCGRKRSEWNSYLLEDVVAPAYGRLLEKVASEIGHFGLYS 1860

Query: 1861 SFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELI 1920
            SFWP  AGLEPWG VVRKLYSFIGDFGLLVLYTNARGGQWIS +QAIFPDFSFDKV+ELI
Sbjct: 1861 SFWPAAAGLEPWGLVVRKLYSFIGDFGLLVLYTNARGGQWISARQAIFPDFSFDKVHELI 1920

Query: 1921 EALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYC 1980
            EAL+DSGLPVI+ISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYC
Sbjct: 1921 EALSDSGLPVISISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYC 1980

Query: 1981 LVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDP 2040
            LVDLKLP QSDSLCGLPLLPL DGSFTSFHKN +GER YIA+GDEYGLLKDSVP QLVDP
Sbjct: 1981 LVDLKLPIQSDSLCGLPLLPLVDGSFTSFHKNRIGERIYIAKGDEYGLLKDSVPSQLVDP 2040

Query: 2041 GIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLE 2100
             IPEVVHAKLCEVAQTEDLN+CFLSC LLEK+FLRFLP EWQNARQVNWNPGHQGQPSLE
Sbjct: 2041 DIPEVVHAKLCEVAQTEDLNLCFLSCHLLEKIFLRFLPTEWQNARQVNWNPGHQGQPSLE 2100

Query: 2101 WIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGC 2160
            WIRL+WCYLKSHC+DLSQFSKWPILPVG+NSLLQLVENSNVL+ADGWSENMFSLLLKVGC
Sbjct: 2101 WIRLMWCYLKSHCNDLSQFSKWPILPVGENSLLQLVENSNVLRADGWSENMFSLLLKVGC 2160

Query: 2161 LFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFIL 2220
            LFLRRDMP+EHPQLEN+VHPSTAIGILNAFLSIAGDI NVE LF DASE ELHE RSFIL
Sbjct: 2161 LFLRRDMPVEHPQLENFVHPSTAIGILNAFLSIAGDIGNVERLFHDASEGELHEFRSFIL 2220

Query: 2221 QSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRME 2280
            QSKW+LE KMEAIHVDIIK IPMFESYKCRKLVSLSKP+RWIKPTGLCEDFLNDDFVR+E
Sbjct: 2221 QSKWFLEEKMEAIHVDIIKRIPMFESYKCRKLVSLSKPVRWIKPTGLCEDFLNDDFVRVE 2280

Query: 2281 SEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKS 2340
            SEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSER A+STIL DVKLLIEEDVSLKS
Sbjct: 2281 SEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSEREAISTILLDVKLLIEEDVSLKS 2340

Query: 2341 SVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRR 2400
            SVS IPFVL  NGSW+PPSRLYDPRV EL NMLHEE FFPSE FSDD ILDALVSLGL+R
Sbjct: 2341 SVSTIPFVLTGNGSWRPPSRLYDPRVHELKNMLHEEAFFPSEKFSDDVILDALVSLGLKR 2400

Query: 2401 SLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLI 2460
            SL L+GLLDCARS+SLLNDSKNSES+SY RR FVCLDALAHKLSI VEG+ YELQNSML 
Sbjct: 2401 SLGLSGLLDCARSISLLNDSKNSESKSYGRRFFVCLDALAHKLSINVEGNCYELQNSMLF 2460

Query: 2461 KSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPL 2520
            KS++VDDDAS++V SLN EDTS MG DS+IGNLTGD +EEEFWSEM TIAWCP+ A+SP+
Sbjct: 2461 KSDHVDDDASVQVSSLNREDTSGMGIDSIIGNLTGDGTEEEFWSEMKTIAWCPVFANSPV 2520

Query: 2521 KVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCA 2580
            KVLPWLKT+NQVAPPSIVRPKSQMWMVSSSMHILDGVPPS YLQ KLGWTDCP VEVLCA
Sbjct: 2521 KVLPWLKTSNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSVYLQDKLGWTDCPSVEVLCA 2580

Query: 2581 QLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVW 2640
            QLTDISKLYGELRLHSS   D+NTALQ+GIPILYSKLQEY GTD+ +LLKSALNGVSWVW
Sbjct: 2581 QLTDISKLYGELRLHSSTGSDVNTALQDGIPILYSKLQEYRGTDDFMLLKSALNGVSWVW 2640

Query: 2641 VGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHS 2700
            VGDDFV P+ALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YLDVLQRLH 
Sbjct: 2641 VGDDFVSPNALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKDYLDVLQRLHR 2700

Query: 2701 DVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAP 2760
            DV GSPLSTDQM+FVIC+LEAISDC VD PEFTATS  LLIPNSSQVLM A+DLVYNDAP
Sbjct: 2701 DVRGSPLSTDQMNFVICVLEAISDCWVDMPEFTATSIPLLIPNSSQVLMLASDLVYNDAP 2760

Query: 2761 WMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLY 2820
            WMEDNNILVGKHFVHPSISNDLA RLGVQSIRCLSLVDEEMTKDLPCMDYAKIS+LL LY
Sbjct: 2761 WMEDNNILVGKHFVHPSISNDLAGRLGVQSIRCLSLVDEEMTKDLPCMDYAKISDLLKLY 2820

Query: 2821 GNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTE 2880
            GNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSL+TE
Sbjct: 2821 GNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLSTE 2880

Query: 2881 EISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAP 2940
            EISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSI+SGGYFYIFDPRGIALSVAPKS+P
Sbjct: 2881 EISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIISGGYFYIFDPRGIALSVAPKSSP 2940

Query: 2941 GAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKE 3000
            GAKVFSLIGSNLIE+FNDQFHP+LGG NMSWPSDSTI+RMPLS ACLKDGLE GI +IKE
Sbjct: 2941 GAKVFSLIGSNLIERFNDQFHPLLGGHNMSWPSDSTIIRMPLSSACLKDGLESGIIRIKE 3000

Query: 3001 ISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKK 3060
            ISSKFLDHASRSLLFLKSVVQVSFSTWDQ GLHP+QDYSVC+NLSSAIARNPFSEKKWKK
Sbjct: 3001 ISSKFLDHASRSLLFLKSVVQVSFSTWDQDGLHPHQDYSVCVNLSSAIARNPFSEKKWKK 3060

Query: 3061 FQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLT 3120
            FQLSRLFSSSNAATK+H ID+I+ QGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLT
Sbjct: 3061 FQLSRLFSSSNAATKVHAIDVILLQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLT 3120

Query: 3121 PVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVL 3180
            PVAGVAAHISRNGLPADI +KSPLMAP PLSGDI LPVTVLGCFLVCHSGGRYLFKNQVL
Sbjct: 3121 PVAGVAAHISRNGLPADIYRKSPLMAPFPLSGDIMLPVTVLGCFLVCHSGGRYLFKNQVL 3180

Query: 3181 EAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSL 3240
            EA+  PLDAGNKLVEAWNRELMSCVCDSYI+M+LEIHKQRKESSSSALESNVSHSI+SSL
Sbjct: 3181 EALVEPLDAGNKLVEAWNRELMSCVCDSYIFMVLEIHKQRKESSSSALESNVSHSITSSL 3240

Query: 3241 KAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNL 3300
            KAYGNQVYSFWPRSEPAN SDSD+DRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNL
Sbjct: 3241 KAYGNQVYSFWPRSEPANDSDSDVDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNL 3300

Query: 3301 VKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKM 3360
            VKAEEGMFLAQPGSPVGGNLLPATVC FVKEH+PVFSVPWELIKEIQAVGITVRQIRPKM
Sbjct: 3301 VKAEEGMFLAQPGSPVGGNLLPATVCSFVKEHHPVFSVPWELIKEIQAVGITVRQIRPKM 3360

Query: 3361 VRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNT 3420
            VRDLLR  SASIVLQSIDTYLDVLEYCLSDI+LAA S+HA D+MG DSVNTNPGGRSTNT
Sbjct: 3361 VRDLLRAPSASIVLQSIDTYLDVLEYCLSDIVLAASSSHAVDNMGGDSVNTNPGGRSTNT 3420

Query: 3421 SEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLS 3480
            +EGSSTSV VSSM+SF R SNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGR+G+SLS
Sbjct: 3421 TEGSSTSVPVSSMHSFGRSSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRNGESLS 3480

Query: 3481 HSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIP 3540
            H NTFTGRNNSSY NVD++FLQMVSE+KGLPFP+ASNN+VRLGSMELWLGSKDQQELMIP
Sbjct: 3481 HGNTFTGRNNSSY-NVDKHFLQMVSELKGLPFPTASNNVVRLGSMELWLGSKDQQELMIP 3540

Query: 3541 LAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNM 3600
            LAA+FVHPK+FDR ILGNILTNDALHKFLKL++FSLSLLAT+MRSVFHANWVNHVMNSNM
Sbjct: 3541 LAAKFVHPKIFDRPILGNILTNDALHKFLKLREFSLSLLATHMRSVFHANWVNHVMNSNM 3600

Query: 3601 SPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRE 3660
            +PWFSW+NKS +GVEEGPSSEWIRLFWK +  SSQ+LLLFSDWPLVPAFLGRPILCRV+E
Sbjct: 3601 APWFSWDNKSNAGVEEGPSSEWIRLFWKISSGSSQNLLLFSDWPLVPAFLGRPILCRVKE 3660

Query: 3661 RHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLF 3720
            RHLVFLPPVT+P  PNAI EI AGGSDVAETS+S ISKPESIQPYTSAFQ+FQDTYPWLF
Sbjct: 3661 RHLVFLPPVTHPPSPNAISEIVAGGSDVAETSSSEISKPESIQPYTSAFQRFQDTYPWLF 3720

Query: 3721 PLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNS 3780
            PLLNHCNIPIFDVAFMDCA+LCNCL NS QSLGQ IAS FVAA NAGYFPELASLSDSNS
Sbjct: 3721 PLLNHCNIPIFDVAFMDCAALCNCLPNSSQSLGQAIASKFVAAKNAGYFPELASLSDSNS 3780

Query: 3781 NELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYN 3840
            +ELLNLFA DFVSNGTNY R+ELEILR LPIYRTV+GSYTQLR+ +QCMISSNSFLKPYN
Sbjct: 3781 DELLNLFAKDFVSNGTNYRREELEILRTLPIYRTVIGSYTQLREYEQCMISSNSFLKPYN 3840

Query: 3841 DCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDL 3900
              CLSYSSNSMEYSLLRAL VPEL++QQIL+RFGLP FD KPQSEQED+LIYL+TNW+DL
Sbjct: 3841 KSCLSYSSNSMEYSLLRALGVPELNDQQILVRFGLPGFDFKPQSEQEDVLIYLYTNWKDL 3900

Query: 3901 QADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADG 3960
            Q+DA LVECLSET FVRSADEFCTDLFKSKELYDPSDALLTSVFSGER+KFPGERF ADG
Sbjct: 3901 QSDAQLVECLSETKFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERRKFPGERFGADG 3960

Query: 3961 WLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLA 4020
            WL+ILRKIGLRT  EANVILECAKKVETLGSEWRKSEED  EFDL N QNEVPME+WTLA
Sbjct: 3961 WLQILRKIGLRTAAEANVILECAKKVETLGSEWRKSEEDSLEFDLTNAQNEVPMEIWTLA 4020

Query: 4021 GSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDW 4080
            GSVVE+VFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVLTSY D IVSKDW
Sbjct: 4021 GSVVESVFSNFAVFYSNSFCNALGNIVFVPAELGFPNLGGNKGGKRVLTSYSDAIVSKDW 4080

Query: 4081 PLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMG 4140
            PLAWSCAPILSKHSVIPP+YSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTL+HWPISMG
Sbjct: 4081 PLAWSCAPILSKHSVIPPEYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLSHWPISMG 4140

Query: 4141 VMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSP 4200
            +MSINEASCEVLKYLERIWS L+SLD+LELQRVAFIPVANATRLVKANALFARLTINLSP
Sbjct: 4141 IMSINEASCEVLKYLERIWSNLASLDILELQRVAFIPVANATRLVKANALFARLTINLSP 4200

Query: 4201 FAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHY 4260
            FAFELPSGYLPFVKILKDLGLQDVLSVASAK LLSSLQVACGYQRLNPNELRSVMEILH+
Sbjct: 4201 FAFELPSGYLPFVKILKDLGLQDVLSVASAKYLLSSLQVACGYQRLNPNELRSVMEILHF 4260

Query: 4261 ICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLP 4320
            ICDEA EAKM+DGREPEIIVPDDGCRLVHATS  YIDTYGSRYIKCIDTSRLRFVH DLP
Sbjct: 4261 ICDEASEAKMYDGREPEIIVPDDGCRLVHATSSVYIDTYGSRYIKCIDTSRLRFVHPDLP 4320

Query: 4321 ERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSM 4380
            ERICRMLGIKKLSDLVIEELDHEDSI+ LE IGAVSL FI+ KLLS+SFQNAVWNVVNSM
Sbjct: 4321 ERICRMLGIKKLSDLVIEELDHEDSIDRLEYIGAVSLGFIKVKLLSKSFQNAVWNVVNSM 4380

Query: 4381 VNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDG 4440
            VNYIH NKNLDL+AVE+LLKS+AERLQFVK LHTRFLLLPNSI+ITRPAKDSIIPEW+DG
Sbjct: 4381 VNYIHPNKNLDLEAVEELLKSVAERLQFVKCLHTRFLLLPNSINITRPAKDSIIPEWEDG 4440

Query: 4441 IHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIA 4500
             HHRALYF+  SKTCILVAEPPAYIS+FDVIAIVVSQILGS IPLP+GSLLFCPEGTE A
Sbjct: 4441 SHHRALYFIKQSKTCILVAEPPAYISVFDVIAIVVSQILGSSIPLPIGSLLFCPEGTETA 4500

Query: 4501 IINILKLCSEK-ENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKL 4560
            II+IL LCSE+ ENE++TG +SL+GKEILPQDALQ+QLHPLRPFYA EVVAWRS+SGEKL
Sbjct: 4501 IIDILNLCSEQMENEKYTGSTSLVGKEILPQDALQVQLHPLRPFYAGEVVAWRSKSGEKL 4560

Query: 4561 KYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHM 4620
            KYGRV EDVRPSAGQALY+FRVETA GITQSL+SSQVLSFRSISIDG  SSTNLQD  H+
Sbjct: 4561 KYGRVLEDVRPSAGQALYRFRVETAAGITQSLLSSQVLSFRSISIDGGPSSTNLQDKSHI 4620

Query: 4621 IIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLL 4680
            +IDSGAS E+ ENS  GKIRSQPVAELQYG+VSAEELVQAVHEML+TAGINVDIERQSLL
Sbjct: 4621 VIDSGASAEIRENSGGGKIRSQPVAELQYGKVSAEELVQAVHEMLTTAGINVDIERQSLL 4680

Query: 4681 QKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLC 4740
            QKT++LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLC
Sbjct: 4681 QKTIILQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLC 4740

Query: 4741 RKCSAAVSK 4746
            RKCS+AVSK
Sbjct: 4741 RKCSSAVSK 4748

BLAST of Moc09g05800 vs. NCBI nr
Match: XP_023513522.1 (sacsin isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 8672.4 bits (22502), Expect = 0.0e+00
Identity = 4281/4746 (90.20%), Postives = 4486/4746 (94.52%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASES SLDS+FLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESASLDSVFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIR SAISQYRDQFLPYC 
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRSSAISQYRDQFLPYCV 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            F+CDM+SSF GTLFRLPLRNAD AARS ISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FNCDMKSSFAGTLFRLPLRNADLAARSNISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYSFSVRSASSDIIWHRQMLLRLSKST ST S+ DS+SL+FLS
Sbjct: 241  LCIEMFVWNDGETEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTASTMSDTDSYSLEFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            +A +GTQIEERIDSFFIVQTMASATSRI SFAATASKEYDIHLLPWASLAVCTSDDSS  
Sbjct: 301  RAMSGTQIEERIDSFFIVQTMASATSRIGSFAATASKEYDIHLLPWASLAVCTSDDSSNN 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED
Sbjct: 361  SVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAPSFIELLIGVQVFLGPTD YFSLWPSGSFEEPWNILVEQVYKNI NALVLYSNVEGG
Sbjct: 421  IIAPSFIELLIGVQVFLGPTDAYFSLWPSGSFEEPWNILVEQVYKNIGNALVLYSNVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSP +AFLHDDKF RS ELGEALV LGMPIVHLPENLSNMLLKFC  FQQKVVTPCTV
Sbjct: 481  KWVSPNEAFLHDDKFARSTELGEALVRLGMPIVHLPENLSNMLLKFCCTFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            R FLR+CKHV+TLNRPYKLVLLEYCIEDLIDADVC  AF LPL+PLANGDFGLFSEASKG
Sbjct: 541  RQFLRDCKHVSTLNRPYKLVLLEYCIEDLIDADVCTHAFGLPLLPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEY LLHQISDRVID NIPL IS RLSNIARSS SN+F FNVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYKLLHQISDRVIDWNIPLAISARLSNIARSSTSNLFAFNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYKNEV WDPESCSNHPTSSWF LFW+YL D CEKLSLFSDWPILP KSRYLYRA
Sbjct: 661  VPADWKYKNEVFWDPESCSNHPTSSWFLLFWQYLRDHCEKLSLFSDWPILPCKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            +K+SK+INVQMLSNEMQ ILSKLGCKLLDPYYKVEHRDL HYVNDGNCTG+LDSIYDAIS
Sbjct: 721  SKQSKVINVQMLSNEMQNILSKLGCKLLDPYYKVEHRDLFHYVNDGNCTGILDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGL+LTSL++LEVEEKDGLRRFLLDPKWYLGG M D++LEKCKRLPI+KVYNGGSAQD
Sbjct: 781  STGGLLLTSLHNLEVEEKDGLRRFLLDPKWYLGGSMKDNELEKCKRLPIFKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSF-DSEVEILLKYYGIKKMGKASFYRKHVL 900
            FGFSDLE+P+KY PPSDVGE FLGVEFI SS  D E EILLKYYGIKKMGKASFYRKHVL
Sbjct: 841  FGFSDLESPRKYFPPSDVGECFLGVEFIFSSSDDGEEEILLKYYGIKKMGKASFYRKHVL 900

Query: 901  NQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRY 960
            NQV QLQP+LRD+TMLSVL NLPQLCVEDV FRECLSNL F+PTS GTLKCP  LYDPRY
Sbjct: 901  NQVGQLQPKLRDNTMLSVLLNLPQLCVEDVTFRECLSNLDFIPTSRGTLKCPAALYDPRY 960

Query: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAH 1020
            EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCV+PETIVQSA HVER MH D NKAH
Sbjct: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVSPETIVQSALHVERFMHMDQNKAH 1020

Query: 1021 SRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKIS 1080
            SRGKVLLSYLEVNAIKWLLNP NE+ GMVNRLFSTAATAFRPRNF SDLEKFWNDL KIS
Sbjct: 1021 SRGKVLLSYLEVNAIKWLLNPTNEEHGMVNRLFSTAATAFRPRNFTSDLEKFWNDLRKIS 1080

Query: 1081 WCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140
            WCPVLL+PPFETLPWP+VSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS
Sbjct: 1081 WCPVLLTPPFETLPWPIVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140

Query: 1141 SPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEG 1200
            S P GSIIAAQLLELGKNNEIV+D VLRKELA AMPRIYALLTGLIGSDEMDVVKAVLEG
Sbjct: 1141 SSPSGSIIAAQLLELGKNNEIVHDLVLRKELAQAMPRIYALLTGLIGSDEMDVVKAVLEG 1200

Query: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADIL 1260
            CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFK+LFL+LGIREFLKPNDYADIL
Sbjct: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKELFLKLGIREFLKPNDYADIL 1260

Query: 1261 SRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAP 1320
            SRMAI+KGSS LN QEVRAAILIVQHLAEAQLPKQQIN+YLPDIS RLLPASNLVYNDAP
Sbjct: 1261 SRMAIRKGSSSLNTQEVRAAILIVQHLAEAQLPKQQINLYLPDISCRLLPASNLVYNDAP 1320

Query: 1321 WLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMN 1380
            WLLGTDDTDV F+GEST VLNARKTVQ FVHGNISN+VAEKLGVCSLRRILLAESADSMN
Sbjct: 1321 WLLGTDDTDVSFNGESTFVLNARKTVQNFVHGNISNEVAEKLGVCSLRRILLAESADSMN 1380

Query: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGT 1440
            LSLSGAAEAFGQHEALTNRLRHILEMYADG GILFELIQNAEDAGASEVVFLLDKTHYGT
Sbjct: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGSGILFELIQNAEDAGASEVVFLLDKTHYGT 1440

Query: 1441 SSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500
            SSILSPEMADWQGPALYCYNDSVFSS DLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF
Sbjct: 1441 SSILSPEMADWQGPALYCYNDSVFSSHDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500

Query: 1501 TDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560
            TDIPTFVSGEN+VMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ
Sbjct: 1501 TDIPTFVSGENVVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560

Query: 1561 KPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIF 1620
            KPFPGTLFRFPLRSPALASRSEIKKE YAPEDV SLFYSFSEVASDALLFLTNVK ISIF
Sbjct: 1561 KPFPGTLFRFPLRSPALASRSEIKKEAYAPEDVLSLFYSFSEVASDALLFLTNVKTISIF 1620

Query: 1621 IKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINR 1680
             KDDI H+MQCLYRVHKNT+SEPSTESSA+QDII+FI GNRQGE+DREQFL KL+KSI++
Sbjct: 1621 TKDDIGHEMQCLYRVHKNTISEPSTESSAQQDIINFICGNRQGELDREQFLRKLNKSISK 1680

Query: 1681 DLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLH 1740
            DLPYKCQK IITEKSS GDILQHYWITSGCLGGGLPRNNSG+GDKSYNFIPWACVAALLH
Sbjct: 1681 DLPYKCQKHIITEKSSGGDILQHYWITSGCLGGGLPRNNSGVGDKSYNFIPWACVAALLH 1740

Query: 1741 SVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNA 1800
            SV+VD EMNYD E ENNWL+ASD VQVSSASI+GRKP EGRAFCFLPLP++TGLPVHVNA
Sbjct: 1741 SVKVDEEMNYDQEAENNWLIASDSVQVSSASIQGRKPLEGRAFCFLPLPIKTGLPVHVNA 1800

Query: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSF 1860
            YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLE+VVAPAYG LLEKI SEIGHSGLFSSF
Sbjct: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLENVVAPAYGHLLEKIASEIGHSGLFSSF 1860

Query: 1861 WPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEA 1920
            WP+TAGLEPWGSVVR LYSFIGDFG+LVLYTNARGGQWIS +QAIFPDFSFDKV+ELIEA
Sbjct: 1861 WPSTAGLEPWGSVVRNLYSFIGDFGILVLYTNARGGQWISARQAIFPDFSFDKVHELIEA 1920

Query: 1921 LADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980
            L+DSGLP+I+ SKSIVDRFMEV PSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV
Sbjct: 1921 LSDSGLPLISTSKSIVDRFMEVCPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980

Query: 1981 DLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGI 2040
            DLKLPFQS+SLC LPLLPLADG+FT+F+KNGMGERTYIARGDEYG+LK+SVP QLVDP I
Sbjct: 1981 DLKLPFQSESLCRLPLLPLADGTFTTFNKNGMGERTYIARGDEYGILKESVPSQLVDPDI 2040

Query: 2041 PEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWI 2100
            PE VHAKLCEVAQTEDLNICFLSC LLEKLFLRFLP EWQNARQVNWNPGHQ  PSLEWI
Sbjct: 2041 PEAVHAKLCEVAQTEDLNICFLSCHLLEKLFLRFLPTEWQNARQVNWNPGHQSHPSLEWI 2100

Query: 2101 RLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLF 2160
            RL+WCYLK HC+DLSQFSKWPILPVG+NSLLQLVENSNVL+ADGWSENMFSLLLKVGCLF
Sbjct: 2101 RLVWCYLKLHCNDLSQFSKWPILPVGENSLLQLVENSNVLRADGWSENMFSLLLKVGCLF 2160

Query: 2161 LRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQS 2220
            LRRDMPIEHPQLEN+VHPSTA GILNAFL+IAGDIENVEGLF DA E ELHELRSFILQS
Sbjct: 2161 LRRDMPIEHPQLENFVHPSTATGILNAFLAIAGDIENVEGLFHDACEGELHELRSFILQS 2220

Query: 2221 KWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESE 2280
            KW+LEG MEA HVDIIK IPMFESYK RKLVSLS+P+RWIKPTGL EDFLNDDFVR+ESE
Sbjct: 2221 KWFLEGNMEATHVDIIKRIPMFESYKSRKLVSLSQPVRWIKPTGLYEDFLNDDFVRIESE 2280

Query: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSV 2340
            KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSER ALSTIL DVKLLIEED SLKSSV
Sbjct: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSEREALSTILLDVKLLIEEDASLKSSV 2340

Query: 2341 SMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSL 2400
            SMIPFVL  NGSWQPPSRLYDPRV EL NMLHEETFFPSE FSDDDILDALVSLGL RSL
Sbjct: 2341 SMIPFVLTGNGSWQPPSRLYDPRVHELKNMLHEETFFPSEIFSDDDILDALVSLGLNRSL 2400

Query: 2401 DLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKS 2460
             LTG LDCARSV LLNDS+N    SYARRLFVCLDALAHKLSI VEG+ YELQ S L++S
Sbjct: 2401 GLTGFLDCARSVPLLNDSEN----SYARRLFVCLDALAHKLSINVEGNCYELQTSTLVES 2460

Query: 2461 NYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKV 2520
            +YVDDD SMEVGSLN EDTSDMG DSLIGNLTGDESEEEFWSEM TIAWCP+CADSP+KV
Sbjct: 2461 DYVDDDTSMEVGSLNREDTSDMGIDSLIGNLTGDESEEEFWSEMKTIAWCPVCADSPVKV 2520

Query: 2521 LPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQL 2580
            LPWLKT N+VAPPSIVRPKSQM MVSSSMHILDGV PS YLQHKLGWTDCPRVEVLCAQL
Sbjct: 2521 LPWLKTCNKVAPPSIVRPKSQMCMVSSSMHILDGVLPSVYLQHKLGWTDCPRVEVLCAQL 2580

Query: 2581 TDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVG 2640
            TDISKLYGELRLHSSLEPDINTALQEGI ILYSKLQEY GTD+ VLLKSALNGVSWVWVG
Sbjct: 2581 TDISKLYGELRLHSSLEPDINTALQEGITILYSKLQEYRGTDDFVLLKSALNGVSWVWVG 2640

Query: 2641 DDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDV 2700
            DDFV PSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YLDVLQRLH DV
Sbjct: 2641 DDFVSPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKDYLDVLQRLHKDV 2700

Query: 2701 EGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWM 2760
            +GSPLSTDQM+FVIC+LEAI DCC+DKPEFTATS  LLIPNSSQVLM ANDLVYNDAPWM
Sbjct: 2701 KGSPLSTDQMNFVICVLEAILDCCMDKPEFTATSIPLLIPNSSQVLMLANDLVYNDAPWM 2760

Query: 2761 EDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGN 2820
            E+NNILVGKHFVHPSIS+DLASRLGVQSIRCLSLVDEEMTKDLPCM+YAKISELL LYG+
Sbjct: 2761 EENNILVGKHFVHPSISHDLASRLGVQSIRCLSLVDEEMTKDLPCMEYAKISELLKLYGD 2820

Query: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEI 2880
            DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSL+TEEI
Sbjct: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLSTEEI 2880

Query: 2881 SSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGA 2940
            S LQFRPPWKLRGDTLNYGLGLLSCYYVCDLL I+SGGYFYIFDPRGIALSVAPKSAPGA
Sbjct: 2881 SGLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLYIISGGYFYIFDPRGIALSVAPKSAPGA 2940

Query: 2941 KVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEIS 3000
            K+FSLIGSNLIE+F DQFHP+LGGQNMSWPSDSTI+RMPLS ACLKDGLE GI +IKEIS
Sbjct: 2941 KMFSLIGSNLIERFKDQFHPLLGGQNMSWPSDSTIIRMPLSSACLKDGLESGIERIKEIS 3000

Query: 3001 SKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQ 3060
            SKFLDHASRSLLFLKSV+QVSFSTWDQGGL+PYQDYS C+NLSSAIARNPFSEKKWKKFQ
Sbjct: 3001 SKFLDHASRSLLFLKSVLQVSFSTWDQGGLNPYQDYSACVNLSSAIARNPFSEKKWKKFQ 3060

Query: 3061 LSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV 3120
            LSRLFSSSNAATK+H ID+I+FQG+ QFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV
Sbjct: 3061 LSRLFSSSNAATKVHAIDVILFQGDAQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV 3120

Query: 3121 AGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEA 3180
            AG+AAHISRNGLPADI  KSPLMAP PLSGDI LPVTVLGCFLVCH+GGRYLFKNQVLEA
Sbjct: 3121 AGLAAHISRNGLPADIYLKSPLMAPFPLSGDIILPVTVLGCFLVCHNGGRYLFKNQVLEA 3180

Query: 3181 VAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKA 3240
               PLDAGNKLVEAWNRELMSCVCDSYI+M+LEIHKQRKESSSSALESNV+HSISSSLKA
Sbjct: 3181 FVEPLDAGNKLVEAWNRELMSCVCDSYIFMVLEIHKQRKESSSSALESNVTHSISSSLKA 3240

Query: 3241 YGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVK 3300
            YGNQVYSFWPRS   NGSDSDLDRGLKADWECLVEQVIRPFY RAIDLPVWQLYSGNLVK
Sbjct: 3241 YGNQVYSFWPRS--GNGSDSDLDRGLKADWECLVEQVIRPFYARAIDLPVWQLYSGNLVK 3300

Query: 3301 AEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360
            AEEGMFLAQPGSPVG NLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR
Sbjct: 3301 AEEGMFLAQPGSPVGDNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360

Query: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSE 3420
            DLLRVSSASIVLQSIDTYLDVLEYCLSDI+LA  SNHAEDS+GADS+NTNPGGRSTNT+E
Sbjct: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDIVLATSSNHAEDSIGADSINTNPGGRSTNTTE 3420

Query: 3421 GSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHS 3480
               TS+ VSS++SFAR SNQN ASSGDALEMMTSLGRALLDFGRGVVEDIGRSG+S  HS
Sbjct: 3421 DGPTSIPVSSVHSFARSSNQNGASSGDALEMMTSLGRALLDFGRGVVEDIGRSGESSFHS 3480

Query: 3481 NTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLA 3540
            NTF GRNN SYRNVDQ+FLQMVSE+KGLPFP+ASNNLVRLGSMELWLGSKDQQELMIPLA
Sbjct: 3481 NTFNGRNN-SYRNVDQHFLQMVSELKGLPFPTASNNLVRLGSMELWLGSKDQQELMIPLA 3540

Query: 3541 ARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSP 3600
            A+F+HPK+FDRSILGNILTNDALHKFLKLQKFSL+LLAT+MRSVFHANWVNHVMNSNM+P
Sbjct: 3541 AKFLHPKIFDRSILGNILTNDALHKFLKLQKFSLNLLATHMRSVFHANWVNHVMNSNMAP 3600

Query: 3601 WFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERH 3660
            WFSW+NK  SGVEEGP+SEWIR+FWKN   +SQDLLLFSDWPL+PAFLGRPILCRVRERH
Sbjct: 3601 WFSWDNKLSSGVEEGPTSEWIRIFWKNFSGTSQDLLLFSDWPLIPAFLGRPILCRVRERH 3660

Query: 3661 LVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPL 3720
            LVFLPPV + V PN+ILEIGAGGSDVAETS S ISKPES+QPYT AFQ+FQD YPWLFPL
Sbjct: 3661 LVFLPPVAHSVSPNSILEIGAGGSDVAETSLSDISKPESLQPYTLAFQRFQDMYPWLFPL 3720

Query: 3721 LNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNE 3780
            LNHCNIPIFDVAFMDCA+LCNC+ NSGQ LGQ+IAS FVAA NAGYFPELASLSDSNS+E
Sbjct: 3721 LNHCNIPIFDVAFMDCAALCNCVPNSGQPLGQVIASKFVAAKNAGYFPELASLSDSNSDE 3780

Query: 3781 LLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDC 3840
            LL LFA DF SNGTNYGR+ELEILR LPIYRTVVGSYTQLR+NDQCMISSNSFLKP N+C
Sbjct: 3781 LLKLFAKDFASNGTNYGREELEILRTLPIYRTVVGSYTQLRENDQCMISSNSFLKPNNEC 3840

Query: 3841 CLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQA 3900
            CLSYSSNSMEYSLLRALRVPELD++QILI+FG P +DCKPQSEQEDILIYL+TNW+DLQA
Sbjct: 3841 CLSYSSNSMEYSLLRALRVPELDDEQILIKFGFPGYDCKPQSEQEDILIYLYTNWRDLQA 3900

Query: 3901 DAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWL 3960
            +A LVECLSET FVRSADEFCTDLFKSKELYDPSDALLTSVFS ERKKFPGERFAADGWL
Sbjct: 3901 NARLVECLSETKFVRSADEFCTDLFKSKELYDPSDALLTSVFSDERKKFPGERFAADGWL 3960

Query: 3961 RILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGS 4020
             ILRK+GLRTT EANVILECAKKVETLGSEWRKSEEDG EFDL N Q+EVPME+WTLAGS
Sbjct: 3961 HILRKVGLRTTAEANVILECAKKVETLGSEWRKSEEDGFEFDLTNAQSEVPMEIWTLAGS 4020

Query: 4021 VVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPL 4080
            VVEAVFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVL SY D I SKDWPL
Sbjct: 4021 VVEAVFSNFAVFYSNSFCNALGNIVFVPAELGFPNLGGNKGGKRVLASYSDAIASKDWPL 4080

Query: 4081 AWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVM 4140
            AWSCAPILSKH VIPP+YSWGALNL+SPPAFP VLKHLQVIGRNGGEDTL+HWPIS+G+M
Sbjct: 4081 AWSCAPILSKHCVIPPEYSWGALNLKSPPAFPKVLKHLQVIGRNGGEDTLSHWPISVGIM 4140

Query: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFA 4200
            SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSP A
Sbjct: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPIA 4200

Query: 4201 FELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYIC 4260
            F+LPSGYLPFVKIL+DLGLQD LSVASAKDLLSSLQVACGYQRLNPNELRSVMEILH+IC
Sbjct: 4201 FQLPSGYLPFVKILRDLGLQDGLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHFIC 4260

Query: 4261 DEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPER 4320
            DEA EAK+F GREPE+IVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVH DLPE+
Sbjct: 4261 DEATEAKIFYGREPEVIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHPDLPEK 4320

Query: 4321 ICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVN 4380
            ICR+LGIKKLS+LVIEELDHEDSIEPLE IGAVSL  I++KLLSRSFQ+AVWNV NSMV+
Sbjct: 4321 ICRILGIKKLSELVIEELDHEDSIEPLECIGAVSLGLIKEKLLSRSFQSAVWNVANSMVS 4380

Query: 4381 YIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIH 4440
            Y H NKNLDL+AVE+LLKS AERLQFVK LHTRFLLLPNSI+ITRP+KDSIIPEW+DG H
Sbjct: 4381 YTHTNKNLDLEAVEELLKSFAERLQFVKCLHTRFLLLPNSINITRPSKDSIIPEWEDGRH 4440

Query: 4441 HRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAII 4500
            HRALYFV  SKTCILVAEPPA ISIFDV+AIVVSQILGSPIPLP+GSL FCPEG E AI+
Sbjct: 4441 HRALYFVKQSKTCILVAEPPACISIFDVLAIVVSQILGSPIPLPIGSLFFCPEGIETAIV 4500

Query: 4501 NILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYG 4560
            +ILKLCSEK NE+FTGISSL+GKEILPQDALQ+QLHPLRPFYA EVVAWRSQSGEKLKYG
Sbjct: 4501 DILKLCSEKTNEKFTGISSLIGKEILPQDALQIQLHPLRPFYAGEVVAWRSQSGEKLKYG 4560

Query: 4561 RVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIID 4620
             V EDVRPSAGQALY+FRVET  GI QSL+SSQVLSFRSISIDG  SS+NL DS HM+ID
Sbjct: 4561 MVLEDVRPSAGQALYRFRVETTSGIIQSLLSSQVLSFRSISIDGGPSSSNLLDSSHMVID 4620

Query: 4621 SGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKT 4680
            +G+SV+MPENSE GKIRSQP AELQYG+VSAEELVQAVHEML+TAGINVDIERQSLLQKT
Sbjct: 4621 NGSSVQMPENSESGKIRSQPAAELQYGKVSAEELVQAVHEMLTTAGINVDIERQSLLQKT 4680

Query: 4681 VVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740
            + LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAW+CRVCLTSEVEITIVPCGHVLCR+C
Sbjct: 4681 ITLQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWVCRVCLTSEVEITIVPCGHVLCRRC 4739

Query: 4741 SAAVSK 4746
            S+AVSK
Sbjct: 4741 SSAVSK 4739

BLAST of Moc09g05800 vs. ExPASy Swiss-Prot
Match: O82660 (Photosystem II stability/assembly factor HCF136, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=HCF136 PE=1 SV=1)

HSP 1 Score: 628.2 bits (1619), Expect = 8.0e-178
Identity = 306/386 (79.27%), Postives = 348/386 (90.16%), Query Frame = 0

Query: 4799 LFAPSLSQPQPQPQPRTFTTSTPRASLQNSSINRRQFVAETAAAVSLSLSPLIAPVQPAK 4858
            L   + S P P P P + ++S         S +RR+ + + +AAVSLSLS ++    PA+
Sbjct: 30   LIPKASSSPPPSPSPSSSSSSL--------SFSRRELLYQ-SAAVSLSLSSIVG---PAR 89

Query: 4859 SEEALSEWERLFLPIDPGVVLLDIAFVPDDLDHGFLLGTRQTILETKDGGRTWAPRTIPS 4918
            ++E LSEWER+FLPIDPGVVLLDIAFVPD+   GFLLGTRQT+LETKDGG TW PR+IPS
Sbjct: 90   ADEQLSEWERVFLPIDPGVVLLDIAFVPDEPSRGFLLGTRQTLLETKDGGSTWNPRSIPS 149

Query: 4919 AEEEDFNYRFNSISFKGKEGWIVGKPAILLYTSDAGESWERIPLSAQLPGDMVYIKATGE 4978
            AEEEDFNYRFNSISFKGKEGWI+GKPAILLYT+DAGE+W+RIPLS+QLPGDMV+IKAT +
Sbjct: 150  AEEEDFNYRFNSISFKGKEGWIIGKPAILLYTADAGENWDRIPLSSQLPGDMVFIKATED 209

Query: 4979 KSAEMVTDEGAIYVTSNKGYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPD 5038
            KSAEMVTDEGAIYVTSN+GYNWKAA+QETVSATLNRTVSSGISGASYYTGTF+ VNRSPD
Sbjct: 210  KSAEMVTDEGAIYVTSNRGYNWKAAIQETVSATLNRTVSSGISGASYYTGTFSAVNRSPD 269

Query: 5039 GRYVAVSSRGNFYLTWEPGQPFWQPHNRAIARRIQNMGWRADGGLWLLVRGGGLFLSKGT 5098
            GRYVAVSSRGNF+LTWEPGQP+WQPHNRA+ARRIQNMGWRADGGLWLLVRGGGL+LSKGT
Sbjct: 270  GRYVAVSSRGNFFLTWEPGQPYWQPHNRAVARRIQNMGWRADGGLWLLVRGGGLYLSKGT 329

Query: 5099 GISEEFEEVPVQSRGFGILDVGYRSTEEAWAAGGSGILLKTTNGGRTWSRDKAADNIAAN 5158
            GI+EEFEEVPVQSRGFGILDVGYRS EEAWAAGGSGILL+T NGG++W+RDKAADNIAAN
Sbjct: 330  GITEEFEEVPVQSRGFGILDVGYRSEEEAWAAGGSGILLRTRNGGKSWNRDKAADNIAAN 389

Query: 5159 LYSVKFINDKKGFVLGNDGVLLQYLG 5185
            LY+VKF++DKKGFVLGNDGVLL+Y+G
Sbjct: 390  LYAVKFVDDKKGFVLGNDGVLLRYVG 403

BLAST of Moc09g05800 vs. ExPASy Swiss-Prot
Match: Q5Z5A8 (Photosystem II stability/assembly factor HCF136, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=HCF136 PE=3 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 2.4e-174
Identity = 304/390 (77.95%), Postives = 336/390 (86.15%), Query Frame = 0

Query: 4821 PRASLQNSSINRRQFVAETA-AAVSLSLSPLIAPVQP---AKSEEALSEWERLFLPIDPG 4880
            PRA   + S  RR+F+A+TA A+ + ++ PL+ P  P   A    +LSEWER+ LPIDPG
Sbjct: 27   PRAHTDSISTGRRRFIADTATASAAAAVGPLVLPRTPLARADQPPSLSEWERVLLPIDPG 86

Query: 4881 VVLLDIAFVPDDLDHGFLLGTRQTILETKDGGRTWAPRTIPSAEEEDFNYRFNSISFKGK 4940
            VVLLDIAFVPDD  HGFLLGTRQTILETK+GG TW PR+IPSAE+EDFNYRFNS+SF GK
Sbjct: 87   VVLLDIAFVPDDPSHGFLLGTRQTILETKNGGNTWFPRSIPSAEDEDFNYRFNSVSFMGK 146

Query: 4941 EGWIVGKPAILLYTSDAGESWERIPLSAQLPGDMVYIKATGEKSAEMVTDEGAIYVTSNK 5000
            EGWI+GKPAILL+TSDAG+SWERIPLSAQLPG+MVYIKATGE+SAEMVTDEGAIYVTSN+
Sbjct: 147  EGWIIGKPAILLHTSDAGDSWERIPLSAQLPGNMVYIKATGEQSAEMVTDEGAIYVTSNR 206

Query: 5001 GYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPDGRYVAVSSRGNFYLTWEP 5060
            GYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPDGRYVAVSSRGNFYLTWEP
Sbjct: 207  GYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPDGRYVAVSSRGNFYLTWEP 266

Query: 5061 GQPFWQPHNRAIARRIQNMGWRADGGLWLLVRGGGLFLSKGTG----------------- 5120
            GQPFWQPHNRA+ARRIQNMGWRADGGLWLLVRGGGLFLSKG+G                 
Sbjct: 267  GQPFWQPHNRAVARRIQNMGWRADGGLWLLVRGGGLFLSKGSGFQFFYRGLNDAHAISYL 326

Query: 5121 -----ISEEFEEVPVQSRGFGILDVGYRSTEEAWAAGGSGILLKTTNGGRTWSRDKAADN 5180
                 I+E+FEE  VQSRGFGILDVGYRS +EAWAAGGSG+LLKTTNGG+TW RDKAADN
Sbjct: 327  HPPNQITEDFEEASVQSRGFGILDVGYRSKDEAWAAGGSGVLLKTTNGGKTWVRDKAADN 386

Query: 5181 IAANLYSVKFINDKKGFVLGNDGVLLQYLG 5185
            IAANLYSVKF+ D KG+VLGNDGVLL+Y+G
Sbjct: 387  IAANLYSVKFLGDNKGYVLGNDGVLLRYVG 416

BLAST of Moc09g05800 vs. ExPASy Swiss-Prot
Match: Q9NZJ4 (Sacsin OS=Homo sapiens OX=9606 GN=SACS PE=1 SV=2)

HSP 1 Score: 520.8 bits (1340), Expect = 1.8e-145
Identity = 548/2043 (26.82%), Postives = 880/2043 (43.07%), Query Frame = 0

Query: 16   FGQKV-DLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLDRRVHGSESLLSESL 75
            FGQ    L   ++++L  YPEG  +LKEL+QNA+DAGAT+V    D   +G+E+L S+ +
Sbjct: 89   FGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDM 148

Query: 76   APFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGFNSVYHLTELPSFVS 135
            AP+QGPAL  YNNAVFT ED+  I  I  S K     K GRFG+GFNSVYH+T++P   S
Sbjct: 149  APYQGPALYVYNNAVFTPEDWHGIQEIARSRKKDDPLKVGRFGIGFNSVYHITDVPCIFS 208

Query: 136  GKYVVMFDP-QGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCAF-----DCDMESS 195
            G  + M DP Q ++ P  S      + D      IS+  DQF P+        +  +  +
Sbjct: 209  GDQIGMLDPHQTLFGPHESGQCWNLKDD---SKEISELSDQFAPFVGIFGSTKETFINGN 268

Query: 196  FDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSVSCIEMFVW 255
            F GT FR PLR       S++S   Y ++ +  +F     +    LLFLKSV  + ++V 
Sbjct: 269  FPGTFFRFPLR----LQPSQLSSNLYNKQKVLELFESFRADADTVLLFLKSVQDVSLYV- 328

Query: 256  NDGEAEPQKLYSFSVRSASSDIIWHRQ---------MLLRLSKSTTSTQSEVDSFSLDFL 315
               EA+  +   F V S+ S  + H +          +    K T S      ++ ++ +
Sbjct: 329  --READGTEKLVFRVTSSESKALKHERPNSIKILGTAISNYCKKTPSNNITCVTYHVNIV 388

Query: 316  SQATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSK 375
             +  +    ++   S+ +  ++      ISS   + + E     +   ++ + + DD +K
Sbjct: 389  LEEESTKDAQK--TSWLVCNSVGG--RGISSKLDSLADELKFVPIIGIAMPLSSRDDEAK 448

Query: 376  TNVLKL-GRAFCFLPLP----VKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWN 435
                   G+AFCFLPLP      TGL V ++GFF ++ NRR I +  ++D+     ++WN
Sbjct: 449  GATSDFSGKAFCFLPLPPGEESSTGLPVHISGFFGLTDNRRSIKW-RELDQWRDPAALWN 508

Query: 436  RLLLEDIIAPSFIELLIG---------VQVFLGPTDTYFSLWPSGS-FEEPWNILVEQVY 495
              L+ +++  ++  L++             F    D  + LWP  S  +  W  ++E ++
Sbjct: 509  EFLVMNVVPKAYATLILDSIKRLEMEKSSDFPLSVDVIYKLWPEASKVKVHWQPVLEPLF 568

Query: 496  KNISNALVLYSNVEGGKWVSPIQAFLH--DDKFTRSKELGEALVLLGMPIVHLPENL--S 555
              +    V+YS      WV   Q +    D+    +K +   L   G  I  +P N+  +
Sbjct: 569  SELLQNAVIYS--ISCDWVRLEQVYFSELDENLEYTKTVLNYLQSSGKQIAKVPGNVDAA 628

Query: 556  NMLLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFD 615
              L         + VTP  VR  LR+C H+       KL LLE+ + D       ++   
Sbjct: 629  VQLTAASGTTPVRKVTPAWVRQVLRKCAHLGCAEE--KLHLLEFVLSD----QAYSELLG 688

Query: 616  LPLIPLANGDFGLFSE--ASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIA 675
            L L+PL NG+F  FS   + + + Y    E   +L   +  R I  N+  ++   L   A
Sbjct: 689  LELLPLQNGNFVPFSSSVSDQDVIYITSAEYPRSLFPSLEGRFILDNLKPHLVAALKEAA 748

Query: 676  RSSK---SNIFIFNVHYFLQLFPKFVPADWKYKNEVL-WDP-ESCSNHPTSSWFSLFWR- 735
            ++     + + + N   F +L  + +   W  +  ++ W P +   NHP+ SW  + W+ 
Sbjct: 749  QTRGRPCTQLQLLNPERFARLIKEVMNTFWPGRELIVQWYPFDENRNHPSVSWLKMVWKN 808

Query: 736  -YLHDRCEKLSLFSDWPILP-------SKSRYLYRATKKSKLI----NVQMLSNEMQKIL 795
             Y+H   E L+LF + P++P            L R    S +I    +   L   +  I+
Sbjct: 809  LYIH-FSEDLTLFDEMPLIPRTILEEGQTCVELIRLRIPSLVILDDESEAQLPEFLADIV 868

Query: 796  SKLG---CKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEE 855
             KLG    K LD    ++H  +  Y++    + VL  +          +   + SL    
Sbjct: 869  QKLGGFVLKKLDA--SIQHPLIKKYIHSPLPSAVLQIMEKMPLQK---LCNQITSLLPTH 928

Query: 856  KDGLRRFLLDPKWYLGGCMNDSDLEK--CKRLPIYKVYNGGSAQDFGFSDLENPQKYLPP 915
            KD LR+F       L    + S+ EK   + L I+K  N  S  D G S     +     
Sbjct: 929  KDALRKF-------LASLTDSSEKEKRIIQELAIFKRINHSS--DQGISSYTKLKGCKVL 988

Query: 916  SDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMG--KASFYRKHVLNQVEQLQPELRDST 975
                +    +   +S  DS  E  ++   + K+   K +   K VL  +E       + T
Sbjct: 989  HHTAKLPADLRLSISVIDSSDEATIRLANMLKIEQLKTTSCLKLVLKDIENAFYSHEEVT 1048

Query: 976  --MLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYEELCALL--DDF 1035
              ML VL+NL  L  E+    E L+ L F+  S   +     L+DP  E L  L   ++ 
Sbjct: 1049 QLMLWVLENLSSLKNENPNVLEWLTPLKFIQISQEQMVSAGELFDPDIEVLKDLFCNEEG 1108

Query: 1036 DSFPSTPFNESYILDILQGLGLR--TCVAPETIVQSAQHVERLM---HKDHNKAHSRGKV 1095
              FP + F    IL  L+ +GL+    +  + +VQ A+ +E L      D +    + K 
Sbjct: 1109 TYFPPSVFTSPDILHSLRQIGLKNEASLKEKDVVQVAKKIEALQVGACPDQDVLLKKAKT 1168

Query: 1096 LLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFW-NDLCKISWCPV 1155
            LL  L  N     L   +E +  + ++    A   RP N+   L   W  DLC       
Sbjct: 1169 LLLVLNKN---HTLLQSSEGKMTLKKIKWVPACKERPPNYPGSL--VWKGDLC------- 1228

Query: 1156 LLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSSPPG 1215
                           ++ APP +  +   + L+ +S+ ++  E     L  +LG  + P 
Sbjct: 1229 ---------------NLCAPPDMCDVGHAI-LIGSSLPLV--ESIHVNLEKALGIFTKPS 1288

Query: 1216 GSIIAAQLLELGKNNEIVYDQVLRK--------ELALAMPRIYALLTGLI--GSDEMDVV 1275
             S +        K+ +IV D    K        +    +  IY  +   +  G D    +
Sbjct: 1289 LSAVL-------KHFKIVVDWYSSKTFSDEDYYQFQHILLEIYGFMHDHLNEGKDSFRAL 1348

Query: 1276 KAVLEGCRWIWVGDGFATSEEVVLDGPLH---LAPYIRVIPIDLAVFKDLFLELGIREFL 1335
            K       W+W G  F    + V+  P+H   L PY+  +P  +A F  LF   G  E L
Sbjct: 1349 K-----FPWVWTGKKFCPLAQAVIK-PIHDLDLQPYLHNVPKTMAKFHQLFKVCGSIEEL 1408

Query: 1336 KPNDYADILSRMAIKKG---SSPLNAQEVRAAILIVQHLAEAQL---PKQQINIYLPDIS 1395
              +  + ++ ++ +K     S   + Q +   + I++ L   Q+   P   + I+     
Sbjct: 1409 TSDHISMVIQKIYLKSDQDLSEQESKQNLHLMLNIIRWLYSNQIPASPNTPVPIHHSKNP 1468

Query: 1396 GRLL--PASNLVYNDAPWLLGTDD-TDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKL 1455
             +L+  P     Y D    +  DD  D+  D    ++L         VH +I    AE L
Sbjct: 1469 SKLIMKPIHECCYCD----IKVDDLNDLLEDSVEPIIL---------VHEDIPMKTAEWL 1528

Query: 1456 GV-CSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNA 1515
             V C   R++  E+            E  GQ E LT R+++ILE Y     I  EL+QNA
Sbjct: 1529 KVPCLSTRLINPENM---------GFEQSGQREPLTVRIKNILEEYPSVSDIFKELLQNA 1588

Query: 1516 EDAGASEVVFLLD--KTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQES 1575
            +DA A+E  FL+D  +      ++L P MA   GPAL+ +N+S FS  D   I+R+G+  
Sbjct: 1589 DDANATECSFLIDMRRNMDIRENLLDPGMAACHGPALWSFNNSQFSDSDFVNITRLGESL 1648

Query: 1576 KLQKPLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGIS-----PSHPGLRI 1635
            K  +   +G+FGLGFN VYH TDIP  +S E ++MFDP   N+  IS      S+PG++I
Sbjct: 1649 KRGEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDP---NINHISKHIKDKSNPGIKI 1708

Query: 1636 KYA-GRRILEQFPDQFSPYLH-FGCDM----QKPFP--GTLFRFPLRSPALASRSEIKKE 1695
             ++  ++ L +FP+QF P++  FGC +    + P+   GTLFR   R+   A  SE+   
Sbjct: 1709 NWSKQQKRLRKFPNQFKPFIDVFGCQLPLTVEAPYSYNGTLFRLSFRTQQEAKVSEVSST 1768

Query: 1696 GYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTE 1755
             Y   D+ SL   FS      ++F  +VK  S+++K     +           + + S  
Sbjct: 1769 CYNTADIYSLVDEFSLCGHRLIIFTQSVK--SMYLKYLKIEETNPSLAQDTVIIKKKSCS 1828

Query: 1756 SSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQ---- 1815
            S A    +  +       +     LMK   S N+ LP        +++  S  ILQ    
Sbjct: 1829 SKALNTPVLSV-------LKEAAKLMKTCSSSNKKLP--------SDEPKSSCILQITVE 1888

Query: 1816 ---HYWITSGCLGGGLPRN------------NSGLGDKSYNFIPW---ACVAALLHSVQV 1875
               H +     L   L R              SG   K  + +      C   LL +   
Sbjct: 1889 EFHHVFRRIADLQSPLFRGPDDDPAALFEMAKSGQSKKPSDELSQKTVECTTWLLCTCMD 1948

Query: 1876 DGE---MNYDPETENNWLVASDLVQVSSASIEGR----KPFEGRAFCFLPLPVRTGLPVH 1913
             GE    +         LV    V V  + I+ +    KP  G  FC+LPL ++TGLPVH
Sbjct: 1949 TGEALKFSLSESGRRLGLVPCGAVGVQLSEIQDQKWTVKPHIGEVFCYLPLRIKTGLPVH 2000

BLAST of Moc09g05800 vs. ExPASy Swiss-Prot
Match: Q9JLC8 (Sacsin OS=Mus musculus OX=10090 GN=Sacs PE=1 SV=2)

HSP 1 Score: 508.1 bits (1307), Expect = 1.2e-141
Identity = 546/2068 (26.40%), Postives = 889/2068 (42.99%), Query Frame = 0

Query: 16   FGQKV-DLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLDRRVHGSESLLSESL 75
            FGQ    L   ++++L  YPEG  +LKEL+QNA+DAGAT+V    D   +G+E+L S+ +
Sbjct: 89   FGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDM 148

Query: 76   APFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGFNSVYHLTELPSFVS 135
            A +QG AL  YNNAVFT ED+  I  I  S K     K GRFG+GFNSVYH+T++P   S
Sbjct: 149  AQYQGSALYVYNNAVFTPEDWHGIQEIARSRKKDDPLKVGRFGIGFNSVYHITDVPCIFS 208

Query: 136  GKYVVMFDP-QGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYC-AFDCDMES----S 195
            G  + M DP Q ++ P  S      + D      I++  DQF P+   F    E+    S
Sbjct: 209  GDQIGMLDPHQTLFGPHESGQCWNLKDDI---KEINELPDQFAPFIGVFGSTKETFTNGS 268

Query: 196  FDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSVSCIEMFVW 255
            F GT FR PLR       S++S   YT++ +  +F     +    LLFLKSV  + + V 
Sbjct: 269  FPGTFFRFPLR----LQPSQLSSNLYTKQKVLELFDSFRADADTVLLFLKSVQAVSLHV- 328

Query: 256  NDGEAEPQKLYSFSVRSASSDIIWHRQ---------MLLRLSKSTTSTQSEVDSFSLDFL 315
               EA+  +   F V ++ +  + H +          +    K   S      ++ ++ +
Sbjct: 329  --READGTEKLVFRVTASENKALKHERPNSIKILGTAISNYCKKIPSNSVTCVTYHINIV 388

Query: 316  SQATTGTQIEERIDSFFIVQTMA--SATSRISSFAATASKEYDIHLLPWASLAVCTSDDS 375
             +  +    ++   S+ +  ++     +S++ S A       ++  +P   LA+  S   
Sbjct: 389  LEDESTKDAQK--TSWLVCNSVGGRGISSKLDSLAD------ELKFVPIIGLAMPLSGKD 448

Query: 376  SKTNVLK--LGRAFCFLPLP----VKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRS 435
             +   +    G+AFCFLPLP     +TGL V ++GFF ++ NRR I +  ++D+     +
Sbjct: 449  EENGAISDFSGKAFCFLPLPPGEESRTGLPVHISGFFGLTDNRRSIKW-RELDQWRDPAA 508

Query: 436  IWNRLLLEDIIAPSFIELLIG---------VQVFLGPTDTYFSLWPSGS-FEEPWNILVE 495
            +WN  L+ +++  ++  L++             F    DT + LWP  S  +  W+ ++ 
Sbjct: 509  LWNEYLIVNVVPKTYATLILDSIKRLETEKSSDFPLSVDTIYKLWPEASKVKAHWHPVLG 568

Query: 496  QVYKNISNALVLYSNVEGGKWVSPIQAFLH--DDKFTRSKELGEALVLLGMPIVHLPENL 555
             ++  +    V+YS   GG+WV   Q      D     ++ +   L   G  I  +P NL
Sbjct: 569  PLFSELFQHAVIYS--IGGEWVKLEQVHFSELDGSLESTRSVLNYLQSSGKQIAKVPGNL 628

Query: 556  S-----NMLLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADV 615
            +     +       A   + VTP  VR  LR+C H+ +     KL LLE+ + D      
Sbjct: 629  AAAVQLSAASATSSASPVRKVTPAWVRQVLRKCAHLGSAEE--KLHLLEFVLSD----QA 688

Query: 616  CAQAFDLPLIPLANGDFGLFSE--ASKGISYFICDELEYTLLHQISDRVIDRNIPLNIST 675
             ++   L L+PL +G F  FS   + + + Y   +E   +L   +  R+I  N+  ++  
Sbjct: 689  YSELLGLELLPLQSGAFVPFSSSVSDQDVVYITSEEFPRSLFPGLEARLILENLKPHLLA 748

Query: 676  RLSNIARSSK---SNIFIFNVHYFLQLFPKFVPADWKYKNEVL-WDPES-CSNHPTSSWF 735
             L   A++     + + + N   F +L  + +   W  +  V+ W P S    HP+ SW 
Sbjct: 749  ALKEAAQTRGRPCTQLQLLNPERFARLIKEVMNTFWPGRELVVQWYPFSEDKRHPSLSWL 808

Query: 736  SLFWR--YLHDRCEKLSLFSDWPILPSKSRYLYRATK------KSKLINVQMLSNE---- 795
             + W+  Y+H   E L+LF + P++P   R L    +      + ++ +V +L +E    
Sbjct: 809  KMVWKNLYIH-FSEDLTLFDEMPLIP---RTLLNEDQTCVELIRLRIPSVVILDDETEAQ 868

Query: 796  ----MQKILSKLG---CKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAISSTGGLMLT 855
                +  I+ KLG    K LD    ++H  +  Y++    + +L  I + I      +  
Sbjct: 869  LPEFLADIVQKLGGIVLKRLDT--SIQHPLVKKYIHSPLPSAIL-QIMEKIPLQ--KLCN 928

Query: 856  SLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEK--CKRLPIYKVYNGGSAQDF----- 915
             + SL    KD LR+F       L    + S+ EK   + L I+K  N  S Q       
Sbjct: 929  QIASLLPTHKDALRKF-------LASLTDTSEKEKRIIQELTIFKRINHSSDQGISSYTK 988

Query: 916  --GFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEI--LLKYYGIKKMGKASFYRKH 975
              G   L++  K   P+D+    L V  I SS ++ + +  +LK   +K      F  K 
Sbjct: 989  LKGCKVLDHTAKL--PTDLR---LSVSVIDSSDEATIRLANMLKIEKLKTTSCLKFVLKD 1048

Query: 976  VLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDP 1035
            + N     Q E+    ML +L+NL  L  E+    + L  L F+  S G +     L+DP
Sbjct: 1049 IGNAF-YTQEEV-TQLMLWILENLSSLKNENSNVLDWLMPLKFIHMSQGHVVAAGDLFDP 1108

Query: 1036 RYEELCALL--DDFDSFPSTPFNESYILDILQGLGLR--TCVAPETIVQSAQHVERLM-- 1095
              E L  L   ++   FP T F    IL  L+ +GL+  + +  + +VQ A+ +E L   
Sbjct: 1109 DIEVLRDLFYNEEEACFPPTIFTSPDILHSLRQIGLKNESSLKEKDVVQVARKIEALQVS 1168

Query: 1096 -HKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEK 1155
              ++ +    + K LL  L  N     L   +E +  + ++    A   RP N+   L  
Sbjct: 1169 SCQNQDVLMKKAKTLLLVLNKNQ---TLLQSSEGKMALKKIKWVPACKERPPNYPGSL-- 1228

Query: 1156 FW-NDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSS 1215
             W  DLC                      ++ APP +      + LV +S+ +++    +
Sbjct: 1229 VWKGDLC----------------------NLCAPPDMCDAAHAV-LVGSSLPLVESVHVN 1288

Query: 1216 SALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDE 1275
               A S+ ++ P   +++      +       +      +    +  IY  +   + S+ 
Sbjct: 1289 LEQALSI-FTKPTINAVLKHFKTVVDWYTSKTFSDEDYYQFQHILLEIYGFMHDHL-SEG 1348

Query: 1276 MDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLH---LAPYIRVIPIDLAVFKDLFLELGI 1335
             D  KA+     W+W G  F    + V+  P H   L PY+  +P  +A F  LF   G 
Sbjct: 1349 KDSFKAL--KFPWVWTGKNFCPLAQAVIK-PTHDLDLQPYLYNVPKTMAKFHQLFKACGS 1408

Query: 1336 REFLKPNDYADILSRMAIKKG---SSPLNAQEVRAAILIVQHLAEAQL---PKQQINIYL 1395
             E L  +  + ++ ++ +K     S   + Q +   + I++ L   Q+   P   + IY 
Sbjct: 1409 IEELTSDHISMVIQKVYLKSDQELSEEESKQNLHLMLNIMRWLYSNQIPASPNTPVPIYH 1468

Query: 1396 PDISGRLL--PASNLVYNDAPWLLGTDD-TDVPFDGESTVVLNARKTVQKFVHGNISNDV 1455
                 +L+  P     Y D    +  DD  D+  D    ++L         VH +I    
Sbjct: 1469 SRNPSKLVMKPIHECCYCD----IKVDDLNDLLEDSVEPIIL---------VHEDIPMKT 1528

Query: 1456 AEKLGV-CSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFEL 1515
            AE L V C   R++  E+            E  GQ E LT R+++ILE Y     I  EL
Sbjct: 1529 AEWLKVPCLSTRLINPENM---------GFEQSGQREPLTVRIKNILEEYPSVSDIFKEL 1588

Query: 1516 IQNAEDAGASEVVFLLD--KTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRV 1575
            +QNA+DA A+E  F++D  +      ++L P MA   GPAL+ +N+S FS  D   I+R+
Sbjct: 1589 LQNADDANATECSFMIDMRRNMDIRENLLDPGMAACHGPALWSFNNSEFSDSDFLNITRL 1648

Query: 1576 GQESKLQKPLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISP-----SHP 1635
            G+  K  +   +G+FGLGFN VYH TDIP  +S E ++MFDP   N+  IS      S+P
Sbjct: 1649 GESLKRGEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDP---NINHISKHIKDRSNP 1708

Query: 1636 GLRIKYA-GRRILEQFPDQFSPYLH-FGCDM----QKPFP--GTLFRFPLRSPALASRSE 1695
            G++I ++  ++ L +FP+QF P++  FGC +    + P+   GTLFR   R+   A  SE
Sbjct: 1709 GIKINWSKQQKRLRKFPNQFKPFIDVFGCQLPLAVEAPYSYNGTLFRLSFRTQQEAKVSE 1768

Query: 1696 IKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFIKD-DIEHDMQCLYRVHKNTVS 1755
            +    Y   D+ SL   FS      ++F  +V   S+++K   IE     L +       
Sbjct: 1769 VSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVN--SMYLKYLKIEETNPSLAQDTIIIKK 1828

Query: 1756 EPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRDLPYKCQK----LIIT----- 1815
            +     +    ++S         +     LMK   S N+ LP    K    L IT     
Sbjct: 1829 KVCPSKALNAPVLSV--------LKEAAKLMKTCSSSNKKLPTDVPKSSCILQITVEEFH 1888

Query: 1816 --------------------------------EKSSSGDILQH-----YWITSGCLGGG- 1875
                                             K  S ++ Q       W+   C+  G 
Sbjct: 1889 HVFRRIADLQSPLFRGPDDDPATLFEMAKSGQSKKPSDELPQKTVDCTTWLICTCMDTGE 1948

Query: 1876 ---LPRNNSGLGDKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSAS 1910
                  N SG   +    +P   V  LLH  Q           E  W V           
Sbjct: 1949 ALKFSLNESG---RRLGLVPCGAVGVLLHETQ-----------EQKWTV----------- 1999

BLAST of Moc09g05800 vs. ExPASy Swiss-Prot
Match: Q9AW48 (Photosystem II stability/assembly factor HCF136, chloroplastic OS=Guillardia theta OX=55529 GN=hcf136 PE=3 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 9.4e-94
Identity = 167/362 (46.13%), Postives = 244/362 (67.40%), Query Frame = 0

Query: 4830 INRRQFVAETAAAVSLSLSPLIAPVQPAKSEEALSEWERLFLPIDPGVVLLDIAFVPDDL 4889
            INR +F       ++  L P +  + P  +    + W ++ LP+D   VL DI F   + 
Sbjct: 42   INRTKF-------INYLLYPTLTSLYPKFTHANTTSWTKVDLPVDS--VLFDIEFTDPEC 101

Query: 4890 DHGFLLGTRQTILETKDGGRTWAPRTIPSAE-EEDFNYRFNSISFKGKEGWIVGKPAILL 4949
             HG+L+G++ T LET DGG TW PRT  + + +E+  YRF +ISF+G+EGW++GKPAI+L
Sbjct: 102  KHGWLVGSKGTFLETDDGGNTWVPRTFANLDPDEELTYRFENISFEGQEGWVIGKPAIIL 161

Query: 4950 YTSDAGESWERIPLSAQLPGDMVYIKATGEKSAEMVTDEGAIYVTSNKGYNWKAAVQETV 5009
            YT D G++W R+P+S +LPG+   IKA G +SAE+ T  GAIYVT+N G NWKA V+ET+
Sbjct: 162  YTKDGGKTWFRVPVSPKLPGEPCLIKALGSESAELTTTSGAIYVTNNAGRNWKAQVKETI 221

Query: 5010 SATLNRTVSSGISGASYYTGTFNTVNRSPDGRYVAVSSRGNFYLTWEPGQPFWQPHNRAI 5069
             +TLNRT+SSG+SGASY+TG    V R+ +G+Y+A+SSRGNFYLTWEPGQ FW P  R  
Sbjct: 222  DSTLNRTISSGVSGASYFTGNVINVIRNSEGKYLAISSRGNFYLTWEPGQDFWIPRARET 281

Query: 5070 ARRIQNMGWRADG---GLWLLVRGGGLFLSKGT----GISE-EFEEVPVQSRGFGILDVG 5129
            +RRIQ+MG+  +    G+W+  RGGGL +S        IS   FE + +++ G+GILD  
Sbjct: 282  SRRIQSMGFIQNDNQKGIWMSTRGGGLSVSTKNFDFESISSFNFENIDIKTGGYGILDAA 341

Query: 5130 YRSTEEAWAAGGSGILLKTTNGGRTWSRDKAADNIAANLYSVKFINDKKGFVLGNDGVLL 5183
            + + ++ W   G GI+  +T+ G+ W++    D ++ NLY +KF+N+ KGF+LG++G+LL
Sbjct: 342  FVNDKDIWIICGGGIVYNSTDKGKNWTKVDGIDKLSGNLYKIKFVNNNKGFILGSNGLLL 394

BLAST of Moc09g05800 vs. ExPASy TrEMBL
Match: A0A6J1DLW4 (sacsin isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022177 PE=4 SV=1)

HSP 1 Score: 9549.9 bits (24780), Expect = 0.0e+00
Identity = 4745/4745 (100.00%), Postives = 4745/4745 (100.00%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
            SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS
Sbjct: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT
Sbjct: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED
Sbjct: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG
Sbjct: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV
Sbjct: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG
Sbjct: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA
Sbjct: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS
Sbjct: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD
Sbjct: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900
            FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN
Sbjct: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900

Query: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960
            QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE
Sbjct: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960

Query: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020
            ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS
Sbjct: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020

Query: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080
            RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW
Sbjct: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080

Query: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140
            CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS
Sbjct: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140

Query: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200
            PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC
Sbjct: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200

Query: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260
            RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS
Sbjct: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260

Query: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320
            RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW
Sbjct: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320

Query: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380
            LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL
Sbjct: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380

Query: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440
            SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS
Sbjct: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440

Query: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500
            SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT
Sbjct: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500

Query: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560
            DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK
Sbjct: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560

Query: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620
            PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI
Sbjct: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620

Query: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680
            KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD
Sbjct: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680

Query: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740
            LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS
Sbjct: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740

Query: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800
            VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY
Sbjct: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800

Query: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860
            FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW
Sbjct: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860

Query: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920
            PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL
Sbjct: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920

Query: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980
            ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD
Sbjct: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980

Query: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040
            LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP
Sbjct: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040

Query: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100
            EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR
Sbjct: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100

Query: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160
            LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL
Sbjct: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160

Query: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220
            RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK
Sbjct: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220

Query: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280
            WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK
Sbjct: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280

Query: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340
            ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS
Sbjct: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340

Query: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400
            MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD
Sbjct: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400

Query: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460
            LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN
Sbjct: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460

Query: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520
            YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL
Sbjct: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520

Query: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580
            PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT
Sbjct: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580

Query: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640
            DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD
Sbjct: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640

Query: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700
            DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE
Sbjct: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700

Query: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760
            GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME
Sbjct: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760

Query: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820
            DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND
Sbjct: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820

Query: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880
            YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS
Sbjct: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880

Query: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940
            SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK
Sbjct: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940

Query: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000
            VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS
Sbjct: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000

Query: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060
            KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL
Sbjct: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060

Query: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120
            SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA
Sbjct: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120

Query: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180
            GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV
Sbjct: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180

Query: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240
            AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY
Sbjct: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240

Query: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300
            GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA
Sbjct: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300

Query: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360
            EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD
Sbjct: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360

Query: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420
            LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG
Sbjct: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420

Query: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480
            SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN
Sbjct: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480

Query: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540
            TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA
Sbjct: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540

Query: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600
            RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW
Sbjct: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600

Query: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660
            FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL
Sbjct: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660

Query: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720
            VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL
Sbjct: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720

Query: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780
            NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL
Sbjct: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780

Query: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840
            LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC
Sbjct: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840

Query: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900
            LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD
Sbjct: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900

Query: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960
            AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR
Sbjct: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960

Query: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020
            ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV
Sbjct: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020

Query: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080
            VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA
Sbjct: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080

Query: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140
            WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS
Sbjct: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140

Query: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200
            INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF
Sbjct: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200

Query: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260
            ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD
Sbjct: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260

Query: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320
            EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI
Sbjct: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320

Query: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380
            CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY
Sbjct: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380

Query: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440
            IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH
Sbjct: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440

Query: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500
            RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN
Sbjct: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500

Query: 4501 ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR 4560
            ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR
Sbjct: 4501 ILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYGR 4560

Query: 4561 VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS 4620
            VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS
Sbjct: 4561 VPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIIDS 4620

Query: 4621 GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV 4680
            GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV
Sbjct: 4621 GASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKTV 4680

Query: 4681 VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS 4740
            VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS
Sbjct: 4681 VLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKCS 4740

Query: 4741 AAVSK 4746
            AAVSK
Sbjct: 4741 AAVSK 4745

BLAST of Moc09g05800 vs. ExPASy TrEMBL
Match: A0A6J1DQI4 (uncharacterized protein LOC111022177 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022177 PE=4 SV=1)

HSP 1 Score: 8754.8 bits (22716), Expect = 0.0e+00
Identity = 4343/4343 (100.00%), Postives = 4343/4343 (100.00%), Query Frame = 0

Query: 403  MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ 462
            MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ
Sbjct: 1    MDRSGKIRSIWNRLLLEDIIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQ 60

Query: 463  VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM 522
            VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM
Sbjct: 61   VYKNISNALVLYSNVEGGKWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNM 120

Query: 523  LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP 582
            LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP
Sbjct: 121  LLKFCRAFQQKVVTPCTVRHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLP 180

Query: 583  LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK 642
            LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK
Sbjct: 181  LIPLANGDFGLFSEASKGISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSK 240

Query: 643  SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS 702
            SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS
Sbjct: 241  SNIFIFNVHYFLQLFPKFVPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLS 300

Query: 703  LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY 762
            LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY
Sbjct: 301  LFSDWPILPSKSRYLYRATKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHY 360

Query: 763  VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE 822
            VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE
Sbjct: 361  VNDGNCTGVLDSIYDAISSTGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLE 420

Query: 823  KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY 882
            KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY
Sbjct: 421  KCKRLPIYKVYNGGSAQDFGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKY 480

Query: 883  YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP 942
            YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP
Sbjct: 481  YGIKKMGKASFYRKHVLNQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVP 540

Query: 943  TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ 1002
            TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ
Sbjct: 541  TSSGTLKCPTVLYDPRYEELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQ 600

Query: 1003 SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR 1062
            SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR
Sbjct: 601  SAQHVERLMHKDHNKAHSRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPR 660

Query: 1063 NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI 1122
            NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI
Sbjct: 661  NFNSDLEKFWNDLCKISWCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRI 720

Query: 1123 LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT 1182
            LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT
Sbjct: 721  LDGECSSSALAHSLGWSSPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLT 780

Query: 1183 GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL 1242
            GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL
Sbjct: 781  GLIGSDEMDVVKAVLEGCRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFL 840

Query: 1243 ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD 1302
            ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD
Sbjct: 841  ELGIREFLKPNDYADILSRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPD 900

Query: 1303 ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG 1362
            ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG
Sbjct: 901  ISGRLLPASNLVYNDAPWLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLG 960

Query: 1363 VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED 1422
            VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED
Sbjct: 961  VCSLRRILLAESADSMNLSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAED 1020

Query: 1423 AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK 1482
            AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK
Sbjct: 1021 AGASEVVFLLDKTHYGTSSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQK 1080

Query: 1483 PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE 1542
            PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE
Sbjct: 1081 PLSIGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILE 1140

Query: 1543 QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV 1602
            QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV
Sbjct: 1141 QFPDQFSPYLHFGCDMQKPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEV 1200

Query: 1603 ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG 1662
            ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG
Sbjct: 1201 ASDALLFLTNVKKISIFIKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQG 1260

Query: 1663 EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG 1722
            EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG
Sbjct: 1261 EMDREQFLMKLSKSINRDLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLG 1320

Query: 1723 DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF 1782
            DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF
Sbjct: 1321 DKSYNFIPWACVAALLHSVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAF 1380

Query: 1783 CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL 1842
            CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL
Sbjct: 1381 CFLPLPVRTGLPVHVNAYFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRL 1440

Query: 1843 LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ 1902
            LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ
Sbjct: 1441 LEKIVSEIGHSGLFSSFWPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQ 1500

Query: 1903 AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR 1962
            AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR
Sbjct: 1501 AIFPDFSFDKVYELIEALADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKR 1560

Query: 1963 AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE 2022
            AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE
Sbjct: 1561 AFKDRKATILTLEYCLVDLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDE 1620

Query: 2023 YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR 2082
            YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR
Sbjct: 1621 YGLLKDSVPGQLVDPGIPEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNAR 1680

Query: 2083 QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD 2142
            QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD
Sbjct: 1681 QVNWNPGHQGQPSLEWIRLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKAD 1740

Query: 2143 GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR 2202
            GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR
Sbjct: 1741 GWSENMFSLLLKVGCLFLRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFR 1800

Query: 2203 DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT 2262
            DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT
Sbjct: 1801 DASESELHELRSFILQSKWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPT 1860

Query: 2263 GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL 2322
            GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL
Sbjct: 1861 GLCEDFLNDDFVRMESEKERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTIL 1920

Query: 2323 HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS 2382
            HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS
Sbjct: 1921 HDVKLLIEEDVSLKSSVSMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFS 1980

Query: 2383 DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI 2442
            DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI
Sbjct: 1981 DDDILDALVSLGLRRSLDLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSI 2040

Query: 2443 KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE 2502
            KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE
Sbjct: 2041 KVEGSGYELQNSMLIKSNYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSE 2100

Query: 2503 MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH 2562
            MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH
Sbjct: 2101 MNTIAWCPICADSPLKVLPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQH 2160

Query: 2563 KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE 2622
            KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE
Sbjct: 2161 KLGWTDCPRVEVLCAQLTDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDE 2220

Query: 2623 SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS 2682
            SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS
Sbjct: 2221 SVLLKSALNGVSWVWVGDDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLS 2280

Query: 2683 FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS 2742
            FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS
Sbjct: 2281 FNVEGYLDVLQRLHSDVEGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSS 2340

Query: 2743 QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL 2802
            QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL
Sbjct: 2341 QVLMQANDLVYNDAPWMEDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDL 2400

Query: 2803 PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG 2862
            PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG
Sbjct: 2401 PCMDYAKISELLMLYGNDYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQG 2460

Query: 2863 PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF 2922
            PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF
Sbjct: 2461 PALVAIFEGSSLNTEEISSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIF 2520

Query: 2923 DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA 2982
            DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA
Sbjct: 2521 DPRGIALSVAPKSAPGAKVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPA 2580

Query: 2983 CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS 3042
            CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS
Sbjct: 2581 CLKDGLEPGIRKIKEISSKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLS 2640

Query: 3043 SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT 3102
            SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT
Sbjct: 2641 SAIARNPFSEKKWKKFQLSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQT 2700

Query: 3103 RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL 3162
            RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL
Sbjct: 2701 RNMALDRRYLAYNLTPVAGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFL 2760

Query: 3163 VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS 3222
            VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS
Sbjct: 2761 VCHSGGRYLFKNQVLEAVAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSS 2820

Query: 3223 SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT 3282
            SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT
Sbjct: 2821 SALESNVSHSISSSLKAYGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYT 2880

Query: 3283 RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE 3342
            RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE
Sbjct: 2881 RAIDLPVWQLYSGNLVKAEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKE 2940

Query: 3343 IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG 3402
            IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG
Sbjct: 2941 IQAVGITVRQIRPKMVRDLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMG 3000

Query: 3403 ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG 3462
            ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG
Sbjct: 3001 ADSVNTNPGGRSTNTSEGSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFG 3060

Query: 3463 RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM 3522
            RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM
Sbjct: 3061 RGVVEDIGRSGDSLSHSNTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSM 3120

Query: 3523 ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS 3582
            ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS
Sbjct: 3121 ELWLGSKDQQELMIPLAARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRS 3180

Query: 3583 VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL 3642
            VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL
Sbjct: 3181 VFHANWVNHVMNSNMSPWFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPL 3240

Query: 3643 VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY 3702
            VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY
Sbjct: 3241 VPAFLGRPILCRVRERHLVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPY 3300

Query: 3703 TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN 3762
            TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN
Sbjct: 3301 TSAFQKFQDTYPWLFPLLNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANN 3360

Query: 3763 AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN 3822
            AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN
Sbjct: 3361 AGYFPELASLSDSNSNELLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDN 3420

Query: 3823 DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE 3882
            DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE
Sbjct: 3421 DQCMISSNSFLKPYNDCCLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSE 3480

Query: 3883 QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS 3942
            QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS
Sbjct: 3481 QEDILIYLFTNWQDLQADAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFS 3540

Query: 3943 GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL 4002
            GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL
Sbjct: 3541 GERKKFPGERFAADGWLRILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDL 3600

Query: 4003 INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK 4062
            INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK
Sbjct: 3601 INGQNEVPMEVWTLAGSVVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGK 3660

Query: 4063 RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR 4122
            RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR
Sbjct: 3661 RVLTSYGDGIVSKDWPLAWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGR 3720

Query: 4123 NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV 4182
            NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV
Sbjct: 3721 NGGEDTLAHWPISMGVMSINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLV 3780

Query: 4183 KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR 4242
            KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR
Sbjct: 3781 KANALFARLTINLSPFAFELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQR 3840

Query: 4243 LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK 4302
            LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK
Sbjct: 3841 LNPNELRSVMEILHYICDEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIK 3900

Query: 4303 CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL 4362
            CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL
Sbjct: 3901 CIDTSRLRFVHSDLPERICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLL 3960

Query: 4363 SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI 4422
            SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI
Sbjct: 3961 SRSFQNAVWNVVNSMVNYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDI 4020

Query: 4423 TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL 4482
            TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL
Sbjct: 4021 TRPAKDSIIPEWKDGIHHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPL 4080

Query: 4483 PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA 4542
            PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA
Sbjct: 4081 PVGSLLFCPEGTEIAIINILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYA 4140

Query: 4543 AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID 4602
            AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID
Sbjct: 4141 AEVVAWRSQSGEKLKYGRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISID 4200

Query: 4603 GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS 4662
            GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS
Sbjct: 4201 GSHSSTNLQDSGHMIIDSGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLS 4260

Query: 4663 TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS 4722
            TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS
Sbjct: 4261 TAGINVDIERQSLLQKTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTS 4320

Query: 4723 EVEITIVPCGHVLCRKCSAAVSK 4746
            EVEITIVPCGHVLCRKCSAAVSK
Sbjct: 4321 EVEITIVPCGHVLCRKCSAAVSK 4343

BLAST of Moc09g05800 vs. ExPASy TrEMBL
Match: A0A6J1H4Y1 (sacsin isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460612 PE=4 SV=1)

HSP 1 Score: 8666.6 bits (22487), Expect = 0.0e+00
Identity = 4279/4746 (90.16%), Postives = 4485/4746 (94.50%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASES SLDS+FLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESASLDSVFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIR SAISQYRDQFLPYC 
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRSSAISQYRDQFLPYCV 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            F+CDM+SSF GTLFRLPLRNAD AARS ISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FNCDMKSSFAGTLFRLPLRNADLAARSNISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTST S+ DS+SL+FLS
Sbjct: 241  LCIEMFVWNDGETEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTMSDTDSYSLEFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            +A +GTQIEERIDSFFIVQTMASATSRI SFAATASKEYDIHLLPWASLAVCTSDDSS  
Sbjct: 301  RAMSGTQIEERIDSFFIVQTMASATSRIGSFAATASKEYDIHLLPWASLAVCTSDDSSNN 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED
Sbjct: 361  SVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAPSFIELLIGVQVFLGPTD YFSLWPSGSFEEPWNILVEQVYKNI NALVLYSNVEGG
Sbjct: 421  IIAPSFIELLIGVQVFLGPTDAYFSLWPSGSFEEPWNILVEQVYKNIGNALVLYSNVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSP +AFLHDDKF RS ELGEALV LGMPIVHLPENLSNMLLKFC  FQQKVVTPCTV
Sbjct: 481  KWVSPNEAFLHDDKFARSTELGEALVRLGMPIVHLPENLSNMLLKFCCTFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            R FLR+CKHV+TLNRPYKLVLLEYCIEDLIDADVC  AF LPL+PLANGDFGLFSEASKG
Sbjct: 541  RQFLRDCKHVSTLNRPYKLVLLEYCIEDLIDADVCTHAFGLPLLPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEY LLHQISDRVID NIPL IS RLSNIARSS SN+F FNVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYKLLHQISDRVIDWNIPLAISARLSNIARSSTSNLFAFNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYKNEV WDPESCSNHPTSSWF LFW+YL D CEKLSLFSDWPILP KSRYLYRA
Sbjct: 661  VPADWKYKNEVFWDPESCSNHPTSSWFLLFWQYLRDHCEKLSLFSDWPILPCKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            +K+SK+INVQMLSNEMQ ILSKLGCKLLDPYYKVEHRDL HYVNDGNCTG+LDSIYDAIS
Sbjct: 721  SKQSKVINVQMLSNEMQNILSKLGCKLLDPYYKVEHRDLFHYVNDGNCTGILDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGL+LTSL++LEVEEKDGLRRFLLDPKWYLGG M D++LEKCKRLPI+KVYNGGSAQD
Sbjct: 781  STGGLLLTSLHNLEVEEKDGLRRFLLDPKWYLGGSMKDNELEKCKRLPIFKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSF-DSEVEILLKYYGIKKMGKASFYRKHVL 900
            FGFSDLE+P+KY PPSDVGE FLGVEFI SS  D E EILLKYYGIKKMGKASFYRKHVL
Sbjct: 841  FGFSDLESPRKYFPPSDVGECFLGVEFIFSSSDDGEEEILLKYYGIKKMGKASFYRKHVL 900

Query: 901  NQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRY 960
            NQV QLQP+LRD+TMLSVL NLPQLCVEDV FRECLSNL F+PTS GTLKCP  LYDPRY
Sbjct: 901  NQVGQLQPKLRDNTMLSVLLNLPQLCVEDVTFRECLSNLDFIPTSRGTLKCPAALYDPRY 960

Query: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAH 1020
            EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCV+PETIVQSA HVER MH D NKAH
Sbjct: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVSPETIVQSALHVERFMHMDQNKAH 1020

Query: 1021 SRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKIS 1080
            SRGKVLLSYLEVNAIKWLLNP NE+ GMVNRLFSTAATAFRPRNF SDLEKFWNDL KIS
Sbjct: 1021 SRGKVLLSYLEVNAIKWLLNPTNEEHGMVNRLFSTAATAFRPRNFTSDLEKFWNDLRKIS 1080

Query: 1081 WCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140
            WCPVLL+PPFETLPWP+VSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS
Sbjct: 1081 WCPVLLTPPFETLPWPIVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140

Query: 1141 SPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEG 1200
            SPP GSIIAAQLLELGKNNEIV+D VLRKELA AMPRIYALLTGLIGSDEMDVVKAVLEG
Sbjct: 1141 SPPSGSIIAAQLLELGKNNEIVHDLVLRKELAQAMPRIYALLTGLIGSDEMDVVKAVLEG 1200

Query: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADIL 1260
            CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFK+LFL+LGIREFLKPNDYADIL
Sbjct: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKELFLKLGIREFLKPNDYADIL 1260

Query: 1261 SRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAP 1320
            SRMAI+KGSS LN QEVRAAILIVQHLAEAQLPKQQIN+YLPDIS RLLPASNLVYNDAP
Sbjct: 1261 SRMAIRKGSSSLNTQEVRAAILIVQHLAEAQLPKQQINLYLPDISCRLLPASNLVYNDAP 1320

Query: 1321 WLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMN 1380
            WLLGTDDTDV F+GEST VLNARKTVQ FVHGNISN+VAEKLGVCSLRRILLAESADSMN
Sbjct: 1321 WLLGTDDTDVSFNGESTFVLNARKTVQNFVHGNISNEVAEKLGVCSLRRILLAESADSMN 1380

Query: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGT 1440
            LSLSGAAEAFGQHEALTNRLRHILEMYADG GILFELIQNAEDAGASEVVFLLDKTHYGT
Sbjct: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGSGILFELIQNAEDAGASEVVFLLDKTHYGT 1440

Query: 1441 SSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500
            SSILSPEMADWQGPALYCYNDSVFSS DLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF
Sbjct: 1441 SSILSPEMADWQGPALYCYNDSVFSSHDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500

Query: 1501 TDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560
            TDIPTFVSGEN+VMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ
Sbjct: 1501 TDIPTFVSGENVVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560

Query: 1561 KPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIF 1620
            KPFPGTLFRFPLRSPALASRSEIKKE YAPEDV SLFYSFSEVASDALLFLTNVK ISIF
Sbjct: 1561 KPFPGTLFRFPLRSPALASRSEIKKEAYAPEDVLSLFYSFSEVASDALLFLTNVKTISIF 1620

Query: 1621 IKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINR 1680
             KDDI H+MQCLYRVHKNT+SEPSTESSA+QDII+FI GNRQGE+DREQFL KL+KSI++
Sbjct: 1621 TKDDIGHEMQCLYRVHKNTISEPSTESSAQQDIINFICGNRQGELDREQFLRKLNKSISK 1680

Query: 1681 DLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLH 1740
            DLPYKCQK IITEKSS GDILQHYWITSGCLGGGLPRNNSG+GDKSYNFIPWACVAALLH
Sbjct: 1681 DLPYKCQKHIITEKSSGGDILQHYWITSGCLGGGLPRNNSGVGDKSYNFIPWACVAALLH 1740

Query: 1741 SVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNA 1800
            SV+VD EMNYD E ENNWL+ASD VQVSSASI+GRKP EGRAFCFLPLP++TGLPVHVNA
Sbjct: 1741 SVKVDEEMNYDQEAENNWLIASDSVQVSSASIQGRKPLEGRAFCFLPLPIKTGLPVHVNA 1800

Query: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSF 1860
            YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLE+VVAPAYG LLEKI SEIGHSGLFSSF
Sbjct: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLENVVAPAYGHLLEKIASEIGHSGLFSSF 1860

Query: 1861 WPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEA 1920
            WP+TAGLEPWGSVVR LYSFIGDFG+LVLYTNARGGQWIS +QAIFPDFSFDKV+ELIEA
Sbjct: 1861 WPSTAGLEPWGSVVRNLYSFIGDFGILVLYTNARGGQWISARQAIFPDFSFDKVHELIEA 1920

Query: 1921 LADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980
            L+DSGLP+I+ SKSIVDRFMEV PSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV
Sbjct: 1921 LSDSGLPLISTSKSIVDRFMEVCPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980

Query: 1981 DLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGI 2040
            DLKLPFQS+SLC LPLLPLADG+FT+F+KNGMGERTYIARGDEYG+LK+SVP QLVDP I
Sbjct: 1981 DLKLPFQSESLCRLPLLPLADGTFTTFNKNGMGERTYIARGDEYGILKESVPSQLVDPDI 2040

Query: 2041 PEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWI 2100
            PE VHAKLCEVAQTEDLNICFLSC LLEKLFLRFLP EWQNARQVNWNPGHQ  PSLEWI
Sbjct: 2041 PEAVHAKLCEVAQTEDLNICFLSCHLLEKLFLRFLPTEWQNARQVNWNPGHQSHPSLEWI 2100

Query: 2101 RLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLF 2160
            RL+WCYLK HC+DLSQFSKWPILPVG+NSLLQLVENSNVL+ADGWSENMFSLLLKVGCLF
Sbjct: 2101 RLVWCYLKLHCNDLSQFSKWPILPVGENSLLQLVENSNVLRADGWSENMFSLLLKVGCLF 2160

Query: 2161 LRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQS 2220
            LRRDMPIEHPQLEN+VHPSTA GILNAFL+IAGDIENVEGLF DA E ELHELRSFILQS
Sbjct: 2161 LRRDMPIEHPQLENFVHPSTATGILNAFLAIAGDIENVEGLFHDACEGELHELRSFILQS 2220

Query: 2221 KWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESE 2280
            KW+LEG MEA HVDIIK IPMFESYK RKLVSLS+P+RWIKPTGL EDFLNDDFVR+ESE
Sbjct: 2221 KWFLEGNMEATHVDIIKRIPMFESYKSRKLVSLSQPVRWIKPTGLYEDFLNDDFVRIESE 2280

Query: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSV 2340
            KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSER ALSTIL DVKLLIEED SLKSSV
Sbjct: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSEREALSTILLDVKLLIEEDASLKSSV 2340

Query: 2341 SMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSL 2400
            SMIPFVL  NGSWQPPSRLYDPRV EL NMLHEETFFPSE FSDDDILDALVSLGL RSL
Sbjct: 2341 SMIPFVLTGNGSWQPPSRLYDPRVHELKNMLHEETFFPSEIFSDDDILDALVSLGLNRSL 2400

Query: 2401 DLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKS 2460
             LTG LDCARSV LLNDS+N    SYARRLFVCLDALAHKLSI VEG+  ELQ SML++S
Sbjct: 2401 GLTGFLDCARSVPLLNDSEN----SYARRLFVCLDALAHKLSINVEGNCCELQTSMLVES 2460

Query: 2461 NYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKV 2520
            +YVDDD SMEVGSL+ EDTSDMG DSLIGNL GDESEEEFWSEM TIAWCP+CADSP+KV
Sbjct: 2461 DYVDDDTSMEVGSLDREDTSDMGIDSLIGNLAGDESEEEFWSEMKTIAWCPVCADSPVKV 2520

Query: 2521 LPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQL 2580
            LPWLKT N+VAPPS+VRPKSQMWMVSSSMHILDG+ PS YLQHKLGWTDCPRVEVLCAQL
Sbjct: 2521 LPWLKTCNKVAPPSVVRPKSQMWMVSSSMHILDGMLPSVYLQHKLGWTDCPRVEVLCAQL 2580

Query: 2581 TDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVG 2640
            TDIS LYGELRLHSSLEPDINTALQ+GI ILYSKLQEY GTD+ VLLKSALNGVSWVWVG
Sbjct: 2581 TDISTLYGELRLHSSLEPDINTALQDGITILYSKLQEYRGTDDFVLLKSALNGVSWVWVG 2640

Query: 2641 DDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDV 2700
            DDFV PSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YLDVLQRLH DV
Sbjct: 2641 DDFVSPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKDYLDVLQRLHKDV 2700

Query: 2701 EGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWM 2760
            +GSPLSTDQM+FVIC+LEAI DCC+DKPEFTATS  LLIPNSSQVLM ANDLVYNDAPWM
Sbjct: 2701 KGSPLSTDQMNFVICVLEAILDCCMDKPEFTATSIPLLIPNSSQVLMLANDLVYNDAPWM 2760

Query: 2761 EDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGN 2820
            E+NNILVGKHFVHPSIS+DLASRLGVQSIRCLSLVDEEMTKDLPCM+YAKISELL LYGN
Sbjct: 2761 EENNILVGKHFVHPSISHDLASRLGVQSIRCLSLVDEEMTKDLPCMEYAKISELLKLYGN 2820

Query: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEI 2880
            DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVA+FEGSSL+TEEI
Sbjct: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAVFEGSSLSTEEI 2880

Query: 2881 SSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGA 2940
            S LQFRPPWKLRGDTLNYGLGLLSCYYVCDLL IVSGGYFYIFDPRGIALSVAPKSAPGA
Sbjct: 2881 SGLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLYIVSGGYFYIFDPRGIALSVAPKSAPGA 2940

Query: 2941 KVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEIS 3000
            K+FSLIGSNLIE+F DQFHP+LGGQNMSWPSDSTI+RMPLS ACLKDGLE GI +IKEIS
Sbjct: 2941 KMFSLIGSNLIERFKDQFHPLLGGQNMSWPSDSTIIRMPLSSACLKDGLESGIERIKEIS 3000

Query: 3001 SKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQ 3060
            SKFLDHASRSLLFLKSV+QVSFSTWDQG L+PYQDYSVC+NLSSAIARNPFSEKKWKKFQ
Sbjct: 3001 SKFLDHASRSLLFLKSVLQVSFSTWDQGELNPYQDYSVCVNLSSAIARNPFSEKKWKKFQ 3060

Query: 3061 LSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV 3120
            LSRLFSSSNAATK+H ID+I+FQG+ QFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV
Sbjct: 3061 LSRLFSSSNAATKVHAIDVILFQGDAQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV 3120

Query: 3121 AGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEA 3180
            AG+AAHISRNGLPADI  KSPLMAP PLSGDI LPVTVLGCFLVCH+GGRYLFKNQVLEA
Sbjct: 3121 AGLAAHISRNGLPADIYLKSPLMAPFPLSGDIILPVTVLGCFLVCHNGGRYLFKNQVLEA 3180

Query: 3181 VAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKA 3240
               PLDAGNKLVEAWNRELMSCVCDSYI+M+LEIHKQRKESSSSALESNVSHSISSSLKA
Sbjct: 3181 FVEPLDAGNKLVEAWNRELMSCVCDSYIFMVLEIHKQRKESSSSALESNVSHSISSSLKA 3240

Query: 3241 YGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVK 3300
            YGNQVYSFWPRS   NGSDSDLDRGLKADWECLVEQVIRPFY RAIDLPVWQLYSGNLVK
Sbjct: 3241 YGNQVYSFWPRS--GNGSDSDLDRGLKADWECLVEQVIRPFYARAIDLPVWQLYSGNLVK 3300

Query: 3301 AEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360
            AEEGMFLAQPGSPVG NLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR
Sbjct: 3301 AEEGMFLAQPGSPVGDNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360

Query: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSE 3420
            DLLRVSSASIVLQSIDTYLDVLEYCLSDI+LA  SNHAEDS+GADS+NTNPGGRSTNT+E
Sbjct: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDIVLATSSNHAEDSIGADSINTNPGGRSTNTTE 3420

Query: 3421 GSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHS 3480
             S TS+ VSS++SFAR SNQN ASSGDALEMMTSLGRALLDFGRGVVEDIGRSG+S  HS
Sbjct: 3421 DSPTSIPVSSVHSFARSSNQNGASSGDALEMMTSLGRALLDFGRGVVEDIGRSGESSFHS 3480

Query: 3481 NTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLA 3540
            NTF GRNN SYRNVDQ+FLQMVSE+KGLPFP+ASNNLVRLGSMELWLGSKDQQELMIPLA
Sbjct: 3481 NTFNGRNN-SYRNVDQHFLQMVSELKGLPFPTASNNLVRLGSMELWLGSKDQQELMIPLA 3540

Query: 3541 ARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSP 3600
            A+F+HPK+FDRSILGNILTNDALHKFLKLQKFSL+LLAT+MRSVFHANWVNHVMNSNM+P
Sbjct: 3541 AKFLHPKIFDRSILGNILTNDALHKFLKLQKFSLNLLATHMRSVFHANWVNHVMNSNMAP 3600

Query: 3601 WFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERH 3660
            WFSW+NK  SGVEEGP+SEWIR+FWKN   +SQDLLLFSDWPL+PAFLGRPILCRVRERH
Sbjct: 3601 WFSWDNKLSSGVEEGPTSEWIRIFWKNFSGTSQDLLLFSDWPLIPAFLGRPILCRVRERH 3660

Query: 3661 LVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPL 3720
            LVFLPPV + V PN+ILEIGAGGSDVAETS S ISKPES+QPYT AFQ+FQD YPWLFPL
Sbjct: 3661 LVFLPPVAHSVSPNSILEIGAGGSDVAETSLSDISKPESLQPYTLAFQRFQDMYPWLFPL 3720

Query: 3721 LNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNE 3780
            LN CNIPIFDVAFMDCA+LCNC+ NSGQ LGQ+IAS FVAA NAGYFPELASLSDSNS+E
Sbjct: 3721 LNQCNIPIFDVAFMDCAALCNCVPNSGQPLGQVIASKFVAAKNAGYFPELASLSDSNSDE 3780

Query: 3781 LLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDC 3840
            LL LFA DF SNGTNYGR+ELEILR LPIYRTVVGSYTQLR+NDQCMISSNSFLKP N+C
Sbjct: 3781 LLKLFAKDFASNGTNYGREELEILRTLPIYRTVVGSYTQLRENDQCMISSNSFLKPNNEC 3840

Query: 3841 CLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQA 3900
            CLSYSSNSMEYSLLRALRVPELD++QILI+FG P FDCKPQSEQEDILIYL+TNW+DLQA
Sbjct: 3841 CLSYSSNSMEYSLLRALRVPELDDEQILIKFGFPGFDCKPQSEQEDILIYLYTNWRDLQA 3900

Query: 3901 DAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWL 3960
            +A LVECLSET FVRSADEFCTDLFKSKELYDPSDALLTSVFS ERKKFPGERFAADGWL
Sbjct: 3901 NARLVECLSETKFVRSADEFCTDLFKSKELYDPSDALLTSVFSDERKKFPGERFAADGWL 3960

Query: 3961 RILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGS 4020
             ILRK+GLRTT EANVILECAKKVETLGSEWRKSEEDG EFDL N Q+EVPME+WTLAGS
Sbjct: 3961 HILRKVGLRTTAEANVILECAKKVETLGSEWRKSEEDGFEFDLTNAQSEVPMEIWTLAGS 4020

Query: 4021 VVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPL 4080
            VVEAVFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVL SY D I SKDWPL
Sbjct: 4021 VVEAVFSNFAVFYSNSFCNALGNIVFVPAELGFPNLGGNKGGKRVLASYSDAIASKDWPL 4080

Query: 4081 AWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVM 4140
            AWSCAPILSKH VIPP+YSWGALNL+SPPAFP VLKHLQVIGRNGGEDTL+HWPIS+G+M
Sbjct: 4081 AWSCAPILSKHCVIPPEYSWGALNLKSPPAFPKVLKHLQVIGRNGGEDTLSHWPISVGIM 4140

Query: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFA 4200
            SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSP A
Sbjct: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPIA 4200

Query: 4201 FELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYIC 4260
            F+LPSGYLPFVKIL DLGLQD LSVASAKDLLSSLQVACGYQRLNPNELRSVMEILH+IC
Sbjct: 4201 FQLPSGYLPFVKILGDLGLQDGLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHFIC 4260

Query: 4261 DEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPER 4320
            DEA EAK+F GREPE+IVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVH DLPE+
Sbjct: 4261 DEATEAKIFCGREPEVIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHPDLPEK 4320

Query: 4321 ICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVN 4380
            ICR+LGIKKLS+LVIEELDHEDSIEPLE IGAVSL  I++KLLSRSFQ+AVWNV NSMV+
Sbjct: 4321 ICRILGIKKLSELVIEELDHEDSIEPLECIGAVSLGRIKEKLLSRSFQSAVWNVANSMVS 4380

Query: 4381 YIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIH 4440
            Y H NKNLDL+AVE+LLKS AERLQFVK LHTRFLLLPNSI+ITRP+KDSIIPEW+D  H
Sbjct: 4381 YTHTNKNLDLEAVEELLKSFAERLQFVKCLHTRFLLLPNSINITRPSKDSIIPEWEDVRH 4440

Query: 4441 HRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAII 4500
            HRALYFV  SKTCILVAEPPA ISIFDV+AIVVSQILGSPIPLP+GSL FCPEG E AI+
Sbjct: 4441 HRALYFVKQSKTCILVAEPPACISIFDVLAIVVSQILGSPIPLPIGSLFFCPEGIETAIV 4500

Query: 4501 NILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYG 4560
            +ILKLCSEK NE+FTGISSL+GKEILPQDALQ+QLHPLRPFYA EVVAWRSQSGEKLKYG
Sbjct: 4501 DILKLCSEKTNEKFTGISSLIGKEILPQDALQIQLHPLRPFYAGEVVAWRSQSGEKLKYG 4560

Query: 4561 RVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIID 4620
             V EDVRPSAGQALY+FRVET  GI QSL+SSQVLSFRSISIDG  SS+NL DS HM+ID
Sbjct: 4561 MVLEDVRPSAGQALYRFRVETTSGIIQSLLSSQVLSFRSISIDGGPSSSNLLDSSHMVID 4620

Query: 4621 SGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKT 4680
            +G+SV+MPENSE GK+RSQP AELQYG+VSAEELVQAVHEML+TAGINVDIERQSLLQKT
Sbjct: 4621 NGSSVQMPENSESGKLRSQPAAELQYGKVSAEELVQAVHEMLTTAGINVDIERQSLLQKT 4680

Query: 4681 VVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740
            + LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAW+CRVCLTSEVEITIVPCGHVLCR+C
Sbjct: 4681 ITLQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWVCRVCLTSEVEITIVPCGHVLCRRC 4739

Query: 4741 SAAVSK 4746
            S+AVSK
Sbjct: 4741 SSAVSK 4739

BLAST of Moc09g05800 vs. ExPASy TrEMBL
Match: A0A6J1KUI4 (sacsin isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111497740 PE=4 SV=1)

HSP 1 Score: 8643.5 bits (22427), Expect = 0.0e+00
Identity = 4266/4746 (89.89%), Postives = 4481/4746 (94.42%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASES SLDS+FLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESASLDSVFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIR SAISQYRDQFLPYC 
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRSSAISQYRDQFLPYCV 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            F+CDM+SSF GTLFRLPLRNAD AARS ISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FNCDMKSSFAGTLFRLPLRNADLAARSNISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYSFSVRSASSDIIWHRQMLLRLSKST ST S+ DS+SL+FLS
Sbjct: 241  LCIEMFVWNDGETEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTASTMSDTDSYSLEFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            +A +GTQIEERIDSFFIVQTMASATSRI SFAATASKEYDIHLLPWASLAVCTSDDSS  
Sbjct: 301  RAMSGTQIEERIDSFFIVQTMASATSRIGSFAATASKEYDIHLLPWASLAVCTSDDSSNN 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWN LLLED
Sbjct: 361  SVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNMLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            +IAPSFIELLIGVQVFLGPTD YFSLWPSGSFEEPWN LVEQVYK ISNALVLYSNVEGG
Sbjct: 421  VIAPSFIELLIGVQVFLGPTDAYFSLWPSGSFEEPWNKLVEQVYKTISNALVLYSNVEGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSP +AFLHDDKF RS ELGEALVLLGMPIVHLPENLSNMLLKFC  FQQKVVTPCTV
Sbjct: 481  KWVSPNEAFLHDDKFARSTELGEALVLLGMPIVHLPENLSNMLLKFCCTFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            R FLR+CKH++TLNRPYKLVLLEYCIEDLIDADVC  AF LPL+PLANGDFGLFSEASKG
Sbjct: 541  RQFLRDCKHISTLNRPYKLVLLEYCIEDLIDADVCTHAFGLPLLPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEY LLHQISDRVID NIPL IS RLSNIARSS SN+F FNVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYKLLHQISDRVIDWNIPLAISARLSNIARSSTSNLFAFNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYKNEV WDPESCSNHPTSSWF LFW+YL D CEKLSLFSDWPILP KSRYLYRA
Sbjct: 661  VPADWKYKNEVFWDPESCSNHPTSSWFLLFWQYLRDHCEKLSLFSDWPILPCKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            +K+SK+INVQMLSNEMQ ILSKLGCKLLDPYYKVEHRDL HYVNDGNCTG+LDSIYDAIS
Sbjct: 721  SKQSKVINVQMLSNEMQNILSKLGCKLLDPYYKVEHRDLFHYVNDGNCTGILDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGL+LTSL++LEVEEKDGLRRFLLDPKWYLGG M D++LEKCKRLPI+KVYNGGSAQD
Sbjct: 781  STGGLLLTSLHNLEVEEKDGLRRFLLDPKWYLGGSMKDNELEKCKRLPIFKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSF-DSEVEILLKYYGIKKMGKASFYRKHVL 900
            FGFSDLE+P+KY PPS+VGE FLGVEFI SS  D E EILLKYYGIKKMGKASFYRKHVL
Sbjct: 841  FGFSDLESPRKYFPPSEVGECFLGVEFIFSSSDDGEEEILLKYYGIKKMGKASFYRKHVL 900

Query: 901  NQVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRY 960
            NQV QLQP+LRD+TMLSVL NLPQLCVEDV FRECLSNL F+PTS GTLKCP  LYDPRY
Sbjct: 901  NQVGQLQPKLRDNTMLSVLLNLPQLCVEDVTFRECLSNLDFIPTSRGTLKCPAALYDPRY 960

Query: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAH 1020
            EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCV+PETIVQSA HVER MH D NKAH
Sbjct: 961  EELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVSPETIVQSALHVERFMHMDQNKAH 1020

Query: 1021 SRGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKIS 1080
            SRGKVLLSYLEVNAIKWLLNP NE+ GMVNRLFSTAATAFRPRNF SDLEKFWNDL KIS
Sbjct: 1021 SRGKVLLSYLEVNAIKWLLNPTNEEHGMVNRLFSTAATAFRPRNFTSDLEKFWNDLRKIS 1080

Query: 1081 WCPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140
            WCPVLL+PPFETLPWP+VSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS
Sbjct: 1081 WCPVLLTPPFETLPWPIVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWS 1140

Query: 1141 SPPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEG 1200
            SPP GSIIAAQLLELGKNNEIV+D VLRKELA AMPRIYALLTGLIGSDEMDVVKAVLEG
Sbjct: 1141 SPPSGSIIAAQLLELGKNNEIVHDLVLRKELAQAMPRIYALLTGLIGSDEMDVVKAVLEG 1200

Query: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADIL 1260
            CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFK+LFL+LGIREFLKPNDYADIL
Sbjct: 1201 CRWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKELFLKLGIREFLKPNDYADIL 1260

Query: 1261 SRMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAP 1320
            SRMAI+KGSS LN QEVRAAILIVQHLAEAQLPKQQIN+YLPDIS RLLPASNLVYNDAP
Sbjct: 1261 SRMAIRKGSSSLNTQEVRAAILIVQHLAEAQLPKQQINLYLPDISCRLLPASNLVYNDAP 1320

Query: 1321 WLLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMN 1380
            WLLGTDDTDV F+GES  VLNARKTVQ FVHGNISN+VAEKLGVCSLRRILLAESADSMN
Sbjct: 1321 WLLGTDDTDVSFNGESNFVLNARKTVQNFVHGNISNEVAEKLGVCSLRRILLAESADSMN 1380

Query: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGT 1440
            LSLSGAAEAFGQHEALTNRLRHILEMYADG GILFELIQNAEDAGASEVVFLLDKTHYGT
Sbjct: 1381 LSLSGAAEAFGQHEALTNRLRHILEMYADGSGILFELIQNAEDAGASEVVFLLDKTHYGT 1440

Query: 1441 SSILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500
            SSILSPEMADWQGPALYCYNDSVFSS DLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF
Sbjct: 1441 SSILSPEMADWQGPALYCYNDSVFSSHDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHF 1500

Query: 1501 TDIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560
            TDIPTFVSGEN+VMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ
Sbjct: 1501 TDIPTFVSGENVVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQ 1560

Query: 1561 KPFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIF 1620
            KPFPGTLFRFPLRSPALASRSEIKKE YAPEDV SLFYSFSEVASDALLFLTNVK ISIF
Sbjct: 1561 KPFPGTLFRFPLRSPALASRSEIKKEAYAPEDVLSLFYSFSEVASDALLFLTNVKTISIF 1620

Query: 1621 IKDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINR 1680
             KDDI H+MQCLYRVHKNT++EPSTESSA+QDII+FI GNRQGE+DREQFL KL+KSI++
Sbjct: 1621 TKDDIGHEMQCLYRVHKNTITEPSTESSAQQDIINFICGNRQGELDREQFLRKLNKSISK 1680

Query: 1681 DLPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLH 1740
            DLPYKCQKLIITEKSS GDILQHYWITSGCLGGGLPRNNSG+GDKSYNFIPWACVAALLH
Sbjct: 1681 DLPYKCQKLIITEKSSGGDILQHYWITSGCLGGGLPRNNSGVGDKSYNFIPWACVAALLH 1740

Query: 1741 SVQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNA 1800
            SV+VD EMNYD E ENNWL+ASD VQVSSASI+GRKP EGRAFCFLPLP++TGLPVHVNA
Sbjct: 1741 SVKVDEEMNYDQEAENNWLIASDSVQVSSASIQGRKPLEGRAFCFLPLPIKTGLPVHVNA 1800

Query: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSF 1860
            YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLE+VVAPAYG LLEKI SEIGHSGLFSSF
Sbjct: 1801 YFELSSNRRDIWYGDDMAGGGKKRSEWNSYLLENVVAPAYGHLLEKIASEIGHSGLFSSF 1860

Query: 1861 WPTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEA 1920
            WP+TAGLEPWGSVVR LYSFIGDFG+LVLYTNARGGQWIS +QAIFPDFSFDKV+ELIEA
Sbjct: 1861 WPSTAGLEPWGSVVRNLYSFIGDFGILVLYTNARGGQWISARQAIFPDFSFDKVHELIEA 1920

Query: 1921 LADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980
            L+DSGLP+I+ SKSIVDRFMEV PSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV
Sbjct: 1921 LSDSGLPLISTSKSIVDRFMEVCPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLV 1980

Query: 1981 DLKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGI 2040
            DLKLPFQS+SLC LPLLPLADG+FT+F+KNG+GERTYIARGDEYG+LK+SVP QLVDP I
Sbjct: 1981 DLKLPFQSESLCRLPLLPLADGTFTTFNKNGIGERTYIARGDEYGILKESVPSQLVDPDI 2040

Query: 2041 PEVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWI 2100
            PE VHAKLCEVAQTEDLNICFLSC LLEKLFLRFLP EWQNARQVNWNPGHQ  PSLEWI
Sbjct: 2041 PEAVHAKLCEVAQTEDLNICFLSCHLLEKLFLRFLPTEWQNARQVNWNPGHQSHPSLEWI 2100

Query: 2101 RLIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLF 2160
            RL+WCYLK HC+DLSQFSKWPILPVG+NSLLQLVENSNVL+ADGWSENMFSLLLKVGCLF
Sbjct: 2101 RLVWCYLKLHCNDLSQFSKWPILPVGENSLLQLVENSNVLRADGWSENMFSLLLKVGCLF 2160

Query: 2161 LRRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQS 2220
            LRRDMPIEHPQLEN+VHPSTA GILNAFL+IAGDIENVEGLF DA E ELHELRSFILQS
Sbjct: 2161 LRRDMPIEHPQLENFVHPSTATGILNAFLAIAGDIENVEGLFHDACEGELHELRSFILQS 2220

Query: 2221 KWYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESE 2280
            KW+LEG MEA HVDIIK IPMFESYK RKLVSLS+P+RWIKPTGL EDFLNDDFVR+ESE
Sbjct: 2221 KWFLEGNMEATHVDIIKRIPMFESYKSRKLVSLSQPVRWIKPTGLYEDFLNDDFVRIESE 2280

Query: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSV 2340
            KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSER ALSTIL DVKLLIEED SLKSSV
Sbjct: 2281 KERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSEREALSTILLDVKLLIEEDASLKSSV 2340

Query: 2341 SMIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSL 2400
            SMIPFVL  NGSWQPPSRLYDPRV EL NMLHEETFFPSE FSDDDILDALVSLGL RSL
Sbjct: 2341 SMIPFVLTGNGSWQPPSRLYDPRVHELKNMLHEETFFPSEIFSDDDILDALVSLGLNRSL 2400

Query: 2401 DLTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKS 2460
             LTG LDCARSV LLNDS+N    SYARRLFVCLDALAHKLSI VEG+ YELQ SML++S
Sbjct: 2401 GLTGFLDCARSVPLLNDSEN----SYARRLFVCLDALAHKLSINVEGNCYELQTSMLVES 2460

Query: 2461 NYVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKV 2520
            +YVDDD SMEVGSL+ EDTSDMG DSLIGNLTGDESEEEFWSEM TIAWCP+CADSP+KV
Sbjct: 2461 DYVDDDTSMEVGSLDREDTSDMGIDSLIGNLTGDESEEEFWSEMKTIAWCPVCADSPVKV 2520

Query: 2521 LPWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQL 2580
            LPWLKT N+VAPPSIVRPKSQMWMVSSSMHILDGV PS YLQHKLGWTDCPRVEVLCAQL
Sbjct: 2521 LPWLKTCNKVAPPSIVRPKSQMWMVSSSMHILDGVLPSVYLQHKLGWTDCPRVEVLCAQL 2580

Query: 2581 TDISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVG 2640
            TDISKLYGELRLHSS+EPDINTALQ+GI ILYSKLQEY GTD+ VLLKSALNGVSWVWVG
Sbjct: 2581 TDISKLYGELRLHSSIEPDINTALQDGITILYSKLQEYRGTDDLVLLKSALNGVSWVWVG 2640

Query: 2641 DDFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDV 2700
            DDFV PSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YLDVLQRLH DV
Sbjct: 2641 DDFVSPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKDYLDVLQRLHKDV 2700

Query: 2701 EGSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWM 2760
            +GSPLSTDQM+FVIC+LEAI DCC+DKPEFTATS  LLIPNSSQVLM ANDLVYNDAPWM
Sbjct: 2701 KGSPLSTDQMNFVICVLEAILDCCMDKPEFTATSIPLLIPNSSQVLMLANDLVYNDAPWM 2760

Query: 2761 EDNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGN 2820
            E+NNILVGKHFVHPSIS+DLASRLGVQSIRCLSLVDEEMTKDLPCM+YAKISELL LYGN
Sbjct: 2761 EENNILVGKHFVHPSISHDLASRLGVQSIRCLSLVDEEMTKDLPCMEYAKISELLKLYGN 2820

Query: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEI 2880
            DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVA+FEGSSL+TEEI
Sbjct: 2821 DYLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAVFEGSSLSTEEI 2880

Query: 2881 SSLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGA 2940
            S LQFRPPWKLRGDTLNYGLGLLSCYYVCDLL IVSGGYFYIFDPRGIALSVAPKSAPGA
Sbjct: 2881 SGLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLYIVSGGYFYIFDPRGIALSVAPKSAPGA 2940

Query: 2941 KVFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEIS 3000
            K+FSLIGSNLIE+F DQFHP+LGGQNMSWPSDSTI+RMPLS ACLKDGLE GI +IK+IS
Sbjct: 2941 KMFSLIGSNLIERFKDQFHPLLGGQNMSWPSDSTIIRMPLSSACLKDGLESGIERIKKIS 3000

Query: 3001 SKFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQ 3060
            SKFLDHASRSLLFLKSV+QVSFSTWDQG L+PYQDYSVC+NLSSAIARNPFSEKKWKKFQ
Sbjct: 3001 SKFLDHASRSLLFLKSVLQVSFSTWDQGELNPYQDYSVCVNLSSAIARNPFSEKKWKKFQ 3060

Query: 3061 LSRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPV 3120
            LSRLFSSSNAATK+H ID+I+FQG+ QFVDRWLVVL+LGSGQTRNMALDRRYLAYNLTPV
Sbjct: 3061 LSRLFSSSNAATKVHAIDVILFQGDAQFVDRWLVVLTLGSGQTRNMALDRRYLAYNLTPV 3120

Query: 3121 AGVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEA 3180
            AG+AAHISRNGLPA+I  KSPLMAP PLSGDI LPVTVLGCFLVCH+ GRYLFKNQVLEA
Sbjct: 3121 AGLAAHISRNGLPANIYLKSPLMAPFPLSGDIILPVTVLGCFLVCHNAGRYLFKNQVLEA 3180

Query: 3181 VAAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKA 3240
               PLDAGNKLVEAWNRELMSCVCDSYI+M+LEIHKQRKESSSSALESNVSHSISSSLKA
Sbjct: 3181 FVEPLDAGNKLVEAWNRELMSCVCDSYIFMVLEIHKQRKESSSSALESNVSHSISSSLKA 3240

Query: 3241 YGNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVK 3300
            YGNQVYSFWPRS   NGSDSDLDRGLKADWECLVEQVIRPFY RAIDLPVWQLYSGNLVK
Sbjct: 3241 YGNQVYSFWPRS--GNGSDSDLDRGLKADWECLVEQVIRPFYARAIDLPVWQLYSGNLVK 3300

Query: 3301 AEEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360
            AEEGMFLAQPGSPVG NLLPATVC FVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR
Sbjct: 3301 AEEGMFLAQPGSPVGDNLLPATVCSFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVR 3360

Query: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSE 3420
            DLLRVSSASIVLQSIDTYLDVLEYCLSDI+LA  SNHAEDS+GA S+NTNPGGRSTNT+E
Sbjct: 3361 DLLRVSSASIVLQSIDTYLDVLEYCLSDIVLATSSNHAEDSIGAGSINTNPGGRSTNTTE 3420

Query: 3421 GSSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHS 3480
             S TS+ VSS++SFAR SNQN ASSGDALEMMTSLGRALLDFGRGVVEDIGRSG+S  HS
Sbjct: 3421 DSPTSIPVSSVHSFARSSNQNGASSGDALEMMTSLGRALLDFGRGVVEDIGRSGESSFHS 3480

Query: 3481 NTFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLA 3540
            NTF GRNN SYRNVDQ+FLQMVSE+KGLPFP+ASNNLVRLGSMELWLGSKDQQELMIPLA
Sbjct: 3481 NTFNGRNN-SYRNVDQHFLQMVSELKGLPFPTASNNLVRLGSMELWLGSKDQQELMIPLA 3540

Query: 3541 ARFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSP 3600
            A+F+HPK+FDRSILGNILTNDALHKFLKLQKFSL+LLAT+MRSVFHANWVNHVMNSNM+P
Sbjct: 3541 AKFLHPKIFDRSILGNILTNDALHKFLKLQKFSLNLLATHMRSVFHANWVNHVMNSNMAP 3600

Query: 3601 WFSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERH 3660
            WFSW+NK  SGVEEGP+SEWIR+FWKN   +SQDLLLFSDWPL+PAFLGRPILCRVRERH
Sbjct: 3601 WFSWDNKLSSGVEEGPTSEWIRIFWKNFSGTSQDLLLFSDWPLIPAFLGRPILCRVRERH 3660

Query: 3661 LVFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPL 3720
            LVFLPPV + V PN+ILEIGAG SDVAETS S ISK ES+QPYT AFQ+FQD YPWLFPL
Sbjct: 3661 LVFLPPVAHSVSPNSILEIGAGDSDVAETSLSDISKLESLQPYTLAFQRFQDMYPWLFPL 3720

Query: 3721 LNHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNE 3780
            LN CNIPIFDVAFMDCA+LCNC+ NSGQ LGQIIAS FVAA NAGYFPELASLSDSNS+E
Sbjct: 3721 LNQCNIPIFDVAFMDCAALCNCVPNSGQPLGQIIASKFVAAKNAGYFPELASLSDSNSDE 3780

Query: 3781 LLNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDC 3840
            LL LFA DF SNGTNYGR+ELEILR LPIYRTVVGSYTQLR+NDQCMISSNSFLKP N+C
Sbjct: 3781 LLKLFAKDFASNGTNYGREELEILRTLPIYRTVVGSYTQLRENDQCMISSNSFLKPNNEC 3840

Query: 3841 CLSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQA 3900
            CLSYSSNSMEYSLLRALRVPELD++QILI+FG P FDCKPQSEQEDILIYL+TNW+DLQA
Sbjct: 3841 CLSYSSNSMEYSLLRALRVPELDDEQILIKFGFPGFDCKPQSEQEDILIYLYTNWRDLQA 3900

Query: 3901 DAHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWL 3960
            +A LVECLSET FVRSADEFCTDLFKSKELYDPSDALLTSVFS ERKKFPGERFAADGWL
Sbjct: 3901 NARLVECLSETKFVRSADEFCTDLFKSKELYDPSDALLTSVFSDERKKFPGERFAADGWL 3960

Query: 3961 RILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGS 4020
             ILRK+GLRTT EANVILECAKKVETLGSEWRKSEEDG EFDL N Q+EVPME+WTLAGS
Sbjct: 3961 HILRKVGLRTTAEANVILECAKKVETLGSEWRKSEEDGFEFDLTNAQSEVPMEIWTLAGS 4020

Query: 4021 VVEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPL 4080
            VVEAVFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVL SY D I SKDWPL
Sbjct: 4021 VVEAVFSNFAVFYSNSFCNALGNIVFVPAELGFPNLGGNKGGKRVLASYSDAIASKDWPL 4080

Query: 4081 AWSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVM 4140
            AWSCAPILSKH VIPP+Y+WGALNL+SPPAFP VLKHLQVIGRNGGEDTL+HWPIS+G+M
Sbjct: 4081 AWSCAPILSKHCVIPPEYAWGALNLKSPPAFPKVLKHLQVIGRNGGEDTLSHWPISVGIM 4140

Query: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFA 4200
            SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSP A
Sbjct: 4141 SINEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPIA 4200

Query: 4201 FELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYIC 4260
            F+LPSGYLPFVKIL+DLGLQD LSVASAKDLLSSLQVACGYQRLNPNELRSVMEILH+IC
Sbjct: 4201 FQLPSGYLPFVKILRDLGLQDGLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHFIC 4260

Query: 4261 DEAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPER 4320
            DEA EAK+F GREPE+IVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVH DLPE+
Sbjct: 4261 DEATEAKIFYGREPEVIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHPDLPEK 4320

Query: 4321 ICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVN 4380
            ICR+LGIKKLS+LVIEELDHEDSIEPLE IGAVSL  I++KLLSRSFQ+AVWNV NSMV+
Sbjct: 4321 ICRILGIKKLSELVIEELDHEDSIEPLECIGAVSLGRIKEKLLSRSFQSAVWNVANSMVS 4380

Query: 4381 YIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIH 4440
            Y H NKNLDL+AVE+LLKS AERLQFVK LHTRFLLLPNSI+ITRP+KDSIIPEW+DG H
Sbjct: 4381 YTHTNKNLDLEAVEELLKSFAERLQFVKCLHTRFLLLPNSINITRPSKDSIIPEWEDGRH 4440

Query: 4441 HRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAII 4500
            HRALYFV  SKTCILVAEPPA ISIFDV+AIVVSQILGSPIPLP+ SL FCPEG E AI+
Sbjct: 4441 HRALYFVKQSKTCILVAEPPACISIFDVLAIVVSQILGSPIPLPICSLFFCPEGIETAIV 4500

Query: 4501 NILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYG 4560
            +ILKLCSEK NE+FTGISSL+GKEILPQDALQ+QLHPLRPFYA EVVAWRSQSGEKLKYG
Sbjct: 4501 DILKLCSEKTNEKFTGISSLIGKEILPQDALQIQLHPLRPFYAGEVVAWRSQSGEKLKYG 4560

Query: 4561 RVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIID 4620
             V EDVRPSAGQALY+FRVET  GI QSL+SSQVLSFRSISIDG  SS+NL D+ HM+ID
Sbjct: 4561 MVLEDVRPSAGQALYRFRVETTSGIIQSLLSSQVLSFRSISIDGGPSSSNLLDTSHMVID 4620

Query: 4621 SGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKT 4680
            +G+SV+MPEN E GKI+SQP AELQYGRVSAEEL+QAVHEML+TAGINVDIERQSLLQKT
Sbjct: 4621 NGSSVQMPENLESGKIQSQPAAELQYGRVSAEELMQAVHEMLTTAGINVDIERQSLLQKT 4680

Query: 4681 VVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740
            + LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAW+CRVCLTSEVEITIVPCGHVLCR+C
Sbjct: 4681 ITLQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWVCRVCLTSEVEITIVPCGHVLCRRC 4739

Query: 4741 SAAVSK 4746
            S+AVSK
Sbjct: 4741 SSAVSK 4739

BLAST of Moc09g05800 vs. ExPASy TrEMBL
Match: A0A5A7URT0 (Sacsin isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G00400 PE=4 SV=1)

HSP 1 Score: 8593.8 bits (22298), Expect = 0.0e+00
Identity = 4233/4746 (89.19%), Postives = 4472/4746 (94.23%), Query Frame = 0

Query: 1    MASESTSLDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60
            MASESTSLDSI LEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD
Sbjct: 1    MASESTSLDSILLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLD 60

Query: 61   RRVHGSESLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120
            RRVHG ESLLS SLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF
Sbjct: 61   RRVHGRESLLSASLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGF 120

Query: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCA 180
            NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSA+NPGKRIDFIR SAISQYRDQFLPYCA
Sbjct: 121  NSVYHLTELPSFVSGKYVVMFDPQGIYLPKVSASNPGKRIDFIRSSAISQYRDQFLPYCA 180

Query: 181  FDCDMESSFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240
            FDC+MESSF GTLFR PLRN DQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV
Sbjct: 181  FDCNMESSFAGTLFRFPLRNTDQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSV 240

Query: 241  SCIEMFVWNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLS 300
             CIEMFVWNDGE EPQKLYSFS+RSA+SD IWHRQMLLRLSKSTT TQSEVDSFSL+FLS
Sbjct: 241  LCIEMFVWNDGETEPQKLYSFSLRSANSDTIWHRQMLLRLSKSTTFTQSEVDSFSLEFLS 300

Query: 301  QATTGTQIEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKT 360
            QA TGTQ EERIDSFFIVQTMASATSRI SFAATASKEYDIHLLPWASLAVCT+  SS  
Sbjct: 301  QAMTGTQTEERIDSFFIVQTMASATSRIGSFAATASKEYDIHLLPWASLAVCTT-ASSND 360

Query: 361  NVLKLGRAFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLED 420
            +VLKLGRAFCFLPLPVKTGL VQVNGFFEVSSNRRGIWYG DMDRSGKIRSIWNRLLLED
Sbjct: 361  SVLKLGRAFCFLPLPVKTGLTVQVNGFFEVSSNRRGIWYGGDMDRSGKIRSIWNRLLLED 420

Query: 421  IIAPSFIELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGG 480
            IIAP+FIELL+GVQ+ LGPTDTYFSLWPSGSFEEPWNILVEQVYK ISNALVLYSNV+GG
Sbjct: 421  IIAPAFIELLLGVQILLGPTDTYFSLWPSGSFEEPWNILVEQVYKIISNALVLYSNVDGG 480

Query: 481  KWVSPIQAFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTV 540
            KWVSP  AFLHDDKFTRS EL EALVLLGMPIVHLPE LSNMLLKFC  FQQKVVTPCTV
Sbjct: 481  KWVSPNDAFLHDDKFTRSTELSEALVLLGMPIVHLPETLSNMLLKFCSTFQQKVVTPCTV 540

Query: 541  RHFLRECKHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKG 600
            RHFLRECKHV TLNRPY+LVLLEYCIEDLIDADVC   F LPL+PLANGDFGLFSEASKG
Sbjct: 541  RHFLRECKHVFTLNRPYRLVLLEYCIEDLIDADVCTNLFGLPLLPLANGDFGLFSEASKG 600

Query: 601  ISYFICDELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKF 660
            ISYFICDELEY LLHQISDR IDRNIPL ISTRLSNIA+SSKSN+FI NVHYFLQLFPKF
Sbjct: 601  ISYFICDELEYKLLHQISDRAIDRNIPLTISTRLSNIAKSSKSNLFILNVHYFLQLFPKF 660

Query: 661  VPADWKYKNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRA 720
            VPADWKYK+EV WDPESCSNHPTSSWF LFW YL + CE LSLFSDWPILPSKSRYLYRA
Sbjct: 661  VPADWKYKSEVFWDPESCSNHPTSSWFLLFWEYLREHCENLSLFSDWPILPSKSRYLYRA 720

Query: 721  TKKSKLINVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780
            TK+SK+INVQMLS+EMQ IL KLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS
Sbjct: 721  TKQSKMINVQMLSHEMQNILGKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAIS 780

Query: 781  STGGLMLTSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQD 840
            STGGLMLTSLY+LEVEEKDGLRRFLLDPKWYLGGCM+D+DL+KC+RLPI+KVYNGGSAQD
Sbjct: 781  STGGLMLTSLYNLEVEEKDGLRRFLLDPKWYLGGCMDDNDLDKCRRLPIFKVYNGGSAQD 840

Query: 841  FGFSDLENPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLN 900
            F FSDLE+P KYLPP DV E FLGVEFI+SS DSE EILLKYYGIK+MGK SFYRK+VLN
Sbjct: 841  FCFSDLEDPPKYLPPLDVEECFLGVEFIISSSDSEEEILLKYYGIKRMGKTSFYRKYVLN 900

Query: 901  QVEQLQPELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYE 960
            +V QLQPELRDSTMLS+L +LPQLC EDV FRECLSNL F+PTSSGTLKCP VLYDPRYE
Sbjct: 901  KVGQLQPELRDSTMLSLLVSLPQLCTEDVTFRECLSNLDFIPTSSGTLKCPAVLYDPRYE 960

Query: 961  ELCALLDDFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHS 1020
            ELCALLDDFDSFPSTPF+ESYILDILQGLGLR CV+PETIVQSA HVER MHKD NKAHS
Sbjct: 961  ELCALLDDFDSFPSTPFSESYILDILQGLGLRRCVSPETIVQSALHVERFMHKDQNKAHS 1020

Query: 1021 RGKVLLSYLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISW 1080
            RGKVLLSYLEVNAIKWLLNP +EDQGMVNRLFSTAATAFRPRNF SDLEKFWNDL KISW
Sbjct: 1021 RGKVLLSYLEVNAIKWLLNPTSEDQGMVNRLFSTAATAFRPRNFTSDLEKFWNDLRKISW 1080

Query: 1081 CPVLLSPPFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSS 1140
            CPVLLSPPFET+PWPVVSS+VAPPKLVRLPKDLWLVSASMRILDGEC+SSALAHSLGWSS
Sbjct: 1081 CPVLLSPPFETVPWPVVSSVVAPPKLVRLPKDLWLVSASMRILDGECASSALAHSLGWSS 1140

Query: 1141 PPGGSIIAAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGC 1200
            PP GSIIAAQLLELGKNNEIVYDQ+LRKELALAMPRIYALLT LIGSDEMDVVKAVLEGC
Sbjct: 1141 PPSGSIIAAQLLELGKNNEIVYDQMLRKELALAMPRIYALLTSLIGSDEMDVVKAVLEGC 1200

Query: 1201 RWIWVGDGFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILS 1260
            RWIWVGDGFATSEEVVL+GPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYA+ILS
Sbjct: 1201 RWIWVGDGFATSEEVVLEGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYANILS 1260

Query: 1261 RMAIKKGSSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPW 1320
            RMA +KGSSPLN QEVRAAILIVQHLAEAQLPKQQI+IYLPDIS RL PA NLVYNDAPW
Sbjct: 1261 RMATRKGSSPLNTQEVRAAILIVQHLAEAQLPKQQIDIYLPDISCRLFPAKNLVYNDAPW 1320

Query: 1321 LLGTDDTDVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380
            LLGTD+ DV FDGES   L+ARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL
Sbjct: 1321 LLGTDNNDVSFDGESAAFLSARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNL 1380

Query: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTS 1440
            SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAG+SEV+FLLDKTHYGTS
Sbjct: 1381 SLSGAAEAFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGSSEVIFLLDKTHYGTS 1440

Query: 1441 SILSPEMADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFT 1500
            SILSPEMADWQGPALYCYNDSVFS QDLYAISRVGQESKLQKPL+IGRFGLGFNCVYHFT
Sbjct: 1441 SILSPEMADWQGPALYCYNDSVFSPQDLYAISRVGQESKLQKPLAIGRFGLGFNCVYHFT 1500

Query: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQK 1560
            DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGR+ILEQFPDQFSPYLHFGCDM+K
Sbjct: 1501 DIPTFVSGENIVMFDPHACNLPGISPSHPGLRIKYAGRKILEQFPDQFSPYLHFGCDMEK 1560

Query: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFI 1620
            PFPGTLFRFPLRSPALASRSEIKKEGYAPEDV SLFYSFSEVASDAL+FLTNVK ISIF+
Sbjct: 1561 PFPGTLFRFPLRSPALASRSEIKKEGYAPEDVISLFYSFSEVASDALVFLTNVKTISIFV 1620

Query: 1621 KDDIEHDMQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRD 1680
            KDDI H+MQCLYRVHKNT+SEP+T+S+A+QDI+SFIYGN +GEMDREQFL KL+KSIN+D
Sbjct: 1621 KDDIGHEMQCLYRVHKNTISEPTTKSTAQQDIMSFIYGNHRGEMDREQFLTKLNKSINKD 1680

Query: 1681 LPYKCQKLIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHS 1740
            LPY CQKLIITEK S GDILQH+WI+SGCLGGGLPRNNSG+GDKSYNFIPWACVAALLHS
Sbjct: 1681 LPYVCQKLIITEKGSGGDILQHFWISSGCLGGGLPRNNSGVGDKSYNFIPWACVAALLHS 1740

Query: 1741 VQVDGEMNYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAY 1800
            V+VD EMN+DPETENNWL+ASDLVQVSSAS++ RKP EGRAFCFLPLP++TGLPVHVNAY
Sbjct: 1741 VKVDEEMNHDPETENNWLIASDLVQVSSASVQDRKPLEGRAFCFLPLPIKTGLPVHVNAY 1800

Query: 1801 FELSSNRRDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFW 1860
            FELSSNRRDIWYGDDMAGGG+KRSEWNSYLLE+VVAPAYG LLEK+ SE+GH GLFSSFW
Sbjct: 1801 FELSSNRRDIWYGDDMAGGGRKRSEWNSYLLEEVVAPAYGHLLEKVASEVGHFGLFSSFW 1860

Query: 1861 PTTAGLEPWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEAL 1920
            P  AG+EPWG VVRKLYSFIGDFGLLVLYTNARGGQWIS KQAIFPDFSFDKV+ELIEAL
Sbjct: 1861 PAAAGVEPWGLVVRKLYSFIGDFGLLVLYTNARGGQWISAKQAIFPDFSFDKVHELIEAL 1920

Query: 1921 ADSGLPVIAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980
            +DSGLPVI+ISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD
Sbjct: 1921 SDSGLPVISISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVD 1980

Query: 1981 LKLPFQSDSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIP 2040
            LKLP QSDSLCGLPLLPL DGSFTSFHKNG+GER YIARGDEYGLLKDSVP QLVDP IP
Sbjct: 1981 LKLPLQSDSLCGLPLLPLVDGSFTSFHKNGIGERIYIARGDEYGLLKDSVPSQLVDPDIP 2040

Query: 2041 EVVHAKLCEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIR 2100
            EVVHAKLCEVAQ E LNICFLSC LLEKLFLRFLP EWQNA+QVNW PG+QGQPSLEWIR
Sbjct: 2041 EVVHAKLCEVAQAEGLNICFLSCDLLEKLFLRFLPTEWQNAKQVNWKPGYQGQPSLEWIR 2100

Query: 2101 LIWCYLKSHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFL 2160
            LIWCYLKSHC+DLSQFSKWPILPVG+NSL+QLV+NSNVL+ADGWSENMFSLLLKVGCLFL
Sbjct: 2101 LIWCYLKSHCNDLSQFSKWPILPVGENSLMQLVQNSNVLRADGWSENMFSLLLKVGCLFL 2160

Query: 2161 RRDMPIEHPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSK 2220
            RRDMPIEHPQLEN+VHPSTAIGILNAFLSIAG IENVE LF +ASE ELHE RSFILQSK
Sbjct: 2161 RRDMPIEHPQLENFVHPSTAIGILNAFLSIAGGIENVEKLFHNASEGELHEFRSFILQSK 2220

Query: 2221 WYLEGKMEAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEK 2280
            W+LEG+MEA HVDIIKCIPMFESYKCRKLVSLSKP+RWIKPTG+CEDFLNDDFVR+ESEK
Sbjct: 2221 WFLEGQMEANHVDIIKCIPMFESYKCRKLVSLSKPVRWIKPTGICEDFLNDDFVRVESEK 2280

Query: 2281 ERIILKRYFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVS 2340
            ERIILK+YFGIGEPSRVEFYKDYVL+HMSEFLSER ALSTIL DVKLLIE+DVSLKSSVS
Sbjct: 2281 ERIILKKYFGIGEPSRVEFYKDYVLSHMSEFLSEREALSTILLDVKLLIEDDVSLKSSVS 2340

Query: 2341 MIPFVLASNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLD 2400
            MIPFVL  NGSWQPPSRLYDPRV EL NMLHEE FFPSE F DD+ILDALVSLGL+ SL 
Sbjct: 2341 MIPFVLTGNGSWQPPSRLYDPRVHELKNMLHEEAFFPSEKFLDDNILDALVSLGLKTSLG 2400

Query: 2401 LTGLLDCARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKVEGSGYELQNSMLIKSN 2460
            LTGLLDCARSVSLLNDS NSESQS  RRLFVCLDALAHKLSI VEG  +E QNSML KS+
Sbjct: 2401 LTGLLDCARSVSLLNDSNNSESQSLGRRLFVCLDALAHKLSINVEGICHEPQNSMLFKSD 2460

Query: 2461 YVDDDASMEVGSLNIEDTSDMGTDSLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVL 2520
            +VDDDASM+VGSLN EDTSDMG DS+IGNLT D SEEEFWSEM TIAWCP+CADSP+KVL
Sbjct: 2461 HVDDDASMQVGSLNREDTSDMGIDSIIGNLTSDGSEEEFWSEMKTIAWCPVCADSPVKVL 2520

Query: 2521 PWLKTNNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLT 2580
            PWLKT++QVAPP+ VRPKSQMWMVSSSMHILDG  PS YLQ KLGWT+CP VEVLC QLT
Sbjct: 2521 PWLKTSSQVAPPNNVRPKSQMWMVSSSMHILDGASPSVYLQQKLGWTECPSVEVLCGQLT 2580

Query: 2581 DISKLYGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGD 2640
            DISKLYGEL+LHSS   DINTALQ+GIPILYSKLQEY GTD+ + +KSALNGVSWVWVGD
Sbjct: 2581 DISKLYGELKLHSSTGSDINTALQDGIPILYSKLQEYRGTDDFLFIKSALNGVSWVWVGD 2640

Query: 2641 DFVPPSALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVE 2700
            DFV P+ALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNV+ YL VLQRLH DV 
Sbjct: 2641 DFVSPNALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVKEYLGVLQRLHRDVR 2700

Query: 2701 GSPLSTDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWME 2760
            GSPLSTDQM+FVIC+LEA+SDCCVD PEFTATS SLLIPNSSQVLM ANDLVYNDAPWME
Sbjct: 2701 GSPLSTDQMNFVICVLEAVSDCCVDMPEFTATSMSLLIPNSSQVLMLANDLVYNDAPWME 2760

Query: 2761 DNNILVGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGND 2820
            DNNILVGKHFVHPSISNDLA RLGVQSIRCLSLVDEEMTKDLPCMDY+KIS+LL LYGND
Sbjct: 2761 DNNILVGKHFVHPSISNDLAGRLGVQSIRCLSLVDEEMTKDLPCMDYSKISDLLKLYGND 2820

Query: 2821 YLFFDLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEIS 2880
            YLFFDLLELADCC+AK LRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSL+TEEIS
Sbjct: 2821 YLFFDLLELADCCKAKNLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLSTEEIS 2880

Query: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAK 2940
            SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSI+SGGYFYIFDPRGIALSVA KSAPGAK
Sbjct: 2881 SLQFRPPWKLRGDTLNYGLGLLSCYYVCDLLSIISGGYFYIFDPRGIALSVAAKSAPGAK 2940

Query: 2941 VFSLIGSNLIEKFNDQFHPMLGGQNMSWPSDSTIVRMPLSPACLKDGLEPGIRKIKEISS 3000
            VFSLIGSNLIE+FNDQF+P+LGGQNMSWPSDSTI+RMPLSPACLKDGLE GI +IKE+SS
Sbjct: 2941 VFSLIGSNLIERFNDQFYPLLGGQNMSWPSDSTIIRMPLSPACLKDGLESGIIRIKELSS 3000

Query: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQL 3060
            KFLDHASRSLLFLKSVVQVSFSTWDQ GLH  QDYSV +NLSSAIARNPFSEKKWKKFQL
Sbjct: 3001 KFLDHASRSLLFLKSVVQVSFSTWDQDGLHLDQDYSVSVNLSSAIARNPFSEKKWKKFQL 3060

Query: 3061 SRLFSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120
            SRLFSSSNAATK+H ID+I+ QGET+FVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA
Sbjct: 3061 SRLFSSSNAATKVHAIDVILLQGETRFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVA 3120

Query: 3121 GVAAHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAV 3180
            GVAAHISRNGLPADI +KSPLMAP PLSGDI LPVTVLGCFLVCHSGGRYLFKNQVLE +
Sbjct: 3121 GVAAHISRNGLPADIYRKSPLMAPFPLSGDIILPVTVLGCFLVCHSGGRYLFKNQVLEGL 3180

Query: 3181 AAPLDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAY 3240
              PLDAGNKLVEAWNRELMSCVCDSYI+MILE+HKQRKESSSSALES+VSHSIS SLKAY
Sbjct: 3181 VEPLDAGNKLVEAWNRELMSCVCDSYIFMILEVHKQRKESSSSALESSVSHSISLSLKAY 3240

Query: 3241 GNQVYSFWPRSEPANGSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300
            GNQVYSFWPRSEPAN SDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA
Sbjct: 3241 GNQVYSFWPRSEPANFSDSDLDRGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKA 3300

Query: 3301 EEGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360
            EEGMFLAQPGSPVGGNLLPATVC FVKEH+PVFSVPWELIKEIQAVGITVRQIRPKMVRD
Sbjct: 3301 EEGMFLAQPGSPVGGNLLPATVCSFVKEHHPVFSVPWELIKEIQAVGITVRQIRPKMVRD 3360

Query: 3361 LLRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEG 3420
            LLR  SASIVLQSIDTYLDVLEYCLSDI+LAA  NHA D++G+D+VNT PGGRSTN++EG
Sbjct: 3361 LLRAPSASIVLQSIDTYLDVLEYCLSDIVLAASPNHAVDNVGSDTVNTIPGGRSTNSTEG 3420

Query: 3421 SSTSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSN 3480
            SSTSV VSSM+SF R SNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGR+G+S SH N
Sbjct: 3421 SSTSVPVSSMHSFGRSSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRNGESSSHGN 3480

Query: 3481 TFTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAA 3540
            TFTGR NSSYRNVDQ+FLQMVSE+KGLPFP+ASN++VRLGSMELWLGSKDQQELMIPLAA
Sbjct: 3481 TFTGRINSSYRNVDQHFLQMVSELKGLPFPTASNSVVRLGSMELWLGSKDQQELMIPLAA 3540

Query: 3541 RFVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPW 3600
            +FVHPK+FDRSILGNILTNDALHKFLKLQKFSL+LLAT+MRSVFHANWVNHVM+SNM+PW
Sbjct: 3541 KFVHPKIFDRSILGNILTNDALHKFLKLQKFSLNLLATHMRSVFHANWVNHVMSSNMAPW 3600

Query: 3601 FSWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHL 3660
            FSW+NKS +GVEEGPSSEWIRLFWKN+  SSQ+LL+FS+WPLVPAFLGRPILCRVRERHL
Sbjct: 3601 FSWDNKSNAGVEEGPSSEWIRLFWKNSSGSSQNLLVFSEWPLVPAFLGRPILCRVRERHL 3660

Query: 3661 VFLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLL 3720
            VFLPPVT+P  PN+I E+ AGGSDVAETS+S ISKPESIQ YTSAFQ+FQDTYPWLFPLL
Sbjct: 3661 VFLPPVTHPTSPNSISEVVAGGSDVAETSSSEISKPESIQHYTSAFQRFQDTYPWLFPLL 3720

Query: 3721 NHCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNEL 3780
            NHCNIPIFDVAF+DCA+LCN L NS QSLGQ IAS FVAA NAGYFPELASLSDSNS+EL
Sbjct: 3721 NHCNIPIFDVAFVDCAALCNSLPNSNQSLGQAIASKFVAAKNAGYFPELASLSDSNSHEL 3780

Query: 3781 LNLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCC 3840
            LNLFA DFVSN TNY R+ELEILR LPIYRTV+GSYTQL +N+QCMISSNSFLKPYN  C
Sbjct: 3781 LNLFAKDFVSNQTNYRREELEILRTLPIYRTVIGSYTQLSENEQCMISSNSFLKPYNKSC 3840

Query: 3841 LSYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQAD 3900
            LSYSSNSMEYSLLRAL VPELD+QQIL++FGLP F  KPQSEQEDILIYL+TNW+DLQ+D
Sbjct: 3841 LSYSSNSMEYSLLRALGVPELDDQQILVKFGLPGFHSKPQSEQEDILIYLYTNWKDLQSD 3900

Query: 3901 AHLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLR 3960
            A LVECL ET FVRSADEFCTDLFKSKELYDPSDALL SVFSGER+KFPGERF ADGWL+
Sbjct: 3901 AQLVECLRETKFVRSADEFCTDLFKSKELYDPSDALLMSVFSGERRKFPGERFGADGWLQ 3960

Query: 3961 ILRKIGLRTTTEANVILECAKKVETLGSEWRKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4020
            ILRKIGLRT  EANVILECAKKVETLGSEWRKSEE+  +FDL N QNEVPME+WTL GSV
Sbjct: 3961 ILRKIGLRTAGEANVILECAKKVETLGSEWRKSEENSFDFDLTNAQNEVPMEIWTLGGSV 4020

Query: 4021 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4080
            VEAVFSNFAVFYSN+FCNALGNI FVPA+LGFPNLGGNKGGKRVLTSY D IVSKDW LA
Sbjct: 4021 VEAVFSNFAVFYSNSFCNALGNIIFVPAELGFPNLGGNKGGKRVLTSYSDAIVSKDWHLA 4080

Query: 4081 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4140
            WSCAPILSKHSVIPP+YSWGALNLRSPPAFPTVLKHLQV GRNGGEDTL+HWPIS+GVMS
Sbjct: 4081 WSCAPILSKHSVIPPEYSWGALNLRSPPAFPTVLKHLQVTGRNGGEDTLSHWPISVGVMS 4140

Query: 4141 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4200
            INEASCEVLKYLERIWS+LSSLD+LELQ+VAFIPVANATRLVKAN LFARLTINLSPFAF
Sbjct: 4141 INEASCEVLKYLERIWSSLSSLDILELQKVAFIPVANATRLVKANVLFARLTINLSPFAF 4200

Query: 4201 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4260
            ELPSGYLPFVKIL+DLGLQDVLS ASAKDLLSSLQVACGYQRLNPNELRSVMEILH+ICD
Sbjct: 4201 ELPSGYLPFVKILQDLGLQDVLSAASAKDLLSSLQVACGYQRLNPNELRSVMEILHFICD 4260

Query: 4261 EAMEAKMFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPERI 4320
            EA E KMFDGRE EIIVPDDGCRLVHA SC YIDTYGSRYIKC+DTSRLRFVH DLPERI
Sbjct: 4261 EATEEKMFDGRELEIIVPDDGCRLVHAASCVYIDTYGSRYIKCVDTSRLRFVHPDLPERI 4320

Query: 4321 CRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMVNY 4380
            CRMLGIKKLSDLVIEELDHEDSI+PLE IGAVSL FI+ KLLS+SFQNAVW + NSMVNY
Sbjct: 4321 CRMLGIKKLSDLVIEELDHEDSIDPLEHIGAVSLGFIKTKLLSKSFQNAVWKIANSMVNY 4380

Query: 4381 IHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGIHH 4440
            IH NKNLDL+ VE+LLKS+AERLQFVK LHT+FLLLPNSI+ITRPAKDSIIPEW+DG HH
Sbjct: 4381 IHPNKNLDLEVVEELLKSVAERLQFVKCLHTQFLLLPNSINITRPAKDSIIPEWEDGSHH 4440

Query: 4441 RALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAIIN 4500
            RALYF+  SKT ILVAEPPAYIS+FDVIAIVVSQILGSPIPLP+GSLLFCPEGTE  II+
Sbjct: 4441 RALYFIKQSKTYILVAEPPAYISVFDVIAIVVSQILGSPIPLPIGSLLFCPEGTENPIID 4500

Query: 4501 ILKLCSE-KENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKYG 4560
            IL LCSE KENE++ GIS+L+GKEILPQDALQ+QLHPLRPFYA EVVAWRS+SGEKLKYG
Sbjct: 4501 ILNLCSEKKENEKYAGISNLVGKEILPQDALQVQLHPLRPFYAGEVVAWRSKSGEKLKYG 4560

Query: 4561 RVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMIID 4620
            RV EDVRPSAGQALY+FRVETA GI QSL+SSQVLSFRSI IDG  SSTNLQD   M+ D
Sbjct: 4561 RVLEDVRPSAGQALYRFRVETAAGIIQSLLSSQVLSFRSIPIDGGSSSTNLQDKSWMVTD 4620

Query: 4621 SGASVEMPENSERGKIRSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQKT 4680
            SGAS++MPE SE GKIR+QPVAELQYG+VSAEELVQAV+EML+TAGINVDIERQSLLQK 
Sbjct: 4621 SGASIKMPEISEGGKIRAQPVAELQYGKVSAEELVQAVNEMLTTAGINVDIERQSLLQKA 4680

Query: 4681 VVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740
            ++LQEQLKDSQAALLLEQE+SDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC
Sbjct: 4681 LILQEQLKDSQAALLLEQEKSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCRKC 4740

Query: 4741 SAAVSK 4746
            S+AVSK
Sbjct: 4741 SSAVSK 4745

BLAST of Moc09g05800 vs. TAIR 10
Match: AT5G23110.1 (Zinc finger, C3HC4 type (RING finger) family protein )

HSP 1 Score: 5857.3 bits (15194), Expect = 0.0e+00
Identity = 2898/4748 (61.04%), Postives = 3646/4748 (76.79%), Query Frame = 0

Query: 8    LDSIFLEDFGQKVDLTRRIREVLLNYPEGTTVLKELVQNADDAGATKVCLCLDRRVHGSE 67
            +DS+ LEDFGQKVDLTRRIREVLLNYPEGTTVLKEL+QNADDAGATKV LCLDRRVHGS 
Sbjct: 1    MDSLLLEDFGQKVDLTRRIREVLLNYPEGTTVLKELIQNADDAGATKVRLCLDRRVHGSG 60

Query: 68   SLLSESLAPFQGPALLAYNNAVFTEEDFVSISRIGGSNKHGQAWKTGRFGVGFNSVYHLT 127
            SLLS+SLA +QGP+LLAYN+AVFTEEDFVSISRIGGS KHGQAWKTGRFGVGFNSVYHLT
Sbjct: 61   SLLSDSLAQWQGPSLLAYNDAVFTEEDFVSISRIGGSGKHGQAWKTGRFGVGFNSVYHLT 120

Query: 128  ELPSFVSGKYVVMFDPQGIYLPKVSAANPGKRIDFIRCSAISQYRDQFLPYCAFDCDMES 187
            ++PSFVSGKYVV+FDPQG YLP +SAANPGKRID++  SA+SQY+DQFLPYCAF CDM S
Sbjct: 121  DIPSFVSGKYVVLFDPQGAYLPNISAANPGKRIDYVGSSALSQYKDQFLPYCAFGCDMRS 180

Query: 188  SFDGTLFRLPLRNADQAARSKISRQAYTEEDISSMFAELYEEGVLTLLFLKSVSCIEMFV 247
             F+GTLFR PLRN +QAA S++SRQAY E+DIS MF +L+EEGV +LLFLK V  IEM+ 
Sbjct: 181  PFNGTLFRFPLRNTEQAASSRLSRQAYFEDDISLMFDQLFEEGVFSLLFLKCVLSIEMYT 240

Query: 248  WNDGEAEPQKLYSFSVRSASSDIIWHRQMLLRLSKSTTSTQSEVDSFSLDFLSQATTGTQ 307
            W+DG++EP+KLYS SV S ++D +WHRQ +LRLSK++ S   E+D+F+L+FLS++  G Q
Sbjct: 241  WDDGDSEPKKLYSCSVSSPNNDTVWHRQAVLRLSKTSISGDREMDAFTLEFLSESEKGNQ 300

Query: 308  IEERIDSFFIVQTMASATSRISSFAATASKEYDIHLLPWASLAVCTSDDSSKTNVLKLGR 367
             + R D F+IVQTMASA+S+I  FAATASKEYDIHLLPWAS+A C SDDSS+ N+LKLG 
Sbjct: 301  TKRRTDRFYIVQTMASASSKIGLFAATASKEYDIHLLPWASVAACISDDSSENNILKLGH 360

Query: 368  AFCFLPLPVKTGLNVQVNGFFEVSSNRRGIWYGADMDRSGKIRSIWNRLLLEDIIAPSFI 427
            AFCFLPLPV+TGL VQVNG+FEVSSNRRGIWYG DMDRSGK+RS WNRLLLED++APSF 
Sbjct: 361  AFCFLPLPVRTGLTVQVNGYFEVSSNRRGIWYGEDMDRSGKVRSAWNRLLLEDVVAPSFA 420

Query: 428  ELLIGVQVFLGPTDTYFSLWPSGSFEEPWNILVEQVYKNISNALVLYSNVEGGKWVSPIQ 487
             LL+ ++  L   D+YFSLWPSGSFE PW+ILVEQ+YKNI NA VL+S+++GGKWVSP  
Sbjct: 421  RLLLCLREVLDSRDSYFSLWPSGSFEAPWSILVEQIYKNICNAPVLFSDLDGGKWVSPAD 480

Query: 488  AFLHDDKFTRSKELGEALVLLGMPIVHLPENLSNMLLKFCRAFQQKVVTPCTVRHFLREC 547
            A+LHD++F+ SK+LG+AL+ L MPIV LP  + +MLLK       KVVTP  VR+FL+EC
Sbjct: 481  AYLHDEEFSGSKDLGDALLQLEMPIVCLPRLVFDMLLKHPSFLLPKVVTPDRVRNFLKEC 540

Query: 548  KHVATLNRPYKLVLLEYCIEDLIDADVCAQAFDLPLIPLANGDFGLFSEASKGISYFICD 607
            K ++ L +  KLVLLEYC++DL D  VC QA +L L+PLANGDFG FS  +  +SYFICD
Sbjct: 541  KTLSALKKSLKLVLLEYCLDDLTDDSVCTQASNLKLLPLANGDFGFFSGRTGSVSYFICD 600

Query: 608  ELEYTLLHQISDRVIDRNIPLNISTRLSNIARSSKSNIFIFNVHYFLQLFPKFVPADWKY 667
            ELE+ LL ++ DRVID+NIP  + TRL  IA S  +N+ IF++H  LQLFP+ VPA+WK+
Sbjct: 601  ELEHMLLQKVYDRVIDKNIPPPLYTRLFAIAESRTANVAIFSIHNLLQLFPRLVPAEWKH 660

Query: 668  KNEVLWDPESCSNHPTSSWFSLFWRYLHDRCEKLSLFSDWPILPSKSRYLYRATKKSKLI 727
            ++++ W PES  +HP+SSWF LFW+YL  RC+ LSLF DWPILPS S YLY A+ +SKLI
Sbjct: 661  RSKISWHPESNRDHPSSSWFVLFWQYLDKRCQSLSLFCDWPILPSTSGYLYIASPQSKLI 720

Query: 728  NVQMLSNEMQKILSKLGCKLLDPYYKVEHRDLIHYVNDGNCTGVLDSIYDAISSTGGLML 787
            N + L   ++ +L K+G K+L+   KVEH DL  +V+D + TGVL+SI+DA SS    + 
Sbjct: 721  NAEKLPAAVRNVLEKIGGKILNNNIKVEHSDLSSFVSDASYTGVLESIFDAASSDLDGVQ 780

Query: 788  TSLYSLEVEEKDGLRRFLLDPKWYLGGCMNDSDLEKCKRLPIYKVYNGGSAQDFGFSDLE 847
              +Y L  +EKD LR FLLDPKW++G  + D  L  CK LPI+++Y   SAQ+  +SDL 
Sbjct: 781  NLIYDLNAQEKDELRSFLLDPKWHIGHQIGDLYLRICKILPIHRIYGETSAQESKYSDLV 840

Query: 848  NPQKYLPPSDVGEFFLGVEFIVSSFDSEVEILLKYYGIKKMGKASFYRKHVLNQVEQLQP 907
            NP K+LPP DV    LG EFI+    SE ++L +YYGI++M K++FYR++V N++E LQP
Sbjct: 841  NPPKHLPPLDVPACLLGCEFILCCQGSEEDVLSRYYGIERMRKSNFYRQNVFNRIEVLQP 900

Query: 908  ELRDSTMLSVLKNLPQLCVEDVAFRECLSNLAFVPTSSGTLKCPTVLYDPRYEELCALLD 967
            E+RD  M+S+L++LPQLC+ED   RE L NL FVPT +G LK P+VL+DPR EEL ALL+
Sbjct: 901  EIRDQVMISILQDLPQLCLEDRLLREELQNLEFVPTVNGPLKRPSVLHDPRNEELYALLE 960

Query: 968  DFDSFPSTPFNESYILDILQGLGLRTCVAPETIVQSAQHVERLMHKDHNKAHSRGKVLLS 1027
            D D FP++ F  S ILD+LQGLGL+T V+PETI++SA+ VERLMHKD  KAHSRGKVL S
Sbjct: 961  DSDCFPASGFQGSAILDMLQGLGLKTTVSPETILESARLVERLMHKDLEKAHSRGKVLFS 1020

Query: 1028 YLEVNAIKWLLNPMNEDQGMVNRLFSTAATAFRPRNFNSDLEKFWNDLCKISWCPVLLSP 1087
            +LEVNA+KWL +  +ED G +NR+FS AATAFRPRN   +L KFW++L  I WCPVL+S 
Sbjct: 1021 FLEVNAVKWLPDQSSEDDGAINRIFSRAATAFRPRNLTCNLVKFWSELKMICWCPVLVSA 1080

Query: 1088 PFETLPWPVVSSMVAPPKLVRLPKDLWLVSASMRILDGECSSSALAHSLGWSSPPGGSII 1147
            PF+TLPWPVV+S VAPPKLVR   D+WLVSASMRILDGECSS+ALA++LGW S PGGS I
Sbjct: 1081 PFQTLPWPVVTSTVAPPKLVRPKTDMWLVSASMRILDGECSSTALAYNLGWLSHPGGSAI 1140

Query: 1148 AAQLLELGKNNEIVYDQVLRKELALAMPRIYALLTGLIGSDEMDVVKAVLEGCRWIWVGD 1207
            AAQLLELGKNNEI+ DQVLR+ELALAMP+IY++L  L+GSDEMD+VKAVLEG RWIWVGD
Sbjct: 1141 AAQLLELGKNNEILIDQVLRQELALAMPKIYSILARLLGSDEMDIVKAVLEGSRWIWVGD 1200

Query: 1208 GFATSEEVVLDGPLHLAPYIRVIPIDLAVFKDLFLELGIREFLKPNDYADILSRMAIKKG 1267
            GFAT  EVVLDGPL L PYIRVIP DLAVF+ LF+ELG+REFL P+DYAD+L R+A++KG
Sbjct: 1201 GFATLSEVVLDGPLQLVPYIRVIPTDLAVFRGLFVELGVREFLTPSDYADVLCRIAVRKG 1260

Query: 1268 SSPLNAQEVRAAILIVQHLAEAQLPKQQINIYLPDISGRLLPASNLVYNDAPWLLGTDDT 1327
            +SPL+ QE+RAA+LI Q LAEAQ    ++ IYLPD+SGRL P+S+LVYNDAPWL  +D+ 
Sbjct: 1261 TSPLDPQEIRAAVLIAQQLAEAQF-LDKVTIYLPDVSGRLFPSSDLVYNDAPWLTASDNL 1320

Query: 1328 DVPFDGESTVVLNARKTVQKFVHGNISNDVAEKLGVCSLRRILLAESADSMNLSLSGAAE 1387
            +  F  EST++LNA++T+QKFVHGNISN+VAEKLGV SLRR+LLAESADSMN SLSGAAE
Sbjct: 1321 NSSFSAESTMLLNAKRTMQKFVHGNISNEVAEKLGVRSLRRVLLAESADSMNFSLSGAAE 1380

Query: 1388 AFGQHEALTNRLRHILEMYADGPGILFELIQNAEDAGASEVVFLLDKTHYGTSSILSPEM 1447
            AFGQHEALT RL+HILEMYADGPGILFEL+QNAEDAGASEV FLLDKTHYGTSS+LSPEM
Sbjct: 1381 AFGQHEALTTRLKHILEMYADGPGILFELVQNAEDAGASEVTFLLDKTHYGTSSLLSPEM 1440

Query: 1448 ADWQGPALYCYNDSVFSSQDLYAISRVGQESKLQKPLSIGRFGLGFNCVYHFTDIPTFVS 1507
            ADWQGPALYC+N+SVF+ QD+YAISR+GQ SKL+KP +IGRFGLGFNCVYHFTDIP FVS
Sbjct: 1441 ADWQGPALYCFNNSVFTQQDMYAISRIGQASKLEKPFAIGRFGLGFNCVYHFTDIPGFVS 1500

Query: 1508 GENIVMFDPHACNLPGISPSHPGLRIKYAGRRILEQFPDQFSPYLHFGCDMQKPFPGTLF 1567
            GENIVMFDPHA +LPGISP+HPGLRIK+AGR IL+QFPDQF+P+LHFGCD++  FPGTLF
Sbjct: 1501 GENIVMFDPHANHLPGISPTHPGLRIKFAGRYILDQFPDQFAPFLHFGCDLEHTFPGTLF 1560

Query: 1568 RFPLRSPALASRSEIKKEGYAPEDVTSLFYSFSEVASDALLFLTNVKKISIFIKDDIEHD 1627
            RFPLR+ ++A RS IKKE YAPEDV SLF SFS V S+AL+FL NVK +SIF K+   H+
Sbjct: 1561 RFPLRNASVAPRSHIKKETYAPEDVLSLFTSFSGVVSEALIFLRNVKTVSIFTKEGAGHE 1620

Query: 1628 MQCLYRVHKNTVSEPSTESSAKQDIISFIYGNRQGEMDREQFLMKLSKSINRDLPYKCQK 1687
            MQ L+RV K+      TE      + S +  N    M+++Q L KLS ++ +DLPYKCQK
Sbjct: 1621 MQLLHRVCKDHNVGQDTEPKPSSQVFSLLDENIFAGMNKDQLLKKLSNTVVKDLPYKCQK 1680

Query: 1688 LIITEKSSSGDILQHYWITSGCLGGGLPRNNSGLGDKSYNFIPWACVAALLHSVQVDGEM 1747
            +++TE+ SSG IL H WIT  CL  G+ + N  L + S+  IPWA VA  ++SV+     
Sbjct: 1681 IVVTEQDSSGCIL-HGWITGECLNAGVSKKNLNLPEMSHKLIPWASVAVHINSVK----- 1740

Query: 1748 NYDPETENNWLVASDLVQVSSASIEGRKPFEGRAFCFLPLPVRTGLPVHVNAYFELSSNR 1807
              +   E+     S++   S+ SI+ R+ F GRAFCFLPLP+ TGLP H+NAYFELSSNR
Sbjct: 1741 --NENVEDLAASISNIFGPSTISIQNRRNFGGRAFCFLPLPITTGLPAHINAYFELSSNR 1800

Query: 1808 RDIWYGDDMAGGGKKRSEWNSYLLEDVVAPAYGRLLEKIVSEIGHSGLFSSFWPTTAGLE 1867
            RD+W+G+DMAG GK RS+WN YL+E+VV PAYG LLEKI SE+G   LF S WP T G E
Sbjct: 1801 RDLWFGNDMAGDGKVRSDWNLYLIEEVVVPAYGHLLEKIASELGPCDLFFSVWPVTLGTE 1860

Query: 1868 PWGSVVRKLYSFIGDFGLLVLYTNARGGQWISTKQAIFPDFSFDKVYELIEALADSGLPV 1927
            PW S+VRKLYSFI + GL VLYT ARGGQWISTKQAI+PDFSF K  EL++ LAD+GLPV
Sbjct: 1861 PWASLVRKLYSFIANNGLRVLYTKARGGQWISTKQAIYPDFSFPKADELVDVLADAGLPV 1920

Query: 1928 IAISKSIVDRFMEVRPSLHFLTPHLLRTLLIKRKRAFKDRKATILTLEYCLVDLKLPFQS 1987
            I ISK++ +RF E   SLH +TP LLRTLL +RKR F+DR    L LEYCL+DLK+PF +
Sbjct: 1921 INISKTVAERFGEACSSLHLMTPQLLRTLLTRRKREFRDRNGLALALEYCLLDLKVPFLA 1980

Query: 1988 DSLCGLPLLPLADGSFTSFHKNGMGERTYIARGDEYGLLKDSVPGQLVDPGIPEVVHAKL 2047
            D L GLPLLPLADGSFT+F+KNG  ER + A    Y LLKDS+P QLVD  +PE V++KL
Sbjct: 1981 DLLYGLPLLPLADGSFTTFNKNGTAERIFFAEEIGYELLKDSLPHQLVDREVPEGVYSKL 2040

Query: 2048 CEVAQTEDLNICFLSCQLLEKLFLRFLPAEWQNARQVNWNPGHQGQPSLEWIRLIWCYLK 2107
              VAQ+ +  IC LSC LLEKLF + LPA+W  + ++ W PG +G P++EWIR++W YLK
Sbjct: 2041 LAVAQSGESCICLLSCNLLEKLFFKLLPADWHLSEKILWTPGQRGHPTVEWIRVLWSYLK 2100

Query: 2108 SHCDDLSQFSKWPILPVGQNSLLQLVENSNVLKADGWSENMFSLLLKVGCLFLRRDMPIE 2167
              CDDLS FSKWPILPV    L+QL+ NSNV++ DGWSENM SLLLK GC FL R++P+E
Sbjct: 2101 LSCDDLSVFSKWPILPVEDGCLMQLILNSNVIRDDGWSENMSSLLLKCGCRFLNRELPVE 2160

Query: 2168 HPQLENYVHPSTAIGILNAFLSIAGDIENVEGLFRDASESELHELRSFILQSKWYLEGKM 2227
            HPQLE +V P TA GILNA L+I+G  EN++G+F + SE ELHELR+FILQSKW+  G M
Sbjct: 2161 HPQLETFVQPPTATGILNALLAISGGHENIKGIFLNVSEGELHELRNFILQSKWFSGGHM 2220

Query: 2228 EAIHVDIIKCIPMFESYKCRKLVSLSKPIRWIKPTGLCEDFLNDDFVRMESEKERIILKR 2287
              +H + IK +P+FESY+ RKLVSL+ P++W+KP G+ ED L+DDFVR++SE+ER I KR
Sbjct: 2221 NEVHFETIKHLPIFESYRSRKLVSLNCPVKWLKPDGIREDLLDDDFVRLDSERERTIFKR 2280

Query: 2288 YFGIGEPSRVEFYKDYVLNHMSEFLSERGALSTILHDVKLLIEEDVSLKSSVSMIPFVLA 2347
            Y  I EPS++EFYK  VLN MSEFLS++ AL  ILHD+  L+  DVSL+ ++S  PFVLA
Sbjct: 2281 YLQIKEPSKMEFYKACVLNRMSEFLSQQEALLAILHDLNDLVVADVSLQCAISTTPFVLA 2340

Query: 2348 SNGSWQPPSRLYDPRVLELNNMLHEETFFPSENFSDDDILDALVSLGLRRSLDLTGLLDC 2407
            +NG WQ PSRLYDPRV  L  +LH+E +FPSE FSD  ILDALV LGLR +LD +  LD 
Sbjct: 2341 ANGLWQQPSRLYDPRVPALQELLHKEVYFPSEKFSDSKILDALVGLGLRTTLDCSTYLDA 2400

Query: 2408 ARSVSLLNDSKNSESQSYARRLFVCLDALAHKLSIKV-EGSGYELQNSMLIKSNYVDDDA 2467
            ARSVS+L+D  + E+  Y RRL   +  L+ KLS K  E +  E QN M I S       
Sbjct: 2401 ARSVSMLHDLGDLEASRYGRRLLFHIKTLSIKLSSKTGEANHDESQNIMSITSE------ 2460

Query: 2468 SMEVGSLNIEDTSDMGTD-SLIGNLTGDESEEEFWSEMNTIAWCPICADSPLKVLPWLKT 2527
                 S + E   +  T+ S +G+L   +SE+EFW ++ +I WCPIC D P++ +PWL++
Sbjct: 2461 ----DSFDGETYPEYETETSYLGSLLTQQSEDEFWCQLRSIPWCPICLDPPIEGIPWLES 2520

Query: 2528 NNQVAPPSIVRPKSQMWMVSSSMHILDGVPPSEYLQHKLGWTDCPRVEVLCAQLTDISKL 2587
            +N VA P  VRPKSQM++VS++MH+LDG   S YL  KLGW DC  +++LC QL +ISK 
Sbjct: 2521 SNLVASPDRVRPKSQMFLVSATMHLLDGECQSSYLHQKLGWMDCLTIDILCRQLIEISKS 2580

Query: 2588 YGELRLHSSLEPDINTALQEGIPILYSKLQEYIGTDESVLLKSALNGVSWVWVGDDFVPP 2647
            Y E +  SS+ P+  + LQ  IP+LY++LQE    ++ + LKSAL+GV WVW+GDDFV  
Sbjct: 2581 YKEQKSRSSVNPEFESMLQSQIPLLYTRLQELSRENDFLALKSALSGVPWVWLGDDFVSA 2640

Query: 2648 SALAFDSPVKFSPYLYVVPSELSEFRDLLSELGVRLSFNVEGYLDVLQRLHSDVEGSPLS 2707
              L+FDSPVKF+PYLYVVPSELS+F++LL ELGVRLSF+   Y++ LQ L +D++GS L+
Sbjct: 2641 DVLSFDSPVKFTPYLYVVPSELSDFKELLLELGVRLSFDAADYMNTLQHLQNDIKGSQLT 2700

Query: 2708 TDQMDFVICMLEAISDCCVDKPEFTATSTSLLIPNSSQVLMQANDLVYNDAPWMEDNNIL 2767
             +Q++FV+C+LEA++D C  +    + + S+L+P+S+  L+   DLVYNDAPW+ D++ L
Sbjct: 2701 DEQINFVLCVLEAVAD-CFSEVSSDSDNNSVLVPDSAGFLVPLEDLVYNDAPWV-DSSSL 2760

Query: 2768 VGKHFVHPSISNDLASRLGVQSIRCLSLVDEEMTKDLPCMDYAKISELLMLYGN-DYLFF 2827
             GK FVHPSI++D+A+RLG+QS+RC+SLVD ++T+DLPCMD+ K+ ELL LY + D+L F
Sbjct: 2761 SGKRFVHPSINSDMANRLGIQSLRCISLVDNDITQDLPCMDFTKLKELLSLYASKDFLLF 2820

Query: 2828 DLLELADCCRAKKLRLIFDKREHPRQSLLQHNLGEFQGPALVAIFEGSSLNTEEISSLQF 2887
            DLLELADCC+ KKL +IFDKREHPR++LLQHNLGEFQGPA+VAI EG +L  EEI SLQ 
Sbjct: 2821 DLLELADCCKVKKLHIIFDKREHPRKTLLQHNLGEFQGPAIVAILEGVTLTREEICSLQL 2880

Query: 2888 RPPWKLRGDTLNYGLGLLSCYYVCDLLSIVSGGYFYIFDPRGIALSVAPKSAPGAKVFSL 2947
               W+++G+TLNYGLGLLSCY++CDLLSIVSGGYFY+FDP+G  LS +   AP  K+FSL
Sbjct: 2881 LSQWRIKGETLNYGLGLLSCYFMCDLLSIVSGGYFYMFDPQGATLSASTTQAPAGKMFSL 2940

Query: 2948 IGSNLIEKFNDQFHPMLGGQNMSWP-SDSTIVRMPLSPACLKDGLEPGIRKIKEISSKFL 3007
            IG+NL+E+F+DQF+PML GQ+ +W  +DSTI+RMPLS   LKDG E G+ ++K+IS +FL
Sbjct: 2941 IGTNLVERFSDQFNPMLIGQDKAWSLTDSTIIRMPLSTEILKDGFEAGLDRVKQISDQFL 3000

Query: 3008 DHASRSLLFLKSVVQVSFSTWDQGGLHPYQDYSVCINLSSAIARNPFSEKKWKKFQLSRL 3067
            ++ASR L+FLKSV QVSFSTW+QG   P+QDY++ I+ +SAI RNPF+EK  K  +LSR+
Sbjct: 3001 ENASRILIFLKSVSQVSFSTWEQGNAQPHQDYTLHIDSASAIMRNPFAEKNLKTSKLSRI 3060

Query: 3068 FSSSNAATKLHTIDIIVFQGETQFVDRWLVVLSLGSGQTRNMALDRRYLAYNLTPVAGVA 3127
            F SSN+  K   I++ +  GE + +DRWLVVLS GSGQ++NMA  R+YLAYNLTPVAGVA
Sbjct: 3061 FGSSNSGVKSRIIEVNLHIGENKLLDRWLVVLSKGSGQSQNMARGRKYLAYNLTPVAGVA 3120

Query: 3128 AHISRNGLPADICQKSPLMAPCPLSGDITLPVTVLGCFLVCHSGGRYLFKNQVLEAVAAP 3187
            AH+SRNG P D+   SP+M+P PLSG + LPVT+LGCFL+ ++ GR+LFKN+   A++ P
Sbjct: 3121 AHVSRNGRPVDVHAASPIMSPLPLSGSVNLPVTILGCFLIRNNCGRFLFKNKNERAMSEP 3180

Query: 3188 -LDAGNKLVEAWNRELMSCVCDSYIYMILEIHKQRKESSSSALESNVSHSISSSLKAYGN 3247
             LDAG+ L++AWN+ELMSCV DSYI +++E+ +  +E SSS+ ES+ +  ++ SLKAYG+
Sbjct: 3181 QLDAGDILIDAWNKELMSCVRDSYIEIVVEMERLSREHSSSSTESSTARQLALSLKAYGH 3240

Query: 3248 QVYSFWPRSEPANGSDSDLD-RGLKADWECLVEQVIRPFYTRAIDLPVWQLYSGNLVKAE 3307
            Q+YSFWPRS   N  D  ++   LK +WECLVEQVIRPFY R  DLP+WQLYSG+LVKAE
Sbjct: 3241 QLYSFWPRS---NQHDDAIEAEVLKPEWECLVEQVIRPFYARVADLPLWQLYSGSLVKAE 3300

Query: 3308 EGMFLAQPGSPVGGNLLPATVCGFVKEHYPVFSVPWELIKEIQAVGITVRQIRPKMVRDL 3367
            EGMFL QPGS V  NLLP TVC FVKEHYPVFSVPWEL+ E+QAVGI VR+++PKMVR L
Sbjct: 3301 EGMFLTQPGSEVAVNLLPLTVCSFVKEHYPVFSVPWELLAEVQAVGIPVREVKPKMVRVL 3360

Query: 3368 LRVSSASIVLQSIDTYLDVLEYCLSDILLAALSNHAEDSMGADSVNTNPGGRSTNTSEGS 3427
            LR SSASI L+S+DT++DVLEYCLSDI      N  E                 N  EG+
Sbjct: 3361 LRKSSASIDLRSVDTFIDVLEYCLSDIQFIEALNPEE----------------ANMDEGN 3420

Query: 3428 STSVSVSSMNSFARLSNQNAASSGDALEMMTSLGRALLDFGRGVVEDIGRSGDSLSHSNT 3487
            STS S S       +S Q  A S DA EMMTSLG+AL DFGR VVEDIGR+GDS+     
Sbjct: 3421 STSTSSS-------MSTQAQAGSSDAFEMMTSLGKALFDFGRVVVEDIGRTGDSIGQR-- 3480

Query: 3488 FTGRNNSSYRNVDQNFLQMVSEIKGLPFPSASNNLVRLGSMELWLGSKDQQELMIPLAAR 3547
                +N+ Y N D  FL  V+E+KGLP P+A+N+L RLG  ELWLG+K+QQ LM+P++AR
Sbjct: 3481 ---ISNNRYSNADPRFLSAVNELKGLPCPTATNHLARLGISELWLGNKEQQALMLPVSAR 3540

Query: 3548 FVHPKVFDRSILGNILTNDALHKFLKLQKFSLSLLATNMRSVFHANWVNHVMNSNMSPWF 3607
            F+HPKVF+RS L +I    ++  FLKL+ +SL LLA+NM+ +FH +WV+++  SN  PWF
Sbjct: 3541 FIHPKVFERSSLADIFLKSSVQAFLKLRSWSLPLLASNMKYLFHDHWVSYISESNSVPWF 3600

Query: 3608 SWENKSCSGVEEGPSSEWIRLFWKNTGDSSQDLLLFSDWPLVPAFLGRPILCRVRERHLV 3667
            SWE+ S S  + GPS EWI+LFWKN   S+ +L LFSDWPL+PAFLGRPILCRVRERHL+
Sbjct: 3601 SWESTSSSSDDSGPSPEWIQLFWKNFNGSADELSLFSDWPLIPAFLGRPILCRVRERHLI 3660

Query: 3668 FLPPVTYPVLPNAILEIGAGGSDVAETSTSVISKPESIQPYTSAFQKFQDTYPWLFPLLN 3727
            F PP     +  +  ++    SD++ TS S     E  Q Y S F   Q  +PWL  LLN
Sbjct: 3661 FFPPPALQPVSRSGTDMHQTDSDISTTSVSGGPLSELTQRYVSGFDLAQSKHPWLILLLN 3720

Query: 3728 HCNIPIFDVAFMDCASLCNCLSNSGQSLGQIIASMFVAANNAGYFPELASLSDSNSNELL 3787
             CNIP+ D A++DCA  C CL +   SLGQ IAS       AGY  ++AS      +EL 
Sbjct: 3721 QCNIPVCDTAYIDCAERCKCLPSPSVSLGQAIASKLAEGKRAGYIADIASFPTFGRDELF 3780

Query: 3788 NLFANDFVSNGTNYGRDELEILRKLPIYRTVVGSYTQLRDNDQCMISSNSFLKPYNDCCL 3847
             L ANDF S+G++Y   ELE+L  LPI++TV GSYT L+ +  C+IS +SFLKPY++CC 
Sbjct: 3781 TLLANDFSSSGSSYQAYELEVLSSLPIFKTVTGSYTHLQRHGLCIISGDSFLKPYDECCF 3840

Query: 3848 SYSSNSMEYSLLRALRVPELDNQQILIRFGLPAFDCKPQSEQEDILIYLFTNWQDLQADA 3907
             Y  +S+E   L+AL V  L N Q L+RFGL  F+ + QSE+EDILIY++ NW DL+ D+
Sbjct: 3841 CYLPDSVECHFLQALGVTVLHNHQTLVRFGLAEFESRSQSEREDILIYVYGNWLDLEVDS 3900

Query: 3908 HLVECLSETNFVRSADEFCTDLFKSKELYDPSDALLTSVFSGERKKFPGERFAADGWLRI 3967
             ++E L E  FVR++DEF ++L KSK+L+DPSD LL SVF GERK+FPGERF+++GWLRI
Sbjct: 3901 DVIEALREAKFVRNSDEFSSELSKSKDLFDPSDTLLVSVFFGERKRFPGERFSSEGWLRI 3960

Query: 3968 LRKIGLRTTTEANVILECAKKVETLGSEW-RKSEEDGSEFDLINGQNEVPMEVWTLAGSV 4027
            LRK GLRT  EA+VILECAK+VE LG+E  R SEED  E DL++ + ++ +E+ TLAGSV
Sbjct: 3961 LRKAGLRTAAEADVILECAKRVEFLGNERNRSSEEDDFETDLVHSEKDISVELSTLAGSV 4020

Query: 4028 VEAVFSNFAVFYSNNFCNALGNIAFVPADLGFPNLGGNKGGKRVLTSYGDGIVSKDWPLA 4087
            +EA+  NFA FYS  FCN LG IA VPA+ GFP+LGG KGGKRVLT Y + ++ +DWPLA
Sbjct: 4021 IEAILLNFAGFYSTAFCNTLGQIACVPAESGFPSLGGRKGGKRVLTRYSEAVLLRDWPLA 4080

Query: 4088 WSCAPILSKHSVIPPDYSWGALNLRSPPAFPTVLKHLQVIGRNGGEDTLAHWPISMGVMS 4147
            WS  PILS    IPP +SW AL L+SPP F TVLKHLQVIGRNGGEDTLAHWP    VM+
Sbjct: 4081 WSSVPILSTQRFIPPGFSWTALRLKSPPIFSTVLKHLQVIGRNGGEDTLAHWPNDPNVMT 4140

Query: 4148 INEASCEVLKYLERIWSTLSSLDVLELQRVAFIPVANATRLVKANALFARLTINLSPFAF 4207
            I+  SCEVLKYLE +W +L++ D+LELQ+VAF+P AN TRLV A++LF RL INLSPFAF
Sbjct: 4141 IDVTSCEVLKYLEIVWDSLTTSDILELQKVAFLPAANGTRLVGASSLFVRLPINLSPFAF 4200

Query: 4208 ELPSGYLPFVKILKDLGLQDVLSVASAKDLLSSLQVACGYQRLNPNELRSVMEILHYICD 4267
            ELPS YLPF+ ILKDLGL DVLSVA+AKD+LS LQ  CGY+RLNPNELR+VMEILH++CD
Sbjct: 4201 ELPSLYLPFLNILKDLGLNDVLSVAAAKDILSKLQKLCGYRRLNPNELRAVMEILHFLCD 4260

Query: 4268 EAMEAK--MFDGREPEIIVPDDGCRLVHATSCAYIDTYGSRYIKCIDTSRLRFVHSDLPE 4327
            E    K    +  + ++IVPDDGCRLVHA SC Y+D++GSRY++ IDT+RLR VH  LPE
Sbjct: 4261 EINTTKPPEINTIKSDVIVPDDGCRLVHALSCVYVDSFGSRYVRYIDTARLRLVHPLLPE 4320

Query: 4328 RICRMLGIKKLSDLVIEELDHEDSIEPLERIGAVSLEFIRKKLLSRSFQNAVWNVVNSMV 4387
            RIC  LG++KLSD+VIEEL++ + IE L+ IG++SL+ +R+KL S +FQ A+W V     
Sbjct: 4321 RICLDLGVRKLSDVVIEELENAEHIETLDNIGSISLKAVRRKLQSETFQAALWTVSRQAT 4380

Query: 4388 NYIHANKNLDLKAVEKLLKSIAERLQFVKSLHTRFLLLPNSIDITRPAKDSIIPEWKDGI 4447
                   +L  + ++  L+S AE++ FV++++TRFLLLPNS+D+T  AK+S+IPEW++  
Sbjct: 4381 TV----DDLSFEVMQHSLQSAAEKIGFVRNIYTRFLLLPNSVDVTFVAKESMIPEWENES 4440

Query: 4448 HHRALYFVNHSKTCILVAEPPAYISIFDVIAIVVSQILGSPIPLPVGSLLFCPEGTEIAI 4507
            HHR +YF+N  +T ILV+EPP YIS  DV+A VVS++LG P  LP+GSL  CPEG+E  I
Sbjct: 4441 HHRTMYFINRHRTSILVSEPPGYISFLDVMATVVSEVLGFPTSLPIGSLFSCPEGSETEI 4500

Query: 4508 INILKLCSEKENEQFTGISSLLGKEILPQDALQLQLHPLRPFYAAEVVAWRSQSGEKLKY 4567
               L+LCS       T  SS +G+EI+PQDA+Q+QLHPLRPFY  E+VAW+ + G+KL+Y
Sbjct: 4501 TAYLRLCSYSLTNTGTADSS-VGQEIMPQDAVQVQLHPLRPFYKGEIVAWKIKQGDKLRY 4560

Query: 4568 GRVPEDVRPSAGQALYKFRVETAPGITQSLISSQVLSFRSISIDGSHSSTNLQDSGHMII 4627
            GRVPEDVRPSAGQALY+ +VE  PG T  L+SSQV SFR  SI+    ST L +    + 
Sbjct: 4561 GRVPEDVRPSAGQALYRLKVEMTPGETGLLLSSQVFSFRGTSIENEGPST-LPEVLPAVS 4620

Query: 4628 DSGASVEMPENSERGKI-RSQPVAELQYGRVSAEELVQAVHEMLSTAGINVDIERQSLLQ 4687
            D   S E+ E+S   K   SQPV E+Q GRV+A+ELV+AVHEMLS AGIN+++E QSLLQ
Sbjct: 4621 DK-KSQEISESSRTNKTSSSQPVNEMQLGRVTAKELVEAVHEMLSAAGINMELENQSLLQ 4680

Query: 4688 KTVVLQEQLKDSQAALLLEQERSDAAAKEADTAKAAWLCRVCLTSEVEITIVPCGHVLCR 4746
            +T+ LQE+LKDS+ A LLEQER++A+ KEA+TAK+ WLC++C T EVE+TIVPCGHVLCR
Sbjct: 4681 RTLTLQEELKDSKVAFLLEQERAEASMKEAETAKSQWLCQICQTKEVEVTIVPCGHVLCR 4689

BLAST of Moc09g05800 vs. TAIR 10
Match: AT5G23120.1 (photosystem II stability/assembly factor, chloroplast (HCF136) )

HSP 1 Score: 628.2 bits (1619), Expect = 5.7e-179
Identity = 306/386 (79.27%), Postives = 348/386 (90.16%), Query Frame = 0

Query: 4799 LFAPSLSQPQPQPQPRTFTTSTPRASLQNSSINRRQFVAETAAAVSLSLSPLIAPVQPAK 4858
            L   + S P P P P + ++S         S +RR+ + + +AAVSLSLS ++    PA+
Sbjct: 30   LIPKASSSPPPSPSPSSSSSSL--------SFSRRELLYQ-SAAVSLSLSSIVG---PAR 89

Query: 4859 SEEALSEWERLFLPIDPGVVLLDIAFVPDDLDHGFLLGTRQTILETKDGGRTWAPRTIPS 4918
            ++E LSEWER+FLPIDPGVVLLDIAFVPD+   GFLLGTRQT+LETKDGG TW PR+IPS
Sbjct: 90   ADEQLSEWERVFLPIDPGVVLLDIAFVPDEPSRGFLLGTRQTLLETKDGGSTWNPRSIPS 149

Query: 4919 AEEEDFNYRFNSISFKGKEGWIVGKPAILLYTSDAGESWERIPLSAQLPGDMVYIKATGE 4978
            AEEEDFNYRFNSISFKGKEGWI+GKPAILLYT+DAGE+W+RIPLS+QLPGDMV+IKAT +
Sbjct: 150  AEEEDFNYRFNSISFKGKEGWIIGKPAILLYTADAGENWDRIPLSSQLPGDMVFIKATED 209

Query: 4979 KSAEMVTDEGAIYVTSNKGYNWKAAVQETVSATLNRTVSSGISGASYYTGTFNTVNRSPD 5038
            KSAEMVTDEGAIYVTSN+GYNWKAA+QETVSATLNRTVSSGISGASYYTGTF+ VNRSPD
Sbjct: 210  KSAEMVTDEGAIYVTSNRGYNWKAAIQETVSATLNRTVSSGISGASYYTGTFSAVNRSPD 269

Query: 5039 GRYVAVSSRGNFYLTWEPGQPFWQPHNRAIARRIQNMGWRADGGLWLLVRGGGLFLSKGT 5098
            GRYVAVSSRGNF+LTWEPGQP+WQPHNRA+ARRIQNMGWRADGGLWLLVRGGGL+LSKGT
Sbjct: 270  GRYVAVSSRGNFFLTWEPGQPYWQPHNRAVARRIQNMGWRADGGLWLLVRGGGLYLSKGT 329

Query: 5099 GISEEFEEVPVQSRGFGILDVGYRSTEEAWAAGGSGILLKTTNGGRTWSRDKAADNIAAN 5158
            GI+EEFEEVPVQSRGFGILDVGYRS EEAWAAGGSGILL+T NGG++W+RDKAADNIAAN
Sbjct: 330  GITEEFEEVPVQSRGFGILDVGYRSEEEAWAAGGSGILLRTRNGGKSWNRDKAADNIAAN 389

Query: 5159 LYSVKFINDKKGFVLGNDGVLLQYLG 5185
            LY+VKF++DKKGFVLGNDGVLL+Y+G
Sbjct: 390  LYAVKFVDDKKGFVLGNDGVLLRYVG 403

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155038.10.0e+00100.00sacsin isoform X1 [Momordica charantia][more]
XP_022155046.10.0e+00100.00uncharacterized protein LOC111022177 isoform X2 [Momordica charantia][more]
XP_038897839.10.0e+0090.60sacsin isoform X2 [Benincasa hispida][more]
XP_038897838.10.0e+0090.55sacsin isoform X1 [Benincasa hispida][more]
XP_023513522.10.0e+0090.20sacsin isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
O826608.0e-17879.27Photosystem II stability/assembly factor HCF136, chloroplastic OS=Arabidopsis th... [more]
Q5Z5A82.4e-17477.95Photosystem II stability/assembly factor HCF136, chloroplastic OS=Oryza sativa s... [more]
Q9NZJ41.8e-14526.82Sacsin OS=Homo sapiens OX=9606 GN=SACS PE=1 SV=2[more]
Q9JLC81.2e-14126.40Sacsin OS=Mus musculus OX=10090 GN=Sacs PE=1 SV=2[more]
Q9AW489.4e-9446.13Photosystem II stability/assembly factor HCF136, chloroplastic OS=Guillardia the... [more]
Match NameE-valueIdentityDescription
A0A6J1DLW40.0e+00100.00sacsin isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022177 PE=4 SV=1[more]
A0A6J1DQI40.0e+00100.00uncharacterized protein LOC111022177 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1H4Y10.0e+0090.16sacsin isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460612 PE=4 SV=1[more]
A0A6J1KUI40.0e+0089.89sacsin isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111497740 PE=4 SV=1[more]
A0A5A7URT00.0e+0089.19Sacsin isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G00... [more]
Match NameE-valueIdentityDescription
AT5G23110.10.0e+0061.04Zinc finger, C3HC4 type (RING finger) family protein [more]
AT5G23120.15.7e-17979.27photosystem II stability/assembly factor, chloroplast (HCF136) [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 4716..4762
e-value: 0.009
score: 25.2
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 4716..4763
score: 8.71251
IPR028203Photosynthesis system II assembly factor Ycf48/Hcf136-like domainPFAMPF14870PSII_BNRcoord: 4861..5182
e-value: 7.2E-122
score: 406.4
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 4692..4765
e-value: 1.7E-7
score: 32.5
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 4926..5162
e-value: 1.7E-7
score: 32.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3400..3425
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 4803..4823
NoneNo IPR availablePANTHERPTHR46919FAMILY NOT NAMEDcoord: 6..4748
NoneNo IPR availableSUPERFAMILY110296Oligoxyloglucan reducing end-specific cellobiohydrolasecoord: 4871..5183
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 4714..4746
IPR036890Histidine kinase/HSP90-like ATPase superfamilySUPERFAMILY55874ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinasecoord: 1389..1574
IPR036890Histidine kinase/HSP90-like ATPase superfamilySUPERFAMILY55874ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinasecoord: 33..137

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc09g05800.1Moc09g05800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0051087 chaperone binding
molecular_function GO:0030544 Hsp70 protein binding
molecular_function GO:0005515 protein binding