Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAACAGTTCCTTTCTCCTCTCTCCACCCAGCCTAATTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCGTCTTCTCTCTCTTTCTCTCTCTAGCGAGCAATATAAACGTAGCAGCCGAATCTCATCGTCATCAACCAGAAACCCCTACATCGGTTTTCCCTCTTTTTGAATCTTCTATTTGATGCCCCTTCAATTCCGTTTATATTATGCTATTTTAATCTAGTCCAATCGCATCCAATCCAATTCGATATAATCTATCGTCTCGTAGTCTCGATCAGATTTGCTTCTGGACGGAGTTATTTCGATCCGCCGACGATCTGATATTTTGTTCTAATTACCTATTGCGCTTGCAATTCTTAGCTACTATCCATTTTGGCTTGGTTTTTTTGGGCTGTCGATCAGATTGGGTAGATTTCGGGTTGCGAGGTAATGGAAGGATTGAGAAATGCAACTATGAGAACTTGAATGATGCCTGCGTTAACACAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCGGCCAATGGATTTTGGTCCCAGCATCGCGACGATGTTAGCTACAATCAGCTCCAGAAGGTATTCATTTCTTTCGTTGCCAGAAGATTTAATTGGAGTTTTCCTCGTTTGTTCTTATTTTGGGAATAATTGCCACTATTATAGATCACTTTCCTGCGCTGGTTTTTGTCACTTTCCTGCTCTGTGTTTTTGCCTGAATTATACTTCATAAAAGTCTGGATTTAATACTGCCGCCAATGTAATCTACCAAAGAAGTGTCTAGCATTTTAGATGTGTGAGTTTGACACTGTAGACCTCTCATTTTTATTTTTTTTACCCAGAATAGAGTTACTGATTTTTCATATTTCCTCAATATGTTGCTCTATGAACAACAAATCTTCCTTGTAGGAGCATCACATTGTTTAATTATGACCTTAAATCATTAGTTGGGATAAGTATGATCTAATTCCAGCATTACTTGGAACTAATGTGGTGAACTGACCATTAACTAATTCAGTTGTGTTTACTCTCAATGTAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGCAAATCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCATGCATGTGATGGATCATTATCGGTTAATGGGTTTCAAGATGAAATTCAAGATCCGTCCGTCCATCCTTGGGGTGGTTTGACCACAACACGTGATGGGGTGCTGACACTTTTGGATTGCTATTTGTATTCGAAGTCTTTCCTGGGTCTCCAAAATGTACGTGCTTATTTTTGGTTTTCTTTGCTATTAGGCACTCATATTATTAGCCCATCCTTTTAATGGCTAAGGTTTCCCTTGAAGCAGGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGAGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAGACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGGCAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTGTCTAGCCGATGCTCATCCTTTGTTTCACGCATTGCATTTAAATTAATGAAAGGAAAATGTGATAGTCAATGCTCGATGGAGATTTGATTACATAATATAATCTTTAGGGATAGAAAGTATCATGAACACATTTTGAGTAACACTAACATGAACAAATAGAAGCTCCACGTCTTAGTAAGGAATACATCTCTTTTCTTTTTATTTTTTTTATTTTTAAATTACTGCTTTCTTTTAATGCATTTTTAAGATTTAAGCTAGAGTAGAATACTGCCCTAGTGTGTATGTATATATATATATTCACTTTTCTCCATGGAAGCATGATTGCTTATGAAAAAACAAAGAAATCTAGAAATACAGCTGTCTTGATTTATTGTATAGATATAATTTTAGTGAAGGAACTTATAACTTAAGTAGATTGTAGTTTACAAGCCAAGAAAATTCTTCTGTAAGATGGTATGTATGCCCCTTTTAGGTGTTGTGCAATTTATAATCAGATTGAAATGAACAAATAAGGGACAGCTTCTAATCCTTTGAGATGAGTAGATAATAAATGTATTGGAAGAATTGGAAGGTGCCTTTTCTCTTTCAATTATTGTGGAGTTGACTTATGGGAAGTATTATGAAATACTTTTGAACAGCTCATACTCTTTGGCCTAAAATTGGGATGAGTTGATAATAAATGTATTGGAAGCATTGGAAGGTACTTTTTCTCCTTTAATTATTGTGGGTTGGCCTATGGGAAATATTATTAAATATTCATGGGGAGCTCATACTCTTTGGCCTAAAATTAAAGGGATTAAATTGTAGGATTTTCTGGTTTTTGTTGTCAGGCATTGCATTGTTTGGAGGGGATTTTGGTGTTGTTTGTTCTAAATGAGCTAAACACCCTGTGGCCAAGGTTATAGTGAGCAAATATGACCCTCACCTTTTCAAATGGATTGTGGCTAGGGTCTTGAAAGGCACATTTATAAATTCTTGGAAGTATATTGCTTCTAGTGGTTCTTTGTTCTCTAATTTCGTTTACAACTCTATTTGAGATGGTTGAAACACTTATTTCCAGAAAGGCTGATGGTTGCATGATACTCCCATTTGTAACTCGTTTCCTCATTTGTACTAGCTGTCTCTCTTGACTCTAGTCATTTCTTATTCAAGTAATGGGCTTATGTTTAACTTTGGTGTCTAATGTCCTTTGTCGGATAGGGAAGCCTTGGATGTGTTGATTCTCTCTGCTCTGTTGGGAGAGGTCCACCTTCGTCTGGGCAATAGAGCTATATCTTGTCGGTCTTCTAATCTCTCCAAAGGGTTTTCTTGTCACCCTTTTCTCCTCCATTTAGGTGCCAACAACATTCTTATACAAAACCTTTATTTTCCTCGTTGTGGAAGATAAAATGTTTCCAAGAATGTTAAGCTCACTGCATGACAAGTTTTAGTGGGAGAGTCAACACCATGGATTGCATTCAGAGAACTCTTCTTTTCTTTTTTGATTAGGCGCAATGTGGTACTTCTTGGTGGCTGAAGATCTGGGTCATATCCTTGAGAGATGCAAGTTTGCTCATTTGTTGGATGGGTTTATGGAGATGTTTGGTGTGAGTGTGGCTTGTATGATTATAGTTTCATGATTGAAGAGGTCTTATTCCATCCCCAATTTCGAGATAAGGGTCTTACCTTGCGACAATTTTTTGGGTCTTTCCTTGTTGCATTATCTCTCAAGAAAAATGCATCCATCGCTGACCTTTGGTTTTTCAATGTAGCATCTTGAAACCTCTCCCTCAGACGTCATCTTGTGGATGATGAGATTGTGGAATGGATTCTTTGTCAAATCAGGCTTTTAGACCCAATGGTTGTGATGACTAATGGTGCTGGCTTTTAGAAAATGATGGGGTTTCTCTGCAAGTCCCTCACAAGAAAATTAAAATTGGCTTGGGCTGATTTAAGGACTTTTGATGGTTTTTATACACATATTGGGAGGGTTACCTATCCAAATATTAAAAAGTTTTTCCTCTGGGAGCACAGTTATTCCAGCATAAACACTCAAGATGTCCTACATGTCTTACAGAAGAAATCCCCATGGATGTACGTGTCTCCGAACTGCTGTCCTATTTGTATGAAGCGTCTGGAATCACAATTCTATATTTTCAGCAACTGCTGGTTTGCAAATGAATTTTGGATCTTTATTCTCTCTAAATTTGGATGGAACTTGAGTATTGACTCGCCCAGGTGAGCATTAGTCTCATTTCATTATATCAATGGAGAGACTTGTTTCCTTTTCAAAAAAAAAAAAAAAAACTTGAGTATTGACTCGCCCAGGTGACATATTCTCCCTTCTATCATGTGTATTAATTGGTCATCCCTTTAAAGAGGAGAAACAGACCCTTTGGTTAAACTTCATGAGAGCTCTTTTTTGGAGCCTTTAGATGGAACGAAATGGCCGCATCTTCAACGATAAGAAACGTGGGATTACTGACATTATTGATTCTATTTCTTTTCTTAGCTTTATCTTGGTGGAAGTTAAATCAACCTTTCTGTAATTATTGTTTATTTACTCTCCTAGAGTACTGGACTAGCTTCATGTAAGTATCTCTTATAGATTCTTCTGTACACATTTCACCATATCAATGAAGTAGTTTTGCTTCCTTTCCTTTTTTTTCTTTTTTCTTTTTTTTTTCTCCTCATTATTATTATTTTCTTATAGGCCATTTAGTTAGAGAGGGATGGATGGTAGAATGTTTAGAGGGGTTGAGAGGTTATGGAAAGAGCTGTAGGTTCTTGCGAAGTTTAATACCCTCTCTTTGGGTGTCTGCTTCGAGAGACTTTTGTAACTACCATCTATCTTATTATTTTGGATTGGAGCATTTTTTTTAATCTTGTTATGTTTCTTTGTGATCTTTCATTTTTTCATTTTATTCTATACTATTTTGTCATTAAGGTTTTTTAAAGCCTTAGTAAAACTTAAAAGTAGTAAGGGGCTAAAGTATCGTCTTGTATTATATTGACATGTAATACGTAAGTAATGTTTGTTCTCTTGTTAGATTTCACGTTTCTCCTTGGGCTTCGATTTCGAAGATCTTTTGTAACTATTCCATAGGTACTATTTTGTACCATTGGCGCTATTTTCTTTAACAGGGTGTCCTTTTGAGGGCCTGGTTTTTCGTATGTTCATGTATTCTTTCATTTATTATCAATGAAAGTTGTTTTCATTAAAGGGGGAAAAACACGCCAGTGTACTTTTATATTTTTTAAGAATGTAGCAGATTAATATTTTTTTTTCCAGACATTCTAGTCTTTACGTCTGTTCAGTGCATCACATGGAGTCATAAACCTTTTCTTTAATAGACGTGAGCCTCTTTTGATTCGAAGAGTCCTAAGACTTGAGCCTAAAAATTTGAGCCTGCTCTTGAAGTGTGCCTCGAGGTGTAAGCCTCAAAAGTCTCTTAAAAAATTGGTTCTACTTGTTATGTTTTTGCGTAGGCTGCTGACACTTATTTGCACAAATTCAAATAGCATTTAGGATTGTTATAGTTAATGAGGAATATAGGAAACAAGCCTTGTTTATTCTAATAGTATTTATGTTTTCTTGTCTGCATTTTTCTTGCCTCGTAATTATTATTCTTTTTTGCTTCTTACAATATGTTGATTATCTCACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAACTAAAGGAGTTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCATTTAATTATGAGGTGCATCCTTGCACTTTTCTTCAACTATAAGTGCTCTTCTACCATCATGGTTATAAAGTAAACTTCATAATTATTATGAATGATAATTATAATGACTATGTAGATCATGTAAGGTTTCTCTAGATTGTGTGCGTTTTGCAATCAGCATGAAAGAATTTATAATTTCTATTGTCTCAGGTATCGGATGACACAATCCAGGCCGATTGGCATCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTATGAGTGGGCCGTTGGAACAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTCAAAATAAATGGCCTAGATCTTGGTGCTTTGAGTTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGGCTTACAGTTGGTGATGGATTTGTTACAATCACGAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCCGAGGAGGAGGAGGTTGTTCTCTTTCTTGTAATTTTGTTTCTTATGATTTTGTCTACCAAAAACTTTTGTATATCATTTGTATTTTCAATAATGAATGGGTTTATTGTTTCTGCTATGACAAAAACTATCTGCAACATTATATATTTGATGAACTTGAAATTCTTGTGGGTGGCAAATGTTTTGTCTTCTAAATCTTGGCAACTCTAATTGATTTGCTGAATTGTGTCTTTTAGGAGGACGATTCTCTTGATAAAGATGCAAATGATTTGGATGGAGATTGTTCTCGTCCTCAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACCGTCATCTTTAAAGAACAGGCATGTTATCTGCAATCTTCTTGGCTTGTGCGTTCTTGCCAATTGGCATTTGTGATATTATTCTTTCACGAAGACAGATCACATGTTAAAGTCTCTTGTTTTGAGAAATTTTATTAAGCACCATGGATATAATTAGCAGTTAGGCACATTGGTACAAGGAAAAACAAATTACAAGTTAAATTACTACATCTATGGAATCAGGTGAAAATGTTATGCTGTATTTTCAGATGTAATTCTTGGTTAATATATGTGTACATTTTTTTTCTTCCTTATTTTTTAGTATCATTTAATGAGATGCAGGAAGGTCTAATATATCCAGCCAAGGTGGCTACAACTTCCAGTTAATTTCTCCCAAGACTTTAGTGAAGATTATGTCAGTCCAAATAGTTCTGGTTTTACCCATTTAGGCTAGTGACTAAGGATAAAGTGGATGTTATAAATGAGGCAATATAAGATAATGGAAAATTCGAGAGAGCTAGGGAGAAGGAAAATAGAAGAAATTCAACTTTTAAACATTTCGATGTTGAAGTCTAGTTACTGCAGTTTATTAATAGGTGAACCAAATGGATTTACAAAAAGGATTTCCACCTGGCCTAAAGTCTCATCAGTATAATAAGAAAAAAGGCAGGAATGTTTGCACCAATATATGTCCCTTAGGACGTTATCCATAATGGAATTAAAAGAAGTCTCAAACCCTTTGAAAAATATGCAATTTCTTTCTAGCCAAAGGTTTCCAAAGAAGCTTCCACCTGGCACAACCAAAGGGTTTGTTGCTCGTTGTGGAAGGGGTACCCTAGTAGGATGCCAGATAAGAGGTCATAGATGGATGTAGTGTTTACGTTTCAAAAAAAGGATGGATGTAGGCAAGGCCGTGGCCCATTTGAAATTGATGTGAATAAAGCCTTAGAAGCTATGGACTATGGAACAATGGATGGATATATGGGCTTGAAACTCATAATCCTTTCTGCAAATATGGAGGAGAGAGAATGGTAAGGAGCCTTCCTTTGAATCAAATCATGAGAGTTATGAACCCATGAGCTAGCTTCCATTTGAAGAACTTGACCTTTTTTTTATAGAAGCATTCCCAGATTGCCAAATATAGGGGAGATGAGCCTTTATTTAAAAAACAAGCATATAAGAAATGCACTGTGAAGCTATTGTCCAAAGCTCCTTTCCCTCTGTCTTTTCTGTCTTGTGGATGAAATGCCATCAGTCTCTTCATCTTTCAGATTTCTGATGAGGTTGAGAATCTGTGAGGGTACTGGTAGGGGATATGTTAGGAACATTAGTAGGATAGTAGTAAGGGTACATTAGTAATTTGGTAGGGAGGTGGTTATGAAGTTTAGCTATAAATAGAGGGAGTGGGGAATGAGAGGTTTAGGCATTTTGTGAGTGAATTAGGGCTTGGGAGAGATCTCAAGAGGGAGGGTGCAAGTACCTCGAATTACTTGTTTATATGGTAATTTCTCTAGATATTGCGATATATTTATTTTGTTGTTTTCATATATTTTCTTGTTAGGAGGTACCCTAACAAATTCCAAGAGTTAGGAATCCATAGCTCGTTTTACACATTTGGATGTCTAGAAATATTTGGCTTGTTGTATTTGATTTGTTAGATTTATATGTGGATATGTCTGATGATCACTAAAATATCTTTGTTATACTATATGTCTCTGTTGTTTAATTTCTTTTTGAAAAGGAAATAATATTTTTCATTAATATAATGAAAAGAGACTAATGCTCAAAGATACAAATAACCCTTAAGAGGGGAGAACAACTAAATAAACCAAAAATAGACAAATGATAGTTAGGTTTTTGGATGAATGACCAGTACACTTCCATGGAACTAGTGACAAAAAGGCAGCTCTTGAATGAAGACAACCAAAAGTCTACTTGAAAAAGGAAATGCCTTAAAAAACTTCAATTCAATGCTTCACTCCTCTAGCATAGTGATGCAACCTTGAAGCTGCATGGATAAATGCTTCTACTTGGTGTTCTCTTTCTAAGCTTTTTGTTAATTTCTCTATTCAAGATATATGTTTAAATTGGGGTGCTTTTATTCTCTAACTAGCCTCTTCTCTAGTGATGGATAGTCTTGTTCTTGTTGATTTTATGTGCTATGCATTTTTTTATATTGTTTCCTTGTATTTTTGAGCAATAGCTTCTTTTCATTATATCAATGAAAAGTTATGTTTCCTTTTAAGAAAAAACCTTGAACAAATAGAATTTTAATAAAGGAGGAGAATTAAAGCTTCAACTTCATGCACAATGTTGAAAACTGAACCAATCTTGGAACTTGGGCTGTTGGAGAATAAAAGACAGATTCGAATTTAATGCTGCCATTCAAGAACTTAGTGAAAAAACGACACTTGAAGAAGACAATCGAAAGTCTTCTATTCTGGGATGGAAAACACCTTAAAAGCTTTGAAATTAATGCTTTGCTCCTCAAGCAAATTAATACATCCTTGAACAAAGAGACTTTATTTAATAAACGAGGAGAATAAAAGTTTCTTCTTCATGCAGAAAGTTGAAAACCGAACCAACTTGAAACTTGGGCTGCCTGGAGAAGCATAGAATTCTGCCATAATATTTCATAAAGGCAACCTTCAAAAAAGAGGCCAGCCTTAATTAAACCAATAATAAGAAGCCATGCAAAGGGTCAGCCTTAATTAAACCAATAAGAAGAAGCCATGGAGTTCAATTAATTTTATATTTCAGTATAATATTAATTACATCAAGTTTAAAGTCAAGTTTTCGATATTTGTTTAGAAACTTATGTCATACATCAATGTAAATTGAGATATTATACATTTTAATCAAAACTTATTTTGTTATAAATTTAATATATAAATATCAATTACTTGAAGACTGAAGCATGGTCAACGTTGTGTCTTGGTTAGACATTTATTGAACACTCTTAACACATCGACACTTGTTGAACATGGAACAAATGTTTGGTAGCACAATGGACATGTGGAACACACTAATTTCATAAATTGAAATTGATTTTTTTTTTTTTTTAATTTTTAATTTTTATACAATATGTGAGATGAGGGAATCAAATCTATGACCTCAAGGTCGATGGTACAAACTTGATGTCAATTTTTCTATGCACATGTTTGCGACTTTTGACACTTTTGCTTGTTTTTTTTTTTTCTTGGTGAAAAACTGACCTTTCATTGAGAAAAAAAAAAGAAAGAAAGAATACACTGGCTTGCAAAAGGAAGTCAATCTACACAAAAGGACTCTAGTCAAGCAAAATTAGACTAAGAGTATAATTACAAAAAAAGCTTTAACACCAACGCCTTAAAAGAAAGGTAGAATTTGATAAGTGACGAAATATCATCTAAAGACCTCTCCCAAGCCCTCAAAGATTTTACAATTTCTCTTGCCATATAGACCTTGCAAAATAGCACACTCTAGCCTGCCACAAAAAAAAAAAAAAATCCCTTTCTCAAAAAAAAGGTGCATGGAGAAAGAGAACCTTGATCGTCTGGCAACTGAGTGCTAGTAAAACAAAAATTGAACACCTCAAAGAAGTGAATCCATACAGCACACGTCCAAAAAGATGATCAAGGTTCTTTGACGCTCTTTGGGTAGAATACAACACAATGGTCTGGTCACCATGAACCCTCTAGCCAACATTCGAATTTCGACCTATGGTGTTAGCTCTTCCATACAAAACTTGCCAAACAAAGAACTTGACCTTCTTGGAAATATTAATCTTCCACTCAGAAAAAACGGAAATGCTCGATGGGAAGAGGGGGTTGACCAAACACAAAAAAAAAAAGAAAACCTTTGAATGGATTGAGACTCCAAAAGCAAAAATCCATCCTTATAGGGCTAAGAAGTTCCCTAGCAAAGATAACAAAGCTAGAACATCTGACGTTTCCCTATTGGTCTACGGATGATGAAAACTTGGAGGAGAAGGCAAGTTCGCAAAGTGAGAGAATCGAAGCAACCTAATGATACTTCCACAAGGAACAACTGATGTAACCGAAGATGTAAGGAGCAAAGGGGTCTATCAACAAATCACTTATCCTCCTAAAAGTAAGTATCCCTCCCATCGCCCACCATATTATGAACAAATTGGGAAAAGAATCGAAGTCTATCTGGAATATTTTTCCACAGGTGTTTGGTATGCCATTAACCATTGTCTATTCAAAAGGATGAGGATCATGCTTGCTAAAAATAACCTTGTGCCATAGGATGTCAGAATCATGATGAAATCGCCAAAACCATTTATCCAGAAGGGCTTTTCCTATGGGTGTCTAGATTTGTCTCTGCTTTCCAGAGACTTCACATGGGTTAGGAGGGATCTTTGTTTTTGGACCCATCCTTCTTAGGGTTTTCCTTCTAGTTCTCTTTTTCATCTTCTGAGTACTCCATTGATTCTCCTCGACACCCCTTTTTTTCTTTTATTTTGAAGATCAAAATCCCTACAATAGTTAAGTTATTTGCCTAGCAGGTCCGAGTCCGACATGAGAGAGTTAATACTTGAGACTGCATGCTGAAACATTTTCCCATTTCTGGGGCCTTAGTGGTTTTCCCTTTGTGAGAAGCGGTGGAAAATCTTAACCATATTATTTGGAGATTCTGTTTGTTCCTTCAATTTAGTCCCATTCTTGGGGCTTTTGGAATTCCATCCCTTTAGGGAGTGGGACAATTTTCTACACAAGGCTTGCTTTTTGGCAGTTGTTTGGGGGATTTGGCTCAAGAGAAATAAAAAAGCTTTAGAGGGAATGAGAAATTTGTAGAGGAGGTGGGGGAGGTCATTAGGTTTAATCCCTCGTTTGAGCCTTGGTGGCAAAGATCTTTTGTAATTAGCGATTAGGAAAATGCTTTATTTTCTTGGATCGGAGTCCTTTGTTATAGTTCTTTGGGCTCCTATTCTTGGTTTTTTTTTTTTTTTTTTAGGAGACTCAAGAGGTTTACTTTGCAGATCGCTTTGATACTTTGTTTGAGAAGGTGAAAATGTTCAAGCAGAATTGTCTGCCATAATTCTTGCCAAGTTCTCTACTTTGATTGAAAAATGTGATTTTTAAATGAAGTCAATCCCCCCTTGTTCGCCTTTAGTTAAAGCATAGCTTGGTGGTTTTGTTCAGGAATTGAACTAATTTTGTGGATTTCTCGTGTTTATGTGCCTAAGCCCTACCGTTTTGTCCTTACAAAGTTGCGTTTTTTCTCCTAGAAGTTGTAATCTTGGAAGCGGCAGTTTAGAAGTTAGTTTGATTTGTGGTTTGATCCTCGCTCTTTCAGCCTTATTGGCCTTCTTATCTCGTTTGCTCATTGGACTTCTGTTGGCTTTGGCATTATAGTTTATGTGCTCTCTAGTTTTGGGTTCCTTTTTGGAGCTAGTTTGTTAGGTTTTAAGTGTTTGTTTTTTGATTTCTTCTCCTCTTTGTATATCAATAGTATTTCTTCTCCTTTTGTTTTTTCACAATTTCTTTATCCTCTTGAAAGTTATTGTATCAGTGAGCATTAGTTTCTTTTCATTATATCAATGAAAAGTTTTGTTTCCTTTTTTTTTAAAAAAAATTATTTTAATGCCTTGTAATTTATATTTCTCATTGGTATCTCATCACATCAGGAGAATCGCTGTATCTATTGATATTGATTCTATAGTTAACAATGCATATCTTTTGAATATATATAAACTAGCTTTAAACAATTGAGCTGGGTGTAATTAGGTTAGTCCATTAGGAAAATAGAGTGCAATTGCATGATTACAAAGTATCCCTTTATCCATCTACAAGAATTAGGAGATGAGTATTGCCATGGAGGCAATACAACTGCTCTCTATTTTAAAGTAATGTCAAATTTTCAAAGTATCTCTATTCTCAAAACCATCCTTGGTAGGATTTATGGCTTCCTCGAAGGAATCTAGATCCATTGAGGAGCTTCTATTTGTATGAGTGGAAATTTCCCAAATGATTTTTATTCTAAGATTTTATTTTGTGGTTTGAGAGTTCTAATGTAAGCATGAGGGTTCATTTAAGCATTAGTCTCTTTTCATTACATTAATGAAAAGTTTTGTTTTTGGTTCAAAACTTAAACATGACGGTTGGGCCAGGAAAGCCGATGGTAATTGATAGTCATAGAGATGGATGAAGTTGGGATCCAATAGCTATGTGTTTTCAGGTAGAAGGAAAATGGGACGGGACCCAACCATCATGGGTTGGCCTAGTGGTAAAAAAGGAGACATAGTCTCAATAAATGCCTAAGAGGTCATGGGTTCAATCCATGGTGGCCACCTACCTAAAAATTAATATCCTATGAGTTTCCTTGACACCAAAATGTTGTAGGGGTTAGACGGGTTGTCCTATGAGATTAGTCAAGGTGCGTGTAATTTGGCTTGGACACTCACGGATATCAAAAAAAGAAAAAAAAAAAAGGGACAGGACCCCATTGAAGATTATGTGTTTCCCAGAAATCTTTGACGGGGAGTCCTTTTTGTAATGCATTTTGTGGGAAGAAAAATATGTTACTTTGTTTCTTAAATAACTTCGTCATGTAATTTGTTTTGGGTTGATTAGTTTCATTAATTTGATAATTAGTGTTTAAGGCTTAACTAGTTATTTTCATTTCTTGTAAAGTAGCACTAAATAGTTGGACATTATTCATAAAAAGGGTTCTCAATTTGGGAAGGTTTCTCTCTTGTTCAAGATGGGACTATTGGGTTGTTAATGACCATTAACGTCATTCATCTAAATCTACTGAATCACCTTGGCTCGGTGGCATATCTCAACTTCCTTTTTCTGTACATCCTCAATTCTTGAAGAAAGTTTTAGTGACCTTCAACTTAAATTTATTTATATCAGATTACAATTTCTTTATTTTGAGTAAAAAATTATATCCAATTGTTGACTTGGCAATACAAGTGGATATTATCCTGTCATTTCCATAATGATGATTGATACTTCATTTGTCGTAAAGAAATTTCTCTTGAATGTTTTGCTTGTTGGATTAGCTCATCCGCTTAAACTGATTGAAACTTCAAAGTTGTTGTGTCTTTCATTATTCACTACTTTCCAATTAATCAAGGGAATTTTTTAATGTGGTCTCTTCTTCTAGCTTCTTTGACTGGAAATCCAATACTGGTACTGATCTAAGTTTTGCCTAGGAATTATAGATCGCTTTTATTCTTATACCTTGTGATCTTTTCCAGGTTGAAAAAGCCTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGGTTTCATTGCGAGCTATATCATTACTCAGAAGTTTATTTACAAGCATTATGATCTCTATTTACTGCTAACATTTGGGAATATTGCTGCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAACAAGAACGCAAAGAGCGGAAAAGGACAAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGCATGTACTCATTCTGATGTCTTGGAGGACTTATCCCCATGTGTTTTGGAGCCAAATCCCAATGCAGTCGGTGAAGTATGTGATGCCAGTGTGCCTGAATCTTCTGATATTCTGGATGAGCTGTTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCATATGATGATAGTTTTGATGGGAGACTTACCGATGGAAATGAGTCTTTCATAGGTGATCAATCTAAAGTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGTGCTGGTTAACAAATCTGAGCAAAGATATTATGCCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAATTAAGAACAAACTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAGAAGTTGCACTCTTCCAACAACCGGGTATCTTATGACTACCGTTCCTGCATCTGTAACCAAACTAATGAATTTAACAAAAAGGTGGAACCATTTGTTTCTTCAGTTAGAGTTAACAGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTTTCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGAAGACTGAAAAACAAAGCTGCTTTATTAAACAATTCTCCTAGTAAAGATTTTGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGATTATGACCCTTTGAAGTCGAGGCATGAATTTTGCAGTGGCGAAGTTAGTGTAACTTCTGGTACAGTTGATCCAGAGGAGAGTAATTCCACTGAATCAACTTCTGGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTTCTATCGAATCGAAGGATCATAAAAACGTAGAAGAAGATGCATGTGAGGAGGTAACACAGTGTTCTGTAAATTCAACCATGGACATGACATTGACATCCAGTGGGACCAGTAACCCAGTAGGAACTAGCTCTTTAAATTCTGATAACTGCTCATCATGCCTGAGTGAAGGAGACAGTAATACTATCTGCTCGAACCATGGAAATTTAGAATCCTCCTCCACATCAGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATAAGGATAGATAAAGTAATTGGAGGTGATGCCATGGGGAGCACGAACCATTCCGGACTTTCTCATGATAATGGGGGATGTAAAGTTCAAGGGAATGCATCCAAAAATGTTCCTCAGAACTTCGAAGCTGGATTCTCTGCTGTTAGTCTGGACTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAAGTTCCACCATCAATGAGTTATTACCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATGATGCCTTTCTCCTATTCAAATCACTGTCTATATGCCAATCCTCTTGGGTATGGTTTGAACGGCAACCCACGCTTCTGCATGCAATATGGCCATTTACATCATCTTTCTAATCCTGTATTCAACCCTAGCCCGGTTCCTATTTATCACCCGGCTTCCAAAGCCAGCAATGGTATCTATGCCGAGGATCGAAGTCAGGTCTCCAAATCAGGTGCATTAGCAGAAAGTTCTGTAGCTCATTCAGACGTCGTTGTTACCACTGGACATCCATATGCACTGAGTTCACCACGAAGCGGAGATTGTAAACAAAGTGATACTTCTTCAAAATTGCAAAAGGATAGCTCAAGCTTTTCATTGTTTCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAACCTCACGCCTTCTAAGGAAGACAATGTTGGGGATTTTTCAAGAAATAATGAGGTGGAAGTTGTTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAATTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCATTCTTCTGAATGAGGACAAGAAGATATGAGGGGGCTGTTTTCGATTTACTAGTCCCTATCTTCACATATATTTTATTTTTTAATGAAATTTTACAGTACTTGCTACATAATAAGTTTTCTTGAAATCTCAATTTTACCTATAAGTTTTTCAGTAGTTGATTGCTCTCAAAATTTGTCTTCATTATTTTTCCAGATGTAGAGAACACAAAAACAAAAGGAGAAAAGAAAAAAAGTAAAAAGAAAAAAAAGAAAGAGTGCTATGTGTTTGAAGGTTTCCGAAACGAGTTCGAGTTTTTTCTATCGACAGAAAATATGTTGATATGAAATTTTTGCTGCCTATTATTTGCTTGAAACCGTGATTCATTTTTAATGTTTTCTTGCAGTTTTTAGCAGCAACTATCTGATTCATTTTTGTAGTTTGAGTTGGA
mRNA sequence
TAACAGTTCCTTTCTCCTCTCTCCACCCAGCCTAATTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCGTCTTCTCTCTCTTTCTCTCTCTAGCGAGCAATATAAACGTAGCAGCCGAATCTCATCGTCATCAACCAGAAACCCCTACATCGGTTTTCCCTCTTTTTGAATCTTCTATTTGATGCCCCTTCAATTCCGTTTATATTATGCTATTTTAATCTAGTCCAATCGCATCCAATCCAATTCGATATAATCTATCGTCTCGTAGTCTCGATCAGATTTGCTTCTGGACGGAGTTATTTCGATCCGCCGACGATCTGATATTTTGTTCTAATTACCTATTGCGCTTGCAATTCTTAGCTACTATCCATTTTGGCTTGGTTTTTTTGGGCTGTCGATCAGATTGGGTAGATTTCGGGTTGCGAGGTAATGGAAGGATTGAGAAATGCAACTATGAGAACTTGAATGATGCCTGCGTTAACACAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCGGCCAATGGATTTTGGTCCCAGCATCGCGACGATGTTAGCTACAATCAGCTCCAGAAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGCAAATCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCATGCATGTGATGGATCATTATCGGTTAATGGGTTTCAAGATGAAATTCAAGATCCGTCCGTCCATCCTTGGGGTGGTTTGACCACAACACGTGATGGGGTGCTGACACTTTTGGATTGCTATTTGTATTCGAAGTCTTTCCTGGGTCTCCAAAATGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGAGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAGACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGGCAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAACTAAAGGAGTTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCATTTAATTATGAGGTATCGGATGACACAATCCAGGCCGATTGGCATCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTATGAGTGGGCCGTTGGAACAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTCAAAATAAATGGCCTAGATCTTGGTGCTTTGAGTTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGGCTTACAGTTGGTGATGGATTTGTTACAATCACGAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCCGAGGAGGAGGAGGAGGACGATTCTCTTGATAAAGATGCAAATGATTTGGATGGAGATTGTTCTCGTCCTCAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACCGTCATCTTTAAAGAACAGGCATGTTATCTGCAATCTTCTTGGCTTGTTGAAAAAGCCTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAACAAGAACGCAAAGAGCGGAAAAGGACAAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGCATGTACTCATTCTGATGTCTTGGAGGACTTATCCCCATGTGTTTTGGAGCCAAATCCCAATGCAGTCGGTGAAGTATGTGATGCCAGTGTGCCTGAATCTTCTGATATTCTGGATGAGCTGTTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCATATGATGATAGTTTTGATGGGAGACTTACCGATGGAAATGAGTCTTTCATAGGTGATCAATCTAAAGTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGTGCTGGTTAACAAATCTGAGCAAAGATATTATGCCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAATTAAGAACAAACTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAGAAGTTGCACTCTTCCAACAACCGGGTATCTTATGACTACCGTTCCTGCATCTGTAACCAAACTAATGAATTTAACAAAAAGGTGGAACCATTTGTTTCTTCAGTTAGAGTTAACAGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTTTCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGAAGACTGAAAAACAAAGCTGCTTTATTAAACAATTCTCCTAGTAAAGATTTTGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGATTATGACCCTTTGAAGTCGAGGCATGAATTTTGCAGTGGCGAAGTTAGTGTAACTTCTGGTACAGTTGATCCAGAGGAGAGTAATTCCACTGAATCAACTTCTGGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTTCTATCGAATCGAAGGATCATAAAAACGTAGAAGAAGATGCATGTGAGGAGGTAACACAGTGTTCTGTAAATTCAACCATGGACATGACATTGACATCCAGTGGGACCAGTAACCCAGTAGGAACTAGCTCTTTAAATTCTGATAACTGCTCATCATGCCTGAGTGAAGGAGACAGTAATACTATCTGCTCGAACCATGGAAATTTAGAATCCTCCTCCACATCAGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATAAGGATAGATAAAGTAATTGGAGGTGATGCCATGGGGAGCACGAACCATTCCGGACTTTCTCATGATAATGGGGGATGTAAAGTTCAAGGGAATGCATCCAAAAATGTTCCTCAGAACTTCGAAGCTGGATTCTCTGCTGTTAGTCTGGACTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAAGTTCCACCATCAATGAGTTATTACCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATGATGCCTTTCTCCTATTCAAATCACTGTCTATATGCCAATCCTCTTGGGTATGGTTTGAACGGCAACCCACGCTTCTGCATGCAATATGGCCATTTACATCATCTTTCTAATCCTGTATTCAACCCTAGCCCGGTTCCTATTTATCACCCGGCTTCCAAAGCCAGCAATGGTATCTATGCCGAGGATCGAAGTCAGGTCTCCAAATCAGGTGCATTAGCAGAAAGTTCTGTAGCTCATTCAGACGTCGTTGTTACCACTGGACATCCATATGCACTGAGTTCACCACGAAGCGGAGATTGTAAACAAAGTGATACTTCTTCAAAATTGCAAAAGGATAGCTCAAGCTTTTCATTGTTTCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAACCTCACGCCTTCTAAGGAAGACAATGTTGGGGATTTTTCAAGAAATAATGAGGTGGAAGTTGTTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAATTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCATTCTTCTGAATGAGGACAAGAAGATATGAGGGGGCTGTTTTCGATTTACTAGTCCCTATCTTCACATATATTTTATTTTTTAATGAAATTTTACAGTACTTGCTACATAATAAGTTTTCTTGAAATCTCAATTTTACCTATAAGTTTTTCAGTAGTTGATTGCTCTCAAAATTTGTCTTCATTATTTTTCCAGATGTAGAGAACACAAAAACAAAAGGAGAAAAGAAAAAAAGTAAAAAGAAAAAAAAGAAAGAGTGCTATGTGTTTGAAGGTTTCCGAAACGAGTTCGAGTTTTTTCTATCGACAGAAAATATGTTGATATGAAATTTTTGCTGCCTATTATTTGCTTGAAACCGTGATTCATTTTTAATGTTTTCTTGCAGTTTTTAGCAGCAACTATCTGATTCATTTTTGTAGTTTGAGTTGGA
Coding sequence (CDS)
ATGATGCCTGCGTTAACACAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCGGCCAATGGATTTTGGTCCCAGCATCGCGACGATGTTAGCTACAATCAGCTCCAGAAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGCAAATCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCATGCATGTGATGGATCATTATCGGTTAATGGGTTTCAAGATGAAATTCAAGATCCGTCCGTCCATCCTTGGGGTGGTTTGACCACAACACGTGATGGGGTGCTGACACTTTTGGATTGCTATTTGTATTCGAAGTCTTTCCTGGGTCTCCAAAATGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGAGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAGACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGGCAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAACTAAAGGAGTTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCATTTAATTATGAGGTATCGGATGACACAATCCAGGCCGATTGGCATCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTATGAGTGGGCCGTTGGAACAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTCAAAATAAATGGCCTAGATCTTGGTGCTTTGAGTTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGGCTTACAGTTGGTGATGGATTTGTTACAATCACGAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCCGAGGAGGAGGAGGAGGACGATTCTCTTGATAAAGATGCAAATGATTTGGATGGAGATTGTTCTCGTCCTCAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACCGTCATCTTTAAAGAACAGGCATGTTATCTGCAATCTTCTTGGCTTGTTGAAAAAGCCTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAACAAGAACGCAAAGAGCGGAAAAGGACAAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGCATGTACTCATTCTGATGTCTTGGAGGACTTATCCCCATGTGTTTTGGAGCCAAATCCCAATGCAGTCGGTGAAGTATGTGATGCCAGTGTGCCTGAATCTTCTGATATTCTGGATGAGCTGTTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCATATGATGATAGTTTTGATGGGAGACTTACCGATGGAAATGAGTCTTTCATAGGTGATCAATCTAAAGTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGTGCTGGTTAACAAATCTGAGCAAAGATATTATGCCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAATTAAGAACAAACTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAGAAGTTGCACTCTTCCAACAACCGGGTATCTTATGACTACCGTTCCTGCATCTGTAACCAAACTAATGAATTTAACAAAAAGGTGGAACCATTTGTTTCTTCAGTTAGAGTTAACAGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTTTCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGAAGACTGAAAAACAAAGCTGCTTTATTAAACAATTCTCCTAGTAAAGATTTTGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACTCAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGGTGTGGAACCTGATTATGACCCTTTGAAGTCGAGGCATGAATTTTGCAGTGGCGAAGTTAGTGTAACTTCTGGTACAGTTGATCCAGAGGAGAGTAATTCCACTGAATCAACTTCTGGTATTGAATCAGATGAAGTCTTCCAAAATGGACTTTCTATCGAATCGAAGGATCATAAAAACGTAGAAGAAGATGCATGTGAGGAGGTAACACAGTGTTCTGTAAATTCAACCATGGACATGACATTGACATCCAGTGGGACCAGTAACCCAGTAGGAACTAGCTCTTTAAATTCTGATAACTGCTCATCATGCCTGAGTGAAGGAGACAGTAATACTATCTGCTCGAACCATGGAAATTTAGAATCCTCCTCCACATCAGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATAAGGATAGATAAAGTAATTGGAGGTGATGCCATGGGGAGCACGAACCATTCCGGACTTTCTCATGATAATGGGGGATGTAAAGTTCAAGGGAATGCATCCAAAAATGTTCCTCAGAACTTCGAAGCTGGATTCTCTGCTGTTAGTCTGGACTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCACTTTCCAGTGTTTCAAGTTCCACCATCAATGAGTTATTACCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATGATGCCTTTCTCCTATTCAAATCACTGTCTATATGCCAATCCTCTTGGGTATGGTTTGAACGGCAACCCACGCTTCTGCATGCAATATGGCCATTTACATCATCTTTCTAATCCTGTATTCAACCCTAGCCCGGTTCCTATTTATCACCCGGCTTCCAAAGCCAGCAATGGTATCTATGCCGAGGATCGAAGTCAGGTCTCCAAATCAGGTGCATTAGCAGAAAGTTCTGTAGCTCATTCAGACGTCGTTGTTACCACTGGACATCCATATGCACTGAGTTCACCACGAAGCGGAGATTGTAAACAAAGTGATACTTCTTCAAAATTGCAAAAGGATAGCTCAAGCTTTTCATTGTTTCATTTTGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAACCTCACGCCTTCTAAGGAAGACAATGTTGGGGATTTTTCAAGAAATAATGAGGTGGAAGTTGTTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAATTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCATTCTTCTGA
Protein sequence
MMPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQTLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLSVNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLYPDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMKEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVSDDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSCFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEEEEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAFREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVPESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQDHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTNHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFNKKETAIEEYNLFAASNGMRFSFF
Homology
BLAST of Lsi01G019330 vs. ExPASy TrEMBL
Match:
A0A1S3B599 (uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=4 SV=1)
HSP 1 Score: 2303.5 bits (5968), Expect = 0.0e+00
Identity = 1189/1290 (92.17%), Postives = 1218/1290 (94.42%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKNDHLNGGSSAIYSLSA+GFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTC NHSCNRLGVSKN ACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYL+SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLHSKSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWHQTFADSVETYHY+EW+VGTGEGKSDILEFENVGMNGSVKINGLDLG L+SC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWSVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 540
Query: 542 TKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVP 601
TKEREKKLRRKERLKGKDKDK+SSESAE C SDVLEDLSPCVLEP NAVGEVCD SVP
Sbjct: 541 TKEREKKLRRKERLKGKDKDKLSSESAEVCARSDVLEDLSPCVLEPTSNAVGEVCDTSVP 600
Query: 602 ESSDILDELFLNESIISEGQNSYDDSFDGRLT---DGNESFIGDQSKVSRWRLKFPKEVQ 661
ESSDILDELFLNESIISEGQNS+DDS DG+ T DGNESFI DQSKVSRWRLKFPKEVQ
Sbjct: 601 ESSDILDELFLNESIISEGQNSFDDSLDGKFTDGNDGNESFISDQSKVSRWRLKFPKEVQ 660
Query: 662 DHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHV 721
DH FKWSERRRFMVVSENG+LVNKSEQRY+ DS ENPSRSMNGSNRKLRTNSLKAYGRHV
Sbjct: 661 DHPFKWSERRRFMVVSENGMLVNKSEQRYHPDSSENPSRSMNGSNRKLRTNSLKAYGRHV 720
Query: 722 SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSK 781
SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKK EPFVSSVRVNRDVKSVSKSESSFDMSK
Sbjct: 721 SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSK 780
Query: 782 QSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNV 841
QS+RSNKYSYGDHSRD+GRLK KAALLNNSP KDFVYSKKVWEPMESQKKYPRSNSDSNV
Sbjct: 781 QSYRSNKYSYGDHSRDNGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNV 840
Query: 842 ALKSSTFKFGVEPDYD-------PLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESD 901
ALKSSTFKF EPDYD +KSR FCSGEVSVTSG VD EESNSTESTSGIESD
Sbjct: 841 ALKSSTFKFDAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVDQEESNSTESTSGIESD 900
Query: 902 EVFQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSC 961
+V QN SIESKDHKNVEED C EV QCS NS +D TLTSSGTSN VGTSSLNSDNCSSC
Sbjct: 901 DVSQNENSIESKDHKNVEEDVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSC 960
Query: 962 LSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDA 1021
LSEGDSNTI SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDK IGG+A
Sbjct: 961 LSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEA 1020
Query: 1022 MGSTNHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF 1081
GS ++SGL DN GC VQ NA KNVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF
Sbjct: 1021 RGSRSYSGLPQDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF 1080
Query: 1082 QVPPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL 1141
QVPPSM+YYHQNSVSWPAAAHANG+MPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL
Sbjct: 1081 QVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL 1140
Query: 1142 SNPVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSP 1201
SNPVFNPSPVPIYHPASKASNGIYAEDR+QVSKSGA++ESSVA+SDV VTTGH YALSSP
Sbjct: 1141 SNPVFNPSPVPIYHPASKASNGIYAEDRTQVSKSGAISESSVANSDVAVTTGHQYALSSP 1200
Query: 1202 RSGDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVV 1261
SGD KQ+DT SKLQ+DSSSFSLFHFGGPVALSTGGKLNLTPSKED+VGDFSRNNEVEVV
Sbjct: 1201 PSGDLKQNDT-SKLQQDSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVV 1260
Query: 1262 DNGHAFNKKETAIEEYNLFAASNGMRFSFF 1282
DNGHAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 DNGHAFNMKETAIEEYNLFAASNGMRFSFF 1279
BLAST of Lsi01G019330 vs. ExPASy TrEMBL
Match:
A0A0A0KZE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G563700 PE=4 SV=1)
HSP 1 Score: 2300.0 bits (5959), Expect = 0.0e+00
Identity = 1183/1281 (92.35%), Postives = 1213/1281 (94.69%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKNDHLNGGSSAIYSLSA+GFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSL QGKTC NHSCNRLGVSKN ACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLHQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADW QTFADSVETYHY+EWAVGTGEGKSDILEF+NVGMNGSVKINGLDLG L+SC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFDNVGMNGSVKINGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 540
Query: 542 TKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVP 601
TKEREKKLRRKERLKGKDKDK+SSESAE C SDVLEDLS CVLEPN NAVGEVCD+SVP
Sbjct: 541 TKEREKKLRRKERLKGKDKDKLSSESAEVCARSDVLEDLSSCVLEPNSNAVGEVCDSSVP 600
Query: 602 ESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQDHS 661
ESSDILDELFLNESIISEGQNSYDDSFDG+L DGNESFI DQSKVSRWRLKFPKEVQDH
Sbjct: 601 ESSDILDELFLNESIISEGQNSYDDSFDGKLADGNESFISDQSKVSRWRLKFPKEVQDHP 660
Query: 662 FKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF 721
FKWSERRRFMVVSENG LVNKSEQRY+ADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF
Sbjct: 661 FKWSERRRFMVVSENGALVNKSEQRYHADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF 720
Query: 722 NEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQSF 781
NEKLHSSNNR+SYDYRSCICNQ NEFNKK EPFVSSVRVNRDVKSVSKSESSFDMSKQS+
Sbjct: 721 NEKLHSSNNRMSYDYRSCICNQANEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSY 780
Query: 782 RSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVALK 841
RSNKYSYGDHSRD+GRLK K ALLNNSP KDFVYSKKVWEPMESQKKYPRSNSD+NVALK
Sbjct: 781 RSNKYSYGDHSRDNGRLKTKPALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALK 840
Query: 842 SSTFKFGVEPDYDPLKSR-HEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLSI 901
SSTFKF EPDYD +KSR EFCSGEVSVTSG VD EESNSTESTSGIESD+V QN +SI
Sbjct: 841 SSTFKFDAEPDYDVVKSRDEEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI 900
Query: 902 ESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNTI 961
E KDHKNVEED C EV Q S NS +D TLTSSGTSN VGTSSLNSDNCSSCLSEGDSNTI
Sbjct: 901 ELKDHKNVEEDVC-EVKQFSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTI 960
Query: 962 CSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTNHSGL 1021
SNHGNLESSSTSDSEYASHQSEGKES ASIQNGFSEHHEIRIDK IGG+AMGS ++SG
Sbjct: 961 GSNHGNLESSSTSDSEYASHQSEGKESLASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGF 1020
Query: 1022 SHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSYY 1081
DN GCKVQ NA KNVPQNFEAGFSAVSLDSPCQVTLP IQNQNIHFPVFQVPPSM+YY
Sbjct: 1021 PQDNEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP-IQNQNIHFPVFQVPPSMNYY 1080
Query: 1082 HQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP 1141
HQNSVSWPA AHANG+MPFSYSNHC YANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP
Sbjct: 1081 HQNSVSWPAPAHANGIMPFSYSNHCPYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP 1140
Query: 1142 VPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQSD 1201
VP+YHPASK SN IYAEDR+QVSKSGA+AESSV +SDV VTTGHPY LSSP SGD KQ+D
Sbjct: 1141 VPLYHPASKTSNCIYAEDRTQVSKSGAIAESSVVNSDVAVTTGHPYVLSSPPSGDLKQND 1200
Query: 1202 TSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFNKK 1261
TSSKLQ+DSSSFSLFHFGGPVALSTGGKLNLTPSKED+VGDFSRNNEVEVVDNGHAFN K
Sbjct: 1201 TSSKLQQDSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGHAFNMK 1260
Query: 1262 ETAIEEYNLFAASNGMRFSFF 1282
ETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 ETAIEEYNLFAASNGMRFSFF 1270
BLAST of Lsi01G019330 vs. ExPASy TrEMBL
Match:
A0A6J1EPP9 (uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC111435513 PE=4 SV=1)
HSP 1 Score: 2246.9 bits (5821), Expect = 0.0e+00
Identity = 1158/1283 (90.26%), Postives = 1200/1283 (93.53%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKN HLN GSSAIYSLSANGFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT NH+CNRLGVSKN A DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGF+DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTL+DFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLIDFWSALGEETRLSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWHQTFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVK+NGLDLG L+SC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDAN LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKR 540
Query: 542 TKEREKKLRRKERLKG--KDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDAS 601
TKEREKKLRRKERLKG KDKDKISSESAEAC HSDVLEDLSPC LEPN +AVGEVCDAS
Sbjct: 541 TKEREKKLRRKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDAS 600
Query: 602 VPESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQD 661
VPESSD +ELFLNESIISEGQNSYDDSFDG+L DGNESFIGDQSKVSRWRLKFPKEVQD
Sbjct: 601 VPESSDTFNELFLNESIISEGQNSYDDSFDGKLGDGNESFIGDQSKVSRWRLKFPKEVQD 660
Query: 662 HSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVS 721
HSFKWSERRR M VSENG LVN+SEQRYYADS ENPSRSMN SNRKLRTNSLKAYGRHVS
Sbjct: 661 HSFKWSERRRSM-VSENGALVNRSEQRYYADSSENPSRSMNASNRKLRTNSLKAYGRHVS 720
Query: 722 KFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQ 781
KFNEK+HSSNN VSYDYRSC+CNQ NEFNKK EPFVSSVR NRDVKS SKSES FDMSKQ
Sbjct: 721 KFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRFNRDVKSASKSESLFDMSKQ 780
Query: 782 SFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVA 841
S+RSNK+SYGD+SRDSGRLKNKAALLNNSP KDFVYSKKVWEPMESQKKYPRSNSDSNVA
Sbjct: 781 SYRSNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVA 840
Query: 842 LKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLS 901
LKSSTFKFGVEPDYD +KSRHE CSGEVSV SGTVD EESNSTESTS IESD+VFQNGL
Sbjct: 841 LKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDDVFQNGLP 900
Query: 902 IESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNT 961
IE KDHKNVEEDACEEVT CSVNST+DM +TS GTSN GTSSLNSDNCSSC SEGDSNT
Sbjct: 901 IELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSSLNSDNCSSCPSEGDSNT 960
Query: 962 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTNHSG 1021
ICSNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGGDAMGSTN SG
Sbjct: 961 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDAMGSTNCSG 1020
Query: 1022 LSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSY 1081
LS DN GCKVQG A KNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFPVFQVPPSM Y
Sbjct: 1021 LSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFPVFQVPPSMGY 1080
Query: 1082 YHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPS 1141
YHQNSVSWPAA HANG+MPFSYSNHC+YANPLGYGLNGNPRFCM+YGHLHHL+NPVFNPS
Sbjct: 1081 YHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFCMRYGHLHHLANPVFNPS 1140
Query: 1142 PVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQS 1201
PVPIY PA+KASNGI+ EDR+QVSKSGA+ ESS A+ DVVVT+G PYALSSP SGDCKQ+
Sbjct: 1141 PVPIYQPAAKASNGIFVEDRTQVSKSGAITESSAANPDVVVTSGLPYALSSPPSGDCKQN 1200
Query: 1202 DTSSKLQKDSSSFSLFHFGGPVALST-GGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFN 1261
DTSSKLQKDSSSFSLFHFGGPVALST GGKLNL PSKED NNEVEVV NGH FN
Sbjct: 1201 DTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED-------NNEVEVVGNGHGFN 1260
Query: 1262 KKETAIEEYNLFAASNGMRFSFF 1282
KKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 KKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Lsi01G019330 vs. ExPASy TrEMBL
Match:
A0A6J1HVV4 (uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149 PE=4 SV=1)
HSP 1 Score: 2242.2 bits (5809), Expect = 0.0e+00
Identity = 1157/1294 (89.41%), Postives = 1200/1294 (92.74%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKN HLN GSSAIYSLSANGFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT NH+CNRLGVSKN A DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGF+DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWH TFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVK+NGLDLG L+SC
Sbjct: 301 DDTIQADWHHTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDAN LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQ+KLLEEEEKEKREE+ERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQVKLLEEEEKEKREEKERKERKR 540
Query: 542 TKEREKKLRRKERLKG--KDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDAS 601
TKEREKKLRRKERLKG KDKDKISSESAEAC HSDVLEDLSPC LEPN +AVGEVCDAS
Sbjct: 541 TKEREKKLRRKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDAS 600
Query: 602 VPESSDILDELFLNESIISEGQNSYDDSFDGRL------------TDGNESFIGDQSKVS 661
VPESSD +ELFLN+SIISEGQNSYDDSFDG+L DGNESFIGDQSKVS
Sbjct: 601 VPESSDTFNELFLNQSIISEGQNSYDDSFDGKLGDGNDGNDGNDGNDGNESFIGDQSKVS 660
Query: 662 RWRLKFPKEVQDHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLR 721
RWRLKFPKEVQDHSFKWSERRR M VSENG L N+SEQRYYADSLENPSRSMN SNRKLR
Sbjct: 661 RWRLKFPKEVQDHSFKWSERRRSM-VSENGALANRSEQRYYADSLENPSRSMNASNRKLR 720
Query: 722 TNSLKAYGRHVSKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSV 781
TNSLKAYGRHVSKFNEK+HSSNN VSYDYRSC+CNQ NEFNKK EPFVSSVRVNRD KS
Sbjct: 721 TNSLKAYGRHVSKFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRVNRDAKSA 780
Query: 782 SKSESSFDMSKQSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQK 841
SKSES FDMSKQS+R NK+SYGD+SRDSGRLKNKAALLNNSP KDFVYSKKVWEPMESQK
Sbjct: 781 SKSESLFDMSKQSYRPNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQK 840
Query: 842 KYPRSNSDSNVALKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSG 901
KYPRSNSDSNVALKSSTFKFGVEPDY+ +KSRHE CSGEVSV SGTVD EESNSTESTS
Sbjct: 841 KYPRSNSDSNVALKSSTFKFGVEPDYELVKSRHECCSGEVSVASGTVDQEESNSTESTSV 900
Query: 902 IESDEVFQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDN 961
IESDEVFQNGL IESKDHKNVE+DACEEVT CSVN T+DM +TSSGTSN GTSSLNSDN
Sbjct: 901 IESDEVFQNGLPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSSLNSDN 960
Query: 962 CSSCLSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVI 1021
CSSC SEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK I
Sbjct: 961 CSSCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAI 1020
Query: 1022 GGDAMGSTNHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIH 1081
GGDA+GSTN SGLS DN GCKVQGNA KNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+H
Sbjct: 1021 GGDALGSTNSSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVH 1080
Query: 1082 FPVFQVPPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGH 1141
FPVFQVPPSM YYHQNSVSWPAA HANG+MPFSYSNHCLYANPLGYGLNGNPRFCM+YGH
Sbjct: 1081 FPVFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMRYGH 1140
Query: 1142 LHHLSNPVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYA 1201
LHHL+NPVFNPSPVPIY PA+KASNGI+ EDR+QVSKSGA+ ESSVA+ DVVVTTG PYA
Sbjct: 1141 LHHLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESSVANPDVVVTTGLPYA 1200
Query: 1202 LSSPRSGDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNE 1261
LSSP SGDCKQ+DTSSKLQKDSSSFSLFHFGGPVALSTGGKLN PSKED NNE
Sbjct: 1201 LSSPPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNPMPSKED-------NNE 1260
Query: 1262 VEVVDNGHAFNKKETAIEEYNLFAASNGMRFSFF 1282
VEVV NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 VEVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of Lsi01G019330 vs. ExPASy TrEMBL
Match:
A0A6J1DQ45 (uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022059 PE=4 SV=1)
HSP 1 Score: 2207.2 bits (5718), Expect = 0.0e+00
Identity = 1141/1287 (88.66%), Postives = 1188/1287 (92.31%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKND LNGGSSAIYSLS NGFWSQ RDDVSYNQLQKFWS+L P RQKLLRIDKQ
Sbjct: 1 MPGLTQKNDQLNGGSSAIYSLSPNGFWSQQRDDVSYNQLQKFWSELPPHTRQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTC NHSCNRLGVSKNH CDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNHTCDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTA +GRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTAGFGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAF+YEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFHYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDT+QADW QTFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVK+NGLDLG L+SC
Sbjct: 301 DDTVQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRL+VGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLSVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEER+HIACKEIITLEKQMKLLEEEEKEKREE+ERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERIHIACKEIITLEKQMKLLEEEEKEKREEKERKERKR 540
Query: 542 TKEREKKLRRKERLKG--KDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDAS 601
TKEREKKLRRKERLKG KDKDK SESAE C HSDVLEDLSPCVLEPN ++VG+ CDAS
Sbjct: 541 TKEREKKLRRKERLKGKEKDKDKTCSESAEVCAHSDVLEDLSPCVLEPNSDSVGDACDAS 600
Query: 602 VPESSDILDELFLNESIISEGQNSYDDSFDGRLT---DGNESFIGDQSKVSRWRLKFPKE 661
+PESSD+LDE FL+ESIISE QNSYDDSFDG+ T DGNESFI DQSK SRWRLKFPKE
Sbjct: 601 MPESSDMLDEQFLDESIISEVQNSYDDSFDGKPTDGNDGNESFIVDQSKFSRWRLKFPKE 660
Query: 662 VQDHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGR 721
VQD SFKWSERRRF VVSENG LVN+SEQRYY DSLENPSRSMNG+NRKLR+NS+KAYGR
Sbjct: 661 VQDQSFKWSERRRFTVVSENGALVNRSEQRYYGDSLENPSRSMNGTNRKLRSNSIKAYGR 720
Query: 722 HVSKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDM 781
H SKFNEKLHSSNNRVS DYRSCIC+Q NEFNKKVE FVSSVRVNRD KSVSKSESSFDM
Sbjct: 721 HGSKFNEKLHSSNNRVSXDYRSCICSQNNEFNKKVEXFVSSVRVNRDAKSVSKSESSFDM 780
Query: 782 SKQSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDS 841
SKQS+RSNKY YGD SRDSGRLKNKAAL NNSP KDFVYSKKVWEPMESQKKYPRSNSD
Sbjct: 781 SKQSYRSNKYGYGDQSRDSGRLKNKAALSNNSPGKDFVYSKKVWEPMESQKKYPRSNSDP 840
Query: 842 NVALKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQN 901
NVA+KSSTFKFGVEPDYD KSRH+ CSGEVSV SG VD EESNSTESTSGIESDEVFQN
Sbjct: 841 NVAMKSSTFKFGVEPDYDLAKSRHDVCSGEVSVASGKVDQEESNSTESTSGIESDEVFQN 900
Query: 902 GLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGD 961
GL E KDHKNVEEDACEE TQCS+NST++ TL SSG +N VGTSSL+SDNCSSCLSEGD
Sbjct: 901 GLPTEPKDHKNVEEDACEEATQCSINSTINSTLRSSGKNNHVGTSSLSSDNCSSCLSEGD 960
Query: 962 SNTICSNHGNLESSSTSDSEYASHQ-SEGKESSASIQNGFSEHHEIRIDKVIGGDAMGST 1021
SN ICSNHGNLESSSTSDSE ASHQ SEGKESSASIQNGFSE HEIR+DKV GG++MG+
Sbjct: 961 SNXICSNHGNLESSSTSDSEDASHQSSEGKESSASIQNGFSERHEIRMDKVNGGESMGTR 1020
Query: 1022 NHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPP 1081
H GL DN GCKV GNA NVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPP
Sbjct: 1021 IHFGLPQDNEGCKVLGNAPMNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPP 1080
Query: 1082 SMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPV 1141
SM YYHQNSVSWP AAHANGMMPFSYSNHCLYANPLGYGL+GNPRFCMQYGHLHHL+ PV
Sbjct: 1081 SMGYYHQNSVSWP-AAHANGMMPFSYSNHCLYANPLGYGLDGNPRFCMQYGHLHHLATPV 1140
Query: 1142 FNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESS-VAHSDVVVTTGHPYALSSPRSG 1201
FNPSPVPIY PA+KASNGIY EDRSQVSK+GA+AESS VA+ DVVVT G PYAL SP SG
Sbjct: 1141 FNPSPVPIYQPAAKASNGIYVEDRSQVSKAGAIAESSDVANPDVVVTAGLPYALGSPPSG 1200
Query: 1202 DCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDNG 1261
DCKQ+DT SKLQK SSSFSLFHFGGPVALSTGGKLNL PSKED+ G F RN+E +VVDNG
Sbjct: 1201 DCKQNDT-SKLQKGSSSFSLFHFGGPVALSTGGKLNLMPSKEDDTGVFPRNSEADVVDNG 1260
Query: 1262 HAFNKKETAIEEYNLFAASNGMRFSFF 1282
HAFNKK+TAIEEYNLFAASNGMRFSFF
Sbjct: 1261 HAFNKKDTAIEEYNLFAASNGMRFSFF 1276
BLAST of Lsi01G019330 vs. NCBI nr
Match:
XP_008442254.1 (PREDICTED: uncharacterized protein LOC103486163 [Cucumis melo])
HSP 1 Score: 2303.5 bits (5968), Expect = 0.0e+00
Identity = 1189/1290 (92.17%), Postives = 1218/1290 (94.42%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKNDHLNGGSSAIYSLSA+GFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTC NHSCNRLGVSKN ACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYL+SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLHSKSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWHQTFADSVETYHY+EW+VGTGEGKSDILEFENVGMNGSVKINGLDLG L+SC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWSVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 540
Query: 542 TKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVP 601
TKEREKKLRRKERLKGKDKDK+SSESAE C SDVLEDLSPCVLEP NAVGEVCD SVP
Sbjct: 541 TKEREKKLRRKERLKGKDKDKLSSESAEVCARSDVLEDLSPCVLEPTSNAVGEVCDTSVP 600
Query: 602 ESSDILDELFLNESIISEGQNSYDDSFDGRLT---DGNESFIGDQSKVSRWRLKFPKEVQ 661
ESSDILDELFLNESIISEGQNS+DDS DG+ T DGNESFI DQSKVSRWRLKFPKEVQ
Sbjct: 601 ESSDILDELFLNESIISEGQNSFDDSLDGKFTDGNDGNESFISDQSKVSRWRLKFPKEVQ 660
Query: 662 DHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHV 721
DH FKWSERRRFMVVSENG+LVNKSEQRY+ DS ENPSRSMNGSNRKLRTNSLKAYGRHV
Sbjct: 661 DHPFKWSERRRFMVVSENGMLVNKSEQRYHPDSSENPSRSMNGSNRKLRTNSLKAYGRHV 720
Query: 722 SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSK 781
SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKK EPFVSSVRVNRDVKSVSKSESSFDMSK
Sbjct: 721 SKFNEKLHSSNNRVSYDYRSCICNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSK 780
Query: 782 QSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNV 841
QS+RSNKYSYGDHSRD+GRLK KAALLNNSP KDFVYSKKVWEPMESQKKYPRSNSDSNV
Sbjct: 781 QSYRSNKYSYGDHSRDNGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNV 840
Query: 842 ALKSSTFKFGVEPDYD-------PLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESD 901
ALKSSTFKF EPDYD +KSR FCSGEVSVTSG VD EESNSTESTSGIESD
Sbjct: 841 ALKSSTFKFDAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVDQEESNSTESTSGIESD 900
Query: 902 EVFQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSC 961
+V QN SIESKDHKNVEED C EV QCS NS +D TLTSSGTSN VGTSSLNSDNCSSC
Sbjct: 901 DVSQNENSIESKDHKNVEEDVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSC 960
Query: 962 LSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDA 1021
LSEGDSNTI SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDK IGG+A
Sbjct: 961 LSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEA 1020
Query: 1022 MGSTNHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF 1081
GS ++SGL DN GC VQ NA KNVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF
Sbjct: 1021 RGSRSYSGLPQDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVF 1080
Query: 1082 QVPPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL 1141
QVPPSM+YYHQNSVSWPAAAHANG+MPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL
Sbjct: 1081 QVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHL 1140
Query: 1142 SNPVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSP 1201
SNPVFNPSPVPIYHPASKASNGIYAEDR+QVSKSGA++ESSVA+SDV VTTGH YALSSP
Sbjct: 1141 SNPVFNPSPVPIYHPASKASNGIYAEDRTQVSKSGAISESSVANSDVAVTTGHQYALSSP 1200
Query: 1202 RSGDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVV 1261
SGD KQ+DT SKLQ+DSSSFSLFHFGGPVALSTGGKLNLTPSKED+VGDFSRNNEVEVV
Sbjct: 1201 PSGDLKQNDT-SKLQQDSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVV 1260
Query: 1262 DNGHAFNKKETAIEEYNLFAASNGMRFSFF 1282
DNGHAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 DNGHAFNMKETAIEEYNLFAASNGMRFSFF 1279
BLAST of Lsi01G019330 vs. NCBI nr
Match:
XP_038881990.1 (uncharacterized protein LOC120073308 [Benincasa hispida])
HSP 1 Score: 2303.1 bits (5967), Expect = 0.0e+00
Identity = 1190/1287 (92.46%), Postives = 1217/1287 (94.56%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTC NHSCNRLGVSKN ACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYL S SFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLCSHSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGT SYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTTSYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADW QTFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVKINGLDLG LSSC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLSSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSIDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCL+LKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR
Sbjct: 481 REGTARQNAHSIFVCLSLKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 540
Query: 542 TKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVP 601
TKEREKKLRRKERLKGKDKDK+SSESAEAC SDVLEDLSPCVLEP N V EVCDASVP
Sbjct: 541 TKEREKKLRRKERLKGKDKDKLSSESAEACAPSDVLEDLSPCVLEPESNTVSEVCDASVP 600
Query: 602 ESSDILDELFLNESIISEGQNSYDDSFDGRLT---DGNESFIGDQSKVSRWRLKFPKEVQ 661
ESSDILDE+FLNESIISEGQNSYDDSFDG+LT DGNESF DQ KVSRWRLKFPKEVQ
Sbjct: 601 ESSDILDEMFLNESIISEGQNSYDDSFDGKLTEGNDGNESFTSDQFKVSRWRLKFPKEVQ 660
Query: 662 DHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHV 721
DH FKWSERRRFMVVSENGV +NKSEQR Y DSLENPSRS+NGSNRKLRTNSLKAYGRHV
Sbjct: 661 DHPFKWSERRRFMVVSENGV-INKSEQRCYVDSLENPSRSINGSNRKLRTNSLKAYGRHV 720
Query: 722 SKFNEKLHSSNNRVSYDYRSCICNQTNEF-NKKVEPFVSSVRVNRDVKSVSKSESSFDMS 781
SKFNEKLHSSNNRVSYDYRSCICNQTNEF NKK EPFVSSVRVNRDVKS+SKSESSFDMS
Sbjct: 721 SKFNEKLHSSNNRVSYDYRSCICNQTNEFNNKKAEPFVSSVRVNRDVKSLSKSESSFDMS 780
Query: 782 KQSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSN 841
KQS+RSNKYSYGDHSRD+GRL+NKAA+LNNSP KDFVYSKKVWEPMESQKKYPRSNSDSN
Sbjct: 781 KQSYRSNKYSYGDHSRDNGRLRNKAAILNNSPCKDFVYSKKVWEPMESQKKYPRSNSDSN 840
Query: 842 VALKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNG 901
VA KSSTFK G EPDYD +KSRHE SGEVSVTSGTVD EE NSTESTSGIESDEVFQNG
Sbjct: 841 VASKSSTFKCGAEPDYDVMKSRHELSSGEVSVTSGTVDQEEINSTESTSGIESDEVFQNG 900
Query: 902 LSIESKDHKNVEEDAC-EEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGD 961
LS ESKDHKN+E+DAC EEVTQCS NSTMDMTL SSGT N VGTSSLNSDNCSSC SEGD
Sbjct: 901 LSNESKDHKNIEDDACEEEVTQCSANSTMDMTLASSGTCNQVGTSSLNSDNCSSCPSEGD 960
Query: 962 SNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTN 1021
NTICSNHGN+ESSSTSDSEYASHQSEGKESSASIQNGFSE+HEIRIDKVIGGDAMGS N
Sbjct: 961 CNTICSNHGNVESSSTSDSEYASHQSEGKESSASIQNGFSENHEIRIDKVIGGDAMGSKN 1020
Query: 1022 HSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPS 1081
HSGLS DN GCKVQGNA KNVPQNFEAGFSAVSLDSPCQ+TLPSIQNQNIHFPVFQVPPS
Sbjct: 1021 HSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVSLDSPCQMTLPSIQNQNIHFPVFQVPPS 1080
Query: 1082 MSYYHQNSVSWPAA--AHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNP 1141
MSYYHQNSVSWPA AHANG+MPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLH LSNP
Sbjct: 1081 MSYYHQNSVSWPATAHAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHQLSNP 1140
Query: 1142 VFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSG 1201
VFNP+PVPIYHPASKASNG YAEDR+QV+KSGA+AESSVA+SDV T GHPYALSSP
Sbjct: 1141 VFNPTPVPIYHPASKASNGTYAEDRNQVTKSGAMAESSVANSDVTGTAGHPYALSSPPGS 1200
Query: 1202 DCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDNG 1261
KQ+DTSSKLQKDSSSFSLFHFGGPVA STGGKLNLTPSKED+VGDFSRNNEVEVVDNG
Sbjct: 1201 --KQNDTSSKLQKDSSSFSLFHFGGPVAFSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNG 1260
Query: 1262 HAFNKKETAIEEYNLFAASNGMRFSFF 1282
HAFNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 HAFNKKETAIEEYNLFAASNGMRFSFF 1275
BLAST of Lsi01G019330 vs. NCBI nr
Match:
XP_011653932.2 (uncharacterized protein LOC101210448 [Cucumis sativus] >KAE8649763.1 hypothetical protein Csa_012708 [Cucumis sativus])
HSP 1 Score: 2297.3 bits (5952), Expect = 0.0e+00
Identity = 1183/1288 (91.85%), Postives = 1213/1288 (94.18%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKNDHLNGGSSAIYSLSA+GFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSL QGKTC NHSCNRLGVSKN ACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLHQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADW QTFADSVETYHY+EWAVGTGEGKSDILEF+NVGMNGSVKINGLDLG L+SC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFDNVGMNGSVKINGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 540
Query: 542 TKEREKKLRRKERLKGKDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDASVP 601
TKEREKKLRRKERLKGKDKDK+SSESAE C SDVLEDLS CVLEPN NAVGEVCD+SVP
Sbjct: 541 TKEREKKLRRKERLKGKDKDKLSSESAEVCARSDVLEDLSSCVLEPNSNAVGEVCDSSVP 600
Query: 602 ESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQDHS 661
ESSDILDELFLNESIISEGQNSYDDSFDG+L DGNESFI DQSKVSRWRLKFPKEVQDH
Sbjct: 601 ESSDILDELFLNESIISEGQNSYDDSFDGKLADGNESFISDQSKVSRWRLKFPKEVQDHP 660
Query: 662 FKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF 721
FKWSERRRFMVVSENG LVNKSEQRY+ADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF
Sbjct: 661 FKWSERRRFMVVSENGALVNKSEQRYHADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKF 720
Query: 722 NEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQSF 781
NEKLHSSNNR+SYDYRSCICNQ NEFNKK EPFVSSVRVNRDVKSVSKSESSFDMSKQS+
Sbjct: 721 NEKLHSSNNRMSYDYRSCICNQANEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSY 780
Query: 782 RSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVALK 841
RSNKYSYGDHSRD+GRLK K ALLNNSP KDFVYSKKVWEPMESQKKYPRSNSD+NVALK
Sbjct: 781 RSNKYSYGDHSRDNGRLKTKPALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALK 840
Query: 842 SSTFKFGVEPDYDPLKSR--------HEFCSGEVSVTSGTVDPEESNSTESTSGIESDEV 901
SSTFKF EPDYD +KSR EFCSGEVSVTSG VD EESNSTESTSGIESD+V
Sbjct: 841 SSTFKFDAEPDYDVVKSRDDVVKSRDEEFCSGEVSVTSGAVDQEESNSTESTSGIESDDV 900
Query: 902 FQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLS 961
QN +SIE KDHKNVEED C EV Q S NS +D TLTSSGTSN VGTSSLNSDNCSSCLS
Sbjct: 901 SQNEISIELKDHKNVEEDVC-EVKQFSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLS 960
Query: 962 EGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMG 1021
EGDSNTI SNHGNLESSSTSDSEYASHQSEGKES ASIQNGFSEHHEIRIDK IGG+AMG
Sbjct: 961 EGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESLASIQNGFSEHHEIRIDKGIGGEAMG 1020
Query: 1022 STNHSGLSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQV 1081
S ++SG DN GCKVQ NA KNVPQNFEAGFSAVSLDSPCQVTLP IQNQNIHFPVFQV
Sbjct: 1021 SRSYSGFPQDNEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP-IQNQNIHFPVFQV 1080
Query: 1082 PPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSN 1141
PPSM+YYHQNSVSWPA AHANG+MPFSYSNHC YANPLGYGLNGNPRFCMQYGHLHHLSN
Sbjct: 1081 PPSMNYYHQNSVSWPAPAHANGIMPFSYSNHCPYANPLGYGLNGNPRFCMQYGHLHHLSN 1140
Query: 1142 PVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRS 1201
PVFNPSPVP+YHPASK SN IYAEDR+QVSKSGA+AESSV +SDV VTTGHPY LSSP S
Sbjct: 1141 PVFNPSPVPLYHPASKTSNCIYAEDRTQVSKSGAIAESSVVNSDVAVTTGHPYVLSSPPS 1200
Query: 1202 GDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDN 1261
GD KQ+DTSSKLQ+DSSSFSLFHFGGPVALSTGGKLNLTPSKED+VGDFSRNNEVEVVDN
Sbjct: 1201 GDLKQNDTSSKLQQDSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDN 1260
Query: 1262 GHAFNKKETAIEEYNLFAASNGMRFSFF 1282
GHAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 GHAFNMKETAIEEYNLFAASNGMRFSFF 1277
BLAST of Lsi01G019330 vs. NCBI nr
Match:
KAG6603257.1 (hypothetical protein SDJN03_03866, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2247.2 bits (5822), Expect = 0.0e+00
Identity = 1159/1283 (90.34%), Postives = 1200/1283 (93.53%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKN HLN GSSAIYSLSANGFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT NH+CNRLGVSKN A DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGF+DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWHQTFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVK+NGLDLG L+SC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDAN LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKR 540
Query: 542 TKEREKKLRRKERLKG--KDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDAS 601
TKEREKKLRRKERLKG KDKDKISSESAEAC HSDVLEDLSPC LEPN +AVGEVCDAS
Sbjct: 541 TKEREKKLRRKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDAS 600
Query: 602 VPESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQD 661
VPESSD +ELFLNESIISEGQNSYDDSFDG+L DGNESFIGDQSKVSRWRLKFPKEVQD
Sbjct: 601 VPESSDTFNELFLNESIISEGQNSYDDSFDGKLGDGNESFIGDQSKVSRWRLKFPKEVQD 660
Query: 662 HSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVS 721
HSFKWSERRR M VSENG LVN+SEQRYYADS ENPSRSMN SNRKLRTNSLKAYGRHVS
Sbjct: 661 HSFKWSERRRSM-VSENGALVNRSEQRYYADSSENPSRSMNASNRKLRTNSLKAYGRHVS 720
Query: 722 KFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQ 781
KFNEK+HSSNN VSYDYRSC+CNQ NEFNKK EPFVSSVR NRDVKS SKSES FDMSKQ
Sbjct: 721 KFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRFNRDVKSASKSESLFDMSKQ 780
Query: 782 SFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVA 841
S+RSNK+SYGD+SRDSGRLKNKAALLNNSP KDFVYSKKVWEPMESQKKYPRSNSDSNVA
Sbjct: 781 SYRSNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVA 840
Query: 842 LKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLS 901
LKSSTFKFGVEPDYD +KSRHE CSGEVSV SGTVD EESNSTESTS IESD+VFQNGL
Sbjct: 841 LKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDDVFQNGLP 900
Query: 902 IESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNT 961
IE KDHKNVEEDACEEVT CSVNST+DM +TS GTSN GTSSLNSDNCSSC SEGDSNT
Sbjct: 901 IELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSSLNSDNCSSCPSEGDSNT 960
Query: 962 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTNHSG 1021
ICSNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGGDAMGSTN SG
Sbjct: 961 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDAMGSTNCSG 1020
Query: 1022 LSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSY 1081
LS DN GCKVQG A KNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFPVFQVPPSM Y
Sbjct: 1021 LSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFPVFQVPPSMGY 1080
Query: 1082 YHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPS 1141
YHQNSVSWPAA HANG+MPFSYSNHC+YANPLGYGLNGNPRFCM+YGHLHHL+NPVFNPS
Sbjct: 1081 YHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFCMRYGHLHHLANPVFNPS 1140
Query: 1142 PVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQS 1201
PVPIY PA+KASNGI+ EDR+QVSKSGA+ ESS A+ DVVVT+G PYALSSP SGDCKQ+
Sbjct: 1141 PVPIYQPAAKASNGIFVEDRTQVSKSGAITESSAANPDVVVTSGLPYALSSPPSGDCKQN 1200
Query: 1202 DTSSKLQKDSSSFSLFHFGGPVALST-GGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFN 1261
DTSSKLQKDSSSFSLFHFGGPVALST GGKLNL PSKED NNEVEVV NGH FN
Sbjct: 1201 DTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED-------NNEVEVVGNGHGFN 1260
Query: 1262 KKETAIEEYNLFAASNGMRFSFF 1282
KKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 KKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Lsi01G019330 vs. NCBI nr
Match:
XP_022928663.1 (uncharacterized protein LOC111435513 [Cucurbita moschata])
HSP 1 Score: 2246.9 bits (5821), Expect = 0.0e+00
Identity = 1158/1283 (90.26%), Postives = 1200/1283 (93.53%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP LTQKN HLN GSSAIYSLSANGFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSLS 121
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT NH+CNRLGVSKN A DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 122 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 181
VNGF+DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 182 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 241
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTL+DFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLIDFWSALGEETRLSLLRMK 240
Query: 242 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 301
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 302 DDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSSC 361
DDTIQADWHQTFADSVETYHY+EWAVGTGEGKSDILEFENVGMNGSVK+NGLDLG L+SC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 362 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 421
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 422 EEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKAF 481
EEEDDS+DKDAN LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ VEKAF
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKAF 480
Query: 482 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKR 541
REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKR
Sbjct: 481 REGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKR 540
Query: 542 TKEREKKLRRKERLKG--KDKDKISSESAEACTHSDVLEDLSPCVLEPNPNAVGEVCDAS 601
TKEREKKLRRKERLKG KDKDKISSESAEAC HSDVLEDLSPC LEPN +AVGEVCDAS
Sbjct: 541 TKEREKKLRRKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDAS 600
Query: 602 VPESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKVSRWRLKFPKEVQD 661
VPESSD +ELFLNESIISEGQNSYDDSFDG+L DGNESFIGDQSKVSRWRLKFPKEVQD
Sbjct: 601 VPESSDTFNELFLNESIISEGQNSYDDSFDGKLGDGNESFIGDQSKVSRWRLKFPKEVQD 660
Query: 662 HSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVS 721
HSFKWSERRR M VSENG LVN+SEQRYYADS ENPSRSMN SNRKLRTNSLKAYGRHVS
Sbjct: 661 HSFKWSERRRSM-VSENGALVNRSEQRYYADSSENPSRSMNASNRKLRTNSLKAYGRHVS 720
Query: 722 KFNEKLHSSNNRVSYDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQ 781
KFNEK+HSSNN VSYDYRSC+CNQ NEFNKK EPFVSSVR NRDVKS SKSES FDMSKQ
Sbjct: 721 KFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRFNRDVKSASKSESLFDMSKQ 780
Query: 782 SFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVA 841
S+RSNK+SYGD+SRDSGRLKNKAALLNNSP KDFVYSKKVWEPMESQKKYPRSNSDSNVA
Sbjct: 781 SYRSNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVA 840
Query: 842 LKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLS 901
LKSSTFKFGVEPDYD +KSRHE CSGEVSV SGTVD EESNSTESTS IESD+VFQNGL
Sbjct: 841 LKSSTFKFGVEPDYDLVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDDVFQNGLP 900
Query: 902 IESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNT 961
IE KDHKNVEEDACEEVT CSVNST+DM +TS GTSN GTSSLNSDNCSSC SEGDSNT
Sbjct: 901 IELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSSLNSDNCSSCPSEGDSNT 960
Query: 962 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKVIGGDAMGSTNHSG 1021
ICSNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGGDAMGSTN SG
Sbjct: 961 ICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDAMGSTNCSG 1020
Query: 1022 LSHDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSY 1081
LS DN GCKVQG A KNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFPVFQVPPSM Y
Sbjct: 1021 LSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFPVFQVPPSMGY 1080
Query: 1082 YHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPS 1141
YHQNSVSWPAA HANG+MPFSYSNHC+YANPLGYGLNGNPRFCM+YGHLHHL+NPVFNPS
Sbjct: 1081 YHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFCMRYGHLHHLANPVFNPS 1140
Query: 1142 PVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQS 1201
PVPIY PA+KASNGI+ EDR+QVSKSGA+ ESS A+ DVVVT+G PYALSSP SGDCKQ+
Sbjct: 1141 PVPIYQPAAKASNGIFVEDRTQVSKSGAITESSAANPDVVVTSGLPYALSSPPSGDCKQN 1200
Query: 1202 DTSSKLQKDSSSFSLFHFGGPVALST-GGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFN 1261
DTSSKLQKDSSSFSLFHFGGPVALST GGKLNL PSKED NNEVEVV NGH FN
Sbjct: 1201 DTSSKLQKDSSSFSLFHFGGPVALSTGGGKLNLMPSKED-------NNEVEVVGNGHGFN 1260
Query: 1262 KKETAIEEYNLFAASNGMRFSFF 1282
KKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 KKETAIEEYNLFAASNGMRFSFF 1266
BLAST of Lsi01G019330 vs. TAIR 10
Match:
AT3G58050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41960.1); Has 13384 Blast hits to 8116 proteins in 546 species: Archae - 41; Bacteria - 766; Metazoa - 5596; Fungi - 1431; Plants - 589; Viruses - 46; Other Eukaryotes - 4915 (source: NCBI BLink). )
HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 692/1342 (51.56%), Postives = 851/1342 (63.41%), Query Frame = 0
Query: 2 MPALTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 61
MP L Q+N+ YS GFWS+ D VSYNQLQKFWS+L P+ARQ+LL+IDKQ
Sbjct: 1 MPGLAQRNNDQ-------YSF---GFWSKEIDGVSYNQLQKFWSELSPKARQELLKIDKQ 60
Query: 62 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSK-NHACDGSL 121
TLFEQARKNMYCSRCNGLLLEGFLQIV++GKSL + N CN+ G SK + C+ +
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMHGKSLHPEGSLGNSPCNKSGGSKYQYDCNAVV 120
Query: 122 SVNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELL 181
S NG DE+QDPSVHPWGGLTTTRDG LTLLDCYLY+KS GLQNVFDSA ARERERELL
Sbjct: 121 S-NGCADEMQDPSVHPWGGLTTTRDGSLTLLDCYLYAKSLKGLQNVFDSAPARERERELL 180
Query: 182 YPDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRM 241
YPDACGGGGRGWISQG AS+GRGHGTRETCALHTARLSCDTLVDFWSAL E+TRQSLLRM
Sbjct: 181 YPDACGGGGRGWISQGIASFGRGHGTRETCALHTARLSCDTLVDFWSALSEDTRQSLLRM 240
Query: 242 KEEDFIERLMYR-----------------------------FDSKRFCRDCRRNVIREFK 301
KEEDF+ERL YR FDSKRFCRDCRRNVIREFK
Sbjct: 241 KEEDFMERLRYRICYHSSYHILNCKMNRHFVVWTIQDVLTKFDSKRFCRDCRRNVIREFK 300
Query: 302 ELKELKRIRREPCCTSWFCVADMAFNYEVSDDTIQADWHQTFADSVETYHYYEWAVGTGE 361
ELKELKR+RREP CT+WFCVA+ F YEVS D+++ADW +TF+++ YH++EWA+G+GE
Sbjct: 301 ELKELKRMRREPRCTTWFCVANTTFQYEVSIDSVKADWRETFSENAGKYHHFEWAIGSGE 360
Query: 362 GKSDILEFENVGMNGSVKINGLDLGALSSCFITLRAWKLDGRCTELSVKAHALKGQQCVH 421
GK DIL+FENVGMNG V++NGL+L L+SC+ITLRA+KLDGR +E+S KAHALKGQ CVH
Sbjct: 361 GKCDILKFENVGMNGRVQVNGLNLRGLNSCYITLRAYKLDGRWSEVSAKAHALKGQNCVH 420
Query: 422 RRLTVGDGFVTITRGENIRRFFEHAEEAEEEEEDDSLDKDANDLDGDCSRPQKHAKSPEL 481
RL VGDGFV+I RGE+IRRFFEHAEEAEEEE++D +DKD N+LDG+CSRPQKHAKSPEL
Sbjct: 421 GRLVVGDGFVSIKRGESIRRFFEHAEEAEEEEDEDMMDKDGNELDGECSRPQKHAKSPEL 480
Query: 482 AREFLLDAATVIFKEQACYLQSSWLVEKAFREGTARQNAHSIFVCLALKLLEERVHIACK 541
AREFLLDAATVIFKEQ VEKAFREGTARQNAHSIFVCL LKLLE+ +H+ACK
Sbjct: 481 AREFLLDAATVIFKEQ---------VEKAFREGTARQNAHSIFVCLTLKLLEQHLHVACK 540
Query: 542 EIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLRRKERLKGKDKDKISSESAEAC 601
EIITLEKQ+KLLEEEEKEKREE+ERKE+KR+KEREKKLR+KERLK KDK K + C
Sbjct: 541 EIITLEKQVKLLEEEEKEKREEEERKEKKRSKEREKKLRKKERLKEKDKGK--EKKNPEC 600
Query: 602 THSDVL-------EDLSPCVLEPNPNAVGE-------VCDASVPESSDILDELFLNESII 661
+ D+L EDL E N E D S P S D+ + L+
Sbjct: 601 SDKDMLLNSSREEEDLPNLYDETNNTINSEESEIETGYADLSPPGSPDVQERQCLDGCPS 660
Query: 662 SEGQNSYDDSFD---GRLTDGNESFIGDQSKVSRWRLKFPKEVQ-DHSFKWSERRRFMVV 721
+N Y D D L D N F D K ++ KEVQ D++ +WS++RR+
Sbjct: 661 PRAENHYCDRPDRDIKDLEDENVYFTNDHQKPVHQNARYWKEVQSDNALRWSDKRRY--- 720
Query: 722 SENGVLVNKSEQRYYADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHSSNNRVS 781
S+N V++SE RY D LE PSR NGSNR+LR N+ K G + K +EK +NR+S
Sbjct: 721 SDNASFVSRSEARYRNDRLEVPSRGFNGSNRQLRVNASKTGGLNGIKSHEKFQCCDNRIS 780
Query: 782 --YDYRSCICNQTNEFNKKVEPFVSSVRVNRDVKSVSKSESSFDMSKQSFRSNKYSYGDH 841
+D+ SC C + E+ KVEP + R R+ K++S S+S+ D SK F+ N+Y+ D+
Sbjct: 781 ERFDFSSCSCKPSCEYRAKVEPKTAGSRSTREPKTISNSDSALDASKPVFQGNRYTQPDY 840
Query: 842 SRDSGRLKNKAAL-LNNSPSKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTFKFGVE 901
+R+ RLK+K + N S ++D ++SK+VWEPME KKYPRSNS S V ++ STFK E
Sbjct: 841 TREL-RLKSKVGVGPNPSTTRDSLHSKQVWEPME-PKKYPRSNSYSEVTVRCSTFK--AE 900
Query: 902 PDYDPLKSRHEFCSGEVSVTSGTVDPEESNSTESTSGIESDEVFQNGLSIESKDHKNVEE 961
D + + NS++ S + E N I+ KD ++E
Sbjct: 901 EIEDAIVA--------------------ENSSDLLSQCKVTEKLDN---IKLKDENSMES 960
Query: 962 DACEEVTQCSVNSTMDMTLTSSGTSNPVGTSSLNSDNCSSCLSEGDSNTICSNHGNLESS 1021
T +P+ +S+ +SDNCSSCLSEG+SNT+ SN+GN ESS
Sbjct: 961 GE---------------TKNGWHLKDPMMSSTSSSDNCSSCLSEGESNTVSSNNGNTESS 1020
Query: 1022 STSDSEYASHQSEGKES-SASIQN--------GFSEHHEIRIDKVIGGDAMGSTNHSGLS 1081
STSDSE AS QSEG+ES QN G S+ E I V+ G+ M + +++ +
Sbjct: 1021 STSDSEDASQQSEGRESIVVGTQNDILIPDTTGKSKIPETPI--VVTGNNMDNNSNNNMV 1080
Query: 1082 HDNGGCKVQGNASKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMSYYH 1141
H + QG P + QN+ +PVFQ M Y+H
Sbjct: 1081 HGLVDVQPQGG------------------------MFPHLLTQNLQYPVFQTASPMGYFH 1140
Query: 1142 Q-NSVSWPAAAHANGMMPFSYSNHCLYANPLGYGLNGNPRFCMQYGH-LHHLSNPVFNPS 1201
Q VSWP ANG++PF + N LY PLGY +NG+P C+QYG L+H + P FNP
Sbjct: 1141 QAPPVSWPTGP-ANGLIPFPHPNPYLYTGPLGYSMNGDPPLCLQYGSPLNHAATPFFNPG 1200
Query: 1202 PVPIYHPASKASNGIYAEDRSQVSKSGALAESSVAHSDVVVTTGHPYALSSPRSGDCKQS 1261
PVP++HP SK + ED++Q L P +C
Sbjct: 1201 PVPVFHPFSKTN----TEDQAQ-------------------------NLEPPLELNCLAP 1209
Query: 1262 DTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLTPSKEDNVGDFSRNNEVEVVDNGHAFNK 1282
+ + +D SFSLFHF GPV LSTG K SK+ + D VV N + K
Sbjct: 1261 PETQTVNED--SFSLFHFSGPVGLSTGSKSKPAHSKDGILRD--------VVGNIYTKAK 1209
BLAST of Lsi01G019330 vs. TAIR 10
Match:
AT2G41960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G58050.1); Has 11991 Blast hits to 7260 proteins in 458 species: Archae - 17; Bacteria - 481; Metazoa - 5028; Fungi - 1325; Plants - 615; Viruses - 38; Other Eukaryotes - 4487 (source: NCBI BLink). )
HSP 1 Score: 944.9 bits (2441), Expect = 6.8e-275
Identity = 629/1313 (47.91%), Postives = 798/1313 (60.78%), Query Frame = 0
Query: 2 MPAL-TQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDK 61
MP L T N+H S++GFWS+ D ++Y+QL +FWS+L +AR +LLRIDK
Sbjct: 9 MPGLTTHMNEH----------YSSSGFWSEDDDGLTYDQLDQFWSELSSKARHELLRIDK 68
Query: 62 QTLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCANHSCNRLGVSKNHACDGSL 121
QTLFEQARKNM CSRC GLLLEGF QI+ G+ A + +G SK++ S
Sbjct: 69 QTLFEQARKNMCCSRCLGLLLEGFAQILSAGR--------AAYEKRMMGPSKDNC--KSN 128
Query: 122 SVNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELL 181
Q P VH WGGLTTTR G +TLLDC+L +K+F GLQNVF+S RARERERELL
Sbjct: 129 GTRKCTVAYQSPPVHRWGGLTTTRSGCITLLDCFLTAKTFKGLQNVFESNRARERERELL 188
Query: 182 YPDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRM 241
YPDACGGGGR W+SQG A +G+GHGTRETC LHT RLSCDTLVDFWSAL E +RQSLLRM
Sbjct: 189 YPDACGGGGRVWLSQGIAGFGKGHGTRETCNLHTTRLSCDTLVDFWSALEEHSRQSLLRM 248
Query: 242 KEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEV 301
KEEDF+ERL YRFD K+FCRDCRRNVIREFKELKELKRI+R+P CT WFCVAD AF YEV
Sbjct: 249 KEEDFVERLTYRFDCKKFCRDCRRNVIREFKELKELKRIQRDPRCTDWFCVADTAFQYEV 308
Query: 302 SDDTIQADWHQTFADSVETYHYYEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGALSS 361
D+++ADW Q F ++ YH++EWA+GTGEG+SDILEF+ VG + S ++NGLDL L
Sbjct: 309 DIDSVRADWSQYFTENA-GYHHFEWAIGTGEGESDILEFKYVGNDRSARVNGLDLRGLHE 368
Query: 362 CFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAE 421
C+ITLRA+K +GR +E+SVKAHAL+GQQCVH RL VGDGFV+I RGE IR FFEHAEEAE
Sbjct: 369 CYITLRAFKKNGRPSEISVKAHALRGQQCVHSRLVVGDGFVSIKRGECIRMFFEHAEEAE 428
Query: 422 EEEEDDSLDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQACYLQSSWLVEKA 481
EEE++ +DKD N+LDG+C RPQKHAKSPELAREFLLDAATVIFKEQ VEKA
Sbjct: 429 EEEDEVLIDKDGNELDGECLRPQKHAKSPELAREFLLDAATVIFKEQ---------VEKA 488
Query: 482 FREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERK 541
FR+GTARQNAHSIFVCL+ +LLE+RVHIACKEI+TLEKQ KLLEEEEKEKREE+ERKERK
Sbjct: 489 FRDGTARQNAHSIFVCLSSELLEQRVHIACKEIVTLEKQNKLLEEEEKEKREEEERKERK 548
Query: 542 RTKEREKKLRRKERLKGKDKDK--------------ISSESAEACTHSDVLEDLSPCVLE 601
R KEREKKLRRKERLK K+++K I S E + D ED + +
Sbjct: 549 RIKEREKKLRRKERLKEKEREKEQKNPKFSDKAILPIMSREEEGSRNLD--EDTNNTIRC 608
Query: 602 PNPNAVGEVCDASVPESSDILDELFLNESIISEGQNSYDDSFDGRLTDGNESFIGDQSKV 661
D S P S D DE L+ I + DS D + D + +
Sbjct: 609 EESGIENGDVDLSSPGSPDDQDEECLDGCISPRVETHSCDSTDKEIIDHEDENGCFTPRP 668
Query: 662 SRWRLKFPKEVQ-DHSFKWSERRRFMVVSENGVLVNKSEQRYYADSLENPSRSMNGSNRK 721
+ + KEVQ DHS + SE+RRF +E V+ SE Y D LE S NGS++
Sbjct: 669 AHKTARLWKEVQTDHSLRLSEKRRF---TEKTSFVSSSEAGYCNDRLEMSSGHFNGSDKN 728
Query: 722 LRTNSLKAYGR-HVSKFNEKLHSSNNRVS--YDYRSCICNQTNEFNKKVEPFVSSVRVNR 781
+R + KA G + S+ +E+ S+ R YDY SC C N + +KVE S+ R R
Sbjct: 729 VRVKASKAGGSPNSSRSHEEFQCSDGRTGERYDYHSCSCKPINGYREKVESNTSATRGMR 788
Query: 782 DVKSVSKSESSFDMSKQSFRSNKYSYGDHSRDSGRLKNKAALLNNSPSKDFVYSKKVWEP 841
+ KSV KS+S D+SK + R+N+Y+ + R+ +++K N+ D V +KV +
Sbjct: 789 EPKSVFKSDSDLDVSKLN-RANRYTQSGYRRE---IRSKMNNSRNACKMDPVNVRKVLDS 848
Query: 842 MESQKKYPRSNSDSNVALKSSTFKFGVEPDYDPLKSRHEFCSGEVSVTSGTVDPEESNST 901
+E K+ R++S S+V L +T+K + E+ S TV P + S
Sbjct: 849 VE--PKHSRNSSTSDV-LSLTTYK-----------------AEEIKDVSPTVKPAGTPSL 908
Query: 902 ESTSGIESDEVFQNGLSIESKDHKNVEEDACEEVTQCSVNSTMDMTLTSSGTSNPVGTSS 961
+ + F N ++ K M++ +T
Sbjct: 909 CKATDKLGNGSFNNSTEVDKK---------------------MEVHIT------------ 968
Query: 962 LNSDNCSSCLSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIR 1021
L +D S S + SN+GN+ESSS SDSE AS QSEG+E+ QN + HE
Sbjct: 969 LKNDYLYS-KDPMMSRSSSSNNGNIESSSMSDSEVASQQSEGRENLVDTQNDMPDCHEKM 1028
Query: 1022 IDKVI-----GGDAMGSTNHSGLSHDNGGCKVQGNASKNVPQNFE---AGFSAVS-LDSP 1081
++KV D + N S L DNG K+ G QN E G + S L P
Sbjct: 1029 VEKVTEMSMDERDVLKIKNISNLPADNGESKLSGTPFMVPSQNMENMVPGLNTGSYLSQP 1088
Query: 1082 CQVTLPSIQNQNIHFPVFQVPPSMSYYHQNSVSWPAAAHANGMMPFSYSNHCLYANPLGY 1141
+ LP + NQ+I PVFQ P +M YYHQ VSW ++A NG+M F + NH +Y PLGY
Sbjct: 1089 QNMILPQMLNQSIPLPVFQAPSTMGYYHQAPVSW-SSASTNGLMQFPHPNHYVYTGPLGY 1148
Query: 1142 GLNGNPRFCMQYG-HLHHLSNPVFNPSPVPIYHPASKASNGIYAEDRSQVSKSGALAESS 1201
LNG CMQYG L+H + P FN PVPI+HP ++ +N + D++Q + L S
Sbjct: 1149 SLNGESPLCMQYGTPLNHSAAPFFNSGPVPIFHPFAE-TNTMNTVDQAQPLE--PLEHSF 1208
Query: 1202 VAHSDVVVTTGHPYALSSPRSGDCKQSDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNLT 1261
+ ++ P + +PR C Q+D+ +FSLFHFGGPVALSTG K N
Sbjct: 1209 LKEANERRFNEMP-LMETPRK-RCPQTDS-------DENFSLFHFGGPVALSTGSKANPA 1215
Query: 1262 PSKEDNVGDFS---RNNEVEVVDNGHAFNKKETAI-EEYNLFAASNGMRFSFF 1282
SK+ + DFS + V G++ +KE + EEYNLFA SN +RFS F
Sbjct: 1269 RSKDGILEDFSLQFSGDHVFGDPTGNSKKEKENTVGEEYNLFATSNSLRFSIF 1215
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3B599 | 0.0e+00 | 92.17 | uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=... | [more] |
A0A0A0KZE9 | 0.0e+00 | 92.35 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G563700 PE=4 SV=1 | [more] |
A0A6J1EPP9 | 0.0e+00 | 90.26 | uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC1114355... | [more] |
A0A6J1HVV4 | 0.0e+00 | 89.41 | uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149... | [more] |
A0A6J1DQ45 | 0.0e+00 | 88.66 | uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
Match Name | E-value | Identity | Description | |
XP_008442254.1 | 0.0e+00 | 92.17 | PREDICTED: uncharacterized protein LOC103486163 [Cucumis melo] | [more] |
XP_038881990.1 | 0.0e+00 | 92.46 | uncharacterized protein LOC120073308 [Benincasa hispida] | [more] |
XP_011653932.2 | 0.0e+00 | 91.85 | uncharacterized protein LOC101210448 [Cucumis sativus] >KAE8649763.1 hypothetica... | [more] |
KAG6603257.1 | 0.0e+00 | 90.34 | hypothetical protein SDJN03_03866, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022928663.1 | 0.0e+00 | 90.26 | uncharacterized protein LOC111435513 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
AT3G58050.1 | 0.0e+00 | 51.56 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G41960.1 | 6.8e-275 | 47.91 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |