Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAATAAAAAGAGGAAGCTAAAGAGTCCTCAAGAAACCGAGAGGCCTTCCAAGTCTGCTCGTCTTCTTATCTCACCGGAAGAGGAAGTTATAGACGAACCAGAACGGGTTGAGAAGTCGGAACAGAGAGGATTGTGTCAAGAGTCTGAAGAAGGTTGTCCGTGGAGGAACTTGGAGTTGATTTTCTTGATTCAGAACAAAGAGCTCGACCAACAAAAGTTTGTTTCTTATCTTTGCATATGTGTTATATTTTAATTCTTTGGTTTTGGAAGCAGCGTAAGCTTTTCTCCAATATTCTTTGGTATTTCAATGCAGGAAGGTCGAAGCAGTTTTCAGCTTTGTCAACTCAAAGTTGGAAGAGAGAGATAAATATTATGATACGGTAAAAGTGTCGCGCTTGATAGTTTTTCTTAGTGATTGGGTACAGTCATGCTGATTTCATCTGAAAAGAAAGCAAAGAATGATGGTGGAAAACATTGTAACATGGCTATAGAGCCATGCTTAGATTATAGATTCTGGGAGGTTTTTAGATTTTGTTTGGAAGAATCAGTAAAAACACACATTACTTTGAGTCTCTCAAGAAACCTCTTACATGCCTTTTGTTTTGTTACAAGAAATGCGATATCCCTACTAGTTGCATCGTCGAGTTCAAAAGAAGAGTTATTGCTGGGGATTGTTTCAAGTTGTACAATTTTGTGTTTGATTGTGTTTCTTTAGTATTCTCGTCCCATTTTGGTCTATGCAATGAGAATTTAGACGCTTGGATTTCTACCATTGACCAGTGCTTGAGTTTCTTCACAAAATTCATGTAAATAACCTTGAAGGTGGAGATGTAGGCATTTTTGCAATGAAATTTTCCTGTAAGATGCTTGATCCTTTTGCTAAATTTTTGTGGATTCACCCAACCAAAAAAACTGGATTTCACAATTTTGTCAACAAAATTCTGGAACCATTACTGCAATTATTGTGTGACTTAAATCTTAAAGCTGATGGATGCAATCATGGTTGGACAAGAACATGATGAAGTTACTTGAAGAAGTTCTATCTCATGCCTTGTTTCACACAGTGCATATTGATGGATTTCTGTGCTTACATGGTTCAGAGAAGGTTATAAAATCCCATGATGAGAAATTAGAAGAGTCTAAAGCACATATTAAGAGCTATCATAGGCATTTATTTGATAAAGTGCAAAGATTAGTTGCAGGAAAAAAGTTCTTGGCATTGGGTGCAATTGGAGAATTGTTTCGCGTGCTTGTTGTCAGAGTTAAAACGGTGAAAGGAGGTTCAAAGTTTGAGGACACCAAACTGAATAACAAAATGGGTTGTGTGGGACCTTTGAGGGATGATGTATCAAGTGATGCATCTTATGTGCTTCAAGGAAGTGCTGATGTACCTTCAGAAAAAAGCAATTATCTGAGCAGCTTGAGTACAGAAATAAGGAAGTCTCTTTTTGAGTTCTTTGTACAGATTTTGGATCCTTTACTTCATACTATTGACCACATTAGCGCTGAAATTAAACAGGGACCAGCCTTGTCGGATGTTCATTGCCTACTCAAATCAATAAATAATCTACTTGCAAGTTTTATGAAGGAAAAGGTGTACTTGAGAACAGAGGATAACTCTGAAGGGGCTTACTTTAATTTCTTAAAAAAGGTTTATGATACAGTAATGTTGGTGTCCTCTAATTTGCTTTTATTATTAAGACATGAGATAGAAAACAATGTAGACCTGGAAGTTTTTGTCTTAGCAGCCAATGAGATACTAGTGACTGTTGGTTATCTTCTGGAAATTGAATATGATGTGATTGGGAATGACTTGGTGAGCTTATGGCTTGTGATCCTTTCTTACTCTGCTATTAATCTTTCTTTCACAAGCATACCTAAGCAATACTTGTTAACTTCCAAGATTCAAGAACTTGGATGTCAGCTGGTTGTTTTATATGGTCAGCTACGGCAGGTTAGTATTAGTCTTGTATTTCCATCTGAAATTTGTTTCGTTTCTGTCGTGTTATTTCTCTGCCATAATATTTTTTGTTTTATAACTTGTTTGCTATTTTTTTTCTCAATCTGAAATTTAATGTTCAAAACTATGATCACTCCTTTTCTATATGTTTGCTATTGTAGGTCAACATTAGTGTTGTTGCTTTATGTAAAGCCATGAGGACAGTACTATCAAATGAACATGAAACTGAAAAAAGTTATGCTAGCGTTATGACTTCTTTAGGTTATGAAGCTTATGGAAAATCAGTTGGGATGCTACTTTCTTCACAAGAAATTAAATTTGCAATTCATAAAGCTATTAAATATGTACCAGAAGGGCAAGCTAGTGGAATTATTCAGCAGTTAACTGAGGATGTGACAGAGACTTTAGGATGGCTGAGACTTTGTAATTTGAACTTGAATATAAGAAATAGCACAAGCGACTCGACTCTGCAAACTATGTTTCTTGGAAGAGGGTTGTCCGAAATGTATGATCTGATGTTGGATTCACTGATGATCACATCTGGCAATGCTTTCCAAATTGGAACTTCAATTGAGAACTTGATTTCAGTCATACGTCCTTGCATGAGTTGTTTGGTTGGACTGCATTCAGATGGTGCCAAAGCATTCCTTGTTGCTATCATGGGGAAAACACGTGATGGTATGATTGCAGATGAAGATAACTGTCTTGGATTTGGAGTGACCAGTCATTGGGTCTTTGTTTTCTTCTTTCGCTTGTACATGTCCTGCCGGAGCCTGTATAGGCAAGCAATTAGCCTGATGCCCCCTAGTTCATCTAGAAAGATGTCAACAGCAATTGGAGATTCATTCATAGCCTATTCTGCTTGTGATTGGATGCAGAGGACTGATTGGAGTGATGAAGGTTATTTTTCATGGATAATTGAACCTTCAACTTCTGTTCTTGCTGTTGCCCAATCTGTTTCTAGCCTTTACCATCAAGGTACGGATGTAGATTGGTGCCCTTTAGTTTATGTATTGCATACCATGACTCTTCAGAGGTTAGTGGATTTGAACAGGCAGATAGACTCTCTCGAGTATGTGCACCAGAGAAATGAGAATCTGATGCAGGTTGAGGTGCTTGGTGATGATGGCTTGTCAGTGTTACGGAAAAAGAGCAAAAAGTTTGGAAGGCTTCTCTCAGATTTGCGAAAAGAAGCAGCAGATCTTACTGATTTCATGATGAAGCATTTATCTTTAGTAGCTAAACACCACATTTTGAATTCCACTAAAAGAAATGCAACTTCAAACGATAAATCTACTGAAACACTTAGTGAGATTGATGATTGGGACTTTAGTATTTGTAATGTGAATAAAAGGTCATTGCCCACAGCTATTTGGTGGATTGTTTGCCAAAATGTTGATATTTGGGCCAATCACGCTGCTAAAAAGAGGTTAAAATGTTCCTTTCTTTTTTAATACGTACATCCCATCTTTTCTTAACAAGCAATGATACGAAGATTGGAACACAACAAACCGATGGATGCCAGCAGCTAAAAAAGGTGTCTCTGCAACAAATTTCATCAGCTGTTCTAAGTGACCCCATCTTTTATGAACAAAGAGTAAGACACTGAAAAAAATGCTACTCACCCTCTCTCTGGTTCTCAAACGAAACTTTTTATTAGTTAGTTTTGGTTTGCACGTGATAGTTAATGTCTTTTGAAGTTTTCATTGTTGTCAATACCTTTCTGTCTAATTTCTTTTTCAAATTTCCTGGTGTTTATGGAGTTTATTTTATCAGATGATGGCAATACTATCTATTCTATTAGTTTGTCTGCAGGCTTATGCCATCAAGGTTTTGTCGTGAGTTGAAGGCTTCAATATTATCATCTTTTCATGATATTAATATAAGTTCAGCTGATTGGATGGAGAAGATAGCCAAACTTGAGCCCGTTTGCAGTAGCAAACGTATTCCTGACGAGAATTTTCCCCTCACTAAAGCAGTTAGTCACTCATCTGATTTGTTGCAAACAGAAGATTGCAAGTTGAAGAGAGATTCACCTCCAAGCAATGTGAGATCCAGAGCTTGTCAGCATTTGATTGACCTTTTATGTTGGATGCCAAAAGGGAATATTAGTTCAAGATCTTTTTCCCTTTATACGACTTATGTTCTCAACCTTGAAAGGTATACTTTATTTGTATATTTTCCTCGCACTCTCTTCTTATTTATACGACTTATGTTCTCAACCTTGAAAGGCACATCTTGTTATGTGTCATGGCTTCTCAGATCTATGTTATTAATGAGACAAAAAAATTATAGAAAAACAAACAATTATCACTCTAGAGAATCTTGGAAGTTGGTTTGTTTTTTAGCTTCTTCTTGGAGTGTCGTAGTCAATAGCTTTGAAATTTATTTCCTTTTTTTGATTAAATACCAATTGGAGAGTACTTTTATAATTTCCTTTGTTGGCTGAAGAGGTGTTTATTCCCCTCTTTTATATATGTTTCTCTCTTTAACCGTATTGTTGTTTCTAAGCAGGAAATATTTTAGAATCTAGTTTGCAAAAGCAGTTATTGCTTTTTTTAATGTAAAATTTTCCTTCAATTTAATATATATTTTTGTTTTAATAGTATAGTAAGTTTTTTCAGAAACTAGTTGCACGATTACTTTAAAAACAATGGATTCTTATTATTGCATACTTTCAGTAGTGTTATGGGTGGACGATAAACAATCCACGTTCTCCCTTGGGCATTTGGTTAGGCATTATCTCACGCTACTCAGCCTTTTTTTATCAAGTGGCTCACAATATTACTTTTCCTTTGTTATCCAGGCAATTGGTGAATGCTATATTAGATAATCAGACTACTTTATGTTCTCCTAATCAATCTGAGCTCCTCAAATTATTTGTATCTTGCCGGAAGGCTTTGAAATATATATTTATGGCTTCCTATGAGGCTGGCGATAGACAAAGCTCATCTATTCCGGTTCCATCCGAAAATCAATTTCCTGTTTCATGGCTTTTCAAGTCTGTATCAATTGTTAATCAACTTCAAGAAGCTTCTCCAGAAGGTAGCAACAGACAGATTAAGGATATCATCTTTTCATTGATGGATCATACATCATATTTGTTTTTAACAACTAGCAAATATCAATTTAAGAATGCTCTCCGCTTAATTGTAATTGACAACAAACCCTGCAAGGAGCAATGCCAGAACGTTTTCCATGAACTAGTTAATGGAGATGATCTATGTTTAGATCCCACTCACTGTGTGGAAGCTTACAATAGTGCAAGTCAAATGACTATCAGTCTGAAGGAACAGGTGGAGAGTGAATTTATCTCTCTGAAGAAGTCTAATGTTACAGTTGGAGATGGTAAAAACAGTGCCAATATGTGTAAAGTTTATTCTCTAGCTTCTTGCCTTAATGGATTTTTGTGGGGCCTAGCATCTGCCGTGGACCACACTGATTTGAGGAATGGTAGCCGTCACATGAGATCAATGAAGTTGAAATGTGAATATAGTTCCCAACTGAACCTTTGTATAAATGCAATTTCAGAACTATTAGGTCTTCTCTTGGAAATGTTTCTTGATAGAGACAGTGATCGGCCGACAAAATTGTGTGATTATCATACTTCTCAAGACCTTTTGGTTGTGAATGAGGTCTCTGTTAAGGGCTCTAGCTCTGAGGTTGATACTTCATTTAGTAAACACCACAAATTTGAATCTTCTCATAATGATGATGGCACTGAAAATGTTAGCTTTAATAGAAAAAGGTTGAAATTGGAAAATAAAAGCAGTGTTGCCTCTATTCTGAATGAGGCTGACTCAATTGAGCTGCAATCCTTGAACCAACCTGTCTTGCGAGGTCTGCTGAAAGGTTCCTATCCTGAAGTAGCATTTGCACTTAAACAGCTATTTCTTGCTGCTTCTGTCATTTTGAGATTGCATATGAAATATGATACTGTTCCTTTGTCATCAAGTTTTATGGCCATCCTAATTGGCATTTCACGATTATTGTTGCTAGAATTTGTGGACATGGTTGAAGTGCCACAGCCATTTTTGCTTGCTTGCTTGGATGGTGTTCTAAAGTATCTGGAGGAATTAGGCCATCTGTTTCCTTTTTCTGATCCTATGCAATCCAGAAATCTATATTCTAGGCTTATAGACATACAGTTGAAAGCTATAGGAAAGTGCATATCTCTACTGGGAAAAAGAGCTACTTTAGCATCCCACGAGACAGAGTCCACTACAAAGACTCTTGATGGTCACTTAGGTCTATTCGAAGAATCATCTTTTCCTGGAGTCTACCGTATAGATGAATTTAAAACTTCATTGAGAATGTCATTCAAAGTGTTCATAAGGAAAGCCTCGGAGCTGCATCTCTTATCTGCAATTCAGGCTATTGAGAGAGCTCTAGTTGGAGTGCAGGAAGGTTGTACGGCGATATATGAATTATATTCCGGAAGTGAAGATGGGGGAAGGTGTTCCTCTATTGTTGCGGCTGGTGTCGAATGCTTGGATTTGGTTCTTGAATTTGTTTCAGGTAATTTTAAATCAATTTGCTGTTCAAGAGCAAATCTGAACCTGTACCAAGTGCTGTGTTTTTTTGACGCATTTTTTAAGGAGTCAACACAAATAATTCTTCTTCTCTTCTTTTCCAGTGTTTGAAATGGACTATCTTATAAGCTGTTTTCAGCATGTAGCACAGTGTTTCTGCATTTTTAGTTGTATGTTAGTTTACTGTGGTCCATATGTTGCATTACCTGTATGTGAGAGCAATGAAGTCTAAACATTATCATTTCAAATTGATGTTTTTTTTATTTGGTCTCTGTTTTGAAATAATCATTTAGATGCATCTTTTAATCAATATTTTCTTCAAAATAATTGTGTTTGCAACTTAAGCTTCTATTTTTGGGGGCTAATTAACCAATCTTAATAGAATTTCATTTTCCTTAATAGAATTTCATTTCCTTTTCTCACTCTCTCTTTTTTAGGACGTAAATGCTTGGGTGTGGTTAAAAGACACATTCAGAGCTTAATTGCTGGTCTGTTTAGCATAGTTCTGCACTTGCAGAGTCCACAAGTTTTCTATACAAGAATGATTGACACGAAAAACAAGAGTGGTCCAGATCCGGGATCAGTCATTCTAATGTCTGTTGAAGTGCTTACAAGAGTTTCGGGAAAGCATGCTCTTTTCCAAATGAATGCTTGGCATGTAGCACAGTGTTTACGCATTCCTGCAGCACTTTTTGAAGAATTTTCTCTTAAGCTCCTAGGAATTCCCATTCAATCAGATAATTCCCTCATCTCAGCTCAGGAAGTCCCCAATCGAGTTGTAACCACAACTAAATCCATTATAGACAGACAATTCTTAATAGATCTTTTTGCTGCCAGCTGCCGGTTATTATATACCGTTCTCAAGCATCATAAAAGGTATGATTTAGCATCGAGAATTTATTTGTGGAAGGAAATTATTGTTTTCTGTCTCTTCAGCGTGGTTAATTTTTTTCTTTTGTTTGATCATTCTCCATTGGTGGTTTATCCTTGAATCTTGCCTGATGATTGGATACAGTACTGGTTAGTTGATGTTTCACTGCAAGTGATCCTTGAAATTATGTTGTAGTCCATTTAACTTCATAAAGCATCACTTATTCCAGTTTGAATGGTATAACTACATAAAGCATCACTTATTCCAGTTTGAATGGTTTTTACTTCTAATATTGTCTTGTCAAGTTAGAATTATGAAAAGAGTTGCTTTCTGATGTTTAAGTTATTGTTAGTGTAAATAAATTCAGAACCTTGATATTACTCTTCCTCGAGAGGGCTTGGCTTGGTTGATCCATGATTTATGTTTCATCTTTCGTTTCCTTTTCTTTGTTTTGAGAAATATATCGATCAATTTCGATTCTTTCTATTTCTATTGATTCTTTTCTAATCGAGAGGTATGGATCGATCATCCATGGGCCTATGTTGTAATTCGGGGTGATCTCTGCACATTGCTATAGCCGTTTGTTGTTATCTTTCATGTTACAAAGCACGCTAGTTACCTTTTCCTCTACTGTACGTGAGTTTTATGTTTCTTGAATCAAAGTGATATTTGTTGTGGGAGTCAATTTTGATTGTTGTACTGTGCCTTCCCTTCAAACCTTTTAAGGCTTTAATACTACATGATCGAACTAATTTCTTCATTATTCATAGTGAGTGCAAGCGTTGCATTGCTCAACTACAAGCATCAGTGTCTGTTCTTCTTCATTCCCTTGAAAGAGTAGGTCCAGATCCGGAATTTATGGGTGGTTACTTTTCGTGGAAAGTAGATGAGGGAGTAAAGTGTGCTTGTTTCCTCCGAAGGATTTATGAAGAGGTTTGATGTTTCCGTCAATATTTCTTAATAATTTTGTACTACTTGAATTCTGTAGCCTGAATAGCATCTGCATTTATGTAATTCTTTTCTGTTTCATTGCCAAGACTAATTCTTTTCTGTTCCATAGCGAGTTGGGCAATCAAGTCTTCATATGTTTAACTCCTTTTGAACAAATTCTTTCATGTGATTTTATGAGTGTACAGGTTTGTCTGGCTGCTGTAGTTTTATTTTTACCATCATGTATTAATAGGATTAGAGCCATTGGACCCTCTATAATACAATTTTCTTAGCAACAATGGGTATCCGAGATAGAGGATTTTCTTTTTTACCCCGAAGTCAAATGGACTAGTATTTTATTGCAACAACTATATTAGATGGGAAATGTTTATGGAAGGCCTCTTGTGAAGTCCTTGTATTCTTTCCCCAAAAATGTTATTTGTGAACTTTCGACCCTTTAAACCTGCTGGAATTCTATTAATAATAAATGAACTTGTGGTCTGTTACACCATTAAAAACTTTCCTATCAGTTTAGTTATAAATAGACTCTTGGTAATTAGATCAATAATAAAAATTTTCCCTTTATGATGGGATTTGTTAATAGCACATACTATTCAAACTTTCACTCTGATATTAAATAAAGTTGTATTCATCCATATCATCTTCTTTTCTTTATCCATCCAAGTGACATACCCTTTTTCTGCTTCCATGACACAGATAAGACAGCAACGGGATATCGTAGGGCGACACTGCTCTCTGTTTCTTTCAAATTACATATGGGTTTATTCAGGATACGGTCCCCTTAAATTCGGGATTAAAAGGTTTGAGTCCTTGAACTTTTATGACAGAAAATTTTCAGCAAATTAGTTCATCAGTCTCACAAATATGATGTATTATGTATCCATTTGTCATGACATAAAACAGCTGTTGAGTTATTAAAATGTAAGCCTATTGAGGTAGTCAAGGTTATATCCCTGGTTATTTAACTTGATATGATTCTGATGAGCTTTTTCTTGTGCACATTGCAAAGGTGCGCAAGTGAAAACAAAACTCGATTGAAAATTTGACGATTCTGCAATGTTCCCTCTGCAGGGAAATAGACGAGGCGTTAAGGCCTGGTATCTATGCTCTAATAGATGCTTGTTCAGCAGAGGATCTTCAATATCTTCACACTGTATTTGGCGGTAAGTATCGATGAGTCGATACGGTTTTGTTAGAGATGAAGATGCAAAAACTTCACATGCTCATTTTGGTGCTGTTTCTGCAGAAGGTCCATGTAGAAACACTCTAGCAACCTTGCAGCAGGATTACAAACAATTTTTCCAATATGAAGGAAAAGTCTGA
mRNA sequence
ATGAAGAATAAAAAGAGGAAGCTAAAGAGTCCTCAAGAAACCGAGAGGCCTTCCAAGTCTGCTCGTCTTCTTATCTCACCGGAAGAGGAAGTTATAGACGAACCAGAACGGGTTGAGAAGTCGGAACAGAGAGGATTGTGTCAAGAGTCTGAAGAAGGTTGTCCGTGGAGGAACTTGGAGTTGATTTTCTTGATTCAGAACAAAGAGCTCGACCAACAAAAGAAGGTCGAAGCAGTTTTCAGCTTTGTCAACTCAAAGTTGGAAGAGAGAGATAAATATTATGATACGAACATGATGAAGTTACTTGAAGAAGTTCTATCTCATGCCTTGTTTCACACAGTGCATATTGATGGATTTCTGTGCTTACATGGTTCAGAGAAGGTTATAAAATCCCATGATGAGAAATTAGAAGAGTCTAAAGCACATATTAAGAGCTATCATAGGCATTTATTTGATAAAGTGCAAAGATTAGTTGCAGGAAAAAAGTTCTTGGCATTGGGTGCAATTGGAGAATTGTTTCGCGTGCTTGTTGTCAGAGTTAAAACGGTGAAAGGAGGTTCAAAGTTTGAGGACACCAAACTGAATAACAAAATGGGTTGTGTGGGACCTTTGAGGGATGATGTATCAAGTGATGCATCTTATGTGCTTCAAGGAAGTGCTGATGTACCTTCAGAAAAAAGCAATTATCTGAGCAGCTTGAGTACAGAAATAAGGAAGTCTCTTTTTGAGTTCTTTGTACAGATTTTGGATCCTTTACTTCATACTATTGACCACATTAGCGCTGAAATTAAACAGGGACCAGCCTTGTCGGATGTTCATTGCCTACTCAAATCAATAAATAATCTACTTGCAAGTTTTATGAAGGAAAAGGTGTACTTGAGAACAGAGGATAACTCTGAAGGGGCTTACTTTAATTTCTTAAAAAAGGTTTATGATACAGTAATGTTGGTGTCCTCTAATTTGCTTTTATTATTAAGACATGAGATAGAAAACAATGTAGACCTGGAAGTTTTTGTCTTAGCAGCCAATGAGATACTAGTGACTGTTGGTTATCTTCTGGAAATTGAATATGATGTGATTGGGAATGACTTGGTGAGCTTATGGCTTGTGATCCTTTCTTACTCTGCTATTAATCTTTCTTTCACAAGCATACCTAAGCAATACTTGTTAACTTCCAAGATTCAAGAACTTGGATGTCAGCTGGTTGTTTTATATGGTCAGCTACGGCAGGTCAACATTAGTGTTGTTGCTTTATGTAAAGCCATGAGGACAGTACTATCAAATGAACATGAAACTGAAAAAAGTTATGCTAGCGTTATGACTTCTTTAGGTTATGAAGCTTATGGAAAATCAGTTGGGATGCTACTTTCTTCACAAGAAATTAAATTTGCAATTCATAAAGCTATTAAATATGTACCAGAAGGGCAAGCTAGTGGAATTATTCAGCAGTTAACTGAGGATGTGACAGAGACTTTAGGATGGCTGAGACTTTGTAATTTGAACTTGAATATAAGAAATAGCACAAGCGACTCGACTCTGCAAACTATGTTTCTTGGAAGAGGGTTGTCCGAAATGTATGATCTGATGTTGGATTCACTGATGATCACATCTGGCAATGCTTTCCAAATTGGAACTTCAATTGAGAACTTGATTTCAGTCATACGTCCTTGCATGAGTTGTTTGGTTGGACTGCATTCAGATGGTGCCAAAGCATTCCTTGTTGCTATCATGGGGAAAACACGTGATGGTATGATTGCAGATGAAGATAACTGTCTTGGATTTGGAGTGACCAGTCATTGGGTCTTTGTTTTCTTCTTTCGCTTGTACATGTCCTGCCGGAGCCTGTATAGGCAAGCAATTAGCCTGATGCCCCCTAGTTCATCTAGAAAGATGTCAACAGCAATTGGAGATTCATTCATAGCCTATTCTGCTTGTGATTGGATGCAGAGGACTGATTGGAGTGATGAAGGTTATTTTTCATGGATAATTGAACCTTCAACTTCTGTTCTTGCTGTTGCCCAATCTGTTTCTAGCCTTTACCATCAAGGTACGGATGTAGATTGGTGCCCTTTAGTTTATGTATTGCATACCATGACTCTTCAGAGGTTAGTGGATTTGAACAGGCAGATAGACTCTCTCGAGTATGTGCACCAGAGAAATGAGAATCTGATGCAGGTTGAGGTGCTTGGTGATGATGGCTTGTCAGTGTTACGGAAAAAGAGCAAAAAGTTTGGAAGGCTTCTCTCAGATTTGCGAAAAGAAGCAGCAGATCTTACTGATTTCATGATGAAGCATTTATCTTTAGTAGCTAAACACCACATTTTGAATTCCACTAAAAGAAATGCAACTTCAAACGATAAATCTACTGAAACACTTAGTGAGATTGATGATTGGGACTTTAGTATTTGTAATGTGAATAAAAGGTCATTGCCCACAGCTATTTGGTGGATTGTTTGCCAAAATGTTGATATTTGGGCCAATCACGCTGCTAAAAAGAGGCTTATGCCATCAAGGTTTTGTCGTGAGTTGAAGGCTTCAATATTATCATCTTTTCATGATATTAATATAAGTTCAGCTGATTGGATGGAGAAGATAGCCAAACTTGAGCCCGTTTGCAGTAGCAAACGTATTCCTGACGAGAATTTTCCCCTCACTAAAGCAGTTAGTCACTCATCTGATTTGTTGCAAACAGAAGATTGCAAGTTGAAGAGAGATTCACCTCCAAGCAATGTGAGATCCAGAGCTTGTCAGCATTTGATTGACCTTTTATGTTGGATGCCAAAAGGGAATATTAGTTCAAGATCTTTTTCCCTTTATACGACTTATGTTCTCAACCTTGAAAGGCAATTGGTGAATGCTATATTAGATAATCAGACTACTTTATGTTCTCCTAATCAATCTGAGCTCCTCAAATTATTTGTATCTTGCCGGAAGGCTTTGAAATATATATTTATGGCTTCCTATGAGGCTGGCGATAGACAAAGCTCATCTATTCCGGTTCCATCCGAAAATCAATTTCCTGTTTCATGGCTTTTCAAGTCTGTATCAATTGTTAATCAACTTCAAGAAGCTTCTCCAGAAGGTAGCAACAGACAGATTAAGGATATCATCTTTTCATTGATGGATCATACATCATATTTGTTTTTAACAACTAGCAAATATCAATTTAAGAATGCTCTCCGCTTAATTGTAATTGACAACAAACCCTGCAAGGAGCAATGCCAGAACGTTTTCCATGAACTAGTTAATGGAGATGATCTATGTTTAGATCCCACTCACTGTGTGGAAGCTTACAATAGTGCAAGTCAAATGACTATCAGTCTGAAGGAACAGGTGGAGAGTGAATTTATCTCTCTGAAGAAGTCTAATGTTACAGTTGGAGATGGTAAAAACAGTGCCAATATGTGTAAAGTTTATTCTCTAGCTTCTTGCCTTAATGGATTTTTGTGGGGCCTAGCATCTGCCGTGGACCACACTGATTTGAGGAATGGTAGCCGTCACATGAGATCAATGAAGTTGAAATGTGAATATAGTTCCCAACTGAACCTTTGTATAAATGCAATTTCAGAACTATTAGGTCTTCTCTTGGAAATGTTTCTTGATAGAGACAGTGATCGGCCGACAAAATTGTGTGATTATCATACTTCTCAAGACCTTTTGGTTGTGAATGAGGTCTCTGTTAAGGGCTCTAGCTCTGAGGTTGATACTTCATTTAGTAAACACCACAAATTTGAATCTTCTCATAATGATGATGGCACTGAAAATGTTAGCTTTAATAGAAAAAGGTTGAAATTGGAAAATAAAAGCAGTGTTGCCTCTATTCTGAATGAGGCTGACTCAATTGAGCTGCAATCCTTGAACCAACCTGTCTTGCGAGGTCTGCTGAAAGGTTCCTATCCTGAAGTAGCATTTGCACTTAAACAGCTATTTCTTGCTGCTTCTGTCATTTTGAGATTGCATATGAAATATGATACTGTTCCTTTGTCATCAAGTTTTATGGCCATCCTAATTGGCATTTCACGATTATTGTTGCTAGAATTTGTGGACATGGTTGAAGTGCCACAGCCATTTTTGCTTGCTTGCTTGGATGGTGTTCTAAAGTATCTGGAGGAATTAGGCCATCTGTTTCCTTTTTCTGATCCTATGCAATCCAGAAATCTATATTCTAGGCTTATAGACATACAGTTGAAAGCTATAGGAAAGTGCATATCTCTACTGGGAAAAAGAGCTACTTTAGCATCCCACGAGACAGAGTCCACTACAAAGACTCTTGATGGTCACTTAGGTCTATTCGAAGAATCATCTTTTCCTGGAGTCTACCGTATAGATGAATTTAAAACTTCATTGAGAATGTCATTCAAAGTGTTCATAAGGAAAGCCTCGGAGCTGCATCTCTTATCTGCAATTCAGGCTATTGAGAGAGCTCTAGTTGGAGTGCAGGAAGGTTGTACGGCGATATATGAATTATATTCCGGAAGTGAAGATGGGGGAAGGTGTTCCTCTATTGTTGCGGCTGGTGTCGAATGCTTGGATTTGGTTCTTGAATTTGTTTCAGGACGTAAATGCTTGGGTGTGGTTAAAAGACACATTCAGAGCTTAATTGCTGGTCTGTTTAGCATAGTTCTGCACTTGCAGAGTCCACAAGTTTTCTATACAAGAATGATTGACACGAAAAACAAGAGTGGTCCAGATCCGGGATCAGTCATTCTAATGTCTGTTGAAGTGCTTACAAGAGTTTCGGGAAAGCATGCTCTTTTCCAAATGAATGCTTGGCATGTAGCACAGTGTTTACGCATTCCTGCAGCACTTTTTGAAGAATTTTCTCTTAAGCTCCTAGGAATTCCCATTCAATCAGATAATTCCCTCATCTCAGCTCAGGAAGTCCCCAATCGAGTTGTAACCACAACTAAATCCATTATAGACAGACAATTCTTAATAGATCTTTTTGCTGCCAGCTGCCGGTTATTATATACCGTTCTCAAGCATCATAAAAGTGAGTGCAAGCGTTGCATTGCTCAACTACAAGCATCAGTGTCTGTTCTTCTTCATTCCCTTGAAAGAGTAGGTCCAGATCCGGAATTTATGGGTGGTTACTTTTCGTGGAAAGTAGATGAGGGAGTAAAGTGTGCTTGTTTCCTCCGAAGGATTTATGAAGAGATAAGACAGCAACGGGATATCGTAGGGCGACACTGCTCTCTGTTTCTTTCAAATTACATATGGGTTTATTCAGGATACGGTCCCCTTAAATTCGGGATTAAAAGGGAAATAGACGAGGCGTTAAGGCCTGGTATCTATGCTCTAATAGATGCTTGTTCAGCAGAGGATCTTCAATATCTTCACACTGTATTTGGCGAAGGTCCATGTAGAAACACTCTAGCAACCTTGCAGCAGGATTACAAACAATTTTTCCAATATGAAGGAAAAGTCTGA
Coding sequence (CDS)
ATGAAGAATAAAAAGAGGAAGCTAAAGAGTCCTCAAGAAACCGAGAGGCCTTCCAAGTCTGCTCGTCTTCTTATCTCACCGGAAGAGGAAGTTATAGACGAACCAGAACGGGTTGAGAAGTCGGAACAGAGAGGATTGTGTCAAGAGTCTGAAGAAGGTTGTCCGTGGAGGAACTTGGAGTTGATTTTCTTGATTCAGAACAAAGAGCTCGACCAACAAAAGAAGGTCGAAGCAGTTTTCAGCTTTGTCAACTCAAAGTTGGAAGAGAGAGATAAATATTATGATACGAACATGATGAAGTTACTTGAAGAAGTTCTATCTCATGCCTTGTTTCACACAGTGCATATTGATGGATTTCTGTGCTTACATGGTTCAGAGAAGGTTATAAAATCCCATGATGAGAAATTAGAAGAGTCTAAAGCACATATTAAGAGCTATCATAGGCATTTATTTGATAAAGTGCAAAGATTAGTTGCAGGAAAAAAGTTCTTGGCATTGGGTGCAATTGGAGAATTGTTTCGCGTGCTTGTTGTCAGAGTTAAAACGGTGAAAGGAGGTTCAAAGTTTGAGGACACCAAACTGAATAACAAAATGGGTTGTGTGGGACCTTTGAGGGATGATGTATCAAGTGATGCATCTTATGTGCTTCAAGGAAGTGCTGATGTACCTTCAGAAAAAAGCAATTATCTGAGCAGCTTGAGTACAGAAATAAGGAAGTCTCTTTTTGAGTTCTTTGTACAGATTTTGGATCCTTTACTTCATACTATTGACCACATTAGCGCTGAAATTAAACAGGGACCAGCCTTGTCGGATGTTCATTGCCTACTCAAATCAATAAATAATCTACTTGCAAGTTTTATGAAGGAAAAGGTGTACTTGAGAACAGAGGATAACTCTGAAGGGGCTTACTTTAATTTCTTAAAAAAGGTTTATGATACAGTAATGTTGGTGTCCTCTAATTTGCTTTTATTATTAAGACATGAGATAGAAAACAATGTAGACCTGGAAGTTTTTGTCTTAGCAGCCAATGAGATACTAGTGACTGTTGGTTATCTTCTGGAAATTGAATATGATGTGATTGGGAATGACTTGGTGAGCTTATGGCTTGTGATCCTTTCTTACTCTGCTATTAATCTTTCTTTCACAAGCATACCTAAGCAATACTTGTTAACTTCCAAGATTCAAGAACTTGGATGTCAGCTGGTTGTTTTATATGGTCAGCTACGGCAGGTCAACATTAGTGTTGTTGCTTTATGTAAAGCCATGAGGACAGTACTATCAAATGAACATGAAACTGAAAAAAGTTATGCTAGCGTTATGACTTCTTTAGGTTATGAAGCTTATGGAAAATCAGTTGGGATGCTACTTTCTTCACAAGAAATTAAATTTGCAATTCATAAAGCTATTAAATATGTACCAGAAGGGCAAGCTAGTGGAATTATTCAGCAGTTAACTGAGGATGTGACAGAGACTTTAGGATGGCTGAGACTTTGTAATTTGAACTTGAATATAAGAAATAGCACAAGCGACTCGACTCTGCAAACTATGTTTCTTGGAAGAGGGTTGTCCGAAATGTATGATCTGATGTTGGATTCACTGATGATCACATCTGGCAATGCTTTCCAAATTGGAACTTCAATTGAGAACTTGATTTCAGTCATACGTCCTTGCATGAGTTGTTTGGTTGGACTGCATTCAGATGGTGCCAAAGCATTCCTTGTTGCTATCATGGGGAAAACACGTGATGGTATGATTGCAGATGAAGATAACTGTCTTGGATTTGGAGTGACCAGTCATTGGGTCTTTGTTTTCTTCTTTCGCTTGTACATGTCCTGCCGGAGCCTGTATAGGCAAGCAATTAGCCTGATGCCCCCTAGTTCATCTAGAAAGATGTCAACAGCAATTGGAGATTCATTCATAGCCTATTCTGCTTGTGATTGGATGCAGAGGACTGATTGGAGTGATGAAGGTTATTTTTCATGGATAATTGAACCTTCAACTTCTGTTCTTGCTGTTGCCCAATCTGTTTCTAGCCTTTACCATCAAGGTACGGATGTAGATTGGTGCCCTTTAGTTTATGTATTGCATACCATGACTCTTCAGAGGTTAGTGGATTTGAACAGGCAGATAGACTCTCTCGAGTATGTGCACCAGAGAAATGAGAATCTGATGCAGGTTGAGGTGCTTGGTGATGATGGCTTGTCAGTGTTACGGAAAAAGAGCAAAAAGTTTGGAAGGCTTCTCTCAGATTTGCGAAAAGAAGCAGCAGATCTTACTGATTTCATGATGAAGCATTTATCTTTAGTAGCTAAACACCACATTTTGAATTCCACTAAAAGAAATGCAACTTCAAACGATAAATCTACTGAAACACTTAGTGAGATTGATGATTGGGACTTTAGTATTTGTAATGTGAATAAAAGGTCATTGCCCACAGCTATTTGGTGGATTGTTTGCCAAAATGTTGATATTTGGGCCAATCACGCTGCTAAAAAGAGGCTTATGCCATCAAGGTTTTGTCGTGAGTTGAAGGCTTCAATATTATCATCTTTTCATGATATTAATATAAGTTCAGCTGATTGGATGGAGAAGATAGCCAAACTTGAGCCCGTTTGCAGTAGCAAACGTATTCCTGACGAGAATTTTCCCCTCACTAAAGCAGTTAGTCACTCATCTGATTTGTTGCAAACAGAAGATTGCAAGTTGAAGAGAGATTCACCTCCAAGCAATGTGAGATCCAGAGCTTGTCAGCATTTGATTGACCTTTTATGTTGGATGCCAAAAGGGAATATTAGTTCAAGATCTTTTTCCCTTTATACGACTTATGTTCTCAACCTTGAAAGGCAATTGGTGAATGCTATATTAGATAATCAGACTACTTTATGTTCTCCTAATCAATCTGAGCTCCTCAAATTATTTGTATCTTGCCGGAAGGCTTTGAAATATATATTTATGGCTTCCTATGAGGCTGGCGATAGACAAAGCTCATCTATTCCGGTTCCATCCGAAAATCAATTTCCTGTTTCATGGCTTTTCAAGTCTGTATCAATTGTTAATCAACTTCAAGAAGCTTCTCCAGAAGGTAGCAACAGACAGATTAAGGATATCATCTTTTCATTGATGGATCATACATCATATTTGTTTTTAACAACTAGCAAATATCAATTTAAGAATGCTCTCCGCTTAATTGTAATTGACAACAAACCCTGCAAGGAGCAATGCCAGAACGTTTTCCATGAACTAGTTAATGGAGATGATCTATGTTTAGATCCCACTCACTGTGTGGAAGCTTACAATAGTGCAAGTCAAATGACTATCAGTCTGAAGGAACAGGTGGAGAGTGAATTTATCTCTCTGAAGAAGTCTAATGTTACAGTTGGAGATGGTAAAAACAGTGCCAATATGTGTAAAGTTTATTCTCTAGCTTCTTGCCTTAATGGATTTTTGTGGGGCCTAGCATCTGCCGTGGACCACACTGATTTGAGGAATGGTAGCCGTCACATGAGATCAATGAAGTTGAAATGTGAATATAGTTCCCAACTGAACCTTTGTATAAATGCAATTTCAGAACTATTAGGTCTTCTCTTGGAAATGTTTCTTGATAGAGACAGTGATCGGCCGACAAAATTGTGTGATTATCATACTTCTCAAGACCTTTTGGTTGTGAATGAGGTCTCTGTTAAGGGCTCTAGCTCTGAGGTTGATACTTCATTTAGTAAACACCACAAATTTGAATCTTCTCATAATGATGATGGCACTGAAAATGTTAGCTTTAATAGAAAAAGGTTGAAATTGGAAAATAAAAGCAGTGTTGCCTCTATTCTGAATGAGGCTGACTCAATTGAGCTGCAATCCTTGAACCAACCTGTCTTGCGAGGTCTGCTGAAAGGTTCCTATCCTGAAGTAGCATTTGCACTTAAACAGCTATTTCTTGCTGCTTCTGTCATTTTGAGATTGCATATGAAATATGATACTGTTCCTTTGTCATCAAGTTTTATGGCCATCCTAATTGGCATTTCACGATTATTGTTGCTAGAATTTGTGGACATGGTTGAAGTGCCACAGCCATTTTTGCTTGCTTGCTTGGATGGTGTTCTAAAGTATCTGGAGGAATTAGGCCATCTGTTTCCTTTTTCTGATCCTATGCAATCCAGAAATCTATATTCTAGGCTTATAGACATACAGTTGAAAGCTATAGGAAAGTGCATATCTCTACTGGGAAAAAGAGCTACTTTAGCATCCCACGAGACAGAGTCCACTACAAAGACTCTTGATGGTCACTTAGGTCTATTCGAAGAATCATCTTTTCCTGGAGTCTACCGTATAGATGAATTTAAAACTTCATTGAGAATGTCATTCAAAGTGTTCATAAGGAAAGCCTCGGAGCTGCATCTCTTATCTGCAATTCAGGCTATTGAGAGAGCTCTAGTTGGAGTGCAGGAAGGTTGTACGGCGATATATGAATTATATTCCGGAAGTGAAGATGGGGGAAGGTGTTCCTCTATTGTTGCGGCTGGTGTCGAATGCTTGGATTTGGTTCTTGAATTTGTTTCAGGACGTAAATGCTTGGGTGTGGTTAAAAGACACATTCAGAGCTTAATTGCTGGTCTGTTTAGCATAGTTCTGCACTTGCAGAGTCCACAAGTTTTCTATACAAGAATGATTGACACGAAAAACAAGAGTGGTCCAGATCCGGGATCAGTCATTCTAATGTCTGTTGAAGTGCTTACAAGAGTTTCGGGAAAGCATGCTCTTTTCCAAATGAATGCTTGGCATGTAGCACAGTGTTTACGCATTCCTGCAGCACTTTTTGAAGAATTTTCTCTTAAGCTCCTAGGAATTCCCATTCAATCAGATAATTCCCTCATCTCAGCTCAGGAAGTCCCCAATCGAGTTGTAACCACAACTAAATCCATTATAGACAGACAATTCTTAATAGATCTTTTTGCTGCCAGCTGCCGGTTATTATATACCGTTCTCAAGCATCATAAAAGTGAGTGCAAGCGTTGCATTGCTCAACTACAAGCATCAGTGTCTGTTCTTCTTCATTCCCTTGAAAGAGTAGGTCCAGATCCGGAATTTATGGGTGGTTACTTTTCGTGGAAAGTAGATGAGGGAGTAAAGTGTGCTTGTTTCCTCCGAAGGATTTATGAAGAGATAAGACAGCAACGGGATATCGTAGGGCGACACTGCTCTCTGTTTCTTTCAAATTACATATGGGTTTATTCAGGATACGGTCCCCTTAAATTCGGGATTAAAAGGGAAATAGACGAGGCGTTAAGGCCTGGTATCTATGCTCTAATAGATGCTTGTTCAGCAGAGGATCTTCAATATCTTCACACTGTATTTGGCGAAGGTCCATGTAGAAACACTCTAGCAACCTTGCAGCAGGATTACAAACAATTTTTCCAATATGAAGGAAAAGTCTGA
Protein sequence
MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLELIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDTNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHRHLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGSKFEDTKLNNKMGCVGPLRDDVSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGPALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRHEIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQYLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEAYGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNSTSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSDGAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSSSRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVDWCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRLLSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNKRSLPTAIWWIVCQNVDIWANHAAKKRLMPSRFCRELKASILSSFHDINISSADWMEKIAKLEPVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLLCWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYIFMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDHTSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQMTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRNGSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVNEVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIELQSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLLLLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCISLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASELHLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCLGVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKHALFQMNAWHVAQCLRIPAALFEEFSLKLLGIPIQSDNSLISAQEVPNRVVTTTKSIIDRQFLIDLFAASCRLLYTVLKHHKSECKRCIAQLQASVSVLLHSLERVGPDPEFMGGYFSWKVDEGVKCACFLRRIYEEIRQQRDIVGRHCSLFLSNYIWVYSGYGPLKFGIKREIDEALRPGIYALIDACSAEDLQYLHTVFGEGPCRNTLATLQQDYKQFFQYEGKV
Homology
BLAST of Sgr023809 vs. NCBI nr
Match:
XP_022146288.1 (uncharacterized protein LOC111015534 isoform X1 [Momordica charantia])
HSP 1 Score: 2789.6 bits (7230), Expect = 0.0e+00
Identity = 1512/2082 (72.62%), Postives = 1605/2082 (77.09%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MKNKKRKLKSPQ+ ERP KSAR++I EEEV+DEP RVEKSEQR L + SEEG PWRNLE
Sbjct: 21 MKNKKRKLKSPQKAERPPKSARVIIPLEEEVVDEPGRVEKSEQRELFRGSEEGRPWRNLE 80
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDT------------------------ 120
LIFLIQNKELDQQKKVEAVFSFV+SKL+E DK YDT
Sbjct: 81 LIFLIQNKELDQQKKVEAVFSFVDSKLKEEDKCYDTVKISRLIVFLSDWVQSLLISFEKK 140
Query: 121 ------------------------------------------------------------ 180
Sbjct: 141 AKNDGGKHLKMAIEPCLDYRCWEVFKFCLEESVKMNITLNLSRNLLHAFCFVTRNAISLL 200
Query: 181 ------------------------------------------------------------ 240
Sbjct: 201 DVSSSSKEELFAGDCLKLYNCVQDCVSLVFSSHLGLSNDNLDAWISTIDAVLEFLHKIHV 260
Query: 241 ------------------------------------------------------------ 300
Sbjct: 261 NSLEGEDVGIFATLFSRMMLEPFAKFLWIHPTKKTGFHNFVNKLLEPLLQLLRDLSLKAD 320
Query: 301 --------NMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLE+VLSHALFHTVHIDGFLCLHGSEKV KSHDEKLEESKAH+KSYHR
Sbjct: 321 GCNHGRTRTLMKLLEDVLSHALFHTVHIDGFLCLHGSEKVTKSHDEKLEESKAHMKSYHR 380
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGSK-FEDTKLNNKMGCVGPLRDD 420
HLFDKVQ+LVA KKFLALGA+GELF VLVVRV VKG S FEDTKLNNKMGC G LRDD
Sbjct: 381 HLFDKVQKLVAEKKFLALGAVGELFDVLVVRVNKVKGASMLFEDTKLNNKMGCFGHLRDD 440
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
SS AS LQGSAD SEKSNY SSLSTEIRKSLFEFFVQILDPLL TIDHISAEIK GP
Sbjct: 441 TSSHASRALQGSADGLSEKSNYSSSLSTEIRKSLFEFFVQILDPLLLTIDHISAEIKLGP 500
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDV LLKSINNLLASFMK KVYLRTEDNSEGAY NFLKKVYD VM VSSNLL L RH
Sbjct: 501 ALSDVCYLLKSINNLLASFMKGKVYLRTEDNSEGAYLNFLKKVYDKVMFVSSNLLSLSRH 560
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
E+ENN+D VFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVI+SYSAINLSFTSIP+Q
Sbjct: 561 ELENNIDQGVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVIISYSAINLSFTSIPEQ 620
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTS+IQELGCQLVVLYGQLRQVNI + ALCKAMRTV+SNE E EKSYAS MTSLG+EA
Sbjct: 621 HLLTSRIQELGCQLVVLYGQLRQVNIIIFALCKAMRTVISNEGENEKSYASFMTSLGHEA 680
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVG L+SSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLN RNS
Sbjct: 681 YGKSVGTLVSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNTRNS 740
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
S L+T+ LGRGLSEMY LMLDSLMITSGNA QIGTSI+NLISV+RPCMS LVGL SD
Sbjct: 741 KSCLNLKTLLLGRGLSEMYALMLDSLMITSGNALQIGTSIDNLISVLRPCMSILVGLQSD 800
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAK F+VAIM K D ++ADEDNC GFGV SHWVFVFFFRLYMSCRSLYRQAISLMPP S
Sbjct: 801 GAKEFVVAIMEKKCDDVVADEDNCQGFGVISHWVFVFFFRLYMSCRSLYRQAISLMPPGS 860
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDS +AYSACDWMQRTDWSDEGYFSWII+PS SVLAVAQS+ SLYHQGTD D
Sbjct: 861 SRKMSAAIGDSMVAYSACDWMQRTDWSDEGYFSWIIQPSASVLAVAQSICSLYHQGTDED 920
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLNRQIDSLEY+HQRNENLMQVEVLGDD LSVLRKKSKKFGRL
Sbjct: 921 WYPLIYVLLTMALQRLVDLNRQIDSLEYLHQRNENLMQVEVLGDDDLSVLRKKSKKFGRL 980
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S L+KEAADLTDFMM HLSL+AK ILN K ATSN+K ETL EIDDWDFSIC++NK
Sbjct: 981 VSVLQKEAADLTDFMMSHLSLIAKRQILNPAK-IATSNEKCIETLDEIDDWDFSICSMNK 1040
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WW+VCQNVDIWA HAAKK+L
Sbjct: 1041 RSFPTAVWWVVCQNVDIWAIHAAKKKLKMFLSFLIRTSHPFLTSNDMKIESQQNDGCQQL 1100
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
MPSRFC ELKA++L SFHDI+ SSADWME IA
Sbjct: 1101 NKVSLQQISSSVLSDPIFYEQRFVCRFMPSRFCHELKATVLPSFHDISTSSADWMEVIAT 1160
Query: 1141 LEPVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLLCWMP 1200
LE L TEDC+ K DSPPSNVR RACQH I+LLCWMP
Sbjct: 1161 LE------------------------LSTTEDCQSKSDSPPSNVRFRACQHFINLLCWMP 1220
Query: 1201 KGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYIFMAS 1260
KGNISSRSFSLYTT VL LERQLV LD+QTTLCS NQ ELLKLF SCRKALKYIF A
Sbjct: 1221 KGNISSRSFSLYTTNVLELERQLV---LDSQTTLCSENQFELLKLFASCRKALKYIFTAY 1280
Query: 1261 YEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDHTSYL 1320
YEAGDRQSSS PVPSENQFPVSWLFKSVSIVNQLQ+AS GS+RQIKDIIFSLMDHTSYL
Sbjct: 1281 YEAGDRQSSSTPVPSENQFPVSWLFKSVSIVNQLQDASSGGSDRQIKDIIFSLMDHTSYL 1340
Query: 1321 FLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQMTIS 1380
FLTTSKYQFKNALRLIVIDNKPC EQ +NV HEL +GDDL L C+EA NSA QMTIS
Sbjct: 1341 FLTTSKYQFKNALRLIVIDNKPCMEQPENVSHELNDGDDLFLGSNRCLEACNSAIQMTIS 1400
Query: 1381 LKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRNGSRH 1440
LKEQVESE I LKKSNVTVGDGKN NM KVYSLASCLNGFLWGLASA D TDLRN +RH
Sbjct: 1401 LKEQVESELIYLKKSNVTVGDGKNRGNMYKVYSLASCLNGFLWGLASAEDDTDLRNSNRH 1460
Query: 1441 MRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVNEVSV 1500
RSMKLKCE+SSQLNLCINAISELLGL+LEMFLDRDS RP KLCDY TSQD L VNE S
Sbjct: 1461 TRSMKLKCEFSSQLNLCINAISELLGLILEMFLDRDSQRPQKLCDYQTSQDFLGVNEPSG 1520
Query: 1501 KGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIELQSLN 1560
KG SSEVDTS SK+ K ESS +DD +N S RKRLKL NKSSVASIL+EA+ IE+QSLN
Sbjct: 1521 KGPSSEVDTSCSKYQKLESSQSDDDNKNTSLKRKRLKLGNKSSVASILSEANLIEMQSLN 1580
Query: 1561 QPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLLLLEF 1620
+P LRGLLKGSYPE AFALKQLFLAASVILRLHMKYD++PLSSS MAILI ISR LLL+F
Sbjct: 1581 KPFLRGLLKGSYPEAAFALKQLFLAASVILRLHMKYDSIPLSSSSMAILISISRFLLLKF 1640
Query: 1621 VDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCISLLG 1680
VDMVEVPQPFLL CLDGVLKYLE LGHLFPF+DPMQSRNLYS LI++ L+AIGKCISL G
Sbjct: 1641 VDMVEVPQPFLLTCLDGVLKYLEGLGHLFPFADPMQSRNLYSNLINLHLQAIGKCISLQG 1700
Query: 1681 KRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASELHLLS 1740
KRATL SH+TESTTKTLDGHL LFEESSFP +Y ID+FK+SLRMSFKVFIRKASELHLLS
Sbjct: 1701 KRATLTSHDTESTTKTLDGHLCLFEESSFPRIYYIDQFKSSLRMSFKVFIRKASELHLLS 1760
Query: 1741 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCLGVVK 1800
AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEF SGRKCL VVK
Sbjct: 1761 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFASGRKCLSVVK 1820
Query: 1801 RHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKHALFQ 1810
RHIQSLIAGLFSIVLHLQ+PQ+FY+RMIDTKNKS PDPGSVILMSVEVLTRVSGKHALFQ
Sbjct: 1821 RHIQSLIAGLFSIVLHLQTPQIFYSRMIDTKNKSDPDPGSVILMSVEVLTRVSGKHALFQ 1880
BLAST of Sgr023809 vs. NCBI nr
Match:
XP_022146289.1 (uncharacterized protein LOC111015534 isoform X2 [Momordica charantia])
HSP 1 Score: 2788.1 bits (7226), Expect = 0.0e+00
Identity = 1511/2082 (72.57%), Postives = 1604/2082 (77.04%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MKNKKRKLKSPQ+ ERP KSAR++I EEEV+DEP RVEKSEQR L + SEEG PWRNLE
Sbjct: 21 MKNKKRKLKSPQKAERPPKSARVIIPLEEEVVDEPGRVEKSEQRELFRGSEEGRPWRNLE 80
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDT------------------------ 120
LIFLIQNKELDQQKKVEAVFSFV+SKL+E DK YDT
Sbjct: 81 LIFLIQNKELDQQKKVEAVFSFVDSKLKEEDKCYDTVKISRLIVFLSDWVQSLLISFEKK 140
Query: 121 ------------------------------------------------------------ 180
Sbjct: 141 AKNDGGKHLKMAIEPCLDYRCWEVFKFCLEESVKMNITLNLSRNLLHAFCFVTRNAISLL 200
Query: 181 ------------------------------------------------------------ 240
Sbjct: 201 DVSSSSKEELFAGDCLKLYNCVQDCVSLVFSSHLGLSNDNLDAWISTIDAVLEFLHKIHV 260
Query: 241 ------------------------------------------------------------ 300
Sbjct: 261 NSLEGEDVGIFATLFSRMMLEPFAKFLWIHPTKKTGFHNFVNKLLEPLLQLLRDLSLKAD 320
Query: 301 --------NMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLE+VLSHALFHTVHIDGFLCLHGSEKV KSHDEKLEESKAH+KSYHR
Sbjct: 321 GCNHGRTRTLMKLLEDVLSHALFHTVHIDGFLCLHGSEKVTKSHDEKLEESKAHMKSYHR 380
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGSK-FEDTKLNNKMGCVGPLRDD 420
HLFDKVQ+LVA KKFLALGA+GELF VLVVRV VKG S FEDTKLNNKMGC G LRDD
Sbjct: 381 HLFDKVQKLVAEKKFLALGAVGELFDVLVVRVNKVKGASMLFEDTKLNNKMGCFGHLRDD 440
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
SS AS LQGSAD SEKSNY SSLSTEIRKSLFEFFVQILDPLL TIDHISAEIK GP
Sbjct: 441 TSSHASRALQGSADGLSEKSNYSSSLSTEIRKSLFEFFVQILDPLLLTIDHISAEIKLGP 500
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDV LLKSINNLLASFMK KVYLRTEDNSEGAY NFLKKVYD VM VSSNLL L RH
Sbjct: 501 ALSDVCYLLKSINNLLASFMKGKVYLRTEDNSEGAYLNFLKKVYDKVMFVSSNLLSLSRH 560
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
E+ENN+D VFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVI+SYSAINLSFTSIP+Q
Sbjct: 561 ELENNIDQGVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVIISYSAINLSFTSIPEQ 620
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTS+IQELGCQLVVLYGQLRQVNI + ALCKAMRTV+SNE E EKSYAS MTSLG+EA
Sbjct: 621 HLLTSRIQELGCQLVVLYGQLRQVNIIIFALCKAMRTVISNEGENEKSYASFMTSLGHEA 680
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVG L+SSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLN RNS
Sbjct: 681 YGKSVGTLVSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNTRNS 740
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
S L+T+ LGRGLSEMY LMLDSLMITSGNA QIGTSI+NLISV+RPCMS LVGL SD
Sbjct: 741 KSCLNLKTLLLGRGLSEMYALMLDSLMITSGNALQIGTSIDNLISVLRPCMSILVGLQSD 800
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAK F+VAIM K D ++ADEDNC GFGV SHWVFVFFFRLYMSCRSLYRQAISLMPP S
Sbjct: 801 GAKEFVVAIMEKKCDDVVADEDNCQGFGVISHWVFVFFFRLYMSCRSLYRQAISLMPPGS 860
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDS +AYSACDWMQRTDWSDEGYFSWII+PS SVLAVAQS+ SLYHQGTD D
Sbjct: 861 SRKMSAAIGDSMVAYSACDWMQRTDWSDEGYFSWIIQPSASVLAVAQSICSLYHQGTDED 920
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLNRQIDSLEY+HQRNENLMQVEVLGDD LSVLRKKSKKFGRL
Sbjct: 921 WYPLIYVLLTMALQRLVDLNRQIDSLEYLHQRNENLMQVEVLGDDDLSVLRKKSKKFGRL 980
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S L+KEAADLTDFMM HLSL+AK ILN K ATSN+K ETL EIDDWDFSIC++NK
Sbjct: 981 VSVLQKEAADLTDFMMSHLSLIAKRQILNPAK-IATSNEKCIETLDEIDDWDFSICSMNK 1040
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WW+VCQNVDIWA HAAKK+L
Sbjct: 1041 RSFPTAVWWVVCQNVDIWAIHAAKKKLKMFLSFLIRTSHPFLTSNDMKIESQQNDGCQQL 1100
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
MPSRFC ELKA++L SFHDI+ SSADWME IA
Sbjct: 1101 NKVSLQQISSSVLSDPIFYEQRFVCRFMPSRFCHELKATVLPSFHDISTSSADWMEVIAT 1160
Query: 1141 LEPVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLLCWMP 1200
LE L T DC+ K DSPPSNVR RACQH I+LLCWMP
Sbjct: 1161 LE-------------------------LSTTDCQSKSDSPPSNVRFRACQHFINLLCWMP 1220
Query: 1201 KGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYIFMAS 1260
KGNISSRSFSLYTT VL LERQLV LD+QTTLCS NQ ELLKLF SCRKALKYIF A
Sbjct: 1221 KGNISSRSFSLYTTNVLELERQLV---LDSQTTLCSENQFELLKLFASCRKALKYIFTAY 1280
Query: 1261 YEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDHTSYL 1320
YEAGDRQSSS PVPSENQFPVSWLFKSVSIVNQLQ+AS GS+RQIKDIIFSLMDHTSYL
Sbjct: 1281 YEAGDRQSSSTPVPSENQFPVSWLFKSVSIVNQLQDASSGGSDRQIKDIIFSLMDHTSYL 1340
Query: 1321 FLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQMTIS 1380
FLTTSKYQFKNALRLIVIDNKPC EQ +NV HEL +GDDL L C+EA NSA QMTIS
Sbjct: 1341 FLTTSKYQFKNALRLIVIDNKPCMEQPENVSHELNDGDDLFLGSNRCLEACNSAIQMTIS 1400
Query: 1381 LKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRNGSRH 1440
LKEQVESE I LKKSNVTVGDGKN NM KVYSLASCLNGFLWGLASA D TDLRN +RH
Sbjct: 1401 LKEQVESELIYLKKSNVTVGDGKNRGNMYKVYSLASCLNGFLWGLASAEDDTDLRNSNRH 1460
Query: 1441 MRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVNEVSV 1500
RSMKLKCE+SSQLNLCINAISELLGL+LEMFLDRDS RP KLCDY TSQD L VNE S
Sbjct: 1461 TRSMKLKCEFSSQLNLCINAISELLGLILEMFLDRDSQRPQKLCDYQTSQDFLGVNEPSG 1520
Query: 1501 KGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIELQSLN 1560
KG SSEVDTS SK+ K ESS +DD +N S RKRLKL NKSSVASIL+EA+ IE+QSLN
Sbjct: 1521 KGPSSEVDTSCSKYQKLESSQSDDDNKNTSLKRKRLKLGNKSSVASILSEANLIEMQSLN 1580
Query: 1561 QPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLLLLEF 1620
+P LRGLLKGSYPE AFALKQLFLAASVILRLHMKYD++PLSSS MAILI ISR LLL+F
Sbjct: 1581 KPFLRGLLKGSYPEAAFALKQLFLAASVILRLHMKYDSIPLSSSSMAILISISRFLLLKF 1640
Query: 1621 VDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCISLLG 1680
VDMVEVPQPFLL CLDGVLKYLE LGHLFPF+DPMQSRNLYS LI++ L+AIGKCISL G
Sbjct: 1641 VDMVEVPQPFLLTCLDGVLKYLEGLGHLFPFADPMQSRNLYSNLINLHLQAIGKCISLQG 1700
Query: 1681 KRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASELHLLS 1740
KRATL SH+TESTTKTLDGHL LFEESSFP +Y ID+FK+SLRMSFKVFIRKASELHLLS
Sbjct: 1701 KRATLTSHDTESTTKTLDGHLCLFEESSFPRIYYIDQFKSSLRMSFKVFIRKASELHLLS 1760
Query: 1741 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCLGVVK 1800
AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEF SGRKCL VVK
Sbjct: 1761 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFASGRKCLSVVK 1820
Query: 1801 RHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKHALFQ 1810
RHIQSLIAGLFSIVLHLQ+PQ+FY+RMIDTKNKS PDPGSVILMSVEVLTRVSGKHALFQ
Sbjct: 1821 RHIQSLIAGLFSIVLHLQTPQIFYSRMIDTKNKSDPDPGSVILMSVEVLTRVSGKHALFQ 1880
BLAST of Sgr023809 vs. NCBI nr
Match:
XP_023533222.1 (uncharacterized protein LOC111795175 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2746.5 bits (7118), Expect = 0.0e+00
Identity = 1477/2086 (70.81%), Postives = 1606/2086 (76.99%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MK+KKRKLKSPQ +ERPSKSAR L+SPE EV+D E+VEK EQ + QE +E CPWRNLE
Sbjct: 13 MKSKKRKLKSPQTSERPSKSARPLVSPEVEVVDGTEQVEKMEQGEVSQEFDESCPWRNLE 72
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYD------------------------- 120
LIFLIQNKE +QQKKV+AVFSFVNSK E+DKY+D
Sbjct: 73 LIFLIQNKEFNQQKKVDAVFSFVNSKWNEKDKYHDKVKMSRLIVFLSDWVQSLLISSEKK 132
Query: 121 ------------------------------------------------------------ 180
Sbjct: 133 AKNDGGKHHNMGIEPCLDHRCWEVFKFCLEESVKTLIPLNLSKNLLHAFCFVTRSAISLL 192
Query: 181 ------------------------------------------------------------ 240
Sbjct: 193 GDFSSSREELFSGDCFKLYNVVLDCVSLVFSPHLGLSNENLDAWISTIDAMLEFLHKIYV 252
Query: 241 ------------------------------------------------------------ 300
Sbjct: 253 SSLEDKDVGIFAIKFSCMMLKPFAKFLWTHPTKKAGFHNFVNKLLEPLLQLLLDISLKAD 312
Query: 301 -------TNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKS DEK EESKAHIKSYHR
Sbjct: 313 GCDHCWTRTLMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSPDEKSEESKAHIKSYHR 372
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCVGPLRDD 420
HLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM +DD
Sbjct: 373 HLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSISSEDTKLNNKM------KDD 432
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+HISAEIK G
Sbjct: 433 ISSHASSGL-------SEKSNNQSSLSTEIRKPLFEFFVQILDPLLQTIEHISAEIKLGT 492
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+LLLL R
Sbjct: 493 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHLLLLSRR 552
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
EI+N+VDLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSFTS+PKQ
Sbjct: 553 EIKNDVDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSVPKQ 612
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTSKIQELGCQLV LYGQLRQVNIS+ ALCKAMRT +SNE ETEK YAS MTSLG+EA
Sbjct: 613 HLLTSKIQELGCQLVALYGQLRQVNISIFALCKAMRTAISNEGETEKDYASFMTSLGHEA 672
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVGMLLSSQEIKFAIHKAIKY+PEGQASG+IQQLTED+TETLGWLR CN+N+N RN+
Sbjct: 673 YGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGLIQQLTEDMTETLGWLRQCNMNMNTRNN 732
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
T D +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS LVG +D
Sbjct: 733 TEDLNMQTVLLGRGLSEVYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNLVGPQAD 792
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAKAF A+MG+T D M+ADEDNCLGFGVTSHWVFVFF LYMSCR+LYRQAISLMPPSS
Sbjct: 793 GAKAFFAAVMGETCDAMVADEDNCLGFGVTSHWVFVFFLCLYMSCRNLYRQAISLMPPSS 852
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYHQ T V
Sbjct: 853 SRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYHQDTSVG 912
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLN+QI SLEY++QRNENLMQVEVLGDDGLSVL+KKSKK+ RL
Sbjct: 913 WYPLIYVLLTMALQRLVDLNKQIGSLEYLYQRNENLMQVEVLGDDGLSVLQKKSKKYSRL 972
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S LRKEA DLTDFMM+H SLVAK +LNSTK ATSNDKST LSEIDDWDFSICNVNK
Sbjct: 973 VSVLRKEAEDLTDFMMRHFSLVAKRQVLNSTKEVATSNDKSTVMLSEIDDWDFSICNVNK 1032
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 1033 RSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQTNGFRQL 1092
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
+PSRFCREL AS+LSSFHDIN SS DWME I
Sbjct: 1093 KKVSLQQISSAALSDPIFYEHRFVCRFLPSRFCRELSASLLSSFHDINTSSTDWMEVICT 1152
Query: 1141 LE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLL 1200
LE VCS KR PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQHLIDLL
Sbjct: 1153 LERLTNSVCSGKRTPDDSAPLAKTVNHSSDMLYTEDCKWKGDSSRSNLSFRACQHLIDLL 1212
Query: 1201 CWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYI 1260
CWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCRKALKYI
Sbjct: 1213 CWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCRKALKYI 1272
Query: 1261 FMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDH 1320
FMA YEAG+ QSSSIP+PSENQFPVSWLFKS+SIVN++QEAS G+ +IKDIIFSLMDH
Sbjct: 1273 FMAYYEAGNEQSSSIPLPSENQFPVSWLFKSISIVNRIQEASAGGTATKIKDIIFSLMDH 1332
Query: 1321 TSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQ 1380
TSYLFLTTSKYQFKNALRL+VIDNKPCKE+ Q+V HEL +GD + LD THCVE NSA Q
Sbjct: 1333 TSYLFLTTSKYQFKNALRLMVIDNKPCKEEHQDVCHELNDGDGVSLDSTHCVEVCNSAIQ 1392
Query: 1381 MTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRN 1440
M+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLAS+VDHTDLRN
Sbjct: 1393 MSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASSVDHTDLRN 1452
Query: 1441 GSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVN 1500
G+R MRSMKLK EYSS+LNLC+NA SELLGL+LEMFLDR+S PTKLCD SQDLLVV+
Sbjct: 1453 GNRRMRSMKLKFEYSSKLNLCMNATSELLGLILEMFLDRNSQWPTKLCDNQPSQDLLVVD 1512
Query: 1501 EVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIEL 1560
E+ VK S SE DTSFSK+ + ESSH DDG+E+ S N+KRLKLENKSSVASILNEA++IE+
Sbjct: 1513 ELPVKHSGSEADTSFSKNRELESSHCDDGSESGSTNKKRLKLENKSSVASILNEANTIEM 1572
Query: 1561 QSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLL 1620
QSLNQ L+GLLKGS P+VAFALKQLFLAASVILRLH +Y TVPLSSSFMAILIG SR L
Sbjct: 1573 QSLNQSFLQGLLKGSCPDVAFALKQLFLAASVILRLHKQYGTVPLSSSFMAILIGFSRFL 1632
Query: 1621 LLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCI 1680
LLEF +MVEVP+PFL CLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LKA+GKCI
Sbjct: 1633 LLEFENMVEVPEPFLFTCLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLKAMGKCI 1692
Query: 1681 SLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASEL 1740
SL KRATLASHETESTTKTLDG GLFEESSFP +Y +DEFK SLRMSFKVFIR+ASEL
Sbjct: 1693 SLQRKRATLASHETESTTKTLDG--GLFEESSFPVIYCMDEFKASLRMSFKVFIREASEL 1752
Query: 1741 HLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCL 1800
HLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAGVECLDLVLEFVSGRKCL
Sbjct: 1753 HLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGVECLDLVLEFVSGRKCL 1812
Query: 1801 GVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKH 1810
GVVKRHIQSLIAGLFSIVLHLQSP +FY R IDTK +S PDPG+VILMSVEVL RVSGKH
Sbjct: 1813 GVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTIDTKGRSDPDPGAVILMSVEVLARVSGKH 1872
BLAST of Sgr023809 vs. NCBI nr
Match:
KAG6605008.1 (hypothetical protein SDJN03_02325, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2734.9 bits (7088), Expect = 0.0e+00
Identity = 1472/2086 (70.57%), Postives = 1601/2086 (76.75%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MK+KKRKLKSPQ +ERPSKSAR LISPE EV+D E+VEK EQ + QE +E CPWRNLE
Sbjct: 36 MKSKKRKLKSPQTSERPSKSARHLISPEVEVVDGTEQVEKMEQGEVSQEFDESCPWRNLE 95
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYD------------------------- 120
LIFLIQNKE +QQKKV+AVFSFVNSK E+DKY+D
Sbjct: 96 LIFLIQNKEFNQQKKVDAVFSFVNSKWNEKDKYHDKVKMSRLIVFLSDWVQSLLISSEKK 155
Query: 121 ------------------------------------------------------------ 180
Sbjct: 156 AKNDGGKHHNMAIEPCLDYRCWEVFKFCLEESVKTLIPLNLSKNLLHAFCFVTRSAISLL 215
Query: 181 ------------------------------------------------------------ 240
Sbjct: 216 GDLSSSREELFSGDCFKLYNVVLDCVSLVFSPHLGLSNENLDAWISTIDAMLEFLHKIYV 275
Query: 241 ------------------------------------------------------------ 300
Sbjct: 276 SSLEDKDVGIFAIKFSSMMLKPFAKFLWTHPTKKAGFHNFVNKLLEPLLQLLLDISLKAD 335
Query: 301 -------TNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLEEVLSHALFHTVHIDGFLCLHGS+KVIKS DEK EESKAHIKSYHR
Sbjct: 336 GCDHCWTRTLMKLLEEVLSHALFHTVHIDGFLCLHGSDKVIKSPDEKSEESKAHIKSYHR 395
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCVGPLRDD 420
HLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM RDD
Sbjct: 396 HLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSISSEDTKLNNKM------RDD 455
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+ ISAEIK G
Sbjct: 456 ISSHASSGL-------SEKSNNQSSLSTEIRKPLFEFFVQILDPLLQTIEQISAEIKLGT 515
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDVHCLLKSINNLLASFM+EKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+LLLL R
Sbjct: 516 ALSDVHCLLKSINNLLASFMEEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHLLLLSRL 575
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
EIENN+DLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSFTS+PKQ
Sbjct: 576 EIENNIDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSVPKQ 635
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTSKIQELGCQLV LYGQLRQVN+S+ ALCKAMRT +SNE ETEK YAS MTSLG+EA
Sbjct: 636 HLLTSKIQELGCQLVALYGQLRQVNVSIFALCKAMRTAISNEGETEKDYASFMTSLGHEA 695
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVGMLLSSQEIKFAIHKAIKY+PEGQASGIIQQLTED+TETLGWLR CN+N+N RN+
Sbjct: 696 YGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGIIQQLTEDMTETLGWLRQCNMNMNTRNN 755
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
T D +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS LVG +D
Sbjct: 756 TEDLNMQTVLLGRGLSEVYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNLVGPQAD 815
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAKAF A++GKT D M+ADEDNCLGFGVTSHWVFVFF LYMSCR+LYRQAISLMPPSS
Sbjct: 816 GAKAFFAAVIGKTCDDMVADEDNCLGFGVTSHWVFVFFLCLYMSCRNLYRQAISLMPPSS 875
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYHQ T V
Sbjct: 876 SRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYHQDTSVG 935
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLN+QI SLEY++QRNENLMQVEVLGDDGLSVL+KKSKK+ RL
Sbjct: 936 WYPLIYVLLTMALQRLVDLNKQIGSLEYLYQRNENLMQVEVLGDDGLSVLQKKSKKYSRL 995
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S LRKEA DLTDFMM+H SLVAK +LNSTK ATSNDKST LSEIDDWDFSICNVNK
Sbjct: 996 VSVLRKEAEDLTDFMMRHFSLVAKRQVLNSTKEVATSNDKSTVMLSEIDDWDFSICNVNK 1055
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 1056 RSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQTNGFRQL 1115
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
+PSRFCREL AS+LSSFHDIN SS DWME +
Sbjct: 1116 KKVSLQQISSAALSDPIFYEHGFVCRFLPSRFCRELSASLLSSFHDINTSSTDWMEVLCT 1175
Query: 1141 LE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLL 1200
LE VCS KR PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQHLIDLL
Sbjct: 1176 LERLTTSVCSGKRTPDDSSPLAKTVNHSSDMLYTEDCKWKGDSSQSNLSFRACQHLIDLL 1235
Query: 1201 CWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYI 1260
CWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCRKALKYI
Sbjct: 1236 CWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCRKALKYI 1295
Query: 1261 FMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDH 1320
FMA YEA + QSSSIP+PSE+QFPVSWLFKS+SIVN++QEAS G+ +IKDIIFSLMDH
Sbjct: 1296 FMAYYEARNEQSSSIPLPSESQFPVSWLFKSISIVNRIQEASAGGTATKIKDIIFSLMDH 1355
Query: 1321 TSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQ 1380
TSYLFLTTSKYQFKNAL+L+VIDNK C+E Q+V HEL +GD + LD THCVE NSA Q
Sbjct: 1356 TSYLFLTTSKYQFKNALQLMVIDNKTCEEH-QDVCHELNDGDGVSLDSTHCVEVCNSAIQ 1415
Query: 1381 MTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRN 1440
M+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLASAVDHTDLRN
Sbjct: 1416 MSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASAVDHTDLRN 1475
Query: 1441 GSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVN 1500
G+R RSMKLK EYSS+LNLC+NA SELL L+LEMFLDRDS PTKLCDY SQDLLVV+
Sbjct: 1476 GNRRTRSMKLKFEYSSKLNLCMNATSELLDLILEMFLDRDSQWPTKLCDYQPSQDLLVVD 1535
Query: 1501 EVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIEL 1560
E+ VK S SE DTSFSKH + ESSH DDG+E+ N+KRLKLENKSSVASILNEA++IE+
Sbjct: 1536 ELPVKHSGSEADTSFSKHRELESSHCDDGSESGGTNKKRLKLENKSSVASILNEANTIEM 1595
Query: 1561 QSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLL 1620
QS NQ L+GLLKGSYP+VAFALKQLFLAASVILRLH +Y T+PLSSSFMAILIG SR L
Sbjct: 1596 QSFNQSFLQGLLKGSYPDVAFALKQLFLAASVILRLHKQYGTIPLSSSFMAILIGFSRFL 1655
Query: 1621 LLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCI 1680
LLEF +MVEVP+PFL ACLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LKA+GKCI
Sbjct: 1656 LLEFENMVEVPEPFLFACLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLKAMGKCI 1715
Query: 1681 SLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASEL 1740
SL KRATLASHETESTTKTLDG GLFEESSFP +Y +DEFK SLRMSFKVFIR+ASEL
Sbjct: 1716 SLQRKRATLASHETESTTKTLDG--GLFEESSFPVIYCMDEFKASLRMSFKVFIREASEL 1775
Query: 1741 HLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCL 1800
HLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAG+ECLDLVLEFVSGRKCL
Sbjct: 1776 HLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGIECLDLVLEFVSGRKCL 1835
Query: 1801 GVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKH 1810
GVVKRHIQSLIAGLFSIVLHLQSP +FY R IDTK +S PDPG+VILMSVEVL RVSGKH
Sbjct: 1836 GVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTIDTKGRSDPDPGAVILMSVEVLARVSGKH 1895
BLAST of Sgr023809 vs. NCBI nr
Match:
XP_022947378.1 (uncharacterized protein LOC111451261 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2731.8 bits (7080), Expect = 0.0e+00
Identity = 1471/2086 (70.52%), Postives = 1600/2086 (76.70%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MK+KKRKLKSPQ +ERPSKSAR LISPE EV+D E+VEK EQ + QE +E CPWRNLE
Sbjct: 13 MKSKKRKLKSPQTSERPSKSARHLISPEVEVVDGTEQVEKMEQGEVSQEFDESCPWRNLE 72
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYD------------------------- 120
LIFLIQNKE +QQKKV+AVFSFVNSK E+DKY+D
Sbjct: 73 LIFLIQNKEFNQQKKVDAVFSFVNSKWNEKDKYHDKVKMSRLIVFLSDWVQSLLISSEKK 132
Query: 121 ------------------------------------------------------------ 180
Sbjct: 133 AKNDGGKHHNMAIEPCLDYRCWEVFKFCLEESVKTLIPLNLSKNLLHAFCFVTRSAISLL 192
Query: 181 ------------------------------------------------------------ 240
Sbjct: 193 GDLSSSREELFSGDCFKLYNVVLDCVSLVFSPHLGLSNENLDAWISTIDAMLEFLHKIYV 252
Query: 241 ------------------------------------------------------------ 300
Sbjct: 253 SSLEDKDVGIFAIKFSSMMLKPFAKFLWTHPTKKAGFHNFVNKLLEPLLQLLLDISLKAD 312
Query: 301 -------TNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLEEVLSHALFHTVHIDGFLCLHGS+KVIKS DEK EESKAHIKSYHR
Sbjct: 313 GCDHCWTRTLMKLLEEVLSHALFHTVHIDGFLCLHGSDKVIKSPDEKSEESKAHIKSYHR 372
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCVGPLRDD 420
HLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM RDD
Sbjct: 373 HLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSISSEDTKLNNKM------RDD 432
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+ ISAEIK G
Sbjct: 433 ISSHASSGL-------SEKSNNQSSLSTEIRKPLFEFFVQILDPLLQTIEQISAEIKLGT 492
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDVHCLLKSINNLLASFM+EKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+LLLL R
Sbjct: 493 ALSDVHCLLKSINNLLASFMEEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHLLLLSRL 552
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
EIENN+DLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSFTS+PKQ
Sbjct: 553 EIENNIDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSVPKQ 612
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTSKIQELGCQLV LYGQLRQVN+S+ ALCKAMRT +SNE E+EK YAS MTSLG+EA
Sbjct: 613 HLLTSKIQELGCQLVALYGQLRQVNVSIFALCKAMRTAISNEGESEKDYASFMTSLGHEA 672
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVGMLLSSQEIKFAIHKAIKY+PEGQASGIIQQLTED+TETLGWLR CN+N+N RN+
Sbjct: 673 YGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGIIQQLTEDMTETLGWLRQCNMNMNTRNN 732
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
T D +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS LVG +D
Sbjct: 733 TEDLNMQTVLLGRGLSEVYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNLVGPQAD 792
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAKAF A++GKT D M+ADEDNCLGFGVTSHWVFVFF LYMSCR+LYRQAISLMPPSS
Sbjct: 793 GAKAFFAAVIGKTCDDMVADEDNCLGFGVTSHWVFVFFLCLYMSCRNLYRQAISLMPPSS 852
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYHQ T V
Sbjct: 853 SRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYHQDTSVG 912
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLN+QI SLEY++QRNENLMQVEVLGDDGLSVL+KKSKK+ RL
Sbjct: 913 WYPLIYVLLTMALQRLVDLNKQIGSLEYLYQRNENLMQVEVLGDDGLSVLQKKSKKYSRL 972
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S LRKEA DLTDFMM+H SLVAK +LNSTK ATSNDKST LSEIDDWDFSICNVNK
Sbjct: 973 VSVLRKEAEDLTDFMMRHFSLVAKRQVLNSTKEVATSNDKSTVMLSEIDDWDFSICNVNK 1032
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 1033 RSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQTNGFRQL 1092
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
+PSRFCREL AS+LSSFHDIN SS DWME +
Sbjct: 1093 KKVSLQQISSAALSDPIFYEHGFVCRFLPSRFCRELSASLLSSFHDINTSSTDWMEVLCT 1152
Query: 1141 LE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLL 1200
LE VCS KR PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQHLIDLL
Sbjct: 1153 LERLTTSVCSGKRTPDDSSPLAKTVNHSSDMLYTEDCKWKGDSSQSNLSFRACQHLIDLL 1212
Query: 1201 CWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYI 1260
CWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCRKALKYI
Sbjct: 1213 CWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCRKALKYI 1272
Query: 1261 FMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDH 1320
FMA YEA + QSSSIP+PSE+QFPVSWLFKS+SIVN++QEAS G+ +IKDIIFSLMDH
Sbjct: 1273 FMAYYEARNEQSSSIPLPSESQFPVSWLFKSISIVNRIQEASAGGTATKIKDIIFSLMDH 1332
Query: 1321 TSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQ 1380
TSYLFLTTSKYQFKNALRL+VIDNK C+E Q+V HEL +GD + LD THCVE NSA Q
Sbjct: 1333 TSYLFLTTSKYQFKNALRLMVIDNKTCEEH-QDVCHELNDGDGVSLDSTHCVEVCNSAIQ 1392
Query: 1381 MTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRN 1440
M+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLASAVDHTDLRN
Sbjct: 1393 MSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASAVDHTDLRN 1452
Query: 1441 GSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVN 1500
G+R RSMKLK EYSS+LNLC+NA SELL L+LEMFLDRDS PTKLCDY SQDLLVV+
Sbjct: 1453 GNRRTRSMKLKFEYSSKLNLCMNATSELLDLILEMFLDRDSQWPTKLCDYQPSQDLLVVD 1512
Query: 1501 EVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIEL 1560
E+ VK S SE DTSFSKH + ESSH DDG+E+ N+KRLKLENKSSVASILNEA++IE+
Sbjct: 1513 ELPVKHSGSEADTSFSKHRELESSHCDDGSESGGTNKKRLKLENKSSVASILNEANTIEM 1572
Query: 1561 QSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLL 1620
QS NQ L+GLLKGSYP+VAFALKQLFLAASVILRLH +Y T+PLSSSFMAILIG SR L
Sbjct: 1573 QSFNQSFLQGLLKGSYPDVAFALKQLFLAASVILRLHKQYGTIPLSSSFMAILIGFSRFL 1632
Query: 1621 LLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCI 1680
LLEF +MVEVP+PFL ACLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LKA+GKCI
Sbjct: 1633 LLEFENMVEVPEPFLFACLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLKAMGKCI 1692
Query: 1681 SLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASEL 1740
SL KRATLASHETESTTKTLDG GLFEESSFP +Y +DEFK SLRMSFKVFIR+ASEL
Sbjct: 1693 SLQRKRATLASHETESTTKTLDG--GLFEESSFPVIYCMDEFKASLRMSFKVFIREASEL 1752
Query: 1741 HLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCL 1800
HLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAG+ECLDLVLEFVSGRKCL
Sbjct: 1753 HLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGIECLDLVLEFVSGRKCL 1812
Query: 1801 GVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKH 1810
GVVKRHIQSLIAGLFSIVLHLQSP +FY R IDTK +S PDPG+VILMSVEVL RVSGKH
Sbjct: 1813 GVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTIDTKGRSDPDPGAVILMSVEVLARVSGKH 1872
BLAST of Sgr023809 vs. ExPASy TrEMBL
Match:
A0A6J1CY73 (uncharacterized protein LOC111015534 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111015534 PE=4 SV=1)
HSP 1 Score: 2789.6 bits (7230), Expect = 0.0e+00
Identity = 1512/2082 (72.62%), Postives = 1605/2082 (77.09%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MKNKKRKLKSPQ+ ERP KSAR++I EEEV+DEP RVEKSEQR L + SEEG PWRNLE
Sbjct: 21 MKNKKRKLKSPQKAERPPKSARVIIPLEEEVVDEPGRVEKSEQRELFRGSEEGRPWRNLE 80
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDT------------------------ 120
LIFLIQNKELDQQKKVEAVFSFV+SKL+E DK YDT
Sbjct: 81 LIFLIQNKELDQQKKVEAVFSFVDSKLKEEDKCYDTVKISRLIVFLSDWVQSLLISFEKK 140
Query: 121 ------------------------------------------------------------ 180
Sbjct: 141 AKNDGGKHLKMAIEPCLDYRCWEVFKFCLEESVKMNITLNLSRNLLHAFCFVTRNAISLL 200
Query: 181 ------------------------------------------------------------ 240
Sbjct: 201 DVSSSSKEELFAGDCLKLYNCVQDCVSLVFSSHLGLSNDNLDAWISTIDAVLEFLHKIHV 260
Query: 241 ------------------------------------------------------------ 300
Sbjct: 261 NSLEGEDVGIFATLFSRMMLEPFAKFLWIHPTKKTGFHNFVNKLLEPLLQLLRDLSLKAD 320
Query: 301 --------NMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLE+VLSHALFHTVHIDGFLCLHGSEKV KSHDEKLEESKAH+KSYHR
Sbjct: 321 GCNHGRTRTLMKLLEDVLSHALFHTVHIDGFLCLHGSEKVTKSHDEKLEESKAHMKSYHR 380
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGSK-FEDTKLNNKMGCVGPLRDD 420
HLFDKVQ+LVA KKFLALGA+GELF VLVVRV VKG S FEDTKLNNKMGC G LRDD
Sbjct: 381 HLFDKVQKLVAEKKFLALGAVGELFDVLVVRVNKVKGASMLFEDTKLNNKMGCFGHLRDD 440
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
SS AS LQGSAD SEKSNY SSLSTEIRKSLFEFFVQILDPLL TIDHISAEIK GP
Sbjct: 441 TSSHASRALQGSADGLSEKSNYSSSLSTEIRKSLFEFFVQILDPLLLTIDHISAEIKLGP 500
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDV LLKSINNLLASFMK KVYLRTEDNSEGAY NFLKKVYD VM VSSNLL L RH
Sbjct: 501 ALSDVCYLLKSINNLLASFMKGKVYLRTEDNSEGAYLNFLKKVYDKVMFVSSNLLSLSRH 560
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
E+ENN+D VFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVI+SYSAINLSFTSIP+Q
Sbjct: 561 ELENNIDQGVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVIISYSAINLSFTSIPEQ 620
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTS+IQELGCQLVVLYGQLRQVNI + ALCKAMRTV+SNE E EKSYAS MTSLG+EA
Sbjct: 621 HLLTSRIQELGCQLVVLYGQLRQVNIIIFALCKAMRTVISNEGENEKSYASFMTSLGHEA 680
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVG L+SSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLN RNS
Sbjct: 681 YGKSVGTLVSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNTRNS 740
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
S L+T+ LGRGLSEMY LMLDSLMITSGNA QIGTSI+NLISV+RPCMS LVGL SD
Sbjct: 741 KSCLNLKTLLLGRGLSEMYALMLDSLMITSGNALQIGTSIDNLISVLRPCMSILVGLQSD 800
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAK F+VAIM K D ++ADEDNC GFGV SHWVFVFFFRLYMSCRSLYRQAISLMPP S
Sbjct: 801 GAKEFVVAIMEKKCDDVVADEDNCQGFGVISHWVFVFFFRLYMSCRSLYRQAISLMPPGS 860
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDS +AYSACDWMQRTDWSDEGYFSWII+PS SVLAVAQS+ SLYHQGTD D
Sbjct: 861 SRKMSAAIGDSMVAYSACDWMQRTDWSDEGYFSWIIQPSASVLAVAQSICSLYHQGTDED 920
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLNRQIDSLEY+HQRNENLMQVEVLGDD LSVLRKKSKKFGRL
Sbjct: 921 WYPLIYVLLTMALQRLVDLNRQIDSLEYLHQRNENLMQVEVLGDDDLSVLRKKSKKFGRL 980
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S L+KEAADLTDFMM HLSL+AK ILN K ATSN+K ETL EIDDWDFSIC++NK
Sbjct: 981 VSVLQKEAADLTDFMMSHLSLIAKRQILNPAK-IATSNEKCIETLDEIDDWDFSICSMNK 1040
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WW+VCQNVDIWA HAAKK+L
Sbjct: 1041 RSFPTAVWWVVCQNVDIWAIHAAKKKLKMFLSFLIRTSHPFLTSNDMKIESQQNDGCQQL 1100
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
MPSRFC ELKA++L SFHDI+ SSADWME IA
Sbjct: 1101 NKVSLQQISSSVLSDPIFYEQRFVCRFMPSRFCHELKATVLPSFHDISTSSADWMEVIAT 1160
Query: 1141 LEPVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLLCWMP 1200
LE L TEDC+ K DSPPSNVR RACQH I+LLCWMP
Sbjct: 1161 LE------------------------LSTTEDCQSKSDSPPSNVRFRACQHFINLLCWMP 1220
Query: 1201 KGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYIFMAS 1260
KGNISSRSFSLYTT VL LERQLV LD+QTTLCS NQ ELLKLF SCRKALKYIF A
Sbjct: 1221 KGNISSRSFSLYTTNVLELERQLV---LDSQTTLCSENQFELLKLFASCRKALKYIFTAY 1280
Query: 1261 YEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDHTSYL 1320
YEAGDRQSSS PVPSENQFPVSWLFKSVSIVNQLQ+AS GS+RQIKDIIFSLMDHTSYL
Sbjct: 1281 YEAGDRQSSSTPVPSENQFPVSWLFKSVSIVNQLQDASSGGSDRQIKDIIFSLMDHTSYL 1340
Query: 1321 FLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQMTIS 1380
FLTTSKYQFKNALRLIVIDNKPC EQ +NV HEL +GDDL L C+EA NSA QMTIS
Sbjct: 1341 FLTTSKYQFKNALRLIVIDNKPCMEQPENVSHELNDGDDLFLGSNRCLEACNSAIQMTIS 1400
Query: 1381 LKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRNGSRH 1440
LKEQVESE I LKKSNVTVGDGKN NM KVYSLASCLNGFLWGLASA D TDLRN +RH
Sbjct: 1401 LKEQVESELIYLKKSNVTVGDGKNRGNMYKVYSLASCLNGFLWGLASAEDDTDLRNSNRH 1460
Query: 1441 MRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVNEVSV 1500
RSMKLKCE+SSQLNLCINAISELLGL+LEMFLDRDS RP KLCDY TSQD L VNE S
Sbjct: 1461 TRSMKLKCEFSSQLNLCINAISELLGLILEMFLDRDSQRPQKLCDYQTSQDFLGVNEPSG 1520
Query: 1501 KGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIELQSLN 1560
KG SSEVDTS SK+ K ESS +DD +N S RKRLKL NKSSVASIL+EA+ IE+QSLN
Sbjct: 1521 KGPSSEVDTSCSKYQKLESSQSDDDNKNTSLKRKRLKLGNKSSVASILSEANLIEMQSLN 1580
Query: 1561 QPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLLLLEF 1620
+P LRGLLKGSYPE AFALKQLFLAASVILRLHMKYD++PLSSS MAILI ISR LLL+F
Sbjct: 1581 KPFLRGLLKGSYPEAAFALKQLFLAASVILRLHMKYDSIPLSSSSMAILISISRFLLLKF 1640
Query: 1621 VDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCISLLG 1680
VDMVEVPQPFLL CLDGVLKYLE LGHLFPF+DPMQSRNLYS LI++ L+AIGKCISL G
Sbjct: 1641 VDMVEVPQPFLLTCLDGVLKYLEGLGHLFPFADPMQSRNLYSNLINLHLQAIGKCISLQG 1700
Query: 1681 KRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASELHLLS 1740
KRATL SH+TESTTKTLDGHL LFEESSFP +Y ID+FK+SLRMSFKVFIRKASELHLLS
Sbjct: 1701 KRATLTSHDTESTTKTLDGHLCLFEESSFPRIYYIDQFKSSLRMSFKVFIRKASELHLLS 1760
Query: 1741 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCLGVVK 1800
AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEF SGRKCL VVK
Sbjct: 1761 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFASGRKCLSVVK 1820
Query: 1801 RHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKHALFQ 1810
RHIQSLIAGLFSIVLHLQ+PQ+FY+RMIDTKNKS PDPGSVILMSVEVLTRVSGKHALFQ
Sbjct: 1821 RHIQSLIAGLFSIVLHLQTPQIFYSRMIDTKNKSDPDPGSVILMSVEVLTRVSGKHALFQ 1880
BLAST of Sgr023809 vs. ExPASy TrEMBL
Match:
A0A6J1CXQ2 (uncharacterized protein LOC111015534 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111015534 PE=4 SV=1)
HSP 1 Score: 2788.1 bits (7226), Expect = 0.0e+00
Identity = 1511/2082 (72.57%), Postives = 1604/2082 (77.04%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MKNKKRKLKSPQ+ ERP KSAR++I EEEV+DEP RVEKSEQR L + SEEG PWRNLE
Sbjct: 21 MKNKKRKLKSPQKAERPPKSARVIIPLEEEVVDEPGRVEKSEQRELFRGSEEGRPWRNLE 80
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDT------------------------ 120
LIFLIQNKELDQQKKVEAVFSFV+SKL+E DK YDT
Sbjct: 81 LIFLIQNKELDQQKKVEAVFSFVDSKLKEEDKCYDTVKISRLIVFLSDWVQSLLISFEKK 140
Query: 121 ------------------------------------------------------------ 180
Sbjct: 141 AKNDGGKHLKMAIEPCLDYRCWEVFKFCLEESVKMNITLNLSRNLLHAFCFVTRNAISLL 200
Query: 181 ------------------------------------------------------------ 240
Sbjct: 201 DVSSSSKEELFAGDCLKLYNCVQDCVSLVFSSHLGLSNDNLDAWISTIDAVLEFLHKIHV 260
Query: 241 ------------------------------------------------------------ 300
Sbjct: 261 NSLEGEDVGIFATLFSRMMLEPFAKFLWIHPTKKTGFHNFVNKLLEPLLQLLRDLSLKAD 320
Query: 301 --------NMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLE+VLSHALFHTVHIDGFLCLHGSEKV KSHDEKLEESKAH+KSYHR
Sbjct: 321 GCNHGRTRTLMKLLEDVLSHALFHTVHIDGFLCLHGSEKVTKSHDEKLEESKAHMKSYHR 380
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGSK-FEDTKLNNKMGCVGPLRDD 420
HLFDKVQ+LVA KKFLALGA+GELF VLVVRV VKG S FEDTKLNNKMGC G LRDD
Sbjct: 381 HLFDKVQKLVAEKKFLALGAVGELFDVLVVRVNKVKGASMLFEDTKLNNKMGCFGHLRDD 440
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
SS AS LQGSAD SEKSNY SSLSTEIRKSLFEFFVQILDPLL TIDHISAEIK GP
Sbjct: 441 TSSHASRALQGSADGLSEKSNYSSSLSTEIRKSLFEFFVQILDPLLLTIDHISAEIKLGP 500
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDV LLKSINNLLASFMK KVYLRTEDNSEGAY NFLKKVYD VM VSSNLL L RH
Sbjct: 501 ALSDVCYLLKSINNLLASFMKGKVYLRTEDNSEGAYLNFLKKVYDKVMFVSSNLLSLSRH 560
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
E+ENN+D VFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVI+SYSAINLSFTSIP+Q
Sbjct: 561 ELENNIDQGVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVIISYSAINLSFTSIPEQ 620
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTS+IQELGCQLVVLYGQLRQVNI + ALCKAMRTV+SNE E EKSYAS MTSLG+EA
Sbjct: 621 HLLTSRIQELGCQLVVLYGQLRQVNIIIFALCKAMRTVISNEGENEKSYASFMTSLGHEA 680
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVG L+SSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLN RNS
Sbjct: 681 YGKSVGTLVSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNTRNS 740
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
S L+T+ LGRGLSEMY LMLDSLMITSGNA QIGTSI+NLISV+RPCMS LVGL SD
Sbjct: 741 KSCLNLKTLLLGRGLSEMYALMLDSLMITSGNALQIGTSIDNLISVLRPCMSILVGLQSD 800
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAK F+VAIM K D ++ADEDNC GFGV SHWVFVFFFRLYMSCRSLYRQAISLMPP S
Sbjct: 801 GAKEFVVAIMEKKCDDVVADEDNCQGFGVISHWVFVFFFRLYMSCRSLYRQAISLMPPGS 860
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDS +AYSACDWMQRTDWSDEGYFSWII+PS SVLAVAQS+ SLYHQGTD D
Sbjct: 861 SRKMSAAIGDSMVAYSACDWMQRTDWSDEGYFSWIIQPSASVLAVAQSICSLYHQGTDED 920
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLNRQIDSLEY+HQRNENLMQVEVLGDD LSVLRKKSKKFGRL
Sbjct: 921 WYPLIYVLLTMALQRLVDLNRQIDSLEYLHQRNENLMQVEVLGDDDLSVLRKKSKKFGRL 980
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S L+KEAADLTDFMM HLSL+AK ILN K ATSN+K ETL EIDDWDFSIC++NK
Sbjct: 981 VSVLQKEAADLTDFMMSHLSLIAKRQILNPAK-IATSNEKCIETLDEIDDWDFSICSMNK 1040
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WW+VCQNVDIWA HAAKK+L
Sbjct: 1041 RSFPTAVWWVVCQNVDIWAIHAAKKKLKMFLSFLIRTSHPFLTSNDMKIESQQNDGCQQL 1100
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
MPSRFC ELKA++L SFHDI+ SSADWME IA
Sbjct: 1101 NKVSLQQISSSVLSDPIFYEQRFVCRFMPSRFCHELKATVLPSFHDISTSSADWMEVIAT 1160
Query: 1141 LEPVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLLCWMP 1200
LE L T DC+ K DSPPSNVR RACQH I+LLCWMP
Sbjct: 1161 LE-------------------------LSTTDCQSKSDSPPSNVRFRACQHFINLLCWMP 1220
Query: 1201 KGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYIFMAS 1260
KGNISSRSFSLYTT VL LERQLV LD+QTTLCS NQ ELLKLF SCRKALKYIF A
Sbjct: 1221 KGNISSRSFSLYTTNVLELERQLV---LDSQTTLCSENQFELLKLFASCRKALKYIFTAY 1280
Query: 1261 YEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDHTSYL 1320
YEAGDRQSSS PVPSENQFPVSWLFKSVSIVNQLQ+AS GS+RQIKDIIFSLMDHTSYL
Sbjct: 1281 YEAGDRQSSSTPVPSENQFPVSWLFKSVSIVNQLQDASSGGSDRQIKDIIFSLMDHTSYL 1340
Query: 1321 FLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQMTIS 1380
FLTTSKYQFKNALRLIVIDNKPC EQ +NV HEL +GDDL L C+EA NSA QMTIS
Sbjct: 1341 FLTTSKYQFKNALRLIVIDNKPCMEQPENVSHELNDGDDLFLGSNRCLEACNSAIQMTIS 1400
Query: 1381 LKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRNGSRH 1440
LKEQVESE I LKKSNVTVGDGKN NM KVYSLASCLNGFLWGLASA D TDLRN +RH
Sbjct: 1401 LKEQVESELIYLKKSNVTVGDGKNRGNMYKVYSLASCLNGFLWGLASAEDDTDLRNSNRH 1460
Query: 1441 MRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVNEVSV 1500
RSMKLKCE+SSQLNLCINAISELLGL+LEMFLDRDS RP KLCDY TSQD L VNE S
Sbjct: 1461 TRSMKLKCEFSSQLNLCINAISELLGLILEMFLDRDSQRPQKLCDYQTSQDFLGVNEPSG 1520
Query: 1501 KGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIELQSLN 1560
KG SSEVDTS SK+ K ESS +DD +N S RKRLKL NKSSVASIL+EA+ IE+QSLN
Sbjct: 1521 KGPSSEVDTSCSKYQKLESSQSDDDNKNTSLKRKRLKLGNKSSVASILSEANLIEMQSLN 1580
Query: 1561 QPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLLLLEF 1620
+P LRGLLKGSYPE AFALKQLFLAASVILRLHMKYD++PLSSS MAILI ISR LLL+F
Sbjct: 1581 KPFLRGLLKGSYPEAAFALKQLFLAASVILRLHMKYDSIPLSSSSMAILISISRFLLLKF 1640
Query: 1621 VDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCISLLG 1680
VDMVEVPQPFLL CLDGVLKYLE LGHLFPF+DPMQSRNLYS LI++ L+AIGKCISL G
Sbjct: 1641 VDMVEVPQPFLLTCLDGVLKYLEGLGHLFPFADPMQSRNLYSNLINLHLQAIGKCISLQG 1700
Query: 1681 KRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASELHLLS 1740
KRATL SH+TESTTKTLDGHL LFEESSFP +Y ID+FK+SLRMSFKVFIRKASELHLLS
Sbjct: 1701 KRATLTSHDTESTTKTLDGHLCLFEESSFPRIYYIDQFKSSLRMSFKVFIRKASELHLLS 1760
Query: 1741 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCLGVVK 1800
AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEF SGRKCL VVK
Sbjct: 1761 AIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFASGRKCLSVVK 1820
Query: 1801 RHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKHALFQ 1810
RHIQSLIAGLFSIVLHLQ+PQ+FY+RMIDTKNKS PDPGSVILMSVEVLTRVSGKHALFQ
Sbjct: 1821 RHIQSLIAGLFSIVLHLQTPQIFYSRMIDTKNKSDPDPGSVILMSVEVLTRVSGKHALFQ 1880
BLAST of Sgr023809 vs. ExPASy TrEMBL
Match:
A0A6J1G6F1 (uncharacterized protein LOC111451261 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451261 PE=4 SV=1)
HSP 1 Score: 2731.8 bits (7080), Expect = 0.0e+00
Identity = 1471/2086 (70.52%), Postives = 1600/2086 (76.70%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MK+KKRKLKSPQ +ERPSKSAR LISPE EV+D E+VEK EQ + QE +E CPWRNLE
Sbjct: 13 MKSKKRKLKSPQTSERPSKSARHLISPEVEVVDGTEQVEKMEQGEVSQEFDESCPWRNLE 72
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYD------------------------- 120
LIFLIQNKE +QQKKV+AVFSFVNSK E+DKY+D
Sbjct: 73 LIFLIQNKEFNQQKKVDAVFSFVNSKWNEKDKYHDKVKMSRLIVFLSDWVQSLLISSEKK 132
Query: 121 ------------------------------------------------------------ 180
Sbjct: 133 AKNDGGKHHNMAIEPCLDYRCWEVFKFCLEESVKTLIPLNLSKNLLHAFCFVTRSAISLL 192
Query: 181 ------------------------------------------------------------ 240
Sbjct: 193 GDLSSSREELFSGDCFKLYNVVLDCVSLVFSPHLGLSNENLDAWISTIDAMLEFLHKIYV 252
Query: 241 ------------------------------------------------------------ 300
Sbjct: 253 SSLEDKDVGIFAIKFSSMMLKPFAKFLWTHPTKKAGFHNFVNKLLEPLLQLLLDISLKAD 312
Query: 301 -------TNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
+MKLLEEVLSHALFHTVHIDGFLCLHGS+KVIKS DEK EESKAHIKSYHR
Sbjct: 313 GCDHCWTRTLMKLLEEVLSHALFHTVHIDGFLCLHGSDKVIKSPDEKSEESKAHIKSYHR 372
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCVGPLRDD 420
HLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM RDD
Sbjct: 373 HLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSISSEDTKLNNKM------RDD 432
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+ ISAEIK G
Sbjct: 433 ISSHASSGL-------SEKSNNQSSLSTEIRKPLFEFFVQILDPLLQTIEQISAEIKLGT 492
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
ALSDVHCLLKSINNLLASFM+EKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+LLLL R
Sbjct: 493 ALSDVHCLLKSINNLLASFMEEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHLLLLSRL 552
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
EIENN+DLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSFTS+PKQ
Sbjct: 553 EIENNIDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSVPKQ 612
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTSKIQELGCQLV LYGQLRQVN+S+ ALCKAMRT +SNE E+EK YAS MTSLG+EA
Sbjct: 613 HLLTSKIQELGCQLVALYGQLRQVNVSIFALCKAMRTAISNEGESEKDYASFMTSLGHEA 672
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVGMLLSSQEIKFAIHKAIKY+PEGQASGIIQQLTED+TETLGWLR CN+N+N RN+
Sbjct: 673 YGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGIIQQLTEDMTETLGWLRQCNMNMNTRNN 732
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
T D +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS LVG +D
Sbjct: 733 TEDLNMQTVLLGRGLSEVYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNLVGPQAD 792
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAKAF A++GKT D M+ADEDNCLGFGVTSHWVFVFF LYMSCR+LYRQAISLMPPSS
Sbjct: 793 GAKAFFAAVIGKTCDDMVADEDNCLGFGVTSHWVFVFFLCLYMSCRNLYRQAISLMPPSS 852
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYHQ T V
Sbjct: 853 SRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYHQDTSVG 912
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLN+QI SLEY++QRNENLMQVEVLGDDGLSVL+KKSKK+ RL
Sbjct: 913 WYPLIYVLLTMALQRLVDLNKQIGSLEYLYQRNENLMQVEVLGDDGLSVLQKKSKKYSRL 972
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S LRKEA DLTDFMM+H SLVAK +LNSTK ATSNDKST LSEIDDWDFSICNVNK
Sbjct: 973 VSVLRKEAEDLTDFMMRHFSLVAKRQVLNSTKEVATSNDKSTVMLSEIDDWDFSICNVNK 1032
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 1033 RSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQTNGFRQL 1092
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
+PSRFCREL AS+LSSFHDIN SS DWME +
Sbjct: 1093 KKVSLQQISSAALSDPIFYEHGFVCRFLPSRFCRELSASLLSSFHDINTSSTDWMEVLCT 1152
Query: 1141 LE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLL 1200
LE VCS KR PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQHLIDLL
Sbjct: 1153 LERLTTSVCSGKRTPDDSSPLAKTVNHSSDMLYTEDCKWKGDSSQSNLSFRACQHLIDLL 1212
Query: 1201 CWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYI 1260
CWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCRKALKYI
Sbjct: 1213 CWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCRKALKYI 1272
Query: 1261 FMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDH 1320
FMA YEA + QSSSIP+PSE+QFPVSWLFKS+SIVN++QEAS G+ +IKDIIFSLMDH
Sbjct: 1273 FMAYYEARNEQSSSIPLPSESQFPVSWLFKSISIVNRIQEASAGGTATKIKDIIFSLMDH 1332
Query: 1321 TSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQ 1380
TSYLFLTTSKYQFKNALRL+VIDNK C+E Q+V HEL +GD + LD THCVE NSA Q
Sbjct: 1333 TSYLFLTTSKYQFKNALRLMVIDNKTCEEH-QDVCHELNDGDGVSLDSTHCVEVCNSAIQ 1392
Query: 1381 MTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRN 1440
M+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLASAVDHTDLRN
Sbjct: 1393 MSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASAVDHTDLRN 1452
Query: 1441 GSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVN 1500
G+R RSMKLK EYSS+LNLC+NA SELL L+LEMFLDRDS PTKLCDY SQDLLVV+
Sbjct: 1453 GNRRTRSMKLKFEYSSKLNLCMNATSELLDLILEMFLDRDSQWPTKLCDYQPSQDLLVVD 1512
Query: 1501 EVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIEL 1560
E+ VK S SE DTSFSKH + ESSH DDG+E+ N+KRLKLENKSSVASILNEA++IE+
Sbjct: 1513 ELPVKHSGSEADTSFSKHRELESSHCDDGSESGGTNKKRLKLENKSSVASILNEANTIEM 1572
Query: 1561 QSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLL 1620
QS NQ L+GLLKGSYP+VAFALKQLFLAASVILRLH +Y T+PLSSSFMAILIG SR L
Sbjct: 1573 QSFNQSFLQGLLKGSYPDVAFALKQLFLAASVILRLHKQYGTIPLSSSFMAILIGFSRFL 1632
Query: 1621 LLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCI 1680
LLEF +MVEVP+PFL ACLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LKA+GKCI
Sbjct: 1633 LLEFENMVEVPEPFLFACLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLKAMGKCI 1692
Query: 1681 SLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASEL 1740
SL KRATLASHETESTTKTLDG GLFEESSFP +Y +DEFK SLRMSFKVFIR+ASEL
Sbjct: 1693 SLQRKRATLASHETESTTKTLDG--GLFEESSFPVIYCMDEFKASLRMSFKVFIREASEL 1752
Query: 1741 HLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCL 1800
HLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAG+ECLDLVLEFVSGRKCL
Sbjct: 1753 HLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGIECLDLVLEFVSGRKCL 1812
Query: 1801 GVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKH 1810
GVVKRHIQSLIAGLFSIVLHLQSP +FY R IDTK +S PDPG+VILMSVEVL RVSGKH
Sbjct: 1813 GVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTIDTKGRSDPDPGAVILMSVEVLARVSGKH 1872
BLAST of Sgr023809 vs. ExPASy TrEMBL
Match:
A0A6J1L579 (uncharacterized protein LOC111500003 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111500003 PE=4 SV=1)
HSP 1 Score: 2722.2 bits (7055), Expect = 0.0e+00
Identity = 1471/2086 (70.52%), Postives = 1598/2086 (76.61%), Query Frame = 0
Query: 1 MKNKKRKLKSPQETERPSKSARLLISPEEEVIDEPERVEKSEQRGLCQESEEGCPWRNLE 60
MK+KKRKLKSPQ + RPSKSAR LISPE EV+D E+VEK EQ + QE +E CPWR+LE
Sbjct: 13 MKSKKRKLKSPQTSARPSKSARPLISPEVEVVDGTEQVEKMEQGEVSQEFDESCPWRSLE 72
Query: 61 LIFLIQNKELDQQKKVEAVFSFVNSKLEERDKYYDT------------------------ 120
LIFLIQNKE DQQKKVEAVFSFVNSK E+DKY+D
Sbjct: 73 LIFLIQNKEFDQQKKVEAVFSFVNSKWNEKDKYHDKVKMSRLIVFLSDWVQSLLISSEKK 132
Query: 121 ---------NM------------------------------------------------- 180
NM
Sbjct: 133 AKNDGGKHHNMAIEPCLDYRCWEVFKFCLEESVKTLIPLNLSKKILHAFCFVTRSAISLL 192
Query: 181 ------------------------------------------------------------ 240
Sbjct: 193 GDLSSSREELFSGDCFKLYNVVLDCVSLVFSPHLGLSNENLDAWISTIDAMLQFLHKIYI 252
Query: 241 ------------------------------------------------------------ 300
Sbjct: 253 SSLEDKDVGVFAIKFSCMMLKPFAKFLWTHPTKKAGFHNFVNKLLEPLLQLLLDISLKAD 312
Query: 301 ----------MKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSYHR 360
MKLLEEVLSH LFHTVHIDGFLCLHGS+KVIKS DEK EESKAHIKSYHR
Sbjct: 313 GCDHCWTRTSMKLLEEVLSHGLFHTVHIDGFLCLHGSDKVIKSPDEKSEESKAHIKSYHR 372
Query: 361 HLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCVGPLRDD 420
HLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM RD+
Sbjct: 373 HLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSMSSEDTKLNNKM------RDE 432
Query: 421 VSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISAEIKQGP 480
+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+HISAEIK G
Sbjct: 433 ISSHASSGL-------SEKSNSQSSLSTEIRKPLFEFFVQILDPLLQTIEHISAEIKLGT 492
Query: 481 ALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNLLLLLRH 540
+LSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+LLLL R
Sbjct: 493 SLSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHLLLLSRR 552
Query: 541 EIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSIPKQ 600
EIEN++DLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSFTS+PKQ
Sbjct: 553 EIENDIDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSFTSVPKQ 612
Query: 601 YLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMTSLGYEA 660
+LLTSKIQELGCQLV LYGQLRQVNIS+ ALCKAMRT +SNE +TEK YAS MTSLG+EA
Sbjct: 613 HLLTSKIQELGCQLVALYGQLRQVNISIFALCKAMRTAISNEGDTEKDYASFMTSLGHEA 672
Query: 661 YGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLNLNIRNS 720
YGKSVGMLLSSQEIKFAIHKAIKY+PEGQASGIIQQLTED+TETLGWLR CN+N+N RN+
Sbjct: 673 YGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGIIQQLTEDMTETLGWLRQCNMNMNTRNN 732
Query: 721 TSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCLVGLHSD 780
T +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS LVGL +D
Sbjct: 733 TEGLNMQTVLLGRGLSEIYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNLVGLQAD 792
Query: 781 GAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAISLMPPSS 840
GAKAF A+MG+T D M+ADED CLGFGVTSHWVFVFF LYMSCR+LYRQAISLMPPSS
Sbjct: 793 GAKAFFAAVMGETCDDMVADEDICLGFGVTSHWVFVFFLCLYMSCRNLYRQAISLMPPSS 852
Query: 841 SRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYHQGTDVD 900
SRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYHQ T+V
Sbjct: 853 SRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYHQDTNVG 912
Query: 901 WCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKSKKFGRL 960
W PL+YVL TM LQRLVDLN+QI SLEY++ RN+NLMQVEVLGDDGLSVL+KKSKK+ RL
Sbjct: 913 WYPLIYVLLTMALQRLVDLNKQIGSLEYLYHRNKNLMQVEVLGDDGLSVLQKKSKKYSRL 972
Query: 961 LSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFSICNVNK 1020
+S LRKEA DLTDFMM+H S V K +LNSTK ATSNDKST LSEIDDWDFSICNVNK
Sbjct: 973 VSVLRKEAEDLTDFMMRHFSSVVKRQVLNSTKEVATSNDKSTVMLSEIDDWDFSICNVNK 1032
Query: 1021 RSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------------- 1080
RS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 1033 RSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQTNGFRQL 1092
Query: 1081 ---------------------------MPSRFCRELKASILSSFHDINISSADWMEKIAK 1140
+PSRFCREL S+LSSFHDIN SS DWME I
Sbjct: 1093 KKVSLQQISSAALSDPIFYEHRFVCRFLPSRFCRELSVSLLSSFHDINTSSTDWMEVICT 1152
Query: 1141 LE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQHLIDLL 1200
LE VCS R PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQHLIDLL
Sbjct: 1153 LERLTTSVCSGTRTPDDSAPLAKIVNHSSDMLYTEDCKWKGDSSQSNLSFRACQHLIDLL 1212
Query: 1201 CWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCRKALKYI 1260
CWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCRKALKYI
Sbjct: 1213 CWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCRKALKYI 1272
Query: 1261 FMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDIIFSLMDH 1320
FMA YEAG+ QSSSIP+PSENQFPVSWLFKS+SIVN++QEAS + +IKDIIFSLMDH
Sbjct: 1273 FMAYYEAGNEQSSSIPLPSENQFPVSWLFKSISIVNRIQEASGGSTATKIKDIIFSLMDH 1332
Query: 1321 TSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEAYNSASQ 1380
TSYLFLTTSKYQFKNALRL+VIDNKPCKE+ Q+V HEL +GD LD THCVE NSA Q
Sbjct: 1333 TSYLFLTTSKYQFKNALRLMVIDNKPCKEEHQDVCHELNDGDGGSLDSTHCVEECNSAIQ 1392
Query: 1381 MTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVDHTDLRN 1440
M+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLASAVDHTDLRN
Sbjct: 1393 MSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASAVDHTDLRN 1452
Query: 1441 GSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQDLLVVN 1500
G+R MRSMKLK EYSS+LNLC+NA SELLGL+LEMFLDRDS PTKLCD SQDLLVV+
Sbjct: 1453 GNRRMRSMKLKFEYSSKLNLCMNATSELLGLILEMFLDRDSQWPTKLCDNQPSQDLLVVD 1512
Query: 1501 EVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNEADSIEL 1560
EV VK S SE D SFSK+ + ESSH DDG+E+ S N+KRLKLENKSSVASILNEA++IE+
Sbjct: 1513 EVKVKHSGSEADISFSKNRELESSHCDDGSESGSTNKKRLKLENKSSVASILNEANTIEM 1572
Query: 1561 QSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILIGISRLL 1620
QSLNQ L+GLLKGS P+VAFALKQLFLAASVILRLH +Y TVPLSSSFMAI+IG SR L
Sbjct: 1573 QSLNQSFLQGLLKGSCPDVAFALKQLFLAASVILRLHKQYGTVPLSSSFMAIVIGFSRFL 1632
Query: 1621 LLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLKAIGKCI 1680
LLEF +MVEVP+PFL ACLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LKA+GKCI
Sbjct: 1633 LLEFENMVEVPEPFLFACLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLKAMGKCI 1692
Query: 1681 SLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFIRKASEL 1740
SL KRATLASHETESTTKTLDG GLFEESSFP VY +DEFK+SLRMSFKVFIR+ASEL
Sbjct: 1693 SLQRKRATLASHETESTTKTLDG--GLFEESSFPVVYCMDEFKSSLRMSFKVFIREASEL 1752
Query: 1741 HLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFVSGRKCL 1800
HLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAGVECLDLVLEFVSGRK L
Sbjct: 1753 HLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGVECLDLVLEFVSGRKGL 1812
Query: 1801 GVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLTRVSGKH 1810
GVVKRHIQSLIAGLFSIVLHLQSP +FY R +DTK +S PDPG+VILMSVEVL RVSGKH
Sbjct: 1813 GVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTVDTKGRSDPDPGAVILMSVEVLARVSGKH 1872
BLAST of Sgr023809 vs. ExPASy TrEMBL
Match:
A0A6J1G6M2 (uncharacterized protein LOC111451261 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111451261 PE=4 SV=1)
HSP 1 Score: 2690.6 bits (6973), Expect = 0.0e+00
Identity = 1402/1792 (78.24%), Postives = 1525/1792 (85.10%), Query Frame = 0
Query: 83 VNSKLEERDKYYDTNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAH 142
++ K + D + +MKLLEEVLSHALFHTVHIDGFLCLHGS+KVIKS DEK EESKAH
Sbjct: 239 ISLKADGCDHCWTRTLMKLLEEVLSHALFHTVHIDGFLCLHGSDKVIKSPDEKSEESKAH 298
Query: 143 IKSYHRHLFDKVQRLVAGKKFLALGAIGELFRVLVVRVKTVKGGS-KFEDTKLNNKMGCV 202
IKSYHRHLFDK+Q+LVAGKKF ALGA+GELF VLVVRVK VKG S EDTKLNNKM
Sbjct: 299 IKSYHRHLFDKMQKLVAGKKFSALGAVGELFHVLVVRVKKVKGVSISSEDTKLNNKM--- 358
Query: 203 GPLRDDVSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTIDHISA 262
RDD+SS AS L SEKSN SSLSTEIRK LFEFFVQILDPLL TI+ ISA
Sbjct: 359 ---RDDISSHASSGL-------SEKSNNQSSLSTEIRKPLFEFFVQILDPLLQTIEQISA 418
Query: 263 EIKQGPALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVSSNL 322
EIK G ALSDVHCLLKSINNLLASFM+EKVYLRTEDNSEGAY NFLKKVYDTVMLVSS+L
Sbjct: 419 EIKLGTALSDVHCLLKSINNLLASFMEEKVYLRTEDNSEGAYHNFLKKVYDTVMLVSSHL 478
Query: 323 LLLLRHEIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAINLSF 382
LLL R EIENN+DLEV+VLA NEILVT+ YLLEIEYDVIGNDLVSLWLVILSYSAINLSF
Sbjct: 479 LLLSRLEIENNIDLEVYVLAGNEILVTLSYLLEIEYDVIGNDLVSLWLVILSYSAINLSF 538
Query: 383 TSIPKQYLLTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVLSNEHETEKSYASVMT 442
TS+PKQ+LLTSKIQELGCQLV LYGQLRQVN+S+ ALCKAMRT +SNE E+EK YAS MT
Sbjct: 539 TSVPKQHLLTSKIQELGCQLVALYGQLRQVNVSIFALCKAMRTAISNEGESEKDYASFMT 598
Query: 443 SLGYEAYGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQLTEDVTETLGWLRLCNLN 502
SLG+EAYGKSVGMLLSSQEIKFAIHKAIKY+PEGQASGIIQQLTED+TETLGWLR CN+N
Sbjct: 599 SLGHEAYGKSVGMLLSSQEIKFAIHKAIKYIPEGQASGIIQQLTEDMTETLGWLRQCNMN 658
Query: 503 LNIRNSTSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQIGTSIENLISVIRPCMSCL 562
+N RN+T D +QT+ LGRGLSE+Y LMLDSLMITSGNAFQ+GTSIENL+SVIRPCMS L
Sbjct: 659 MNTRNNTEDLNMQTVLLGRGLSEVYALMLDSLMITSGNAFQVGTSIENLVSVIRPCMSNL 718
Query: 563 VGLHSDGAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFVFFFRLYMSCRSLYRQAIS 622
VG +DGAKAF A++GKT D M+ADEDNCLGFGVTSHWVFVFF LYMSCR+LYRQAIS
Sbjct: 719 VGPQADGAKAFFAAVIGKTCDDMVADEDNCLGFGVTSHWVFVFFLCLYMSCRNLYRQAIS 778
Query: 623 LMPPSSSRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWIIEPSTSVLAVAQSVSSLYH 682
LMPPSSSRKMS AIGDSF+AYSACDWMQRTDWSDEGYFSWII PS SVL VAQSV SLYH
Sbjct: 779 LMPPSSSRKMSAAIGDSFVAYSACDWMQRTDWSDEGYFSWIIRPSASVLVVAQSVCSLYH 838
Query: 683 QGTDVDWCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENLMQVEVLGDDGLSVLRKKS 742
Q T V W PL+YVL TM LQRLVDLN+QI SLEY++QRNENLMQVEVLGDDGLSVL+KKS
Sbjct: 839 QDTSVGWYPLIYVLLTMALQRLVDLNKQIGSLEYLYQRNENLMQVEVLGDDGLSVLQKKS 898
Query: 743 KKFGRLLSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNATSNDKSTETLSEIDDWDFS 802
KK+ RL+S LRKEA DLTDFMM+H SLVAK +LNSTK ATSNDKST LSEIDDWDFS
Sbjct: 899 KKYSRLVSVLRKEAEDLTDFMMRHFSLVAKRQVLNSTKEVATSNDKSTVMLSEIDDWDFS 958
Query: 803 ICNVNKRSLPTAIWWIVCQNVDIWANHAAKKRL--------------------------- 862
ICNVNKRS PTA+WWIVCQNVDIW NHAAKK+L
Sbjct: 959 ICNVNKRSFPTAVWWIVCQNVDIWVNHAAKKKLKMFLSFLIRTSHQFLVSSDTKIGRQQT 1018
Query: 863 ---------------------------------MPSRFCRELKASILSSFHDINISSADW 922
+PSRFCREL AS+LSSFHDIN SS DW
Sbjct: 1019 NGFRQLKKVSLQQISSAALSDPIFYEHGFVCRFLPSRFCRELSASLLSSFHDINTSSTDW 1078
Query: 923 MEKIAKLE----PVCSSKRIPDENFPLTKAVSHSSDLLQTEDCKLKRDSPPSNVRSRACQ 982
ME + LE VCS KR PD++ PL K V+HSSD+L TEDCK K DS SN+ RACQ
Sbjct: 1079 MEVLCTLERLTTSVCSGKRTPDDSSPLAKTVNHSSDMLYTEDCKWKGDSSQSNLSFRACQ 1138
Query: 983 HLIDLLCWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSELLKLFVSCR 1042
HLIDLLCWMPKGN SSRSFSLYTT+VL LERQLV+A+LDNQT LCS NQ ELLKLF SCR
Sbjct: 1139 HLIDLLCWMPKGNFSSRSFSLYTTHVLKLERQLVSALLDNQTVLCS-NQFELLKLFASCR 1198
Query: 1043 KALKYIFMASYEAGDRQSSSIPVPSENQFPVSWLFKSVSIVNQLQEASPEGSNRQIKDII 1102
KALKYIFMA YEA + QSSSIP+PSE+QFPVSWLFKS+SIVN++QEAS G+ +IKDII
Sbjct: 1199 KALKYIFMAYYEARNEQSSSIPLPSESQFPVSWLFKSISIVNRIQEASAGGTATKIKDII 1258
Query: 1103 FSLMDHTSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLCLDPTHCVEA 1162
FSLMDHTSYLFLTTSKYQFKNALRL+VIDNK C+E Q+V HEL +GD + LD THCVE
Sbjct: 1259 FSLMDHTSYLFLTTSKYQFKNALRLMVIDNKTCEEH-QDVCHELNDGDGVSLDSTHCVEV 1318
Query: 1163 YNSASQMTISLKEQVESEFISLKKSNVTVGDGKNSANMCKVYSLASCLNGFLWGLASAVD 1222
NSA QM+ISLKEQVESE ISL+KSNV+VGDGKNSA MCKV SLASCLNGFLWGLASAVD
Sbjct: 1319 CNSAIQMSISLKEQVESELISLRKSNVSVGDGKNSAQMCKVNSLASCLNGFLWGLASAVD 1378
Query: 1223 HTDLRNGSRHMRSMKLKCEYSSQLNLCINAISELLGLLLEMFLDRDSDRPTKLCDYHTSQ 1282
HTDLRNG+R RSMKLK EYSS+LNLC+NA SELL L+LEMFLDRDS PTKLCDY SQ
Sbjct: 1379 HTDLRNGNRRTRSMKLKFEYSSKLNLCMNATSELLDLILEMFLDRDSQWPTKLCDYQPSQ 1438
Query: 1283 DLLVVNEVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKRLKLENKSSVASILNE 1342
DLLVV+E+ VK S SE DTSFSKH + ESSH DDG+E+ N+KRLKLENKSSVASILNE
Sbjct: 1439 DLLVVDELPVKHSGSEADTSFSKHRELESSHCDDGSESGGTNKKRLKLENKSSVASILNE 1498
Query: 1343 ADSIELQSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMKYDTVPLSSSFMAILI 1402
A++IE+QS NQ L+GLLKGSYP+VAFALKQLFLAASVILRLH +Y T+PLSSSFMAILI
Sbjct: 1499 ANTIEMQSFNQSFLQGLLKGSYPDVAFALKQLFLAASVILRLHKQYGTIPLSSSFMAILI 1558
Query: 1403 GISRLLLLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPMQSRNLYSRLIDIQLK 1462
G SR LLLEF +MVEVP+PFL ACLDGVLKYLEELGHLFP +DPM+SR+LYSRL+++ LK
Sbjct: 1559 GFSRFLLLEFENMVEVPEPFLFACLDGVLKYLEELGHLFPSADPMKSRDLYSRLVNLHLK 1618
Query: 1463 AIGKCISLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRIDEFKTSLRMSFKVFI 1522
A+GKCISL KRATLASHETESTTKTLDG GLFEESSFP +Y +DEFK SLRMSFKVFI
Sbjct: 1619 AMGKCISLQRKRATLASHETESTTKTLDG--GLFEESSFPVIYCMDEFKASLRMSFKVFI 1678
Query: 1523 RKASELHLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSIVAAGVECLDLVLEFV 1582
R+ASELHLLSAIQAIERALVGVQEGCTA YEL SGSEDGG CSSIVAAG+ECLDLVLEFV
Sbjct: 1679 REASELHLLSAIQAIERALVGVQEGCTATYELCSGSEDGGSCSSIVAAGIECLDLVLEFV 1738
Query: 1583 SGRKCLGVVKRHIQSLIAGLFSIVLHLQSPQVFYTRMIDTKNKSGPDPGSVILMSVEVLT 1642
SGRKCLGVVKRHIQSLIAGLFSIVLHLQSP +FY R IDTK +S PDPG+VILMSVEVL
Sbjct: 1739 SGRKCLGVVKRHIQSLIAGLFSIVLHLQSPHIFYVRTIDTKGRSDPDPGAVILMSVEVLA 1798
Query: 1643 RVSGKHALFQMNAWHVAQCLRIPAALFEEFSLKLLGIPIQSDNSLISAQEVPNRVVTTTK 1702
RVSGKHA++QMNAW+VAQCLRIPAALFE+FSLKL GIP+QS+NSLIS E N VV T
Sbjct: 1799 RVSGKHAIYQMNAWYVAQCLRIPAALFEDFSLKLPGIPVQSENSLISTPEASNTVVATRN 1858
Query: 1703 SIIDRQFLIDLFAASCRLLYTVLKHHKSECKRCIAQLQASVSVLLHSLERVGPDPEFMGG 1762
SIIDRQFLIDLFAA CRLL+TVLKHHKSECK+ IAQLQASVSVLLHSLERV PDPE +GG
Sbjct: 1859 SIIDRQFLIDLFAACCRLLFTVLKHHKSECKQSIAQLQASVSVLLHSLERVDPDPELVGG 1918
Query: 1763 YFSWKVDEGVKCACFLRRIYEEIRQQRDIVGRHCSLFLSNYIWVYSGYGPLKFGIKREID 1810
YFSW VDEGVKCACFLRRIYEEIRQQR+ VGRHCSLFLSNYI VYSG GPLK GI+REID
Sbjct: 1919 YFSWNVDEGVKCACFLRRIYEEIRQQREFVGRHCSLFLSNYISVYSGLGPLKSGIRREID 1978
BLAST of Sgr023809 vs. TAIR 10
Match:
AT4G30150.1 (CONTAINS InterPro DOMAIN/s: Nucleolar 27S pre-rRNA processing, Urb2/Npa2 (InterPro:IPR018849); Has 58 Blast hits to 49 proteins in 21 species: Archae - 0; Bacteria - 2; Metazoa - 2; Fungi - 0; Plants - 44; Viruses - 3; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 1065.8 bits (2755), Expect = 3.7e-311
Identity = 686/1810 (37.90%), Postives = 1015/1810 (56.08%), Query Frame = 0
Query: 87 LEERDKYYDTNMMKLLEEVLSHALFHTVHIDGFLCLHGSEKVIKSHDEKLEESKAHIKSY 146
+E+++K + ++++L+E++LS ALFH+ HIDGFL L G++K + + +E+K +KSY
Sbjct: 337 IEDKNKDLEISLLRLIEDILSLALFHSAHIDGFLGLGGAKKYL----PESKENKTILKSY 396
Query: 147 HRHLFDKVQRLVAGKKFLALGAIGELFRVLVVRV-KTVKGGSKFED---TKLNNKMGC-- 206
HRH F K + ++ KK L L +G LF+V + RV K + ++ ++ TK +N
Sbjct: 397 HRHFFTKFKNMLLMKKELELSCMGSLFKVFIYRVMKQQRDPNQLQEGMMTKASNARQAEE 456
Query: 207 -VGPLRDDVSSDASYVLQGSADVPSEKSNYLSSLSTEIRKSLFEFFVQILDPLLHTID-H 266
L D ++D GS S KS+Y SSL E RKS+F+FF+ +++P+L I+ +
Sbjct: 457 RPWKLADTATND-----NGS----STKSHYSSSLRLETRKSIFDFFLHLMEPILLEINGY 516
Query: 267 ISAEIKQGPALSDVHCLLKSINNLLASFMKEKVYLRTEDNSEGAYFNFLKKVYDTVMLVS 326
+ + P L D C++KS N+LL +F E++Y++TED SEGA FL+ ++ T++ V+
Sbjct: 517 NQSGSEMAPLLGDFCCVIKSANSLLFNFAHERMYVKTEDASEGACSCFLRTIFKTIVSVA 576
Query: 327 SNLLLLLRHEIENNVDLEVFVLAANEILVTVGYLLEIEYDVIGNDLVSLWLVILSYSAIN 386
S L+ + E+ VL A E++ +GYLL IEY++I +DLV+LWL+ILS+
Sbjct: 577 SE----LKKHCPYDNGSEMHVLLAKELVTAIGYLLHIEYEIIESDLVTLWLIILSF---- 636
Query: 387 LSFTSIPKQYL-----LTSKIQELGCQLVVLYGQLRQVNISVVALCKAMRTVL------- 446
L F+S+ + LTS + LGCQL+ LY LRQV+++V +L KA+R V+
Sbjct: 637 LEFSSLSPENSEGDCPLTSLLVGLGCQLITLYSDLRQVSVAVFSLFKAVRLVMPVVTPAD 696
Query: 447 SNEHE---TEKSYASVMTSLGYEAYGKSVGMLLSSQEIKFAIHKAIKYVPEGQASGIIQQ 506
++ E TE+ S + E KSV LLSSQ ++ AIHKAIK +PEGQASG I+
Sbjct: 697 GDDDEMIATEELPLSTVFPFRLERSEKSVEKLLSSQALRLAIHKAIKVIPEGQASGCIKS 756
Query: 507 LTEDVTETLGWLRLCNLNLNIRNSTSDSTLQTMFLGRGLSEMYDLMLDSLMITSGNAFQI 566
LT DV++T+ W++ + +T FL LS++Y L+LDS+ IT+GN+ +
Sbjct: 757 LTADVSKTMKWIKQVCCS---TGATEQDGQVAAFLAGSLSDIYSLILDSITITTGNSNLV 816
Query: 567 GTSIENLISVIRPCMSCLVGLHSDGAKAFLVAIMGKTRDGMIADEDNCLGFGVTSHWVFV 626
G S+++L+ +I PC++ LV SD + FL A+ GK + ++A E + + +
Sbjct: 817 GQSMKDLLDLISPCLTHLVSSDSDCIENFLSALTGKDLEIVMA-EKKIETYRKSVRLFVI 876
Query: 627 FFFRLYMSCRSLYRQAISLMPPSSSRKMSTAIGDSFIAYSACDWMQRTDWSDEGYFSWII 686
F R+YMS RSLYRQ ISLMPP ++ M+ GDS DW++ W+ EGYFSWI
Sbjct: 877 FVLRIYMSSRSLYRQVISLMPPKKTKDMAGIKGDSVAVRCGSDWIKEKSWNYEGYFSWIS 936
Query: 687 EPSTSVLAVAQSVSSLYHQGTDVDWCPLVYVLHTMTLQRLVDLNRQIDSLEYVHQRNENL 746
+PS S++ + +S+ Y + D L+Y+L+ + LQRLVDLN I SL+YV Q ++N
Sbjct: 937 QPSASIVDTIKHISAFYLKDDSADCSLLIYILYGVALQRLVDLNSHIKSLDYVSQISDNQ 996
Query: 747 MQVEVLGDDGLSVLRKKSKKFGRLLSDLRKEAADLTDFMMKHLSLVAKHHILNSTKRNAT 806
+ +L + +S L++E +LTDF +L + +
Sbjct: 997 INDTML----------------KHVSVLKREGEELTDF------------LLGNNIISGF 1056
Query: 807 SNDKSTETLSEIDDWDFSICNVNKRSLPTAIWWIVCQNVDIWANHAAKKRLMPSRFCREL 866
+D + ET+ + D W + +N + LPT W++ Q++D+W HA KK+L F +L
Sbjct: 1057 VDDGTFETIKDTDQWVLRVSGINGKCLPTMRLWVLSQHIDLWCPHAGKKKL--KNFLSQL 1116
Query: 867 -KASILSSFHDINISSADWMEKIAK-------------------------------LEPV 926
+S+ + + +S+ W + K L P
Sbjct: 1117 IGSSVPCILNGVGMSTLGWENNVDKGSQKKKIGLEQFSFGLLFDSVLYEHEFVRRYLAPS 1176
Query: 927 CSS----------KRIPDE-NFP-----------LTKAVSHSSDLLQTEDCKLKRDSPPS 986
S K I +E NF L ++++ S L++E S
Sbjct: 1177 FSHVLKMTAETFFKDITEEVNFDSPSDWSEVLILLESSIANLSGKLKSEAFLEAHVSLLD 1236
Query: 987 NVRSRACQHLIDLLCWMPKGNISSRSFSLYTTYVLNLERQLVNAILDNQTTLCSPNQSEL 1046
N + ACQ+L++LL MPK + +SF LY +YVL+LER +V ++L L + L
Sbjct: 1237 NRKFTACQNLLNLLGVMPKEYTNKKSFQLYASYVLDLERFIVFSMLRCLNKLSCGDMQNL 1296
Query: 1047 LKLFVSCRKALKYIFMASYEAGDRQSSSIPVP-SENQFPVSWLFKSVSIVNQLQEASPEG 1106
LF +CRK LK I M S D+ + +P S++ SWLFKS Q
Sbjct: 1297 FSLFSTCRKTLKSIAMISC---DKVLGATKLPLSDSSLLASWLFKSAQAAT-CQVRFRND 1356
Query: 1107 SNRQIKDIIFSLMDHTSYLFLTTSKYQFKNALRLIVIDNKPCKEQCQNVFHELVNGDDLC 1166
+ +D +FSLMDHTSY+FLT SKYQF AL D
Sbjct: 1357 VTGKARDALFSLMDHTSYMFLTVSKYQFSKALPF----------------------SDEK 1416
Query: 1167 LDPTHCVEAYNSASQMTISLKEQVESEFISLKKSNVTVGDGK-----NSANMCKVYSLAS 1226
L + E A+ + +L EQ E+ +L+ T D K S + K+ + S
Sbjct: 1417 LISSEISEGTGQANLIIENLTEQAETLLNALR---ATFRDEKTAFKCESLILNKLTPIFS 1476
Query: 1227 CLNGFLWGLASAVDHTDLRNGSRHMRSMKLKCEYSSQLNLCINAISELLGLLLE-MFLDR 1286
C +G LWGLASAV + D++ ++ + ++ K E S+L+ I+ +S + + +FL
Sbjct: 1477 CFSGLLWGLASAVSNRDMQKNHQNAK-LRWKSEQFSKLSRIIHVLSNFFEVFAQCLFLSG 1536
Query: 1287 DSDRPTKLCDYHTSQDLLVVNEVSVKGSSSEVDTSFSKHHKFESSHNDDGTENVSFNRKR 1346
D R E+ T+ + + + +G
Sbjct: 1537 DVQR--------------------------EIQTNINWTRLLDGTEGSNG---------- 1596
Query: 1347 LKLENKSSVASILNEADSIELQSLNQPVLRGLLKGSYPEVAFALKQLFLAASVILRLHMK 1406
L D +E + + ++ L+KG EV ALK L +A++ ILRL+++
Sbjct: 1597 ------------LVCGDVVETSDVKKKIIESLIKGDSSEVVLALKHLLIASAAILRLNLQ 1656
Query: 1407 YDTVPLSSSFMAILIGISRLLLLEFVDMVEVPQPFLLACLDGVLKYLEELGHLFPFSDPM 1466
D + S +F+++L IS LL F DM E P F LDG +K +EELG F S+P
Sbjct: 1657 IDGITFSPTFVSVLTNISNDLLSVFADMSEAPLEFSFIWLDGAVKVVEELGSQFCLSNPT 1716
Query: 1467 QSRNLYSRLIDIQLKAIGKCISLLGKRATLASHETESTTKTLDGHLGLFEESSFPGVYRI 1526
+ +LYS+LI++ LK IGKCISL GK ATL SHET T + L L E+ ++ +
Sbjct: 1717 LNIDLYSKLIELHLKVIGKCISLQGKEATLESHETGFGTNAIHAKLVLTEKKRSHRLHWL 1776
Query: 1527 DEFKTSLRMSFKVFIRKASELHLLSAIQAIERALVGVQEGCTAIYELYSGSEDGGRCSSI 1586
DE K LRMSFKVFI +SELHLLS +QAIERALVGV E C AIY + +G+ DGGR S
Sbjct: 1777 DELKQRLRMSFKVFIHSSSELHLLSGVQAIERALVGVWEVCPAIYCIQTGNRDGGRISET 1836
Query: 1587 VAAGVECLDLVLEFVSGRKCLGVVKRHIQSLIAGLFSIVLHLQSPQVFYTR-MIDTKNKS 1646
VAAG++CLDL+LE +GRK L VVKRHIQ L++ +F I+ H+QSP +F++ ++ + +
Sbjct: 1837 VAAGLDCLDLILEHATGRKRLNVVKRHIQGLMSAVFGIMAHMQSPFIFFSNAVVGNQGSN 1896
Query: 1647 GPDPGSVILMSVEVLTRVSGKHALFQMNAWHVAQCLRIPAALFEEF-SLKLLGIPIQSDN 1706
PD G+VILM V VL R++GKHALF+M++ HV+Q + IP A+F ++ +G + D
Sbjct: 1897 SPDSGAVILMCVGVLIRIAGKHALFRMDSSHVSQSIHIPGAIFLDYLHATRVGFSV-LDG 1956
Query: 1707 SLISAQEVPNRVVTTTKSI-IDRQFLIDLFAASCRLLYTVLKHHKSECKRCIAQLQASVS 1766
+L+S + ++ +K + +DR+F + L+AA CRLLYT +KHHKS+ + IA LQ SVS
Sbjct: 1957 NLLSKDDQQQDLLGCSKELQVDRKFSVSLYAACCRLLYTAVKHHKSQTEGSIATLQESVS 2009
Query: 1767 VLLHSLERVGPDPEFMGGYFSWKVDEGVKCACFLRRIYEEIRQQRDIVGRHCSLFLSNYI 1810
LLH LE G + +G SW+V+EG++CACFLRRIYEE+RQQ+++ G+HC FLS YI
Sbjct: 2017 ALLHCLETAGKN---LGNCVSWEVEEGIRCACFLRRIYEELRQQKEVFGQHCFKFLSTYI 2009
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022146288.1 | 0.0e+00 | 72.62 | uncharacterized protein LOC111015534 isoform X1 [Momordica charantia] | [more] |
XP_022146289.1 | 0.0e+00 | 72.57 | uncharacterized protein LOC111015534 isoform X2 [Momordica charantia] | [more] |
XP_023533222.1 | 0.0e+00 | 70.81 | uncharacterized protein LOC111795175 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG6605008.1 | 0.0e+00 | 70.57 | hypothetical protein SDJN03_02325, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022947378.1 | 0.0e+00 | 70.52 | uncharacterized protein LOC111451261 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CY73 | 0.0e+00 | 72.62 | uncharacterized protein LOC111015534 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CXQ2 | 0.0e+00 | 72.57 | uncharacterized protein LOC111015534 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1G6F1 | 0.0e+00 | 70.52 | uncharacterized protein LOC111451261 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1L579 | 0.0e+00 | 70.52 | uncharacterized protein LOC111500003 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1G6M2 | 0.0e+00 | 78.24 | uncharacterized protein LOC111451261 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT4G30150.1 | 3.7e-311 | 37.90 | CONTAINS InterPro DOMAIN/s: Nucleolar 27S pre-rRNA processing, Urb2/Npa2 (InterP... | [more] |