Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATCCTGGCGTTGGGTCCAAGTTTGTGTCCGTGAATCTCAATAAATCCTATGGACAGGCTCATCATCATTCCAACTCTTATGGATCAAATCGAACCCGGCCTGGTACTCATGGGGCCGGAGGAGGAATGGTGGTCCTTTCGAGGCCTCGAAGTTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCCTTGCGAAAGGAGCATCAGAGACTTGATTCTTTAGGTTCATCTGCTGGCCCAACGGGTGGAGGGGTTATGGGAAACGGGCAGAGGCCAACTTCAGCTGGCATCGGTTGGACAAAGCCAAGGACCAACGATTTTCCGGACAAAGAAGCACTTAATGGTAATGTAGTCGACAGAATTGATCCATCTCTGCGAAGTGTCGATGGGGTGAGTGGTGGGAGCAGTGTGTATATGCCTCCTTCGGCCCGTGCTGGTATAACACAGCCCGATTGCGTCTCCAGTTGCTTTCTCTCGGTGCATACTACACTTGAAAAGACCCCAATTTTGGGAGGCGAGGACTTCCCTTCTTTGCAAGCAACGTTACCATCTGCCGTTGCGCCTCCTCAGAAACAAAAAGATGGGATGAATTCTAAATTGAAGCATGCATCTGAAGGTTCATATGAAGAACGGAGGGATACTTCTCATTTAAGTTCAAGTATAGATGCCCGCACAAAATATCGGTCATCACTTAAAAGTCCTCCTAGGGAAAATTCAAAAAATGGAGATTTTTTCAGTTCACCAGAGTTTTCTCGGAAACAGGAAGATATTTTCCCCGATCCTTTACCACTCGTCTCAGTGAATCCAAGATCAGACTGGGCTGATGATGAACGTGATACAAGTCAAGGTTTGGCTGGCATGGGAAGGGACCGAGGCCACCCTAAGAGTGAGGCTTATTGGGAGAGGGACTTTGATATGCCCCGGGTTAGTTCTCTTCCCCACAAGCCCATTCCTAATTTTTCTCAGAAATGGAATATGCGGGATAATGAATCTGGAAAGTTTCGCTCCAGTGACATTCGTAAAGTGGACCCTTATGGTCGAGATGCAAGAACACCTAGTAGAGAAGGCTGGGAAGGAAGCTTCCAAAAAAACAATCCTATTCCTAAAGATGGATTTCATTCAGATAGTGGAAATCCTAGAAATGATATTGCAGCGAGGCCCACTAGCATTGATCGGGAAGCAAATGCCGGTAACATGCATGTTTCACATTTTCAAGAACATGCTCATAAAGATGGGAGGAAGGATACTGGATTTGGACAGACAGGGCGCCAAACATGGAATCGTGCAACAGAATCCTATAGCTCACAGGAACCAGATCGAACTACAAGAGACAAGTATGGTAGTGAGCAACATAATAGGTACAAGGGTGAAATACACAATACTTCAGTTGCAAATTCATCATACTATTCAGGTTCAAAACAAATTCCTACAGATGAACCATTGCTGAATTTTGGTAAGGACAGACGTTCTTTTACAAAGAATGAGAAACCCTTCATGGAAGATCCTTTTATGAAAGATTTTGGTGCTGATCCTTTTACTGCTGGTCTTGTTGGGATGTTTAAGAAGAAGGATGTGATTAAGCATACTGATTTTCATGACCCTGTTAGGGAGTCCTTTGAGGCTGAGCTTGAGAGAGTTCAACAGATGCAAGAACAGGAACGACAAAGAATTATTGAGGAGCAAGAAAGAGCTTTGGAACTGTCTAGAAGAGAAGAAGAAGAGAGAAAGAGGATAGCAAGGGAACATGAAGAAAGGCAGAGAAGAGCTGAAGAAGAAGCTAGAGAAGTAGCATGGAGAGCTGAACAAGAACGACTGGAGAATATCCAAAAGGCTGAAGAACTTCGAATAGCTAGAGAGAGAGATAAACAAAGGATTATTTTGGAGGAAGAGAGAAGAAAGCAAGCTGCTAAGCTAAAGCTTTTAGAAGTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCGGATATTCCTGAAAAGAATATTTCCAATGTTGCGAAGGATGTTTCCAGGGTGGCGGACACTATTGATTGGGAAGATGGTGAAAAAATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGTATGGTTAGGTCCTCTGAGGTAGGCCCTAGATCTCAATTTTCTAGAGATGCTTCTTCTGCCTTTGTGGACAGAGGGAAATTGGTTAATTCATATAGAAGAGATCTTTATGAGAGAAGAAGTGGCTCTCAATTCGTTCTACAAGGCCAGAGTATTGGCTACAACAGTCAAAGGCAAGAGCCATTTGTTGGTGGGCAATTATCTTCTAGAAAAGAGTTTTATGGGGGAGCTGGGTTTACAACTTCTAGGATATCTCATGGAAGAGGTATTACAAAACCGCAATCTGATGATTATTCTGAGCTAAGAGTGCAGAGACCCAACCTTTCTGGGAGTGGTGATCATTATAACAAGAGCCAAGAGTTTGACTCAGAATTACAGGATAGTTTCGAGAATTTCGGTGATCATGGATGGAGGCAAGAGGGTGGTCACAACAACGTCTATTTTCCTTACCCTGAACGAGTAAATTCGATTTCTGAGCCTGATGGGTCCTACTATGTTGGAAGGACACGCTATTCCCAGAAGCAACCTCGAGTTCTTCCTCCTCCATCTGTAGCTTTGATGCAGAAATCTTCTATCAGGGGTGAATATGAATCTGTTACTCGGGATACTCATCCGGTAAGGAATGTTTCTACTGCTCAGGCAAGGTATATTCATCATGAAAATTGTACCCTATCAAAGATAATTGACGTTAATTTTGAAAATGTTGAGAATGAAGAGCAGAAACCAGATGGTGGCATAACTCTGCGGTGTGACTCACAGTCAACTCTGTCTGTGTTTAGCCCCCCAATCTCTCCAACTCATCTATCTCACGAGGACTTGGATGATTCTGGAGATTCACCTGTTTTATCTGCTAGCAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTGCAGTACCTGCCAAGGCTAGTAAAGTGATCATGATGACCTCTACTAGAGTATCTACAGGTGATGAAGATGATTGGGTTGTTGCAGACGAGCATGTTCAAGAACAAGAAGAATATGATGAAGATGATGATGGATATCAGGAAGAAGACGAAGTTCATGAAGGGGAGGATGAGAATATTGACCTTGAACAAGATTTTGATGATTTGCAGTTAAATGATACTGATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTTGTGATGCCAAATGATGAGTTTGAAAGAATTCCTGGAGATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAGTTGCATCAAAGAAGAACAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATATGTGGATGCTTCTTCTCAAGTAAGGATTTCTGATCTTGAGGAGATGCAGGACAAGGTTATGCAATCGAAAATTGCCCAAGCATTACCAGAACTTGAAATTACCGAGCAAGGAAATTCCTGTAGATCTAGTTTGTCTGTTCAACAACCAATCTCATGTTCAGTTACAATGGGCTCACAATCTTCATCTGGTCAAGTTATTGTGCCGAATGCTGTCCTTTCAGGTCAAGCCGAGCCTCCTTTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCATGTACCATCTATACAGATAGGCTCTATACAGATGCCTCTTCATTTGCATCCCCAGATTACTCCATCTATAATTCAGATGCATTCATCACAACCCCCTTTATTCCAGTTTGGGCAGCTTAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTGCCTTTGCCTCCTCAACCGCCAACATTTATTCTGCCCATTGTTCAGACTGGTTTTCCTTCAAATGAAAATCCAAGGGAAGCTCTGTCTGTTCAAACTCCTGAGGAAACTTGTTTTAACAAGTCCCGTAAACATAATGTGTTTCCTTTTATGAAGGATAATCAGGTCCTTGTGTCAAGATCTCTGAATGTGAATTCATCAGGCGATTCAAAGTCATTACCTTTAACAGAAAGTATAGAAGCCAAAGTTATGAATCAGCAAGATCAAACTTCAAGTTCTTGCATAGACGAGAGCAATTCCAGATCTCAACCAGGTTTTCAAGCGGAACATCAGAGGCACCATGTTCCAACTTCAGGCAATCATTACATGGTATCAAGGGGAAAAGAATCTAAAGGTCGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGGAGGTTTAAAACGCGTGGCCTATTTCCTGGTGGAAGAGGGAAAAAATTTATCTTTGCAGTAAAAAATTCTGGATCTAGATTGCCCTTTGCAGGTTCTGAATCTACTCGCTTAGATAATGGTGGACTTCAGAGGCAGCCTAGGCGCAATACTCCACGTACTGAGTTTCGTGTTAGGGAAACTGTGGATAAAAAATTGCCGAACAGTCAAGTTTCTTCAAACTACGTAGAGGTAGATGATAAGCCAACTGATAGTGGAATAAGTGCGGTCAATTCGGGCAGAAATGGAACTAGGAAGGTTGTCATATCTAATAAGCCGTCAAAAAGAGCGCTAGAGTCTGAAGGATTTAGCTCTGTTGTGAGTACTTCTCTAGAGCTTGATTTTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGATTATTTGGGCAAGAGCCAGGGAAACCAATATTCGGGAGAGGGTATCTTCAGAAAGAATATTGGTTCTTGGGAGGATGTTGACGCTCCTTTGCAGAGTGGGATCATACGTGTATTTGAGCAACCTGGCATAGAGGCCCCCAGTGATGAGGACGATTTCATTGAGGTACGATCTAAAAGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAAGAGATCAAGGCAAAGTCCCACAATTCTAAGGTTTATACGCTATTCTTCATTTCAATTCTTAATTAATTATTTGTTTGTTTGTAACTGCAAGTTTATAGTTTAACTAATTTGATGAGTACTTTTTTAGATCCCACCAAAAAGTCGATCTACTTTGAAAAGTGCATTATCCTCCGTCAATTCAAGGAAGGTTTATGCCGCTAAGGAAGCAGAAATGGTGAAGAGAACTCGATTGGATTTTGCTACTCGTGATGGACGTGGTTCAGGAAGTATTATGGTGTCAAGTGCATTTAGTTCTCCATTAGTCTCTCAACCATTGGCCCCAATCGGTACTCCTGCTCTGAAGTCTGATTCCCAGATGGAAAGGTTGCATACTGCTAGGTTGGTATACATATCATAAAGCAACTTACATGCATCAATCTTTCAGATCTAACATTTTATATTATCTTTTTAGGCCTATACTGACAAGTACCCCAGCTTTGGCAACCAGTAACGGAAGAAATCTCGAGTCAGGCTTGATGTTTGGCAAGAAGAACGATATTTTGGATAATGTACAAACATCTTTGACTTCTTGGGGCAATTCTCATAGAAATCAACAGGTACAAAGATGGAAGTATTGACCTTATTGAGTTTTATATTTGTTCATTATCTATTTTGCTGGATGCTAAAATGAGATTTTGATATTTGCCTTGTTGACCTTTATTCACTGCTTACTTCTTCTGGATTTTAATAGGCAATTTTAATATTTTTGTTTTGTTTTACTTTGGTAGAAGACCCTTGTTCTAATGTTTCATGATTTCAATTTCAGCTCAAGGAGGGTTAAATCTTAATCTAAGTTCAAAATGATTGTACTTAGATTTATATTTGTAATGTGCTTTTACTTGTATATGTTTTATAATCTGATTGAATCTTAAACTGATTAGACACATTGGCAGGTTATGGCCCTGACACAAACCCAGCTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCCCCGGTAGATCATTCTAGTTTAACTGGTGATCCTTTTATTGTGTCATCACCATCGATTTTGGCAAGCGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTGGCTGGGGAGAAAATTCAGTTTGGTGAGTATTGGAAGAACCGCAGGCTTCAAAACCTGATATTAAACTGATTTTAAGAGTTCAAAATTTATTAATTCTAATTCTCTTTTCAATTATGCATGTAGGTGCAGTCACATCTCCAACAGTTCTTCCACCTGGTAGCTGTGCCACTTTTCTTGGTATTGGCCCCACAGATCTCTGTCACTCCGACATCCAAATTCCTCACAAACTTTCCGGTGCTGAGAATGATTGTAATCTATTCTTTGATAAAGAGAAACATTACTCTGAATCTTGTTCTCCTATTGAAAATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCCGTTGCTGTTGCAGCTATCAGTAGTGATGAGATACTAGTTACGAATGGGCTTGACACATGCTCTGTTTCTGTTACTGATACCAATAATATTGGTGGTGGGGATGTTGACATTGTAACAGCAGGTAATAGAAGCTGTGGTCTGTTAATTCGATTAGTTGACCTCTTATCTTGGTTGCAAATTTTGTTATACAACTTTGATGACGCTTACTGTATTACAGGTACAGCTGGTGATCAGCAATTAGCTAGCAAAACAAGGGCCGATGACTCCCTAACTGTCACACTGCCTGCAGATTTGTCTGTTGAGACTCCCCCGATTTCCCTGTGGCCAACTTTGCCAAGTCCACAGAATTCTTCAAGCCAGATGCTTTCACATTTCCCCGGAGGTTCACCTTCTCAATTTCCCTTTTACGAGATAAATCCTATGCTGGGAGGCCCTGTCTTTACTTTTGGACCCCATGACGAGTCTGTATCCACCACCCAAGCTCAAACTCAAAAATGTAGTGCACCGGCACCTGGCCCTCTTGGATCCTGGAAACAGTGCCATTCTGGTGTAGATTCATTCTATGGGCCTCCTGCTAGTTTTACCAGTCCATTTGTAAGTCCTGGAGGCATCCCGGGGGTTCAAGGTCCTCCGCACATGGTTGTGTACAATCACTTCGCTCCTGTTGGACAGTTTGGGCAGGTTGGCTTGAGTTTCATGGGGGCGACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCACAGCCCTGGACCTTCTTTGTGCGTTGAAGATCAGAAAAATTTAAATATGGTTTCGGCTCAACGCATGCCCACCAACTTACCTCCAATTCAGCATCTTGCTCCTGGATCACCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTTTCTCCATTCCAGGTACAGTTCGTTTATTTTATTCCCCTTTCACCTTGTTGTGTAGCTCGCGTGGGAGAGGATTTTTTGTGTGTTACGGACTCTAGACATAGTATTTAGAAAAAATGTACTACTATTAGTCCAAGAATGATAGGATTTGTGGTAGGGGTATTCTAGTAAATAGTGAAGAGTTGCTAGGCATTGTTGGATTATAAATAGAGGTAGTGGGGAGGAGAAAGTTATAAAAAATAGTGGCCATTTCGGTATTTGGTTTGAGAGAGTATTCTCAAGAATGGGAGGTTTAATTACCTACTTGGTTAGATTGAAATTTCTTTATCCTTTATATTTCAATGCATAGTTCGGTTTCTAGTTCGGGCCTTAACAATTGGTATTAGAGCTGTTTGATTCTGGCCATAATAATGTATGATGAAGACGTTTTTCGTGAGGTGATGAAATGTATGGTGTCGAGGCTAGATTCATGTCAAGCTGTAGTGAAGGAGAGTATGGATTCAATTCATCCAAAATTGTGATAGTAAACAATTTTAGAGAAATGATAAGTTAGTTTACGGGACTTGAAAGACAAGCGCAGAGAGTTGTGGAAGGAAATCGCAAAAGAAAATTAAAGCAAAGATGAGACAAATAGTGCTTATGTTCTACAGATTTGTTCCAAATTAGAAAAACATTTAAAAAAGAGTGTACAGGGAGCTCTTAAAGAAGGATGAAGTTCAAAAAGAAAGTTAGTGGCAAGATGAGGTCGTTGTAAATGGGTATCGCTGCCGACAATGGTTGGAGATGAAGAGGTGGCAGATCCGTAGACACAAGTGACAGTGGAAAAGGAAGATGATGAAGCTGAAGCAAGTGCTGAAATGGATGGTGGTGGTCGCTGGATAAGGGAAGCTCCACCGGAGAGACATGGTTGCCAATCGTCCATTGCAAAAAATTATATGAACACGCAAACGAGGAAGATGTGTGATGTGCATTGGAAGCCGAAGAAGAAGATACATTGGATATGGGAGGAAGAATATGATGAAAAAGTGGTAGAAGTTGAAAGGGGGAGAAAAGGAAGCGGTAAAGAAGAAAGAGCCGATGTCAGTGTGGTGATGTGGTAGGAAAAACATAGAAATTTTGGCTTTTCGCCTGGGTTCTCAGGGAAGTAAGGAAAATAAATAAAAGAATAAAAGAATAAGGATATTTAGTTGGTGGCTAGGGTTTTTAATGGATTTTGGTTACAAAAGCTATAGGGATTGGACTATTTTTTGCAAGCCAGTAATGGGTTTTGTTGTTTTGGGTTGTTGGGCTGTTTTTTCTTTTGAATGGGTTGTTTTTAGATTTTACATGGGTTAAAAAAAAGTTTTGTTTACCCCCACTTTGAGGACGAGGTGTTTTTTAGGTGTCGGTAATGATAGGATTTGTGGTAGGAGTATTTAAGTAAGAGATCAATTTAGGTAAAAAAAAAAATCAATTTAGGTCATGCCTTGACACGAGCTATCTTTTCATACCCACATGTTATTCTTTATTCCCAAAAAATAGGAAAAAAAAGTGGTCCAAATTTGAGTGAAGGGTTTATTAGGCATTTTGGATTATAAATGGAGGTAGTGGGGAGAGAGTTAGAAAAAGTATTGGGGCCTTTTTGCTATTGGGCTTGAGAGAGTATTGTAAAGAGTCAAGAATAGGTGAAATTTCTTTATCCTTTATATTTCAATGCGAGCTTCGAGAATACATGGGATCTTTATAAAGTTGATCATTATGAGTGTTGTGACGAATTTGATTAGGAAGTTCAGTGGCAAAAAGGAGAAGATTCATTAGCTCATTATTTAGTCGGCCTTGGCTAAAAAAGGGACAAAGTGTCAGTTGCATTTCAATATTTCAATACATTGTTCGGTTTCTAGTTTGGGTCCTATTTTATATGCATGAACTGCGTCTCCGGTGTTACATAATAAAGTAGATCATAGATGAATAGATTGGCAGTTGCACTAACCAATTTCTTTTTTTCTGCTACAGAAAGTTCTGATGTTTGCTGTTAAATATTTGCAGGCCTCTCCGGAAATTTCTGCTCAAACTCGTTGGCCCTCTTCAGCATCAACTGTTCAGGCTGTGCCTCTTTCCATGCCTTTGCAGCATCAGGCTGGGGGCGTACTTCCTTCTCATTTTAGTCATCCGTCATCTGCTGACCCATCATTTACAGTCAACAGGTTTCCTGGATCACAACCCGCAGACCAGAAGCGTAATTTTCCTGTGGGGGCTGATGCATCTGTCACCCAACTTCCAGATGAACTTGAAATAGTTGATGAATCTAGTTGCCTTAGTTCTGGGGCTGGAGTGCCTAATGTTGACACTAACAGCTTAGTGGTTAACTCGGCTACTGATGCTGGCAAGACTAGTGTTCGTCCGAATTGCAGTAGCAACAACAGTGGCCAGAATGCAAGCACCAATTTAAAATCTCAGTATCCTCATAAGGGCATATCTGCCCATCAATACACTCATTCTTCTGGTTACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGGCCCACCGGCGAACACGGTTCATGGGAAGGAACCAGTCTGGAGCTGAAAAGAACTTTTCCTCTGGAAAGGTGAAGCAAATTTATGTTGCCAAGCAATCATCAAGCAGAAATCTCGGGAGTTTAGAAAAGGGAGCTGCCTAGATTTCTGATTTGCTTCCATTGAACGGATTGTTTCGGATCCAGAAAACAAAGTTTTGGTTGGCTTTTCTTTTTGTGTTTACTTTACATTAGTAAAGTGTTGGAATTAGTCGATCTGTAGAGACCATGAACGCTGGTGGCGATGATGCCATTCCAGACGAAAGGGATTATTAAGTTTCATTCTTCCAAGCGATGTGCCAGGACAGTGGGAGTCGTCAATCATAGTATGAAGTTTTTGTCGACCTCGAAAAGTTTTATTATTTATTTGGACTGATGTCACATTTGCAGTGCAATCAGATGATCAAATTGGGGGCCTGTCGCCTAATTCTAATGCCATTTGATGTCTTTCTATTTTTATTTTCTGTGCATCTAATTCTGTGTTTCCAAGAATGGATGGCAAAGCCAAGTCTGTTTAGCATATTGGAAACTCCTCAAAAACTATGAACATACTGTTTTCCTCTCATCTCCCAGAAGTTACGGTAAATTTAACGTAAACGTTATTGTTGTTGCCCATATTTTCTGATTTAAAATTTTGATGATTTGATATGGGATGTTTTAGTAGAGGTGGTTTCTTTTCCCTTTTGCCTTTACCGTGTTGAAATACTCTGTACCGAGGGACAGCCTTATTACTTCGTTTATGGCGTATCCAGGAAAGTAGTCTTGTTTTGTTATTGATTTGCCTAGCTAACGAAAAAGTGATGAAAC
mRNA sequence
ATGGCTAATCCTGGCGTTGGGTCCAAGTTTGTGTCCGTGAATCTCAATAAATCCTATGGACAGGCTCATCATCATTCCAACTCTTATGGATCAAATCGAACCCGGCCTGGTACTCATGGGGCCGGAGGAGGAATGGTGGTCCTTTCGAGGCCTCGAAGTTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCCTTGCGAAAGGAGCATCAGAGACTTGATTCTTTAGGTTCATCTGCTGGCCCAACGGGTGGAGGGGTTATGGGAAACGGGCAGAGGCCAACTTCAGCTGGCATCGGTTGGACAAAGCCAAGGACCAACGATTTTCCGGACAAAGAAGCACTTAATGGTAATGTAGTCGACAGAATTGATCCATCTCTGCGAAGTGTCGATGGGGTGAGTGGTGGGAGCAGTGTGTATATGCCTCCTTCGGCCCGTGCTGGTATAACACAGCCCGATTGCGTCTCCAGTTGCTTTCTCTCGGTGCATACTACACTTGAAAAGACCCCAATTTTGGGAGGCGAGGACTTCCCTTCTTTGCAAGCAACGTTACCATCTGCCGTTGCGCCTCCTCAGAAACAAAAAGATGGGATGAATTCTAAATTGAAGCATGCATCTGAAGGTTCATATGAAGAACGGAGGGATACTTCTCATTTAAGTTCAAGTATAGATGCCCGCACAAAATATCGGTCATCACTTAAAAGTCCTCCTAGGGAAAATTCAAAAAATGGAGATTTTTTCAGTTCACCAGAGTTTTCTCGGAAACAGGAAGATATTTTCCCCGATCCTTTACCACTCGTCTCAGTGAATCCAAGATCAGACTGGGCTGATGATGAACGTGATACAAGTCAAGGTTTGGCTGGCATGGGAAGGGACCGAGGCCACCCTAAGAGTGAGGCTTATTGGGAGAGGGACTTTGATATGCCCCGGGTTAGTTCTCTTCCCCACAAGCCCATTCCTAATTTTTCTCAGAAATGGAATATGCGGGATAATGAATCTGGAAAGTTTCGCTCCAGTGACATTCGTAAAGTGGACCCTTATGGTCGAGATGCAAGAACACCTAGTAGAGAAGGCTGGGAAGGAAGCTTCCAAAAAAACAATCCTATTCCTAAAGATGGATTTCATTCAGATAGTGGAAATCCTAGAAATGATATTGCAGCGAGGCCCACTAGCATTGATCGGGAAGCAAATGCCGGTAACATGCATGTTTCACATTTTCAAGAACATGCTCATAAAGATGGGAGGAAGGATACTGGATTTGGACAGACAGGGCGCCAAACATGGAATCGTGCAACAGAATCCTATAGCTCACAGGAACCAGATCGAACTACAAGAGACAAGTATGGTAGTGAGCAACATAATAGGTACAAGGGTGAAATACACAATACTTCAGTTGCAAATTCATCATACTATTCAGGTTCAAAACAAATTCCTACAGATGAACCATTGCTGAATTTTGGTAAGGACAGACGTTCTTTTACAAAGAATGAGAAACCCTTCATGGAAGATCCTTTTATGAAAGATTTTGGTGCTGATCCTTTTACTGCTGGTCTTGTTGGGATGTTTAAGAAGAAGGATGTGATTAAGCATACTGATTTTCATGACCCTGTTAGGGAGTCCTTTGAGGCTGAGCTTGAGAGAGTTCAACAGATGCAAGAACAGGAACGACAAAGAATTATTGAGGAGCAAGAAAGAGCTTTGGAACTGTCTAGAAGAGAAGAAGAAGAGAGAAAGAGGATAGCAAGGGAACATGAAGAAAGGCAGAGAAGAGCTGAAGAAGAAGCTAGAGAAGTAGCATGGAGAGCTGAACAAGAACGACTGGAGAATATCCAAAAGGCTGAAGAACTTCGAATAGCTAGAGAGAGAGATAAACAAAGGATTATTTTGGAGGAAGAGAGAAGAAAGCAAGCTGCTAAGCTAAAGCTTTTAGAAGTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCGGATATTCCTGAAAAGAATATTTCCAATGTTGCGAAGGATGTTTCCAGGGTGGCGGACACTATTGATTGGGAAGATGGTGAAAAAATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGTATGGTTAGGTCCTCTGAGGTAGGCCCTAGATCTCAATTTTCTAGAGATGCTTCTTCTGCCTTTGTGGACAGAGGGAAATTGGTTAATTCATATAGAAGAGATCTTTATGAGAGAAGAAGTGGCTCTCAATTCGTTCTACAAGGCCAGAGTATTGGCTACAACAGTCAAAGGCAAGAGCCATTTGTTGGTGGGCAATTATCTTCTAGAAAAGAGTTTTATGGGGGAGCTGGGTTTACAACTTCTAGGATATCTCATGGAAGAGGTATTACAAAACCGCAATCTGATGATTATTCTGAGCTAAGAGTGCAGAGACCCAACCTTTCTGGGAGTGGTGATCATTATAACAAGAGCCAAGAGTTTGACTCAGAATTACAGGATAGTTTCGAGAATTTCGGTGATCATGGATGGAGGCAAGAGGGTGGTCACAACAACGTCTATTTTCCTTACCCTGAACGAGTAAATTCGATTTCTGAGCCTGATGGGTCCTACTATGTTGGAAGGACACGCTATTCCCAGAAGCAACCTCGAGTTCTTCCTCCTCCATCTGTAGCTTTGATGCAGAAATCTTCTATCAGGGGTGAATATGAATCTGTTACTCGGGATACTCATCCGGTAAGGAATGTTTCTACTGCTCAGGCAAGGTATATTCATCATGAAAATTGTACCCTATCAAAGATAATTGACGTTAATTTTGAAAATGTTGAGAATGAAGAGCAGAAACCAGATGGTGGCATAACTCTGCGGTGTGACTCACAGTCAACTCTGTCTGTGTTTAGCCCCCCAATCTCTCCAACTCATCTATCTCACGAGGACTTGGATGATTCTGGAGATTCACCTGTTTTATCTGCTAGCAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTGCAGTACCTGCCAAGGCTAGTAAAGTGATCATGATGACCTCTACTAGAGTATCTACAGGTGATGAAGATGATTGGGTTGTTGCAGACGAGCATGTTCAAGAACAAGAAGAATATGATGAAGATGATGATGGATATCAGGAAGAAGACGAAGTTCATGAAGGGGAGGATGAGAATATTGACCTTGAACAAGATTTTGATGATTTGCAGTTAAATGATACTGATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTTGTGATGCCAAATGATGAGTTTGAAAGAATTCCTGGAGATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAGTTGCATCAAAGAAGAACAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATATGTGGATGCTTCTTCTCAAGTAAGGATTTCTGATCTTGAGGAGATGCAGGACAAGGTTATGCAATCGAAAATTGCCCAAGCATTACCAGAACTTGAAATTACCGAGCAAGGAAATTCCTGTAGATCTAGTTTGTCTGTTCAACAACCAATCTCATGTTCAGTTACAATGGGCTCACAATCTTCATCTGGTCAAGTTATTGTGCCGAATGCTGTCCTTTCAGGTCAAGCCGAGCCTCCTTTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCATGTACCATCTATACAGATAGGCTCTATACAGATGCCTCTTCATTTGCATCCCCAGATTACTCCATCTATAATTCAGATGCATTCATCACAACCCCCTTTATTCCAGTTTGGGCAGCTTAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTGCCTTTGCCTCCTCAACCGCCAACATTTATTCTGCCCATTGTTCAGACTGGTTTTCCTTCAAATGAAAATCCAAGGGAAGCTCTGTCTGTTCAAACTCCTGAGGAAACTTGTTTTAACAAGTCCCGTAAACATAATGTGTTTCCTTTTATGAAGGATAATCAGGTCCTTGTGTCAAGATCTCTGAATGTGAATTCATCAGGCGATTCAAAGTCATTACCTTTAACAGAAAGTATAGAAGCCAAAGTTATGAATCAGCAAGATCAAACTTCAAGTTCTTGCATAGACGAGAGCAATTCCAGATCTCAACCAGGTTTTCAAGCGGAACATCAGAGGCACCATGTTCCAACTTCAGGCAATCATTACATGGTATCAAGGGGAAAAGAATCTAAAGGTCGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGGAGGTTTAAAACGCGTGGCCTATTTCCTGGTGGAAGAGGGAAAAAATTTATCTTTGCAGTAAAAAATTCTGGATCTAGATTGCCCTTTGCAGGTTCTGAATCTACTCGCTTAGATAATGGTGGACTTCAGAGGCAGCCTAGGCGCAATACTCCACGTACTGAGTTTCGTGTTAGGGAAACTGTGGATAAAAAATTGCCGAACAGTCAAGTTTCTTCAAACTACGTAGAGGTAGATGATAAGCCAACTGATAGTGGAATAAGTGCGGTCAATTCGGGCAGAAATGGAACTAGGAAGGTTGTCATATCTAATAAGCCGTCAAAAAGAGCGCTAGAGTCTGAAGGATTTAGCTCTGTTGTGAGTACTTCTCTAGAGCTTGATTTTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGATTATTTGGGCAAGAGCCAGGGAAACCAATATTCGGGAGAGGGTATCTTCAGAAAGAATATTGGTTCTTGGGAGGATGTTGACGCTCCTTTGCAGAGTGGGATCATACGTGTATTTGAGCAACCTGGCATAGAGGCCCCCAGTGATGAGGACGATTTCATTGAGGTACGATCTAAAAGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAAGAGATCAAGGCAAAGTCCCACAATTCTAAGATCCCACCAAAAAGTCGATCTACTTTGAAAAGTGCATTATCCTCCGTCAATTCAAGGAAGGTTTATGCCGCTAAGGAAGCAGAAATGGTGAAGAGAACTCGATTGGATTTTGCTACTCGTGATGGACGTGGTTCAGGAAGTATTATGGTGTCAAGTGCATTTAGTTCTCCATTAGTCTCTCAACCATTGGCCCCAATCGGTACTCCTGCTCTGAAGTCTGATTCCCAGATGGAAAGGTTGCATACTGCTAGGCCTATACTGACAAGTACCCCAGCTTTGGCAACCAGTAACGGAAGAAATCTCGAGTCAGGCTTGATGTTTGGCAAGAAGAACGATATTTTGGATAATGTACAAACATCTTTGACTTCTTGGGGCAATTCTCATAGAAATCAACAGGTTATGGCCCTGACACAAACCCAGCTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCCCCGGTAGATCATTCTAGTTTAACTGGTGATCCTTTTATTGTGTCATCACCATCGATTTTGGCAAGCGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTGGCTGGGGAGAAAATTCAGTTTGGTGCAGTCACATCTCCAACAGTTCTTCCACCTGGTAGCTGTGCCACTTTTCTTGGTATTGGCCCCACAGATCTCTGTCACTCCGACATCCAAATTCCTCACAAACTTTCCGGTGCTGAGAATGATTGTAATCTATTCTTTGATAAAGAGAAACATTACTCTGAATCTTGTTCTCCTATTGAAAATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCCGTTGCTGTTGCAGCTATCAGTAGTGATGAGATACTAGTTACGAATGGGCTTGACACATGCTCTGTTTCTGTTACTGATACCAATAATATTGGTGGTGGGGATGTTGACATTGTAACAGCAGGTACAGCTGGTGATCAGCAATTAGCTAGCAAAACAAGGGCCGATGACTCCCTAACTGTCACACTGCCTGCAGATTTGTCTGTTGAGACTCCCCCGATTTCCCTGTGGCCAACTTTGCCAAGTCCACAGAATTCTTCAAGCCAGATGCTTTCACATTTCCCCGGAGGTTCACCTTCTCAATTTCCCTTTTACGAGATAAATCCTATGCTGGGAGGCCCTGTCTTTACTTTTGGACCCCATGACGAGTCTGTATCCACCACCCAAGCTCAAACTCAAAAATGTAGTGCACCGGCACCTGGCCCTCTTGGATCCTGGAAACAGTGCCATTCTGGTGTAGATTCATTCTATGGGCCTCCTGCTAGTTTTACCAGTCCATTTGTAAGTCCTGGAGGCATCCCGGGGGTTCAAGGTCCTCCGCACATGGTTGTGTACAATCACTTCGCTCCTGTTGGACAGTTTGGGCAGGTTGGCTTGAGTTTCATGGGGGCGACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCACAGCCCTGGACCTTCTTTGTGCGTTGAAGATCAGAAAAATTTAAATATGGTTTCGGCTCAACGCATGCCCACCAACTTACCTCCAATTCAGCATCTTGCTCCTGGATCACCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTTTCTCCATTCCAGGCCTCTCCGGAAATTTCTGCTCAAACTCGTTGGCCCTCTTCAGCATCAACTGTTCAGGCTGTGCCTCTTTCCATGCCTTTGCAGCATCAGGCTGGGGGCGTACTTCCTTCTCATTTTAGTCATCCGTCATCTGCTGACCCATCATTTACAGTCAACAGGTTTCCTGGATCACAACCCGCAGACCAGAAGCGTAATTTTCCTGTGGGGGCTGATGCATCTGTCACCCAACTTCCAGATGAACTTGAAATAGTTGATGAATCTAGTTGCCTTAGTTCTGGGGCTGGAGTGCCTAATGTTGACACTAACAGCTTAGTGGTTAACTCGGCTACTGATGCTGGCAAGACTAGTGTTCGTCCGAATTGCAGTAGCAACAACAGTGGCCAGAATGCAAGCACCAATTTAAAATCTCAGTATCCTCATAAGGGCATATCTGCCCATCAATACACTCATTCTTCTGGTTACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGGCCCACCGGCGAACACGGTTCATGGGAAGGAACCAGTCTGGAGCTGAAAAGAACTTTTCCTCTGGAAAGGTGAAGCAAATTTATGTTGCCAAGCAATCATCAAGCAGAAATCTCGGGAGTTTAGAAAAGGGAGCTGCCTAGATTTCTGATTTGCTTCCATTGAACGGATTGTTTCGGATCCAGAAAACAAAGTTTTGGTTGGCTTTTCTTTTTGTGTTTACTTTACATTAGTAAAGTGTTGGAATTAGTCGATCTGTAGAGACCATGAACGCTGGTGGCGATGATGCCATTCCAGACGAAAGGGATTATTAAGTTTCATTCTTCCAAGCGATGTGCCAGGACAGTGGGAGTCGTCAATCATAGTATGAAGTTTTTGTCGACCTCGAAAAGTTTTATTATTTATTTGGACTGATGTCACATTTGCAGTGCAATCAGATGATCAAATTGGGGGCCTGTCGCCTAATTCTAATGCCATTTGATGTCTTTCTATTTTTATTTTCTGTGCATCTAATTCTGTGTTTCCAAGAATGGATGGCAAAGCCAAGTCTGTTTAGCATATTGGAAACTCCTCAAAAACTATGAACATACTGTTTTCCTCTCATCTCCCAGAAGTTACGGTAAATTTAACGTAAACGTTATTGTTGTTGCCCATATTTTCTGATTTAAAATTTTGATGATTTGATATGGGATGTTTTAGTAGAGGTGGTTTCTTTTCCCTTTTGCCTTTACCGTGTTGAAATACTCTGTACCGAGGGACAGCCTTATTACTTCGTTTATGGCGTATCCAGGAAAGTAGTCTTGTTTTGTTATTGATTTGCCTAGCTAACGAAAAAGTGATGAAAC
Coding sequence (CDS)
ATGGCTAATCCTGGCGTTGGGTCCAAGTTTGTGTCCGTGAATCTCAATAAATCCTATGGACAGGCTCATCATCATTCCAACTCTTATGGATCAAATCGAACCCGGCCTGGTACTCATGGGGCCGGAGGAGGAATGGTGGTCCTTTCGAGGCCTCGAAGTTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCCTTGCGAAAGGAGCATCAGAGACTTGATTCTTTAGGTTCATCTGCTGGCCCAACGGGTGGAGGGGTTATGGGAAACGGGCAGAGGCCAACTTCAGCTGGCATCGGTTGGACAAAGCCAAGGACCAACGATTTTCCGGACAAAGAAGCACTTAATGGTAATGTAGTCGACAGAATTGATCCATCTCTGCGAAGTGTCGATGGGGTGAGTGGTGGGAGCAGTGTGTATATGCCTCCTTCGGCCCGTGCTGGTATAACACAGCCCGATTGCGTCTCCAGTTGCTTTCTCTCGGTGCATACTACACTTGAAAAGACCCCAATTTTGGGAGGCGAGGACTTCCCTTCTTTGCAAGCAACGTTACCATCTGCCGTTGCGCCTCCTCAGAAACAAAAAGATGGGATGAATTCTAAATTGAAGCATGCATCTGAAGGTTCATATGAAGAACGGAGGGATACTTCTCATTTAAGTTCAAGTATAGATGCCCGCACAAAATATCGGTCATCACTTAAAAGTCCTCCTAGGGAAAATTCAAAAAATGGAGATTTTTTCAGTTCACCAGAGTTTTCTCGGAAACAGGAAGATATTTTCCCCGATCCTTTACCACTCGTCTCAGTGAATCCAAGATCAGACTGGGCTGATGATGAACGTGATACAAGTCAAGGTTTGGCTGGCATGGGAAGGGACCGAGGCCACCCTAAGAGTGAGGCTTATTGGGAGAGGGACTTTGATATGCCCCGGGTTAGTTCTCTTCCCCACAAGCCCATTCCTAATTTTTCTCAGAAATGGAATATGCGGGATAATGAATCTGGAAAGTTTCGCTCCAGTGACATTCGTAAAGTGGACCCTTATGGTCGAGATGCAAGAACACCTAGTAGAGAAGGCTGGGAAGGAAGCTTCCAAAAAAACAATCCTATTCCTAAAGATGGATTTCATTCAGATAGTGGAAATCCTAGAAATGATATTGCAGCGAGGCCCACTAGCATTGATCGGGAAGCAAATGCCGGTAACATGCATGTTTCACATTTTCAAGAACATGCTCATAAAGATGGGAGGAAGGATACTGGATTTGGACAGACAGGGCGCCAAACATGGAATCGTGCAACAGAATCCTATAGCTCACAGGAACCAGATCGAACTACAAGAGACAAGTATGGTAGTGAGCAACATAATAGGTACAAGGGTGAAATACACAATACTTCAGTTGCAAATTCATCATACTATTCAGGTTCAAAACAAATTCCTACAGATGAACCATTGCTGAATTTTGGTAAGGACAGACGTTCTTTTACAAAGAATGAGAAACCCTTCATGGAAGATCCTTTTATGAAAGATTTTGGTGCTGATCCTTTTACTGCTGGTCTTGTTGGGATGTTTAAGAAGAAGGATGTGATTAAGCATACTGATTTTCATGACCCTGTTAGGGAGTCCTTTGAGGCTGAGCTTGAGAGAGTTCAACAGATGCAAGAACAGGAACGACAAAGAATTATTGAGGAGCAAGAAAGAGCTTTGGAACTGTCTAGAAGAGAAGAAGAAGAGAGAAAGAGGATAGCAAGGGAACATGAAGAAAGGCAGAGAAGAGCTGAAGAAGAAGCTAGAGAAGTAGCATGGAGAGCTGAACAAGAACGACTGGAGAATATCCAAAAGGCTGAAGAACTTCGAATAGCTAGAGAGAGAGATAAACAAAGGATTATTTTGGAGGAAGAGAGAAGAAAGCAAGCTGCTAAGCTAAAGCTTTTAGAAGTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCGGATATTCCTGAAAAGAATATTTCCAATGTTGCGAAGGATGTTTCCAGGGTGGCGGACACTATTGATTGGGAAGATGGTGAAAAAATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGTATGGTTAGGTCCTCTGAGGTAGGCCCTAGATCTCAATTTTCTAGAGATGCTTCTTCTGCCTTTGTGGACAGAGGGAAATTGGTTAATTCATATAGAAGAGATCTTTATGAGAGAAGAAGTGGCTCTCAATTCGTTCTACAAGGCCAGAGTATTGGCTACAACAGTCAAAGGCAAGAGCCATTTGTTGGTGGGCAATTATCTTCTAGAAAAGAGTTTTATGGGGGAGCTGGGTTTACAACTTCTAGGATATCTCATGGAAGAGGTATTACAAAACCGCAATCTGATGATTATTCTGAGCTAAGAGTGCAGAGACCCAACCTTTCTGGGAGTGGTGATCATTATAACAAGAGCCAAGAGTTTGACTCAGAATTACAGGATAGTTTCGAGAATTTCGGTGATCATGGATGGAGGCAAGAGGGTGGTCACAACAACGTCTATTTTCCTTACCCTGAACGAGTAAATTCGATTTCTGAGCCTGATGGGTCCTACTATGTTGGAAGGACACGCTATTCCCAGAAGCAACCTCGAGTTCTTCCTCCTCCATCTGTAGCTTTGATGCAGAAATCTTCTATCAGGGGTGAATATGAATCTGTTACTCGGGATACTCATCCGGTAAGGAATGTTTCTACTGCTCAGGCAAGGTATATTCATCATGAAAATTGTACCCTATCAAAGATAATTGACGTTAATTTTGAAAATGTTGAGAATGAAGAGCAGAAACCAGATGGTGGCATAACTCTGCGGTGTGACTCACAGTCAACTCTGTCTGTGTTTAGCCCCCCAATCTCTCCAACTCATCTATCTCACGAGGACTTGGATGATTCTGGAGATTCACCTGTTTTATCTGCTAGCAGAGAAGGCACATTGTCGATAGAGGATAATGAATCTGCAGTACCTGCCAAGGCTAGTAAAGTGATCATGATGACCTCTACTAGAGTATCTACAGGTGATGAAGATGATTGGGTTGTTGCAGACGAGCATGTTCAAGAACAAGAAGAATATGATGAAGATGATGATGGATATCAGGAAGAAGACGAAGTTCATGAAGGGGAGGATGAGAATATTGACCTTGAACAAGATTTTGATGATTTGCAGTTAAATGATACTGATAGAGGGTCACCCCACATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTTGTGATGCCAAATGATGAGTTTGAAAGAATTCCTGGAGATGAAGAAAATATGTATGTTGCAACAGAAATTTCAAGTTGCATCAAAGAAGAACAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAGTCTGTCAATATGTGGATGCTTCTTCTCAAGTAAGGATTTCTGATCTTGAGGAGATGCAGGACAAGGTTATGCAATCGAAAATTGCCCAAGCATTACCAGAACTTGAAATTACCGAGCAAGGAAATTCCTGTAGATCTAGTTTGTCTGTTCAACAACCAATCTCATGTTCAGTTACAATGGGCTCACAATCTTCATCTGGTCAAGTTATTGTGCCGAATGCTGTCCTTTCAGGTCAAGCCGAGCCTCCTTTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCATGTACCATCTATACAGATAGGCTCTATACAGATGCCTCTTCATTTGCATCCCCAGATTACTCCATCTATAATTCAGATGCATTCATCACAACCCCCTTTATTCCAGTTTGGGCAGCTTAGGTACCCGTCTTCTGTCCCCCAAGGTGTACTGCCTTTGCCTCCTCAACCGCCAACATTTATTCTGCCCATTGTTCAGACTGGTTTTCCTTCAAATGAAAATCCAAGGGAAGCTCTGTCTGTTCAAACTCCTGAGGAAACTTGTTTTAACAAGTCCCGTAAACATAATGTGTTTCCTTTTATGAAGGATAATCAGGTCCTTGTGTCAAGATCTCTGAATGTGAATTCATCAGGCGATTCAAAGTCATTACCTTTAACAGAAAGTATAGAAGCCAAAGTTATGAATCAGCAAGATCAAACTTCAAGTTCTTGCATAGACGAGAGCAATTCCAGATCTCAACCAGGTTTTCAAGCGGAACATCAGAGGCACCATGTTCCAACTTCAGGCAATCATTACATGGTATCAAGGGGAAAAGAATCTAAAGGTCGAGCTCAGGATGGGATGTGGCCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGGAGGTTTAAAACGCGTGGCCTATTTCCTGGTGGAAGAGGGAAAAAATTTATCTTTGCAGTAAAAAATTCTGGATCTAGATTGCCCTTTGCAGGTTCTGAATCTACTCGCTTAGATAATGGTGGACTTCAGAGGCAGCCTAGGCGCAATACTCCACGTACTGAGTTTCGTGTTAGGGAAACTGTGGATAAAAAATTGCCGAACAGTCAAGTTTCTTCAAACTACGTAGAGGTAGATGATAAGCCAACTGATAGTGGAATAAGTGCGGTCAATTCGGGCAGAAATGGAACTAGGAAGGTTGTCATATCTAATAAGCCGTCAAAAAGAGCGCTAGAGTCTGAAGGATTTAGCTCTGTTGTGAGTACTTCTCTAGAGCTTGATTTTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGATTATTTGGGCAAGAGCCAGGGAAACCAATATTCGGGAGAGGGTATCTTCAGAAAGAATATTGGTTCTTGGGAGGATGTTGACGCTCCTTTGCAGAGTGGGATCATACGTGTATTTGAGCAACCTGGCATAGAGGCCCCCAGTGATGAGGACGATTTCATTGAGGTACGATCTAAAAGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAAGAGATCAAGGCAAAGTCCCACAATTCTAAGATCCCACCAAAAAGTCGATCTACTTTGAAAAGTGCATTATCCTCCGTCAATTCAAGGAAGGTTTATGCCGCTAAGGAAGCAGAAATGGTGAAGAGAACTCGATTGGATTTTGCTACTCGTGATGGACGTGGTTCAGGAAGTATTATGGTGTCAAGTGCATTTAGTTCTCCATTAGTCTCTCAACCATTGGCCCCAATCGGTACTCCTGCTCTGAAGTCTGATTCCCAGATGGAAAGGTTGCATACTGCTAGGCCTATACTGACAAGTACCCCAGCTTTGGCAACCAGTAACGGAAGAAATCTCGAGTCAGGCTTGATGTTTGGCAAGAAGAACGATATTTTGGATAATGTACAAACATCTTTGACTTCTTGGGGCAATTCTCATAGAAATCAACAGGTTATGGCCCTGACACAAACCCAGCTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCCCCGGTAGATCATTCTAGTTTAACTGGTGATCCTTTTATTGTGTCATCACCATCGATTTTGGCAAGCGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTGGCTGGGGAGAAAATTCAGTTTGGTGCAGTCACATCTCCAACAGTTCTTCCACCTGGTAGCTGTGCCACTTTTCTTGGTATTGGCCCCACAGATCTCTGTCACTCCGACATCCAAATTCCTCACAAACTTTCCGGTGCTGAGAATGATTGTAATCTATTCTTTGATAAAGAGAAACATTACTCTGAATCTTGTTCTCCTATTGAAAATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCCGTTGCTGTTGCAGCTATCAGTAGTGATGAGATACTAGTTACGAATGGGCTTGACACATGCTCTGTTTCTGTTACTGATACCAATAATATTGGTGGTGGGGATGTTGACATTGTAACAGCAGGTACAGCTGGTGATCAGCAATTAGCTAGCAAAACAAGGGCCGATGACTCCCTAACTGTCACACTGCCTGCAGATTTGTCTGTTGAGACTCCCCCGATTTCCCTGTGGCCAACTTTGCCAAGTCCACAGAATTCTTCAAGCCAGATGCTTTCACATTTCCCCGGAGGTTCACCTTCTCAATTTCCCTTTTACGAGATAAATCCTATGCTGGGAGGCCCTGTCTTTACTTTTGGACCCCATGACGAGTCTGTATCCACCACCCAAGCTCAAACTCAAAAATGTAGTGCACCGGCACCTGGCCCTCTTGGATCCTGGAAACAGTGCCATTCTGGTGTAGATTCATTCTATGGGCCTCCTGCTAGTTTTACCAGTCCATTTGTAAGTCCTGGAGGCATCCCGGGGGTTCAAGGTCCTCCGCACATGGTTGTGTACAATCACTTCGCTCCTGTTGGACAGTTTGGGCAGGTTGGCTTGAGTTTCATGGGGGCGACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCACAGCCCTGGACCTTCTTTGTGCGTTGAAGATCAGAAAAATTTAAATATGGTTTCGGCTCAACGCATGCCCACCAACTTACCTCCAATTCAGCATCTTGCTCCTGGATCACCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTTTCTCCATTCCAGGCCTCTCCGGAAATTTCTGCTCAAACTCGTTGGCCCTCTTCAGCATCAACTGTTCAGGCTGTGCCTCTTTCCATGCCTTTGCAGCATCAGGCTGGGGGCGTACTTCCTTCTCATTTTAGTCATCCGTCATCTGCTGACCCATCATTTACAGTCAACAGGTTTCCTGGATCACAACCCGCAGACCAGAAGCGTAATTTTCCTGTGGGGGCTGATGCATCTGTCACCCAACTTCCAGATGAACTTGAAATAGTTGATGAATCTAGTTGCCTTAGTTCTGGGGCTGGAGTGCCTAATGTTGACACTAACAGCTTAGTGGTTAACTCGGCTACTGATGCTGGCAAGACTAGTGTTCGTCCGAATTGCAGTAGCAACAACAGTGGCCAGAATGCAAGCACCAATTTAAAATCTCAGTATCCTCATAAGGGCATATCTGCCCATCAATACACTCATTCTTCTGGTTACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGGCCCACCGGCGAACACGGTTCATGGGAAGGAACCAGTCTGGAGCTGAAAAGAACTTTTCCTCTGGAAAGGTGAAGCAAATTTATGTTGCCAAGCAATCATCAAGCAGAAATCTCGGGAGTTTAGAAAAGGGAGCTGCCTAG
Protein sequence
MANPGVGSKFVSVNLNKSYGQAHHHSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFPDKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTPILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYRSSLKSPPRENSKNGDFFSSPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQGLAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVDPYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSHFQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSVANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGADPFTAGLVGMFKKKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERKRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAAKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERITTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSIGYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSGDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYSQKQPRVLPPPSVALMQKSSIRGEYESVTRDTHPVRNVSTAQARYIHHENCTLSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQEEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGDEENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQALPELEITEQGNSCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGLFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVLPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQVLVSRSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDGRGSGSIMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFGKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPVDHSSLTGDPFIVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHSDIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTNGLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCSAPAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSLCVEDQKNLNMVSAQRMPTNLPPIQHLAPGSPLLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSHPSSADPSFTVNRFPGSQPADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNVDTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYPHKGISAHQYTHSSGYNYQRGGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNLGSLEKGAA
Homology
BLAST of Sed0025145 vs. NCBI nr
Match:
XP_038883483.1 (uncharacterized protein LOC120074436 [Benincasa hispida])
HSP 1 Score: 3696.4 bits (9584), Expect = 0.0e+00
Identity = 1989/2453 (81.08%), Postives = 2138/2453 (87.16%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH-----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQ 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG HGAGGGMVVLSRPRSSQ
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQAHHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQ 60
Query: 61 KPGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDF 120
KPGPKLSVPPPLNLPSLRKEH+RLDSLGS AGPTGGGV+GNGQRPTSAG+GWTKPRTND
Sbjct: 61 KPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPTGGGVLGNGQRPTSAGMGWTKPRTNDL 120
Query: 121 PDKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKT 180
P+KE L+ N+VD+IDPSLRSVDGVSGGSSVYMPPSARAG+T P +S V T +EK
Sbjct: 121 PEKEGLSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVLTAVEKA 180
Query: 181 PILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKY 240
P+L GEDFPSLQATLPSA AP QKQ+DG++SKLKHA EG YEE+RDTSHLSS IDA +K+
Sbjct: 181 PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAPEGLYEEQRDTSHLSSRIDAHSKF 240
Query: 241 RSSLKSPPRENSKNGDFF-----SSPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQ 300
+SS +S P EN+KNG+ F SPE S KQ+DIFP PLPLVS+NPRSDWADDERDTS
Sbjct: 241 QSSQESIPSENAKNGNSFGSGSLQSPELSWKQDDIFPGPLPLVSMNPRSDWADDERDTSH 300
Query: 301 GLAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKV 360
GL RDRGHPKSEAYWERDFDMPRVSSLPHK NFSQ+WN+RD+ESGKF SSDI K+
Sbjct: 301 GLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKHTHNFSQRWNLRDDESGKFHSSDIHKL 360
Query: 361 DPYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVS 420
DPYGRDART SREGWEG+F++NNPIPKDGF SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 DPYGRDARTASREGWEGNFRRNNPIPKDGFGSDSGNDRNDIAGRPTSIDRETNADNMHVS 420
Query: 421 HFQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTS 480
HF+EH +KDGR+DTGFGQ GRQTWN ATESYSSQEPDRT RDKY SEQHNRY+GE HNTS
Sbjct: 421 HFREHVNKDGRRDTGFGQNGRQTWNSATESYSSQEPDRTVRDKYVSEQHNRYRGETHNTS 480
Query: 481 VANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGL 540
VANSSY + K+IP DEPLLNFG+DRRSF K EKP+MEDPFMKDFGA DPFTAGL
Sbjct: 481 VANSSYSTSLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTAGL 540
Query: 541 VGMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEER 600
VG+ K KKDVIK TDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER
Sbjct: 541 VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
Query: 601 KRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQA 660
+R+AREHEERQRRAEEEARE AWRAEQERLE IQKAEELR+ARE +KQRI+LEEERRKQA
Sbjct: 601 QRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRMAREEEKQRILLEEERRKQA 660
Query: 661 AKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERI 720
AKLKLLE+EERMAKRQAE VKSS+LTSDIPEK I +V KDVSR+ADT+DWEDGEKMVERI
Sbjct: 661 AKLKLLELEERMAKRQAEVVKSSTLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERI 720
Query: 721 TTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQS 780
TTSASSESSS+ RSSEVG RSQFS D S +FVDRGK +NS+RRD YER SGSQFVLQ QS
Sbjct: 721 TTSASSESSSINRSSEVGFRSQFSTDGSPSFVDRGKSINSWRRDFYERGSGSQFVLQDQS 780
Query: 781 IGYNS-QRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSG 840
GYN+ R+E GG++SSRKEFYGGAGFTTSR SH RGIT+PQSD+YS+LR QRPNLSG
Sbjct: 781 TGYNNGPRREASTGGRVSSRKEFYGGAGFTTSRTSHRRGITEPQSDEYSQLRGQRPNLSG 840
Query: 841 SGDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTR 900
GDHYN+SQEFDSE QD+ EN+GDHGWRQE G NN YFPYPERVN ISE DGSY VGR+R
Sbjct: 841 GGDHYNRSQEFDSEFQDNVENYGDHGWRQESGRNNFYFPYPERVNPISEADGSYSVGRSR 900
Query: 901 YSQKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENC 960
YSQ+QPRVLPPPSVA +QKSS+RGEYESV RD HP N+ST+Q RYIHH+N
Sbjct: 901 YSQRQPRVLPPPSVASVQKSSVRGEYESVPRDIVESEIQYDHPAHNISTSQTRYIHHDNR 960
Query: 961 TLSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVL 1020
L +IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVL
Sbjct: 961 ALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVL 1020
Query: 1021 SASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGY 1080
SASREGTLSIEDNESAVPAK+ K IM+TSTRVSTGDED+W V DEHVQEQEEYDEDDDGY
Sbjct: 1021 SASREGTLSIEDNESAVPAKSGKEIMITSTRVSTGDEDEWGVVDEHVQEQEEYDEDDDGY 1080
Query: 1081 QEEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIP 1140
QEEDEVHEGEDENIDL +DFDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERIP
Sbjct: 1081 QEEDEVHEGEDENIDLVEDFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERIP 1140
Query: 1141 GDEENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIA 1200
G++ENMYVA EIS+ IKEEQGSSEGL VDGKVCQY DASSQ+RI D EEMQD VMQ A
Sbjct: 1141 GNDENMYVAPEISNGIKEEQGSSEGLPVDGKVCQYADASSQIRI-DPEEMQDLVMQPITA 1200
Query: 1201 QALPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQF 1260
QALPE EITEQGN SCRSS SVQQP M SQS SGQVIVPN +SGQAEPP KLQF
Sbjct: 1201 QALPESEITEQGNSSCRSSASVQQP------MASQSISGQVIVPNTAVSGQAEPPVKLQF 1260
Query: 1261 GLFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQG 1320
GLFSGPSLIPS VP+IQIGSIQMPLHLHPQIT S+ MHSSQ PLFQFGQLRY SSV QG
Sbjct: 1261 GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQTPLFQFGQLRYTSSVSQG 1320
Query: 1321 VLPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LV 1380
VLPL PQP TF+ P VQTGFP N+NP +ALS+ +ETC + SRK++V PF+ DNQ LV
Sbjct: 1321 VLPLAPQPLTFVPPTVQTGFPLNKNPGDALSIHPSQETCVHNSRKNDVLPFLMDNQQGLV 1380
Query: 1381 SRSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTS 1440
SRSLNVN S +SKSLPLTES E+K+M QDQT+ SCIDESNSRS+PGFQAEHQRH V TS
Sbjct: 1381 SRSLNVNPSMESKSLPLTESTESKLMTPQDQTAGSCIDESNSRSEPGFQAEHQRHRVSTS 1440
Query: 1441 GNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRL 1500
N Y+VSRGKES+G+ QDGM FDSVSRDKGL K RG F GGRGKK+IF VKNSGSRL
Sbjct: 1441 DNQYVVSRGKESEGQGQDGMGSFDSVSRDKGLSGLKARGQFHGGRGKKYIFTVKNSGSRL 1500
Query: 1501 PFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGIS 1560
PF GSESTRLD GG QR+ RRN PRTEFRVRETVDKKL NSQVSSN+V VDDKPT SG +
Sbjct: 1501 PFPGSESTRLDTGGFQRRTRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRT 1560
Query: 1561 AVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQ 1620
V+S RNGTRKVVISNKPSKRALESEG SS STSLELD GNRS KGVKK+YLGKSQG+Q
Sbjct: 1561 VVHSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSAKGVKKEYLGKSQGSQ 1620
Query: 1621 YSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
Y GEG FRKNI S EDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR
Sbjct: 1621 YPGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
Query: 1681 EKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD--GRGSG 1740
EKEIKAKSHNSKIP KSRST K+ALSSVNS KVYAAKEAE VKRTR DF D GRGSG
Sbjct: 1681 EKEIKAKSHNSKIPRKSRSTSKNALSSVNSSKVYAAKEAEPVKRTRSDFVAADGGGRGSG 1740
Query: 1741 SIMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLM 1800
+I+VS+AFSSP+VSQPLAPIGTPALKSDSQ ER H AR I TS PALATS GRNL+S +M
Sbjct: 1741 NIVVSTAFSSPVVSQPLAPIGTPALKSDSQSERSHAARSIQTSGPALATSEGRNLDSSMM 1800
Query: 1801 FGKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDP 1860
F KK+DIL+NV +S TSWG S NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP
Sbjct: 1801 FDKKDDILENVHSSFTSWGTSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDP 1860
Query: 1861 FIVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIG-PTDLC 1920
V SPSILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVL PGSC+T LGIG P+ LC
Sbjct: 1861 -NVPSPSILALDRSFSSAANPISSLLAGEKIQFGAVTSPTVLSPGSCSTLLGIGAPSSLC 1920
Query: 1921 HSDIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILV 1980
HSDI IPHKLSGAENDC+LFF+KEKH+SESC+ IE+SEAEAEAAASAVAVAAISSDEI V
Sbjct: 1921 HSDIPIPHKLSGAENDCHLFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEI-V 1980
Query: 1981 TNGLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPI 2040
NG+ TCSVSVTDTNN GGGD++++TAG+ GDQQLASKTRADDSLTV LPADLSVETPPI
Sbjct: 1981 ANGIGTCSVSVTDTNNFGGGDINVITAGSVGDQQLASKTRADDSLTVALPADLSVETPPI 2040
Query: 2041 SLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQK 2100
SLWPTLPSPQNSSSQ+LSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV TTQAQTQK
Sbjct: 2041 SLWPTLPSPQNSSSQVLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQK 2100
Query: 2101 CSAPAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQF 2160
SAPAPGPLGSWKQCHSGVDSFYGPP FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQF
Sbjct: 2101 SSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQF 2160
Query: 2161 GQVGLSFMGATYIPSGKQHDWKHSPGP-SLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPG 2220
GQVGLSFMG TYIPSGKQHDWKHSPGP SL VE DQKNLNMVSAQRMPTNLPPIQHLAPG
Sbjct: 2161 GQVGLSFMGTTYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPG 2220
Query: 2221 SPLLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSH 2280
SPLLPMASPLAMFDVSPFQASPE+S Q RWPSSAS+ Q VPLSMP+Q QA G+LPSHFSH
Sbjct: 2221 SPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSGQPVPLSMPMQQQAEGILPSHFSH 2280
Query: 2281 PSSADPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPN 2340
SS+DP+FTVNRFPGSQP +D KRNFPV ADA+VTQLPDEL IVD SSC+SSGA VPN
Sbjct: 2281 ASSSDPTFTVNRFPGSQPSVASDHKRNFPVAADATVTQLPDELGIVDASSCVSSGASVPN 2340
Query: 2341 VDTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQ 2400
D N L VN TDAGKT V+ NCSS+NSGQNA TNLKSQ HKGISA QY HSSGYNYQ
Sbjct: 2341 ADINGLSVNLVTDAGKTGVQ-NCSSSNSGQNAGTNLKSQSSHHKGISAQQYGHSSGYNYQ 2400
Query: 2401 RGGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
RGGASQKN SGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ S+ NL
Sbjct: 2401 RGGASQKNGSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2441
BLAST of Sed0025145 vs. NCBI nr
Match:
XP_022950041.1 (uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950043.1 uncharacterized protein LOC111453246 [Cucurbita moschata])
HSP 1 Score: 3684.8 bits (9554), Expect = 0.0e+00
Identity = 1985/2449 (81.05%), Postives = 2138/2449 (87.30%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AG TGGGV+GN QRPTSAG+GWTKP TND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+GN+VD+IDPSLRSVDGV+GGSSVYMPPSARA P +S VHT +EK P
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA AP QKQ+DG++SKLKHA+E SYEE+RDTSHLSSSIDAR+K++
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
SS KS P EN+KNG+ FS SPE SRKQEDIFP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L RD GHPKSEAYWERDFDMP VSSLPHKPI NFSQ+W+ RD+ESGKF SSDI KVD
Sbjct: 301 LIDRVRDHGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRD RTPSREGWEG+FQKNNPIPKD F SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 PYGRDTRTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSV 480
F+EHA K GR+DTGF GRQTWN A+ESY+SQ+PD T +DK+GSEQHN+++G+ HNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGLV 540
+NSSY G K+IP D+ LLNFG+DRRSF K EKP+MEDPFMKDFG DP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERK 600
G+ K KKDVIK TDFHDPVR+SFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER+
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAA 660
R+ARE EERQRRAEE ARE AWRAEQERLE IQKAEELRIARE +KQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERIT 720
KLKLLE+EERMAKRQAEAVKSS+LTSDIPEK IS+V KD SR+ADT+DWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSI 780
TSASSESSS+ R SEVG R+Q SRD S +FVDRGK VNS+RRD Y+R SGSQFVLQ QS
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSG 840
GY R+E GG++SSRKEFYGGAG TSRI + RG+T+PQSDDYS+LR QRPNLSG G
Sbjct: 781 GYTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYS 900
D YN+SQEFDSE QD+ ENFGDHGWRQEGG NN YFPYPERVN ISE DGSY VGR+RYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 QKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCTL 960
Q+QPRVLPPPSVA +QKSS+RGE+ SVTRD H RNVSTAQ RYIHHEN TL
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 SKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSA 1020
+IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESAVPAKA K IM+TSTR STGDED+W V DEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGD 1140
EDEVHEGEDENIDL Q+FDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G+
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGN 1140
Query: 1141 EENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQA 1200
EENM+VA E+S+CI+EEQGSSEGLQVDGKVCQY DASSQ+RI D EEMQD VMQS+ AQA
Sbjct: 1141 EENMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQA 1200
Query: 1201 LPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGL 1260
LPE EI EQGN SCRSS+SVQQPIS SV+ SQSSSGQVIVPNA SGQAEPP KLQFGL
Sbjct: 1201 LPEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGL 1260
Query: 1261 FSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVL 1320
FSGPSLIPS VP+IQIGSIQMPLHLHPQ+TPS+ MHSSQPPLFQFGQLRY SSV QGVL
Sbjct: 1261 FSGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVL 1320
Query: 1321 PLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVSR 1380
PL PQP TF+ P VQTGFP N+NP +AL +QT +ETC + SRK++V P + DNQ LVSR
Sbjct: 1321 PLAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSR 1380
Query: 1381 SLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGN 1440
SLNVNSSG+SKSLPLTESIE++VM QQ QT+ SCIDESNSRS+PGFQAEHQRHHV TS N
Sbjct: 1381 SLNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDN 1440
Query: 1441 HYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPF 1500
HY+VSRGKES+GRAQDGM DSVSRDKGL K RG FPGGRGKK++F VKNSGSRLPF
Sbjct: 1441 HYVVSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPF 1500
Query: 1501 AGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAV 1560
GSESTRLD GG QR+PRRN PRTEFRVRETVDKKL +SQVSSN+VEVDDKPT SG +AV
Sbjct: 1501 PGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAV 1560
Query: 1561 NSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYS 1620
NS RNGTRKV +SNKPSKRALE EG SS STSLELD GNRSEKGVKK+YLGKSQG+QY
Sbjct: 1561 NSARNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYY 1620
Query: 1621 GEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
GE FRKNI S EDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK
Sbjct: 1621 GESNFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
Query: 1681 EIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD--GRGSGSI 1740
EIKAKSHNSKIP KSRST K ALSSVNS KVYAAK AE VKRTR DF D GRGSG+I
Sbjct: 1681 EIKAKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNI 1740
Query: 1741 MVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFG 1800
+VSSA SS +VSQPLAPIGTPALKSDSQ ER HTAR I TS PALATS+GRNLES LMF
Sbjct: 1741 VVSSALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFD 1800
Query: 1801 KKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPFI 1860
KKNDILDNV +S SWGNS NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP +
Sbjct: 1801 KKNDILDNVTSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNV 1860
Query: 1861 VSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHSD 1920
SS SILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SC+T LGIGPT LCHSD
Sbjct: 1861 PSS-SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIGPTGLCHSD 1920
Query: 1921 IQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTNG 1980
+QIPHKLSGAENDC+LFF+KEKH+SES + IE+SEAEAEAAASAVAVAAISSDEI VTNG
Sbjct: 1921 MQIPHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEI-VTNG 1980
Query: 1981 LDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISLW 2040
L T SV VTDTNN GGGD++++ AG+AG+QQ ASKTRADDSLTV LPADLSVETPPISLW
Sbjct: 1981 LGTSSVPVTDTNNFGGGDINVIIAGSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQK SA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPPA FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
GLSFMGATYIPSGKQ DWKHSPGPSL VE DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGPSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
Query: 2221 PMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSHPSSA 2280
PMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPLQ QA G+LPSHFSH SSA
Sbjct: 2221 PMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASSA 2280
Query: 2281 DPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNVDTN 2340
DPSFTVNRFPGSQP +D KRN+ V ADA+VTQLPDEL IVD SSC+SSG VPNVD
Sbjct: 2281 DPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDIK 2340
Query: 2341 SLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQRGGA 2400
SL VNS TDAGKT NCSS+NS NA TNLKSQ P HKGI A QY+HSSGYNYQRGGA
Sbjct: 2341 SLSVNSVTDAGKTV--QNCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
SQKNSSGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ SS NL
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNL 2439
BLAST of Sed0025145 vs. NCBI nr
Match:
XP_022133325.1 (uncharacterized protein LOC111005936 isoform X1 [Momordica charantia])
HSP 1 Score: 3682.9 bits (9549), Expect = 0.0e+00
Identity = 1981/2452 (80.79%), Postives = 2135/2452 (87.07%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQ HH H NSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQPHHHHSSHPNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AGP GGGV+GNGQRPTSAG+GWTKPRTND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGNGQRPTSAGMGWTKPRTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+ N+ DRIDPSLR+VDG SGGSSVYMPPSARAG+T P +S V+ +EK P
Sbjct: 121 EKEGLSSNMADRIDPSLRTVDGASGGSSVYMPPSARAGMTGPVVTTSASSQVYAAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA P QK KDG +SKLK A+EGSYEE+RDTSHLSSSIDAR K++
Sbjct: 181 VLRGEDFPSLQATLPSAAGPSQKLKDGPSSKLKRAAEGSYEEQRDTSHLSSSIDARPKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
S+ K P EN+K GD FS S E SRKQED+FP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SAQKGLPSENAKKGDTFSLGSFQSSESSRKQEDLFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L GRDRGHPKSEAYWERDFDMPRVS+LPHKPIPNFSQ+WN+RD+ESGKF S+DI KVD
Sbjct: 301 LIDRGRDRGHPKSEAYWERDFDMPRVSALPHKPIPNFSQRWNLRDDESGKFHSNDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDARTPSREGWEG+F++N PIPKDGF SDS N RNDIAARPT+IDRE NA +MHVSH
Sbjct: 361 PYGRDARTPSREGWEGNFRRNTPIPKDGFGSDSRNDRNDIAARPTNIDRETNANSMHVSH 420
Query: 421 FQEHAHKD-GRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTS 480
F+EHAHKD GR+DTG+GQ GRQTWN A ESYSSQEPDR RDKYGSEQHNRY+GE HNTS
Sbjct: 421 FREHAHKDPGRRDTGYGQMGRQTWNSAAESYSSQEPDRVIRDKYGSEQHNRYRGETHNTS 480
Query: 481 VANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGL 540
VANSSY SG K+IP DEPLLNFG++RRSF K EKP+MEDPFMKDFGA DPF G+
Sbjct: 481 VANSSYSSGLKRIPADEPLLNFGRERRSFAKIEKPYMEDPFMKDFGASGFDGRDPFATGI 540
Query: 541 VGMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEER 600
VG+ K KKDVIK TDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER
Sbjct: 541 VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
Query: 601 KRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQA 660
K +AREHEERQRRAEEEARE AWRAEQERLE IQKAEELRIARE +KQRIILEEERRKQA
Sbjct: 601 KTLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIILEEERRKQA 660
Query: 661 AKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERI 720
AKL LLE+EERMAKRQAE VKSS+ TSDIPEK I V KDVSR+AD +DWEDGEKMVERI
Sbjct: 661 AKLMLLELEERMAKRQAETVKSSTSTSDIPEKKIPGVVKDVSRLADAVDWEDGEKMVERI 720
Query: 721 TTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQS 780
TTSASSESSS+ RSSEVG RSQFSRDAS AFVDRGK VNS+RRD YER SGSQFV+Q QS
Sbjct: 721 TTSASSESSSINRSSEVGLRSQFSRDASPAFVDRGKSVNSWRRDFYERGSGSQFVVQDQS 780
Query: 781 IGYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGS 840
GYN R+E +GG+ +SRKEFYGGAGFTTSRISH RGIT+PQSDDYS+LR RPNLSG
Sbjct: 781 TGYNGPRREAPIGGRATSRKEFYGGAGFTTSRISHRRGITEPQSDDYSQLRGHRPNLSGG 840
Query: 841 GDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRY 900
GDHY++S +FDSE QD+ ENFGDHGWRQE G NN YFPYPERVN ISE DGSY VGR+RY
Sbjct: 841 GDHYSQSPDFDSEFQDNVENFGDHGWRQETGRNNFYFPYPERVNPISETDGSYSVGRSRY 900
Query: 901 SQKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCT 960
SQ+QPRVLPPPSVA +QKSS+RGEYESV RD HP RNVSTAQ YIHHEN +
Sbjct: 901 SQRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPARNVSTAQTGYIHHENRS 960
Query: 961 LSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLS 1020
+IIDVN +N ENEEQKPD TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLS
Sbjct: 961 FPEIIDVNLDNAENEEQKPDCDTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLS 1020
Query: 1021 ASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQ 1080
ASREGTLSIED ESAVP K K IM++STRVSTGDED+W V +EHVQEQEEYDEDDDGY
Sbjct: 1021 ASREGTLSIEDTESAVPTKCGKEIMISSTRVSTGDEDEWAVGNEHVQEQEEYDEDDDGYD 1080
Query: 1081 EEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPG 1140
EEDEVHE EDENIDL QDFDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G
Sbjct: 1081 EEDEVHEVEDENIDLAQDFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERISG 1140
Query: 1141 DEENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQ 1200
+EENM+V EISSCI+EEQGSSE LQVD +CQY DASSQVRI D EEM+D V+ SK AQ
Sbjct: 1141 NEENMFVTPEISSCIREEQGSSEVLQVDSNICQYPDASSQVRIPDTEEMKDLVILSKPAQ 1200
Query: 1201 ALPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFG 1260
ALP E+TEQG SCRS +SVQ PIS SV+M SQS GQVIVPNA +SGQAEPP KLQFG
Sbjct: 1201 ALPGSELTEQGKFSCRSGVSVQLPISSSVSMASQSPPGQVIVPNAAISGQAEPPVKLQFG 1260
Query: 1261 LFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGV 1320
LFSGPSLIPS VP+IQIGSIQMPLHLH QITPS+ MHSSQPPLFQFGQLRY SSV QGV
Sbjct: 1261 LFSGPSLIPSPVPAIQIGSIQMPLHLHAQITPSMTHMHSSQPPLFQFGQLRYTSSVSQGV 1320
Query: 1321 LPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVS 1380
LPL PQP TF+ P VQTGFP N+NP +A S+QT +ETC + SRK++V PF+ DNQ L S
Sbjct: 1321 LPLAPQPLTFVPPTVQTGFPLNKNPGDAPSIQTSQETCAHNSRKNDVLPFLMDNQQGLAS 1380
Query: 1381 RSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHH--VPT 1440
RSL NSSG+SKSLPLT++ E++V+ QQDQT+ SCIDESNSRS+ GFQAEHQRHH V T
Sbjct: 1381 RSL--NSSGESKSLPLTDNKESEVITQQDQTAGSCIDESNSRSESGFQAEHQRHHHNVST 1440
Query: 1441 SGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSR 1500
S +HY+V+RGKES+GRAQDGM PFDSVSRDKGL K RG F GGRGKK+IF VKNSGSR
Sbjct: 1441 SDDHYVVTRGKESEGRAQDGMGPFDSVSRDKGLGGSKARGQFHGGRGKKYIFTVKNSGSR 1500
Query: 1501 LPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGI 1560
L F SESTRLD+ G QR+PRRN PRTEFRVRETVDKK NSQVSS+ VEVDDKPT SG
Sbjct: 1501 LSFPASESTRLDSSGFQRRPRRNAPRTEFRVRETVDKKSSNSQVSSSNVEVDDKPTVSGR 1560
Query: 1561 SAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGN 1620
SA +S RNGTRKVVIS KPSKRALESEG SS VS+SLELD GNR+EKGVKK+YLGKSQG+
Sbjct: 1561 SAASSARNGTRKVVISTKPSKRALESEGLSSGVSSSLELDAGNRTEKGVKKEYLGKSQGS 1620
Query: 1621 QYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
QYSGEG FRKNI S EDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ
Sbjct: 1621 QYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
Query: 1681 REKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDGRGSGS 1740
REKEIKAKSHNSKIP +SRS K A SSVNS KVYAAKEAE VKRTR DF DGRGSG+
Sbjct: 1681 REKEIKAKSHNSKIPRRSRSNSKIASSSVNSSKVYAAKEAETVKRTRSDFVATDGRGSGN 1740
Query: 1741 IMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMF 1800
I+VSSAFSSP+VSQPLAPIGTPALKSDSQ ER HTAR I +S PALAT +GRNLES MF
Sbjct: 1741 IVVSSAFSSPIVSQPLAPIGTPALKSDSQTERSHTARSIQSSAPALATGDGRNLESSSMF 1800
Query: 1801 GKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPF 1860
KKNDILDNVQTS SWG S NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP
Sbjct: 1801 DKKNDILDNVQTSFASWGGSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDP- 1860
Query: 1861 IVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHS 1920
V SPSILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCAT LGIGPT LCHS
Sbjct: 1861 NVPSPSILARDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATLLGIGPTGLCHS 1920
Query: 1921 DIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTN 1980
DIQIPHKLSGAENDC++FF+KEKH+SESC+ IE+SEAEAEAAASAVAVAAISSDEI VTN
Sbjct: 1921 DIQIPHKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEI-VTN 1980
Query: 1981 GLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISL 2040
GL TCSVSVTDTNN G GD++++TAG+AGDQQLASKTRADDSLTV LPADLSVETPPISL
Sbjct: 1981 GLGTCSVSVTDTNNFGSGDINVITAGSAGDQQLASKTRADDSLTVALPADLSVETPPISL 2040
Query: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCS 2100
WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQ+QTQK S
Sbjct: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQSQTQKSS 2100
Query: 2101 APAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
APAPGPLGSWKQCHSGVDSFYGPP F+ PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQ
Sbjct: 2101 APAPGPLGSWKQCHSGVDSFYGPP-GFSGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
Query: 2161 VGLSFMGATYIPSGKQHDWKHSPGP-SLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSP 2220
VGLSFMGATYIPSGKQHDWKHSPGP SL VE DQK LNMVSAQRMPTNLPPIQHLAPGSP
Sbjct: 2161 VGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKTLNMVSAQRMPTNLPPIQHLAPGSP 2220
Query: 2221 LLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPL-QHQAGGVLPSHFSHP 2280
LLPMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPL Q QA GVLPSHFSH
Sbjct: 2221 LLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQQAEGVLPSHFSHS 2280
Query: 2281 SSADPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNV 2340
SSADPSFTVNRFPGSQP +D KRNF V DA+VTQLPDEL IVD SSC+SSGA VPNV
Sbjct: 2281 SSADPSFTVNRFPGSQPSVASDHKRNFSVATDATVTQLPDELGIVDASSCVSSGASVPNV 2340
Query: 2341 DTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQR 2400
D NSL V+S TDAGKT V+ NCSSNNSGQN+ TNLKSQ P HKG+S QY+HSSGYN+QR
Sbjct: 2341 DINSLSVSSVTDAGKTGVQ-NCSSNNSGQNSGTNLKSQSPQHKGVSTQQYSHSSGYNFQR 2400
Query: 2401 GGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
GGASQK+SSGG EW+HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ+SS NL
Sbjct: 2401 GGASQKHSSGGGEWSHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQTSSGNL 2444
BLAST of Sed0025145 vs. NCBI nr
Match:
KAG7034343.1 (hypothetical protein SDJN02_04070, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 3678.6 bits (9538), Expect = 0.0e+00
Identity = 1985/2453 (80.92%), Postives = 2138/2453 (87.16%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AG TGGGV+GN QRPTSAG+GWTKP TND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+GN+VD+IDPSLRSVDGV+GGSSVYMPPSARA P +S VHT +EK P
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA AP QKQ+DG++SKLKH +E SYEE+RDTSHLSSSIDAR+K++
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
SS KS P EN+KNGD FS SPE SRKQEDIFP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SSKKSIPSENAKNGDSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L RDRGHPKSEAYWERDFDMP VSSLPHKPI NFSQ+W+ RD+ESGKF SSDI KVD
Sbjct: 301 LIDRVRDRGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDAR PSREGWEG+FQKN PIPKD F SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 PYGRDARAPSREGWEGNFQKNIPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSV 480
F+EHA K GR+DTGF GRQTWN A+ESY+SQ+PD T +DK+GSEQHN+++G+ HNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGLV 540
+NSSY G K+IP D+ LLNFG+DRRSF K EKP+MEDPFMKDFG DP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERK 600
G+ K KKDVIK TDFHDPVR+SFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER+
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAA 660
R+ARE EERQRRAEE ARE AWRAEQERLE IQKAEELRIARE +KQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERIT 720
KLKLLE+EERMAKRQAEAVKSS+LT DIPEK IS+V KD SR+ADT+DWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTQDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSI 780
TSASSESSS+ R SEVG R+Q SRD S +FVDRGK VNS+RRD Y+R SGSQFVLQ QS
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSG 840
GY R+E GG++SSRKEFYGGAG TSRI + RG+T+PQSDDYS+LR QRPNLSG G
Sbjct: 781 GYTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYS 900
D YN+SQEFDSE QD+ ENFGDHGWRQEGG NN YFPYPERVN ISE DGSY VGR+RYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 QKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCTL 960
Q+QPRVLPPPSVA +QKSS+RGE+ SVTRD H RNVSTAQ RYIHHEN TL
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 SKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSA 1020
+IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESAVPAKA K IM++STR STGDED+W V DEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGD 1140
EDEVHEGEDENIDL Q+FDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G+
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGN 1140
Query: 1141 EENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQA 1200
EENM+VA EIS+CI+EEQGSSEGLQVDGKVCQY DASSQ+RI D EEMQD VMQS+ AQA
Sbjct: 1141 EENMFVAPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQA 1200
Query: 1201 LPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGL 1260
LPE EI EQGN SCRSS+SVQQPIS SV+ SQSSSGQVIVPNA SGQAEPP KLQFGL
Sbjct: 1201 LPEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGL 1260
Query: 1261 FSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVL 1320
FSGPSLIPS VP+IQIGSIQMPLHLHPQ+TPS+ MHSSQPPLFQFGQLRY SSV QGVL
Sbjct: 1261 FSGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVL 1320
Query: 1321 PLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVSR 1380
PL PQP TF+ P VQTGFP N+NP +AL +QT +ETC + SRK++V P + DNQ LVSR
Sbjct: 1321 PLAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSR 1380
Query: 1381 SLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGN 1440
SLNVNSSG+SKSLPLTESIE++VM QQ QT+ SCIDESNSRS+PGFQ+EHQRHHV TS N
Sbjct: 1381 SLNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQSEHQRHHVSTSDN 1440
Query: 1441 HYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPF 1500
HY+VSRGKES+GRAQDGM DSVSRDKGL K RG FPGGRGKK++F VKNSGSRLPF
Sbjct: 1441 HYVVSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPF 1500
Query: 1501 AGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAV 1560
GSESTRLD GG QRQPRRN PRTEFRVRETVDKKL +SQVSSN+VEVDDKPT SG +AV
Sbjct: 1501 PGSESTRLDTGGFQRQPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAV 1560
Query: 1561 NSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYS 1620
NS RNGTRKV +SNKPSKRALE EG SS STSLELD GNRSEKGVKK+YLGKSQG+QY
Sbjct: 1561 NSARNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYY 1620
Query: 1621 GEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
GE FRKNI S EDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK
Sbjct: 1621 GESNFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
Query: 1681 EIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD--GRGSGSI 1740
EIKAKSHNSKIP KSRST K ALSSVNS KVYAAK AE VKRTR DF D GRGSG+I
Sbjct: 1681 EIKAKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNI 1740
Query: 1741 MVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFG 1800
+VSSA SS +VSQPLAPIGTPALKSDSQ ER HTAR I TS PALATS+GRNLES LMF
Sbjct: 1741 VVSSALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFD 1800
Query: 1801 KKNDILDNVQTSLTSWGNSHRNQ----QVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTG 1860
KKNDILDNV +S SWGNS NQ QVMALTQTQLDEAMKPAQFDLHPPV DHSSL G
Sbjct: 1801 KKNDILDNVPSSFPSWGNSRINQQIHWQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAG 1860
Query: 1861 DPFIVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDL 1920
DP + SS SILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SC+T LGIGPT L
Sbjct: 1861 DPNVPSS-SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIGPTGL 1920
Query: 1921 CHSDIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEIL 1980
CHSD+QIPHKLSGAENDC+LFF+KEKH+SES + IE+SEAEAEAAASAVAVAAISSDEI
Sbjct: 1921 CHSDMQIPHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEI- 1980
Query: 1981 VTNGLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPP 2040
VTNGL T SV VTDTNN GGGD++++ AG+AG+QQ ASKTRADDSLTV LPADLSVETPP
Sbjct: 1981 VTNGLGTSSVPVTDTNNFGGGDINVIIAGSAGNQQFASKTRADDSLTVALPADLSVETPP 2040
Query: 2041 ISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQ 2100
ISLWP+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQ
Sbjct: 2041 ISLWPSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQ 2100
Query: 2101 KCSAPAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQ 2160
K SAPAPGPLGSWKQCHSGVDSFYGPPA FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQ
Sbjct: 2101 KSSAPAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQ 2160
Query: 2161 FGQVGLSFMGATYIPSGKQHDWKHSPGPSLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPG 2220
FGQVGLSFMGATYIPSGKQ DWKHSPGPSL VE DQKNLNMVSAQRMPTNLPPIQHLAPG
Sbjct: 2161 FGQVGLSFMGATYIPSGKQPDWKHSPGPSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPG 2220
Query: 2221 SPLLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSH 2280
SPLLPMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPLQ QA G+LPSHFSH
Sbjct: 2221 SPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSH 2280
Query: 2281 PSSADPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPN 2340
SSADPSFTVNRFPGSQP +D KRN+ V ADA+VTQLPDEL IVD SSC+SSG VPN
Sbjct: 2281 ASSADPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPN 2340
Query: 2341 VDTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQ 2400
VD SL VNS TDAGKT V+ NCSS+NS NA TNLKSQ P HKGI A QY+HSSGYNYQ
Sbjct: 2341 VDIKSLSVNSVTDAGKTGVQ-NCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQ 2400
Query: 2401 RGGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
RGGASQKNSSGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ SS NL
Sbjct: 2401 RGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNL 2444
BLAST of Sed0025145 vs. NCBI nr
Match:
XP_023543957.1 (uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023543958.1 uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023543959.1 uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 3677.9 bits (9536), Expect = 0.0e+00
Identity = 1982/2449 (80.93%), Postives = 2137/2449 (87.26%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AG TGGGV+GN QRPTSAG+GWTKP TND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+GN+VD+IDPSLRSVDGV+GGSSVYMPPSARA P +S VHT +EK P
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA AP QKQ+DG++SKLKHA+E SYEE+RDTSHLSSSIDAR+K++
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSLKSPPRENSKNGDFFSS-----PEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
SS KS P EN+KNG+ FSS PE SRKQEDIFP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQPPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L RDRGHPKSEAYWERDFDMP VSSLPHKPI NFSQ+W+ RD+ESGKF SSDI KVD
Sbjct: 301 LIDRVRDRGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDARTPSREGWEG+FQKNNPIPKD F SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 PYGRDARTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSV 480
F+EHA K GR+DTGF GRQTWN A+ESY+SQ+PD T +DK+GSEQHN+++G+ HNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGLV 540
+NSSY G K+IP D+ LLNFG+DRRSF K EKP+MEDPFMKDFG DP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERK 600
G+ K KKDVIK TDFHDPVR+SFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER+
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAA 660
R+ARE EERQRRAEE ARE AWRAEQERLE +QKAEELRIARE +KQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAVQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERIT 720
KLKLLE+EERMAKRQAEAVKSS+LT+DIPEK IS+V KD SR+ADT+DWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTTDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSI 780
TSASSESSS+ R SEVG R+Q S D S +F DRGK VNS+RRD Y+R SGSQFVLQ QS
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSSDGSPSFGDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSG 840
GY R+E GG++SSRKEFYGGAG TTSRI + RG+ +PQSDDYS+LR QRPNLSG G
Sbjct: 781 GYPGPRREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMAEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYS 900
D YN+SQEFDSE QD+ ENFGDHGWRQEGG NN YFPYPERVN ISE DGSY VGR+RYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 QKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCTL 960
Q+QPRVLPPPSVA +QKSS+RGE+ SVTRD H RNVST+Q RYIHHEN TL
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTSQTRYIHHENRTL 960
Query: 961 SKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSA 1020
+IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESAVPAKA K IM+TSTR STGDED+W V DEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGD 1140
EDEVHEGEDENIDL Q+FDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G+
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGN 1140
Query: 1141 EENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQA 1200
EENM+VA EIS+CI+EE GSSEGLQVDGKVCQY DASSQ+RI D EEMQD VMQS+ AQA
Sbjct: 1141 EENMFVAPEISNCIREELGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQA 1200
Query: 1201 LPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGL 1260
LPE EI EQGN SCRSS+SVQQPIS SV SQSSSGQVIVPNA SGQAEPP KLQFGL
Sbjct: 1201 LPEPEINEQGNSSCRSSVSVQQPISSSVLTASQSSSGQVIVPNAAGSGQAEPPVKLQFGL 1260
Query: 1261 FSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVL 1320
FSGPSLIPS VP+IQIGSIQMPLHLHPQ+TPS+ MHSSQPPLFQFGQLRY SSV QGVL
Sbjct: 1261 FSGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVL 1320
Query: 1321 PLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVSR 1380
PL PQP TF+ P VQTGFP N+NP +AL +QT +ETC + SRK++V P + DNQ LVSR
Sbjct: 1321 PLAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSR 1380
Query: 1381 SLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGN 1440
SLNVNSSG+SKSLPLTESIE++VM QQ QT+ SCIDESNSRS+PGFQAEHQRHHV TS N
Sbjct: 1381 SLNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDN 1440
Query: 1441 HYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPF 1500
HY+VSRGKES+GRAQDGM DSVSRDKGL K RG FPGGRGKK++F VKNSGSRLPF
Sbjct: 1441 HYVVSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPF 1500
Query: 1501 AGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAV 1560
GSESTRLD GG QR+PRRN PRTEFRVRETVDKKL +SQVSSN+VEVDDKPT SG +AV
Sbjct: 1501 PGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAV 1560
Query: 1561 NSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYS 1620
NS RNGTRKV +SNKPSKRALE EG SS STSLELD GNRSEKGVKK+YLGKSQG+QY
Sbjct: 1561 NSARNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYY 1620
Query: 1621 GEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
GE FRKNI S EDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK
Sbjct: 1621 GESNFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
Query: 1681 EIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD--GRGSGSI 1740
EIKAKSHNSKIP KSRST K ALSSVNS KVYAAK AE VKRTR DF D GRGSG+I
Sbjct: 1681 EIKAKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNI 1740
Query: 1741 MVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFG 1800
+VSSA SS +VSQPLAPIGTPALKSDSQ ER HTAR I TS PALATS+GRNLES LMF
Sbjct: 1741 VVSSALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFD 1800
Query: 1801 KKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPFI 1860
KKNDILDNV +S SWGNS NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP +
Sbjct: 1801 KKNDILDNVPSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPTV 1860
Query: 1861 VSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHSD 1920
SS SILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SC+T LGIGPT LCHSD
Sbjct: 1861 PSS-SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIGPTGLCHSD 1920
Query: 1921 IQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTNG 1980
+QIPHKLSGAENDC+LFF+KEKH+SES + IE+SEAEAEAAASAVAVAAISSDEI VTNG
Sbjct: 1921 MQIPHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEI-VTNG 1980
Query: 1981 LDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISLW 2040
L T S VTDTNN GGGD++++ AG+AG+QQ ASKTRADDSLTV LPADLSVETPPISLW
Sbjct: 1981 LGTSSGPVTDTNNFGGGDINVIIAGSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQK SA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPPA FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
GLSFMGATYIPSGKQ DWKHSPGPSL VE DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGPSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
Query: 2221 PMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSHPSSA 2280
PMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPLQ QA G+LPSHFSH SSA
Sbjct: 2221 PMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASSA 2280
Query: 2281 DPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNVDTN 2340
DPSFTVNRFPGSQP +D KRN+ V ADA+VTQLPDEL IVD SSC+SSG VPNVD
Sbjct: 2281 DPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDIK 2340
Query: 2341 SLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQRGGA 2400
SL VNS TDAGKT V+ NCSS+NS NA TNLKSQ P HKGI A QY+HSSGYNYQRGGA
Sbjct: 2341 SLSVNSVTDAGKTGVQ-NCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
SQKNSSGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ SS NL
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNL 2440
BLAST of Sed0025145 vs. ExPASy TrEMBL
Match:
A0A6J1GDR0 (uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC111453246 PE=4 SV=1)
HSP 1 Score: 3684.8 bits (9554), Expect = 0.0e+00
Identity = 1985/2449 (81.05%), Postives = 2138/2449 (87.30%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AG TGGGV+GN QRPTSAG+GWTKP TND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+GN+VD+IDPSLRSVDGV+GGSSVYMPPSARA P +S VHT +EK P
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA AP QKQ+DG++SKLKHA+E SYEE+RDTSHLSSSIDAR+K++
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
SS KS P EN+KNG+ FS SPE SRKQEDIFP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L RD GHPKSEAYWERDFDMP VSSLPHKPI NFSQ+W+ RD+ESGKF SSDI KVD
Sbjct: 301 LIDRVRDHGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRD RTPSREGWEG+FQKNNPIPKD F SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 PYGRDTRTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSV 480
F+EHA K GR+DTGF GRQTWN A+ESY+SQ+PD T +DK+GSEQHN+++G+ HNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGLV 540
+NSSY G K+IP D+ LLNFG+DRRSF K EKP+MEDPFMKDFG DP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERK 600
G+ K KKDVIK TDFHDPVR+SFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER+
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAA 660
R+ARE EERQRRAEE ARE AWRAEQERLE IQKAEELRIARE +KQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERIT 720
KLKLLE+EERMAKRQAEAVKSS+LTSDIPEK IS+V KD SR+ADT+DWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSI 780
TSASSESSS+ R SEVG R+Q SRD S +FVDRGK VNS+RRD Y+R SGSQFVLQ QS
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSG 840
GY R+E GG++SSRKEFYGGAG TSRI + RG+T+PQSDDYS+LR QRPNLSG G
Sbjct: 781 GYTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYS 900
D YN+SQEFDSE QD+ ENFGDHGWRQEGG NN YFPYPERVN ISE DGSY VGR+RYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 QKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCTL 960
Q+QPRVLPPPSVA +QKSS+RGE+ SVTRD H RNVSTAQ RYIHHEN TL
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 SKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSA 1020
+IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESAVPAKA K IM+TSTR STGDED+W V DEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGD 1140
EDEVHEGEDENIDL Q+FDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G+
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGN 1140
Query: 1141 EENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQA 1200
EENM+VA E+S+CI+EEQGSSEGLQVDGKVCQY DASSQ+RI D EEMQD VMQS+ AQA
Sbjct: 1141 EENMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQA 1200
Query: 1201 LPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGL 1260
LPE EI EQGN SCRSS+SVQQPIS SV+ SQSSSGQVIVPNA SGQAEPP KLQFGL
Sbjct: 1201 LPEPEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGL 1260
Query: 1261 FSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVL 1320
FSGPSLIPS VP+IQIGSIQMPLHLHPQ+TPS+ MHSSQPPLFQFGQLRY SSV QGVL
Sbjct: 1261 FSGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVL 1320
Query: 1321 PLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVSR 1380
PL PQP TF+ P VQTGFP N+NP +AL +QT +ETC + SRK++V P + DNQ LVSR
Sbjct: 1321 PLAPQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSR 1380
Query: 1381 SLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGN 1440
SLNVNSSG+SKSLPLTESIE++VM QQ QT+ SCIDESNSRS+PGFQAEHQRHHV TS N
Sbjct: 1381 SLNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDN 1440
Query: 1441 HYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPF 1500
HY+VSRGKES+GRAQDGM DSVSRDKGL K RG FPGGRGKK++F VKNSGSRLPF
Sbjct: 1441 HYVVSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPF 1500
Query: 1501 AGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAV 1560
GSESTRLD GG QR+PRRN PRTEFRVRETVDKKL +SQVSSN+VEVDDKPT SG +AV
Sbjct: 1501 PGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAV 1560
Query: 1561 NSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYS 1620
NS RNGTRKV +SNKPSKRALE EG SS STSLELD GNRSEKGVKK+YLGKSQG+QY
Sbjct: 1561 NSARNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYY 1620
Query: 1621 GEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
GE FRKNI S EDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK
Sbjct: 1621 GESNFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
Query: 1681 EIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD--GRGSGSI 1740
EIKAKSHNSKIP KSRST K ALSSVNS KVYAAK AE VKRTR DF D GRGSG+I
Sbjct: 1681 EIKAKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNI 1740
Query: 1741 MVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFG 1800
+VSSA SS +VSQPLAPIGTPALKSDSQ ER HTAR I TS PALATS+GRNLES LMF
Sbjct: 1741 VVSSALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFD 1800
Query: 1801 KKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPFI 1860
KKNDILDNV +S SWGNS NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP +
Sbjct: 1801 KKNDILDNVTSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNV 1860
Query: 1861 VSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHSD 1920
SS SILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SC+T LGIGPT LCHSD
Sbjct: 1861 PSS-SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIGPTGLCHSD 1920
Query: 1921 IQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTNG 1980
+QIPHKLSGAENDC+LFF+KEKH+SES + IE+SEAEAEAAASAVAVAAISSDEI VTNG
Sbjct: 1921 MQIPHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEI-VTNG 1980
Query: 1981 LDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISLW 2040
L T SV VTDTNN GGGD++++ AG+AG+QQ ASKTRADDSLTV LPADLSVETPPISLW
Sbjct: 1981 LGTSSVPVTDTNNFGGGDINVIIAGSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQK SA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPPA FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
GLSFMGATYIPSGKQ DWKHSPGPSL VE DQKNLNMVSAQRMPTNLPPIQHLAPGSPLL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGPSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPLL 2220
Query: 2221 PMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSHPSSA 2280
PMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPLQ QA G+LPSHFSH SSA
Sbjct: 2221 PMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASSA 2280
Query: 2281 DPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNVDTN 2340
DPSFTVNRFPGSQP +D KRN+ V ADA+VTQLPDEL IVD SSC+SSG VPNVD
Sbjct: 2281 DPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDIK 2340
Query: 2341 SLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQRGGA 2400
SL VNS TDAGKT NCSS+NS NA TNLKSQ P HKGI A QY+HSSGYNYQRGGA
Sbjct: 2341 SLSVNSVTDAGKTV--QNCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
SQKNSSGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ SS NL
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNL 2439
BLAST of Sed0025145 vs. ExPASy TrEMBL
Match:
A0A6J1BUX9 (uncharacterized protein LOC111005936 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005936 PE=4 SV=1)
HSP 1 Score: 3682.9 bits (9549), Expect = 0.0e+00
Identity = 1981/2452 (80.79%), Postives = 2135/2452 (87.07%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQ HH H NSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQPHHHHSSHPNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AGP GGGV+GNGQRPTSAG+GWTKPRTND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGNGQRPTSAGMGWTKPRTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+ N+ DRIDPSLR+VDG SGGSSVYMPPSARAG+T P +S V+ +EK P
Sbjct: 121 EKEGLSSNMADRIDPSLRTVDGASGGSSVYMPPSARAGMTGPVVTTSASSQVYAAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA P QK KDG +SKLK A+EGSYEE+RDTSHLSSSIDAR K++
Sbjct: 181 VLRGEDFPSLQATLPSAAGPSQKLKDGPSSKLKRAAEGSYEEQRDTSHLSSSIDARPKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
S+ K P EN+K GD FS S E SRKQED+FP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SAQKGLPSENAKKGDTFSLGSFQSSESSRKQEDLFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L GRDRGHPKSEAYWERDFDMPRVS+LPHKPIPNFSQ+WN+RD+ESGKF S+DI KVD
Sbjct: 301 LIDRGRDRGHPKSEAYWERDFDMPRVSALPHKPIPNFSQRWNLRDDESGKFHSNDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDARTPSREGWEG+F++N PIPKDGF SDS N RNDIAARPT+IDRE NA +MHVSH
Sbjct: 361 PYGRDARTPSREGWEGNFRRNTPIPKDGFGSDSRNDRNDIAARPTNIDRETNANSMHVSH 420
Query: 421 FQEHAHKD-GRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTS 480
F+EHAHKD GR+DTG+GQ GRQTWN A ESYSSQEPDR RDKYGSEQHNRY+GE HNTS
Sbjct: 421 FREHAHKDPGRRDTGYGQMGRQTWNSAAESYSSQEPDRVIRDKYGSEQHNRYRGETHNTS 480
Query: 481 VANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGL 540
VANSSY SG K+IP DEPLLNFG++RRSF K EKP+MEDPFMKDFGA DPF G+
Sbjct: 481 VANSSYSSGLKRIPADEPLLNFGRERRSFAKIEKPYMEDPFMKDFGASGFDGRDPFATGI 540
Query: 541 VGMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEER 600
VG+ K KKDVIK TDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER
Sbjct: 541 VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
Query: 601 KRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQA 660
K +AREHEERQRRAEEEARE AWRAEQERLE IQKAEELRIARE +KQRIILEEERRKQA
Sbjct: 601 KTLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIILEEERRKQA 660
Query: 661 AKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERI 720
AKL LLE+EERMAKRQAE VKSS+ TSDIPEK I V KDVSR+AD +DWEDGEKMVERI
Sbjct: 661 AKLMLLELEERMAKRQAETVKSSTSTSDIPEKKIPGVVKDVSRLADAVDWEDGEKMVERI 720
Query: 721 TTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQS 780
TTSASSESSS+ RSSEVG RSQFSRDAS AFVDRGK VNS+RRD YER SGSQFV+Q QS
Sbjct: 721 TTSASSESSSINRSSEVGLRSQFSRDASPAFVDRGKSVNSWRRDFYERGSGSQFVVQDQS 780
Query: 781 IGYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGS 840
GYN R+E +GG+ +SRKEFYGGAGFTTSRISH RGIT+PQSDDYS+LR RPNLSG
Sbjct: 781 TGYNGPRREAPIGGRATSRKEFYGGAGFTTSRISHRRGITEPQSDDYSQLRGHRPNLSGG 840
Query: 841 GDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRY 900
GDHY++S +FDSE QD+ ENFGDHGWRQE G NN YFPYPERVN ISE DGSY VGR+RY
Sbjct: 841 GDHYSQSPDFDSEFQDNVENFGDHGWRQETGRNNFYFPYPERVNPISETDGSYSVGRSRY 900
Query: 901 SQKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCT 960
SQ+QPRVLPPPSVA +QKSS+RGEYESV RD HP RNVSTAQ YIHHEN +
Sbjct: 901 SQRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPARNVSTAQTGYIHHENRS 960
Query: 961 LSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLS 1020
+IIDVN +N ENEEQKPD TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLS
Sbjct: 961 FPEIIDVNLDNAENEEQKPDCDTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLS 1020
Query: 1021 ASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQ 1080
ASREGTLSIED ESAVP K K IM++STRVSTGDED+W V +EHVQEQEEYDEDDDGY
Sbjct: 1021 ASREGTLSIEDTESAVPTKCGKEIMISSTRVSTGDEDEWAVGNEHVQEQEEYDEDDDGYD 1080
Query: 1081 EEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPG 1140
EEDEVHE EDENIDL QDFDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G
Sbjct: 1081 EEDEVHEVEDENIDLAQDFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERISG 1140
Query: 1141 DEENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQ 1200
+EENM+V EISSCI+EEQGSSE LQVD +CQY DASSQVRI D EEM+D V+ SK AQ
Sbjct: 1141 NEENMFVTPEISSCIREEQGSSEVLQVDSNICQYPDASSQVRIPDTEEMKDLVILSKPAQ 1200
Query: 1201 ALPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFG 1260
ALP E+TEQG SCRS +SVQ PIS SV+M SQS GQVIVPNA +SGQAEPP KLQFG
Sbjct: 1201 ALPGSELTEQGKFSCRSGVSVQLPISSSVSMASQSPPGQVIVPNAAISGQAEPPVKLQFG 1260
Query: 1261 LFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGV 1320
LFSGPSLIPS VP+IQIGSIQMPLHLH QITPS+ MHSSQPPLFQFGQLRY SSV QGV
Sbjct: 1261 LFSGPSLIPSPVPAIQIGSIQMPLHLHAQITPSMTHMHSSQPPLFQFGQLRYTSSVSQGV 1320
Query: 1321 LPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVS 1380
LPL PQP TF+ P VQTGFP N+NP +A S+QT +ETC + SRK++V PF+ DNQ L S
Sbjct: 1321 LPLAPQPLTFVPPTVQTGFPLNKNPGDAPSIQTSQETCAHNSRKNDVLPFLMDNQQGLAS 1380
Query: 1381 RSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHH--VPT 1440
RSL NSSG+SKSLPLT++ E++V+ QQDQT+ SCIDESNSRS+ GFQAEHQRHH V T
Sbjct: 1381 RSL--NSSGESKSLPLTDNKESEVITQQDQTAGSCIDESNSRSESGFQAEHQRHHHNVST 1440
Query: 1441 SGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSR 1500
S +HY+V+RGKES+GRAQDGM PFDSVSRDKGL K RG F GGRGKK+IF VKNSGSR
Sbjct: 1441 SDDHYVVTRGKESEGRAQDGMGPFDSVSRDKGLGGSKARGQFHGGRGKKYIFTVKNSGSR 1500
Query: 1501 LPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGI 1560
L F SESTRLD+ G QR+PRRN PRTEFRVRETVDKK NSQVSS+ VEVDDKPT SG
Sbjct: 1501 LSFPASESTRLDSSGFQRRPRRNAPRTEFRVRETVDKKSSNSQVSSSNVEVDDKPTVSGR 1560
Query: 1561 SAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGN 1620
SA +S RNGTRKVVIS KPSKRALESEG SS VS+SLELD GNR+EKGVKK+YLGKSQG+
Sbjct: 1561 SAASSARNGTRKVVISTKPSKRALESEGLSSGVSSSLELDAGNRTEKGVKKEYLGKSQGS 1620
Query: 1621 QYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
QYSGEG FRKNI S EDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ
Sbjct: 1621 QYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
Query: 1681 REKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDGRGSGS 1740
REKEIKAKSHNSKIP +SRS K A SSVNS KVYAAKEAE VKRTR DF DGRGSG+
Sbjct: 1681 REKEIKAKSHNSKIPRRSRSNSKIASSSVNSSKVYAAKEAETVKRTRSDFVATDGRGSGN 1740
Query: 1741 IMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMF 1800
I+VSSAFSSP+VSQPLAPIGTPALKSDSQ ER HTAR I +S PALAT +GRNLES MF
Sbjct: 1741 IVVSSAFSSPIVSQPLAPIGTPALKSDSQTERSHTARSIQSSAPALATGDGRNLESSSMF 1800
Query: 1801 GKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPF 1860
KKNDILDNVQTS SWG S NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP
Sbjct: 1801 DKKNDILDNVQTSFASWGGSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDP- 1860
Query: 1861 IVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHS 1920
V SPSILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCAT LGIGPT LCHS
Sbjct: 1861 NVPSPSILARDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATLLGIGPTGLCHS 1920
Query: 1921 DIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTN 1980
DIQIPHKLSGAENDC++FF+KEKH+SESC+ IE+SEAEAEAAASAVAVAAISSDEI VTN
Sbjct: 1921 DIQIPHKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEI-VTN 1980
Query: 1981 GLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISL 2040
GL TCSVSVTDTNN G GD++++TAG+AGDQQLASKTRADDSLTV LPADLSVETPPISL
Sbjct: 1981 GLGTCSVSVTDTNNFGSGDINVITAGSAGDQQLASKTRADDSLTVALPADLSVETPPISL 2040
Query: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCS 2100
WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQ+QTQK S
Sbjct: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQSQTQKSS 2100
Query: 2101 APAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
APAPGPLGSWKQCHSGVDSFYGPP F+ PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQ
Sbjct: 2101 APAPGPLGSWKQCHSGVDSFYGPP-GFSGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
Query: 2161 VGLSFMGATYIPSGKQHDWKHSPGP-SLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSP 2220
VGLSFMGATYIPSGKQHDWKHSPGP SL VE DQK LNMVSAQRMPTNLPPIQHLAPGSP
Sbjct: 2161 VGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKTLNMVSAQRMPTNLPPIQHLAPGSP 2220
Query: 2221 LLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPL-QHQAGGVLPSHFSHP 2280
LLPMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPL Q QA GVLPSHFSH
Sbjct: 2221 LLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQQAEGVLPSHFSHS 2280
Query: 2281 SSADPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNV 2340
SSADPSFTVNRFPGSQP +D KRNF V DA+VTQLPDEL IVD SSC+SSGA VPNV
Sbjct: 2281 SSADPSFTVNRFPGSQPSVASDHKRNFSVATDATVTQLPDELGIVDASSCVSSGASVPNV 2340
Query: 2341 DTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQR 2400
D NSL V+S TDAGKT V+ NCSSNNSGQN+ TNLKSQ P HKG+S QY+HSSGYN+QR
Sbjct: 2341 DINSLSVSSVTDAGKTGVQ-NCSSNNSGQNSGTNLKSQSPQHKGVSTQQYSHSSGYNFQR 2400
Query: 2401 GGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
GGASQK+SSGG EW+HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ+SS NL
Sbjct: 2401 GGASQKHSSGGGEWSHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQTSSGNL 2444
BLAST of Sed0025145 vs. ExPASy TrEMBL
Match:
A0A6J1IST3 (uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360 PE=4 SV=1)
HSP 1 Score: 3664.0 bits (9500), Expect = 0.0e+00
Identity = 1976/2448 (80.72%), Postives = 2133/2448 (87.13%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQAHH HSNSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AG TGGGV+GN QRPTSAG+GWTKP TND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+GN+VD+IDPSLRSVDGV+GGSSVYMPPSARA P +S VHT +EK P
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSALSQVHTAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA AP QKQ+DG++SKLKHA+E SYEE+RDTSHLSSSIDAR+K++
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
SS KS P EN+KNG+ FS SPE SRKQEDIFP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L RDRGHPKSEAYWERDFDMP VSSLPHKPI NFSQ+W+ RD+ESGKF SSDI KVD
Sbjct: 301 LIDRVRDRGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDARTPSREGWEG+FQKNNPIPKD F SDSGN RNDIA RPTSIDRE NA NMHVS
Sbjct: 361 PYGRDARTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSV 480
F+EHA K GR+D GF GRQTWN A+ESY+SQ+PD T +DK+GSEQHN+++G+ HNTSV
Sbjct: 421 FREHAPKVGRRDAGF---GRQTWNSASESYNSQDPDWTAKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGLV 540
+NSSY G K+IP D+ LLNFG+DRRSF K EKP+MEDPFMKDFG DP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERK 600
G+ K KKDVIK TDFHDPVR+SFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER+
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAA 660
R+ARE EERQRRAEE ARE AWRAEQERLE +QKAEELRIARE +KQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAVQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERIT 720
KLKLLE+EERMAKRQAEAVKSS+LTSDIPEK IS+V KDVSR+AD++DWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDVSRLADSVDWEDGEKMVERIT 720
Query: 721 TSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQSI 780
TSASSESS + R SEVG R+Q SRD S +FVDRGK VNS+RRD Y+R SGSQFVLQ QS
Sbjct: 721 TSASSESSCINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSG 840
GY +E GG++SSRKEFYGGAG TTSRI + RG+T+PQSDDYS+LR QRPNLSG G
Sbjct: 781 GYTGPWREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRYS 900
D YN+SQEFDSE QD+ ENFGDH WRQEG NN YFPYPERVN ISE DGSY VGR+RYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHAWRQEGSRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 QKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCTL 960
Q+QPRVLPPPSVA +QKSS+RGE+ SVTRD H RNVSTAQ RYIHHEN TL
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 SKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSA 1020
+IIDVN EN ENEEQKPDG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESAVPAKA K IM++STR STGDED+W V DEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGD 1140
EDEVHEGEDENIDL Q+FDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G+
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGN 1140
Query: 1141 EENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQA 1200
EENM+ EIS+CI+EEQGSSEGLQVDGKVCQY DASSQ+RI D EEMQD VMQS+ AQA
Sbjct: 1141 EENMFATPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRI-DPEEMQDLVMQSETAQA 1200
Query: 1201 LPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFGL 1260
LPE EI EQGN SCRSS+SVQQPIS SV+M SQSSSGQVIVPNA SGQAEPP KLQFGL
Sbjct: 1201 LPEPEINEQGNSSCRSSVSVQQPISSSVSMASQSSSGQVIVPNAAGSGQAEPPVKLQFGL 1260
Query: 1261 FSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGVL 1320
FSGPSLIPS VP+IQIGSIQMPLHLHPQ+TPS+ MHSSQPPLFQFGQLRY SSV QGVL
Sbjct: 1261 FSGPSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVL 1320
Query: 1321 PLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVSR 1380
PL PQP TF+ P VQTGFP N+NP +AL +QT +ETC + SRK++V P + DNQ LVSR
Sbjct: 1321 PLAPQPLTFVPPAVQTGFPLNKNPGDALLIQTSQETCAHNSRKNDVLPLLMDNQQGLVSR 1380
Query: 1381 SLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVPTSGN 1440
S NVNSSG+SKSLPLTESIE++VM QQ QT+ SCIDE+NSRS+ GFQAEHQR HV TS N
Sbjct: 1381 SSNVNSSGESKSLPLTESIESQVMAQQYQTAGSCIDENNSRSELGFQAEHQRQHVSTSDN 1440
Query: 1441 HYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSRLPF 1500
HY+VSRGKES+GRAQDGM DSVSRDKGL K RG FPGGRGKK+IF VKNSGSRLPF
Sbjct: 1441 HYVVSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPF 1500
Query: 1501 AGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGISAV 1560
GSESTRLD GG QR+PRRN PRTEFRVRETVDKKL +SQVSSN+VEVDDKPT SG +AV
Sbjct: 1501 PGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAV 1560
Query: 1561 NSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGNQYS 1620
NS RNGTRKV +SNKPSKRALE EG SS STSLELD GNRSEK VKK+YLGKSQG+QY
Sbjct: 1561 NSARNGTRKVFVSNKPSKRALEPEGLSSRASTSLELDAGNRSEKEVKKEYLGKSQGSQYY 1620
Query: 1621 GEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
GE FRKNI S EDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK
Sbjct: 1621 GESNFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREK 1680
Query: 1681 EIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRD-GRGSGSIM 1740
EIKAKSHNSKIP KSRST K ALSSVNS KVYAAK AE VKRTR +F D GRGSG+I+
Sbjct: 1681 EIKAKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSEFIAADGGRGSGNIV 1740
Query: 1741 VSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMFGK 1800
VSSA SS +VSQPLAPIGTPALKSDSQ ER HTAR I TS PALATS+GRNLES LMF K
Sbjct: 1741 VSSALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFDK 1800
Query: 1801 KNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPFIV 1860
KNDILDNV +S SWGNS NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP +
Sbjct: 1801 KNDILDNVPSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNVP 1860
Query: 1861 SSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHSDI 1920
SS SILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SC+T LGIGPT LCHSD+
Sbjct: 1861 SS-SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIGPTGLCHSDM 1920
Query: 1921 QIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTNGL 1980
QIPHKLSGAENDC+LFF+KEKH+SES + IE+SEAEAEAAASAVAVAAISSDEI VTNGL
Sbjct: 1921 QIPHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEI-VTNGL 1980
Query: 1981 DTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISLWP 2040
T SV VTDTNN GGGD++++ AG+AG+QQ ASKTRADDSLTV LPADLSVETPPISLWP
Sbjct: 1981 GTSSVPVTDTNNFGGGDINVIIAGSAGNQQFASKTRADDSLTVALPADLSVETPPISLWP 2040
Query: 2041 TLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCSAP 2100
+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQK SAP
Sbjct: 2041 SLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSAP 2100
Query: 2101 APGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQVG 2160
APGPLGSWKQCHSGVDSFYGPPA FT PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQVG
Sbjct: 2101 APGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQVG 2160
Query: 2161 LSFMGATYIPSGKQHDWKHSPGPSLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSPLLP 2220
LSFMGATYIPSGKQ DWKHSPGPSL VE DQKNLNMVSAQRMPTNLPPIQHLAPGSPLLP
Sbjct: 2161 LSFMGATYIPSGKQPDWKHSPGPSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPLLP 2220
Query: 2221 MASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVLPSHFSHPSSAD 2280
MASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPLQ QA G+LPSHFSH SSAD
Sbjct: 2221 MASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASSAD 2280
Query: 2281 PSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNVDTNS 2340
PSFTVNRFPGSQP +D KRN+ V ADA+VTQLPDEL IVD SSC+SSG VPNVD S
Sbjct: 2281 PSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDIKS 2340
Query: 2341 LVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQRGGAS 2400
L VNS TDAGKT V+ NCSS+NS NA TNLKSQ P HKGI QY+HSSGYNYQRGGAS
Sbjct: 2341 LSVNSVTDAGKTGVQ-NCSSSNSSLNAGTNLKSQSPQHKGIPVQQYSHSSGYNYQRGGAS 2400
Query: 2401 QKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
QKNSSGGSEW HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ SS NL
Sbjct: 2401 QKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNL 2439
BLAST of Sed0025145 vs. ExPASy TrEMBL
Match:
A0A5D3CNG4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004860 PE=4 SV=1)
HSP 1 Score: 3614.3 bits (9371), Expect = 0.0e+00
Identity = 1953/2458 (79.45%), Postives = 2125/2458 (86.45%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH---------HSNSYGSNRTRPGTHGAGGGMVVLSRP 60
MANPGVG+KFVSVNLNKSYGQ HH HSNSYGSNRTRPG HG GGGMVVLSRP
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSRP 60
Query: 61 RSSQKPGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPR 120
RSSQKPGPKLSVPPPLNLPSLRKEH+RLDSLGS GPTGGGV+GNGQRPTSAG+GWTKPR
Sbjct: 61 RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR 120
Query: 121 TNDFPDKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTT 180
TND P+KE + N+VD+IDPSLRSVDGVSGGSSVYMPPSARAG+T P +S VH
Sbjct: 121 TNDLPEKEGPSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVHAA 180
Query: 181 LEKTPILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDA 240
+EK+P+L GEDFPSLQATLPSA AP QKQ+DG++SKLKH SEGS EE+RD++HLSS IDA
Sbjct: 181 VEKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVSEGSCEEQRDSAHLSSRIDA 240
Query: 241 RTKYRSSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDER 300
R+ Y+SS KS EN+KNG+ FS SPE SRKQEDIFP PLPLVS+NPRSDWADDER
Sbjct: 241 RSNYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDER 300
Query: 301 DTSQGLAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSD 360
DTS GL RDRGHPKSEAYWERDFDMPRVSSLPHKP NFSQ+WN+RD+ESGKF SSD
Sbjct: 301 DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD 360
Query: 361 IRKVDPYGRDARTPSREGWE-GSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAG 420
I KVDPYGRD+R SR+GWE G+F+KNNP+PKDGF SD+GN RN IA R TS+DRE NA
Sbjct: 361 IHKVDPYGRDSRMASRDGWEGGNFRKNNPVPKDGFGSDNGNDRNAIAGRLTSVDRETNAD 420
Query: 421 NMHVSHFQEHAHKDGRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGE 480
NMHVSHF+EHA+KDGR+D GFGQ GRQTWN ATESYSSQEPDRT +DKYGSEQH++++GE
Sbjct: 421 NMHVSHFREHANKDGRRDAGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHSKFRGE 480
Query: 481 IHNTSVANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DP 540
HNTSVANSSY SG K+IP DEPLLNFG+DRRSF K EKP+MEDPFMKDFGA DP
Sbjct: 481 THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP 540
Query: 541 FTAGLVGMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRR 600
FTAGLVG+ K KKDVIK TDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL+RR
Sbjct: 541 FTAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR 600
Query: 601 EEEERKRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEE 660
EEEER+R+AREHEERQRRAEEEARE AWRAEQERLE IQKAEELRIARE +KQRI LEEE
Sbjct: 601 EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEEE 660
Query: 661 RRKQAAKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEK 720
RRKQAAKLKLLE+EE++AKRQAEAVKSS+ SDIPEK I +V KDVSR+ DT+DWEDGEK
Sbjct: 661 RRKQAAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGEK 720
Query: 721 MVERITTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFV 780
MVERITTSASSESSS+ RSSEVG RSQFSRD S +FVDRGK VNS+RRD YER SGSQFV
Sbjct: 721 MVERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV 780
Query: 781 LQGQSIGYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRP 840
LQ QS GYN R+E GG++SSRKEFYGGA FTTS+ SH RGIT+PQSD+YS+LR QRP
Sbjct: 781 LQDQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQRP 840
Query: 841 NLSGSGDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYV 900
NLSG DHYN++QEFDS+ QD+ ENFGDHGWRQE GHNN YFPYPERVN ISE DGSY V
Sbjct: 841 NLSGGVDHYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSV 900
Query: 901 GRTRYSQKQPRVLPPPSVALMQKSSIRGEYESVTRD-------THPVRNVSTAQARYIHH 960
GR+RYSQ+QPRVLPPPSVA MQKSS+R EYESV RD HP N+STAQ YIHH
Sbjct: 901 GRSRYSQRQPRVLPPPSVASMQKSSVRNEYESVPRDIESEIQYDHPASNISTAQTMYIHH 960
Query: 961 ENCTLSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDS 1020
EN L +IIDVN EN ENEEQK DG TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDS
Sbjct: 961 ENRALPEIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDS 1020
Query: 1021 PVLSASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDD 1080
PVLSASREGTLSIEDN+SAVPAKA K IM+TSTRVSTGDED+W DEHVQEQEEYDEDD
Sbjct: 1021 PVLSASREGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDEDD 1080
Query: 1081 DGYQEEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFE 1140
DGYQEEDEVHEGEDENIDL DFDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFE
Sbjct: 1081 DGYQEEDEVHEGEDENIDLVPDFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFE 1140
Query: 1141 RIPGDEENMYVATEISSCIKEEQGSSEGLQVDG-KVCQYVDASSQVRISDLEEMQDKVMQ 1200
RIPG+EEN+YVA+EIS+ I+EE+GSSEGLQVDG KVCQYVDASSQ+RI D EEMQD VMQ
Sbjct: 1141 RIPGNEENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRI-DPEEMQDLVMQ 1200
Query: 1201 SKIAQALPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPF 1260
SKIAQALP+ EITEQGN SCRSS+SV+QPIS SV+M SQS SGQVIVP+AV SGQAEPP
Sbjct: 1201 SKIAQALPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPSAV-SGQAEPPV 1260
Query: 1261 KLQFGLFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSS 1320
KLQFGLFSGPSLIPS VP+IQIGSIQMPLHLHPQIT S+ MHSSQPPLFQFGQLRY SS
Sbjct: 1261 KLQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSS 1320
Query: 1321 VPQGVLPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQ 1380
V GVLPL PQP TF P VQTGF N+NP + LS+ +ETC + SRK++ PF DNQ
Sbjct: 1321 VSPGVLPLAPQPLTF-APTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQ 1380
Query: 1381 V-LVSRSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHH 1440
LVSRSLNVN SG+SKSLPLTES+E+KV++ QDQ + SCIDESNSRS+PGFQAEH R H
Sbjct: 1381 QGLVSRSLNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLH 1440
Query: 1441 VPTSGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNS 1500
V TS NHY+VSRGKES+GRAQDGM FDS SR+KG K RG FPGGRGKK+IF VKNS
Sbjct: 1441 VSTSDNHYVVSRGKESEGRAQDGMGSFDSASRNKGSSGLKGRGQFPGGRGKKYIFTVKNS 1500
Query: 1501 GSRLPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTD 1560
GSRLPF SESTRL+ GG QR+PRRN RTEFRVRET DKKL NSQVSSN+V VDDKPT
Sbjct: 1501 GSRLPFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTV 1560
Query: 1561 SGISAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKS 1620
SG +AV+S RNGTRKV++SNK SKRALESEG SS VSTS+ELD GNRSEKGVKK+YLGKS
Sbjct: 1561 SGRTAVSSARNGTRKVIMSNKSSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYLGKS 1620
Query: 1621 QGNQYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDR 1680
QG+QYSGEG FR+NI S ED D PLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDR
Sbjct: 1621 QGSQYSGEGSFRRNICSGEDADTPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDR 1680
Query: 1681 REQREKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDG-- 1740
REQREKEIKAKSHN+KIP K RSTLKSALSSV+S KVYA KEAE VKRTR DF DG
Sbjct: 1681 REQREKEIKAKSHNTKIPRKGRSTLKSALSSVSSSKVYAPKEAETVKRTRSDFVAADGGV 1740
Query: 1741 RGSGSIMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLE 1800
RGSG+++VSSAFS P+VSQPLAPIGTPALKSDSQ ER HTAR I TS PALAT++GRNL+
Sbjct: 1741 RGSGNVVVSSAFSPPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATNDGRNLD 1800
Query: 1801 SGLMFGKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPVDHSSLT 1860
S LMF KK+DILDNVQ+S SWGNS NQQV+ALTQTQLDEAMKPAQFDLHPP ++
Sbjct: 1801 SSLMFDKKDDILDNVQSSFASWGNSRINQQVIALTQTQLDEAMKPAQFDLHPPAGDTN-- 1860
Query: 1861 GDPFIVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIG-PT 1920
V SPSILA DRS+SSAANPISSLLAGEKIQFGAVTSPTVLPPGSC+T LGIG PT
Sbjct: 1861 -----VPSPSILAMDRSYSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGTPT 1920
Query: 1921 DLCHSDIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDE 1980
LCHSDI IPHKLSGAENDC+LFF+KEKH ESC+ IE+SEAEAEAAASAVAVAAISSDE
Sbjct: 1921 GLCHSDISIPHKLSGAENDCHLFFEKEKHRPESCTHIEDSEAEAEAAASAVAVAAISSDE 1980
Query: 1981 ILVTNGLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVET 2040
+VTNG+ TCSVSV+DTNN G GD++++ G+ GDQQLASKTRADDSLTV LPADLSVET
Sbjct: 1981 -MVTNGIGTCSVSVSDTNNFGSGDINVIATGSTGDQQLASKTRADDSLTVALPADLSVET 2040
Query: 2041 PPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQ 2100
PPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV TTQAQ
Sbjct: 2041 PPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQ 2100
Query: 2101 TQKCSAPAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPV 2160
TQK SAPAPGPLGSWK CHSGVDSFYGPP FT PF+SPGGIPGVQGPPHMVVYNHFAPV
Sbjct: 2101 TQKSSAPAPGPLGSWKHCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPV 2160
Query: 2161 GQFGQVGLSFMGATYIPSGKQHDWKHSPGP-SLCVE-DQKNLNMVSAQRMPTNLPPIQHL 2220
GQFGQVGLSFMGATYIPSGKQHDWKHSPGP SL V+ DQKNLNMVSAQRMP NLPPIQHL
Sbjct: 2161 GQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPANLPPIQHL 2220
Query: 2221 APGSPLLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPL-QHQAGGVLPS 2280
APGSPLLPMASPLAMFDVSPFQASPE+S QTRWPSSAS Q VPLSMP+ Q QA G+LPS
Sbjct: 2221 APGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSASPAQPVPLSMPMQQQQAEGILPS 2280
Query: 2281 HFSHPSSADPSFTVNRFPGSQ---PADQKRNFPVGADASVTQLPDELEIVDESSCLSSGA 2340
HFSH SS+DP+F+VNRFPGSQ +D KRNF V ADA+VTQLPDEL IVD SSC+SSGA
Sbjct: 2281 HFSHASSSDPTFSVNRFPGSQASVASDHKRNFTVSADATVTQLPDELGIVDSSSCVSSGA 2340
Query: 2341 GVPNVDTNSLVVNSATDAGKTSVRPNCSSNNSGQ-NASTNLKSQYPHKGI-SAHQYTHSS 2400
VPNVD NSL S TDAG+T V+ SS+NSGQ NA TNLKS HKGI SA QY+HSS
Sbjct: 2341 SVPNVDINSL---SVTDAGQTGVKNCSSSSNSGQNNAGTNLKSSLHHKGISSAQQYSHSS 2400
Query: 2401 GYNYQRGGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
GYNYQRGGASQKNSSGGSEW+HRRT F+GRNQSGAEKNFSS K+KQIYVAKQ S+ NL
Sbjct: 2401 GYNYQRGGASQKNSSGGSEWSHRRTGFVGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2442
BLAST of Sed0025145 vs. ExPASy TrEMBL
Match:
A0A6J1BUR5 (uncharacterized protein LOC111005936 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111005936 PE=4 SV=1)
HSP 1 Score: 3612.8 bits (9367), Expect = 0.0e+00
Identity = 1955/2452 (79.73%), Postives = 2106/2452 (85.89%), Query Frame = 0
Query: 1 MANPGVGSKFVSVNLNKSYGQAHH----HSNSYGSNRTRPGTHGAGGGMVVLSRPRSSQK 60
MANPGVG+KFVSVNLNKSYGQ HH H NSYGSNRTRPG+HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQPHHHHSSHPNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFP 120
PGPKLSVPPPLNLPSLRKEH+RLDSLGS AGP GGGV+GNGQRPTSAG+GWTKPRTND P
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGNGQRPTSAGMGWTKPRTNDLP 120
Query: 121 DKEALNGNVVDRIDPSLRSVDGVSGGSSVYMPPSARAGITQPDCVSSCFLSVHTTLEKTP 180
+KE L+ N+ DRIDPSLR+VDG SGGSSVYMPPSARAG+T P +S V+ +EK P
Sbjct: 121 EKEGLSSNMADRIDPSLRTVDGASGGSSVYMPPSARAGMTGPVVTTSASSQVYAAVEKAP 180
Query: 181 ILGGEDFPSLQATLPSAVAPPQKQKDGMNSKLKHASEGSYEERRDTSHLSSSIDARTKYR 240
+L GEDFPSLQATLPSA P QK KDG +SKLK A+EGSYEE+RDTSHLSSSIDAR K++
Sbjct: 181 VLRGEDFPSLQATLPSAAGPSQKLKDGPSSKLKRAAEGSYEEQRDTSHLSSSIDARPKFQ 240
Query: 241 SSLKSPPRENSKNGDFFS-----SPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQG 300
S+ K P EN+K GD FS S E SRKQED+FP PLPLVS+NPRSDWADDERDTS G
Sbjct: 241 SAQKGLPSENAKKGDTFSLGSFQSSESSRKQEDLFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LAGMGRDRGHPKSEAYWERDFDMPRVSSLPHKPIPNFSQKWNMRDNESGKFRSSDIRKVD 360
L GRDRGHPKSEAYWERDFDMPRVS+LPHKPIPNFSQ+WN+RD+ESGKF S+DI KVD
Sbjct: 301 LIDRGRDRGHPKSEAYWERDFDMPRVSALPHKPIPNFSQRWNLRDDESGKFHSNDIHKVD 360
Query: 361 PYGRDARTPSREGWEGSFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDREANAGNMHVSH 420
PYGRDARTPSREGWEG+F++N PIPKDGF SDS N RNDIAARPT+IDRE NA +MHVSH
Sbjct: 361 PYGRDARTPSREGWEGNFRRNTPIPKDGFGSDSRNDRNDIAARPTNIDRETNANSMHVSH 420
Query: 421 FQEHAHKD-GRKDTGFGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTS 480
F+EHAHKD GR+DTG+GQ GRQTWN A ESYSSQEPDR RDKYGSEQHNR
Sbjct: 421 FREHAHKDPGRRDTGYGQMGRQTWNSAAESYSSQEPDRVIRDKYGSEQHNR--------- 480
Query: 481 VANSSYYSGSKQIPTDEPLLNFGKDRRSFTKNEKPFMEDPFMKDFGA------DPFTAGL 540
+RRSF K EKP+MEDPFMKDFGA DPF G+
Sbjct: 481 ------------------------ERRSFAKIEKPYMEDPFMKDFGASGFDGRDPFATGI 540
Query: 541 VGMFK-KKDVIKHTDFHDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEER 600
VG+ K KKDVIK TDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL+RREEEER
Sbjct: 541 VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
Query: 601 KRIAREHEERQRRAEEEAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQA 660
K +AREHEERQRRAEEEARE AWRAEQERLE IQKAEELRIARE +KQRIILEEERRKQA
Sbjct: 601 KTLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIILEEERRKQA 660
Query: 661 AKLKLLEVEERMAKRQAEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERI 720
AKL LLE+EERMAKRQAE VKSS+ TSDIPEK I V KDVSR+AD +DWEDGEKMVERI
Sbjct: 661 AKLMLLELEERMAKRQAETVKSSTSTSDIPEKKIPGVVKDVSRLADAVDWEDGEKMVERI 720
Query: 721 TTSASSESSSMVRSSEVGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQGQS 780
TTSASSESSS+ RSSEVG RSQFSRDAS AFVDRGK VNS+RRD YER SGSQFV+Q QS
Sbjct: 721 TTSASSESSSINRSSEVGLRSQFSRDASPAFVDRGKSVNSWRRDFYERGSGSQFVVQDQS 780
Query: 781 IGYNSQRQEPFVGGQLSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGS 840
GYN R+E +GG+ +SRKEFYGGAGFTTSRISH RGIT+PQSDDYS+LR RPNLSG
Sbjct: 781 TGYNGPRREAPIGGRATSRKEFYGGAGFTTSRISHRRGITEPQSDDYSQLRGHRPNLSGG 840
Query: 841 GDHYNKSQEFDSELQDSFENFGDHGWRQEGGHNNVYFPYPERVNSISEPDGSYYVGRTRY 900
GDHY++S +FDSE QD+ ENFGDHGWRQE G NN YFPYPERVN ISE DGSY VGR+RY
Sbjct: 841 GDHYSQSPDFDSEFQDNVENFGDHGWRQETGRNNFYFPYPERVNPISETDGSYSVGRSRY 900
Query: 901 SQKQPRVLPPPSVALMQKSSIRGEYESVTRD--------THPVRNVSTAQARYIHHENCT 960
SQ+QPRVLPPPSVA +QKSS+RGEYESV RD HP RNVSTAQ YIHHEN +
Sbjct: 901 SQRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPARNVSTAQTGYIHHENRS 960
Query: 961 LSKIIDVNFENVENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLS 1020
+IIDVN +N ENEEQKPD TLRCDSQSTLSVFSPP SPTHLSHEDLDDSGDSPVLS
Sbjct: 961 FPEIIDVNLDNAENEEQKPDCDTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLS 1020
Query: 1021 ASREGTLSIEDNESAVPAKASKVIMMTSTRVSTGDEDDWVVADEHVQEQEEYDEDDDGYQ 1080
ASREGTLSIED ESAVP K K IM++STRVSTGDED+W V +EHVQEQEEYDEDDDGY
Sbjct: 1021 ASREGTLSIEDTESAVPTKCGKEIMISSTRVSTGDEDEWAVGNEHVQEQEEYDEDDDGYD 1080
Query: 1081 EEDEVHEGEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPG 1140
EEDEVHE EDENIDL QDFDDL L+ D+GSPHMLDNLVLGFNEGVEV MPNDEFERI G
Sbjct: 1081 EEDEVHEVEDENIDLAQDFDDLHLD--DKGSPHMLDNLVLGFNEGVEVGMPNDEFERISG 1140
Query: 1141 DEENMYVATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEEMQDKVMQSKIAQ 1200
+EENM+V EISSCI+EEQGSSE LQVD +CQY DASSQVRI D EEM+D V+ SK AQ
Sbjct: 1141 NEENMFVTPEISSCIREEQGSSEVLQVDSNICQYPDASSQVRIPDTEEMKDLVILSKPAQ 1200
Query: 1201 ALPELEITEQGN-SCRSSLSVQQPISCSVTMGSQSSSGQVIVPNAVLSGQAEPPFKLQFG 1260
ALP E+TEQG SCRS +SVQ PIS SV+M SQS GQVIVPNA +SGQAEPP KLQFG
Sbjct: 1201 ALPGSELTEQGKFSCRSGVSVQLPISSSVSMASQSPPGQVIVPNAAISGQAEPPVKLQFG 1260
Query: 1261 LFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQGV 1320
LFSGPSLIPS VP+IQIGSIQMPLHLH QITPS+ MHSSQPPLFQFGQLRY SSV QGV
Sbjct: 1261 LFSGPSLIPSPVPAIQIGSIQMPLHLHAQITPSMTHMHSSQPPLFQFGQLRYTSSVSQGV 1320
Query: 1321 LPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQV-LVS 1380
LPL PQP TF+ P VQTGFP N+NP +A S+QT +ETC + SRK++V PF+ DNQ L S
Sbjct: 1321 LPLAPQPLTFVPPTVQTGFPLNKNPGDAPSIQTSQETCAHNSRKNDVLPFLMDNQQGLAS 1380
Query: 1381 RSLNVNSSGDSKSLPLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHH--VPT 1440
RSL NSSG+SKSLPLT++ E++V+ QQDQT+ SCIDESNSRS+ GFQAEHQRHH V T
Sbjct: 1381 RSL--NSSGESKSLPLTDNKESEVITQQDQTAGSCIDESNSRSESGFQAEHQRHHHNVST 1440
Query: 1441 SGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGSR 1500
S +HY+V+RGKES+GRAQDGM PFDSVSRDKGL K RG F GGRGKK+IF VKNSGSR
Sbjct: 1441 SDDHYVVTRGKESEGRAQDGMGPFDSVSRDKGLGGSKARGQFHGGRGKKYIFTVKNSGSR 1500
Query: 1501 LPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSGI 1560
L F SESTRLD+ G QR+PRRN PRTEFRVRETVDKK NSQVSS+ VEVDDKPT SG
Sbjct: 1501 LSFPASESTRLDSSGFQRRPRRNAPRTEFRVRETVDKKSSNSQVSSSNVEVDDKPTVSGR 1560
Query: 1561 SAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQGN 1620
SA +S RNGTRKVVIS KPSKRALESEG SS VS+SLELD GNR+EKGVKK+YLGKSQG+
Sbjct: 1561 SAASSARNGTRKVVISTKPSKRALESEGLSSGVSSSLELDAGNRTEKGVKKEYLGKSQGS 1620
Query: 1621 QYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
QYSGEG FRKNI S EDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ
Sbjct: 1621 QYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQ 1680
Query: 1681 REKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDGRGSGS 1740
REKEIKAKSHNSKIP +SRS K A SSVNS KVYAAKEAE VKRTR DF DGRGSG+
Sbjct: 1681 REKEIKAKSHNSKIPRRSRSNSKIASSSVNSSKVYAAKEAETVKRTRSDFVATDGRGSGN 1740
Query: 1741 IMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALATSNGRNLESGLMF 1800
I+VSSAFSSP+VSQPLAPIGTPALKSDSQ ER HTAR I +S PALAT +GRNLES MF
Sbjct: 1741 IVVSSAFSSPIVSQPLAPIGTPALKSDSQTERSHTARSIQSSAPALATGDGRNLESSSMF 1800
Query: 1801 GKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPV-DHSSLTGDPF 1860
KKNDILDNVQTS SWG S NQQVMALTQTQLDEAMKPAQFDLHPPV DHSSL GDP
Sbjct: 1801 DKKNDILDNVQTSFASWGGSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDP- 1860
Query: 1861 IVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLCHS 1920
V SPSILA DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCAT LGIGPT LCHS
Sbjct: 1861 NVPSPSILARDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATLLGIGPTGLCHS 1920
Query: 1921 DIQIPHKLSGAENDCNLFFDKEKHYSESCSPIENSEAEAEAAASAVAVAAISSDEILVTN 1980
DIQIPHKLSGAENDC++FF+KEKH+SESC+ IE+SEAEAEAAASAVAVAAISSDEI VTN
Sbjct: 1921 DIQIPHKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEI-VTN 1980
Query: 1981 GLDTCSVSVTDTNNIGGGDVDIVTAGTAGDQQLASKTRADDSLTVTLPADLSVETPPISL 2040
GL TCSVSVTDTNN G GD++++TAG+AGDQQLASKTRADDSLTV LPADLSVETPPISL
Sbjct: 1981 GLGTCSVSVTDTNNFGSGDINVITAGSAGDQQLASKTRADDSLTVALPADLSVETPPISL 2040
Query: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKCS 2100
WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQ+QTQK S
Sbjct: 2041 WPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQSQTQKSS 2100
Query: 2101 APAPGPLGSWKQCHSGVDSFYGPPASFTSPFVSPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
APAPGPLGSWKQCHSGVDSFYGPP F+ PF+SPGGIPGVQGPPHMVVYNHFAPVGQFGQ
Sbjct: 2101 APAPGPLGSWKQCHSGVDSFYGPP-GFSGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQ 2160
Query: 2161 VGLSFMGATYIPSGKQHDWKHSPGP-SLCVE-DQKNLNMVSAQRMPTNLPPIQHLAPGSP 2220
VGLSFMGATYIPSGKQHDWKHSPGP SL VE DQK LNMVSAQRMPTNLPPIQHLAPGSP
Sbjct: 2161 VGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKTLNMVSAQRMPTNLPPIQHLAPGSP 2220
Query: 2221 LLPMASPLAMFDVSPFQASPEISAQTRWPSSASTVQAVPLSMPL-QHQAGGVLPSHFSHP 2280
LLPMASPLAMFDVSPFQASPE+S Q RWPSSAS+VQ VPLSMPL Q QA GVLPSHFSH
Sbjct: 2221 LLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQQAEGVLPSHFSHS 2280
Query: 2281 SSADPSFTVNRFPGSQP---ADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAGVPNV 2340
SSADPSFTVNRFPGSQP +D KRNF V DA+VTQLPDEL IVD SSC+SSGA VPNV
Sbjct: 2281 SSADPSFTVNRFPGSQPSVASDHKRNFSVATDATVTQLPDELGIVDASSCVSSGASVPNV 2340
Query: 2341 DTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLKSQYP-HKGISAHQYTHSSGYNYQR 2400
D NSL V+S TDAGKT V+ NCSSNNSGQN+ TNLKSQ P HKG+S QY+HSSGYN+QR
Sbjct: 2341 DINSLSVSSVTDAGKTGVQ-NCSSNNSGQNSGTNLKSQSPQHKGVSTQQYSHSSGYNFQR 2400
Query: 2401 GGASQKNSSGGSEWAHRRTRFMGRNQSGAEKNFSSGKVKQIYVAKQSSSRNL 2416
GGASQK+SSGG EW+HRRT FMGRNQSGAEKNFSS K+KQIYVAKQ+SS NL
Sbjct: 2401 GGASQKHSSGGGEWSHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQTSSGNL 2411
BLAST of Sed0025145 vs. TAIR 10
Match:
AT3G50370.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; Has 27734 Blast hits to 16708 proteins in 1259 species: Archae - 81; Bacteria - 3434; Metazoa - 10876; Fungi - 2514; Plants - 987; Viruses - 212; Other Eukaryotes - 9630 (source: NCBI BLink). )
HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 944/2395 (39.42%), Postives = 1283/2395 (53.57%), Query Frame = 0
Query: 75 EHQRLDSLGSSAGPTGGGVMGNGQRPTSAGIGWTKPRTNDFPDKEALNGNVVDRIDPSLR 134
EH+R+DS GSS +GGG+ G+G RP S+GIGW+KP A +G++ +
Sbjct: 27 EHERVDSSGSSF-HSGGGIAGSGTRPASSGIGWSKPAAT------ATDGDIGN------H 86
Query: 135 SVDGVSGGSS-VYMPPSARAGITQPDCVSSCFLSVHTTLEKTPILGGEDFPSLQATLPSA 194
+ +GV+ GS+ + ++R G +P + F V EK L GEDFPSL+A+LPSA
Sbjct: 87 TGEGVTRGSNGLNTSLASRVGAAEP--MERAFHHV----EKVATLRGEDFPSLKASLPSA 146
Query: 195 VAPPQKQKDGMNSKLKHAS-EGSYEERRDTSHLSSS-IDARTKY---RSSLKSPPRENSK 254
QKQK+G+N K K A+ E +E R S +SSS +D R + RS L + E+
Sbjct: 147 SVSGQKQKEGLNQKQKQAAGEDFSKEPRGVSGMSSSLVDMRPQNQSGRSRLGNELSESPS 206
Query: 255 NGDFFSSPEFSRKQEDIFPDPLPLVSVNPRSDWADDERDTSQGLAGMGRDRGHPKSEAYW 314
D S E RK+E F PLPLV + PRSDWADDERDTS GL RD G+ K+E +W
Sbjct: 207 FSDGLHSSEHVRKKE-YFAGPLPLVRLAPRSDWADDERDTSHGLRDRDRDHGYSKNEPFW 266
Query: 315 ERDFDMPRVSSLPHK-PIPNFSQKWNMRDNESGKFRSSDIRKVDPYGRDARTPSREGWEG 374
+R FD+ R LP K N K R+NE K + +R V GR+A
Sbjct: 267 DRGFDL-RPHVLPQKHAASNVFDKPGQRENEIAKSSLTQVRPVSGGGREA---------N 326
Query: 375 SFQKNNPIPKDGFHSDSGNPRNDIAARPTSIDRE-ANAGNMHVSHFQEHA-HKDGRKDTG 434
+++ ++P+ +G + N+ ARP+S RE A N +S +E+ + G ++
Sbjct: 327 AWRVSSPLQNEGAN------HNNYGARPSSRGREAAKKSNYVLSSSRENVWNNSGAREAP 386
Query: 435 FGQTGRQTWNRATESYSSQEPDRTTRDKYGSEQHNRYKGEIHNTSVANSSYYSGSKQIPT 494
+ GRQ WN +S S++ RD YG E NR
Sbjct: 387 YQHGGRQPWNNNMDSSSNR--GTYNRDGYGIEHQNR------------------------ 446
Query: 495 DEPLLNFGKDRRSFTKNEKPFMEDPFMKDFG------ADPFTAGLVGMFKKKDVIKHTDF 554
D+RSF K++KP +EDPFMKDFG DPF L KKK+ +K T+F
Sbjct: 447 ---------DKRSFFKSDKPHVEDPFMKDFGDSGFDVHDPFPV-LGVTKKKKEALKQTEF 506
Query: 555 HDPVRESFEAELERVQQMQEQERQRIIEEQERALELSRREEEERKRIAREHEERQRRAEE 614
HDPVRESFEAELERVQ+MQE+ER+RIIEEQER +EL+R EEEER R+ARE +ERQRR EE
Sbjct: 507 HDPVRESFEAELERVQKMQEEERRRIIEEQERVIELARTEEEERLRLAREQDERQRRLEE 566
Query: 615 EAREVAWRAEQERLENIQKAEELRIARERDKQRIILEEERRKQAAKLKLLEVEERMAKRQ 674
EARE A+R EQERLE ++AEELR ++E +K R+ +EEERRKQAAK KLLE+EE++++RQ
Sbjct: 567 EAREAAFRNEQERLEATRRAEELRKSKEEEKHRLFMEEERRKQAAKQKLLELEEKISRRQ 626
Query: 675 AEAVKSSSLTSDIPEKNISNVAKDVSRVADTIDWEDGEKMVERITTSASSESSSMVRSSE 734
AEA K S +S I E ++ K+ AD +DWED E+MV+RITTS++ + S +RS E
Sbjct: 627 AEAAKGCSSSSTISEDKFLDIVKEKDS-ADVVDWEDSERMVDRITTSSTLDLSVPMRSFE 686
Query: 735 VGPRSQFSRDASSAFVDRGKLVNSYRRDLYERRSGSQFVLQG-QSIGYNSQRQEPFVGGQ 794
SQFSRD S F DR K ++R++ E S S+F+ Q +++ ++ Q
Sbjct: 687 SNATSQFSRDGSFGFPDRQK--PTWRKEDIESGSNSRFIPQNLENVPHSPQ--------- 746
Query: 795 LSSRKEFYGGAGFTTSRISHGRGITKPQSDDYSELRVQRPNLSGSGDHYNKSQEFDSELQ 854
+EF+G AG+ ++ G + D Q + G G + ++ +SE +
Sbjct: 747 ----EEFFGTAGYLSAPSYFKPGFPEHSID-------QSWRIPGDGRTHGRNYGMESESR 806
Query: 855 DSF-ENFGDHGWRQEGG--HNNVYFPYPERVNSISEPDGSYYVGRTRYSQKQPRVLPPPS 914
++F E +GD GW Q G + Y PYPE++ E D Y GR RYS +QPRVLPPP
Sbjct: 807 ENFGEQYGDPGWGQSQGRPRHGPYSPYPEKLYQNPEGDDYYPFGRPRYSVRQPRVLPPPQ 866
Query: 915 VALMQKSSIRGEYESVTRDT--------HPVRNVSTAQARYIHHENCTLSKIIDVNFENV 974
+ QK+S R E E T H R ST A YI + ID
Sbjct: 867 ES-RQKTSFRSEVEHPGPSTSIGGINYSHKGRTNSTVLANYIEDHHVLPGSGID------ 926
Query: 975 ENEEQKPDGGITLRCDSQSTLSVFSPPISPTHLSHEDLDDSGDSPVLSASREGTLSIEDN 1034
E ++ D +T RCDSQS+LSV SPP SP HLSH+DLD+S DS VL SR G +
Sbjct: 927 --EHRRFDTKLTGRCDSQSSLSVTSPPDSPVHLSHDDLDESADSTVLPTSRMGEDAGLLE 986
Query: 1035 ESAVPAKASKV----IMMTSTRVSTGDEDDWVV-ADEHVQEQEEYDEDDDGYQEEDEVHE 1094
+ P +S + +MM + VS D ++W + ++E +QEQEEYDED+DGYQEED++H
Sbjct: 987 KGGAPIISSDIGKDSLMMATGSVSCWDNEEWTLDSNERLQEQEEYDEDEDGYQEEDKIH- 1046
Query: 1095 GEDENIDLEQDFDDLQLNDTDRGSPHMLDNLVLGFNEGVEVVMPNDEFERIPGDEENMYV 1154
G DENIDL Q+ +++ L+D D NLVLGFNEGVEV +P+D+FE+ + E+ +
Sbjct: 1047 GVDENIDLAQELEEMHLDDKD-------SNLVLGFNEGVEVEIPSDDFEKCQRNSESTFP 1106
Query: 1155 ATEISSCIKEEQGSSEGLQVDGKVCQYVDASSQVRISDLEE--------MQDKVMQSKIA 1214
+ + +++ S + Q S + + + MQ+ + I
Sbjct: 1107 LHQHTVDSLDDERPSIETSRGEQAAQPAVVSDPLGMHNASRTFQGAETTMQNLTVHPNIG 1166
Query: 1215 QALPELEITEQGNSCRSSLSVQQPISCSVTMGSQSSSGQVIVPN-AVLSGQAEPPFKLQF 1274
+ E+ + +S +S P+ + SS Q +P + S Q E P K QF
Sbjct: 1167 R--QSFEVASKVDSTSNSTVSTHPVIPLHSAALHPSSLQTAIPPVSTTSAQMEEPVKFQF 1226
Query: 1275 GLFSGPSLIPSHVPSIQIGSIQMPLHLHPQITPSIIQMHSSQPPLFQFGQLRYPSSVPQG 1334
GLFSGPSLIPS P+IQIGSIQMPL LHPQ S+ M QPPL QFGQL Y S + QG
Sbjct: 1227 GLFSGPSLIPSPFPAIQIGSIQMPLPLHPQFGSSLTHMQQPQPPLIQFGQLPYTSPISQG 1286
Query: 1335 VLPLPPQPPTFILPIVQTGFPSNENPREALSVQTPEETCFNKSRKHNVFPFMKDNQVLVS 1394
VLP PP + T + N+NP ++VQ + N ++ ++
Sbjct: 1287 VLP-PPHHSVVQANGLST-YALNQNPGSLVTVQLGQGNSANLLARNAATSVSHPQLSVLR 1346
Query: 1395 RSLNVNSSGDSKSL---PLTESIEAKVMNQQDQTSSSCIDESNSRSQPGFQAEHQRHHVP 1454
R NV+ G K+ P SIEA V Q+ QP Q +P
Sbjct: 1347 RPTNVSDEGTLKNANLPPARASIEAAVSPQK---------------QPELSGNSQ---LP 1406
Query: 1455 TSGNHYMVSRGKESKGRAQDGMWPFDSVSRDKGLRRFKTRGLFPGGRGKKFIFAVKNSGS 1514
+ +S GK + Q G V D AV+NSG
Sbjct: 1407 SR----KMSHGKSNFAERQSGY----QVQTDTS--------------------AVRNSGL 1466
Query: 1515 RLPFAGSESTRLDNGGLQRQPRRNTPRTEFRVRETVDKKLPNSQVSSNYVEVDDKPTDSG 1574
R +E +R+D+GG +R R+ R EFRVRE SN+ D+ +G
Sbjct: 1467 R-SSGTAEVSRVDSGGNRRYRRQ---RVEFRVRE------------SNWPSSDENRNGNG 1526
Query: 1575 ISAVNSGRNGTRKVVISNKPSKRALESEGFSSVVSTSLELDFGNRSEKGVKKDYLGKSQG 1634
A S + G+RK V+SNK K+AL+S +S ++ + G E + KD + K+
Sbjct: 1527 -RAQTSTKIGSRKYVVSNKSQKQALDSS--ASGLNAMQKTVSGGSFENRLGKDAVVKNPL 1586
Query: 1635 NQYSGEGIFRKNIGSWEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE 1694
+ SG+ ++N+ S +++DAPLQ GI+RVFEQ GIEAPSD+DDFIEVRSKRQMLNDRRE
Sbjct: 1587 SPNSGQANLKRNMVSEKEIDAPLQIGIVRVFEQQGIEAPSDDDDFIEVRSKRQMLNDRRE 1646
Query: 1695 QREKEIKAKSHNSKIPPKSRSTLKSALSSVNSRKVYAAKEAEMVKRTRLDFATRDGRGSG 1754
QREKEIK KS +K K RST ++ ++ S + A A K+
Sbjct: 1647 QREKEIKEKSQAAKAFRKPRSTFQNNTTAARSNRSPPASRAANNKQ-------------- 1706
Query: 1755 SIMVSSAFSSPLVSQPLAPIGTPALKSDSQMERLHTARPILTSTPALAT--SNGRNLESG 1814
F+ Q LAPIGTP+ K DS ++ + + AL N +N SG
Sbjct: 1707 -------FNPVSNRQTLAPIGTPSPKIDSHVDEKSGSNKSTQESSALPVIPKNDQNPASG 1766
Query: 1815 LMFGKKNDILDNVQTSLTSWGNSHRNQQVMALTQTQLDEAMKPAQFDLHPPVDHSSLTGD 1874
+F KN +LDN T + +WGN Q VMALTQ+QLDEAMKP V++ +
Sbjct: 1767 FVFSNKNKVLDNSHTPVGTWGNQLTYQPVMALTQSQLDEAMKPVSHLSCVSVENGANRIS 1826
Query: 1875 PFIVSSPSILASDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCATFLGIGPTDLC 1934
+S S++ + +FSS+ +PI+SLLA KIQFGAVTS TV+PP T
Sbjct: 1827 ESNSTSTSVVPKNNTFSSSTSPINSLLAEGKIQFGAVTSSTVIPPCGGRT---------- 1886
Query: 1935 HSDIQIPHKLSGAENDCNLFFDKE-KHYSESCSPIENSEAEAEAAASAVAVAAISSDEIL 1994
E D +L+F+K+ KH + S + IE EAEAEAAASA+AVAAI++DE
Sbjct: 1887 -------------EKDSSLYFEKDNKHRNPSSTGIEICEAEAEAAASAIAVAAITNDE-T 1946
Query: 1995 VTNGLDTCSVSVTDTNNIGGGDVDI-VTAGTAGDQQLASKTRADDSLTVTLPADLSVETP 2054
N L T SV +T GG ++D +GT G Q S+++A++SL V+LPADLSV+T
Sbjct: 1947 SGNALSTGSVLPVETKIYGGTELDDGAASGTVGGQ--TSRSKAEESLIVSLPADLSVDT- 2006
Query: 2055 PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQT 2114
PISLWP LPSP N S+QM++HFP G P +PFY++NPML GP+F FGPH ++ TQ+Q+
Sbjct: 2007 PISLWPQLPSPHN-SNQMITHFPPG-PPHYPFYDVNPMLRGPIFAFGPHHDA-GATQSQS 2066
Query: 2115 QKCSAPAPGPLGSW-KQCHSGVDSFYGPPASFTSPFVS-PGGIPGVQGPPHMVVYNHFAP 2174
QK GP +W +Q HSGVDSFY PPA FT PF++ PG IPGVQGPPHM VYNHFAP
Sbjct: 2067 QKGPVTVSGPPTTWQQQGHSGVDSFYAPPAGFTGPFLTPPGAIPGVQGPPHMFVYNHFAP 2126
Query: 2175 VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSLC-VEDQKNLNMVSAQRMPTNLPP--IQ 2234
VGQFG GLSFMG TYIPSGKQ DWKH+P S V ++N + M N+ P +Q
Sbjct: 2127 VGQFG--GLSFMGTTYIPSGKQPDWKHNPNVSSSPVGGDGDVNNPNVASMQCNIVPASLQ 2140
Query: 2235 HLAPGSPLLPMASPLAMFDVSPFQ-ASPEISAQTRWPSSASTVQAVPLSMPLQHQAGGVL 2294
HL P+ MFD SPFQ +S E+ + RWP + P +M +Q Q G
Sbjct: 2187 HL-----------PMPMFDPSPFQSSSQEMPIRARWPYMPF---SGPPTMQMQKQQEGTD 2140
Query: 2295 PSHFSHPSSADPSFTVNRFPGSQPADQKRNFPVGADASVTQLPDELEIVDESSCLSSGAG 2354
S+ PS P F N P P +P ++V +VD S+ SS G
Sbjct: 2247 GSNL--PS---PQFNNNMLPPPPP----NRYPNVQASTVVD-----AMVDSSNAYSSTTG 2140
Query: 2355 VPNVDTNSLVVNSATDAGKTSVRPNCSSNNSGQNASTNLK-SQYPHKGISAHQYTHSSGY 2413
P S + + ++ + P Q S+ K +Q H G +H + H
Sbjct: 2307 APPAKPTSTLSDPNSNNTQNPNGPGFKPPQQQQQQSSQEKNTQSQHVGGPSHHHQHQHHQ 2140
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883483.1 | 0.0e+00 | 81.08 | uncharacterized protein LOC120074436 [Benincasa hispida] | [more] |
XP_022950041.1 | 0.0e+00 | 81.05 | uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 unchar... | [more] |
XP_022133325.1 | 0.0e+00 | 80.79 | uncharacterized protein LOC111005936 isoform X1 [Momordica charantia] | [more] |
KAG7034343.1 | 0.0e+00 | 80.92 | hypothetical protein SDJN02_04070, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023543957.1 | 0.0e+00 | 80.93 | uncharacterized protein LOC111803678 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GDR0 | 0.0e+00 | 81.05 | uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC1114532... | [more] |
A0A6J1BUX9 | 0.0e+00 | 80.79 | uncharacterized protein LOC111005936 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1IST3 | 0.0e+00 | 80.72 | uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360... | [more] |
A0A5D3CNG4 | 0.0e+00 | 79.45 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1BUR5 | 0.0e+00 | 79.73 | uncharacterized protein LOC111005936 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT3G50370.1 | 0.0e+00 | 39.42 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |