Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACGGAAATGCCATGGCTGAAATGAAGATAATCACTCGAACTGTGCAGAGAAAAAGGTCGACGGTGCGTTGCTTAGCGAAGGTGCTTCTTCCTCGGTAGATATTTATAGCAAAATAGTCTCAATCGGGTCTTTCGGAGTTCGGATACCCGGGCCAGCTGAAATGTCGTTGCTCCTTCTGTCCCTTTTAAATTTTGGAAATTTTTTGCCCGTGGTTGCTATGTCATTCTCAAAATGTGTATAAATTTTTTTTTTCGTATTTACCAACAATGGCTAATAATCGAAATATCGATCTTTAAGATAATAATAGTTGTAAATTATGCTTGGATTGACTTTTTCATATTTATTATATCATGTTAAATCAATCAAATCAACTGATGTTTATATTAAACCTAACTCGATGATTGAGCATTTGTACCTCTGTTTAAAAATTATAGGTTTAAACCCACCCCTTATTTGTAGTGTAATATTTAAAAAAGAAATACTAAATTTTGTCTAATAGATATGTCATGGTGGGAAGTATATAAAATCACAATTCAATAAGTTTTTTAAAGTTTAAAAACTAGCCAACCTAAGCATGACTTAAATTGGTCAAGACATCAGTGATTTTCTTAAATAAATAATGTAGGTTCGAATCTGTACTAAAGAATAAAAGTTCGAATCTTTACTCCCATGTTTGCATTGTAATATTCTAAAAAAATTTGAAAACTAAAACAATACATCATCAAACTTTATAGACTAATTTTGCAATTTAATCAAATATTTATATGTGTCATTTTTTAACTAACCATTGTTGGATGAAATAAATTGATAATATATGTTTGTATTCTTCACCTTATTAACACAATTTCGTTATTAATATTTTAAAAGTTTGAATGTCACTCTCGAAGTATCAATTTTGTTGGAATAGTGTGTTTATTCAACAAACATTATTGATTTTTTTTAAACAGAGATGTGACATAAAACACTATTTCATACTACCTAAAATTTTTTAATATAAAATTCAGATAGTGTTAATTTATATTTTTCAACTAGATAAAACTAATTGGTAAGTTTGGATAAAAATTTACCTTCTGAAGATGCCGGTTCATAAAAAATAATAGTTTTGAAGATTACATCACTATTCATTTATTCCATTTTAATATTGTTATCGAGTAAATACTTAATCGAAAATTTTTAACATTCAAAAGTTAAAATGAAAATTTTCAAAATTGAAAACTGAAAAATCGTATAACCTTAACGATCAAAAGATATTTTCATCCATATATAATCATTATCGTGATATGATATAATGATATATGAAGAGGGGAATTGGTAAAGTACAAAACACAAAGGATTGGTCGATCCGATATATAGGAATCAAAATTCACAAGGCAATGGATTTTATCAACAACACACTTGTCAAATTTATACAGGAAATGGGAGCTTTTTGAAGATTCTCATCTATCTACTTCATAGTTTCTGAATAGTGTTTGATGGGACTGAACTATATAATTTATGACACATTTCTTTCAGAACATAAAAATATTCACTGCCAGATGATATGGCTGAATTTTTCCCTATTTTCAGAAAATATGACTACAAATTCACACGTTATAATTTATAACCAGTGATTCAATCTACAAGTTTGCTTTAAATCTGGTTTTACTGTTTCCTCCAAATTTTCAAATAAACTTTCATTTTCTTTAGGTGTTCTCTCTCTGTAACTATGCAGAGGAAGGCCTCAGATTTGAGCAATGGCCATGGGGTTTACTGCTCAAACGCAGCACCCGGTGCTAGAAATCCATTTAGGCCTGCAAATCAAGGATCGACCTTCCAAGCTATAGGGACTAATCTCACAAATAAACCTCTATAACTTCATCCAATTGGCATACAAGTACAAGCCAGCTAAATTTGCTAGAGTGCAGTTTTTTTCTTGTAAAGATACAACAAATGAATTGAGGAAATGTAAAAGCCCTGTGCCTAAAATGCATTCATTACCTCCGGGCTCTTCCCTTGGATTTGGAATCAAAATGGTGGGCCTTTAGCTTCTTTTTCCGCATCACAGTTGCAGCACCACCACTCTCTCTGAAAGGCATTCCCATGAGAGCCAAGAGTCGCTGAGCTTCTTGATCTGTTTGAGCTGTTGTTGTGATGCAAATGTCCATTCCCCTCGGTTTGCCAAGTGTGTCGAACTTTATTTCTGGGAATACAATTTGTTCGCGGATACCGATGGAGTAATTGCCATGTCCATCAAAGCTACTTGAGTTCAGTCCTTGAAAGTCCCTTGTCCTAGGCAGCCCTAAGTTGATAAGACGATCCAGGAAGGAGAACATCACCTGCGAATGCAGTCAACAAGCAAGCAGCATAAGAATTGAATCGCCTTAAGATTTACGACACGAAAGCCATCAGCCACACCCATAACTTCTTCAATGTAATGATGCTAGCTTCCATTATTACAACTTCAGATCTTTAAAAACCTTTAGGTTTTTTGTCCTTCCACAAACTCCATAATAATTGGCACAAAAATCTTGTTCACTTAATGAGTATCACGGTGATAAGTGTATCGAGCCCATGAAGGAACAACTCCCTCCTCCAAGGTCATTCCTTTGCACTTGTTTTAGGCAATTATCTCAAAGAATCGAATTCTTAACATCATCATACAGGAGAAATCCACCACACTAAACATATTAAAGCATATACAGATCATCAAAGTAGTATAACTAATGCCCGCAGCCACACCCATAACTTCTTCCATTTAATGATGCTAGATACCTTCCATTATTACAACACATCAGAGAATCCCGCGTCTATTTTGTCCTTCCACGTACTCATTGATAATTGGCACAAAAATCATGTTCACTAAACGAGTATCATGGTAATGTATATCTAGCCCATGAAGGAGCAAGTCCCTCCTCCGAGGTCATTCCTTTGCACTTGTTTTAGGCAACAATTATCTTGAAGAATCGAATTCCTAACATCATACAAGAGAAATCCACTACACTGAACACATTAAAGCATATACAGATCATCGAAGTAATATAACTATTCAAGACTGGAAAGATAGACATGAAAAGGAACAAAAAAATGTCTATGAATACGAATATTGAAAATCAGGACCAAATACCAAAACGTATTTTCCAGTTCTAGATATGCTGAACTAACTACTGAGCAGCACTGGGGACTAGAACAGAAGCCGAAAGATCAAAAGAAACAGTGACTAACTGAAAATTTTAAGAAGAACAAATAAAACTCACATTTCCCCGAAGAGTTGCAGCAATCCCAAGTGGTTGACCTTCCCTGATCTTGAAGGTTGCAATGGACTTCTTTGCACGTGTTTTCACAGGTCGCTGTCCTGTAATGGACGCTAGTTCATTCATTGCAGCTTCCAAACCCTTCGCATTCTGTTGTGCATCTCCAATACCACAGTTCACAACAATCTTTTCAATTTTTGGGACCTAAAAGAAGAGGCACAAAAATTACGTTAAGAAAATCATGTCTATTTGCATAATGAGCTGCCAAATGTTTGAGATGGAGGAATCACAGCCACAAAATTCGAACTTAAATCTAAGATCAAACAAGTATAGTAGCACGCTACTGCTTCCTTCAAAAGAAATTCACGGGTATGTATATGAATTTGATCCAAGTTTGGCGGGGGCATTTAATTCATAAGGAATCCTTAGCTTCCATGTATGTGTAAATATTTGCATACAAACATAGAAGTTCACGTGTCTGGTTATACGAATGTCCATTCCCTCCATTAAAGGAACTTTAAAGCTTAGCTAAGCCAAGCGGTTGTATGCATGGTCGAATTCCATGAATTCAAAAAGAAAATCAACAATTTCCAATTCCTTGAGCAATTAAACCAAACAGAACACAAAAATTATCCTTCTCGGAATAATCAATACCACTGCATAGTCAATCCAACCCCATCTCCGCCAAACTTAAAACCCGACGAACTCCGGCGCCCAATTGAGCAAATAAATTAGAATTGAATCGACAACTTCTAATTTACGAAAATATTCGGTCGCTACCTGGTGGATATTGGTGTAGGAGAACTCGTCCATAAGCAGCGGCATGATTTTCTCAAGGTAATTGGATTTGAGCCGGTTGACCTTCTCCGCCTCTGCCTTCTCCACAAGCACTATAGAACCACCAGCCGCGGCCTTCACTCTCACCGCCAATGGAGCTACATTGCTAGAGTTTCCGCACGAAACCGGTGCAGCAAAACGGACCGAGAAGAACGGGCAATGGCCATGAAACGACGACGTGGCAGAGCTCAACAGCGAAGAAGAGGCCATCGGAGTTCAGAGACGTTTCTCCTTCGCTCAGTGAGAGTGGAATTAAAAGAGCACTTCAGCCAAAGGAGTTTTTTTTATTCTAATTTCCAAATCCCATGTATAGCCTCGTATTTCCACGTTATTACTAATACGCCATACCAAATAACGCGTTTGCGACTTTGTAAAATTGGATAATAATAGGGGTAAATTCGTCATTTCAGCTTTCTGTGAATGAAATTGGATAAGGGAAAGCTTCACTTTGGACGCCATGCTCTCGTCTGCTAAGAGGATAAAACCTCTTTTATTCAACTATAAATAATATTTTATTTACTATACTTTTTTTTACTAAATAAACTAAAATAATTTTTTTACTATTCATTTTTTATATATAAATATTTTATTTACTATTCACCGTTTTTTTCAATTGTAATAATCATGTGTAAATTATACCATATGACATCAAAAATCACCGAGTTTATTTGGATAGGTGTAAGTTAAGACGACAAAAAAAATAAAGAATAAAAATCCCAATAAATACTTTTTAAGCAAACAACAGAAAGTGGCTTAAAGAGTGATTTGAAATAGAGCAAAATTAAAATGGTTTATTTAAATTGTAATACAACATATACTAAGTCAGTCATCAAATTTATTGGGTGAGTTTTCCTTTTTTTTTCCTTTTTTTCTTTTTGGACATTGAGGATATTACGTATATTAACAAATGTTTTAATTATTACGGAAATTATGCTCAAATGCTTCACTATTTGAATTCTCACACTTCAAAATTTAGCGCTAAAATATTTGTATTCATGAAATGAGTCGGTTGAGTTATACATATTTTGAACTCGATCAATCATTATATTTAAAAGTTTATTTGTAAGATCGTTTGTTAAATGATAGATCCAACTAAATATTGAAATTTAATTTAATCAAGCTTTTTGAGTTTTTTGTCTAAGAATTTGTAAAATAATTGGTATAAATTATTGTATCTAAAGATTTTATTTAGTTTTTGTATCACGTTAAATTAAGACACAATAGTTCAATATTCTATTTTATTTATTAAAAATCATTACTCCAGGGAGTTAAACCAAATATTCGTTAGAAGAACATAAGTCTAAAATAGTTATTTGATAAAATAAAATTCACAAGAGGCATGCGTTTAATTATTTTTTCAAATTAGTATATATATTTTAAATTAATAAAAATAAATTGATATAGTTCAACTCGAGCTAATTTCATAATTATTTATGAACTCATATGACTATGGAGACTCGACATATTTAAATCGAGACTTACTAAACTAAACTCCTAACCTGAGTGACATAACACGATATAAGTCAGGGCAATAAAATTGTCATCGTGAATTTCTAAATATCCTTCTTACATACTTTAAACTTTTGATTTAATTACATATATATATATATATATATTTTGAGTTCAACATTGTAGAGTGATGATTGAACCATCGATCTTTAGAATGATAATAGTGTTTATTGCTCGAATTGACAATTTAATTATATATTTATTTGTGTTGTTTATGACTGTGAATATCAATATTAAAAAAACTATAAAGTTTATTACCACACCAATTAAATTTTTTGACACGAGCATACTTCAAGTGATCAGACATATTTATTTATTACACAAAATATATATATATATTTAGGGTTATCACACAATATTCTAAAAAAAAAGAATACAGCCGACTTAAATTTGGAAAACGAATTATAATTGCTCAATGCTAAGGAACCAGAGTCGGCGCACTAAAAATATAGATTTCGGAGGCTCACATATATTTAAATCCCAATTTTATTTTTTCCGAGGGTAAAATTTCCCTCTACGACCACGCTCCAAAACGCCAAGTGAAGACAACGTTTAGAAAAATGGACGCGAAGTCTACGGCGCGTTGTTTACGGGGAGCAACATGCGCGTGGGATTATCTTAAAGCCGTAACTAGTTGAATTTTTCAGTTTATATTTTGTTTCTCTTTCTGATGGATGTGGTTTAGGGTGATGAACAGAGACCAGAAAGCCAGCACTCAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATAGATCAAAACCCTAAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTCTTTTTGAATAATTTGTGTTTTGGATCCAGGTGTTTGCTTTTTTCTATGTACCAAAACCCTAGTTTCAATTTGGGGCTCTCGGATTGAACAAAACCCTAGAATTTGTTCTGGTGCAACATCTCCGATCACGAGTAATAACGAGAATTTTGTTCGTGTTGGATGGATTATGAGGAAATTGGATTTATGATTCGAGGAAAGCGCGATTCTACGTAATAATCTGTTTTATGGGCTTGGGATTTTATTGAAAAAAGTGGTAAACGGGAGTTGGAGATTGATATATTTTCGGCGAGTGAGCTTGGGTGGATGGAGTTTCACTTTTTGCTGGACGTTGTTGCGCATTCAGCTGGATCAAACATAGGATTGCCTCTGAGTCCGCGTTTTTAATTCTTCCTGACTTGAATTAGCTTCGCTTGAAAAATAATTCGAGGACGAGGTGACGGGAGAAGTTTTAAGCGTGTTGCCTGCATTGAATTAATTGGTTTTCTTTAATGAATGAGTCGGTATAGCATCCAATAAGCGTAGAATTAGCTTTGCGGCAATTCTAAGGCGGCTTTGTTGATAAACCTCTGGTTGTCGTTCGGTAAACTGGTTCTGGTGCTGTTGAACACTTCCGCATATTGATCGACTGCACTGATAATCTTTACTAAGCATCGCAGAATGAGCCTTAAGAAGGACGATTCGAACTCACACGATCAGACCGCTACAATAAAGCATGAGTTGCAAAAGTAATTTTTCTACCATTTGCAGTTCCGATTACGATACCTGGGTTCTTTTCTCATTTCTTCCTGATCCTTTATTATTTGTCTTCCACTGTAAATGCTGGCTGTATTGACTATATTTTCCCGGTCGTTTAGGAAATCAAAGATTTCTTACACGAGAGATTTCCTCTTATCCCTGAGCGAATTGGATATTTGCAAAAAGTTGCCAAGCGGTTTTGACCAATCAATTATCACGTAAGATGCTCTTTCGTGCATCTGTTCTCTTTTCCCTGTCTACAGTTAAATTGTTATTTATTCTTTATATGCGCAACTGAAGTCCGTATCCTACAAAGTAAAGAACCGAAAATAAGCCATGTAGTTATATTCTTATTTTCATTGTTGAATTATTTATCTAAGGATACGTCCCCCCTTGTGTAGTGAATTTGAAGAAGCTTCCTGTGATAGGCAAAGAATTTCTGGAGGTTTGTCTTTGAATAGTTCTAGGCGTAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGAATAATTATTCTCGTCGAATACACGGAAAGAGGGAAGTTCAATCTTCTGGACGAAGTGATAAAGATAGTGATTCACAATCCGATAGGGATTCAGGTATGGTTTCAGCTTAATCTATTCTTAAGTCTAAATAATTACGTTGTTCGATGCAGTTTAATCTGAATATTGTATCCTATCCGGCTTGGTTAACTGGTTCCTGCTGTAGTTGATTGAATTAATGACAATCATACCACCGTCTTGGAAATAATTTCATTCATGCATGTGTGTATGTATATATTTATAATCTTAATTCTGCCTGCTCCAATGCTGCTGACAAATGGAGCATTGTCAGCATTACCCCATTTCATAGCAGAGCACCACAAGAATATGTCATTTGTATTGGGTCGATATCCTTATAATTGTAGATTTGAGTTCGTTAAAACTAGTCAATTCAGGATTTATAATGTGAAATTGCTATAATACTGACCCTTGAATTATTCATTTTAACAGTGGATTCTGGGTGGCGTTATGGGGATCATTCTAGGAGGTCTTTGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGCTCTTTTCCAAGACCATCTGGATATACTACGGGATTTTCAGCACCAAAGGTTCGAGCTAATGATCACTATCAGCTTAACAAAAGCAATGAGCCATATCATCCACCTCGTCCTTATAAGGTTCACATTTGCCCGATGTCTCTAGAAAGTACTTGCAAACAACTTTTGGAATTAAATTTCTATCCTTGGATTATGGTTTGTCCTTATAAGGTTCACATTACATTGCCTGATGTCTCTAGAAACTATTTGCAAACGATGTTTTTAATTAAATTTCTATCCTTGGATTTATGGCTTTACATTCAATTATTTGTCTTTAAACATAATCAGAATCATTTTGTTGACTTGGTATATGAATTGCTTAGCAAGAATTTGAAATTATGTTGATTAAGTGCTCCTTATGCATGTTATTCTTTTATTGATCAGGCTGTATCTCACTCACGAAGGGAAACTAATGATTCATACAATCACGAAACTTTTGGTTCTTTTGAGTACACAAGTGAGGATAGGGTTGAAGAGGAAAAAAAGAGAAGAGGTAACCACTCTTTTCTCATGATGCTTGATGTACACATTGGTTTAACGGTATATCTATAGAACCTGTTCAAATCCTTTTGTGAACAATAAGTACCTTTTTATCTGACAAACTTATTTGCAGCTTCATTTGAGTCGATGAGGAAAGAACAACACAGGGCATTTCAAGAAAGTCACAAGCTAAATCCTTTGAAGCAGAGAGACGAGTTTGGCATCCTAATGCAGTTGGACGAGTCTAAAGATGAGAAGAAGTTATTGAATACAAGCAGTGGTTTCGATGAATCTACCACCTTACAAGCTTCAAAGAATGATCGAGAAAAATCTTTTCCACTACAGACAACTGTCTCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTATTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCATTTGCTGGAGGTGATTATTTGTTTTCATGTATTGTTGTGGTTTTAATAGGTCTGACTAAAATGTTGTTTCCCATTTTAAAGCAATCAATGTTGAATGACTTCAAGCTTCTATAGTTGGTTGGTGCCTGTACAGTTTGTTTCTGTCTTCAGCATATATCCAGTCATATATCATTAGAATAAATAGCGATTTTTCTTGTCGATATCTTTTGGCACAAGTTAACTCCTTTCTCCTTCACTAACTGCACATATGTTTGTCAATTTTGTAGGGGAAAGATGATGTTGTTGACAAGTGTTTGCAAACTAAAGATGAGCATTTGCACAATGGAATCTCTGAAGATTTGTTGGAAAAAAATTCATCAGAGCAAATGGGTTGCCCCGGACAGTACGGAAAAACAAGCATTAATGCTTCTACTAACAACACTAGTGAAAAGATTATTGATCTGTTTTCAGCTCTTGATATGTCTAATAAAACAACTGGAATTGATGTTCAATCCCATGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGCTGGTTGTAAGACTGAAAGGAGGGTCCTAGAGAATACAGCCATTGGTGAACCAAGTCAAGTACATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATGAAGTTAGATGGTAGTGCTACTAATTTTATCGAGGTACTAGTGACTTCTATCAGCATCCCTCGGTACCTACTCTAAAAGCATTTCAATAATTTTCTTTGATGGGCAGCAATTCTTTAAATGGACTATATTATTATTCAGATGATTGTTGAAATATTAAGTAAATAAATCCTTCAGCCATGGTATTAATTTTCTGGTTTTGATCTTTAAGTTCGATATGAACTACATAAATCAACATCCTATCTGTGGTATTATTATTAGAAGGTACACGATCCTAGGCATTTTTCCTAGCTTAGATTTCTCCAATGTCATAAGTTATGAAAAACAGGCCGAAAAGCATAAAAATTATGCAGAATTAACTCCCCGCATTTGAATATCTTTTCTGAATTGCAATCTTATCATAGGAATATCATTTTTTATTGACGAATTATTTGTTCCCAATTGAATGTGGGATGAAATTGATATATAATTTAAAACTATACCTTTTTTGTTGCATATCTTTTCCTTGAATTTGTATCTAGTTGTTTTCTGCTCGAGGAAACATAAGTACCTCCTTCTCTCCTTGTTGGATTTTCATTTAAGATTAGAAGTAACAAAAGCAGCTCAAATTATGATGCATCTATCACTGATGTGTGTTTTTCCTGGTCTTATGTAGCAGCAGCACGACAATGAGATGGAGGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAAATTTGCTCACTGGTTCGTGGACAATGGTATGTTGATAAAATTTCTGTATGCAACAAATTCCCAATATCAGTGGTTTATACACTGATGAGCTTTGAGTTTTTAATGCTCATAGCTCATTTTTTTCACCCTGCAGATAGGAAACAGGAGGAGGACCTTTCACCTAAAAGGTCAAATGACTTGCTTACTTTGATTGTAGGTGGAGAAAAGGGTGGAGACGTGTCTGATGTGAAGCATTCTGAGCTGTGTCTGCCTACTGTTACCTTTCATGGTTATGAATCTGCAGAAAATTATATCACATCAAGTGCAACATCATCCAATGTTGCGAAGCCTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCAATCCTTACCTGTGAAGCCGTTGAACAGACACTGCTGTCAAAAATTAGTGAAAATGACCCAGCTTTGCAGCCGTCTGATCAAAGATGGCGTGATTCTGATGATGATATCAAACATCCAACTGTAAAAAGGGATGATCATGCGTCACAGCACCTTCTCTCATTGTTACAGAAGGGTACGAGTCCAGTGATCGTGGGATATTGTGATGATGGTGCAAATAAAAAGGAGGAAAGTACCCACAACATTTCAAATCCGGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCGTCAGGATCTGTCAAAATTGATGTTTCAGAGTCTCAAGGTCCGATCACAGATGATGGTCTCTTGTCCAATAATGAAATTCGGCCCAGTATAATGAATCATGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGCCAGTGGTTTAATCTGAATGGCCCTCGACCTGAATTGGATTCTTCTCATCCTCGTGCTAAGTTAGGACATAAGATTGGCGGTTACGATGGACCACCTGAAATACCCCTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAAATTTCAGAACCTCATTTCTATTGGGAATTCAACTAAACCTCAACCACTGTTCTCACACCACACACAAGACAATAATGCTGCAATCTTTAACCCTGCCTTTAAAGATGAAAGGCCAAACATGGGAGGTCTAGAAGGGCTACCTTTTTCAGCCAGTCCCTATGATAGGAGGGAGACAGAAATGCCACATCGGAAAGCTCCTGTTCATTCCTCATTTTCCCAGCTTCATCCTCCACAAACGAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGAGTTACTGTTGCCAGAAGGAATGATTCATCACGACTCACCATCTAATCACCAATTTGTAGCAAATATGCTTCGTCCTCCTACCACTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCGGTCAATCTTCCGCCTCAGCATCTACTACAAGGGTTATCTAGAGGTGCACCTCCACCCATGACAAGCAGAAGTGTTCCTTTACATCCTCACTCTGTCAGAGGTAGTGCAGCACCTCCCCAACCAAACAATCCGGTTACTGGTTTAGTGCAGGAACTCAATTCTATCCAAGGTTTTCATATCGGTCAGCGTGTGCCTAATATTGGTGGTTCCAGAATGCCCTCGCCAGGTAACCTCTCATGACATTTTGCTTGCACTTCAGGTGCTATTTTGAAAATTTTATATTTTACTGATAAATTAGTTCACATATCCTGCTTATCTGGCCGTTATTTGTTTATGTCTTGCTAAATTGATATGGCCTCTATTAAATGTTACTGAATCGCTGATTAATAAATATCTTTTGCCCTTTAGTGAAATGAAACAAAATTAAGATTTTTGACTCATTTGGATTGATTCCGCATAAGACACTCTTTACCTTTCCCAAACTTTGACTTATTTGAGCTTATTAGAAAACCCGTGTCTGTGCATGCCTGCTGCAGAGTGCTTGTATTGCTGTATGTATCAAGCAACATAAAATAAAGGCAGTTGAATCTGTTGCTCATTGGTGTCTTTTTAGTGCGTTGCTCTCCTTTTTTATTTTATTTCTGGTTTAAACTTTCAGCTCCTGGTATTGGTGGGACAGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGTCACAGATCAAACTCAAAGCAAATTCATCCTCTTTCTGCCAGTGGCCATGGTCAGGGGATGCATGGTCACGAGTTGAACATGGGTTATGGGTTCAGGTAGTTGTAAACACTGTACTTCGAGAATACTTGCCCAAATCCTCAATTCCTTGGATAGAGAAATGGACCCAATATGCATGCTCCTCTTAGTAGGATTGAGTGAGAAATGTAAGTAATGAATCCTTCTTATGTTTGTTTTGTTGATTTGTCATTCACTTATTTCTCGTGCGGTTTCTAACTCGAAATGAAGTCTTAAGTAACCCGACTAATAGAAATCAGTAGGTGAAGGAAAGCGCAGTGTTTCACTCTGATTAGGCATTTTAAGAAATAGATTTGCCCACTACAGGGTTTTATGATGGGGATACATTTTTATTAGTTAAACCATTCTGTTGTGTCTCGGAGCAATGCAATAAGCTTCGAGGCAGAGGACGAGGAGATAGGGTCCATCTTTGCCATATTTGGCTTGACAGAAATAATCTATGTTTCACGTCATATGCTGGATTGGGCCCCTCCATCGACAGAGTTTTAA
mRNA sequence
ATGTACGGAAATGCCATGGCTGAAATGAAGATAATCACTCGAACTGTGCAGAGAAAAAGGTCGACGGTGCGTTGCTTAGCGAAGAGGAAGGCCTCAGATTTGAGCAATGGCCATGGGGTTTACTGCTCAAACGCAGCACCCGGAGAACTCGTCCATAAGCAGCGGCATGATTTTCTCAAGGTAATTGGATTTGAGCCGGTTGACCTTCTCCGCCTCTGCCTTCTCCACAAGCACTATAGAACCACCAGCCGCGGCCTTCACTCTCACCGCCAATGGAGCTACATTGCTAGAGTTTCCGCACGAAACCGGTGCAGCAAAACGGACCGAGAAGAACGGGCAATGGCCATGAAACGACGACGTGGCAGAGCTCAACAGCGAAGAAGAGGCCATCGGAGTTCAGAGACGTTTCTCCTTCGCTCAGGTGATGAACAGAGACCAGAAAGCCAGCACTCAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATAGATCAAAACCCTAAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTCTTTTTGAATAATTTGTGTTTTGGATCCAGGAAATCAAAGATTTCTTACACGAGAGATTTCCTCTTATCCCTGAGCGAATTGGATATTTGCAAAAAGTTGCCAAGCGGTTTTGACCAATCAATTATCACTGAATTTGAAGAAGCTTCCTGTGATAGGCAAAGAATTTCTGGAGGTTTGTCTTTGAATAGTTCTAGGCGTAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGAATAATTATTCTCGTCGAATACACGGAAAGAGGGAAGTTCAATCTTCTGGACGAAGTGATAAAGATAGTGATTCACAATCCGATAGGGATTCAGTGGATTCTGGGTGGCGTTATGGGGATCATTCTAGGAGGTCTTTGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGCTCTTTTCCAAGACCATCTGGATATACTACGGGATTTTCAGCACCAAAGGTTCGAGCTAATGATCACTATCAGCTTAACAAAAGCAATGAGCCATATCATCCACCTCGTCCTTATAAGGCTGTATCTCACTCACGAAGGGAAACTAATGATTCATACAATCACGAAACTTTTGGTTCTTTTGAGTACACAAGTGAGGATAGGGTTGAAGAGGAAAAAAAGAGAAGAGCTTCATTTGAGTCGATGAGGAAAGAACAACACAGGGCATTTCAAGAAAGTCACAAGCTAAATCCTTTGAAGCAGAGAGACGAGTTTGGCATCCTAATGCAGTTGGACGAGTCTAAAGATGAGAAGAAGTTATTGAATACAAGCAGTGGTTTCGATGAATCTACCACCTTACAAGCTTCAAAGAATGATCGAGAAAAATCTTTTCCACTACAGACAACTGTCTCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTATTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCATTTGCTGGAGGGGAAAGATGATGTTGTTGACAAGTGTTTGCAAACTAAAGATGAGCATTTGCACAATGGAATCTCTGAAGATTTGTTGGAAAAAAATTCATCAGAGCAAATGGGTTGCCCCGGACAGTACGGAAAAACAAGCATTAATGCTTCTACTAACAACACTAGTGAAAAGATTATTGATCTGTTTTCAGCTCTTGATATGTCTAATAAAACAACTGGAATTGATGTTCAATCCCATGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGCTGGTTGTAAGACTGAAAGGAGGGTCCTAGAGAATACAGCCATTGGTGAACCAAGTCAAGTACATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATGAAGTTAGATGGTAGTGCTACTAATTTTATCGAGCAGCACGACAATGAGATGGAGGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAAATTTGCTCACTGGTTCGTGGACAATGATAGGAAACAGGAGGAGGACCTTTCACCTAAAAGGTCAAATGACTTGCTTACTTTGATTGTAGGTGGAGAAAAGGGTGGAGACGTGTCTGATGTGAAGCATTCTGAGCTGTGTCTGCCTACTGTTACCTTTCATGGTTATGAATCTGCAGAAAATTATATCACATCAAGTGCAACATCATCCAATGTTGCGAAGCCTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCAATCCTTACCTGTGAAGCCGTTGAACAGACACTGCTGTCAAAAATTAGTGAAAATGACCCAGCTTTGCAGCCGTCTGATCAAAGATGGCGTGATTCTGATGATGATATCAAACATCCAACTGTAAAAAGGGATGATCATGCGTCACAGCACCTTCTCTCATTGTTACAGAAGGGTACGAGTCCAGTGATCGTGGGATATTGTGATGATGGTGCAAATAAAAAGGAGGAAAGTACCCACAACATTTCAAATCCGGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCGTCAGGATCTGTCAAAATTGATGTTTCAGAGTCTCAAGGTCCGATCACAGATGATGGTCTCTTGTCCAATAATGAAATTCGGCCCAGTATAATGAATCATGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGCCAGTGGTTTAATCTGAATGGCCCTCGACCTGAATTGGATTCTTCTCATCCTCGTGCTAAGTTAGGACATAAGATTGGCGGTTACGATGGACCACCTGAAATACCCCTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAAATTTCAGAACCTCATTTCTATTGGGAATTCAACTAAACCTCAACCACTGTTCTCACACCACACACAAGACAATAATGCTGCAATCTTTAACCCTGCCTTTAAAGATGAAAGGCCAAACATGGGAGGTCTAGAAGGGCTACCTTTTTCAGCCAGTCCCTATGATAGGAGGGAGACAGAAATGCCACATCGGAAAGCTCCTGTTCATTCCTCATTTTCCCAGCTTCATCCTCCACAAACGAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGAGTTACTGTTGCCAGAAGGAATGATTCATCACGACTCACCATCTAATCACCAATTTGTAGCAAATATGCTTCGTCCTCCTACCACTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCGGTCAATCTTCCGCCTCAGCATCTACTACAAGGGTTATCTAGAGGTGCACCTCCACCCATGACAAGCAGAAGTGTTCCTTTACATCCTCACTCTGTCAGAGGTAGTGCAGCACCTCCCCAACCAAACAATCCGGTTACTGGTTTAGTGCAGGAACTCAATTCTATCCAAGGTTTTCATATCGGTCAGCGTGTGCCTAATATTGGTGGTTCCAGAATGCCCTCGCCAGCTCCTGGTATTGGTGGGACAGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGTCACAGATCAAACTCAAAGCAAATTCATCCTCTTTCTGCCAGTGGCCATGGTCAGGGGATGCATGGTCACGAGTTGAACATGGGTTATGGGTTCAGCTTCGAGGCAGAGGACGAGGAGATAGGGTCCATCTTTGCCATATTTGGCTTGACAGAAATAATCTATGTTTCACGTCATATGCTGGATTGGGCCCCTCCATCGACAGAGTTTTAA
Coding sequence (CDS)
ATGTACGGAAATGCCATGGCTGAAATGAAGATAATCACTCGAACTGTGCAGAGAAAAAGGTCGACGGTGCGTTGCTTAGCGAAGAGGAAGGCCTCAGATTTGAGCAATGGCCATGGGGTTTACTGCTCAAACGCAGCACCCGGAGAACTCGTCCATAAGCAGCGGCATGATTTTCTCAAGGTAATTGGATTTGAGCCGGTTGACCTTCTCCGCCTCTGCCTTCTCCACAAGCACTATAGAACCACCAGCCGCGGCCTTCACTCTCACCGCCAATGGAGCTACATTGCTAGAGTTTCCGCACGAAACCGGTGCAGCAAAACGGACCGAGAAGAACGGGCAATGGCCATGAAACGACGACGTGGCAGAGCTCAACAGCGAAGAAGAGGCCATCGGAGTTCAGAGACGTTTCTCCTTCGCTCAGGTGATGAACAGAGACCAGAAAGCCAGCACTCAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATAGATCAAAACCCTAAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTCTTTTTGAATAATTTGTGTTTTGGATCCAGGAAATCAAAGATTTCTTACACGAGAGATTTCCTCTTATCCCTGAGCGAATTGGATATTTGCAAAAAGTTGCCAAGCGGTTTTGACCAATCAATTATCACTGAATTTGAAGAAGCTTCCTGTGATAGGCAAAGAATTTCTGGAGGTTTGTCTTTGAATAGTTCTAGGCGTAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGAATAATTATTCTCGTCGAATACACGGAAAGAGGGAAGTTCAATCTTCTGGACGAAGTGATAAAGATAGTGATTCACAATCCGATAGGGATTCAGTGGATTCTGGGTGGCGTTATGGGGATCATTCTAGGAGGTCTTTGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGCTCTTTTCCAAGACCATCTGGATATACTACGGGATTTTCAGCACCAAAGGTTCGAGCTAATGATCACTATCAGCTTAACAAAAGCAATGAGCCATATCATCCACCTCGTCCTTATAAGGCTGTATCTCACTCACGAAGGGAAACTAATGATTCATACAATCACGAAACTTTTGGTTCTTTTGAGTACACAAGTGAGGATAGGGTTGAAGAGGAAAAAAAGAGAAGAGCTTCATTTGAGTCGATGAGGAAAGAACAACACAGGGCATTTCAAGAAAGTCACAAGCTAAATCCTTTGAAGCAGAGAGACGAGTTTGGCATCCTAATGCAGTTGGACGAGTCTAAAGATGAGAAGAAGTTATTGAATACAAGCAGTGGTTTCGATGAATCTACCACCTTACAAGCTTCAAAGAATGATCGAGAAAAATCTTTTCCACTACAGACAACTGTCTCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTATTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCATTTGCTGGAGGGGAAAGATGATGTTGTTGACAAGTGTTTGCAAACTAAAGATGAGCATTTGCACAATGGAATCTCTGAAGATTTGTTGGAAAAAAATTCATCAGAGCAAATGGGTTGCCCCGGACAGTACGGAAAAACAAGCATTAATGCTTCTACTAACAACACTAGTGAAAAGATTATTGATCTGTTTTCAGCTCTTGATATGTCTAATAAAACAACTGGAATTGATGTTCAATCCCATGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGCTGGTTGTAAGACTGAAAGGAGGGTCCTAGAGAATACAGCCATTGGTGAACCAAGTCAAGTACATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATGAAGTTAGATGGTAGTGCTACTAATTTTATCGAGCAGCACGACAATGAGATGGAGGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAAATTTGCTCACTGGTTCGTGGACAATGATAGGAAACAGGAGGAGGACCTTTCACCTAAAAGGTCAAATGACTTGCTTACTTTGATTGTAGGTGGAGAAAAGGGTGGAGACGTGTCTGATGTGAAGCATTCTGAGCTGTGTCTGCCTACTGTTACCTTTCATGGTTATGAATCTGCAGAAAATTATATCACATCAAGTGCAACATCATCCAATGTTGCGAAGCCTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCAATCCTTACCTGTGAAGCCGTTGAACAGACACTGCTGTCAAAAATTAGTGAAAATGACCCAGCTTTGCAGCCGTCTGATCAAAGATGGCGTGATTCTGATGATGATATCAAACATCCAACTGTAAAAAGGGATGATCATGCGTCACAGCACCTTCTCTCATTGTTACAGAAGGGTACGAGTCCAGTGATCGTGGGATATTGTGATGATGGTGCAAATAAAAAGGAGGAAAGTACCCACAACATTTCAAATCCGGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCGTCAGGATCTGTCAAAATTGATGTTTCAGAGTCTCAAGGTCCGATCACAGATGATGGTCTCTTGTCCAATAATGAAATTCGGCCCAGTATAATGAATCATGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGCCAGTGGTTTAATCTGAATGGCCCTCGACCTGAATTGGATTCTTCTCATCCTCGTGCTAAGTTAGGACATAAGATTGGCGGTTACGATGGACCACCTGAAATACCCCTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAAATTTCAGAACCTCATTTCTATTGGGAATTCAACTAAACCTCAACCACTGTTCTCACACCACACACAAGACAATAATGCTGCAATCTTTAACCCTGCCTTTAAAGATGAAAGGCCAAACATGGGAGGTCTAGAAGGGCTACCTTTTTCAGCCAGTCCCTATGATAGGAGGGAGACAGAAATGCCACATCGGAAAGCTCCTGTTCATTCCTCATTTTCCCAGCTTCATCCTCCACAAACGAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGAGTTACTGTTGCCAGAAGGAATGATTCATCACGACTCACCATCTAATCACCAATTTGTAGCAAATATGCTTCGTCCTCCTACCACTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCGGTCAATCTTCCGCCTCAGCATCTACTACAAGGGTTATCTAGAGGTGCACCTCCACCCATGACAAGCAGAAGTGTTCCTTTACATCCTCACTCTGTCAGAGGTAGTGCAGCACCTCCCCAACCAAACAATCCGGTTACTGGTTTAGTGCAGGAACTCAATTCTATCCAAGGTTTTCATATCGGTCAGCGTGTGCCTAATATTGGTGGTTCCAGAATGCCCTCGCCAGCTCCTGGTATTGGTGGGACAGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGTCACAGATCAAACTCAAAGCAAATTCATCCTCTTTCTGCCAGTGGCCATGGTCAGGGGATGCATGGTCACGAGTTGAACATGGGTTATGGGTTCAGCTTCGAGGCAGAGGACGAGGAGATAGGGTCCATCTTTGCCATATTTGGCTTGACAGAAATAATCTATGTTTCACGTCATATGCTGGATTGGGCCCCTCCATCGACAGAGTTTTAA
Protein sequence
MYGNAMAEMKIITRTVQRKRSTVRCLAKRKASDLSNGHGVYCSNAAPGELVHKQRHDFLKVIGFEPVDLLRLCLLHKHYRTTSRGLHSHRQWSYIARVSARNRCSKTDREERAMAMKRRRGRAQQRRRGHRSSETFLLRSGDEQRPESQHSISLSRRLLGKSIDQNPKASNPQEVRVHGDFFLNNLCFGSRKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEYGSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPEHDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSYNHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDESKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSSVNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTSEKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQVHSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDLSPKRSNDLLTLIVGGEKGGDVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHASQHLLSLLQKGTSPVIVGYCDDGANKKEESTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQPDIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLISIGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAPVHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPPTTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPPQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRSNSKQIHPLSASGHGQGMHGHELNMGYGFSFEAEDEEIGSIFAIFGLTEIIYVSRHMLDWAPPSTEF
Homology
BLAST of Sgr016019 vs. NCBI nr
Match:
XP_022147303.1 (uncharacterized protein LOC111016288 isoform X1 [Momordica charantia])
HSP 1 Score: 1789.6 bits (4634), Expect = 0.0e+00
Identity = 917/1050 (87.33%), Postives = 966/1050 (92.00%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIEQH+NEMEDACSPQN QSSKFAHWFVDND KQE+ +
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIEQHENEMEDACSPQNTQSSKFAHWFVDNDGKQEDGV 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
SPKRSNDLLTLIVGGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 SPKRSNDLLTLIVGGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1068
BLAST of Sgr016019 vs. NCBI nr
Match:
XP_022147304.1 (uncharacterized protein LOC111016288 isoform X2 [Momordica charantia])
HSP 1 Score: 1783.1 bits (4617), Expect = 0.0e+00
Identity = 916/1050 (87.24%), Postives = 965/1050 (91.90%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIE H+NEMEDACSPQN QSSKFAHWFVDND KQE+ +
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIE-HENEMEDACSPQNTQSSKFAHWFVDNDGKQEDGV 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
SPKRSNDLLTLIVGGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 SPKRSNDLLTLIVGGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1067
BLAST of Sgr016019 vs. NCBI nr
Match:
XP_022147305.1 (uncharacterized protein LOC111016288 isoform X3 [Momordica charantia])
HSP 1 Score: 1744.9 bits (4518), Expect = 0.0e+00
Identity = 900/1050 (85.71%), Postives = 947/1050 (90.19%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIEQH+NEMEDACSPQN QSSKFAHWFVDN
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIEQHENEMEDACSPQNTQSSKFAHWFVDN-------- 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
GGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 -------------GGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1047
BLAST of Sgr016019 vs. NCBI nr
Match:
XP_038882196.1 (uncharacterized protein LOC120073406 isoform X3 [Benincasa hispida])
HSP 1 Score: 1694.5 bits (4387), Expect = 0.0e+00
Identity = 870/1049 (82.94%), Postives = 938/1049 (89.42%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+K K SYTRDFLLSLS+LD+CKKLPSGFDQSI+ EFEEAS DRQR+SG LSLNS RRNEY
Sbjct: 23 KKPKFSYTRDFLLSLSDLDVCKKLPSGFDQSIMAEFEEASYDRQRVSGALSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAET+NYSRRIHGKRE+ SSGRSDKDSDSQSDRDSVDSGWRYGD SRR QGPE
Sbjct: 83 GSSPPSRAETSNYSRRIHGKREIHSSGRSDKDSDSQSDRDSVDSGWRYGDQSRRPSQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY F APKVRAND YQLN+SNEPYHPPRPYKAV+H R TNDSY
Sbjct: 143 HDGLLGSGSFPRPSGYVPAFLAPKVRANDQYQLNRSNEPYHPPRPYKAVAHQRGNTNDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS EYTSEDRVEEEKKRRASFESMRKEQH+AFQESHK NP+KQRDEF ILM+LDE
Sbjct: 203 NHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHKAFQESHKSNPVKQRDEFAILMELDE 262
Query: 431 SKDEKKLLNTSSGFDESTTL-QASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRS 490
SKD++K LNT SG DES +L Q SKNDREKSF Q+TVSRPLVPPGFTSTVLEKNF TRS
Sbjct: 263 SKDDEKSLNTISGVDESISLKQTSKNDREKSFTSQSTVSRPLVPPGFTSTVLEKNFATRS 322
Query: 491 SVNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNT 550
SVNPHLLEGKDD +DKCLQTK+E LHNGI+EDL K+SSEQMG QY KTSIN STNNT
Sbjct: 323 SVNPHLLEGKDD-IDKCLQTKEEQLHNGIAEDLEGKSSSEQMGRAEQYRKTSINVSTNNT 382
Query: 551 SEKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQ 610
EKI+DLFSA+DMSNKTT ID QSH+ SLEVFEAS+NS V CKTE ++ NTAIGEPSQ
Sbjct: 383 GEKILDLFSAVDMSNKTTEIDNQSHKKSLEVFEASDNSTVVDCKTE-KLPANTAIGEPSQ 442
Query: 611 VHSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEED 670
VHSSSILEKLFGSAMKLDG ATNFIEQHDNEM+D CSPQNAQSSKFAHWF+D+DRKQE+D
Sbjct: 443 VHSSSILEKLFGSAMKLDGDATNFIEQHDNEMDDVCSPQNAQSSKFAHWFMDSDRKQEDD 502
Query: 671 LSPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKP 730
LSPKRS DLLT+IVGGEKGG DV+DVKHSE LPTV FHGYESAENYITSS+TSSNVAKP
Sbjct: 503 LSPKRSIDLLTMIVGGEKGGYDVADVKHSEQSLPTVAFHGYESAENYITSSSTSSNVAKP 562
Query: 731 EPFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDH 790
EPFY+KSKPEAVSAILTCEAVEQTLLSK+SEND AL PSDQR D+KHP+VK DDH
Sbjct: 563 EPFYNKSKPEAVSAILTCEAVEQTLLSKVSENDSALHPSDQRCSHPVADVKHPSVKSDDH 622
Query: 791 ASQHLLSLLQKGTSPVIVGYCDDGA------NKKEESTHNISNPGKTLTLETLFGSAFMK 850
ASQHLLSLLQKG+SP+I Y DDG + EESTHNISNPGKTLTLETLFGSAFMK
Sbjct: 623 ASQHLLSLLQKGSSPLISEYGDDGGYMGPVFHNNEESTHNISNPGKTLTLETLFGSAFMK 682
Query: 851 ELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQPD 910
ELQSVGAPVSAQRGSSGSVK D SES GPITDDG LSNNE+R S++NHDHGDQRQQNQPD
Sbjct: 683 ELQSVGAPVSAQRGSSGSVKSDASESHGPITDDGPLSNNEVRSSMLNHDHGDQRQQNQPD 742
Query: 911 IVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLISI 970
IVRG W NLNGPRPE DSSHP AKLGHKIG GP E+P PEEDSLIISDSM FQNLIS+
Sbjct: 743 IVRGNWLNLNGPRPESDSSHPLAKLGHKIG---GPAEMPFPEEDSLIISDSMNFQNLISM 802
Query: 971 GNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAPV 1030
GNS KPQPLFSH+TQDNN A+ +PAFKDER ++GG++GLPFSA+PYDRRETEMPHRKAPV
Sbjct: 803 GNSAKPQPLFSHNTQDNN-AMLSPAFKDERQSIGGVDGLPFSANPYDRRETEMPHRKAPV 862
Query: 1031 HSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPPT 1090
HS+FSQLHPPQTNNVKLFHQFE PPNMNSQG+L+LPEG++HHDSPSNHQFVANMLRPPT
Sbjct: 863 HSAFSQLHPPQTNNVKLFHQFEPRPPNMNSQGDLMLPEGIVHHDSPSNHQFVANMLRPPT 922
Query: 1091 TGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPP 1150
+GLSGFDHSIHHPM+QQMQTSVNLPPQHLLQGLSRG PPM++R++PLHPHS RGSAAP
Sbjct: 923 SGLSGFDHSIHHPMLQQMQTSVNLPPQHLLQGLSRGVAPPMSNRNLPLHPHSARGSAAPS 982
Query: 1151 QPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRS 1210
QPN+ VTGL QELNSIQGFHIGQRVPNIGG R+PSPAP GNQPDAIQRLIQMGHRS
Sbjct: 983 QPNHQVTGLPQELNSIQGFHIGQRVPNIGGPRLPSPAP-----GNQPDAIQRLIQMGHRS 1042
Query: 1211 NSKQIHPLSAS-GHGQGMHGHELNMGYGF 1231
NSKQIHPLSAS GHGQGM+GHELNMGYG+
Sbjct: 1043 NSKQIHPLSASGGHGQGMYGHELNMGYGY 1060
BLAST of Sgr016019 vs. NCBI nr
Match:
XP_038882201.1 (uncharacterized protein LOC120073406 isoform X4 [Benincasa hispida])
HSP 1 Score: 1688.3 bits (4371), Expect = 0.0e+00
Identity = 869/1049 (82.84%), Postives = 937/1049 (89.32%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+K K SYTRDFLLSLS+LD+CKKLPSGFDQSI+ EFEEAS DRQR+SG LSLNS RRNEY
Sbjct: 23 KKPKFSYTRDFLLSLSDLDVCKKLPSGFDQSIMAEFEEASYDRQRVSGALSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAET+NYSRRIHGKRE+ SSGRSDKDSDSQSDRDSVDSGWRYGD SRR QGPE
Sbjct: 83 GSSPPSRAETSNYSRRIHGKREIHSSGRSDKDSDSQSDRDSVDSGWRYGDQSRRPSQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY F APKVRAND YQLN+SNEPYHPPRPYKAV+H R TNDSY
Sbjct: 143 HDGLLGSGSFPRPSGYVPAFLAPKVRANDQYQLNRSNEPYHPPRPYKAVAHQRGNTNDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS EYTSEDRVEEEKKRRASFESMRKEQH+AFQESHK NP+KQRDEF ILM+LDE
Sbjct: 203 NHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHKAFQESHKSNPVKQRDEFAILMELDE 262
Query: 431 SKDEKKLLNTSSGFDESTTL-QASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRS 490
SKD++K LNT SG DES +L Q SKNDREKSF Q+TVSRPLVPPGFTSTVLEKNF TRS
Sbjct: 263 SKDDEKSLNTISGVDESISLKQTSKNDREKSFTSQSTVSRPLVPPGFTSTVLEKNFATRS 322
Query: 491 SVNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNT 550
SVNPHLLEGKDD +DKCLQTK+E LHNGI+EDL K+SSEQMG QY KTSIN STNNT
Sbjct: 323 SVNPHLLEGKDD-IDKCLQTKEEQLHNGIAEDLEGKSSSEQMGRAEQYRKTSINVSTNNT 382
Query: 551 SEKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQ 610
EKI+DLFSA+DMSNKTT ID QSH+ SLEVFEAS+NS V CKTE ++ NTAIGEPSQ
Sbjct: 383 GEKILDLFSAVDMSNKTTEIDNQSHKKSLEVFEASDNSTVVDCKTE-KLPANTAIGEPSQ 442
Query: 611 VHSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEED 670
VHSSSILEKLFGSAMKLDG ATNFIE HDNEM+D CSPQNAQSSKFAHWF+D+DRKQE+D
Sbjct: 443 VHSSSILEKLFGSAMKLDGDATNFIE-HDNEMDDVCSPQNAQSSKFAHWFMDSDRKQEDD 502
Query: 671 LSPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKP 730
LSPKRS DLLT+IVGGEKGG DV+DVKHSE LPTV FHGYESAENYITSS+TSSNVAKP
Sbjct: 503 LSPKRSIDLLTMIVGGEKGGYDVADVKHSEQSLPTVAFHGYESAENYITSSSTSSNVAKP 562
Query: 731 EPFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDH 790
EPFY+KSKPEAVSAILTCEAVEQTLLSK+SEND AL PSDQR D+KHP+VK DDH
Sbjct: 563 EPFYNKSKPEAVSAILTCEAVEQTLLSKVSENDSALHPSDQRCSHPVADVKHPSVKSDDH 622
Query: 791 ASQHLLSLLQKGTSPVIVGYCDDGA------NKKEESTHNISNPGKTLTLETLFGSAFMK 850
ASQHLLSLLQKG+SP+I Y DDG + EESTHNISNPGKTLTLETLFGSAFMK
Sbjct: 623 ASQHLLSLLQKGSSPLISEYGDDGGYMGPVFHNNEESTHNISNPGKTLTLETLFGSAFMK 682
Query: 851 ELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQPD 910
ELQSVGAPVSAQRGSSGSVK D SES GPITDDG LSNNE+R S++NHDHGDQRQQNQPD
Sbjct: 683 ELQSVGAPVSAQRGSSGSVKSDASESHGPITDDGPLSNNEVRSSMLNHDHGDQRQQNQPD 742
Query: 911 IVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLISI 970
IVRG W NLNGPRPE DSSHP AKLGHKIG GP E+P PEEDSLIISDSM FQNLIS+
Sbjct: 743 IVRGNWLNLNGPRPESDSSHPLAKLGHKIG---GPAEMPFPEEDSLIISDSMNFQNLISM 802
Query: 971 GNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAPV 1030
GNS KPQPLFSH+TQDNN A+ +PAFKDER ++GG++GLPFSA+PYDRRETEMPHRKAPV
Sbjct: 803 GNSAKPQPLFSHNTQDNN-AMLSPAFKDERQSIGGVDGLPFSANPYDRRETEMPHRKAPV 862
Query: 1031 HSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPPT 1090
HS+FSQLHPPQTNNVKLFHQFE PPNMNSQG+L+LPEG++HHDSPSNHQFVANMLRPPT
Sbjct: 863 HSAFSQLHPPQTNNVKLFHQFEPRPPNMNSQGDLMLPEGIVHHDSPSNHQFVANMLRPPT 922
Query: 1091 TGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPP 1150
+GLSGFDHSIHHPM+QQMQTSVNLPPQHLLQGLSRG PPM++R++PLHPHS RGSAAP
Sbjct: 923 SGLSGFDHSIHHPMLQQMQTSVNLPPQHLLQGLSRGVAPPMSNRNLPLHPHSARGSAAPS 982
Query: 1151 QPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRS 1210
QPN+ VTGL QELNSIQGFHIGQRVPNIGG R+PSPAP GNQPDAIQRLIQMGHRS
Sbjct: 983 QPNHQVTGLPQELNSIQGFHIGQRVPNIGGPRLPSPAP-----GNQPDAIQRLIQMGHRS 1042
Query: 1211 NSKQIHPLSAS-GHGQGMHGHELNMGYGF 1231
NSKQIHPLSAS GHGQGM+GHELNMGYG+
Sbjct: 1043 NSKQIHPLSASGGHGQGMYGHELNMGYGY 1059
BLAST of Sgr016019 vs. ExPASy TrEMBL
Match:
A0A6J1D0Y7 (uncharacterized protein LOC111016288 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016288 PE=4 SV=1)
HSP 1 Score: 1789.6 bits (4634), Expect = 0.0e+00
Identity = 917/1050 (87.33%), Postives = 966/1050 (92.00%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIEQH+NEMEDACSPQN QSSKFAHWFVDND KQE+ +
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIEQHENEMEDACSPQNTQSSKFAHWFVDNDGKQEDGV 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
SPKRSNDLLTLIVGGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 SPKRSNDLLTLIVGGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1068
BLAST of Sgr016019 vs. ExPASy TrEMBL
Match:
A0A6J1D0L9 (uncharacterized protein LOC111016288 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016288 PE=4 SV=1)
HSP 1 Score: 1783.1 bits (4617), Expect = 0.0e+00
Identity = 916/1050 (87.24%), Postives = 965/1050 (91.90%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIE H+NEMEDACSPQN QSSKFAHWFVDND KQE+ +
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIE-HENEMEDACSPQNTQSSKFAHWFVDNDGKQEDGV 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
SPKRSNDLLTLIVGGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 SPKRSNDLLTLIVGGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1067
BLAST of Sgr016019 vs. ExPASy TrEMBL
Match:
A0A6J1CZT1 (uncharacterized protein LOC111016288 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111016288 PE=4 SV=1)
HSP 1 Score: 1744.9 bits (4518), Expect = 0.0e+00
Identity = 900/1050 (85.71%), Postives = 947/1050 (90.19%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+KSKISYTRDFLLSLSELDICKKLPSGFDQSII+EFE+AS DRQRISGGLSLNS RRNEY
Sbjct: 23 KKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFEDASYDRQRISGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPSRAE NNYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE
Sbjct: 83 GSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSGY TGFSAPKVRAN+ YQLN+SNEPYHPPRPYKAV+H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNEPYHPPRPYKAVAHPRGNINDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEEKKRRA FESMRKEQHRAFQES K NP+KQRDEFGI+MQLDE
Sbjct: 203 NHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQESQKSNPVKQRDEFGIMMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD+KKLLNTSSGFDES LQASKNDREK FP TTVSRPLVPPGFTS VLEK+FGT+SS
Sbjct: 263 SKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVSRPLVPPGFTSNVLEKSFGTKSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPH LE KDDVVDK LQTKDEHLHNGISEDL+EKNSSEQMGCP QYGKTSINAS NNTS
Sbjct: 323 VNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSSEQMGCPEQYGKTSINASANNTS 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+DMSNKTTGIDV+S E+SL+ +ASEN AVA CKTE +VL NTAIGE SQV
Sbjct: 383 EKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRAVADCKTE-KVLANTAIGETSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLF SA+KLDG ATNFIEQH+NEMEDACSPQN QSSKFAHWFVDN
Sbjct: 443 HSSSILEKLFCSAIKLDGGATNFIEQHENEMEDACSPQNTQSSKFAHWFVDN-------- 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
GGEKGG D+SDV SE LPTV FHGYESAE+YITSS TSSN K E
Sbjct: 503 -------------GGEKGGYDISDVA-SEQSLPTVAFHGYESAESYITSSETSSNAQKTE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFYDKSKPEAVS+ILTCEAVEQTLLSK+SEND ALQPSDQRW SD + KHPT K DDHA
Sbjct: 563 PFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSDQRWSHSDANNKHPTGKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGY-CDDG-------ANKKEESTHNISNPGKTLTLETLFGSAFM 850
SQHLLSLLQKGTSP+IVGY DDG NKKEES+HNISNPGKTLTLETLFGSAFM
Sbjct: 623 SQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESSHNISNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQP 910
KELQSVGAPVSAQRGSSGS K+DVSES GPI DDGLLSNNEIRPS++NHDHGDQRQQNQP
Sbjct: 683 KELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSNNEIRPSMINHDHGDQRQQNQP 742
Query: 911 DIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLIS 970
D+VRGQW NLNGPRPELDSSHP+AKLGHKIGGYDGP E+P PEEDSLIISDSM FQNLIS
Sbjct: 743 DLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEMPFPEEDSLIISDSMNFQNLIS 802
Query: 971 IGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAP 1030
IGNS KPQPLFSHHTQDNN+AIFN AFKDERP+MGGLEGLPFSASP+DRRETEMPHRKAP
Sbjct: 803 IGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEGLPFSASPFDRRETEMPHRKAP 862
Query: 1031 VHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPP 1090
VHSSF QLHP Q NNVKLFHQFESHPPNMNSQGELLLPEGM+HHDSPSNHQFVANMLRPP
Sbjct: 863 VHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPEGMVHHDSPSNHQFVANMLRPP 922
Query: 1091 TTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAP 1150
T+GLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAPPPMT+RSVPLHPHSVRGSAAP
Sbjct: 923 TSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAPPPMTNRSVPLHPHSVRGSAAP 982
Query: 1151 PQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHR 1210
PQPNN V+GLVQELNSIQGFHIGQRVPN+GG R+PSPAPGIG GNQPDAIQRLIQMGHR
Sbjct: 983 PQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAPGIG--GNQPDAIQRLIQMGHR 1042
Query: 1211 SN-SKQIHPLSASGHGQGMHGHELNMGYGF 1231
SN KQIHPLSASGHGQG++GHELNMGYG+
Sbjct: 1043 SNPPKQIHPLSASGHGQGIYGHELNMGYGY 1047
BLAST of Sgr016019 vs. ExPASy TrEMBL
Match:
A0A0A0L649 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G121650 PE=4 SV=1)
HSP 1 Score: 1660.2 bits (4298), Expect = 0.0e+00
Identity = 850/1049 (81.03%), Postives = 926/1049 (88.27%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+K K SYTRDFLLSLS+LD+CKKLPS FD+SII EFEEAS DRQR+SG LSLNS RRNEY
Sbjct: 23 KKPKFSYTRDFLLSLSDLDVCKKLPSSFDKSIIAEFEEASYDRQRVSGALSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPPS+AE +NYSRRIHGKREV SSGRSDKDSDSQSDRDSVDSGWRYGD SRRS QGPE
Sbjct: 83 GSSPPSKAEPSNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDQSRRSSQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
HDGLLGSGSFPRPSG+ T FSAPKVR ND YQLN+SNEPYHPPRPYKA +H R NDSY
Sbjct: 143 HDGLLGSGSFPRPSGFATAFSAPKVRGNDQYQLNRSNEPYHPPRPYKAAAHQRGNANDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E+TSEDRVEEEKKRRASFESMRKEQH+AFQESHK NP+KQ+DEF ILM++DE
Sbjct: 203 NHETFGSSEFTSEDRVEEEKKRRASFESMRKEQHKAFQESHKSNPVKQKDEFAILMEMDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
SKD++KLL TSSGFDES ++Q SKNDREKSF Q+TVSRPLVPPGF +TVLEKNF TRSS
Sbjct: 263 SKDDEKLLKTSSGFDESISIQTSKNDREKSFTSQSTVSRPLVPPGFATTVLEKNFATRSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNPHLLEGKDD VDKCLQTK+E +HNGI E+L K SSEQM QYGK+SINASTNNT
Sbjct: 323 VNPHLLEGKDD-VDKCLQTKEEQMHNGIVENLEGKGSSEQMDRTEQYGKSSINASTNNTG 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
EKIIDLFSA+D SNKTTGID+QSH+ SLEVFEASE SA KTE ++ NTAIGEPSQV
Sbjct: 383 EKIIDLFSAVDSSNKTTGIDIQSHKKSLEVFEASEKSAAVDFKTE-KLPANTAIGEPSQV 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLFGSA+KLDG A NFIEQHDNEM+DACSPQN+QSSKFA WFVDNDRKQE++L
Sbjct: 443 HSSSILEKLFGSAIKLDGGAPNFIEQHDNEMDDACSPQNSQSSKFARWFVDNDRKQEDNL 502
Query: 671 SPKRSNDLLTLIVGGEKGG-DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKPE 730
SPKRS DLLT+IVGGEKGG DVSDV+HSE LPTV FHGYES ENYITSSATSSNVAKPE
Sbjct: 503 SPKRSIDLLTMIVGGEKGGYDVSDVEHSEQSLPTVAFHGYESTENYITSSATSSNVAKPE 562
Query: 731 PFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDHA 790
PFY+KSKPEAVSAILTCEAVEQTLLS +S ND ALQP+DQ S D+KHP+VK DDHA
Sbjct: 563 PFYNKSKPEAVSAILTCEAVEQTLLSTVSGNDSALQPADQTCIHSVADVKHPSVKSDDHA 622
Query: 791 SQHLLSLLQKGTSPVIVGYCDDGA-------NKKEESTHNISNPGKTLTLETLFGSAFMK 850
S HLLSLLQKG+SP++ Y DDGA N KEESTHN+SNPGKTLTLETLFGSAFMK
Sbjct: 623 SHHLLSLLQKGSSPLVSEYGDDGAYMSTAFHNNKEESTHNVSNPGKTLTLETLFGSAFMK 682
Query: 851 ELQSVGAPVSAQRGSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQPD 910
ELQSVGAPVSAQRGSSGSVK D SES GP DDGLLSNNEIR S++NHDHGDQRQQNQPD
Sbjct: 683 ELQSVGAPVSAQRGSSGSVKSDASESHGPTPDDGLLSNNEIRSSMINHDHGDQRQQNQPD 742
Query: 911 IVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLISI 970
IVRG W NLNGPRPE +SSHP AKLGH+IG GP E+P PEEDSLIISDSM FQNLIS+
Sbjct: 743 IVRGHWLNLNGPRPESESSHPLAKLGHRIG---GPAEMPFPEEDSLIISDSMNFQNLISM 802
Query: 971 GNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKAPV 1030
GNS KPQP FSH+TQDNNAA+ NPAFKDER +MGGL+GLPFSA+ YDRRETEMPHRKAPV
Sbjct: 803 GNSAKPQPPFSHNTQDNNAAMLNPAFKDERQSMGGLDGLPFSANAYDRRETEMPHRKAPV 862
Query: 1031 HSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRPPT 1090
HSSFSQLHPPQTNN+KLFHQFESHPPNMNSQG+++L EG++HHDSPSNHQF+ANMLRPPT
Sbjct: 863 HSSFSQLHPPQTNNIKLFHQFESHPPNMNSQGDVMLAEGIVHHDSPSNHQFIANMLRPPT 922
Query: 1091 TGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPP 1150
+GLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRG PPM SR++PLH HS+R SAAPP
Sbjct: 923 SGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGVAPPMASRTLPLHHHSIRASAAPP 982
Query: 1151 QPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRS 1210
QPN+ VT LV ELNS+QGFHIGQRVPNI G R+ SPAP GNQPDAIQRLIQMGHRS
Sbjct: 983 QPNHQVTSLVDELNSMQGFHIGQRVPNIVGPRISSPAP-----GNQPDAIQRLIQMGHRS 1042
Query: 1211 NSKQIHPLSA-SGHGQGMHGHELNMGYGF 1231
NSKQI+ LSA GHGQG++GHELNMGYG+
Sbjct: 1043 NSKQINHLSAGGGHGQGIYGHELNMGYGY 1061
BLAST of Sgr016019 vs. ExPASy TrEMBL
Match:
A0A6J1F449 (uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442216 PE=4 SV=1)
HSP 1 Score: 1644.8 bits (4258), Expect = 0.0e+00
Identity = 856/1053 (81.29%), Postives = 917/1053 (87.08%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPSGFDQSIITEFEEASCDRQRISGGLSLNSSRRNEY 250
+K K SYTRDFLLSLS LD+CKKLPSGFDQS+I E EEAS DRQR+SGGLSLNS RRNEY
Sbjct: 23 KKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEEASYDRQRVSGGLSLNSFRRNEY 82
Query: 251 GSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQGPE 310
GSSPP+RAET NY+RRIHGK+++ SSGRSDKDSDSQSDRDSVDSGWR DHSRR QGPE
Sbjct: 83 GSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSDRDSVDSGWRLSDHSRRPSQGPE 142
Query: 311 HDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETNDSY 370
DGLLGSGSFPRP GY T FSAPKVRA+D YQLN+SNEPYHPPRPYKAV+H R T+DSY
Sbjct: 143 QDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNEPYHPPRPYKAVAHQRGNTHDSY 202
Query: 371 NHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQLDE 430
NHETFGS E TSEDRVEEE+KRRASFESMRKEQHRAFQE HK NP+KQRD F ILMQLDE
Sbjct: 203 NHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQEGHKSNPVKQRDGFDILMQLDE 262
Query: 431 SKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGTRSS 490
+KD+KKLLNTSSGFDE +LQ+SKNDRE FP QTTVSRPLVPPGFTSTVLEKNFGTRSS
Sbjct: 263 AKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVSRPLVPPGFTSTVLEKNFGTRSS 322
Query: 491 VNPHLLEGKDDVVDKCLQTKDEHLHNGISEDLLEKNSSEQMGCPGQYGKTSINASTNNTS 550
VNP LLEGKDD VDK LQTKD+ LHNG SEDL K+S EQMG P YGKTS NASTNNT
Sbjct: 323 VNPRLLEGKDD-VDKSLQTKDKQLHNGFSEDLEGKSSLEQMGRPEHYGKTSTNASTNNTG 382
Query: 551 EKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENTAIGEPSQV 610
E II L SA+DMSN+TTG DVQS ENSLEVFEA ENSAV CKTE V NTA+GE SQ
Sbjct: 383 ENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAVDNCKTE-MVPANTAVGEASQG 442
Query: 611 HSSSILEKLFGSAMKLDGSATNFIEQHDNEMEDACSPQNAQSSKFAHWFVDNDRKQEEDL 670
HSSSILEKLFGS +KLDG ATNFIEQ D+E +DACSPQNAQSS+FAHWF+DNDRKQ +DL
Sbjct: 443 HSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNAQSSRFAHWFMDNDRKQGDDL 502
Query: 671 SPKRSNDLLTLIVGGEKGG--DVSDVKHSELCLPTVTFHGYESAENYITSSATSSNVAKP 730
SPKRS DLLT+I GEKGG VSDVKHSE LPTV F GYESAE+YITSSATSSNVAK
Sbjct: 503 SPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQGYESAESYITSSATSSNVAKT 562
Query: 731 EPFYDKSKPEAVSAILTCEAVEQTLLSKISENDPALQPSDQRWRDSDDDIKHPTVKRDDH 790
EPFYDKSKPEAVSAILTCEAVEQTLLSK+ END ALQPSDQRW SDDD+KHPTVK DD
Sbjct: 563 EPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQRWSHSDDDVKHPTVKNDDL 622
Query: 791 ASQHLLSLLQKGTSPVIVGYCDDGA-------NKKEESTHNISNPGKTLTLETLFGSAFM 850
AS HLLSLLQKG+SPVI GY DDG NKKEESTHN+SNPGKTLTLETLFGSAFM
Sbjct: 623 ASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHNVSNPGKTLTLETLFGSAFM 682
Query: 851 KELQSVGAPVSAQR-GSSGSVKIDVSESQGPITDDGLLSNNEIRPSIMNHDHGDQRQQNQ 910
KELQSVGAPVSAQR GSSGSVK DV E PITDDGLLSNNEIRPS++NHDHG QRQQNQ
Sbjct: 683 KELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSNNEIRPSMINHDHGVQRQQNQ 742
Query: 911 PDIVRGQWFNLNGPRPELDSSHPRAKLGHKIGGYDGPPEIPLPEEDSLIISDSMKFQNLI 970
PDIVRGQW NLNGP P +DSSHP AKLGHK+GGYDG E+P P+EDSLIISDSM QNL+
Sbjct: 743 PDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMPFPQEDSLIISDSMNLQNLM 802
Query: 971 SIGNSTKPQPLFSHHTQDNNAAIFNPAFKDERPNMGGLEGLPFSASPYDRRETEMPHRKA 1030
SIGNS +PQPLFSH++QD+NAAIFNPAFKDERP+MGGLEGLPFSAS YDRRETEMP KA
Sbjct: 803 SIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGLPFSASLYDRRETEMPQWKA 862
Query: 1031 PVHSSFSQLHPPQTNNVKLFHQFESHPPNMNSQGELLLPEGMIHHDSPSNHQFVANMLRP 1090
PVHS+FSQLHP QTNNVK FHQFESHPPNMNSQG++ LPEGM+HH SPSNHQFV+NMLRP
Sbjct: 863 PVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDIALPEGMVHHGSPSNHQFVSNMLRP 922
Query: 1091 PTTGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAA 1150
PT+GLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP PMT+RSVPLHPHS+RGSAA
Sbjct: 923 PTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPLPMTNRSVPLHPHSIRGSAA 982
Query: 1151 PPQPNNPVTGLVQELNSIQGFHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGH 1210
QPNN V GL+QE NSIQGFH GQRVPN GG R+PSPAP GNQPDAIQRLIQMGH
Sbjct: 983 TLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAP-----GNQPDAIQRLIQMGH 1042
Query: 1211 RSN--SKQIHPLSAS-GHGQGMHGHELNMGYGF 1231
RSN SKQIHPLSAS GHGQGM+GHELNMGYG+
Sbjct: 1043 RSNSTSKQIHPLSASGGHGQGMYGHELNMGYGY 1067
BLAST of Sgr016019 vs. TAIR 10
Match:
AT4G01290.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1744 Blast hits to 1308 proteins in 219 species: Archae - 0; Bacteria - 241; Metazoa - 793; Fungi - 253; Plants - 108; Viruses - 0; Other Eukaryotes - 349 (source: NCBI BLink). )
HSP 1 Score: 473.8 bits (1218), Expect = 4.4e-133
Identity = 385/1093 (35.22%), Postives = 552/1093 (50.50%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPS---GFDQSIITEFEEASCDRQRISGGLSLNSSRR 250
+K +I+YTR FL+SLSE D+CKKLP+ FD++++ +FE+ S +R RISG S + RR
Sbjct: 24 KKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLDFEDPSPERARISGDFSSHGFRR 83
Query: 251 NEYGSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQ 310
N+Y SSPP+R E SR HG+ E +S G +DKDSDSQSDRDS + G R G SRRS Q
Sbjct: 84 NDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDSQSDRDSGEPGRRSGMPSRRSWQ 143
Query: 311 GPEHDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETN 370
PEHDGLLG GSFP+PSG+ G SAP+ ++ND +QL+++NEPYHPPRPYKA +RR+
Sbjct: 144 APEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSRTNEPYHPPRPYKAPPFTRRDAR 203
Query: 371 DSYNHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQ 430
DS+N ETFGS + TSEDR EEE+KRRASFE +RKE +AFQE K NP ++++F
Sbjct: 204 DSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQKAFQERQKSNPDLRKNDFDFTEL 263
Query: 431 LDESKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGT 490
L ESKD+K + S + + T+ S N S P Q+ RPLVPPGF ST+LEK G
Sbjct: 264 LGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQSNAPRPLVPPGFASTILEKKQGE 323
Query: 491 RSSVNPHLLEGKDDVVDKCLQTKDEHLHNGIS-----EDLLEKNSSEQMGCPGQYGKTSI 550
+ E L +K ++ NG S + L K S +M G+ +
Sbjct: 324 KPQTETSQYERSP------LNSKGINVVNGTSVNNGGKPLGIKIGSSEMLIEGE----DV 383
Query: 551 NASTNNTSEKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENT 610
S+ + +E+ +++ S L +S T D +S E + +E + G + T
Sbjct: 384 RVSSTDANERAVNISSLLGISTDTVNKD-KSFEKLSSISTPTE---IQGYPIKSEKATMT 443
Query: 611 AIGEPSQVHSS--SILEKLFGSAMKLD-GSATNFIEQHDNEMEDACSPQNA-QSSKFAHW 670
+ S HS SIL+K+F +A+ L+ G ++N +++ ++E+ SPQ +SSKFAH
Sbjct: 444 LGKKKSLEHSDGPSILDKIFNTAINLNSGDSSNMNKKNVEKVEEIRSPQTINKSSKFAHL 503
Query: 671 FVDNDRKQEEDL-SPKRSNDLLTLIVGGEKGGDVSDVKHSELCLPTVTFHGYESAE-NYI 730
F++ D K E L S + LL+L+ G +K D K + F G+ + + +
Sbjct: 504 FLEEDNKPVEVLPSSEPPRGLLSLLQGADK-LQTFDTKANPDLSTDFPFQGHATKRTDQL 563
Query: 731 TSSATSSNVAKPEPFYDKSKPEAVSAILTCEAVEQTLLSKISEN-DPALQPSDQRWRDSD 790
+S++T+ +V AV +LTCE +EQ++LS++ ++ P P DQ
Sbjct: 564 SSTSTTKSVT------------AVPPVLTCEDLEQSILSEVGDSYHPPPPPVDQ------ 623
Query: 791 DDIKHPTVKR--------DDHASQHLLSLLQKGTSPVIVGYCDDGANKK----------- 850
D P+VK DD ASQHLLSLLQ+ + P A ++
Sbjct: 624 -DTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDPKSQDTQLLSATERRPPPPSMKTTT 683
Query: 851 -----EESTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSVKIDVSESQGP 910
+ +T ++PGK+LTLE LFGSAFM ELQS+G PVS + S + + + S+
Sbjct: 684 PPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEPVSGRAMVSDAPGVPL-RSERS 743
Query: 911 ITDDGLLSNNEIRPSIMNHDHGDQRQQNQPDIVRGQWFNLNGPRPELDSSHPRAKLGHKI 970
I + L N+IRP
Sbjct: 744 IGE--LSQRNQIRP---------------------------------------------- 803
Query: 971 GGYDGPP--EIPLPEEDSLI-ISDSMKFQNLISIGNSTKPQPLFSHHTQDNNAAIFNPAF 1030
DGPP + LPE+ +L+ + +S S +P + + D AA+ N
Sbjct: 804 ---DGPPGGVLALPEDGNLLAVGGHANPSKYMSFPGSHNQEPEVAFNISDKLAAL-NSGP 863
Query: 1031 KDERPNMGGLEGLPFSASPYDRRETEMPHRKAPVHSSFSQLHPPQTNNVKLFHQFESHPP 1090
++ERP MGG +GL P H + +FH F+S
Sbjct: 864 RNERPTMGGQDGLFLHQHPQQYVTNPSSH---------------LNGSGPVFHPFDSQHA 923
Query: 1091 NMNSQGELLLPEGMI--HHDSPSNHQFVANML-RP-----PTTGLSGFDHSIHHPMMQQM 1150
++ Q + + P + HHD P NH+F NM+ RP PT+G FD H MMQ+M
Sbjct: 924 HVKPQLDFMGPGSTMSQHHDPPPNHRFPPNMIHRPPFHHTPTSGHPEFDRLPPH-MMQKM 983
Query: 1151 QTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPPQPNNPVTGLVQELNSIQG 1210
NL HL+QG P P S P NN + GL+ ELN QG
Sbjct: 984 HMQDNLQHHHLMQGFPGSGPQPHHS----------------PHVNNQMPGLIPELNPSQG 990
Query: 1211 FHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRSN-SKQIHPLSASG--HGQ 1231
F R PN G MP P + G P ++Q L+ + R + +KQI + +G + Q
Sbjct: 1044 FPFAHRQPNYG---MPPPGSQV-NRGEHPASLQTLLGIQQRMDPAKQIPAVGQAGGPNRQ 990
BLAST of Sgr016019 vs. TAIR 10
Match:
AT4G01290.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1797 Blast hits to 1352 proteins in 216 species: Archae - 0; Bacteria - 202; Metazoa - 850; Fungi - 267; Plants - 109; Viruses - 0; Other Eukaryotes - 369 (source: NCBI BLink). )
HSP 1 Score: 469.5 bits (1207), Expect = 8.3e-132
Identity = 385/1093 (35.22%), Postives = 552/1093 (50.50%), Query Frame = 0
Query: 191 RKSKISYTRDFLLSLSELDICKKLPS---GFDQSIITEFEEASCDRQRISGGLSLNSSRR 250
+K +I+YTR FL+SLSE D+CKKLP+ FD++++ +FE+ S +R RISG S + RR
Sbjct: 24 KKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLDFEDPSPERARISGDFSSHGFRR 83
Query: 251 NEYGSSPPSRAETNNYSRRIHGKREVQSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSLQ 310
N+Y SSPP+R E SR HG+ E +S G +DKDSDSQSDRDS + G R G SRRS Q
Sbjct: 84 NDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDSQSDRDS-EPGRRSGMPSRRSWQ 143
Query: 311 GPEHDGLLGSGSFPRPSGYTTGFSAPKVRANDHYQLNKSNEPYHPPRPYKAVSHSRRETN 370
PEHDGLLG GSFP+PSG+ G SAP+ ++ND +QL+++NEPYHPPRPYKA +RR+
Sbjct: 144 APEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSRTNEPYHPPRPYKAPPFTRRDAR 203
Query: 371 DSYNHETFGSFEYTSEDRVEEEKKRRASFESMRKEQHRAFQESHKLNPLKQRDEFGILMQ 430
DS+N ETFGS + TSEDR EEE+KRRASFE +RKE +AFQE K NP ++++F
Sbjct: 204 DSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQKAFQERQKSNPDLRKNDFDFTEL 263
Query: 431 LDESKDEKKLLNTSSGFDESTTLQASKNDREKSFPLQTTVSRPLVPPGFTSTVLEKNFGT 490
L ESKD+K + S + + T+ S N S P Q+ RPLVPPGF ST+LEK G
Sbjct: 264 LGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQSNAPRPLVPPGFASTILEKKQGE 323
Query: 491 RSSVNPHLLEGKDDVVDKCLQTKDEHLHNGIS-----EDLLEKNSSEQMGCPGQYGKTSI 550
+ E L +K ++ NG S + L K S +M G+ +
Sbjct: 324 KPQTETSQYERSP------LNSKGINVVNGTSVNNGGKPLGIKIGSSEMLIEGE----DV 383
Query: 551 NASTNNTSEKIIDLFSALDMSNKTTGIDVQSHENSLEVFEASENSAVAGCKTERRVLENT 610
S+ + +E+ +++ S L +S T D +S E + +E + G + T
Sbjct: 384 RVSSTDANERAVNISSLLGISTDTVNKD-KSFEKLSSISTPTE---IQGYPIKSEKATMT 443
Query: 611 AIGEPSQVHSS--SILEKLFGSAMKLD-GSATNFIEQHDNEMEDACSPQNA-QSSKFAHW 670
+ S HS SIL+K+F +A+ L+ G ++N +++ ++E+ SPQ +SSKFAH
Sbjct: 444 LGKKKSLEHSDGPSILDKIFNTAINLNSGDSSNMNKKNVEKVEEIRSPQTINKSSKFAHL 503
Query: 671 FVDNDRKQEEDL-SPKRSNDLLTLIVGGEKGGDVSDVKHSELCLPTVTFHGYESAE-NYI 730
F++ D K E L S + LL+L+ G +K D K + F G+ + + +
Sbjct: 504 FLEEDNKPVEVLPSSEPPRGLLSLLQGADK-LQTFDTKANPDLSTDFPFQGHATKRTDQL 563
Query: 731 TSSATSSNVAKPEPFYDKSKPEAVSAILTCEAVEQTLLSKISEN-DPALQPSDQRWRDSD 790
+S++T+ +V AV +LTCE +EQ++LS++ ++ P P DQ
Sbjct: 564 SSTSTTKSVT------------AVPPVLTCEDLEQSILSEVGDSYHPPPPPVDQ------ 623
Query: 791 DDIKHPTVKR--------DDHASQHLLSLLQKGTSPVIVGYCDDGANKK----------- 850
D P+VK DD ASQHLLSLLQ+ + P A ++
Sbjct: 624 -DTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDPKSQDTQLLSATERRPPPPSMKTTT 683
Query: 851 -----EESTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSVKIDVSESQGP 910
+ +T ++PGK+LTLE LFGSAFM ELQS+G PVS + S + + + S+
Sbjct: 684 PPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEPVSGRAMVSDAPGVPL-RSERS 743
Query: 911 ITDDGLLSNNEIRPSIMNHDHGDQRQQNQPDIVRGQWFNLNGPRPELDSSHPRAKLGHKI 970
I + L N+IRP
Sbjct: 744 IGE--LSQRNQIRP---------------------------------------------- 803
Query: 971 GGYDGPP--EIPLPEEDSLI-ISDSMKFQNLISIGNSTKPQPLFSHHTQDNNAAIFNPAF 1030
DGPP + LPE+ +L+ + +S S +P + + D AA+ N
Sbjct: 804 ---DGPPGGVLALPEDGNLLAVGGHANPSKYMSFPGSHNQEPEVAFNISDKLAAL-NSGP 863
Query: 1031 KDERPNMGGLEGLPFSASPYDRRETEMPHRKAPVHSSFSQLHPPQTNNVKLFHQFESHPP 1090
++ERP MGG +GL P H + +FH F+S
Sbjct: 864 RNERPTMGGQDGLFLHQHPQQYVTNPSSH---------------LNGSGPVFHPFDSQHA 923
Query: 1091 NMNSQGELLLPEGMI--HHDSPSNHQFVANML-RP-----PTTGLSGFDHSIHHPMMQQM 1150
++ Q + + P + HHD P NH+F NM+ RP PT+G FD H MMQ+M
Sbjct: 924 HVKPQLDFMGPGSTMSQHHDPPPNHRFPPNMIHRPPFHHTPTSGHPEFDRLPPH-MMQKM 983
Query: 1151 QTSVNLPPQHLLQGLSRGAPPPMTSRSVPLHPHSVRGSAAPPQPNNPVTGLVQELNSIQG 1210
NL HL+QG P P S P NN + GL+ ELN QG
Sbjct: 984 HMQDNLQHHHLMQGFPGSGPQPHHS----------------PHVNNQMPGLIPELNPSQG 989
Query: 1211 FHIGQRVPNIGGSRMPSPAPGIGGTGNQPDAIQRLIQMGHRSN-SKQIHPLSASG--HGQ 1231
F R PN G MP P + G P ++Q L+ + R + +KQI + +G + Q
Sbjct: 1044 FPFAHRQPNYG---MPPPGSQV-NRGEHPASLQTLLGIQQRMDPAKQIPAVGQAGGPNRQ 989
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022147303.1 | 0.0e+00 | 87.33 | uncharacterized protein LOC111016288 isoform X1 [Momordica charantia] | [more] |
XP_022147304.1 | 0.0e+00 | 87.24 | uncharacterized protein LOC111016288 isoform X2 [Momordica charantia] | [more] |
XP_022147305.1 | 0.0e+00 | 85.71 | uncharacterized protein LOC111016288 isoform X3 [Momordica charantia] | [more] |
XP_038882196.1 | 0.0e+00 | 82.94 | uncharacterized protein LOC120073406 isoform X3 [Benincasa hispida] | [more] |
XP_038882201.1 | 0.0e+00 | 82.84 | uncharacterized protein LOC120073406 isoform X4 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D0Y7 | 0.0e+00 | 87.33 | uncharacterized protein LOC111016288 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1D0L9 | 0.0e+00 | 87.24 | uncharacterized protein LOC111016288 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CZT1 | 0.0e+00 | 85.71 | uncharacterized protein LOC111016288 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A0A0L649 | 0.0e+00 | 81.03 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G121650 PE=4 SV=1 | [more] |
A0A6J1F449 | 0.0e+00 | 81.29 | uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT4G01290.1 | 4.4e-133 | 35.22 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G01290.2 | 8.3e-132 | 35.22 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |