Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAAATGGGGGTCCGCTTAGGATCCGGTCACGTGGTCGCTTTCTAACCGATCAAGCACTTCCCGCTCGCATATCCTTTCAAATCCATATCGCCGAACCGAGGGAGCGAAATCGAAGATCCAAACCCAGCCGTGAGAAACCTCCTTCCTCAATCTCATTCACCGTCATTTCTAGTTCCTTCGGTCGTAGTGATTGACGTCGCAAGCATTATGGAGTCCGATATCTCTCTCATTGAAGTCGCCGGAGAAGATGATTCTCTACTGCATCCGATTCCAGAAAACGAGCTTTTAAACCTCGACAAGGATACGGACGGCGTAACAGCAGGAAACAGCGGCTTCTTCTTGTGCTCGCCTCTCTTAACTGACAGATCCAGTGGCACCGTTGCTGGTTCTTCTGCTGCTTCTTCTACAGGTCAGGTTGATTATGTACATTTTCTTTGTTTTTAACCTTCGGCCTAAGGTGCTGCACTTTCATCTGTGTTTCAGATATCGTTTGTAGGTGTAGGTACTTCTTCTTGTCTAATTTGAGTTCCTAATCAGTATTTTTAAAAATTATCTTCCAATTTGATGAATATGTATATTTAGATTGATGTTGAAGCACCGGAGAAGTTGTTTTTGTGATCTATTGAGAGTTTTACTTTCTCTGATTAAAACCTGCATTTCAGGCGGTTGCGATCTCTCTTTCTAAGAGTCAAACTAATTACGTTATTAATATTTTCATTCGTTTTAAATTAGGATGACTTAAATTATAAATATGTTTTACTATTGTTCAGTAGCTTACGATTAAATGACCGGTCCTTTCTTCTGTACAGTTATATCGATAAAGAAAATGTTAATGCGAACAATTTAGAAGTGCCAAAGCTTAGTATTATGCCGCAACAAATGAAGAGGAAGAAGAAGGGAGGAGGATATAATTTGCGGAAAAGCTTAGCATGGAACAAAGCTTTCTTCACCGAAGAAGGTTTCTACATGTGGTTGCTAGATTTATTTATTTCTCATTTTTTTCTGCAATTCAAGCTGTAAAACATTGTTTGTATCATATTCACTCTTTCGTCAGGTGTTTTGGATTCCGTGGAGCTATCAATGATCACTGGTAGGACTAGTACATCTTGCGGTGAAGCATTGGCAGCAATCGACGAAGAATTACCGGCGATGCCTGGTGGTGGCTGTATTGATGATATATCGTCGCATGATAAATTTTTCAAGGATACGTCAACGAGTACTCCCGGTGGCGATAGAAAGAATGGCCGTTACTTGTTGCCAAAGCGTGGTTCATCAACTAAAGATAATGTATCCCAGCTTCTGCAGTCAGTATATGTCCAAGTTTAATTGGAAATCGTTATCTTTGTGGTTATACTACTATGGATATGCTCCACAGCTTTGGATATAGAAAGATTAAAAGATTTTTGATATCTTTTTTTTTAGGCCGACTATTACAAACTTGTCTTTTGGTGTTTAAACAATATTTCAGGTCAAGTCGAAGGAGCTATCAACCAAAGATGTCAATCGGAGTGGATCAAAACGTGGAAGCTGTCCACGACCTGTGGCCTCATCGTCATATCCTTTTAGTTGTTTCTTAAGCTTTGTTTTAGTTTTGCAGCGTTATCGAGTTTGCTCCAGCTGGTCCACCCTGAACTCACTTATAGTAATAGATTAAGTTAGATTCCTTAACCGCTTTTATACTGCCAAGAGGCCTACAACCTCGAATGGAACAAAAACGGTGATCAAAGATGAGAGAATTTCCAGAATTCCAGTTTCAAAACGTGATCCTACTGTTACCTCTAGGACTACAAGGAGTACTACTATACGTGCAAGTGACGCAAAGAGTAATCAGGTTGCTCAAAGAGGAGGTAATCATTGTAGAATCGCACAATATTTCTTTCTTTTAGTCATCGTAGAAAATATTTGTCTTACGAACTTTAGGCTATCTAAATCGATATTGTTACATTCAAATCATAATCTATTTTGTTTTCGATGACTATTACTTCAATTTATTATTCTGTCTTAAAACAAATTTTGCAATCGTGAAATGATACTTTGTTCTGATATGTTGAATTCTCTAATACAGTTGGAAACAATCCAAAACAGACTACGTTAAAGAGCTCGTCCATCAATGCTAAAAGTGCATTGAATAAGGATGTTAATGCAAGCAAAACTTTGAAGGCTAAAACCTCAATTCAACAACCGAGAAGAAAATTGGTGAGTACTATAAGCTTTGATGTGGGGGTCTTTATCATCTATGTATTAAAATTGGTAGATGAGAAGATGGTTAAAAGTTCTAACAGACATCGAGGACTCGTACTAATTGGGAAGCTCGTACGTTGTACTGCTTATTTCTTATTTACAGGCCAGCCCAGTATTAAAAATGAATTCCTCTCATTTACATAACGAGTCTACTGATTCAAATGAGGGGTTAAAGGCGGCCACAAATTCATTGATTTCTAAAGTTCTTCCTTTGAATGATGATGGCACCAGAAAAGTTTCTGCTTCCGTTACTCAAAATGCTCTGCCTGATGGTGGCAGAATGCTTAATCAAACCCAAATGCCAAAGCCGTCAGGTTTGCGAATGCCATCACCGTCAATGGGATTCTTTGGTCAGGTATAATCATCAGGCTTCATTAAGTGCTATTAGATATATTCTAGATAAATGTTTTTTTCTTCAATTTTTTTGTATTTCTTATTTTCTGTTTATTTGTGAATTCTGTCCCAGAAAAAGGTTTCTTCATTCCAAAGCCTGCCACCGGCTACTTCTGAAATGCACAGCCTTTCCAAATCTAACCTTCCCAATTTGAGAACAGCTGGTCCTTCAAACCCTATTTGTCAGTTGGCAACAACACTTGTGCCTAGGAACTTTGTGAAAGCAAGCAGCCGTGAGGCTTCTGGGGAAACTAACGTAGTCTCATGCTTAAGTTCGGGTAGTTCAGTAGAACCTATTTCCCATGATAGAGCGAAGTCCGTCTTGAAAGCAGCTGAAATCCATTTAGGGAAAGTGAATGTTGGCGGCGCTTCTACAACGAACAAGGATTTGAGCGCCCATGGACTGGAGAAGCCCGATGTTTTGTCCCTTTCTAGTCGTGGAGATGTTTCTAGATACATGATGAGATTAACGACCAATTTGCAGAATGCAAACACCGTCATGTGTCATTTGACAATTTTGCTGACTCTACCAAATCATATCTTGGTGAAACGAACGACTTGGGCTCACAAGGTATGCCAAGAGCTCTTAATGATCAGTTAAATGGTGCACAAGATTGCAATGAACAATCTCCGATGCAAGTTGAGCTTCCAAACTCTTCTAATTGTATGCTTGAACGAATTACTCCTGATCATAAGAGGTTAGGGGTCGGCACTTGTAATTCACTCAAGAGAAGTAGAACTTCAAAAGATTTTGAACATGGTAGATTTGAAGAAAATGTTGGTAATGATTCTAACGGTCAGGAGTGCTATGCATTTGATAAAGACATGAATGAGGGACAGGATGAAGCCTCGGAAACTCATAAAATGAGACTATTAGGGACAGAAGAACCAGAAGGATCCGACGTCGATCACTGTATTTCAAATGAATGTAACAAGACTATGCAAATCACTTCTGCGTTGCTAATTCAGACTCAATGCATATTGATGATGAAAAGCCAACGGTTCCAATGTCGAATAACACATCGGGCTTTTCTCTAACCTCCCAGCATGATTATGCTTCCAAGAATGATATTTTGGCAGAAGGAAGCAATGGAGTCTGTGAAAATATATTGAAAAATGAGATGATCTTTCTTTGATTCCAAATTCAACAGCAGATGCTTGCTTAGACAGTGGCCCAGTTGTCAGAAATTGCAATAATCATACAGATAAGATGGTAGACATCAGATCTGATATTGAGCAGAATGACACTTCCTTGGAAACGGAAAGGAATCAGGATGGTCATGGTGATGTAGCAATTGCCCACAACGCTGACGCTGCCGAAACTTTTCTAATAACCCGAGACTTGTGTTCAAGTGATACAGAAAATCAACTGTATGAAGCGCAAATGTGTATTGGATCAGAGCATGTACAAAACGAGGACAAACAAAATTCTCATGTTCCATCGTCGGTTAATGAATTTAATCGACTGCCTGAATTTGCTGAACTTCAAAATTGTTGCATCGACCTGGTAGAGGTTTCCCAGAACAACAACCAAGGGAAGTGCTCAAATGATGACTTGTTGCACACAAGCAATTCTGAAGGGTACAAAGAAGAAACTATCACTGATAGTCTCATTGCTAGTTCTGATGTATGTACATCAGAATGTCTTAGTAATCATAATCAGATCGCATTACCCAAGGATTATAGCAGTCTACATTTGGAAACACACGACACGAGGACGGGTGAAAACATTTTAGTGTCTCAGGAGATTGGGAATTTCCAAGTGTTCAACTGCTATATCATCCGAGTGTGAAAAAGTACGAAGTGGTGAAGGTGCATATGAAACAATGAGTATAGAAATAACTTCTGAAGCAAGAACGACTTGCAATGATCTGGCATTTTGTTGGTTGACTAAAGAGCTGGATTCATCAGTACCCAATGATCACATATTGTCTAGGGAAAATGTTCAACAGTGTATGGAAACTAAAGAGCTGGAGAATCATAAGTCTCCACAGATGAATGGAAACATACTGTGTCAAAATGAAAACGAGCTCAACAGTGAATGGAGCATTTACCAGGCACTGAAACGTGCAGTACATACGACGCCGGTGCACAATCATTGCAGCTGTAAGTGCCTTGGTTTTGTGAATGAACTTTTTGCTTCCTTGCCTTTTCTTTCTTTTATGCTTGAATGAAACTCTCCATGCTATTGGCAGAAGAAAGTCCGAGGATGTCAGAAGCAAAATGCTCTTGGAATTAAAACTTCAACAAATGCTGTTCCGTTTTCTGAAGAATGGTTGGCTGCAATTGAAGCTGCTGGAGAGGTGATATTACCTTAAATTCTATTACTCTTTTTTTAAACTCAGATGTTTTAAAAGTTCAACATTGTCGTATTGGTATATTATATCTATATATTGTCCTTTCATGAATGTTGAAACTGATGAATAACGGCCAAGGTTGGGGACATCGGCATATATGATATATGGTAACGCATGCAAAATTTACACGAGATACCATAAAAAGTAAAACTTGAAAACAAATCATTTGAAGCCAAATGTGAAGTATGGTAGAGCCCGAAATATTTGTCTTTTCTTATTTTCTTTCCAAAATTAATTTTGAACTTTATAATTATTAATGTATAGGTCACTAAAAGAATGTCAATTTCTAATGTTCCAGTACTTGGGTAGTTTTTACTTTTCCTTATTCTTCTTGATTTTAAGTTTTGCGTTTATTGCATATGGTAGCTACCACTTTTAACTGTTTCCCTTTTTCCTTTGCAAAGAAATTATTAGACTTTCGTTTTTACTTTCTACTTGGAATACTTGAGTACTCTCATATATGGTTCCTGCCTCCCGGCTGGATTTGACGAATTTGTCCTTTTATTTTTCCTTAGGCTACTGATTATTGCACTCGCATGAGTCCAATTTCTTTTCGCTTGCTATGTATGTGCAATTCTCGTTTCTTACCTCGGACTCTTGGGGTTTGATCGAGGGTACCTAACTTAAGTCGTTGGAGCTACGAATAATTTGTTACTGTTTGTGTGATAATCAGGAAATTCTGACCATGAAAACCGGCGCTGTACAAAACTCACCTCCCGACAAGTCTCAACCCGAACCAGGTCCATGGTCCCCGGTATGCTCTCTCTCTCTCTCTCTCTCTCTATCTCATAACATTGTGGTTAGGCACCCGTGCCTGTGACCGTGCCTTACACTCCTCTTCCCAAACATGGGAGAAAATTCGGTTTCAATGTCTGGATATTAAGATTTACTGAATTTATTTTTTAAAAAATGAAATTTTGTTTTTTACAGTAAAAATGAACTTTCCAAATCATATTTGTGAATCAAGAATAAGAATGCAGGAAGAAAACTATTTCGTTGTTCTCTTTCCAATGTTTCATGTATTGAAGAAATGATTTGGAGTTATATGGCTGTTATTTAAAATATATATATATATAATTTTTGTTTTTATAATGTTTGGATAATTAGTAAAAGTAAAGTCAATTTGCATTTGAACCATTATTTATAATCTCGTAATTTAGTCAGGAAAAATCGTTTCAATTATTTTAAAAATAAGTCAACAGTAAAAACAATTTATGAGAAAACAATATAACTCAACTTCAAAAAAGTGATTATTTAAGTGTTGGAAAATGTAAGGTTGATTTTTTTATTTTGCAAGTCTGACTTTCTTTTTTGTTTTTAAACCTATTTTCTTTCTCTATGTTATTGGGAATCTAGTAACTTCTTGGCTTTTATTTTGAAGGTTTTTAAAATCAATAAAAACAAATATACTTCAATTAGGATTTGGAAATAATTCTAGGAAAATAAATTAGATTAAACTATCTTTTACCTTAAATTTTCGTATTTATTTTATTTGAGTCATCATATTTTTAAAATTAGGTCAAATTTTTTATTAATATTAGTTTTGGTCCCAACATTAATTTTTGTACTGTTAGCATATATTAGTACAAAATTCAATAGAGAGATTTTGAAATGAAATTAACATAAAAATCCAATAACTAAAATAAAATCTAAACAAAAAAAATGTTGGGAAAATTAATTTTTGAATGTTTCTTACATGAAATCACCTTTAAAAAAATCATCAAAAGCGTGCATTGTAGTGGCCAATTTTATGGATATGATATTGAACACTCATGAGTTTCGGTTTTATATTATTTTATGTTAATTGACAGGTGAAACGGAAAACCAATCAAGGGATCGGACCATTTGATTGTACAAAATGCATCAACACGAGCCAACCCCATGAATCCTGTTAACCAGACGATTTCCTTTCATTCTTTGCCACTAACGGTCTGACTTTTTGTTCATTTTTATCAGAAAAATGTACACGAGGCCTAGATGCGTAGGTTAGATTAGACCAAAATGCCCTGGTCCATTCTTCATGTTCAACTGTCTATTAAAGCCATTAAACAAGGAAATTCTTTGTTTAGAATTCAATCAAATATTAGTTATGACAATTAATAGTTTTCAGTTTAAACAATTTAACTCATTTAGCATTAGAATTTCTCTATTCTAAAAAAAAAAAAAAGCATTAAAATTTTTCTAGCAATTACAGCCAACTCTAAAAAAAACCCTCCAAAAATAGGAATTCTAAAATTAAAAAAAAAAAAAAAAAAAAAAAAAAAACTCCCAAAATTAGTCTTTGCAACTTAACCGTGTAAGGAGATTAGTTTGATTCAAGTAAGTTTTAGGCTAATACCAACATCAAGCTAACATGTCGGTGTAAGAAAAAAAAAATTGATGCCAAAACCAAACATCTTGACAATTGAACCAATCAAAGCTGAAAAAATTTACCCAAACCAAACTAGCTAACTATTTTTCAATTTTATGCATTTTTTTTTTTCAATTTTATACTTTTAGAAACTTGAACCAACAATTGATTCAATTAGTTTCATCAATTTTTTCAATTTTTGGATCTAAGCCTGCACACCTAAACCTACCTATCCTACACAAAATAGAATCAAATCTTCATAGGTATAACATAAAATATAGTTATGGGTTTCTTTTATATATATATATATGTATGTATGTATAGTTATAGGTTTCTACGTCATCCTTCAAGGTCTCCTAATATTAATTGTCGACTATTAGCAGGTAGATAGGCCCCATGTCCGTCCAGGAAAAATAAAGAGAAAAAAAGGTCCCGTGTCTTAATAATTAATTTGATAAACTAAGTTCAATTCTTGTATTATATTGTGTTGGGTAAAATGTTCAAATATACCTTATATTTTCTATACAAAAAAGGGTATGATTAATTATAATTTCATTTGAATCATTCAAGTTTTTCTATGTCTTAAGGTTTGGACAATTTAGATGGATCCTAGATGACTAAAAGATTGCACTTTTTTTACCTATCTTTGGCCAATACCACAAACATCTCATAGAAAATTGATCATCGAAATATTGCACTACTTTGGCAACGCCTTTCCTAAAAATAGTTAATCTAGCTAGTATTATGTGCAAATTCAATGCGTTCTCATAACCAATCCCTCTAACAAAAGATAAACATTATGCATTATTTCTTATCTTATATTTATGTTTAATAAAAGGTTAAATTATTTAATGCACAAAAGGGAAAAAAAATTACCAAATGAGTCTAATTACGAGTTAAAATCATAAATAAAGTACATACTTGTAATAATTCAACTAAAGAAATCTAACTAAACTTACTAATAACAACTCAACTAAAAAAATTATGTAGATTTGAAGTTTATAAATAAAGTACCTACTCGTGATGACTCAACTAAAAAAATCTAACTACACCCACTAATAACAACTCATCTAGAAAAATGCAATAAATTCAAAGTTTATAAATAAAGTACCTACTCGTAATGACTCAATTAAAGAAGTCTAACTAAACTCACTAATAATAACTCGATTAGAAAAATTCAGTTGATTCAAAGTTTCTATTCAAGGTTCAAATCTTTGACTTCATATATACTATAGTTAAATTATATATATACATAAAAAATTTAAACAATGATTTCCATCCATACATAGTTCATATTCCCTCCACCTTACTATATCCTCATAAAAAGTAAATAAATGATATTAAAATTCAACAATCGAAATATAGTTAACTTATGCTTATATCTCCTTCATAGCGTTAGCTAAGGGAAGGAAGCCACAACTCTTATAAACAACTTTTCTATAAATAATAACAACAACAATAATAAATCTCTTATGAAACAACTACATAATGGCCCCCACCTCGCATGTAAAACAGTGTCCTATTAGCGGCTTGTGTGAGGAACGATGCTTCAGTAACTAATTCAAAGGCTTTTGCAGACCTAAACTTCTAAAAGGGCGGGCTAATTGGTTGCAACTTGCAACGTGGGAAACGCTACCTAACTGCCAGCACCTCCTCCCCACTAAATAAAACAGTGATAAGATTAAATTTTACGAATTCAGTCCTAAACTTTAAAAATTATCTAATAAGTTCTTAAATTTTAATTTTGTGTCTAATAAATTTTTTAATTTAAATTTTTAAAAATATCTAATAAATCCTTAAATTTTAATTTTATGTCCAATAAATTTTTTAATTTTTAAAAATATCTAATAAGTCTTTTAACTTCTAATTTTGTGTCTAATAAGTACCTAAATTTTTAATTTTGTGTCTAATAGATTCCTAACTTATTCAATATTTTTAAAAATTTATAAACTTATTAGTCACAAGGTTGAATTTTGTGTCTAATAAAATCCTAACTTTTCAATTTAGTGTTTAATACGTCTATAAATTTTAAAAAATGTCTAATAAGTGAAATATATATTTGACACAAAATTAAAAATTCAAGAAATTATTAGCTACAAAATTGAAAGTTTAAAGATTCATTAGACACAAAATTAAAACTTTATGAACCTATTAAATACTTTTAAAGTTCAGAACCTATTAAACACAAACTTAAAAGTTTCAAAACTAAACTTCTAATTAAATCTAGTGATAATTAATTATTTTAAAGTTACTAATAATTAGCAAACCCCCAAAGTCCTCCGTGTTAACCACCCGCAACTTCATTCCGGCCACTGAGCATAACAATTTGGACCGATTTAAAAAATGAACTTTTTTCAAAAAAATTCTAATAAAAAAAAATTACTGAAGTAACAAACATTATTTTGTAGGTTTGATAATTATTTAAATCTTTTTGATAATATACTTTTAGTATTTTTTAAATAACATTCAATAATTCAAACGTAATATGCTAACATGAGCGTAGTTACAGCATCTCTACTTATTTTAAGAAGTCATAATATATGTTACATCTATTTAATTGTTTTTGTAATTTTCACTATAAAAAATGTCCATTTACCTAGTAATTTTGAAAATCCTATCAGTGAAATAAGATGCTTTTAGTCATTTTATTTTTTATAATTCAATAATTATTAAATGTCATATTTCTAAAAAGTTATAATAGTTGCTATTGTTTAATATTATAGCATTAAAATATGTGGTTTACTGTTCATCTAGTTATATTTGCAAGGGAGTAGTAACTTGAAGGGTACACGAAAAATAATATTAAAAAAGTAAGGGTGTGTACCTCTTTCTTTCTCTTTCCTCTGATGTTCAGGTTGATGAATGAGTGAAAAATTAGTTCCTTTCATGTATCATGCAGCAGGAAAGAATCAGAAAGAAGGCAGAATTTCACAATAATGCATTTACGTATAAAATGGAAGTAACACTTTTCCGAGTTCGAGTAAAAATTTTCCAAAGTCGTGATTTCGCGTGATTAATTGAGCAAATATACATAGAAAATGCAGACATGATAGATAGTAGAGTACTTTTAAAATTATTGAATATAAAAATGTTTAGAGGAGAAAAATGGGCTACTATCAGCGAATTTGTCTTAGAATTTGAAATATTGTACATATGAAGGCTTGGCTTCGGATGAGGTTCTTTGATTTGCTCTCAAGTTTCAGTTCTCACCATAAGGTTCAAAGATCGAACTCGAAAAATAGTTGAAGAAGAAGTGCATCAAGAATTTGCCATTAACTTCCTTCGTAAGTAAGGAGAGTAAAAATGAAATGGTGGAAAAAGCCTTCGATAAAACTCGGACAAAAAATATGAAGAAACAATTCATGCAAACAAATGAAAAACTGCTCGATATGGTATGGTATTAAAGGGAACAAAAAGAGCGAAGGAAAATTTCGCTCGCATAGACGACCGGCAAATTCACCAGAAAAAAAACAGTCAATGCAAACAAAACGAAATTTCAGCCCGCAGACGTCGTGTGGATTAACAAAATGAGCAAAAACAAAGCAAAATTCACCGGGAGAGGGACGATTAAACCAAATTACTTTTGCGCATGCTGTTGAATTTGAGAACAAAAGAAAATGAACCGATAGTGCAGGTAATGAGAGTAGAAAAGCTGAACAAAAACAAGTACTAGTTTAATTTAATAGGAGATTCGGAACTCACATAATTCACAATCAGGATGCCGCGAGCCAAACTCGATCTCATCAGAAGTTTGTTGGATCTTCTAAGCCGCGACTATGCCGGTCTCCGAATCAGAAACAACAAAAAATCAAAGTGCAAGAAGTCAAAAATCAAACCAAACGAAACGACTTCTCATTTCAGTTCAAATGCATTATAGACGGCCAGAAAAGCGCACAAATCAAACGATTCGAACATAAAAAATTACGCACCAGGAGATATCGCCTAAATCGGAGGGGAAATTTGAGGATAGTCATCGCCTCGAAGAGCTCTGCGAGAATATCGAAAGTTGAACTTTAA
mRNA sequence
ATGTCAAATGGGGGTCCGCTTAGGATCCGGTCACGTGGTCGCTTTCTAACCGATCAAGCACTTCCCGCTCGCATATCCTTTCAAATCCATATCGCCGAACCGAGGGAGCGAAATCGAAGATCCAAACCCAGCCTTCCTTCGGTCGTAGTGATTGACGTCGCAAGCATTATGGAGTCCGATATCTCTCTCATTGAAGTCGCCGGAGAAGATGATTCTCTACTGCATCCGATTCCAGAAAACGAGCTTTTAAACCTCGACAAGGATACGGACGGCGTAACAGCAGGAAACAGCGGCTTCTTCTTGTGCTCGCCTCTCTTAACTGACAGATCCAGTGGCACCGTTGCTGGTTCTTCTGCTGCTTCTTCTACAGATATCGTTTGTAGGTGTAGTTATATCGATAAAGAAAATGTTAATGCGAACAATTTAGAAGTGCCAAAGCTTAGTATTATGCCGCAACAAATGAAGAGGAAGAAGAAGGGAGGAGGATATAATTTGCGGAAAAGCTTAGCATGGAACAAAGCTTTCTTCACCGAAGAAGGTGTTTTGGATTCCGTGGAGCTATCAATGATCACTGGTAGGACTAGTACATCTTGCGGTGAAGCATTGGCAGCAATCGACGAAGAATTACCGGCGATGCCTGGTGGTGGCTGTATTGATGATATATCGTCGCATGATAAATTTTTCAAGGATACGTCAACGAGTACTCCCGGTGGCGATAGAAAGAATGGCCGTTACTTGTTGCCAAAGCGTGGTTCATCAACTAAAGATAATGTCAAGTCGAAGGAGCTATCAACCAAAGATGTCAATCGGAGTGGATCAAAACGTGGAAGCTGTCCACGACCTGTGGCCTCATCGCCTACAACCTCGAATGGAACAAAAACGGTGATCAAAGATGAGAGAATTTCCAGAATTCCAGTTTCAAAACGTGATCCTACTGTTACCTCTAGGACTACAAGGAGTACTACTATACGTGCAAGTGACGCAAAGAGTAATCAGGTTGCTCAAAGAGGAGTTGGAAACAATCCAAAACAGACTACGTTAAAGAGCTCGTCCATCAATGCTAAAAGTGCATTGAATAAGGATGTTAATGCAAGCAAAACTTTGAAGGCTAAAACCTCAATTCAACAACCGAGAAGAAAATTGGCCAGCCCAGTATTAAAAATGAATTCCTCTCATTTACATAACGAGTCTACTGATTCAAATGAGGGGTTAAAGGCGGCCACAAATTCATTGATTTCTAAAGTTCTTCCTTTGAATGATGATGGCACCAGAAAAGTTTCTGCTTCCGTTACTCAAAATGCTCTGCCTGATGGTGGCAGAATGCTTAATCAAACCCAAATGCCAAAGCCGTCAGGTTTGCGAATGCCATCACCGTCAATGGGATTCTTTGGTCAGAAAAAGGTTTCTTCATTCCAAAGCCTGCCACCGGCTACTTCTGAAATGCACAGCCTTTCCAAATCTAACCTTCCCAATTTGAGAACAGCTGGTCCTTCAAACCCTATTTGTCAGTTGGCAACAACACTTGTGCCTAGGAACTTTGTGAAAGCAAGCAGCCGTGAGGCTTCTGGGGAAACTAACGTAGTCTCATGCTTAAGTTCGGGTAGTTCAGTAGAACCTATTTCCCATGATAGAGCGAAGTCCGTCTTGAAAGCAGCTGAAATCCATTTAGGGAAAGTGAATGTTGGCGGCGCTTCTACAACGAACAAGGATTTGAGCGCCCATGGACTGGAGAAGCCCGATGTTTTGTCCCTTTCTAGTCGTGGAGATGTTTCTAGATACATGATGAGATTAACGACCAATTTGCAGAATGCAAACACCGTCATGTGTCATTTGACAATTTTGCTGACTCTACCAAATCATATCTTGGTGAAACGAACGACTTGGGCTCACAAGAGGTTAGGGGTCGGCACTTGTAATTCACTCAAGAGAAGTAGAACTTCAAAAGATTTTGAACATGGTAGATTTGAAGAAAATGTTGGTAATGATTCTAACGGTCAGGAGTGCTATGCATTTGATAAAGACATGAATGAGGGACAGGATGAAGCCTCGGAAACTCATAAAATGAGACTATTAGGGACAGAAGAACCAGAAGGATCCGACGTCGATCACTGTATTTCAAATGAATACTCAATGCATATTGATGATGAAAAGCCAACGGTTCCAATGTCGAATAACACATCGGGCTTTTCTCTAACCTCCCAGCATGATTATGCTTCCAAGAATGATATTTTGGCAGAAGGAAGCAATGGAGTCTCAGATGCTTGCTTAGACAGTGGCCCAGTTGTCAGAAATTGCAATAATCATACAGATAAGATGGTAGACATCAGATCTGATATTGAGCAGAATGACACTTCCTTGGAAACGGAAAGGAATCAGGATGGTCATGGTGATGTAGCAATTGCCCACAACGCTGACGCTGCCGAAACTTTTCTAATAACCCGAGACTTGTGTTCAAGTGATACAGAAAATCAACTGTATGAAGCGCAAATGTGTATTGGATCAGAGCATGTACAAAACGAGGACAAACAAAATTCTCATGTTCCATCGTCGGTTAATGAATTTAATCGACTGCCTGAATTTGCTGAACTTCAAAATTGTTGCATCGACCTGGTAGAGGTTTCCCAGAACAACAACCAAGGGAAGTGCTCAAATGATGACTTGTTGCACACAAGCAATTCTGAAGGGTACAAAGAAGAAACTATCACTGATAGTCTCATTGCTAGTTCTGATTGTCTCAGGAGATTGGGAATTTCCAAGTGTTCAACTGCTATATCATCCGAGTGTGAAAAAGTACGAAGTGGTGAAGGTGCATATGAAACAATGAGTATAGAAATAACTTCTGAAGCAAGAACGACTTGCAATGATCTGGCATTTTGTTGGTTGACTAAAGAGCTGGATTCATCAGTACCCAATGATCACATATTGTCTAGGGAAAATGTTCAACAGTGTATGGAAACTAAAGAGCTGGAGAATCATAAGTCTCCACAGATGAATGGAAACATACTGTGTCAAAATGAAAACGAGCTCAACAGTGAATGGAGCATTTACCAGGCACTGAAACGTGCAAAGAAAGTCCGAGGATGTCAGAAGCAAAATGCTCTTGGAATTAAAACTTCAACAAATGCTGTTCCGTTTTCTGAAGAATGGTTGGCTGCAATTGAAGCTGCTGGAGAGGAAATTCTGACCATGAAAACCGGCGCTGTACAAAACTCACCTCCCGACAAGTCTCAACCCGAACCAGGTCCATGGTCCCCGGATGCCGCGAGCCAAACTCGATCTCATCAGAAGTTTGTTGGATCTTCTAAGCCGCGACTATGCCGGTCTCCGAATCAGAAACAACAAAAAATCAAAGTGCAAGAAGTCAAAAATCAAACCAAACGAAACGACTTCTCATTTCAGTTCAAATGCATTATAGACGGCCAGAAAAGCGCACAAATCAAACGATTCGAACATAAAAAATTACGCACCAGGAGATATCGCCTAAATCGGAGGGGAAATTTGAGGATAGTCATCGCCTCGAAGAGCTCTGCGAGAATATCGAAAGTTGAACTTTAA
Coding sequence (CDS)
ATGTCAAATGGGGGTCCGCTTAGGATCCGGTCACGTGGTCGCTTTCTAACCGATCAAGCACTTCCCGCTCGCATATCCTTTCAAATCCATATCGCCGAACCGAGGGAGCGAAATCGAAGATCCAAACCCAGCCTTCCTTCGGTCGTAGTGATTGACGTCGCAAGCATTATGGAGTCCGATATCTCTCTCATTGAAGTCGCCGGAGAAGATGATTCTCTACTGCATCCGATTCCAGAAAACGAGCTTTTAAACCTCGACAAGGATACGGACGGCGTAACAGCAGGAAACAGCGGCTTCTTCTTGTGCTCGCCTCTCTTAACTGACAGATCCAGTGGCACCGTTGCTGGTTCTTCTGCTGCTTCTTCTACAGATATCGTTTGTAGGTGTAGTTATATCGATAAAGAAAATGTTAATGCGAACAATTTAGAAGTGCCAAAGCTTAGTATTATGCCGCAACAAATGAAGAGGAAGAAGAAGGGAGGAGGATATAATTTGCGGAAAAGCTTAGCATGGAACAAAGCTTTCTTCACCGAAGAAGGTGTTTTGGATTCCGTGGAGCTATCAATGATCACTGGTAGGACTAGTACATCTTGCGGTGAAGCATTGGCAGCAATCGACGAAGAATTACCGGCGATGCCTGGTGGTGGCTGTATTGATGATATATCGTCGCATGATAAATTTTTCAAGGATACGTCAACGAGTACTCCCGGTGGCGATAGAAAGAATGGCCGTTACTTGTTGCCAAAGCGTGGTTCATCAACTAAAGATAATGTCAAGTCGAAGGAGCTATCAACCAAAGATGTCAATCGGAGTGGATCAAAACGTGGAAGCTGTCCACGACCTGTGGCCTCATCGCCTACAACCTCGAATGGAACAAAAACGGTGATCAAAGATGAGAGAATTTCCAGAATTCCAGTTTCAAAACGTGATCCTACTGTTACCTCTAGGACTACAAGGAGTACTACTATACGTGCAAGTGACGCAAAGAGTAATCAGGTTGCTCAAAGAGGAGTTGGAAACAATCCAAAACAGACTACGTTAAAGAGCTCGTCCATCAATGCTAAAAGTGCATTGAATAAGGATGTTAATGCAAGCAAAACTTTGAAGGCTAAAACCTCAATTCAACAACCGAGAAGAAAATTGGCCAGCCCAGTATTAAAAATGAATTCCTCTCATTTACATAACGAGTCTACTGATTCAAATGAGGGGTTAAAGGCGGCCACAAATTCATTGATTTCTAAAGTTCTTCCTTTGAATGATGATGGCACCAGAAAAGTTTCTGCTTCCGTTACTCAAAATGCTCTGCCTGATGGTGGCAGAATGCTTAATCAAACCCAAATGCCAAAGCCGTCAGGTTTGCGAATGCCATCACCGTCAATGGGATTCTTTGGTCAGAAAAAGGTTTCTTCATTCCAAAGCCTGCCACCGGCTACTTCTGAAATGCACAGCCTTTCCAAATCTAACCTTCCCAATTTGAGAACAGCTGGTCCTTCAAACCCTATTTGTCAGTTGGCAACAACACTTGTGCCTAGGAACTTTGTGAAAGCAAGCAGCCGTGAGGCTTCTGGGGAAACTAACGTAGTCTCATGCTTAAGTTCGGGTAGTTCAGTAGAACCTATTTCCCATGATAGAGCGAAGTCCGTCTTGAAAGCAGCTGAAATCCATTTAGGGAAAGTGAATGTTGGCGGCGCTTCTACAACGAACAAGGATTTGAGCGCCCATGGACTGGAGAAGCCCGATGTTTTGTCCCTTTCTAGTCGTGGAGATGTTTCTAGATACATGATGAGATTAACGACCAATTTGCAGAATGCAAACACCGTCATGTGTCATTTGACAATTTTGCTGACTCTACCAAATCATATCTTGGTGAAACGAACGACTTGGGCTCACAAGAGGTTAGGGGTCGGCACTTGTAATTCACTCAAGAGAAGTAGAACTTCAAAAGATTTTGAACATGGTAGATTTGAAGAAAATGTTGGTAATGATTCTAACGGTCAGGAGTGCTATGCATTTGATAAAGACATGAATGAGGGACAGGATGAAGCCTCGGAAACTCATAAAATGAGACTATTAGGGACAGAAGAACCAGAAGGATCCGACGTCGATCACTGTATTTCAAATGAATACTCAATGCATATTGATGATGAAAAGCCAACGGTTCCAATGTCGAATAACACATCGGGCTTTTCTCTAACCTCCCAGCATGATTATGCTTCCAAGAATGATATTTTGGCAGAAGGAAGCAATGGAGTCTCAGATGCTTGCTTAGACAGTGGCCCAGTTGTCAGAAATTGCAATAATCATACAGATAAGATGGTAGACATCAGATCTGATATTGAGCAGAATGACACTTCCTTGGAAACGGAAAGGAATCAGGATGGTCATGGTGATGTAGCAATTGCCCACAACGCTGACGCTGCCGAAACTTTTCTAATAACCCGAGACTTGTGTTCAAGTGATACAGAAAATCAACTGTATGAAGCGCAAATGTGTATTGGATCAGAGCATGTACAAAACGAGGACAAACAAAATTCTCATGTTCCATCGTCGGTTAATGAATTTAATCGACTGCCTGAATTTGCTGAACTTCAAAATTGTTGCATCGACCTGGTAGAGGTTTCCCAGAACAACAACCAAGGGAAGTGCTCAAATGATGACTTGTTGCACACAAGCAATTCTGAAGGGTACAAAGAAGAAACTATCACTGATAGTCTCATTGCTAGTTCTGATTGTCTCAGGAGATTGGGAATTTCCAAGTGTTCAACTGCTATATCATCCGAGTGTGAAAAAGTACGAAGTGGTGAAGGTGCATATGAAACAATGAGTATAGAAATAACTTCTGAAGCAAGAACGACTTGCAATGATCTGGCATTTTGTTGGTTGACTAAAGAGCTGGATTCATCAGTACCCAATGATCACATATTGTCTAGGGAAAATGTTCAACAGTGTATGGAAACTAAAGAGCTGGAGAATCATAAGTCTCCACAGATGAATGGAAACATACTGTGTCAAAATGAAAACGAGCTCAACAGTGAATGGAGCATTTACCAGGCACTGAAACGTGCAAAGAAAGTCCGAGGATGTCAGAAGCAAAATGCTCTTGGAATTAAAACTTCAACAAATGCTGTTCCGTTTTCTGAAGAATGGTTGGCTGCAATTGAAGCTGCTGGAGAGGAAATTCTGACCATGAAAACCGGCGCTGTACAAAACTCACCTCCCGACAAGTCTCAACCCGAACCAGGTCCATGGTCCCCGGATGCCGCGAGCCAAACTCGATCTCATCAGAAGTTTGTTGGATCTTCTAAGCCGCGACTATGCCGGTCTCCGAATCAGAAACAACAAAAAATCAAAGTGCAAGAAGTCAAAAATCAAACCAAACGAAACGACTTCTCATTTCAGTTCAAATGCATTATAGACGGCCAGAAAAGCGCACAAATCAAACGATTCGAACATAAAAAATTACGCACCAGGAGATATCGCCTAAATCGGAGGGGAAATTTGAGGATAGTCATCGCCTCGAAGAGCTCTGCGAGAATATCGAAAGTTGAACTTTAA
Protein sequence
MSNGGPLRIRSRGRFLTDQALPARISFQIHIAEPRERNRRSKPSLPSVVVIDVASIMESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAGSSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFFTEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTPGGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASSPTTSNGTKTVIKDERISRIPVSKRDPTVTSRTTRSTTIRASDAKSNQVAQRGVGNNPKQTTLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLKAATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQKKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGETNVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLSSRGDVSRYMMRLTTNLQNANTVMCHLTILLTLPNHILVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYAFDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEYSMHIDDEKPTVPMSNNTSGFSLTSQHDYASKNDILAEGSNGVSDACLDSGPVVRNCNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQLYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSNDDLLHTSNSEGYKEETITDSLIASSDCLRRLGISKCSTAISSECEKVRSGEGAYETMSIEITSEARTTCNDLAFCWLTKELDSSVPNDHILSRENVQQCMETKELENHKSPQMNGNILCQNENELNSEWSIYQALKRAKKVRGCQKQNALGIKTSTNAVPFSEEWLAAIEAAGEEILTMKTGAVQNSPPDKSQPEPGPWSPDAASQTRSHQKFVGSSKPRLCRSPNQKQQKIKVQEVKNQTKRNDFSFQFKCIIDGQKSAQIKRFEHKKLRTRRYRLNRRGNLRIVIASKSSARISKVEL
Homology
BLAST of Sgr023386 vs. NCBI nr
Match:
XP_022146221.1 (uncharacterized protein LOC111015485 [Momordica charantia])
HSP 1 Score: 1132.9 bits (2929), Expect = 0.0e+00
Identity = 684/1117 (61.24%), Postives = 784/1117 (70.19%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISL+EVAGEDD LL IPE++LLNLDKDTD +TAGNSGFFLCSPLLTDRS+GTVAG
Sbjct: 1 MESEISLLEVAGEDDFLLQQIPEDDLLNLDKDTDRITAGNSGFFLCSPLLTDRSNGTVAG 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS ASSTD C+ DKEN+N NNLEVPKLSIM QQMK+KKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASSTD----CT--DKENINVNNLEVPKLSIMSQQMKKKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG TSTSCGEALAAIDEE+PAMPGGGC +D+SS DK F D S STP
Sbjct: 121 TEEGVLDSMELSMITGSTSTSCGEALAAIDEEIPAMPGGGCYNDLSSQDKLFTDASISTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
DRKNGR LLPKRGS+TKDNVKSKELSTK+ NRS SKRGSCPRPVASS PTTS GT
Sbjct: 181 SADRKNGRCLLPKRGSTTKDNVKSKELSTKNSNRSVSKRGSCPRPVASSSTKRPTTSKGT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTRSTTIRASDAKSNQVAQRGVG--NNPKQTTLKSS 356
KTV K+ERISR+ V KRDP VTS T R+ T AS+AK+NQVAQRGVG NNPK+TTLK S
Sbjct: 241 KTVNKEERISRVLVPKRDPAVTSWTPRNATTHASNAKNNQVAQRGVGVVNNPKRTTLKGS 300
Query: 357 SINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLKAATNS 416
+IN KSAL+KDVNA+K+LKAKTSIQQPRRKLASPV+KM+SSHL NESTDSNE LKAA+ S
Sbjct: 301 AINTKSALHKDVNANKSLKAKTSIQQPRRKLASPVIKMSSSHLQNESTDSNEELKAASKS 360
Query: 417 LISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQKKVSS 476
LISK +P NDDGTRKVSAS+TQNA P GG ML+QTQMPKPSGLRMPSPSMGFFGQKKVSS
Sbjct: 361 LISKFIPFNDDGTRKVSASITQNAPPTGGSMLSQTQMPKPSGLRMPSPSMGFFGQKKVSS 420
Query: 477 FQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGETNVVSC 536
FQ+L PATSE H LSKS +PN+RTAGPSN CQLA L P NFVKAS+ EASGETNVVSC
Sbjct: 421 FQNLAPATSETHDLSKSYIPNVRTAGPSNYTCQLA-ALAPTNFVKASNGEASGETNVVSC 480
Query: 537 LSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLSSR--- 596
SSG SV+P+SH RAKS+ +AA IHLGKVNV GAST N DLSAHGLEKPD+LSLS+
Sbjct: 481 FSSGISVQPVSHYRAKSISEAANIHLGKVNVVGASTMNNDLSAHGLEKPDLLSLSNPVLQ 540
Query: 597 --GDVSRYMMRLTTNLQN-----------ANTVMCHL----------------------- 656
GDVSR + L + HL
Sbjct: 541 NFGDVSRIHDEIKVQLAECKPRHLSFNNFGESTKSHLGEKNGLGSQGLPRALSNGSQDFN 600
Query: 657 ---TILLTLPN--HILVKRTTWAHKRLGVGTCNSLKRSRTSKDFEH-GRFEENVGNDSNG 716
+ + PN +++R + +R G+G+CNSLKRSR+S D EH G FEEN+GN S
Sbjct: 601 EPSSEQVEFPNSPKCILERISSDSQRSGIGSCNSLKRSRSSIDLEHGGSFEENIGNGS-- 660
Query: 717 QECYAFDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SM 776
AF+KD+NEGQDE T +MR+L T+E E S +DHCISNE SM
Sbjct: 661 ---CAFEKDVNEGQDETLGTRRMRILRTKEVESSGIDHCISNECKNTIQSTSAWSNSDSM 720
Query: 777 HIDDEKPTVPMSNNTS----GFSLTSQHDYAS-KNDILAEGSNGV--------------- 836
HIDDEKPT+PMSN+ S G SLTS HD+ S +N+ILA SNGV
Sbjct: 721 HIDDEKPTIPMSNSKSLQREGSSLTSHHDFNSHENEILAGVSNGVGENNLKSEIDLCLIQ 780
Query: 837 ---SDACLDSGPVVRNCNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAE 896
+DACLD GPVVRNCN+HTD+M DI S+IEQN+T LE +RN D +GDVAIA N DAAE
Sbjct: 781 HSTADACLDIGPVVRNCNDHTDEMADISSNIEQNNTDLEMKRNLDDYGDVAIARNTDAAE 840
Query: 897 TFLITRDLCSSDTENQLYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCI 956
T + DLC+SDTENQLYEAQ+ IGSE V NED QNSHV SSVN+FN+LP LQN C
Sbjct: 841 TLPLNPDLCTSDTENQLYEAQIQIGSEQVPNEDPQNSHVQSSVNDFNQLP---GLQNRCT 900
Query: 957 DLVEVSQNNNQGKCSNDDLLHTSNSEGYKEETITDSLIASSDCLRRLGISKCSTAISSEC 1016
DLV+VSQNN+QGKCSNDD LHTSNSE YK+E IT+ +I ++ A +
Sbjct: 901 DLVDVSQNNHQGKCSNDD-LHTSNSEEYKDEIITNGVIDYTN----------QVASHNNL 960
Query: 1017 EKVRSGEGAYETMSIEITSEARTTCNDLAFCWLTKELDSSVPNDHILSRENVQQCMETKE 1076
E ++ +S S +T N S E+ + C
Sbjct: 961 E--------FQEISASARSSKHSTAN---------------------SSEHTETCS---- 1020
Query: 1077 LENHKSPQMNGNILCQNENELNSEWSIYQALKRAKKVRGCQKQNALGIKTSTNAVPFSEE 1086
C + Q++++ K +KQ G +T TNAVPFSEE
Sbjct: 1021 -------------TCDDN---------AQSMQQLTKSEDVRKQ---GGRTETNAVPFSEE 1033
BLAST of Sgr023386 vs. NCBI nr
Match:
XP_022986753.1 (uncharacterized protein LOC111484416 isoform X3 [Cucurbita maxima])
HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 694/1210 (57.36%), Postives = 798/1210 (65.95%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE++LL+L+++ DG TA NSGFFLCSPLLT+RS+GT++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDDLLDLEREMDGATARNSGFFLCSPLLTNRSNGTISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L A+DEE+PA+P GG +D+S DK FK+TSTSTP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAVDEEIPAVPSGGSYNDLSFQDKLFKNTSTSTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG------VGNNPKQT 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK T
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGHQVAQRVVSNPKMT 300
Query: 357 TLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLK 416
T SS+NA ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLK
Sbjct: 301 TPNGSSMNAIRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLK 360
Query: 417 AATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQ 476
A NSLISK LP NDDGT+KVSAS+TQNA +G MLN TQMPKPSGLRMPSPSMGFFGQ
Sbjct: 361 ARENSLISKPLPSNDDGTKKVSASITQNAPHNGRSMLNPTQMPKPSGLRMPSPSMGFFGQ 420
Query: 477 KKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGET 536
KKVSSFQS+PP TSE+H LSKSN+PN+R GPSNPICQLA TLVPRN VKA+ EASGET
Sbjct: 421 KKVSSFQSVPPDTSELHDLSKSNIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGET 480
Query: 537 NVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLS 596
VVSCLSSGSS+EP+SHDRAKS LK A IHLGKV+V GAST K LSAHGL KPDVLSLS
Sbjct: 481 KVVSCLSSGSSIEPLSHDRAKSALKVANIHLGKVDVIGASTMKKALSAHGLVKPDVLSLS 540
Query: 597 -----SRGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------- 656
+ GDVSR + L +T C + L + +
Sbjct: 541 NPVLENLGDVSRKHHEIKDQLAGDSTKSYLDETNESCSQGMQNALDDQLRGAKDCNEQSS 600
Query: 657 -----------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYA 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +
Sbjct: 601 EQVEVANSFNCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGTFED-VSNDSNGWESCS 660
Query: 717 FDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDE 776
FD DE ET KMR+L T + E SDVDH ISNE SMHIDDE
Sbjct: 661 FD------LDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDE 720
Query: 777 KPTVPMSNNTS----GFSLTSQHDYASKNDILAEGSNGVSDACLDSGPVVRN-------- 836
PT P+ NN S G SL AS+N++L + SN + + +DS + +
Sbjct: 721 NPTGPILNNKSLQGNGCSL------ASQNEVLTKESNDICENKVDSDYELSSIPHSTADA 780
Query: 837 CNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQ 896
CN +TD++VDI SD++QN TSLETER Q+ GDV IA NAD AET LI+RD SSDTENQ
Sbjct: 781 CNKYTDEIVDIVSDMQQNHTSLETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQ 840
Query: 897 LYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSN 956
LYEA + I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS
Sbjct: 841 LYEAHIRIESEHVQNEDKQNSFVQSSVND------------CCIDLVDVSPKNNQGKCSI 900
Query: 957 DDLLHTSNSE-------------------------------GYKEET----ITDSLIASS 1016
D LLH S+SE +EET D+++ S
Sbjct: 901 DHLLHRSSSEEIITGSVDNSDVCTSEGLSNCNQMASPKDNSSIQEETHDTRTGDNILESQ 960
Query: 1017 DCLRRLGISKCSTAISSECEK-VRSGEGAYETMSIEITSEARTTCND------------- 1076
+ + S CSTA SSE +K + SGEG ETMS +ITSE+R TCND
Sbjct: 961 EIEASISSSSCSTAKSSEYDKIILSGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSS 1020
Query: 1077 -------------------------------------------------LAFCWLTKELD 1086
+FC TK+L
Sbjct: 1021 IPIDDTLSSENVQQYVGTKELDNHKSPEMNEMCSTYNDNAQSEATTCNVSSFCSPTKDLG 1080
BLAST of Sgr023386 vs. NCBI nr
Match:
XP_022986754.1 (uncharacterized protein LOC111484416 isoform X4 [Cucurbita maxima])
HSP 1 Score: 1100.9 bits (2846), Expect = 0.0e+00
Identity = 694/1209 (57.40%), Postives = 796/1209 (65.84%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE++LL+L+++ DG TA NSGFFLCSPLLT+RS+GT++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDDLLDLEREMDGATARNSGFFLCSPLLTNRSNGTISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L A+DEE+PA+P GG +D+S DK FK+TSTSTP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAVDEEIPAVPSGGSYNDLSFQDKLFKNTSTSTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG------VGNNPKQT 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK T
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGHQVAQRVVSNPKMT 300
Query: 357 TLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLK 416
T SS+NA ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLK
Sbjct: 301 TPNGSSMNAIRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLK 360
Query: 417 AATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQ 476
A NSLISK LP NDDGT+KVSAS+TQNA +G MLN TQMPKPSGLRMPSPSMGFFGQ
Sbjct: 361 ARENSLISKPLPSNDDGTKKVSASITQNAPHNGRSMLNPTQMPKPSGLRMPSPSMGFFGQ 420
Query: 477 KKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGET 536
KKVSSFQS+PP TSE+H LSKSN+PN+R GPSNPICQLA TLVPRN VKA+ EASGET
Sbjct: 421 KKVSSFQSVPPDTSELHDLSKSNIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGET 480
Query: 537 NVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLS 596
VVSCLSSGSS+EP+SHDRAKS LK A IHLGKV+V GAST K LSAHGL KPDVLSLS
Sbjct: 481 KVVSCLSSGSSIEPLSHDRAKSALKVANIHLGKVDVIGASTMKKALSAHGLVKPDVLSLS 540
Query: 597 -----SRGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------- 656
+ GDVSR + L +T C + L + +
Sbjct: 541 NPVLENLGDVSRKHHEIKDQLAGDSTKSYLDETNESCSQGMQNALDDQLRGAKDCNEQSS 600
Query: 657 -----------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYA 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +
Sbjct: 601 EQVEVANSFNCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGTFED-VSNDSNGWESCS 660
Query: 717 FDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDE 776
FD DE ET KMR+L T + E SDVDH ISNE SMHIDDE
Sbjct: 661 FD------LDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDE 720
Query: 777 KPTVPMSNNTS----GFSLTSQHDYASKNDILAEGSNGVSDACLDSGPVVRN-------- 836
PT P+ NN S G SL AS+N++L + SN + + +DS + +
Sbjct: 721 NPTGPILNNKSLQGNGCSL------ASQNEVLTKESNDICENKVDSDYELSSIPHSTADA 780
Query: 837 CNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQ 896
CN +TD++VDI SD++QN TSLETER Q+ GDV IA NAD AET LI+RD SSDTENQ
Sbjct: 781 CNKYTDEIVDIVSDMQQNHTSLETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQ 840
Query: 897 LYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSN 956
LYEA + I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS
Sbjct: 841 LYEAHIRIESEHVQNEDKQNSFVQSSVND------------CCIDLVDVSPKNNQGKCSI 900
Query: 957 DDLLHTSNSE-------------------------------GYKEET----ITDSLIASS 1016
D LLH S+SE +EET D+++ S
Sbjct: 901 DHLLHRSSSEEIITGSVDNSDVCTSEGLSNCNQMASPKDNSSIQEETHDTRTGDNILESQ 960
Query: 1017 DCLRRLGISKCSTAISSECEK-VRSGEGAYETMSIEITSEARTTCND------------- 1076
+ + S CSTA SSE +K + SGEG ETMS +ITSE+R TCND
Sbjct: 961 EIEASISSSSCSTAKSSEYDKIILSGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSS 1020
Query: 1077 -------------------------------------------------LAFCWLTKELD 1086
+FC TK+L
Sbjct: 1021 IPIDDTLSSENVQQYVGTKELDNHKSPEMNEMCSTYNDNAQSEATTCNVSSFCSPTKDLG 1080
BLAST of Sgr023386 vs. NCBI nr
Match:
XP_022943291.1 (uncharacterized protein LOC111448106 isoform X3 [Cucurbita moschata])
HSP 1 Score: 1089.3 bits (2816), Expect = 0.0e+00
Identity = 703/1265 (55.57%), Postives = 793/1265 (62.69%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE+ LL+L+++ DG TA NSGFFLCSPLLT+RS G ++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDGLLDLEREMDGATARNSGFFLCSPLLTNRSIGAISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L AIDEE+PAMPGG +D+S DK FKDTST TP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAIDEEIPAMPGGSSYNDLSFQDKLFKDTSTGTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRGVGNNPKQTTLKSSS 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQR V +NPK +T KSSS
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQR-VVSNPKMSTSKSSS 300
Query: 357 INAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLKAATNSL 416
+N+ ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLKA NSL
Sbjct: 301 MNSLRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLKAGANSL 360
Query: 417 ISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQKKVSSF 476
ISK LP NDDGT+K SAS+TQNA PDG MLN TQMPKPSGLRMPSPSMGFFGQKKVSSF
Sbjct: 361 ISKPLPSNDDGTKKASASITQNAPPDGRSMLNPTQMPKPSGLRMPSPSMGFFGQKKVSSF 420
Query: 477 QSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGETNVVSCL 536
QS+PP TSE+H LSKS++PN+R GPSNPICQLA TLVPRN VKA+ EASGET VVSCL
Sbjct: 421 QSVPPDTSELHDLSKSSIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGETKVVSCL 480
Query: 537 SSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLSS----- 596
SSGSS+EP+SH RAKS LK A IHLGKV+V GAST NK LSAHGL KPDVLSLS+
Sbjct: 481 SSGSSIEPVSHYRAKSALKLANIHLGKVDVIGASTMNKALSAHGLVKPDVLSLSNPVLEH 540
Query: 597 RGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------------- 656
GDVSR + L +T C + L + +
Sbjct: 541 LGDVSRTHHEIKDQLAGDSTKSYLDETNECCSQGMQNALDDQLRGAKDCNEQSSEQVEVA 600
Query: 657 -----LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYAFDKDMN 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +FD
Sbjct: 601 NSSYCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGIFED-VSNDSNGWESCSFD---- 660
Query: 717 EGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDEKPTVPM 776
QDE ET KMR+L T + E SDVDH ISNE SMHIDDE PT P+
Sbjct: 661 --QDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDENPTGPI 720
Query: 777 SNNTS----GFSLTSQHDYASK--NDILAEGSNGVSDACLDSGPVVRNCNNHTDKMVDIR 836
NN S G+SL SQ+++ +K NDI + D CNN TD+MVDI
Sbjct: 721 LNNQSLQGNGYSLASQNEFLTKESNDICENKVDSDYDLSSIPHSTADACNNCTDEMVDIV 780
Query: 837 SDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQLYEAQMCIGSEH 896
SD++QN TS ETER Q+ GDV IA NAD AET LI+RD SSDTENQLYEA + I SEH
Sbjct: 781 SDMQQNHTSFETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQLYEAHIRIESEH 840
Query: 897 VQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSNDDLLHTSNSE-- 956
VQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS DDLLH SNSE
Sbjct: 841 VQNEDKQNSLVQSSVND------------CCIDLVDVSPKNNQGKCSIDDLLHRSNSEEI 900
Query: 957 ---------------------------GYKEET----ITDSLIASSDCLRRLGISKCSTA 1016
EET D+++ S + L S+CSTA
Sbjct: 901 ISVDNYDVCTSEGLSNCNQMASPKDNNSIHEETHDTRTGDNILESRELEASLRSSRCSTA 960
Query: 1017 ISSECEKVRSGEGAYETMSIEITSEARTTCNDLAFC------------------------ 1076
SSE +K SGEG ETMS +ITSE+R TCND +FC
Sbjct: 961 KSSEDDK--SGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSSIPNDDKLSSANVQQY 1020
Query: 1077 --------------------------------------WLTKELDSSVPNDHILSRENVQ 1086
TK+L SSVPND ILSRENVQ
Sbjct: 1021 AGTKELENHKSPEMNEMCSTYNDNAQSEATTCNDSSFRSPTKDLGSSVPNDDILSRENVQ 1080
BLAST of Sgr023386 vs. NCBI nr
Match:
XP_022943290.1 (uncharacterized protein LOC111448106 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1088.9 bits (2815), Expect = 0.0e+00
Identity = 704/1272 (55.35%), Postives = 794/1272 (62.42%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE+ LL+L+++ DG TA NSGFFLCSPLLT+RS G ++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDGLLDLEREMDGATARNSGFFLCSPLLTNRSIGAISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L AIDEE+PAMPGG +D+S DK FKDTST TP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAIDEEIPAMPGGSSYNDLSFQDKLFKDTSTGTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG--------VGNNPK 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGNHCKIAQTVVSNPK 300
Query: 357 QTTLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEG 416
+T KSSS+N+ ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEG
Sbjct: 301 MSTSKSSSMNSLRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEG 360
Query: 417 LKAATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFF 476
LKA NSLISK LP NDDGT+K SAS+TQNA PDG MLN TQMPKPSGLRMPSPSMGFF
Sbjct: 361 LKAGANSLISKPLPSNDDGTKKASASITQNAPPDGRSMLNPTQMPKPSGLRMPSPSMGFF 420
Query: 477 GQKKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASG 536
GQKKVSSFQS+PP TSE+H LSKS++PN+R GPSNPICQLA TLVPRN VKA+ EASG
Sbjct: 421 GQKKVSSFQSVPPDTSELHDLSKSSIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASG 480
Query: 537 ETNVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLS 596
ET VVSCLSSGSS+EP+SH RAKS LK A IHLGKV+V GAST NK LSAHGL KPDVLS
Sbjct: 481 ETKVVSCLSSGSSIEPVSHYRAKSALKLANIHLGKVDVIGASTMNKALSAHGLVKPDVLS 540
Query: 597 LSS-----RGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI--------- 656
LS+ GDVSR + L +T C + L + +
Sbjct: 541 LSNPVLEHLGDVSRTHHEIKDQLAGDSTKSYLDETNECCSQGMQNALDDQLRGAKDCNEQ 600
Query: 657 -------------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQEC 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E
Sbjct: 601 SSEQVEVANSSYCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGIFED-VSNDSNGWES 660
Query: 717 YAFDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHID 776
+FD QDE ET KMR+L T + E SDVDH ISNE SMHID
Sbjct: 661 CSFD------QDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHID 720
Query: 777 DEKPTVPMSNNTS----GFSLTSQHDYASK--NDILAEGSNGVSDACLDSGPVVRNCNNH 836
DE PT P+ NN S G+SL SQ+++ +K NDI + D CNN
Sbjct: 721 DENPTGPILNNQSLQGNGYSLASQNEFLTKESNDICENKVDSDYDLSSIPHSTADACNNC 780
Query: 837 TDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQLYEA 896
TD+MVDI SD++QN TS ETER Q+ GDV IA NAD AET LI+RD SSDTENQLYEA
Sbjct: 781 TDEMVDIVSDMQQNHTSFETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQLYEA 840
Query: 897 QMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSNDDLL 956
+ I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS DDLL
Sbjct: 841 HIRIESEHVQNEDKQNSLVQSSVND------------CCIDLVDVSPKNNQGKCSIDDLL 900
Query: 957 HTSNSE-----------------------------GYKEET----ITDSLIASSDCLRRL 1016
H SNSE EET D+++ S + L
Sbjct: 901 HRSNSEEIISVDNYDVCTSEGLSNCNQMASPKDNNSIHEETHDTRTGDNILESRELEASL 960
Query: 1017 GISKCSTAISSECEKVRSGEGAYETMSIEITSEARTTCNDLAFC---------------- 1076
S+CSTA SSE +K SGEG ETMS +ITSE+R TCND +FC
Sbjct: 961 RSSRCSTAKSSEDDK--SGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSSIPNDDKL 1020
Query: 1077 ----------------------------------------------WLTKELDSSVPNDH 1086
TK+L SSVPND
Sbjct: 1021 SSANVQQYAGTKELENHKSPEMNEMCSTYNDNAQSEATTCNDSSFRSPTKDLGSSVPNDD 1080
BLAST of Sgr023386 vs. ExPASy TrEMBL
Match:
A0A6J1CYM8 (uncharacterized protein LOC111015485 OS=Momordica charantia OX=3673 GN=LOC111015485 PE=4 SV=1)
HSP 1 Score: 1132.9 bits (2929), Expect = 0.0e+00
Identity = 684/1117 (61.24%), Postives = 784/1117 (70.19%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISL+EVAGEDD LL IPE++LLNLDKDTD +TAGNSGFFLCSPLLTDRS+GTVAG
Sbjct: 1 MESEISLLEVAGEDDFLLQQIPEDDLLNLDKDTDRITAGNSGFFLCSPLLTDRSNGTVAG 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS ASSTD C+ DKEN+N NNLEVPKLSIM QQMK+KKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASSTD----CT--DKENINVNNLEVPKLSIMSQQMKKKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG TSTSCGEALAAIDEE+PAMPGGGC +D+SS DK F D S STP
Sbjct: 121 TEEGVLDSMELSMITGSTSTSCGEALAAIDEEIPAMPGGGCYNDLSSQDKLFTDASISTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
DRKNGR LLPKRGS+TKDNVKSKELSTK+ NRS SKRGSCPRPVASS PTTS GT
Sbjct: 181 SADRKNGRCLLPKRGSTTKDNVKSKELSTKNSNRSVSKRGSCPRPVASSSTKRPTTSKGT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTRSTTIRASDAKSNQVAQRGVG--NNPKQTTLKSS 356
KTV K+ERISR+ V KRDP VTS T R+ T AS+AK+NQVAQRGVG NNPK+TTLK S
Sbjct: 241 KTVNKEERISRVLVPKRDPAVTSWTPRNATTHASNAKNNQVAQRGVGVVNNPKRTTLKGS 300
Query: 357 SINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLKAATNS 416
+IN KSAL+KDVNA+K+LKAKTSIQQPRRKLASPV+KM+SSHL NESTDSNE LKAA+ S
Sbjct: 301 AINTKSALHKDVNANKSLKAKTSIQQPRRKLASPVIKMSSSHLQNESTDSNEELKAASKS 360
Query: 417 LISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQKKVSS 476
LISK +P NDDGTRKVSAS+TQNA P GG ML+QTQMPKPSGLRMPSPSMGFFGQKKVSS
Sbjct: 361 LISKFIPFNDDGTRKVSASITQNAPPTGGSMLSQTQMPKPSGLRMPSPSMGFFGQKKVSS 420
Query: 477 FQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGETNVVSC 536
FQ+L PATSE H LSKS +PN+RTAGPSN CQLA L P NFVKAS+ EASGETNVVSC
Sbjct: 421 FQNLAPATSETHDLSKSYIPNVRTAGPSNYTCQLA-ALAPTNFVKASNGEASGETNVVSC 480
Query: 537 LSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLSSR--- 596
SSG SV+P+SH RAKS+ +AA IHLGKVNV GAST N DLSAHGLEKPD+LSLS+
Sbjct: 481 FSSGISVQPVSHYRAKSISEAANIHLGKVNVVGASTMNNDLSAHGLEKPDLLSLSNPVLQ 540
Query: 597 --GDVSRYMMRLTTNLQN-----------ANTVMCHL----------------------- 656
GDVSR + L + HL
Sbjct: 541 NFGDVSRIHDEIKVQLAECKPRHLSFNNFGESTKSHLGEKNGLGSQGLPRALSNGSQDFN 600
Query: 657 ---TILLTLPN--HILVKRTTWAHKRLGVGTCNSLKRSRTSKDFEH-GRFEENVGNDSNG 716
+ + PN +++R + +R G+G+CNSLKRSR+S D EH G FEEN+GN S
Sbjct: 601 EPSSEQVEFPNSPKCILERISSDSQRSGIGSCNSLKRSRSSIDLEHGGSFEENIGNGS-- 660
Query: 717 QECYAFDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SM 776
AF+KD+NEGQDE T +MR+L T+E E S +DHCISNE SM
Sbjct: 661 ---CAFEKDVNEGQDETLGTRRMRILRTKEVESSGIDHCISNECKNTIQSTSAWSNSDSM 720
Query: 777 HIDDEKPTVPMSNNTS----GFSLTSQHDYAS-KNDILAEGSNGV--------------- 836
HIDDEKPT+PMSN+ S G SLTS HD+ S +N+ILA SNGV
Sbjct: 721 HIDDEKPTIPMSNSKSLQREGSSLTSHHDFNSHENEILAGVSNGVGENNLKSEIDLCLIQ 780
Query: 837 ---SDACLDSGPVVRNCNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAE 896
+DACLD GPVVRNCN+HTD+M DI S+IEQN+T LE +RN D +GDVAIA N DAAE
Sbjct: 781 HSTADACLDIGPVVRNCNDHTDEMADISSNIEQNNTDLEMKRNLDDYGDVAIARNTDAAE 840
Query: 897 TFLITRDLCSSDTENQLYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCI 956
T + DLC+SDTENQLYEAQ+ IGSE V NED QNSHV SSVN+FN+LP LQN C
Sbjct: 841 TLPLNPDLCTSDTENQLYEAQIQIGSEQVPNEDPQNSHVQSSVNDFNQLP---GLQNRCT 900
Query: 957 DLVEVSQNNNQGKCSNDDLLHTSNSEGYKEETITDSLIASSDCLRRLGISKCSTAISSEC 1016
DLV+VSQNN+QGKCSNDD LHTSNSE YK+E IT+ +I ++ A +
Sbjct: 901 DLVDVSQNNHQGKCSNDD-LHTSNSEEYKDEIITNGVIDYTN----------QVASHNNL 960
Query: 1017 EKVRSGEGAYETMSIEITSEARTTCNDLAFCWLTKELDSSVPNDHILSRENVQQCMETKE 1076
E ++ +S S +T N S E+ + C
Sbjct: 961 E--------FQEISASARSSKHSTAN---------------------SSEHTETCS---- 1020
Query: 1077 LENHKSPQMNGNILCQNENELNSEWSIYQALKRAKKVRGCQKQNALGIKTSTNAVPFSEE 1086
C + Q++++ K +KQ G +T TNAVPFSEE
Sbjct: 1021 -------------TCDDN---------AQSMQQLTKSEDVRKQ---GGRTETNAVPFSEE 1033
BLAST of Sgr023386 vs. ExPASy TrEMBL
Match:
A0A6J1JHG3 (uncharacterized protein LOC111484416 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111484416 PE=4 SV=1)
HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 694/1210 (57.36%), Postives = 798/1210 (65.95%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE++LL+L+++ DG TA NSGFFLCSPLLT+RS+GT++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDDLLDLEREMDGATARNSGFFLCSPLLTNRSNGTISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L A+DEE+PA+P GG +D+S DK FK+TSTSTP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAVDEEIPAVPSGGSYNDLSFQDKLFKNTSTSTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG------VGNNPKQT 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK T
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGHQVAQRVVSNPKMT 300
Query: 357 TLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLK 416
T SS+NA ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLK
Sbjct: 301 TPNGSSMNAIRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLK 360
Query: 417 AATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQ 476
A NSLISK LP NDDGT+KVSAS+TQNA +G MLN TQMPKPSGLRMPSPSMGFFGQ
Sbjct: 361 ARENSLISKPLPSNDDGTKKVSASITQNAPHNGRSMLNPTQMPKPSGLRMPSPSMGFFGQ 420
Query: 477 KKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGET 536
KKVSSFQS+PP TSE+H LSKSN+PN+R GPSNPICQLA TLVPRN VKA+ EASGET
Sbjct: 421 KKVSSFQSVPPDTSELHDLSKSNIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGET 480
Query: 537 NVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLS 596
VVSCLSSGSS+EP+SHDRAKS LK A IHLGKV+V GAST K LSAHGL KPDVLSLS
Sbjct: 481 KVVSCLSSGSSIEPLSHDRAKSALKVANIHLGKVDVIGASTMKKALSAHGLVKPDVLSLS 540
Query: 597 -----SRGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------- 656
+ GDVSR + L +T C + L + +
Sbjct: 541 NPVLENLGDVSRKHHEIKDQLAGDSTKSYLDETNESCSQGMQNALDDQLRGAKDCNEQSS 600
Query: 657 -----------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYA 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +
Sbjct: 601 EQVEVANSFNCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGTFED-VSNDSNGWESCS 660
Query: 717 FDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDE 776
FD DE ET KMR+L T + E SDVDH ISNE SMHIDDE
Sbjct: 661 FD------LDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDE 720
Query: 777 KPTVPMSNNTS----GFSLTSQHDYASKNDILAEGSNGVSDACLDSGPVVRN-------- 836
PT P+ NN S G SL AS+N++L + SN + + +DS + +
Sbjct: 721 NPTGPILNNKSLQGNGCSL------ASQNEVLTKESNDICENKVDSDYELSSIPHSTADA 780
Query: 837 CNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQ 896
CN +TD++VDI SD++QN TSLETER Q+ GDV IA NAD AET LI+RD SSDTENQ
Sbjct: 781 CNKYTDEIVDIVSDMQQNHTSLETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQ 840
Query: 897 LYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSN 956
LYEA + I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS
Sbjct: 841 LYEAHIRIESEHVQNEDKQNSFVQSSVND------------CCIDLVDVSPKNNQGKCSI 900
Query: 957 DDLLHTSNSE-------------------------------GYKEET----ITDSLIASS 1016
D LLH S+SE +EET D+++ S
Sbjct: 901 DHLLHRSSSEEIITGSVDNSDVCTSEGLSNCNQMASPKDNSSIQEETHDTRTGDNILESQ 960
Query: 1017 DCLRRLGISKCSTAISSECEK-VRSGEGAYETMSIEITSEARTTCND------------- 1076
+ + S CSTA SSE +K + SGEG ETMS +ITSE+R TCND
Sbjct: 961 EIEASISSSSCSTAKSSEYDKIILSGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSS 1020
Query: 1077 -------------------------------------------------LAFCWLTKELD 1086
+FC TK+L
Sbjct: 1021 IPIDDTLSSENVQQYVGTKELDNHKSPEMNEMCSTYNDNAQSEATTCNVSSFCSPTKDLG 1080
BLAST of Sgr023386 vs. ExPASy TrEMBL
Match:
A0A6J1JGY6 (uncharacterized protein LOC111484416 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111484416 PE=4 SV=1)
HSP 1 Score: 1100.9 bits (2846), Expect = 0.0e+00
Identity = 694/1209 (57.40%), Postives = 796/1209 (65.84%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE++LL+L+++ DG TA NSGFFLCSPLLT+RS+GT++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDDLLDLEREMDGATARNSGFFLCSPLLTNRSNGTISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L A+DEE+PA+P GG +D+S DK FK+TSTSTP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAVDEEIPAVPSGGSYNDLSFQDKLFKNTSTSTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG------VGNNPKQT 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK T
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGHQVAQRVVSNPKMT 300
Query: 357 TLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLK 416
T SS+NA ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLK
Sbjct: 301 TPNGSSMNAIRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLK 360
Query: 417 AATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQ 476
A NSLISK LP NDDGT+KVSAS+TQNA +G MLN TQMPKPSGLRMPSPSMGFFGQ
Sbjct: 361 ARENSLISKPLPSNDDGTKKVSASITQNAPHNGRSMLNPTQMPKPSGLRMPSPSMGFFGQ 420
Query: 477 KKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGET 536
KKVSSFQS+PP TSE+H LSKSN+PN+R GPSNPICQLA TLVPRN VKA+ EASGET
Sbjct: 421 KKVSSFQSVPPDTSELHDLSKSNIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGET 480
Query: 537 NVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLS 596
VVSCLSSGSS+EP+SHDRAKS LK A IHLGKV+V GAST K LSAHGL KPDVLSLS
Sbjct: 481 KVVSCLSSGSSIEPLSHDRAKSALKVANIHLGKVDVIGASTMKKALSAHGLVKPDVLSLS 540
Query: 597 -----SRGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------- 656
+ GDVSR + L +T C + L + +
Sbjct: 541 NPVLENLGDVSRKHHEIKDQLAGDSTKSYLDETNESCSQGMQNALDDQLRGAKDCNEQSS 600
Query: 657 -----------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYA 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +
Sbjct: 601 EQVEVANSFNCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGTFED-VSNDSNGWESCS 660
Query: 717 FDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDE 776
FD DE ET KMR+L T + E SDVDH ISNE SMHIDDE
Sbjct: 661 FD------LDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDE 720
Query: 777 KPTVPMSNNTS----GFSLTSQHDYASKNDILAEGSNGVSDACLDSGPVVRN-------- 836
PT P+ NN S G SL AS+N++L + SN + + +DS + +
Sbjct: 721 NPTGPILNNKSLQGNGCSL------ASQNEVLTKESNDICENKVDSDYELSSIPHSTADA 780
Query: 837 CNNHTDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQ 896
CN +TD++VDI SD++QN TSLETER Q+ GDV IA NAD AET LI+RD SSDTENQ
Sbjct: 781 CNKYTDEIVDIVSDMQQNHTSLETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQ 840
Query: 897 LYEAQMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSN 956
LYEA + I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS
Sbjct: 841 LYEAHIRIESEHVQNEDKQNSFVQSSVND------------CCIDLVDVSPKNNQGKCSI 900
Query: 957 DDLLHTSNSE-------------------------------GYKEET----ITDSLIASS 1016
D LLH S+SE +EET D+++ S
Sbjct: 901 DHLLHRSSSEEIITGSVDNSDVCTSEGLSNCNQMASPKDNSSIQEETHDTRTGDNILESQ 960
Query: 1017 DCLRRLGISKCSTAISSECEK-VRSGEGAYETMSIEITSEARTTCND------------- 1076
+ + S CSTA SSE +K + SGEG ETMS +ITSE+R TCND
Sbjct: 961 EIEASISSSSCSTAKSSEYDKIILSGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSS 1020
Query: 1077 -------------------------------------------------LAFCWLTKELD 1086
+FC TK+L
Sbjct: 1021 IPIDDTLSSENVQQYVGTKELDNHKSPEMNEMCSTYNDNAQSEATTCNVSSFCSPTKDLG 1080
BLAST of Sgr023386 vs. ExPASy TrEMBL
Match:
A0A6J1FRC2 (uncharacterized protein LOC111448106 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111448106 PE=4 SV=1)
HSP 1 Score: 1089.3 bits (2816), Expect = 0.0e+00
Identity = 703/1265 (55.57%), Postives = 793/1265 (62.69%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE+ LL+L+++ DG TA NSGFFLCSPLLT+RS G ++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDGLLDLEREMDGATARNSGFFLCSPLLTNRSIGAISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L AIDEE+PAMPGG +D+S DK FKDTST TP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAIDEEIPAMPGGSSYNDLSFQDKLFKDTSTGTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRGVGNNPKQTTLKSSS 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQR V +NPK +T KSSS
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQR-VVSNPKMSTSKSSS 300
Query: 357 INAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEGLKAATNSL 416
+N+ ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEGLKA NSL
Sbjct: 301 MNSLRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEGLKAGANSL 360
Query: 417 ISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFFGQKKVSSF 476
ISK LP NDDGT+K SAS+TQNA PDG MLN TQMPKPSGLRMPSPSMGFFGQKKVSSF
Sbjct: 361 ISKPLPSNDDGTKKASASITQNAPPDGRSMLNPTQMPKPSGLRMPSPSMGFFGQKKVSSF 420
Query: 477 QSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASGETNVVSCL 536
QS+PP TSE+H LSKS++PN+R GPSNPICQLA TLVPRN VKA+ EASGET VVSCL
Sbjct: 421 QSVPPDTSELHDLSKSSIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASGETKVVSCL 480
Query: 537 SSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLSLSS----- 596
SSGSS+EP+SH RAKS LK A IHLGKV+V GAST NK LSAHGL KPDVLSLS+
Sbjct: 481 SSGSSIEPVSHYRAKSALKLANIHLGKVDVIGASTMNKALSAHGLVKPDVLSLSNPVLEH 540
Query: 597 RGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI----------------- 656
GDVSR + L +T C + L + +
Sbjct: 541 LGDVSRTHHEIKDQLAGDSTKSYLDETNECCSQGMQNALDDQLRGAKDCNEQSSEQVEVA 600
Query: 657 -----LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQECYAFDKDMN 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E +FD
Sbjct: 601 NSSYCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGIFED-VSNDSNGWESCSFD---- 660
Query: 717 EGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHIDDEKPTVPM 776
QDE ET KMR+L T + E SDVDH ISNE SMHIDDE PT P+
Sbjct: 661 --QDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHIDDENPTGPI 720
Query: 777 SNNTS----GFSLTSQHDYASK--NDILAEGSNGVSDACLDSGPVVRNCNNHTDKMVDIR 836
NN S G+SL SQ+++ +K NDI + D CNN TD+MVDI
Sbjct: 721 LNNQSLQGNGYSLASQNEFLTKESNDICENKVDSDYDLSSIPHSTADACNNCTDEMVDIV 780
Query: 837 SDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQLYEAQMCIGSEH 896
SD++QN TS ETER Q+ GDV IA NAD AET LI+RD SSDTENQLYEA + I SEH
Sbjct: 781 SDMQQNHTSFETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQLYEAHIRIESEH 840
Query: 897 VQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSNDDLLHTSNSE-- 956
VQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS DDLLH SNSE
Sbjct: 841 VQNEDKQNSLVQSSVND------------CCIDLVDVSPKNNQGKCSIDDLLHRSNSEEI 900
Query: 957 ---------------------------GYKEET----ITDSLIASSDCLRRLGISKCSTA 1016
EET D+++ S + L S+CSTA
Sbjct: 901 ISVDNYDVCTSEGLSNCNQMASPKDNNSIHEETHDTRTGDNILESRELEASLRSSRCSTA 960
Query: 1017 ISSECEKVRSGEGAYETMSIEITSEARTTCNDLAFC------------------------ 1076
SSE +K SGEG ETMS +ITSE+R TCND +FC
Sbjct: 961 KSSEDDK--SGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSSIPNDDKLSSANVQQY 1020
Query: 1077 --------------------------------------WLTKELDSSVPNDHILSRENVQ 1086
TK+L SSVPND ILSRENVQ
Sbjct: 1021 AGTKELENHKSPEMNEMCSTYNDNAQSEATTCNDSSFRSPTKDLGSSVPNDDILSRENVQ 1080
BLAST of Sgr023386 vs. ExPASy TrEMBL
Match:
A0A6J1FTV2 (uncharacterized protein LOC111448106 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448106 PE=4 SV=1)
HSP 1 Score: 1088.9 bits (2815), Expect = 0.0e+00
Identity = 704/1273 (55.30%), Postives = 794/1273 (62.37%), Query Frame = 0
Query: 57 MESDISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAG 116
MES+ISLIEVAGEDDSLL IPE+ LL+L+++ DG TA NSGFFLCSPLLT+RS G ++
Sbjct: 1 MESNISLIEVAGEDDSLLQQIPEDGLLDLEREMDGATARNSGFFLCSPLLTNRSIGAISS 60
Query: 117 SSAASSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFF 176
SS AS+ D + DKEN+NAN++E P LSIMPQQMKRKKK GGYNLRKSLAWNKAFF
Sbjct: 61 SSTASAAD------HTDKENINANDIEGPNLSIMPQQMKRKKKAGGYNLRKSLAWNKAFF 120
Query: 177 TEEGVLDSVELSMITGRTSTSCGEALAAIDEELPAMPGGGCIDDISSHDKFFKDTSTSTP 236
TEEGVLDS+ELSMITG T+T L AIDEE+PAMPGG +D+S DK FKDTST TP
Sbjct: 121 TEEGVLDSMELSMITGSTNT-----LGAIDEEIPAMPGGSSYNDLSFQDKLFKDTSTGTP 180
Query: 237 GGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPVASS----PTTSNGT 296
GDRKNGR LLPKRGSSTKDNVK KELS KD+NRSGSKRGSCPRPVASS PTTSN T
Sbjct: 181 IGDRKNGRCLLPKRGSSTKDNVKLKELSAKDLNRSGSKRGSCPRPVASSSVKRPTTSNAT 240
Query: 297 KTVIKDERISRIPVSKRDPTVTSRTTR-STTIRASDAKSNQVAQRG--------VGNNPK 356
KT+ K+ER SRIPV KRD TV R R + TIRASDAKSNQVAQRG V +NPK
Sbjct: 241 KTMNKEERTSRIPVPKRDSTVIPRAPRNAATIRASDAKSNQVAQRGNHCKIAQTVVSNPK 300
Query: 357 QTTLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMNSSHLHNESTDSNEG 416
+T KSSS+N+ ALNKDVNASK+LKAK SIQQP KLA+PVLK+NSS +ES D NEG
Sbjct: 301 MSTSKSSSMNSLRALNKDVNASKSLKAKNSIQQP-GKLANPVLKVNSSRCQHESVDPNEG 360
Query: 417 LKAATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQTQMPKPSGLRMPSPSMGFF 476
LKA NSLISK LP NDDGT+K SAS+TQNA PDG MLN TQMPKPSGLRMPSPSMGFF
Sbjct: 361 LKAGANSLISKPLPSNDDGTKKASASITQNAPPDGRSMLNPTQMPKPSGLRMPSPSMGFF 420
Query: 477 GQKKVSSFQSLPPATSEMHSLSKSNLPNLRTAGPSNPICQLATTLVPRNFVKASSREASG 536
GQKKVSSFQS+PP TSE+H LSKS++PN+R GPSNPICQLA TLVPRN VKA+ EASG
Sbjct: 421 GQKKVSSFQSVPPDTSELHDLSKSSIPNVRIRGPSNPICQLA-TLVPRNVVKANHGEASG 480
Query: 537 ETNVVSCLSSGSSVEPISHDRAKSVLKAAEIHLGKVNVGGASTTNKDLSAHGLEKPDVLS 596
ET VVSCLSSGSS+EP+SH RAKS LK A IHLGKV+V GAST NK LSAHGL KPDVLS
Sbjct: 481 ETKVVSCLSSGSSIEPVSHYRAKSALKLANIHLGKVDVIGASTMNKALSAHGLVKPDVLS 540
Query: 597 LSS-----RGDVSRYMMRLTTNLQNANT--------VMCHLTILLTLPNHI--------- 656
LS+ GDVSR + L +T C + L + +
Sbjct: 541 LSNPVLEHLGDVSRTHHEIKDQLAGDSTKSYLDETNECCSQGMQNALDDQLRGAKDCNEQ 600
Query: 657 -------------LVKRTTWAHKRLGVGTCNSLKRSRTSKDFEHGRFEENVGNDSNGQEC 716
++RT+ H+R+G+GTCNSLKRSRTS +F+HG FE+ V NDSNG E
Sbjct: 601 SSEQVEVANSSYCKIERTSSDHQRVGIGTCNSLKRSRTSIEFDHGIFED-VSNDSNGWES 660
Query: 717 YAFDKDMNEGQDEASETHKMRLLGTEEPEGSDVDHCISNEY--------------SMHID 776
+FD QDE ET KMR+L T + E SDVDH ISNE SMHID
Sbjct: 661 CSFD------QDEDLETRKMRILRTRKAEPSDVDHFISNECNNTMQTATELSNSDSMHID 720
Query: 777 DEKPTVPMSNNTS----GFSLTSQHDYASK--NDILAEGSNGVSDACLDSGPVVRNCNNH 836
DE PT P+ NN S G+SL SQ+++ +K NDI + D CNN
Sbjct: 721 DENPTGPILNNQSLQGNGYSLASQNEFLTKESNDICENKVDSDYDLSSIPHSTADACNNC 780
Query: 837 TDKMVDIRSDIEQNDTSLETERNQDGHGDVAIAHNADAAETFLITRDLCSSDTENQLYEA 896
TD+MVDI SD++QN TS ETER Q+ GDV IA NAD AET LI+RD SSDTENQLYEA
Sbjct: 781 TDEMVDIVSDMQQNHTSFETERYQNDRGDVEIACNADVAETLLISRDFHSSDTENQLYEA 840
Query: 897 QMCIGSEHVQNEDKQNSHVPSSVNEFNRLPEFAELQNCCIDLVEVSQNNNQGKCSNDDLL 956
+ I SEHVQNEDKQNS V SSVN+ CCIDLV+VS NNQGKCS DDLL
Sbjct: 841 HIRIESEHVQNEDKQNSLVQSSVND------------CCIDLVDVSPKNNQGKCSIDDLL 900
Query: 957 HTSNSE-----------------------------GYKEET----ITDSLIASSDCLRRL 1016
H SNSE EET D+++ S + L
Sbjct: 901 HRSNSEEIISVDNYDVCTSEGLSNCNQMASPKDNNSIHEETHDTRTGDNILESRELEASL 960
Query: 1017 GISKCSTAISSECEKVRSGEGAYETMSIEITSEARTTCNDLAFC---------------- 1076
S+CSTA SSE +K SGEG ETMS +ITSE+R TCND +FC
Sbjct: 961 RSSRCSTAKSSEDDK--SGEGTSETMS-KITSESRMTCNDQSFCSPTKDLGSSIPNDDKL 1020
Query: 1077 ----------------------------------------------WLTKELDSSVPNDH 1086
TK+L SSVPND
Sbjct: 1021 SSANVQQYAGTKELENHKSPEMNEMCSTYNDNAQSEATTCNDSSFRSPTKDLGSSVPNDD 1080
BLAST of Sgr023386 vs. TAIR 10
Match:
AT5G60150.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 122.9 bits (307), Expect = 1.8e-27
Identity = 133/448 (29.69%), Postives = 205/448 (45.76%), Query Frame = 0
Query: 60 DISLIEVAGEDDSLLHPIPENELL-NLDKDTDGVTAGNSGFFLCSPLLTDRSSGTV---- 119
D L++++GEDD +N LL N K T+ ++ + CSPL RSS V
Sbjct: 2 DKDLLDISGEDDE------DNWLLKNTPKKTN--SSRGKSYLKCSPLQIPRSSRIVPTRP 61
Query: 120 ----AGSSAASSTDIVCRCSYIDKENVNANN--LEVPKLSIMPQQMKRKKKGGGYNLRKS 179
G +S + C+ +D ++V N +E+PKLS+ QQMK+KKK G+NLRKS
Sbjct: 62 PFSPIGRVTGTSNNREQPCASVDTDSVGKENAKVELPKLSVERQQMKKKKKNAGFNLRKS 121
Query: 180 LAWNKAFFTEEGVLDSVELSMITGRTSTSCGEALAAIDEEL-PAMPGGGCIDDISS---- 239
LAW++AF TEEGVLDS ELS ITG G+ LAAI EE +M C ++S
Sbjct: 122 LAWDRAFSTEEGVLDSSELSKITGTACHLGGDRLAAIQEEYRESMSASKC--NVSPGLQA 181
Query: 240 -HDKFFKDTSTSTPGGDRKNGRYLLPKRGSSTKDNVKSKELSTKDVNRSGSKRGSCPRPV 299
+ F D ++ ++K ++P KELS
Sbjct: 182 LEENLFNDLPVNSKNREKKLVSGIMP------------KELS------------------ 241
Query: 300 ASSPTTSNGTKTVIKDERISRIPVSKRDPTVTSRTTRSTT---IRASDAKSNQV--AQRG 359
IS++P +K DP + TT I+A +++ Q+ +QR
Sbjct: 242 ------------------ISKVPTTKSDPVTVGNNMKRTTQSPIKAKNSQPTQLKNSQRS 301
Query: 360 VGNNPKQTTLKSSSINAKSALNKDVNASKTLKAKTSIQQPRRKLASPVLKMN--SSHLHN 419
+G+ S+ KS+L ASK+ K S++Q RR + S ++ S H+
Sbjct: 302 LGSESFSKNTSSTKSKTKSSL-----ASKSSIPKPSLKQARRNVISKSSEIPTVSYSQHS 361
Query: 420 ESTDSNEG-LKAATNSLISKVLPLNDDGTRKVSASVTQNALPDGGRMLNQ-TQMPKPSGL 479
SN G + A+ +++ + D + S+ Q++ G + +++ KPSGL
Sbjct: 362 VVAKSNVGPMTASDVAMLGHASVIPDSNVITLGTSLAQSSCNKAGSTQSAVSRLGKPSGL 386
Query: 480 RMPSPSMGFFGQKKVSSFQSLPPATSEM 482
R P PS+G+F Q QS S++
Sbjct: 422 RAPKPSIGYFSQSDSQPSQSAGDKHSQL 386
HSP 2 Score: 80.1 bits (196), Expect = 1.3e-14
Identity = 37/45 (82.22%), Postives = 42/45 (93.33%), Query Frame = 0
Query: 1041 NAVPFSEEWLAAIEAAGEEILTMKTGAVQNSPPDKSQPEPGPWSP 1086
NAVPFS+EWLAAIEAAGEEILT+K+G VQ+SP +KS PEPGPWSP
Sbjct: 1125 NAVPFSDEWLAAIEAAGEEILTLKSGRVQHSPTEKSAPEPGPWSP 1169
BLAST of Sgr023386 vs. TAIR 10
Match:
AT3G53320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G37070.1); Has 11044 Blast hits to 5993 proteins in 551 species: Archae - 8; Bacteria - 1486; Metazoa - 4078; Fungi - 1814; Plants - 348; Viruses - 112; Other Eukaryotes - 3198 (source: NCBI BLink). )
HSP 1 Score: 45.1 bits (105), Expect = 4.7e-04
Identity = 127/516 (24.61%), Postives = 202/516 (39.15%), Query Frame = 0
Query: 61 ISLIEVAGEDDSLLHPIPENELLNLDKDTDGVTAGNSGFFLCSPLLTDRSSGTVAGSSAA 120
+ LI+VA EDDSLL +E DKD + F+ D
Sbjct: 16 LGLIDVAVEDDSLLF----SEFSETDKDDKCLKEDKDLNFMRDTQYCD------------ 75
Query: 121 SSTDIVCRCSYIDKENVNANNLEVPKLSIMPQQMKRKKKGGGYNLRKSLAWNKAFFTEEG 180
D + S +KE V P S P+++ +K G YNLRKSLAW+ FFT G
Sbjct: 76 ---DEILASSVEEKEEVLQ-----PHESPEPEKVMKK---GKYNLRKSLAWDNEFFTSAG 135
Query: 181 VLDSVELSMITGRTSTSCGEALAAIDEEL------------------------------- 240
VL+ ELS + S +AL I E++
Sbjct: 136 VLEPEELSSMMESNHKSGKKALPTILEDINRSTESISTFQSDCTVENSQEFVLFEDVRAS 195
Query: 241 ---------PAMPGGGCIDDISSHDKFFKDTS-----TSTPGGDRKNGRYLLPKR----G 300
A PG + + + D TS T+T G + G P R G
Sbjct: 196 IQRSAKTSDVATPGKSNV--LRATDVAISPTSSTVDVTATQGKTKSKGSPRNPSRVQGPG 255
Query: 301 SSTKDNVKSKELSTKDVNRSGSKRG-SCPRPVASSPTTSNGTK-TVIKDERISRIPVSKR 360
+TK V ++ LST + S G S RP++++ T + + + E+ S++P K
Sbjct: 256 KATKQPVATRGLST---SISKPPNGLSKVRPLSTTSTNRSSLDISKTQQEKNSKLPAGKE 315
Query: 361 --DPTVT-SRTTR----------STTIRASDAKSNQV------------AQRGVGNNPKQ 420
P ++ SR + ++ R+SDA N++ A + P
Sbjct: 316 PLGPRISMSRRAKPVLPKPGVPFKSSSRSSDASKNEMTSSCSSLESCASASSSASHKPSI 375
Query: 421 TTLK-----SSSINAKSALNKDVNASKTLKAKTSIQQPRR----KLASPVLKMNS-SHLH 474
++K SS ++++ N+ + + + QQ + KL+S V S S
Sbjct: 376 DSIKKKNDSSSRLSSQPLANRSTSRGIMGQPRIPPQQTNKTSKPKLSSSVPTAGSISDYS 435
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022146221.1 | 0.0e+00 | 61.24 | uncharacterized protein LOC111015485 [Momordica charantia] | [more] |
XP_022986753.1 | 0.0e+00 | 57.36 | uncharacterized protein LOC111484416 isoform X3 [Cucurbita maxima] | [more] |
XP_022986754.1 | 0.0e+00 | 57.40 | uncharacterized protein LOC111484416 isoform X4 [Cucurbita maxima] | [more] |
XP_022943291.1 | 0.0e+00 | 55.57 | uncharacterized protein LOC111448106 isoform X3 [Cucurbita moschata] | [more] |
XP_022943290.1 | 0.0e+00 | 55.35 | uncharacterized protein LOC111448106 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CYM8 | 0.0e+00 | 61.24 | uncharacterized protein LOC111015485 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A6J1JHG3 | 0.0e+00 | 57.36 | uncharacterized protein LOC111484416 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JGY6 | 0.0e+00 | 57.40 | uncharacterized protein LOC111484416 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FRC2 | 0.0e+00 | 55.57 | uncharacterized protein LOC111448106 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FTV2 | 0.0e+00 | 55.30 | uncharacterized protein LOC111448106 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G60150.1 | 1.8e-27 | 29.69 | unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... | [more] |
AT3G53320.1 | 4.7e-04 | 24.61 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |