Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGCACCAGATGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAACCTATGCTTCCTGCAAAGTCATCTTCGAGAGACAATCCACCACAATCTCTACTGCAATCAGTTGCAAAACAGGTACTTCTAAATTTGACTTTCGTGGGTTTAATATTTTTATTCCATTCACTTTCAAGCATGATATTGTTCGAACAATCATAATCAGACTCATAATTTTTTGGTTTTGGACTGGATGTCTGAAGTTGAATTACTGGTAACATAAAAGTTATACAATCAGCAATCCGTGTTTTAATATCCTTCATCTTTTGAAAGTACCAACAGTTTATAAGGATACTATTGCAGTATTACTTATGTGAAAAATGTAATTTCTCCAGCGCTCTCTCTCCCTCAGCTATCCATGTGCTTCAAAAGTACACGAGTTTTATAAGCATAAAATTACTTCTGTAAAAAATATAATCTTCTTCAGAAAACTCTCTCTCTCACACTCGCACACACAAAGGAAGAAACAAATCACTGATAGAAAGAAGGCACATAGAGGTAGGAGGGAGAAGGAGGACTATGTATCCAGCTCTACTCCTTGATAAGGGGAATTACAAGAAATGTCCATCCAGAAAACGTCATCTGAGAAATTGGTTCATTCATTTTCCAGTATTTGAAACATCCGCAAATAAAAGGTAATTCATGTAGTCCAAAAATACCTCGACACTTGTGCTGCTGCTGCTAGTTTATGATTTACTTTTATCTGTCTCATTTAGACTTTTTAATTTAGCACCATTGTTCTTGCAGAAGTTAGGATCCTCCCTTTTGGATAACTAGCAGCATAAATTGACTACTGCTAATCATATGATTAAGATAGAATAGGTTTCGAAGTAGGGTAGTTTTAACTGCATGTGGCAGCTTATTACTCACCACTCCTGCCCAAAAGAAGAAAATTGTTAAGAGAGAGAATAAAAGAGTCATCTAACTTTGTGTTTTGCCCTTGAATCTACTCTGTAAGGTGCATGCCGCTGTTATATTGTACAACTACTACCATCGGAAACAACATCCACACCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAGTTGGCTGTGGTCATAAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGATCAGATGATACTGAATTGGAAAACCCTGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCTTGTGATATAGCCACTTGTCTAGAAGCATCAAAAGATGAAAACGTAGAGGGCTGGTCTCGTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAGAAGGAGCATTGCCATTTGCTATTTAGTTTCATCACTCAAGGAGTTTGGTCTGTGATTGAACAAGATTTGGATACCTCTGAATGCCAACCAGAAACGGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGATGAAGCTAAGACACAGCAGCTTGCATATTCAGCAGTTAAGGAAGCAACTGGTGTGTGTCTTACTGCATTAACTCATTAGATAGTTCAATAGTTATAACTTGTTCTTATGGTAAAATGGTTATATGTATGCCAAGGAGAATTGCTTTTATTGCCTACTTTTCTTTTATTCCATTTGAGATGCACCCCGTATTGGTTTTTGATTCGAAGGTACCTTTTAGAATACTAATGATTTATTGCGGAGATAAATATCATATATATCTCTTCTTTCCCAATTAAGCATCATGTCTGGTTACTGGTTAGGGAATGAAGATACTGATCATCCCTATTCCAATTTTTTTTTTGTTTTTCTGAATTAGATTATTATCTAATTATGATATTGGTTGGATGGTCTGTTACTGCCTCATCATATTGGTTATGTGTTTGGGTACCCTTTCTATTCTCTTTTGTAAGTATCTCAAGAAAAGGGGAAGAACTTGGTTCTATCATGACATCGTTTGGAGTACATGTACAAAAATAAACAAGACTTATTAAGAGGAACATAACATTTAGTTTAGGGTATTTATATGTAAAGAATGCCTTCTTCAATGATGAGTTAGAAGAAGAGGCTTATATGAGATTTCCCTTGGGTTGAATGACAATTTGGGGAGGATGAGGCATACGAAGTTAAAAAAATCTGAATGAGTTTTAATTACCATCATTGTCCTTTCAAATAAAATTAAGTGTATTTAAAGGCAATCCATTGGGCAAGATGATCTTAATGAACATGGAAGTAACAGTGAAACAGTAAATATGGCTTTACTAACTGCAACGGTGGGGAACTAAATAAATCTCAGTAACTAATTACAAGATCAGAACCATATTTACTAAGATAAAATGGCTAAAGGAAGGTAATAAAAATTCATGTTTTTTCCATAAAATGCTAGCAGGTAGATGAAGAAAATTGTTTATTTCAGAATTAATTTCAGCTCAAGGTGAAAGTTTAGTGGAATATAAAGAGATAGAAAAGGAGATTTTGAGCTTCTTTGAATCCTTATATTTGAAAGATGAGGTATATTCCTGTTGGCTTAAGCTGGATCCCATCAGTGGATCAAAGGTGGTTGGAAAGGCCGTTTGAGGAAGAAGAGGTGTGTGCAGCTATTAAGAGCTTGGGTTAGGTGAAATATCGGGGTTGGAATGGTTTTTCTATTGAATTTTTCTGTAAATTCTGGAAATTATTAAAAGCAGATTTGATGGAGGTCTTCAATGAATTTTTTGAAATGGTCATTTGAATGCTTGCATCAAGGAGAACTTCATTTGTTAAATTCAGAAGTAGGAAAGAGCCAGACCTATAAAGGATTTCGGACCTATTAGCTTGGTAACAAGTGTGTATAAGATTCTATCTAAGGTGCTGGGAGAGAGATTGAAGCAAAATTTATCCTCTACCATTTCTTCTAGTCAAAATGCTTTTATTAAGGGGAGACGAATCTTAGATCCAGTGCTGGTGGCAAATGAAGCTATGGAGGACTATATAAAAAGAAAGAAAAAAGCATGGCTATTTAAAATTAGATCTGGATAAGCCAAATGATTGAGTGGATTGGGAGTTTTCAATAGAAATTCTCAAGAAGAAAGGGTTTGGTGAATGATGGATTAAGTGGATTAGGGGGTGTATTATTGATACAAAATTCTCAATATTTCTTAATGGAAGGCCGTGAGGAAGAATCCTAGCAACTAGAGGATTGAGGCAAGGGAATCTATTTTTGTTGGTGGGAGATGTGTTGAGCAGATCGTTAGAAAAAAGGAGTTGCTCATGGGGATTTTGAAGTTTTTTGGTTGGTAAGGATGCTGTACAGATTTCTTGCTTACAGTTTGCGGAAGATACAGTTATTTTTTGTAAGGATGATGATCATATGGTGTTGCATTTATCAAAGTTTTTGAGATTTCTTCTGGTTTAAGGTGAATTGGGAAAAGTCTACAGTTAGTGGAATGAATATATGTTCATCCAAAGTGCTAAATTTAGCGCACAAGTTGGGATGTATCTCCGAAAGCCTTCCTTTTCAGTATTAGGGGTTGCTGTTGGGTGGAAATCCAAAAAATATGCAATTTGGGTAGCTACTTTGGAGGTGTTGAATAAAATTAGTAGATGGAAGTGTTTTCTCTTATCTAGAGGCCGAAGAGTCACATTGAGTAACGCTGTTCTTAACAACCTTCCTACATATTATATGTCTTTATTCTCAATGCCAAGAAAGGTTCTTTTGGATATTGAGAGGCATATTAGGGATTTTTTTGGGGAGGGAAAGGAGGAGGAGAACATTTCTCATTGGGTTAGATGGAATAAGGTTGTCTTACCCATAGAAAAAGGTGGGTTGGGTATAGGGAATTTAAAAAGGAAGAATGAGGCTTTGTTGATGAAATGACTATGGACATTTGCAAATGAACCAAATGAGTTGTGGCATAAAGTAGTGGCTAGTGTGTATGGGGATGGACAGAATGGCTGGTTCACTAAAGAAAAGAAAGTTGGTAGTCCCAAAAGTCCTTGGTGGAATATTCTAAAACTAAAAGCTTTTTTTGAATCCTATTATTCGATTATGATCGGTAATGGGGTTAGAACTTCTTTTGGAAGGACCAATGGCTGAATTCTCATCAATTATGTAATTCTTTCTTGAAGGTGTTTGAGCTTGCATGAATAAAGAAGCTAGGGTTAGTGAAGTTTGAGATTCTTCTTCCAATTGTTGGGTGGTGGAGGTTAGAAGAAATTTGAAGGAAGATGAGATGCTGGAATATTGTAACCTGATGAAATATTGTGTTTCTCCTTTGTCTAAAGGAATTGTATCTTCGTTGTGGAAGGTAAAAAGTCCTAAAAAAGTGAATATTTTGCTGTGGTTGGTACTTCTTGGTAGTTTGAATACGGCAGAAAAATTGCAGAAGGAATGTCCTCAATGGTGTTTGAGTCCTAGTATGTGTGTTCTTTGCAAGAAGGCGGGTAGAAGATCTCAATCATGTGTTATATAGCAGAAAATTGTGAACAGCATTGCTTCAGGACATTGTTTTATCTTGGGTTTTCACTTTAAAGCAGGAAATAATTTGCTCTCCCTCTTGTATGGTAGTAAATTCGGTAGACAAGCTAAGATAATGTGGACTAATTCAGTTAAAGCCTTGATATGGTGTTTGTGGTTTGAAAGAAGGTCTAGAATCTTTGAAGGCAAAAGTCTCATTTGGGAAGAAAGGATGGACAATAAAGGGATGCGGCTACTTGGTGTGTTGTTTCTAAGTTTTTTTAGTAATAGTTCCCCTTTTGATGTATACTCGTGCTAGAATTCTTTTTTATGTTCTTCAGATTGATATGAGTATGGTCCAGTTTGGGTGTCTTTATGGTTGTGATTTTTTAATCTTTGAGAGTTTCCCTTGTTTATTACGTGTTACTATAATTGTAATTTTTCAATTTATCAATGAGAAATTCTGTCTCTTTGTCAAAAAAAAAAAAGTTCGTTCAGTAAAATATATCATTGTGTTTTTTTAAAAATTTTCAATTATTCAAAAGCCCCCAGAACTGTTCTTGGAATTTGTGTAGTTATTACATGTCTTCCTTATGCAGCAATGGAGTGGAAGCAAAATTCATTAAGGCTATCCTTTTTGCTTATCTTTGGTACTTATGGATACTTGCTAATCCTATTTATTTCATGCTAAACAGGGATTAATCAAAGCTATCTCAAAATTTTGGAAAGTCATGTTGTATACTCTCTTAGTAAAGAGAAATCTGCAGTCTGCTTTTATATGATTCAGTGCACCCGATCAGCGACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCATTGACAGGTTTTACTATTTTCCTTCACACCCATCCCCATCTTCAAATTTGTGTCTTGGCATATTTTTTACCGTCCCACCAGCTGAAAAGCTAATATGTTTAGGTTAGTAATTGGACACAGTTAAAAGAAAAAAAAAAACTGGAATTGGGTATTTGGTTTGAATGCATGTCCCAGGACCTCATATGCAATTAGCATGGATGAAGGTCAGTCCTCAATTGCATAGATACCTTTTTTGCTCATGAATTTTGTCTTTTCCATGATATTAGTTTGCAGGGTTCGTTGTTTAGAAAAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGTGCAAATCTGGTTTCATAGGTATAGACTTAATTGCCATCGACATTAGTATTACTGTTATTATTTATTTATCGTCTACATAACTTGATACTTTATAGCTTGATTGTTGTGACCTTGCTTATTTCTGCTTTCTGTTGCTGTATCCAACCTCTTGGCTCTTGCTAGTTGCCAGTGATGTGAGGGTTAACTATTATTGGATACGTGTTGGAAATTGAGCGTCCTGAATCATTCATATAGCTTTTACTTCATTCATTATTATTTCTTTTCTTTCACTTGATTCCGCTTTCTGTTGCTGTATCCAACCTCCTGGCTCTTGCTAGTTACATGTGATGTGAGGGTTAACTATGATTGGATACTTGCCGGAAATTGAGCCTCCTGAATCATTCATGTAGCTTTTACTTAATTCATTATTTTCTTTCACTTGATTCCACCAGAGAAACATTCGCTTTACCCCGTACTTAAAAATTAATAAAAAAAACACATTCATGTTGTTGTTGTTGTTGTTGTGTGTGTGTTTGTTTTTAATTTTAATTTGTAGGTCATTCTTTTAATTCCTCATCAGCTATTTTATTCTGACCTTAATTTTGTTTCATATAAAGTAATTTGGATGGAGAAACTCCCAAAAGGTTTGAGGTATTTGCTTCAATGTTGAATGTAAGAGAATCTACTAATGCATGTTAAGATGTTTTCCTTCTGTTTCTTGTCCATTATTTTTGTACACAATGAATGCTTATGATTTGTTGCAGAAAAGGAGGCCTTTTCTTGTTTCTCTTTCACCACATTTGTGCTTCCTTTGTCATAAGACACATGAGCCAGTTAATCACATGTTTCCTTTTTGTGATTTCTAAATTAAGGGTTAAGCTATTTTCTCAAATCATTTCAATGTTTGTCGGGTAGTTCCAATTCACTATATTGTTTGGATATTTATTGTGGCATTGCTAGTAAAAGATTGAAGACATTTTAAACATGTGAAGTTTCTGCATTTCCTTGAGCAATATGGTTGGAAAGGAATCTTTGAAAATACACATGATGAAGGAGAAATTATGATAATTTTGTTTTTTACTTTTTCTTGGTCATATAATGTAACTTGTAAATTTATTTGGTATGCACTCTAGCGAATAAGCTTCCTTTTTTATTAGAGACAAAAGAACTTTTAGTTGAAAATATTAAAGTACGAAAGAGGAGGGCAAAGCCTATTGGATGGAGCCTGCAAAAAAGAGAGGAAAGGAACTCCAATTGGCATTGATCATAAAAAATTCATAACTGCATAAGTGTTTGGATAGGATACACCTGAGTATGAAGATAAGAATAAAGATTAAGATATATATAAAGATAAAGATTAATCGAGAGACAGACGAATTTGCAAACCATAAGATAAATAAATAGACAAACAATATATCTGTAAACATTTTGTGGGTGGTGTGCTAAGTAGGTTGTTCGAACATGGGACCAGCAGGAGTTTACTAGAGGTGTACCTTGTAGGGAAGGACAAGGTACATATCTCCCATAATACAATTTGTGGATGATACATTCCTTTCATGTAAGGATGATAAAGAAAGCTATACTACCTATTCTCGGTGCTTAAAGTCTTTGAGGCTTTGTCCGGCTTAAAGATCAATTGTGAGAAGATTCAAATTTGTGATATAAACATCCCAAGCAACATGGTTTAATAGTTTGCTGCAAGGGTGAAGTGGAAAGCTTGTGTGCTACGCTTGTCTTATTTGGGCATGCCTTTGGGAGACAGCCAAAGAAAGCTTGGGTTTTGGAAACTTGTGGAGGAAAAAGTCTTATGTAAGTTTGATAAATGGAGGAAAATTTTCTTATGCAAGGGTGAAAGGTTGACTTTGTCTTGAGCGATCTTAGCAAACTTATCAAACTAAAAACCAACTGTCTTTCAATACATTCTATCTTGCTTGAGGATGGGTAGTGTGGAAATTGACTTGCAAGGTTGAGATCTTGACTGCAAATTTCCTTGATTTCCTGCCATTGATTCTCAGTCATTTATCCTAAAGCTCCCATTTTAATAGTTTGCATGTGTGGAGAATGCCCCCTTTTGTAATTTAGCATTTTCTTATGATTGAAATATCTATATTCTTTTTTAGAGACCACATTAGATCCGACCCAAACAGGGTGAGACCCTGTACCATCACTTAAACAGCCCCACTCGGGTCACAAGCTTGCGCAAGCTCGGCGATTGCAATATCTATATTAGGGAGTAATCACATAATTGAGAATAAAGGAAGTTATTTTTGCTAGATAATGAGAAGCGTTGTTATTCTTTATGACTTGTATATGAAAAGGTGATTACATACTAATATCTCCTATCCTCTCTTTCACGTAGGGAAACTTCAACAGATAGTTTGCGAGTCATAGGTGGAGAAAAGATAGATGAAAACTTGAACAAGCTTGAGAGAATAGATGCACCCAGGAAGCTTGAAATTCAAAACAACCAAGATGGTGCTAGTGCAAAGAATTTGAATAAAGGGACTAGTATTTATGGTGAAGGATTGGAGAGACTGCCAGATAAAACTAACTGTGCGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTACTAATGTGGATGACTTCGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATGTACCCAATACTAGCCAAGTCATCTTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTGAATGAATCGAATGCCTCAGAAAGTGGTATCAAAGTCAAGGTAAGACGTTGAAATCTTTTCAGATACTTATCTAGTCTTTATTTCATATTTATCATGAGAGATCTTAAGATACATATCTCGTCTTCATTTTGTATTTATCATTAGGACTCCTGATATGTTTCTTATCCAAAAAAAATTAGGACTCCCGATGCTTATGGGGACATTGACTCAGATGCCATTCAGCTCTCCCACTGTTTTAAAAAGCGCGCCTAGGCGTGCTTTCCAGAAAAGCGAGGTAAGGATGCCCGCTTCAAAGAAGCAAGGCGTCCCTAATAAAGCACCTTGAGGCGCGCGCCTCTCTGCATGTTGGCGCGCGCCTCTGCGCTTTTGTTCCTTGTTTTAAATATTTTTTGTTTTTTAATTCTTTTTATAAATAAATTCTTCAATACTTTAAAAAAATGTTAGTTTTACACATAAATCTCTTAAAAAACCTTTTTAAATTTTCTTTATCTTATCATTTATAATACTATTTTCTTTTATATATAAAAAAATAAGTTACATTTGCCTATTGTGCGCCTCACATAAAAAAGCCCTCGCTTTTTTTTGCACCTTGCGCTTAAGTTTTGGAAGATGATTGCGCTTTAGTGTGCTTCACGCTTTTAAAAAGACTGACCTCTCCTATAAATTCTGGCATAAACCCATATCTATTCTATATTTATAGGAATTATTATGGATTCATCCCTTGAAGGAACTGGACATGATGCTTTTGGGAGCAGGATCTTTTGAATGATTTTTTTTTCTTTGGTTATTTTGTATGCATGATCTAATCAAATCAAAATAAGGTTCAATGTCGTGACCTTTGGGCCAAGGGACAGTCTTGAGGATGGATTAATGTGGTTCCTCCATGCTTTAGACATCAGAAAAGATACTATTTTGTTTTTGGTTTCATTCTTTTAATGGATTAATTGAAGAACCTGCACATTTTAAATCCCTGACTTCTTTGGTGACGTAAAGAACTACATCAATTGTAGCAAGGTAAATGATGAAAAAAAAAAGAAATTTTGGAAGGAACTGTTAATCCTTTGAAAAATTATTAATTACTTCTCCAGTAAAATACTGTATACTTGGTGTGTAACTTTTTTAAATACTCAAAGCAAGGAAGCTGGAAAAAAACCTAGAATACATGGTGATCCCTCTAAACTTAGCGCACAATCAGATACTTTTTTAAGTGCCAGTAAATTAGAAGATAATCACGGAATCCTTTGTTATAATTATAAAAACTATCTTAAACTGCAGCAACTCTGATACAACATTAAATAGCAAAACTCTGTTACAGAACTTGTGGAAAAAATTAGACCGTTATCAATGCATTGAAATGAAGGATGGTGAAGATGTAATCATGCTCAAGATGTTTTTTTAAGAAAGAAAGAACTTGAAGTTTCTCCCTGGATTTAATGTGATGCAATCTAGCATAAAAAGTCTAGAACTATAAGAACTCTCAACCAAGAATCTTTATTATTATTCATTTAAGTCAATTAAAAATTTATTATAAATGATTTTAAACGAATAAATTAATTCTAACTATTAGTTTAAGTTTTTGGATTTAGGATTCTCTTTACTTGGAACCAAAGTAAGAGTTCTTGAATTCAAATTCTTGCATGGCCACATCTTTACTTCAAGATTAATTATCTGTGTGATGAGCTTGTTAATCGAAGGAAAATTTCAATCTGCATGTGACAAAATGCTAGTTATTAATGTATTGAATAAATAATCTTGACCTAATAACTAAAGCCTTTATTTTCTTTTATTTATAAGTCAGCTGATAGCATAATTTACTTTCCAGGATGGGATATTAGCAACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATGTTTCATTTGATCGAAATAGGAACGGTGATCATGCTCTTATCACCTGTCAATCGAACTCAGAGCATCTTTCCAAGCTACATGCAATTATAGTTTCGAAAGAAACAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAACTGGTACACACATACATCATATATTGTGGTTAAGTTTAGGGTTTATCATTGTCCCATGAATTTAATTTGAGCCAGAAACTTTCTTTCAAAGTTCAGCACAATGCAAATGATGAGAACACGTAATATGTGTGATAAAGAACTGAAAGCTTCCACCATAATTGTAAAAGAGAAATGGGTGGTGAAGTGATTTAAATCTTGAGTTCTCCTTGCCCAACCTTGGCTGATGAGATGGTAGGAAAATAAATAAGCTTGCTCTGTATCTTGATATATGTATCACCTTACTATGGTATTCAGCTGTCCCTTTTTCTTTCTCTCTGACCAAGGGTTTCATATAGATTGAGCCCTTTTGAATATCGATGTTACATAATCGTTACTTTGACAAAATGTAACCCATTCCTTTTGTTTGCAGTCTCAGCAGCAGCGCATCATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTATGGTTTTGATGCTCTGTTTTCCCTTTTCTTCCATTTGTTCTTCCTGTTTTGTTTTGTTTCCGCAGTTTTGTTTATTTGCATGTTTCAAATACAAAGCCTCCAGAAATTCAGACAATTTCCTTCTTCAAATACCAGCAGTCCTCCGTCCTCTTTTTTCCTTAAATTTTACTTGATGAACCACTACCCCCAACTATCTACGATTTATGGTCTTATAAGCCCGTGGAGCATTCAAGATAAAAGTATTTTTCAGTATTGCATTGAGACTAAGATTTGAATGATCGTCATGATTTCTAATTTAAGGATTATACAGATCCTGAGCTCCAAATGGCAGTAAAGTTTCTCTCTCCTGGTTCAATGAGCATTATTTGCAGGCTCTTTCTATTTCCTTTGCAAAAGAAGACCATGCTTTGTGTAAAGTGTAAAGAAACAGCATTTGTTTTCTTTCTTGTTAACATAACTGTGGAAAAGCTTCTGTTGAAACTCGAAGGATTAGTGTGTAGAAAGTCTATAAATTTTTTACTTAGGTTGAGTATCAGAGCATGCTTGAACTCTTGTGTCTAAGTGTTTTATATAACAACACTTTCTGCCAAAAGTACGTTTCTAAATAATACATTACTTAGTGTTTTGCGGTTTCCTTTTATAAAGTAATATTGGAACTTTAAAATGTGATTATGATAAAACTATAAATGATTTTTCTTTAATTTGCTATTGACCAACCTGATTCATAATATTTTTTCTTACAAACTCCATGATAATACAATACAAGTTATATCGAAATAGTGATTTTTTTTCTTTTTTTTTTCCCTCCCGCCCTCACATTTGAGTATTAAGTTTTTGTTATTGTTGTTGAGTTGGAGTCTAAAACTTGAAAAGAAATATAAAAATTTAAGAATAGCACTGTATCAAAATATTTGATATTTGATATTGCTTTACTAAATCGCTAGATGTTTAGATTGGAGCTAAATTTTCAATATTATATGTAAATTGACGAATTGTGTATAAATAGAACAAGTGATATTAGAAATTCACAATTTAATTAAAATATGTATACAAAGACATTCAAAATTTTTATTAGTGCATCCTAGTATTTTTATAAAAGAAAAATATTTGATAGTGCTGTTTGCAAAAGCATGTACTGGGTGCTTCTAATCTAAGGCCTCATCATAGTGTTTTTAATTTTTAAATAACACCAACGTATACGTAGCTAACACCATTGGAGGTATTTTTTAAGCATTTAGAAAAAAAAATGCCTTGAAAACACTTCCAAAGAGACTTTTACTGTAAATTTGAATTTTGAGTTTTACTTCGTGTTCCATGGTTTTGCTTTCTTATTCTGTTCTGATCCATAATTCTTTTTAGATTTTATTTTGACCTTCTTTCATCAGATCGAGTTAATCAATAAGTCGTCATAAGTAGCAAAGAAAATATCTGAATTACATTCATATTTTGGTTGCTAAAGGTGATGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGATTGTTGTAATGATGTATGTCTAAGAATACTGCCGAAGATAGATCTAATCAGTGCTTTGAAGAAAACTGCTCATCTCAATATGTCTCAAGGAAGAGATTGTCAGAAGCAGTTCTCTGTGTACAAAATCCATGTCAGGTGGGTTAACCATAGAAATATATTCACATTTCTTTTATATGCAAGGGTATTTTCATGGTTGTATTTCATGGTATATTTATGTCTTAATACTGTCCGTGACATATATTCATTTTTAATAATGGGAGACAAGACTTTCATTACAAGAGATGAAAGTAAAAGAGAGGCTACAAGAAGACCAATAGGGCATAAAAGGAACTCTTCAAAAGTACAGCCAAGCTAAAATTATACAGCACAAAAACATGAAAGACACCGATCTATAAATGATAAATCATAAATTGAACAACCGAATTAGAAGAATAGAGTGCTGTTAAATAGCTTTGTGATGGCTAAGTATACAAGTTGGATATAGTGGACTCAAACAACTTACACCGGTCATAGGGCAACCCACCGTTTCTTAGGCCCCCTACTATTTTACCATAATAGGGATATAGCCACTGTTATCAATGTCGCCACTATCATCAATATTGTCTATCATCAATAGGAAAATCTTTAGGCCTGTCATTTAAGATCTTATTAAAAAAAAATACTGTAATGAAAGGGAGATGGGAATTAACCATAGGTCAAGTTAAAAAATTAAAGGGACTTTGGGATCATGGGTTCAAACTTCATTGTGCAAGTTGGTCTGGATACTTAAGGTATATAAAAAAAAAAAAGGAAGGAATTTGTTGCTAGGTATTTGACCAAATAGGATTGTTCAATAACTGTAGAGCCTCACTAAAATGCCCCTAGAGGGGAATAAGGATAAGCGGAGCCAAAAGGGCTTACCATTAGTTGGGAAATGAAAACTATTAGCAACTGGGTTTTGGGAGTTTGTAGTCATCCGTTTTTGGTAAAATAGGGTTTTTTTTGGTTAATATAACAATTTATCAGGAAGATAACACTGTTACCCCTTAAGGTGTGAAAGTTTATGCAGCTGTTTGCTACATTTGGTTAGTGAGTGTCATCTAGTTAAGGATATTCCTCAAGTTTAGTTCTATCTGATGCCATACACAGGCTAAAAAATATGAACATGGACACTGTGACGTCATTTTTCAAAAATAGACATAGGCACAAGGACATATCATTAAACGTTGAAGTATGACTCATTAAAGACAGTGGGGAAGTTCTTTCTTCTTCTCTTCTCTTCTTTTATTATTATTTTTTTTTGGGGGGGGGGGGGTTGGAAGTGAATTTTCAAGGTCCTAGTTTTGTCTACTTTATATAAGTGACTCCTAATTTATGTGAGCAACAAGCGGCCTCAATATGCTTTGCATAGATGATATTGGGATTTATTTATTTTATGTTTTAAACTTCTTTATGGTCCCTCTCATATGAAAGAGATTTCTAACCTTTTTCTTTTTCTGATTTCTTTTACTGTAGAGCTTTCTCTTAATGTCTTTTGTAACTGAGAACGGACAGGGACACCTATTATCTTTTCATAGTTTTATTTTTTGTTTTGTGGATAGAATGCGTTCCTCATAAAAGAAAATTGAAAACTTAGTATATGAGATTCGTTATAAATACAAATTTTATCTTTCTATTTCAGGATGATGTTAGGCCTCCAATAGTAAGAGTGTACGAGAGGAGGGGTAAAAAGTGAAATGTATCCTTTGAAAGTTAGTTAGGATAGACCTCACTTGTATATGTAGGGGAATGATGTAAGCGTGAGTAGGGGAATGATGTGAGTGTGAGGGAAGAGGAATCTTGTGTGACGCATGTATAGGGAGAGAGCTGGCCCTCAAGTTTTGTAAGGTGTTTGGTTCACCTCTTTCTTTGTGAATAGTAATGCTATCAGTGTTGATACTTAATTGCTTTAAAACATGCTTTCTTGTTTCTGGTAATTGTAAAAAAACCAAAACAAAACTTAAGCGTGACTTCAACTTGGTCTCAGGTTGCCTGTAATTTGGACTTCAACTTTTGAAATTGCCTTAATTGATTCCTCTGTCTACTAGGTTCTTTTAAATTAAACAGTAATATCCATGTGGATGCCACATGGACCTTTTTTTTTTTCCTTAAGAGACTTTCCTTTCTTGGATACGGACGCTAGTATAAATGTCAAGTTGCAATGGCCTCAAAATAACAAATGTTTACATTTTATCAAGGATGCAGTGAATAAATCATAGAAACTTGTAAGTTGTAATGCCATTGTTGAGATATTCAGTTCAGAGCTGCTAATAAGGTATTTGTTATGTATGGCCAACTTAGGTTAATAAAAAGTATAGTGTATTGACCTCTTTAATGTGTTGATTAGAATTGTATCTTGCATGTCAAATTTATTGGGACAAGTTATCAATGTGGACTTGCTAATGCAAAGAAACATTGAACTTCAAAAATTTTGAAAAGCAAGTAGCATAAGCTTTCAGAGACAGTCTAAAATGGTGCTGTTGGGATGCACATTTGATCTTAAATATTGCATACAGGCATTTAAGAGAAAAGAAATGGTAGTCTTCTATACATTTTTCTTTCATTCTTTCCACATTTGTTTACATTTGCGATTATAAATGTAATAATGCTCTATTGACTTGGCAGGAACTGGATGGTATATGTCATAAAAATAATTGGATATTGCCAATTTATGGTGTTTCTTCATCAGATGGTAAGATCTTTGTAGATTCAGTCGGTCAATTTTTTATAGAGAAACATTAAATTCTTTTCTTCTAATGTTCACTATGCAGTCTTTCAAGTAAATTATGTGAAGCGAATGATTAATAAATATATTTTTCTTTTCCCCTGAGTACAATGTGCCTCGCAAATTAATTGGAGATCATAGTTGTCTTTGCCTTCAGTTTTCCATCTGATTTTGTGGATTCTCCATCTTCCATCTCAAAATATTCTACAGATATATTGGTAGATCTTGAAGAAGATGATCAAATTTTGATAGTGCCTCATTCCTCTCTCTCTCTCTCTCTCTCTCTCCTTTTTTTCTGCTTCTGCCATTTGAGGCTGGCCGGGTTTTGTTTATGCGTGCATGTCTGATGCATTGTTCTGGTGTACTTGCAATTTGAAATCATAGTCAGCACCTATTATAGATTAGTCGTGCTAAAATGTGTTGCTTGTTCTTTTGAAGGTGGATTCCAAGCCAATGTATTTCTAAAAGGGATGGATTTTGAGTATTCAAGCTGCGGTGAGCAGTGTTCAAATCCTCGTGAAGCGAGGGAATCAGCTGCAACAAAGATGTTGGGTCAACTATGGAGTATGGCAAGCCAGGCCAAGTAG
mRNA sequence
ATGAGTGCACCAGATGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAACCTATGCTTCCTGCAAAGTCATCTTCGAGAGACAATCCACCACAATCTCTACTGCAATCAGTTGCAAAACAGGTGCATGCCGCTGTTATATTGTACAACTACTACCATCGGAAACAACATCCACACCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAGTTGGCTGTGGTCATAAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGATCAGATGATACTGAATTGGAAAACCCTGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCTTGTGATATAGCCACTTGTCTAGAAGCATCAAAAGATGAAAACGTAGAGGGCTGGTCTCGTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAGAAGGAGCATTGCCATTTGCTATTTAGTTTCATCACTCAAGGAGTTTGGTCTGTGATTGAACAAGATTTGGATACCTCTGAATGCCAACCAGAAACGGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGATGAAGCTAAGACACAGCAGCTTGCATATTCAGCAGTTAAGGAAGCAACTGGGATTAATCAAAGCTATCTCAAAATTTTGGAAAGTCATGTTGTATACTCTCTTAGTAAAGAGAAATCTGCAGTCTGCTTTTATATGATTCAGTGCACCCGATCAGCGACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCATTGACAGTTTGCAGGGTTCGTTGTTTAGAAAAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGTGCAAATCTGGTTTCATAGGGAAACTTCAACAGATAGTTTGCGAGTCATAGGTGGAGAAAAGATAGATGAAAACTTGAACAAGCTTGAGAGAATAGATGCACCCAGGAAGCTTGAAATTCAAAACAACCAAGATGGTGCTAGTGCAAAGAATTTGAATAAAGGGACTAGTATTTATGGTGAAGGATTGGAGAGACTGCCAGATAAAACTAACTGTGCGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTACTAATGTGGATGACTTCGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATGTACCCAATACTAGCCAAGTCATCTTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTGAATGAATCGAATGCCTCAGAAAGTGGTATCAAAGTCAAGGATGGGATATTAGCAACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATGTTTCATTTGATCGAAATAGGAACGGTGATCATGCTCTTATCACCTGTCAATCGAACTCAGAGCATCTTTCCAAGCTACATGCAATTATAGTTTCGAAAGAAACAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAACTGTCTCAGCAGCAGCGCATCATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGATTGTTGTAATGATGTATGTCTAAGAATACTGCCGAAGATAGATCTAATCAGTGCTTTGAAGAAAACTGCTCATCTCAATATGTCTCAAGGAAGAGATTGTCAGAAGCAGTTCTCTGTGTACAAAATCCATGAACTGGATGGTATATGTCATAAAAATAATTGGATATTGCCAATTTATGGTGTTTCTTCATCAGATGGTGGATTCCAAGCCAATGTATTTCTAAAAGGGATGGATTTTGAGTATTCAAGCTGCGGTGAGCAGTGTTCAAATCCTCGTGAAGCGAGGGAATCAGCTGCAACAAAGATGTTGGGTCAACTATGGAGTATGGCAAGCCAGGCCAAGTAG
Coding sequence (CDS)
ATGAGTGCACCAGATGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAACCTATGCTTCCTGCAAAGTCATCTTCGAGAGACAATCCACCACAATCTCTACTGCAATCAGTTGCAAAACAGGTGCATGCCGCTGTTATATTGTACAACTACTACCATCGGAAACAACATCCACACCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAGTTGGCTGTGGTCATAAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGATCAGATGATACTGAATTGGAAAACCCTGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCTTGTGATATAGCCACTTGTCTAGAAGCATCAAAAGATGAAAACGTAGAGGGCTGGTCTCGTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAGAAGGAGCATTGCCATTTGCTATTTAGTTTCATCACTCAAGGAGTTTGGTCTGTGATTGAACAAGATTTGGATACCTCTGAATGCCAACCAGAAACGGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGATGAAGCTAAGACACAGCAGCTTGCATATTCAGCAGTTAAGGAAGCAACTGGGATTAATCAAAGCTATCTCAAAATTTTGGAAAGTCATGTTGTATACTCTCTTAGTAAAGAGAAATCTGCAGTCTGCTTTTATATGATTCAGTGCACCCGATCAGCGACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCATTGACAGTTTGCAGGGTTCGTTGTTTAGAAAAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGTGCAAATCTGGTTTCATAGGGAAACTTCAACAGATAGTTTGCGAGTCATAGGTGGAGAAAAGATAGATGAAAACTTGAACAAGCTTGAGAGAATAGATGCACCCAGGAAGCTTGAAATTCAAAACAACCAAGATGGTGCTAGTGCAAAGAATTTGAATAAAGGGACTAGTATTTATGGTGAAGGATTGGAGAGACTGCCAGATAAAACTAACTGTGCGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTACTAATGTGGATGACTTCGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATGTACCCAATACTAGCCAAGTCATCTTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTGAATGAATCGAATGCCTCAGAAAGTGGTATCAAAGTCAAGGATGGGATATTAGCAACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATGTTTCATTTGATCGAAATAGGAACGGTGATCATGCTCTTATCACCTGTCAATCGAACTCAGAGCATCTTTCCAAGCTACATGCAATTATAGTTTCGAAAGAAACAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAACTGTCTCAGCAGCAGCGCATCATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGATTGTTGTAATGATGTATGTCTAAGAATACTGCCGAAGATAGATCTAATCAGTGCTTTGAAGAAAACTGCTCATCTCAATATGTCTCAAGGAAGAGATTGTCAGAAGCAGTTCTCTGTGTACAAAATCCATGAACTGGATGGTATATGTCATAAAAATAATTGGATATTGCCAATTTATGGTGTTTCTTCATCAGATGGTGGATTCCAAGCCAATGTATTTCTAAAAGGGATGGATTTTGAGTATTCAAGCTGCGGTGAGCAGTGTTCAAATCCTCGTGAAGCGAGGGAATCAGCTGCAACAAAGATGTTGGGTCAACTATGGAGTATGGCAAGCCAGGCCAAGTAG
Protein sequence
MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRKQHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIATCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETVEEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSKEKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAKMVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIYGEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRNRNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQFSVYKIHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSNPREARESAATKMLGQLWSMASQAK
Homology
BLAST of Sgr021853 vs. NCBI nr
Match:
XP_022150346.1 (uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150347.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150348.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150349.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia])
HSP 1 Score: 1113.6 bits (2879), Expect = 0.0e+00
Identity = 576/684 (84.21%), Postives = 609/684 (89.04%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSA VCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSL QSVAKQVHA VILYNYYHRK
Sbjct: 1 MSALGVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLQQSVAKQVHAVVILYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLE LSFEAFCKLAVV+KPALLSHMKLMQ SDDTELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEASKDENVEGW SKVAVLLIDS+KE CHLLFSFITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
EEEKHVNKK+RVIKKPSKE VVDEAKTQQLAYSAVKEATGINQ LKIL+ HVVYSLSK
Sbjct: 181 EEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRDLKILDGHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAV FYMIQCT+SATEDVIQVPIKDA+DSLQGSLFRK+GRRWSITSKVE+FHILPYAK
Sbjct: 241 EKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSITSKVEHFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
MV W RETS DSLRV+ GEK+DENL+KLERIDAPRKLEIQN+QDG SA +L+KGTSIY
Sbjct: 301 MVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQDGDSANDLSKGTSIY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLE+L +KTN SLHDAICRPQ TNVDD VPSYPV+KKKDVPNTSQVI SYTKKRNA
Sbjct: 361 GEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPNTSQVIVSYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQVDN HEVMIPC NESNASESGIK+KDG+LATNPCIAECSGEKIASGN SDNVSFD+N
Sbjct: 421 RQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKIASGNFSDNVSFDQN 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQC 540
RNGDHALITCQSN EHLSKL AI+VSKETALSQAAIRALIRKRDKLS QQRIIEDEIAQC
Sbjct: 481 RNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKLSHQQRIIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQ 600
DK +QTILRGDEDDLVIKLDSVI+CCNDVCLR + K+ N S +K+
Sbjct: 541 DKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKE----NCSSQYVTRKR 600
Query: 601 FSVYKI------HELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSN 660
S + ELD ICHKNNWILP+Y +SSSDGGFQANVF+KG+DFEYSSC E CSN
Sbjct: 601 LSEAVLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKGLDFEYSSCSETCSN 660
Query: 661 PREARESAATKMLGQLWSMASQAK 679
PREAR SAATKMLGQLWS+ASQ K
Sbjct: 661 PREARASAATKMLGQLWSIASQRK 680
BLAST of Sgr021853 vs. NCBI nr
Match:
XP_008445716.1 (PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo] >XP_008445717.1 PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo])
HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 556/696 (79.89%), Postives = 606/696 (87.07%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSAP VCPTEDAIHALLDYLVEPMLPAKSSSR+NPP++LLQSVAKQ+HA V+LYN+YHRK
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAV++KPALLSHMKLMQ SDD ELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEAS DEN+EGW SKVAV L+DSKKEHC+LLFSFITQGVWSVIEQD+D+SE QPETV
Sbjct: 121 TCLEASPDENIEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EE+HVNKKKRVIKKPSKEGLVVDE KTQQ+AY+AVKEATGINQS LKILESHVVYSLSK
Sbjct: 181 DEERHVNKKKRVIKKPSKEGLVVDETKTQQVAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRSATEDVIQVPI+D ++SLQ SLFRK+GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVVNSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
M WFHRE+S+D L VIG EK+DENLN+ ERID R+L++QNNQ+GASA NLN +IY
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRPERIDVIRRLKVQNNQNGASANNLNIRANIY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFS----YTK 420
G+G ERLPDKTNC SLHDAI RPQST+VDD VPSYPVEKKKDVPNTSQ I S YTK
Sbjct: 361 GKGFERLPDKTNCVGSLHDAIYRPQSTSVDDLVPSYPVEKKKDVPNTSQAIVSYTKTYTK 420
Query: 421 KRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVS 480
K RQVDN +E+MIPCMVNES+ASESGIK KDGILATNPCIAECSGEKIASGNLSDN+S
Sbjct: 421 KITDRQVDNSYELMIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNIS 480
Query: 481 FDRNRNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDE 540
FD+NRNGDHALITCQSN+EHLSKL AIIVSKETALSQAAI+ALIRKRDKLS QQR+IEDE
Sbjct: 481 FDQNRNGDHALITCQSNAEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
Query: 541 IAQCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQ--G 600
IAQCDKNMQTILRGDEDDLV+KLDSVIDCCND+C + TA Q
Sbjct: 541 IAQCDKNMQTILRGDEDDLVLKLDSVIDCCNDLC-------------QSTAEDKSYQYFE 600
Query: 601 RDCQKQFSVYK------------IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMD 660
+C Q+ K ELDGICHKNNWILP+YGVSS DGGFQANVF+KGMD
Sbjct: 601 ENCSSQYVTRKRLSEAILCIQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMD 660
Query: 661 FEYSSCGEQCSNPREARESAATKMLGQLWSMASQAK 679
FEYSSCGE CS+PR+ARESAA KMLGQLW MA+QAK
Sbjct: 661 FEYSSCGELCSDPRDARESAAMKMLGQLWRMANQAK 683
BLAST of Sgr021853 vs. NCBI nr
Match:
XP_038884896.1 (uncharacterized protein LOC120075512 isoform X2 [Benincasa hispida])
HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 552/690 (80.00%), Postives = 597/690 (86.52%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MS PDVCPTEDAIHALLDYLVEPMLPAKSSSR+NPP++LLQSVAKQ+HA ++LYNYYHRK
Sbjct: 1 MSTPDVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVILLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAV++KPALLSHMKLMQ SDD ELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEAS +ENVEGW SKVAV LIDSK+EHC+LLFSFITQGVWSVIEQD+DTSECQPETV
Sbjct: 121 TCLEASTNENVEGWPLSKVAVFLIDSKREHCYLLFSFITQGVWSVIEQDIDTSECQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EEKHVNKKKRVIKK SKEGLVVDEAKTQQLAY AVKEATGINQS LKILESHVVYSLSK
Sbjct: 181 DEEKHVNKKKRVIKKASKEGLVVDEAKTQQLAYKAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAVCFY+IQCTRSATEDVIQVPI+DA++SLQ LF+++GRRW ITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYIIQCTRSATEDVIQVPIRDAVNSLQDLLFKRSGRRWGITSKVEYFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
MV WFHRETS D+L IG EKIDENLN+ ERID RKL+IQN+Q+GASA ++ S
Sbjct: 301 MVLTWFHRETSLDNLGGIGEEKIDENLNRPERIDVTRKLKIQNDQNGASANHMYTEASTC 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLERL D TNC LHDAICRPQS NVDD VPSY EKKKDVPNTSQVI SYTKKRNA
Sbjct: 361 GEGLERLSDNTNCVGGLHDAICRPQSANVDDIVPSYTAEKKKDVPNTSQVIISYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQ DN +EVM PCM+NESNA ES IKVKDGILATNPCIAECSGEKIASGNLSDN+SFD+N
Sbjct: 421 RQADNHYEVMTPCMINESNALES-IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQC 540
RN DHALITCQSN+EHLSKL AIIVSKETALSQAAI+ALIRKRDKLS QQ +IEDEIAQC
Sbjct: 481 RNDDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQHLIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQ 600
DKNMQTIL+GDEDDLVIKLDSVI+CCNDVCLR S + ++ + +C Q
Sbjct: 541 DKNMQTILKGDEDDLVIKLDSVIECCNDVCLR--------STAEDKSYQYFEE--NCSSQ 600
Query: 601 FSVYK------------IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSC 660
+ K ELDGICHKN WILP+YGVSS DGGFQANVF+KGMDFEYSSC
Sbjct: 601 YVTRKRLSEAILCVQNPCKELDGICHKNYWILPVYGVSSIDGGFQANVFVKGMDFEYSSC 660
Query: 661 GEQCSNPREARESAATKMLGQLWSMASQAK 679
GE CS+PREARESAA KMLGQLW MAS K
Sbjct: 661 GELCSDPREARESAAMKMLGQLWRMASVGK 679
BLAST of Sgr021853 vs. NCBI nr
Match:
XP_011656540.1 (uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus] >XP_011656541.1 uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus])
HSP 1 Score: 1074.3 bits (2777), Expect = 5.2e-310
Identity = 550/692 (79.48%), Postives = 602/692 (86.99%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSAP VCPTEDAIHALLDYLVEPMLPAKSSSR+NPP++LLQSVAKQ+HA V+LYN+YH+K
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHQK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFE FCKLAV+IKPALLSHMKLMQ SDD ELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFETFCKLAVIIKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEAS DENVEGW SKVAV L+DSKKEHC+LLFSFITQGVWSVIEQD+D+SE QPETV
Sbjct: 121 TCLEASTDENVEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+ E+HVNKKKRVIKKPSKEGLVVDEAKTQQLAY+AVKEATGINQS LKILESHVVYSLSK
Sbjct: 181 DVERHVNKKKRVIKKPSKEGLVVDEAKTQQLAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRSATEDVIQVPI+D +SLQ SLFRK+GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVANSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
M WFHRE+S+D L VIG EK+DENLN+ ERID RKL+++NNQ+GASA NLNK +IY
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRRERIDVTRKLKVENNQNGASANNLNKSANIY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
G+GLERLPDKTNC SLHDAI RPQST+ D VP YPVEKKKDVPNTSQ I SYT K
Sbjct: 361 GKGLERLPDKTNCVGSLHDAIYRPQSTSAVDLVPFYPVEKKKDVPNTSQDIISYTSKITD 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
R+VDN +E+MIPC+VNESNASESGIKV+DGILATNPCIAECSGEK+ASGNLSDN+SFD+N
Sbjct: 421 RKVDNSYELMIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSN--SEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIA 540
RNGDHALITCQSN SEHLSKL AIIVSKE ALSQAAIRALIRKRDKLS QQR+IEDEIA
Sbjct: 481 RNGDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIA 540
Query: 541 QCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQ 600
QCDKNMQTILRGDEDDLV+KLDSVI+CCND+C R S + ++ + +C
Sbjct: 541 QCDKNMQTILRGDEDDLVLKLDSVIECCNDICPR--------STAEDKSYQYFEE--NCS 600
Query: 601 KQFSVYK------------IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYS 660
Q+ K ELDGICHKNNWILP+YGVSS DGGFQANVF+KGMDFEYS
Sbjct: 601 SQYVTRKRLSEAILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYS 660
Query: 661 SCGEQCSNPREARESAATKMLGQLWSMASQAK 679
SC E CS+PR+ARESAA KMLGQLW MA+ AK
Sbjct: 661 SCSELCSDPRDARESAAMKMLGQLWRMANLAK 682
BLAST of Sgr021853 vs. NCBI nr
Match:
XP_038884894.1 (uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida] >XP_038884895.1 uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida])
HSP 1 Score: 1069.3 bits (2764), Expect = 1.4e-308
Identity = 552/698 (79.08%), Postives = 597/698 (85.53%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MS PDVCPTEDAIHALLDYLVEPMLPAKSSSR+NPP++LLQSVAKQ+HA ++LYNYYHRK
Sbjct: 1 MSTPDVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVILLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAV++KPALLSHMKLMQ SDD ELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEAS +ENVEGW SKVAV LIDSK+EHC+LLFSFITQGVWSVIEQD+DTSECQPETV
Sbjct: 121 TCLEASTNENVEGWPLSKVAVFLIDSKREHCYLLFSFITQGVWSVIEQDIDTSECQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EEKHVNKKKRVIKK SKEGLVVDEAKTQQLAY AVKEATGINQS LKILESHVVYSLSK
Sbjct: 181 DEEKHVNKKKRVIKKASKEGLVVDEAKTQQLAYKAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAVCFY+IQCTRSATEDVIQVPI+DA++SLQ LF+++GRRW ITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYIIQCTRSATEDVIQVPIRDAVNSLQDLLFKRSGRRWGITSKVEYFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
MV WFHRETS D+L IG EKIDENLN+ ERID RKL+IQN+Q+GASA ++ S
Sbjct: 301 MVLTWFHRETSLDNLGGIGEEKIDENLNRPERIDVTRKLKIQNDQNGASANHMYTEASTC 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLERL D TNC LHDAICRPQS NVDD VPSY EKKKDVPNTSQVI SYTKKRNA
Sbjct: 361 GEGLERLSDNTNCVGGLHDAICRPQSANVDDIVPSYTAEKKKDVPNTSQVIISYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQ DN +EVM PCM+NESNA ES IKVKDGILATNPCIAECSGEKIASGNLSDN+SFD+N
Sbjct: 421 RQADNHYEVMTPCMINESNALES-IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKL--------SQQQRI 540
RN DHALITCQSN+EHLSKL AIIVSKETALSQAAI+ALIRKRDKL S QQ +
Sbjct: 481 RNDDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLCNPFILSQSHQQHL 540
Query: 541 IEDEIAQCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMS 600
IEDEIAQCDKNMQTIL+GDEDDLVIKLDSVI+CCNDVCLR S + ++
Sbjct: 541 IEDEIAQCDKNMQTILKGDEDDLVIKLDSVIECCNDVCLR--------STAEDKSYQYFE 600
Query: 601 QGRDCQKQFSVYK------------IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKG 660
+ +C Q+ K ELDGICHKN WILP+YGVSS DGGFQANVF+KG
Sbjct: 601 E--NCSSQYVTRKRLSEAILCVQNPCKELDGICHKNYWILPVYGVSSIDGGFQANVFVKG 660
Query: 661 MDFEYSSCGEQCSNPREARESAATKMLGQLWSMASQAK 679
MDFEYSSCGE CS+PREARESAA KMLGQLW MAS K
Sbjct: 661 MDFEYSSCGELCSDPREARESAAMKMLGQLWRMASVGK 687
BLAST of Sgr021853 vs. ExPASy TrEMBL
Match:
A0A6J1DAH9 (uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018541 PE=4 SV=1)
HSP 1 Score: 1113.6 bits (2879), Expect = 0.0e+00
Identity = 576/684 (84.21%), Postives = 609/684 (89.04%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSA VCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSL QSVAKQVHA VILYNYYHRK
Sbjct: 1 MSALGVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLQQSVAKQVHAVVILYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLE LSFEAFCKLAVV+KPALLSHMKLMQ SDDTELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEASKDENVEGW SKVAVLLIDS+KE CHLLFSFITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
EEEKHVNKK+RVIKKPSKE VVDEAKTQQLAYSAVKEATGINQ LKIL+ HVVYSLSK
Sbjct: 181 EEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRDLKILDGHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAV FYMIQCT+SATEDVIQVPIKDA+DSLQGSLFRK+GRRWSITSKVE+FHILPYAK
Sbjct: 241 EKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSITSKVEHFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
MV W RETS DSLRV+ GEK+DENL+KLERIDAPRKLEIQN+QDG SA +L+KGTSIY
Sbjct: 301 MVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQDGDSANDLSKGTSIY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLE+L +KTN SLHDAICRPQ TNVDD VPSYPV+KKKDVPNTSQVI SYTKKRNA
Sbjct: 361 GEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPNTSQVIVSYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQVDN HEVMIPC NESNASESGIK+KDG+LATNPCIAECSGEKIASGN SDNVSFD+N
Sbjct: 421 RQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKIASGNFSDNVSFDQN 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQC 540
RNGDHALITCQSN EHLSKL AI+VSKETALSQAAIRALIRKRDKLS QQRIIEDEIAQC
Sbjct: 481 RNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKLSHQQRIIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQ 600
DK +QTILRGDEDDLVIKLDSVI+CCNDVCLR + K+ N S +K+
Sbjct: 541 DKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKE----NCSSQYVTRKR 600
Query: 601 FSVYKI------HELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSN 660
S + ELD ICHKNNWILP+Y +SSSDGGFQANVF+KG+DFEYSSC E CSN
Sbjct: 601 LSEAVLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKGLDFEYSSCSETCSN 660
Query: 661 PREARESAATKMLGQLWSMASQAK 679
PREAR SAATKMLGQLWS+ASQ K
Sbjct: 661 PREARASAATKMLGQLWSIASQRK 680
BLAST of Sgr021853 vs. ExPASy TrEMBL
Match:
A0A1S3BE29 (uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488666 PE=4 SV=1)
HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 556/696 (79.89%), Postives = 606/696 (87.07%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSAP VCPTEDAIHALLDYLVEPMLPAKSSSR+NPP++LLQSVAKQ+HA V+LYN+YHRK
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAV++KPALLSHMKLMQ SDD ELENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCLEAS DEN+EGW SKVAV L+DSKKEHC+LLFSFITQGVWSVIEQD+D+SE QPETV
Sbjct: 121 TCLEASPDENIEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EE+HVNKKKRVIKKPSKEGLVVDE KTQQ+AY+AVKEATGINQS LKILESHVVYSLSK
Sbjct: 181 DEERHVNKKKRVIKKPSKEGLVVDETKTQQVAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRSATEDVIQVPI+D ++SLQ SLFRK+GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVVNSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
M WFHRE+S+D L VIG EK+DENLN+ ERID R+L++QNNQ+GASA NLN +IY
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRPERIDVIRRLKVQNNQNGASANNLNIRANIY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFS----YTK 420
G+G ERLPDKTNC SLHDAI RPQST+VDD VPSYPVEKKKDVPNTSQ I S YTK
Sbjct: 361 GKGFERLPDKTNCVGSLHDAIYRPQSTSVDDLVPSYPVEKKKDVPNTSQAIVSYTKTYTK 420
Query: 421 KRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVS 480
K RQVDN +E+MIPCMVNES+ASESGIK KDGILATNPCIAECSGEKIASGNLSDN+S
Sbjct: 421 KITDRQVDNSYELMIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNIS 480
Query: 481 FDRNRNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDE 540
FD+NRNGDHALITCQSN+EHLSKL AIIVSKETALSQAAI+ALIRKRDKLS QQR+IEDE
Sbjct: 481 FDQNRNGDHALITCQSNAEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
Query: 541 IAQCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQ--G 600
IAQCDKNMQTILRGDEDDLV+KLDSVIDCCND+C + TA Q
Sbjct: 541 IAQCDKNMQTILRGDEDDLVLKLDSVIDCCNDLC-------------QSTAEDKSYQYFE 600
Query: 601 RDCQKQFSVYK------------IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMD 660
+C Q+ K ELDGICHKNNWILP+YGVSS DGGFQANVF+KGMD
Sbjct: 601 ENCSSQYVTRKRLSEAILCIQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMD 660
Query: 661 FEYSSCGEQCSNPREARESAATKMLGQLWSMASQAK 679
FEYSSCGE CS+PR+ARESAA KMLGQLW MA+QAK
Sbjct: 661 FEYSSCGELCSDPRDARESAAMKMLGQLWRMANQAK 683
BLAST of Sgr021853 vs. ExPASy TrEMBL
Match:
A0A6J1KZE5 (uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732 PE=4 SV=1)
HSP 1 Score: 1057.7 bits (2734), Expect = 2.0e-305
Identity = 541/680 (79.56%), Postives = 590/680 (86.76%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSAP VCPTEDAI LLDYLVEPMLPAKS SR+NPPQSLLQSVAKQVHA V+LYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFE FCKLAVV+KPALLSHMKLMQ SDD ELENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEEFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCL+ASKD++VEGW SKVAVLLIDSK+E CHLLFS ITQGVWSVIEQDLDTSECQPET+
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETM 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EEKHVNKKKRVIKKPSKEG VDE KTQQLAYS V++ATGINQS LKILESHVVYS SK
Sbjct: 181 DEEKHVNKKKRVIKKPSKEG-PVDEIKTQQLAYSTVRKATGINQSDLKILESHVVYSHSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
KSAVCFY+IQCTRSATEDVIQVPIKD IDSLQ SLF+ NGRRWSITSKVEYFHILPYA+
Sbjct: 241 AKSAVCFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
M+ IWFH TST+SLRVIGG K+DENLNK ERID R LEIQ+NQDGA+A NLNKGTS Y
Sbjct: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGANAYNLNKGTSTY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLERLPDKTN SSL+D +CRPQ++NVDD VPSYPVEKKKDVPNTSQV FS TKK+NA
Sbjct: 361 GEGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSCTKKKNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQVDN + VMIPCMVNESNASESGIKVKD ILA NPC+AECSGEKIASGNLSDN+S D+
Sbjct: 421 RQVDNSYAVMIPCMVNESNASESGIKVKDRILAANPCLAECSGEKIASGNLSDNISLDQY 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQC 540
RNGDHAL+TCQSN+EHL+KL II+SKETALSQAAI+AL RKRDKLS QQRIIED+IAQC
Sbjct: 481 RNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQ 600
DKNMQTILRGDED LVIKLDSVI+CC DVC+R + + ++ + +
Sbjct: 541 DKNMQTILRGDEDGLVIKLDSVIECCYDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEA 600
Query: 601 FSVYK--IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSNPREA 660
+ ELD IC KNNWILP+YGVS+SDGGFQANV +KGMDF YSSC E C +P EA
Sbjct: 601 ILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVLVKGMDFAYSSCSELCPDPCEA 660
Query: 661 RESAATKMLGQLWSMASQAK 679
R+SAATKMLGQLW+MASQ K
Sbjct: 661 RKSAATKMLGQLWTMASQTK 679
BLAST of Sgr021853 vs. ExPASy TrEMBL
Match:
A0A6J1HAN9 (uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC111461089 PE=4 SV=1)
HSP 1 Score: 1052.0 bits (2719), Expect = 1.1e-303
Identity = 540/680 (79.41%), Postives = 588/680 (86.47%), Query Frame = 0
Query: 1 MSAPDVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRK 60
MSA VCPTEDAI LLDYLVEPMLPAKS SR+NPPQSLLQSVAKQVHA V+LYNYYHRK
Sbjct: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVV+KPALLSHMKLMQ SDD ELENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
TCL+ASKD++VEGW SKVAVLLIDSK+E CHLLFS ITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 EEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSK 240
+EEKHVNKKKRVIKKPSKEG VDE KTQQLAYS V++ATGINQ+ LKILESHVVYS SK
Sbjct: 181 DEEKHVNKKKRVIKKPSKEG-PVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSK 240
Query: 241 EKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAK 300
KSAV FY+IQCTRSATEDVIQVPIKD IDSLQ SLF+ NGRRWSITSKVEYFHILPYA+
Sbjct: 241 AKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
Query: 301 MVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIY 360
M+ IWFH TST+SLRVIGG K+DENLNK ERID R LEIQ+NQDGASA NLNKGTS Y
Sbjct: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTY 360
Query: 361 GEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNA 420
GEGLERLPDKTN SSL+D + RPQ++NVDD VPSYPVEKKKDVPNTSQV FSY KK+NA
Sbjct: 361 GEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRN 480
RQ DNR VMIPCMVNE NASESGIKVKD ILATNPC AECSGEKIASGNLSDN+S D+
Sbjct: 421 RQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQY 480
Query: 481 RNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQC 540
RNGDHAL+TCQSN+EHL+KL II+SKETALSQAAI+AL RKRDKLS QQRIIED+IA+C
Sbjct: 481 RNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTAHLNMSQGRDCQKQ 600
DKNMQTILRGDED LVIKLDSVI+CCNDVC+R + + ++ + +
Sbjct: 541 DKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEA 600
Query: 601 FSVYK--IHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSNPREA 660
+ ELD IC KNNWILP+YGVS+SDGGFQANV++KGMDF YSSC E C +P EA
Sbjct: 601 ILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEA 660
Query: 661 RESAATKMLGQLWSMASQAK 679
R+SAATKMLGQLW+MASQ K
Sbjct: 661 RKSAATKMLGQLWTMASQTK 679
BLAST of Sgr021853 vs. ExPASy TrEMBL
Match:
A0A6J1D888 (uncharacterized protein LOC111018541 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018541 PE=4 SV=1)
HSP 1 Score: 1032.3 bits (2668), Expect = 8.9e-298
Identity = 533/638 (83.54%), Postives = 566/638 (88.71%), Query Frame = 0
Query: 47 VHAAVILYNYYHRKQHPHLEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQL 106
VHA VILYNYYHRKQHPHLE LSFEAFCKLAVV+KPALLSHMKLMQ SDDTELENPEKQL
Sbjct: 8 VHAVVILYNYYHRKQHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQL 67
Query: 107 SPAEKAIMDACDIATCLEASKDENVEGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVI 166
SPAEKAIMDACDIATCLEASKDENVEGW SKVAVLLIDS+KE CHLLFSFITQGVWSVI
Sbjct: 68 SPAEKAIMDACDIATCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVI 127
Query: 167 EQDLDTSECQPETVEEEKHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSY 226
EQDLDTSECQPETVEEEKHVNKK+RVIKKPSKE VVDEAKTQQLAYSAVKEATGINQ
Sbjct: 128 EQDLDTSECQPETVEEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRD 187
Query: 227 LKILESHVVYSLSKEKSAVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSI 286
LKIL+ HVVYSLSKEKSAV FYMIQCT+SATEDVIQVPIKDA+DSLQGSLFRK+GRRWSI
Sbjct: 188 LKILDGHVVYSLSKEKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSI 247
Query: 287 TSKVEYFHILPYAKMVQIWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQD 346
TSKVE+FHILPYAKMV W RETS DSLRV+ GEK+DENL+KLERIDAPRKLEIQN+QD
Sbjct: 248 TSKVEHFHILPYAKMVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQD 307
Query: 347 GASAKNLNKGTSIYGEGLERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPN 406
G SA +L+KGTSIYGEGLE+L +KTN SLHDAICRPQ TNVDD VPSYPV+KKKDVPN
Sbjct: 308 GDSANDLSKGTSIYGEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPN 367
Query: 407 TSQVIFSYTKKRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKI 466
TSQVI SYTKKRNARQVDN HEVMIPC NESNASESGIK+KDG+LATNPCIAECSGEKI
Sbjct: 368 TSQVIVSYTKKRNARQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKI 427
Query: 467 ASGNLSDNVSFDRNRNGDHALITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKL 526
ASGN SDNVSFD+NRNGDHALITCQSN EHLSKL AI+VSKETALSQAAIRALIRKRDKL
Sbjct: 428 ASGNFSDNVSFDQNRNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKL 487
Query: 527 SQQQRIIEDEIAQCDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKT 586
S QQRIIEDEIAQCDK +QTILRGDEDDLVIKLDSVI+CCNDVCLR + K+
Sbjct: 488 SHQQRIIEDEIAQCDKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKE- 547
Query: 587 AHLNMSQGRDCQKQFSVYKI------HELDGICHKNNWILPIYGVSSSDGGFQANVFLKG 646
N S +K+ S + ELD ICHKNNWILP+Y +SSSDGGFQANVF+KG
Sbjct: 548 ---NCSSQYVTRKRLSEAVLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKG 607
Query: 647 MDFEYSSCGEQCSNPREARESAATKMLGQLWSMASQAK 679
+DFEYSSC E CSNPREAR SAATKMLGQLWS+ASQ K
Sbjct: 608 LDFEYSSCSETCSNPREARASAATKMLGQLWSIASQRK 641
BLAST of Sgr021853 vs. TAIR 10
Match:
AT1G05950.1 (unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 337.0 bits (863), Expect = 3.4e-92
Identity = 240/676 (35.50%), Postives = 367/676 (54.29%), Query Frame = 0
Query: 5 DVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLLQSVAKQVHAAVILYNYYHRKQHPH 64
D CPTEDAI ALL+ LV+P+LP+K + D P S+ +SVAKQVHA V+LYNYYHRK +PH
Sbjct: 15 DSCPTEDAIRALLESLVDPLLPSKPTD-DLPSTSIRESVAKQVHAVVLLYNYYHRKDNPH 74
Query: 65 LEFLSFEAFCKLAVVIKPALLSHMKLMQRSDDTELENPEKQLSPAEKAIMDACDIATCLE 124
LE LSFE+F LA V+KPALL H+K E Q EK I+DAC ++ L+
Sbjct: 75 LECLSFESFRSLATVMKPALLQHLK--------EDGGVSGQTVLLEKVIVDACSLSMSLD 134
Query: 125 ASKDENV-EGWSRSKVAVLLIDSKKEHCHLLFSFITQGVWSVIEQDLDTSECQPETVEEE 184
AS D + +VAVLL+DS+K+ C+L S ITQGVWS++ E
Sbjct: 135 ASSDLFILNKCPIRRVAVLLVDSEKKSCYLQHSSITQGVWSLL----------------E 194
Query: 185 KHVNKKKRVIKKPSKEGLVVDEAKTQQLAYSAVKEATGINQSYLKILESHVVYSLSKEKS 244
K + K+K + +EG+ Q++A++ VKEATG+N + ILE H+V SLS+EK+
Sbjct: 195 KPIEKEKAARENQKEEGVF------QKVAFAVVKEATGVNHKDIVILERHLVCSLSEEKT 254
Query: 245 AVCFYMIQCTRSATEDVIQVPIKDAIDSLQGSLFRKNGRRWSITSKVEYFHILPYAKMVQ 304
AV FY+++CT S + + P+++ + +QG LF K+ W++ S VEYFH+LPYA +++
Sbjct: 255 AVRFYIMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLIE 314
Query: 305 IWFHRETSTDSLRVIGGEKIDENLNKLERIDAPRKLEIQNNQDGASAKNLNKGTSIYGEG 364
WF R T+ + E + +++ ++DA ++ E+ + + L + I
Sbjct: 315 DWFSRRGDTEFVIEKEPEAVCDDIES-NKVDATKESEVSDIFERREKAALKRRYEIKA-- 374
Query: 365 LERLPDKTNCASSLHDAICRPQSTNVDDFVPSYPVEKKKDVPNTSQVIFSYTKKRNARQV 424
K A H +T + + + K+ S+ + + A+ V
Sbjct: 375 ------KKVAALLSHPGARGKATTRLQNRYLKGSMSGAKEPNVHSETVVAL----KAKNV 434
Query: 425 DNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNVSFDRNRNG 484
N M PC N SN + G +V A++P +++ L + N
Sbjct: 435 GNE---MSPCKDNYSNGEKGGFEV-----ASDP-------KELKERGLQRKKAVPDRLNS 494
Query: 485 DHAL----ITCQSNSEHLSKLHAIIVSKETALSQAAIRALIRKRDKLSQQQRIIEDEIAQ 544
H L + +++ +L +L ++SK T+LS+ A++ L+ KRDKL++QQR IEDEIA+
Sbjct: 495 IHKLNSTPASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAK 554
Query: 545 CDKNMQTILRGDEDDLVIKLDSVIDCCNDVCLRILPKIDLISALKKTA-----HLNMSQG 604
CDK +Q I + D ++L++V++CCN+ P+ +L +L K+A L +S+
Sbjct: 555 CDKCIQNI----KGDWELQLETVLECCNET----YPRRNLQESLDKSACQSNKRLKLSET 614
Query: 605 RDCQKQFSVYKIHELDGICHKNNWILPIYGVSSSDGGFQANVFLKGMDFEYSSCGEQCSN 664
K LD IC NNW+LP Y V+ SDGG++A V + G + GE+ S+
Sbjct: 615 LPSTKSL----CQRLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHGEEKSD 618
Query: 665 PREARESAATKMLGQL 671
EARESAA +L +L
Sbjct: 675 AEEARESAAACLLTKL 618
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022150346.1 | 0.0e+00 | 84.21 | uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150... | [more] |
XP_008445716.1 | 0.0e+00 | 79.89 | PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo] >XP_00... | [more] |
XP_038884896.1 | 0.0e+00 | 80.00 | uncharacterized protein LOC120075512 isoform X2 [Benincasa hispida] | [more] |
XP_011656540.1 | 5.2e-310 | 79.48 | uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus] >XP_011656541.... | [more] |
XP_038884894.1 | 1.4e-308 | 79.08 | uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida] >XP_03888489... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DAH9 | 0.0e+00 | 84.21 | uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A1S3BE29 | 0.0e+00 | 79.89 | uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1KZE5 | 2.0e-305 | 79.56 | uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732... | [more] |
A0A6J1HAN9 | 1.1e-303 | 79.41 | uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC1114610... | [more] |
A0A6J1D888 | 8.9e-298 | 83.54 | uncharacterized protein LOC111018541 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT1G05950.1 | 3.4e-92 | 35.50 | unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bac... | [more] |