Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCATTTGCAGAGCTTTGAGGCTCAACTTGGGGACGCCATTGCCGCCGCTAACGTCCGGCGTCTTCGCTAGACAAGCACAATATTGCCAGACGTCTTCTCTTCGGCTGCGCAACAAATGGGTCTCCCTTTCCGCTGCCGAAGGCTTCGACTGGAACTCGAGCGACTACTTTGCGAAGAACTGTAATTTGAAGAGGGGGAGTGGTTTTTATGGTGACCGGGATGATGTTGAAGAAGGGGATGGAGAGAGAGAGAGAGCTGTGCGTTGTGAAGTGGAGGTTATATCGTGGAGGGAGCGGCGAATTCGGGCTGATATATTTGTTAATGCTGGGATTGAATCCGTTTGGAACGCTCTTACTGATTACGAGCGCCTCGCGGATTTTATACCCAATCTTGTTTCCAGGTAGTGTGGGTGACTTCTTTTGTAAAAATGGTTATTCATTCCATGTTATTCGGGAAAATGCGCGTGAGAGGTATTCTATTCCTAGTTGTATTGCAGAAAATTGCATGTCTGTTGTTATTGCAAATTTTCTTCATGTGGTCTACTTGGGGAGATTACTGTTTGATATTTGTGTCAGTTAATCTCTTTTTCCCAGAGAGTCTAGGGACTTGTGATGTTTGCAACGGGAAAATGTGATGGAGGAGACTTCTGGGGTTGGAGTGTCATTTCTAAGGTTGAGATGACAGTTTCCATGATAAAAAGTGCAAAAACTTGGGTTGATTGCATAAAACTCTCTTAGAATTGTGCTGATTGTGGGAATCAGTTTGGACTAGGCTAGTTGATTGACAACTTTCTTAACTACTTGGCCCAGGAGCATCAGTCATGGTAGTAGTTCTAGCTCAATTAAGCTCCTGTCCAGGACATGGAAATATGAGTTAAAAGGTTTTGGTGGTGTGCAAGTGTGGCATTCAATTGAGATGGTTCTGGCAGTGGCTGCACTCGAGTAGAGACATGGTTTGTTATATTGTTTGAGCAACATGTCACCTTGGTTGCCATATGAGGGTTGCGCGAATTGTTAAGGTGGTTGTCTTAGTTGTCGATGGGTGTATTGAGTCGAGGGGTAGATTGAGCCCAGACAGTTGTGACCTTCAAAATTTTTAGAATTCTATGAAATTTTTTGTACTACATCGTCTATGATTTATATTTGTACCTTCTAGAACTGTTACATAATGAATCAAACGTTCATGTTATGATGATCATTTTTAATGACAAAAAAAAAGAAATAATATTTACACTTGTATGTGTCATTGTCATATACCTTAAAATGTATATGCCTCAGACCCAACTACTCTGTAGTGTATTGCTTCCAAACCATTTTACAAGATAATTGACTTTCAATTGCTCACATATTCCCTAAAGATCTTGCTCTAGCTATTGCACGTTCTTTGTGATGTTTTCTTTATCTTTACCATGTAAGTGGGCTGGTCATTATGTTATGTCCAGTGGGAGAATACCTTGTCCACATACTGGTCGGATATGGTTGGAACAGAGAGGTTTGCAAAGGGCATTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCTGTAAGAAAAGCCTTATTTATCTGTTTATGAAATCTGTCTAATCCTGTCTGGTTTCTTTCTGAACTGTTTCAATCTTTTTAGCATTTAGTTATGTAATTAGCAACATAGGTATCTCTAAAAAACCTATTTTCTTGTACCTCTCTTGGTATTGACTATTGAGTTAAGACCATATTAATTTTAAGGCTAGTCAATTGTTGGTCTTAGGATGGTAGTCGTGAACTCCATTTTTCCATGGTTGATGGGGACTTCAAAAAGTTTGAAGGCAAATGGTCATTAAAAGCTGGTACAAGGTAAAATTTTGTTTATTTGTTCTTTAACCATAATTTTAAGAATTTAAGAATATAACAATACAATATTTTCTATATCATTCATGGGCATTGGTTTATAAGGTTAAATTACTATTTTAGTTTCATGGTTTGAGCTTCATTTCAATTTGGTCTCTATGGTCGTAAAAGTTACAGCTTATTTGTCTAAATTATTATTTTAGTTCCTATGGTTTGATCTTGATTTCAATTTGGTGTCTATAGTTTTAAAAGTTTAGTTGTAGCCCTTATTTATGCTTGGTTAAATCTCCCGAAACATTCTTCCCATTACTAGATGTTAACAAGTGTCTTATGTGGCAGGTTTTTTGGACATGATAGGGAGAGAAAAAGGGATTGGTTTTTGGGATTAAGGTTTTTGAAGATGTCTAGCAAGACCTACTTTTCTACACAAACCTGTTTCCTCTTTACTCAAAATCCCAAATGATTTCTTCTATTATTAAACAACTCAAACCATAGTTAGCTTAATATCTTCAACCAATTTGATCCTTATGCTCAATGTTTGCTCTATCACCTCTTTGATATGTCCGAAGAAAACATGCCACATAAGATTTGGTAATGACAGAACTGTTTTGGAAAATTTAACCAAACTATAAGAGCTATGAAAATTTTGAAGTCATAGGGACATATTGAAACCAAACTCAAAGCATAGGGGCTAAAATGGTAATTTAACCGGCTTATTTATATATGATAATAGATGATATATAATATATGACGTGAAATTTCCATGGTTTATACTTGGTAAATACAATGTCGTAACTTCATTTTTATTTTTCTGCCAAAAGTTTATTGTTATCTGGTATTTTTCTACCATAATACTAGTTTCATCCGATATGATCACAGCTTGCTTCACGTGCAGTTTATGAGATTGTTTACTTGTTGAAATCTTCTCAAGTTTTCATGTTCAGATATTTTCCTTTTTACTTATTAATATGCATGGTTATCACTGCTATTTTCTGTAGGTCACTCCCTACAATGTTGTCATATGAAGTTAATGTAATACCAAGATTCAATTTTCCTGCCATTCTTCTAGAGCGAATAATCAGATCAGACCTTCCTGTGAATCTTCGAGCCTTGGCTTGTAGAGCCGAAGAGAATTCTGAAGGGGGTCAAAGAGTAGGAACCATTGTAGATTCCAAGTCCATGGTTCAAACTAATACAGTTAATGGTGCTTCATGTGAAAAGGATGAATTATTACAGGAAACTTCCAGAGGGGGTAATTCTATTTCCAATTTAGGACCCTTGCCCCCATTATTTAATGAATTGAATAGCAACTGGGGAGTTTTTGGAAAAGTTTGCAGACTTGACAAACGATGCATGGTAGATGAAGTTCATCTTCGCAGATTTGATGGTTTGTTGGTATGTGAATCAATCACCCTAGCATGTCTGACTTACAGGGATAATTTGTGATCCTCTCAGCAGAAATATTTGCTTCTGAATCCCTTATTCTTTATAATGGTTCTACTCTAACATGTTATGGTATTTATTGATGACATCCTTTTAGACCCATCATCAAAACCATTAGAGTAGTAGAAAGAAAAAAGTGGACAGTTTTTCAATGGAGATTATCCAATTTCGTATGTCAGGCTCTTTAATTGTCAAAATTTCTTATAAGATAAATTTCACTGAAGGAAAATGGAGGTGTTCATCGCTGTGTGGTCGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTATGAAAGTCTTCCCGAGTAAGTAATCTGTGCCTCTTTTCTTCAAATATTTTTAATTTTCTCTTCTATTAAAAAAAAATCTTATTTTCATTGCATGATAATTGGAGAGAGTTGTTTTGTATAGGTTACTACATTAATTTTATCTTTGCAAGAGTTGCTCATGCTCTTGTTCAAACTTTGTTGAATTAATTTTAGTGACTTCTCAAATGAAAATGTAAAACATATTACAACTGTGGGCATTAATTGAACAAAACAGCTAGAGTTTGGAACAAGAGTTTGATGGAAATATTATATTTTTCATGATAAAGCAATTGACTTCCTGAGTTATCAGTTGAAGAAAGTGGCATGGAATCAAACGTATAGAGGATGAAAGACACATAATCAGGCAATGGAAATGCTTAACGTTCATAAAAATAAAATGAATTATGGATTCAAAAGTTTTCTTCATTGGCTAGAAGTTTTCACAGGACAGGTTTAAAAGAGATATGGGGTCAAAATTGGGATGACAATATTTACACCACATTTTTTTAATCTTCATCCAAATTTAATGTCTTAAAATTTGTTTAGTTAATCAATTTTAGAACACCCTTATCTCAAATTCACCAATAACTGTTTTGCCATTATGGTGCCTTCTCTTTCTCCTCCCCTTTTATGGGTCTATGTTCCTTCCCCCATCATGCAAAGAAATAGAGAGGTTGTGAGGTGGGCAACCATTGTTGGAGGAATGTGGATGTAAAATTAAATTCAAGTTAGGATTAAGTGTATCATTGTGATACAAAAATTGCATATAGTGAGCAGAAAGGTTGATGTATATCTCTGATGGGAGATATATTAGGATTTTTGTATACTCCTTATGGGTGATTTAACTCAATGTAATATTTATTAAATCAACAAGTGTTTCTTCTTCTTCTGCTTCTTCTTATTCCCCAAAGAAAGGAGTCACAGTCTACCCCCATCAAAAGAGGACTCAGTCTCCTAACCGAGGTCTCCGTCAGAATCTACCGGCATAAACATGATTGAGTCGATGATAGCATATCCAAGAGATAACATTTTAAGGATTTGGAGATTATTAGAGAGGAGCCTACAATTTGGGTAATATTTTAGGAGTGCTTGTAGATGAGTTAGGAACTTTGTTGTTTGGAATGGCATCATGTTGATTACGTGAGTTGCAATTAGAACAATTTCACCCCATAAATGATTAGGGGCATTAGTAGACAGTGCTAGGCATCTTGCCACTTCTAAAAGTTGTCTATTTTTCCCCCAAAAATGTTTAGGAAACTCTATAGCTAAGTGTTATTAGCTTGGAGCAATTCTAAGCAGGGGGCGTAAATTTTTTTCTTTCTAGAAAGCATGTGCACGTGGACAAAGCATGCAAGTAGTCTTCAATCTCATCTCAAAAACAGCTTAAGACATTATGTTTAGGTTGCAGGGATGCAAGGAAAGCTAAGGTTGTTGCGGTTCGAATCTGATTTTTTTTTTTTTTTTTTTTTTGATCAGAAACAATTCCATTGATTGAATGACATTACCAAAAAAAGGGGAGAACCCCATTCCAAGGGAGTTACAAAAAATCTCTCCAATTGGATAAGAGAGTAGAAAAACTATAAGAATGAAAGAGAGGGTTACATTTACACCAAGTGATAACTAAAAAAACTATATTGTCGTAAAAGTAGGGAAAGTGTAGCTATTTTTCCTTGAAGACCCTATGATTCCTTTGTTGCCGGGTAGACCAAAATAAAGCTCTCATGAAGTGTAACCAAAGGAGCTTTTTCTCCTTCTTAAGCGTGCCCCAACAAAACAAAGGAGAGAAGAGCTAAAGGATCGATGGGTAGAACCGTAGACCATCCAAAGGCTGACAGCGTGTTCCTCCAGAAGTCCTCAGCATAGTCACAAGTTAAAAAAAGATGGCTTTGTGTCTCAGAGCTTCTTCTACACATAGAACATCAAAAAGGTGAGATGTTAATGTAAGGCATTCTCCTTTGAAGCTTGTCCTGAGTATTAATTGCTTTGTGGCTCAACTCCCATAGGAAAAATTTCGCTTTCTTTGGGTAGTGCTCTTTCCATATTGCCTCATATAAGAGTGGAGACGCCTCAGATTTATAGTGGGGAAGATCTTTTAGCAGGGAACTAACCGTATATACCCATTCGATTCTAGTTTCCACAACATCTTATCCTTCGCTTGAGACAAATGAACCGTTGATAGGAGTTGTAAGAGTGTTGCCCATTCTTCTATTTCAGCTTCCTTTAGATTTCTTCTCATGCCAATATTCCACATTCCAGATTCCACTAGACAAAATTTTTTTACAGCAGCAGTTTTCTTTGTAGAGAGACCAAATAGAAGTGGAAATCTTCTGTTTAAAGGCCTGTCCAGCAGCCAAGCGTCATACCAAAAGGAGGTAGCTGCACCGTTCCCAACCTTGAGAGTGACCCAGTCGAATAATAAGTTCTGAAATTTCAAAATGGATCTCCCTGGACCCCTAGTTCGGATCTGAATTTTTATTCCTAGTTTGAGTCCTAGGTTTGAGATGTTACACAGTTTCTTGAGGAGCATTGCCTTTCATAACTAAAACGTGATATGCCTTGGCCTTTTTGTCGAGTTGAGTTCATGAATATCTCCCATCTGCAGGTTTTCCATGAATGTGCCAACACCTTTCTTTTGCATTCTTATTCTTTTACAGTGGTCACACCACATTTTTCCACATCTGTTACGATTATCACCAGATTGTTGATTGGGTTTGTAAACTGTTGTAACTCAATATTGAGAGAATACACATCTATTTCGTATACTAAAGTCACGTATTTATAAGTATACAAGAGAGCCCTAGATTAAAAGAATGTAAAATTACAATAAAGGACAAATATACTAATAAGCATCTAAACTAAGGTATTTATACTAATACACATAATAGTGTATATCATAACACTCCCCCTCAAGCTGGAGCAAATATGTCAATTATGCCCAGCTTGTTGCAAAGGTAGTCTATTCTTGCTCCATTTAAGGCTTTTGTAAAGATATCTCCCAATTGTTCATCGGTCTTCACATATCCTGTAGACACCTCACCTTGTTGTATTTTCTCACGAATAAAATGACAATCAACTTCAATATGTTTAGTTCGTTCATGAAATACTGGATTAGATGCAATGTGAAGGGCAGCTTGATTGTCACACCACAACTTAGTTGGCATAGTGATACTGAAGCCTAACTCAGTTAAAAGTTGATGTATCCACGTTATTTCACAAACAGACTGTGCCATAGCTCTATATTCTGATTCAGCACTCGAACGTGAAACTACATTTTGCTTCTTACTCTTCCATGAAACCAAATTACCTCCTACAAAAATACAATATCCAAAAGTTGATCTTCTATCTTCTCTTGATCCAGCCCAATCAGCATCTGAAAAACACTCGACTTTCGTATGACCATGATCTTTGTATAAGATTCCATGTCCAGGTGCAGCTTTTAAATAACATAATATTCGTTCTACTGCAGCCCAATGATCCACTGTAGGCGAAGACATATACTGACTCATAATGCTTACTGGATAAGCAATGTCTGGTCGTGTCACTGTCAGATAATTCAACTTTCCAACTAATCTTCTATATCTCTCAGGGTTTTTAAATAATTCACCATCTTTTGTAAGTTGCAAATTAGGCACCATTGGCGTATTACATGGTTTGGCTCCCAGTTTTCCTGTCTCAGATAACAGATCAAGCACATACTTCCTTTGTGATAAAAAGATACCCTTCTTGCTTCTCATCACTTCAATACCCAAGAAGTATTTCAACATTCCTAGATCCTTAGTATGAAACTGACTGTGAAGGAAGTTTTTGAGAGAAGAAATACCTGATATATCATTTCCAGTGATAATAATATCATCAACATATACAACAAGAAGAGTAATGCCACTCTCAGACCGTCGATAGAATACAGAATGATCAGACTTACTGTTCTTCATTCCAAACTGCTCAAGTGCTTGACTAAATCTTCCAAACCACGCTCGTGGACTTTGTTTCAATCCGTACAAAGATTTTCGAAGATGGCAAACTTTATCATTCTCCCCCTGAGCAACAAAACCAGGTGGTTGCTCCATATAAACTTCTTCCTGAAGATCTCCATGTAGAAAAGCATTCTTTATGTCCAGCTGATGCAAAGACCAACGTTGGGTAGCAGCCATGGAAATAAATAACCTGACAGAAGTTAGTTTAGCAACGGGAGAAAATGTGTCAGAGTAGTCAATCCCATAAGTCTGAGCATAGCCTTTGGCAACCAAGCGTGCTTTCAATCGGGCAACTGATCCATCTGGGTTGACCTTAATTGCAAATACCCACTTACACCCAATTGCTTTCTTTCCTACAGGACGAGGCACTAAATCCCACGTACCATTATCATTTAAAGCATTCATCTCTTCAATCATTGCATCACGCCAGCCAGGATGAGATAATGCTTCATGAATAGTGTTAGGAATAGAAACAGAATCAAGGGACGCAATAAAGGAATATGTGGGTGATGACAAATGGTTATATGAAACAAAGGAAGAAATAGGATAAGTGCATTTGCGTTTACCTTTACGAAGTGCAATAGGAAGCTCATCACTTGGTCCTGGATCCAATGTCGAAGAATCTACTAGTATAGGATATGTACCTGGAGGTTGTTGTTGTTGTCGTCGCCTTGAGTAGACCTGAGTAAAGGGTGGACGGGGAGGGACAGATACCACTGGAGGTGGAGCAAGGGAGGAATCTGGAGAAACAATGGTATAAATAAAGAGATCATCCTCCTCTCCCTTACGCGTACTCGAAGGTGCTGGATTAAAAGGTATATCCTCAAAGAAAGTAACGTCAGGAGAAACAAGGTATCTGTTAAGACTAGGACAATAACAACGATATCCTTTTTGAACACGAGAATATCCTAGAAAAATGCACTTTAATGATTTTGAATCTAACTTTGTGAGGTGAGGACGGACGTCTCGAACAAAACACGTACAACCAAATATTTTTAGATCGATAGAAAATAATGGTTGTGTTGGAAATAGAACCCGATACAGAATCTCACCATTAAGAACAGATGAAGGCATTCGATTTATTAGGAAGCAGGCTGTGGAGACAGCATCAACCCAAAAATGCTTTGGAACATGCATTTGAAATGATAATGCTCTTGCTGTTTCAAGGAGATGTCTATTCTTTCGTTCTGCAACTCCATTTTGGGATGGAGTGTCAGCACATGAAGATTGATGAATAATGCCATTTTGACATAAATAAGATCCAAGCATATTAGAAAAGTATTCTCCTGCATTATCCGTTCGCAAAACTTTAAGAGAAACGTTAAATTGAGTTTGGATTTCAGCATGAAAATTGCAAAAGTGGGAGAGCAACTCAGAACGACTTTTCATTAAATATAACCAAGTCAAACGAGAATAATCGTCAACAAAAGTAATAAAATATCTAAATCCCGTTTTCGACACTACTGGACAGGGACCCCAAATATCAGAATGAACTAATTCAAAAGGAGCATTTGCTCGTTTATTGACTCTAGGACATGAACTAAGACGATGAAATTTGGCAAACTGACAAGAATCACAATTTAAAGAAGACAAAGAATGAAACTGTGGATAAAGTTTCTTCAACACGGACAAAGACGGATGGCCCAAACGACAATGGACTTCAAATGCAGATGCAACTCCAGAACATGCTACAGCCTTTGGTTCTTGGTGATAAAAAATGTAAAGACCTCCAGATTCATATCCTTTACCAATAATCTTCTTCGTCGCAAGATCCTGAAACAAGCAATAGCCAGGAAAGAATGAAACAGAACAGTTAAGATCGCGAGTGAGTTTACTAACAGAGATTAAATTAAAGGAAAGTTTAGGCAAATTTAAGACAGAGGACAAATGTAGAGATGGTGTGAGATTAATTGTGCCAGATCAAAGTACAGAGGATGTTGATCCATCTGCTAAAGTAACAATAGGAGAAGATGCAGGTGACAAAGGAGTAGAAAATAAGCGGGAATTACCTGTCATATGGTCTGTCGCACCAGAATCTATGACCCATTTGGTGGAGGATGAAAGAAGACAATGATGCATATTACCTGACTCAGCGATGGCTGTAATAGGGGTAGATGATGATGATGATGCTTGTAATGATTCCTGGTACTGCTGAAATTTAGCAAAATCATCTGCAGAAATGGTTACTGACTTTTCAGGTGTATCATGGGTGGAAGCAACCTGAGCGGATCGAGGTCTTTGGCCCTTATTCAACAATTTCTTGCATTCGCGTTTCATATGACCTGGCTTACGACAGTAGTGGCATTGTTATGATATACACTATTATGTGTATTAGTATAAATACATTAGTTTAGATGCTTATTAGTATAATTGTCCTTTATTGTAATTTTACATTCTTTTAATCTAGGACTCTCTTGTGTACTTATAAATACGTGACTTTAGTATACGAAATAGATGAATTCTCTCAATATTGAGTTACAACAGTCTACAATAATCTCCTGACCATCTTGTCTGCGATTATCAAAACCAGTTTTGGGAGTATTGGTGCCCATCCCTTTATCACCTCTGTGATTATTGTTCATACCAATGAGAGCACTGTTTGATTGAGCAGGAGAGAAACTTGATTGAGAATTCTCCGTACGAAGAACTCTACTAAAAGCTTCTTGTAAAGAAGAGATGTCAGGACTAGAAAGAATTTGTGCTTTAGCCATTTCAAATTCAGATGGAAGTCCAGTTAAAAAACTCATGACAGCCATCTGTTCTCGCTGAGCTTGTTGAACCTTCATATCTGTACTAAATGGTAGTAACATATTGAGTTCTGCATACGTTTTCTTAAATTCCATAAAGTAACTTGTAAGTGATTGTTCCTTCTTTTCTGGACGATAGAACGCCTTGCATACCTCATACATTCTATTGACTTGCCCTTTTCCTGAGTACAAAAATTCCAAGTACTCAAGTAGTTCTTTCACAAATTCACAATGATTAATTAAGCCAACTACCTCACTATCAATGGAATTCTTTATCTGAAGAAACAAACGGGCATCATCTCGTAGCCATGTCTTCTTTGTGTCATCATCTGGTGGATCATCAGTGATGTGGTTATCCATCTCTATGCTTCGTAAATAAAGTCGAATAGTCCTACTCCAATCATAGTAATTTGAACCATTTAGCTTATGATCTGTGATCTTTGACGTTAGAGGAACCACATCAGATATCACTATGGGTTTTTTTCTCAGCCATAGTACCAAATAAAGGTTGTAAAAACCCAAAATCTGATAATTATTGACCAAAAACAGAATTGAACTATGTGGCAAAACCAACTATTTTCCAAACTGCCTTGCACCAAAGTAGATCTTGCTTAATCCAAGCACACAGATAGGACGAACAGCATGAAACACGACAAGAACAGGCGACGGAAGACTGATTGGCTGTCCCACGCGCCGGCGCGTGGAGTCGGAGGCGATTTTGTCGGTGGCGCATGTAGCCCACGCGCGGGTGTTTCCGACGATCGGCGAAGACGATCCGGTCCTCTGGACGGCGGCGCTCCTTTTGGGGTGGGTGGTGTCAACAAACGGCCACTCCAATGAGATGACCCAAACTTCAAACCCTAACCTTACTTAGAAGAAAAACCCTAATAGCCCCAAAAGAGATATCTAAAACCCTAGACAAGGCTCTGATACCATGTAAACTGTTGTAACTCAATATTGAGAGAATACACATCTATTTCGTATACTAAAGTCACGTATTTATAAGTATACAAGAGAGCCCTAGATTAAAATAATGTAAAATTACAATAAAGGACAAATATACTAATAAGCATCTAAACTAAGGTATTTATACTAATACACATAATAGTGTATATCATAACAGGGTTGTCATACAACTAAAGCTGAATCTCAGCATTTTATCGAGTAATGGTTGTTCCCAACATGAGTTTTCTTCTACTCTCCTCATGATGAACCTCAGGGAACACTTAGACTCGGAAATAATGGAAACTACTCTTCCCTTAACTTTATCGAGACTTTTATTGAGGCCCAACAAAAATTTATAAATTCTCTTTTGTTGAACAAATTTCATGGACTGATTTTCATCTTTTGTACACTCATAGTTGTAAGATTCAGAACATATCCAGTTGTTGCCAATATTGGTTAAGAAGAGAACTATTGGGTTATAGGAGGTCACCTTGTCTGAAAATCTGAATAATTCCTTCTATTTAACAACTCTGATGTGTTTCCATGTTTGAATGTGTTTTCTTGGCATAGTCCCATAACTCTTTGACGGTTTGGGATAAAAGGAATTTTTCACTGATCTCTGGAAGTGTTGAGCTAATCAATCAGCACAATACCATATATTTGTTGATTTGCCATTCTCAACACTTAGGATTGCCGATAGAGGGTTGTTTGATGCTTCGAATTAAGTATTGTATTTGTCGTTCCCTCTACCATAGGTAGACATTTGCATTGAAGATGACCATTGGAGGTAGTTCCTGCCAATTAATTTGTGGTTGGTGATGGTAGGAGAAAAAGTTGGCTAGAGTAGAGGTCCTGCTGACCTCGACACCAACCGATCAGTTGGAGAGTAAAGTAGAAACCAATTGATTTATTTTGTATAATGCAATGATTGGTCGGTTGAAGATGATCGGAGCTGGTGATTGGTGGTGGGGCACATGAGGGAGGATTGAGAGAGTGGTTGGCAGAGTCAAAAGGTAGGAAGCACTGCATTTCCACTGTGTTAACTAATGATTAGAGAAGGGAGGTTTTTTTTTGGAGAGGGGGGGGGGGGGGGGGGGAATTCTCCTGTCTGTAAATCTATTCATGCAGCCTCTTTAAGAGTTAGTATGGCCAGCCACCATCTCAGTTATTTTACTTTTAATTTTTTTATATGTATTTGGAAAAAATCGGACAGTCAGTCTGAGGGGCTGAAGATGGGAAAAAAAAAAACAATGCTCTGATACCATATTGAATGAGTGTTGAGCAAGAGAGGAAATAATGATGTTCATATGACATCCCACGATATACAACCTTCGTACAAGTATTCTGAAAAAGTAAAAAGGAAAGAAATGTGTACAAGGGAAGAATAAAATGGTAAACTAGCTGTACAAAACCAAGAATACTCTTACGGTAATTAAAATAAATAAAGGGCCTATTATGCTTAACGAATGGAAGGCCCGTAAACCAAAAACTTCAACAGGTAGCAAAAAGGATCGTACAGAATGCTTCACCCTGATATTAGTAACGAACCAAGCCTTTGGATTTGAAAATCTTCTTACTATTGAAATTATTGATTGCAGAGTAGTTCCAAATCTAGCTATCAGCAAGATACTGTCAAGAGAAAGCAACAAAGTTCGCATTCTTCAGGTAAAATCAGAATATTAATTGCAAATTAGATGAACCTCCTATTAGCCGTTCTTATCTATCTTAGAAATATATATGACGGGAGGTTTGAAATTTATTAATTCATGTGGCATTTGTGTAATAAGTCTTGAGTTGGTTCGATTCTTGTAGAGAGGTAATCATACAGCTTTCTGGCTAGGAAATTTGTATGAAAAGATTTAAAATTCTAACGGGAAGCTAATAATGTTGCAATACAGAATGAGAATAATGAGAATGTCTAGAGTCACTGTTGCAGTGTGCATAACATGCAATGACATTTTTCTTATGTAAGAGAATACTAAGGTGATTATTTCATTATCTAGCTATATAGCAGATGTGGTACTCTGGAACTGAAAACATCTGATGTGATTTTACATTTACTTGATGGGATTACTACCTCTTTTGGATTTTACTTGTGCTATGCATGTAGGGAGGAATGCTAAGCAGTTAGCTTATTGAGGGTATTATTATGATTCCATAACCAATGGCTACTAACAGTATAGTTTAGTTAGCTAAGCAGTTCGTTCTTTGTAATCAGTTAGGATAGCAGGTCAGTTGATGTTTGGAGCGGCTAGGAATATAGTGAGAATGAGTAGAGTCAGGCCTCTCGTCAAGAATTTTACTCGATAACTATCAATGGAACGTGTTCTTAAAACCCAAATCCTAACAGTTACTTGTTCTGTTAATGGCTCAATCATGCTAATTTTCGAAAAGGATTAATATGTGCAGGAAGGATGCAAGGGTCTGCTGTATATGGTTCTGCACGCCCGTGTAGTTCTGGACTTGTGTGAACTGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGTGGAAAATGGCATTTTGAGCAGTTAGGAAGTCATCATACCCTGTTGAAATACTCCGTGGAGTCGAGAATGCACAAAGATACCTTTCTTTCTGAGGCTCTAATGGAAGAGGTTTCATTTTTTTTTATTTTTTTGTGCTTCTGTGACTAATCTTGCTTCTATTTTGTAGTTCCTGTTTCGCTTGAAACATTATAAATAATTTTAGCTGTTCACTCTCTTTATGTGGAAGCAAAACATGTTTCTCTAGTTATTTAATTCTCTTTTTTACAAGTAATATTGCATAAGTGAAATAGTTACCACCTGCAGTTCTTTAAAAATAATCATGTGTTATTCAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGCAATTCGGGACTCCATTGAGAAAAGGGGTTTGAACAATTCTTTTGAAGCATTTGATGAAGGTAGAGATTCAGACGAGAAAAGTGCCTCATATTGTAACGATCAATTCAATGGTTATACGACGACAGTTGAGGGAGTTTCAGATGTCAATGGGAGAAATTCGTGCAGACCGAGGCCCAAAGTTCCAGGCTTACAAAGAGATATCGAAGTTCTCAAAGCAGAGGTGCTCAAGTTTATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGCATGCATGGAAGAGTAGATATTGAGAAGGCCATCACACGCATGGGTGGATTCAGAAGGATTGCATCACTTATGAATCTTTCTCTCGCTTATAAGCACCGCAAGCCGAAGGGTTACTGGGACAAATTTGACAATTTGCAGGAAGAGGTATGCTTCAACGTGGGATTTTAACTTGTTGGAAACCTTTATCTTTTAATTTCTCATTCTTTCTGTTGGTTAAAATTTTTATGTTTAGCATGCAAACTATATTTTGGGTTTACATGTAAATCAAAATCTGAAACTTGCATACCGTTGTGGATAACTTAACCGTGAGTCTTAGAGTGAGTTTGAGATTGACATTTGAGAAGCACCTTTTTATACAGATATTTTTTTTTTGAAAAAGCACTTTAGAGGTGCTTTCAAGTGTTTTTAGAATCTAAAAACTCTTTTGACTAAGTAACCAAATACTATGAAATTTGAAAAATACTTCTAATTAGATATAAAACAATTTTCACTATTATTAAAGTTATTCCAAACTCATCCTTATCCTAGCTACTCATCCGCCCTTAGAATGAACGAAAATGTATGCTTCATGCTTCATGTGTTAGAAAAAGACTTGCAGCTTCACAGAATTCTATTGCTAAATAAACAGATAAATCGGTTTCAGAAGAGTTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGTGCAGGTACAAAGTCTACTGCTTTATATATTATAAATTATCATTATTATTCTTCTTTCTTCCCTTTTTTTTTTCTTTTTTTTTTTTTTTTTGGAAGGGCTTTTTTGTGTTTGGGGGAAGGCGGGGGGGGGGGGGGGGCACGGACTTTGAAGAGGGAGGTTGATTGACCTCACCAAAAGTTACGAAAAGCATTATTTTGATGCTCATAGAACATGAATTTGTAAGCTTTGATCACCCTCAGTTCGTAATGTAATGACTCATCTCACATCATTGATTGGTTTCCGACACAACTGAGTCTTTTGAATTTGAGATACCGTGTCGACAATAGATTTTAACATCTAGTTGAGACGTATACATAAACTTCTAGGATTTCAATATTGTCTGAGACGATCCGGGAGGGTCATGTCAGGTTTAGCTCTCACTTCAATGTTATTAGATTCAACTCCAAATGTTGTTTCTCAAATAATGATATTTTCCTAATAAGCTCTATGTTGCTCAAAAAATGGCTGCCATTACTCTCTATGAAGCTTCATGCACATTCCATTCAGCTCTATATATTTATGAAGAAAAATCATTGATTGACAAAGGCTTGTTGCAGGGCGGTACGACATCGCACGGGCACTCGAGAAATGGGGCGGCCTACACGAAGTGTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAAGATCGAAAGAATGATTATGTAGCTCTAAATGATGTTGATGCTGAAAGTAAGACCCCATCTAAGCCCTATATTTCTCAGGATACAGAAAAATGGCTTAGAGGACTCAAACATTTGGATATTAATTGGGTTGAGTAG
mRNA sequence
ATGATCATTTGCAGAGCTTTGAGGCTCAACTTGGGGACGCCATTGCCGCCGCTAACGTCCGGCGTCTTCGCTAGACAAGCACAATATTGCCAGACGTCTTCTCTTCGGCTGCGCAACAAATGGGTCTCCCTTTCCGCTGCCGAAGGCTTCGACTGGAACTCGAGCGACTACTTTGCGAAGAACTGTAATTTGAAGAGGGGGAGTGGTTTTTATGGTGACCGGGATGATGTTGAAGAAGGGGATGGAGAGAGAGAGAGAGCTGTGCGTTGTGAAGTGGAGGTTATATCGTGGAGGGAGCGGCGAATTCGGGCTGATATATTTGTTAATGCTGGGATTGAATCCGTTTGGAACGCTCTTACTGATTACGAGCGCCTCGCGGATTTTATACCCAATCTTGTTTCCAGTGGGAGAATACCTTGTCCACATACTGGTCGGATATGGTTGGAACAGAGAGGTTTGCAAAGGGCATTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCTGATGGTAGTCGTGAACTCCATTTTTCCATGGTTGATGGGGACTTCAAAAAGTTTGAAGGCAAATGGTCATTAAAAGCTGGTACAAGGTCACTCCCTACAATGTTGTCATATGAAGTTAATGTAATACCAAGATTCAATTTTCCTGCCATTCTTCTAGAGCGAATAATCAGATCAGACCTTCCTGTGAATCTTCGAGCCTTGGCTTGTAGAGCCGAAGAGAATTCTGAAGGGGGTCAAAGAGTAGGAACCATTGTAGATTCCAAGTCCATGGTTCAAACTAATACAGTTAATGGTGCTTCATGTGAAAAGGATGAATTATTACAGGAAACTTCCAGAGGGGGTAATTCTATTTCCAATTTAGGACCCTTGCCCCCATTATTTAATGAATTGAATAGCAACTGGGGAGTTTTTGGAAAAGTTTGCAGACTTGACAAACGATGCATGGTAGATGAAGTTCATCTTCGCAGATTTGATGGTTTGTTGGAAAATGGAGGTGTTCATCGCTGTGTGGTCGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTATGAAAGTCTTCCCGAAGTAGTTCCAAATCTAGCTATCAGCAAGATACTGTCAAGAGAAAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTGCTGTATATGGTTCTGCACGCCCGTGTAGTTCTGGACTTGTGTGAACTGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGTGGAAAATGGCATTTTGAGCAGTTAGGAAGTCATCATACCCTGTTGAAATACTCCGTGGAGTCGAGAATGCACAAAGATACCTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGCAATTCGGGACTCCATTGAGAAAAGGGGTTTGAACAATTCTTTTGAAGCATTTGATGAAGGTAGAGATTCAGACGAGAAAAGTGCCTCATATTGTAACGATCAATTCAATGGTTATACGACGACAGTTGAGGGAGTTTCAGATGTCAATGGGAGAAATTCGTGCAGACCGAGGCCCAAAGTTCCAGGCTTACAAAGAGATATCGAAGTTCTCAAAGCAGAGGTGCTCAAGTTTATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGCATGCATGGAAGAGTAGATATTGAGAAGGCCATCACACGCATGGGTGGATTCAGAAGGATTGCATCACTTATGAATCTTTCTCTCGCTTATAAGCACCGCAAGCCGAAGGGTTACTGGGACAAATTTGACAATTTGCAGGAAGAGATAAATCGGTTTCAGAAGAGTTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGTGCAGGGCGGTACGACATCGCACGGGCACTCGAGAAATGGGGCGGCCTACACGAAGTGTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAAGATCGAAAGAATGATTATGTAGCTCTAAATGATGTTGATGCTGAAAGTAAGACCCCATCTAAGCCCTATATTTCTCAGGATACAGAAAAATGGCTTAGAGGACTCAAACATTTGGATATTAATTGGGTTGAGTAG
Coding sequence (CDS)
ATGATCATTTGCAGAGCTTTGAGGCTCAACTTGGGGACGCCATTGCCGCCGCTAACGTCCGGCGTCTTCGCTAGACAAGCACAATATTGCCAGACGTCTTCTCTTCGGCTGCGCAACAAATGGGTCTCCCTTTCCGCTGCCGAAGGCTTCGACTGGAACTCGAGCGACTACTTTGCGAAGAACTGTAATTTGAAGAGGGGGAGTGGTTTTTATGGTGACCGGGATGATGTTGAAGAAGGGGATGGAGAGAGAGAGAGAGCTGTGCGTTGTGAAGTGGAGGTTATATCGTGGAGGGAGCGGCGAATTCGGGCTGATATATTTGTTAATGCTGGGATTGAATCCGTTTGGAACGCTCTTACTGATTACGAGCGCCTCGCGGATTTTATACCCAATCTTGTTTCCAGTGGGAGAATACCTTGTCCACATACTGGTCGGATATGGTTGGAACAGAGAGGTTTGCAAAGGGCATTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCTGATGGTAGTCGTGAACTCCATTTTTCCATGGTTGATGGGGACTTCAAAAAGTTTGAAGGCAAATGGTCATTAAAAGCTGGTACAAGGTCACTCCCTACAATGTTGTCATATGAAGTTAATGTAATACCAAGATTCAATTTTCCTGCCATTCTTCTAGAGCGAATAATCAGATCAGACCTTCCTGTGAATCTTCGAGCCTTGGCTTGTAGAGCCGAAGAGAATTCTGAAGGGGGTCAAAGAGTAGGAACCATTGTAGATTCCAAGTCCATGGTTCAAACTAATACAGTTAATGGTGCTTCATGTGAAAAGGATGAATTATTACAGGAAACTTCCAGAGGGGGTAATTCTATTTCCAATTTAGGACCCTTGCCCCCATTATTTAATGAATTGAATAGCAACTGGGGAGTTTTTGGAAAAGTTTGCAGACTTGACAAACGATGCATGGTAGATGAAGTTCATCTTCGCAGATTTGATGGTTTGTTGGAAAATGGAGGTGTTCATCGCTGTGTGGTCGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTATGAAAGTCTTCCCGAAGTAGTTCCAAATCTAGCTATCAGCAAGATACTGTCAAGAGAAAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTGCTGTATATGGTTCTGCACGCCCGTGTAGTTCTGGACTTGTGTGAACTGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGTGGAAAATGGCATTTTGAGCAGTTAGGAAGTCATCATACCCTGTTGAAATACTCCGTGGAGTCGAGAATGCACAAAGATACCTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGCAATTCGGGACTCCATTGAGAAAAGGGGTTTGAACAATTCTTTTGAAGCATTTGATGAAGGTAGAGATTCAGACGAGAAAAGTGCCTCATATTGTAACGATCAATTCAATGGTTATACGACGACAGTTGAGGGAGTTTCAGATGTCAATGGGAGAAATTCGTGCAGACCGAGGCCCAAAGTTCCAGGCTTACAAAGAGATATCGAAGTTCTCAAAGCAGAGGTGCTCAAGTTTATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGCATGCATGGAAGAGTAGATATTGAGAAGGCCATCACACGCATGGGTGGATTCAGAAGGATTGCATCACTTATGAATCTTTCTCTCGCTTATAAGCACCGCAAGCCGAAGGGTTACTGGGACAAATTTGACAATTTGCAGGAAGAGATAAATCGGTTTCAGAAGAGTTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGTGCAGGGCGGTACGACATCGCACGGGCACTCGAGAAATGGGGCGGCCTACACGAAGTGTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAAGATCGAAAGAATGATTATGTAGCTCTAAATGATGTTGATGCTGAAAGTAAGACCCCATCTAAGCCCTATATTTCTCAGGATACAGAAAAATGGCTTAGAGGACTCAAACATTTGGATATTAATTGGGTTGAGTAG
Protein sequence
MIICRALRLNLGTPLPPLTSGVFARQAQYCQTSSLRLRNKWVSLSAAEGFDWNSSDYFAKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNALTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPLPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRGLKHLDINWVE
Homology
BLAST of Sgr019019 vs. NCBI nr
Match:
XP_022154935.1 (uncharacterized protein LOC111022083 isoform X1 [Momordica charantia])
HSP 1 Score: 1309.7 bits (3388), Expect = 0.0e+00
Identity = 661/737 (89.69%), Postives = 681/737 (92.40%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLP---------PLTSGVFARQAQYCQT-SSLRLRNKWVSLSAAEGF 60
MI+CRALR NLGTP P PLTSGV+ARQA+YCQT SSL LR+K VSLSAAEGF
Sbjct: 1 MIVCRALRFNLGTPSPLPLPLPLPSPLTSGVYARQAEYCQTSSSLPLRSKCVSLSAAEGF 60
Query: 61 DWNSSDYFAKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNA 120
DW+SS+YFAKNCNLK SG + +D EG G+ ERAV CEV+VISWRERRIRADI VNA
Sbjct: 61 DWDSSEYFAKNCNLKSRSGGW---EDGGEGVGDGERAVHCEVKVISWRERRIRADILVNA 120
Query: 121 GIESVWNALTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQ 180
IESVWNALTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQ
Sbjct: 121 AIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQ 180
Query: 181 ELLNSDGSRELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERI 240
ELLNSDGSRELHFSMVDGDFKKFEGKWS+KAGTRS PT LSYEVNVIPRFNFPAILLERI
Sbjct: 181 ELLNSDGSRELHFSMVDGDFKKFEGKWSIKAGTRSSPTTLSYEVNVIPRFNFPAILLERI 240
Query: 241 IRSDLPVNLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGN 300
IRSDLPVNLRALACRAEENSEGG+RVGT DSKSMV TNTVNGASCE DE LQETSR N
Sbjct: 241 IRSDLPVNLRALACRAEENSEGGRRVGTTEDSKSMVLTNTVNGASCENDE-LQETSRRSN 300
Query: 301 SISNLGPLPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT 360
S SNLGPLPPL NELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT
Sbjct: 301 SNSNLGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT 360
Query: 361 VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD 420
VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD
Sbjct: 361 VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD 420
Query: 421 LCELLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV 480
LCE LEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV
Sbjct: 421 LCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV 480
Query: 481 YEDLPSNLCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNG 540
YEDLPSNLCAIRDSIEKRG NNSFEAFDEGR S+EKSASY NDQ NGYT EGVSD NG
Sbjct: 481 YEDLPSNLCAIRDSIEKRGSNNSFEAFDEGRHSEEKSASYHNDQINGYTMKGEGVSDDNG 540
Query: 541 RNSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG 600
+NSCRP+PKV GLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG
Sbjct: 541 KNSCRPKPKVAGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG 600
Query: 601 FRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYD 660
FRRIAS+MNLSLAYKHRKPKGYWDKFDNLQEEINRFQ SWGMDPSYMPSRKSFERAGRYD
Sbjct: 601 FRRIASIMNLSLAYKHRKPKGYWDKFDNLQEEINRFQTSWGMDPSYMPSRKSFERAGRYD 660
Query: 661 IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQD 720
IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKND +A N DAE+KT S+PYISQD
Sbjct: 661 IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDSLAFNGHDAENKTASRPYISQD 720
Query: 721 TEKWLRGLKHLDINWVE 728
TEKWL GLK+LDINWVE
Sbjct: 721 TEKWLSGLKYLDINWVE 733
BLAST of Sgr019019 vs. NCBI nr
Match:
XP_038882723.1 (uncharacterized protein LOC120073881 [Benincasa hispida])
HSP 1 Score: 1307.7 bits (3383), Expect = 0.0e+00
Identity = 652/729 (89.44%), Postives = 678/729 (93.00%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT--SSLRLRNKWVSLSAAEGFDWNSSDYF 60
MI+CRAL LG P PLTSGV+A Q +Y QT SSL R K VSLSAAEGF+WNS+ YF
Sbjct: 4 MIVCRALSFTLGPPF-PLTSGVYATQTEYYQTSFSSLPFRTKCVSLSAAEGFEWNSTQYF 63
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
K CNLKRG+ YG R+D EEG+GERER VRCEVEV+SWRERRIRADIFV +GIESVWNA
Sbjct: 64 TKGCNLKRGNEVYGGREDGEEGEGERERDVRCEVEVVSWRERRIRADIFVQSGIESVWNA 123
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS
Sbjct: 124 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 183
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 240
REL FSMVDGDFKKFEGKWS+KAGTRS PTMLSYEVNVIPRFNFPAILLERIIRSDLPVN
Sbjct: 184 RELLFSMVDGDFKKFEGKWSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 243
Query: 241 LRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPL 300
LRALACRAEE SEGGQRVG DSKS+V +NTV GA+CEKDE++QE SRGGNS SNLGPL
Sbjct: 244 LRALACRAEEKSEGGQRVGNTKDSKSVVLSNTVKGATCEKDEMVQENSRGGNSNSNLGPL 303
Query: 301 PPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
PPL NELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV
Sbjct: 304 PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 363
Query: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQE 420
WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCE LEQE
Sbjct: 364 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQE 423
Query: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL
Sbjct: 424 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 483
Query: 481 CAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPRP 540
CAIRDSIEKRGL NSF AFDEG DS+E S+ N+Q NGY TT GVS+V+GR+SCRPRP
Sbjct: 484 CAIRDSIEKRGLKNSFGAFDEG-DSEETGVSHRNNQSNGYKTTAGGVSNVSGRDSCRPRP 543
Query: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM
Sbjct: 544 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 603
Query: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW
Sbjct: 604 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 663
Query: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRGL 720
GGLHEVS LLSLKVRHPNRQPSFA DRKNDY+A+NDVDAESKTPSKPYISQDTEKWL GL
Sbjct: 664 GGLHEVSCLLSLKVRHPNRQPSFATDRKNDYLAVNDVDAESKTPSKPYISQDTEKWLTGL 723
Query: 721 KHLDINWVE 728
K+LDINWVE
Sbjct: 724 KYLDINWVE 730
BLAST of Sgr019019 vs. NCBI nr
Match:
XP_011654397.2 (uncharacterized protein LOC101212159 [Cucumis sativus] >KAE8649758.1 hypothetical protein Csa_012453 [Cucumis sativus])
HSP 1 Score: 1292.7 bits (3344), Expect = 0.0e+00
Identity = 646/729 (88.61%), Postives = 673/729 (92.32%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT--SSLRLRNKWVSLSAAEGFDWNSSDYF 60
MI+CRAL LG PL PLTSGV A Q +Y QT SSL LR K VSLSAA+GF+WN + YF
Sbjct: 1 MIVCRALSFTLGPPL-PLTSGVCATQTEYSQTSSSSLPLRTKCVSLSAADGFEWNPTQYF 60
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
AK NLKR SG YG R+D EEG+ ERER VRCEVEV+SWRERRIRAD+FV++GIESVWN
Sbjct: 61 AKGSNLKRRSGVYGGREDGEEGEAERERDVRCEVEVVSWRERRIRADVFVHSGIESVWNV 120
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS
Sbjct: 121 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 240
REL FSMVDGDFKKFEGKWS+ AGTRS PTMLSYEVNVIPRFNFPAILLERIIRSDLPVN
Sbjct: 181 RELLFSMVDGDFKKFEGKWSINAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 240
Query: 241 LRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPL 300
LRALACRAEE SEGGQRVG I DSK +V +NT+NGA+C KDE++QE SRGGNS SNLG +
Sbjct: 241 LRALACRAEEKSEGGQRVGNIKDSKDVVLSNTLNGATCVKDEIVQENSRGGNSNSNLGSV 300
Query: 301 PPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
PPL NELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV
Sbjct: 301 PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
Query: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQE 420
WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCE LEQE
Sbjct: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQE 420
Query: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL
Sbjct: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
Query: 481 CAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPRP 540
CAIRDSIEKR L NSFEA D+G DS+EKS S N+Q NGYTTT EGVSD+NGR S RPRP
Sbjct: 481 CAIRDSIEKRVLKNSFEALDQG-DSEEKSVSRRNNQSNGYTTTAEGVSDINGRASFRPRP 540
Query: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM
Sbjct: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
Query: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW
Sbjct: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
Query: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRGL 720
GGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYV +ND D ESK PSKPYISQDTEKWL GL
Sbjct: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKSDYVVVNDFDGESKAPSKPYISQDTEKWLTGL 720
Query: 721 KHLDINWVE 728
K+LDINWVE
Sbjct: 721 KYLDINWVE 727
BLAST of Sgr019019 vs. NCBI nr
Match:
XP_008442209.1 (PREDICTED: uncharacterized protein LOC103486131 [Cucumis melo])
HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 645/730 (88.36%), Postives = 670/730 (91.78%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT--SSLRLRNKWVSLSAAEGFDWNSSDYF 60
MI+CRAL LG PL PLTSGV+A Q +YCQT SSL LR K VSLSAA+GF+WNSS YF
Sbjct: 4 MIVCRALSFTLGPPL-PLTSGVYATQTEYCQTSSSSLPLRTKCVSLSAADGFEWNSSQYF 63
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
AK NLKR SG YG R D EEG+ ERER VRCEVEV+SWRERRIRADIFV++GIESVWN
Sbjct: 64 AKGSNLKRQSGVYGGRRDGEEGEAERERDVRCEVEVVSWRERRIRADIFVHSGIESVWNV 123
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGS
Sbjct: 124 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQEHLNSDGS 183
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTR-SLPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 240
REL FSMVDGDFKKFEGKWS+KAGTR S PTMLSYEVNVIPRFNFPAILLERIIRSDLPV
Sbjct: 184 RELLFSMVDGDFKKFEGKWSIKAGTRSSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 243
Query: 241 NLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGP 300
NLRALACRAEE SEGGQRVG I DSK++V +NT+NGA+C KDE++QE SRGGNS SNLGP
Sbjct: 244 NLRALACRAEEKSEGGQRVGNIKDSKAVVLSNTLNGATCAKDEIVQENSRGGNSNSNLGP 303
Query: 301 LPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
+PPL NELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE
Sbjct: 304 VPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 363
Query: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQ 420
VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCE LEQ
Sbjct: 364 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQ 423
Query: 421 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN
Sbjct: 424 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 483
Query: 481 LCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPR 540
LCAIRDSIEKRGL NSFE +G ++ CN Q NGYTTT EGVS +NGR S RPR
Sbjct: 484 LCAIRDSIEKRGLKNSFEVLYQGNLEEKSVPRQCN-QSNGYTTTAEGVSAINGRASFRPR 543
Query: 541 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
Sbjct: 544 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 603
Query: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK
Sbjct: 604 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 663
Query: 661 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRG 720
WGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYV NDVD ESK PSKPYISQDTEKWL G
Sbjct: 664 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKSDYVVANDVDGESKAPSKPYISQDTEKWLTG 723
Query: 721 LKHLDINWVE 728
LK+LDINWVE
Sbjct: 724 LKYLDINWVE 731
BLAST of Sgr019019 vs. NCBI nr
Match:
XP_023517467.1 (uncharacterized protein LOC111781223 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 644/729 (88.34%), Postives = 679/729 (93.14%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQTSS--LRLRNKWVSLSAAEGFDWNSSDYF 60
MI+ LR NLG LPP TSGV+ARQ +YC TSS L LR K VS+SAAEGFDWNSS+YF
Sbjct: 1 MIVGGPLRFNLGPSLPP-TSGVYARQPEYCLTSSSFLSLRTKCVSVSAAEGFDWNSSEYF 60
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
K+ +LKRGSG YG RD EG+GERER V CEVEV+SWRER+IRA+IFVN+GIESVWNA
Sbjct: 61 TKSFSLKRGSGVYGGRDGNGEGEGERERDVYCEVEVVSWRERQIRANIFVNSGIESVWNA 120
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS
Sbjct: 121 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 240
RELHFSMVDGDFKKFEGKWSLKAGTRS PT+LSYEVNVIPRFNFPAILLERIIRSDLPVN
Sbjct: 181 RELHFSMVDGDFKKFEGKWSLKAGTRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVN 240
Query: 241 LRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPL 300
LRALACRAE +SEGGQRVG DSKSM+ +NT+NGA+CEKDELLQE NS SNLG L
Sbjct: 241 LRALACRAEGSSEGGQRVGNSEDSKSMILSNTINGAACEKDELLQE-----NSSSNLGTL 300
Query: 301 PPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
PPL NELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV
Sbjct: 301 PPLSNELNSNWGVFGKVCKLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
Query: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQE 420
WNVLTAYESLPEVVPNLAISKILSRESNKVRI+QEGCKGLLYMVLHARVVLDLCE LEQE
Sbjct: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRIVQEGCKGLLYMVLHARVVLDLCEQLEQE 420
Query: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
ISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL
Sbjct: 421 ISFEQVEGDFDSLTGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
Query: 481 CAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPRP 540
CAIRDSIEKRGL NSFE+F++G DS+EKS+S N+Q NG+TTT E VSD+NGR+S RPRP
Sbjct: 481 CAIRDSIEKRGLKNSFESFEKG-DSEEKSSSNQNNQVNGHTTTGERVSDINGRSSRRPRP 540
Query: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
K+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM
Sbjct: 541 KIPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
Query: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
NLSLAYKHRKPKGYWDK DNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW
Sbjct: 601 NLSLAYKHRKPKGYWDKLDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
Query: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRGL 720
GGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ +NDVDAESKTPSKPYISQDTEKWL GL
Sbjct: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKHDYLGVNDVDAESKTPSKPYISQDTEKWLAGL 720
Query: 721 KHLDINWVE 728
K+LDINWVE
Sbjct: 721 KYLDINWVE 722
BLAST of Sgr019019 vs. ExPASy TrEMBL
Match:
A0A6J1DL18 (uncharacterized protein LOC111022083 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022083 PE=3 SV=1)
HSP 1 Score: 1309.7 bits (3388), Expect = 0.0e+00
Identity = 661/737 (89.69%), Postives = 681/737 (92.40%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLP---------PLTSGVFARQAQYCQT-SSLRLRNKWVSLSAAEGF 60
MI+CRALR NLGTP P PLTSGV+ARQA+YCQT SSL LR+K VSLSAAEGF
Sbjct: 1 MIVCRALRFNLGTPSPLPLPLPLPSPLTSGVYARQAEYCQTSSSLPLRSKCVSLSAAEGF 60
Query: 61 DWNSSDYFAKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNA 120
DW+SS+YFAKNCNLK SG + +D EG G+ ERAV CEV+VISWRERRIRADI VNA
Sbjct: 61 DWDSSEYFAKNCNLKSRSGGW---EDGGEGVGDGERAVHCEVKVISWRERRIRADILVNA 120
Query: 121 GIESVWNALTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQ 180
IESVWNALTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQ
Sbjct: 121 AIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQ 180
Query: 181 ELLNSDGSRELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERI 240
ELLNSDGSRELHFSMVDGDFKKFEGKWS+KAGTRS PT LSYEVNVIPRFNFPAILLERI
Sbjct: 181 ELLNSDGSRELHFSMVDGDFKKFEGKWSIKAGTRSSPTTLSYEVNVIPRFNFPAILLERI 240
Query: 241 IRSDLPVNLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGN 300
IRSDLPVNLRALACRAEENSEGG+RVGT DSKSMV TNTVNGASCE DE LQETSR N
Sbjct: 241 IRSDLPVNLRALACRAEENSEGGRRVGTTEDSKSMVLTNTVNGASCENDE-LQETSRRSN 300
Query: 301 SISNLGPLPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT 360
S SNLGPLPPL NELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT
Sbjct: 301 SNSNLGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASIT 360
Query: 361 VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD 420
VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD
Sbjct: 361 VKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLD 420
Query: 421 LCELLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV 480
LCE LEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV
Sbjct: 421 LCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVV 480
Query: 481 YEDLPSNLCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNG 540
YEDLPSNLCAIRDSIEKRG NNSFEAFDEGR S+EKSASY NDQ NGYT EGVSD NG
Sbjct: 481 YEDLPSNLCAIRDSIEKRGSNNSFEAFDEGRHSEEKSASYHNDQINGYTMKGEGVSDDNG 540
Query: 541 RNSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG 600
+NSCRP+PKV GLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG
Sbjct: 541 KNSCRPKPKVAGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGG 600
Query: 601 FRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYD 660
FRRIAS+MNLSLAYKHRKPKGYWDKFDNLQEEINRFQ SWGMDPSYMPSRKSFERAGRYD
Sbjct: 601 FRRIASIMNLSLAYKHRKPKGYWDKFDNLQEEINRFQTSWGMDPSYMPSRKSFERAGRYD 660
Query: 661 IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQD 720
IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKND +A N DAE+KT S+PYISQD
Sbjct: 661 IARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDSLAFNGHDAENKTASRPYISQD 720
Query: 721 TEKWLRGLKHLDINWVE 728
TEKWL GLK+LDINWVE
Sbjct: 721 TEKWLSGLKYLDINWVE 733
BLAST of Sgr019019 vs. ExPASy TrEMBL
Match:
A0A0A0KYT4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G552160 PE=3 SV=1)
HSP 1 Score: 1287.3 bits (3330), Expect = 0.0e+00
Identity = 644/729 (88.34%), Postives = 672/729 (92.18%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT--SSLRLRNKWVSLSAAEGFDWNSSDYF 60
MI+CRAL LG PL PLTSGV A Q +Y QT SSL LR K VSLSAA+GF+WN + YF
Sbjct: 1 MIVCRALSFTLGPPL-PLTSGVCATQTEYSQTSSSSLPLRTKCVSLSAADGFEWNPTQYF 60
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
AK NLKR SG YG R+D EEG+ ERER VRCEVEV+SWRERRIRAD+FV++GIESVWN
Sbjct: 61 AKGSNLKRRSGVYGGREDGEEGEAERERDVRCEVEVVSWRERRIRADVFVHSGIESVWNV 120
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS
Sbjct: 121 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVN 240
REL FSMVDGDFKKFEGKWS+ AGTRS PTMLSYEVNVIPRFNFPAILLE+IIRSDLPVN
Sbjct: 181 RELLFSMVDGDFKKFEGKWSINAGTRSSPTMLSYEVNVIPRFNFPAILLEKIIRSDLPVN 240
Query: 241 LRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPL 300
LRALA RAEE SEGGQRVG I DSK +V +NT+NGA+C KDE++QE SRGGNS SNLG +
Sbjct: 241 LRALAFRAEEKSEGGQRVGNIKDSKDVVLSNTLNGATCVKDEIVQENSRGGNSNSNLGSV 300
Query: 301 PPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
PPL NELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV
Sbjct: 301 PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREV 360
Query: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQE 420
WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCE LEQE
Sbjct: 361 WNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQE 420
Query: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL
Sbjct: 421 ISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL 480
Query: 481 CAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPRP 540
CAIRDSIEKR L NSFEA D+G DS+EKS S N+Q NGYTTT EGVSD+NGR S RPRP
Sbjct: 481 CAIRDSIEKRVLKNSFEALDQG-DSEEKSVSRRNNQSNGYTTTAEGVSDINGRASFRPRP 540
Query: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM
Sbjct: 541 KVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLM 600
Query: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW
Sbjct: 601 NLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKW 660
Query: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRGL 720
GGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYV +ND D ESK PSKPYISQDTEKWL GL
Sbjct: 661 GGLHEVSRLLSLKVRHPNRQPSFAKDRKSDYVVVNDFDGESKAPSKPYISQDTEKWLTGL 720
Query: 721 KHLDINWVE 728
K+LDINWVE
Sbjct: 721 KYLDINWVE 727
BLAST of Sgr019019 vs. ExPASy TrEMBL
Match:
A0A1S3B5Y3 (uncharacterized protein LOC103486131 OS=Cucumis melo OX=3656 GN=LOC103486131 PE=3 SV=1)
HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 645/730 (88.36%), Postives = 670/730 (91.78%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT--SSLRLRNKWVSLSAAEGFDWNSSDYF 60
MI+CRAL LG PL PLTSGV+A Q +YCQT SSL LR K VSLSAA+GF+WNSS YF
Sbjct: 4 MIVCRALSFTLGPPL-PLTSGVYATQTEYCQTSSSSLPLRTKCVSLSAADGFEWNSSQYF 63
Query: 61 AKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWNA 120
AK NLKR SG YG R D EEG+ ERER VRCEVEV+SWRERRIRADIFV++GIESVWN
Sbjct: 64 AKGSNLKRQSGVYGGRRDGEEGEAERERDVRCEVEVVSWRERRIRADIFVHSGIESVWNV 123
Query: 121 LTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGS 180
LTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGS
Sbjct: 124 LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQEHLNSDGS 183
Query: 181 RELHFSMVDGDFKKFEGKWSLKAGTR-SLPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 240
REL FSMVDGDFKKFEGKWS+KAGTR S PTMLSYEVNVIPRFNFPAILLERIIRSDLPV
Sbjct: 184 RELLFSMVDGDFKKFEGKWSIKAGTRSSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 243
Query: 241 NLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGP 300
NLRALACRAEE SEGGQRVG I DSK++V +NT+NGA+C KDE++QE SRGGNS SNLGP
Sbjct: 244 NLRALACRAEEKSEGGQRVGNIKDSKAVVLSNTLNGATCAKDEIVQENSRGGNSNSNLGP 303
Query: 301 LPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
+PPL NELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE
Sbjct: 304 VPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 363
Query: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQ 420
VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCE LEQ
Sbjct: 364 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQ 423
Query: 421 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN
Sbjct: 424 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 483
Query: 481 LCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPR 540
LCAIRDSIEKRGL NSFE +G ++ CN Q NGYTTT EGVS +NGR S RPR
Sbjct: 484 LCAIRDSIEKRGLKNSFEVLYQGNLEEKSVPRQCN-QSNGYTTTAEGVSAINGRASFRPR 543
Query: 541 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
Sbjct: 544 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 603
Query: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK
Sbjct: 604 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 663
Query: 661 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRG 720
WGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYV NDVD ESK PSKPYISQDTEKWL G
Sbjct: 664 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKSDYVVANDVDGESKAPSKPYISQDTEKWLTG 723
Query: 721 LKHLDINWVE 728
LK+LDINWVE
Sbjct: 724 LKYLDINWVE 731
BLAST of Sgr019019 vs. ExPASy TrEMBL
Match:
A0A6J1HQY2 (uncharacterized protein LOC111465941 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465941 PE=3 SV=1)
HSP 1 Score: 1286.2 bits (3327), Expect = 0.0e+00
Identity = 644/730 (88.22%), Postives = 677/730 (92.74%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT---SSLRLRNKWVSLSAAEGFDWNSSDY 60
MI+CR LR NLG LPP SGV+ARQ +YC T SSL LR K VS+SAAEGFDWNSS+Y
Sbjct: 1 MIVCRPLRFNLGPSLPP-ASGVYARQPEYCLTSSSSSLSLRTKCVSVSAAEGFDWNSSEY 60
Query: 61 FAKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWN 120
F K+ +LKRGSG YG RD EG+GERER V CEVEV+SWRER+IRA IFVN+GIESVWN
Sbjct: 61 FTKSFSLKRGSGVYGGRDGNGEGEGERERDVYCEVEVVSWRERQIRASIFVNSGIESVWN 120
Query: 121 ALTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG 180
ALTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG
Sbjct: 121 ALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG 180
Query: 181 SRELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 240
SRELHFSMVDGDFKKFEGKWSLKAGTRS PT+LSYEVNVIPRFNFPAILLERIIRSDLPV
Sbjct: 181 SRELHFSMVDGDFKKFEGKWSLKAGTRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPV 240
Query: 241 NLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGP 300
NLRALACRAE +SEGGQRVG DSKSM+ +NT+NGA+CEKDELL E NS SNLG
Sbjct: 241 NLRALACRAEGSSEGGQRVGNSEDSKSMILSNTINGAACEKDELLLE-----NSSSNLGT 300
Query: 301 LPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
LPPL NELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE
Sbjct: 301 LPPLSNELNSNWGVFGKVCKLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
Query: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQ 420
VWNVLTAYESLPEVVPNLAISKILSRESNKVRI+QEGCKGLLYMVLHARVVLDLCE LEQ
Sbjct: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRIVQEGCKGLLYMVLHARVVLDLCEQLEQ 420
Query: 421 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
EISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN
Sbjct: 421 EISFEQVEGDFDSLTGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
Query: 481 LCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPR 540
LCAIRDSIEKRGL NSFE+F++G DS+EKS+S N+QF G+TTT E VSD+NGR+S RPR
Sbjct: 481 LCAIRDSIEKRGLKNSFESFEKG-DSEEKSSSNQNNQFYGHTTTGERVSDINGRSSHRPR 540
Query: 541 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
K+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
Sbjct: 541 TKIPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
Query: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK
Sbjct: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
Query: 661 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRG 720
WGGLHEVSRLLSLKVRHPNRQPSFAKDRK DY+ +NDVDAESKTPSKPYISQDTEKWL G
Sbjct: 661 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKYDYLGVNDVDAESKTPSKPYISQDTEKWLAG 720
Query: 721 LKHLDINWVE 728
LK+LDINWVE
Sbjct: 721 LKYLDINWVE 723
BLAST of Sgr019019 vs. ExPASy TrEMBL
Match:
A0A6J1EAX7 (uncharacterized protein LOC111432394 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432394 PE=3 SV=1)
HSP 1 Score: 1285.0 bits (3324), Expect = 0.0e+00
Identity = 643/730 (88.08%), Postives = 678/730 (92.88%), Query Frame = 0
Query: 1 MIICRALRLNLGTPLPPLTSGVFARQAQYCQT---SSLRLRNKWVSLSAAEGFDWNSSDY 60
MI+CR LR NLG LPP SGV+ARQ +YC T SSL LR K VS+SAAEGFDWNSS+Y
Sbjct: 1 MIVCRPLRFNLGPSLPP-ASGVYARQPEYCPTSSSSSLSLRTKCVSVSAAEGFDWNSSEY 60
Query: 61 FAKNCNLKRGSGFYGDRDDVEEGDGERERAVRCEVEVISWRERRIRADIFVNAGIESVWN 120
F K+ +LKRGSG YG RD EG+ ERER V CEVEV+SWRER+IRA+IFVN+GIESVWN
Sbjct: 61 FTKSFSLKRGSGVYGGRDGNGEGEVERERDVYCEVEVVSWRERQIRANIFVNSGIESVWN 120
Query: 121 ALTDYERLADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG 180
ALTDYERLADFIPNLVSSGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG
Sbjct: 121 ALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDG 180
Query: 181 SRELHFSMVDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPV 240
SRELHFSMVDGDFKKFEGKWSLKAGTRS PT+LSYEVNVIPRFNFPAILLERIIRSDLPV
Sbjct: 181 SRELHFSMVDGDFKKFEGKWSLKAGTRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPV 240
Query: 241 NLRALACRAEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGP 300
NLRALACRAE +SEGGQRVG DSKSM+ +NT+NGA+CEKDELLQE NS SNLG
Sbjct: 241 NLRALACRAEGSSEGGQRVGNSEDSKSMILSNTINGAACEKDELLQE-----NSSSNLGT 300
Query: 301 LPPLFNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
LPPL NELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE
Sbjct: 301 LPPLSNELNSNWGVFGKVCKLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVRE 360
Query: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQ 420
VWNVLTAYESLPEVVPNLAISKILSRESNKVRI+QEGCKGLLYMVLHARVVLDLCE LEQ
Sbjct: 361 VWNVLTAYESLPEVVPNLAISKILSRESNKVRIVQEGCKGLLYMVLHARVVLDLCEQLEQ 420
Query: 421 EISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
EISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN
Sbjct: 421 EISFEQVEGDFDSLTGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSN 480
Query: 481 LCAIRDSIEKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGVSDVNGRNSCRPR 540
LCAIRDSIEKRGL NSFE+F++G DS+EKS+S N+QFN +TTT E VSDVNGR+S R R
Sbjct: 481 LCAIRDSIEKRGLKNSFESFEKG-DSEEKSSSNQNNQFNDHTTTGERVSDVNGRSSPRSR 540
Query: 541 PKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
PK+PGLQRD+EVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
Sbjct: 541 PKIPGLQRDVEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL 600
Query: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK
Sbjct: 601 MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK 660
Query: 661 WGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTPSKPYISQDTEKWLRG 720
WGGLHEVSRLLSLKVRH NRQPSFAKDRKNDY+ +NDVD+ESKTPSKPYISQDTEKWL G
Sbjct: 661 WGGLHEVSRLLSLKVRHRNRQPSFAKDRKNDYLGVNDVDSESKTPSKPYISQDTEKWLAG 720
Query: 721 LKHLDINWVE 728
LK+LDINWVE
Sbjct: 721 LKYLDINWVE 723
BLAST of Sgr019019 vs. TAIR 10
Match:
AT5G08720.1 (CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031); BEST Arabidopsis thaliana protein match is: Polyketide cyclase / dehydrase and lipid transport protein (TAIR:AT4G01650.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 904.8 bits (2337), Expect = 4.4e-263
Identity = 469/667 (70.31%), Postives = 531/667 (79.61%), Query Frame = 0
Query: 67 GSGFYGDRDDVEEGDGER-ERAVRCEVEVISWRERRIRADIFVNAGIESVWNALTDYERL 126
G G G R D G ER ER VRCEV+VISWRERRIR +I+V++ +SVWN LTDYERL
Sbjct: 63 GRGDNGLRRDSGLGFDERGERKVRCEVDVISWRERRIRGEIWVDSDSQSVWNVLTDYERL 122
Query: 127 ADFIPNLVSSGRIPCPHTGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSM 186
ADFIPNLV SGRIPCPH GRIWLEQRGLQRALYWHIEARVVLDL E L+S RELHFSM
Sbjct: 123 ADFIPNLVWSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLHECLDSPNGRELHFSM 182
Query: 187 VDGDFKKFEGKWSLKAGTRSLPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACR 246
VDGDFKKFEGKWS+K+G RS+ T+LSYEVNVIPRFNFPAI LERIIRSDLPVNLRA+A +
Sbjct: 183 VDGDFKKFEGKWSVKSGIRSVGTVLSYEVNVIPRFNFPAIFLERIIRSDLPVNLRAVARQ 242
Query: 247 AEENSEGGQRVGTIVDSKSMVQTNTVNGASCEKDELLQETSRGGNSISNLGPLPPLFNEL 306
AE+ + + I D ++ + E D L E S S++G L NEL
Sbjct: 243 AEKIYKDCGKPSIIEDLLGIISSQPAPSNGIEFDSLATERS----VASSVGSLAH-SNEL 302
Query: 307 NSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAY 366
N+NWGV+GK C+LDK C VDEVHLRRFDGLLENGGVHRC VASITVKAPV EVW VLT+Y
Sbjct: 303 NNNWGVYGKACKLDKPCTVDEVHLRRFDGLLENGGVHRCAVASITVKAPVCEVWKVLTSY 362
Query: 367 ESLPEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCELLEQEISFEQVE 426
ESLPE+VPNLAISKILSR++NKVRILQEGCKGLLYMVLHAR VLDL E+ EQEI FEQVE
Sbjct: 363 ESLPEIVPNLAISKILSRDNNKVRILQEGCKGLLYMVLHARAVLDLHEIREQEIRFEQVE 422
Query: 427 GDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSI 486
GDFDSL GKW FEQLGSHHTLLKY+VES+M KD+FLSEA+MEEV+YEDLPSNLCAIRD I
Sbjct: 423 GDFDSLEGKWIFEQLGSHHTLLKYTVESKMRKDSFLSEAIMEEVIYEDLPSNLCAIRDYI 482
Query: 487 EKRGLNNSFEAFDEGRDSDEKSASYCNDQFNGYTTTVEGV-SDVNGRNSCRPRPKVPGLQ 546
EKRG +S E E++ S + +VE V ++ +G + + R ++PGLQ
Sbjct: 483 EKRGEKSSESCKLETCQVSEETCS------SSRAKSVETVYNNDDGSDQTKQRRRIPGLQ 542
Query: 547 RDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNLSLAY 606
RDIEVLK+E+LKFISEHGQEGFMPMRKQLR+HGRVDIEKAITRMGGFRRIA +MNLSLAY
Sbjct: 543 RDIEVLKSEILKFISEHGQEGFMPMRKQLRLHGRVDIEKAITRMGGFRRIALMMNLSLAY 602
Query: 607 KHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEV 666
KHRKPKGYWD +NLQEEI RFQ+SWGMDPS+MPSRKSFERAGRYDIARALEKWGGLHEV
Sbjct: 603 KHRKPKGYWDNLENLQEEIGRFQQSWGMDPSFMPSRKSFERAGRYDIARALEKWGGLHEV 662
Query: 667 SRLLSLKVRHPNRQPSFAKDRKNDYVALNDVDAESKTP----SKPYISQDTEKWLRGLKH 726
SRLL+L VRHPNRQ + KD N + +A+ + +KPY+SQDTEKWL LK
Sbjct: 663 SRLLALNVRHPNRQLNSRKDNGNTILRTESTEADLNSTVNKNNKPYVSQDTEKWLYNLKD 718
Query: 727 LDINWVE 728
LDINWV+
Sbjct: 723 LDINWVQ 718
BLAST of Sgr019019 vs. TAIR 10
Match:
AT4G01650.1 (Polyketide cyclase / dehydrase and lipid transport protein )
HSP 1 Score: 99.0 bits (245), Expect = 1.7e-20
Identity = 71/203 (34.98%), Postives = 104/203 (51.23%), Query Frame = 0
Query: 73 DRDDVEEGDGERER------AVRCEVEVISWRERRIRADIFVNAGIESVWNALTDYERLA 132
D DD DG+ E V E++ + RRIR+ I + A ++SVW+ LTDYE+L+
Sbjct: 82 DEDDYCLTDGKTEELVVGDDGVLIELKKLEKSSRRIRSKIGMEASLDSVWSVLTDYEKLS 141
Query: 133 DFIPNLVSSGRIPCPHTGRIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSREL 192
DFIP LV S + R+ L Q G Q AL A+ VLD E+L RE+
Sbjct: 142 DFIPGLVVSELVE-KEGNRVRLFQMGQQNLALGLKFNAKAVLDCYEKELEVLPHGRRREI 201
Query: 193 HFSMVDGDFKKFEGKWSLKAGTRSL------------PTMLSYEVNVIPRFNFPAILLER 252
F MV+GDF+ FEGKWS++ + + T L+Y V+V P+ P L+E
Sbjct: 202 DFKMVEGDFQLFEGKWSIEQLDKGIHGEALDLQFKDFRTTLAYTVDVKPKMWLPVRLVEG 261
BLAST of Sgr019019 vs. TAIR 10
Match:
AT4G01650.2 (Polyketide cyclase / dehydrase and lipid transport protein )
HSP 1 Score: 97.1 bits (240), Expect = 6.4e-20
Identity = 68/197 (34.52%), Postives = 103/197 (52.28%), Query Frame = 0
Query: 77 VEEGDGER----ERAVRCEVEVISWRERRIRADIFVNAGIESVWNALTDYERLADFIPNL 136
+E+G E + V E++ + RRIR+ I + A ++SVW+ LTDYE+L+DFIP L
Sbjct: 11 LEDGKTEELVVGDDGVLIELKKLEKSSRRIRSKIGMEASLDSVWSVLTDYEKLSDFIPGL 70
Query: 137 VSSGRIPCPHTGRIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSRELHFSMVD 196
V S + R+ L Q G Q AL A+ VLD E+L RE+ F MV+
Sbjct: 71 VVSELVE-KEGNRVRLFQMGQQNLALGLKFNAKAVLDCYEKELEVLPHGRRREIDFKMVE 130
Query: 197 GDFKKFEGKWSLKAGTRSL------------PTMLSYEVNVIPRFNFPAILLERIIRSDL 253
GDF+ FEGKWS++ + + T L+Y V+V P+ P L+E + ++
Sbjct: 131 GDFQLFEGKWSIEQLDKGIHGEALDLQFKDFRTTLAYTVDVKPKMWLPVRLVEGRLCKEI 190
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154935.1 | 0.0e+00 | 89.69 | uncharacterized protein LOC111022083 isoform X1 [Momordica charantia] | [more] |
XP_038882723.1 | 0.0e+00 | 89.44 | uncharacterized protein LOC120073881 [Benincasa hispida] | [more] |
XP_011654397.2 | 0.0e+00 | 88.61 | uncharacterized protein LOC101212159 [Cucumis sativus] >KAE8649758.1 hypothetica... | [more] |
XP_008442209.1 | 0.0e+00 | 88.36 | PREDICTED: uncharacterized protein LOC103486131 [Cucumis melo] | [more] |
XP_023517467.1 | 0.0e+00 | 88.34 | uncharacterized protein LOC111781223 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DL18 | 0.0e+00 | 89.69 | uncharacterized protein LOC111022083 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A0A0KYT4 | 0.0e+00 | 88.34 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G552160 PE=3 SV=1 | [more] |
A0A1S3B5Y3 | 0.0e+00 | 88.36 | uncharacterized protein LOC103486131 OS=Cucumis melo OX=3656 GN=LOC103486131 PE=... | [more] |
A0A6J1HQY2 | 0.0e+00 | 88.22 | uncharacterized protein LOC111465941 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1EAX7 | 0.0e+00 | 88.08 | uncharacterized protein LOC111432394 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G08720.1 | 4.4e-263 | 70.31 | CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031);... | [more] |
AT4G01650.1 | 1.7e-20 | 34.98 | Polyketide cyclase / dehydrase and lipid transport protein | [more] |
AT4G01650.2 | 6.4e-20 | 34.52 | Polyketide cyclase / dehydrase and lipid transport protein | [more] |