Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGATTCCAGAGGCATATCTGTACGTTGTCGTTTAGATTGTCGCAAAAGGTATCTATCAATCTAACTATTAACGCTTTTGGGAACTGAAAAAAATGGAACCCAGAAATGAATTGCTGGCATTATATTATTCCGCACATATAAAAATTAAATGAATAGTGTCTAGTCAGACAGACTCAGAACAGGGGCTGGGGCTGGGTATTATCATTAATTAATTAACTAAATTCATATATATATATATATATATATTTTATAAATATTATTAACATTCCATTCGACAACAGAGGGGAGAGCCGTAGCGACGCTCCGGGAGTAGCGCGTGAGGTACAAGCGCGGAAGGAAGGAATCGACTTTGCTTCTTTGGTACCCGGTTAATAAGGCGCCAAATGGAAATGGGAGCGTCGTGCGTAAAAGCAGCTGCGTGTGTGAGCTGTTTCTTACGTGCTCCAACAAATCATTTCACGTTTTGCCACCCCATCTCGTCAAACCAACTGTATTTCGACCCATTTTTCTGTTTTTTTTTCCCTTGGTTATTATTTATAAAAAAAAAAAAATAATAGCTCCTTTTCTTTCCTTCTTAAAAGTAGAATACATAAGGAAAATTTATTTTTGCATTCGTTTTGTTAAATCAATCTTATTTCACTTTGGGTTTGATTTAATATAAATTTTAAATATTTTTATATATAAAAATGAGTTCAAAAAAGTAATTGGAAAAATTGTCCGAGAAAATTCTCTCCCCACGACTTCATTTATGTAATAAAAGTTAAATCATAGATGTTGAAAATTTTGAAATCTCAATTTTATGGAAATGTCTAATAATTTATCTTTTTAAAGAAACCATCAAAACGAATCGACTGATAAAAATGAGTTTAATAAGTGAATAAAAATTTCACCTTCGTAAACAAGTAATAAATTATAACTCTTTTGTTGTTAGTAAATCAATATTAGCTCAATTTATTGAAACATTAGTAACTTATTTTATAGGTTGAAGGTTTGAATCGATATCATTTTGTTTGTATTGTAATATTCATTAAAAAAAATTGCATTCATATCAGTTATACTTTTTATCAATATTTTAAATGCTTCTAAACATTTTTTATATATAAATTGCCATGTTTATGATATTTGAAAAAATTCTTCAAGTTTTTATTTTTTTATTTTTTTTATGAGTTCAACAACTGTGCAGTGAAAGATCGAACCATCAACCTTTAAGATGAAAATAAGTGCATTATCCACTGAGCTATGTTCGAATTGACAAACTCGTTGAGGTTATATCAATGTTGAACCCCCGTCAATTTTTAAAAACATCCATGGATAAAAATTTACGTTGCGCATGTATTTAACACCAATGAATTTTAAGAATATATATAAAAGATTATTAAATATACTATATGTTTAAGTTTTTTGATTTAAAGATAATTTAATATGGTAGAATGGACGGTCTTGAGTTCAAATCCATAAACACCATTAACAATACAATGTCATCTAGTTGGTCATCGTAGTTAAGTTTTTTTACTGCAATTCAAAAAATTAGGTTATTCCTGAACCTGAAACCAAACTTCGTATATATATATAGCTTAAAATACAAGTCTACGTAATGAGATGCAAAATCTTTGTTACATGCAATACAAACTTTTAACAAAAAATGTTAAAATATAATTGACTAAATTTATGTTTTAGTCTCTAATATTTGTATATTTTTTCAATTTAGTTTATATTGTTTAAAAAGTTTCAAATTAATCTCTTATATTTTAATATTTTTCAATTATGTCTTTCACGTCGATAAGTGTTAAAATTGGCTAATGATAATCTTATATGATATGATATTTAGTGAATTGACAAAAATTTAGAGTAGAAATTAGACCACCTATAAGAAGAAAAATTGTAATTTTTGCCAACTTCTAAGAAATGTCAATGAGCAATTTGAAAGTTTCCTCTTTTAATTGACCTAATCCCCTCTTTAAAAAAACAATTTTTTTCTTAAAAGTAGAATACACAGGAAAAATTATTTTTGCATTAGTTTTAATTTAAATCAATTTTATTTTCCTTTGGGTTTGATTTAATAAAAATATTAAAGAATTTTTTATTTATATAAAAATGAAAAAAATAATTGAGAAAATTAACCGAGAAAATTCTCTCCTCTCACAACCTCATTTATGTAATTAAAGTTAAAACATAGATGTTGAAAATTTTGAAATCACGCTTTTATGGAAATGTCTCACAATTTGTTTTTTTAAAGAAACCATTAAAACGAATGAACTGATAAAAATTAATTTAATCAATGAATAAAAACTTCACCTTCATAATCAAGTCATAGGTTATAAATGTTTGTTATTAGTAAATCAACATAGTTCAATTGATTATGACATTAGTAATTTTTTTTTATAAAATTTTTTTAAGTTCAATAATTATAAGAGTGGGGGACGGAACCATGGACCTCTAAGATGTCAATAAATGTCTTATCTATTAAACTATGCTCGAATTGGCATTTTTTACAGGTTAAAGGTTTGAATCTTTACTTCTTTTTTTGAATTGTTAATACTCATTAAAAAATTACATTCATATCAATGATAATTTTTATCAATATTTAAATGTTTAAAACATTTCTTTTTATATAAATCGATATTTTGCGATATTTTGAAACAATTCATTGAGGTTATATCAATGTTGAACCCCCTTCAATTTTTTAAAACATCAATGAATAAAATTCACCTTGTGCATGTTTTAGCTAATATGAACATATGATGTGATATGATATCTTCAATATTGTCAATAGCGTTGCATAGCTCAGCGATTAAGACATATCTCTACTTCCTCTGAAGACTTGTAGGTTTGAATCTCCAACTCCACAATTGTGATGTGATGTTCTCAAAAAAATACTGGTGTATTTTATATATATATATAAAAGATTAAATATACTATAACTTATATATTTAAATTATTTTGATTTAAAGATAATTTAATATGATATAAGAATAGGAGGTATTGAGTTCGAATACCTACTATTGACAATACAATGTCTTCTATATTTGTCATTGTAGTTAAGTTTTTTCAGTGCAATTAAAAAAAATTAAGTTATTCCTAAACCTGAAAGCAAACTTTGAATATATATATAACTTAAAATACAAGTCTACGTGATGAGATGCAAAATCTTTTTTACATGCAATACAAAATTTTAACAAAAATGCTAAAATATAATTTTATGTAAAGTTTAACAAAAAAAATAAAACATCTAAAGCTATAAAGATAAAATTAAAAATTCAGAAAAAAAAACTTCATGAAATTATTACGAGGAAAAAAGTTTATTGACGGGAGGGTGGGTGGGACCGGGTCTTGAAAAGGCATAATATGCCATGTACTTTACCGCGCTTTATTACTCCTATTAATATTAATCGTTGCCGTTTCCTTTATTTTATTTCTTATTTTTAAAAATAGTTTTGCCATTTAATAATCATTTTACTATTATTATTTTTAAATTATTACGAGAAAATTGATGGCTTTCAATTCCTACTTTTTGCGTCGACACATTCCTATTCATGATTGCATCCAACGAACGACGCCATTTCCAATTTCTTCCACATTACTAAATTTTCCAGTAAACACCTTGTTTGTTTGTTTAACTGCACTCCCTGACTCGCCGCCCCACTTCTCGCCGGCGCTCTCCTCCGTTCAACTTCCGTCGCCATTTCCGTCTCTCTTCTCAATGGTGCGATCTCCATTCCACAGCTTCCTCTCTCCGTGTTTTGTTTTCCGGATTCAACTTTTAGTAGCTTTTCCGATGTAATGCTGTGCAATTTCGTTGCTCTTCTGTTCGATTACTTCGGAAATTTTCAATTGAGCTAATTGGTTGACGAGGCAGGTGCTGATGCTGAGACTGCGTTTTGTTTTCTTGTTTGAGCGTGAATTTTGCGTGCTGATCAAGAACTTGAATGATTGAACTACGATATATGCAAATTTCACCACTCTCAATAACCATTTTTTCCTGACAATTGTTTATGATCGTGTGAACTCGTTGCTTGTCAAATTATGTTGTAGGTGGCTGGTTTTGACGAAATAAAGTTATTAATGAGTCCTGTTTTGCGGTAGACTTCTTGTTTGCTATGCTCTGTACCACGAATTTCCTTGAAGACTTTTAATATAATATTTGACCTGCTCTTCCTTCTATGGATCTTGCTGTGGTCTCTCTTGTCTCTCTCCGATTTCTGTAACAGTTGTGCTTGTTATGTTTCCTCTTGATCTTTTTGTACATTACTGAGCAATTGACTTTCTCCGCTTACTGATTCCGGAAAACCAAAATATCAAGAAAATGAGAAGAGATTGTGTTTTGTATATGAAGAATTTATTTTTCGCAATTTCTATGTAGGGTGCAGTGTAGATTCCGTTGCTCATATTCATACACGATATTAATTTCTCCAGGATATGGTTTGAATGCTTGTCGCCTCTTGTTGTTTCTCGGAAGGATGTTTCAAACAAATTATCGCTATTAAACAGCTAACGTAACCAGAGTTCCATTACCGCAACAAACTGTATTGGTTAGCAATTAGCATGATATATTAAGTGGTTGTAAATTTAAATACTGTTTGCGTACTCGGATTGTATGGTTCTGTCGATGATATATATTTTTGTTGGTTGCAGTTTGCCAGTTTATAAACTTCGTACCTCATTTTTTTTCCTTTTTTTTTTTTTTTCTTTTTCAAGGGGAAACAAGGACATTCAGATCGATTTGTACAGAGTATGAAAGAACCATAACAATGATAGCCTCATGGTAGTTCCACAAGCTGCATGGGAGGCTTGTTGAATCCTCTAGACTTTGACCACAGAAGCATGGCCAAGAAAGTCTTTAATCAAAAGAGTCGTAATGGTGGTATGATTGATGTTATTATGCTCCTTGGTTATAAGAAGAACATGTGCGAGTTCTGCTCAAAGAGCCATTTTCGTTTTAAATGTATCTGGCATTAAGTTGAGATTTCAAAATATCAGAATATGTAGTGTTGTGCTATCTGGAGAGTTAGTTTAGCAATAATCATGACTAGAAAAATTGTTCTTGGATGACTGAGCCTACAGTAATATTGGTTTGCATATGGAACCGTGTTTCTTTTTCTTTCATAAACCAGGTTTTCTAAAATTAATAATTCATACGGTTTTTGGAAATTCTGGTGTAGTGAATGTTAAAACTTTCTGTCAGTCCATCTAGATATAAGGACACTGACTTGAGTAGAAATTAATCATAGGATAAATCATCTTTCTACAAAAAATTTAATGTTAACAACAGTTGTTATTTGAGGCGGTTTTATCCAAATTTAATTTGCATTTTTCTGCAGGCCTGGAAACCCCTCGAAATAGCCTGGAGCTGCAGATAGAGAGTTCCCATAACTATTGTGCTGCAGAAGAAATACCGGTAAATCTGCAGTTATAACTCTTCACGTCAAATTATCTTGTTAAGGTTTTTTGTTACTTTTGCGAATATACTACTAGAAATAACTTGCAGCTTTCGCCCCAATGCAAACTAGGTCTTAGAAACGAGTCTGACTATAGAATTGGACCATCTAAGTTTAAGATATCAAATTTTGATTTGTCTGGCAAAGGTGTTCTGTTTGGTATTGTAGCTCCACTCAACATCATTGCAGAATCTTTTTGAAAATTTGAATCGGTATTTTCCAGCCTTAAATTTATTCTAAAATAATGGTTAGTATACAAGTTTAAAAAAAAAAAAAAGGCTTCATGTTCCTTATTTTCTGTCGGTAACCAAGTTGGATTTTGGACTGACCTATCAGCTGTTGTGTATTAGTAGTTAGACTGCCACCAAGAAATTGTGTCGGATGGTATTACCTATGGCCTGTACTCTCTTACTTAATAAATTCAATACCATAGGTTCCTAAAACTAATTGACTTGCAACTATAATAGTTCATATACACTCGCCTAGGGACTCAACTGCCTATGTCACAAAAATTATATGAAGAAAATGGAAAAGTCAACCCTGCCCACCTGTTAATTACTAAGTTAAAGAACAAAAAGGAAATATATGCACAGGCGAATCTTGTTACCAAACCACATCCTAATTGGAGTATTTTCTAAAAATAGTTTCACTCTCATTCTCATATCAGCATTCTATTTCTGAAAATGATCTCCTTTTCAAGTGTCTATCCTCTAATATTCTCTTATTCTTTTCTAATGATCTTATTCCCCAGTACCAATATACTTTCCATGAGTGCTTTTCCATATCGTTTCCTTCGCTATCTCTCTAATCAACATATTTATGTGATTGTTGCTGAATTTTCAGTACTTCTACCAAATTGATGAAGTGTTTTCTGACAAGGACTATTTTAAAAATGAGGCTTCAATGAAGAAATTAATTGATAAGGAAATGTCCACGCGCATAAATGCCAGACATAATGGACCAAGCATTGTTGCTCGACTCATGGGGATGGAAATGTTGCCCTTGGATGCAAAAGATGAAGTTCAGCTACGTGACAAAAGGCGTAATAGCAAGGGAGTCAAGACTTTAAATAAAGAAAGTACTGGCAGGGGATTGCATTCTCAGGCATCCTCCAAATCGAATTCTTCGAAGCAGATGGACCTGCACTTGTCTTATCATGATAATGACACGGATGCTGATCGATGGAGCAGCAGTCAGAAGATGGGAAAACCACGCCGTCGGGAACATCCTCAAGAGGAGGAGTTACAAAAGTTTAAGAAGGAATTTGAAGCATGGCAGGCTGCAAGGTTTAGGGAGTGTTCAAGGGTTATTGAAGTTAGTAGCATCAACAGACAGTCACTTGCTCAGGAAGGCCTTGCCAAGGAAACGATGGCACTTAGTGCAAACACGAGGAAAATATCGAGTCAGAAGCTCTCAGCAGAACCTAAAGGTTCGACAGTGGAGATAAAATCTTATAGAAGTGTTGGTGTGGATGATGGTACTAGGGGGGAAACATTCCCAGCTGAGCAGAGGGGATCTTTTTCTTTGAGAAGCAAATTCATGGATGCAGATTTTGAGCACCCTTGCCTGATAAGTTGTGATCAGAAGACAGACAAATCACGTGGCCCAACAAAGATAGTGATCTTGAAGCCTGGTCCTGATAAGATGTGCCTCCATGAAGAGCACTGGACAAATTCCTCAGGGACCTTAGGAGAAAGAGTTAGTATTGAAGCTTTTCTTGAAGAGGTCAAGGAGCGGCTGAAATGCGAATTACAAGGGAAAACTTTTAAAAAGGGTTCTGCTGTTCGTGGAAGTGGAATAGAGACACCATATAGTGAGAAACCATCTCACTCAAGACAAATAGCTCGGAACATAGCAACACAGGTCAGAGATAGTGTCACCAGAGACGTTGAAATGAATTTACTTCGTTCAGAATCCACGAGATCATACAAAAGCGAAATTCAGTTTAATGGGTTAGGTTCCCCTGAATTCATACATAAAGATACCAGAAGATTCTTGTCAGAGAGACTGAGAAATGTTCAAAGGAAAGATTCAGACCTGGATAGTGGCAGCTCTAGGTCATCTGTATATGATCATGAAAGAGCTACGAAGCAAGTAGAAACTACTTCGACCAGTGGAAAACATACAAACTACTGGGAATTACTTAGAGATGAAGAAGAAACACAAACTAGATCTTTCAGGCATGAAGCAGACGAAAATGAGGTTCTTCCCAAAGAATTGTCTCCTAGGAATCTCACCAGGTCGTTATCAGCTCCAGTGTCAGGAACATCATTTGGGAAGCTTCTTCTGGAGGACCGCCACATTTTAACCGGTGTCCACATTCAGAGAAAACATGAAGCAAGTGATCATGCGGCGGTGAATATTAAAAAGCAGAAGAAAGAGAGGTTTAATTTTAAAGAAAAAGTATCCAATTTCAGATATAATTTCACTCTAAGAGGGAAGCTGTTTGGCAGAAAGACTCAATCGATTAGTGGATTGCATACTTCCGACCTATACTCTACCAAAGACATCTTGAGTGGACCAACTGTTGTAATGAACTCTGGAGAGCGCCACGAAAGGGTATAATAATTTAGTTATTTCAGTTCTGGTTCCACACTTTATCCTCACTAATCATTTTTCTTTCTTATCCCAGGAGAATTTCACTGAGGTGCCTCCTAGTCCTGCTTCTGTGTGCAGCAGTGTCCAAGAAGAGTTCTGGAAGTTAACTGATCACCACAGCCCAATATCCACTTCAGATGTCACTCCTAGAGATGAGAACTGTGTTTCCCAGGTCTTTAGGGAAATCAGCTCTAATTTGAAAGGTATGTGGAATTGCGTTTAAGCATAATCAATTTTACTGTGTGTTAACTGTTATCAATGTCATATGTAAAAAAGTCTTTCTCCTTTATTTCTGGATTTGTTTATTTTACAAGATCATATCTCTCGGGATATTATTCAGTGATAAACTGTTTCTTCTTGTGTTATCATGCCTTTAAATACAGTGGTTAATTGCCATTTTGATTACTTGCATCTTAAGTCTGTTCTCCTCAAATACTGCCCTCAAAAGTTTGTCTGCTTCTGCTTATGTATCTTGGTTGATTTACAGATTTCTAAAAAATCCTAGTAAACGTATTAACATGATGTATAATTTGTCTAGCATGAAAATTGTCGTACATAAAGAATTACGGATATAGTAAACGTCAAATTGGCATTACTAGCTAAAACATGTCATTCTTTGACTGACATTTATCAAATAAATACAAGTTAGGGGATTGATAGAGTTGAAATATTTGATTCCATTTTTTTTTTTACACTACAGAACTCCGAAGACAGCTGAATCAACTTGAGTCGGATGATTTTGAGGACAAAGTGGTACAGCAGCAGCCCGTTGAGTCTGAAATCACAAAACTTGAAGATCCAGCAGAAGCTTACATACGAGACCTTCTTATTGTTTCTGGTTTGTATGATGGATCAACTGATAACAACTTTTCACGCAATAACACAGCTGCAAAGCCTATCAACAACGCGATTTTTGAGGAAGTGGAAGAAGCTTATAGAAAATCTGAGACGAAAAATGAAATCATCGAGAAGGAGCCGAACGAAAACAGTGTAGATCACAAATTATTATTTGATCTGTTGAACGAAGCACTTCCAATCGTACTTGCACCACGTTTGACAATGTCCAGATTTAGAAGAAACATTACTAACTCCTCTATGCCGCCGCCTTTGTTTGGAAAAAGATTATTGGATTCTGTATGGGATATCATCCTCCAGTTTACACACCCTCCAACTGACAGATCTTACTACTTGCTTGATGGAGTGATGGCACGAGATTTAAATTCGACACCGTGGTCGTCATTAATGGATGATGAGATTAACACGACTGGAAGGGAGGTGGAAGGTCTGATCATCAAGGATTTGTTTGAAGAAGTTGTGAAGGATTTGCGAAAATGA
mRNA sequence
ATGCCGATTCCAGAGGCATATCTAGGGGAGAGCCGTAGCGACGCTCCGGGAGTAGCGCGTGAGGTACAAGCGCGGAAGGAAGGAATCGACTTTGCTTCTTTGGTACCCGGCCTGGAAACCCCTCGAAATAGCCTGGAGCTGCAGATAGAGAGTTCCCATAACTATTGTGCTGCAGAAGAAATACCGTACTTCTACCAAATTGATGAAGTGTTTTCTGACAAGGACTATTTTAAAAATGAGGCTTCAATGAAGAAATTAATTGATAAGGAAATGTCCACGCGCATAAATGCCAGACATAATGGACCAAGCATTGTTGCTCGACTCATGGGGATGGAAATGTTGCCCTTGGATGCAAAAGATGAAGTTCAGCTACGTGACAAAAGGCGTAATAGCAAGGGAGTCAAGACTTTAAATAAAGAAAGTACTGGCAGGGGATTGCATTCTCAGGCATCCTCCAAATCGAATTCTTCGAAGCAGATGGACCTGCACTTGTCTTATCATGATAATGACACGGATGCTGATCGATGGAGCAGCAGTCAGAAGATGGGAAAACCACGCCGTCGGGAACATCCTCAAGAGGAGGAGTTACAAAAGTTTAAGAAGGAATTTGAAGCATGGCAGGCTGCAAGGTTTAGGGAGTGTTCAAGGGTTATTGAAGTTAGTAGCATCAACAGACAGTCACTTGCTCAGGAAGGCCTTGCCAAGGAAACGATGGCACTTAGTGCAAACACGAGGAAAATATCGAGTCAGAAGCTCTCAGCAGAACCTAAAGGTTCGACAGTGGAGATAAAATCTTATAGAAGTGTTGGTGTGGATGATGGTACTAGGGGGGAAACATTCCCAGCTGAGCAGAGGGGATCTTTTTCTTTGAGAAGCAAATTCATGGATGCAGATTTTGAGCACCCTTGCCTGATAAGTTGTGATCAGAAGACAGACAAATCACGTGGCCCAACAAAGATAGTGATCTTGAAGCCTGGTCCTGATAAGATGTGCCTCCATGAAGAGCACTGGACAAATTCCTCAGGGACCTTAGGAGAAAGAGTTAGTATTGAAGCTTTTCTTGAAGAGGTCAAGGAGCGGCTGAAATGCGAATTACAAGGGAAAACTTTTAAAAAGGGTTCTGCTGTTCGTGGAAGTGGAATAGAGACACCATATAGTGAGAAACCATCTCACTCAAGACAAATAGCTCGGAACATAGCAACACAGGTCAGAGATAGTGTCACCAGAGACGTTGAAATGAATTTACTTCGTTCAGAATCCACGAGATCATACAAAAGCGAAATTCAGTTTAATGGGTTAGGTTCCCCTGAATTCATACATAAAGATACCAGAAGATTCTTGTCAGAGAGACTGAGAAATGTTCAAAGGAAAGATTCAGACCTGGATAGTGGCAGCTCTAGGTCATCTGTATATGATCATGAAAGAGCTACGAAGCAAGTAGAAACTACTTCGACCAGTGGAAAACATACAAACTACTGGGAATTACTTAGAGATGAAGAAGAAACACAAACTAGATCTTTCAGGCATGAAGCAGACGAAAATGAGGTTCTTCCCAAAGAATTGTCTCCTAGGAATCTCACCAGGTCGTTATCAGCTCCAGTGTCAGGAACATCATTTGGGAAGCTTCTTCTGGAGGACCGCCACATTTTAACCGGTGTCCACATTCAGAGAAAACATGAAGCAAGTGATCATGCGGCGGTGAATATTAAAAAGCAGAAGAAAGAGAGGTTTAATTTTAAAGAAAAAGTATCCAATTTCAGATATAATTTCACTCTAAGAGGGAAGCTGTTTGGCAGAAAGACTCAATCGATTAGTGGATTGCATACTTCCGACCTATACTCTACCAAAGACATCTTGAGTGGACCAACTGTTGTAATGAACTCTGGAGAGCGCCACGAAAGGGAGAATTTCACTGAGGTGCCTCCTAGTCCTGCTTCTGTGTGCAGCAGTGTCCAAGAAGAGTTCTGGAAGTTAACTGATCACCACAGCCCAATATCCACTTCAGATGTCACTCCTAGAGATGAGAACTGTGTTTCCCAGGTCTTTAGGGAAATCAGCTCTAATTTGAAAGAACTCCGAAGACAGCTGAATCAACTTGAGTCGGATGATTTTGAGGACAAAGTGGTACAGCAGCAGCCCGTTGAGTCTGAAATCACAAAACTTGAAGATCCAGCAGAAGCTTACATACGAGACCTTCTTATTGTTTCTGGTTTGTATGATGGATCAACTGATAACAACTTTTCACGCAATAACACAGCTGCAAAGCCTATCAACAACGCGATTTTTGAGGAAGTGGAAGAAGCTTATAGAAAATCTGAGACGAAAAATGAAATCATCGAGAAGGAGCCGAACGAAAACAGTGTAGATCACAAATTATTATTTGATCTGTTGAACGAAGCACTTCCAATCGTACTTGCACCACGTTTGACAATGTCCAGATTTAGAAGAAACATTACTAACTCCTCTATGCCGCCGCCTTTGTTTGGAAAAAGATTATTGGATTCTGTATGGGATATCATCCTCCAGTTTACACACCCTCCAACTGACAGATCTTACTACTTGCTTGATGGAGTGATGGCACGAGATTTAAATTCGACACCGTGGTCGTCATTAATGGATGATGAGATTAACACGACTGGAAGGGAGGTGGAAGGTCTGATCATCAAGGATTTGTTTGAAGAAGTTGTGAAGGATTTGCGAAAATGA
Coding sequence (CDS)
ATGCCGATTCCAGAGGCATATCTAGGGGAGAGCCGTAGCGACGCTCCGGGAGTAGCGCGTGAGGTACAAGCGCGGAAGGAAGGAATCGACTTTGCTTCTTTGGTACCCGGCCTGGAAACCCCTCGAAATAGCCTGGAGCTGCAGATAGAGAGTTCCCATAACTATTGTGCTGCAGAAGAAATACCGTACTTCTACCAAATTGATGAAGTGTTTTCTGACAAGGACTATTTTAAAAATGAGGCTTCAATGAAGAAATTAATTGATAAGGAAATGTCCACGCGCATAAATGCCAGACATAATGGACCAAGCATTGTTGCTCGACTCATGGGGATGGAAATGTTGCCCTTGGATGCAAAAGATGAAGTTCAGCTACGTGACAAAAGGCGTAATAGCAAGGGAGTCAAGACTTTAAATAAAGAAAGTACTGGCAGGGGATTGCATTCTCAGGCATCCTCCAAATCGAATTCTTCGAAGCAGATGGACCTGCACTTGTCTTATCATGATAATGACACGGATGCTGATCGATGGAGCAGCAGTCAGAAGATGGGAAAACCACGCCGTCGGGAACATCCTCAAGAGGAGGAGTTACAAAAGTTTAAGAAGGAATTTGAAGCATGGCAGGCTGCAAGGTTTAGGGAGTGTTCAAGGGTTATTGAAGTTAGTAGCATCAACAGACAGTCACTTGCTCAGGAAGGCCTTGCCAAGGAAACGATGGCACTTAGTGCAAACACGAGGAAAATATCGAGTCAGAAGCTCTCAGCAGAACCTAAAGGTTCGACAGTGGAGATAAAATCTTATAGAAGTGTTGGTGTGGATGATGGTACTAGGGGGGAAACATTCCCAGCTGAGCAGAGGGGATCTTTTTCTTTGAGAAGCAAATTCATGGATGCAGATTTTGAGCACCCTTGCCTGATAAGTTGTGATCAGAAGACAGACAAATCACGTGGCCCAACAAAGATAGTGATCTTGAAGCCTGGTCCTGATAAGATGTGCCTCCATGAAGAGCACTGGACAAATTCCTCAGGGACCTTAGGAGAAAGAGTTAGTATTGAAGCTTTTCTTGAAGAGGTCAAGGAGCGGCTGAAATGCGAATTACAAGGGAAAACTTTTAAAAAGGGTTCTGCTGTTCGTGGAAGTGGAATAGAGACACCATATAGTGAGAAACCATCTCACTCAAGACAAATAGCTCGGAACATAGCAACACAGGTCAGAGATAGTGTCACCAGAGACGTTGAAATGAATTTACTTCGTTCAGAATCCACGAGATCATACAAAAGCGAAATTCAGTTTAATGGGTTAGGTTCCCCTGAATTCATACATAAAGATACCAGAAGATTCTTGTCAGAGAGACTGAGAAATGTTCAAAGGAAAGATTCAGACCTGGATAGTGGCAGCTCTAGGTCATCTGTATATGATCATGAAAGAGCTACGAAGCAAGTAGAAACTACTTCGACCAGTGGAAAACATACAAACTACTGGGAATTACTTAGAGATGAAGAAGAAACACAAACTAGATCTTTCAGGCATGAAGCAGACGAAAATGAGGTTCTTCCCAAAGAATTGTCTCCTAGGAATCTCACCAGGTCGTTATCAGCTCCAGTGTCAGGAACATCATTTGGGAAGCTTCTTCTGGAGGACCGCCACATTTTAACCGGTGTCCACATTCAGAGAAAACATGAAGCAAGTGATCATGCGGCGGTGAATATTAAAAAGCAGAAGAAAGAGAGGTTTAATTTTAAAGAAAAAGTATCCAATTTCAGATATAATTTCACTCTAAGAGGGAAGCTGTTTGGCAGAAAGACTCAATCGATTAGTGGATTGCATACTTCCGACCTATACTCTACCAAAGACATCTTGAGTGGACCAACTGTTGTAATGAACTCTGGAGAGCGCCACGAAAGGGAGAATTTCACTGAGGTGCCTCCTAGTCCTGCTTCTGTGTGCAGCAGTGTCCAAGAAGAGTTCTGGAAGTTAACTGATCACCACAGCCCAATATCCACTTCAGATGTCACTCCTAGAGATGAGAACTGTGTTTCCCAGGTCTTTAGGGAAATCAGCTCTAATTTGAAAGAACTCCGAAGACAGCTGAATCAACTTGAGTCGGATGATTTTGAGGACAAAGTGGTACAGCAGCAGCCCGTTGAGTCTGAAATCACAAAACTTGAAGATCCAGCAGAAGCTTACATACGAGACCTTCTTATTGTTTCTGGTTTGTATGATGGATCAACTGATAACAACTTTTCACGCAATAACACAGCTGCAAAGCCTATCAACAACGCGATTTTTGAGGAAGTGGAAGAAGCTTATAGAAAATCTGAGACGAAAAATGAAATCATCGAGAAGGAGCCGAACGAAAACAGTGTAGATCACAAATTATTATTTGATCTGTTGAACGAAGCACTTCCAATCGTACTTGCACCACGTTTGACAATGTCCAGATTTAGAAGAAACATTACTAACTCCTCTATGCCGCCGCCTTTGTTTGGAAAAAGATTATTGGATTCTGTATGGGATATCATCCTCCAGTTTACACACCCTCCAACTGACAGATCTTACTACTTGCTTGATGGAGTGATGGCACGAGATTTAAATTCGACACCGTGGTCGTCATTAATGGATGATGAGATTAACACGACTGGAAGGGAGGTGGAAGGTCTGATCATCAAGGATTTGTTTGAAGAAGTTGTGAAGGATTTGCGAAAATGA
Protein sequence
MPIPEAYLGESRSDAPGVAREVQARKEGIDFASLVPGLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRINARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNSSKQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECSRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGTRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEEHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQIARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNVQRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADENEVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKKERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHERENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAAKPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRFRRNITNSSMPPPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMDDEINTTGREVEGLIIKDLFEEVVKDLRK
Homology
BLAST of Sgr029742 vs. NCBI nr
Match:
XP_022145277.1 (uncharacterized protein LOC111014768 isoform X1 [Momordica charantia] >XP_022145278.1 uncharacterized protein LOC111014768 isoform X1 [Momordica charantia])
HSP 1 Score: 1442.9 bits (3734), Expect = 0.0e+00
Identity = 756/869 (87.00%), Postives = 796/869 (91.60%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLEL +ESS NYCAA+EI Y YQIDEVF DKDYFKNE+SMKKLIDKEMSTR N
Sbjct: 28 GLETPRNSLELHLESSQNYCAAKEISYSYQIDEVFCDKDYFKNESSMKKLIDKEMSTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
RHNGPSIVARLMGM+MLPLDAKDEV+L DKR NSKGVKTLNKESTGRGL S SSKSN
Sbjct: 88 PRHNGPSIVARLMGMDMLPLDAKDEVELSDKRHNSKGVKTLNKESTGRGLPSHVSSKSNY 147
Query: 157 SKQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECSR 216
SKQMDLH SYHDND DAD+WSSSQKMGKP RREHPQEEELQKFKKEFEAWQA+RFR CSR
Sbjct: 148 SKQMDLHSSYHDNDQDADQWSSSQKMGKPCRREHPQEEELQKFKKEFEAWQASRFRHCSR 207
Query: 217 VIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGTR 276
VIEVSSINR+S+AQ E MAL+ NT KISSQKL AE +G VE+KS RSVG+DDGT+
Sbjct: 208 VIEVSSINRRSMAQ-----EEMALNGNTGKISSQKLPAESEG-PVEMKSRRSVGLDDGTK 267
Query: 277 GETFPAE--QRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
ETF AE QRGSFSLRSK MDADFEHPCLISCD+KTDK GPTKIVILKPGPDKMCLHE
Sbjct: 268 RETFRAEQTQRGSFSLRSKSMDADFEHPCLISCDRKTDKLLGPTKIVILKPGPDKMCLHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHWTNSSGTLGERVSIE FLEEVKERL+CELQGKTFKKG+A RGSGIETPYSEKPSHSRQ
Sbjct: 328 EHWTNSSGTLGERVSIEDFLEEVKERLRCELQGKTFKKGTAARGSGIETPYSEKPSHSRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLR-N 456
IARNIATQVRDS+TRD ++LLRSESTRS KSEIQFN L SPEF++KDTRRFLSER+R N
Sbjct: 388 IARNIATQVRDSITRDTGISLLRSESTRSCKSEIQFNALDSPEFLNKDTRRFLSERMRNN 447
Query: 457 VQRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADE 516
VQ KDSDLDSGSSRSSVYD ER TKQVETT TS KHTNYWE+LRD EE QTRSFRHEAD
Sbjct: 448 VQSKDSDLDSGSSRSSVYDQERVTKQVETTLTSEKHTNYWEILRDSEEMQTRSFRHEADV 507
Query: 517 NEVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQK 576
NEVLPKELSPRNLTRS+SAPV+GTSFGKLLLEDRHILTGVHIQRKHEASDH A NIKKQK
Sbjct: 508 NEVLPKELSPRNLTRSVSAPVAGTSFGKLLLEDRHILTGVHIQRKHEASDHVA-NIKKQK 567
Query: 577 KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHE 636
KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHT+DLYST+DILSGPTVVMNSGERHE
Sbjct: 568 KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTTDLYSTRDILSGPTVVMNSGERHE 627
Query: 637 RENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELR 696
RENFTEVPPSPASVCSSVQEEFWK +DHHSPISTSDVTPRDENCVSQVFR+ISSNLKELR
Sbjct: 628 RENFTEVPPSPASVCSSVQEEFWKFSDHHSPISTSDVTPRDENCVSQVFRDISSNLKELR 687
Query: 697 RQLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTA 756
RQLNQLESDDFEDK V+QQPVESEITKLEDPAEAY+RDLLIVSG+YDGST NNFSRNNTA
Sbjct: 688 RQLNQLESDDFEDK-VEQQPVESEITKLEDPAEAYVRDLLIVSGMYDGSTGNNFSRNNTA 747
Query: 757 AKPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSR 816
AKPI+NAIFEEVEEAYRKSE KNE IEKE NE SVDHKLLFDLLNEALP+ LAP LTMSR
Sbjct: 748 AKPISNAIFEEVEEAYRKSERKNETIEKEQNEYSVDHKLLFDLLNEALPLALAPCLTMSR 807
Query: 817 FRRNITNSSM-PPPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLM 876
FR + NSS PPPLFGK+LLDSVWDII +FTHPPTDRSYYLLDGVMARDLNSTPWSSLM
Sbjct: 808 FRTKVINSSTPPPPLFGKKLLDSVWDIIHKFTHPPTDRSYYLLDGVMARDLNSTPWSSLM 867
Query: 877 DDEINTTGREVEGLIIKDLFEEVVKDLRK 902
DDE+NTTGREVEGLII DL EE+VKD RK
Sbjct: 868 DDEVNTTGREVEGLIINDLVEEIVKDFRK 888
BLAST of Sgr029742 vs. NCBI nr
Match:
XP_038904709.1 (uncharacterized protein LOC120091008 isoform X1 [Benincasa hispida])
HSP 1 Score: 1425.6 bits (3689), Expect = 0.0e+00
Identity = 745/868 (85.83%), Postives = 796/868 (91.71%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS NYCAAEEIPY YQIDEVFSDKDY KNEASMKKLIDKE+STR N
Sbjct: 28 GLETPRNSLELQMESSQNYCAAEEIPYSYQIDEVFSDKDYLKNEASMKKLIDKEISTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
RHNGPSIVARLMGM+MLPLDAKD V+L DKRRN+KGVKT N+ES GR +S ASSKSNS
Sbjct: 88 VRHNGPSIVARLMGMDMLPLDAKDVVELSDKRRNTKGVKTSNRESNGRS-NSHASSKSNS 147
Query: 157 SKQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECSR 216
SKQMDL+ SY DND DRWSSSQKMGK RREHPQEEELQKFKKEFEAWQAARFRECSR
Sbjct: 148 SKQMDLNSSYQDNDKGDDRWSSSQKMGKSHRREHPQEEELQKFKKEFEAWQAARFRECSR 207
Query: 217 VIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGTR 276
VIEVSSINR+SLAQ+ LAKE MAL+ANTR+I SQK+SAEPKGSTVE+KSYR++ +DDG +
Sbjct: 208 VIEVSSINRRSLAQDDLAKEKMALNANTRRILSQKVSAEPKGSTVEMKSYRNIDLDDGVK 267
Query: 277 GETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEEH 336
ETFPAEQRGSFSLRSK MDADFEHPC+ISCDQK DKSRGPTKIVILKPGPDKM LHEEH
Sbjct: 268 RETFPAEQRGSFSLRSKSMDADFEHPCMISCDQK-DKSRGPTKIVILKPGPDKMYLHEEH 327
Query: 337 WTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQIA 396
W NSSGTLGERVSIE FL+EVKERL+CELQGKT KKG A RGSGIETPYSE+PSH+RQIA
Sbjct: 328 WKNSSGTLGERVSIEDFLDEVKERLRCELQGKTLKKGYAARGSGIETPYSERPSHTRQIA 387
Query: 397 RNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNVQR 456
+NIATQVRD+VTRD+ +NLLRSESTRSY SEIQFNGL SPEFI+KDTRR LSERLRNVQR
Sbjct: 388 QNIATQVRDNVTRDIGINLLRSESTRSYNSEIQFNGLDSPEFINKDTRRLLSERLRNVQR 447
Query: 457 KD--SDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADEN 516
KD SDLDSGSSRSSV DHER QVETT +GK ++YWE LRD E QTRSFRHEAD+N
Sbjct: 448 KDSNSDLDSGSSRSSVCDHERVVNQVETTLKNGKRSSYWEALRDTEVIQTRSFRHEADQN 507
Query: 517 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKK 576
E LPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTG HIQRKHEASD AV++KKQKK
Sbjct: 508 EALPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGAHIQRKHEASD-VAVSVKKQKK 567
Query: 577 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHER 636
ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLH+++LYS+KDILSGPTVVMNSGERHER
Sbjct: 568 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHSANLYSSKDILSGPTVVMNSGERHER 627
Query: 637 ENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRR 696
ENFTEVPPSPASVCSSVQEEFWKL+DHHSPISTSDVTPR+ENCVSQVFREISSNLKELRR
Sbjct: 628 ENFTEVPPSPASVCSSVQEEFWKLSDHHSPISTSDVTPREENCVSQVFREISSNLKELRR 687
Query: 697 QLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAA 756
QLNQL+SDD EDK V+QQPVESEI KLEDPAEAYIRDLLIVSG+YDGSTDNNFSRNN A
Sbjct: 688 QLNQLDSDDIEDK-VEQQPVESEIAKLEDPAEAYIRDLLIVSGMYDGSTDNNFSRNNAAT 747
Query: 757 KPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRF 816
KPI+NAIFEEVEEAYRKSETKNEII KE NENSV H++LFDLLNEALPIVLAP LTMSRF
Sbjct: 748 KPISNAIFEEVEEAYRKSETKNEIIGKEQNENSVGHQMLFDLLNEALPIVLAPCLTMSRF 807
Query: 817 RRNITNSSMP-PPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMD 876
RR +TNSSMP PLFGK+LLDSVWD+I +F HP TDRSYYLLDGVMARDLNS PWSSLMD
Sbjct: 808 RRKVTNSSMPLRPLFGKKLLDSVWDVIRKFVHPSTDRSYYLLDGVMARDLNSIPWSSLMD 867
Query: 877 DEINTTGREVEGLIIKDLFEEVVKDLRK 902
DE+NTTGREVEGLIIKDL EEVVKDL K
Sbjct: 868 DEVNTTGREVEGLIIKDLVEEVVKDLLK 891
BLAST of Sgr029742 vs. NCBI nr
Match:
XP_008448479.1 (PREDICTED: uncharacterized protein LOC103490651 [Cucumis melo] >XP_008448480.1 PREDICTED: uncharacterized protein LOC103490651 [Cucumis melo] >XP_008448481.1 PREDICTED: uncharacterized protein LOC103490651 [Cucumis melo])
HSP 1 Score: 1423.3 bits (3683), Expect = 0.0e+00
Identity = 733/868 (84.45%), Postives = 793/868 (91.36%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS NYCA EEIPY YQIDEVFSDKDY KNEASMKKLID+E+STR N
Sbjct: 28 GLETPRNSLELQMESSQNYCAVEEIPYSYQIDEVFSDKDYLKNEASMKKLIDREISTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
+HNGPSIVARLMGM+MLPLDAKD V+L DKR NSKGVKT NKES GRGLH ASSKSN
Sbjct: 88 VKHNGPSIVARLMGMDMLPLDAKDVVELSDKRHNSKGVKTSNKESNGRGLHFLASSKSNH 147
Query: 157 SKQMDLHLSYHDNDTDADR--WSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFREC 216
SKQMDLH SYHDND DADR WSS QKMGK RREHPQEEELQKFKKEFEAWQAARFREC
Sbjct: 148 SKQMDLHSSYHDNDKDADRDDWSSDQKMGKSHRREHPQEEELQKFKKEFEAWQAARFREC 207
Query: 217 SRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDG 276
SRVIEVSSINR+SL QE LAKE + ++ANTR+ SSQK+SAEPKGSTVE+KSYRS+G+DD
Sbjct: 208 SRVIEVSSINRRSLKQEDLAKEKITINANTRRTSSQKVSAEPKGSTVEMKSYRSIGLDDC 267
Query: 277 TRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
+ ETFPAEQRG+FSLRSK MDADFEHPCLIS DQK DKS GPTKIVILKPGPDKMC+HE
Sbjct: 268 VKRETFPAEQRGTFSLRSKSMDADFEHPCLISYDQK-DKSHGPTKIVILKPGPDKMCVHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHW NSSG LGERVSIE FL+EVKERL+CELQGKTFKKG VRGSGIETPYSE+PSH RQ
Sbjct: 328 EHWKNSSGNLGERVSIEDFLDEVKERLRCELQGKTFKKGYTVRGSGIETPYSERPSHRRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNV 456
IA+NIATQVRDSVTRD+ +NLLRSESTRSY SE+QF GL SPEF++KDTRR LSERLRNV
Sbjct: 388 IAQNIATQVRDSVTRDIGINLLRSESTRSYNSEVQFIGLDSPEFVNKDTRRLLSERLRNV 447
Query: 457 QRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADEN 516
+ KD DLDSGSSRSSV DHER QVETT T+GKHT+YWE+LRD EE QTRSFRHEA++N
Sbjct: 448 RSKDPDLDSGSSRSSVCDHERVMNQVETTLTNGKHTDYWEVLRDAEEIQTRSFRHEANQN 507
Query: 517 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKK 576
EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEA DH A++ KKQKK
Sbjct: 508 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEAGDHVAMSSKKQKK 567
Query: 577 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHER 636
ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLH+++LYS+KDILSGPTVVMNSGERHER
Sbjct: 568 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHSANLYSSKDILSGPTVVMNSGERHER 627
Query: 637 ENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRR 696
ENFTEVPPSPASVCSSVQEEFWKL+DH SPISTSDVTPR+E CVSQVFREISSNLKELRR
Sbjct: 628 ENFTEVPPSPASVCSSVQEEFWKLSDHQSPISTSDVTPREEKCVSQVFREISSNLKELRR 687
Query: 697 QLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAA 756
QLNQL+SDD EDK V+QQPVESEITKLEDPAEAYIRDLLIVSG+YDGSTDNNF+RNN A
Sbjct: 688 QLNQLDSDDIEDK-VEQQPVESEITKLEDPAEAYIRDLLIVSGMYDGSTDNNFTRNNAAT 747
Query: 757 KPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRF 816
KPI++AIFEEVEEAYRKSETKNEII KE +ENSVDHK+LFDLLNEALPIVLAP LT+S+F
Sbjct: 748 KPISDAIFEEVEEAYRKSETKNEIIGKEQSENSVDHKMLFDLLNEALPIVLAPCLTLSKF 807
Query: 817 RRNITNSSMPP-PLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMD 876
+R + NSSMPP PLFGK+LLD VWD+I +F HP TDRSYYLLDGVMARDLNSTPWSSL+D
Sbjct: 808 KRKVINSSMPPRPLFGKKLLDPVWDVIRKFIHPSTDRSYYLLDGVMARDLNSTPWSSLVD 867
Query: 877 DEINTTGREVEGLIIKDLFEEVVKDLRK 902
DE+NTTGREVE LI+KDL EE+VKDL K
Sbjct: 868 DEVNTTGREVEALIMKDLVEEIVKDLLK 893
BLAST of Sgr029742 vs. NCBI nr
Match:
XP_011650257.1 (uncharacterized protein LOC101212814 isoform X1 [Cucumis sativus] >XP_031738431.1 uncharacterized protein LOC101212814 isoform X1 [Cucumis sativus] >XP_031738432.1 uncharacterized protein LOC101212814 isoform X1 [Cucumis sativus] >KGN55611.1 hypothetical protein Csa_011398 [Cucumis sativus])
HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 728/868 (83.87%), Postives = 788/868 (90.78%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS NYCA EEIPY YQIDEVFSDKDY KNEASMKKLID+E+STR N
Sbjct: 28 GLETPRNSLELQMESSQNYCAVEEIPYSYQIDEVFSDKDYLKNEASMKKLIDREISTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
+HNGPSIVARLMGM+MLPLDAKD V+L DKR NSKGVKT NKES GRGLHS ASSKSN
Sbjct: 88 VKHNGPSIVARLMGMDMLPLDAKDVVELSDKRHNSKGVKTSNKESNGRGLHSLASSKSNY 147
Query: 157 SKQMDLHLSYHDNDTDA--DRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFREC 216
SKQMDLH SYHDND DA DRW SSQKMG R+EHPQEEELQKFKKEFEAWQAARFREC
Sbjct: 148 SKQMDLHSSYHDNDKDADRDRWGSSQKMGVSHRQEHPQEEELQKFKKEFEAWQAARFREC 207
Query: 217 SRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDG 276
SRVIEVSSINR+S+AQE LAKE +A++ANTR+ SSQK+SAEPKGSTVE+KSY+S+G+DD
Sbjct: 208 SRVIEVSSINRRSVAQENLAKEKIAINANTRRTSSQKVSAEPKGSTVEMKSYKSIGLDDC 267
Query: 277 TRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
+ ETFPAEQRG+FSLRSK MDADFEHPCLISCDQK DKS GPTKIVILKPGPDKMC+HE
Sbjct: 268 VKRETFPAEQRGTFSLRSKAMDADFEHPCLISCDQK-DKSHGPTKIVILKPGPDKMCVHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHW NSSG LGERVSIE FL+EVKERL+CELQGK+FKKG RGSGIETPYSE+PSH RQ
Sbjct: 328 EHWKNSSGNLGERVSIEDFLDEVKERLRCELQGKSFKKGYTARGSGIETPYSERPSHRRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNV 456
IA+NIATQVRDSVTRD+ +NLLRSESTRSY SE+QF GL SPEF+ KDTRR L+ERLRNV
Sbjct: 388 IAQNIATQVRDSVTRDIGINLLRSESTRSYNSEVQFIGLDSPEFVSKDTRRLLAERLRNV 447
Query: 457 QRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADEN 516
+ KDSDLDSGSSRSSV DHER QVETT T+GKH +YWE+LRD EE QTRSFRHEA++N
Sbjct: 448 RSKDSDLDSGSSRSSVCDHERVMNQVETTLTNGKHRDYWEVLRDAEEIQTRSFRHEANQN 507
Query: 517 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKK 576
EVLPKELSP NLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDH A++ KKQKK
Sbjct: 508 EVLPKELSPMNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHVAMSCKKQKK 567
Query: 577 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHER 636
ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLH+++LYS+KDILSGPTVVMNSGERHER
Sbjct: 568 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHSANLYSSKDILSGPTVVMNSGERHER 627
Query: 637 ENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRR 696
ENFTEVPPSPASVCSSVQEEFWKL+DHHSPISTSDVTPR+EN VSQVFREISSNLKELRR
Sbjct: 628 ENFTEVPPSPASVCSSVQEEFWKLSDHHSPISTSDVTPREENSVSQVFREISSNLKELRR 687
Query: 697 QLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAA 756
QLNQL+SDD EDK V+QQPVESEITKLEDPAEAYIRDLLIVSG+YDGSTDNNF+RNN
Sbjct: 688 QLNQLDSDDIEDK-VEQQPVESEITKLEDPAEAYIRDLLIVSGMYDGSTDNNFTRNNADT 747
Query: 757 KPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRF 816
K I+NAIFEEVEEAYRKSE KNEII KE +ENSVDHK+LFDLLNE LPIVLAP LT+S+F
Sbjct: 748 KSISNAIFEEVEEAYRKSEIKNEIIGKEQSENSVDHKMLFDLLNEVLPIVLAPCLTLSKF 807
Query: 817 RRNITNSSMPP-PLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMD 876
RR + NSSMPP PL GK+LLD VWD+I +F HP TDRSYYLLDGVMARDLNSTPWSSL D
Sbjct: 808 RRKVINSSMPPRPLLGKKLLDPVWDVIRKFIHPSTDRSYYLLDGVMARDLNSTPWSSLRD 867
Query: 877 DEINTTGREVEGLIIKDLFEEVVKDLRK 902
DEINT GREVE LI+KDL EE+VKDL K
Sbjct: 868 DEINTIGREVEALIMKDLVEEIVKDLLK 893
BLAST of Sgr029742 vs. NCBI nr
Match:
XP_023539226.1 (uncharacterized protein LOC111799930 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1402.5 bits (3629), Expect = 0.0e+00
Identity = 736/867 (84.89%), Postives = 784/867 (90.43%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNS+ELQ+ESS +YC AEEIPY YQIDEVFSDKDY KNE SMKKLIDKEMSTR +
Sbjct: 28 GLETPRNSMELQMESSRSYCTAEEIPYSYQIDEVFSDKDYLKNETSMKKLIDKEMSTRTS 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
A+H+GPSIVARLMGM+MLPLDAKDEV+L DKR NSKGVKT +KE GRGLHS ASSKSNS
Sbjct: 88 AKHHGPSIVARLMGMDMLPLDAKDEVELSDKRHNSKGVKTSSKEINGRGLHSYASSKSNS 147
Query: 157 SKQMDLHLSYHDNDTDADRW-SSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECS 216
KQMD+H SYHDND DADRW S+SQKMG+P RREHPQEEELQKFKKEFEAWQAARFRECS
Sbjct: 148 YKQMDVHSSYHDNDKDADRWRSTSQKMGRPHRREHPQEEELQKFKKEFEAWQAARFRECS 207
Query: 217 RVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGT 276
RVIE SSINRQSLAQ+ AKE M L+ N RKISS KLSAE KG TV +KSY+ V +D G
Sbjct: 208 RVIEASSINRQSLAQDD-AKE-MELNVNRRKISSPKLSAESKGPTVGMKSYKRVDLDGGI 267
Query: 277 RGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEE 336
+ ETFP EQRG FSLRSK MDADFEHPCLIS DQK DK GPTKIVILKPGPDKMCLHEE
Sbjct: 268 KRETFPGEQRGPFSLRSKSMDADFEHPCLISSDQK-DKLLGPTKIVILKPGPDKMCLHEE 327
Query: 337 HWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQI 396
HWTNSSGTLGERVSIE FLEEVKERL+CELQGKT KKGSA RGSGIETPYSEK SHSRQI
Sbjct: 328 HWTNSSGTLGERVSIEDFLEEVKERLRCELQGKTTKKGSAARGSGIETPYSEKSSHSRQI 387
Query: 397 ARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNVQ 456
A+NIATQVRDSVTRD+ NLLRSESTRSY S +QFNGLGSPEF++KDTRRFLS RLRNV+
Sbjct: 388 AQNIATQVRDSVTRDIGFNLLRSESTRSYNSGVQFNGLGSPEFMNKDTRRFLSGRLRNVR 447
Query: 457 RKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADENE 516
RKDSDLDSGSSRSS DHER +KQVET T+GKHTNYWE+LRD EE Q+RSFRHEAD E
Sbjct: 448 RKDSDLDSGSSRSSASDHERVSKQVETILTNGKHTNYWEVLRDAEEIQSRSFRHEAD--E 507
Query: 517 VLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKKE 576
VLPKELSPRNL+RSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDH AVN+KKQKKE
Sbjct: 508 VLPKELSPRNLSRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHVAVNLKKQKKE 567
Query: 577 RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHERE 636
RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGL T+DLYSTKDILSGPTVVMNSGERHERE
Sbjct: 568 RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLDTADLYSTKDILSGPTVVMNSGERHERE 627
Query: 637 NFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 696
NFTEVPPSPASVCSS QEEFWKL+DHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ
Sbjct: 628 NFTEVPPSPASVCSSAQEEFWKLSDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 687
Query: 697 LNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAAK 756
L+QL+SDD EDK V+QQPVE EITKLEDPAE YIRDLLIVSG+YDGSTD+NFSRNN A K
Sbjct: 688 LSQLDSDDIEDK-VEQQPVEFEITKLEDPAEVYIRDLLIVSGMYDGSTDHNFSRNNAATK 747
Query: 757 PINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRFR 816
PI+NAIF+EVEEAYRKSETKNEII KE NE++VDHKLLFDLLNEALPIVL P LT SRFR
Sbjct: 748 PISNAIFDEVEEAYRKSETKNEIIGKEQNESNVDHKLLFDLLNEALPIVLGPCLTTSRFR 807
Query: 817 RNITNSSMP-PPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMDD 876
+ +SS P PPLFGK LLDSVWDII +F HPPTDRSYYLL+GVMARDLNSTPW+SLMD
Sbjct: 808 TKVIDSSTPLPPLFGKNLLDSVWDIIRKFIHPPTDRSYYLLEGVMARDLNSTPWASLMDV 867
Query: 877 EINTTGREVEGLIIKDLFEEVVKDLRK 902
EIN TGREVEGLIIKDL +EVVKDLRK
Sbjct: 868 EINMTGREVEGLIIKDLIDEVVKDLRK 888
BLAST of Sgr029742 vs. ExPASy TrEMBL
Match:
A0A6J1CW53 (uncharacterized protein LOC111014768 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014768 PE=4 SV=1)
HSP 1 Score: 1442.9 bits (3734), Expect = 0.0e+00
Identity = 756/869 (87.00%), Postives = 796/869 (91.60%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLEL +ESS NYCAA+EI Y YQIDEVF DKDYFKNE+SMKKLIDKEMSTR N
Sbjct: 28 GLETPRNSLELHLESSQNYCAAKEISYSYQIDEVFCDKDYFKNESSMKKLIDKEMSTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
RHNGPSIVARLMGM+MLPLDAKDEV+L DKR NSKGVKTLNKESTGRGL S SSKSN
Sbjct: 88 PRHNGPSIVARLMGMDMLPLDAKDEVELSDKRHNSKGVKTLNKESTGRGLPSHVSSKSNY 147
Query: 157 SKQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECSR 216
SKQMDLH SYHDND DAD+WSSSQKMGKP RREHPQEEELQKFKKEFEAWQA+RFR CSR
Sbjct: 148 SKQMDLHSSYHDNDQDADQWSSSQKMGKPCRREHPQEEELQKFKKEFEAWQASRFRHCSR 207
Query: 217 VIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGTR 276
VIEVSSINR+S+AQ E MAL+ NT KISSQKL AE +G VE+KS RSVG+DDGT+
Sbjct: 208 VIEVSSINRRSMAQ-----EEMALNGNTGKISSQKLPAESEG-PVEMKSRRSVGLDDGTK 267
Query: 277 GETFPAE--QRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
ETF AE QRGSFSLRSK MDADFEHPCLISCD+KTDK GPTKIVILKPGPDKMCLHE
Sbjct: 268 RETFRAEQTQRGSFSLRSKSMDADFEHPCLISCDRKTDKLLGPTKIVILKPGPDKMCLHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHWTNSSGTLGERVSIE FLEEVKERL+CELQGKTFKKG+A RGSGIETPYSEKPSHSRQ
Sbjct: 328 EHWTNSSGTLGERVSIEDFLEEVKERLRCELQGKTFKKGTAARGSGIETPYSEKPSHSRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLR-N 456
IARNIATQVRDS+TRD ++LLRSESTRS KSEIQFN L SPEF++KDTRRFLSER+R N
Sbjct: 388 IARNIATQVRDSITRDTGISLLRSESTRSCKSEIQFNALDSPEFLNKDTRRFLSERMRNN 447
Query: 457 VQRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADE 516
VQ KDSDLDSGSSRSSVYD ER TKQVETT TS KHTNYWE+LRD EE QTRSFRHEAD
Sbjct: 448 VQSKDSDLDSGSSRSSVYDQERVTKQVETTLTSEKHTNYWEILRDSEEMQTRSFRHEADV 507
Query: 517 NEVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQK 576
NEVLPKELSPRNLTRS+SAPV+GTSFGKLLLEDRHILTGVHIQRKHEASDH A NIKKQK
Sbjct: 508 NEVLPKELSPRNLTRSVSAPVAGTSFGKLLLEDRHILTGVHIQRKHEASDHVA-NIKKQK 567
Query: 577 KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHE 636
KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHT+DLYST+DILSGPTVVMNSGERHE
Sbjct: 568 KERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTTDLYSTRDILSGPTVVMNSGERHE 627
Query: 637 RENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELR 696
RENFTEVPPSPASVCSSVQEEFWK +DHHSPISTSDVTPRDENCVSQVFR+ISSNLKELR
Sbjct: 628 RENFTEVPPSPASVCSSVQEEFWKFSDHHSPISTSDVTPRDENCVSQVFRDISSNLKELR 687
Query: 697 RQLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTA 756
RQLNQLESDDFEDK V+QQPVESEITKLEDPAEAY+RDLLIVSG+YDGST NNFSRNNTA
Sbjct: 688 RQLNQLESDDFEDK-VEQQPVESEITKLEDPAEAYVRDLLIVSGMYDGSTGNNFSRNNTA 747
Query: 757 AKPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSR 816
AKPI+NAIFEEVEEAYRKSE KNE IEKE NE SVDHKLLFDLLNEALP+ LAP LTMSR
Sbjct: 748 AKPISNAIFEEVEEAYRKSERKNETIEKEQNEYSVDHKLLFDLLNEALPLALAPCLTMSR 807
Query: 817 FRRNITNSSM-PPPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLM 876
FR + NSS PPPLFGK+LLDSVWDII +FTHPPTDRSYYLLDGVMARDLNSTPWSSLM
Sbjct: 808 FRTKVINSSTPPPPLFGKKLLDSVWDIIHKFTHPPTDRSYYLLDGVMARDLNSTPWSSLM 867
Query: 877 DDEINTTGREVEGLIIKDLFEEVVKDLRK 902
DDE+NTTGREVEGLII DL EE+VKD RK
Sbjct: 868 DDEVNTTGREVEGLIINDLVEEIVKDFRK 888
BLAST of Sgr029742 vs. ExPASy TrEMBL
Match:
A0A1S3BKM8 (uncharacterized protein LOC103490651 OS=Cucumis melo OX=3656 GN=LOC103490651 PE=4 SV=1)
HSP 1 Score: 1423.3 bits (3683), Expect = 0.0e+00
Identity = 733/868 (84.45%), Postives = 793/868 (91.36%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS NYCA EEIPY YQIDEVFSDKDY KNEASMKKLID+E+STR N
Sbjct: 28 GLETPRNSLELQMESSQNYCAVEEIPYSYQIDEVFSDKDYLKNEASMKKLIDREISTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
+HNGPSIVARLMGM+MLPLDAKD V+L DKR NSKGVKT NKES GRGLH ASSKSN
Sbjct: 88 VKHNGPSIVARLMGMDMLPLDAKDVVELSDKRHNSKGVKTSNKESNGRGLHFLASSKSNH 147
Query: 157 SKQMDLHLSYHDNDTDADR--WSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFREC 216
SKQMDLH SYHDND DADR WSS QKMGK RREHPQEEELQKFKKEFEAWQAARFREC
Sbjct: 148 SKQMDLHSSYHDNDKDADRDDWSSDQKMGKSHRREHPQEEELQKFKKEFEAWQAARFREC 207
Query: 217 SRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDG 276
SRVIEVSSINR+SL QE LAKE + ++ANTR+ SSQK+SAEPKGSTVE+KSYRS+G+DD
Sbjct: 208 SRVIEVSSINRRSLKQEDLAKEKITINANTRRTSSQKVSAEPKGSTVEMKSYRSIGLDDC 267
Query: 277 TRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
+ ETFPAEQRG+FSLRSK MDADFEHPCLIS DQK DKS GPTKIVILKPGPDKMC+HE
Sbjct: 268 VKRETFPAEQRGTFSLRSKSMDADFEHPCLISYDQK-DKSHGPTKIVILKPGPDKMCVHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHW NSSG LGERVSIE FL+EVKERL+CELQGKTFKKG VRGSGIETPYSE+PSH RQ
Sbjct: 328 EHWKNSSGNLGERVSIEDFLDEVKERLRCELQGKTFKKGYTVRGSGIETPYSERPSHRRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNV 456
IA+NIATQVRDSVTRD+ +NLLRSESTRSY SE+QF GL SPEF++KDTRR LSERLRNV
Sbjct: 388 IAQNIATQVRDSVTRDIGINLLRSESTRSYNSEVQFIGLDSPEFVNKDTRRLLSERLRNV 447
Query: 457 QRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADEN 516
+ KD DLDSGSSRSSV DHER QVETT T+GKHT+YWE+LRD EE QTRSFRHEA++N
Sbjct: 448 RSKDPDLDSGSSRSSVCDHERVMNQVETTLTNGKHTDYWEVLRDAEEIQTRSFRHEANQN 507
Query: 517 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKK 576
EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEA DH A++ KKQKK
Sbjct: 508 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEAGDHVAMSSKKQKK 567
Query: 577 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHER 636
ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLH+++LYS+KDILSGPTVVMNSGERHER
Sbjct: 568 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHSANLYSSKDILSGPTVVMNSGERHER 627
Query: 637 ENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRR 696
ENFTEVPPSPASVCSSVQEEFWKL+DH SPISTSDVTPR+E CVSQVFREISSNLKELRR
Sbjct: 628 ENFTEVPPSPASVCSSVQEEFWKLSDHQSPISTSDVTPREEKCVSQVFREISSNLKELRR 687
Query: 697 QLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAA 756
QLNQL+SDD EDK V+QQPVESEITKLEDPAEAYIRDLLIVSG+YDGSTDNNF+RNN A
Sbjct: 688 QLNQLDSDDIEDK-VEQQPVESEITKLEDPAEAYIRDLLIVSGMYDGSTDNNFTRNNAAT 747
Query: 757 KPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRF 816
KPI++AIFEEVEEAYRKSETKNEII KE +ENSVDHK+LFDLLNEALPIVLAP LT+S+F
Sbjct: 748 KPISDAIFEEVEEAYRKSETKNEIIGKEQSENSVDHKMLFDLLNEALPIVLAPCLTLSKF 807
Query: 817 RRNITNSSMPP-PLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMD 876
+R + NSSMPP PLFGK+LLD VWD+I +F HP TDRSYYLLDGVMARDLNSTPWSSL+D
Sbjct: 808 KRKVINSSMPPRPLFGKKLLDPVWDVIRKFIHPSTDRSYYLLDGVMARDLNSTPWSSLVD 867
Query: 877 DEINTTGREVEGLIIKDLFEEVVKDLRK 902
DE+NTTGREVE LI+KDL EE+VKDL K
Sbjct: 868 DEVNTTGREVEALIMKDLVEEIVKDLLK 893
BLAST of Sgr029742 vs. ExPASy TrEMBL
Match:
A0A0A0L638 (DUF4378 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G002320 PE=4 SV=1)
HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 728/868 (83.87%), Postives = 788/868 (90.78%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS NYCA EEIPY YQIDEVFSDKDY KNEASMKKLID+E+STR N
Sbjct: 28 GLETPRNSLELQMESSQNYCAVEEIPYSYQIDEVFSDKDYLKNEASMKKLIDREISTRTN 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
+HNGPSIVARLMGM+MLPLDAKD V+L DKR NSKGVKT NKES GRGLHS ASSKSN
Sbjct: 88 VKHNGPSIVARLMGMDMLPLDAKDVVELSDKRHNSKGVKTSNKESNGRGLHSLASSKSNY 147
Query: 157 SKQMDLHLSYHDNDTDA--DRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFREC 216
SKQMDLH SYHDND DA DRW SSQKMG R+EHPQEEELQKFKKEFEAWQAARFREC
Sbjct: 148 SKQMDLHSSYHDNDKDADRDRWGSSQKMGVSHRQEHPQEEELQKFKKEFEAWQAARFREC 207
Query: 217 SRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDG 276
SRVIEVSSINR+S+AQE LAKE +A++ANTR+ SSQK+SAEPKGSTVE+KSY+S+G+DD
Sbjct: 208 SRVIEVSSINRRSVAQENLAKEKIAINANTRRTSSQKVSAEPKGSTVEMKSYKSIGLDDC 267
Query: 277 TRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHE 336
+ ETFPAEQRG+FSLRSK MDADFEHPCLISCDQK DKS GPTKIVILKPGPDKMC+HE
Sbjct: 268 VKRETFPAEQRGTFSLRSKAMDADFEHPCLISCDQK-DKSHGPTKIVILKPGPDKMCVHE 327
Query: 337 EHWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQ 396
EHW NSSG LGERVSIE FL+EVKERL+CELQGK+FKKG RGSGIETPYSE+PSH RQ
Sbjct: 328 EHWKNSSGNLGERVSIEDFLDEVKERLRCELQGKSFKKGYTARGSGIETPYSERPSHRRQ 387
Query: 397 IARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNV 456
IA+NIATQVRDSVTRD+ +NLLRSESTRSY SE+QF GL SPEF+ KDTRR L+ERLRNV
Sbjct: 388 IAQNIATQVRDSVTRDIGINLLRSESTRSYNSEVQFIGLDSPEFVSKDTRRLLAERLRNV 447
Query: 457 QRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADEN 516
+ KDSDLDSGSSRSSV DHER QVETT T+GKH +YWE+LRD EE QTRSFRHEA++N
Sbjct: 448 RSKDSDLDSGSSRSSVCDHERVMNQVETTLTNGKHRDYWEVLRDAEEIQTRSFRHEANQN 507
Query: 517 EVLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKK 576
EVLPKELSP NLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDH A++ KKQKK
Sbjct: 508 EVLPKELSPMNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHVAMSCKKQKK 567
Query: 577 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHER 636
ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLH+++LYS+KDILSGPTVVMNSGERHER
Sbjct: 568 ERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHSANLYSSKDILSGPTVVMNSGERHER 627
Query: 637 ENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRR 696
ENFTEVPPSPASVCSSVQEEFWKL+DHHSPISTSDVTPR+EN VSQVFREISSNLKELRR
Sbjct: 628 ENFTEVPPSPASVCSSVQEEFWKLSDHHSPISTSDVTPREENSVSQVFREISSNLKELRR 687
Query: 697 QLNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAA 756
QLNQL+SDD EDK V+QQPVESEITKLEDPAEAYIRDLLIVSG+YDGSTDNNF+RNN
Sbjct: 688 QLNQLDSDDIEDK-VEQQPVESEITKLEDPAEAYIRDLLIVSGMYDGSTDNNFTRNNADT 747
Query: 757 KPINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRF 816
K I+NAIFEEVEEAYRKSE KNEII KE +ENSVDHK+LFDLLNE LPIVLAP LT+S+F
Sbjct: 748 KSISNAIFEEVEEAYRKSEIKNEIIGKEQSENSVDHKMLFDLLNEVLPIVLAPCLTLSKF 807
Query: 817 RRNITNSSMPP-PLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMD 876
RR + NSSMPP PL GK+LLD VWD+I +F HP TDRSYYLLDGVMARDLNSTPWSSL D
Sbjct: 808 RRKVINSSMPPRPLLGKKLLDPVWDVIRKFIHPSTDRSYYLLDGVMARDLNSTPWSSLRD 867
Query: 877 DEINTTGREVEGLIIKDLFEEVVKDLRK 902
DEINT GREVE LI+KDL EE+VKDL K
Sbjct: 868 DEINTIGREVEALIMKDLVEEIVKDLLK 893
BLAST of Sgr029742 vs. ExPASy TrEMBL
Match:
A0A6J1F7W2 (uncharacterized protein LOC111442905 OS=Cucurbita moschata OX=3662 GN=LOC111442905 PE=4 SV=1)
HSP 1 Score: 1401.7 bits (3627), Expect = 0.0e+00
Identity = 737/867 (85.01%), Postives = 783/867 (90.31%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS +YC AEEIPY YQIDEVFSDKDY KNE SMKKLIDKEMSTR +
Sbjct: 28 GLETPRNSLELQMESSQSYCTAEEIPYSYQIDEVFSDKDYLKNETSMKKLIDKEMSTRTS 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
A+H+GPSIVARLMGM+MLPLDAK+EV+L DKR NSKGVKT + E GRGLHS ASSKSNS
Sbjct: 88 AKHHGPSIVARLMGMDMLPLDAKNEVELSDKRHNSKGVKTSSNEINGRGLHSYASSKSNS 147
Query: 157 SKQMDLHLSYHDNDTDADRW-SSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECS 216
KQMD+H SYHDND DADRW S+SQKMG P RREHPQEEELQKFKKEFEAWQAARFRECS
Sbjct: 148 CKQMDVHSSYHDNDKDADRWRSTSQKMGGPHRREHPQEEELQKFKKEFEAWQAARFRECS 207
Query: 217 RVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGT 276
RVIE SSINRQSLAQ G AKE M L+ N RKISS KLSAEPKG TV +KSYR V +D G
Sbjct: 208 RVIEASSINRQSLAQ-GDAKE-MELNVNRRKISSPKLSAEPKGPTVGMKSYRRVDLDGGI 267
Query: 277 RGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEE 336
+ ETFP EQRG FSLRSK MDADFEHPCLIS DQK DK GPTKIVILKPGPDKMCLHEE
Sbjct: 268 KRETFPGEQRGPFSLRSKSMDADFEHPCLISSDQK-DKLLGPTKIVILKPGPDKMCLHEE 327
Query: 337 HWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQI 396
HWTNSSGTLGERVSIE FLEEVKERL+CELQGKT KKGSA RGSGIETPYSEK SHSRQI
Sbjct: 328 HWTNSSGTLGERVSIEDFLEEVKERLRCELQGKTTKKGSAARGSGIETPYSEKSSHSRQI 387
Query: 397 ARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNVQ 456
A+NIATQVRDSVTRD+ NLLRSESTRSY S +QFNGLGSPEF++KDTRRFLS RLRNV+
Sbjct: 388 AQNIATQVRDSVTRDIGFNLLRSESTRSYNSGVQFNGLGSPEFMNKDTRRFLSGRLRNVR 447
Query: 457 RKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADENE 516
RKDSDLDSGSSRSS DHER TKQVET T+GKHTNYWE+LRD EE +RSFRHEAD E
Sbjct: 448 RKDSDLDSGSSRSSASDHERVTKQVETILTNGKHTNYWEVLRDAEEIHSRSFRHEAD--E 507
Query: 517 VLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKKE 576
VLPKELSPRNL+RSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDH AVN+KKQKKE
Sbjct: 508 VLPKELSPRNLSRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHVAVNLKKQKKE 567
Query: 577 RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHERE 636
RFNFKEKVSNFRYNFTLRG+LFGRKTQSISGL T+DLYSTKDILSGPTVVMNSGERHERE
Sbjct: 568 RFNFKEKVSNFRYNFTLRGRLFGRKTQSISGLDTADLYSTKDILSGPTVVMNSGERHERE 627
Query: 637 NFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 696
NFTEVPPSPASVCSS QEEFWKL+DHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ
Sbjct: 628 NFTEVPPSPASVCSSAQEEFWKLSDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 687
Query: 697 LNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAAK 756
L+QL+SDD EDK V+QQPVE EITKLEDPAE YIRDLLIVSG+YDGSTDNNFSRNN A K
Sbjct: 688 LSQLDSDDIEDK-VEQQPVEFEITKLEDPAEVYIRDLLIVSGMYDGSTDNNFSRNNAATK 747
Query: 757 PINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRFR 816
I+NAIF+EVEEAYRKSETKNEII KE NE++VDHKLLFDLLNEALPIVL P LT SRFR
Sbjct: 748 AISNAIFDEVEEAYRKSETKNEIIGKEQNESNVDHKLLFDLLNEALPIVLGPCLTTSRFR 807
Query: 817 RNITNSSMP-PPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMDD 876
+ +SS P PPLFGK+LLDSVWDII +F HPPTDRSY+LL+GVMARDLNSTPW+SLMD
Sbjct: 808 TKVIDSSTPLPPLFGKKLLDSVWDIIRKFIHPPTDRSYFLLEGVMARDLNSTPWASLMDV 867
Query: 877 EINTTGREVEGLIIKDLFEEVVKDLRK 902
EINTTGREVEGLIIKDL +EVVKDLRK
Sbjct: 868 EINTTGREVEGLIIKDLIDEVVKDLRK 888
BLAST of Sgr029742 vs. ExPASy TrEMBL
Match:
A0A6J1I298 (uncharacterized protein LOC111470236 OS=Cucurbita maxima OX=3661 GN=LOC111470236 PE=4 SV=1)
HSP 1 Score: 1393.6 bits (3606), Expect = 0.0e+00
Identity = 732/867 (84.43%), Postives = 784/867 (90.43%), Query Frame = 0
Query: 37 GLETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRIN 96
GLETPRNSLELQ+ESS +YC AEEIPY YQIDEVFSDKDY KNE SMKKLIDKEMS+R +
Sbjct: 28 GLETPRNSLELQMESSQSYCTAEEIPYSYQIDEVFSDKDYLKNETSMKKLIDKEMSSRTS 87
Query: 97 ARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNS 156
A+H+GPSIVARLMGM+MLPLDAKDEV+L DKR NSKGVKT +KE GRGLHS ASSKSNS
Sbjct: 88 AKHHGPSIVARLMGMDMLPLDAKDEVELSDKRHNSKGVKTSSKEINGRGLHSDASSKSNS 147
Query: 157 SKQMDLHLSYHDNDTDADRW-SSSQKMGKPRRREHPQEEELQKFKKEFEAWQAARFRECS 216
K+MD+H SYHDND DADRW S+SQKMG+P RREHPQEEELQKFKKEFEAWQAARFRECS
Sbjct: 148 YKKMDVHSSYHDNDKDADRWRSTSQKMGRPHRREHPQEEELQKFKKEFEAWQAARFRECS 207
Query: 217 RVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGT 276
RVIE SSINRQSLAQ+ A+E M L+ NTRKISS KLSAE K TV +KSYR V +D G
Sbjct: 208 RVIETSSINRQSLAQDD-ARE-MELNVNTRKISSPKLSAELKYPTVGMKSYRRVDLDGGI 267
Query: 277 RGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEE 336
+ ETFP EQRG FSLRS+ MDADFEHPCLIS DQK DK GPTKIVILKPGPDKMCLHEE
Sbjct: 268 KRETFPGEQRGPFSLRSESMDADFEHPCLISSDQK-DKLLGPTKIVILKPGPDKMCLHEE 327
Query: 337 HWTNSSGTLGERVSIEAFLEEVKERLKCELQGKTFKKGSAVRGSGIETPYSEKPSHSRQI 396
HWTNSSGTLGERVSIE FLEEVKERL+CELQGKT KKGSA RGSGIETPYSEK SHSRQI
Sbjct: 328 HWTNSSGTLGERVSIEDFLEEVKERLRCELQGKTTKKGSAARGSGIETPYSEKSSHSRQI 387
Query: 397 ARNIATQVRDSVTRDVEMNLLRSESTRSYKSEIQFNGLGSPEFIHKDTRRFLSERLRNVQ 456
A+NIATQVRDSVTRD+ NLLRSESTRSY S +QFNGLGSPEF++KDTRRFLS RLRNV+
Sbjct: 388 AQNIATQVRDSVTRDIGFNLLRSESTRSYNSGVQFNGLGSPEFMNKDTRRFLSGRLRNVR 447
Query: 457 RKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHEADENE 516
RKDSDLDSGSSRSS DHER +KQVET T+GKHTNYWE+LRD EE +RSFRHEAD E
Sbjct: 448 RKDSDLDSGSSRSSASDHERVSKQVETILTNGKHTNYWEVLRDAEEIHSRSFRHEAD--E 507
Query: 517 VLPKELSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHAAVNIKKQKKE 576
VLPKELSPRNL+RSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDH AVN+KKQKKE
Sbjct: 508 VLPKELSPRNLSRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEASDHVAVNLKKQKKE 567
Query: 577 RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDILSGPTVVMNSGERHERE 636
RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGL T+DLYSTKDILSGPTVVMNSGERHERE
Sbjct: 568 RFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLDTADLYSTKDILSGPTVVMNSGERHERE 627
Query: 637 NFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 696
NFTEVPPSPASVCSS QEEFWKL+DHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ
Sbjct: 628 NFTEVPPSPASVCSSGQEEFWKLSDHHSPISTSDVTPRDENCVSQVFREISSNLKELRRQ 687
Query: 697 LNQLESDDFEDKVVQQQPVESEITKLEDPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAAK 756
L+QL+SDD ED+ V+QQPVE EITKLEDPAE YIRDLLIVSG+YDGSTD+NFSRNN A K
Sbjct: 688 LSQLDSDDIEDR-VEQQPVEFEITKLEDPAEVYIRDLLIVSGMYDGSTDHNFSRNNAATK 747
Query: 757 PINNAIFEEVEEAYRKSETKNEIIEKEPNENSVDHKLLFDLLNEALPIVLAPRLTMSRFR 816
PI+NAIF+EVEEAYRKSETKNEII KE NE++VDHKLLFDLLNEALPIVL P LT SRFR
Sbjct: 748 PISNAIFDEVEEAYRKSETKNEIIGKEQNESNVDHKLLFDLLNEALPIVLGPCLTTSRFR 807
Query: 817 RNITNSSMP-PPLFGKRLLDSVWDIILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMDD 876
+ +SS P PPLFGK+L DSVWDII +F HPPTDRSYYLL+GVMARDLNSTPW+SLMD
Sbjct: 808 TKVIDSSTPLPPLFGKKLWDSVWDIIRKFIHPPTDRSYYLLEGVMARDLNSTPWTSLMDV 867
Query: 877 EINTTGREVEGLIIKDLFEEVVKDLRK 902
EINTTGREVEGLIIKDL +EVVKDLRK
Sbjct: 868 EINTTGREVEGLIIKDLIDEVVKDLRK 888
BLAST of Sgr029742 vs. TAIR 10
Match:
AT2G17550.1 (unknown protein; Has 264 Blast hits to 258 proteins in 65 species: Archae - 5; Bacteria - 5; Metazoa - 66; Fungi - 16; Plants - 107; Viruses - 0; Other Eukaryotes - 65 (source: NCBI BLink). )
HSP 1 Score: 478.0 bits (1229), Expect = 1.7e-134
Identity = 350/888 (39.41%), Postives = 491/888 (55.29%), Query Frame = 0
Query: 38 LETPRNSLELQIESSHNYCAAEEIPYFYQIDEVFSDKDYFKNEASMKKLIDKEMSTRINA 97
LE PRNS ELQ+++ H Y ++ P +E + ++ + E SMKK I +E+S R N
Sbjct: 30 LEAPRNSFELQVDNFHTYHNGKDKPSNGFEEEEWYERSCYPIEESMKKKIIEELSKRSND 89
Query: 98 RHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKESTGRGLHSQASSKSNSS 157
+ N PS+VA+LMGM+ LPL++ R SK V + E GR S+ S++
Sbjct: 90 KQNTPSLVAKLMGMDALPLESVKSSSAWIYPRQSK-VNRFDDEKGGR--RSRKGRLSSAV 149
Query: 158 KQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKEFEAWQA-ARFRECSR 217
+D+ M P RREHPQEEELQ+F++EFEAWQA RF++CSR
Sbjct: 150 TALDV-------------------METPMRREHPQEEELQRFRREFEAWQADKRFKDCSR 209
Query: 218 VIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTVEIKSYRSVGVDDGTR 277
+++ + + +E L T RS G D
Sbjct: 210 IVDSGCVVARDENKERLFTRT-----------------------------RSFGRD---- 269
Query: 278 GETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIVILKPGPDKMCLHEEH 337
F+L K+D++ PT+IV+L+PG + +E+
Sbjct: 270 -----------FTL-------------------KSDRT-APTRIVVLRPGLQRAYDYEDS 329
Query: 338 WTNSSGTLGE---RVSIEAFLEEVKERLKCELQGK-TFKKGSAVRGSGIETPYSEKPSHS 397
T SSGT E SIE FLEEVKERLK ELQGK K+ S+VRGSGIETP+SE+PS
Sbjct: 330 LTTSSGTTMEGSRGSSIEEFLEEVKERLKGELQGKAALKRSSSVRGSGIETPFSERPSP- 389
Query: 398 RQIARNIATQVRDSVTRDVEMNLLRSESTRSYK-SEIQFNGLGSP-EFIHKDTRRFLSER 457
RSES RSY SE+Q N SP EFI +DTR+ L+ER
Sbjct: 390 ------------------------RSESMRSYAVSEVQCNAPDSPTEFISRDTRKLLAER 449
Query: 458 LRNVQRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWELLRDEEETQTRSFRHE 517
L+NV RK+ S S +++ T S + K E
Sbjct: 450 LKNVLRKEMTPSHDSVTKS------SSRLRPTVSDAAKQA------------------EE 509
Query: 518 ADENEVLPKE-LSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVHIQRKHEAS------- 577
++ +V KE LSPRNL RSLSAPVSGTSFGKLLLEDRH+LTG I RKHEA+
Sbjct: 510 INQEDVSKKESLSPRNLKRSLSAPVSGTSFGKLLLEDRHVLTGAQIMRKHEATITEREET 569
Query: 578 ---DHAAVNIKKQKKERFNFKEKVSNFRYNFTLRGKLFGRKTQSISGLHTSDLYSTKDIL 637
V ++KERFN ++KVS+FR TLRG++FG+K +S+ ++ + S KD +
Sbjct: 570 ESETEPVVVDPIRRKERFNLRKKVSSFR--STLRGRIFGKKIRSMIESNSFEDESIKDFV 629
Query: 638 SGPTVVMNSGERHERENFTEVPPSPASVCSSVQEEFWKLTDHHSPISTSDVTPRDENCVS 697
+G + N +R+ EN TEVPPSPASVCSS EEFW+ D+ S +ST DVT DEN +
Sbjct: 630 TG-SKFNNFYDRN--ENSTEVPPSPASVCSSTPEEFWRNVDYLSQVSTPDVTVSDENGMP 689
Query: 698 QVFREISSNLKELRRQLNQLESDDFEDKVVQQQPVE--SEITKLEDPAEAYIRDLLIVSG 757
QVFR+ISSNL ELRRQ+N+LES+ V+++P++ I L +P + ++RDLL+ SG
Sbjct: 690 QVFRDISSNLSELRRQINELESEVQVRTPVEEEPIQEIETIVDLGNPDKVFVRDLLVASG 749
Query: 758 LYDGSTDNNFSRNNTAAKPINNAIFEEVEEAYRKSETKNEIIEK--EPNENSVDHKLLFD 817
LY+G++D + SR + AK I ++ EE +E +K +N+ + E + +H +LFD
Sbjct: 750 LYEGTSDISLSRWDPLAKLIKKSVLEETKENLKKRSNQNQEDDDTGETTISEENHNILFD 776
Query: 818 LLNEALPIVLAPRLTMSRFRRNITNSSM--PPPLFGKRLLDSVWDIILQFTHPPTDRSYY 877
LLNE L +VL P LT S F+ + +SS+ + GK LL+S W I+ ++ + +R +
Sbjct: 810 LLNEVLTVVLGP-LTKSGFKNKLLSSSVSESTTIRGKYLLESTWKIMSEYLYSQPERPFC 776
Query: 878 LLDGVMARDLNSTPWSSLMDDEINTTGREVEGLIIKDLFEEVVKDLRK 902
LDG++ D++ PWS+L+ +E+N G+EVEG+I+ DL EE+VKDLR+
Sbjct: 870 SLDGIIGWDMDRFPWSALIGEEVNVLGKEVEGMIMADLVEELVKDLRR 776
BLAST of Sgr029742 vs. TAIR 10
Match:
AT2G17550.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 215 Blast hits to 205 proteins in 55 species: Archae - 5; Bacteria - 0; Metazoa - 50; Fungi - 10; Plants - 99; Viruses - 0; Other Eukaryotes - 51 (source: NCBI BLink). )
HSP 1 Score: 453.8 bits (1166), Expect = 3.3e-127
Identity = 335/843 (39.74%), Postives = 466/843 (55.28%), Query Frame = 0
Query: 83 MKKLIDKEMSTRINARHNGPSIVARLMGMEMLPLDAKDEVQLRDKRRNSKGVKTLNKEST 142
MKK I +E+S R N + N PS+VA+LMGM+ LPL++ R SK V + E
Sbjct: 1 MKKKIIEELSKRSNDKQNTPSLVAKLMGMDALPLESVKSSSAWIYPRQSK-VNRFDDEKG 60
Query: 143 GRGLHSQASSKSNSSKQMDLHLSYHDNDTDADRWSSSQKMGKPRRREHPQEEELQKFKKE 202
GR S+ S++ +D+ M P RREHPQEEELQ+F++E
Sbjct: 61 GR--RSRKGRLSSAVTALDV-------------------METPMRREHPQEEELQRFRRE 120
Query: 203 FEAWQA-ARFRECSRVIEVSSINRQSLAQEGLAKETMALSANTRKISSQKLSAEPKGSTV 262
FEAWQA RF++CSR+++ + + +E L T
Sbjct: 121 FEAWQADKRFKDCSRIVDSGCVVARDENKERLFTRT------------------------ 180
Query: 263 EIKSYRSVGVDDGTRGETFPAEQRGSFSLRSKFMDADFEHPCLISCDQKTDKSRGPTKIV 322
RS G D F+L K+D++ PT+IV
Sbjct: 181 -----RSFGRD---------------FTL-------------------KSDRT-APTRIV 240
Query: 323 ILKPGPDKMCLHEEHWTNSSGTLGE---RVSIEAFLEEVKERLKCELQGK-TFKKGSAVR 382
+L+PG + +E+ T SSGT E SIE FLEEVKERLK ELQGK K+ S+VR
Sbjct: 241 VLRPGLQRAYDYEDSLTTSSGTTMEGSRGSSIEEFLEEVKERLKGELQGKAALKRSSSVR 300
Query: 383 GSGIETPYSEKPSHSRQIARNIATQVRDSVTRDVEMNLLRSESTRSYK-SEIQFNGLGSP 442
GSGIETP+SE+PS RSES RSY SE+Q N SP
Sbjct: 301 GSGIETPFSERPSP-------------------------RSESMRSYAVSEVQCNAPDSP 360
Query: 443 -EFIHKDTRRFLSERLRNVQRKDSDLDSGSSRSSVYDHERATKQVETTSTSGKHTNYWEL 502
EFI +DTR+ L+ERL+NV RK+ S S +++ T S + K
Sbjct: 361 TEFISRDTRKLLAERLKNVLRKEMTPSHDSVTKS------SSRLRPTVSDAAKQA----- 420
Query: 503 LRDEEETQTRSFRHEADENEVLPKE-LSPRNLTRSLSAPVSGTSFGKLLLEDRHILTGVH 562
E ++ +V KE LSPRNL RSLSAPVSGTSFGKLLLEDRH+LTG
Sbjct: 421 -------------EEINQEDVSKKESLSPRNLKRSLSAPVSGTSFGKLLLEDRHVLTGAQ 480
Query: 563 IQRKHEAS----------DHAAVNIKKQKKERFNFKEKVSNFRYNFTLRGKLFGRKTQSI 622
I RKHEA+ V ++KERFN ++KVS+FR TLRG++FG+K +S+
Sbjct: 481 IMRKHEATITEREETESETEPVVVDPIRRKERFNLRKKVSSFR--STLRGRIFGKKIRSM 540
Query: 623 SGLHTSDLYSTKDILSGPTVVMNSGERHERENFTEVPPSPASVCSSVQEEFWKLTDHHSP 682
++ + S KD ++G + N +R+ EN TEVPPSPASVCSS EEFW+ D+ S
Sbjct: 541 IESNSFEDESIKDFVTG-SKFNNFYDRN--ENSTEVPPSPASVCSSTPEEFWRNVDYLSQ 600
Query: 683 ISTSDVTPRDENCVSQVFREISSNLKELRRQLNQLESDDFEDKVVQQQPVE--SEITKLE 742
+ST DVT DEN + QVFR+ISSNL ELRRQ+N+LES+ V+++P++ I L
Sbjct: 601 VSTPDVTVSDENGMPQVFRDISSNLSELRRQINELESEVQVRTPVEEEPIQEIETIVDLG 660
Query: 743 DPAEAYIRDLLIVSGLYDGSTDNNFSRNNTAAKPINNAIFEEVEEAYRKSETKNEIIEK- 802
+P + ++RDLL+ SGLY+G++D + SR + AK I ++ EE +E +K +N+ +
Sbjct: 661 NPDKVFVRDLLVASGLYEGTSDISLSRWDPLAKLIKKSVLEETKENLKKRSNQNQEDDDT 702
Query: 803 -EPNENSVDHKLLFDLLNEALPIVLAPRLTMSRFRRNITNSSM--PPPLFGKRLLDSVWD 862
E + +H +LFDLLNE L +VL P LT S F+ + +SS+ + GK LL+S W
Sbjct: 721 GETTISEENHNILFDLLNEVLTVVLGP-LTKSGFKNKLLSSSVSESTTIRGKYLLESTWK 702
Query: 863 IILQFTHPPTDRSYYLLDGVMARDLNSTPWSSLMDDEINTTGREVEGLIIKDLFEEVVKD 902
I+ ++ + +R + LDG++ D++ PWS+L+ +E+N G+EVEG+I+ DL EE+VKD
Sbjct: 781 IMSEYLYSQPERPFCSLDGIIGWDMDRFPWSALIGEEVNVLGKEVEGMIMADLVEELVKD 702
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022145277.1 | 0.0e+00 | 87.00 | uncharacterized protein LOC111014768 isoform X1 [Momordica charantia] >XP_022145... | [more] |
XP_038904709.1 | 0.0e+00 | 85.83 | uncharacterized protein LOC120091008 isoform X1 [Benincasa hispida] | [more] |
XP_008448479.1 | 0.0e+00 | 84.45 | PREDICTED: uncharacterized protein LOC103490651 [Cucumis melo] >XP_008448480.1 P... | [more] |
XP_011650257.1 | 0.0e+00 | 83.87 | uncharacterized protein LOC101212814 isoform X1 [Cucumis sativus] >XP_031738431.... | [more] |
XP_023539226.1 | 0.0e+00 | 84.89 | uncharacterized protein LOC111799930 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CW53 | 0.0e+00 | 87.00 | uncharacterized protein LOC111014768 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A1S3BKM8 | 0.0e+00 | 84.45 | uncharacterized protein LOC103490651 OS=Cucumis melo OX=3656 GN=LOC103490651 PE=... | [more] |
A0A0A0L638 | 0.0e+00 | 83.87 | DUF4378 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G002320 PE=... | [more] |
A0A6J1F7W2 | 0.0e+00 | 85.01 | uncharacterized protein LOC111442905 OS=Cucurbita moschata OX=3662 GN=LOC1114429... | [more] |
A0A6J1I298 | 0.0e+00 | 84.43 | uncharacterized protein LOC111470236 OS=Cucurbita maxima OX=3661 GN=LOC111470236... | [more] |
Match Name | E-value | Identity | Description | |
AT2G17550.1 | 1.7e-134 | 39.41 | unknown protein; Has 264 Blast hits to 258 proteins in 65 species: Archae - 5; B... | [more] |
AT2G17550.2 | 3.3e-127 | 39.74 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |