Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCTCCTCCTCCTCCTCTCTCTCTCTTTCTCTCTCTAGCGAGCAATAAACGTAGCAGCCGAATCTCACCGTCATCAATCCGAAACCCCTACATCCGTTTTCTCTCTTTTCTAATCTTCAATTTGATGCCCCTTCAATTTCATTTATATTATGCTATTATTCCAATCTACTCCAATCGCATCCAATCCAATTCCATATAATCCATTGTCACATAGTCTCGATCAGATTTGCTTCTGGACGGAGTTATTTTGATCCGCCGACGATCTGATATTTTGTTCTAATTACGTATTGCGATTGTAATTCTTAGCTACTATCCATTTTGGCTTTTTTTTTTTTTTTTTTTTTTTTGGTTGTCGATCAGATTGGGTAGATTTGGGTTGTGAGGTAATGGAAGGATTGAGAAATGCAACTATGAGAACTTGAAAGATGCCTGGATTAACACAGAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCCGCCCATGGATTTTGGTCCCAGCATCGCGACGACGTTAGCTACAATCAGCTCCAGAAGGTATTCATTTCTTTCGTTGCTTATTTTGGAATAGTTGCCACTATTATAGATCACTTTTCTGCTCTGCATTTATGCCTGCCTTATACTGAATAAAAGTCTGGATTAATACTGCGCCAATGTAATCTATCGAAGGAGTGTCTAGCATTTTAGATGTGTGAGTTTGACACTGTAGACCTCTCGTTTTTCTCTTTACCTAGAATATAGCTACTGATTTTTCCTATTTCTTCAACATGTTGCTCTATGAACAACAAATCTTCTTTGGGAAACTAAAGTATCACATTGTTTAATTAATTACCTTAAATCATTAGTTCGGATGTGCGTGATCTAATTCCAGGATTACTTGGAACTAATGTGGTGAACTGGCCATTAATTAATTAAGTTTTGTTTACTCTATATATAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAAATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGTGAACCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCAAGCATGTGATGGTTCATTATCGGTTAATGGGCTTCAAGATGAAATTCAAGATCCTTCTGTACATCCTTGGGGTGGTTTGACCACAACACGTGACGGGGTGCTGACACTTTTGGATTGTTATTTGTTTTCCAAGTCTTTCCTGGGTCTCCAAAATGTATGTACTTGTTTTTGGTTTTCTTTGCTATTGGGCACTCCATATTAGCTCATCCTTTTTAATGGCCAAGGTTTTCCTTGAAGCAGGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGGGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAAACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTGTCTAGCCGATGCTCATCCTCTTTTTCATACATTGCATTTACATTAATGAAAGGAAAATGTGATAGTTGATGCTGATGGAGATATGATTAGATAATATAACCTTTAGGGATACAAAGTGTCATGATCACATTTTGAGGAATACTAACATGAACAAATAGAAGCTCGACCTCTTATTAAGGATTAAATCTGTTTATTTTTATTTCTATTTTTATTATATATGTTTTTTTTTACCGATACCGATTTCTTTTTATGGATTTTGAAGATTTAAGCTAGTGAAGAATGTTGCCCAAGTGTGTGTGTGTTTGTGTATTCACTTTTCTGCTTGGAAGCATGATTTCTTATAAAGAAAACAAAGAAATCTAGAATTACAGCTGTCTTGATTGATTGTATAGATACAGTTTTAGTTAAGGAACTTGTAACTTAAGTTGGTTGTAGTTTACAAGCCAAGAAAATTCTTCTGTTATTAAGTTCTTTTGATTGTTATATATATTTAAGATGGTATGTATGCCCTTTTTAGGTCTAAAAATTTGTGCAATATACAATCAGATTGAAATTAACAAATGGGGACAGCTTCTAATCCTATGAGACGAGTAGATAATAAATGCACTGGAAGGTGCCTTTTTCTCTTTCAATAATTGCGAAGTTGACTTAAAGGAAGTATTATGAAATGTTTTTGAAGGGCTCATACTCTTTGGCTAAAAGTTGGATGAGTTGATATAAATGTACTGGAAGCATTGGAAGTTGGTTTTGATCTTTCAATTATTGTGAGTTGATCTATGGTAAGTATTATGAAATATTCGTGGGGAGATCATGCTCTTTGGCCTAATATTAAAGAGGATTTTTTGGATTAAATTGTAGGATTTTCTGGTTTCTGTTGGCAGACATTGCGTTGTTTGGAGGGGATTCTGGTGCTGTTTGTTCTAAATAAGCCAAAGACTCTGTGGCACGAGATTATAGTGAGCAAGTACGACCCTCACCTTTTGAATGGATTGTGGCTATGATCTTGAAAGGCACATTTAGAAACTCTTGGAAGTATATTGCTTCTAGTGGTTCTTTATTCTCTAATTTTGTTTACAACTCTATTTGAGATGGTTGAAACATGTATTTCCAAAAAGACTTTGTGGCACAAGATTATTGTGGTGTGAGCAAGTATGAACCTCACCTTTTCGAATGGATTGTGGCTATGGTCTTGAAAGACGCATTAAGAAACTGTTGGAAGTTTTGTCATTTCTAGTGGTTCTTTATTCTCTAATTTCGTTTACAACTCTATTTGGGATGGTTGAAACACTTATTTCCAGAAAGGCTGATGGTTGCATGATACTCCCATTTGTAACACGTTTCTTATTTCTCTTGTTTCGTGTCATTTCTTATTCAAGTAATTGGCATACGTCTAATTTTGGTGTCTAACGTGCTTTGTCGGATAGGGAAGCCTTAGATGTTTTGACTCTCTTTGCTCTTTTGGTAGAGGTCCATCTTCATCTAGATAATAGAGTTATATCTTGGTGGTCTTCTAATCTCTCCAAAGAGTTTTCTTGTCACTCAGCTTTCCTTCGTTTGGGTGCCACAACATTCTTATAGAAAACTGTATTTTCCTCGTTGAGGAAGATAAAAGGTTTCCAAGAAGGTTAAGCTCACTACATGACCAGTTTTATGTGGGAGATTCAACACCCTGGACTGCATTCAGGGAACTCTTCTTTTCTGATTAGGTGAAATGAGGCCCTTCTTCTAGGGAAGAGGCTGAAGATCTTGGTCATATCCTCGAGTGATGCAAGTTTGCTCATTTTTTTCCTGGAGATGTTTGGTGTGCATGTGGCTTGTATCCATGATTATAGTTTCATGATTAAAAAGGTCTTGTTCCATCCCTATTTTGAGATGTGGGTCTTACCTTGTGGCATTATCTCTCAAGAAAAATGCACCCATAGCTGGCCTTTAACTTTTCAATGTAGCATCTTGGAACCTCATGTGTCATCTTATGGATGATGTTATTGTGGAATGAACGTCTTTGTCAAATCTTATTGAGGCTTTTACACCTCATGGTTGTGATGAAAATTGGTGCTGGCTTTTAGAAAATGATGGGCTTTTTCTGCAAGTCTCTCACAAGAAAATTGGCTTCCACTGATTAGAGGACTTTTGACGGTTTTTATACACACGTTTCGAGGGGTACATATCCAGAAATTGCTAAAAAGTTTTTCCTCAAGAAGCTCAGTTATTCCAGCATAAACACTCAAGATTCCCTACATTTCTTACTCAAGAAATTCCCATGGATATACTTGTCTCCAAACTGCTGTCCTGTATGAAGAATCTGGAATCACAAATGAGAAATCAATATTTACAGCAACTTCTGGTTTGCAAATAATTTTGGATCTTTATTCTCTCTAAATTTGGATTGAACTGGAATATAGACTTGCCCAGGGGATCTTTTCTCCCTTCTATAATGTGTAGTAAAGCGTCATTCCTTTAAAGATGAGAAAAAGACCCTTTGGTTAAACTTCATGAGAACCTTTTTTTGGAGCCTTAAGATGGACGTAATTGCCACATCTTCAACGATAAGAAATGTGGGATTATAACTTATCCTTTCAAAAAGGAGAAAAAGACATTATGAATCTATTTCTTTCCTACCTTTATCTTGGCGGAGGTTAAATCAACCCTTCTGTAATTATAGTTTATCTACTCTCCTGGGTGTTGGACTAGCTTGAGTGGTTGGAACGGCTTGAAGTGGCTTGCTTGAATGCCTCATCTTGGTGCACATTATCCGAGCTGTTTGAAGACTTTTCCATCCAAGATTTTTGTTTAAATTGACATGATTTTATTTTCCTTCCTAAGTTTGTAAAAAGACTTGTCTTGATGTTGAGTTTGTTTCATTTATTGTTGTATTACTGATTGTTCCTTGAACATTTTTACAATATTTGATTATGGATATGAGAGGTTGCTAGGGTGTCTCAATCTAGTTGAGATTCCCAGGTGCAACTTCTGATCCTTTTTGTTTCTTTTGATTTTGTTCTTGATTTTTTTACATTAGTCTCACTTCATTATACCAATGATAGAGACTTGTTTTCGTTTAAAAAAAAAAAGATTATTGGACTAGTTTCATGTAATTATCTCTTGATAGGTTCTTTTGTACACATTTCACCATATCAATGAAATAGTTTTCCTTCCTTTTCTTATTATTTTCTTTTCATTTGGTTGTTATTATTATTATTATTATTATAGACCATTTAGCTAGAGAGGGATGGATGATAGAATGTTTAGAGTGTTGAGAAGTCATGGAAACAGGTGTGGATCCTTGCCAAGTTTAATACCTCTCTTTGGGTGTCTGCTTTAAGAAATTTCTGTAACTACCCTCTGTCTTATTATTTTGAATTGGAGTTTTTTTTTTTAAATCTTGTTATTTTTTTTTTTTGTGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCCAATGAAAGCTCAGTTTCTATCCTTAAGAAAAAAAAAGGAAAAGAAAAGGAAAACCAACTTATTCTATCAAATTGGTTCTATGTGCTATGTTTTTGCATAGTCTGCTGAGACTTATCTGCACAAATTCAAATAGTATTTAGGATTGTTATATTTAGTGAGGAATATAGAAAACAAGCCTTGTTTATTCTAATAATATATATGTTTTCTTGTCTGCATTTTTCTTGCCTCGTTCTTTTGTGCCTCTTACAATATGTTGATTATCTCACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTTATCCGTGAGTTTAAGGAACTAAAGGAGCTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCCTTTAATTATGAGGTGGATCCTTGCACTTTGCTTCAACTTTAAGTGCTCTTCTACCATCATAATTATAAAGTAAACTTTACAATTAATATGAATGATAATTATAATGACTACATAGATCATGTAATGTTTCTCTAGATTGTGCGCGTTCTGCAATCAGCATCAAAGAATTTATAATTGCTATTGTTTCAGGTATCGGATGACACAATACAAGCCGATTGGCGTCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGAACGGGAGAAGGAAAATCTGATATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTTAAAATAAACGGCCTAGATCTTGGTGGTTTGAATTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGACTTACAGTTGGTGATGGATTTGTTACAATTACGAGAGGGGAAAATATTAGGATGTTTTTTGAGCATGCTGAAGAGGCCGAGGAAGAGGAGGTTGTTCTCTTTCTTGTAATTTTGTCTCTCTAGGATTTTACCTACCGAAAACTTTTTGTATATCATTCATAGTTTCAATAATGAATGGGTTTCTTGTATCTGCTATGAAAGAACTATCTGAAACATGATATGTTTCATGAACTTGAAATTCTTGTGGGTGACAAATGGTTTTTCTTCTAAATCATGGCAACTCTAATTGCTTTGCTGAATTGTGTCTTTTAGGAGGACGATTCCATTGATAAAGATTCAAATGATTTGGATGGAGACTGTTCTCGTCCACAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTCATCTTTAAAGAACAGGTATGTTATCTGCAATCTTCTTGGCTTGTGCTTTCTTGCCAATTGGCATTTATGACATTATTCTCTCTCAAAGACAAATCAAATGTTAAAGTCTTGTTTTGAGAAATTTTATTAAGCACCAAGGATATAATTAGCAGTTAGGCACATGCTTACGAGGAAAAACATTTATTTTATTTTTATTTTTTTGAGAAGAAAACAGGGTTTTTCATTGAAATGATGATAAGAGACAAATGCTCAAGATACAAAAATATAAATAAGAAGACAAATTCCTTTACAACAAAGCTAAGGTCACCTAGAACTGAAGCAAACCAAATAGAACAGGAGAAAGCAGAACTTTAGGCAAACTCAAACTAAAAGACCAAGAAAGTAATTAATCTTGAAAGGCTCGAAGAATTTTTCCAGGGTGAAACTTCTCAAGGAAAACTATAACCCCGCAAGTGAACAAAAGAACTAAGAAATGTCTTCTAAATTACCCAACCAAATTGGAAACCATTGACCTTAGGTCCTTAGGTGAGCTGTCCAAAATCTGCTAAAAGTGAGAAAGTAAACCAACACCCAAAAATAATGTACCCACTGAAAGACTCACAAAAAGGCTTTAAAGTAAGCTTCAACTGAAAAGCCTTTTGATGACAAAAAACTGAAGAAGATTAACACAACAACTTAAAATTGTATTCCAAATCCCGCGTTAAATCTTAATTTTTTTTTTTGAAAAGGAAACAAACCTCTTTATTAATAATAAGAAAAGAGACTTTGTATGCACATCACAAGGAAAAAAAAAAAAACTAATGCTCAAAGTACAACAGAATTACAATTTTAATTTTAAATTAGAAGTTAAGTTACTCATTTATGGAGTCAGTTGAAAATGCAGCACATTTTTAGTTGCGATTGTTGGTTCAGATATGTATATATATTCTTCCTTTATCTTTAAGTATCATTTAATGAGATGTAAAAAGAAGGTCTAATATCCAGCCAAGTTGGCTGCATCTTCCTTTTAATTTCTCCCAAGACTCAAGAGATCCGTTTCTCGTTTCAAAAAGACTCAATAGTGAAATATTATGCCAGTGGACATAATTCTAGTTTTACCCATTTAGGCTAGTGATAGAGGATAAAGTAGATGCTATAAATGAGACAATATAAGATAAGGGAAAATTGGAGATAACTAGGGAGAAGAAAAATAGGAGACATTCTCGAAACATTTTCATTTCGACGTTGAAGTTTAGTTACTGCAGTTTACTAATGGGTCAATTAAATGAATTTACAAAAAAGATTTCCACTTGGCCAAAAGTCTCATCACTATAATAAGAGAAAAGGTAGAAACGTTTGCACCAATATATATCCCTTATGACATTATCCATAATGGAATGCAAAGAGGTTTCTATCGCAAGAGAGCTCTGTTGGCCAAACGGCTTTGGCGCTTTTGCTTTGAGTCTGAATCTTTGTGGCACATTCTTATTGTTATTTTTCTTTTAAACGGAAACAAGCCTCTTCATTGTGAGACTAATGATCAAAGTACAATTGAAAGATACTAAGAGCAACAAGGCCTAGGTGCACTTAGGCATCTCAACTAGGTTGATGCCCCCTTAGCACCCTCATCATATCCATATAAACTATCCAAGTAGACTAAAAATAGGGAACAACCCAAGAAAACAAAAAGCAAAGCGTCCAAGTACAACATGATACTAGTGAAAGATACATCATAAAACAACAGCCACAGGGGCATAAAGCAAAAAAAAAAAAAAAAAAAAACAGCAAAATCCAACTAATACAAATTGGATCACTGAAAAGAACATAAACTAAAAGGCTAGCTGTGTGATTACTCTCCAATTTAGACTTATATCTTGTACCTAGAAGTTTGCAAAGGATTTGGATAGGGGGCACCAAGGGGAAAAATTAAGTCTTGTTGTCTCAAAACAATCCAACCAAGGATTTGATTTATCATGGAAGGCTCCTTGGTTTCTTTCAAACCCTAAATCAGCTAATAATGCTTTGCATTATTCCATAGTAATTTAGGGCCAAATTTCAGCGATGGACCAACCAGAACTTGGAAAATGTTATGATGAAATGTGGTGGTGTAGGATTATTGTTAGTAAGCATGACCCCATCCCTTCAAGTGGTTGTCCAATGGGGTTGAAGGTACATTTCATAATTTGTGGAAAGATATTTCTCTTTAGCTCCCTTCATTCTCCCTTCATGTTCTATGTGTCATGGGGAAGGTGAGGGTACTTATTTCTTGTAAGATCAGTGAGTGGGGATTGATCCCTCTGCTTTGCATTTCTTTATTTATATAAACTTTTTTCCTATAAAAATCATCATGTTTTTGATTTCGTGGTTACCAAAACATTTCGTGGTTTGCTGGCTTAGTGGACAACTAAGTTAGACTTCTTGGTTTGGTTAGTGATGTAGTAGGAATAGGGTATTATGATCAAATCCTTAATCTATTAGGATTAAGATTTGTTTCTTTTATTAATTAGGATTAAGATTTGTTTCTTTTATTAATTAGGATTACAATTAGTTTCTTTTATTAATTAGGATTAGGATTAGTTTGCTTTTCATATATAAATAGAGTACTTGTCTTCTTGTATTCACAACTTTTAAACATGAATAAAATTCTCTCATTTGGTATTCATCATTTGGTATCAGAGTAGCCATGTGGGCCAACATTTTCCGCCATCGGTGGAAAACTGAGAAAGGGAAACCCTAACAATTCCTAAAAGAAAAAACCAAAGATTTATCAAGATTTTGCAAGAATGGAATTTTATAGACGACTACAAATTGAGACAGAATGATCAATTACTTTGGTGAATAAATAGAGACGAAAATACTAATGAAGGAAAAATACAAATATGAAAAATATTAAATAAGAATAAGCCCAATAAACCCTAATTTCCTTTAACACTCCCAAACTCAAGGTGGTTGAATTGCAAACAACTTGAGTTTGTGAAGAAAACATCCGAATAAAATCAGCAGATCTGTGATAGAATCAGAAACCAACGAAGAACCAAACACAAAGTATTTCTAGGCTGAAAACCAAAACTCACGAAGAACGAAACAAAAAACTTCTGGGCTAGAAACATAGATCTTCATATAGAGATCTATAACAAAACGTACGAAGAACCAAACAAAGAACTTTTGGGTCGGAACAAAGATCCTCATAGAGAGGTCTATAACAAAGCCCATGAACGATTGATTTGAAAACGAAACAGAAATAGAAAATAGAACATAACAATTGGGCTAATTGAACCAGACTCAATGACTAAACCAACCAAAACTGGACCAGACTGGTGAACTAAGACTGAACCAAACCGGTTGTATTTAAAACATTGAACCATGTCAAGAAAAACCTCTTTTCTTCATAGTTTTTGGCATATGAAGGAGGCGGTGAATCGCAAGGTGATCGACAGATCTATGTTATGTATGGTGCGGCAAAAAGGAGATGAGGTGTGACTGGCGATTGATCTATGGCACGAATGACGGCGAAATTAAAGGTTGTAGGTGGATCTCTGGGCGGCGATAGATGTGGATTTTCGACGGGAGGCCGTGGTGTGGGCGAATCTAGGCGGTGGTATGAATCTGATGTGGGCATGAGCAACTAGGCTAGGAAAATCTAATCTGATGGAGGAGAAGGTGGGCATCGAAGATGGATTTGAATGGCGTGGATCTAGGTGGTCTAGGTGCACGATCTAGACAAAGGCTAATGAGGGGTTAGATCTACACGTGGATTTGCCTCCGTTGATGGCTAGTCAAATCGACGGAGGCAGAGGTCGAATAAGGTTGTGCCGACTAGGGTTTAATGAAAATGAGATAAAACCTAATGGTTTGATACCATAGAAGTACATAATAACCTATTACATTGGTGAATAAATAGAGACCAAAATAGTATAAATGAAAACTACAAATATGGAAAATATTAAATAAGAATAAACCCAATAAACCCTAATTTCTAACACTAAGAAAGTTAAATTCTTTACCACATGTTTCCAAAGTGATTGAGGGATTTTTTTGAAATGCACCCCAACTTAAAGTTGCTGGATATTTCCTTGAAGGTGAGCACCAAACACCCAAGAAAGATTAAAGAAAGAGAACAACTTCTCCCAATATTGTGCTGTGTACGAAAAGAAGAAGAAGAGATGTAAAGGATCTTCTCCATTCCAATGACATAAAGGGCAAATGGAAGGAGAAAGACAGCTGTTAGGAAGTTTATAATGCAATACAGAGGATACGTTAAGATGACTGATCATGATGATCCAAAAAAGAATGTTAACCCTCCTTGGGCTCTTAGACTTCCATAGTGCCGAGTATAGCAACTTGTCCATTGGGGAAGAAGGGCATAAGTGAACTGTGAGAGACTTTACTGAGAAGACCCCATTTCCCAACAAAGACCATTGGGTACTGTTAACTGTTATGTTAGGAATATTAGTAGCATAGTACTACTATCATAGTAATTATGGTATATTAGTAATTTAGTAGGGAGGTGATTATGTAGTCTGGTAAAAAACTAGAGGAAGTGAGGAATGAGGGAGTTAGACATTTTGTTAGGGAATTATTGAGAGAAATCTCCAAGAGAGAGGGTGCAAGTTGATAATTTTATATACTATTTTGAATATTCAGAATAATTGAGAGTATATAATGACCTTTTACATAGGGAATGAATAGACTTCAAAGAAACCACTAAAGGAAAATAAATAAAGGGAAAATATCAAAAAGGAAAATAATAATAAGCCATAAACCCTAATTTCCTTTTAACACAAGTAACTCGAATTAAACAATGTAGTTTCTCCATATTGCAATATAGTTATTTTACTGTTTTCTTATATTTTCTTGTTAGGAGAGGTACCTTAACCAATTCCACGTGTAATTCCATGAGTAAGGAATTCATTATTCGATTTACACATTTGGATGTCCAGAAGTATTTGGGCTTGTTGCAATCGATTTGCTAGATTTATATGTGGATATGTGTTATGGTTACTGAAATATCTTTATATACTATATCAGACAACACTCTGGATTTGATTTGTTTAGAACGGGAAATGAAACTTTTCGTTGATATAATGAGAAAAGACTAATAGTCAAAGATACAAATACACTTAAGAAGGGAGAACAAGAAAATAAACTAAAAATAAACAAATCATAGTTCGATCTTTGGAGTTTGGTTTTTTTCTATTTTTTCCATTTAGGGCATTTGTTTGAATCGGCATGATTTTTTCTCGGTTGAAGTGTTGTTTATCATTATTTGTATTACTTGTATTTTGTATCCTTTAGTATTAGTTTCTTTTCATTATACTAATGAAAAATCTTGTTTCCTTTTAAAAAAAGAAGAAAAATAAGAGAAGCCTCAACTTCATGCACAATGTTGAAAACTGAACGAATCTTGAAACTTGGGCTACTTGGAGAATGAAAGTCAGCTCGAATTTAATTGTGCCATTCGAGAACTTAGTGAAAAAACAATGACACTTGAAAGAAGAAAACCCAAATTCTTCTATGGATGAGAAGCACCTTAAAAACTTTAAAATTAATGCTTTGCTCCTCAAAGAAATTACTACACCCTTGAACAAAGAGAATTTATTTAAAAACGAGTAGAATAGAAGTTTCTTCTTCATGCAGAAGGTTAAAAATCGAACCATCTTGGAACATGTGCCGCCTGCAGAAGAATAATCGAAGTCTAGTATGCTTTCTTGATACTTCATAAGGGCAAGCTCGAAAAAGGAAGCCATCCTTAATTGTACCTATAAAAAGAAGCCATGCAAAGGGTCATCCTTAATTAAACCAATAAGAAGCTATGGAGTAATCAAGTTTATATTGAAGACTGAAGCACAAACATCACTATGAATAAGGGTGAAGGGCTGAAAAGGTTTATTAGGTTGAGACGGAAAGGTGACTATGTTGTTTCGAATGCACACATCACAAGACAGAGAAGAGACATCAACTTTATTAAATAAATGAGGAAATAAATATTTCATATATTGGAAATTTGGGTGACCAAGGCGAAAATGCCACAACATACAATCTTTTTCTATAGTTGAAAAATAAGAAGACTACAAATTTGTCCTGTAACAATTCCTAGAGGGAGCATCATTGTCAAGGAGATAGTCCCCTATTGTGTCGGGCAGTGCCAACCGTGTTTCTCGAGCTCAAGTCTTGAAAGAAAACATTATCCGGTGAGAAGGCAACTTGACAATTTAGATCTCTAGTTATCGCACTAATAGATAGTAAATTGTAAGATATTTTTGGTACATGCAACACATTTTGTAAAGTTATTTAGTACTCTAGAGAATAACGGGGTTGCAAGCAGTGATAAATATGACCTTGCCCGCAATAGGAGAAAGGAGTCATCTACAATCCGAATCTTTTCATTACCAGCACTTGGAAGATAGGATAGAAAATTATCATAGGATCCATTCAAATGATTTGTAGCTCCTAAATCAAGTATCCATGGTGTCATACCATTTATGCTGAGGAGACTTAAGGATTGAGAGGTACCTGATTGAACAATGGCCCTAAGGTAAGAGACACCAAAGTCACTTTGACAATTCACCTGAGGTTGTGACTGGGGGGCACTCGTCAATTCTCTCATAAGAGCTTGACTAGGGTTGGGCTTGTCATTTGGAGGATGTCTTCTCTGCCATTTGGTGGTCTACCATGTAACTTCCAACATTGTTCTTTTGTATGCCAGGGTTTCTTGCAATGTTCACACACCAGTGTTTGTTTGCCATTTTGCTTGTCATTACTAGAACTAAATAATTTTGCACTAAAATCAGCAAAATCAGTAGCAGATACAATCTTGATGTTCATAGCACTGACTCTGTCTTCCTCTAGACGCACCTACGAACACACTTCCATAAGAGGAGGTATAAGCGTTTGTCCCAATATACGACCTCGCACTACATTGGACTTGGAGTTCAAACCAGGTAGGAAATCGTAAACACGATCAACTTCCTCAATCTTGGAATATTGGACACCTCCACACGGACAATCCAGATAATTTCATAACATAGATCCATCTCTTGCTAGATTAGGGACAATTTGTTAAAGTATGAAGTGACATACATCGCCCCTATTTGCACTCACGAGCCTGTTTACACAAAGTGTAAAGACGAGATTTTTGAATTGCATCCTAGATATCTCAAGCAGTAGCAGCATACAACAACGGTTTCCCTATCTAAGGTTCCATACTACTAATCAGTAATGATCGAAGTAAGGAATCTTCTCCTTTCCAAATGCGCACTTAAGGATCCCTTGGTCTAGGTCTTAGGTATTTCACTAGTTAGATATCCAAACTTGTGACGCCCCTCAAGGACCATTTTAATGGACTGAGACTAGGAGAAATAATTTTGACCATGCAACTTTTCTCCTGCAATTAACCCTGTGGAATTTCCCATAGAGCCAGACAAATAGTTTGTAGTAGTTAAAGTAGAGAAAGAGGTTACTGGATTCTCTGGATACATTGATAAACGAGAAGGAATGTCATAATTCACTGTACTAAAATTTAGGTTTGTGGAAGCACCTAGGGCTGCCCCAAAAGTAGCGATTTGTTGCTGAAAACTTGTCCGAAGATTAGCCAATTGTTGTACTATTATCTGGGGATAAACCAATTGAACCTTGATGTTCATAACTAGGTGACACATGATGAGGAAGACCCGGTTGCTCGAGAAGCTTGTTGCCGTGATAGGGAATTTCCCTCTGTGGTAGGCAGACGACTCACTAATCTTCGTTTCGTGGCGTAATGTTGCGGCTAAGGTTTCTGTCAAAGTATTGTGGCGGTGCTAGAATTGGACGGATATGCAAGGACCAAAGATAACGATCAACAACGACTTGGATATTAGTGGCAGTGATGACAACGGAAAAATCGGCGGCGGTTCCGGCGGTTGTCGGTGGTGGCTGTAGCTGAAGCAAGACTAGTGCAGAACTAGAGTTATCTTGCATCAGTTGGTTTTCGCCAATGACGGCTAGGGTTTCGTCACCACTCTCTAATACCATGTTAAAAGGTAGAAAATAGAACACAAGGTTTACGTGGAAAACTCTAGAACAGGGAGAAAAACCACGATAGAAAGTCTCTTTTATTATTTTCACATAGTGAAAGTTTCAAAAGAAGACAACTTTATAGGCTCTAGCAAGCCCTAATACAAAAAAGGAGATAAACTTAGGGTAAAGTAAAATATTAAAAATCCCTCGGGGCTATAAACTATCAACAAAGCCCCTTATTTTCAACACTTGCTAACAATAACTTTGTGCCATAGGATGTTAGAATCAGGATTAAATCACCAAAACCATTTAGCCAGAAGTACTTATCCCATGGGTGTCTAGATTTGTTGCTGGTTTCCAAGGACTTCACATGACTTGGGTTAGGAGGGATCTTCGTTTTTGGACCCATTCTTCTTAGAGTTTTCCTTCTGGCTCTCTTTTTCATCTTCTTGAGTAGTACCCCATTGATTCTCCTTGACACCCCTTTTTTCCCTTTACGGAAAGTCAAAATCCCTAAACTAGTTAAGTTTTTGCCTAGCCAGTCTGACATGAGAGATTTAATACCTGAGACTGCATGCAAAGACATTTTCCCTTTTGGGGGGGCCTTAGTGGTTTCCCTTTTGAAGAGCGGTGGAAGATCTGAACAAAATTTGGAGATCCTATTTGTCCTTTCAATTTAGTCGCCTTCTTGTGGGGGGAGGGGGGTTTGGAGTTTGTTTGGCATGAGAAAAGGACTATAATAGGGCTATGATGGACCAAGTTTGGCTTTTTTATCCCTATAGGGAGTGTGACAATTTTCTATGACAGGCTTGCTTCTTCTCGCAGTTGTTTGGGGGATTCTGGCTCGTTGGAAATACTAGAGCTTTGGAGGCAATGAGAGATTTGTAGAGGAAATGTGGGAGGCCATTAGGTTTAATGCCTCATCTAGGCCTTAGTAGCTAAGATCTTTTGTAATTAGCGATTAGGAAAAGGCTTCATACTGTTGGATTGGAGTCCTTTGTTATCTTTGGACTCCTTTTGTTGGTTTTACTTCGTTTTATTTTATTTTAATTCATTATTCATTTGTTCATATTTCTCATCAGTATCGAATCAAGGGGAAGTAAAATCTCTGCATCAATTGATATTGATACTATAGTTAGTAATGCATATGTTCTTAAAATATATACACTAGCTTTCGAAGCCGGAATTTTCCTTGTACTTTGAGCACTAGACGCATTTTGTTAATCCAATGAAAAAGTCTTGTTTCCATTTTAAAAAAGGTAACACTATAAGTAGTTGGACATTATTCATAAAGGTTCTCAATTTTGGAGAAACTGGTTTCTCTCTTGTTCTAGATGGATTATTGGGTTAATGACCATCAACGTCATCCATCTAAATCTACTGGATCATCTTGGCTCGGTGGGATATCTCATCTTCCATCTTCTGTATACCCTTGATTCTTAATGAAAGTTTGAAAGATGTTCAACTTAAAATTTATTTATATCAGATTACAATTTCTTTATCTTGAGTAAAAAATTATATCCAATTATTCGACTTGCCATGGATGTAAGTAGAATATTATCCTTTTGTTTCATAATGTGATAGAACCCAGATATATTGAATATAAAAGATAATAAAAATACACGATAAACCAAATAATTCGAGGTACTTGGACCCTCGACCTCCCTTCTTGAGATCATTCTCAAGCCTTAAGTCACTCACCCTCTACTTATAACCAACTTCCCTTGCAAACTCCCTATCTAATTATTAATATACCCGCAATAATCCCTTAATACCAATCTTATCATTATGATGGTCTAATTGATACTTCACTTGTTTTAAGGAAATTCCTCTCGAAGGTTTTGCTCGTCCCCTTAATCCAATTGAAACTTCAAAGTCATAATGACTTTCTGTTACTCACCAGTTTCCAATTAATCTAGGGAATTTTCTAATGTGATCTCTTCTTCTAGCTTCTTTGACTGGAAATTCAATGCTGGCACTGATCTAACTTTTGCCTTGTGGTTTTTTCCAGGTTGAAAAAGCTTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGGTTTCACTGCAATCTATATCAGTACTCAGAAGATTATTTACAAGAAATGAGATCTCGATTTACTGCTAACCTTTGAGAATTTGCTGCAGATGAAACTTCTTGAAGAGGAAGAGAAGGAAAAGCGTGAAGAGCAAGAACGCAAAGAGCGAAAAAGGACTAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGTTGAGTTCTGAATCAGCTGAAGTGTGTCCTCATTCTGATGTCTTGGAGGATTTGTCCCCATGTGTTTTGGAGCCAAATTCCAATGCAGTCGGTGAAGTATGCGATTCCAGCGTGCCTGAATCTTCTGACATTCTGGATGAGCTATTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCGTATGATGATAGCTTTGATGGAAAACTTACCGACGGAAATGATGGAAATGAGTCTTTCATAAGTGATCAATCTAAGGTTTCTCGATGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATCCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGCGCTGGTTAACAAATCTGAGCAAAGATATCATGTCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAGTTAAGAACAAATTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAAAAGTTGCACTCTTCCAACAACCGGGTGTCTTATGACTACCGTTCCTGCGTCTGTAACCAAACTAATGAATTTAACAAAAAGGCGGAGCCATTTGTTTCTTCAGTTAGAGTTAACCGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTATCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGGAGACTGAAAACGAAAGCTGCTTTATTAAACAATTCTCCTGGTAAAGATTTTGTATATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACACAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGATGCGGAACCTGATTATGATGTTGTGAAGTCAAGGGATGATGTTGTGAAGTCGAGGGATGAATTTTGCAGTGGTGAAGTTAGTGTAACTTCTGGTGCAGTTGATCAAGAGGAGAGTAATTCTACTGAATCAACTTCTGGCATTGAATCAGATGATGTCTCCCAAAATGAAATTTCTATCGAATCGAAGAATCATAAAAACGTAGAAGATGTATGTGAGGTAAAACAGTGTTCTGCAAATTCAGCCATAGACACGACATTGACATCTAGTGGGACCAGTAACCAAGTAGGGACTAGCTCATTAAATTCTGATAACTGCTCATCGTGCCTGAGTGAAGGAGACAGTAATACTATTGGCTCGAACCATGGAAATTTAGAATCCTCCTCCACGTCCGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATACGGATAGATAAAGGAATTGGAGGTGAAGCCATGGGGAGCAGGAGCTATTCCGGTCTACCTCAAGAGAATGAGGGATGTAAAGTTCAAGTGAATGCACCCAAAAATGTTCCTCAGAACTTCGAAGCAGGATTCTCTGCCGTTAGTCTGGATTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCATTTCCCAGTGTTTCAGGTTCCTCCATCAATGAATTATTATCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATAATGCCTTTCTCCTATTCAAACCACTGTCTATATGCCAATCCTCTCGGGTATGGTTTAAATGGTAACCCACGCTTCTGCATGCAATATGGCCATTTGCATCATCTTTCTAATCCGGTTTTCAACCCTAGCCCGGTTCCTATTTATCATCCGGCTTCGAAAACGAGCAATGGTATCTATACCGAAGATAGAACTCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCTGTGGCGAATTCAGACATCGCCCTTACAACTGGACATCCATATGCACTCAGTTCACCACCGAGTGGAGATCTTAACCAAAATGATACTTCTTCCAAATTGCAACAGGATAGTAGCTCAAGCTTTTCATTGTTTCATTTCGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCACGCCGTCTAAGGAAGACGATGTTGGGGATTTTTCGACAAATAATGAGGTGGAAGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCGGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGAATGAGAACAAGAAGATATGAGGGGGCTGTTTTCGATTTACGAGTCCCTATCTTCATATATTTTTTTTAAATGAAATTTTACAGTACCTTGCTACATAATAAGTTTTCTTTAAATCTCAATTTTACTTATAAGTTTTTCAGTAGTTCATTGCTCTCAAAATTTGTCTCATTATATTTCCAGATGTAGAGAACAC
mRNA sequence
CTTCTCCTCCTCCTCCTCTCTCTCTCTTTCTCTCTCTAGCGAGCAATAAACGTAGCAGCCGAATCTCACCGTCATCAATCCGAAACCCCTACATCCGTTTTCTCTCTTTTCTAATCTTCAATTTGATGCCCCTTCAATTTCATTTATATTATGCTATTATTCCAATCTACTCCAATCGCATCCAATCCAATTCCATATAATCCATTGTCACATAGTCTCGATCAGATTTGCTTCTGGACGGAGTTATTTTGATCCGCCGACGATCTGATATTTTGTTCTAATTACGTATTGCGATTGTAATTCTTAGCTACTATCCATTTTGGCTTTTTTTTTTTTTTTTTTTTTTTTGGTTGTCGATCAGATTGGGTAGATTTGGGTTGTGAGGTAATGGAAGGATTGAGAAATGCAACTATGAGAACTTGAAAGATGCCTGGATTAACACAGAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCCGCCCATGGATTTTGGTCCCAGCATCGCGACGACGTTAGCTACAATCAGCTCCAGAAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAAATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGTGAACCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCAAGCATGTGATGGTTCATTATCGGTTAATGGGCTTCAAGATGAAATTCAAGATCCTTCTGTACATCCTTGGGGTGGTTTGACCACAACACGTGACGGGGTGCTGACACTTTTGGATTGTTATTTGTTTTCCAAGTCTTTCCTGGGTCTCCAAAATGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGGGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAAACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTTATCCGTGAGTTTAAGGAACTAAAGGAGCTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCCTTTAATTATGAGGTATCGGATGACACAATACAAGCCGATTGGCGTCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGAACGGGAGAAGGAAAATCTGATATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTTAAAATAAACGGCCTAGATCTTGGTGGTTTGAATTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGACTTACAGTTGGTGATGGATTTGTTACAATTACGAGAGGGGAAAATATTAGGATGTTTTTTGAGCATGCTGAAGAGGCCGAGGAAGAGGAGGAGGACGATTCCATTGATAAAGATTCAAATGATTTGGATGGAGACTGTTCTCGTCCACAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTCATCTTTAAAGAACAGGTTGAAAAAGCTTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGATGAAACTTCTTGAAGAGGAAGAGAAGGAAAAGCGTGAAGAGCAAGAACGCAAAGAGCGAAAAAGGACTAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGTTGAGTTCTGAATCAGCTGAAGTGTGTCCTCATTCTGATGTCTTGGAGGATTTGTCCCCATGTGTTTTGGAGCCAAATTCCAATGCAGTCGGTGAAGTATGCGATTCCAGCGTGCCTGAATCTTCTGACATTCTGGATGAGCTATTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCGTATGATGATAGCTTTGATGGAAAACTTACCGACGGAAATGATGGAAATGAGTCTTTCATAAGTGATCAATCTAAGGTTTCTCGATGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATCCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGCGCTGGTTAACAAATCTGAGCAAAGATATCATGTCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAGTTAAGAACAAATTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAAAAGTTGCACTCTTCCAACAACCGGGTGTCTTATGACTACCGTTCCTGCGTCTGTAACCAAACTAATGAATTTAACAAAAAGGCGGAGCCATTTGTTTCTTCAGTTAGAGTTAACCGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTATCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGGAGACTGAAAACGAAAGCTGCTTTATTAAACAATTCTCCTGGTAAAGATTTTGTATATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACACAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGATGCGGAACCTGATTATGATGTTGTGAAGTCAAGGGATGATGTTGTGAAGTCGAGGGATGAATTTTGCAGTGGTGAAGTTAGTGTAACTTCTGGTGCAGTTGATCAAGAGGAGAGTAATTCTACTGAATCAACTTCTGGCATTGAATCAGATGATGTCTCCCAAAATGAAATTTCTATCGAATCGAAGAATCATAAAAACGTAGAAGATGTATGTGAGGTAAAACAGTGTTCTGCAAATTCAGCCATAGACACGACATTGACATCTAGTGGGACCAGTAACCAAGTAGGGACTAGCTCATTAAATTCTGATAACTGCTCATCGTGCCTGAGTGAAGGAGACAGTAATACTATTGGCTCGAACCATGGAAATTTAGAATCCTCCTCCACGTCCGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATACGGATAGATAAAGGAATTGGAGGTGAAGCCATGGGGAGCAGGAGCTATTCCGGTCTACCTCAAGAGAATGAGGGATGTAAAGTTCAAGTGAATGCACCCAAAAATGTTCCTCAGAACTTCGAAGCAGGATTCTCTGCCGTTAGTCTGGATTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCATTTCCCAGTGTTTCAGGTTCCTCCATCAATGAATTATTATCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATAATGCCTTTCTCCTATTCAAACCACTGTCTATATGCCAATCCTCTCGGGTATGGTTTAAATGGTAACCCACGCTTCTGCATGCAATATGGCCATTTGCATCATCTTTCTAATCCGGTTTTCAACCCTAGCCCGGTTCCTATTTATCATCCGGCTTCGAAAACGAGCAATGGTATCTATACCGAAGATAGAACTCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCTGTGGCGAATTCAGACATCGCCCTTACAACTGGACATCCATATGCACTCAGTTCACCACCGAGTGGAGATCTTAACCAAAATGATACTTCTTCCAAATTGCAACAGGATAGTAGCTCAAGCTTTTCATTGTTTCATTTCGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCACGCCGTCTAAGGAAGACGATGTTGGGGATTTTTCGACAAATAATGAGGTGGAAGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCGGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGAATGAGAACAAGAAGATATGAGGGGGCTGTTTTCGATTTACGAGTCCCTATCTTCATATATTTTTTTTAAATGAAATTTTACAGTACCTTGCTACATAATAAGTTTTCTTTAAATCTCAATTTTACTTATAAGTTTTTCAGTAGTTCATTGCTCTCAAAATTTGTCTCATTATATTTCCAGATGTAGAGAACAC
Coding sequence (CDS)
ATGCCTGGATTAACACAGAAAAATGACCATTTAAATGGTGGGTCATCGGCTATATACTCGCTCTCCGCCCATGGATTTTGGTCCCAGCATCGCGACGACGTTAGCTACAATCAGCTCCAGAAGTTTTGGAGTGACCTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACCCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTAGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAAATTGTCATATATGGGAAGTCTTTACAACAAGGAAAAACATGTGTGAACCATTCCTGCAACAGATTAGGGGTTTCAAAAAATCAAGCATGTGATGGTTCATTATCGGTTAATGGGCTTCAAGATGAAATTCAAGATCCTTCTGTACATCCTTGGGGTGGTTTGACCACAACACGTGACGGGGTGCTGACACTTTTGGATTGTTATTTGTTTTCCAAGTCTTTCCTGGGTCTCCAAAATGTCTTTGATAGCGCACGTGCTAGGGAGCGAGAACGAGAGTTGCTTTATCCAGATGCCTGTGGTGGGGGAGGTCGAGGTTGGATAAGTCAGGGAACAGCAAGTTATGGCAGGGGACATGGAACAAGGGAAACATGTGCCTTGCATACTGCTAGGCTTTCTTGTGATACATTGGTTGATTTCTGGTCAGCATTAGGAGAAGAGACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGGCTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTTATCCGTGAGTTTAAGGAACTAAAGGAGCTGAAGCGCATAAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCCTTTAATTATGAGGTATCGGATGACACAATACAAGCCGATTGGCGTCAAACTTTTGCTGACTCTGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGAACGGGAGAAGGAAAATCTGATATTCTGGAATTTGAAAATGTTGGCATGAATGGAAGTGTTAAAATAAACGGCCTAGATCTTGGTGGTTTGAATTCATGTTTTATCACCCTCAGAGCTTGGAAATTGGATGGACGCTGCACTGAGCTCTCAGTGAAAGCTCATGCATTGAAAGGTCAACAATGTGTTCATCGCAGACTTACAGTTGGTGATGGATTTGTTACAATTACGAGAGGGGAAAATATTAGGATGTTTTTTGAGCATGCTGAAGAGGCCGAGGAAGAGGAGGAGGACGATTCCATTGATAAAGATTCAAATGATTTGGATGGAGACTGTTCTCGTCCACAAAAGCATGCAAAGAGTCCTGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTCATCTTTAAAGAACAGGTTGAAAAAGCTTTCAGAGAAGGAACAGCACGCCAAAATGCGCATAGCATCTTTGTGTGTCTTGCACTAAAATTACTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTGGAAAAGCAGATGAAACTTCTTGAAGAGGAAGAGAAGGAAAAGCGTGAAGAGCAAGAACGCAAAGAGCGAAAAAGGACTAAAGAAAGAGAGAAGAAGCTCCGGAGAAAAGAAAGATTAAAAGGAAAGGATAAAGATAAGTTGAGTTCTGAATCAGCTGAAGTGTGTCCTCATTCTGATGTCTTGGAGGATTTGTCCCCATGTGTTTTGGAGCCAAATTCCAATGCAGTCGGTGAAGTATGCGATTCCAGCGTGCCTGAATCTTCTGACATTCTGGATGAGCTATTTTTAAATGAATCCATCATTTCAGAAGGGCAAAATTCGTATGATGATAGCTTTGATGGAAAACTTACCGACGGAAATGATGGAAATGAGTCTTTCATAAGTGATCAATCTAAGGTTTCTCGATGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATCCTTTCAAGTGGTCTGAGAGGCGCCGATTTATGGTTGTTTCAGAAAATGGGGCGCTGGTTAACAAATCTGAGCAAAGATATCATGTCGATAGTTTGGAAAATCCTTCCAGGAGTATGAATGGATCAAACAGGAAGTTAAGAACAAATTCATTAAAGGCCTATGGTCGACATGTCTCTAAGTTTAATGAAAAGTTGCACTCTTCCAACAACCGGGTGTCTTATGACTACCGTTCCTGCGTCTGTAACCAAACTAATGAATTTAACAAAAAGGCGGAGCCATTTGTTTCTTCAGTTAGAGTTAACCGAGATGTCAAATCTGTGAGCAAGTCAGAATCTTCATTTGATATGTCCAAGCAAAGTTATCGTTCTAACAAGTACAGTTATGGAGATCATTCTCGTGATAGCGGGAGACTGAAAACGAAAGCTGCTTTATTAAACAATTCTCCTGGTAAAGATTTTGTATATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGACACAAATGTTGCATTGAAGTCTTCAACTTTCAAGTTTGATGCGGAACCTGATTATGATGTTGTGAAGTCAAGGGATGATGTTGTGAAGTCGAGGGATGAATTTTGCAGTGGTGAAGTTAGTGTAACTTCTGGTGCAGTTGATCAAGAGGAGAGTAATTCTACTGAATCAACTTCTGGCATTGAATCAGATGATGTCTCCCAAAATGAAATTTCTATCGAATCGAAGAATCATAAAAACGTAGAAGATGTATGTGAGGTAAAACAGTGTTCTGCAAATTCAGCCATAGACACGACATTGACATCTAGTGGGACCAGTAACCAAGTAGGGACTAGCTCATTAAATTCTGATAACTGCTCATCGTGCCTGAGTGAAGGAGACAGTAATACTATTGGCTCGAACCATGGAAATTTAGAATCCTCCTCCACGTCCGACTCAGAATATGCTAGCCATCAATCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAGCATCATGAGATACGGATAGATAAAGGAATTGGAGGTGAAGCCATGGGGAGCAGGAGCTATTCCGGTCTACCTCAAGAGAATGAGGGATGTAAAGTTCAAGTGAATGCACCCAAAAATGTTCCTCAGAACTTCGAAGCAGGATTCTCTGCCGTTAGTCTGGATTCCCCATGTCAAGTGACTCTTCCTTCGATTCAGAACCAAAATATTCATTTCCCAGTGTTTCAGGTTCCTCCATCAATGAATTATTATCATCAAAACTCAGTTTCATGGCCAGCAGCTGCCCATGCAAATGGAATAATGCCTTTCTCCTATTCAAACCACTGTCTATATGCCAATCCTCTCGGGTATGGTTTAAATGGTAACCCACGCTTCTGCATGCAATATGGCCATTTGCATCATCTTTCTAATCCGGTTTTCAACCCTAGCCCGGTTCCTATTTATCATCCGGCTTCGAAAACGAGCAATGGTATCTATACCGAAGATAGAACTCAGGTCTCCAAATCAGGTGCAATAGCAGAAAGCTCTGTGGCGAATTCAGACATCGCCCTTACAACTGGACATCCATATGCACTCAGTTCACCACCGAGTGGAGATCTTAACCAAAATGATACTTCTTCCAAATTGCAACAGGATAGTAGCTCAAGCTTTTCATTGTTTCATTTCGGAGGGCCTGTTGCACTTTCAACAGGAGGTAAATTAAATCTCACGCCGTCTAAGGAAGACGATGTTGGGGATTTTTCGACAAATAATGAGGTGGAAGTTATTGACAATGGTCACGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCGGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGA
Protein sequence
MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQTLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLSVNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLYPDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMKEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVSDDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSCFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEEEEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLRRKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDELFLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISIESKNHKNVEDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF
Homology
BLAST of PI0002685 vs. ExPASy TrEMBL
Match:
A0A1S3B599 (uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=4 SV=1)
HSP 1 Score: 2421.0 bits (6273), Expect = 0.0e+00
Identity = 1242/1281 (96.96%), Postives = 1254/1281 (97.89%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDGVLTLLDCYL SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLHSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW QTFADSVETYHYFEW+VGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWSVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDEL 600
RKERLKGKDKDKLSSESAEVC SDVLEDLSPCVLEP SNAVGEVCD+SVPESSDILDEL
Sbjct: 541 RKERLKGKDKDKLSSESAEVCARSDVLEDLSPCVLEPTSNAVGEVCDTSVPESSDILDEL 600
Query: 601 FLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
FLNESIISEGQNS+DDS DGK TDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER
Sbjct: 601 FLNESIISEGQNSFDDSLDGKFTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
Query: 661 RRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
RRFMVVSENG LVNKSEQRYH DS ENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS
Sbjct: 661 RRFMVVSENGMLVNKSEQRYHPDSSENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
Query: 721 SNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
SNNRVSYDYRSC+CNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS
Sbjct: 721 SNNRVSYDYRSCICNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
Query: 781 YGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
YGDHSRD+GRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+NVALKSSTFKF
Sbjct: 781 YGDHSRDNGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTFKF 840
Query: 841 DAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI 900
DAEPDYDVVKSRD VVKSRD FCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNE SI
Sbjct: 841 DAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNENSI 900
Query: 901 ESKNHKNV-EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
ESK+HKNV EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG
Sbjct: 901 ESKDHKNVEEDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
Query: 961 SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGLP 1020
SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEA GSRSYSGLP
Sbjct: 961 SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEARGSRSYSGLP 1020
Query: 1021 QENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH 1080
Q+NEGC VQVNAPKNVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH
Sbjct: 1021 QDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH 1080
Query: 1081 QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV
Sbjct: 1081 QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
Query: 1141 PIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQNDT 1200
PIYHPASK SNGIY EDRTQVSKSGAI+ESSVANSD+A+TTGH YALSSPPSGDL QNDT
Sbjct: 1141 PIYHPASKASNGIYAEDRTQVSKSGAISESSVANSDVAVTTGHQYALSSPPSGDLKQNDT 1200
Query: 1201 SSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNKK 1260
SKLQQD SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFS NNEVEV+DNGHAFN K
Sbjct: 1201 -SKLQQD-SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGHAFNMK 1260
Query: 1261 ETAIEEYNLFAASNGMRFSFF 1281
ETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 ETAIEEYNLFAASNGMRFSFF 1279
BLAST of PI0002685 vs. ExPASy TrEMBL
Match:
A0A0A0KZE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G563700 PE=4 SV=1)
HSP 1 Score: 2395.2 bits (6206), Expect = 0.0e+00
Identity = 1234/1281 (96.33%), Postives = 1246/1281 (97.27%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSL QGKTCVNHSCNRLGVSKNQACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLHQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDGVLTLLDCYL+SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEF+NVGMNGSVKINGLDLGGLNSC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFDNVGMNGSVKINGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDEL 600
RKERLKGKDKDKLSSESAEVC SDVLEDLS CVLEPNSNAVGEVCDSSVPESSDILDEL
Sbjct: 541 RKERLKGKDKDKLSSESAEVCARSDVLEDLSSCVLEPNSNAVGEVCDSSVPESSDILDEL 600
Query: 601 FLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
FLNESIISEGQNSYDDSFDGKL DGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER
Sbjct: 601 FLNESIISEGQNSYDDSFDGKLA---DGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
Query: 661 RRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
RRFMVVSENGALVNKSEQRYH DSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS
Sbjct: 661 RRFMVVSENGALVNKSEQRYHADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
Query: 721 SNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
SNNR+SYDYRSC+CNQ NEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS
Sbjct: 721 SNNRMSYDYRSCICNQANEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
Query: 781 YGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
YGDHSRD+GRLKTK ALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF
Sbjct: 781 YGDHSRDNGRLKTKPALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
Query: 841 DAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI 900
DAEPDYDVVKSRD +EFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI
Sbjct: 841 DAEPDYDVVKSRD------EEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI 900
Query: 901 ESKNHKNV-EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
E K+HKNV EDVCEVKQ SANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG
Sbjct: 901 ELKDHKNVEEDVCEVKQFSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
Query: 961 SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGLP 1020
SNHGNLESSSTSDSEYASHQSEGKES ASIQNGFSEHHEIRIDKGIGGEAMGSRSYSG P
Sbjct: 961 SNHGNLESSSTSDSEYASHQSEGKESLASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGFP 1020
Query: 1021 QENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH 1080
Q+NEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP IQNQNIHFPVFQVPPSMNYYH
Sbjct: 1021 QDNEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP-IQNQNIHFPVFQVPPSMNYYH 1080
Query: 1081 QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
QNSVSWPA AHANGIMPFSYSNHC YANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV
Sbjct: 1081 QNSVSWPAPAHANGIMPFSYSNHCPYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
Query: 1141 PIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQNDT 1200
P+YHPASKTSN IY EDRTQVSKSGAIAESSV NSD+A+TTGHPY LSSPPSGDL QNDT
Sbjct: 1141 PLYHPASKTSNCIYAEDRTQVSKSGAIAESSVVNSDVAVTTGHPYVLSSPPSGDLKQNDT 1200
Query: 1201 SSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNKK 1260
SSKLQQD SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFS NNEVEV+DNGHAFN K
Sbjct: 1201 SSKLQQD-SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGHAFNMK 1260
Query: 1261 ETAIEEYNLFAASNGMRFSFF 1281
ETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 ETAIEEYNLFAASNGMRFSFF 1270
BLAST of PI0002685 vs. ExPASy TrEMBL
Match:
A0A6J1HVV4 (uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149 PE=4 SV=1)
HSP 1 Score: 2193.3 bits (5682), Expect = 0.0e+00
Identity = 1141/1293 (88.24%), Postives = 1189/1293 (91.96%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKN HLN GSSAIYSLSA+GFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT VNH+CNRLGVSKNQA DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG +DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW TFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTIQADWHHTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+N LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQ+KLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQVKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKG--KDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILD 600
RKERLKG KDKDK+SSESAE C HSDVLEDLSPC LEPNS+AVGEVCD+SVPESSD +
Sbjct: 541 RKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDASVPESSDTFN 600
Query: 601 ELFLNESIISEGQNSYDDSFDGKL---------TDGNDGNESFISDQSKVSRWRLKFPKE 660
ELFLN+SIISEGQNSYDDSFDGKL DGNDGNESFI DQSKVSRWRLKFPKE
Sbjct: 601 ELFLNQSIISEGQNSYDDSFDGKLGDGNDGNDGNDGNDGNESFIGDQSKVSRWRLKFPKE 660
Query: 661 VQDHPFKWSERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGR 720
VQDH FKWSERRR M VSENGAL N+SEQRY+ DSLENPSRSMN SNRKLRTNSLKAYGR
Sbjct: 661 VQDHSFKWSERRRSM-VSENGALANRSEQRYYADSLENPSRSMNASNRKLRTNSLKAYGR 720
Query: 721 HVSKFNEKLHSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDM 780
HVSKFNEK+HSSNN VSYDYRSCVCNQ NEFNKKAEPFVSSVRVNRD KS SKSES FDM
Sbjct: 721 HVSKFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRVNRDAKSASKSESLFDM 780
Query: 781 SKQSYRSNKYSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDT 840
SKQSYR NK+SYGD+SRDSGRLK KAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+
Sbjct: 781 SKQSYRPNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDS 840
Query: 841 NVALKSSTFKFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIE 900
NVALKSSTFKF EPDY ++VKSR E CSGEVSV SG VDQEESNSTESTS IE
Sbjct: 841 NVALKSSTFKFGVEPDY-------ELVKSRHECCSGEVSVASGTVDQEESNSTESTSVIE 900
Query: 901 SDDVSQNEISIESKNHKNVE-DVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCS 960
SD+V QN + IESK+HKNVE D C EV CS N +D +TSSGTSNQ GTSSLNSDNCS
Sbjct: 901 SDEVFQNGLPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSSLNSDNCS 960
Query: 961 SCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGG 1020
SC SEGDSNTI SNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGG
Sbjct: 961 SCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGG 1020
Query: 1021 EAMGSRSYSGLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFP 1080
+A+GS + SGL Q+NEGCKVQ NAPKNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFP
Sbjct: 1021 DALGSTNSSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFP 1080
Query: 1081 VFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLH 1140
VFQVPPSM YYHQNSVSWPAA HANGIMPFSYSNHCLYANPLGYGLNGNPRFCM+YGHLH
Sbjct: 1081 VFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMRYGHLH 1140
Query: 1141 HLSNPVFNPSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALS 1200
HL+NPVFNPSPVPIY PA+K SNGI+ EDRTQVSKSGAI ESSVAN D+ +TTG PYALS
Sbjct: 1141 HLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESSVANPDVVVTTGLPYALS 1200
Query: 1201 SPPSGDLNQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEV 1260
SPPSGD QNDTSSKLQ+D SSSFSLFHFGGPVALSTGGKLN PSKED NNEV
Sbjct: 1201 SPPSGDCKQNDTSSKLQKD-SSSFSLFHFGGPVALSTGGKLNPMPSKED-------NNEV 1260
Query: 1261 EVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 1281
EV+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 EVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of PI0002685 vs. ExPASy TrEMBL
Match:
A0A6J1EPP9 (uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC111435513 PE=4 SV=1)
HSP 1 Score: 2187.9 bits (5668), Expect = 0.0e+00
Identity = 1140/1285 (88.72%), Postives = 1186/1285 (92.30%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKN HLN GSSAIYSLSA+GFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT VNH+CNRLGVSKNQA DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG +DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTL+DFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLIDFWSALGEETRLSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW QTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+N LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKG--KDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILD 600
RKERLKG KDKDK+SSESAE C HSDVLEDLSPC LEPNS+AVGEVCD+SVPESSD +
Sbjct: 541 RKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDASVPESSDTFN 600
Query: 601 ELFLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWS 660
ELFLNESIISEGQNSYDDSFDGKL DGNESFI DQSKVSRWRLKFPKEVQDH FKWS
Sbjct: 601 ELFLNESIISEGQNSYDDSFDGKL---GDGNESFIGDQSKVSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKL 720
ERRR M VSENGALVN+SEQRY+ DS ENPSRSMN SNRKLRTNSLKAYGRHVSKFNEK+
Sbjct: 661 ERRRSM-VSENGALVNRSEQRYYADSSENPSRSMNASNRKLRTNSLKAYGRHVSKFNEKM 720
Query: 721 HSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNK 780
HSSNN VSYDYRSCVCNQ NEFNKKAEPFVSSVR NRDVKS SKSES FDMSKQSYRSNK
Sbjct: 721 HSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRFNRDVKSASKSESLFDMSKQSYRSNK 780
Query: 781 YSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTF 840
+SYGD+SRDSGRLK KAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+NVALKSSTF
Sbjct: 781 FSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTF 840
Query: 841 KFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEI 900
KF EPDY D+VKSR E CSGEVSV SG VDQEESNSTESTS IESDDV QN +
Sbjct: 841 KFGVEPDY-------DLVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDDVFQNGL 900
Query: 901 SIESKNHKNV-EDVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSN 960
IE K+HKNV ED C EV CS NS +D +TS GTSNQ GTSSLNSDNCSSC SEGDSN
Sbjct: 901 PIELKDHKNVEEDACEEVTPCSVNSTVDMKMTSCGTSNQAGTSSLNSDNCSSCPSEGDSN 960
Query: 961 TIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYS 1020
TI SNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGG+AMGS + S
Sbjct: 961 TICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDAMGSTNCS 1020
Query: 1021 GLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMN 1080
GL Q+NEGCKVQ APKNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFPVFQVPPSM
Sbjct: 1021 GLSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFPVFQVPPSMG 1080
Query: 1081 YYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNP 1140
YYHQNSVSWPAA HANGIMPFSYSNHC+YANPLGYGLNGNPRFCM+YGHLHHL+NPVFNP
Sbjct: 1081 YYHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFCMRYGHLHHLANPVFNP 1140
Query: 1141 SPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQ 1200
SPVPIY PA+K SNGI+ EDRTQVSKSGAI ESS AN D+ +T+G PYALSSPPSGD Q
Sbjct: 1141 SPVPIYQPAAKASNGIFVEDRTQVSKSGAITESSAANPDVVVTSGLPYALSSPPSGDCKQ 1200
Query: 1201 NDTSSKLQQDSSSSFSLFHFGGPVALST-GGKLNLTPSKEDDVGDFSTNNEVEVIDNGHA 1260
NDTSSKLQ+D SSSFSLFHFGGPVALST GGKLNL PSKED NNEVEV+ NGH
Sbjct: 1201 NDTSSKLQKD-SSSFSLFHFGGPVALSTGGGKLNLMPSKED-------NNEVEVVGNGHG 1260
Query: 1261 FNKKETAIEEYNLFAASNGMRFSFF 1281
FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 FNKKETAIEEYNLFAASNGMRFSFF 1266
BLAST of PI0002685 vs. ExPASy TrEMBL
Match:
A0A6J1DQ45 (uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022059 PE=4 SV=1)
HSP 1 Score: 2176.4 bits (5638), Expect = 0.0e+00
Identity = 1131/1286 (87.95%), Postives = 1184/1286 (92.07%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKND LNGGSSAIYSLS +GFWSQ RDDVSYNQLQKFWS+L P RQKLLRIDKQ
Sbjct: 1 MPGLTQKNDQLNGGSSAIYSLSPNGFWSQQRDDVSYNQLQKFWSELPPHTRQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKN CDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNHTCDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTA +GRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTAGFGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAF+YEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDT+QADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTVQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRL+VGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLSVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEER+HIACKEIITLEKQMKLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERIHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKG--KDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILD 600
RKERLKG KDKDK SESAEVC HSDVLEDLSPCVLEPNS++VG+ CD+S+PESSD+LD
Sbjct: 541 RKERLKGKEKDKDKTCSESAEVCAHSDVLEDLSPCVLEPNSDSVGDACDASMPESSDMLD 600
Query: 601 ELFLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWS 660
E FL+ESIISE QNSYDDSFDGK TDGNDGNESFI DQSK SRWRLKFPKEVQD FKWS
Sbjct: 601 EQFLDESIISEVQNSYDDSFDGKPTDGNDGNESFIVDQSKFSRWRLKFPKEVQDQSFKWS 660
Query: 661 ERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKL 720
ERRRF VVSENGALVN+SEQRY+ DSLENPSRSMNG+NRKLR+NS+KAYGRH SKFNEKL
Sbjct: 661 ERRRFTVVSENGALVNRSEQRYYGDSLENPSRSMNGTNRKLRSNSIKAYGRHGSKFNEKL 720
Query: 721 HSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNK 780
HSSNNRVS DYRSC+C+Q NEFNKK E FVSSVRVNRD KSVSKSESSFDMSKQSYRSNK
Sbjct: 721 HSSNNRVSXDYRSCICSQNNEFNKKVEXFVSSVRVNRDAKSVSKSESSFDMSKQSYRSNK 780
Query: 781 YSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTF 840
Y YGD SRDSGRLK KAAL NNSPGKDFVYSKKVWEPMESQKKYPRSNSD NVA+KSSTF
Sbjct: 781 YGYGDQSRDSGRLKNKAALSNNSPGKDFVYSKKVWEPMESQKKYPRSNSDPNVAMKSSTF 840
Query: 841 KFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEI 900
KF EPDYD+ KSR DV CSGEVSV SG VDQEESNSTESTSGIESD+V QN +
Sbjct: 841 KFGVEPDYDLAKSRHDV-------CSGEVSVASGKVDQEESNSTESTSGIESDEVFQNGL 900
Query: 901 SIESKNHKNV-EDVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSN 960
E K+HKNV ED C E QCS NS I++TL SSG +N VGTSSL+SDNCSSCLSEGDSN
Sbjct: 901 PTEPKDHKNVEEDACEEATQCSINSTINSTLRSSGKNNHVGTSSLSSDNCSSCLSEGDSN 960
Query: 961 TIGSNHGNLESSSTSDSEYASHQ-SEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSY 1020
I SNHGNLESSSTSDSE ASHQ SEGKESSASIQNGFSE HEIR+DK GGE+MG+R +
Sbjct: 961 XICSNHGNLESSSTSDSEDASHQSSEGKESSASIQNGFSERHEIRMDKVNGGESMGTRIH 1020
Query: 1021 SGLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSM 1080
GLPQ+NEGCKV NAP NVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSM
Sbjct: 1021 FGLPQDNEGCKVLGNAPMNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSM 1080
Query: 1081 NYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFN 1140
YYHQNSVSWP AAHANG+MPFSYSNHCLYANPLGYGL+GNPRFCMQYGHLHHL+ PVFN
Sbjct: 1081 GYYHQNSVSWP-AAHANGMMPFSYSNHCLYANPLGYGLDGNPRFCMQYGHLHHLATPVFN 1140
Query: 1141 PSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESS-VANSDIALTTGHPYALSSPPSGDL 1200
PSPVPIY PA+K SNGIY EDR+QVSK+GAIAESS VAN D+ +T G PYAL SPPSGD
Sbjct: 1141 PSPVPIYQPAAKASNGIYVEDRSQVSKAGAIAESSDVANPDVVVTAGLPYALGSPPSGDC 1200
Query: 1201 NQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGH 1260
QNDT SKLQ+ SSSFSLFHFGGPVALSTGGKLNL PSKEDD G F N+E +V+DNGH
Sbjct: 1201 KQNDT-SKLQK-GSSSFSLFHFGGPVALSTGGKLNLMPSKEDDTGVFPRNSEADVVDNGH 1260
Query: 1261 AFNKKETAIEEYNLFAASNGMRFSFF 1281
AFNKK+TAIEEYNLFAASNGMRFSFF
Sbjct: 1261 AFNKKDTAIEEYNLFAASNGMRFSFF 1276
BLAST of PI0002685 vs. NCBI nr
Match:
XP_008442254.1 (PREDICTED: uncharacterized protein LOC103486163 [Cucumis melo])
HSP 1 Score: 2421.0 bits (6273), Expect = 0.0e+00
Identity = 1242/1281 (96.96%), Postives = 1254/1281 (97.89%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDGVLTLLDCYL SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLHSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW QTFADSVETYHYFEW+VGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWSVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDEL 600
RKERLKGKDKDKLSSESAEVC SDVLEDLSPCVLEP SNAVGEVCD+SVPESSDILDEL
Sbjct: 541 RKERLKGKDKDKLSSESAEVCARSDVLEDLSPCVLEPTSNAVGEVCDTSVPESSDILDEL 600
Query: 601 FLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
FLNESIISEGQNS+DDS DGK TDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER
Sbjct: 601 FLNESIISEGQNSFDDSLDGKFTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
Query: 661 RRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
RRFMVVSENG LVNKSEQRYH DS ENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS
Sbjct: 661 RRFMVVSENGMLVNKSEQRYHPDSSENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
Query: 721 SNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
SNNRVSYDYRSC+CNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS
Sbjct: 721 SNNRVSYDYRSCICNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
Query: 781 YGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
YGDHSRD+GRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+NVALKSSTFKF
Sbjct: 781 YGDHSRDNGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTFKF 840
Query: 841 DAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISI 900
DAEPDYDVVKSRD VVKSRD FCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNE SI
Sbjct: 841 DAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNENSI 900
Query: 901 ESKNHKNV-EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
ESK+HKNV EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG
Sbjct: 901 ESKDHKNVEEDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIG 960
Query: 961 SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGLP 1020
SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEA GSRSYSGLP
Sbjct: 961 SNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEARGSRSYSGLP 1020
Query: 1021 QENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH 1080
Q+NEGC VQVNAPKNVP NFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH
Sbjct: 1021 QDNEGCNVQVNAPKNVPHNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYH 1080
Query: 1081 QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV
Sbjct: 1081 QNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSPV 1140
Query: 1141 PIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQNDT 1200
PIYHPASK SNGIY EDRTQVSKSGAI+ESSVANSD+A+TTGH YALSSPPSGDL QNDT
Sbjct: 1141 PIYHPASKASNGIYAEDRTQVSKSGAISESSVANSDVAVTTGHQYALSSPPSGDLKQNDT 1200
Query: 1201 SSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNKK 1260
SKLQQD SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFS NNEVEV+DNGHAFN K
Sbjct: 1201 -SKLQQD-SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGHAFNMK 1260
Query: 1261 ETAIEEYNLFAASNGMRFSFF 1281
ETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 ETAIEEYNLFAASNGMRFSFF 1279
BLAST of PI0002685 vs. NCBI nr
Match:
XP_011653932.2 (uncharacterized protein LOC101210448 [Cucumis sativus] >KAE8649763.1 hypothetical protein Csa_012708 [Cucumis sativus])
HSP 1 Score: 2409.4 bits (6243), Expect = 0.0e+00
Identity = 1241/1282 (96.80%), Postives = 1252/1282 (97.66%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSL QGKTCVNHSCNRLGVSKNQACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLHQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDGVLTLLDCYL+SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEF+NVGMNGSVKINGLDLGGLNSC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFDNVGMNGSVKINGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDEL 600
RKERLKGKDKDKLSSESAEVC SDVLEDLS CVLEPNSNAVGEVCDSSVPESSDILDEL
Sbjct: 541 RKERLKGKDKDKLSSESAEVCARSDVLEDLSSCVLEPNSNAVGEVCDSSVPESSDILDEL 600
Query: 601 FLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
FLNESIISEGQNSYDDSFDGKL DGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER
Sbjct: 601 FLNESIISEGQNSYDDSFDGKLA---DGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
Query: 661 RRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
RRFMVVSENGALVNKSEQRYH DSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS
Sbjct: 661 RRFMVVSENGALVNKSEQRYHADSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
Query: 721 SNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
SNNR+SYDYRSC+CNQ NEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS
Sbjct: 721 SNNRMSYDYRSCICNQANEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYS 780
Query: 781 YGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
YGDHSRD+GRLKTK ALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF
Sbjct: 781 YGDHSRDNGRLKTKPALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKF 840
Query: 841 DAEPDYDVVKSRDDVVKSRD-EFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEIS 900
DAEPDYDVVKSRDDVVKSRD EFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEIS
Sbjct: 841 DAEPDYDVVKSRDDVVKSRDEEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEIS 900
Query: 901 IESKNHKNV-EDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTI 960
IE K+HKNV EDVCEVKQ SANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTI
Sbjct: 901 IELKDHKNVEEDVCEVKQFSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTI 960
Query: 961 GSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGL 1020
GSNHGNLESSSTSDSEYASHQSEGKES ASIQNGFSEHHEIRIDKGIGGEAMGSRSYSG
Sbjct: 961 GSNHGNLESSSTSDSEYASHQSEGKESLASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGF 1020
Query: 1021 PQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYY 1080
PQ+NEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP IQNQNIHFPVFQVPPSMNYY
Sbjct: 1021 PQDNEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLP-IQNQNIHFPVFQVPPSMNYY 1080
Query: 1081 HQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP 1140
HQNSVSWPA AHANGIMPFSYSNHC YANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP
Sbjct: 1081 HQNSVSWPAPAHANGIMPFSYSNHCPYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNPSP 1140
Query: 1141 VPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQND 1200
VP+YHPASKTSN IY EDRTQVSKSGAIAESSV NSD+A+TTGHPY LSSPPSGDL QND
Sbjct: 1141 VPLYHPASKTSNCIYAEDRTQVSKSGAIAESSVVNSDVAVTTGHPYVLSSPPSGDLKQND 1200
Query: 1201 TSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNK 1260
TSSKLQQD SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFS NNEVEV+DNGHAFN
Sbjct: 1201 TSSKLQQD-SSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGHAFNM 1260
Query: 1261 KETAIEEYNLFAASNGMRFSFF 1281
KETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 KETAIEEYNLFAASNGMRFSFF 1277
BLAST of PI0002685 vs. NCBI nr
Match:
XP_038881990.1 (uncharacterized protein LOC120073308 [Benincasa hispida])
HSP 1 Score: 2273.0 bits (5889), Expect = 0.0e+00
Identity = 1178/1286 (91.60%), Postives = 1216/1286 (94.56%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAIYSLSA+GFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSANGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG QDEIQDPSVHPWGGLTTTRDGVLTLLDCYL S SFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLCSHSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGT SYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTTSYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGL+SC
Sbjct: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLSSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSIDKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCL+LKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLSLKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILDEL 600
RKERLKGKDKDKLSSESAE C SDVLEDLSPCVLEP SN V EVCD+SVPESSDILDE+
Sbjct: 541 RKERLKGKDKDKLSSESAEACAPSDVLEDLSPCVLEPESNTVSEVCDASVPESSDILDEM 600
Query: 601 FLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWSER 660
FLNESIISEGQNSYDDSFDGKLT+GNDGNESF SDQ KVSRWRLKFPKEVQDHPFKWSER
Sbjct: 601 FLNESIISEGQNSYDDSFDGKLTEGNDGNESFTSDQFKVSRWRLKFPKEVQDHPFKWSER 660
Query: 661 RRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
RRFMVVSENG ++NKSEQR +VDSLENPSRS+NGSNRKLRTNSLKAYGRHVSKFNEKLHS
Sbjct: 661 RRFMVVSENG-VINKSEQRCYVDSLENPSRSINGSNRKLRTNSLKAYGRHVSKFNEKLHS 720
Query: 721 SNNRVSYDYRSCVCNQTNEF-NKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKY 780
SNNRVSYDYRSC+CNQTNEF NKKAEPFVSSVRVNRDVKS+SKSESSFDMSKQSYRSNKY
Sbjct: 721 SNNRVSYDYRSCICNQTNEFNNKKAEPFVSSVRVNRDVKSLSKSESSFDMSKQSYRSNKY 780
Query: 781 SYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFK 840
SYGDHSRD+GRL+ KAA+LNNSP KDFVYSKKVWEPMESQKKYPRSNSD+NVA KSSTFK
Sbjct: 781 SYGDHSRDNGRLRNKAAILNNSPCKDFVYSKKVWEPMESQKKYPRSNSDSNVASKSSTFK 840
Query: 841 FDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEIS 900
AEPDY DV+KSR E SGEVSVTSG VDQEE NSTESTSGIESD+V QN +S
Sbjct: 841 CGAEPDY-------DVMKSRHELSSGEVSVTSGTVDQEEINSTESTSGIESDEVFQNGLS 900
Query: 901 IESKNHKNVE-DVC--EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSN 960
ESK+HKN+E D C EV QCSANS +D TL SSGT NQVGTSSLNSDNCSSC SEGD N
Sbjct: 901 NESKDHKNIEDDACEEEVTQCSANSTMDMTLASSGTCNQVGTSSLNSDNCSSCPSEGDCN 960
Query: 961 TIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYS 1020
TI SNHGN+ESSSTSDSEYASHQSEGKESSASIQNGFSE+HEIRIDK IGG+AMGS+++S
Sbjct: 961 TICSNHGNVESSSTSDSEYASHQSEGKESSASIQNGFSENHEIRIDKVIGGDAMGSKNHS 1020
Query: 1021 GLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMN 1080
GL Q+NEGCKVQ NAPKNVPQNFEAGFSAVSLDSPCQ+TLPSIQNQNIHFPVFQVPPSM+
Sbjct: 1021 GLSQDNEGCKVQGNAPKNVPQNFEAGFSAVSLDSPCQMTLPSIQNQNIHFPVFQVPPSMS 1080
Query: 1081 YYHQNSVSWPAA--AHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVF 1140
YYHQNSVSWPA AHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLH LSNPVF
Sbjct: 1081 YYHQNSVSWPATAHAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHQLSNPVF 1140
Query: 1141 NPSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDL 1200
NP+PVPIYHPASK SNG Y EDR QV+KSGA+AESSVANSD+ T GHPYALSSPP
Sbjct: 1141 NPTPVPIYHPASKASNGTYAEDRNQVTKSGAMAESSVANSDVTGTAGHPYALSSPPGS-- 1200
Query: 1201 NQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGH 1260
QNDTSSKLQ+D SSSFSLFHFGGPVA STGGKLNLTPSKEDDVGDFS NNEVEV+DNGH
Sbjct: 1201 KQNDTSSKLQKD-SSSFSLFHFGGPVAFSTGGKLNLTPSKEDDVGDFSRNNEVEVVDNGH 1260
Query: 1261 AFNKKETAIEEYNLFAASNGMRFSFF 1281
AFNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 AFNKKETAIEEYNLFAASNGMRFSFF 1275
BLAST of PI0002685 vs. NCBI nr
Match:
KAG7033556.1 (hypothetical protein SDJN02_03278 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2196.8 bits (5691), Expect = 0.0e+00
Identity = 1142/1285 (88.87%), Postives = 1188/1285 (92.45%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKN HLN GSS IYSLSA+GFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSTIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT VNH+CNRLGVSKNQA DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG +DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW QTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+N LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKG--KDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILD 600
RKERLKG KDKDK+SSESAE C HSDVLEDLSPC LEPNS+AVGEVCD+SVPESSD +
Sbjct: 541 RKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDASVPESSDTFN 600
Query: 601 ELFLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWS 660
ELFLNESIISEGQNSYDDSFDGKL DGNDGNESFI DQSKVSRWRLKFPKEVQDH FKWS
Sbjct: 601 ELFLNESIISEGQNSYDDSFDGKLGDGNDGNESFIGDQSKVSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKL 720
ERRR M VSENGALVN+SEQRY+ DS ENPSRSMN SNRKLRTNSLKAYGRHVSKFNEK+
Sbjct: 661 ERRRSM-VSENGALVNRSEQRYYADSSENPSRSMNASNRKLRTNSLKAYGRHVSKFNEKM 720
Query: 721 HSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNK 780
HSSNN VSYDYRSCVCNQ NEFNKKAEPFVSSVR NRDVKS SKSES FDMSKQSYRSNK
Sbjct: 721 HSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRFNRDVKSASKSESLFDMSKQSYRSNK 780
Query: 781 YSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTF 840
+SYGD+SRDSGRLK KAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+NVALKSSTF
Sbjct: 781 FSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTF 840
Query: 841 KFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEI 900
KF EPDY D+VKSR E CSGEVSV SG VDQEESNSTESTS IESDDV QN +
Sbjct: 841 KFGVEPDY-------DLVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDDVFQNGL 900
Query: 901 SIESKNHKNV-EDVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSN 960
IE K+HKNV ED C E+ CS NS +D +TS GTSNQ GTSSLNSDNCSSC SEGDSN
Sbjct: 901 PIELKDHKNVEEDACEELTPCSVNSTVDMKMTSCGTSNQAGTSSLNSDNCSSCPSEGDSN 960
Query: 961 TIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYS 1020
TI SNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGG+AMGS + S
Sbjct: 961 TICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDAMGSTNCS 1020
Query: 1021 GLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMN 1080
GL Q+NEGCKVQ APKNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFPVFQVPPSM
Sbjct: 1021 GLSQDNEGCKVQGKAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFPVFQVPPSMG 1080
Query: 1081 YYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNPVFNP 1140
YYHQNSVSWPAA HANGIMPFSYSNHC+YANPLGYGLNGNPRFCM+YGHLHHL+NPVFNP
Sbjct: 1081 YYHQNSVSWPAAVHANGIMPFSYSNHCVYANPLGYGLNGNPRFCMRYGHLHHLANPVFNP 1140
Query: 1141 SPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQ 1200
SPVPIY PA+K SNGI+ EDRTQVSKSGAI ESS AN D+ +T+G PYALSSPPSGD Q
Sbjct: 1141 SPVPIYQPAAKASNGIFVEDRTQVSKSGAITESSAANPDVVVTSGLPYALSSPPSGDCKQ 1200
Query: 1201 NDTSSKLQQDSSSSFSLFHFGGPVALST-GGKLNLTPSKEDDVGDFSTNNEVEVIDNGHA 1260
NDTSSKLQ+D SSSFSLFHFGGPVALST GGKLNL PSKED NNEVEV+ NGH
Sbjct: 1201 NDTSSKLQKD-SSSFSLFHFGGPVALSTGGGKLNLMPSKED-------NNEVEVVGNGHG 1260
Query: 1261 FNKKETAIEEYNLFAASNGMRFSFF 1281
FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 FNKKETAIEEYNLFAASNGMRFSFF 1269
BLAST of PI0002685 vs. NCBI nr
Match:
XP_022967698.1 (uncharacterized protein LOC111467149 [Cucurbita maxima])
HSP 1 Score: 2193.3 bits (5682), Expect = 0.0e+00
Identity = 1141/1293 (88.24%), Postives = 1189/1293 (91.96%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGLTQKN HLN GSSAIYSLSA+GFWSQHRDDVSYNQLQKFW +LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQGKT VNH+CNRLGVSKNQA DG+L+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
VNG +DEIQDPSVHPWGGLTTTRDG+LTLLDCYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
DDTIQADW TFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTIQADWHHTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGE+IR FFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+N LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQ+KLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQVKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKG--KDKDKLSSESAEVCPHSDVLEDLSPCVLEPNSNAVGEVCDSSVPESSDILD 600
RKERLKG KDKDK+SSESAE C HSDVLEDLSPC LEPNS+AVGEVCD+SVPESSD +
Sbjct: 541 RKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDASVPESSDTFN 600
Query: 601 ELFLNESIISEGQNSYDDSFDGKL---------TDGNDGNESFISDQSKVSRWRLKFPKE 660
ELFLN+SIISEGQNSYDDSFDGKL DGNDGNESFI DQSKVSRWRLKFPKE
Sbjct: 601 ELFLNQSIISEGQNSYDDSFDGKLGDGNDGNDGNDGNDGNESFIGDQSKVSRWRLKFPKE 660
Query: 661 VQDHPFKWSERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGR 720
VQDH FKWSERRR M VSENGAL N+SEQRY+ DSLENPSRSMN SNRKLRTNSLKAYGR
Sbjct: 661 VQDHSFKWSERRRSM-VSENGALANRSEQRYYADSLENPSRSMNASNRKLRTNSLKAYGR 720
Query: 721 HVSKFNEKLHSSNNRVSYDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDM 780
HVSKFNEK+HSSNN VSYDYRSCVCNQ NEFNKKAEPFVSSVRVNRD KS SKSES FDM
Sbjct: 721 HVSKFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRVNRDAKSASKSESLFDM 780
Query: 781 SKQSYRSNKYSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDT 840
SKQSYR NK+SYGD+SRDSGRLK KAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSD+
Sbjct: 781 SKQSYRPNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDS 840
Query: 841 NVALKSSTFKFDAEPDYDVVKSRDDVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIE 900
NVALKSSTFKF EPDY ++VKSR E CSGEVSV SG VDQEESNSTESTS IE
Sbjct: 841 NVALKSSTFKFGVEPDY-------ELVKSRHECCSGEVSVASGTVDQEESNSTESTSVIE 900
Query: 901 SDDVSQNEISIESKNHKNVE-DVC-EVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCS 960
SD+V QN + IESK+HKNVE D C EV CS N +D +TSSGTSNQ GTSSLNSDNCS
Sbjct: 901 SDEVFQNGLPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSSLNSDNCS 960
Query: 961 SCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGG 1020
SC SEGDSNTI SNHGNLESSSTSDSEYASHQSEGKESSASIQ GFSEHHEIR+DK IGG
Sbjct: 961 SCPSEGDSNTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGG 1020
Query: 1021 EAMGSRSYSGLPQENEGCKVQVNAPKNVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFP 1080
+A+GS + SGL Q+NEGCKVQ NAPKNVPQNFEAGFSAV+LDSPC VTLPS+QNQN+HFP
Sbjct: 1021 DALGSTNSSGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDSPCHVTLPSVQNQNVHFP 1080
Query: 1081 VFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLH 1140
VFQVPPSM YYHQNSVSWPAA HANGIMPFSYSNHCLYANPLGYGLNGNPRFCM+YGHLH
Sbjct: 1081 VFQVPPSMGYYHQNSVSWPAAVHANGIMPFSYSNHCLYANPLGYGLNGNPRFCMRYGHLH 1140
Query: 1141 HLSNPVFNPSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALS 1200
HL+NPVFNPSPVPIY PA+K SNGI+ EDRTQVSKSGAI ESSVAN D+ +TTG PYALS
Sbjct: 1141 HLANPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESSVANPDVVVTTGLPYALS 1200
Query: 1201 SPPSGDLNQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEV 1260
SPPSGD QNDTSSKLQ+D SSSFSLFHFGGPVALSTGGKLN PSKED NNEV
Sbjct: 1201 SPPSGDCKQNDTSSKLQKD-SSSFSLFHFGGPVALSTGGKLNPMPSKED-------NNEV 1260
Query: 1261 EVIDNGHAFNKKETAIEEYNLFAASNGMRFSFF 1281
EV+ NGH FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 EVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of PI0002685 vs. TAIR 10
Match:
AT3G58050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41960.1); Has 13384 Blast hits to 8116 proteins in 546 species: Archae - 41; Bacteria - 766; Metazoa - 5596; Fungi - 1431; Plants - 589; Viruses - 46; Other Eukaryotes - 4915 (source: NCBI BLink). )
HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 693/1329 (52.14%), Postives = 852/1329 (64.11%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
MPGL Q+N+ YS GFWS+ D VSYNQLQKFWS+L P+ARQ+LL+IDKQ
Sbjct: 1 MPGLAQRNNDQ-------YSF---GFWSKEIDGVSYNQLQKFWSELSPKARQELLKIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
TLFEQARKNMYCSRCNGLLLEGFLQIV++GKSL + N CN+ G SK Q ++
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMHGKSLHPEGSLGNSPCNKSGGSKYQYDCNAVV 120
Query: 121 VNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELLY 180
NG DE+QDPSVHPWGGLTTTRDG LTLLDCYL++KS GLQNVFDSA ARERERELLY
Sbjct: 121 SNGCADEMQDPSVHPWGGLTTTRDGSLTLLDCYLYAKSLKGLQNVFDSAPARERERELLY 180
Query: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQG AS+GRGHGTRETCALHTARLSCDTLVDFWSAL E+TRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGIASFGRGHGTRETCALHTARLSCDTLVDFWSALSEDTRQSLLRMK 240
Query: 241 EEDFIERLMYR-----------------------------FDSKRFCRDCRRNVIREFKE 300
EEDF+ERL YR FDSKRFCRDCRRNVIREFKE
Sbjct: 241 EEDFMERLRYRICYHSSYHILNCKMNRHFVVWTIQDVLTKFDSKRFCRDCRRNVIREFKE 300
Query: 301 LKELKRIRREPCCTSWFCVADMAFNYEVSDDTIQADWRQTFADSVETYHYFEWAVGTGEG 360
LKELKR+RREP CT+WFCVA+ F YEVS D+++ADWR+TF+++ YH+FEWA+G+GEG
Sbjct: 301 LKELKRMRREPRCTTWFCVANTTFQYEVSIDSVKADWRETFSENAGKYHHFEWAIGSGEG 360
Query: 361 KSDILEFENVGMNGSVKINGLDLGGLNSCFITLRAWKLDGRCTELSVKAHALKGQQCVHR 420
K DIL+FENVGMNG V++NGL+L GLNSC+ITLRA+KLDGR +E+S KAHALKGQ CVH
Sbjct: 361 KCDILKFENVGMNGRVQVNGLNLRGLNSCYITLRAYKLDGRWSEVSAKAHALKGQNCVHG 420
Query: 421 RLTVGDGFVTITRGENIRMFFEHAEEAEEEEEDDSIDKDSNDLDGDCSRPQKHAKSPELA 480
RL VGDGFV+I RGE+IR FFEHAEEAEEEE++D +DKD N+LDG+CSRPQKHAKSPELA
Sbjct: 421 RLVVGDGFVSIKRGESIRRFFEHAEEAEEEEDEDMMDKDGNELDGECSRPQKHAKSPELA 480
Query: 481 REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMK 540
REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCL LKLLE+ +H+ACKEIITLEKQ+K
Sbjct: 481 REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLTLKLLEQHLHVACKEIITLEKQVK 540
Query: 541 LLEEEEKEKREEQERKERKRTKEREKKLRRKERLKGKDKDKLSSESAEVCPHSDVL---- 600
LLEEEEKEKREE+ERKE+KR+KEREKKLR+KERLK KDK K + C D+L
Sbjct: 541 LLEEEEKEKREEEERKEKKRSKEREKKLRKKERLKEKDKGK--EKKNPECSDKDMLLNSS 600
Query: 601 ---EDLSPCVLEPNSNAVGE-------VCDSSVPESSDILDELFLNESIISEGQNSYDDS 660
EDL E N+ E D S P S D+ + L+ +N Y D
Sbjct: 601 REEEDLPNLYDETNNTINSEESEIETGYADLSPPGSPDVQERQCLDGCPSPRAENHYCDR 660
Query: 661 FDGKLTDGNDGNESFISDQSKVSRWRLKFPKEVQ-DHPFKWSERRRFMVVSENGALVNKS 720
D + D D N F +D K ++ KEVQ D+ +WS++RR+ S+N + V++S
Sbjct: 661 PDRDIKDLEDENVYFTNDHQKPVHQNARYWKEVQSDNALRWSDKRRY---SDNASFVSRS 720
Query: 721 EQRYHVDSLENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKLHSSNNRVS--YDYRSCVC 780
E RY D LE PSR NGSNR+LR N+ K G + K +EK +NR+S +D+ SC C
Sbjct: 721 EARYRNDRLEVPSRGFNGSNRQLRVNASKTGGLNGIKSHEKFQCCDNRISERFDFSSCSC 780
Query: 781 NQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNKYSYGDHSRDSGRLKTK 840
+ E+ K EP + R R+ K++S S+S+ D SK ++ N+Y+ D++R+ RLK+K
Sbjct: 781 KPSCEYRAKVEPKTAGSRSTREPKTISNSDSALDASKPVFQGNRYTQPDYTREL-RLKSK 840
Query: 841 AAL-LNNSPGKDFVYSKKVWEPMESQKKYPRSNSDTNVALKSSTFKFDAEPDYDVVKSRD 900
+ N S +D ++SK+VWEPME KKYPRSNS + V ++ STFK + D V ++
Sbjct: 841 VGVGPNPSTTRDSLHSKQVWEPME-PKKYPRSNSYSEVTVRCSTFKAEEIEDAIVAENSS 900
Query: 901 DVVKSRDEFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEISIESKNHKNVEDVCE 960
D++ + TE I+ D + E S E+KN +++D
Sbjct: 901 DLL--------------------SQCKVTEKLDNIKLKDENSME-SGETKNGWHLKD--- 960
Query: 961 VKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNTIGSNHGNLESSSTSDS 1020
+ +S+ +SDNCSSCLSEG+SNT+ SN+GN ESSSTSDS
Sbjct: 961 ---------------------PMMSSTSSSDNCSSCLSEGESNTVSSNNGNTESSSTSDS 1020
Query: 1021 EYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEAMGSRSYSGLPQENEGCKVQVNAPK 1080
E AS QSEG+ES ++I I G S +P E N
Sbjct: 1021 EDASQQSEGRESIV-----VGTQNDILIPDTTG--------KSKIP-ETPIVVTGNNMDN 1080
Query: 1081 NVPQNFEAGFSAVSLDSPCQVTLPSIQNQNIHFPVFQVPPSMNYYHQ-NSVSWPAAAHAN 1140
N N G V P P + QN+ +PVFQ M Y+HQ VSWP AN
Sbjct: 1081 NSNNNMVHGLVDV---QPQGGMFPHLLTQNLQYPVFQTASPMGYFHQAPPVSWPTGP-AN 1140
Query: 1141 GIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGH-LHHLSNPVFNPSPVPIYHPASKTSNG 1200
G++PF + N LY PLGY +NG+P C+QYG L+H + P FNP PVP++HP SKT+
Sbjct: 1141 GLIPFPHPNPYLYTGPLGYSMNGDPPLCLQYGSPLNHAATPFFNPGPVPVFHPFSKTN-- 1200
Query: 1201 IYTEDRTQVSKSGAIAESSVANSDIALTTGHPYALSSPPSGDLNQNDTSSKLQQDSSSSF 1260
TED+ Q N + L L+ P + +N++ SF
Sbjct: 1201 --TEDQAQ-------------NLEPPLELN---CLAPPETQTVNED------------SF 1209
Query: 1261 SLFHFGGPVALSTGGKLNLTPSKEDDVGDFSTNNEVEVIDNGHAFNKKETAIEEYNLFAA 1281
SLFHF GPV LSTG K SK+ + D V+ N + K+ +EEYNLFA
Sbjct: 1261 SLFHFSGPVGLSTGSKSKPAHSKDGILRD--------VVGNIYTKAKESKEVEEYNLFAT 1209
BLAST of PI0002685 vs. TAIR 10
Match:
AT2G41960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G58050.1); Has 11991 Blast hits to 7260 proteins in 458 species: Archae - 17; Bacteria - 481; Metazoa - 5028; Fungi - 1325; Plants - 615; Viruses - 38; Other Eukaryotes - 4487 (source: NCBI BLink). )
HSP 1 Score: 955.3 bits (2468), Expect = 5.0e-278
Identity = 633/1315 (48.14%), Postives = 804/1315 (61.14%), Query Frame = 0
Query: 1 MPGL-TQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDK 60
MPGL T N+H S+ GFWS+ D ++Y+QL +FWS+L +AR +LLRIDK
Sbjct: 9 MPGLTTHMNEH----------YSSSGFWSEDDDGLTYDQLDQFWSELSSKARHELLRIDK 68
Query: 61 QTLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSL 120
QTLFEQARKNM CSRC GLLLEGF QI+ G++ + + +G SK+ S
Sbjct: 69 QTLFEQARKNMCCSRCLGLLLEGFAQILSAGRAAYEKR--------MMGPSKDNC--KSN 128
Query: 121 SVNGLQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLFSKSFLGLQNVFDSARARERERELL 180
Q P VH WGGLTTTR G +TLLDC+L +K+F GLQNVF+S RARERERELL
Sbjct: 129 GTRKCTVAYQSPPVHRWGGLTTTRSGCITLLDCFLTAKTFKGLQNVFESNRARERERELL 188
Query: 181 YPDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRM 240
YPDACGGGGR W+SQG A +G+GHGTRETC LHT RLSCDTLVDFWSAL E +RQSLLRM
Sbjct: 189 YPDACGGGGRVWLSQGIAGFGKGHGTRETCNLHTTRLSCDTLVDFWSALEEHSRQSLLRM 248
Query: 241 KEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEV 300
KEEDF+ERL YRFD K+FCRDCRRNVIREFKELKELKRI+R+P CT WFCVAD AF YEV
Sbjct: 249 KEEDFVERLTYRFDCKKFCRDCRRNVIREFKELKELKRIQRDPRCTDWFCVADTAFQYEV 308
Query: 301 SDDTIQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNS 360
D+++ADW Q F ++ YH+FEWA+GTGEG+SDILEF+ VG + S ++NGLDL GL+
Sbjct: 309 DIDSVRADWSQYFTENA-GYHHFEWAIGTGEGESDILEFKYVGNDRSARVNGLDLRGLHE 368
Query: 361 CFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRMFFEHAEEAE 420
C+ITLRA+K +GR +E+SVKAHAL+GQQCVH RL VGDGFV+I RGE IRMFFEHAEEAE
Sbjct: 369 CYITLRAFKKNGRPSEISVKAHALRGQQCVHSRLVVGDGFVSIKRGECIRMFFEHAEEAE 428
Query: 421 EEEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQN 480
EEE++ IDKD N+LDG+C RPQKHAKSPELAREFLLDAATVIFKEQVEKAFR+GTARQN
Sbjct: 429 EEEDEVLIDKDGNELDGECLRPQKHAKSPELAREFLLDAATVIFKEQVEKAFRDGTARQN 488
Query: 481 AHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKL 540
AHSIFVCL+ +LLE+RVHIACKEI+TLEKQ KLLEEEEKEKREE+ERKERKR KEREKKL
Sbjct: 489 AHSIFVCLSSELLEQRVHIACKEIVTLEKQNKLLEEEEKEKREEEERKERKRIKEREKKL 548
Query: 541 RRKERLKGKDKDKLSS----ESAEVCPHSDVLEDLSPCVLEPNSNAV---------GEVC 600
RRKERLK K+++K + P E+ S + E +N + G+V
Sbjct: 549 RRKERLKEKEREKEQKNPKFSDKAILPIMSREEEGSRNLDEDTNNTIRCEESGIENGDV- 608
Query: 601 DSSVPESSDILDELFLNESIISEGQNSYDDSFDGKLTDGNDGNESFISDQSKVSRWRLKF 660
D S P S D DE L+ I + DS D ++ D D N F + + +
Sbjct: 609 DLSSPGSPDDQDEECLDGCISPRVETHSCDSTDKEIIDHEDENGCF---TPRPAHKTARL 668
Query: 661 PKEVQ-DHPFKWSERRRFMVVSENGALVNKSEQRYHVDSLENPSRSMNGSNRKLRTNSLK 720
KEVQ DH + SE+RRF +E + V+ SE Y D LE S NGS++ +R + K
Sbjct: 669 WKEVQTDHSLRLSEKRRF---TEKTSFVSSSEAGYCNDRLEMSSGHFNGSDKNVRVKASK 728
Query: 721 AYGR-HVSKFNEKLHSSNNRVS--YDYRSCVCNQTNEFNKKAEPFVSSVRVNRDVKSVSK 780
A G + S+ +E+ S+ R YDY SC C N + +K E S+ R R+ KSV K
Sbjct: 729 AGGSPNSSRSHEEFQCSDGRTGERYDYHSCSCKPINGYREKVESNTSATRGMREPKSVFK 788
Query: 781 SESSFDMSKQSYRSNKYSYGDHSRDSGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKY 840
S+S D+SK + R+N+Y+ + R+ +++K N+ D V +KV + +E K+
Sbjct: 789 SDSDLDVSKLN-RANRYTQSGYRRE---IRSKMNNSRNACKMDPVNVRKVLDSVE--PKH 848
Query: 841 PRSNSDTNVALKSSTFKFDAEPDYDVVKSRDDVVK--SRDEFCSGEVSVTSGAVDQEESN 900
R NS T+ L +T+K + +K VK C + +G+ +N
Sbjct: 849 SR-NSSTSDVLSLTTYKAEE------IKDVSPTVKPAGTPSLCKATDKLGNGSF----NN 908
Query: 901 STESTSGIESDDVSQNEISIESKNHKNVEDVCEVKQCSANSAIDTTLTSSGTSNQVGTSS 960
STE +E +N+ + SK D ++ S +
Sbjct: 909 STEVDKKMEVHITLKNDY-LYSK--------------------DPMMSRSSS-------- 968
Query: 961 LNSDNCSSCLSEGDSNTIGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIR 1020
SN+GN+ESSS SDSE AS QSEG+E+ QN + HE
Sbjct: 969 -------------------SNNGNIESSSMSDSEVASQQSEGRENLVDTQNDMPDCHEKM 1028
Query: 1021 IDKGI-----GGEAMGSRSYSGLPQENEGCKVQVNAPKNVP----QNFEAGFSAVS-LDS 1080
++K + + ++ S LP +N K+ P VP +N G + S L
Sbjct: 1029 VEKVTEMSMDERDVLKIKNISNLPADNGESKLS-GTPFMVPSQNMENMVPGLNTGSYLSQ 1088
Query: 1081 PCQVTLPSIQNQNIHFPVFQVPPSMNYYHQNSVSWPAAAHANGIMPFSYSNHCLYANPLG 1140
P + LP + NQ+I PVFQ P +M YYHQ VSW ++A NG+M F + NH +Y PLG
Sbjct: 1089 PQNMILPQMLNQSIPLPVFQAPSTMGYYHQAPVSW-SSASTNGLMQFPHPNHYVYTGPLG 1148
Query: 1141 YGLNGNPRFCMQYG-HLHHLSNPVFNPSPVPIYHPASKTSNGIYTEDRTQVSKSGAIAES 1200
Y LNG CMQYG L+H + P FN PVPI+HP ++T N + T D+ Q + E
Sbjct: 1149 YSLNGESPLCMQYGTPLNHSAAPFFNSGPVPIFHPFAET-NTMNTVDQAQPLEP---LEH 1208
Query: 1201 SVANSDIALTTGHPYALSSPPSGDLNQNDTSSKLQQDSSSSFSLFHFGGPVALSTGGKLN 1260
S L + + P L + Q DS +FSLFHFGGPVALSTG K N
Sbjct: 1209 SF------LKEANERRFNEMP---LMETPRKRCPQTDSDENFSLFHFGGPVALSTGSKAN 1215
Query: 1261 LTPSKEDDVGDFS---TNNEVEVIDNGHAFNKKETAI-EEYNLFAASNGMRFSFF 1281
SK+ + DFS + + V G++ +KE + EEYNLFA SN +RFS F
Sbjct: 1269 PARSKDGILEDFSLQFSGDHVFGDPTGNSKKEKENTVGEEYNLFATSNSLRFSIF 1215
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3B599 | 0.0e+00 | 96.96 | uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=... | [more] |
A0A0A0KZE9 | 0.0e+00 | 96.33 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G563700 PE=4 SV=1 | [more] |
A0A6J1HVV4 | 0.0e+00 | 88.24 | uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149... | [more] |
A0A6J1EPP9 | 0.0e+00 | 88.72 | uncharacterized protein LOC111435513 OS=Cucurbita moschata OX=3662 GN=LOC1114355... | [more] |
A0A6J1DQ45 | 0.0e+00 | 87.95 | uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
Match Name | E-value | Identity | Description | |
XP_008442254.1 | 0.0e+00 | 96.96 | PREDICTED: uncharacterized protein LOC103486163 [Cucumis melo] | [more] |
XP_011653932.2 | 0.0e+00 | 96.80 | uncharacterized protein LOC101210448 [Cucumis sativus] >KAE8649763.1 hypothetica... | [more] |
XP_038881990.1 | 0.0e+00 | 91.60 | uncharacterized protein LOC120073308 [Benincasa hispida] | [more] |
KAG7033556.1 | 0.0e+00 | 88.87 | hypothetical protein SDJN02_03278 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022967698.1 | 0.0e+00 | 88.24 | uncharacterized protein LOC111467149 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT3G58050.1 | 0.0e+00 | 52.14 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G41960.1 | 5.0e-278 | 48.14 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |