Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAGAAAAAAGAAGAAGAAGAAAACGAGAAAACGAGAAAACGAAACTTTATCGACTAAACGCTTGAAGTTTCTTAATACACTCCACCCGCGCTCTCAAAAATTGGATTCGCGGCTCACAGTGACGTAATCATTCTCTCTCTACAAGCAAAGACAAAATCGTTGCCGAATGAGCTCATACTTGTCCAAAATAATTTCCAATTCTTCCGTTTCCGATTCAAATCGCTATAGCTTCTTCCTTCATCCCAATTTGTATGTGTTTTTGTTTCTAAGCCATGCATTTAGGCAGATATGTTGAACTGGGAAGTGTTGGCGTCTCGTGCTGATTGATTTTGCTTTACCGTTTGTTTCTTTTGGAATGGGTGTTGGGAAGTGTTGTGTTGCGTGTGATTGAGAGGTTGTTATTTGTCTACTAACGTGGGGTTTAGGATTTCGAGCTACCAGGGTTATCTGATTCTCATATTGGGTGTAATTCTTTTTGTTGCTAATTTATGATTAGGGATTTTTGTTTGTTAGATGAAACGTAAGTAGCTCGTTTGATGTTGAAATTGGTAGCTGATCGATTCTGTTATATGCCTGAGTTCTTGCTGAGAGTTTGAATTGTCAGGGGACTAAGACCTGCTTCTTGGTCCCCCACAGATGAATCGTTTTTGGTTTAAGCGAGACTAAGAGGATAATGAATGTGAAACTTGATTCTTCTTTCTTGGGGACTCAACTTCATAGCTCATTGCATTGTGTAACGAATGGAAAATTTGTATATTTGGGTCGACGCCGATTGTCGAAAGGGGACTCTAAGAAGTATGTATGTGCAAAACATAACGAATGGAATGCTCGAGTAGAGAGGTTTTCGCGTTTTTTTGGGCAGCATTTGAAGTCATTGAGCATAAAGCTTAAGCCAAGACACGAGTCTTTGATGAAATGTGCCAATGAGCCTTTTGTTCAAACAAAATCTCTATCAAGTTTACTACGTCCTGTCTGGAATGAGGGGTTGTTTTTGATTAGATGTTCTGCATTCGCTGCTGTTGTATCTGGTATATGCTTACTGGTTTGGTATGGGCAGACAAAAGCCAAGGGCTTTGTTGAAGCTAAGCTTCTTCCTTCTGTTTGCAAAGCAGTCAGTGACTGCATTCAGCGTGATCTTGATTTTGGAAAGGTTAGAAGTATTTCACCCTTGAGCATCACATTAGAGTCGTGTTCCGTTGGTCCCGATGGTGAAGAATTCTCTTGCGGCGAAGTTCCCACCATGAAACTTCGTGTTTTACCTTTCACCAGTTTGAGGAGAGGGAGGGTAATAATTGATGTGGTATTGTCTCATCCAAGTGTGGTAGTTGTGCAGAAGAGGGATTATACGTGGTTGGGGCTGCCCTTTCCATCTGAAGGGACCTTGGAGAGGCATTCATCTTCAGAAGAAGGCATTGACAATCGTACAAAAATCAGGAGAATTGCCAGGGAAAATGCAGCTGCTCTCTGGTCCAAGGATAGGGATGATGCGGCTAGGGAAGCAGCTGAGATGGGTTTTGTTGTTTTTGACAGGAGCTCAGGTTTGTATGATACTAGTGATTATAAGGAGGTTGTAGGTCCTACAGTAGATATTGGAAACTCTAAAACATTTTTTTTCAAGGATGAGAATGTTCATTCGAGGGAACATCACTGCATGGATACGGATGTGGACTATAAAATAAGGCATGCAAAGTCAGAGAAGTATTTTGATGTAAAAAGCCCTGATACAAGGCTTAAATTTTCATCCAGAGCAATGAAAACGCTTATAAAAGGACAATCAAAGAGGAATGCAAGTGGAGACGATGTGTATGTAAATAGTTTTGCTGCAAAAAAGAGAATTCTTAGGCGTAGTACATTGGCAGCTCAGGATTATTTCAAGGGTGCATCTGAGGGGAAGTTTGGGGAGCCTTCACAATTACACAAGAGTTTTAATAATGCGAACCTTGACTCCTACTTGATCAAAAGTGGGAATGAAACTAATGCTGACTCCTCCATCACGGATACAGATGTCCAGTATGGGAAACAAAGTTTAGATGCCAGATTGAATTCTCTTAAAGAGAAAAGGGATATCGATATTCCGAACCATATAGATGATCAGACTTCTACAGTTACAGGCTTGGGAAATAAAGACAGAAGGTCTTTTTCAGCTACACCCAGCATTGATGAGTCCAATATGAGAAAGGAAGATGTTATAGGATCTGATCATGTTCCTGATGGAATATCTGATCACATGCGTAACACATCTCAAACACCAACTTCAACAGGTCATGAACATCAGCATGGAACATCTTGGCCAAATTCTTTCTGGGGACTAAGCCCAGAATCGGCTTTATCTTATTTTCCTAAAGATGTGGCCAAGAAGCTGTTGTACCATATTAGCATGTACGTTCAAAATCTCAAATCTGGCTTTGTTCAACACGCTAGAGGTGTCATAGATGGTGGAGATGTGATGAAGAATAAAGGAACCAATACAATGCTTCCAGTAACGATTGATTCTGTTCATTTCAAAGGCGGGACACTTATGTTACTTGCATACGGTGACAGAGAACCAAGGTAGTGTCTACTGTCTTCTCAAAAGTAGTTTATCTAAAACGGTTTTTTGTTTTTGTTTTTTCACTTTTCTTTTTTAAATCAAAACATCTTTACTGCGCGAGACAGAACAAAGAAGCAATATTTTCTATGCACACATATATTTTGAGTTAAACTTCGAAGCTGGCCATCATGTGAGTTGAGTTATTATTTGGGAATTGTATATTGTATTGAGAGGTTATAGATGAATTGAGACACCACTAATGTCTACCTTAGGAATGCTCATATACGGTTTTATATGCTGAGGAATAGGAAAAAATGCGTAATGATGCTGCCTGCTAATAGTTTGATAATGTAACTGATTGAAGCCATCTATATGAAGTGATTAGCTGGCAGGGAGCTTTTACAAAAAAGTAGTAGAATGCTTTATATGGGGAATCAAGATCAGTGTGGTTAAAATGGTGTTTTCGATAATGTAAATAAATGCAAGCTACATATGGACAAAGTGTTTACAATTTTTAAAATATGTATCCAGAGAGATGGAGAATGTTAATGGGCATGTGAAATTTCAAAATCATTATGGCAACGTGCACGTTCATTTGAGTGGTAATTGTAAGTCATGGAGATCAGAATTTGTCTCTGGAGATGGTGGCTGGTTATCTGCGGATGTTTTTGTTGACATCTTCGAGCAGGAATGGCATTCAAATCTGAAAATTACCAACATATTTGTTCCGGTATGCTTTATTAGCAACAATTATCTATATTTAAATACTGAGTTGAGATTTGGAAGTTCCCATCTACTTATATGAAGTGCTTACTCTTCATGGGTTCCCCTATCATTTCTATGCAGCTTTTTGAGAGAATTTTAGATATTCCAATCACGTGGTCCAAAGGAAGAGCCACTGGTGAGGTGCTTCTCTAACCTTTTGTCCTGCAGATTGAGCTTTAACACTTTCTAGAAGGATGTTAAAATTTTAAAGCTGTGACTAAACGTTATATGTTAACCTACTGATTTTATTTTATTTGTTATTATTTGTTTTAGCAAGAATTAAATTTTAAATTGAAGTAGTGAAAACAGACTAATGTTAAAGAATGAATGCCCAAAAAGAGAAAAATAGAGACAAAGCAAAAGAATTTAGTAATAGGTCACTAACCTGTTGATGTTTTACTAGCATGAAAAATTTAAATCAATATTTTAACAAGAGTTGAATCTGTAAAATTGCATTTATTTATTGTTGACTGCTATTCTTTATTTTATTTTCTTTTGGAAATTTATTTTGTGAATCTGTTGTCAGGTTCACTTGTGTATGTCAAGAGGAGATACATTTCCTAATTTTCAAGGGCAACTTGATGTGACCGGTTTAGCCTTTAAGATCTTTGATGCCCCATCAAGCTTTACTGTAAGTCTGACATTGTCAGTGTAACACGATTCCTTTCATTTCACGTATTAATTCTCTGCTATACCAGTAGAAAATTAAGTCCGTTGCAAGATTACTGCCTTCTGTGGTTTATAGGGAAAAATGCTGGTCATGTGTAGATCTTGTTCTTTGATTCAAAATGGATACTTTGTGGTAGGAATTCTTAGGTGCTTGGTGAAATTTTAATCCAATATTTTATAAAATTAAAGAATGGGCGTCATGTTTTGACGGTAAGTGTGACTGAGGAATTGCTTCAGAGAACAACCCTTAGAACTCCATCCTAAATACTATATTTAAAATTATTAAACTGTCTCCAAGGTAAAAAGAACACACACACACAATCATTCATTTTTTATGAACATTTTAAACACAGTTAAATTGCTCTGTTATCTGGAGGACCGGAGTTTGGAAGTTCAATAAGAACCTACTCTTGACAGTCAAATTGCTTTTTGTTACCTTCAAGGAAAGGGAGAACCTTCCCTTGTTGGTTCTGCAAGTCTCTATAGACGTTGAAGCTGCCTCAGAAATGTGAGGTGCTTCTTTTGTCTCTATTCTTAGGTGGTGTCAATATGAAAAAGAAAAGTTTCTGAAATACAATTCTTCTTTGATGCTCCACCGAAACTGGTGATCCTAATTTTTTCTTCTTAGGAGGATCCAACCATTCTACTTAGGAGGACCCAACCATCTTTTTTGTTCTTCTCTGACTGTAGTTCCTCGTTGGCTAAGGCTGATGGGGGCTTTTGTGGCCTGGGTTCATATGTCATATTAACTTGCAAATTTAAGGTTTTCAAGTTATTTTGGGTTAGAGTTTAAGCGTCCAAAAAACAGTTCAATGATTAATGCTTGCCCAATCTTTTTGTGGAATATGTGGTTAGACTTGCACAATCTTTTTAAGAATTAAATACTTTTTTTCTTTTACCAATAATAGGTTTAAATAATATTGACCTATTATTTTATCCAATGCTCCTTTCTTCGCTCAGGTGACACCTTGTCGCCTCTCGCCTTAAGGCAATGAGGAGACTTGTCGCCTTAGAGTGCACCTTGCACTTTAAAAACACTGGCTGACATAGAAAATCATTAAGGAAGGGAGAAATCAAGCGATCAGCAGCGGGAAGAAATAGTTTTCCCATGTCTGTTTGGGATCTTTTAGGCCTTAATACTTTTTTGTTACTCTCTCATAATTTTTCTAAGTAGTTGAAAATCTTTTCTTCGTAACTCTGGCTCTGAGGTTAAGGGTTTTATTCATTTATTTATTATTTCTTTCTGTTCTCAAACCAGTTTCTTATATGAAAAGAACAATAGTCCTGGGCTGGTTTTGTTTACTGAAATTTATTATAGTATAGTATTATAATATTATGTTTTTGAGCATGTTGAATACCCTGTCTTCTTTCCAAGTTTCTTTCTTTATCACTGTTGAAGCATGAATTGCTGCTTTCAATGTAGATTTTATTTAATTTTTATTTTAAATAATTTCTTAACATACTACCTAGTGATTGGATTGAACATTCGGCTCATTGTACTTTTAGCTATTTGCTAGTGCTCATTTAGAACTACAGTATCATGGTAATTCAAGCACTTATGCAGGAAATAGCGGCAACTTTATGTTTCCGTGGTCAGAGAATATTTGTACAGAATGCAAGTGGCTGGTTTGGCTGCGCTCCATTAGAGGCATCTGGTGACTTTGGCATTAATCCGGATGAAGGAGAGTTTCATTTGATGTGTCAGGTATGTTCGTATTTAGGGGAAACTGTCGACAATTATTTTTCCCTGATCTCATCAAATGTATATTACTCACTGTTTAGATGTATTTTGCCTTGTCATCTAATGAAAAAGGTAAGATTCTTCAAATACAAAGTCCTCACTTTCTTCCCTTCGACAAAAAAAGAACATTGAGAGGAGGACAAAGTCATTTATGGACATCTAAGTTGCAGTCTTTCTTTTGTTTTTAGTCTTAGGTAATCTGTTCACACCATCATTATAACCTTCTTGGGGGATTTTACTTTGAACATCTTCTCCGAAAGTTTCATTTCACAATATATTTGTGTACGTGTATAAATATAATACCAATCAAAAAATATTTCCCAATCTTTATCTCAACTGTTTGGGGTAGCGTCTATATAGAGGTCAGAGAAAGACCTAATTATCTTTTTAGTTACATTTTTCTTCGTACCTCCAGGAATTTTGATTGGATTCTTTTAACCCAATGATGTTTTTGCTGGACTATGTTGTTCTTGCTGTATTATGGATGGGCTTTTGGTTTGATCCGTAAAGCAAGTTCAAGAGCTTTTATTTAAGATTCTAAACGAGAGATATATGAGGAGCCTTGAAGACAAGCGCGGTGTGGTGATATTTTGGAGATTTGGGAGCAGATTTGTTTGGCCTCCCTACGGTTTCAATCACACCAGTTTTGCTACTTCATTTTCTGCTTGGACGCTTAGTCAATTGGAATCGAGTAATATATCTGGAAATCCCTATTGATTCTTTATTAACACCGTTTCAAACACAACGAGTACATTCCAAAATACTGTTACTTCGTCATCTTTACCCTTGACCATGACCTCCCTTGCATTTTTTATTCACACAACTCTTCCATTTTCAGATGGAAGACGAAGAAAAAAAGAAAAATTGGTAAAAGGAAAGAAAATTATGGAAGATTTCGTCACAATCAAATATGAAAATATAGCAATACAATTGATTTCTAAATTCAGTTATTGCAGTACAGTTTTTCATTTAAGTATTTTGTGTGATATTGTTGCCCTAGCCATCACATTTTCATTTTCGTTCTCACTCTTTTATTAGTTCAATTTGCACCCGGCGCGGAGTTTGACTGAGGTTGAATTCAATTTGGGTGTAGATTTGTTTGGCCTCACTGTGGTTTCAATCACACCAGTTTGCTACTTCATTTTCCGGTTGGAGAAGTGGATTTTTTTTCTTTTAGATATATATATTTTTTATAATCCTCGTTATAGCTTGTGTTAGATTTAAACCTCAAGCCGACCTTTGTTTCAAAGGTTGGGATTGGATAGGTTAGGCCCTAAGCCCAACCCTTGTTTGGTACAAATGCTCTAGAAGCCTTGGCTTCCTACAAACGTTTTCCATGGGTTTGAATGGTAAATATTGTTCAAACCCATGGTTACCTCTTTGTTTCTTTTTTCCTAATTTAATTTAGAAAGTTTGTCGGCCAACTTTCGGACACCACCACCGGTGACAGTCGTCGGTCAACCTCCAAGCGTCATTTCCAGTGACTCTATCGGCAAACTTCCGGCCACCAGCTGTGGTGACCTCCGTGGACCTGCCTCAGATTGCCACCGCTTGACCTTGAAAATATTAATAAAAATACTTTTAAGTATATAACAAACCAAATGAAACAAAACTTATATATAAAAGTACTTTTAAAAAAACACTTGTCAAACACATTTGTGTTTTTATTTTAAGAAGTTTTTATCAATAGTGTTTAATCATTCTAATATTTAAAAAAACACTTTCTAATCCTTTTTTGGAGCTCTTTTTAACATTATTATTATTAGATTATATTTTTTAATTATTTCACTTCTCTTCTTTGTTTATGATTAATGTTTTTTTCATTTTTGGTGATTTTGTTTATTTTCTTCTATTTTTCCTAGGTAAAAATTGACTACTTATTTATTGCAATCTCAATTATAAATCAAATATATTGTTAAAATAGTAGAAAATGTAAATATATATTGCATATACTATTGTTTATGTTTTAGGTTAATAATATTTTTGTTTTTTTCGTGAGTCTATTTAACTAGAAGACTTCTAACGTTTTTAAAACTTAAAAATCAAAATGTTGGAAACAATTTGGTTGCCAAATAAAAATAAAAATTGAAACTTCATGATTACTTTTATTCCTCAAATATAATTTGGTACAAAAATTTACAATTTTCCAATAACTGAAGTTAATATTAAAATGCTAATTTATTTCAATTTGAAAGTAGTAAATTATGGTTAAAAAACCCGAACTTTTTAGAATTAATTAAAGTAACAAAAGTTTGTGAAATCTTATTTCATAACCTAGTAAAATCAAAATATCAAATTAATAAACAATAAGTCAAATAAAATTATAATTAGTCAAAATTCTATTTCAAAATAATAATAAAAAAAATTCTTAGAAATTTTTTAGTTTATTTAAATTCAACTATATTAAAACTATTAGGATGAAAAATTTATACTAATTTAAAATCTAATCAATGATTAAATACAAGAAAATGTTTCACAAACTAAAGATTTAAAAATCTGCACTTCATCGCAGGCTTCCTAAGCTTGCACTCCAAACACAGTTTTCTTAAACTTAGACAACCTAAAACCTGCACTCTAAACATAGTTTTCACATTCCTAGGCTACCTAAGCCTCTCAGCCTAACCCGCCGACTTAACATTCCAAACCTGTGTACCAAGTGCTGCCTAAATGGTAAAAGTCTACTTAAGACTAATGGGCAAAAAGAGAAATTTAGCAATGGTTAAATCACCGGTTGACCCAAAAGTTTAAGCTAATGAGGGGAAGGTAAATTTAATATTAGTAACCCAAAAGTTTAAGCTAATTTAGTATATCAACAATTTTACACTTCCCCTCACATGTAAACTTGAAATATGTAGAAAACCCAGAAGTAGAAATCGATATTAGTTGGGGAGGAAACGAGACATTGTATGGGTTTGAACACAGGTTCTCTTGGATCTCATTTTCTAATACCGTGTTATATCACTAATTAACTCAAAAGTTTAAGCTTATAAGGTGAAGGTAAACTTAATGTTATATCGACACTCTAACCGCCACAAATAATGAGGAGAAGCAGCAGTTTCTCTTTAGTGGTTATTTATCATTTTGTTAAATGGTTTCAACCTGTAACCTTGTATTTGTTATATTCTCAGGTTCCCGGTGTTGAAGTAAATGCCCTGATGAAAACTTTCAAGATGAAACCTTTCTTATTCCCAGTAAGTTTAAACAATTTTCTCCTGGAAAATCAAATGGGTTAAGAGGGAAGTCTTTATTATCACGATCAACCCTCTTTGGTTAATGTGCTATTTTCTTCCTTTAAAGTTTAAAGGCACACACACATATGTCTGAATGAAATAAAATTTCGAGAAGAATCTTTTAAAGAAACATTTCAGGCTTGTTGATTATTCTCATAATATTGATCACAGTTAGCTGGTTCGGTAACTGCTGTGTTCAACTGTCAAGGTCCACTGGATTCACCCATCTTTGTAGGAAGTGGAATGGTTTCCAGGAAGATGAATAATTTATTCTCGGATCTTCCTGCATCCTGTGCTTCAGAAGCAATTGTGAAAAGTAAAGAAGGTGGTGCAATAGCAGCAGTTGACCGTATTCCATTTTCCTATGTCTCAGCAAATTTCACTTTCAGCATCGACAATTGTGTATGTTTTTGAGATAGCAATTAGCTAATTTAGAGTTTTTATGGGTCAAAGCATTAACAAATTCTATTGTATTTTTGCTTACTTGGTTAATGATCATTAAAAATTGTATTTGAGTAGGTTGCTGACTTATATGGAATTAGAGCCAACCTTGTGGATGGTGGTGAAATTCGAGGCGCCGGGAATGCATGGATATGCCCAGAGGTGAACCACAGGCTATAAACATTTGTTGTTGGCGTGTGTGCGTTTTTTCCCTTTTTAAGACAGCTGTTTTAAGCTAATTTTGTTCTTTCCTAGAAAAGGGTGAATGCTTAACCACTCATCCATCCAAAACTTGAACTTTTCTCATTTTGCAGTTATAAACTTAGAAATGAAGTATGAAGTTGCTGGAGACAAGGGCAGTAACCTTACTGCCTTTTTGGATCTTGGTCAGGCATTAACTTGAATCCCATATTTAATCGGTCCTCCTTTTCAGGGAATTTCCCTCATAAATAAACCTCCATTAGTCACTTGACAAGTACAGCGTCGTTTCTGGCTTTTAGGATTCTATTCCAAGTCTGTTTAAGGGATTGAGGAAGAGTAACCCATTTCCAGCTCACAAGGTGGCATCTTTGTACGGAGCCAATGCATTCCCTTATAAAGTCCCTTTACGTTTTCCTCAAGGTTTGGCCACTTTGACAGGAGTTTTGAAAGGGAAGAAAAAAAATAGCGAGATACTCCTAAGAATAAATTGTGTCAAAGTAATCCTTCCACCTTTGGGGATGGGTGTATTTCTTCATTTGGCCAACGTAGTCTTTAGCTTATCAGCAGTAGGGTCCCAAAATGAATTCTACAATGGATTGCCTCAGAAGGAGAAGGCAAGTCATAGTGTCTACATTGTTGCCCAAATAACCAGCTTTGATTGCAACATCTTCATTAAGGACGCTGTGCCCTCTGATTGCAGATTATCTCTTTGTTGCATGAAGATTCCTGAATAACTCCTAGTGTATAATAATTCTCAAACGGCACCTTTATGCCCTAAAGAGATGTATTCATTTATTATAGTTTCATTGACAGTTTTTAAGTTAGTGGTAGGTTGACTGAAGAGTAAATGCCATCATTCGGTACTTTTGAATGACTGTCATGAACATAGACTTGAGGAATTATTTTTTTCATTCATATCTTTAGATCTAAAGTTTTCGTTATTCTTTTTTTTCTATTTTTCTGGAAGTTAATTTATGTCCTATTCTGACGTCGAATTTATTTCTTTTCAGGGTGAGCTGGATGATACGGCAATGGATTTAAATTTTTCAGGAAATATATCATTGGATAAAATCATGCATCGATATATGCCTGGCTATTCAGATTGGATGCCACTTAAATTGGGACTTCTAAATGGGGAAACCAAAGTTTCCGGATCCCTCTTGAGACCGTAATACTGTTTCTTTTCTTTTCAATTGTTAATTGAAAGCTGTCTCTGTCTTCCTCTTTGGATCTGTCAGGCATTATGTCATGATCATGTTTTCTAGAATCTCCTGGATTTTCATCATTCTTTGGGCTGGTTACATCTTTCCCTCTCTTTCTTTCGGCAGGAGGTTTAACATTAATTGGACTGCACCACTTGCTGAAGGATCGTTCAGGGATGCTCGAGGAGATATCAATATTTCCCACGATTATATTATTGTTAATTCCTCCTCTGTTGCTTTTGAACTCTTCTCAAAAGTGCAAACATCGTATTCTGACAAGATTATGCTTGATGAAGAAGTGTTTGATACAAAGAGGACTCCATCATTTACTATTGATGGAGTGGAGTTGGACTTACATATGCGTGGTTTTGAGTTCTTGAGCTTAGTCTCTTATATTTTTGAATCTCCAAGGCCTATGCATCTAAAAGCAACCGGAAGGGTTAAGTTTGTGGGGAAAGTTTTGAGACCATCTAGTAAAGATTTTAGTAATGAGAAGAGTAAGCATCAGGTGCAGCCAATTGATGAAGAGAATAAAAATGGTCTTGCTGGCGAGGTTTCAATTTCTGGTCTTAAGCTGAATCAATTGGTTCTGGCCCCTAAGCTTGCTGGGCTATTGAGCATGACTCGTGAGTCAATCAAGGTTCTATCTGAATCCCTCTTCTGAAAAGAATTAGTCTTAACTTTGAAACTGACATATGAATTTATTTGTTTTACTTTGTACACATTTCCTGTGTAAGGTATTTCCTCAAGTTGTGTCTTTGGTTTATGGTTTAGCTTCAATTGACTTGTCATTTTTTTTTTTGACTTTTTAATTGAAGGAAATCAAACTTAAGTTTATTTGGTGAAAAAGTCAAACAAAAAAGATGCACTGTTGGAGAACTGAGTTATTTATATAACATCATAACGAACAATAGAATTTAGTCAAATAAAAAACTTTTTTCCATTGTTGTTCATTCCATTAATTTATCAATAGTTCCTCTCCAACTATAATCCCCAGCAAAAGGCCTCCACTGCTTTTATCCCAAGTCTTGCTGTCCCTCTAAATCTATGACAACTGACAAATGCTGTCTTTTGCATCTGTATCAAAGACACAATAGATCTAAAATTCCTGCATCAAAGATGTCCAACAGTTCTTGCCAAGGGCACATTTAACAGAAAGTCATGCACAACACTTTTGTCTGTCCCTTACACACTACACACTAAGATGGACTAAAACTCATGCTTGAGCACCTCTTTTGAAATCCTTCTGCAACAATGCTTGGTCTCCAAGGCAACATGTACAATGAACACCTTCTTAAATAATTTTTTTTAATGGAGACAAGCTTCTTTATTAATAGAAAAACTCAAAGTACAAGAGAATTATTCAATGAGGATAATACAGAAGTCTAAAAAAAGGAGAGGGAGAGAAAGAGGATCAAGAGGCGCACTCGGACATCTCAACTATGTTATTAATAAAGAAGAGAGAACAAACAAATTAGTTTGTTATTAGGGATGCAGTTTGAGAGATATCATAACTGTCTGTTTTGCTTCTTGTTTGCCTTAATATGTTCGTTAGAGTAAGATTGTTTTTTGATCTCTAAAATGCCCAGATATTTTGAGTTCAGATATTTATGAAGTTATTATAAGCATACCAAGAATTAACATCCAATCTTGTTCTTATAAAATCTATTTATCAGTGGTTCTTTACTTTGTGCCCTTACTTCTCTCTGTTTTTGCAGTTGGATACCACAGGTAGGCCAGATGAAAGTCTTTCAGTAGAAATTGTGGGGTCATTGAAGCCTAACTCAGATAATTCTAGAAAGTCGAAGTTGTTCTCTTTTAATCTTCAACGGGGGCAATTAAGAGCCAATGCACGTTACCAACCATCTAGATCTGCACATTTGGAGGTATGCATATGGTTTTGATATTTCAAAGAAGGGATCTTTTGTACTTTGTGTGACAAGCTTACCAGTAATCACACTGCACACCGTTGAGCTGATACTCTAGGAACATGATAGGCTGATTATGATCTCCATTCTTTATGTCTGTTGTTAATAGTTACGTCATTTGCCGCTAGATGACTTGGAGCTGGCTTCACTTAGGGGAGCAATACAAAGGGTAAGGCCAATCTTGTTCTTGATACAATTGTTTCAAGGCTGCATTTTCATATCTATAATTTAAGCAATGGTAGATCAAGGGGGAAGATAACTGCAGCAGGAGGCTTGTGACAATATCTTTCTTCTTTCTTTTTGTGTAGGTTGGTGATGTTTTAAATGGATCGATCAAACATCACATTCAGTTTCACATATAAATTTTCTGTTGATACTGAACTTATTTTGTAAAGATGACAAATAATTAATTAAGTTTATGATTTATTTTAGCTAATTCAACGTGTTTCGGTGATATATTTTATTGAAGTTTTGAAACTAGTCCTAAATGGAAATTGATTGTTTTCCATCTCTTTTTGATGTGATTTCTTCTTTCTCCTCTTCTTCCTTTAATGTTTCTTCTGCTAGTTTATATAGTTGAAATCTTTTTTTTTTGAACTGAATGTTAATTTCTATTTCAATTATTTTTTTGAACTGAATGTTAATTTCTTCTGCTAGTTTCCGTTGCTATGTATATTTGAAGCTTGTACTTAACAATGACCCGTTTACTTTCAGGCAGAAATTGAACTTAATCTTCAGAAAAGAAGAGGTCATGGGGTTTTATCAGTACTTGACCCAAAATTTAGCGGCGTGTTGGGAGAAGCTTTAGATATAGCTGCTAGGTGGAGCGGAGACGTGGTAAGTCACATATATTTACTTGCGTTTGTTATTCCATTAACTTTTGTTATTTATAAACTGCTCCTGTTATCTCCAATTTTTTGGGTCTGGGGGGGGGGGGGGGGGGGGACACATTAAAGATGTTAACAACATTGGTAGACAACATTGACGTTCTGTCTTACTAAAGATTATCTCTCCATACCTTTATTATCTTTTAGCCCAGTTCTAGTGGCCAAGATCCATACAATGGAAAGAGGAGAATGTGGGTGAAGTAAGTGGGGTGGTTTTATTATTATTATCATATCACCATCGGGAAGGTTGAAGAACCTCCCATTTTAACTTGAAGTTCTTGATGGTGGGAATGGAGCTTAATAAGTTTGTTCTAACAATTGATATTGTTTGATGAAGATATGTTGTAATTAATTATTTGTGCAGATCAGATTAAAGTTTGTTGGATAAAAATTATACCATAGTCAGGAAATTTAATTTTGATTAATTACAAAATTTAAACTGATTCAATTAATTTTTGAACGACTTTCAATTTAATGGATAAAATCATATTCTGCTTACTATAATTAATTTTGAATTCTTTTAAATTGAAAATATTATGTTAATATTCTTACAAAAGAATTATATTAATATAATATGAATAAGTAACTAATAGAGGCAATTAAAATTGATTGGTTAATTCTGAAGAGGTAAATTGATTTAGTATGATTAATAAGCTTTAGCTATAAGAATTTTGAAGTTGTCACACAAACATGTATAACAAGAAAAAGCAAAGACATTTCACTTGATCGTAGACTTTTTATGCCTAGTTATTTGGTTATCAAGCATCCTAGTTCTATCAAAAATTTGAACCAAACGGCTAATAGCTCGATGATCTCATATGGACGCAAAAACAACCTTTTTGGACCCTTTCCTTATATGTTTCAATTTCTCAATTATTTTAGGGTTGTTTTTAAATACAGAACAAAAATATTTACAAATATAGCAAAATATCCTAGTCTATATGTACGATAGACTATTGTTTGTATCTACCTATCAGATATTTTTAATTACTTTCAGCAGTTTGATCATCTAAAATAATTTCTCATTATTTTATACGAGCATTTGTGCTTCCATACTTTGTAAATTGTACCAGCTATTAAATATTTAACGATATTCAGATCACCATCGAGAAAACTATTCTAGAGCAAAGCAATAGCCGTTATGAGCTTCAAGGTGAATATGTGCTTCCTGGAAGCCGTGATCGAAATGTTACCGATAAGGAAAGTACTGGATTTTTGAAAAAGGCAATGGCTTCCCATCTGAGCAGTGTCATATCCTCCATGGGAAGATGGAGGATGAGACTGGAAGTTCCTAGGGCTGAGGTTGCTGAGATGCTTCCACTGGCCCGCCTTCTATCGCGAAGTGCAGATCCTTCAGTTCACTCAAGATCCAAGGTTATCTCCTCTTTACTGCATGAAAATTGACCTCTCTTGGGTATGCGTATTTATTCTTAGATTGACTTTTCTTATTATTATCTTCTAAAATGTGGTCAAGAAACTTATATCTGCTTATAACGTCAACCATAGCTCTTCCTTAAAGCATGTCCTTTTATTTATTTAGTTTTTAACTTTTATTTTATTTTTTTCAATTTTTTTTGTAAATTTTATTTTAATATGAATATTACTCTTTTAATGGTGTGAACTTGCAAGGGGTAAAGCAATGCCATTTTTTTTCCCTCAAATCTACGTCTAGCATTTTGATGAAAATGACCATAACAATTGTGGAAGGCTTTATTTTTTTTATTTTTGACTTCCTACGCCATAGTAGCGCATTCAAATTTGATGTACGAGAAATATTTTATATCCATCATTTCTTTCTTTTGGATAAGAGATCATTTCATGGATGAATAACATAATCCAAAGAGGCACAAAATGAGGTCCCCGTAGGGAATACAAAAGGTGTTTCCAATTTACAACAATTAAAACAATGATCTTTAAGGGGCTGTAGCCTGTAGATTCTTACACATATGGGGCTTGCGCTATATCTTAATTGATGCATTTTACACAAGGACAGGAGCAAGACCTTAGCACCTCTTACTTGATAGATGGGTGCAGGCTGGATCAAGTTGACTTTTAGAGTTTGGATTACATGTTGTTCCTTGTCAATGGTCTTTTTGGTCATTCTCTTTTGCGTAAGAGGAAATTTTATGAAGGAACGTGTATAGGTTTTATAGCTGGACAAAAATAGTAGGATCTTAGGACGGGTAGATAAATATATTCGTTTGTGGGATTTAGTCATTGCTCTGTTGATATAGGTGGTATGCTCTCCAAAGAAACTTTTATATTTGTAGCATTCTTTATTAGTTACAATTTGATGACTTCTTTCTAACCTCTATGGACGCTCTCCTTTTTTTTTTTTTTTTTTTTTTTAACGGAAGCAAGCCACATTCAGTGAATAAGAGAAATGAGACTAATACTCAAGTTACGATATAAAAATACAAAGTATAAAAGAACAAAGGATCGGGAGGAGCACTAGAAAATCTCAACTAGGTTGATACAACCTTAGCACCATCCTAATATCCAAACAAACTATCTAACTGACAAAGCTAGACAATTACAACAAATGATCTCCACAATCCAAGAGGACAATACAACTCAAATGCTACCCTAACTGTAAGATGAGGTTCAAAATGAATATCTAAAGCAACAAGACGAAAATGTCAAGACTAAAACAAAAAGCAAAATACCCAAGGCGAGCAACATTAGCCTGTAACATAAAAGCTCTCCAATTGAGGCAAAAACTCTCCAGTTGAGGCACAAGTCTTGAATGGAAAAATCTGCAAAAGCCTTAGCTTGGCAGCACCAAGAAGAGGCAAGACAAGCAACTTCAAAGGGAACCATCCAATCAGAGAATATATCATGGAAGACATGTTGATTTATTTTAAACCAAACTTCTGCTAATAGTGCCCTAACAACATTGGACCATATCAAATGTGGCTTGGTGTGCAAGGCTGGACCCACCAAAATTTGTGATACATTATCACGGTTAACATTGACAAATACCCCACTAATGTTAAATATGTACAATAATCGTTGCCAACACTGTTTAGAATAGTGACAAACAAAGAACTTGTGTAGTAAATCTTCATTTCCAGCCATACAAAGGGGTCAAATGGAAGGAGACAAACAATGTAAAGGGAGTTTATGTTGCATAGTCGATGCACAATTCAAGCTGCCAAAGATTAATCCAATCAAGAATATAACGTTTCTTGGACTTTTAGACTTCCATAGGGCTTTACCCAACTGATTATCCAATGGAGATGCCAAGGACAAGTGAATAGAGAGGGATTTAACTGAAAAAATCCCATTAGCTTCCTATCCATATTCTCATTTATCCTTATCTGAATTCAACTGAAGGAGATTCTGGAAATCAATTATTTCGTCCTCCTTCAATAGCCGATGGAAGAAAATAGACCAAGACGAAGATGATTTACCCAAGTGATCGAAACTGAGCCATTAGGATCTAAAGCAATTCAAAATAAAACAGGAAAGCGTAACCTGTGAGGAACCAAATTAATCCATGGATCGAGCCAAAACCCAATTCTTCCACCATTACCGACGACCAGAGACTCTACCTTCAACCATGTAAGAGTTATACTAATTCATGGACTTTGCAAACTTAGACTGTCCTTATCAGCAGTATTCCAATTGAATTTGTGTTTTCCATGAATACTTCTGACAACTTGGCACCAATGAGTTTTTCTCATTAAATATCTCCAACCCTAATCACCCAACATGTCCATGCTAGCCATAGTTTCTTTGAAAACTCTCAAACTTCATTCAAACCAAAATTTCCAAAAAGAAGCCACAACTGCATTAGCCCGGAGATTTTTGCTTGCCCACAACCAGAGGATTCAGCAGTTTTGTTTTACTTTGTTTTCTTCTTGCTATTTGATGGGGATGGAATAGCAAAATTTTATAATTCTGTGGGGTGGAGGGGTCCGAGGATCTTTGGTACACAACTAAATAACATTTCTCTTTGAAGACATTTGTTAATATGACTTTTTGTAATTATCAACTTAGCCTATTCGTTTGGTTTAGACCCCTTGGCGTGGGTCTGTTTCGTTCTTGGCCTTTTTATTGACATGTTTATTTTATTTTTTGATGTCTCATCCGGAATCCACCTAAAACGGTTTAAGAACCTCACAAGCATAAAAGTAGTAGAGCAATGAAGAAAACCTTGCCTATGAACACATCTTTTTGCGTTTGATCTCATCCCCTGCAGTGGAACTCTTTTATTTTTGTGTGGTTATTAATGAAGTGTTTGTTCTATATCTAGGGGAAAAAAGTGTAAAATTGAAGCGAACAGTGAAAATACCTCCTTTCATGAGTATCATGCAAACATACCAATAAAAAGTAAAGTTGATAAGAGTAAAAGTTTTACCTTTCAAGAAATTGGAAAATAAGGGAGTTTTTAATTTGTGGGTATTCTGTATGGGAAGTGTGTTATCCTCACTTGCTAAGCTTTTAATTTTTCTATCAAATGCAGGATCTTTTTATACAAAATTTGCAAGCTGTTGGATTATACACCGAAAGTGTTCAAGATCTGATTGAGGTACTTTTTGATAGAATAGTTGAAGCAGCTATGTTTTAACTGTGACTTGCATTGATCAGTCTGTATTCTGTATTACCTTCCCGGAAATGTCTTTTTTTCTTTTCTTGATCAATGATCTTGTCTTCAGCTGCACTGAGGACATACATTCAAAACCCCACGTTTTGTTTTTATCTTACTTTTCCAGGTGATACGGAGACAATTTATTCTATCAGATGAAATTGTTCTTGAAGATTTATCTCTTCCAGGCTTATCTGAACTCAGAGGCTGCTGGCGAGGCTCTCTTGATGCAAGTGGTGGAGGCAATGGGGATACAATGGTAGATCTTGAATTATAATTTTAGGCCATACAAAGTTTATTTCTCTTTCTTTCTGTTTTTGATGATTAGCAGTGGTGCATATTGAAAGCAATAAACACGAAACCGTCTTACGTGGAAACCCAAGAACCGGAAGAAAAACCACGATTTTTTAATTTTATTATTTTCTGATAATCATACAATAGGTACGAGAGGGAATAAATAGAAAAGTACAAAGAGATAAAAAAGGAAAGAATATTTAGGGTAAATCTCTCTAATGGGCTAAGCCCACTAATTTTAACACTCCCTCTCTAGTTGGGACGTAAATATCAATGAGGCCTAATGTGCTAACACAGAAGTCTAAGTTTTGTCTGAGAAGCCCCTTGGTGAGGACATCAGCAACCTGTTGGCTCGAATGGATGTACGGAATGCATATGCTTCCACTGCCAAGTCTTTCTTTGATGCCGATCAATCTCAACATGTTTAGTTCTATCATGCTGAACTGGGTTGTTAGCAATACTAATAGTGTCTTTATTATCACAAATAGCTTTAATGGAGTCTCACATTCCTGATGAAGATCAGATAGGACATTCTGAAGCCAAATTTCCTTACATATTTCCAAACTCATAGCTCTATATTCGACCTCAGTACTGCTCCTGGCCACAACACTTTGCTTCTTACTCCTCCAAGTTACAAGATTGCCCCAAACAAAGGTACAATAACCGGAGGTAGATTTTTTGTCAACAACAGATCCTGCCCAATCGGAGTCAGTATATGCCTCAATGGTCTTTCTGTCTGTCATTCCGAACATCAACCCTTTACCAGGTGTTATTTTTTAAGTATCTCAGAATTCTTTTGACAGCTTCCATGTGTTCCTCATAGGGAGCCTGCATAAACTGGTTGACAACACTCGCAGCAAAGGAAATATCAGGAAGAGTATGAGATAAGTAAATCAATTTACCCACAAGGAGCCGATATTGTTCTTTATCAACTGGAACTTGATCATCAGAGTTTCCTAGTTACAGTTGAATTCAATAGGAGTGTTAGCTGGACGACATCCCAACATACCTGTCTTGGTTAGCAAATCAAAGGTATATTTTCTCTGAGACACGGAGATGCCTTCTTTAGATCTGACCACCTCCATTCCAAGGAAATATTTTAGATTTCCCAAATCCTTGATTTCAAATTTATTACCCATTCTCTGTTTAAGTTGACTGATTTCTGCCTGATCATCTTCAGACAAAACGGTGTCATCCACATAAACTATCAGAATTGCAATCTTTTCTGTCTTTGAAACTTTTGTAAATAAAGTATGATTAGAGTGTCCCTGATTGTACCATTGGGACTTGACAAAAGTAGTGAATATGTTAAACCATGCTCTGGGTGACTGTTTCAGACCATATAAGGCTCTCTGGAGTTTACAAACCTACTGACCAAATTGGGCTTCAAAGCTAGGCGGGACTCATGTAGACCTCCTCTACTAGATCTCCATTCAGAAAAGCATTTTTAACATCCAGCTGATATAGAGGTCAATCTTTGTTTACAACAACAAAAAGGACTCCGACAGTATTCAATTTAGCAATAGGAGAAAAAGTTTCTAAATAATCAACACCATAGGTCTGAGTAAACCCTTTTGCAACTAACCTTGCCTGGTGTCTGTCAAGTGTTCCATCTGCTTTGTATTTGAGAGTGAACACCCATTTGCGTCTCACAGGCTTGTGTCCCTTGAGTAGAGCACAAATCTCTCGAGTGTTATTCTTTTCAAGAGCCTTTATCTCTTCCATGACAACATTTTTCCATGTAGGGCACTTTAAAGCAATGTTAATATTTTTTAGTATTGTGGTAGAGTCAAGGTTGGCAGTAAAAGCTCTAAACTGTGGTGAGAGATTCTCGTATGACACATAATTAATATGGAGTGCTTTGTACAGGACCTAGTACCTTTTCTCAATGCAATAGGAACATCAAAAGAAGGATCATACTTGTCAAGTTTCCCTGTATGACCCTGTTCAGCTTCATTGCTACTGGTTTCTGTTCTGACCTCACCACTGTTCTTTTCTTCCAAATTTTCAAGAACAACAACATCAGACCTGTCATTCTTATTCATTGTATTATTAGTACAAGGTTTAGTAGGGTTTTCCATACTTTGATCTCGAGGAGGTTTGAAGTCTTGGATTGGAGCCAACGGTTGACTACTAGAGGACCTGACTTGAGATTCCTCCTGTAATACGCTTTCTAAGGAACTTGGTTTGTGGGTAGGATTATAGGATGAGGATCAATGTCAGACATGGTACTAGGAGTAGGTTTGATAAATTCAAAGGTGCTGTAAAGACTCTTCACTCACATTCTCCCTCTAAAGATGGCTAATGGAAAAATAAGGTTGGTCCTCATAGAAAGTAACATCCATAGTGACAAAGTATTTCCTGGACGGCGGGTGAAAACATTTATACTCGTGCTGGTGAAGGGGATACCTAACAAATACACAAGCCTGAGCCCAAGGGGTAAATTTGGTCTGATTAGGGCCAAAATAATGGACATATGCGGTACACCCAAACACACGAAGAGGAACCTCTGAAACAAGACGAGTAGAGGGGTAGGACCCCTTAATACATTTTATTGATTAAATGAGCTGATGTAAGAATAGCATCTCCCCATGGGTATGAAGGAAGGGAAGTGGAAAGCATAGGGGAAAAGGTACTTCCAAAAGGTGACGGTTTTTTCGCTTGACCACTCCATTTTGTTGAGTGTAGGTGCATGAGTTTTGGTGAACAATCCCCTTGAAGCTAGAAATTCACTAAGGTTATGGTTTTGGAATTCCTGACTATTATCACTTCGAAGAATAACAATTTTTTTATGGCATTGTGTTTCAATGATGTGATAGAAGTTTTGGACAATAGAGGAAACCTCAGATTTATCGGTGATAAGGTATGCCCAGGTAAGACGGGTATGATCATCAATGAAAGTTACAAACTACCTTTTCCTAGATGAGGTGGTGACCTTGGAGGGACCCCAAACGTCACTATGGATAAGGGTAAACGGTTGTGTGGGATTATATGGTTGTGAAGGAAAAGAAACTCGATGTTTGTTTTGCCCGAAGACACATCAAAAGATAACGAGGAGACATCTATTTCAGGAAAGAGATGGGGAAACAAATATTTCATATGGTGACCCAACCGAAAATGCCACAACATAAAGTCAAGTTCAGAAGTGCTAAAATTGGAAGACAGTAAACTAGTCGTAGAGATACTACTATCAGAGGTATCATCATCAAGGATGTAAAGTCCCCTGCTATGCCGGGTAGTGCCAATCGTCCTTTCTCGAACTCAAGTCCTAAAAGCAAATAGATTTAGATAAGAAAGTAGCCTTACAGTGCAGCTCATGGGTGATCTTACTTGTATAACAAATTGTAAGAAAGCGTAGGCACATGCAAAACATTCTAGAGGGAGAAACCGTCAAAGAGAACTATTTGTCCTTTTCCAGCAATTAGGGCTAAAGAGCCATCGGCTATCCGGATTTTCTAATTACCGGCACAAAGGTATAAGAGACAAAGTGCTTCGAGAAACCTGTCAAGTGATATGTGGCCCCTGAGTCCAAAATCCAAGGATTCTTCCCATCAACACTAATAAGGCCAAGGGACTGAGACATCCCTGACTGAGCAATGGCGCTTAGAGTAGTAGAGCTAGCCTGGCTAGCAGTAGGGCCAGTTGGCTGAGAGGTGCTGGCAGTCTCCCTAACATGTATGCCTTGAGTTCTGTTGCTCGTTGGAGGAACGTTTGTTACCTCTTGGGGGCTGATCGTGGAGTTTCCAACACTAATTCTTGGTATGCTGTTGTTTCTTGCACTGCTTACAAACAAAAATTGGTTTCCCATTATTCTTTTCATTATCATGGGTCAAGGATCGAGCACTAAAGACAGCAGAGTTAGTTATATAATTTGAAGGCACATTAGGAAGTGAAATTATCGGGTTCTTGGAATACATCAGCAGATCGGTCTGTGTCGAGGATGCACCAACTTCAAAAATAGCTCTCTTATAGGTTTGGTTAATCCTGAGACCACAAACATATGGTTGTCCATAGCCGGAGGGTGCCGATGATTCGCCCGCTTCAATCCCCGATCTGTTATGGCGTTGGTCAGCCCCGTTCCTATAGAAGAAAGGTTGCTGTAAAGGGTCAACGGACAGATTTGCATTGCATTCCATGGTGGTGTCCACGGCGGCAGCAGTAGCTTCTGTTTTGCTCTGGGTGTTTCCTAAATTTTTTTTCTAGGGTTTCGTTGTTGCTTCGCTCTGATACCATATTGAAAGCAATAAACACAAAAATCGGCTTATGTGGAAACCCGAGAACTGGAAGAAAAACCACGATGTTTTTAGTTTTATCATTTTCTGATAATCATAGAATAGGTACAAGAGGGAATAAATAAAAAAGTACAAAGAGATAAAAAAGGAAAGAATATTTAGGGTAAATCTCTCCAATGGGCTAAGTCCATTAATTCTAACCGTGCATATACATTGAACAATGTGGCTTTAAACATTTTGGTTGAAAGTTAAGTTGATGAACTGAAAGTATGTTACTATCTGACTAAGAGGTCATGGGTTCAGTCCACGGTGGTCACTTACCTAGTAATTAATTTCCTACGGGTCTCATAACACTCAAATGTTGTAGGGTCAGGTGGGTTGTCACGTGACATCAGTCGAGGTGTGCGTAAGTTAGCTCGGATACTCACGGATATAAAAAAAAAAAAATTGAAAGTATGTTTACTGTCTGAACCATGTACTAGTTGACTAAATAAGCAGCTGCACACTTACCAAAACAAATGATGTCAGAAGTTCAATATTTTATTGAGTTTAATTGTGGAAACTGGAGGCCCTTTTTTAGATCAATAATAGAAATGCATATCATTGCTGCAAACAATTAATTGTGTTGAACTTGGATCCATTGGCTTCTAAGAATGATCATCATTTCTTGGTGATTTTCTCAGGCGGAGTTTGACTTCCATGGAGAGGACTGGGAGTGGGGAGTCTACAAAACGCAGCGTGTTTTGGCTGTTGGTGCATACAGCAACAATGATGGTCTGCGGCTGGAAAAAATTTTTATCCAGAAGGATAATGCCACAGTGCATGCAGATGGAACTTTGTTTGGCCCAATAACTAATCTTCACTTTGCTGTTCTAAACTTTCCTGTCAGTCTAGTTCCTGCTGCGGTTCAGGTAATTGAATCTTCAGCAAAAGATCTTGTTCACTCATTGAGGCAACTTGTAGCTCCTATTAGAGGTATACTGCACATGGAGGGAGATCTCAGAGGCAATCTTGCAAAACCAGAATGTGATGTGCAAGTAAGGCTCCTTGATGGTGCCATTGGTGGTGTTGATCTCGGTCGTGCTGAAGTTGTTGCTTCTCTTACCTCAGGCAGCCGTTTTTTGTTCAATGCAAAATTTGAACCAATTATCCAAAATGGCCATGTACATGTTCAAGGAAGCATTCCTGTGATGTTTGTCCAAAATAATATGGGAGAGGTGGAAGAAGTGGAAACAGATACAAGTAGGGGAACTTTGGTTCATGCTTGGGGGAAGGAAAAGGTACGGGAGAAGTTCAATGACAGAAAAAGTTCTAGAGATAGAAATGAAGAAGGTTGGAATACTCAATTGGCTGAAGGGCTTAAAGGATTAAACTGGAGTCTTTTGGATGTAGGAGAAGTTAGGATTGATGCAGATATTAAAGATGGAGGTATGTTGCTACTTACAGCTTTGTCTCCTCATGTCAACTGGCTTCATGGAAATGCTGACATCTTACTTCAGGTGAGCACACTCCAAACTACATCATGGGTTTATATCATCTAATTTCTTACAGACTGTAGTTCAATTACATTTACTCTAAAAAATAGGTTAGAGGGACCATTGAGGAACCAATACTCGACGGATCCGCATCTTTCCACAGGGCTTCAATATCTTCTCCTGTGCTCCCTAAACCACTCACCAACTTCGGTGGTACACTCCATGTAAGATCAAATAGGCTTTGCATCAATTCCTTGGAAAGTCGTGTAAGCCGAAGAGGGAAGTTGATTCTGAAAGGGAATCTGCCCCTTAGATCCAGTGAAGCATGCCTCGATGACAAGATTGATTTAAAATGTGAAGTTCTGGAAGTACGAGCAAAGAATATTTTCAGGTTCCCTGTCTAGTTATTTATGAATTTCAAGTACTCTGATGTTTGAAAGGTTGTGGAAGCTTGGTATTGATTTTGTACATCTGTTTTTTTCTTGATGTTTTGGATAGATTTTTACGACGTGGCCTTACTTTTTATTTTTTAATCCGATCTTGAGATTCATAAGAATAATGGATAAAAATAATTGATGAGGTTTTTATATTGGTTCGTGTTTCTGATTAAATCGTGATTGTTTTCAGAAATAGAAAAAAAGGATAAAAATAATTGAAGCAGCCCTGTTTCATTTTTTGAGAGCATGTTAAGAAGAACAATGCTCTTTCTTGCTCAGCATAAGAAGTTTGGACTTCTATGTGACATGAATCTCAAGATTTATGCACTTTTATTATGTTGTTACATAGAGTTTGCCTTCCCTAACTATAGTATTTCCTAACATGTTCTCTCTAGCCTCTAGCACTCTTTCTAGTAAATCTCTTCTTACTTGTTGGTTGATCCATCCGCCACTTCTAATTCCTCTTTATTTGGTTCAAATTTCAAAGAAGTTTAGCTTTTCTCTTCATTTTTTCCAAAGATCTTCCTCTATAGTTTTGAGACCCTGCAGTGCGTGTACAAGAAGGAACTGGGAGATTAAGCCTGGATAACGTGTTGTGCTGATTTTATATTACTATGAAAATTGGCTCTGATTTTTCGAAATCTTCAGCCTATGTTTAGCTCCTAACTAAGATTGCCTAGACGTGATGAGGAAGCTCATTGATCTTCCTCTGTAACAGTGGGCAGATTCTTCGGTGATGGGCCTTTTTTTGATATTATTTGGAGCATCTGGTTAGAAACATGAGTAGGACAATATATATATGTATATTATTTTGCCAAAGAATGTAATATTTCAAGGAGAAACCCTAAACTTAAAAAATTATTTTGCCAAAACTTAAAACTTAAAAAAATATTTCAAGGAGAAATCCTAAACAAAAGAGCACTTGCAAACAAAAGCTTCAAAATGGATGGATGGAGAGGAAACACCAAGAAAAGACAATCAACCAAGCTGCTTCAAGTTTGACATGATGAAAAAAAATTCTTTTGTCCAGGAGCCTTGCAAAAAACTTCAAATAAACCCCATCAAATCTCCAAGCTGCCTAAGTCAAAGAACTACCGAACAATATAAACTTTTTTTAAAGTGATTCAGAGTTAAAATTTAAAACCTTAGTTGGCATTCGATTGATCAGGTATGTAGTTGTGAGAACTGTGCTACCCCACAAATATTTTGGGAAATTCGTAGAGAACGGCGGTTTCAAGTAAATGTCTATTCTTTTCGCTCAACAATTTCATTTTGTTGAGGAGTATCCCGACTTGTGGCTTGATGAAAAATACCTTTTTCGTGTAAGAATCTGGTTAATTGTTTGTTGAAATATTTAGTACCATTATTAGAATGAAGGATGCGAATTTTAGTTTGGAACCGAGTCTCAATTGTATTGTATAGTCTAACAAAAACTCTTTTACCTCCAACTTGTTAGTTAACAAATAAAACTAGGTTAAATGAGTATGATCATCTATTAAGGTGACAATCCAACTCTTTCCACTATAAGTTAAAACTTTAATTGAACCCCAAACATCAGTGTGAATTAAGGAAAAAGGTGAAGAAGCCTCGTAAGGTTTAGGCAAATAAGTGGATCAATGATGTTTGGCAAAATTGCAACTTTCACATTGAAAAAATGAACAATCAAGACTTTTAAATAAATAATTCTGGAAACAAATATTTTTAGTAAAAGAAATTGGGATGCCTTAATCTGCGATGCCAAAACATTGTAGTTTCTTTAACGGAAGGAGAACATACACTACTGAACCCCTCAACTTGTTTATAACTAGCTGAAACTTCATCAAAGTAATAGACATCACCAATTATCCTAGCACATCCAATCGTCTCCCTTGAGTCTTTATCCCTAAAAGTACAATGAGTGTCACAGAAGACAACACGACAGTTAGCATCTTTAGAGATTTCACCAATGGATAACAAATTGCAAACTAATTTTGAAACACGAAGAACATAATGCAATGCAAAATTTGTAGTCAGAGGAATAGTTTCTTTGCCAACAATAGGAGAAACTATCATCTGCGATGCAGATTTTTTCATTGCAATATATAGGAGAGTAGGATGCAAAAAAGGAAGAGGAATGAGTCATCTGATCAGAGGCTCCAGAATCTATAATTCATGGAGATGAGTTTATGCAAGAGAGGGCTTGAGGAAACTTACCTGATTGTGCCAAGGAAATGCTAGGATTATTAGATGAGGAAGTAGTCTTTGGTACTTTCAAGATTTGATCAATCTGCTCCTTACTAAATAGGTTGAAATCACCAACATTTGCATTTGTGGTGCATAGGTGTGAATTTCTCGCCCCTTACTTAGAACTTCTCCAATTCGCAGGTTTTTCATGAAGCTTTCAACATTTTCACGTGTTTGACAGGGTTTATTGTAATGGTCACACCAGACTCAAGGCTTTTCGTGTGTCTTGCTGGAGTGATCAAAGGCTTTTATATTTATTCGGTCACCAACACAGAACTTTCTACTGAATCAATAGGTTTTTTTCCAATCATAACATTTCTATAGCTTTCTTCCTTGTGAACTTTAGAAGAAACGTCATTAATAGATGGAAGAGTGGTTTTCCCAAGTATTCGACCTCTAACCTCATCAAATTCAACATTGAGGCTGATGAGGAACTTTTAAATACGACCGTCTTCTACAGTTCTTCTATAATGTTTTTGATCATTTGTAGACTTCCACAGTTCCACTCGTATGAATCAAAGAGATCCGAGTCTTGCCAAATCATTTTTAGGGAGTGAAAGTATTCTGTCACCAAGTTTCCTCCTTGGCGTATATCTCCTAATTTGAGATTCAACTCAAATACCTATGATTGATTGCCTAAATCAAAGTACATTTGTGTCGCACTGTCCCACAATTCCTTGGCCGTAGAACAACACATGTAATTACAGCTGATTTTTTCAATCATGGAGTTTACAAACCAAGTCATAACCATGGAGCATCCCATGTAGCAATCGAAGGGTTGTCTGGGGCAGTTTTCTCTACGGTTGTTTGGTTTTGGGGCAGTTTTCTCTACGGCAATGTAGCCAATTATCCCTTGTCCACGAATATATCTAACACTTTGTGACCAACTAAGAAAATTCTCCATTAAGTCAGATCATGGTGATTTGGATAGTGGGGGCTATTGGCATGGATACAATTATCGACTTTAGCCATAGGTAACTTAGTATCCGACATGGCGGAAGCAAGAAAGGTAAAAGAACAAAGATAACAGATCTGAAAAGACCAACCAAACTCGTCACAAGTGGCGGAATCAGTGGTGGACGGCTCAAAAGGGGCGACTAGTGAGGACCACGACAAACAACGGCGTAGACAAAAAGAACACCAACTTACAGCAAACAAAATGCAAGACAATATGGCTCCAGCGGTGTCCAGTGATTGGTGGTCGGCGGCAGCGATCGGTTGGAGTAGAATGATTGGCGGCAGCAACCTTCAAGGCAGCAGCAGTCTGTTAGAGTTTTAGGGTTTTGTAGGTTTGAATAAATTAGGGTTTTTTTGGGTTTCAAAAACCTTATGTGATACCATGTTAAAGAATATAATTTTCTATATATTTCACAAAGAGAGAAGCGGTTTCCTAAATATAAGAAGAAACCATTACAAATAAGGAATTTTTTTGCGAATATAATACAACCAACACAATAAAATATAATTCAAAAAAGAAAATAATCAACATATACAAATGGAGGAAGAGATGAGGTATCCCCAAACAACCAATTCTTAAGTCTCATTTATTAATAATAAATAAAGAGACTTGTCTCCGTCTCCAAAAAAAATTGGGTTAGGCCATAATTACAAAAGAAATTAGAAGTGGAGGATCACGTAGAAGTAGAAAGAGTAACCAAATTCCGAAACAAATCATTTAAGATAATATTTCGAGATCTGATGTTCTAGTATTGGGAGGGGATTTATAATATAGCACACTTCAATTTTCTTTATTTTATATGAGAATAAAGGATTTGGTATAAGACAATCCCTCTTCCCAGAGGAAGGTTGATGAAAGCCTTCCATCCTTTAGGTTGATGGTAGCAGTAGAGTTGGTAGCGCACGTCAATGTTCTCTTCAAACTGTAATTTACTAAATCCCTTTTTTTTTTGGTAGTTTGCTTTATGCTTTGCGGCCTCCTACGCGCAGCTTTTTTCTTTTGTACATTAATTCATCTTTATAATGAATCATAAAAAGCTTTCTTCAGCTATCTATTTTGCTCCACTCTTCCATTAAATTAAAATTATTTATCTCTCTCTTCAATATTGCAGTGGCCAAGTTGACTCTCAAATGCAAATTACTGGGTCGATATTGCAACCGAACATCTCTGGAAATATTCAATTGAGTCGTGGAGAAGCATATCTACCACATGATAAGGGTAGTGGAGCTGCTTCATTTAATAAAGTGGTATCAGACCAGTTCAGTCTTCCTCCTGGCAGTTCGAACCAAGTAGTTGCTTCTAAATATGCTTCGTTTTTCAATTCGGAATCTACAGCATTGAAAACTAGATTCCGTGTACCCCGAGGTGATATTTGTATTTTCAATCTATGTACTTTCACGTTGCATTTGTACATAAAAAAGAACAATGTTCTGATCTTTTATCTACAAAAATGCCCCAACTTTGACGTGGGTTTGCAACAAAAAGTTTTTTTTTTTTTTTTAAACTATAAAAATCTGTTTATTAACTCCTGAAAGGGAGAAACCAAAACAGAAAAAATTTCGTCTTACAGAAACAAACTAACCAACTACTAACTAAATAAAAACCGAAGAAAACCCCAAAACAAACTAAAGAAAACCAAAAGAGATCCTCTCCCGGGAATCTGATTGCAAGAACCTCTGAACAAAACAGCAAACCAGCTTCAAAAATGTCTAGAAAGGAGAAGCTGCACACCCATTTGGAACTTGAGTTTGCAAAATGGGTTTTTTGGTTTAATAGGAGCACGAGCTGTCGGCTCTGCGAAGTCCATGATGGTAAGGAAAAAAGACCTTTAGGTCTCCTAGACAACCTGTGCACATCACAAACTTTGTCAGGAATGTACAAACGATTCCAGTTAAATTTTCACATTTATCAAGCTTTCATCCAAGTATTTATCAAATGATATATAATTAAATTCGCCTTCATCCACGAACTTAAGCTTTTAGGTCAACCGGTTATTAACGATGGTATCAAAACAAGTGGTCCTGGAGGGTTCTGTACTCAAGTCCCTGCCATGTTATTCCTCCTATTTCACTTGTTGGGTTTTCCACATATTTCGACCCCACAAGTGAGGTGGAGTGTTAGATGATTTATAATTAAATAAGCCTTCATCTATCAACTTAAGCCTTTTGGTCAATCGGTGATTTAAAAGTATCAAATAATTCAGCCAAAAATTATTGACAGCATGAAGTTCTATTTTAGTATAAATTCTTCTCTGGAGCAGTTTTAAATAGTTTGATCCGTCAATTACATGACTTCATATGAAGATTTACATAAACTGTGTCCAGGAGTTTGTAATTATAACTGGTGAAAATACAAACAAGCTATTATGTCTTCTTTTCTTTCTAATGCTGTATTGTTTATTTTTCTCCTTTTCAGATAAAGCTGTTGACATTGAAAAGGAGTCCAGAAATGTAAACGTCAAACCTAGTGTTGACGTCAGCCTCAGTGATTTGAAGCTTGTTCTTGGACCAGAATTAAGGATTCTTTATCCTTTGATTCTAAATTTTGCTGTTAGTGGAGAGCTTGAGCTTAACGGTTTTGCTCACGCTAAAAGCATTAAACCAAAAGGAACTTTAACTTTCGATAATGGTGACGTGAATCTTCTTGCGACCCAGGTATCATCTAATCTTTAACATACTGGTTTTTTTACTAAACAAAAGAAGCTGATATTAACCTTGACCATGCAGGTAAGACTGAAACGGGAGCATCTAAATATAGCAACTTTCGAACCTGAGAATGGGTTAGATCCTATGCTAGATTTAGCTTTAGTAGGTTCCGAGTGGCAGATAAGAATACAAAGTCGGGCAAGCAAATGGCAAGAAAAATTAGTGGTGACGTCAACCCGTTCTGTGGAACAAGATGCTCTCTCACCCACTGAGGTATTTTATTTTTGGTTTTATATATATAAATTTGTGAGTGTTTGAGGTGTCTTGACTAATATCACGAGACAACTTGTTTGATCCTACAACAATTGGGTGGTAAGGAAAGTTGTAGGATATTAAATCTATGGCCAACTAAGATTATTCCTCGATAAATTTCCTATCCTACAAAAGAAAATAACTTCAACTAAAATTGGAAAAAAATGTTTCCTTTTGTAGTTTGTCTAATAAAAAAAGGAATAACTTCAACTAAAAAAAAGAAAAAAATGTTAAAAATATATATATTATAAAAAATAGTGTATCCCAACGTGTCCGTGCTCTAGTATTTTAGAAATTGGCGTATCACCGTGTCCTATCGTGTTCGTGCCCATGTTTCTTAGATGCCAAGTGTGGAGAGTTCGGGGAAAACACTACAGGCTGACTTGACAAGTAGAAAACCTGAACTACGGGTTTATACTAGAAGAGACTTTGACTTGAAGGAATCGGGAACAAATAGTTGATCTATCACATAACCAACCTGTTTGTTCCAAGGAATATACAGGAGGCCTTAAATGATTTAAATTGGAAATTAGCAATAATGGAAGAGATGATTGCACCGAAACAAAATTGCATAAGGGACATAGTTGAACTACCAAAAGAAAAGTGAACAGTTGGATGCAATTAGGTGTTCACGGTAAAATGTAAAGCTGATGGTAGTGTTGAAGGTACAAGGCCAGATTGGTTGCTAAGGGGTTTACTCAGACGTATAGAATTGATTATCGAGAAACATTTGCTCCACTTGCTACATTCAGTTATATTAGAATTTTGTTGTCTGTTGTTGTTAACTTTGATTGGCCTCTTTATCAACTAGATGTTAAGAATGTCTCTCTCATTGGGGAACTGGAAAAAGAGGTACTTATGGACTTACCACCTGGTTTTGAGGTAGATCTCGAGATTAATGAACTGTGCAAGTTAAAGAGACCTTATATGGTCTTAAACAGTCTAGAGCTTGATTTGAACGCTTCAAACAAGCAGTCATGAGCTATGGATCTAGTCAAAGTCAAGCCGATTATGCAATGTTCTACAAACATACTGGAAATGACAATGTTGTGGTTTTGATAGTTTATGTTGATGATATCATTCTTACAGGCAATGATAAGACAAAACTGACTTTCGTGAAGAAAAAATTAGCCGATGACTTCCAAATCAAAGATTTAGGAACTTTAAAGTACTTTCTAGGCATGGAGTTTGTTAGGTCCAAAAGTACCATTTTTGTCAACCAAAGGAAGTATAGTCTTGATCTGCTAAAGGAGACAGATTTACTTGGTTGCAAGATAGCAAAAACTCTATTGAGTAGAATCTGAAATTGGAAGCTGCAACCAAGAATGAGATAAAGGAAAGAGAAAAGTACTAGAGACTTGTGGGGAGACTCACATATCTCTCGCACACGTGTCTTGACATCGCTTTTGCAATTTAAGTCAATTCATGCATGCTCAATGTTGGAAAAGCATATATTGATGTTAATTGGGAAAGTCGCATGACTCATACCTAAGATATTTGAAAAGTACCCCGGAAAAAGCATATTGTTTCAAAGACATTACCATCTCAATGTTGAAGTTTATATTGATGCTAATTGGGCAGGTCGTAAGACTGATAGAGAATCCACTTCTGGATACTGCTCCTTTGTTGGAGGAAATGTTGTTATTTGGCAAAGTAAAAAACAGTGTGTGGTTGCAAGAAGTGTAGAAGTAGAATTTAGAGTTTTAGCACGTGGTATTTGTGAGGGTATATGGATAAGAAGATTGTTGGAAGAATTGAGATTCACTTAGACAATGCCTATGCGCATTTACTGTGATAACAAAGCAGCAATTTCCATTGCCCACAATCTAGTCTTTCACGAGAGAACGAAACATATTGAAGTCGATAAACACTTTATAAAGGAAAGATCAATGCGGGAGTAATATGCATGCCCTATCTTCTGACAACAGAACAAATTGCCAATATGTTAACTAAAGGCCTTCTGAAGTGGCAATTCAACAATTTGATTGACAAGTTGGTAATGACTAATGTCTTCAAACCAGCTTGAGGGAGAGTGTTGATTGTTTCCTATTTGTATTATATTTTATTGCATTAGTTATATTATATTTGTCAAATTTATTTTTTTCTTTGTAATGGGTTGTTCTATTTAAGAAAACTCTTCTCTTTGTGAAGTAAAAGAAAAATACTATTTTGGCAGCTTGTTTCCATTTTTTAAAAAATAACGGACCAAACCTTACTAAACAGCAACTAAAAATTACTACATTGTAAAATATTAAAACACCTAACAAGAAAACATCACAGTTTATTCTAACATCCTGTAATATCGTTGGCTCAGGCTACTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGTCAGGTGGGCAATTAGCCTTGGAGAAGCTTGCAACTGCTACGCTCGAGAAACTAATGCCTAGAATAGAAGGAAAGGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAACTTTACTCTCCTTTCCAACTACTGATCCTCTCCTATCGCTCACCAGTAATATCTCAATTGGTACTGTTGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGTTCTCTTTGCACCTTTTTATTATTGGGCTAATGAGCCTGAAGACTAATCCTTTCTATTTCAGAATCTTTATTGGGTTAAAATCAGCTTTAGTTCTGTTTTAAGAAATCAATCTTGTTATTTTTTGTTCTAAATAGAGAAACGAAATATTTATTTGATTAACAAGTTATGGTAAGTTTTTCTTAATATTTTTCAAATATTAGGAGGTTTTCAAAAATTTCAAACAGGTCTATCAAAAAATATTTTTCTTTTTGAAATTTTGGTTATAATCTTAAGAATGCTTAATAATGTAGAATGTATGATGAGAAAAACTGTTGATAAGACTGAATGAGGAAACTCTAATATGAAATTTCAGATTTTGTTCTATTGTTATCTTCTTAATTTCTTTCCCTCCTCCGCACTCTTCTAGGCCTCCATGATTCGCCAAATGAAGGAGACAGAGATGGCCATGCAATGGATGATCACGTACAAGCTTACGAGCCGCTTGCGGATGGTCCTTCAATCAGCTCCAGCTCAACGGACACTTTTGCTCGTTGAGTATTCTGCTACATCACTGGATTAATTTGTTTCTCCACACCAGAATAAATCTTCAGTTA
mRNA sequence
AAAAAGAAAAAAGAAGAAGAAGAAAACGAGAAAACGAGAAAACGAAACTTTATCGACTAAACGCTTGAAGTTTCTTAATACACTCCACCCGCGCTCTCAAAAATTGGATTCGCGGCTCACAGTGACGTAATCATTCTCTCTCTACAAGCAAAGACAAAATCGTTGCCGAATGAGCTCATACTTGTCCAAAATAATTTCCAATTCTTCCGTTTCCGATTCAAATCGCTATAGCTTCTTCCTTCATCCCAATTTGTATGTGTTTTTGTTTCTAAGCCATGCATTTAGGCAGATATGTTGAACTGGGAAGTGTTGGCGTCTCGTGCTGATTGATTTTGCTTTACCGTTTGTTTCTTTTGGAATGGGTGTTGGGAAGTGTTGTGTTGCGTGTGATTGAGAGGTTGTTATTTGTCTACTAACGTGGGGTTTAGGATTTCGAGCTACCAGGGTTATCTGATTCTCATATTGGGTGTAATTCTTTTTGTTGCTAATTTATGATTAGGGATTTTTGTTTGTTAGATGAAACGTAAGTAGCTCGTTTGATGTTGAAATTGGTAGCTGATCGATTCTGTTATATGCCTGAGTTCTTGCTGAGAGTTTGAATTGTCAGGGGACTAAGACCTGCTTCTTGGTCCCCCACAGATGAATCGTTTTTGGTTTAAGCGAGACTAAGAGGATAATGAATGTGAAACTTGATTCTTCTTTCTTGGGGACTCAACTTCATAGCTCATTGCATTGTGTAACGAATGGAAAATTTGTATATTTGGGTCGACGCCGATTGTCGAAAGGGGACTCTAAGAAGTATGTATGTGCAAAACATAACGAATGGAATGCTCGAGTAGAGAGGTTTTCGCGTTTTTTTGGGCAGCATTTGAAGTCATTGAGCATAAAGCTTAAGCCAAGACACGAGTCTTTGATGAAATGTGCCAATGAGCCTTTTGTTCAAACAAAATCTCTATCAAGTTTACTACGTCCTGTCTGGAATGAGGGGTTGTTTTTGATTAGATGTTCTGCATTCGCTGCTGTTGTATCTGGTATATGCTTACTGGTTTGGTATGGGCAGACAAAAGCCAAGGGCTTTGTTGAAGCTAAGCTTCTTCCTTCTGTTTGCAAAGCAGTCAGTGACTGCATTCAGCGTGATCTTGATTTTGGAAAGGTTAGAAGTATTTCACCCTTGAGCATCACATTAGAGTCGTGTTCCGTTGGTCCCGATGGTGAAGAATTCTCTTGCGGCGAAGTTCCCACCATGAAACTTCGTGTTTTACCTTTCACCAGTTTGAGGAGAGGGAGGGTAATAATTGATGTGGTATTGTCTCATCCAAGTGTGGTAGTTGTGCAGAAGAGGGATTATACGTGGTTGGGGCTGCCCTTTCCATCTGAAGGGACCTTGGAGAGGCATTCATCTTCAGAAGAAGGCATTGACAATCGTACAAAAATCAGGAGAATTGCCAGGGAAAATGCAGCTGCTCTCTGGTCCAAGGATAGGGATGATGCGGCTAGGGAAGCAGCTGAGATGGGTTTTGTTGTTTTTGACAGGAGCTCAGGTTTGTATGATACTAGTGATTATAAGGAGGTTGTAGGTCCTACAGTAGATATTGGAAACTCTAAAACATTTTTTTTCAAGGATGAGAATGTTCATTCGAGGGAACATCACTGCATGGATACGGATGTGGACTATAAAATAAGGCATGCAAAGTCAGAGAAGTATTTTGATGTAAAAAGCCCTGATACAAGGCTTAAATTTTCATCCAGAGCAATGAAAACGCTTATAAAAGGACAATCAAAGAGGAATGCAAGTGGAGACGATGTGTATGTAAATAGTTTTGCTGCAAAAAAGAGAATTCTTAGGCGTAGTACATTGGCAGCTCAGGATTATTTCAAGGGTGCATCTGAGGGGAAGTTTGGGGAGCCTTCACAATTACACAAGAGTTTTAATAATGCGAACCTTGACTCCTACTTGATCAAAAGTGGGAATGAAACTAATGCTGACTCCTCCATCACGGATACAGATGTCCAGTATGGGAAACAAAGTTTAGATGCCAGATTGAATTCTCTTAAAGAGAAAAGGGATATCGATATTCCGAACCATATAGATGATCAGACTTCTACAGTTACAGGCTTGGGAAATAAAGACAGAAGGTCTTTTTCAGCTACACCCAGCATTGATGAGTCCAATATGAGAAAGGAAGATGTTATAGGATCTGATCATGTTCCTGATGGAATATCTGATCACATGCGTAACACATCTCAAACACCAACTTCAACAGGTCATGAACATCAGCATGGAACATCTTGGCCAAATTCTTTCTGGGGACTAAGCCCAGAATCGGCTTTATCTTATTTTCCTAAAGATGTGGCCAAGAAGCTGTTGTACCATATTAGCATGTACGTTCAAAATCTCAAATCTGGCTTTGTTCAACACGCTAGAGGTGTCATAGATGGTGGAGATGTGATGAAGAATAAAGGAACCAATACAATGCTTCCAGTAACGATTGATTCTGTTCATTTCAAAGGCGGGACACTTATGTTACTTGCATACGGTGACAGAGAACCAAGAGAGATGGAGAATGTTAATGGGCATGTGAAATTTCAAAATCATTATGGCAACGTGCACGTTCATTTGAGTGGTAATTGTAAGTCATGGAGATCAGAATTTGTCTCTGGAGATGGTGGCTGGTTATCTGCGGATGTTTTTGTTGACATCTTCGAGCAGGAATGGCATTCAAATCTGAAAATTACCAACATATTTGTTCCGCTTTTTGAGAGAATTTTAGATATTCCAATCACGTGGTCCAAAGGAAGAGCCACTGGTGAGGTTCACTTGTGTATGTCAAGAGGAGATACATTTCCTAATTTTCAAGGGCAACTTGATGTGACCGGTTTAGCCTTTAAGATCTTTGATGCCCCATCAAGCTTTACTGAAATAGCGGCAACTTTATGTTTCCGTGGTCAGAGAATATTTGTACAGAATGCAAGTGGCTGGTTTGGCTGCGCTCCATTAGAGGCATCTGGTGACTTTGGCATTAATCCGGATGAAGGAGAGTTTCATTTGATGTGTCAGGTTCCCGGTGTTGAAGTAAATGCCCTGATGAAAACTTTCAAGATGAAACCTTTCTTATTCCCATTAGCTGGTTCGGTAACTGCTGTGTTCAACTGTCAAGGTCCACTGGATTCACCCATCTTTGTAGGAAGTGGAATGGTTTCCAGGAAGATGAATAATTTATTCTCGGATCTTCCTGCATCCTGTGCTTCAGAAGCAATTGTGAAAAGTAAAGAAGGTGGTGCAATAGCAGCAGTTGACCGTATTCCATTTTCCTATGTCTCAGCAAATTTCACTTTCAGCATCGACAATTGTGTTGCTGACTTATATGGAATTAGAGCCAACCTTGTGGATGGTGGTGAAATTCGAGGCGCCGGGAATGCATGGATATGCCCAGAGGGTGAGCTGGATGATACGGCAATGGATTTAAATTTTTCAGGAAATATATCATTGGATAAAATCATGCATCGATATATGCCTGGCTATTCAGATTGGATGCCACTTAAATTGGGACTTCTAAATGGGGAAACCAAAGTTTCCGGATCCCTCTTGAGACCGAGGTTTAACATTAATTGGACTGCACCACTTGCTGAAGGATCGTTCAGGGATGCTCGAGGAGATATCAATATTTCCCACGATTATATTATTGTTAATTCCTCCTCTGTTGCTTTTGAACTCTTCTCAAAAGTGCAAACATCGTATTCTGACAAGATTATGCTTGATGAAGAAGTGTTTGATACAAAGAGGACTCCATCATTTACTATTGATGGAGTGGAGTTGGACTTACATATGCGTGGTTTTGAGTTCTTGAGCTTAGTCTCTTATATTTTTGAATCTCCAAGGCCTATGCATCTAAAAGCAACCGGAAGGGTTAAGTTTGTGGGGAAAGTTTTGAGACCATCTAGTAAAGATTTTAGTAATGAGAAGAGTAAGCATCAGGTGCAGCCAATTGATGAAGAGAATAAAAATGGTCTTGCTGGCGAGGTTTCAATTTCTGGTCTTAAGCTGAATCAATTGGTTCTGGCCCCTAAGCTTGCTGGGCTATTGAGCATGACTCGTGAGTCAATCAAGTTGGATACCACAGGTAGGCCAGATGAAAGTCTTTCAGTAGAAATTGTGGGGTCATTGAAGCCTAACTCAGATAATTCTAGAAAGTCGAAGTTGTTCTCTTTTAATCTTCAACGGGGGCAATTAAGAGCCAATGCACGTTACCAACCATCTAGATCTGCACATTTGGAGTTACGTCATTTGCCGCTAGATGACTTGGAGCTGGCTTCACTTAGGGGAGCAATACAAAGGGCAGAAATTGAACTTAATCTTCAGAAAAGAAGAGGTCATGGGGTTTTATCAGTACTTGACCCAAAATTTAGCGGCGTGTTGGGAGAAGCTTTAGATATAGCTGCTAGGTGGAGCGGAGACGTGATCACCATCGAGAAAACTATTCTAGAGCAAAGCAATAGCCGTTATGAGCTTCAAGGTGAATATGTGCTTCCTGGAAGCCGTGATCGAAATGTTACCGATAAGGAAAGTACTGGATTTTTGAAAAAGGCAATGGCTTCCCATCTGAGCAGTGTCATATCCTCCATGGGAAGATGGAGGATGAGACTGGAAGTTCCTAGGGCTGAGGTTGCTGAGATGCTTCCACTGGCCCGCCTTCTATCGCGAAGTGCAGATCCTTCAGTTCACTCAAGATCCAAGGATCTTTTTATACAAAATTTGCAAGCTGTTGGATTATACACCGAAAGTGTTCAAGATCTGATTGAGGTGATACGGAGACAATTTATTCTATCAGATGAAATTGTTCTTGAAGATTTATCTCTTCCAGGCTTATCTGAACTCAGAGGCTGCTGGCGAGGCTCTCTTGATGCAAGTGGTGGAGGCAATGGGGATACAATGGCGGAGTTTGACTTCCATGGAGAGGACTGGGAGTGGGGAGTCTACAAAACGCAGCGTGTTTTGGCTGTTGGTGCATACAGCAACAATGATGGTCTGCGGCTGGAAAAAATTTTTATCCAGAAGGATAATGCCACAGTGCATGCAGATGGAACTTTGTTTGGCCCAATAACTAATCTTCACTTTGCTGTTCTAAACTTTCCTGTCAGTCTAGTTCCTGCTGCGGTTCAGGTAATTGAATCTTCAGCAAAAGATCTTGTTCACTCATTGAGGCAACTTGTAGCTCCTATTAGAGGTATACTGCACATGGAGGGAGATCTCAGAGGCAATCTTGCAAAACCAGAATGTGATGTGCAAGTAAGGCTCCTTGATGGTGCCATTGGTGGTGTTGATCTCGGTCGTGCTGAAGTTGTTGCTTCTCTTACCTCAGGCAGCCGTTTTTTGTTCAATGCAAAATTTGAACCAATTATCCAAAATGGCCATGTACATGTTCAAGGAAGCATTCCTGTGATGTTTGTCCAAAATAATATGGGAGAGGTGGAAGAAGTGGAAACAGATACAAGTAGGGGAACTTTGGTTCATGCTTGGGGGAAGGAAAAGGTACGGGAGAAGTTCAATGACAGAAAAAGTTCTAGAGATAGAAATGAAGAAGGTTGGAATACTCAATTGGCTGAAGGGCTTAAAGGATTAAACTGGAGTCTTTTGGATGTAGGAGAAGTTAGGATTGATGCAGATATTAAAGATGGAGGTATGTTGCTACTTACAGCTTTGTCTCCTCATGTCAACTGGCTTCATGGAAATGCTGACATCTTACTTCAGGTTAGAGGGACCATTGAGGAACCAATACTCGACGGATCCGCATCTTTCCACAGGGCTTCAATATCTTCTCCTGTGCTCCCTAAACCACTCACCAACTTCGGTGGTACACTCCATGTAAGATCAAATAGGCTTTGCATCAATTCCTTGGAAAGTCGTGTAAGCCGAAGAGGGAAGTTGATTCTGAAAGGGAATCTGCCCCTTAGATCCAGTGAAGCATGCCTCGATGACAAGATTGATTTAAAATGTGAAGTTCTGGAAGTACGAGCAAAGAATATTTTCAGTGGCCAAGTTGACTCTCAAATGCAAATTACTGGGTCGATATTGCAACCGAACATCTCTGGAAATATTCAATTGAGTCGTGGAGAAGCATATCTACCACATGATAAGGGTAGTGGAGCTGCTTCATTTAATAAAGTGGTATCAGACCAGTTCAGTCTTCCTCCTGGCAGTTCGAACCAAGTAGTTGCTTCTAAATATGCTTCGTTTTTCAATTCGGAATCTACAGCATTGAAAACTAGATTCCGTGTACCCCGAGATAAAGCTGTTGACATTGAAAAGGAGTCCAGAAATGTAAACGTCAAACCTAGTGTTGACGTCAGCCTCAGTGATTTGAAGCTTGTTCTTGGACCAGAATTAAGGATTCTTTATCCTTTGATTCTAAATTTTGCTGTTAGTGGAGAGCTTGAGCTTAACGGTTTTGCTCACGCTAAAAGCATTAAACCAAAAGGAACTTTAACTTTCGATAATGGTGACGTGAATCTTCTTGCGACCCAGGTAAGACTGAAACGGGAGCATCTAAATATAGCAACTTTCGAACCTGAGAATGGGTTAGATCCTATGCTAGATTTAGCTTTAGTAGGTTCCGAGTGGCAGATAAGAATACAAAGTCGGGCAAGCAAATGGCAAGAAAAATTAGTGGTGACGTCAACCCGTTCTGTGGAACAAGATGCTCTCTCACCCACTGAGATGCCAAGTGTGGAGAGTTCGGGGAAAACACTACAGGCTGACTTGACAAGTAGAAAACCTGAACTACGGGCTACTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGTCAGGTGGGCAATTAGCCTTGGAGAAGCTTGCAACTGCTACGCTCGAGAAACTAATGCCTAGAATAGAAGGAAAGGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAACTTTACTCTCCTTTCCAACTACTGATCCTCTCCTATCGCTCACCAGTAATATCTCAATTGGTACTGTTGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGAGGTTTTCAAAAATTTCAAACAGGCCTCCATGATTCGCCAAATGAAGGAGACAGAGATGGCCATGCAATGGATGATCACGTACAAGCTTACGAGCCGCTTGCGGATGGTCCTTCAATCAGCTCCAGCTCAACGGACACTTTTGCTCGTTGAGTATTCTGCTACATCACTGGATTAATTTGTTTCTCCACACCAGAATAAATCTTCAGTTA
Coding sequence (CDS)
ATGAATGTGAAACTTGATTCTTCTTTCTTGGGGACTCAACTTCATAGCTCATTGCATTGTGTAACGAATGGAAAATTTGTATATTTGGGTCGACGCCGATTGTCGAAAGGGGACTCTAAGAAGTATGTATGTGCAAAACATAACGAATGGAATGCTCGAGTAGAGAGGTTTTCGCGTTTTTTTGGGCAGCATTTGAAGTCATTGAGCATAAAGCTTAAGCCAAGACACGAGTCTTTGATGAAATGTGCCAATGAGCCTTTTGTTCAAACAAAATCTCTATCAAGTTTACTACGTCCTGTCTGGAATGAGGGGTTGTTTTTGATTAGATGTTCTGCATTCGCTGCTGTTGTATCTGGTATATGCTTACTGGTTTGGTATGGGCAGACAAAAGCCAAGGGCTTTGTTGAAGCTAAGCTTCTTCCTTCTGTTTGCAAAGCAGTCAGTGACTGCATTCAGCGTGATCTTGATTTTGGAAAGGTTAGAAGTATTTCACCCTTGAGCATCACATTAGAGTCGTGTTCCGTTGGTCCCGATGGTGAAGAATTCTCTTGCGGCGAAGTTCCCACCATGAAACTTCGTGTTTTACCTTTCACCAGTTTGAGGAGAGGGAGGGTAATAATTGATGTGGTATTGTCTCATCCAAGTGTGGTAGTTGTGCAGAAGAGGGATTATACGTGGTTGGGGCTGCCCTTTCCATCTGAAGGGACCTTGGAGAGGCATTCATCTTCAGAAGAAGGCATTGACAATCGTACAAAAATCAGGAGAATTGCCAGGGAAAATGCAGCTGCTCTCTGGTCCAAGGATAGGGATGATGCGGCTAGGGAAGCAGCTGAGATGGGTTTTGTTGTTTTTGACAGGAGCTCAGGTTTGTATGATACTAGTGATTATAAGGAGGTTGTAGGTCCTACAGTAGATATTGGAAACTCTAAAACATTTTTTTTCAAGGATGAGAATGTTCATTCGAGGGAACATCACTGCATGGATACGGATGTGGACTATAAAATAAGGCATGCAAAGTCAGAGAAGTATTTTGATGTAAAAAGCCCTGATACAAGGCTTAAATTTTCATCCAGAGCAATGAAAACGCTTATAAAAGGACAATCAAAGAGGAATGCAAGTGGAGACGATGTGTATGTAAATAGTTTTGCTGCAAAAAAGAGAATTCTTAGGCGTAGTACATTGGCAGCTCAGGATTATTTCAAGGGTGCATCTGAGGGGAAGTTTGGGGAGCCTTCACAATTACACAAGAGTTTTAATAATGCGAACCTTGACTCCTACTTGATCAAAAGTGGGAATGAAACTAATGCTGACTCCTCCATCACGGATACAGATGTCCAGTATGGGAAACAAAGTTTAGATGCCAGATTGAATTCTCTTAAAGAGAAAAGGGATATCGATATTCCGAACCATATAGATGATCAGACTTCTACAGTTACAGGCTTGGGAAATAAAGACAGAAGGTCTTTTTCAGCTACACCCAGCATTGATGAGTCCAATATGAGAAAGGAAGATGTTATAGGATCTGATCATGTTCCTGATGGAATATCTGATCACATGCGTAACACATCTCAAACACCAACTTCAACAGGTCATGAACATCAGCATGGAACATCTTGGCCAAATTCTTTCTGGGGACTAAGCCCAGAATCGGCTTTATCTTATTTTCCTAAAGATGTGGCCAAGAAGCTGTTGTACCATATTAGCATGTACGTTCAAAATCTCAAATCTGGCTTTGTTCAACACGCTAGAGGTGTCATAGATGGTGGAGATGTGATGAAGAATAAAGGAACCAATACAATGCTTCCAGTAACGATTGATTCTGTTCATTTCAAAGGCGGGACACTTATGTTACTTGCATACGGTGACAGAGAACCAAGAGAGATGGAGAATGTTAATGGGCATGTGAAATTTCAAAATCATTATGGCAACGTGCACGTTCATTTGAGTGGTAATTGTAAGTCATGGAGATCAGAATTTGTCTCTGGAGATGGTGGCTGGTTATCTGCGGATGTTTTTGTTGACATCTTCGAGCAGGAATGGCATTCAAATCTGAAAATTACCAACATATTTGTTCCGCTTTTTGAGAGAATTTTAGATATTCCAATCACGTGGTCCAAAGGAAGAGCCACTGGTGAGGTTCACTTGTGTATGTCAAGAGGAGATACATTTCCTAATTTTCAAGGGCAACTTGATGTGACCGGTTTAGCCTTTAAGATCTTTGATGCCCCATCAAGCTTTACTGAAATAGCGGCAACTTTATGTTTCCGTGGTCAGAGAATATTTGTACAGAATGCAAGTGGCTGGTTTGGCTGCGCTCCATTAGAGGCATCTGGTGACTTTGGCATTAATCCGGATGAAGGAGAGTTTCATTTGATGTGTCAGGTTCCCGGTGTTGAAGTAAATGCCCTGATGAAAACTTTCAAGATGAAACCTTTCTTATTCCCATTAGCTGGTTCGGTAACTGCTGTGTTCAACTGTCAAGGTCCACTGGATTCACCCATCTTTGTAGGAAGTGGAATGGTTTCCAGGAAGATGAATAATTTATTCTCGGATCTTCCTGCATCCTGTGCTTCAGAAGCAATTGTGAAAAGTAAAGAAGGTGGTGCAATAGCAGCAGTTGACCGTATTCCATTTTCCTATGTCTCAGCAAATTTCACTTTCAGCATCGACAATTGTGTTGCTGACTTATATGGAATTAGAGCCAACCTTGTGGATGGTGGTGAAATTCGAGGCGCCGGGAATGCATGGATATGCCCAGAGGGTGAGCTGGATGATACGGCAATGGATTTAAATTTTTCAGGAAATATATCATTGGATAAAATCATGCATCGATATATGCCTGGCTATTCAGATTGGATGCCACTTAAATTGGGACTTCTAAATGGGGAAACCAAAGTTTCCGGATCCCTCTTGAGACCGAGGTTTAACATTAATTGGACTGCACCACTTGCTGAAGGATCGTTCAGGGATGCTCGAGGAGATATCAATATTTCCCACGATTATATTATTGTTAATTCCTCCTCTGTTGCTTTTGAACTCTTCTCAAAAGTGCAAACATCGTATTCTGACAAGATTATGCTTGATGAAGAAGTGTTTGATACAAAGAGGACTCCATCATTTACTATTGATGGAGTGGAGTTGGACTTACATATGCGTGGTTTTGAGTTCTTGAGCTTAGTCTCTTATATTTTTGAATCTCCAAGGCCTATGCATCTAAAAGCAACCGGAAGGGTTAAGTTTGTGGGGAAAGTTTTGAGACCATCTAGTAAAGATTTTAGTAATGAGAAGAGTAAGCATCAGGTGCAGCCAATTGATGAAGAGAATAAAAATGGTCTTGCTGGCGAGGTTTCAATTTCTGGTCTTAAGCTGAATCAATTGGTTCTGGCCCCTAAGCTTGCTGGGCTATTGAGCATGACTCGTGAGTCAATCAAGTTGGATACCACAGGTAGGCCAGATGAAAGTCTTTCAGTAGAAATTGTGGGGTCATTGAAGCCTAACTCAGATAATTCTAGAAAGTCGAAGTTGTTCTCTTTTAATCTTCAACGGGGGCAATTAAGAGCCAATGCACGTTACCAACCATCTAGATCTGCACATTTGGAGTTACGTCATTTGCCGCTAGATGACTTGGAGCTGGCTTCACTTAGGGGAGCAATACAAAGGGCAGAAATTGAACTTAATCTTCAGAAAAGAAGAGGTCATGGGGTTTTATCAGTACTTGACCCAAAATTTAGCGGCGTGTTGGGAGAAGCTTTAGATATAGCTGCTAGGTGGAGCGGAGACGTGATCACCATCGAGAAAACTATTCTAGAGCAAAGCAATAGCCGTTATGAGCTTCAAGGTGAATATGTGCTTCCTGGAAGCCGTGATCGAAATGTTACCGATAAGGAAAGTACTGGATTTTTGAAAAAGGCAATGGCTTCCCATCTGAGCAGTGTCATATCCTCCATGGGAAGATGGAGGATGAGACTGGAAGTTCCTAGGGCTGAGGTTGCTGAGATGCTTCCACTGGCCCGCCTTCTATCGCGAAGTGCAGATCCTTCAGTTCACTCAAGATCCAAGGATCTTTTTATACAAAATTTGCAAGCTGTTGGATTATACACCGAAAGTGTTCAAGATCTGATTGAGGTGATACGGAGACAATTTATTCTATCAGATGAAATTGTTCTTGAAGATTTATCTCTTCCAGGCTTATCTGAACTCAGAGGCTGCTGGCGAGGCTCTCTTGATGCAAGTGGTGGAGGCAATGGGGATACAATGGCGGAGTTTGACTTCCATGGAGAGGACTGGGAGTGGGGAGTCTACAAAACGCAGCGTGTTTTGGCTGTTGGTGCATACAGCAACAATGATGGTCTGCGGCTGGAAAAAATTTTTATCCAGAAGGATAATGCCACAGTGCATGCAGATGGAACTTTGTTTGGCCCAATAACTAATCTTCACTTTGCTGTTCTAAACTTTCCTGTCAGTCTAGTTCCTGCTGCGGTTCAGGTAATTGAATCTTCAGCAAAAGATCTTGTTCACTCATTGAGGCAACTTGTAGCTCCTATTAGAGGTATACTGCACATGGAGGGAGATCTCAGAGGCAATCTTGCAAAACCAGAATGTGATGTGCAAGTAAGGCTCCTTGATGGTGCCATTGGTGGTGTTGATCTCGGTCGTGCTGAAGTTGTTGCTTCTCTTACCTCAGGCAGCCGTTTTTTGTTCAATGCAAAATTTGAACCAATTATCCAAAATGGCCATGTACATGTTCAAGGAAGCATTCCTGTGATGTTTGTCCAAAATAATATGGGAGAGGTGGAAGAAGTGGAAACAGATACAAGTAGGGGAACTTTGGTTCATGCTTGGGGGAAGGAAAAGGTACGGGAGAAGTTCAATGACAGAAAAAGTTCTAGAGATAGAAATGAAGAAGGTTGGAATACTCAATTGGCTGAAGGGCTTAAAGGATTAAACTGGAGTCTTTTGGATGTAGGAGAAGTTAGGATTGATGCAGATATTAAAGATGGAGGTATGTTGCTACTTACAGCTTTGTCTCCTCATGTCAACTGGCTTCATGGAAATGCTGACATCTTACTTCAGGTTAGAGGGACCATTGAGGAACCAATACTCGACGGATCCGCATCTTTCCACAGGGCTTCAATATCTTCTCCTGTGCTCCCTAAACCACTCACCAACTTCGGTGGTACACTCCATGTAAGATCAAATAGGCTTTGCATCAATTCCTTGGAAAGTCGTGTAAGCCGAAGAGGGAAGTTGATTCTGAAAGGGAATCTGCCCCTTAGATCCAGTGAAGCATGCCTCGATGACAAGATTGATTTAAAATGTGAAGTTCTGGAAGTACGAGCAAAGAATATTTTCAGTGGCCAAGTTGACTCTCAAATGCAAATTACTGGGTCGATATTGCAACCGAACATCTCTGGAAATATTCAATTGAGTCGTGGAGAAGCATATCTACCACATGATAAGGGTAGTGGAGCTGCTTCATTTAATAAAGTGGTATCAGACCAGTTCAGTCTTCCTCCTGGCAGTTCGAACCAAGTAGTTGCTTCTAAATATGCTTCGTTTTTCAATTCGGAATCTACAGCATTGAAAACTAGATTCCGTGTACCCCGAGATAAAGCTGTTGACATTGAAAAGGAGTCCAGAAATGTAAACGTCAAACCTAGTGTTGACGTCAGCCTCAGTGATTTGAAGCTTGTTCTTGGACCAGAATTAAGGATTCTTTATCCTTTGATTCTAAATTTTGCTGTTAGTGGAGAGCTTGAGCTTAACGGTTTTGCTCACGCTAAAAGCATTAAACCAAAAGGAACTTTAACTTTCGATAATGGTGACGTGAATCTTCTTGCGACCCAGGTAAGACTGAAACGGGAGCATCTAAATATAGCAACTTTCGAACCTGAGAATGGGTTAGATCCTATGCTAGATTTAGCTTTAGTAGGTTCCGAGTGGCAGATAAGAATACAAAGTCGGGCAAGCAAATGGCAAGAAAAATTAGTGGTGACGTCAACCCGTTCTGTGGAACAAGATGCTCTCTCACCCACTGAGATGCCAAGTGTGGAGAGTTCGGGGAAAACACTACAGGCTGACTTGACAAGTAGAAAACCTGAACTACGGGCTACTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGTCAGGTGGGCAATTAGCCTTGGAGAAGCTTGCAACTGCTACGCTCGAGAAACTAATGCCTAGAATAGAAGGAAAGGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAACTTTACTCTCCTTTCCAACTACTGATCCTCTCCTATCGCTCACCAGTAATATCTCAATTGGTACTGTTGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGAGGTTTTCAAAAATTTCAAACAGGCCTCCATGATTCGCCAAATGAAGGAGACAGAGATGGCCATGCAATGGATGATCACGTACAAGCTTACGAGCCGCTTGCGGATGGTCCTTCAATCAGCTCCAGCTCAACGGACACTTTTGCTCGTTGAGTATTCTGCTACATCACTGGATTAA
Protein sequence
MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRFFGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGICLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGEEFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERHSSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVVGPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAMKTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNNANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTGLGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWPNSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTMLPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVSGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGDTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFGINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMVSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLNGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSYSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKFVGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRLRMVLQSAPAQRTLLLVEYSATSLD
Homology
BLAST of MELO3C003666 vs. NCBI nr
Match:
XP_008466365.1 (PREDICTED: uncharacterized protein LOC103503795 [Cucumis melo])
HSP 1 Score: 4242.6 bits (11002), Expect = 0.0e+00
Identity = 2153/2184 (98.58%), Postives = 2153/2184 (98.58%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF
Sbjct: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI
Sbjct: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH
Sbjct: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV
Sbjct: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM
Sbjct: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN
Sbjct: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG
Sbjct: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP
Sbjct: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM
Sbjct: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV
Sbjct: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM
Sbjct: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR
Sbjct: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS
Sbjct: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF
Sbjct: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
Query: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR
Sbjct: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
Query: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE
Sbjct: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
Query: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD
Sbjct: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
Query: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR
Sbjct: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
Query: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ
Sbjct: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
Query: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL
Sbjct: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
Query: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS
Sbjct: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
Query: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT
Sbjct: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
Query: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR
Sbjct: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
Query: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN
Sbjct: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
Query: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL
Sbjct: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
Query: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ
Sbjct: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
Query: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA
Sbjct: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
Query: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE
Sbjct: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
Query: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE
Sbjct: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
Query: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRA 2040
WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTE ATRA
Sbjct: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTE-----------------------ATRA 2040
Query: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT 2100
FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT
Sbjct: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT 2100
Query: 2101 TDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRL 2160
TDPLLSLTSNISIGTVVEVQLGKRI QASMIRQMKETEMAMQWMITYKLTSRL
Sbjct: 2101 TDPLLSLTSNISIGTVVEVQLGKRI--------QASMIRQMKETEMAMQWMITYKLTSRL 2153
Query: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2185
RMVLQSAPAQRTLLLVEYSATSLD
Sbjct: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2153
BLAST of MELO3C003666 vs. NCBI nr
Match:
XP_011652500.1 (protein TIC236, chloroplastic [Cucumis sativus])
HSP 1 Score: 4123.2 bits (10692), Expect = 0.0e+00
Identity = 2086/2184 (95.51%), Postives = 2112/2184 (96.70%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSF GTQLHSSLHC+ NGKFVYLG+ RLSK DSKKYVCA+HN+WNARV+RFSRF
Sbjct: 1 MNVKLDSSFFGTQLHSSLHCIKNGKFVYLGQGRLSKRDSKKYVCAEHNDWNARVDRFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
FGQHL+SLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAF AVVSGI
Sbjct: 61 FGQHLRSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFVAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSV PDGE
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVSPDGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTL+RH
Sbjct: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLQRH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV
Sbjct: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYK RHAKSEKYFDVKSPDTRLKFSSRAM
Sbjct: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKTRHAKSEKYFDVKSPDTRLKFSSRAM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
KT IKGQSKRNASGDDVYVNSFAAK+RILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN
Sbjct: 361 KTPIKGQSKRNASGDDVYVNSFAAKRRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
NLDSYLIK GNETNADSSITDTDVQYGKQSLDARLNSL+EKRDIDIPNHIDDQTSTVTG
Sbjct: 421 VNLDSYLIKRGNETNADSSITDTDVQYGKQSLDARLNSLREKRDIDIPNHIDDQTSTVTG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKDRRSFS TPSIDESN+RKEDV+GSDH+PDGISD M NTSQTPTSTGHEHQHGTSWP
Sbjct: 481 LGNKDRRSFSVTPSIDESNVRKEDVVGSDHIPDGISDQMLNTSQTPTSTGHEHQHGTSWP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
SFWGLS ESALSYFPKDV KKLLYHIS+Y+QNLK G VQHARG+IDGGDVMKNKG NTM
Sbjct: 541 ISFWGLSSESALSYFPKDVGKKLLYHISLYIQNLKFGIVQHARGIIDGGDVMKNKGANTM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV
Sbjct: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKIFDAPSSFTEI ATLCFRGQRIFVQNASGWFG APLEASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIVATLCFRGQRIFVQNASGWFGSAPLEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GINPDEGEFHLMCQVPGVE NALMKTFKMKPF FPLAGSVTAVFNCQGPLDSPIFVGSGM
Sbjct: 781 GINPDEGEFHLMCQVPGVEANALMKTFKMKPFFFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTF IDNCVADLYGIR
Sbjct: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFGIDNCVADLYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMH Y+PGYSDWMPLKLGLL
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHLYVPGYSDWMPLKLGLL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS
Sbjct: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
YSDKIMLDEEVFD KRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF
Sbjct: 1021 YSDKIMLDEEVFDAKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
Query: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
VGKVLRPSSKDFSNEKSK QVQPIDEENK+GLAGEVSISGLKLNQLVLAPKLAGLLSMTR
Sbjct: 1081 VGKVLRPSSKDFSNEKSKQQVQPIDEENKDGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
Query: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
ESIKL+TTGRPDESLSVEIVGSLKP+SDNSRKSKLFSFNLQRGQL+ANARYQPSRSAHLE
Sbjct: 1141 ESIKLETTGRPDESLSVEIVGSLKPSSDNSRKSKLFSFNLQRGQLKANARYQPSRSAHLE 1200
Query: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD
Sbjct: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
Query: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR
Sbjct: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
Query: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
MRLEVP+AEVAEMLPLARLLSRS DPSVHSRSKD FIQNLQAVGLYTESVQDLIEVIRRQ
Sbjct: 1321 MRLEVPKAEVAEMLPLARLLSRSTDPSVHSRSKDFFIQNLQAVGLYTESVQDLIEVIRRQ 1380
Query: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
FILSDEIVLEDLSLPGLSELRGCW GSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL
Sbjct: 1381 FILSDEIVLEDLSLPGLSELRGCWHGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
Query: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS
Sbjct: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
Query: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT
Sbjct: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
Query: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
SGSRFLFNAKFEP+IQNGHVHVQGSIPVMFVQN MGEVEEVETDTSRGTLVHAWGKEKVR
Sbjct: 1561 SGSRFLFNAKFEPVIQNGHVHVQGSIPVMFVQNKMGEVEEVETDTSRGTLVHAWGKEKVR 1620
Query: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN
Sbjct: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
Query: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
WLHG+ADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTL+VRSNRLCINSL
Sbjct: 1681 WLHGSADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLYVRSNRLCINSL 1740
Query: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
ESRV RRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ
Sbjct: 1741 ESRVGRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
Query: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFS PPGSSNQVVASKYASFFNSESTA
Sbjct: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSHPPGSSNQVVASKYASFFNSESTA 1860
Query: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
LKTRF VP+DK VDIEKESRNVN+KPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE
Sbjct: 1861 LKTRFHVPQDKGVDIEKESRNVNIKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
Query: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE
Sbjct: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
Query: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRA 2040
WQIRIQSRASKWQEKLVVTSTRSVEQDA SPTE ATRA
Sbjct: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDAHSPTE-----------------------ATRA 2040
Query: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT 2100
FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQA WRLVYAPQIPTLLSFPT
Sbjct: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQASWRLVYAPQIPTLLSFPT 2100
Query: 2101 TDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRL 2160
TDPL SLTSNIS GTVVEVQLGKRI QASMIRQMKETEMAMQW TYKLTSRL
Sbjct: 2101 TDPLQSLTSNISFGTVVEVQLGKRI--------QASMIRQMKETEMAMQWTFTYKLTSRL 2153
Query: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2185
RMVLQSAPAQRTLLLVEYSATSLD
Sbjct: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2153
BLAST of MELO3C003666 vs. NCBI nr
Match:
KAE8651368.1 (hypothetical protein Csa_001268 [Cucumis sativus])
HSP 1 Score: 4109.7 bits (10657), Expect = 0.0e+00
Identity = 2086/2208 (94.47%), Postives = 2112/2208 (95.65%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSF GTQLHSSLHC+ NGKFVYLG+ RLSK DSKKYVCA+HN+WNARV+RFSRF
Sbjct: 1 MNVKLDSSFFGTQLHSSLHCIKNGKFVYLGQGRLSKRDSKKYVCAEHNDWNARVDRFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
FGQHL+SLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAF AVVSGI
Sbjct: 61 FGQHLRSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFVAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSV PDGE
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVSPDGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTL+RH
Sbjct: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLQRH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV
Sbjct: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYK RHAKSEKYFDVKSPDTRLKFSSRAM
Sbjct: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKTRHAKSEKYFDVKSPDTRLKFSSRAM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
KT IKGQSKRNASGDDVYVNSFAAK+RILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN
Sbjct: 361 KTPIKGQSKRNASGDDVYVNSFAAKRRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
NLDSYLIK GNETNADSSITDTDVQYGKQSLDARLNSL+EKRDIDIPNHIDDQTSTVTG
Sbjct: 421 VNLDSYLIKRGNETNADSSITDTDVQYGKQSLDARLNSLREKRDIDIPNHIDDQTSTVTG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKDRRSFS TPSIDESN+RKEDV+GSDH+PDGISD M NTSQTPTSTGHEHQHGTSWP
Sbjct: 481 LGNKDRRSFSVTPSIDESNVRKEDVVGSDHIPDGISDQMLNTSQTPTSTGHEHQHGTSWP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
SFWGLS ESALSYFPKDV KKLLYHIS+Y+QNLK G VQHARG+IDGGDVMKNKG NTM
Sbjct: 541 ISFWGLSSESALSYFPKDVGKKLLYHISLYIQNLKFGIVQHARGIIDGGDVMKNKGANTM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV
Sbjct: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKIFDAPSSFTEI ATLCFRGQRIFVQNASGWFG APLEASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIVATLCFRGQRIFVQNASGWFGSAPLEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GINPDEGEFHLMCQVPGVE NALMKTFKMKPF FPLAGSVTAVFNCQGPLDSPIFVGSGM
Sbjct: 781 GINPDEGEFHLMCQVPGVEANALMKTFKMKPFFFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTF IDNCVADLYGIR
Sbjct: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFGIDNCVADLYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMH Y+PGYSDWMPLKLGLL
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHLYVPGYSDWMPLKLGLL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS
Sbjct: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
YSDKIMLDEEVFD KRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF
Sbjct: 1021 YSDKIMLDEEVFDAKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
Query: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
VGKVLRPSSKDFSNEKSK QVQPIDEENK+GLAGEVSISGLKLNQLVLAPKLAGLLSMTR
Sbjct: 1081 VGKVLRPSSKDFSNEKSKQQVQPIDEENKDGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
Query: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
ESIKL+TTGRPDESLSVEIVGSLKP+SDNSRKSKLFSFNLQRGQL+ANARYQPSRSAHLE
Sbjct: 1141 ESIKLETTGRPDESLSVEIVGSLKPSSDNSRKSKLFSFNLQRGQLKANARYQPSRSAHLE 1200
Query: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD
Sbjct: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
Query: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR
Sbjct: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
Query: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
MRLEVP+AEVAEMLPLARLLSRS DPSVHSRSKD FIQNLQAVGLYTESVQDLIEVIRRQ
Sbjct: 1321 MRLEVPKAEVAEMLPLARLLSRSTDPSVHSRSKDFFIQNLQAVGLYTESVQDLIEVIRRQ 1380
Query: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
FILSDEIVLEDLSLPGLSELRGCW GSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL
Sbjct: 1381 FILSDEIVLEDLSLPGLSELRGCWHGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
Query: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS
Sbjct: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
Query: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT
Sbjct: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
Query: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
SGSRFLFNAKFEP+IQNGHVHVQGSIPVMFVQN MGEVEEVETDTSRGTLVHAWGKEKVR
Sbjct: 1561 SGSRFLFNAKFEPVIQNGHVHVQGSIPVMFVQNKMGEVEEVETDTSRGTLVHAWGKEKVR 1620
Query: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN
Sbjct: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
Query: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
WLHG+ADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTL+VRSNRLCINSL
Sbjct: 1681 WLHGSADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLYVRSNRLCINSL 1740
Query: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
ESRV RRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ
Sbjct: 1741 ESRVGRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
Query: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFS PPGSSNQVVASKYASFFNSESTA
Sbjct: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSHPPGSSNQVVASKYASFFNSESTA 1860
Query: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
LKTRF VP+DK VDIEKESRNVN+KPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE
Sbjct: 1861 LKTRFHVPQDKGVDIEKESRNVNIKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
Query: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLAT------------------------QVRLKREH 1980
LNGFAHAKSIKPKGTLTFDNGDVNLLAT QVRLKREH
Sbjct: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVSSNLYHTYSFSKQKKLVLTLTMQVRLKREH 1980
Query: 1981 LNIATFEPENGLDPMLDLALVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPS 2040
LNIATFEPENGLDPMLDLALVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDA SPTE
Sbjct: 1981 LNIATFEPENGLDPMLDLALVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDAHSPTE--- 2040
Query: 2041 VESSGKTLQADLTSRKPELRATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGK 2100
ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGK
Sbjct: 2041 --------------------ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGK 2100
Query: 2101 GEFGQARWRLVYAPQIPTLLSFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQAS 2160
GEFGQA WRLVYAPQIPTLLSFPTTDPL SLTSNIS GTVVEVQLGKRI QAS
Sbjct: 2101 GEFGQASWRLVYAPQIPTLLSFPTTDPLQSLTSNISFGTVVEVQLGKRI--------QAS 2160
Query: 2161 MIRQMKETEMAMQWMITYKLTSRLRMVLQSAPAQRTLLLVEYSATSLD 2185
MIRQMKETEMAMQW TYKLTSRLRMVLQSAPAQRTLLLVEYSATSLD
Sbjct: 2161 MIRQMKETEMAMQWTFTYKLTSRLRMVLQSAPAQRTLLLVEYSATSLD 2177
BLAST of MELO3C003666 vs. NCBI nr
Match:
XP_038897772.1 (protein TIC236, chloroplastic [Benincasa hispida] >XP_038897773.1 protein TIC236, chloroplastic [Benincasa hispida])
HSP 1 Score: 3909.0 bits (10136), Expect = 0.0e+00
Identity = 1984/2191 (90.55%), Postives = 2059/2191 (93.98%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSF GT LHSSLHC+ NGKFVYL R RL K DSKKYVCAKHN+WNARV+RFSRF
Sbjct: 1 MNVKLDSSFFGTPLHSSLHCIKNGKFVYLRRGRLLKRDSKKYVCAKHNDWNARVDRFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
QHLKSLSIKL+PRHESLMKCANEPFVQTKSLSSLLRP WNEGLFLIRCSAF AVVSGI
Sbjct: 61 CVQHLKSLSIKLRPRHESLMKCANEPFVQTKSLSSLLRPAWNEGLFLIRCSAFVAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSI+PLSITLESCS+GPDGE
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSITPLSITLESCSIGPDGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGT +RH
Sbjct: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTSQRH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARE+AAALW+KDRDDAAREAAEMGFVVFDRSSGLYD+SD KE V
Sbjct: 241 SSSEEGIDNRTKIRRIAREDAAALWAKDRDDAAREAAEMGFVVFDRSSGLYDSSDLKEDV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPT+DI N KT FF D +VH REHHCMDTDVDYKI+HA SEKYFDVKSP+TRLKF SR M
Sbjct: 301 GPTIDIENYKTCFFTDNDVHLREHHCMDTDVDYKIKHADSEKYFDVKSPNTRLKFLSRVM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
K+ IKGQSKR ASGDD+YVN+F AKKR LRRSTLAAQDYFKGASEGKF EPSQLH+SFNN
Sbjct: 361 KSPIKGQSKRKASGDDIYVNNFTAKKRTLRRSTLAAQDYFKGASEGKFVEPSQLHRSFNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
NLD+YLIKS +ETNA SSI +TDVQY KQSLDA+L+SLKE DIDI NHIDDQ STVTG
Sbjct: 421 VNLDAYLIKSVDETNAASSIANTDVQYEKQSLDAKLHSLKE-GDIDIRNHIDDQISTVTG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKD+RSFS TPSIDESN++K+DV+GSDH+ DGISD M NTSQTPTST HEHQHG+S P
Sbjct: 481 LGNKDKRSFSVTPSIDESNVKKDDVVGSDHILDGISDQMCNTSQTPTSTVHEHQHGSSGP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
SFW LSPESALSYFPKDV KKL+YH+SMYVQNLK G VQHARG++DGGDVMKNKGT TM
Sbjct: 541 TSFWALSPESALSYFPKDVRKKLMYHLSMYVQNLKFGLVQHARGIVDGGDVMKNKGTETM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK+WRS+FV
Sbjct: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKTWRSDFV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVDIFEQ+WHSNLKITN+FVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDIFEQQWHSNLKITNLFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFG APLEASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGSAPLEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GI+P+EGEFHLMCQVPGVEVNALMKTFKM+PFLFPLAGSVTAVFNCQGPLDSPIFVGSGM
Sbjct: 781 GIHPEEGEFHLMCQVPGVEVNALMKTFKMRPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
VSRKMN+ F DLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTF+IDNCVADLYGIR
Sbjct: 841 VSRKMNHSFLDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFNIDNCVADLYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNIS DKIMHRYMPGY D MPLKLGLL
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISFDKIMHRYMPGYLDLMPLKLGLL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGSLL+PRFNINWTAPLAEGSFRDARGDINISHDYI VNSSSVAFELFSK+QTS
Sbjct: 961 NGETKVSGSLLKPRFNINWTAPLAEGSFRDARGDINISHDYITVNSSSVAFELFSKMQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
YSD+IMLDEEVFDTKRTPS IDGVELDLHMRGFEFLSLVSYIFESPRP HLKATGRVKF
Sbjct: 1021 YSDEIMLDEEVFDTKRTPSCIIDGVELDLHMRGFEFLSLVSYIFESPRPTHLKATGRVKF 1080
Query: 1081 VGKVLR----PSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLL 1140
VGKV+R SS+DFSNEKSK QVQP+DE+ KN LAGEVSISGLKLNQLVLAPKLAGLL
Sbjct: 1081 VGKVMRLSAGSSSQDFSNEKSKQQVQPVDEDYKNSLAGEVSISGLKLNQLVLAPKLAGLL 1140
Query: 1141 SMTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRS 1200
SMTRESIKLD TGRPDESLSVEIVGSLKP+SDNSRKSKLFSFNLQRGQLRAN YQPSRS
Sbjct: 1141 SMTRESIKLDATGRPDESLSVEIVGSLKPSSDNSRKSKLFSFNLQRGQLRANVCYQPSRS 1200
Query: 1201 AHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAAR 1260
AHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLS+L PKFSGVLGEALDIAAR
Sbjct: 1201 AHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSILGPKFSGVLGEALDIAAR 1260
Query: 1261 WSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSM 1320
WSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNV+ KE +GFLKKAMASHLSSVISSM
Sbjct: 1261 WSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVSGKEGSGFLKKAMASHLSSVISSM 1320
Query: 1321 GRWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEV 1380
GRWRMRLEVP AEVAEMLPLARLLSRS DPSVHSRSKDLFIQ+LQAVGLYTESVQDLIEV
Sbjct: 1321 GRWRMRLEVPMAEVAEMLPLARLLSRSTDPSVHSRSKDLFIQSLQAVGLYTESVQDLIEV 1380
Query: 1381 IRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKT 1440
IRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWG YKT
Sbjct: 1381 IRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGTYKT 1440
Query: 1441 QRVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQV 1500
QRVLAVGAYSNNDGLRLEKIFIQKDNAT+HADGTLFGPITNLHFAVLNFPVSLVP VQV
Sbjct: 1441 QRVLAVGAYSNNDGLRLEKIFIQKDNATIHADGTLFGPITNLHFAVLNFPVSLVPTVVQV 1500
Query: 1501 IESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVV 1560
IESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGG+DLGRAEVV
Sbjct: 1501 IESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGIDLGRAEVV 1560
Query: 1561 ASLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGK 1620
ASLTS SRFLFNAKFEPIIQNGHVHVQGSIPVMFVQN+MGEVEEVETDTSR TLVHAWGK
Sbjct: 1561 ASLTSSSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNSMGEVEEVETDTSRATLVHAWGK 1620
Query: 1621 EKVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALS 1680
EKVR+KFNDRKSSR+RNEEGWNTQLAEGLKGLNW+LLDVGEVRIDADIKDGGMLLLTALS
Sbjct: 1621 EKVRDKFNDRKSSRERNEEGWNTQLAEGLKGLNWNLLDVGEVRIDADIKDGGMLLLTALS 1680
Query: 1681 PHVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLC 1740
PHVNWLHGNADILLQVRGTIEEP+LDGSASFHRASISSPVLPKPL NFGGT+HVRSNRLC
Sbjct: 1681 PHVNWLHGNADILLQVRGTIEEPVLDGSASFHRASISSPVLPKPLINFGGTVHVRSNRLC 1740
Query: 1741 INSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITG 1800
INSLESRVSRRGKLI+KGNLPLRSSEA L DKIDLKCEVLEVRAKNIFSGQVDS MQITG
Sbjct: 1741 INSLESRVSRRGKLIVKGNLPLRSSEASLGDKIDLKCEVLEVRAKNIFSGQVDSLMQITG 1800
Query: 1801 SILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNS 1860
SILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNS
Sbjct: 1801 SILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNS 1860
Query: 1861 ESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1920
EST LKTRF PRDKA DIEKESRN+N+KPSVDV L +LK+VLGPELRILYPLILNFAVS
Sbjct: 1861 ESTTLKTRFHAPRDKAADIEKESRNLNIKPSVDVYLGNLKVVLGPELRILYPLILNFAVS 1920
Query: 1921 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL 1980
GELELNG AHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL
Sbjct: 1921 GELELNGRAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL 1980
Query: 1981 VGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELR 2040
VGSEWQIRIQSRASKWQ+KLVVTSTRSVEQDALSPTE
Sbjct: 1981 VGSEWQIRIQSRASKWQDKLVVTSTRSVEQDALSPTE----------------------- 2040
Query: 2041 ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL 2100
A RAFENQLAESILE GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL
Sbjct: 2041 AARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL 2100
Query: 2101 SFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKL 2160
SFPTTDPL SLTSNIS GTVVEVQLGKRI QAS++RQMKE+EMAMQW TYKL
Sbjct: 2101 SFPTTDPLKSLTSNISFGTVVEVQLGKRI--------QASIVRQMKESEMAMQWTFTYKL 2159
Query: 2161 TSRLRMVLQS---APAQRTLLLVEYSATSLD 2185
TSRLRMVLQS APAQRTL+LVEYSA+SLD
Sbjct: 2161 TSRLRMVLQSAPAAPAQRTLVLVEYSASSLD 2159
BLAST of MELO3C003666 vs. NCBI nr
Match:
XP_022936094.1 (uncharacterized protein LOC111442799 [Cucurbita moschata])
HSP 1 Score: 3807.3 bits (9872), Expect = 0.0e+00
Identity = 1933/2191 (88.22%), Postives = 2025/2191 (92.42%), Query Frame = 0
Query: 2 NVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRFF 61
NVKLDSSF TQLHSSL+C+ NG FV + R RLSK DSKKY+CAKHN+WNARV+RFSRF
Sbjct: 3 NVKLDSSFFATQLHSSLYCIKNGNFVCVRRGRLSKRDSKKYICAKHNDWNARVDRFSRFC 62
Query: 62 GQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGIC 121
GQHLKS+S+KL+PRHESLMKCANEP VQTK+LSS LRP+ NEGLFLIRCSAF AVVSGIC
Sbjct: 63 GQHLKSISLKLRPRHESLMKCANEPSVQTKALSSFLRPLRNEGLFLIRCSAFVAVVSGIC 122
Query: 122 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGEE 181
LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPD EE
Sbjct: 123 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDDEE 182
Query: 182 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERHS 241
FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPS VVVQKRDYTWLGLPFPSEGTL+RHS
Sbjct: 183 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSAVVVQKRDYTWLGLPFPSEGTLQRHS 242
Query: 242 SSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVVG 301
SSEEGIDNRTKIRRIARE AAA WSKDRDDAAREAAEMGFVV DRSSGLYD+S+ KE VG
Sbjct: 243 SSEEGIDNRTKIRRIAREEAAACWSKDRDDAAREAAEMGFVVSDRSSGLYDSSNLKEDVG 302
Query: 302 PTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAMK 361
P VD+ NSK F F DENVHSREH CMDTDVDYKI+HA +EKYFDVKSP +RLKF SR MK
Sbjct: 303 PAVDVENSKAFLFMDENVHSREHRCMDTDVDYKIKHANAEKYFDVKSPGSRLKFLSRVMK 362
Query: 362 TLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNNA 421
IKGQSKR ASGD+VYVN+F AKKRILRRSTLAAQDYFK ASE KF EPS+LH+S NN
Sbjct: 363 VPIKGQSKRKASGDNVYVNNFMAKKRILRRSTLAAQDYFKAASEVKFSEPSELHRSLNNV 422
Query: 422 NLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTGL 481
NLD+YL+KS NETNADSS+ +TD QYGKQ L A L SL+E+ IDIPNHIDDQ STVTGL
Sbjct: 423 NLDAYLVKSVNETNADSSVMNTDAQYGKQRLYAGLPSLEEEGGIDIPNHIDDQISTVTGL 482
Query: 482 GNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWPN 541
GNKDRR FS TPSI+ESN++ +DV+GSDH+PDGISD M +TSQ PTSTGHEHQ GTS P
Sbjct: 483 GNKDRRFFSVTPSINESNVKNDDVVGSDHIPDGISDQMCHTSQAPTSTGHEHQSGTSGPT 542
Query: 542 SFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTML 601
SFW +SP+SALSYFPKD KLLYH++MY +NLK G VQH+R +++GGDVMKNKGT ML
Sbjct: 543 SFWAMSPKSALSYFPKDAGTKLLYHLAMYFKNLKFGLVQHSRVIVNGGDVMKNKGTEAML 602
Query: 602 PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVS 661
PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK+WRS+ VS
Sbjct: 603 PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKTWRSDSVS 662
Query: 662 GDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGD 721
GDGGWLSADVFVDIFEQ+WHSNLKITN+FVPLFERILDIPITWSKGRATGEVHLCMSRGD
Sbjct: 663 GDGGWLSADVFVDIFEQQWHSNLKITNLFVPLFERILDIPITWSKGRATGEVHLCMSRGD 722
Query: 722 TFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFG 781
TFPNFQGQL+VTGLAFKIFDAPSSFTE+AA+LCFRGQRIFVQNASGWFG APLEASGDFG
Sbjct: 723 TFPNFQGQLEVTGLAFKIFDAPSSFTEMAASLCFRGQRIFVQNASGWFGSAPLEASGDFG 782
Query: 782 INPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV 841
I+P+EGEFHLMCQVP VEVNALMKTFKM+PFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV
Sbjct: 783 IHPEEGEFHLMCQVPCVEVNALMKTFKMRPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV 842
Query: 842 SRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRA 901
SRKMN+ SD+PASCASEAIVKSKE GAIAAVDRIPFSYVSANFTF+IDNCVADLYGIRA
Sbjct: 843 SRKMNHSISDIPASCASEAIVKSKEAGAIAAVDRIPFSYVSANFTFNIDNCVADLYGIRA 902
Query: 902 NLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLN 961
NLVDGGEIRGAGNAWICPEGELDDTAMDLN SGNIS DKIMHRYMPGY D MPLKLGLLN
Sbjct: 903 NLVDGGEIRGAGNAWICPEGELDDTAMDLNISGNISFDKIMHRYMPGYLDLMPLKLGLLN 962
Query: 962 GETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSY 1021
GETKVSGSL RPRFNINWTAPLAEGSFRDARGDINISHDY IVNSSSVAFELFSK+QTSY
Sbjct: 963 GETKVSGSLTRPRFNINWTAPLAEGSFRDARGDINISHDYFIVNSSSVAFELFSKMQTSY 1022
Query: 1022 SDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKFV 1081
SD+ MLDEE FD KRTPS TIDGVELDLHMRGFEFLSLVSYIFESPRP HLKATGRVKFV
Sbjct: 1023 SDENMLDEEAFDAKRTPSCTIDGVELDLHMRGFEFLSLVSYIFESPRPTHLKATGRVKFV 1082
Query: 1082 GKVLRP----SSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLS 1141
GKVLRP SS+DF+ EKS QVQ I +ENKN LAGEVSISGLKLNQL+LAPKLAG LS
Sbjct: 1083 GKVLRPSASNSSQDFNTEKSNQQVQTIGDENKNSLAGEVSISGLKLNQLILAPKLAGQLS 1142
Query: 1142 MTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSA 1201
MTRESIKLD TGRPDESLSVEIVGSLKP SDNS KSKLFSFNLQRGQLRAN YQP RSA
Sbjct: 1143 MTRESIKLDATGRPDESLSVEIVGSLKPGSDNSIKSKLFSFNLQRGQLRANVCYQPFRSA 1202
Query: 1202 HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARW 1261
HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVL PKFSGVLGEALDIAARW
Sbjct: 1203 HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLGPKFSGVLGEALDIAARW 1262
Query: 1262 SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMG 1321
SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVT KE+ GFLKKAMASHLSSVISSMG
Sbjct: 1263 SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTGKETHGFLKKAMASHLSSVISSMG 1322
Query: 1322 RWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVI 1381
RWRMRLEVPRAEVAEMLPLARLLSRS DPSVHSRSKDLFIQ+LQAVGLYTESVQ+LIEVI
Sbjct: 1323 RWRMRLEVPRAEVAEMLPLARLLSRSTDPSVHSRSKDLFIQSLQAVGLYTESVQELIEVI 1382
Query: 1382 RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQ 1441
RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWG YKTQ
Sbjct: 1383 RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGTYKTQ 1442
Query: 1442 RVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVI 1501
RVLA GAYSNNDGLRLEKIFIQKDNAT+HADGTLFGPITNLHFAVLNFPVSLVP VQVI
Sbjct: 1443 RVLAGGAYSNNDGLRLEKIFIQKDNATIHADGTLFGPITNLHFAVLNFPVSLVPTVVQVI 1502
Query: 1502 ESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVA 1561
ESSAKDLVHSLRQLV PIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGG+DLGRAEVVA
Sbjct: 1503 ESSAKDLVHSLRQLVTPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGIDLGRAEVVA 1562
Query: 1562 SLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKE 1621
SLTS SR LFNAKFEPIIQNGHVHVQGSIPV+F QN++ EVEE+ETDTSR TL+HAWGKE
Sbjct: 1563 SLTSSSRLLFNAKFEPIIQNGHVHVQGSIPVLFFQNSVTEVEELETDTSRATLIHAWGKE 1622
Query: 1622 KVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSP 1681
KVR+KFNDRKSSR+RNEEGWNTQLAEGLKGLNW+LLDVGEVR+DADIKDGGMLLLTALSP
Sbjct: 1623 KVRDKFNDRKSSRERNEEGWNTQLAEGLKGLNWNLLDVGEVRVDADIKDGGMLLLTALSP 1682
Query: 1682 HVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCI 1741
HVNWLHGNADILLQV+GTIEEP+LDGSASFHRASISSPVLPKPL NFGGT+HVRSNRLCI
Sbjct: 1683 HVNWLHGNADILLQVKGTIEEPVLDGSASFHRASISSPVLPKPLINFGGTVHVRSNRLCI 1742
Query: 1742 NSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGS 1801
NSLESRVSRRGKLI+KGNLPLRSSEA L DKIDLKCEVLEVRAKNIFSGQVDSQMQITGS
Sbjct: 1743 NSLESRVSRRGKLIVKGNLPLRSSEASLGDKIDLKCEVLEVRAKNIFSGQVDSQMQITGS 1802
Query: 1802 ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYAS-FFNS 1861
ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFS P GSSNQVVASKYAS FF+S
Sbjct: 1803 ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSPPTGSSNQVVASKYASPFFSS 1862
Query: 1862 ESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1921
ESTALKTRF PRDKA D EKESRNVN+KPSVDVSLSDLKLVLGPELRILYPLILNFAVS
Sbjct: 1863 ESTALKTRFDAPRDKAADSEKESRNVNIKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1922
Query: 1922 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL 1981
GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLN+ATFEPENGLDPMLDLAL
Sbjct: 1923 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNVATFEPENGLDPMLDLAL 1982
Query: 1982 VGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELR 2041
VGSEWQIRIQSRASKWQ+KLVVTSTRSVEQDALSPTE
Sbjct: 1983 VGSEWQIRIQSRASKWQDKLVVTSTRSVEQDALSPTE----------------------- 2042
Query: 2042 ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL 2101
A RAFENQLAESILES GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIP+LL
Sbjct: 2043 AARAFENQLAESILESDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSLL 2102
Query: 2102 SFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKL 2161
SFPTTDPL SLTSNIS GTVVEVQLGKRI QAS++RQMKE+EMAMQW TYKL
Sbjct: 2103 SFPTTDPLKSLTSNISFGTVVEVQLGKRI--------QASIVRQMKESEMAMQWTFTYKL 2162
Query: 2162 TSRLRMVLQS---APAQRTLLLVEYSATSLD 2185
TSRLRMVLQS APAQRTL+LVEYSA+SLD
Sbjct: 2163 TSRLRMVLQSGPAAPAQRTLVLVEYSASSLD 2162
BLAST of MELO3C003666 vs. ExPASy Swiss-Prot
Match:
F4ISL7 (Protein TIC236, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TIC236 PE=1 SV=1)
HSP 1 Score: 2555.4 bits (6622), Expect = 0.0e+00
Identity = 1349/2238 (60.28%), Postives = 1661/2238 (74.22%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
M+++L + FL T L LH N + + R + + Y K N+W A+V +FS+F
Sbjct: 1 MSLRLQNPFLSTPL---LHGSFNRREKRINVARRAFRSKRIYSEKKQNDWLAKVAKFSQF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
G++++ L L R +KC EPFV++K L L PVW EGLF +RCS F AV+SG+
Sbjct: 61 CGKNVQLLRKSLDSRSRMEVKCLKEPFVRSKDLVRSLAPVWEEGLFFLRCSVFFAVISGV 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQ KA+ FVE KLLPSVC +S+ IQR++DFGKVR +SPL ITLE+ S+GP GE
Sbjct: 121 CLLVWYGQNKARVFVETKLLPSVCSVLSETIQREVDFGKVRRVSPLCITLEASSIGPHGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMK+ V PF SLRRG++++D +LS+P+V+V QK+D+TWLG+P S+ TL H
Sbjct: 181 EFSCGEVPTMKVCVRPFASLRRGKIVVDAILSNPTVLVAQKKDFTWLGIPL-SDTTLPSH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSEEGID RTK RR++RE A W ++RD+ AR+AAE+G++V ++ + K
Sbjct: 241 LSSEEGIDFRTKTRRVSREEAGIRWDEERDNDARKAAEIGYIVPCKNYSQAKDNAVKHDR 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
T +I N +F DE +HS E HCMD V+Y ++HA+ EK F +K P + LKF S+ +
Sbjct: 301 RFT-EIANPNSFICMDEKMHSAEQHCMDPGVEYDVKHAELEKSFGIKIPGSGLKFLSKML 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
K K + K N+ +++ +AKKRIL RS AA YF S+ K EPS L +++
Sbjct: 361 KVPRKYKFKWNSKSHKNSMSNISAKKRILERSASAALSYFHSLSQQKLDEPSVLSTNYDG 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQS----LDARLNSLKEKRDIDIPNHIDDQTS 480
+LD L+K E S+ D V YG+QS LD + ++ KR + +
Sbjct: 421 LSLDMLLVKGDREI---SNQYDRHVPYGEQSLANDLDGKGYRVRGKRLLGVKKASTLDKF 480
Query: 481 TVT---GLGNKDR----RSFSATPSIDE-SNMRKEDVIGSDHVPDGISDHMRNTSQTP-- 540
TV+ L DR +PS+++ N + + + S ++ +NT P
Sbjct: 481 TVSCDPFLMTVDRLCALLQTKRSPSVEDIVNSSESETLSSQRGDISMNVVNQNTDDVPHG 540
Query: 541 ----------TSTGHEHQ-----HGTSWPNSFWGLSPESALSYFPKDVAKKL-------L 600
T HEHQ SWP + + A+ +KKL
Sbjct: 541 NRSGNQPRDFTFKKHEHQPVANHWRPSWPRN---KKLKEAVFNILTGSSKKLTGRADPNA 600
Query: 601 YHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTMLPVTIDSVHFKGGTLMLLAYGDRE 660
H+S ++ L + +V+ LPV +DSV FKGGTL+LLAYGD E
Sbjct: 601 PHLSDELEKLPAVYVEKT------------------LPVMLDSVQFKGGTLLLLAYGDTE 660
Query: 661 PREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVSGDGGWLSADVFVDIFEQEWHSNL 720
PREM NV+GHVKFQNHYG V+V L GNC WRS+ S DGG LS DVFVD EQ WH+NL
Sbjct: 661 PREMRNVHGHVKFQNHYGRVYVQLGGNCNMWRSDVTSEDGGLLSVDVFVDTVEQNWHANL 720
Query: 721 KITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGDTFPNFQGQLDVTGLAFKIFDAPS 780
+ N FVP+FERIL+IPI WSKGRATGEVHLCMSRG++FPN GQLDVTGL F I DAPS
Sbjct: 721 NVANFFVPIFERILEIPIEWSKGRATGEVHLCMSRGESFPNLHGQLDVTGLGFHINDAPS 780
Query: 781 SFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFGINPDEGEFHLMCQVPGVEVNALM 840
SF++++A+L FRGQRIF+ NA+GWFG PLEASGDFGI+PDEGEFHLMCQVP VE+NALM
Sbjct: 781 SFSDVSASLSFRGQRIFLHNANGWFGKVPLEASGDFGIHPDEGEFHLMCQVPYVEINALM 840
Query: 841 KTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMVSRKMNNLFSDLPASCASEAIVKS 900
KTFKMKP FPLAGSVTAVFNCQGPLD+P+FVGS MVSRK+ L DLP S A EA++K+
Sbjct: 841 KTFKMKPLFFPLAGSVTAVFNCQGPLDAPVFVGSCMVSRKIAYLSPDLPTSLAYEAMLKN 900
Query: 901 KEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRANLVDGGEIRGAGNAWICPEGELD 960
KE GA+AA DR+PFSY+SANFTF+ DNCVADLYGIRA LVDGGEIRGAGNAWICPEGE+D
Sbjct: 901 KEAGAVAAFDRVPFSYLSANFTFNTDNCVADLYGIRATLVDGGEIRGAGNAWICPEGEVD 960
Query: 961 DTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLNGETKVSGSLLRPRFNINWTAPLA 1020
DTA+D+NFSGNIS DK++HRYMP Y + LKLG L GETK+SG+LL+PRF+I W AP A
Sbjct: 961 DTALDVNFSGNISFDKVLHRYMPEYFNIGMLKLGDLTGETKLSGALLKPRFDIKWAAPKA 1020
Query: 1021 EGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSYSDKIMLDEEVFDTKRTPSFTIDG 1080
+GS DARGDI ISHD IIVNSSSVAF+LF+K+ TSY D + ++ + P F ++G
Sbjct: 1021 DGSLTDARGDIVISHDNIIVNSSSVAFDLFTKLDTSYHDPCLSHQDFTQGEAMP-FVVEG 1080
Query: 1081 VELDLHMRGFEFLSLV-SYIFESPRPMHLKATGRVKFVGKVLRPSSK---DFSNEKSKHQ 1140
++LDL MRGFEF SLV SY F+SPRP HLKATGR+KF+GK+ R S+ D ++K +
Sbjct: 1081 LDLDLRMRGFEFFSLVSSYPFDSPRPTHLKATGRIKFLGKIKRHSTTKDGDVGSDKCE-- 1140
Query: 1141 VQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTRESIKLDTTGRPDESLSVEIV 1200
D + L G++SIS LKLNQL+LAP+L+G LS++R+ +KLD GRPDESL+++ +
Sbjct: 1141 ----DAAAISSLDGDISISSLKLNQLILAPQLSGRLSVSRDHVKLDAAGRPDESLTLDFI 1200
Query: 1201 GSLKPNSD-NSRKSKLFSFNLQRGQLRANARYQPSRSAHLELRHLPLDDLELASLRGAIQ 1260
G L+PNSD N + KL SF+LQ+GQLRANA +QP +SA LE+R+ PLD+LELASLRG IQ
Sbjct: 1201 GPLQPNSDENVQSGKLLSFSLQKGQLRANACFQPQQSATLEIRNFPLDELELASLRGLIQ 1260
Query: 1261 RAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGDV-----------ITIEKTI 1320
+AEI+LNLQKRRGHG+LSV+ PKFSGVLGEALD+A RWSGDV IT+EKTI
Sbjct: 1261 KAEIQLNLQKRRGHGLLSVIRPKFSGVLGEALDVAVRWSGDVCFMLSGRLEVMITVEKTI 1320
Query: 1321 LEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWRMRLEVPRA 1380
LEQSNSRYELQGEYVLPGSRDR++ KE+ FL +AM HL SVISSMGRWRMRLEVP+A
Sbjct: 1321 LEQSNSRYELQGEYVLPGSRDRDLGQKEAGSFLMRAMTGHLGSVISSMGRWRMRLEVPKA 1380
Query: 1381 EVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQFILSDEIV 1440
EVAEMLPLARLLSRS DP+VHSRSKDLFIQ++Q + L E+++DL+E IR + E+V
Sbjct: 1381 EVAEMLPLARLLSRSTDPAVHSRSKDLFIQSVQNLCLQAENLRDLLEEIRGYYTPPSEVV 1440
Query: 1441 LEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVLAVGAYSNN 1500
LEDLSLPGL+EL+G W GSLDASGGGNGDT+AEFDFHG+DWEWG YKTQRVLA G+Y+N+
Sbjct: 1441 LEDLSLPGLAELKGHWHGSLDASGGGNGDTLAEFDFHGDDWEWGTYKTQRVLATGSYNND 1500
Query: 1501 DGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESSAKDLVHSL 1560
DGLRL+++ IQK NAT+HADGTL GP TNLHFAVLNFPVSL+P V+V+ESSA D+VHSL
Sbjct: 1501 DGLRLKEMLIQKGNATLHADGTLLGPKTNLHFAVLNFPVSLIPTLVEVVESSATDIVHSL 1560
Query: 1561 RQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLTSGSRFLFN 1620
R+L++PI+GILHMEGDLRG+L KPECDVQVRLLDGA+GG+DLGRAEV ASLTS SRFLFN
Sbjct: 1561 RKLLSPIKGILHMEGDLRGSLEKPECDVQVRLLDGAVGGIDLGRAEVFASLTSNSRFLFN 1620
Query: 1621 AKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVREKFNDRKS 1680
+ FEP +QNGHVH+QGS+PV F Q NM E E ETD + +W KEK + +++++
Sbjct: 1621 SNFEPFVQNGHVHIQGSVPVSFSQKNMSEGEVSETDRGGAVKIPSWAKEK---EDDEKRT 1680
Query: 1681 SRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVNWLHGNADI 1740
SRDR+EE W++QLAE LKGL W++LD GEVR++ADIKDGGM LLTA+SP+ NWL GNADI
Sbjct: 1681 SRDRSEERWDSQLAESLKGLYWNILDAGEVRLEADIKDGGMTLLTAISPYANWLQGNADI 1740
Query: 1741 LLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSLESRVSRRG 1800
LQV GT++ P+LDGSASFHRASISSPVL KPLTNFGGTLHV+SNRLCI SLESRVSR+G
Sbjct: 1741 RLQVGGTVDHPVLDGSASFHRASISSPVLRKPLTNFGGTLHVKSNRLCITSLESRVSRKG 1800
Query: 1801 KLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQPNISGNIQ 1860
KL++KGNLPLRS+EA D I+LKCEVLEVRAKN S QVD+Q+QITGS+LQP ISGNI+
Sbjct: 1801 KLVVKGNLPLRSNEASAGDGIELKCEVLEVRAKNFLSCQVDTQLQITGSMLQPTISGNIK 1860
Query: 1861 LSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKY-ASFFNSESTALKTRFRV 1920
LS+GEAYLPHDKG GAA N++ ++Q+S+P + NQ V+S+Y A FF +E + +F
Sbjct: 1861 LSQGEAYLPHDKGGGAAPLNRLAANQYSIPGAAINQAVSSRYFARFFGTERASSGMKFSQ 1920
Query: 1921 PRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELELNGFAHA 1980
K+ +EKE V +KP++D+ LSD+KLVLGPELRI+YPLILNFAVSGELEL+G AH
Sbjct: 1921 STGKSNSVEKEIEEVKMKPNMDIRLSDMKLVLGPELRIMYPLILNFAVSGELELDGMAHP 1980
Query: 1981 KSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSEWQIRIQS 2040
K IKPKG LTF+NGDVNL+ATQVRLKREHLN+A FEPE+GLDP+LDLALVGSEWQ R+QS
Sbjct: 1981 KFIKPKGVLTFENGDVNLVATQVRLKREHLNVAKFEPEHGLDPLLDLALVGSEWQFRVQS 2040
Query: 2041 RASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRAFENQLAE 2100
RAS WQ+KLVVTSTRSVEQDALSP+E A + FE+QLAE
Sbjct: 2041 RASNWQDKLVVTSTRSVEQDALSPSE-----------------------AAKVFESQLAE 2100
Query: 2101 SILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSF-PTTDPLLS 2160
SILE GQLA +KLATATL +MPRIEGKGEFGQARWRLVYAPQIP+LLS PT DPL S
Sbjct: 2101 SILEGDGQLAFKKLATATLGTIMPRIEGKGEFGQARWRLVYAPQIPSLLSVDPTVDPLKS 2160
Query: 2161 LTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRLRMVLQS 2185
L SNIS GT VEVQLGKR+ QAS++RQMK++EMAMQW + Y+LTSRLR++LQS
Sbjct: 2161 LASNISFGTEVEVQLGKRL--------QASVVRQMKDSEMAMQWTLIYQLTSRLRVLLQS 2166
BLAST of MELO3C003666 vs. ExPASy Swiss-Prot
Match:
W0RYD3 (Protein SUBSTANDARD STARCH GRAIN 4, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=SSG4 PE=1 SV=1)
HSP 1 Score: 2257.6 bits (5849), Expect = 0.0e+00
Identity = 1180/2124 (55.56%), Postives = 1530/2124 (72.03%), Query Frame = 0
Query: 91 KSLSSLLRPVWNEGLFLIRCSAFAAVVSGICLLVWYGQTKAKGFVEAKLLPSVCKAVSDC 150
++L + L P+W EGLFL+RCS FAA +S L WY Q +A+ FVE++LLP+ C A+ +
Sbjct: 91 QALVASLAPLWREGLFLVRCSVFAAALSVAAALSWYAQLRARSFVESRLLPAACAALGEF 150
Query: 151 IQRDLDFGKVRSISPLSITLESCSVGPDGEEFSCGEVPTMKLRVLPFTSLRRGRVIIDVV 210
+QR++ G+VRS+SPL ITL +CS+GP EEFSC EVP MK+RV PF SLRRGRV++D V
Sbjct: 151 LQREVHLGRVRSVSPLGITLHTCSIGPHAEEFSCAEVPVMKIRVRPFASLRRGRVVVDAV 210
Query: 211 LSHPSVVVVQKRDYTWLGLPFPSEGTLERHSSSEEGIDNRTKIRRIARENAAALWSKDRD 270
LS PS +V Q++D++WLGLP PSEG+ +RH S EEGID RTK RR+ARE AA W+++RD
Sbjct: 211 LSEPSALVAQRKDFSWLGLPAPSEGSPKRH-SGEEGIDYRTKTRRLAREKAAEQWNEERD 270
Query: 271 DAAREAAEMGFVVFDRSSGLYDTSDYKEVVGPTVDIGNSKTFFFKDENVHSREHHCMDTD 330
AAREAAEMG++V S + E GP VD G S DE +H ++HH +D
Sbjct: 271 KAAREAAEMGYIVPSAQSISPSIDEMMEDDGP-VDTGKSSPHLCPDE-MHRKDHH-IDAG 330
Query: 331 VDYKIRHAKSEKYFDVKSPDTRLKFSSRAMKTLIKGQSKRNASGDDVYVNSFAAKKRILR 390
+D +HA EK F VK+ + F SR + + + +R A + ++++RILR
Sbjct: 331 IDSSSKHADLEKSFGVKARIPGISFWSRMIPNPSRRRYRRKAHSKLISDTDNSSQQRILR 390
Query: 391 RSTLAAQDYFKGASEGKFGEPSQLHKSFNNANLDSYLIKSGNETNADSSITDTDVQYGKQ 450
RS AA YF+ G N D L G E+++D T+ + G
Sbjct: 391 RSAYAAVAYFQNECSG---------------NPDDSLPGPG-ESSSDGGHTNGGGEEGSP 450
Query: 451 SLDARLNSLKEKRDIDIPNHIDDQTSTVTGLGNKDRRSFSATPSIDESNMRKEDVIGSDH 510
+ D P + TS G ++ +F++T I +++ + GS H
Sbjct: 451 N--------------DGPTEYSETTSMDYGELPPEKSNFASTMLIGNTDV----LNGSSH 510
Query: 511 --VPDGISDHM----RNTSQTPTSTGHEHQHGTSWPNSF----WG--LSPESALSYFPKD 570
P IS H S+ P ++ + F +G + LS++P
Sbjct: 511 NQQPSQISSHSWENNEQVSEAPVLKKRKNISEDDYRQEFDFGAFGSCTYAHNWLSFWPFQ 570
Query: 571 VA------KKLLYHISMYVQNLKSGFV----QHARGVIDGGDVMKNKGTNTMLPVTIDSV 630
+ +++ +Q L+S F ++ + G + LP+T+DSV
Sbjct: 571 LKGFPVGFNAPSASLNVQIQKLRSLFAIGPGDNSAELSQGVGQIHPGAVQQTLPITLDSV 630
Query: 631 HFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVSGDGGWLS 690
+F GG LMLL YGD+EPREM++ NGH+KF+N Y VHVH++GNC WR + S GG+LS
Sbjct: 631 YFNGGNLMLLGYGDQEPREMKHANGHIKFKNSYNRVHVHVTGNCMEWRQDRTSQGGGYLS 690
Query: 691 ADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGDTFPNFQG 750
DVFVDI EQ WH+NL + N F PLFERIL+IP+ W+KGRATGEVHLCMS+GD+FP+ G
Sbjct: 691 TDVFVDIAEQTWHANLNVVNAFAPLFERILEIPVVWNKGRATGEVHLCMSKGDSFPSIHG 750
Query: 751 QLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFGINPDEGE 810
QLDV GLAF+I DAPSSF++I ATL FRGQR+F+ NASGWFG AP+EASGDFG+NP++GE
Sbjct: 751 QLDVKGLAFQILDAPSSFSDIVATLSFRGQRVFLHNASGWFGDAPVEASGDFGLNPEDGE 810
Query: 811 FHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMVSRKMNNL 870
FHLMCQVP VEVNALMKT KM+P +FPLAG+VTAVFNCQGPLD+P+FVGSG+VSRK ++
Sbjct: 811 FHLMCQVPSVEVNALMKTMKMRPLMFPLAGAVTAVFNCQGPLDAPVFVGSGIVSRKSLSV 870
Query: 871 FSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRANLVDGGE 930
LP S ASEA++++KE GA+AA D IPF++VSANFTF++DNCVADLYGIRA L+DGGE
Sbjct: 871 SGMLP-SAASEAVMQNKESGAVAAFDHIPFTHVSANFTFNLDNCVADLYGIRACLLDGGE 930
Query: 931 IRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLNGETKVSG 990
IRGAGN WICPEGE DD+AMD+N SG+I LDK++HRY+PG +PLK+G LNGET++SG
Sbjct: 931 IRGAGNVWICPEGEGDDSAMDINLSGSILLDKVLHRYIPGGIQLIPLKIGELNGETRLSG 990
Query: 991 SLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSYSDKIMLD 1050
SL+RP+F+I W AP AE SF DARG+I I+HDYI+VNSSSV+F+L + +QTSY D +L
Sbjct: 991 SLIRPKFDIKWAAPNAEDSFSDARGNIVIAHDYIMVNSSSVSFDLNTHIQTSYIDDYLLH 1050
Query: 1051 EEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYI-FESPRPMHLKATGRVKFVGKVLRP 1110
+E++ K+ ++GV+LDL MRGFEF + S I F+SPRP+HLKA+GR KF GKV++
Sbjct: 1051 KEMYQRKKIMPLIVEGVDLDLRMRGFEFAHIASSIPFDSPRPLHLKASGRFKFQGKVVKY 1110
Query: 1111 SSKDFSNEKSKHQVQPIDEENK-----NGLAGEVSISGLKLNQLVLAPKLAGLLSMTRES 1170
S +EK+ +Q +++K + L GE+S+SG+KLNQL+LAP+ G LS++ +S
Sbjct: 1111 S--QLVDEKNHGAIQGTIDQSKLENDVSRLVGEISLSGIKLNQLMLAPQSTGFLSISPDS 1170
Query: 1171 IKLDTTGRPDESLSVEI-VGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLEL 1230
I L+ TGRPDE+ S+E+ V + + +L S LQ+GQLR+N Y P LE+
Sbjct: 1171 IMLNATGRPDENFSIEVNVPLFFGTHEAIQDGRLLSIFLQKGQLRSNICYHPENLTSLEV 1230
Query: 1231 RHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGDV 1290
R+LPLD+LE ASLRG +Q+AE++LN QKRRGHG+LSV+ PKFSG+LGE+LDIAARWSGDV
Sbjct: 1231 RNLPLDELEFASLRGFVQKAELQLNFQKRRGHGLLSVIRPKFSGMLGESLDIAARWSGDV 1290
Query: 1291 ITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWRM 1350
IT+EK++LEQ+NS+YELQGEYV PG+RDR + +S GF++KAM HL S++SSMGRWRM
Sbjct: 1291 ITMEKSVLEQANSKYELQGEYVFPGTRDRFPMESQSNGFIEKAMGGHLGSMMSSMGRWRM 1350
Query: 1351 RLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQF 1410
RLEVP AEVAEMLPLARLLSRS DP++ SRSK+LF+Q L +VG ES++D ++ +
Sbjct: 1351 RLEVPGAEVAEMLPLARLLSRSTDPAIRSRSKELFMQTLHSVGFNAESLRDQLKALEMYP 1410
Query: 1411 ILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVLA 1470
D+ +ED++LPGL+ELRG WRGSLDASGGGNGDTMA+FDF+GEDWEWG YKTQRVLA
Sbjct: 1411 DWLDDDTIEDITLPGLAELRGYWRGSLDASGGGNGDTMADFDFNGEDWEWGTYKTQRVLA 1470
Query: 1471 VGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESSA 1530
G++SNNDGLRL+K+FIQKDNAT+HADG++ GP+TNLHFAVLNFPV L+PA VQ IESS
Sbjct: 1471 SGSFSNNDGLRLDKLFIQKDNATLHADGSILGPLTNLHFAVLNFPVGLIPALVQAIESST 1530
Query: 1531 KDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLTS 1590
D +H LRQ + PI+GILHMEGDLRG LAKPECDVQ+RLLDG IGG+DLGRAEV+AS+T
Sbjct: 1531 TDSIHFLRQWLTPIKGILHMEGDLRGTLAKPECDVQIRLLDGTIGGIDLGRAEVLASVTP 1590
Query: 1591 GSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTL-VHAWGKEK-V 1650
SRF+F+A FEP IQ+GHV++QGS+PV +V +N E + D +G + + W K++ +
Sbjct: 1591 TSRFVFDANFEPTIQSGHVNIQGSVPVTYVDSNSIEEDLEGGDGKQGIIRIPVWAKDRGL 1650
Query: 1651 REKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHV 1710
++ + RD+ +EGW QLAE LKGL+W++L+ GEVRI+ADIKDGGM L+TALSP+
Sbjct: 1651 TNDISETRIMRDKPDEGWEFQLAESLKGLSWNMLEPGEVRINADIKDGGMTLITALSPYS 1710
Query: 1711 NWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINS 1770
NWL G A++LLQV+GT++ P++DGSASFHRA+++SP L PLTNF G +HV SNRLCI+S
Sbjct: 1711 NWLQGYAEVLLQVKGTVDHPVVDGSASFHRATVASPFLRTPLTNFAGNVHVISNRLCISS 1770
Query: 1771 LESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSIL 1830
+ESRV R+G+L +KG LPL + E +DKI+LKCEVL++RAKNI SGQVDSQ+Q+TGSIL
Sbjct: 1771 MESRVGRKGRLSMKGTLPLHNIEPSANDKIELKCEVLDIRAKNILSGQVDSQLQVTGSIL 1830
Query: 1831 QPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSEST 1890
+P++SG I+LS GEAYLPHDKG+GA + + S+P G + V+ + F S ST
Sbjct: 1831 RPDVSGMIRLSHGEAYLPHDKGNGAVATRLSSNKSISVPAGFDQRTVSRDVSHFLGSLST 1890
Query: 1891 ALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGEL 1950
+ P + + E+ + + KP++D L+DLKL GPELRI+YPLILNFAVSG+L
Sbjct: 1891 S-------PDGQQSETERTPEHGSFKPNIDARLNDLKLTFGPELRIVYPLILNFAVSGDL 1950
Query: 1951 ELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGS 2010
ELNG H K I+PKG LTF+NG+VNL+ATQVRLK +HLN+A FEP+ GLDP+LDL LVGS
Sbjct: 1951 ELNGMVHPKYIRPKGVLTFENGEVNLVATQVRLKNDHLNVAKFEPDLGLDPILDLVLVGS 2010
Query: 2011 EWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATR 2070
EWQ +IQSRAS WQ+ LVVTSTRSV+QD LSP+E A +
Sbjct: 2011 EWQFKIQSRASMWQDNLVVTSTRSVDQDVLSPSE-----------------------AAK 2070
Query: 2071 AFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSF- 2130
FE+QLAES+LE GQLA +KLATATLE LMPRIEGKGEFGQARWRLVYAPQIP+LLS
Sbjct: 2071 VFESQLAESLLEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSLLSVD 2130
Query: 2131 PTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTS 2183
PT DPL SL +NIS T VEVQLGKR+ QAS++RQMK++EMAMQW + Y+LTS
Sbjct: 2131 PTVDPLKSLANNISFATEVEVQLGKRL--------QASVVRQMKDSEMAMQWSLIYQLTS 2133
BLAST of MELO3C003666 vs. ExPASy TrEMBL
Match:
A0A1S3CR23 (uncharacterized protein LOC103503795 OS=Cucumis melo OX=3656 GN=LOC103503795 PE=4 SV=1)
HSP 1 Score: 4242.6 bits (11002), Expect = 0.0e+00
Identity = 2153/2184 (98.58%), Postives = 2153/2184 (98.58%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF
Sbjct: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI
Sbjct: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH
Sbjct: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV
Sbjct: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM
Sbjct: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN
Sbjct: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG
Sbjct: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP
Sbjct: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM
Sbjct: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV
Sbjct: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM
Sbjct: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR
Sbjct: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS
Sbjct: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF
Sbjct: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
Query: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR
Sbjct: 1081 VGKVLRPSSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTR 1140
Query: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE
Sbjct: 1141 ESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSAHLE 1200
Query: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD
Sbjct: 1201 LRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGD 1260
Query: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR
Sbjct: 1261 VITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWR 1320
Query: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ
Sbjct: 1321 MRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQ 1380
Query: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL
Sbjct: 1381 FILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVL 1440
Query: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS
Sbjct: 1441 AVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESS 1500
Query: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT
Sbjct: 1501 AKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLT 1560
Query: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR
Sbjct: 1561 SGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVR 1620
Query: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN
Sbjct: 1621 EKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVN 1680
Query: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL
Sbjct: 1681 WLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSL 1740
Query: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ
Sbjct: 1741 ESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQ 1800
Query: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA
Sbjct: 1801 PNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNSESTA 1860
Query: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE
Sbjct: 1861 LKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELE 1920
Query: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE
Sbjct: 1921 LNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSE 1980
Query: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRA 2040
WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTE ATRA
Sbjct: 1981 WQIRIQSRASKWQEKLVVTSTRSVEQDALSPTE-----------------------ATRA 2040
Query: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT 2100
FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT
Sbjct: 2041 FENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSFPT 2100
Query: 2101 TDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRL 2160
TDPLLSLTSNISIGTVVEVQLGKRI QASMIRQMKETEMAMQWMITYKLTSRL
Sbjct: 2101 TDPLLSLTSNISIGTVVEVQLGKRI--------QASMIRQMKETEMAMQWMITYKLTSRL 2153
Query: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2185
RMVLQSAPAQRTLLLVEYSATSLD
Sbjct: 2161 RMVLQSAPAQRTLLLVEYSATSLD 2153
BLAST of MELO3C003666 vs. ExPASy TrEMBL
Match:
A0A6J1F7B9 (uncharacterized protein LOC111442799 OS=Cucurbita moschata OX=3662 GN=LOC111442799 PE=4 SV=1)
HSP 1 Score: 3807.3 bits (9872), Expect = 0.0e+00
Identity = 1933/2191 (88.22%), Postives = 2025/2191 (92.42%), Query Frame = 0
Query: 2 NVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRFF 61
NVKLDSSF TQLHSSL+C+ NG FV + R RLSK DSKKY+CAKHN+WNARV+RFSRF
Sbjct: 3 NVKLDSSFFATQLHSSLYCIKNGNFVCVRRGRLSKRDSKKYICAKHNDWNARVDRFSRFC 62
Query: 62 GQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGIC 121
GQHLKS+S+KL+PRHESLMKCANEP VQTK+LSS LRP+ NEGLFLIRCSAF AVVSGIC
Sbjct: 63 GQHLKSISLKLRPRHESLMKCANEPSVQTKALSSFLRPLRNEGLFLIRCSAFVAVVSGIC 122
Query: 122 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGEE 181
LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPD EE
Sbjct: 123 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDDEE 182
Query: 182 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERHS 241
FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPS VVVQKRDYTWLGLPFPSEGTL+RHS
Sbjct: 183 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSAVVVQKRDYTWLGLPFPSEGTLQRHS 242
Query: 242 SSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVVG 301
SSEEGIDNRTKIRRIARE AAA WSKDRDDAAREAAEMGFVV DRSSGLYD+S+ KE VG
Sbjct: 243 SSEEGIDNRTKIRRIAREEAAACWSKDRDDAAREAAEMGFVVSDRSSGLYDSSNLKEDVG 302
Query: 302 PTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAMK 361
P VD+ NSK F F DENVHSREH CMDTDVDYKI+HA +EKYFDVKSP +RLKF SR MK
Sbjct: 303 PAVDVENSKAFLFMDENVHSREHRCMDTDVDYKIKHANAEKYFDVKSPGSRLKFLSRVMK 362
Query: 362 TLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNNA 421
IKGQSKR ASGD+VYVN+F AKKRILRRSTLAAQDYFK ASE KF EPS+LH+S NN
Sbjct: 363 VPIKGQSKRKASGDNVYVNNFMAKKRILRRSTLAAQDYFKAASEVKFSEPSELHRSLNNV 422
Query: 422 NLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTGL 481
NLD+YL+KS NETNADSS+ +TD QYGKQ L A L SL+E+ IDIPNHIDDQ STVTGL
Sbjct: 423 NLDAYLVKSVNETNADSSVMNTDAQYGKQRLYAGLPSLEEEGGIDIPNHIDDQISTVTGL 482
Query: 482 GNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWPN 541
GNKDRR FS TPSI+ESN++ +DV+GSDH+PDGISD M +TSQ PTSTGHEHQ GTS P
Sbjct: 483 GNKDRRFFSVTPSINESNVKNDDVVGSDHIPDGISDQMCHTSQAPTSTGHEHQSGTSGPT 542
Query: 542 SFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTML 601
SFW +SP+SALSYFPKD KLLYH++MY +NLK G VQH+R +++GGDVMKNKGT ML
Sbjct: 543 SFWAMSPKSALSYFPKDAGTKLLYHLAMYFKNLKFGLVQHSRVIVNGGDVMKNKGTEAML 602
Query: 602 PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVS 661
PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK+WRS+ VS
Sbjct: 603 PVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKTWRSDSVS 662
Query: 662 GDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGD 721
GDGGWLSADVFVDIFEQ+WHSNLKITN+FVPLFERILDIPITWSKGRATGEVHLCMSRGD
Sbjct: 663 GDGGWLSADVFVDIFEQQWHSNLKITNLFVPLFERILDIPITWSKGRATGEVHLCMSRGD 722
Query: 722 TFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFG 781
TFPNFQGQL+VTGLAFKIFDAPSSFTE+AA+LCFRGQRIFVQNASGWFG APLEASGDFG
Sbjct: 723 TFPNFQGQLEVTGLAFKIFDAPSSFTEMAASLCFRGQRIFVQNASGWFGSAPLEASGDFG 782
Query: 782 INPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV 841
I+P+EGEFHLMCQVP VEVNALMKTFKM+PFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV
Sbjct: 783 IHPEEGEFHLMCQVPCVEVNALMKTFKMRPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMV 842
Query: 842 SRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRA 901
SRKMN+ SD+PASCASEAIVKSKE GAIAAVDRIPFSYVSANFTF+IDNCVADLYGIRA
Sbjct: 843 SRKMNHSISDIPASCASEAIVKSKEAGAIAAVDRIPFSYVSANFTFNIDNCVADLYGIRA 902
Query: 902 NLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLN 961
NLVDGGEIRGAGNAWICPEGELDDTAMDLN SGNIS DKIMHRYMPGY D MPLKLGLLN
Sbjct: 903 NLVDGGEIRGAGNAWICPEGELDDTAMDLNISGNISFDKIMHRYMPGYLDLMPLKLGLLN 962
Query: 962 GETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSY 1021
GETKVSGSL RPRFNINWTAPLAEGSFRDARGDINISHDY IVNSSSVAFELFSK+QTSY
Sbjct: 963 GETKVSGSLTRPRFNINWTAPLAEGSFRDARGDINISHDYFIVNSSSVAFELFSKMQTSY 1022
Query: 1022 SDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKFV 1081
SD+ MLDEE FD KRTPS TIDGVELDLHMRGFEFLSLVSYIFESPRP HLKATGRVKFV
Sbjct: 1023 SDENMLDEEAFDAKRTPSCTIDGVELDLHMRGFEFLSLVSYIFESPRPTHLKATGRVKFV 1082
Query: 1082 GKVLRP----SSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLS 1141
GKVLRP SS+DF+ EKS QVQ I +ENKN LAGEVSISGLKLNQL+LAPKLAG LS
Sbjct: 1083 GKVLRPSASNSSQDFNTEKSNQQVQTIGDENKNSLAGEVSISGLKLNQLILAPKLAGQLS 1142
Query: 1142 MTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRSA 1201
MTRESIKLD TGRPDESLSVEIVGSLKP SDNS KSKLFSFNLQRGQLRAN YQP RSA
Sbjct: 1143 MTRESIKLDATGRPDESLSVEIVGSLKPGSDNSIKSKLFSFNLQRGQLRANVCYQPFRSA 1202
Query: 1202 HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARW 1261
HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVL PKFSGVLGEALDIAARW
Sbjct: 1203 HLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLGPKFSGVLGEALDIAARW 1262
Query: 1262 SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMG 1321
SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVT KE+ GFLKKAMASHLSSVISSMG
Sbjct: 1263 SGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTGKETHGFLKKAMASHLSSVISSMG 1322
Query: 1322 RWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVI 1381
RWRMRLEVPRAEVAEMLPLARLLSRS DPSVHSRSKDLFIQ+LQAVGLYTESVQ+LIEVI
Sbjct: 1323 RWRMRLEVPRAEVAEMLPLARLLSRSTDPSVHSRSKDLFIQSLQAVGLYTESVQELIEVI 1382
Query: 1382 RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQ 1441
RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWG YKTQ
Sbjct: 1383 RRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGTYKTQ 1442
Query: 1442 RVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVI 1501
RVLA GAYSNNDGLRLEKIFIQKDNAT+HADGTLFGPITNLHFAVLNFPVSLVP VQVI
Sbjct: 1443 RVLAGGAYSNNDGLRLEKIFIQKDNATIHADGTLFGPITNLHFAVLNFPVSLVPTVVQVI 1502
Query: 1502 ESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVA 1561
ESSAKDLVHSLRQLV PIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGG+DLGRAEVVA
Sbjct: 1503 ESSAKDLVHSLRQLVTPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGIDLGRAEVVA 1562
Query: 1562 SLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKE 1621
SLTS SR LFNAKFEPIIQNGHVHVQGSIPV+F QN++ EVEE+ETDTSR TL+HAWGKE
Sbjct: 1563 SLTSSSRLLFNAKFEPIIQNGHVHVQGSIPVLFFQNSVTEVEELETDTSRATLIHAWGKE 1622
Query: 1622 KVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSP 1681
KVR+KFNDRKSSR+RNEEGWNTQLAEGLKGLNW+LLDVGEVR+DADIKDGGMLLLTALSP
Sbjct: 1623 KVRDKFNDRKSSRERNEEGWNTQLAEGLKGLNWNLLDVGEVRVDADIKDGGMLLLTALSP 1682
Query: 1682 HVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCI 1741
HVNWLHGNADILLQV+GTIEEP+LDGSASFHRASISSPVLPKPL NFGGT+HVRSNRLCI
Sbjct: 1683 HVNWLHGNADILLQVKGTIEEPVLDGSASFHRASISSPVLPKPLINFGGTVHVRSNRLCI 1742
Query: 1742 NSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGS 1801
NSLESRVSRRGKLI+KGNLPLRSSEA L DKIDLKCEVLEVRAKNIFSGQVDSQMQITGS
Sbjct: 1743 NSLESRVSRRGKLIVKGNLPLRSSEASLGDKIDLKCEVLEVRAKNIFSGQVDSQMQITGS 1802
Query: 1802 ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYAS-FFNS 1861
ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFS P GSSNQVVASKYAS FF+S
Sbjct: 1803 ILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSPPTGSSNQVVASKYASPFFSS 1862
Query: 1862 ESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1921
ESTALKTRF PRDKA D EKESRNVN+KPSVDVSLSDLKLVLGPELRILYPLILNFAVS
Sbjct: 1863 ESTALKTRFDAPRDKAADSEKESRNVNIKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1922
Query: 1922 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL 1981
GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLN+ATFEPENGLDPMLDLAL
Sbjct: 1923 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNVATFEPENGLDPMLDLAL 1982
Query: 1982 VGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELR 2041
VGSEWQIRIQSRASKWQ+KLVVTSTRSVEQDALSPTE
Sbjct: 1983 VGSEWQIRIQSRASKWQDKLVVTSTRSVEQDALSPTE----------------------- 2042
Query: 2042 ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL 2101
A RAFENQLAESILES GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIP+LL
Sbjct: 2043 AARAFENQLAESILESDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSLL 2102
Query: 2102 SFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKL 2161
SFPTTDPL SLTSNIS GTVVEVQLGKRI QAS++RQMKE+EMAMQW TYKL
Sbjct: 2103 SFPTTDPLKSLTSNISFGTVVEVQLGKRI--------QASIVRQMKESEMAMQWTFTYKL 2162
Query: 2162 TSRLRMVLQS---APAQRTLLLVEYSATSLD 2185
TSRLRMVLQS APAQRTL+LVEYSA+SLD
Sbjct: 2163 TSRLRMVLQSGPAAPAQRTLVLVEYSASSLD 2162
BLAST of MELO3C003666 vs. ExPASy TrEMBL
Match:
A0A6J1III6 (uncharacterized protein LOC111476591 OS=Cucurbita maxima OX=3661 GN=LOC111476591 PE=4 SV=1)
HSP 1 Score: 3806.5 bits (9870), Expect = 0.0e+00
Identity = 1934/2199 (87.95%), Postives = 2029/2199 (92.27%), Query Frame = 0
Query: 2 NVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRFF 61
NVKLDSSF TQLHSSL+C+ NG FVY+ R +LSK DSKKY+CAKHN+WNARV+RFSRF
Sbjct: 3 NVKLDSSFFATQLHSSLYCIKNGNFVYVRRGQLSKRDSKKYICAKHNDWNARVDRFSRFC 62
Query: 62 GQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGIC 121
GQHLKS+S+KL+PRHESLMKCANEP VQTK+LSS LRP+ NEGLFLIRCSAF AVVSGIC
Sbjct: 63 GQHLKSISLKLRPRHESLMKCANEPSVQTKALSSFLRPLRNEGLFLIRCSAFVAVVSGIC 122
Query: 122 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGEE 181
LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPD EE
Sbjct: 123 LLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDDEE 182
Query: 182 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERHS 241
FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPS VVVQKRDYTWLGLPFPSEGTL+RHS
Sbjct: 183 FSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSAVVVQKRDYTWLGLPFPSEGTLQRHS 242
Query: 242 SSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVVG 301
SSEEGIDNRTKIRRIARE AAA WSKDRDDAAREAAEMGFVV DRSSGLYD+S+ KE VG
Sbjct: 243 SSEEGIDNRTKIRRIAREEAAACWSKDRDDAAREAAEMGFVVSDRSSGLYDSSNLKEDVG 302
Query: 302 PTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAMK 361
PTVD+ NSK F F DENVHSREH CMDTDVDYKI+HA +EKYFDVKSP +RLKF SR MK
Sbjct: 303 PTVDVENSKAFLFMDENVHSREHRCMDTDVDYKIKHANAEKYFDVKSPGSRLKFLSRVMK 362
Query: 362 TLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNNA 421
IKGQSKR ASGD+VYVN+F AKKRILRRSTLAAQDYFK ASE KFGEPS+LH+SFNN
Sbjct: 363 VPIKGQSKRKASGDNVYVNNFMAKKRILRRSTLAAQDYFKAASEVKFGEPSELHRSFNNV 422
Query: 422 NLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTGL 481
NLD+YL+KS NETNADSS+ +TD QYGKQ L A SL+E+ IDIPNHIDDQ STVTGL
Sbjct: 423 NLDAYLVKSVNETNADSSVMNTDAQYGKQRLYAGWPSLEEEGGIDIPNHIDDQISTVTGL 482
Query: 482 GNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDH--------MRNTSQTPTSTGHEH 541
GNKDRR FS TPSI+ESN++ +DV+GSDH+PDG+SDH M +TSQ PTSTGHEH
Sbjct: 483 GNKDRRFFSVTPSINESNVKNDDVVGSDHIPDGVSDHIPDGVSDQMCHTSQAPTSTGHEH 542
Query: 542 QHGTSWPNSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMK 601
Q GTS P SFW +SP+SALSYFPKD KLLYH++MY +NLK G VQH+R +++GGDVMK
Sbjct: 543 QSGTSGPTSFWAMSPKSALSYFPKDAGTKLLYHLAMYFKNLKFGLVQHSRVIVNGGDVMK 602
Query: 602 NKGTNTMLPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK 661
NKGT MLPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK
Sbjct: 603 NKGTEAMLPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCK 662
Query: 662 SWRSEFVSGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEV 721
+WRS+ VSGDGGWLSADVFVDIFEQ+WHSNLKITN+FVPLFERILDIPITWSKGRATGEV
Sbjct: 663 TWRSDSVSGDGGWLSADVFVDIFEQQWHSNLKITNLFVPLFERILDIPITWSKGRATGEV 722
Query: 722 HLCMSRGDTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAP 781
HLCMSRGDTFPNFQGQLDVTGLAFKIFDAPSSFTE+AA+LCFRGQRIFVQNASGWFG AP
Sbjct: 723 HLCMSRGDTFPNFQGQLDVTGLAFKIFDAPSSFTEMAASLCFRGQRIFVQNASGWFGSAP 782
Query: 782 LEASGDFGINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSP 841
LEASGDFGI+P+EGEFHLMCQVP VEVNALMKTFKM+PFLFPLAGSVTAVFNCQGPLDSP
Sbjct: 783 LEASGDFGIHPEEGEFHLMCQVPCVEVNALMKTFKMRPFLFPLAGSVTAVFNCQGPLDSP 842
Query: 842 IFVGSGMVSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCV 901
IFVGSGMVSRKMN+ SD+PASCASEAIVKSKE GAIAAVDRIPFSYVSANFTF+IDNCV
Sbjct: 843 IFVGSGMVSRKMNHSISDIPASCASEAIVKSKEAGAIAAVDRIPFSYVSANFTFNIDNCV 902
Query: 902 ADLYGIRANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWM 961
ADLYGIRANLVDGGEIRGAGNAWICPEGELDDTAMDLN SGNIS DKIMHRYMPGY D M
Sbjct: 903 ADLYGIRANLVDGGEIRGAGNAWICPEGELDDTAMDLNISGNISFDKIMHRYMPGYLDLM 962
Query: 962 PLKLGLLNGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFEL 1021
PLKLGLLNGETKVSGSL RPRFNINWTAPLAEGSFRDARGDINISHDY IVNSSSVAFEL
Sbjct: 963 PLKLGLLNGETKVSGSLTRPRFNINWTAPLAEGSFRDARGDINISHDYFIVNSSSVAFEL 1022
Query: 1022 FSKVQTSYSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLK 1081
FSK+QTSYSD+ MLDEE FD KRTPS TIDGVELDLHMRGFEFLSLVSYIFESPRP HLK
Sbjct: 1023 FSKMQTSYSDENMLDEEAFDAKRTPSCTIDGVELDLHMRGFEFLSLVSYIFESPRPTHLK 1082
Query: 1082 ATGRVKFVGKVLRP----SSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLA 1141
ATGRVKFVGKVLRP SS+DF+ EKS QVQ I +ENKN LAGEVSISGLKLNQL+LA
Sbjct: 1083 ATGRVKFVGKVLRPSASNSSQDFNIEKSYQQVQTIGDENKNSLAGEVSISGLKLNQLILA 1142
Query: 1142 PKLAGLLSMTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANA 1201
PKLAG LSMTRESIKLD TGRPDESLSVEIVGSLKP SDNS KSKLFSFNLQRGQLRAN
Sbjct: 1143 PKLAGQLSMTRESIKLDATGRPDESLSVEIVGSLKPGSDNSIKSKLFSFNLQRGQLRANV 1202
Query: 1202 RYQPSRSAHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGE 1261
YQP RSAHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVL PKFSGVLGE
Sbjct: 1203 CYQPFRSAHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLGPKFSGVLGE 1262
Query: 1262 ALDIAARWSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHL 1321
ALDIAARWSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVT KE+ GFLKKAMASHL
Sbjct: 1263 ALDIAARWSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTGKETHGFLKKAMASHL 1322
Query: 1322 SSVISSMGRWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTES 1381
SSVISSMGRWRMRLEVPRAEVAEMLPLARLLSRS DPSVHSRSKDLFI++LQAVGLYTES
Sbjct: 1323 SSVISSMGRWRMRLEVPRAEVAEMLPLARLLSRSTDPSVHSRSKDLFIRSLQAVGLYTES 1382
Query: 1382 VQDLIEVIRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDW 1441
VQ+LIEVIRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDW
Sbjct: 1383 VQELIEVIRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDW 1442
Query: 1442 EWGVYKTQRVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSL 1501
EWG YKTQRVLA GAYSNNDGLRLEKIFIQKDNAT+HADGTLFGPITNLHFAVLNFPVSL
Sbjct: 1443 EWGTYKTQRVLAGGAYSNNDGLRLEKIFIQKDNATIHADGTLFGPITNLHFAVLNFPVSL 1502
Query: 1502 VPAAVQVIESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVD 1561
VP VQVIESSAKDLVHSLRQLV PIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGG+D
Sbjct: 1503 VPTVVQVIESSAKDLVHSLRQLVTPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGID 1562
Query: 1562 LGRAEVVASLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGT 1621
LGRAEVVASLTS SR LFNAKFEPIIQNGHVHVQGSIPV+F QN++ EVEE+ETDTSR T
Sbjct: 1563 LGRAEVVASLTSSSRLLFNAKFEPIIQNGHVHVQGSIPVLFFQNSVTEVEELETDTSRAT 1622
Query: 1622 LVHAWGKEKVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGM 1681
L+HAWGKEKVR+KFNDRKSSR+RNEEGWNTQLAEGLKGLNW+LLDVGEVR+DADIKDGGM
Sbjct: 1623 LIHAWGKEKVRDKFNDRKSSRERNEEGWNTQLAEGLKGLNWNLLDVGEVRVDADIKDGGM 1682
Query: 1682 LLLTALSPHVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLH 1741
LLLTALSPHVNWLHGNADILLQV+GTIEEP+LDGSASFHRASISSPVLPKPL NFGGT+H
Sbjct: 1683 LLLTALSPHVNWLHGNADILLQVKGTIEEPVLDGSASFHRASISSPVLPKPLINFGGTVH 1742
Query: 1742 VRSNRLCINSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVD 1801
VRSNRLCINSLESRVSRRGKLI+KGNLPLRSSEA L DKIDLKCEVLEVRAKNIFSGQVD
Sbjct: 1743 VRSNRLCINSLESRVSRRGKLIVKGNLPLRSSEASLGDKIDLKCEVLEVRAKNIFSGQVD 1802
Query: 1802 SQMQITGSILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASK 1861
SQMQITGSILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFS P GSSNQ+VASK
Sbjct: 1803 SQMQITGSILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSPPTGSSNQIVASK 1862
Query: 1862 YAS-FFNSESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYP 1921
YAS FF+SESTALKTRF PRDKA D EKESRNVN+KPSVDVSLSDLKLVLGPELRILYP
Sbjct: 1863 YASPFFSSESTALKTRFDAPRDKAADSEKESRNVNIKPSVDVSLSDLKLVLGPELRILYP 1922
Query: 1922 LILNFAVSGELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGL 1981
LILNFAVSGELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLN+ATFEPENGL
Sbjct: 1923 LILNFAVSGELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNVATFEPENGL 1982
Query: 1982 DPMLDLALVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADL 2041
DPMLDLALVGSEWQIRIQSRASKWQ+KLVVTSTRSVEQDALSPTE
Sbjct: 1983 DPMLDLALVGSEWQIRIQSRASKWQDKLVVTSTRSVEQDALSPTE--------------- 2042
Query: 2042 TSRKPELRATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVY 2101
A RAFENQLAESILES GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVY
Sbjct: 2043 --------AARAFENQLAESILESDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVY 2102
Query: 2102 APQIPTLLSFPTTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAM 2161
APQIP+LLSFPTTDPL SLTSNIS GTVVEVQLGKRI QAS++RQMKE+EMAM
Sbjct: 2103 APQIPSLLSFPTTDPLKSLTSNISFGTVVEVQLGKRI--------QASIVRQMKESEMAM 2162
Query: 2162 QWMITYKLTSRLRMVLQS---APAQRTLLLVEYSATSLD 2185
QW TYKLTSRLRMVLQS APAQRTL+LVEYSA+SLD
Sbjct: 2163 QWTFTYKLTSRLRMVLQSGPAAPAQRTLVLVEYSASSLD 2170
BLAST of MELO3C003666 vs. ExPASy TrEMBL
Match:
A0A6J1BWB5 (uncharacterized protein LOC111006106 OS=Momordica charantia OX=3673 GN=LOC111006106 PE=4 SV=1)
HSP 1 Score: 3642.4 bits (9444), Expect = 0.0e+00
Identity = 1860/2190 (84.93%), Postives = 1989/2190 (90.82%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVKLDSSF GT HSSLHC+ NGKFVYL R +LSK DSKKY+ AKHN+WNARV+RFSRF
Sbjct: 1 MNVKLDSSFFGTPFHSSLHCMNNGKFVYLRRGQLSKRDSKKYIHAKHNDWNARVDRFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
GQ LKSLSIKL PRHE LMKCANEP QTK+LSS LRP+WNEGLF IRCSAF AV+SG+
Sbjct: 61 CGQQLKSLSIKLGPRHEYLMKCANEPLSQTKALSSFLRPLWNEGLFFIRCSAFVAVISGV 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQ KAKGF EAKLLPSVCKAVS+CIQRD+DFGKVRSISPLSITLESCSVGPD E
Sbjct: 121 CLLVWYGQAKAKGFAEAKLLPSVCKAVSECIQRDIDFGKVRSISPLSITLESCSVGPDDE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFS GEVP+MKLRVLPFTSLRRGRVIIDVVLSHPSV+VVQKRDYTWLG+PFPS+GTL+RH
Sbjct: 181 EFSNGEVPSMKLRVLPFTSLRRGRVIIDVVLSHPSVLVVQKRDYTWLGIPFPSKGTLQRH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSEEGIDNRTKIRRIARE+AAA WSKDRDDAAREAAEMGFVV DRSSGLYD+S KE V
Sbjct: 241 SSSEEGIDNRTKIRRIAREDAAARWSKDRDDAAREAAEMGFVVSDRSSGLYDSSAPKEDV 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
GPTVDI NSKTFF D NVHSREHHCMDTDVDYKI+HA SEKYF+VKSPD RLKF SR M
Sbjct: 301 GPTVDIENSKTFFCTDGNVHSREHHCMDTDVDYKIKHASSEKYFNVKSPDVRLKFLSRVM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
K K QSKR ASGDDVYVNSF AKKRIL RSTLAAQ+YFKG S+GKFGEPS L++S NN
Sbjct: 361 KAPKKDQSKRKASGDDVYVNSFTAKKRILSRSTLAAQEYFKGTSQGKFGEPSPLYRSLNN 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
NLD YL++S +ETN DSSI +TDV+YGK+SLD+ L+S E+ DI I N IDDQ +T+TG
Sbjct: 421 VNLDPYLVESVHETNVDSSIMNTDVKYGKESLDSILHSCNEEEDIGISNLIDDQIATITG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDV-IGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSW 540
LG+K+ RSFS T S +ESN++ +DV +GSDH+P+GISD M +TSQTPTST EHQ+GT
Sbjct: 481 LGSKE-RSFSVTSSSNESNVKNDDVDVGSDHIPNGISDQMCHTSQTPTSTIDEHQNGTPG 540
Query: 541 PNSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNT 600
LSP+SALSYF KDV K+LLYH+SM+ Q LK G VQ+A+ ++DGGDV KN+GT
Sbjct: 541 QIPILTLSPKSALSYFAKDVGKQLLYHLSMHSQKLKFGLVQYAKSIVDGGDVEKNEGTEM 600
Query: 601 MLPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEF 660
MLPVTID+VHF+GGT+MLLAYGDREPREMENV+GHVKFQNHYGNVHVHLSGNCK+WRS+
Sbjct: 601 MLPVTIDAVHFRGGTVMLLAYGDREPREMENVDGHVKFQNHYGNVHVHLSGNCKTWRSDI 660
Query: 661 VSGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSR 720
VS DGGWLSADVFVDI EQ+WHSNLKITN+FVPLFERILDIPITW+KGRATGEVH+CMSR
Sbjct: 661 VSEDGGWLSADVFVDIIEQKWHSNLKITNLFVPLFERILDIPITWTKGRATGEVHMCMSR 720
Query: 721 GDTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGD 780
GDTFPNFQGQLDVTGLAFKIFDAPSSFTE+ A+LCFRGQRIFVQNASGWFG APLEASGD
Sbjct: 721 GDTFPNFQGQLDVTGLAFKIFDAPSSFTEMVASLCFRGQRIFVQNASGWFGSAPLEASGD 780
Query: 781 FGINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSG 840
FGI+P+EGEFHLMCQVP VEVNALMKTFKM+PFLFPLAGSVTAVFNCQGPLDSPIFVGSG
Sbjct: 781 FGIHPEEGEFHLMCQVPRVEVNALMKTFKMRPFLFPLAGSVTAVFNCQGPLDSPIFVGSG 840
Query: 841 MVSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGI 900
MVSRKMN SDLPASCASEAIVKSKE GAIAAVDRIPFSYVSANFTF+IDNCVADLYGI
Sbjct: 841 MVSRKMNQSISDLPASCASEAIVKSKEAGAIAAVDRIPFSYVSANFTFNIDNCVADLYGI 900
Query: 901 RANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGL 960
RAN+VDGGEIRGAGNAWICPEGELDDTAMDLNFSGN+S DKI+HRYMPG D MPLKLGL
Sbjct: 901 RANIVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNLSFDKILHRYMPGDLDVMPLKLGL 960
Query: 961 LNGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQT 1020
LNGETKVSGSL RPRFNINWTAPLAEGSFRDARGDINISHD IIVNSSSVAFEL+SK+QT
Sbjct: 961 LNGETKVSGSLFRPRFNINWTAPLAEGSFRDARGDINISHDCIIVNSSSVAFELYSKMQT 1020
Query: 1021 SYSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVK 1080
SYSD+ MLDEEVF KRTPS TIDGVELDLHMRGFEFLSLVSYIFES RPMHLKATGR+K
Sbjct: 1021 SYSDENMLDEEVF-AKRTPSCTIDGVELDLHMRGFEFLSLVSYIFESQRPMHLKATGRIK 1080
Query: 1081 FVGKVLRPSSK----DFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGL 1140
FVGKVLRPSS+ DF EKSK Q+Q IDE K+ LAGEVSISGLKLNQL+LAPKLAGL
Sbjct: 1081 FVGKVLRPSSRSSSQDFCIEKSKQQLQRIDEA-KSSLAGEVSISGLKLNQLILAPKLAGL 1140
Query: 1141 LSMTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSR 1200
LSMTRESIKLD TGRPDESLSVEIVGSLKP+SDNSRKSK FSF+LQRGQLRAN YQP R
Sbjct: 1141 LSMTRESIKLDATGRPDESLSVEIVGSLKPSSDNSRKSKFFSFSLQRGQLRANVCYQPFR 1200
Query: 1201 SAHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAA 1260
SAHLELRHLPLDDLELASLRGAIQRAE+ELNLQKRRGHGVLSVL PKFSGVLGEALDI+A
Sbjct: 1201 SAHLELRHLPLDDLELASLRGAIQRAELELNLQKRRGHGVLSVLGPKFSGVLGEALDISA 1260
Query: 1261 RWSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISS 1320
RWSGDVITIEKT+LEQSNSRYELQGEYVLPGSRDRNV S+GFLKKAMASHLSSVISS
Sbjct: 1261 RWSGDVITIEKTVLEQSNSRYELQGEYVLPGSRDRNVASNGSSGFLKKAMASHLSSVISS 1320
Query: 1321 MGRWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIE 1380
MGRWRMRLEVP AEVAEMLPLARLLSRS DPSVHSRS+DLFIQ+LQAVGLYTESVQDLIE
Sbjct: 1321 MGRWRMRLEVPSAEVAEMLPLARLLSRSTDPSVHSRSRDLFIQSLQAVGLYTESVQDLIE 1380
Query: 1381 VIRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYK 1440
VIRRQFILSDEIVLEDLSLPGLSELRG W GSLDASGGGNGDTMAEFDFHGEDWEWG YK
Sbjct: 1381 VIRRQFILSDEIVLEDLSLPGLSELRGRWHGSLDASGGGNGDTMAEFDFHGEDWEWGTYK 1440
Query: 1441 TQRVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQ 1500
TQRVLAVGAYSN+DGLRLEKIFIQKDNAT+HADGTLFGP TNLHFAVLNFPVSL+P VQ
Sbjct: 1441 TQRVLAVGAYSNDDGLRLEKIFIQKDNATIHADGTLFGPKTNLHFAVLNFPVSLLPTVVQ 1500
Query: 1501 VIESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEV 1560
V+ESSAKDLVHSLR+LVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGG+DLGRAEV
Sbjct: 1501 VVESSAKDLVHSLRKLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGIDLGRAEV 1560
Query: 1561 VASLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWG 1620
VASLTS SRFLFNAKFEPIIQNGHVHVQGSIPVMFVQN+M EVEE+ETDTSR TL+HAWG
Sbjct: 1561 VASLTSSSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNSMVEVEELETDTSRTTLIHAWG 1620
Query: 1621 KEKVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTAL 1680
KEKVR+KFNDRK+SR+RNEEGWNTQL EGLKGLNW+LLDVGEVR+DADIKDGGMLLLTAL
Sbjct: 1621 KEKVRDKFNDRKNSRERNEEGWNTQLTEGLKGLNWNLLDVGEVRVDADIKDGGMLLLTAL 1680
Query: 1681 SPHVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRL 1740
SPHVNWLHGNADILLQVRGTIEEP+LDGSASFHRASISSPVLPKPL NFGGT+++RSNRL
Sbjct: 1681 SPHVNWLHGNADILLQVRGTIEEPVLDGSASFHRASISSPVLPKPLINFGGTVYIRSNRL 1740
Query: 1741 CINSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQIT 1800
CINSLESRVSRRGKLI+KGNLPLRSSEA L DKIDLKCEVLEVRAKNIFSGQVDSQMQIT
Sbjct: 1741 CINSLESRVSRRGKLIVKGNLPLRSSEASLGDKIDLKCEVLEVRAKNIFSGQVDSQMQIT 1800
Query: 1801 GSILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFN 1860
GSILQPNISGNIQLSRGEAYLPHDKGSGAASFN+VV +QFSLP GSSNQV+AS FF+
Sbjct: 1801 GSILQPNISGNIQLSRGEAYLPHDKGSGAASFNRVVPNQFSLPAGSSNQVIAS---PFFS 1860
Query: 1861 SESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAV 1920
SESTALKTRF PRDK+ +IEKESRNVN+KPSVDV LSDL++VLGPELRILYPLILNFAV
Sbjct: 1861 SESTALKTRFLAPRDKSANIEKESRNVNIKPSVDVRLSDLRVVLGPELRILYPLILNFAV 1920
Query: 1921 SGELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLA 1980
SGELELNG AHAK I+PKGTLTFDNGDVNLLATQVRL REHLNIATFEPENGLDPMLDLA
Sbjct: 1921 SGELELNGHAHAKRIEPKGTLTFDNGDVNLLATQVRLNREHLNIATFEPENGLDPMLDLA 1980
Query: 1981 LVGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPEL 2040
LVGSEWQIRIQSRASKWQ+KLVVTSTRSVEQDALSPTE
Sbjct: 1981 LVGSEWQIRIQSRASKWQDKLVVTSTRSVEQDALSPTE---------------------- 2040
Query: 2041 RATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTL 2100
A RAFENQLAESILE GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIP+L
Sbjct: 2041 -AARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 2100
Query: 2101 LSFPTT-DPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITY 2160
LS +T DPL LT NIS GTVVEVQLGKRI QAS++RQMK++EMAMQW Y
Sbjct: 2101 LSVDSTNDPLKLLTGNISFGTVVEVQLGKRI--------QASIVRQMKDSEMAMQWTFMY 2153
Query: 2161 KLTSRLRMVLQSAPAQRTLLLVEYSATSLD 2185
KLTSRLRMV QSAPAQR L+LVEYSA+SLD
Sbjct: 2161 KLTSRLRMVFQSAPAQRMLVLVEYSASSLD 2153
BLAST of MELO3C003666 vs. ExPASy TrEMBL
Match:
A0A6J1IVS8 (uncharacterized protein LOC111480396 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480396 PE=4 SV=1)
HSP 1 Score: 3410.9 bits (8843), Expect = 0.0e+00
Identity = 1752/2189 (80.04%), Postives = 1903/2189 (86.93%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
MNVK SSF GT LHSSLH V NGKF+YL R RL K DSKKY CAK N+W+ARV+ FSRF
Sbjct: 1 MNVKYHSSFFGTPLHSSLHYVNNGKFIYLRRSRLLKWDSKKYTCAKKNDWDARVDGFSRF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
QHLKSLS+KL R+ESLMKCANEPFV TK+LSS LRPVWNEGLFLIRCSAF AVVSGI
Sbjct: 61 CVQHLKSLSMKLGTRYESLMKCANEPFVLTKTLSSFLRPVWNEGLFLIRCSAFVAVVSGI 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQTKAKGFVEAKLLP VCKAVSD IQRDLDFGKV SISPLSITL+SC VGPDG+
Sbjct: 121 CLLVWYGQTKAKGFVEAKLLPFVCKAVSDHIQRDLDFGKVTSISPLSITLKSCLVGPDGD 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMK+RVLPFTSLRRGRVIIDVVLSHP V+VVQKRDYTWLGLPFPSEGTL H
Sbjct: 181 EFSCGEVPTMKIRVLPFTSLRRGRVIIDVVLSHPIVLVVQKRDYTWLGLPFPSEGTLPTH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSSE GID+RTKIRRIARE+AAA WSKDR DAAREAAE+GFVV DRS G YD+S KE +
Sbjct: 241 SSSEGGIDSRTKIRRIAREDAAARWSKDRHDAAREAAEVGFVVSDRSPGSYDSSASKEDI 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
PTVD+ NSKT F DENVH R+HHCMDTDV+YKI+H+ +EKYFDVK+PD RLKF SR M
Sbjct: 301 RPTVDVENSKTSFLTDENVHLRKHHCMDTDVEYKIKHSNTEKYFDVKNPDMRLKFLSRVM 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
K +KGQSKR ASGDDVY+NS AKKRILRRSTLAA+ YFKGASEGKFGEPSQLH+SFN
Sbjct: 361 KVPMKGQSKRKASGDDVYINSSTAKKRILRRSTLAARGYFKGASEGKFGEPSQLHRSFNI 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQSLDARLNSLKEKRDIDIPNHIDDQTSTVTG 480
N D+YL+KS NET+ADSSI +T+VQ G QSLDARL+S+KE+ DIDI NHIDD++ST+TG
Sbjct: 421 VNPDAYLVKSVNETDADSSIMNTNVQNGNQSLDARLHSIKEEGDIDILNHIDDKSSTITG 480
Query: 481 LGNKDRRSFSATPSIDESNMRKEDVIGSDHVPDGISDHMRNTSQTPTSTGHEHQHGTSWP 540
LGNKDRRSFS T ES+++K+DVIGSDH+ +G SD M +T QTPTST +EHQHGT+WP
Sbjct: 481 LGNKDRRSFSVTSGSHESSVKKDDVIGSDHISEGTSDQMCHTFQTPTSTIYEHQHGTTWP 540
Query: 541 NSFWGLSPESALSYFPKDVAKKLLYHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTM 600
SF L+ +S LSYFPKDV KKLLYH+S +VQNLK VQ+ARGV+D GDV KN+GT TM
Sbjct: 541 ISFSALNQKSDLSYFPKDVGKKLLYHLSTFVQNLKFILVQYARGVVDDGDVWKNEGTETM 600
Query: 601 LPVTIDSVHFKGGTLMLLAYGDREPREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFV 660
LPVTIDSVHF+GGTLM LAYGDREPRE+ENVNGHVKFQNHYGNV VHLSGNCK+WR E V
Sbjct: 601 LPVTIDSVHFRGGTLMFLAYGDREPREIENVNGHVKFQNHYGNVRVHLSGNCKTWR-EIV 660
Query: 661 SGDGGWLSADVFVDIFEQEWHSNLKITNIFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
SGDGGWLSADVFVD FEQ+WHSNLKITN+FVPLFERILDIPITWSKGRATGEVHLCMSRG
Sbjct: 661 SGDGGWLSADVFVDTFEQQWHSNLKITNLFVPLFERILDIPITWSKGRATGEVHLCMSRG 720
Query: 721 DTFPNFQGQLDVTGLAFKIFDAPSSFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDF 780
DTFPNFQGQLDVTGLAFKI APSSFTEIAA++ F GQRIFVQNASGW G EASGDF
Sbjct: 721 DTFPNFQGQLDVTGLAFKISGAPSSFTEIAASISFHGQRIFVQNASGWLGSTSFEASGDF 780
Query: 781 GINPDEGEFHLMCQVPGVEVNALMKTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGM 840
GI+P++GEF L+C+V VEVNAL++TFK++PF FPLAGSVTAVFNCQGPLDSPI VG GM
Sbjct: 781 GIHPEKGEFRLICEVSCVEVNALLETFKIRPFSFPLAGSVTAVFNCQGPLDSPILVGRGM 840
Query: 841 VSRKMNNLFSDLPASCASEAIVKSKEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIR 900
S KMN+ DLPASCASEA+VKSKE GA+ AVDR P S VSANFTF+ DNCVA+LYGIR
Sbjct: 841 FSGKMNHSILDLPASCASEAVVKSKEAGAMTAVDRFPLSNVSANFTFNFDNCVAELYGIR 900
Query: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLL 960
ANLVDGGEIRGAGNAWICPEGELDDTAMDL FSGN+S DKI+HRYMPGY D MPLKLG+L
Sbjct: 901 ANLVDGGEIRGAGNAWICPEGELDDTAMDLKFSGNVSFDKILHRYMPGYFDPMPLKLGIL 960
Query: 961 NGETKVSGSLLRPRFNINWTAPLAEGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTS 1020
NGETKVSGS LRPR NINW APLAEGSFRDARGDINIS+DYII+NSSSVAFEL++KVQTS
Sbjct: 961 NGETKVSGSFLRPRVNINWIAPLAEGSFRDARGDINISNDYIIINSSSVAFELYTKVQTS 1020
Query: 1021 YSDKIMLDEEVFDTKRTPSFTIDGVELDLHMRGFEFLSLVSYIFESPRPMHLKATGRVKF 1080
Y+D+ ML +E FD K+TP TIDGVELDLHMRGFEFLS S IFESPRP HL+ATGRVKF
Sbjct: 1021 YADENMLGDEAFDAKKTPPCTIDGVELDLHMRGFEFLSFDS-IFESPRPTHLRATGRVKF 1080
Query: 1081 VGKVLRP----SSKDFSNEKSKHQVQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLL 1140
VGKVL P SS+DFS EK K QVQ ID+EN N LAGEVSISGLK +QL+LAPKLAGLL
Sbjct: 1081 VGKVLSPSTGSSSQDFSIEKRKQQVQIIDDENINSLAGEVSISGLKFDQLILAPKLAGLL 1140
Query: 1141 SMTRESIKLDTTGRPDESLSVEIVGSLKPNSDNSRKSKLFSFNLQRGQLRANARYQPSRS 1200
SMTRESIKLD TGRPDESLSVEIVGSLKP+SDNS KSKLFSFNLQRGQLRANA YQP RS
Sbjct: 1141 SMTRESIKLDATGRPDESLSVEIVGSLKPSSDNSSKSKLFSFNLQRGQLRANACYQPFRS 1200
Query: 1201 AHLELRHLPLDDLELASLRGAIQRAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAAR 1260
AHLELRHLPLDDLELASLRG IQRAEIEL+LQK+RGHGVLSVL PKFSGV+GEA DIAAR
Sbjct: 1201 AHLELRHLPLDDLELASLRGEIQRAEIELDLQKKRGHGVLSVLGPKFSGVVGEAFDIAAR 1260
Query: 1261 WSGDVITIEKTILEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSM 1320
WSGDVIT+EKTILEQSNSRYELQGE VL GS DRNVT KES+ FLKKAMA HLSSVISSM
Sbjct: 1261 WSGDVITLEKTILEQSNSRYELQGECVLLGSPDRNVTGKESSNFLKKAMALHLSSVISSM 1320
Query: 1321 GRWRMRLEVPRAEVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEV 1380
GRWRMRLEVPRAEVAEMLPLARLLSR DPSVHSRSKDLFIQ+LQAVGL TESVQDLIEV
Sbjct: 1321 GRWRMRLEVPRAEVAEMLPLARLLSRCTDPSVHSRSKDLFIQSLQAVGLSTESVQDLIEV 1380
Query: 1381 IRRQFILSDEIVLEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKT 1440
IRRQFILS+EIVLEDLSLPGLSELRGC RGSLDASGGGN DTMAEFD GEDWEWG K
Sbjct: 1381 IRRQFILSEEIVLEDLSLPGLSELRGCLRGSLDASGGGNEDTMAEFDIRGEDWEWGTNKM 1440
Query: 1441 QRVLAVGAYSNNDGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQV 1500
QRVL VGAYSNNDGLRLE FIQKDNAT+HADGTLFGP+++LHFAVLN PV LVP QV
Sbjct: 1441 QRVLTVGAYSNNDGLRLENFFIQKDNATIHADGTLFGPLSSLHFAVLNCPVGLVPTVAQV 1500
Query: 1501 IESSAKDLVHSLRQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVV 1560
IESSAKDLVHSLRQL+API+GIL+MEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAE V
Sbjct: 1501 IESSAKDLVHSLRQLLAPIKGILYMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEAV 1560
Query: 1561 ASLTSGSRFLFNAKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGK 1620
ASLTS SRFLFNAKFEPI QNGHVHVQGSIPVMFVQN+M EVEE+ETD+SR TL+H+WGK
Sbjct: 1561 ASLTSSSRFLFNAKFEPIFQNGHVHVQGSIPVMFVQNSMAEVEELETDSSRATLIHSWGK 1620
Query: 1621 EKVREKFNDRKSSRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALS 1680
E+V +KFNDRK+SR++NE+ W TQL EGLKGLN +LLDVGEVR DADIKDGGMLLLTALS
Sbjct: 1621 ERVMDKFNDRKNSREKNED-WTTQLTEGLKGLNSNLLDVGEVRFDADIKDGGMLLLTALS 1680
Query: 1681 PHVNWLHGNADILLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLC 1740
PHVNWLHGNADILL+V GTIEEP+ DGSA+FH AS+SSPV PKPL N GG +HVRSNRLC
Sbjct: 1681 PHVNWLHGNADILLKVSGTIEEPVFDGSATFHWASVSSPVFPKPLVNSGGMIHVRSNRLC 1740
Query: 1741 INSLESRVSRRGKLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITG 1800
+SLE RVSR+GKL +KGNLPLRSSEA L DKIDLKCE LEVRAKNIFSGQVDSQMQITG
Sbjct: 1741 FDSLECRVSRKGKLTVKGNLPLRSSEASLSDKIDLKCEALEVRAKNIFSGQVDSQMQITG 1800
Query: 1801 SILQPNISGNIQLSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKYASFFNS 1860
SILQP ISGNIQLSRGEAYLPHDKGSGAAS NKV+ DQ FF+
Sbjct: 1801 SILQPYISGNIQLSRGEAYLPHDKGSGAASSNKVLPDQ------------------FFSP 1860
Query: 1861 ESTALKTRFRVPRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVS 1920
ESTALKTRF PRDK+ + EK SRNVN+KP V+V LSDLKLVLGPELRILYPLILNFAVS
Sbjct: 1861 ESTALKTRFHPPRDKSAETEKASRNVNIKPRVNVCLSDLKLVLGPELRILYPLILNFAVS 1920
Query: 1921 GELELNGFAHAKSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLAL 1980
GELELNG AH+K I+PKG LTFDNGDVNLLATQVRLKREH NIA FEPENGLDPMLDLAL
Sbjct: 1921 GELELNGCAHSKRIQPKGILTFDNGDVNLLATQVRLKREHRNIAAFEPENGLDPMLDLAL 1980
Query: 1981 VGSEWQIRIQSRASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELR 2040
VGSEWQI+IQSRASKWQ+ LVV S RSVE+ ALSPTE
Sbjct: 1981 VGSEWQIKIQSRASKWQDNLVVMSIRSVERGALSPTE----------------------- 2040
Query: 2041 ATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLL 2100
ATRAFENQLA+SILES GQLAL KLA +TLEKLMPRIEGKGEFG+ARWRLVYAPQIP++L
Sbjct: 2041 ATRAFENQLAKSILESNGQLALNKLAASTLEKLMPRIEGKGEFGEARWRLVYAPQIPSVL 2100
Query: 2101 SFP-TTDPLLSLTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYK 2160
SFP TTDP S++S GTVVEVQLGKRI QAS++RQM+E+EM MQW +TYK
Sbjct: 2101 SFPTTTDPF----SSLSFGTVVEVQLGKRI--------QASVVRQMRESEMGMQWTLTYK 2133
Query: 2161 LTSRLRMVLQSAPAQRTLLLVEYSATSLD 2185
L S LR+V QSAPAQRTL+LVEY A SLD
Sbjct: 2161 LRSGLRLVFQSAPAQRTLVLVEYCAASLD 2133
BLAST of MELO3C003666 vs. TAIR 10
Match:
AT2G25660.1 (embryo defective 2410 )
HSP 1 Score: 2555.4 bits (6622), Expect = 0.0e+00
Identity = 1349/2238 (60.28%), Postives = 1661/2238 (74.22%), Query Frame = 0
Query: 1 MNVKLDSSFLGTQLHSSLHCVTNGKFVYLGRRRLSKGDSKKYVCAKHNEWNARVERFSRF 60
M+++L + FL T L LH N + + R + + Y K N+W A+V +FS+F
Sbjct: 1 MSLRLQNPFLSTPL---LHGSFNRREKRINVARRAFRSKRIYSEKKQNDWLAKVAKFSQF 60
Query: 61 FGQHLKSLSIKLKPRHESLMKCANEPFVQTKSLSSLLRPVWNEGLFLIRCSAFAAVVSGI 120
G++++ L L R +KC EPFV++K L L PVW EGLF +RCS F AV+SG+
Sbjct: 61 CGKNVQLLRKSLDSRSRMEVKCLKEPFVRSKDLVRSLAPVWEEGLFFLRCSVFFAVISGV 120
Query: 121 CLLVWYGQTKAKGFVEAKLLPSVCKAVSDCIQRDLDFGKVRSISPLSITLESCSVGPDGE 180
CLLVWYGQ KA+ FVE KLLPSVC +S+ IQR++DFGKVR +SPL ITLE+ S+GP GE
Sbjct: 121 CLLVWYGQNKARVFVETKLLPSVCSVLSETIQREVDFGKVRRVSPLCITLEASSIGPHGE 180
Query: 181 EFSCGEVPTMKLRVLPFTSLRRGRVIIDVVLSHPSVVVVQKRDYTWLGLPFPSEGTLERH 240
EFSCGEVPTMK+ V PF SLRRG++++D +LS+P+V+V QK+D+TWLG+P S+ TL H
Sbjct: 181 EFSCGEVPTMKVCVRPFASLRRGKIVVDAILSNPTVLVAQKKDFTWLGIPL-SDTTLPSH 240
Query: 241 SSSEEGIDNRTKIRRIARENAAALWSKDRDDAAREAAEMGFVVFDRSSGLYDTSDYKEVV 300
SSEEGID RTK RR++RE A W ++RD+ AR+AAE+G++V ++ + K
Sbjct: 241 LSSEEGIDFRTKTRRVSREEAGIRWDEERDNDARKAAEIGYIVPCKNYSQAKDNAVKHDR 300
Query: 301 GPTVDIGNSKTFFFKDENVHSREHHCMDTDVDYKIRHAKSEKYFDVKSPDTRLKFSSRAM 360
T +I N +F DE +HS E HCMD V+Y ++HA+ EK F +K P + LKF S+ +
Sbjct: 301 RFT-EIANPNSFICMDEKMHSAEQHCMDPGVEYDVKHAELEKSFGIKIPGSGLKFLSKML 360
Query: 361 KTLIKGQSKRNASGDDVYVNSFAAKKRILRRSTLAAQDYFKGASEGKFGEPSQLHKSFNN 420
K K + K N+ +++ +AKKRIL RS AA YF S+ K EPS L +++
Sbjct: 361 KVPRKYKFKWNSKSHKNSMSNISAKKRILERSASAALSYFHSLSQQKLDEPSVLSTNYDG 420
Query: 421 ANLDSYLIKSGNETNADSSITDTDVQYGKQS----LDARLNSLKEKRDIDIPNHIDDQTS 480
+LD L+K E S+ D V YG+QS LD + ++ KR + +
Sbjct: 421 LSLDMLLVKGDREI---SNQYDRHVPYGEQSLANDLDGKGYRVRGKRLLGVKKASTLDKF 480
Query: 481 TVT---GLGNKDR----RSFSATPSIDE-SNMRKEDVIGSDHVPDGISDHMRNTSQTP-- 540
TV+ L DR +PS+++ N + + + S ++ +NT P
Sbjct: 481 TVSCDPFLMTVDRLCALLQTKRSPSVEDIVNSSESETLSSQRGDISMNVVNQNTDDVPHG 540
Query: 541 ----------TSTGHEHQ-----HGTSWPNSFWGLSPESALSYFPKDVAKKL-------L 600
T HEHQ SWP + + A+ +KKL
Sbjct: 541 NRSGNQPRDFTFKKHEHQPVANHWRPSWPRN---KKLKEAVFNILTGSSKKLTGRADPNA 600
Query: 601 YHISMYVQNLKSGFVQHARGVIDGGDVMKNKGTNTMLPVTIDSVHFKGGTLMLLAYGDRE 660
H+S ++ L + +V+ LPV +DSV FKGGTL+LLAYGD E
Sbjct: 601 PHLSDELEKLPAVYVEKT------------------LPVMLDSVQFKGGTLLLLAYGDTE 660
Query: 661 PREMENVNGHVKFQNHYGNVHVHLSGNCKSWRSEFVSGDGGWLSADVFVDIFEQEWHSNL 720
PREM NV+GHVKFQNHYG V+V L GNC WRS+ S DGG LS DVFVD EQ WH+NL
Sbjct: 661 PREMRNVHGHVKFQNHYGRVYVQLGGNCNMWRSDVTSEDGGLLSVDVFVDTVEQNWHANL 720
Query: 721 KITNIFVPLFERILDIPITWSKGRATGEVHLCMSRGDTFPNFQGQLDVTGLAFKIFDAPS 780
+ N FVP+FERIL+IPI WSKGRATGEVHLCMSRG++FPN GQLDVTGL F I DAPS
Sbjct: 721 NVANFFVPIFERILEIPIEWSKGRATGEVHLCMSRGESFPNLHGQLDVTGLGFHINDAPS 780
Query: 781 SFTEIAATLCFRGQRIFVQNASGWFGCAPLEASGDFGINPDEGEFHLMCQVPGVEVNALM 840
SF++++A+L FRGQRIF+ NA+GWFG PLEASGDFGI+PDEGEFHLMCQVP VE+NALM
Sbjct: 781 SFSDVSASLSFRGQRIFLHNANGWFGKVPLEASGDFGIHPDEGEFHLMCQVPYVEINALM 840
Query: 841 KTFKMKPFLFPLAGSVTAVFNCQGPLDSPIFVGSGMVSRKMNNLFSDLPASCASEAIVKS 900
KTFKMKP FPLAGSVTAVFNCQGPLD+P+FVGS MVSRK+ L DLP S A EA++K+
Sbjct: 841 KTFKMKPLFFPLAGSVTAVFNCQGPLDAPVFVGSCMVSRKIAYLSPDLPTSLAYEAMLKN 900
Query: 901 KEGGAIAAVDRIPFSYVSANFTFSIDNCVADLYGIRANLVDGGEIRGAGNAWICPEGELD 960
KE GA+AA DR+PFSY+SANFTF+ DNCVADLYGIRA LVDGGEIRGAGNAWICPEGE+D
Sbjct: 901 KEAGAVAAFDRVPFSYLSANFTFNTDNCVADLYGIRATLVDGGEIRGAGNAWICPEGEVD 960
Query: 961 DTAMDLNFSGNISLDKIMHRYMPGYSDWMPLKLGLLNGETKVSGSLLRPRFNINWTAPLA 1020
DTA+D+NFSGNIS DK++HRYMP Y + LKLG L GETK+SG+LL+PRF+I W AP A
Sbjct: 961 DTALDVNFSGNISFDKVLHRYMPEYFNIGMLKLGDLTGETKLSGALLKPRFDIKWAAPKA 1020
Query: 1021 EGSFRDARGDINISHDYIIVNSSSVAFELFSKVQTSYSDKIMLDEEVFDTKRTPSFTIDG 1080
+GS DARGDI ISHD IIVNSSSVAF+LF+K+ TSY D + ++ + P F ++G
Sbjct: 1021 DGSLTDARGDIVISHDNIIVNSSSVAFDLFTKLDTSYHDPCLSHQDFTQGEAMP-FVVEG 1080
Query: 1081 VELDLHMRGFEFLSLV-SYIFESPRPMHLKATGRVKFVGKVLRPSSK---DFSNEKSKHQ 1140
++LDL MRGFEF SLV SY F+SPRP HLKATGR+KF+GK+ R S+ D ++K +
Sbjct: 1081 LDLDLRMRGFEFFSLVSSYPFDSPRPTHLKATGRIKFLGKIKRHSTTKDGDVGSDKCE-- 1140
Query: 1141 VQPIDEENKNGLAGEVSISGLKLNQLVLAPKLAGLLSMTRESIKLDTTGRPDESLSVEIV 1200
D + L G++SIS LKLNQL+LAP+L+G LS++R+ +KLD GRPDESL+++ +
Sbjct: 1141 ----DAAAISSLDGDISISSLKLNQLILAPQLSGRLSVSRDHVKLDAAGRPDESLTLDFI 1200
Query: 1201 GSLKPNSD-NSRKSKLFSFNLQRGQLRANARYQPSRSAHLELRHLPLDDLELASLRGAIQ 1260
G L+PNSD N + KL SF+LQ+GQLRANA +QP +SA LE+R+ PLD+LELASLRG IQ
Sbjct: 1201 GPLQPNSDENVQSGKLLSFSLQKGQLRANACFQPQQSATLEIRNFPLDELELASLRGLIQ 1260
Query: 1261 RAEIELNLQKRRGHGVLSVLDPKFSGVLGEALDIAARWSGDV-----------ITIEKTI 1320
+AEI+LNLQKRRGHG+LSV+ PKFSGVLGEALD+A RWSGDV IT+EKTI
Sbjct: 1261 KAEIQLNLQKRRGHGLLSVIRPKFSGVLGEALDVAVRWSGDVCFMLSGRLEVMITVEKTI 1320
Query: 1321 LEQSNSRYELQGEYVLPGSRDRNVTDKESTGFLKKAMASHLSSVISSMGRWRMRLEVPRA 1380
LEQSNSRYELQGEYVLPGSRDR++ KE+ FL +AM HL SVISSMGRWRMRLEVP+A
Sbjct: 1321 LEQSNSRYELQGEYVLPGSRDRDLGQKEAGSFLMRAMTGHLGSVISSMGRWRMRLEVPKA 1380
Query: 1381 EVAEMLPLARLLSRSADPSVHSRSKDLFIQNLQAVGLYTESVQDLIEVIRRQFILSDEIV 1440
EVAEMLPLARLLSRS DP+VHSRSKDLFIQ++Q + L E+++DL+E IR + E+V
Sbjct: 1381 EVAEMLPLARLLSRSTDPAVHSRSKDLFIQSVQNLCLQAENLRDLLEEIRGYYTPPSEVV 1440
Query: 1441 LEDLSLPGLSELRGCWRGSLDASGGGNGDTMAEFDFHGEDWEWGVYKTQRVLAVGAYSNN 1500
LEDLSLPGL+EL+G W GSLDASGGGNGDT+AEFDFHG+DWEWG YKTQRVLA G+Y+N+
Sbjct: 1441 LEDLSLPGLAELKGHWHGSLDASGGGNGDTLAEFDFHGDDWEWGTYKTQRVLATGSYNND 1500
Query: 1501 DGLRLEKIFIQKDNATVHADGTLFGPITNLHFAVLNFPVSLVPAAVQVIESSAKDLVHSL 1560
DGLRL+++ IQK NAT+HADGTL GP TNLHFAVLNFPVSL+P V+V+ESSA D+VHSL
Sbjct: 1501 DGLRLKEMLIQKGNATLHADGTLLGPKTNLHFAVLNFPVSLIPTLVEVVESSATDIVHSL 1560
Query: 1561 RQLVAPIRGILHMEGDLRGNLAKPECDVQVRLLDGAIGGVDLGRAEVVASLTSGSRFLFN 1620
R+L++PI+GILHMEGDLRG+L KPECDVQVRLLDGA+GG+DLGRAEV ASLTS SRFLFN
Sbjct: 1561 RKLLSPIKGILHMEGDLRGSLEKPECDVQVRLLDGAVGGIDLGRAEVFASLTSNSRFLFN 1620
Query: 1621 AKFEPIIQNGHVHVQGSIPVMFVQNNMGEVEEVETDTSRGTLVHAWGKEKVREKFNDRKS 1680
+ FEP +QNGHVH+QGS+PV F Q NM E E ETD + +W KEK + +++++
Sbjct: 1621 SNFEPFVQNGHVHIQGSVPVSFSQKNMSEGEVSETDRGGAVKIPSWAKEK---EDDEKRT 1680
Query: 1681 SRDRNEEGWNTQLAEGLKGLNWSLLDVGEVRIDADIKDGGMLLLTALSPHVNWLHGNADI 1740
SRDR+EE W++QLAE LKGL W++LD GEVR++ADIKDGGM LLTA+SP+ NWL GNADI
Sbjct: 1681 SRDRSEERWDSQLAESLKGLYWNILDAGEVRLEADIKDGGMTLLTAISPYANWLQGNADI 1740
Query: 1741 LLQVRGTIEEPILDGSASFHRASISSPVLPKPLTNFGGTLHVRSNRLCINSLESRVSRRG 1800
LQV GT++ P+LDGSASFHRASISSPVL KPLTNFGGTLHV+SNRLCI SLESRVSR+G
Sbjct: 1741 RLQVGGTVDHPVLDGSASFHRASISSPVLRKPLTNFGGTLHVKSNRLCITSLESRVSRKG 1800
Query: 1801 KLILKGNLPLRSSEACLDDKIDLKCEVLEVRAKNIFSGQVDSQMQITGSILQPNISGNIQ 1860
KL++KGNLPLRS+EA D I+LKCEVLEVRAKN S QVD+Q+QITGS+LQP ISGNI+
Sbjct: 1801 KLVVKGNLPLRSNEASAGDGIELKCEVLEVRAKNFLSCQVDTQLQITGSMLQPTISGNIK 1860
Query: 1861 LSRGEAYLPHDKGSGAASFNKVVSDQFSLPPGSSNQVVASKY-ASFFNSESTALKTRFRV 1920
LS+GEAYLPHDKG GAA N++ ++Q+S+P + NQ V+S+Y A FF +E + +F
Sbjct: 1861 LSQGEAYLPHDKGGGAAPLNRLAANQYSIPGAAINQAVSSRYFARFFGTERASSGMKFSQ 1920
Query: 1921 PRDKAVDIEKESRNVNVKPSVDVSLSDLKLVLGPELRILYPLILNFAVSGELELNGFAHA 1980
K+ +EKE V +KP++D+ LSD+KLVLGPELRI+YPLILNFAVSGELEL+G AH
Sbjct: 1921 STGKSNSVEKEIEEVKMKPNMDIRLSDMKLVLGPELRIMYPLILNFAVSGELELDGMAHP 1980
Query: 1981 KSIKPKGTLTFDNGDVNLLATQVRLKREHLNIATFEPENGLDPMLDLALVGSEWQIRIQS 2040
K IKPKG LTF+NGDVNL+ATQVRLKREHLN+A FEPE+GLDP+LDLALVGSEWQ R+QS
Sbjct: 1981 KFIKPKGVLTFENGDVNLVATQVRLKREHLNVAKFEPEHGLDPLLDLALVGSEWQFRVQS 2040
Query: 2041 RASKWQEKLVVTSTRSVEQDALSPTEMPSVESSGKTLQADLTSRKPELRATRAFENQLAE 2100
RAS WQ+KLVVTSTRSVEQDALSP+E A + FE+QLAE
Sbjct: 2041 RASNWQDKLVVTSTRSVEQDALSPSE-----------------------AAKVFESQLAE 2100
Query: 2101 SILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTLLSF-PTTDPLLS 2160
SILE GQLA +KLATATL +MPRIEGKGEFGQARWRLVYAPQIP+LLS PT DPL S
Sbjct: 2101 SILEGDGQLAFKKLATATLGTIMPRIEGKGEFGQARWRLVYAPQIPSLLSVDPTVDPLKS 2160
Query: 2161 LTSNISIGTVVEVQLGKRIQEVFKNFKQASMIRQMKETEMAMQWMITYKLTSRLRMVLQS 2185
L SNIS GT VEVQLGKR+ QAS++RQMK++EMAMQW + Y+LTSRLR++LQS
Sbjct: 2161 LASNISFGTEVEVQLGKRL--------QASVVRQMKDSEMAMQWTLIYQLTSRLRVLLQS 2166
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008466365.1 | 0.0e+00 | 98.58 | PREDICTED: uncharacterized protein LOC103503795 [Cucumis melo] | [more] |
XP_011652500.1 | 0.0e+00 | 95.51 | protein TIC236, chloroplastic [Cucumis sativus] | [more] |
KAE8651368.1 | 0.0e+00 | 94.47 | hypothetical protein Csa_001268 [Cucumis sativus] | [more] |
XP_038897772.1 | 0.0e+00 | 90.55 | protein TIC236, chloroplastic [Benincasa hispida] >XP_038897773.1 protein TIC236... | [more] |
XP_022936094.1 | 0.0e+00 | 88.22 | uncharacterized protein LOC111442799 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
F4ISL7 | 0.0e+00 | 60.28 | Protein TIC236, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TIC236 PE=1 SV=... | [more] |
W0RYD3 | 0.0e+00 | 55.56 | Protein SUBSTANDARD STARCH GRAIN 4, chloroplastic OS=Oryza sativa subsp. japonic... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CR23 | 0.0e+00 | 98.58 | uncharacterized protein LOC103503795 OS=Cucumis melo OX=3656 GN=LOC103503795 PE=... | [more] |
A0A6J1F7B9 | 0.0e+00 | 88.22 | uncharacterized protein LOC111442799 OS=Cucurbita moschata OX=3662 GN=LOC1114427... | [more] |
A0A6J1III6 | 0.0e+00 | 87.95 | uncharacterized protein LOC111476591 OS=Cucurbita maxima OX=3661 GN=LOC111476591... | [more] |
A0A6J1BWB5 | 0.0e+00 | 84.93 | uncharacterized protein LOC111006106 OS=Momordica charantia OX=3673 GN=LOC111006... | [more] |
A0A6J1IVS8 | 0.0e+00 | 80.04 | uncharacterized protein LOC111480396 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT2G25660.1 | 0.0e+00 | 60.28 | embryo defective 2410 | [more] |