Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGCTATGGAGAAGCTATTCGTTCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGGCCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTTCACTCCTCTTTTCTTCATTCCCACATTTCCCATTTCGAAGGTAATTTCCTACTCTTTTCAATCTTCCCCATTTCTCATGCTTCATGGTTAGTTAGGTTGAAGTAGGAATTTCCCTCTTTCGCCACATTTTTTTGCTCATTTGGTTTCTCGTAGTTGCAGAGGTGAACAAAGGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATACCCATCGTTCTAGTTTGAATGAAGCATTTGTTGAAGACAGTGGGGAGGAGTTGGAGCACAGGTCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGCAAATAGGCCAGCAATTTCACCCCAGTGTGATATAAGTAGTGCCGGTGTCTTAAATTGCGCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCTAAATCTAGGCAAAAGGCTTTAGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCCTTGCTGGTGGGATTGTAGGATCTGCTATTGGTTTACTGCAAGCTGATCACGAAGATGAATCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAACGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCGGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTCCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATCAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGAGTCCGCATATTGCAGGAGCCAGGAATATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAATGTAGCTCTTTGGATGCAAATGAGACTTCATGCATTTCCCCGGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGACATTGCAGGAGCCATGACTATGATCTTGATAACGCTCTACAGTCTGAGTCACAACAAAGGTCCCAGGAAATGGATGATTCATCACGCATTGACGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGTGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCAACAGTTTGGGTCGCAATACAGCTCCCAGGATGCGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAATAATCCTCAAAAGTCTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAAATGGAACTTTTTGCCCTGTTGGAAGTTCGAAGCAACATTCTGATCAAGTGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATAATGCGGAACAGTTTGATGTTGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGAGATGTCAGTGAAAATGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAAGGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGACACAGGAGATTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAAATACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAACGTTTCTTGGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAAAAGATACAGAAACATTTAGAGATCATGTAGTAATGGTTCCTTGTGTTCCTTACGTTGGTGAAACAGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAATGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTGAGCCAGAATTGGGGATTGGTATTCCTGGATCTTTAGGCTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATGTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTACCAACAATCAAATGTGAGCACTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCCAGTTAAAGAGGTATACATTTATTACAACGGGAAGAATTTTCAAACAGGCCTTCACAGGTTGACTATTGAAACAAACATCGTTCATGCATAGTCTATAAAATATTTTAAAATAAACTACAAAATCACGATGCAGACCCAAAATTAGTTATCATCACCAAATAGGTTCTTTATCACCTCCAATTTTCTCGTCATTAGACTTATGGCCTTTCTTTTTTTTTCCCACCATAAAACTTGTAAGTTTCCATATTTTCCACCATTTGACTTTACAGCCTTCTGTATTTCCTTCTTTGTTTAGTTGTTTTTATGTTCCTCTACATCTGTAAAAGAAAAAATAGGGTGTGTTGGGTTAGTTAAACAAATGAGAAACAACTTCTTTCTAATTTTCCTTTCTTTTACGTAATTTAATCATAATTTATTTCAAGCACCTAAAAGAGAAGTGCATGCTCACAGGGGGAGGTGGCTTACTTATTTTAATTGTGAAGCGCACTTAAGGACACACTTCTGTGGATTAAAACTATTGTTTTGGTGTACTTAGGAAGTTAAGATCAAGCGATTTAACTCTCTATTATTTCAAGTATATTGATCTGTTGTTATTGAAACATGGGTATAAAAATAAGCGTACCTTCATTACAAAAGTACCTGTGAAGGAGTATGAAGATAATTCTTCCAATTCACTTAACAAAAGTTGTTTTTCTTCTTTTTAATATGTTGTTTTCTGCATGCTGTTTTTGAAAGGATGGACCTAGTTAGGAAGGGGGTGATGGTAGTCATACTGTTTGTGCTTTTAAGAGGGTGTAGTGTTCACTAAATGCTGGCCTTATCATTTTCTGACAAATTTCTAGGCAGGGGATTTGATTGATTTTTTTCCTCCTTTTTTACTTTTTTTATTAACTTCTCTTCGTGATCTTGTTTCTTTGTGCATGATTGACAGGATCTCTCTAGGTTCAGAAGCAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGAAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGACGGCAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCCTATCTGTTCGTGGAATCAACCTTGACACATTAGAACTTTCGAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGAACTTGAAGAGTAACTTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCTTTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCTGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATGAGTTTGCAAAGGATATGAAATCGAACAAGCGAGTACCACTTGTTGACATTACAGAAAATTCAAATGTTCTGGTAACAGTTTCTGAAGCTATCATGTGTGCTGATAGATTAAGTTTAGAATCTTTAAACACGGAACTCAGCAACACGGGGACTCATAATAGAACCAAAGAGAATCTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACAGAGCTAAAAGAGTCACTAGGTCATCTTATAATAGATTTAGCAGGACAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCTGGAAAGGAGACCAAGCATAAAAATATCGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGTCTGTATATATATTTTTAACTATATCTGCATTCAATATTAACATACTTGGATTGATTATGAATCTTTAACATTTATTGGATATTGCCATGTTAAATATCTCTAGAATTTTATGTCTCCATTACTTGCTACTACTGAATTGAGAAAATGTGTAATGAAGAATGATTTTCATGTATGATATCTAGAAGAATCCCACGACAAAGTCAAGTCCTAGGCTCTCCAACCCATAACACGCCCCTTATTGCTCCACTCTCTTATTCCCCTGTTATCATCCTCCCTTTTTGCACGTCCTATTTTGTTATGGTTATTTTTGGAAATTTATTTTGGCTTCTTCTGTCAGTGTTACCATGTTTGGCGGTTGTGGCCTTGTTGATTGATGTTTGCCAAGTTAGATGTTCCTAGCTTGGTTTTGTCCTACGTTTTGCTTTGAGCTTGATTAAAGCCATCCCTTTTGGTATGAAGGCCTTGTTGATTGATGTTTGCCAAGTTAGATGTTCCTAGCTTGGTTTTGTCCTACGTTTTGCTTTGAGCTTGATTAAAGTCATCCCTTTTGGTATGAAGTTTACGAAGATGGACAGCTTGGTGGATGCTATGGTTTTCTAGCTTCTTCAAAAGCCTGATGTGTTGAGATTTGTTTGTAATTCTTCATTTGTTCATGGTTTATGTTGGTCACACAATCTTTCAATCTTGAGACCTTTGGATACCTCTTAACAAGAACACTCAAGAACAAAAAACAAACAAAGAACACTAAAAGGTAGATTTATATTGCAATAGCAGATCAGATTAACATAGCCTTTCGAGATAACTACTCTCCTAGTATTTCCTACAGGAAATTACCCACAAAATTCTTCACTCCTTCCTCTCAACCCAGTACAGTACCCCCACACTCACTATTTCTAACTAAAAGCCCTCTGTCTAATTACTAATATTCCCCTTCTAATAAGTTGTAACCATACCAATATTTACTAATAATCCTACTAAATCCATAATTAGGGTCTTACAGATTCCAAATCTTGAATTGGTTGTTTTAGATTTATGATTGATTGATGTTTATCCCCTTATCTTATAAAGAATTCGGTTTTGATTTGGCTGGTCTTAAGGCTTCATCTTGGTGCTCCGTTCCCAGATCTTATGTTAGCTTCTTTTTACAGGATATATTTCTTATTGAATGCCTATATCTTTTTGTTGCATTCTCGTTCCATTGTTTATTGCTTTCCTTGTTTACTCCCTTGTGGAATTTGTATACTTTGAGCATGATCTCTTTTTATTATATCAATGTATATGTCCGTATCTTGTTAAAAAGAAATATAAAAAGAAAAGGAAAGAAAAGAAGCATGCACATTAAAACATCAAGTCAACCATAATTGCTAATAGCTAACTAATTATTTCCCATAACTTGGACGCCTTTACTTTTCTTTGTTCTCCTTGTTTGTTAAACAGAATGGGCCTATCAAAATTTTATTTTGGTGTGGCATTGAACTCTATCTATTATTTGGGAGTACAGAGTTTGACTATCGTGTATCTGATGGTCTGAGTATTGTAACAATTTGTAATTTAAGTTTGAAACATGTGCATTCCTGTAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAAAAGCCCTGAAACTTGAAAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGAGAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGGAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTAGAAGAAGTTAGGAGACGATTACGAGAACATGGTGGGAAGTCACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGTAAGAAGCTATATGTGATAATGTTTGAACTTTCCTTGTGCTTATGAGATTTGAGAGGTAAAAATAGCTTATTATTTTTTTCTGGTAGGAACAAAAACCACCCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAACGGACACGACAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGAAAAAATTTGTTGTGGAGAACTCACAACCAACGAGTGTAGAATTTCTAGATCCAGAGGTAAATTGATTTTGTATTTGCAAATTTCCAAAAACTATGACGTGATTTATAGTGAAATGTTATTTTCCCATAATAAGAATGGCCAATTAGCTAGCAATCTTCCATTGTCCTAACCCTCACCAGTCTTCTACTGAAAGCGTTGTTTATTATTATTTTTTTATATTTTTTTATTTAAAATTGTCACAATTTGTTTTGAGGGATAATAGATGGAGGACCCGCATTGAAACCTTTTGGAGGTAATTTAGTGCAGCATCTGAAAGATTCTGGGCTTATAGTGAAGTAGGTTGTAATTTGAAATCATCCTAGGATACTTCTTGTAGATAAGCAATAAAATAATTGAGAAGGCACAACTCTTGAAACATAACACCCTAGGCTTCCAGTTAGTTTTAGGTCTGAATCTACAAGAAAGAAGCATGAAAACTATTTTCTAGATAATTATGTGTCTGTCTAGGAGTGATGATTAAAGCAGTGATTTTAAAATTTTGAGTGAGAGTTAAAGTAGTGATTTTAGAAAGAGAATTTTGTACAAATTATTTTGTAAAGATGATTTGATTCAATTGTATTGATATTTGATTCAGAATTTTAAAAATTGAATTTTATATAGTTAAAAGTGATTTTGTGTTATTATTTTTGGTTTCAAAATTGATTTTAGAATTACTAAATTATGAAAAAAGACATAAAAAAAACGTTCAATTATATGGGACTAATTTTACTTAAGAAGAGACCGAGAAAAAAAAAACAGAGAATGGAGCTAATTTTGATGATTCTTGGTCAATCAGTTCCCTAAAAGCACATTCCAACTCGTTGGAAAATAATTCTATATGATTAGATTTTTCTAAAATAAGATTTCAACATCTGCCAATTATCAACTTTTGAAAGAAAAGTGATTTTGGCAATGACAAATTAATGTTGGCCATGTTATAATCATTCCAAAACATACATTAAATTTTTATGCATCTTGCTGTATCCTGTTACATACTTATAACCACAGAAAATTAATTCCACTTGATATAGAAACTCTGTTGGACGTTTGTAGAGTAAATCTTACTGAAACCTGAATCTAAACTGAACTAAGAATTTTAAGATACTATACAGTATTGACTTTTGCTTGATCCTATCATATGTCGCTTAAAATTGAGAAATTAAATAGTTATATGATTTAACCTACTTGGAATGACTGCTCAAATATTTGGCGACTTAAAGTGAAAGAGAATTAGTTGGCGATTAAAGGTAGGTCTTCGTCTGTTTGAGAGAAGAAAAAGATTTTTATAATTGTAACACAGAAAAAAAGTTTTTATAATTGTAACACACTTTTAGATGTTTGATATTAGGAAGTCTAATTTCTTGGAAGACTATAACATACATGATGAAGGAAGATATTTCTTGTTCCATTTGTGCTTCTTATGCTGAAATTTATGATCTTTAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATATCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGAAGATGATGGCATACGAAAAAATAAATTTGTTCCTTCGTGGGCAAGGTGTGAAACTTGGTCTGATGTGTTTTCAAATAAGGAAATAACAAATGTGATTTAAACTCCGATGATGCAATTTAGTATATTTTGCATTTTTCATTGCATCCCTCTCTCCTAACATCCTTTTTGTCAAAGATAATCGCGAAGACCATTTCAAATTTGCCATTTTGTGTTAGAACTAATTTCTTCATACTTCTTTTATGCAGTAAGGATCGCGTAGCTGCCCTTTTTGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGGTGAAAATTAACACAGAAAGATGGCTGCAGCTTTTAGTCTTTATTAATATTAAATAATAAAAACGCACTGAGCTAGTTTAACTATTTTGCAGTTCTATTGCTTTGACGGCATTGGTTTAAATAGTCGTAACTTTCACTATGTGGATAGATTTTATCTGCAACAGAAACATTCCTCTTTCTCACCGTGCAAGAGATTTGCCTAGGTATGCTTTTATCGAACTGTGTTTTTTCTTACTTGCACTTCACATCATGATTGATCACTTTTGACTTGTTTGTCAATAAGTTAGTTGCAACGTCACTCGTTGGACATTCATGGTGTTTGAGGGTCATTTCTAGTTATTAGGTCTATCTAATCTACATAATGTTGACTTCAGTATTTATGTAAATCTTTTACCCGTAGTTAGCTGTACTGTATCTTTTGCGTGGCTTATCTCTGTCCTTCCCATATAAATTATCATTCGATCTAATGGATGATTGAGAGAAAACATGCACTAGTTGTGTAAATAATCAATAAGAATTTCCCTTAAGCATTATGGCTGCTTGTTTGTTTAATAAATCGATTACTGTTTTCATTTGTCCATGTCTCATTAAAATGGAGTGCTATACTATGTGGCAGATTTTGCTGACTAATTTCTCAAGGAGTTGCTGATCAGGTGATTAGTTTCTTGCTACTCTATATTCTGAATAGCTAAATTGAGATGATAATTGTATAATAGGCAGCTTTATATGTGGGGTTAATTTCATAGCACCGAAATAAGCTGTAAAACCCAGTACCCACCCCCCTCTTTCGACTGGATTCCTCAACCATGAAAAAGTTGGGGAAAACGCCTGTAGTTAGTTCTTGGTGTCAACCTTCCCTTCTTTTGCTTTTTGGTGAGGGACAGGGGGAGGATTTAGGGTTTCCCTGTAGAAAAAGCAGAATGTAACCTGCCAACATGTAGGTGTAAAGCACTAGTTCTTAAGATAACTTCTCTATGTACAATTTATTTTTATTATATTGTCTAATGAAGATGCGATTGTTCATCTTGAAGAGGCAAACATTGATTATATTCTAGAAAAAAAGGGTAACCCTTAATTCCGTAAAGGCTTATGCCAGTATTACATAGGTTGGTTTATGATGAAGGAATTTATAATATGAATAGAATTTATATATAATCAGTTCCCAGCTGAAACTAGGTTTCTATTTTATCTAGTTCTTGCTTTGACAGAATCTTTTTCCTCTCATAAGTTCCATTGCTTGAAAACTTGTGTTATTCCTAAAGTGGAAGGATGGACAACCATGAGTAATAATTCAAAAAAAAAAAAAAAAAAAGGAAAAAAAAGAAATTTTCCCTGTCTTGTTTGGTTTGATAGATCTTGATTGCTTTGTTTCTCCAATTTTCAAGATTATGTTTCCACATTTTGTAGCCCTTTTTTCTGGATTCTTTTTTCTCCCAACATAAGTTGTGGGATATATGTTGGAGAAGAACGAACAGGCATAACAATGGGAGAGAGGTGAAACAAAGTTTGAGAAAGCTATCACCCCTGTAAGGACATATACCAAAACTTTACATATTTGTATTTCCACCAACTACTTCTAATAATTACACCACAAAAATGTCTCACAACTCACAGACGAAAAAAAAACAACCTAAACATCTTTGTTGAAACCAACTTGTTATAAAATAAAATCAAACAAATTGTTAAACCAAAGCCAACTTAACTGCTGAAGTTTTGATCTTGCATTATGAGTTCAAAATTAGCTAATTCATTGAGTCAAGTTGATTTGGTTGATTTTTTGAATTTGAGTTACACTGCAAAAGAAATAAAAGTTCATCTTATTCAATAGACGATTAGACAAACAGGGCAAGAGTGCATGTTGGATATGTTGATGAGCCTTGCTTGACCATATTTGTAATAACATGAGAAAGATGAGAATTAGGGTGACAGAAGAAGAGAGAACATGTACACTCCCAACAAGGACCAAAGTATCTCTTAAGCCAATGGTTGGGTTCTGCTCTCTCTCTCTTCTGTCAACGTCTTTTCATCCATCTTTCACCTCTTCAACTTCACCATTATTTGATAACATATGCATACTCATTCAACCATCTTACAAAAAAAGAAAAAAACCCCTAATTTCATCCATAATATTTAGGGGTACACCTTCTTAAGTGAGAAATTTGAACTTCACATGACATTACACATCATAAATCAGTTAAGTTATGCTTATGTTACCAGTTTCAATTTCTAAACACTAAGTTAGGGTCTGTTTGGCCCAAAGATTACCTAATAGCCCACTCCATGTTTGACCCACTGTTTACCATAATTGGGGTTAGGGTTATCAAACTCTTCTTCACTCCCCCTCTCTCCTTCAGGCGTTGTTCACCACTCCCCCTACCGCACTGTTCACCACTCCGCATCTCTCCTTCCCTCTCCCTTTGTTTTCCCTTCTCCCAATCCCTCCATCTCCCTTACCTTGATTGTCCGTCCCAGCTCTACTTCTGCCACCGCCTCTATTGGACTCGTAGGTATCGGATCAAGCGGTGGCGGAAACATGCTGAGATTCTACATTGTCGTGAGCCTCTGCTTTATTGGCTTCGTCACTAGCCTCCATGTCTTTGGTAAGCTCTACTGTGCACGTTCTGCCCATGGAGTTTAA
mRNA sequence
ATGTCGGCTATGGAGAAGCTATTCGTTCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGGCCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTTCACTCCTCTTTTCTTCATTCCCACATTTCCCATTTCGAAGAGGTGAACAAAGGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATACCCATCGTTCTAGTTTGAATGAAGCATTTGTTGAAGACAGTGGGGAGGAGTTGGAGCACAGGTCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGCAAATAGGCCAGCAATTTCACCCCAGTGTGATATAAGTAGTGCCGGTGTCTTAAATTGCGCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCTAAATCTAGGCAAAAGGCTTTAGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCCTTGCTGGTGGGATTGTAGGATCTGCTATTGGTTTACTGCAAGCTGATCACGAAGATGAATCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAACGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCGGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTCCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATCAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGAGTCCGCATATTGCAGGAGCCAGGAATATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAATGTAGCTCTTTGGATGCAAATGAGACTTCATGCATTTCCCCGGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGACATTGCAGGAGCCATGACTATGATCTTGATAACGCTCTACAGTCTGAGTCACAACAAAGGTCCCAGGAAATGGATGATTCATCACGCATTGACGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGTGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCAACAGTTTGGGTCGCAATACAGCTCCCAGGATGCGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAATAATCCTCAAAAGTCTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAAATGGAACTTTTTGCCCTGTTGGAAGTTCGAAGCAACATTCTGATCAAGTGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATAATGCGGAACAGTTTGATGTTGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGAGATGTCAGTGAAAATGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAAGGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGACACAGGAGATTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAAATACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAACGTTTCTTGGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAAAAGATACAGAAACATTTAGAGATCATGTAGTAATGGTTCCTTGTGTTCCTTACGTTGGTGAAACAGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAATGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTGAGCCAGAATTGGGGATTGGTATTCCTGGATCTTTAGGCTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATGTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTACCAACAATCAAATGTGAGCACTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCCAGTTAAAGAGGATCTCTCTAGGTTCAGAAGCAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGAAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGACGGCAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCCTATCTGTTCGTGGAATCAACCTTGACACATTAGAACTTTCGAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGAACTTGAAGAGTAACTTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCTTTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCTGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATGAGTTTGCAAAGGATATGAAATCGAACAAGCGAGTACCACTTGTTGACATTACAGAAAATTCAAATGTTCTGGTAACAGTTTCTGAAGCTATCATGTGTGCTGATAGATTAAGTTTAGAATCTTTAAACACGGAACTCAGCAACACGGGGACTCATAATAGAACCAAAGAGAATCTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACAGAGCTAAAAGAGTCACTAGGTCATCTTATAATAGATTTAGCAGGACAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCTGGAAAGGAGACCAAGCATAAAAATATCGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAAAAGCCCTGAAACTTGAAAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGAGAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGGAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTAGAAGAAGTTAGGAGACGATTACGAGAACATGGTGGGAAGTCACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGAACAAAAACCACCCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAACGGACACGACAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGAAAAAATTTGTTGTGGAGAACTCACAACCAACGAGTGTAGAATTTCTAGATCCAGAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATATCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGAAGATGATGGCATACGAAAAAATAAATTTGTTCCTTCGTGGGCAAGTAAGGATCGCGTAGCTGCCCTTTTTGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGATGCGATTGTTCATCTTGAAGAGGCAAACATTGATTATATTCTAGAAAAAAAGGGCGTTGTTCACCACTCCCCCTACCGCACTGTTCACCACTCCGCATCTCTCCTTCCCTCTCCCTTTGTTTTCCCTTCTCCCAATCCCTCCATCTCCCTTACCTTGATTGTCCGTCCCAGCTCTACTTCTGCCACCGCCTCTATTGGACTCGTAGGTATCGGATCAAGCGGTGGCGGAAACATGCTGAGATTCTACATTGTCGTGAGCCTCTGCTTTATTGGCTTCGTCACTAGCCTCCATGTCTTTGGTAAGCTCTACTGTGCACGTTCTGCCCATGGAGTTTAA
Coding sequence (CDS)
ATGTCGGCTATGGAGAAGCTATTCGTTCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGGCCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTTCACTCCTCTTTTCTTCATTCCCACATTTCCCATTTCGAAGAGGTGAACAAAGGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATACCCATCGTTCTAGTTTGAATGAAGCATTTGTTGAAGACAGTGGGGAGGAGTTGGAGCACAGGTCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGCAAATAGGCCAGCAATTTCACCCCAGTGTGATATAAGTAGTGCCGGTGTCTTAAATTGCGCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCTAAATCTAGGCAAAAGGCTTTAGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCCTTGCTGGTGGGATTGTAGGATCTGCTATTGGTTTACTGCAAGCTGATCACGAAGATGAATCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAACGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCGGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTCCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATCAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGAGTCCGCATATTGCAGGAGCCAGGAATATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAATGTAGCTCTTTGGATGCAAATGAGACTTCATGCATTTCCCCGGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGACATTGCAGGAGCCATGACTATGATCTTGATAACGCTCTACAGTCTGAGTCACAACAAAGGTCCCAGGAAATGGATGATTCATCACGCATTGACGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGTGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCAACAGTTTGGGTCGCAATACAGCTCCCAGGATGCGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGCAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAATAATCCTCAAAAGTCTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAAATGGAACTTTTTGCCCTGTTGGAAGTTCGAAGCAACATTCTGATCAAGTGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATAATGCGGAACAGTTTGATGTTGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGAGATGTCAGTGAAAATGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAAGGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGACACAGGAGATTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAAATACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAACGTTTCTTGGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAAAAGATACAGAAACATTTAGAGATCATGTAGTAATGGTTCCTTGTGTTCCTTACGTTGGTGAAACAGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAATGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTGAGCCAGAATTGGGGATTGGTATTCCTGGATCTTTAGGCTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATGTTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTACCAACAATCAAATGTGAGCACTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCCAGTTAAAGAGGATCTCTCTAGGTTCAGAAGCAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGAAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGACGGCAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCCTATCTGTTCGTGGAATCAACCTTGACACATTAGAACTTTCGAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATTTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGAACTTGAAGAGTAACTTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCTTTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCTGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATGAGTTTGCAAAGGATATGAAATCGAACAAGCGAGTACCACTTGTTGACATTACAGAAAATTCAAATGTTCTGGTAACAGTTTCTGAAGCTATCATGTGTGCTGATAGATTAAGTTTAGAATCTTTAAACACGGAACTCAGCAACACGGGGACTCATAATAGAACCAAAGAGAATCTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACAGAGCTAAAAGAGTCACTAGGTCATCTTATAATAGATTTAGCAGGACAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCTGGAAAGGAGACCAAGCATAAAAATATCGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAAGAAAATGAACGTCAAATGAAGAAAAAAGCCCTGAAACTTGAAAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGAGAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGGAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCGTGTAGAAGAAGTTAGGAGACGATTACGAGAACATGGTGGGAAGTCACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGAACAAAAACCACCCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAACGGACACGACAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGAAAAAATTTGTTGTGGAGAACTCACAACCAACGAGTGTAGAATTTCTAGATCCAGAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATATCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGATGAAGATGATGGCATACGAAAAAATAAATTTGTTCCTTCGTGGGCAAGTAAGGATCGCGTAGCTGCCCTTTTTGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGATGCGATTGTTCATCTTGAAGAGGCAAACATTGATTATATTCTAGAAAAAAAGGGCGTTGTTCACCACTCCCCCTACCGCACTGTTCACCACTCCGCATCTCTCCTTCCCTCTCCCTTTGTTTTCCCTTCTCCCAATCCCTCCATCTCCCTTACCTTGATTGTCCGTCCCAGCTCTACTTCTGCCACCGCCTCTATTGGACTCGTAGGTATCGGATCAAGCGGTGGCGGAAACATGCTGAGATTCTACATTGTCGTGAGCCTCTGCTTTATTGGCTTCGTCACTAGCCTCCATGTCTTTGGTAAGCTCTACTGTGCACGTTCTGCCCATGGAGTTTAA
Protein sequence
MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHFEEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVDEVGGSCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDGDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNNAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKDGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPELGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEGYQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSLDPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLMQSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRVPLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLRQLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHGGKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARRKKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQDAIVHLEEANIDYILEKKGVVHHSPYRTVHHSASLLPSPFVFPSPNPSISLTLIVRPSSTSATASIGLVGIGSSGGGNMLRFYIVVSLCFIGFVTSLHVFGKLYCARSAHGV*
Homology
BLAST of Chy1G011410 vs. ExPASy TrEMBL
Match:
A0A0A0K8D1 (INCENP_ARK-bind domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G115340 PE=3 SV=1)
HSP 1 Score: 2839.7 bits (7360), Expect = 0.0e+00
Identity = 1501/1588 (94.52%), Postives = 1533/1588 (96.54%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ KQQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 EEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAI 120
+EVNK FISGVEFPRSPLD HRSSLNEAFV DSGEE EHRSTEEAGSLNDDFDA N PAI
Sbjct: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
Query: 121 SPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRN 180
SPQCDIS+AGVLNC+PCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKA ELRN
Sbjct: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
Query: 181 SVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDS+AGGIVGS IG LQ+DHEDESGLAKASSSCNGIGSLEEESNVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNI SKNEELCIAGGSTQNSY+VNEQFDS
Sbjct: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
Query: 301 PRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEE SAYCRSQEYSSDKPEKCRLQ SSLDANETSCISPEDGRAGPIGGSKFHS
Sbjct: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASD 420
DQVDEQLDLPKPSSDNVECNEKAVLG CRSHDYDLD ALQSESQQRS E+DDSS IDASD
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVDEVGGSC 480
GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIA Q GS+YSSQD DNSSYVDEVGGSC
Sbjct: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
Query: 481 PIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDGD 540
PIGSSKVHPHEVKE+LDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQK GMQHNSLDGD
Sbjct: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
Query: 541 NSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNNA 600
NSSCFSSV+GTFC VGSSKQHSDQ IERLELFRPSSVNSECHEEELEDCRTQDCNFD NA
Sbjct: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFD-NA 600
Query: 601 EQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKEN 660
EQ DVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSE EKCNSLLHIPLPQIQVDSVKEN
Sbjct: 601 EQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKEN 660
Query: 661 ESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKDG 720
ESDK ASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAEKNVS KDG
Sbjct: 661 ESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDG 720
Query: 721 VSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSAG 780
VSDLQNSHDNVVEIPPVDANGASVPI+DTETFRDHVVMVPCVP+VGETDGYLEQQLKSAG
Sbjct: 721 VSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAG 780
Query: 781 ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPEL 840
ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRV+PEL
Sbjct: 781 ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPEL 840
Query: 841 GIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEGYQQSNVSTVPI 900
GIGIP SL LGSEQLQIINGSPTDK+LMQEFDTEKPVLEFQRLSFCEEGYQQSNVS VPI
Sbjct: 841 GIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPI 900
Query: 901 EMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSLDPEENLQSGED 960
EMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSLDPEENLQSG D
Sbjct: 901 EMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSG-D 960
Query: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLMQSDAEQPCLSV 1020
NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLG KDMPALEGFLMQSDAEQPC+SV
Sbjct: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISV 1020
Query: 1021 RGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
GINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES
Sbjct: 1021 GGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
Query: 1081 VNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRLFVSPFSK 1140
V+LKSN LMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRR FVSPFSK
Sbjct: 1081 VDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSK 1140
Query: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRVPLVDITENSNV 1200
LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDN+FAKDMKSN RVPLVD+TEN+NV
Sbjct: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANV 1200
Query: 1201 LVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
V VSE +M ADRLSLESLNTE+ NTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN
Sbjct: 1201 PVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
Query: 1261 RAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
AKRVTRSSY+RFSR+DLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI
Sbjct: 1261 GAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
Query: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLRQLELEKKKKEE 1380
LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQENLRQLELEKKKKEE
Sbjct: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEE 1380
Query: 1381 ERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHGGKSRSDKENKD 1440
+RKKKEEEMKKR+ADKAAKKRQREEEERKEKERKRM VEEVRRRLREHGGK RSDKENKD
Sbjct: 1381 DRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKD 1440
Query: 1441 VKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARRKKFVVENSQPT 1500
VKPQANEQKP DRKACKDVTNKLDKENGH+KFDKLSVTKSKSTTSDARR+ FVVEN+QPT
Sbjct: 1441 VKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPT 1500
Query: 1501 SVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
V FL+ EALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD
Sbjct: 1501 IVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
Query: 1561 RVAALFASQQKLNPEIIFPPKSFCDIEQ 1589
VA LFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 HVADLFASQQKLNPEIIFPPKSFCDIEQ 1586
BLAST of Chy1G011410 vs. ExPASy TrEMBL
Match:
A0A1S3CJT1 (uncharacterized protein LOC103501253 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=3 SV=1)
HSP 1 Score: 2691.8 bits (6976), Expect = 0.0e+00
Identity = 1440/1599 (90.06%), Postives = 1484/1599 (92.81%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 EEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAI 120
EEVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRPA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRN 180
SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDG 540
CPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNN 600
DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+NN
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFNNN 600
Query: 601 AEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKE 660
A Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKD 720
NESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS KD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M PYVGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIM---APYVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPE 840
GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE E
Sbjct: 781 GISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVELE 840
Query: 841 ----------LGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEG 900
LG GIPGSLGLG EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
YQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLM 1020
D EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGFLM
Sbjct: 961 DREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLM 1020
Query: 1021 QSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADL 1080
QSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADL
Sbjct: 1021 QSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADL 1080
Query: 1081 YHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASD 1140
YHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASD
Sbjct: 1081 YHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASD 1140
Query: 1141 VRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRV 1200
VRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNKRV
Sbjct: 1141 VRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRV 1200
Query: 1201 PLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNE 1260
PLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYLNE
Sbjct: 1201 PLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNE 1260
Query: 1261 AVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFI 1320
AVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITSFI
Sbjct: 1261 AVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFI 1320
Query: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLR 1380
PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQENLR
Sbjct: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLR 1380
Query: 1381 QLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHG 1440
Q+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLREH
Sbjct: 1381 QIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHS 1440
Query: 1441 GKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARR 1500
GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA R
Sbjct: 1441 GKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGR 1500
Query: 1501 KKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRK 1560
+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 ENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIRN 1560
Query: 1561 NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1589
NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1594
BLAST of Chy1G011410 vs. ExPASy TrEMBL
Match:
A0A1S3CIN7 (uncharacterized protein LOC103501253 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=3 SV=1)
HSP 1 Score: 2686.8 bits (6963), Expect = 0.0e+00
Identity = 1440/1601 (89.94%), Postives = 1484/1601 (92.69%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 E--EVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRP 120
E EVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRP
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALEL 180
A+SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALEL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSL 540
GSCPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 NNAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSV 660
NNA Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSW 720
KENESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M PYVGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIM---APYVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVE 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PE----------LGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCE 900
E LG GIPGSLGLG EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
EGYQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGF 1020
SLD EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGF
Sbjct: 961 SLDREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGF 1020
Query: 1021 LMQSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVA 1080
LMQSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVA
Sbjct: 1021 LMQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVA 1080
Query: 1081 DLYHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSA 1140
DLYHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSA
Sbjct: 1081 DLYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSA 1140
Query: 1141 SDVRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNK 1200
SDVRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNK
Sbjct: 1141 SDVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNK 1200
Query: 1201 RVPLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYL 1260
RVPLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYL
Sbjct: 1201 RVPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYL 1260
Query: 1261 NEAVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITS 1320
NEAVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITS
Sbjct: 1261 NEAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITS 1320
Query: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQEN 1380
FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQEN
Sbjct: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQEN 1380
Query: 1381 LRQLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
LRQ+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLRE
Sbjct: 1381 LRQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
Query: 1441 HGGKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDA 1500
H GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA
Sbjct: 1441 HSGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDA 1500
Query: 1501 RRKKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGI 1560
R+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGI
Sbjct: 1501 GRENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGI 1560
Query: 1561 RKNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1589
R NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 RNNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1596
BLAST of Chy1G011410 vs. ExPASy TrEMBL
Match:
A0A1S3CI78 (uncharacterized protein LOC103501253 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=4 SV=1)
HSP 1 Score: 2625.1 bits (6803), Expect = 0.0e+00
Identity = 1409/1570 (89.75%), Postives = 1453/1570 (92.55%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 E--EVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRP 120
E EVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRP
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALEL 180
A+SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALEL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSL 540
GSCPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 NNAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSV 660
NNA Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSW 720
KENESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M PYVGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIM---APYVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVE 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PE----------LGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCE 900
E LG GIPGSLGLG EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
EGYQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGF 1020
SLD EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGF
Sbjct: 961 SLDREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGF 1020
Query: 1021 LMQSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVA 1080
LMQSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVA
Sbjct: 1021 LMQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVA 1080
Query: 1081 DLYHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSA 1140
DLYHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSA
Sbjct: 1081 DLYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSA 1140
Query: 1141 SDVRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNK 1200
SDVRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNK
Sbjct: 1141 SDVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNK 1200
Query: 1201 RVPLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYL 1260
RVPLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYL
Sbjct: 1201 RVPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYL 1260
Query: 1261 NEAVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITS 1320
NEAVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITS
Sbjct: 1261 NEAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITS 1320
Query: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQEN 1380
FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQEN
Sbjct: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQEN 1380
Query: 1381 LRQLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
LRQ+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLRE
Sbjct: 1381 LRQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
Query: 1441 HGGKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDA 1500
H GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA
Sbjct: 1441 HSGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDA 1500
Query: 1501 RRKKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGI 1558
R+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGI
Sbjct: 1501 GRENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGI 1560
BLAST of Chy1G011410 vs. ExPASy TrEMBL
Match:
A0A5D3C921 (Titin-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G001060 PE=4 SV=1)
HSP 1 Score: 2546.5 bits (6599), Expect = 0.0e+00
Identity = 1362/1519 (89.66%), Postives = 1407/1519 (92.63%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWII+QA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIEQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 EEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAI 120
EEVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRPA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRN 180
SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDG 540
CPIGSS VHP EVKEQLDLSK+S NIEC EEKILG LS+Q+YKL+NPQKSGMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECGEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNN 600
DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNFDNN
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFDNN 600
Query: 601 AEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKE 660
A Q V K FSSPI EVRE TSDKKPSSF+DDKRD SE EK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKPSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKD 720
NESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS KD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP KDTETF+DHV+M PYVGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFKDHVIM---APYVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPE 840
GISQC SDSF+YCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVEPE
Sbjct: 781 GISQCEGSDSFQYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVEPE 840
Query: 841 ----------LGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEG 900
LG GIPGSLGLG EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
YQQSNVS VPIEMLLLEKEAHS QLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSKQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLM 1020
D EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGFLM
Sbjct: 961 DREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLM 1020
Query: 1021 QSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADL 1080
QSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADL
Sbjct: 1021 QSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADL 1080
Query: 1081 YHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASD 1140
YHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASD
Sbjct: 1081 YHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASD 1140
Query: 1141 VRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRV 1200
VRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNKRV
Sbjct: 1141 VRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRV 1200
Query: 1201 PLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNE 1260
PLVD+TEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYLNE
Sbjct: 1201 PLVDVTENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNE 1260
Query: 1261 AVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFI 1320
AVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITSFI
Sbjct: 1261 AVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFI 1320
Query: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLR 1380
PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQ+NLR
Sbjct: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQDNLR 1380
Query: 1381 QLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHG 1440
Q+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLREH
Sbjct: 1381 QIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHS 1440
Query: 1441 GKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARR 1500
GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA R
Sbjct: 1441 GKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGR 1500
Query: 1501 KKFVVENSQPTSVEFLDPE 1509
+ FVVENSQPTSV+FL+ E
Sbjct: 1501 ENFVVENSQPTSVDFLEAE 1514
BLAST of Chy1G011410 vs. NCBI nr
Match:
XP_004148933.1 (uncharacterized protein LOC101214907 isoform X2 [Cucumis sativus] >KGN44031.1 hypothetical protein Csa_011858 [Cucumis sativus])
HSP 1 Score: 2844 bits (7373), Expect = 0.0
Identity = 1501/1588 (94.52%), Postives = 1533/1588 (96.54%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ KQQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 EEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAI 120
+EVNK FISGVEFPRSPLD HRSSLNEAFV DSGEE EHRSTEEAGSLNDDFDA N PAI
Sbjct: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
Query: 121 SPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRN 180
SPQCDIS+AGVLNC+PCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKA ELRN
Sbjct: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
Query: 181 SVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDS+AGGIVGS IG LQ+DHEDESGLAKASSSCNGIGSLEEESNVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNI SKNEELCIAGGSTQNSY+VNEQFDS
Sbjct: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
Query: 301 PRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEE SAYCRSQEYSSDKPEKCRLQ SSLDANETSCISPEDGRAGPIGGSKFHS
Sbjct: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASD 420
DQVDEQLDLPKPSSDNVECNEKAVLG CRSHDYDLD ALQSESQQRS E+DDSS IDASD
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVDEVGGSC 480
GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIA Q GS+YSSQD DNSSYVDEVGGSC
Sbjct: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
Query: 481 PIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDGD 540
PIGSSKVHPHEVKE+LDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQK GMQHNSLDGD
Sbjct: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
Query: 541 NSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNNA 600
NSSCFSSV+GTFC VGSSKQHSDQ IERLELFRPSSVNSECHEEELEDCRTQDCNFDN A
Sbjct: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN-A 600
Query: 601 EQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKEN 660
EQ DVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSE EKCNSLLHIPLPQIQVDSVKEN
Sbjct: 601 EQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKEN 660
Query: 661 ESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKDG 720
ESDK ASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAEKNVS KDG
Sbjct: 661 ESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDG 720
Query: 721 VSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSAG 780
VSDLQNSHDNVVEIPPVDANGASVPI+DTETFRDHVVMVPCVP+VGETDGYLEQQLKSAG
Sbjct: 721 VSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAG 780
Query: 781 ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPEL 840
ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRV+PEL
Sbjct: 781 ISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPEL 840
Query: 841 GIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEGYQQSNVSTVPI 900
GIGIP SL LGSEQLQIINGSPTDK+LMQEFDTEKPVLEFQRLSFCEEGYQQSNVS VPI
Sbjct: 841 GIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPI 900
Query: 901 EMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSLDPEENLQSGED 960
EMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSLDPEENLQSG D
Sbjct: 901 EMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSG-D 960
Query: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLMQSDAEQPCLSV 1020
NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLG KDMPALEGFLMQSDAEQPC+SV
Sbjct: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISV 1020
Query: 1021 RGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
GINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES
Sbjct: 1021 GGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
Query: 1081 VNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRLFVSPFSK 1140
V+LKSN LMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRR FVSPFSK
Sbjct: 1081 VDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSK 1140
Query: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRVPLVDITENSNV 1200
LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDN+FAKDMKSN RVPLVD+TEN+NV
Sbjct: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANV 1200
Query: 1201 LVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
V VSE +M ADRLSLESLNTE+ NTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN
Sbjct: 1201 PVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
Query: 1261 RAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
AKRVTRSSY+RFSR+DLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI
Sbjct: 1261 GAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
Query: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLRQLELEKKKKEE 1380
LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQENLRQLELEKKKKEE
Sbjct: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEE 1380
Query: 1381 ERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHGGKSRSDKENKD 1440
+RKKKEEEMKKR+ADKAAKKRQREEEERKEKERKRM VEEVRRRLREHGGK RSDKENKD
Sbjct: 1381 DRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKD 1440
Query: 1441 VKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARRKKFVVENSQPT 1500
VKPQANEQKP DRKACKDVTNKLDKENGH+KFDKLSVTKSKSTTSDARR+ FVVEN+QPT
Sbjct: 1441 VKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPT 1500
Query: 1501 SVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
V FL+ EALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD
Sbjct: 1501 IVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
Query: 1561 RVAALFASQQKLNPEIIFPPKSFCDIEQ 1588
VA LFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 HVADLFASQQKLNPEIIFPPKSFCDIEQ 1586
BLAST of Chy1G011410 vs. NCBI nr
Match:
XP_011658937.1 (uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_011658938.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_031744660.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_031744661.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus])
HSP 1 Score: 2839 bits (7360), Expect = 0.0
Identity = 1501/1590 (94.40%), Postives = 1533/1590 (96.42%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ KQQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 E--EVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRP 120
+ EVNK FISGVEFPRSPLD HRSSLNEAFV DSGEE EHRSTEEAGSLNDDFDA N P
Sbjct: 61 QVAEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
Query: 121 AISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALEL 180
AISPQCDIS+AGVLNC+PCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKA EL
Sbjct: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
Query: 181 RNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDS+AGGIVGS IG LQ+DHEDESGLAKASSSCNGIGSLEEESN
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQF 300
VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNI SKNEELCIAGGSTQNSY+VNEQF
Sbjct: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
Query: 301 DSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEE SAYCRSQEYSSDKPEKCRLQ SSLDANETSCISPEDGRAGPIGGSKF
Sbjct: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDA 420
HSDQVDEQLDLPKPSSDNVECNEKAVLG CRSHDYDLD ALQSESQQRS E+DDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVDEVGG 480
SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIA Q GS+YSSQD DNSSYVDEVGG
Sbjct: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGG 480
Query: 481 SCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLD 540
SCPIGSSKVHPHEVKE+LDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQK GMQHNSLD
Sbjct: 481 SCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLD 540
Query: 541 GDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDN 600
GDNSSCFSSV+GTFC VGSSKQHSDQ IERLELFRPSSVNSECHEEELEDCRTQDCNFDN
Sbjct: 541 GDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN 600
Query: 601 NAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVK 660
AEQ DVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSE EKCNSLLHIPLPQIQVDSVK
Sbjct: 601 -AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVK 660
Query: 661 ENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWK 720
ENESDK ASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAEKNVS K
Sbjct: 661 ENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLK 720
Query: 721 DGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKS 780
DGVSDLQNSHDNVVEIPPVDANGASVPI+DTETFRDHVVMVPCVP+VGETDGYLEQQLKS
Sbjct: 721 DGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKS 780
Query: 781 AGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEP 840
AGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRV+P
Sbjct: 781 AGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQP 840
Query: 841 ELGIGIPGSLGLGSEQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEGYQQSNVSTV 900
ELGIGIP SL LGSEQLQIINGSPTDK+LMQEFDTEKPVLEFQRLSFCEEGYQQSNVS V
Sbjct: 841 ELGIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIV 900
Query: 901 PIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSLDPEENLQSG 960
PIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSLDPEENLQSG
Sbjct: 901 PIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSG 960
Query: 961 EDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLMQSDAEQPCL 1020
DNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLG KDMPALEGFLMQSDAEQPC+
Sbjct: 961 -DNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCI 1020
Query: 1021 SVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLL 1080
SV GINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLL
Sbjct: 1021 SVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLL 1080
Query: 1081 ESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRLFVSPF 1140
ESV+LKSN LMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRR FVSPF
Sbjct: 1081 ESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPF 1140
Query: 1141 SKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRVPLVDITENS 1200
SKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDN+FAKDMKSN RVPLVD+TEN+
Sbjct: 1141 SKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENA 1200
Query: 1201 NVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPG 1260
NV V VSE +M ADRLSLESLNTE+ NTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPG
Sbjct: 1201 NVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPG 1260
Query: 1261 ANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAA 1320
AN AKRVTRSSY+RFSR+DLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAA
Sbjct: 1261 ANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAA 1320
Query: 1321 TILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLRQLELEKKKK 1380
TILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQENLRQLELEKKKK
Sbjct: 1321 TILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKK 1380
Query: 1381 EEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHGGKSRSDKEN 1440
EE+RKKKEEEMKKR+ADKAAKKRQREEEERKEKERKRM VEEVRRRLREHGGK RSDKEN
Sbjct: 1381 EEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKEN 1440
Query: 1441 KDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARRKKFVVENSQ 1500
KDVKPQANEQKP DRKACKDVTNKLDKENGH+KFDKLSVTKSKSTTSDARR+ FVVEN+Q
Sbjct: 1441 KDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQ 1500
Query: 1501 PTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWAS 1560
PT V FL+ EALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWAS
Sbjct: 1501 PTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWAS 1560
Query: 1561 KDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1588
KD VA LFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 KDHVADLFASQQKLNPEIIFPPKSFCDIEQ 1588
BLAST of Chy1G011410 vs. NCBI nr
Match:
XP_008463008.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X2 [Cucumis melo])
HSP 1 Score: 2696 bits (6989), Expect = 0.0
Identity = 1440/1599 (90.06%), Postives = 1484/1599 (92.81%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 EEVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRPAI 120
EEVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRPA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALELRN 180
SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDG 540
CPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFDNN 600
DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+NN
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFNNN 600
Query: 601 AEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSVKE 660
A Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSWKD 720
NESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS KD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M P YVGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVEPE 840
GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE E
Sbjct: 781 GISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVELE 840
Query: 841 LGIGIPGSLGLGS----------EQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCEEG 900
LG G PGSLGLGS EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
YQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLM 1020
D EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGFLM
Sbjct: 961 DREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLM 1020
Query: 1021 QSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADL 1080
QSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADL
Sbjct: 1021 QSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADL 1080
Query: 1081 YHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASD 1140
YHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASD
Sbjct: 1081 YHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASD 1140
Query: 1141 VRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNKRV 1200
VRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNKRV
Sbjct: 1141 VRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRV 1200
Query: 1201 PLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNE 1260
PLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYLNE
Sbjct: 1201 PLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNE 1260
Query: 1261 AVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFI 1320
AVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITSFI
Sbjct: 1261 AVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFI 1320
Query: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLR 1380
PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQENLR
Sbjct: 1321 PLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLR 1380
Query: 1381 QLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLREHG 1440
Q+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLREH
Sbjct: 1381 QIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHS 1440
Query: 1441 GKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDARR 1500
GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA R
Sbjct: 1441 GKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGR 1500
Query: 1501 KKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRK 1560
+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 ENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIRN 1560
Query: 1561 NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1588
NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1594
BLAST of Chy1G011410 vs. NCBI nr
Match:
XP_008463006.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X1 [Cucumis melo])
HSP 1 Score: 2691 bits (6976), Expect = 0.0
Identity = 1440/1601 (89.94%), Postives = 1484/1601 (92.69%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 E--EVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRP 120
E EVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRP
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALEL 180
A+SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALEL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSL 540
GSCPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 NNAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSV 660
NNA Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSW 720
KENESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M P YVGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVE 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPGSLGLGS----------EQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCE 900
ELG G PGSLGLGS EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
EGYQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGF 1020
SLD EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGF
Sbjct: 961 SLDREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGF 1020
Query: 1021 LMQSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVA 1080
LMQSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVA
Sbjct: 1021 LMQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVA 1080
Query: 1081 DLYHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSA 1140
DLYHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSA
Sbjct: 1081 DLYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSA 1140
Query: 1141 SDVRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNK 1200
SDVRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNK
Sbjct: 1141 SDVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNK 1200
Query: 1201 RVPLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYL 1260
RVPLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYL
Sbjct: 1201 RVPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYL 1260
Query: 1261 NEAVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITS 1320
NEAVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITS
Sbjct: 1261 NEAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITS 1320
Query: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQEN 1380
FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQEN
Sbjct: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQEN 1380
Query: 1381 LRQLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
LRQ+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLRE
Sbjct: 1381 LRQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
Query: 1441 HGGKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDA 1500
H GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA
Sbjct: 1441 HSGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDA 1500
Query: 1501 RRKKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGI 1560
R+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGI
Sbjct: 1501 GRENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGI 1560
Query: 1561 RKNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1588
R NKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ
Sbjct: 1561 RNNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQ 1596
BLAST of Chy1G011410 vs. NCBI nr
Match:
XP_008463009.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X3 [Cucumis melo])
HSP 1 Score: 2630 bits (6816), Expect = 0.0
Identity = 1409/1570 (89.75%), Postives = 1453/1570 (92.55%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQA+QQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 E--EVNKGFISGVEFPRSPLDTHRSSLNEAFVEDSGEELEHRSTEEAGSLNDDFDAANRP 120
E EVNK FISGVEFPRSPLDTHRSSLNEAFV DSGEELEHRS EE GSLNDDFDA NRP
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISSAGVLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKALEL 180
A+SPQCDI SAGVLNCAPCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKALEL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDS+AG IVGSAIGLLQADHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEELCIAGGSTQNSYQVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNISSKNEELCIAGGSTQNSYQVNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEE S YCRSQEYSSDKPEKCRLQCSSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLGHCRSHDYDLDNALQS SQQ SQE+DDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIAQQ GSQYS QD D+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSL 540
GSCPIGSS VHP EVKEQLDLSK+S NIECCEEKILG LS+Q+YKL+NPQKSGMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVNGTFCPVGSSKQHSDQVIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSVNGTFC VGSSKQHSD V E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 NNAEQFDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSENEKCNSLLHIPLPQIQVDSV 660
NNA Q V K FSSPI EVRE TSDKK SSF+DDKRD SE EK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKGASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVNTCSLLQSDEPAEKNVSW 720
KENESD+GASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEV TCSLLQSDEPAE+NVS
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIKDTETFRDHVVMVPCVPYVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP KDTETFRDHV+M P YVGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVE 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRVE
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPGSLGLGS----------EQLQIINGSPTDKMLMQEFDTEKPVLEFQRLSFCE 900
ELG G PGSLGLGS EQLQIINGSPTD +LM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSTVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
EGYQQSNVS VPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGEDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGF 1020
SLD EENLQSGE N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY G KDMPALEGF
Sbjct: 961 SLDREENLQSGE-NELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGF 1020
Query: 1021 LMQSDAEQPCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVA 1080
LMQSDAEQPC+SV GINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVA
Sbjct: 1021 LMQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVA 1080
Query: 1081 DLYHSLSNGLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSA 1140
DLYHSLSNGLLESV+LKS LM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSA
Sbjct: 1081 DLYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSA 1140
Query: 1141 SDVRRLFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFAKDMKSNK 1200
SDVRR F SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DNEFAKDMKSNK
Sbjct: 1141 SDVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNK 1200
Query: 1201 RVPLVDITENSNVLVTVSEAIMCADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYL 1260
RVPLVDITEN+NV VTV EA+M ADRLSLESLNTELSN GTHNRTKENLANQK SKRKYL
Sbjct: 1201 RVPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYL 1260
Query: 1261 NEAVDLDIFPGANRAKRVTRSSYNRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITS 1320
NEAVDLDI PGAN AKRVTRSSYNRFSR+DLSCKENFRK GSRFSGKETKHKNIVSNITS
Sbjct: 1261 NEAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITS 1320
Query: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQEN 1380
FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKK+ALKLERARMEQEN
Sbjct: 1321 FIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQEN 1380
Query: 1381 LRQLELEKKKKEEERKKKEEEMKKREADKAAKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
LRQ+ELEKKKKEEERKKKEEEMKKREADKA KKRQREEEERKEKERKRMRVEEVRRRLRE
Sbjct: 1381 LRQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLRE 1440
Query: 1441 HGGKSRSDKENKDVKPQANEQKPPDRKACKDVTNKLDKENGHDKFDKLSVTKSKSTTSDA 1500
H GK RSDKENKD KPQANEQKP RKACKDVTNKLDKENGH+KFDKLSVT+SKS+TSDA
Sbjct: 1441 HSGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDA 1500
Query: 1501 RRKKFVVENSQPTSVEFLDPEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGI 1557
R+ F+VENSQPTSV+FL+ EALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGI
Sbjct: 1501 GRENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGI 1560
BLAST of Chy1G011410 vs. TAIR 10
Match:
AT5G55820.1 (CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterPro:IPR005635); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 295.8 bits (756), Expect = 2.2e-79
Identity = 490/1843 (26.59%), Postives = 777/1843 (42.16%), Query Frame = 0
Query: 4 MEKLFVQIFERKKWIIDQAKQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHFEEV 63
+E LFVQIFERK+ I++Q +QQ DL+DQHLASK ++ G+ PP WL S L S S E+
Sbjct: 48 IENLFVQIFERKRRIVEQVQQQVDLYDQHLASKCLLAGVSPPSWLWSPSLPSQTS---EL 107
Query: 64 NK-GFISGVEFPRS----------PLDTHR-------------------SSLNEAFVEDS 123
NK IS + FP S P R + L E +E+
Sbjct: 108 NKEEIISELLFPSSRPSIVCPSSRPFSYQRPVRFLADNVVRQDLTSVVNNPLEEQLLEEE 167
Query: 124 GE-ELEHRSTEEAGSLNDDFDA-----------------------ANRPAISPQ------ 183
+ L H + + + + D N+ SP+
Sbjct: 168 PQHNLSHNLVRQVSNHSHEQDVNIASPRDVHEKERLPESVSIDCRENQSCSSPEHSKNQR 227
Query: 184 ----CDISSAG---------VLNCAPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSK 243
D +S G ++ C + + + I D DP LSLA++ RS+
Sbjct: 228 VETNLDATSPGCSQGEKVPKCVSTTGCKRKSSSLGYCQEEIEPDTCIDPGLSLAKMQRSR 287
Query: 244 SRQKALELRNSVKSTRCQSRCENKSDSLAGGIVGSAIGLLQADHEDESGLAKASSSCNGI 303
SRQKALELR+S K+++ +S N+ GG +G I L++D E L K +
Sbjct: 288 SRQKALELRSSAKASKSRSNSRNELKPSPGGDIGFGIASLRSDSVSEIKLFKHDENDEEC 347
Query: 304 GSLEEESNVGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNISSKNEEL------CIA 363
E SN ++ D I K+ V +D S++ISS + C+
Sbjct: 348 REEVENSNSQGKRGDQCI---KISVPTESFTLHHEVD---SVSISSSGDAYASIVPECLL 407
Query: 364 GGSTQNSYQVNEQFDSPRPSSGKIEEESAYCRSQEYSSDKPEKCRLQCSSLDANETSCIS 423
N + + ++ +SGK++E+ D + + + LD + S S
Sbjct: 408 ESGHVNDIDILQSIETIDEASGKVDEQ---------VDDPKSRSCYETAYLDGSTRSKSS 467
Query: 424 PEDG--RAGPIGGSKFH----------SDQVDEQLDLPKPSSDNVECN----------EK 483
+D R + F S D +++LP+ + E + +
Sbjct: 468 IQDNSKRKHQKSSNSFSGNFLLTNSNPSHWADHEVELPQAITTTSEVSMVTDAGTSIFQS 527
Query: 484 AVLGHCRSHDYDLDNALQSESQQRSQEMDDSSRID---------ASDGRLLDLYNPSSGK 543
++ RS NA ++ S+ +SS I+ + D NPSS
Sbjct: 528 EIIARSRS------NARENRSKTEHSGSVESSSINLEPRDSIPVLQGSHVKDSLNPSSVD 587
Query: 544 VE-CCEETISGHCRSKECNFEIAQQFGSQYSSQDADNSSYVDEVGGSCPIGSSKVHPHEV 603
E E I+ +SKE G + ++ V + G + P E
Sbjct: 588 AEGLVVENITSSDQSKET--------GECVDTNRCSSAERVSQTG---------ISPDET 647
Query: 604 --KEQLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKSGMQHNSLDGDNSSCFSSVNG 663
+ S S + + E + S K ++ + ++ +++G+ NG
Sbjct: 648 TFAGAIQDSISQIELLSFVESSSIELQSRHSVKQSDDESVLLKPVTVNGEALLVEEDNNG 707
Query: 664 TFCPV-GSSKQHS---DQVIERLELFRPSSVNSECHEEELED--------CRTQD----- 723
+ G SK S + L + S +N E+L D C +++
Sbjct: 708 ESTEISGISKSRSLSQTDITVVLPVVVESILNESGTPEKLIDHSKRCDISCGSKEVQPLG 767
Query: 724 --CNFDNNAEQFDVDKKFSSPITEVREN----TSDKKPSSFLDDKRDVSENEKCNSLLHI 783
+N + + SS I E N SD D + +V E NSLL
Sbjct: 768 SLTEVGSNQSHGIISRARSSLIEEESANDYKALSDGSNHKSADKQLEVREG---NSLLRT 827
Query: 784 PLPQIQVDS---VKENESDKGASE-----SHSERRYEDTGDFNGNTLSSGNKSLQGYEEV 843
P + VD+ V EN +K + E + + R ++ + S N + E+
Sbjct: 828 PDRPVFVDNFDEVPENSREKSSMEKVPTPAPTARVFDVPSLTDSGVNLSANNEMNDIEDH 887
Query: 844 NTCSL----------------LQSDEPAEKNVSWKDGVSDL----QNSHDNVVEIPPVDA 903
N ++ + +EP E N ++ + L Q+ + +PP+
Sbjct: 888 NGLNIEMVAEMESYASHPGLKVGENEPTESN-TFTGHIDALTKRPQHETSSEKAVPPIKR 947
Query: 904 NGASVPIKDTETFRDHVVMVPCVPYVGETD---GYLEQQ-----LKSAGISQCADSDSFE 963
+ + TE H + P + + G + Q L+ + + S +
Sbjct: 948 D-----VTCTEADECHDLESPIQEFFCSSSPMGGSMRQNKRRRILEKPTRRELSSSPGGD 1007
Query: 964 YCTDDF--NGNHHYLSTEC-QIAETSIELKTFSALTKASSSPEDVRRVEPELGIGIPGSL 1023
D+ HH C + +EL+ S+ VE + IG S
Sbjct: 1008 ILESDYVREAVHHREEAACHNVDNYDVELQKL-----IGSASSHHYSVELQKMIGSASSA 1067
Query: 1024 GLGSEQLQIINGSPTDKMLMQE-----FDTEKPVLEFQRLSFCEEGYQQSNVSTVPIEML 1083
L E+ I+ + + + + +E Q+L + S +E+
Sbjct: 1068 ELRFEEGDILESDYVREAVHHREEAACHNVDNYDVELQKLIGSASSHHYS------VELQ 1127
Query: 1084 LLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSLDPEENLQSGEDNKL 1143
+ A S +L L + L S + T + + ++ + P+ + S N
Sbjct: 1128 KMIGSASSAELRFEESYLLKEAGLMSPASLSYRT--EQLSVQRSQIAPDHRVGSENINFF 1187
Query: 1144 PVDTGKTE--------REEDKGKLTSCSLLTPLIQTSHYLGDGKDMPALEGFLMQSDAEQ 1203
P G+T R+ D S LTPL S D P LEGF++Q+D E
Sbjct: 1188 PY-AGETSHGLASCIVRDSD-----SSPCLTPLGLIS---SDDGSPPVLEGFIIQTDDEN 1247
Query: 1204 PCLSVRGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSN 1263
S +N D+ +L + E A+++E+ICKSAC+N+P +++ K ++ DL S+S
Sbjct: 1248 QSGSKNQLNHDSFQLPRTTAESAAMIEQICKSACMNTPSLHLAKTFKFDEKLDLDQSVST 1307
Query: 1264 GLLESVNLKSNFLMNDQNKLLKDGSNFLNGEVNCSPHG-SFSACLKSIGSHSASDVRRLF 1323
L + + N L+ S F N +N G S++ L G+ S+++ R
Sbjct: 1308 ELFDGMFFSQN---------LEGSSVFDNLGINHDYTGRSYTDSLP--GTGSSAEARNPC 1367
Query: 1324 VSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNEFA----KDMKSNK--- 1383
+SP KL R+ SSSS KRS+ +LPCISEE E+ EE K M+S K
Sbjct: 1368 MSPTEKLWYRSLQKSSSSEKRSTQTPDLPCISEENENIEEEAENLCTNTPKSMRSEKRGS 1427
Query: 1384 ------------------------------------RVPLVDITEN-SNVLVTVSEAIMC 1443
R PL D+ E+ +L +VSEA +
Sbjct: 1428 SIPELPCIAEENENIDEISDAVNEASGSERENVSAERKPLGDVNEDPMKLLPSVSEAKIP 1487
Query: 1444 ADRLSLESLNTELSNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANRAKRVTRSSY 1503
ADR SL+S++T S + N K + K S R++ + + G AKR +
Sbjct: 1488 ADRQSLDSVSTAFSFSAKCNSVKSKVG--KLSNRRFTGKGKENQ---GGAGAKRNVKPPS 1547
Query: 1504 NRFSRTDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQRE-AATILKGKRDVKV 1563
+RFS+ LSC + G R KE +H NIVSNITSF+PLVQQ++ A ++ GKRDVKV
Sbjct: 1548 SRFSKPKLSCNSSLTTVGPRLQEKEPRHNNIVSNITSFVPLVQQQKPAPALITGKRDVKV 1607
Query: 1564 KAIEAAEAAKRLAEKKENERQMKKKALKLERARMEQENLRQLELEKKKKEE--------- 1592
KA+EAAEA+KR+AE+KEN+R++KK+A+KLERA+ EQENL++ E+EKKKKEE
Sbjct: 1608 KALEAAEASKRIAEQKENDRKLKKEAMKLERAKQEQENLKKQEIEKKKKEEDRKKKEAEM 1667
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0K8D1 | 0.0e+00 | 94.52 | INCENP_ARK-bind domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G11... | [more] |
A0A1S3CJT1 | 0.0e+00 | 90.06 | uncharacterized protein LOC103501253 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CIN7 | 0.0e+00 | 89.94 | uncharacterized protein LOC103501253 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CI78 | 0.0e+00 | 89.75 | uncharacterized protein LOC103501253 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3C921 | 0.0e+00 | 89.66 | Titin-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G0... | [more] |
Match Name | E-value | Identity | Description | |
XP_004148933.1 | 0.0 | 94.52 | uncharacterized protein LOC101214907 isoform X2 [Cucumis sativus] >KGN44031.1 hy... | [more] |
XP_011658937.1 | 0.0 | 94.40 | uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_011658938.... | [more] |
XP_008463008.1 | 0.0 | 90.06 | PREDICTED: uncharacterized protein LOC103501253 isoform X2 [Cucumis melo] | [more] |
XP_008463006.1 | 0.0 | 89.94 | PREDICTED: uncharacterized protein LOC103501253 isoform X1 [Cucumis melo] | [more] |
XP_008463009.1 | 0.0 | 89.75 | PREDICTED: uncharacterized protein LOC103501253 isoform X3 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
AT5G55820.1 | 2.2e-79 | 26.59 | CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterP... | [more] |