Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATCCCAAAATTTGAATTCCTTTGCCCCCGCTACTCGCGAAAAACCCATCTTCTTCCTCAGAAACTCCCTCCATCGAAACCCGAACTCTTCTTCCTCTCCTTTCCTCAAAACTTCCTCTCCTTTCCTCAAAACACCCTCTCCTTTCTCTCTCTCTCTCCTCATGGCTCTTCTGTGAAACATGTCGGCTATGGAGAAGCTATTCGTCCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGACCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTCCACTCCACTTTTCTTCATTCCCACATTTCCCATTTCCAAGGTACTTTCCTACTCTTTTCAATTTTCCCCATTTCTTATGCTTCATGGTTAGTTAGGTTGAAGTAAGAATTTCCCTCTTTCGCCACATTTTTTTGCTCAATTGGTTTGTCGTAGTTGCAGAGGTGAACAAAAGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATGCCCATCGCTCTAGTTTGAATGAAGCATTTGTTGCAGACAGTGGGGAGGAGTGGGAGCACAGATCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGGAAATAATCCAGCAATTTCACCCCAGTGTGACATAAGTAATGCCGGTGTCTTAAATTGCTCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCAAAATCTAGGCAAAAGGCTTTCGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCATTGCTGGTGGGATTGTAGGATCTGTTATTGGTTCACTGCAATCTGATCACGAAGATGAGTCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAATGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCCGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTTCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATAAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGGGTCCGCATATTGCAGGAGCCAGGAGTATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAAAGTAGCTCTTTGGATGCAAATGAAACTTCATGCATTTCCCCTGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGAGATTGCAGGAGCCATGACTATGATCTTGATAAGGCTCTACAGTCTGAGTCACAACAAAGGTCCCCGGAAGTGGATGATTCATCATGCATTGATGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGCGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCACCAGTCTGGGTCGCGATACAGCTCCCAGGATGTGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGAAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAACAATCCTCAAAAGTTTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAGATGGAACTTTTTGCCGTGTTGGAAGTTCGAAGCAACATTCTGATCAAGGGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATGCGGAACAGTCTGATGTAGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGGGATGTCAGTGAAAAAGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAATGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGATACAGGAGACTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAACTACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAATGTTTCTTTGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAGAAGATACAGAAACATTTAGAGATCACGTGGTAATGGTTCCTTGTGTTCCTCACGTTGGTGAAACGGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAACGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTCAGCCAGAATTGGGGATTGGTATTCCTGAATCTTTAGACTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATATTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTATCAACAATCAAATGTGAGCATTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCTAGTTAAAGAGGTATACATTTATTACAACGGGAAGAATTTTCAAACAGGCCTTCACAGGTTGACTATTGAAACAAACATCGTTCATGCATAGTCTATAAAATATTTTAGAATAAACTACAAAATCACGATGCAAACCCAAAATTAGTTATCATCACCAAATAGGTTCTTTATCACCTCCAATTTTCTCATCATTAGACTTATGACCTTTTTTTTTCCCACCATCAAACTTATAAGTTTCCATATTTTCCACCATTTGACTTTACAGCCTTCTGTATTTCCTTCTTTGTTTAGTTGTTTTTATGTTCTTCTACATCTGTAAAAGAAAAAATGGGGTGTGTTGGGTTAGTTAAACAAAGGAGAAACAACTTCTTTCTAATTTTCCTTTCTTTTACGTAATATAATCATAATTTATTTCAAGCATCTAAAAGAGAAGTGCATGTTCACAGGGTGAGGTGGCTTACTTAATTTACTTGTGAAGCGCACTTAAGGACACACTTCTGTGGATTAAAACTATTGTTTTGGTGTACTTAGGAAGTTAAGATCAAACGATTTAACTCTCTATTATTTCAAGTATATTGATCTGTTGTTATTGAAACATGGGTATAATAATAAGCGTACCTTCATTGCAAAAGTACCTGTGAATGAGTACGAAGATAATTCTTCCAATTCACTTAACAAAAGTTGTTTTTCTTCTTTTTAATATGTTGTTTTCTGCATGCTGTTTTTGAAAGGATGGACCTAGTTAGGAAGGGGGTGATGGTAGTCATACTCTTTGTGCTTTTAAGAGGGTGTAGTCTTCACTAAATGCTGGCCTTATCATTTTCTGACAAATTTCTAGGCAGGGGATTTGATTGATTTTTTTCCTCCTTCTTTACCTTTTTTATTAACTTCTCTTCATGATCTTGTTTCTTTGTGCATGATTGACAGGATCTATCTAGGTTCAGAAACAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGCCGACAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCATTTCTGTTGGTGGAATCAACCTTGACACATTAGAACTTTCAAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATCTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGGACTTGAAGAGTAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCATTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCCGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATAAGTTTGCAAAGGATATGAAATCGAACATGCGAGTACCACTTGTTGACGTTACAGAAAATGCAAATGTTCCGGTAGCAGTTTCTGAAACTGTCATGTTTGCTGATAGATTGAGTTTAGAATCTTTAAACACGGAAGTCGGCAACACGGGGACTCATAATAGAACCAAAGAGAATTTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACGGAGCTAAAAGAGTCACTAGGTCATCTTATAGTAGATTTAGCAGGTCAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCAGGAAAGGAGACCAAGCATAAAAATATTGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGTCTGTATATATATTTTTAACTATATCTGCATTCAATATTAACATACTTGGATTGATTATGAATCTTTAACATTTCTTGGATATTGTCATGTTAAATATCTCTAGAATTTTATGTCCCCATTACTTCCTACTACTGAATTGAGAAAATGTGTAATGAAGAATGATTTTCATGTACGATATCTAGAAGAATCCCACAACAAAGTCAAGTCCTAGGCTCTCCTACCCATAACACGCCCCTTATTGCTCCACTCTCTTATTCCCCTGTTATCATCCTCCCTTTTTGCACGTCCTATTTTGTTATGGTTATTTTTGGAAATTTATTTTGGCTTCTTCTGTCAGTGTTACCATGTTCGGCAGTTGTGGCATTGTTGATTGATGTTTGCCCAAGTTAGATGTTCCTAACTTGGTTTTGTCCTACGTTTTGCTTTGAGCTTGATTAAAGTCATCCCTTTTGGTTTAGTTTTTGGTATGAAGTTTTCGAAGATGGACAACTTGGTGGATGCTATGGTTTTCTAGCTTCTTCAAAAGCCTGATGTGTTGAGATCTGTTTTGTAATTTTTCATTTGTTCATGGTTTATGTTGGTCACACAATTTTTCAATCTTGAGGCCTTTGGCTACCTCCTAACAAGAACACTCAAGAACAAAAAACAAACAAAGAACACTAAAATGTAGATTTATATTGCAAATAGCAGATCAGATTAACATAGCCTTTCGTGATAACTACTCTCCTAGTATTTCCTACAGGAAATTACCCACAAATTTCTTCACTCCTTCCTCTCAACCCAGTACAGTACCCCCACACTCACTATTTCTAACTAAAAGCCCTCTGTCTAATTACTAATATTCCCCTTCTTATAAGTAGTAACCATACCAATATTTACAACTAATCCTACTAAATTCATAATTAGGGTCTTACAGATTCCAAATCTTGAATTGGTTGTTTTAGATTTATGATTGATCGTCGTGTATCCCCTTATCTTTTAAAGAATTCAGTTTTGATTTGGCTGGTCTTAAGGCTTCATCTTGGTGCTCCCTTTGCAGATCTTATGTTAGCTTCTTTTTGCAGGATATATTTCTTATTGAATGCCTATATCTTTTTGTTGCATTGTCGTTCCATTGTTTATTGCTTTCCTTGTTTACTCCCTTGTGGAATGTGTGTACTTTGAGCATGATCTCTTTTTATTATATCAATGTATACGTCCGTATCTTGTTAAAAAGAAGTAGAAAAAGAAAAGGAAAGAAAAGAAGCAGGCACATTAAACCACCAACTCAACCATAATTGCTAATGCTAACTAATTTTTTCCCATAACTTGGACGCCTGTACTTTTCTTTGTTTTCCTTGTTTTCCTTGTTTGTTAAACAGAATGGGCCTATCAAAATTTTATTTTGGTGTGGCATTGAACTCTATCTATTATTTGGGAGTACACAGTTTGACTATCGTGTATCTGATGATCTGAGTATTGTAACAAATTGTAATTTAAGTTTGAAACATGTGAATTTCTGTAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAGGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAGAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGATAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGAAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCATGTTGAAGAGGTTAGGAGACGATTACGAGAACATGGTGGGAAGTTACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGTAAGAAGCTATATGTGATAATGTTTGAACTTTCCTTGTGCTTATGGGATTTGAGAGGTAAAAATAGCTTCTTATTTTTTTCTGGTAGGAACAAAAACCACTCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAATGGACATGAAAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGGAAAACTTTGTTGTGGAGAACGCACAACCAACGATTGTAGGATTTCTAGAGGCAGAGGTAAATTGATTTTGTATTTGCAAATTTCCAAAAACTATGACGTGATTTATAGTGAAATGTTATTTTCCCATAATAAGCATGGCCAATTAGCTAGCAATCTTCCATTGTTCTAACCCTCACCAGTCTTCTACTGAAAGTGTCGTTTATTATTATTATTATTATTATTTTTTATTTAAGATTGTCACGATTTGTTTTGAGGGATAATAGATGGAGGACGATTGAAACCTTTTGGAGGTAATGTAGTGCAGCATCTGAAAGATTCTGGGCTTATAGATGAAGTAGGTTGTTGTAATTTGAAACCATCCTAGGATACTTCTTGTAGATAAGCAATAAAATAATCGAAAAGGCACAATTCTTGAAACATAACACCCTAGGCTTCCAGTTAGTTTTAGGTCTGAATCTACAAGAAAGAAGCATGAAAACTATTTTCTAGATAATTATTTGTCTGTCTAGGAGTGATGATTAAAGTAGTGATTTTAAAAATTTGAGCGAGTTAAAGTAGTGATTTTAGAAAAAGAGATTTTTGTACAAATTATTTTGTAAAATTGATTTGATTCAATTGTATTGATATTTGGTTCAGAATTTTAAAAATTGAATTTTATATAGTTAAAAGTGATTTTGTGTTATTATTTTTGGTTTCAAAAGTGATTCTAGAATTACTAAATTATGAAAAAAGACATTAAAAAAAACGTTCAAATATATGGGACTAATTTTACTTAAGAAGAGACTGAGAAAAAAAAAAACAGAAAATGGAGCTAATTTTGGTGATTCTTGGTCGATCAGTTCCCTAAAAGCACATTCCAACTCGTTGTTGGAAAATAATTCTATATGATTAGATTTCTCTAAAATAAGATTTCAACATCTGCCAGTTATCAATTTTTGAAAGAAAAGTGATTTTGGCAATGACAAATTAATGTTGGCCATGTTATAATCATTCCAAAACATACACTAAATCTTTATGCATCTTGCTGTATCCTGTTACATACTTATACCCACAGAAAATAATTCCACTTGATCTAGAAACTCTGTTGGACTTTTGTAGAGTAAATCTTACTGGAGCCTGAATCTAAACTGAACTAAGAATTTTAAGATACTATACAGTATTGACTTTTGCTTGATCCTATCATATGTCGCTTAAAATTGAGAAATTAAATAGTTATATGATTTAACCTACTTGGAATGACTGCTCAAATATTTGGCAACTTAAAGTGAGAAAGATAATTAGTGGGTGATTAAAGGTAGGTCTTCGTCTTTTTGAGAGAAGAAAAAGATTTTTATAATTGTAACACAGGAAAAAGGTTTTTATAATTGTAACACACTTTTAGATGTTTGATATTAGGAAGTCTAATTTCTTGGAAGACTTTAACATACATGATGAAGGAAGATATTTTTTGTTCCATTTGAGCTTCTTATGCTGAAATTTATGATCTTTAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGACGAAGATGATGGCATACGAAAAAATAAATTTGTACCTTCGTGGGCAAGGTGTGTAACTTGGTCTGATGTGTTTTTAAATAAGGAAATAACAAATGTGATTTAAACTCCAATGATGCAATTTAGTCTGTTTTGCATTTTTCATTGTATCCCTCTTTCTTAACATCCTTTTTGTCAAAGATAATCACGAAGACCATTTCAAATTTGCCATTTTGTGTTAGAACTAATTTCTTCATACTTCTTTTATGCAGTAAGGATCACGTAGCTGACCTTTTCGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGGTGAAAATTAACACAGAAAGATGGCTGCAGCTTTTAGTCTTTATTAATGTTAAATAATAAAAACGCACTGAGCTAGTTTAACTATTTTGCAGTTCTATTGCTTTGACGGCTCTGGATTAAATAGTAGTAACTTTCACAATGTGGATAGATTTTATCTGCAACAGAAACATTCCTCTTTCTCACCATGCAAGAGATTTGCCTAGGTATGCTTTTATTGAACTGTGTTTTTTCTGACTTGAACTTCACATCATGATTGATCACTTTTGACTTGTTTGTCAATAAGTTAGTTGCAACTTAACTCGTTTGGACATTCTTGGTGTTTGAGGGTCATTGCTAGTTATTAGGTCTATCTAGTCTACATAACGTTGACTTCAGTATTTATGTAAATCTTTTACCCGTAGTTAGCTGTACTGTATCTTTTGCGTGGCTTTTCTCTGTCCTTCCCATACAATCTATAAATTATCATGCAATCTAATGGATGATTGAGAGAAAACGTTCACTAGTTGTGTAAATAATCAATAAGAATTTCCCTTCAGCATTATGGCTGCTTGTTTGTTTAATAAATCGATTACTGTTTTCATTTGTCCATGTCTCATTAAAATAGAGTGCTATATTATGTGGCAGATTTTGCTGACTAATTTCTCAAGGCGTTGCTGATCAGGTGATTAGTTTCTTGCTACTCTATATTCTGAATAGCTAAATTGAGATGATTATTGTACAATAGGCAGCTTTATATGTGGGGTTAATTTCATAGCACCAAAATAAGCTATAAAACCCAGTACCCAGCCCCCTCTTCCGACTGGATTCCTCAACCACGAAAAAGTTGGGTAAAACGCCTGTGGTTAGTTCTTGGTGTCAACCTTCCCTTCTTTTGCTTTGTGGTGAGGGATAGGGGGAGGATTTAGGGTTCCCCTGTAGAAAAAGCAGAATGTAACCTGCCAACATGTAGGTGTAAAGCACTAGTTCTTAAGATAACTTCTCTATGTACAATTTATTTTTATTATATTGTCTAATGAAGATGATGCGATTGTTCATCTTGAAGAGGCAAACATTGATTATATTCTAGAAGAAAAGGTAACCCTTAATTCCGTAAAAGTTTATGCCAGTATTACATAGGTTGGTTTATGATGAAGGAATTTATAATATGAATAGACTTTATATATCATAAGCAGTTCCCAGCGGAAACTAGGTTTCTATTTTATCTAGCTCTTGCTTTGACAGAATCTTTTTCCTCTCATAAGTTCCATTGCTTGAAAAAACTTGTGTTATTCCTAAAGTAGAAGGATGGACAACCATGAGTAATAATTCAAAAAAAAAAAAAGGAAAAAAAAAAAAAATTCCCTCTCTTGTTTGGTTGATAGATCTTGATTGCTTTGTTTCTCCAATTTTCAAGATTAGGTTTCCACATCTAGCCCTTTTTTCTGGATTCTTTTTTCTCCCAGCATAAGTTGTGGGATATATGTTGGAGAAGAACGAACAGGCGTATCAATGGGAGAGAGGTGAAACAAAGTTTGAGAAAGCCATCACCCCTGCAAGGGCATATACCAAAACTTTACATATTTGTATTTCCACCAATCACTTCTAATAATTACACCACAAAAATGTCTCACAACTCACAGGCGAAGAAAAAAGAATATTGAGAAAGCTCCAATGAGTTCAGAGTCCTATTGATGTGGTACTATGTCTTGTCAGTTTACATAAAATACATTCGTCAGGTTCCAATTTCCCTGCAATTCAAGAATATTGAGAAAGCTCCAATGAGTCCAGAGTCCATGCAGGGAAGTTTGTGATGTTATTGGCCTTCAAGTAATTGAAAAGTCGTGTGAATTGTTACATTAAATTTTTAGGTTGTACTACATTTTCCCACCAGATTAAATCAATGTCATTGGGATACAGCAGGTTTACAAACTATTGATGGTAATATGTAGATATGGACATAGTAATAGCACACGCGTAATACATATTTACTTCTACCATCATTTACTTCCTTTCTCTACATAAATGAAGCTGTTGATGAATGATGAAATCATGGGGTAATGAAAATGTTTGGGGTTTCGGGATTACCTGTTGATTGAATGGTTTGGTTTATTTATATGAACATAATGCATTTTGAGACTTGACGTCTCCAGAATCAGTGTTGATACGCTTTGCCTTCGTTGCAACAGTGTAGATGTGTTGGTAACCAGCCAAAAATAGAATGGTATTGGGC
mRNA sequence
ATGTATCCCAAAATTTGAATTCCTTTGCCCCCGCTACTCGCGAAAAACCCATCTTCTTCCTCAGAAACTCCCTCCATCGAAACCCGAACTCTTCTTCCTCTCCTTTCCTCAAAACTTCCTCTCCTTTCCTCAAAACACCCTCTCCTTTCTCTCTCTCTCTCCTCATGGCTCTTCTGTGAAACATGTCGGCTATGGAGAAGCTATTCGTCCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGACCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTCCACTCCACTTTTCTTCATTCCCACATTTCCCATTTCCAAGAGGTGAACAAAAGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATGCCCATCGCTCTAGTTTGAATGAAGCATTTGTTGCAGACAGTGGGGAGGAGTGGGAGCACAGATCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGGAAATAATCCAGCAATTTCACCCCAGTGTGACATAAGTAATGCCGGTGTCTTAAATTGCTCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCAAAATCTAGGCAAAAGGCTTTCGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCATTGCTGGTGGGATTGTAGGATCTGTTATTGGTTCACTGCAATCTGATCACGAAGATGAGTCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAATGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCCGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTTCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATAAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGGGTCCGCATATTGCAGGAGCCAGGAGTATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAAAGTAGCTCTTTGGATGCAAATGAAACTTCATGCATTTCCCCTGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGAGATTGCAGGAGCCATGACTATGATCTTGATAAGGCTCTACAGTCTGAGTCACAACAAAGGTCCCCGGAAGTGGATGATTCATCATGCATTGATGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGCGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCACCAGTCTGGGTCGCGATACAGCTCCCAGGATGTGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGAAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAACAATCCTCAAAAGTTTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAGATGGAACTTTTTGCCGTGTTGGAAGTTCGAAGCAACATTCTGATCAAGGGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATGCGGAACAGTCTGATGTAGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGGGATGTCAGTGAAAAAGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAATGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGATACAGGAGACTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAACTACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAATGTTTCTTTGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAGAAGATACAGAAACATTTAGAGATCACGTGGTAATGGTTCCTTGTGTTCCTCACGTTGGTGAAACGGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAACGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTCAGCCAGAATTGGGGATTGGTATTCCTGAATCTTTAGACTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATATTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTATCAACAATCAAATGTGAGCATTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCTAGTTAAAGAGGATCTATCTAGGTTCAGAAACAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGCCGACAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCATTTCTGTTGGTGGAATCAACCTTGACACATTAGAACTTTCAAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATCTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGGACTTGAAGAGTAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCATTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCCGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATAAGTTTGCAAAGGATATGAAATCGAACATGCGAGTACCACTTGTTGACGTTACAGAAAATGCAAATGTTCCGGTAGCAGTTTCTGAAACTGTCATGTTTGCTGATAGATTGAGTTTAGAATCTTTAAACACGGAAGTCGGCAACACGGGGACTCATAATAGAACCAAAGAGAATTTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACGGAGCTAAAAGAGTCACTAGGTCATCTTATAGTAGATTTAGCAGGTCAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCAGGAAAGGAGACCAAGCATAAAAATATTGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAGGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAGAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGATAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGAAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCATGTTGAAGAGGTTAGGAGACGATTACGAGAACATGGTGGGAAGTTACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGAACAAAAACCACTCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAATGGACATGAAAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGGAAAACTTTGTTGTGGAGAACGCACAACCAACGATTGTAGGATTTCTAGAGGCAGAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGACGAAGATGATGGCATACGAAAAAATAAATTTGTACCTTCGTGGGCAAGTAAGGATCACGTAGCTGACCTTTTCGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGTTCTATTGCTTTGACGGCTCTGGATTAAATAGTAGTAACTTTCACAATGTGGATAGATTTTATCTGCAACAGAAACATTCCTCTTTCTCACCATGCAAGAGATTTGCCTAGATTTTGCTGACTAATTTCTCAAGGCGTTGCTGATCAGGTGATTAGTTTCTTGCTACTCTATATTCTGAATAGCTAAATTGAGATGATTATTGTACAATAGGCAGCTTTATATGTGGGGTTAATTTCATAGCACCAAAATAAGCTATAAAACCCAGTACCCAGCCCCCTCTTCCGACTGGATTCCTCAACCACGAAAAAGTTGGGTAAAACGCCTGTGGTTAGTTCTTGGTGTCAACCTTCCCTTCTTTTGCTTTGTGGTGAGGGATAGGGGGAGGATTTAGGGTTCCCCTGTAGAAAAAGCAGAATGTAACCTGCCAACATGTAGGTGTAAAGCACTAGTTCTTAAGATAACTTCTCTATGTACAATTTATTTTTATTATATTGTCTAATGAAGATGATGCGATTGTTCATCTTGAAGAGGCAAACATTGATTATATTCTAGAAGAAAAGATTAGGTTTCCACATCTAGCCCTTTTTTCTGGATTCTTTTTTCTCCCAGCATAAGTTGTGGGATATATGTTGGAGAAGAACGAACAGGCGTATCAATGGGAGAGAGGTGAAACAAAGTTTGAGAAAGCCATCACCCCTGCAAGGGCATATACCAAAACTTTACATATTTGTATTTCCACCAATCACTTCTAATAATTACACCACAAAAATGTCTCACAACTCACAGGCGAAGAAAAAAGAATATTGAGAAAGCTCCAATGAGTTCAGAGTCCTATTGATGTGGTACTATGTCTTGTCAGTTTACATAAAATACATTCGTCAGGTTCCAATTTCCCTGCAATTCAAGAATATTGAGAAAGCTCCAATGAGTCCAGAGTCCATGCAGGGAAGTTTGTGATGTTATTGGCCTTCAAGTAATTGAAAAGTCGTGTGAATTGTTACATTAAATTTTTAGGTTGTACTACATTTTCCCACCAGATTAAATCAATGTCATTGGGATACAGCAGGTTTACAAACTATTGATGGTAATATGTAGATATGGACATAGTAATAGCACACGCGTAATACATATTTACTTCTACCATCATTTACTTCCTTTCTCTACATAAATGAAGCTGTTGATGAATGATGAAATCATGGGGTAATGAAAATGTTTGGGGTTTCGGGATTACCTGTTGATTGAATGGTTTGGTTTATTTATATGAACATAATGCATTTTGAGACTTGACGTCTCCAGAATCAGTGTTGATACGCTTTGCCTTCGTTGCAACAGTGTAGATGTGTTGGTAACCAGCCAAAAATAGAATGGTATTGGGC
Coding sequence (CDS)
ATGTCGGCTATGGAGAAGCTATTCGTCCAGATCTTCGAAAGGAAGAAGTGGATCATCGACCAGACCAAGCAGCAGACCGATCTCTTCGACCAACACCTCGCTTCCAAGCTCATTATCGATGGAATTGTTCCTCCTCCTTGGCTCCACTCCACTTTTCTTCATTCCCACATTTCCCATTTCCAAGAGGTGAACAAAAGTTTTATTTCTGGCGTTGAGTTCCCACGTTCGCCGCTTGATGCCCATCGCTCTAGTTTGAATGAAGCATTTGTTGCAGACAGTGGGGAGGAGTGGGAGCACAGATCGACTGAAGAAGCTGGTTCCTTAAACGATGATTTTGATGCAGGAAATAATCCAGCAATTTCACCCCAGTGTGACATAAGTAATGCCGGTGTCTTAAATTGCTCGCCTTGTATTGAAATGACTCCTGTTTCTCCTCATGGTCGAGGAGGCATAGTCTCAGACAATTACCGGGATCCTACTCTGTCATTGGCTCGGTTACACAGATCAAAATCTAGGCAAAAGGCTTTCGAGTTGCGTAATAGTGTGAAATCTACAAGGTGCCAATCTCGGTGTGAGAACAAGAGTGATTCCATTGCTGGTGGGATTGTAGGATCTGTTATTGGTTCACTGCAATCTGATCACGAAGATGAGTCAGGGTTGGCAAAGGCTTCCAGTAGCTGTAATGGAATTGGTTCTCTAGAAGAAGAATCTAATGTTGGTTGTGAGCAGAAGGATAGCTCTATTGGCTCCGATAAAGTTGGAGTAGTTGTAAGCCCTGGGTTGCAAAGTAGATTTATTGATGTGGACAATTCTTTAAACATTTTCTCTAAAAATGAAGAGTTATGTATAGCTGGAGGTTCAACACAGAATTCTTATAAAGTAAATGAGCAATTTGACTCACCTAGACCTTCTTCGGGAAAGATTGAAGAAGGGTCCGCATATTGCAGGAGCCAGGAGTATAGTTCTGATAAACCTGAAAAGTGTAGGTTGCAAAGTAGCTCTTTGGATGCAAATGAAACTTCATGCATTTCCCCTGAAGATGGAAGAGCAGGTCCTATAGGAGGTTCAAAATTCCATTCTGATCAAGTGGACGAGCAATTGGACTTGCCTAAACCTTCTTCTGACAATGTTGAGTGTAATGAAAAGGCTGTATTAGGAGATTGCAGGAGCCATGACTATGATCTTGATAAGGCTCTACAGTCTGAGTCACAACAAAGGTCCCCGGAAGTGGATGATTCATCATGCATTGATGCCAGTGATGGAAGATTGTTGGACTTGTATAACCCTTCTTCTGGAAAAGTTGAATGCTGCGAAGAAACTATTTCAGGACATTGCAGGAGCAAGGAATGTAATTTCGAAATTGCCCACCAGTCTGGGTCGCGATACAGCTCCCAGGATGTGGATAATTCTTCATATGTTGATGAGGTGGGAGGATCCTGTCCCATTGGAAGTTCAAAAGTGCACCCTCATGAAGTGAAAGAGAAATTGGACTTATCTAAATCTTCTTTCGACAATATCGAGTGCTGTGAAGAAAAAATATTAGGTGATTTGAGTAATCAGGAGTATAAACTTAACAATCCTCAAAAGTTTGGGATGCAACATAACTCCCTGGATGGGGACAATTCATCATGCTTTTCTTCTGTAGATGGAACTTTTTGCCGTGTTGGAAGTTCGAAGCAACATTCTGATCAAGGGATTGAGCGGTTGGAGTTGTTTAGACCTTCTTCTGTCAATAGTGAATGTCATGAAGAAGAACTAGAAGACTGTAGGACCCAGGACTGTAATTTTGATAATGCGGAACAGTCTGATGTAGACAAAAAATTCAGTTCACCAATAACGGAGGTAAGGGAGAATACATCCGATAAGAAGCCCTCCAGTTTTCTAGATGACAAGAGGGATGTCAGTGAAAAAGAAAAATGCAATTCACTCCTTCACATACCTTTGCCACAGATTCAGGTAGACTCAGTGAAGGAAAACGAATCTGATAAATGTGCATCTGAATCTCACAGTGAGAGGAGATATGAAGATACAGGAGACTTTAATGGAAATACTCTCTCATCTGGCAACAAGTCACTGCAAGGTTATGAAGAAGTAACTACTTGTTCTTTGCTGCAAAGTGATGAACCTGCTGAAAAAAATGTTTCTTTGAAAGATGGAGTATCAGATTTGCAGAATTCCCATGACAATGTAGTTGAAATTCCACCAGTGGATGCAAACGGTGCATCAGTTCCTATAGAAGATACAGAAACATTTAGAGATCACGTGGTAATGGTTCCTTGTGTTCCTCACGTTGGTGAAACGGATGGTTATTTGGAGCAGCAACTGAAAAGTGCAGGCATATCTCAGTGCGCAGATTCAGATTCCTTTGAGTATTGCACTGACGACTTTAACGGTAACCATCATTACTTATCAACAGAGTGCCAGATTGCAGAAACATCAATTGAGTTAAAAACTTTCAGCGCACTTACGAAGGCATCTAGTTCTCCTGAAGACGTGAGAAGGGTTCAGCCAGAATTGGGGATTGGTATTCCTGAATCTTTAGACTTGGGGAGTGAGCAACTTCAAATTATCAACGGGAGTCCCACAGATAAAATATTGATGCAGGAATTTGACACTGAAAAACCTGTCCTTGAATTTCAACGATTATCATTTTGTGAAGAAGGTTATCAACAATCAAATGTGAGCATTGTCCCTATTGAAATGTTGCTTTTGGAAAAGGAAGCTCATTCAATGCAATTGTCTGATTCTTCACCCACGCTTCTAGTTAAAGAGGATCTATCTAGGTTCAGAAACAATAACAGAGGCACGCTGTTGCAAAATGTCATGCTAGAGAGCCAAAGTTTGGATCCTGAAGAAAATCTTCAGTCTGGAGATAACAAACTTCCTGTTGATACCGGGAAGACGGAAAGAGAGGAGGATAAGGGGAAACTTACTTCTTGCTCACTTCTTACTCCCCTAATCCAAACTTCTCATTATCTTGGTGCCGACAAGGATATGCCTGCATTAGAGGGGTTCCTAATGCAGTCTGATGCTGAACAACCATGCATTTCTGTTGGTGGAATCAACCTTGACACATTAGAACTTTCAAAGTGTATGATAGAACGCGCTAGCATATTGGAGAAAATCTGTAAATCTGCTTGTATAAACAGTCCATTATCCTCATCTTCAGAAAGTCTTAAGTTGAACAAGGTGGCAGATTTGTACCATTCCCTTTCTAATGGTCTGTTAGAGAGCGTGGACTTGAAGAGTAACCTTCTGATGAATGATCAAAATAAGCTACTGAAGGATGGTAGTAACTTCTTGAATGGAGAAGTCAACTGTTCTCCTCATGGGTCATTTTCTGCTTGCCTGAAAAGCATTGGCAGTCATTCAGCTAGCGATGTTAGGAGGCCGTTTGTATCCCCCTTTAGTAAGTTGTTGGATAGAAATTCATTAAATTCCTCAAGTTCAGGAAAACGAAGCAGCCCGAATATAGAGCTTCCTTGCATTAGTGAAGAAGCTGAGAGTACAGAGGAGACTGATAATAAGTTTGCAAAGGATATGAAATCGAACATGCGAGTACCACTTGTTGACGTTACAGAAAATGCAAATGTTCCGGTAGCAGTTTCTGAAACTGTCATGTTTGCTGATAGATTGAGTTTAGAATCTTTAAACACGGAAGTCGGCAACACGGGGACTCATAATAGAACCAAAGAGAATTTGGCAAACCAGAAAAAGAGTAAAAGGAAATATTTGAATGAGGCTGTAGATCTTGATATCTTTCCAGGAGCAAACGGAGCTAAAAGAGTCACTAGGTCATCTTATAGTAGATTTAGCAGGTCAGATTTATCCTGTAAAGAAAATTTCAGAAAAGAAGGCTCTCGATTCTCAGGAAAGGAGACCAAGCATAAAAATATTGTGTCCAATATTACTTCTTTTATTCCTCTTGTCCAACAAAGAGAAGCTGCAACTATTTTGAAAGGGAAGAGAGATGTTAAGGTGAAGGCCATTGAGGCTGCTGAGGCTGCAAAACGCCTTGCAGAAAAGAAGGAAAATGAACGTCAAATGAAGAAAGAAGCCCTGAAACTTGAGAGAGCAAGAATGGAGCAAGAGAATTTGAGGCAGCTTGAACTTGAGAAAAAGAAGAAAGAAGAAGATAGAAAGAAGAAAGAGGAAGAAATGAAGAAAAGGAAGGCTGATAAGGCAGCAAAGAAAAGACAGAGAGAAGAAGAAGAGAGGAAGGAGAAAGAAAGAAAAAGAATGCATGTTGAAGAGGTTAGGAGACGATTACGAGAACATGGTGGGAAGTTACGATCAGATAAAGAGAATAAGGATGTGAAACCCCAAGCCAATGAACAAAAACCACTCGACAGAAAGGCATGTAAGGATGTGACAAACAAACTGGACAAGGAAAATGGACATGAAAAATTTGACAAACTCTCAGTTACCAAGTCCAAGAGTACTACAAGCGATGCTAGGAGGGAAAACTTTGTTGTGGAGAACGCACAACCAACGATTGTAGGATTTCTAGAGGCAGAGGCACTTGAAAATGGGATGGAGAGTAGAATCTCCGAAACAAGTGAACGAGAATCATATCAGATTTCTCCTTACAAAGCTTCTGATGATGAAGATGAAGAGGACGAAGATGATGGCATACGAAAAAATAAATTTGTACCTTCGTGGGCAAGTAAGGATCACGTAGCTGACCTTTTCGCTTCTCAGCAAAAATTGAATCCAGAAATTATATTTCCACCGAAAAGCTTTTGTGATATAGAGCAAGTTCTATTGCTTTGA
Protein sequence
MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHFQEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELGIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHVADLFASQQKLNPEIIFPPKSFCDIEQVLLL*
Homology
BLAST of CsGy7G007850 vs. NCBI nr
Match:
XP_004148933.1 (uncharacterized protein LOC101214907 isoform X2 [Cucumis sativus] >KGN44031.1 hypothetical protein Csa_011858 [Cucumis sativus])
HSP 1 Score: 3048 bits (7903), Expect = 0.0
Identity = 1590/1590 (100.00%), Postives = 1590/1590 (100.00%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI
Sbjct: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
Query: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN
Sbjct: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
Query: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS
Sbjct: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
Query: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS
Sbjct: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC
Sbjct: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
Query: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD
Sbjct: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
Query: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE 600
NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE
Sbjct: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE 600
Query: 601 QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE 660
QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE
Sbjct: 601 QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE 660
Query: 661 SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV 720
SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV
Sbjct: 661 SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV 720
Query: 721 SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI 780
SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI
Sbjct: 721 SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI 780
Query: 781 SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG 840
SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG
Sbjct: 781 SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG 840
Query: 841 IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE 900
IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE
Sbjct: 841 IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE 900
Query: 901 MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK 960
MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK
Sbjct: 901 MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK 960
Query: 961 LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG 1020
LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG
Sbjct: 961 LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG 1020
Query: 1021 INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD 1080
INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD
Sbjct: 1021 INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD 1080
Query: 1081 LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL 1140
LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL
Sbjct: 1081 LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL 1140
Query: 1141 DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV 1200
DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV
Sbjct: 1141 DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV 1200
Query: 1201 AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA 1260
AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA
Sbjct: 1201 AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA 1260
Query: 1261 KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK 1320
KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK
Sbjct: 1261 KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK 1320
Query: 1321 GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR 1380
GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR
Sbjct: 1321 GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR 1380
Query: 1381 KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK 1440
KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK
Sbjct: 1381 KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK 1440
Query: 1441 PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV 1500
PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV
Sbjct: 1441 PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV 1500
Query: 1501 GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV 1560
GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV
Sbjct: 1501 GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV 1560
Query: 1561 ADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
ADLFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 ADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
BLAST of CsGy7G007850 vs. NCBI nr
Match:
XP_011658937.1 (uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_011658938.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_031744660.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_031744661.1 uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus])
HSP 1 Score: 3043 bits (7890), Expect = 0.0
Identity = 1590/1592 (99.87%), Postives = 1590/1592 (99.87%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 Q--EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
Q EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP
Sbjct: 61 QVAEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
Query: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL
Sbjct: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
Query: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF
Sbjct: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
Query: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF
Sbjct: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGG 480
SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGG
Sbjct: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGG 480
Query: 481 SCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLD 540
SCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLD
Sbjct: 481 SCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLD 540
Query: 541 GDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN 600
GDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN
Sbjct: 541 GDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN 600
Query: 601 AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE 660
AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE
Sbjct: 601 AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD 720
NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD
Sbjct: 661 NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA
Sbjct: 721 GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE 840
GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE
Sbjct: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE 840
Query: 841 LGIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVP 900
LGIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVP
Sbjct: 841 LGIGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVP 900
Query: 901 IEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGD 960
IEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGD
Sbjct: 901 IEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGD 960
Query: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISV 1020
NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISV
Sbjct: 961 NKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISV 1020
Query: 1021 GGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
GGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES
Sbjct: 1021 GGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLES 1080
Query: 1081 VDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSK 1140
VDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSK
Sbjct: 1081 VDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSK 1140
Query: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANV 1200
LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANV
Sbjct: 1141 LLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANV 1200
Query: 1201 PVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
PVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN
Sbjct: 1201 PVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGAN 1260
Query: 1261 GAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
GAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI
Sbjct: 1261 GAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATI 1320
Query: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEE 1380
LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEE
Sbjct: 1321 LKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEE 1380
Query: 1381 DRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKD 1440
DRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKD
Sbjct: 1381 DRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKD 1440
Query: 1441 VKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPT 1500
VKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPT
Sbjct: 1441 VKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPT 1500
Query: 1501 IVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
IVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD
Sbjct: 1501 IVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKD 1560
Query: 1561 HVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
HVADLFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 HVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1592
BLAST of CsGy7G007850 vs. NCBI nr
Match:
XP_008463008.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X2 [Cucumis melo])
HSP 1 Score: 2680 bits (6948), Expect = 0.0
Identity = 1428/1602 (89.14%), Postives = 1480/1602 (92.38%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
+EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN PA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA ELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDG 540
CPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN- 600
DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+N
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFNNN 600
Query: 601 AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE 660
A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD 720
NESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSLKD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE 840
GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+ E
Sbjct: 781 GISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVELE 840
Query: 841 LGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEG 900
LG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSL 960
YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQ 1020
D EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFLMQ
Sbjct: 961 DREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLMQ 1020
Query: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLY 1080
SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADLY
Sbjct: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADLY 1080
Query: 1081 HSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDV 1140
HSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASDV
Sbjct: 1081 HSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASDV 1140
Query: 1141 RRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVP 1200
RRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN RVP
Sbjct: 1141 RRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRVP 1200
Query: 1201 LVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEA 1260
LVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLNEA
Sbjct: 1201 LVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNEA 1260
Query: 1261 VDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIP 1320
VDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSFIP
Sbjct: 1261 VDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFIP 1320
Query: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ 1380
LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ
Sbjct: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ 1380
Query: 1381 LELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGG 1440
+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH G
Sbjct: 1381 IELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHSG 1440
Query: 1441 KLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRE 1500
KLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA RE
Sbjct: 1441 KLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGRE 1500
Query: 1501 NFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKN 1560
NF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR N
Sbjct: 1501 NFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIRNN 1560
Query: 1561 KFVPSWASKDHVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
KFVPSWASKD VA LFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 KFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQVLLL 1598
BLAST of CsGy7G007850 vs. NCBI nr
Match:
XP_008463006.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X1 [Cucumis melo])
HSP 1 Score: 2675 bits (6935), Expect = 0.0
Identity = 1428/1604 (89.03%), Postives = 1480/1604 (92.27%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 Q--EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
+ EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN P
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
A+SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA EL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSL 540
GSCPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 N-AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSV 660
N A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSL 720
KENESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSL
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQ 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCE 900
ELG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQ 960
EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFL 1020
SLD EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFL
Sbjct: 961 SLDREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFL 1020
Query: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVAD 1080
MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVAD
Sbjct: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVAD 1080
Query: 1081 LYHSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSAS 1140
LYHSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSAS
Sbjct: 1081 LYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSAS 1140
Query: 1141 DVRRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMR 1200
DVRRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN R
Sbjct: 1141 DVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKR 1200
Query: 1201 VPLVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLN 1260
VPLVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLN
Sbjct: 1201 VPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLN 1260
Query: 1261 EAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSF 1320
EAVDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSF
Sbjct: 1261 EAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSF 1320
Query: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL
Sbjct: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
Query: 1381 RQLELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREH 1440
RQ+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH
Sbjct: 1381 RQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREH 1440
Query: 1441 GGKLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDAR 1500
GKLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA
Sbjct: 1441 SGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAG 1500
Query: 1501 RENFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIR 1560
RENF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 RENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIR 1560
Query: 1561 KNKFVPSWASKDHVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
NKFVPSWASKD VA LFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 NNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQVLLL 1600
BLAST of CsGy7G007850 vs. NCBI nr
Match:
XP_008463009.1 (PREDICTED: uncharacterized protein LOC103501253 isoform X3 [Cucumis melo])
HSP 1 Score: 2612 bits (6771), Expect = 0.0
Identity = 1397/1579 (88.47%), Postives = 1452/1579 (91.96%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 Q--EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
+ EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN P
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
A+SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA EL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSL 540
GSCPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 N-AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSV 660
N A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSL 720
KENESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSL
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQ 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCE 900
ELG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQ 960
EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFL 1020
SLD EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFL
Sbjct: 961 SLDREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFL 1020
Query: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVAD 1080
MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVAD
Sbjct: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVAD 1080
Query: 1081 LYHSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSAS 1140
LYHSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSAS
Sbjct: 1081 LYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSAS 1140
Query: 1141 DVRRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMR 1200
DVRRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN R
Sbjct: 1141 DVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKR 1200
Query: 1201 VPLVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLN 1260
VPLVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLN
Sbjct: 1201 VPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLN 1260
Query: 1261 EAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSF 1320
EAVDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSF
Sbjct: 1261 EAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSF 1320
Query: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL
Sbjct: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
Query: 1381 RQLELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREH 1440
RQ+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH
Sbjct: 1381 RQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREH 1440
Query: 1441 GGKLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDAR 1500
GKLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA
Sbjct: 1441 SGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAG 1500
Query: 1501 RENFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIR 1560
RENF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 RENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIR 1560
Query: 1561 KNKFVPSWASKDHVADLFA 1565
NKFVPSWA +D+F+
Sbjct: 1561 NNKFVPSWARCVTWSDVFS 1575
BLAST of CsGy7G007850 vs. ExPASy TrEMBL
Match:
A0A0A0K8D1 (INCENP_ARK-bind domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G115340 PE=3 SV=1)
HSP 1 Score: 3048 bits (7903), Expect = 0.0
Identity = 1590/1590 (100.00%), Postives = 1590/1590 (100.00%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
Query: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI
Sbjct: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
Query: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN
Sbjct: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
Query: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS
Sbjct: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
Query: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS
Sbjct: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC
Sbjct: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVDEVGGSC 480
Query: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD
Sbjct: 481 PIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGD 540
Query: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE 600
NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE
Sbjct: 541 NSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDNAE 600
Query: 601 QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE 660
QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE
Sbjct: 601 QSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKENE 660
Query: 661 SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV 720
SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV
Sbjct: 661 SDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKDGV 720
Query: 721 SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI 780
SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI
Sbjct: 721 SDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSAGI 780
Query: 781 SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG 840
SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG
Sbjct: 781 SQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPELG 840
Query: 841 IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE 900
IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE
Sbjct: 841 IGIPESLDLGSEQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEGYQQSNVSIVPIE 900
Query: 901 MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK 960
MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK
Sbjct: 901 MLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNK 960
Query: 961 LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG 1020
LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG
Sbjct: 961 LPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGG 1020
Query: 1021 INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD 1080
INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD
Sbjct: 1021 INLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVD 1080
Query: 1081 LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL 1140
LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL
Sbjct: 1081 LKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDVRRPFVSPFSKLL 1140
Query: 1141 DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV 1200
DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV
Sbjct: 1141 DRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVPLVDVTENANVPV 1200
Query: 1201 AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA 1260
AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA
Sbjct: 1201 AVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGA 1260
Query: 1261 KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK 1320
KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK
Sbjct: 1261 KRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQREAATILK 1320
Query: 1321 GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR 1380
GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR
Sbjct: 1321 GKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR 1380
Query: 1381 KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK 1440
KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK
Sbjct: 1381 KKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGGKLRSDKENKDVK 1440
Query: 1441 PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV 1500
PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV
Sbjct: 1441 PQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRENFVVENAQPTIV 1500
Query: 1501 GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV 1560
GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV
Sbjct: 1501 GFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKNKFVPSWASKDHV 1560
Query: 1561 ADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
ADLFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 ADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
BLAST of CsGy7G007850 vs. ExPASy TrEMBL
Match:
A0A1S3CJT1 (uncharacterized protein LOC103501253 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=3 SV=1)
HSP 1 Score: 2680 bits (6948), Expect = 0.0
Identity = 1428/1602 (89.14%), Postives = 1480/1602 (92.38%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
+EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN PA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA ELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDG 540
CPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN- 600
DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+N
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFNNN 600
Query: 601 AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE 660
A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD 720
NESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSLKD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE 840
GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+ E
Sbjct: 781 GISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVELE 840
Query: 841 LGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEG 900
LG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSL 960
YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQ 1020
D EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFLMQ
Sbjct: 961 DREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLMQ 1020
Query: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLY 1080
SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADLY
Sbjct: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADLY 1080
Query: 1081 HSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDV 1140
HSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASDV
Sbjct: 1081 HSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASDV 1140
Query: 1141 RRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVP 1200
RRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN RVP
Sbjct: 1141 RRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRVP 1200
Query: 1201 LVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEA 1260
LVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLNEA
Sbjct: 1201 LVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNEA 1260
Query: 1261 VDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIP 1320
VDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSFIP
Sbjct: 1261 VDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFIP 1320
Query: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ 1380
LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ
Sbjct: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ 1380
Query: 1381 LELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGG 1440
+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH G
Sbjct: 1381 IELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHSG 1440
Query: 1441 KLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRE 1500
KLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA RE
Sbjct: 1441 KLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGRE 1500
Query: 1501 NFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIRKN 1560
NF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR N
Sbjct: 1501 NFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIRNN 1560
Query: 1561 KFVPSWASKDHVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
KFVPSWASKD VA LFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 KFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQVLLL 1598
BLAST of CsGy7G007850 vs. ExPASy TrEMBL
Match:
A0A1S3CIN7 (uncharacterized protein LOC103501253 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=3 SV=1)
HSP 1 Score: 2675 bits (6935), Expect = 0.0
Identity = 1428/1604 (89.03%), Postives = 1480/1604 (92.27%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 Q--EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
+ EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN P
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
A+SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA EL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSL 540
GSCPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 N-AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSV 660
N A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSL 720
KENESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSL
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQ 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCE 900
ELG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQ 960
EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFL 1020
SLD EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFL
Sbjct: 961 SLDREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFL 1020
Query: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVAD 1080
MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVAD
Sbjct: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVAD 1080
Query: 1081 LYHSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSAS 1140
LYHSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSAS
Sbjct: 1081 LYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSAS 1140
Query: 1141 DVRRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMR 1200
DVRRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN R
Sbjct: 1141 DVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKR 1200
Query: 1201 VPLVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLN 1260
VPLVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLN
Sbjct: 1201 VPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLN 1260
Query: 1261 EAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSF 1320
EAVDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSF
Sbjct: 1261 EAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSF 1320
Query: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL
Sbjct: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
Query: 1381 RQLELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREH 1440
RQ+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH
Sbjct: 1381 RQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREH 1440
Query: 1441 GGKLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDAR 1500
GKLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA
Sbjct: 1441 SGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAG 1500
Query: 1501 RENFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIR 1560
RENF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 RENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIR 1560
Query: 1561 KNKFVPSWASKDHVADLFASQQKLNPEIIFPPKSFCDIEQVLLL 1590
NKFVPSWASKD VA LFASQQKLNPEIIFPPKSFCDIEQVLLL
Sbjct: 1561 NNKFVPSWASKDRVAALFASQQKLNPEIIFPPKSFCDIEQVLLL 1600
BLAST of CsGy7G007850 vs. ExPASy TrEMBL
Match:
A0A1S3CI78 (uncharacterized protein LOC103501253 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103501253 PE=4 SV=1)
HSP 1 Score: 2612 bits (6771), Expect = 0.0
Identity = 1397/1579 (88.47%), Postives = 1452/1579 (91.96%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWIIDQ +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIDQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 Q--EVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNP 120
+ EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN P
Sbjct: 61 EVAEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRP 120
Query: 121 AISPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFEL 180
A+SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA EL
Sbjct: 121 AVSPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALEL 180
Query: 181 RNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESN 240
RNSVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+N
Sbjct: 181 RNSVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETN 240
Query: 241 VGCEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQF 300
VGCEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQF
Sbjct: 241 VGCEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQF 300
Query: 301 DSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKF 360
DSPRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KF
Sbjct: 301 DSPRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKF 360
Query: 361 HSDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDA 420
HSDQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA
Sbjct: 361 HSDQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDA 420
Query: 421 SDGRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVG 480
DGRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVG
Sbjct: 421 CDGRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVG 480
Query: 481 GSCPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSL 540
GSCPIGSS VHP EVKE+LDLSK+S NIECCEEKILG LS+Q+YKL+NPQK GMQHNSL
Sbjct: 481 GSCPIGSSNVHPREVKEQLDLSKTSSGNIECCEEKILGGLSSQDYKLDNPQKSGMQHNSL 540
Query: 541 DGDNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFD 600
D DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNF+
Sbjct: 541 DADNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFN 600
Query: 601 N-AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSV 660
N A QS V K FSSPI EVRE TSDKK SSF+DDKRD SEKEK NSLLHIPLPQIQVDSV
Sbjct: 601 NNAVQSGVGKNFSSPIMEVREKTSDKKSSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSV 660
Query: 661 KENESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSL 720
KENESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSL
Sbjct: 661 KENESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSL 720
Query: 721 KDGVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLK 780
KDGVSDLQNSHDNVVEIPPVD NG SVP +DTETFRDHV+M P +VGETDGYLEQQLK
Sbjct: 721 KDGVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFRDHVIMAP---YVGETDGYLEQQLK 780
Query: 781 SAGISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQ 840
S+GISQC SDSFEYCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+
Sbjct: 781 SSGISQCEGSDSFEYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVE 840
Query: 841 PELGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCE 900
ELG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC
Sbjct: 841 LELGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCG 900
Query: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQ 960
EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQ
Sbjct: 901 EGYQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQ 960
Query: 961 SLDPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFL 1020
SLD EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFL
Sbjct: 961 SLDREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFL 1020
Query: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVAD 1080
MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVAD
Sbjct: 1021 MQSDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVAD 1080
Query: 1081 LYHSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSAS 1140
LYHSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSAS
Sbjct: 1081 LYHSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSAS 1140
Query: 1141 DVRRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMR 1200
DVRRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN R
Sbjct: 1141 DVRRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKR 1200
Query: 1201 VPLVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLN 1260
VPLVD+TENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLN
Sbjct: 1201 VPLVDITENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLN 1260
Query: 1261 EAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSF 1320
EAVDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSF
Sbjct: 1261 EAVDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSF 1320
Query: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL
Sbjct: 1321 IPLVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENL 1380
Query: 1381 RQLELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREH 1440
RQ+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH
Sbjct: 1381 RQIELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREH 1440
Query: 1441 GGKLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDAR 1500
GKLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA
Sbjct: 1441 SGKLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAG 1500
Query: 1501 RENFVVENAQPTIVGFLEAEALENGMESRISETSERESYQISPYKASDDEDEEDEDDGIR 1560
RENF+VEN+QPT V FLEAEALE GMES ISETSER+SYQISPYKASDDEDEEDE+DGIR
Sbjct: 1501 RENFLVENSQPTSVDFLEAEALEIGMESGISETSERQSYQISPYKASDDEDEEDEEDGIR 1560
Query: 1561 KNKFVPSWASKDHVADLFA 1565
NKFVPSWA +D+F+
Sbjct: 1561 NNKFVPSWARCVTWSDVFS 1575
BLAST of CsGy7G007850 vs. ExPASy TrEMBL
Match:
A0A5D3C921 (Titin-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G001060 PE=4 SV=1)
HSP 1 Score: 2533 bits (6565), Expect = 0.0
Identity = 1350/1518 (88.93%), Postives = 1401/1518 (92.29%), Query Frame = 0
Query: 1 MSAMEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHF 60
MSAMEKLFVQIFERKKWII+Q +QQTDLFDQHLASKLIIDGIVPPPWLHS+FLHSHISHF
Sbjct: 1 MSAMEKLFVQIFERKKWIIEQARQQTDLFDQHLASKLIIDGIVPPPWLHSSFLHSHISHF 60
Query: 61 QEVNKSFISGVEFPRSPLDAHRSSLNEAFVADSGEEWEHRSTEEAGSLNDDFDAGNNPAI 120
+EVNKSFISGVEFPRSPLD HRSSLNEAFVADSGEE EHRS EE GSLNDDFDAGN PA+
Sbjct: 61 EEVNKSFISGVEFPRSPLDTHRSSLNEAFVADSGEELEHRSNEETGSLNDDFDAGNRPAV 120
Query: 121 SPQCDISNAGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSKSRQKAFELRN 180
SPQCDI +AGVLNC+PCIEMTPVSPHGRG IVS+NYRDPTLSLARLHRSKSRQKA ELRN
Sbjct: 121 SPQCDIRSAGVLNCAPCIEMTPVSPHGRGAIVSENYRDPTLSLARLHRSKSRQKALELRN 180
Query: 181 SVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGIGSLEEESNVG 240
SVKSTRCQSRCENKSDSIAG IVGS IG LQ+DHEDESGLAKASSSC GIGSLEEE+NVG
Sbjct: 181 SVKSTRCQSRCENKSDSIAGRIVGSAIGLLQADHEDESGLAKASSSCRGIGSLEEETNVG 240
Query: 241 CEQKDSSIGSDKVGVVVSPGLQSRFIDVDNSLNIFSKNEELCIAGGSTQNSYKVNEQFDS 300
CEQK SSIGSDKVGVVVSPGLQSRFIDV+NSLNI SKNEELCIAGGSTQNSY+VNEQFDS
Sbjct: 241 CEQKRSSIGSDKVGVVVSPGLQSRFIDVENSLNISSKNEELCIAGGSTQNSYQVNEQFDS 300
Query: 301 PRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPEDGRAGPIGGSKFHS 360
PRPSSGKIEEGS YCRSQEYSSDKPEKCRLQ SSLDAN+TSCISP DGRAG IGG KFHS
Sbjct: 301 PRPSSGKIEEGSTYCRSQEYSSDKPEKCRLQCSSLDANKTSCISPVDGRAGTIGGPKFHS 360
Query: 361 DQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDLDKALQSESQQRSPEVDDSSCIDASD 420
DQVDEQLDLPKPSSDNVECNE+AVLG CRSHDYDLD ALQS SQQ S EVDDSS IDA D
Sbjct: 361 DQVDEQLDLPKPSSDNVECNEEAVLGHCRSHDYDLDNALQSRSQQSSQEVDDSSIIDACD 420
Query: 421 GRLLDLYNPSSGKVECCEETISGHCRSKECNFEIAHQSGSRYSSQDVDNSSYVD-EVGGS 480
GRLLDLYNPSSGKVECC ETI GHC S+ECNFEIA QSGS+YS QDVD+SSYVD EVGGS
Sbjct: 421 GRLLDLYNPSSGKVECCGETILGHCWSQECNFEIAQQSGSQYSPQDVDDSSYVDSEVGGS 480
Query: 481 CPIGSSKVHPHEVKEKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDG 540
CPIGSS VHP EVKE+LDLSK+S NIEC EEKILG LS+Q+YKL+NPQK GMQHNSLD
Sbjct: 481 CPIGSSNVHPREVKEQLDLSKTSSGNIECGEEKILGGLSSQDYKLDNPQKSGMQHNSLDA 540
Query: 541 DNSSCFSSVDGTFCRVGSSKQHSDQGIERLELFRPSSVNSECHEEELEDCRTQDCNFDN- 600
DNSSCFSSV+GTFC VGSSKQHSD E LELFRPSSVNSECHEEELEDCRTQDCNFDN
Sbjct: 541 DNSSCFSSVNGTFCAVGSSKQHSDLVSEPLELFRPSSVNSECHEEELEDCRTQDCNFDNN 600
Query: 601 AEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVSEKEKCNSLLHIPLPQIQVDSVKE 660
A QS V K FSSPI EVRE TSDKKPSSF+DDKRD SEKEK NSLLHIPLPQIQVDSVKE
Sbjct: 601 AVQSGVGKNFSSPIMEVREKTSDKKPSSFIDDKRDASEKEKSNSLLHIPLPQIQVDSVKE 660
Query: 661 NESDKCASESHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEKNVSLKD 720
NESD+ ASESH+ERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAE+NVSLKD
Sbjct: 661 NESDQGASESHNERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSLLQSDEPAEQNVSLKD 720
Query: 721 GVSDLQNSHDNVVEIPPVDANGASVPIEDTETFRDHVVMVPCVPHVGETDGYLEQQLKSA 780
GVSDLQNSHDNVVEIPPVD NG SVP +DTETF+DHV+M P +VGETDGYLEQQLKS+
Sbjct: 721 GVSDLQNSHDNVVEIPPVDGNGTSVPRKDTETFKDHVIMAP---YVGETDGYLEQQLKSS 780
Query: 781 GISQCADSDSFEYCTDDFNGNHHYLSTECQIAETSIELKTFSALTKASSSPEDVRRVQPE 840
GISQC SDSF+YCTDDFNGNHHY+STECQ AETSIELKTFS+LTKASSSPEDVRRV+PE
Sbjct: 781 GISQCEGSDSFQYCTDDFNGNHHYISTECQTAETSIELKTFSSLTKASSSPEDVRRVEPE 840
Query: 841 LGIGIPESLDLGS----------EQLQIINGSPTDKILMQEFDTEKPVLEFQRLSFCEEG 900
LG G P SL LGS EQLQIINGSPTD ILM+EFDTEKPVLE QRLSFC EG
Sbjct: 841 LGSGFPGSLGLGSGIPGSLGLGGEQLQIINGSPTDNILMEEFDTEKPVLEIQRLSFCGEG 900
Query: 901 YQQSNVSIVPIEMLLLEKEAHSMQLSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSL 960
YQQSNVSIVPIEMLLLEKEAHS QLSDSSPTL VKEDLSRFR+NNRGTLLQNVMLESQSL
Sbjct: 901 YQQSNVSIVPIEMLLLEKEAHSKQLSDSSPTLPVKEDLSRFRSNNRGTLLQNVMLESQSL 960
Query: 961 DPEENLQSGDNKLPVDTGKTEREEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQ 1020
D EENLQSG+N+LPVDT K EREEDKGKLTSCSLLTPLIQTSHY GADKDMPALEGFLMQ
Sbjct: 961 DREENLQSGENELPVDTEKMEREEDKGKLTSCSLLTPLIQTSHYFGADKDMPALEGFLMQ 1020
Query: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLY 1080
SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACI+SPLSSSSES KLNKVADLY
Sbjct: 1021 SDAEQPCISVGGINLDTLELSKCMIERASILEKICKSACIDSPLSSSSESFKLNKVADLY 1080
Query: 1081 HSLSNGLLESVDLKSNLLMNDQNKLLKDGSNFLNGEVNCSPHGSFSACLKSIGSHSASDV 1140
HSLSNGLLESVDLKS LLM DQNKLLKDGSNFLNGEVNCSPHGSFS CLKS GSHSASDV
Sbjct: 1081 HSLSNGLLESVDLKSKLLMKDQNKLLKDGSNFLNGEVNCSPHGSFSDCLKSTGSHSASDV 1140
Query: 1141 RRPFVSPFSKLLDRNSLNSSSSGKRSSPNIELPCISEEAESTEETDNKFAKDMKSNMRVP 1200
RRPF SPF KLLDRNSLNSSSSGKRSSPNIELPCISEEAES EE DN+FAKDMKSN RVP
Sbjct: 1141 RRPFASPFGKLLDRNSLNSSSSGKRSSPNIELPCISEEAESIEEIDNEFAKDMKSNKRVP 1200
Query: 1201 LVDVTENANVPVAVSETVMFADRLSLESLNTEVGNTGTHNRTKENLANQKKSKRKYLNEA 1260
LVDVTENANV V V E VMFADRLSLESLNTE+ N GTHNRTKENLANQK SKRKYLNEA
Sbjct: 1201 LVDVTENANVSVTVPEAVMFADRLSLESLNTELSNAGTHNRTKENLANQKNSKRKYLNEA 1260
Query: 1261 VDLDIFPGANGAKRVTRSSYSRFSRSDLSCKENFRKEGSRFSGKETKHKNIVSNITSFIP 1320
VDLDI PGANGAKRVTRSSY+RFSRSDLSCKENFRK GSRFSGKETKHKNIVSNITSFIP
Sbjct: 1261 VDLDILPGANGAKRVTRSSYNRFSRSDLSCKENFRK-GSRFSGKETKHKNIVSNITSFIP 1320
Query: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQENLRQ 1380
LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQ+NLRQ
Sbjct: 1321 LVQQREAATILKGKRDVKVKAIEAAEAAKRLAEKKENERQMKKEALKLERARMEQDNLRQ 1380
Query: 1381 LELEKKKKEEDRKKKEEEMKKRKADKAAKKRQREEEERKEKERKRMHVEEVRRRLREHGG 1440
+ELEKKKKEE+RKKKEEEMKKR+ADKA KKRQREEEERKEKERKRM VEEVRRRLREH G
Sbjct: 1381 IELEKKKKEEERKKKEEEMKKREADKAEKKRQREEEERKEKERKRMRVEEVRRRLREHSG 1440
Query: 1441 KLRSDKENKDVKPQANEQKPLDRKACKDVTNKLDKENGHEKFDKLSVTKSKSTTSDARRE 1500
KLRSDKENKD KPQANEQKP RKACKDVTNKLDKENGHEKFDKLSVT+SKS+TSDA RE
Sbjct: 1441 KLRSDKENKDAKPQANEQKPRCRKACKDVTNKLDKENGHEKFDKLSVTESKSSTSDAGRE 1500
Query: 1501 NFVVENAQPTIVGFLEAE 1506
NFVVEN+QPT V FLEAE
Sbjct: 1501 NFVVENSQPTSVDFLEAE 1514
BLAST of CsGy7G007850 vs. TAIR 10
Match:
AT5G55820.1 (CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterPro:IPR005635); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 302.0 bits (772), Expect = 2.9e-81
Identity = 484/1829 (26.46%), Postives = 773/1829 (42.26%), Query Frame = 0
Query: 4 MEKLFVQIFERKKWIIDQTKQQTDLFDQHLASKLIIDGIVPPPWLHSTFLHSHISHFQEV 63
+E LFVQIFERK+ I++Q +QQ DL+DQHLASK ++ G+ PP WL S L S S E+
Sbjct: 48 IENLFVQIFERKRRIVEQVQQQVDLYDQHLASKCLLAGVSPPSWLWSPSLPSQTS---EL 107
Query: 64 NK-SFISGVEFPRS------------------------------------PLD------- 123
NK IS + FP S PL+
Sbjct: 108 NKEEIISELLFPSSRPSIVCPSSRPFSYQRPVRFLADNVVRQDLTSVVNNPLEEQLLEEE 167
Query: 124 ----------------AHRSSLNEAFVADSGEE-----------WEHRSTEEAGSLNDDF 183
+H +N A D E+ E++S +
Sbjct: 168 PQHNLSHNLVRQVSNHSHEQDVNIASPRDVHEKERLPESVSIDCRENQSCSSPEHSKNQR 227
Query: 184 DAGNNPAISPQCDISN--AGVLNCSPCIEMTPVSPHGRGGIVSDNYRDPTLSLARLHRSK 243
N A SP C ++ + C + + + I D DP LSLA++ RS+
Sbjct: 228 VETNLDATSPGCSQGEKVPKCVSTTGCKRKSSSLGYCQEEIEPDTCIDPGLSLAKMQRSR 287
Query: 244 SRQKAFELRNSVKSTRCQSRCENKSDSIAGGIVGSVIGSLQSDHEDESGLAKASSSCNGI 303
SRQKA ELR+S K+++ +S N+ GG +G I SL+SD E L K +
Sbjct: 288 SRQKALELRSSAKASKSRSNSRNELKPSPGGDIGFGIASLRSDSVSEIKLFKHDENDEEC 347
Query: 304 GSLEEESNVGCEQKDSSIGSDKVGVVVSPGLQSRFID---VDNSLNIFSKNEELCIAGGS 363
E SN ++ D I K+ V +D + +S + ++ C+
Sbjct: 348 REEVENSNSQGKRGDQCI---KISVPTESFTLHHEVDSVSISSSGDAYASIVPECLLESG 407
Query: 364 TQNSYKVNEQFDSPRPSSGKIEEGSAYCRSQEYSSDKPEKCRLQSSSLDANETSCISPED 423
N + + ++ +SGK++E + K C +++ LD + S S +D
Sbjct: 408 HVNDIDILQSIETIDEASGKVDE--------QVDDPKSRSC-YETAYLDGSTRSKSSIQD 467
Query: 424 G--RAGPIGGSKFH----------SDQVDEQLDLPKPSSDNVECNEKAVLGDCRSHDYDL 483
R + F S D +++LP+ + +E +++ D + +
Sbjct: 468 NSKRKHQKSSNSFSGNFLLTNSNPSHWADHEVELPQAITTT---SEVSMVTDAGTSIFQS 527
Query: 484 DKALQSESQQRS--PEVDDSSCIDASD--------------GRLLDLYNPSSGKVE-CCE 543
+ +S S R + + S +++S + D NPSS E
Sbjct: 528 EIIARSRSNARENRSKTEHSGSVESSSINLEPRDSIPVLQGSHVKDSLNPSSVDAEGLVV 587
Query: 544 ETISGHCRSK---ECNFEIAHQSGSRYSSQDV--DNSSYVDEVGGSCPIGSSKVHPHEVK 603
E I+ +SK EC S R S + D +++ + S S++
Sbjct: 588 ENITSSDQSKETGECVDTNRCSSAERVSQTGISPDETTFAGAIQDSI----SQIELLSFV 647
Query: 604 EKLDLSKSSFDNIECCEEKILGDLSNQEYKLNNPQKFGMQHNSLDGDNSSCFSSVDGTFC 663
E + S +++ S+ E L P + ++ DN+ + + G
Sbjct: 648 ESSSIELQSRHSVK---------QSDDESVLLKPVTVNGEALLVEEDNNGESTEISG--- 707
Query: 664 RVGSSKQHSDQGIE-RLELFRPSSVNSECHEEELEDCRTQDCNF---------------- 723
+ S+ S I L + S +N E+L D ++ C+
Sbjct: 708 -ISKSRSLSQTDITVVLPVVVESILNESGTPEKLID-HSKRCDISCGSKEVQPLGSLTEV 767
Query: 724 -DNAEQSDVDKKFSSPITEVRENTSDKKPSSFLDDKRDVS---EKEKCNSLLHIPLPQIQ 783
N + + SS I E E+ +D K S + + E + NSLL P +
Sbjct: 768 GSNQSHGIISRARSSLIEE--ESANDYKALSDGSNHKSADKQLEVREGNSLLRTPDRPVF 827
Query: 784 VDS---VKENESDKCASE-----SHSERRYEDTGDFNGNTLSSGNKSLQGYEEVTTCSL- 843
VD+ V EN +K + E + + R ++ + S N + E+ ++
Sbjct: 828 VDNFDEVPENSREKSSMEKVPTPAPTARVFDVPSLTDSGVNLSANNEMNDIEDHNGLNIE 887
Query: 844 ---------------LQSDEPAEKNV---SLKDGVSDLQNSHDNVVEIPPVDANGASVPI 903
+ +EP E N + Q+ + +PP+ +
Sbjct: 888 MVAEMESYASHPGLKVGENEPTESNTFTGHIDALTKRPQHETSSEKAVPPIKRDVTCTEA 947
Query: 904 EDTETFRDHVVMVPCVPHVGETDGYLEQQ-----LKSAGISQCADSDSFEYCTDDF--NG 963
++ + C G + Q L+ + + S + D+
Sbjct: 948 DECHDLESPIQEFFC--SSSPMGGSMRQNKRRRILEKPTRRELSSSPGGDILESDYVREA 1007
Query: 964 NHHYLSTEC-QIAETSIELKTFSALTKASSSPEDVRRVQPELGIGIPESLDLGSEQLQII 1023
HH C + +EL+ + +++++ IG S +L E+ I+
Sbjct: 1008 VHHREEAACHNVDNYDVELQKLIGSASSHHYSVELQKM-----IGSASSAELRFEEGDIL 1067
Query: 1024 NGSPTDKILMQE-----FDTEKPVLEFQRLSFCEEGYQQSNVSIVPIEMLLLEKEAHSMQ 1083
+ + + + +E Q+L + S +E+ + A S +
Sbjct: 1068 ESDYVREAVHHREEAACHNVDNYDVELQKLIGSASSHHYS------VELQKMIGSASSAE 1127
Query: 1084 LSDSSPTLLVKEDLSRFRNNNRGTLLQNVMLESQSLDPEENLQSGDNKLPVDTGKTE--- 1143
L LL + L + + T + + ++ + P+ + S + G+T
Sbjct: 1128 LRFEESYLLKEAGLMSPASLSYRT--EQLSVQRSQIAPDHRVGSENINFFPYAGETSHGL 1187
Query: 1144 -----REEDKGKLTSCSLLTPLIQTSHYLGADKDMPALEGFLMQSDAEQPCISVGGINLD 1203
R+ D S LTPL S D P LEGF++Q+D E S +N D
Sbjct: 1188 ASCIVRDSD-----SSPCLTPLGLIS---SDDGSPPVLEGFIIQTDDENQSGSKNQLNHD 1247
Query: 1204 TLELSKCMIERASILEKICKSACINSPLSSSSESLKLNKVADLYHSLSNGLLESVDLKSN 1263
+ +L + E A+++E+ICKSAC+N+P +++ K ++ DL S+S L + + N
Sbjct: 1248 SFQLPRTTAESAAMIEQICKSACMNTPSLHLAKTFKFDEKLDLDQSVSTELFDGMFFSQN 1307
Query: 1264 LLMNDQNKLLKDGSNFLNGEVNCSPHG-SFSACLKSIGSHSASDVRRPFVSPFSKLLDRN 1323
L+ S F N +N G S++ L G+ S+++ R P +SP KL R+
Sbjct: 1308 ---------LEGSSVFDNLGINHDYTGRSYTDSLP--GTGSSAEARNPCMSPTEKLWYRS 1367
Query: 1324 SLNSSSSGKRSSPNIELPCISEEAESTEETDNKFA----KDMKSNM-------------- 1383
SSSS KRS+ +LPCISEE E+ EE K M+S
Sbjct: 1368 LQKSSSSEKRSTQTPDLPCISEENENIEEEAENLCTNTPKSMRSEKRGSSIPELPCIAEE 1427
Query: 1384 -------------------------RVPLVDVTEN-ANVPVAVSETVMFADRLSLESLNT 1443
R PL DV E+ + +VSE + ADR SL+S++T
Sbjct: 1428 NENIDEISDAVNEASGSERENVSAERKPLGDVNEDPMKLLPSVSEAKIPADRQSLDSVST 1487
Query: 1444 EVGNTGTHNRTKENLANQKKSKRKYLNEAVDLDIFPGANGAKRVTRSSYSRFSRSDLSCK 1503
+ N K + K S R++ + + G GAKR + SRFS+ LSC
Sbjct: 1488 AFSFSAKCNSVKSKVG--KLSNRRFTGKGKENQ---GGAGAKRNVKPPSSRFSKPKLSCN 1547
Query: 1504 ENFRKEGSRFSGKETKHKNIVSNITSFIPLVQQRE-AATILKGKRDVKVKAIEAAEAAKR 1563
+ G R KE +H NIVSNITSF+PLVQQ++ A ++ GKRDVKVKA+EAAEA+KR
Sbjct: 1548 SSLTTVGPRLQEKEPRHNNIVSNITSFVPLVQQQKPAPALITGKRDVKVKALEAAEASKR 1607
Query: 1564 LAEKKENERQMKKEALKLERARMEQENLRQLELEKKKKEEDR---------------KKK 1585
+AE+KEN+R++KKEA+KLERA+ EQENL++ E+EKKKKEEDR KKK
Sbjct: 1608 IAEQKENDRKLKKEAMKLERAKQEQENLKKQEIEKKKKEEDRKKKEAEMAWKQEMEKKKK 1667
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_004148933.1 | 0.0 | 100.00 | uncharacterized protein LOC101214907 isoform X2 [Cucumis sativus] >KGN44031.1 hy... | [more] |
XP_011658937.1 | 0.0 | 99.87 | uncharacterized protein LOC101214907 isoform X1 [Cucumis sativus] >XP_011658938.... | [more] |
XP_008463008.1 | 0.0 | 89.14 | PREDICTED: uncharacterized protein LOC103501253 isoform X2 [Cucumis melo] | [more] |
XP_008463006.1 | 0.0 | 89.03 | PREDICTED: uncharacterized protein LOC103501253 isoform X1 [Cucumis melo] | [more] |
XP_008463009.1 | 0.0 | 88.47 | PREDICTED: uncharacterized protein LOC103501253 isoform X3 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0K8D1 | 0.0 | 100.00 | INCENP_ARK-bind domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G11... | [more] |
A0A1S3CJT1 | 0.0 | 89.14 | uncharacterized protein LOC103501253 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CIN7 | 0.0 | 89.03 | uncharacterized protein LOC103501253 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CI78 | 0.0 | 88.47 | uncharacterized protein LOC103501253 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3C921 | 0.0 | 88.93 | Titin-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G0... | [more] |
Match Name | E-value | Identity | Description | |
AT5G55820.1 | 2.9e-81 | 26.46 | CONTAINS InterPro DOMAIN/s: Inner centromere protein, ARK-binding region (InterP... | [more] |