Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAACACCCTAAGTCAATCTATCGCTTATTTCTCCTAAATTTTTTTAAAGATTTTATCGGAAAGCAAAACCGAATTAAATTAATAATAAATATAAAAAGCGTATATATTGTTGTATTGCGTTGGCAATGAACGGCGGTTCCGGCACGACGGAGCTCTGCTACATTACGAGTATGATTGTCTTGTCCCTTACAGCCATTTTTCACACACTCATCGTTTTCTGATGGTGAAAATCCGTAGTCCTCTTTTTCAATTTTAAGGAAATTTGATTGAATTACACATTAGAAAGGGATCCCTCTGTGTTGAAATGATGAAAAACCGTTTGATAATTCTCATCCCAACCCTTCAATTCTTCGTATTCTTCCTTTTCTCTGTCATATAATTTACAAATTTCCCACCTACTTTTCCCTCCTTTCTTGTTTGTCTACTATGTCTATTGCATAAAAATAAAGCCCTAGAGAAACCTTGGCCACCTGCCACTGCACCTACGTTTTGCAATTACATGTATCCTAGTTTATTTTATTTTATTTTTTATTTTTATTACTTGCATTGTTCCTCAGGCTAACTGGAATTCTGATTAATCAGAGCTTTGTTTTCACATGAATTATTCGACCCACTTTTTCTATTATTCGATTTTTTTGAGAAGCAATCTTTTTGTGTTTCCCGCCGGCCGTTCATAGATGTGATCGATTCAACTAATTTCCATATTTTTTAATGATTTGTGATGCCGTAGAACTAGCAAAGCTAGCTTTGGTTTTACGACTTGGAAGATATGGGTATTACTGAAGGATTTTGATTTACTGTTTCTTTTAGCTTTGCATTGTTTGGAAAATGACATCGGAGTTTTGATCAATTGAGATGGAGAACAGCAGTGGAGTGGATGATCAGGGATCAGGATGGTTCGAAGTGAAAAAGGTGCCAACTCTATCATCGGCTAACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTTCTGATGGTTGGGAAGTGGGTTGAGGGGAGATGAATGGTTCTTATTATTTATGGTTGGGAACGCTGAAAATGAATGAATGGATGAACTGATGAACAATTCTTGGCATCCGAATAAGGTCCCCAAAGCCCACCAATTATTCTATCTGAATCTTTTAGTTACTTTTCATTTTTCTTCCGCGGAGGTATATAAGAGTTTCTTCTCTTCAAAGTTGTGTTTATCAGTGTGAAAAAGATTGGTAGCTACTTTCTTTGTTTGAGTAGTCTTAAGAATTTAAGGACATCAGACTCAGATTCTTTGAAGTGGGGTTATTTTATTGCATCTAAAGATTTTGTTTTGTTATTAGATGAAAGCTTGAGCGACAGAGTTGTTTGTTTGTAATGTAGGTGCTCTAGAAATTTTACCGGCATAGGAGTCCAAATAGACTTGGATATGTGTTGGGCTCTTCTTTCCTTCTCGAGGTGCTGTAAATGTTTTGCAAAGCATTTCTTTATTCACTTTTGTTGAAATTTGAGTTACATGCACTGCACCATATGATATAACGTTAATATTATCCATTTCATTGGCTTTGTCAGCCAATTCCTTTAAATAAGTGTTCAGTTTCTCCACCTTTGGGTACTGGGTTCATGCTTATTTATTTTACGTTCCTGGTATTGATAATGCTCTAATCATATCATATATTTGTGGCGGCGGACCTAACCTTTGTGTTAATTTGTCTACTCTGTTTCCCAGAAGCATAGGAGCAGTTCTAAGTTTTCCCTACAGGGTTGGGTTGGAGGATTTTCAGGAAAAAATTCGTCTAATTCTTCGCGAAGTCCACATCTGGTGAATAAAAATGCCCAAAATGACAGGCCAATTTCAAAGTCTCATCCTCCAGCATCAGGAGGAAGCTATGCTGTATATACCCAAAGTAATACTGAGAATCGTATTGCTACATCAATTGGAGAGGATGGAGGTTCACATTGCCCTGATAAATGCATGGTCAGACAAGACACTGAATTCCCCAAGTCATCTGTTTTGCACGTTACAGATTCTCATGCTGCAAGTGGAGACTGTGAGAAAGTCCCCCATAAGGACATGCCTGGAGTGGTTCAAAAAATCAAATGGGGTGATCTAGAGGATGATAGTTTGGTATTGAATAACAGTGAAAATGGAGTTGAAATAAAGTTTGGTAATATTGGAGAGGTTGATCTAGGGGTCTCCGAAAAAAATGAGGTTAAGGATGGCTTGGCTTCACACGTCTCTACATCCGTAGATACCCAAGCGAACAAATTGGTGGCAGTATCAGTCAGAGAAGAGGAGGCCTCTCATCAGGCCCTTTTATCAACTGATGAGGACAAAGTATGGCGAGTTAGGCATCAAGATATAAATAGGGAATTTATAGAAGCTTTGGAAGTACTGAATAATAGTGAAGCAACAGTCTGTCCGGTTATTGATGGCTCAAATTTCAAGGATACAGGAACTGGACACACCAAACCTGTAGATGATCACAGTTCAAATTTTAGTTCTCCCTCTTGTGTAGAAGCTGGAACTGAGCCAAAAGTCCAGAAGGCTATGAAGTTACCTGAGGTAGAAAATCCAGAGCTCCATGAAGCACCAGGTAAGACTGAGTTTTCTAGCTTGTCATTGCCCGTTCAGAATGCTGAGTTAGTTTCAACTGAAACTAATGGACCTGAAAATTCAGGGGGTACCTCTGATTCGGTTGAAGAAGCGCAAATTGAACAAGGGAGTGGAACTCATAATGTGCAAGTTGTGAGTGTCCCCAGTGAAGGTGAGACAGGTGAGAGTAAAGAAAGGTTTAGGCAGCGGCTCTGGTGCTTCCTTTTTGAGAATCTTAATCGAGCTATAGATGAACTTTATCTCCTTTGTGAACTGGAATGTGACTTGGAACAGATGAAAGAAGCCATTCTTGTTCTTGAAGAGGCTGCATCTGACTTTAAAGAGCTAAATGCTAGGGTGGAGGAGTTTGAAGAGGTAAAGAGGTTGTCTTCTCAGTCTGTTGATGGGATGTCAATCACTATGAAGAGTGACCATTGTAGGCCACATGCTCTCTCTTGGGAGGTGAGCTCGAGTATGTCTGTTTTTCTGCCAAGCAACTTTCAGATTCTCTGATGTGGCTATAGCTTCTCTTTTTGCAGCAATGAGTATATTATATATTAACTCCGATGCTTCTAGTAACTAGTTTTCTGATGTGAATTGTTTGATTGTAATGATGTTTCTTTTGTAAAGAAATTGGAAAATATATCATATATTAACTCCTATGCTTCTTGTAACTAGTTTCCTGGTTGTTGACAGTGGAAAAATTTGGCATGGATCTCTAGTTCTCTTCTTTTGCATTTTATGTTTGTGGTCTTCTAACTTTTCCTTTGTTCCCTCCTTAATGAACCAGAAGATCTTGAAAATTTTGGAACACAAAGGATAAAATAGTGAACACACTTCTCTCAGACTAATTAAATTATCTTTGGTCTGCTTTGCGTGATTGGAAATTAAATACCCTTGGCAATCTTTGCATGAGGAAGCAAGAAATCTTAAATAGTATTGAGCCTTGGATTATGTAGATATGGAGATTTTTTAGACTTCAAGTTTTAGGTCGGTCTTGAAGAGGGATAGAGTTCAAAATTATCCACTGTAGGAAGTGAATTAGTTGGAGCTAGAAGGCTAAAACTAGGTGTGTTGAAAAGAAGATTGCAACGCTGTTCCTCTCCTTAGATTGCTAGTTTTCAAAAATTCATGAATGCCGTTCTCAACTATGAAGATTGTAATTTGGCTACAAGTGATATAGAGATGGAGGGTTAAGTAGTTTCATTTTTCTTGTCCCTTTTTGCTTTCTTTCAAAGCCATAAGCCTTTTTAGATGGCTTTGGATGTACCCTGTAGTGACAAGTGAGAGGATGGTTGGAGAGGTCTTTCTCCCTTTAACGGACTAGGAAAATGGTTTGCAACCTGAGGGGAGTGAAGGCTTGAACTTTGGTGTCTTTTTGTTGTTGGCTAAGGATAGTTGGGATGCCATTAAGCTACCTTTACTTGGGGTGTTATATGAATTCCTAGGAAAGGCATGAGTGATTCCACCATGCACAAGACCTTTAGATGTGTCTGCCTTAAGAAATCAAGAAGAAATTAGATTATTGATTAGCTTGGTTAAATCCATCTCTAAGGCATTAATGAAGAGGATCAAGTGTTGCCTTCTATTATCTCTACCAACTAACATGCTTTTGTGGTGGGTCGACAAACCCTTGATGAGTTTCTTGTTGCTACTAACAAGGCTATTTAAAACTATAATCTTTGCACTTAAGGAGGGTTCTAGTTACAATCATGTGCAATTTGCCCTTATTATTGTAATGTTTAGTACACAATGCTATTATTGTGTATTTACGCGGATAGTTCTTGATTCTTGAATTCTAATTCAGCAGTCATAGGGAAGGTTGGAATATATTTGACCCTCTCTTGTGCGTGATGTATTATCAAAATCTGTTTGTATGACAGAATTAACCTTTTCACGTAAAGTTTATTAGTTAGTCTTTCTCTGAAATTTGAAACCCAGTTATGCATTAATAAATGCAGGTTCGACGGATGACAAATTCTCCTCATAAAGCAGAGATTTTATCGTCATCACTTGAGGCATTTAAGAAAATTCGGCAAGAGAGAGCCAACATGCTTGAAGCAAGCAAAAAACTCCCTGGGAATGAGAGTCTAAGTCCTCAGTGTATGGATCAAATGAAGAAAACATGTACAATAAATTGTGTAGTGCATGATGCTGAAGATTCAGCATCAAAGATAGCAGAAAAGAAAGGTGTAGATCTCACTCCAGGGAGTCTTAGTGGAAAAGAAAAGAACACTGAGTCTCTTGGTACCGAGAAGGTTAATGTTGTGCAAAATATACGTTCACGCCCTCAGAATTCCTTGTCTTCTATAAATTCATCCAAGCCACCTCTTGCTGTAAAATTTAAGCGAGAGCAACTTGAATCTGATGTAGAAAGGTTAGTTTCTAAAAGAGAAAGGGCTCTTGCAGAGGGCACATGTGAGAAAAATCTGAAAGCTATAGAACATTCCAAAAGACAAGCAACTGTTCCTGAAAAAGACAAGGAAAAAGAGAAGAGAAATATTGGTGTTCGGAAGTCCATGGATGCATGGAAAGAGAAGAGGAATTGGGAAGACATACTTTCATCATCCGTTTGGATTAGTTCTCGAGTTTCACATTTGCCAGGCATGAGCAGGAAAAGTGCTGAGCGTGTACGCGTGCTGCATGATAAATTAATGTCCCCAGACAAGAAGAAGAAAACTTCTTTAGATGTGAAAAGAGAGGCAGAAGAAAAGCATGCTCGTGCAATGAAAATTAGAAGTGAATTAGAGAATGAAAGGGTTCAGAAACTGCAGCGTACTTCAGAGAAATTAAATCGTGTTAATGAATGGCAGGCTGTACGCACTATGAAATTACGGGAGGGAATGTATGCTCGCCACCAGCGCAGTGAATCTCGGCATGAAGCTTTCCTAGCTCAAGTTGTGAAGAGAGCTGGTGACGAAAGTAGTAAGGTAAATGAGGTTCGTTTCATCACCTCCTTAAATGAAGAAAATAAAAAGATTATGCTGCGACAGAAGCTTCACGGTTCTGAGTTAAGGAGAGCTGAAAAACTTCAGGTAATGAAAATTAAACAAAAGGAAGACATGGCTAGAGAAGAAGCAGTGTTAGAACGGAAGAAATTGATCGAAGCTGAAAAATTGCAGCGTCTTGCTGAAACACAGAGGAAAAAAGAAGAGGCTCATGTTCGAAGGGAAGAAGAGCGAAAAGCATCAAGTGCAGCAAGGGAGGCGAGGGCCATGGAACAGCTTCGAAGGAAAGAGGAAAGAGCAAGAGCCCAGCAAGAAGAAGCTGAACTTATGGCCCAGAAATTGGCTGAGCGACTTAGTGAAAGTGAACAACGACGAAAGTTTTATTTGGAGCAAATACGGGAGAGGGCTTCTATGGATTTTAGGGACCAATCTTCACCCTTGCTGCGACGATATATGCATAAGGATGGTCCAAATAGATCAACACCAAATAATAATGGTGATGAACAGGGACCATGTAGCTCTGACTTGGGTTCTGGTCTTGCAATGGGTAAGACAACAATGCAACAACACATGAAACGAAGGATCAAAAGAATCCGACAGAGGCTTATGGCTCTGAAATATGAATTTTTTGAGCCCGTTAATGGAGCTGAAAATGTTGGCATTGGATATAGAACATCCATAGGGACTGCAAGGGCAAAAATAGGTAGGTGGCTTCAAGAACTTCAGAAACTTCGCCAAGAAAGAAAAGAAGGGGCTGCAAGTTTAGGGCTGATAATTGCTGAAATGATCAAGGTATTATCAAACACCAACATTCTATTTTTTGGGTGGGTGGATTTTTTGATGAAATATGTCTCATTCTCTTGCGTGTCTATATAGTATTAGATTGTATTTTGAAGGTTAAAGAGTTGAACACTGTGAATTCTCCATTCCTGCATTTTGTCCATTTATCAACTTGTTTGTATGGTTTAATACCTTGTAGAGAAGTGAATTTTATCAAAATTTTTCATGCTTTTGTAATTATTCATATATCATGGTGCTAGCTGGGCATTTGCAGTATCAGTGTATGGATTTAAAGTAATTTGATGTGAAACTATGAGATTTGGTATTTGCAATCTTAGTTGATTGTTGTATTTATTCCTGTCAATACTAGTAGTGATGTGTCATTGGTGTCCCAATTTTTAACTTTTGTCTATGCATAACTCATATTCATCCGTCTGAGGATCGTCACAAAGAATTGTTTCATAGCATGTCTCTCTACTAAGATAGCATAACTTGCTTTTCTCTAAAATAAAATTAATTTCTCGTCTACTTGTATGCTTTTAGATGTTTTATGACAGTTTTTTGTTTTTTAGTACTTGGATGGAAGAGAACTTGAGCTGCAAGCTTCTCGCCAAGCTGGCTTACTTGATTTTATTGCCTCTGCACTTCCTGCTTCTCACACATCTAAACCAGAAGCCTGTCAAGTGATGATACACCTATTAAAATTGCTAAGGGTAGTATTATCTGCATCAGCAAACAGAAGTTACTTTCTTGCGCAGAATCTCTTGCCACCAATCATCCCAATGCTGTCAACAGCCCTTGAGAACTACATTAAGATTGCAGCTTCTATTAATGCCCCTGGTAATGGGGTTCCATCAAGCAGAACGTCTATTGAAAATTTTGAGTCGAGTTCTGAAGTACTGGATGGGTCTTTGTGGACTATCACAACAATAATTGGTCATATAAACCCTGAAGGACCACAACTCCAAATGTGGGATGGTTTGCTAGAACTATTGGTTGCCTACCAGGTTATTCAACGGCTTCGGGATCTTTTTGCACTTTATGATAGACCTCAGGTGGAAGGGTCCCCATTTCCTTCTTCAATTCTCTTGAGTATACGCCTTCTGGTGGTTTTAACATCAAGACCTGGAACTGATAGTACCATTAATTGTGTATTACCATATAGTGAAAAATTGGCAGGAGATGGAAGTGGGATTACTATTTCTGCCGAGTCTAGAGATCTTTTTGGAACTGGTTTCACTGAGGATGGTAGCCCTTCAGAATCTGGGTTAAATGGTGCTAAAATTGTACAAAAGCCAAAAATGACCGTTGATCAACTGGATGATGAATCATGTGAGCAAAAGATAAACGATGGGATGATACCCAGTGATGGTGGTAAGAGGGAGCAGACGGATTGCTCGATTAAGGCAAATGGTGTTAATCTTATCCAGTCTGAAAATCAGGATGAACCGCCAGATAGTGAAGTTGTCTTGAAACCTTTTGTATCCCAAGAGGATCAGAAAAAACCAGTGGATCTTGTTTCTGACTTGAGGATAAAAAATGTTACAAAATTGAAACCACCAATAGCCTATCTATTGTCTGCTATATCTGATACTGGAATTGTTGGTCTCCTGTCTTTATTAACAGCCGTATTGCTTCAAGCAAACAATAGATTATCATCCGAACAGGTAAGTTGACCTGTCAAAAGCATTATATCTCTGAGCTAATTTGTATTCTTATACAACTATAAGGCTTTAATAGTTAATGTACATGTTAACTTATGTTTCTATTTTTAGAACTTGTAGTGTTCAATTTCTTCAAGCCATGTAGACTGGAATCCATCATTTCTTCATGTTCATCTTGCTCCATTCTCTATTTGGTGAATACCGTGAATATTGTAACAAATTCATTATGATATTTATTATTTTTCAGTTTGATGTTTATTATTACTAACTTTTATACCTTTTTTCAGTTTGATGTTTAACATCATTGACGGATTTGTCAGATATGTTTAAAATCCATTTTTTTAGATCTAGCTTAATCTAAGCACATTTAGTGCAACATTTCCTGATAGACTACTGTTCTAACACAATGGGCCAGTTGCTTGTTCCAAACTTGCTTTAACCAAAAATAGCTGTGTTTTCTCTCTCTCTCTCTCTCTCTCTTTTGTCTTTTCTTTTCATTCTTTCTTTCTTTTTTTTTTTTCCTAAAAAATTTATCAGTTTTTCCCCATTTTTATATTTTACGTCAGTTATTGTCTTTACCACATGAAGTAGCTATAGTTTGAAGTTTTAGCAGCTGTTTGTTCATGTTATCAATTTTCAGATACCTTTATTATGTTTCAGGCCGCATATATCCTTCCGTCTAACTTTGAAGATGTTGCTACTGGTGTGCTGAAGGTGTTGAACAATCTGGCATTTTTGGATCTTAAATTTATGCAGCGAGTGCTAGTAAGTTGGCATTTTTTCAAGTGGAACTTTGTCTATGACAATTTTCAATCTCTCGAACTCAGTTCCAATGTGGATTGTTGAATTTAATCTCTAGATATAAATTACTAGACCATCATGGAAAGAGAGATTAATTACTAGATCTTTGACTCCTGGCTAACTAGGAGTTACATCCTTATCTTCTTTCAGGCTAGGCCAGACCTGAAAATGGAGTTTTTCCATTTAATGAGTTTCCTTCTCTCGCATTGCTCAAGCAAGTGGACAGCTCCTAGTGATCCGGTAATATGTTTGCTATCTTTTACAAGCTGGTGTCAAGTGTAGGAAACATGCTCTTTCCATATGCTAGCACACTTGAGACAAAAATTGGATGTATATTTTTGCATGGACTAATTGATTATTTGGCCACGGAAGTGAACGTCATAGCCGTAGTTGTACATTATAATGATGGGATGTTCCCCTTGGTGGTGTTCCTTAGACTGACATTGATTAACGAAGAAGTCGTTGGAATTTAACACTTCTGTGCAGTATTAGGAATCCCAACGGTAAGGAAGTTTGTAGCAGTCAGAGTATCCTCACAAAACAAAATTTTATGACCTTGACCATGATTGAACTCAACCACTCATCCACCTGCACGCAGCGCTCGAAAATAGATGACTAGGTGCTTATAGAGCTTCCAGCGAGTAATTCCCCTCAAATTTGTAATTTCCAATGAAAACCTCCAACTTCGTTTAACAAGAAAGACTTTGTGTAAAGGCAAAAGAAATAACAGGATGTGGTGTTGGAAATGCAAGCATGGATCCAGGAAATTCATTTGGAAGAAACCTTCTTTTCTGCTAAACTTGCAATAATCCAAAATTTTGTTACCTATTAGAATAAAATCTACAATTTGCTTTTTGTTGAGATAGGTCAGTAGCCACGATTTGTCTGCTGTATGTTATTAAGACTTTTCTCTTGAAGGCGCTATGCACGTCCATTGATACTGATCCTCTGTAATCTATCTTCATGGTTCATACACATATCCATTGAAGTCAATTTTATTCACTGCCTCAATAATTTATCCCTTTTTAATTTATTTGCACCATTTTGTGTTACTATTTAGTTTTAAGCTTTCAAACCTGCTTGATCATCCTCAGATTGGTCTTCTGCTTCTCGAGTCTCTGTCCATTCTTGGGCATTTTGCTTTGTTCCATCCTGAGAATCAAGAAGTCCTTCGATGGGGAAAGAGCCCAACCATACTCCACAAAGTTAGTATACTACCATACAGATCCCATCCCTAGTTTCATATTTTGACGAAATTGACGAATGAGATTTATGTGCTTAGGTGTGCGACCTGCCGTTCGTGTTCTTCAGCGACCCCGAATTGATGCCAGTCTTGGCTAGTACCCTGGTTGCTGCCTGTTATGGGTGTGAACAGAACAAGTCTGTTGTTCAGCAAGAGCTAAGTATAGATATGCTGCTCTCATTATTGAGATCCTGCAAAAATAATCCGCTAGTACCAGCTTTACAATCGACTGCAACCCAAGAGAACGAGGAATCTAATGAATCTAACCCAAATGGTTCCGAGTCTAGAAAACCTCAAATGGACGGAACCATTAGAGCCACCCGCAATGTCAGCAGAATCACAAGAACTTCCTTAGGAAGACCTGGAGGAGGTTCCTCAGGAAATAGCAGTAGGAGTAACAAAACAAGAAACCAAAGAGATAACAGATCAGCTAAAGCATCTGATGAATTAGCTTTGAAGCACATTCAACCAGCAATAGAAGTTGCATCTGTGATGTTGCATTATAGATTCCCTAGCAGTTTTCTTGATAGAGCAGAGCAGTTCTTTTCAGCTGGCATTTCCACTGCTATAGATGGATAAAATGAGAAGGCTTGATGTGGGATTTTCCTGGTTTGTGATTAATTAATTTTAATCATAATTTTGTACATAATTTCAGCTACATTATATATTCAGTTTTGTTTCAAAAAACAAAGTTTTCTCTGTTAAAAAGTACAAAAATCCCAGCTAAAATTTAGAATTACTTTTGATCTTATTGTACATTGATGTCTAATCACAGATTTCATATTTGTTTCAGCCTTGCTTTCAAGCCATGATGTGAATGTAATCAAGGAACAGCTGACAGGTTACAAGAACTTTGAAGACAGTTAACTATACCAACAAATTCAAAAGGTTATATTATAGTCATCATCAATACAAAGAAATAAGTTTTCCAGCTCTACTACACACCATGAAAGGAAGAAATGTTCGTATTCTTTCATTTAATTTACTTTGACCATCAGATCAGATGCTGAAAAGGCACTTGTTTGTTATATATATTAAAAATAGTGTTTTTATATATGAATAATTAATGATTTACAGCAGTTCTATTCCACGAGCATGCAAGTTATAGAAGATAACAATTCAC
mRNA sequence
TTTAACACCCTAAGTCAATCTATCGCTTATTTCTCCTAAATTTTTTTAAAGATTTTATCGGAAAGCAAAACCGAATTAAATTAATAATAAATATAAAAAGCGTATATATTGTTGTATTGCGTTGGCAATGAACGGCGGTTCCGGCACGACGGAGCTCTGCTACATTACGAGTATGATTGTCTTGTCCCTTACAGCCATTTTTCACACACTCATCGTTTTCTGATGCTTTGCATTGTTTGGAAAATGACATCGGAGTTTTGATCAATTGAGATGGAGAACAGCAGTGGAGTGGATGATCAGGGATCAGGATGGTTCGAAGTGAAAAAGCCAATTCCTTTAAATAAGTGTTCAGTTTCTCCACCTTTGGGTACTGGGTTCATGCTTATTTATTTTACGTTCCTGAAGCATAGGAGCAGTTCTAAGTTTTCCCTACAGGGTTGGGTTGGAGGATTTTCAGGAAAAAATTCGTCTAATTCTTCGCGAAGTCCACATCTGGTGAATAAAAATGCCCAAAATGACAGGCCAATTTCAAAGTCTCATCCTCCAGCATCAGGAGGAAGCTATGCTGTATATACCCAAAGTAATACTGAGAATCGTATTGCTACATCAATTGGAGAGGATGGAGGTTCACATTGCCCTGATAAATGCATGGTCAGACAAGACACTGAATTCCCCAAGTCATCTGTTTTGCACGTTACAGATTCTCATGCTGCAAGTGGAGACTGTGAGAAAGTCCCCCATAAGGACATGCCTGGAGTGGTTCAAAAAATCAAATGGGGTGATCTAGAGGATGATAGTTTGGTATTGAATAACAGTGAAAATGGAGTTGAAATAAAGTTTGGTAATATTGGAGAGGTTGATCTAGGGGTCTCCGAAAAAAATGAGGTTAAGGATGGCTTGGCTTCACACGTCTCTACATCCGTAGATACCCAAGCGAACAAATTGGTGGCAGTATCAGTCAGAGAAGAGGAGGCCTCTCATCAGGCCCTTTTATCAACTGATGAGGACAAAGTATGGCGAGTTAGGCATCAAGATATAAATAGGGAATTTATAGAAGCTTTGGAAGTACTGAATAATAGTGAAGCAACAGTCTGTCCGGTTATTGATGGCTCAAATTTCAAGGATACAGGAACTGGACACACCAAACCTGTAGATGATCACAGTTCAAATTTTAGTTCTCCCTCTTGTGTAGAAGCTGGAACTGAGCCAAAAGTCCAGAAGGCTATGAAGTTACCTGAGGTAGAAAATCCAGAGCTCCATGAAGCACCAGGTAAGACTGAGTTTTCTAGCTTGTCATTGCCCGTTCAGAATGCTGAGTTAGTTTCAACTGAAACTAATGGACCTGAAAATTCAGGGGGTACCTCTGATTCGGTTGAAGAAGCGCAAATTGAACAAGGGAGTGGAACTCATAATGTGCAAGTTGTGAGTGTCCCCAGTGAAGGTGAGACAGGTGAGAGTAAAGAAAGGTTTAGGCAGCGGCTCTGGTGCTTCCTTTTTGAGAATCTTAATCGAGCTATAGATGAACTTTATCTCCTTTGTGAACTGGAATGTGACTTGGAACAGATGAAAGAAGCCATTCTTGTTCTTGAAGAGGCTGCATCTGACTTTAAAGAGCTAAATGCTAGGGTGGAGGAGTTTGAAGAGGTAAAGAGGTTGTCTTCTCAGTCTGTTGATGGGATGTCAATCACTATGAAGAGTGACCATTGTAGGCCACATGCTCTCTCTTGGGAGGTTCGACGGATGACAAATTCTCCTCATAAAGCAGAGATTTTATCGTCATCACTTGAGGCATTTAAGAAAATTCGGCAAGAGAGAGCCAACATGCTTGAAGCAAGCAAAAAACTCCCTGGGAATGAGAGTCTAAGTCCTCAGTGTATGGATCAAATGAAGAAAACATGTACAATAAATTGTGTAGTGCATGATGCTGAAGATTCAGCATCAAAGATAGCAGAAAAGAAAGGTGTAGATCTCACTCCAGGGAGTCTTAGTGGAAAAGAAAAGAACACTGAGTCTCTTGGTACCGAGAAGGTTAATGTTGTGCAAAATATACGTTCACGCCCTCAGAATTCCTTGTCTTCTATAAATTCATCCAAGCCACCTCTTGCTGTAAAATTTAAGCGAGAGCAACTTGAATCTGATGTAGAAAGGTTAGTTTCTAAAAGAGAAAGGGCTCTTGCAGAGGGCACATGTGAGAAAAATCTGAAAGCTATAGAACATTCCAAAAGACAAGCAACTGTTCCTGAAAAAGACAAGGAAAAAGAGAAGAGAAATATTGGTGTTCGGAAGTCCATGGATGCATGGAAAGAGAAGAGGAATTGGGAAGACATACTTTCATCATCCGTTTGGATTAGTTCTCGAGTTTCACATTTGCCAGGCATGAGCAGGAAAAGTGCTGAGCGTGTACGCGTGCTGCATGATAAATTAATGTCCCCAGACAAGAAGAAGAAAACTTCTTTAGATGTGAAAAGAGAGGCAGAAGAAAAGCATGCTCGTGCAATGAAAATTAGAAGTGAATTAGAGAATGAAAGGGTTCAGAAACTGCAGCGTACTTCAGAGAAATTAAATCGTGTTAATGAATGGCAGGCTGTACGCACTATGAAATTACGGGAGGGAATGTATGCTCGCCACCAGCGCAGTGAATCTCGGCATGAAGCTTTCCTAGCTCAAGTTGTGAAGAGAGCTGGTGACGAAAGTAGTAAGGTAAATGAGGTTCGTTTCATCACCTCCTTAAATGAAGAAAATAAAAAGATTATGCTGCGACAGAAGCTTCACGGTTCTGAGTTAAGGAGAGCTGAAAAACTTCAGGTAATGAAAATTAAACAAAAGGAAGACATGGCTAGAGAAGAAGCAGTGTTAGAACGGAAGAAATTGATCGAAGCTGAAAAATTGCAGCGTCTTGCTGAAACACAGAGGAAAAAAGAAGAGGCTCATGTTCGAAGGGAAGAAGAGCGAAAAGCATCAAGTGCAGCAAGGGAGGCGAGGGCCATGGAACAGCTTCGAAGGAAAGAGGAAAGAGCAAGAGCCCAGCAAGAAGAAGCTGAACTTATGGCCCAGAAATTGGCTGAGCGACTTAGTGAAAGTGAACAACGACGAAAGTTTTATTTGGAGCAAATACGGGAGAGGGCTTCTATGGATTTTAGGGACCAATCTTCACCCTTGCTGCGACGATATATGCATAAGGATGGTCCAAATAGATCAACACCAAATAATAATGGTGATGAACAGGGACCATGTAGCTCTGACTTGGGTTCTGGTCTTGCAATGGGTAAGACAACAATGCAACAACACATGAAACGAAGGATCAAAAGAATCCGACAGAGGCTTATGGCTCTGAAATATGAATTTTTTGAGCCCGTTAATGGAGCTGAAAATGTTGGCATTGGATATAGAACATCCATAGGGACTGCAAGGGCAAAAATAGGTAGGTGGCTTCAAGAACTTCAGAAACTTCGCCAAGAAAGAAAAGAAGGGGCTGCAAGTTTAGGGCTGATAATTGCTGAAATGATCAAGTACTTGGATGGAAGAGAACTTGAGCTGCAAGCTTCTCGCCAAGCTGGCTTACTTGATTTTATTGCCTCTGCACTTCCTGCTTCTCACACATCTAAACCAGAAGCCTGTCAAGTGATGATACACCTATTAAAATTGCTAAGGGTAGTATTATCTGCATCAGCAAACAGAAGTTACTTTCTTGCGCAGAATCTCTTGCCACCAATCATCCCAATGCTGTCAACAGCCCTTGAGAACTACATTAAGATTGCAGCTTCTATTAATGCCCCTGGTAATGGGGTTCCATCAAGCAGAACGTCTATTGAAAATTTTGAGTCGAGTTCTGAAGTACTGGATGGGTCTTTGTGGACTATCACAACAATAATTGGTCATATAAACCCTGAAGGACCACAACTCCAAATGTGGGATGGTTTGCTAGAACTATTGGTTGCCTACCAGGTTATTCAACGGCTTCGGGATCTTTTTGCACTTTATGATAGACCTCAGGTGGAAGGGTCCCCATTTCCTTCTTCAATTCTCTTGAGTATACGCCTTCTGGTGGTTTTAACATCAAGACCTGGAACTGATAGTACCATTAATTGTGTATTACCATATAGTGAAAAATTGGCAGGAGATGGAAGTGGGATTACTATTTCTGCCGAGTCTAGAGATCTTTTTGGAACTGGTTTCACTGAGGATGGTAGCCCTTCAGAATCTGGGTTAAATGGTGCTAAAATTGTACAAAAGCCAAAAATGACCGTTGATCAACTGGATGATGAATCATGTGAGCAAAAGATAAACGATGGGATGATACCCAGTGATGGTGGTAAGAGGGAGCAGACGGATTGCTCGATTAAGGCAAATGGTGTTAATCTTATCCAGTCTGAAAATCAGGATGAACCGCCAGATAGTGAAGTTGTCTTGAAACCTTTTGTATCCCAAGAGGATCAGAAAAAACCAGTGGATCTTGTTTCTGACTTGAGGATAAAAAATGTTACAAAATTGAAACCACCAATAGCCTATCTATTGTCTGCTATATCTGATACTGGAATTGTTGGTCTCCTGTCTTTATTAACAGCCGTATTGCTTCAAGCAAACAATAGATTATCATCCGAACAGGCCGCATATATCCTTCCGTCTAACTTTGAAGATGTTGCTACTGGTGTGCTGAAGGTGTTGAACAATCTGGCATTTTTGGATCTTAAATTTATGCAGCGAGTGCTAGCTAGGCCAGACCTGAAAATGGAGTTTTTCCATTTAATGAGTTTCCTTCTCTCGCATTGCTCAAGCAAGTGGACAGCTCCTAGTGATCCGATTGGTCTTCTGCTTCTCGAGTCTCTGTCCATTCTTGGGCATTTTGCTTTGTTCCATCCTGAGAATCAAGAAGTCCTTCGATGGGGAAAGAGCCCAACCATACTCCACAAAGTGTGCGACCTGCCGTTCGTGTTCTTCAGCGACCCCGAATTGATGCCAGTCTTGGCTAGTACCCTGGTTGCTGCCTGTTATGGGTGTGAACAGAACAAGTCTGTTGTTCAGCAAGAGCTAAGTATAGATATGCTGCTCTCATTATTGAGATCCTGCAAAAATAATCCGCTAGTACCAGCTTTACAATCGACTGCAACCCAAGAGAACGAGGAATCTAATGAATCTAACCCAAATGGTTCCGAGTCTAGAAAACCTCAAATGGACGGAACCATTAGAGCCACCCGCAATGTCAGCAGAATCACAAGAACTTCCTTAGGAAGACCTGGAGGAGGTTCCTCAGGAAATAGCAGTAGGAGTAACAAAACAAGAAACCAAAGAGATAACAGATCAGCTAAAGCATCTGATGAATTAGCTTTGAAGCACATTCAACCAGCAATAGAAGTTGCATCTGTGATGTTGCATTATAGATTCCCTAGCAGTTTTCTTGATAGAGCAGAGCAGTTCTTTTCAGCTGGCATTTCCACTGCTATAGATGGATAAAATGAGAAGGCTTGATGTGGGATTTTCCTGCCTTGCTTTCAAGCCATGATGTGAATGTAATCAAGGAACAGCTGACAGGTTACAAGAACTTTGAAGACAGTTAACTATACCAACAAATTCAAAAGGTTATATTATAGTCATCATCAATACAAAGAAATAAGTTTTCCAGCTCTACTACACACCATGAAAGGAAGAAATGTTCGTATTCTTTCATTTAATTTACTTTGACCATCAGATCAGATGCTGAAAAGGCACTTGTTTGTTATATATATTAAAAATAGTGTTTTTATATATGAATAATTAATGATTTACAGCAGTTCTATTCCACGAGCATGCAAGTTATAGAAGATAACAATTCAC
Coding sequence (CDS)
ATGGAGAACAGCAGTGGAGTGGATGATCAGGGATCAGGATGGTTCGAAGTGAAAAAGCCAATTCCTTTAAATAAGTGTTCAGTTTCTCCACCTTTGGGTACTGGGTTCATGCTTATTTATTTTACGTTCCTGAAGCATAGGAGCAGTTCTAAGTTTTCCCTACAGGGTTGGGTTGGAGGATTTTCAGGAAAAAATTCGTCTAATTCTTCGCGAAGTCCACATCTGGTGAATAAAAATGCCCAAAATGACAGGCCAATTTCAAAGTCTCATCCTCCAGCATCAGGAGGAAGCTATGCTGTATATACCCAAAGTAATACTGAGAATCGTATTGCTACATCAATTGGAGAGGATGGAGGTTCACATTGCCCTGATAAATGCATGGTCAGACAAGACACTGAATTCCCCAAGTCATCTGTTTTGCACGTTACAGATTCTCATGCTGCAAGTGGAGACTGTGAGAAAGTCCCCCATAAGGACATGCCTGGAGTGGTTCAAAAAATCAAATGGGGTGATCTAGAGGATGATAGTTTGGTATTGAATAACAGTGAAAATGGAGTTGAAATAAAGTTTGGTAATATTGGAGAGGTTGATCTAGGGGTCTCCGAAAAAAATGAGGTTAAGGATGGCTTGGCTTCACACGTCTCTACATCCGTAGATACCCAAGCGAACAAATTGGTGGCAGTATCAGTCAGAGAAGAGGAGGCCTCTCATCAGGCCCTTTTATCAACTGATGAGGACAAAGTATGGCGAGTTAGGCATCAAGATATAAATAGGGAATTTATAGAAGCTTTGGAAGTACTGAATAATAGTGAAGCAACAGTCTGTCCGGTTATTGATGGCTCAAATTTCAAGGATACAGGAACTGGACACACCAAACCTGTAGATGATCACAGTTCAAATTTTAGTTCTCCCTCTTGTGTAGAAGCTGGAACTGAGCCAAAAGTCCAGAAGGCTATGAAGTTACCTGAGGTAGAAAATCCAGAGCTCCATGAAGCACCAGGTAAGACTGAGTTTTCTAGCTTGTCATTGCCCGTTCAGAATGCTGAGTTAGTTTCAACTGAAACTAATGGACCTGAAAATTCAGGGGGTACCTCTGATTCGGTTGAAGAAGCGCAAATTGAACAAGGGAGTGGAACTCATAATGTGCAAGTTGTGAGTGTCCCCAGTGAAGGTGAGACAGGTGAGAGTAAAGAAAGGTTTAGGCAGCGGCTCTGGTGCTTCCTTTTTGAGAATCTTAATCGAGCTATAGATGAACTTTATCTCCTTTGTGAACTGGAATGTGACTTGGAACAGATGAAAGAAGCCATTCTTGTTCTTGAAGAGGCTGCATCTGACTTTAAAGAGCTAAATGCTAGGGTGGAGGAGTTTGAAGAGGTAAAGAGGTTGTCTTCTCAGTCTGTTGATGGGATGTCAATCACTATGAAGAGTGACCATTGTAGGCCACATGCTCTCTCTTGGGAGGTTCGACGGATGACAAATTCTCCTCATAAAGCAGAGATTTTATCGTCATCACTTGAGGCATTTAAGAAAATTCGGCAAGAGAGAGCCAACATGCTTGAAGCAAGCAAAAAACTCCCTGGGAATGAGAGTCTAAGTCCTCAGTGTATGGATCAAATGAAGAAAACATGTACAATAAATTGTGTAGTGCATGATGCTGAAGATTCAGCATCAAAGATAGCAGAAAAGAAAGGTGTAGATCTCACTCCAGGGAGTCTTAGTGGAAAAGAAAAGAACACTGAGTCTCTTGGTACCGAGAAGGTTAATGTTGTGCAAAATATACGTTCACGCCCTCAGAATTCCTTGTCTTCTATAAATTCATCCAAGCCACCTCTTGCTGTAAAATTTAAGCGAGAGCAACTTGAATCTGATGTAGAAAGGTTAGTTTCTAAAAGAGAAAGGGCTCTTGCAGAGGGCACATGTGAGAAAAATCTGAAAGCTATAGAACATTCCAAAAGACAAGCAACTGTTCCTGAAAAAGACAAGGAAAAAGAGAAGAGAAATATTGGTGTTCGGAAGTCCATGGATGCATGGAAAGAGAAGAGGAATTGGGAAGACATACTTTCATCATCCGTTTGGATTAGTTCTCGAGTTTCACATTTGCCAGGCATGAGCAGGAAAAGTGCTGAGCGTGTACGCGTGCTGCATGATAAATTAATGTCCCCAGACAAGAAGAAGAAAACTTCTTTAGATGTGAAAAGAGAGGCAGAAGAAAAGCATGCTCGTGCAATGAAAATTAGAAGTGAATTAGAGAATGAAAGGGTTCAGAAACTGCAGCGTACTTCAGAGAAATTAAATCGTGTTAATGAATGGCAGGCTGTACGCACTATGAAATTACGGGAGGGAATGTATGCTCGCCACCAGCGCAGTGAATCTCGGCATGAAGCTTTCCTAGCTCAAGTTGTGAAGAGAGCTGGTGACGAAAGTAGTAAGGTAAATGAGGTTCGTTTCATCACCTCCTTAAATGAAGAAAATAAAAAGATTATGCTGCGACAGAAGCTTCACGGTTCTGAGTTAAGGAGAGCTGAAAAACTTCAGGTAATGAAAATTAAACAAAAGGAAGACATGGCTAGAGAAGAAGCAGTGTTAGAACGGAAGAAATTGATCGAAGCTGAAAAATTGCAGCGTCTTGCTGAAACACAGAGGAAAAAAGAAGAGGCTCATGTTCGAAGGGAAGAAGAGCGAAAAGCATCAAGTGCAGCAAGGGAGGCGAGGGCCATGGAACAGCTTCGAAGGAAAGAGGAAAGAGCAAGAGCCCAGCAAGAAGAAGCTGAACTTATGGCCCAGAAATTGGCTGAGCGACTTAGTGAAAGTGAACAACGACGAAAGTTTTATTTGGAGCAAATACGGGAGAGGGCTTCTATGGATTTTAGGGACCAATCTTCACCCTTGCTGCGACGATATATGCATAAGGATGGTCCAAATAGATCAACACCAAATAATAATGGTGATGAACAGGGACCATGTAGCTCTGACTTGGGTTCTGGTCTTGCAATGGGTAAGACAACAATGCAACAACACATGAAACGAAGGATCAAAAGAATCCGACAGAGGCTTATGGCTCTGAAATATGAATTTTTTGAGCCCGTTAATGGAGCTGAAAATGTTGGCATTGGATATAGAACATCCATAGGGACTGCAAGGGCAAAAATAGGTAGGTGGCTTCAAGAACTTCAGAAACTTCGCCAAGAAAGAAAAGAAGGGGCTGCAAGTTTAGGGCTGATAATTGCTGAAATGATCAAGTACTTGGATGGAAGAGAACTTGAGCTGCAAGCTTCTCGCCAAGCTGGCTTACTTGATTTTATTGCCTCTGCACTTCCTGCTTCTCACACATCTAAACCAGAAGCCTGTCAAGTGATGATACACCTATTAAAATTGCTAAGGGTAGTATTATCTGCATCAGCAAACAGAAGTTACTTTCTTGCGCAGAATCTCTTGCCACCAATCATCCCAATGCTGTCAACAGCCCTTGAGAACTACATTAAGATTGCAGCTTCTATTAATGCCCCTGGTAATGGGGTTCCATCAAGCAGAACGTCTATTGAAAATTTTGAGTCGAGTTCTGAAGTACTGGATGGGTCTTTGTGGACTATCACAACAATAATTGGTCATATAAACCCTGAAGGACCACAACTCCAAATGTGGGATGGTTTGCTAGAACTATTGGTTGCCTACCAGGTTATTCAACGGCTTCGGGATCTTTTTGCACTTTATGATAGACCTCAGGTGGAAGGGTCCCCATTTCCTTCTTCAATTCTCTTGAGTATACGCCTTCTGGTGGTTTTAACATCAAGACCTGGAACTGATAGTACCATTAATTGTGTATTACCATATAGTGAAAAATTGGCAGGAGATGGAAGTGGGATTACTATTTCTGCCGAGTCTAGAGATCTTTTTGGAACTGGTTTCACTGAGGATGGTAGCCCTTCAGAATCTGGGTTAAATGGTGCTAAAATTGTACAAAAGCCAAAAATGACCGTTGATCAACTGGATGATGAATCATGTGAGCAAAAGATAAACGATGGGATGATACCCAGTGATGGTGGTAAGAGGGAGCAGACGGATTGCTCGATTAAGGCAAATGGTGTTAATCTTATCCAGTCTGAAAATCAGGATGAACCGCCAGATAGTGAAGTTGTCTTGAAACCTTTTGTATCCCAAGAGGATCAGAAAAAACCAGTGGATCTTGTTTCTGACTTGAGGATAAAAAATGTTACAAAATTGAAACCACCAATAGCCTATCTATTGTCTGCTATATCTGATACTGGAATTGTTGGTCTCCTGTCTTTATTAACAGCCGTATTGCTTCAAGCAAACAATAGATTATCATCCGAACAGGCCGCATATATCCTTCCGTCTAACTTTGAAGATGTTGCTACTGGTGTGCTGAAGGTGTTGAACAATCTGGCATTTTTGGATCTTAAATTTATGCAGCGAGTGCTAGCTAGGCCAGACCTGAAAATGGAGTTTTTCCATTTAATGAGTTTCCTTCTCTCGCATTGCTCAAGCAAGTGGACAGCTCCTAGTGATCCGATTGGTCTTCTGCTTCTCGAGTCTCTGTCCATTCTTGGGCATTTTGCTTTGTTCCATCCTGAGAATCAAGAAGTCCTTCGATGGGGAAAGAGCCCAACCATACTCCACAAAGTGTGCGACCTGCCGTTCGTGTTCTTCAGCGACCCCGAATTGATGCCAGTCTTGGCTAGTACCCTGGTTGCTGCCTGTTATGGGTGTGAACAGAACAAGTCTGTTGTTCAGCAAGAGCTAAGTATAGATATGCTGCTCTCATTATTGAGATCCTGCAAAAATAATCCGCTAGTACCAGCTTTACAATCGACTGCAACCCAAGAGAACGAGGAATCTAATGAATCTAACCCAAATGGTTCCGAGTCTAGAAAACCTCAAATGGACGGAACCATTAGAGCCACCCGCAATGTCAGCAGAATCACAAGAACTTCCTTAGGAAGACCTGGAGGAGGTTCCTCAGGAAATAGCAGTAGGAGTAACAAAACAAGAAACCAAAGAGATAACAGATCAGCTAAAGCATCTGATGAATTAGCTTTGAAGCACATTCAACCAGCAATAGAAGTTGCATCTGTGATGTTGCATTATAGATTCCCTAGCAGTTTTCTTGATAGAGCAGAGCAGTTCTTTTCAGCTGGCATTTCCACTGCTATAGATGGATAA
Protein sequence
MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGGFSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGSHCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLNNSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQALLSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSNFSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPENSGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMKKTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNSLSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKDKEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAAREARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMALKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFLAQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTIIGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVVLTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQKPKMTVDQLDDESCEQKINDGMIPSDGGKREQTDCSIKANGVNLIQSENQDEPPDSEVVLKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANNRLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEESNESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAKASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG
Homology
BLAST of CmUC06G124080 vs. NCBI nr
Match:
XP_038878649.1 (uncharacterized protein LOC120070833 [Benincasa hispida])
HSP 1 Score: 2894.4 bits (7502), Expect = 0.0e+00
Identity = 1549/1686 (91.87%), Postives = 1593/1686 (94.48%), Query Frame = 0
Query: 41 FTFLKHRSSSKFSLQGWVGGFSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAV 100
F F KHRSSSKFSLQ WVGGF+GKNSSNS RSP+LVNKN QNDR SKSHPP SGGSYAV
Sbjct: 29 FLFEKHRSSSKFSLQSWVGGFTGKNSSNSLRSPNLVNKNPQNDRSNSKSHPPTSGGSYAV 88
Query: 101 YTQSNTENRIATSIGEDGGSHCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDM 160
YTQSNTEN IATSIG +G SHCPDKCMVRQDTEFP+SSVLH+ DSHA SG+CEKVPHKDM
Sbjct: 89 YTQSNTENSIATSIGGNGDSHCPDKCMVRQDTEFPESSVLHIADSHAGSGECEKVPHKDM 148
Query: 161 PGVVQKIKWGDLEDDSLVLNNSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDT 220
PG+VQKIKWGDLEDDSLVLNNS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+S+DT
Sbjct: 149 PGLVQKIKWGDLEDDSLVLNNSANGVEIKFGNIGEVDLGVSEKNEVKHDLASLVSSSIDT 208
Query: 221 QANKLVAVSVREEEASHQALLSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDG 280
QANKLVA SVREEEASHQ LLST+E+K+ +V HQDINREFIE LEVL+NSEATVCPV DG
Sbjct: 209 QANKLVAASVREEEASHQTLLSTNEEKLCQVSHQDINREFIEDLEVLSNSEATVCPVTDG 268
Query: 281 SNFKDTGTGHTKPVDDHSSNFSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSS 340
SNFK GT HTKPVDDHSSNF+S SC E TEPKV KA+KLPEVENP LHEA GKTE S
Sbjct: 269 SNFKGIGTEHTKPVDDHSSNFNSLSCEEIVTEPKVLKAIKLPEVENPVLHEAAGKTEICS 328
Query: 341 LSLPVQNAELVSTETNGPENSGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERF 400
LSLPVQNAE V TET+ ENSGG+ D VEE QIEQGSGTHNVQV SVPSEGETGESKERF
Sbjct: 329 LSLPVQNAESVPTETSRRENSGGSFDLVEEPQIEQGSGTHNVQVASVPSEGETGESKERF 388
Query: 401 RQRLWCFLFENLNRAIDELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVK 460
RQRLWCFLFENLNRA+DELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVK
Sbjct: 389 RQRLWCFLFENLNRAVDELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVK 448
Query: 461 RLSSQSVDGMSITMKSDHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLE 520
RLSSQSVDGM ITMKSDHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLE
Sbjct: 449 RLSSQSVDGMPITMKSDHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLE 508
Query: 521 ASKKLPGNESLSPQCMDQMKKTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTE 580
ASKKLPGNE LSPQCMDQMKKT TINCVVH+A DSASK AE KGVDLTPGSLSGKEKNTE
Sbjct: 509 ASKKLPGNECLSPQCMDQMKKTSTINCVVHEAADSASKTAENKGVDLTPGSLSGKEKNTE 568
Query: 581 SLGTEKVNVVQNIRSRPQNSLSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTC 640
SLGT+KVNVVQNIRSRPQNSLS INSSKPPLAVKFKR+QLESDVERLVS+RERALAEGTC
Sbjct: 569 SLGTDKVNVVQNIRSRPQNSLSFINSSKPPLAVKFKRDQLESDVERLVSRRERALAEGTC 628
Query: 641 EKNLKAIEHSKRQATVPEKDKEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHL 700
EK K IEHSKRQATV EKDKEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHL
Sbjct: 629 EKIQKPIEHSKRQATVSEKDKEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHL 688
Query: 701 PGMSRKSAERVRVLHDKLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRT 760
PGMSRKSAERVRVLHDKLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRT
Sbjct: 689 PGMSRKSAERVRVLHDKLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRT 748
Query: 761 SEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSL 820
SEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSL
Sbjct: 749 SEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSL 808
Query: 821 NEENKKIMLRQKLHGSELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQ 880
NEENKKIMLRQKLHGSELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQ
Sbjct: 809 NEENKKIMLRQKLHGSELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQ 868
Query: 881 RKKEEAHVRREEERKASSAAREARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQR 940
RKKEEAHVRREEERKASSAAREARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQR
Sbjct: 869 RKKEEAHVRREEERKASSAAREARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQR 928
Query: 941 RKFYLEQIRERASMDFRDQSSPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGK 1000
RKFYLEQIRERASMDFRDQSSPLLRRYMHKDGP++STPNNNGDEQGP SSDLGSGLA+GK
Sbjct: 929 RKFYLEQIRERASMDFRDQSSPLLRRYMHKDGPSKSTPNNNGDEQGPSSSDLGSGLALGK 988
Query: 1001 TTMQQHMKRRIKRIRQRLMALKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKL 1060
TTMQQHMKRRIKRIRQRLMALKYEF EPVNGAENVGIGYRTSIGTARAKIGRWLQELQKL
Sbjct: 989 TTMQQHMKRRIKRIRQRLMALKYEFVEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKL 1048
Query: 1061 RQERKEGAASLGLIIAEMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMI 1120
RQ RKEGAASLGLIIAEMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMI
Sbjct: 1049 RQARKEGAASLGLIIAEMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMI 1108
Query: 1121 HLLKLLRVVLSASANRSYFLAQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSI 1180
HLLKLLRVVLSASANRSYFLAQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSI
Sbjct: 1109 HLLKLLRVVLSASANRSYFLAQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSI 1168
Query: 1181 ENFESSSEVLDGSLWTITTIIGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRP 1240
ENFESSSEVLDGSLWTITTIIGHIN EGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRP
Sbjct: 1169 ENFESSSEVLDGSLWTITTIIGHINSEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRP 1228
Query: 1241 QVEGSPFPSSILLSIRLLVVLTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGT 1300
QVEGSPFPSSILLSIRLLVVLTSRPGTDSTI+CVLPYSEK AGD SGI IS S D GT
Sbjct: 1229 QVEGSPFPSSILLSIRLLVVLTSRPGTDSTISCVLPYSEKWAGDESGIAISVVSGDFLGT 1288
Query: 1301 GFTEDGSPSESGLNGAKIVQKPKMTVDQLDDESCEQKINDGMIPSDGGKREQTDCSIKAN 1360
G TED SP ESGLNG KIVQ PKMTVDQLDDESCEQKI +GM PSD G+REQTDCSI+AN
Sbjct: 1289 GSTEDSSPPESGLNGGKIVQNPKMTVDQLDDESCEQKIKNGMKPSDCGQREQTDCSIEAN 1348
Query: 1361 GVNLIQSENQDEPPDSEVVLKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISD 1420
GVNLIQ+ D+P D EVVLK FVSQ D+K+PVDLVSD RIKN+TKLKPP+AYLLSAISD
Sbjct: 1349 GVNLIQT---DDPQDGEVVLKSFVSQGDKKQPVDLVSDERIKNITKLKPPLAYLLSAISD 1408
Query: 1421 TGIVGLLSLLTAVLLQANNRLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVL 1480
TGIVGLLSLLTAVLLQANNRLSSEQA+YILPSNFEDVATGVLKVLNNLA LDLKFMQRVL
Sbjct: 1409 TGIVGLLSLLTAVLLQANNRLSSEQASYILPSNFEDVATGVLKVLNNLASLDLKFMQRVL 1468
Query: 1481 ARPDLKMEFFHLMSFLLSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGK 1540
ARPDLKMEFFHLMSFLLSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGK
Sbjct: 1469 ARPDLKMEFFHLMSFLLSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGK 1528
Query: 1541 SPTILHKVCDLPFVFFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK 1600
SPTILHKVCDLPFV FSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK
Sbjct: 1529 SPTILHKVCDLPFVLFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK 1588
Query: 1601 NNPLVPALQSTATQENEESNESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSS 1660
NNPLVPALQST+TQENEESNESNPNG ESRKPQ DGTIR TRNV R+T+TSLGRPGGGSS
Sbjct: 1589 NNPLVPALQSTSTQENEESNESNPNGPESRKPQTDGTIRTTRNVGRMTKTSLGRPGGGSS 1648
Query: 1661 GNSSRSNKTRNQRDNRSAKASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGI 1720
GNS+RSNKTRN RDNRSAKASDE+ALKH QPAIEVASVMLHYRFPSSFLDRAEQFFS GI
Sbjct: 1649 GNSNRSNKTRNHRDNRSAKASDEIALKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSTGI 1708
Query: 1721 STAIDG 1727
STA+DG
Sbjct: 1709 STAVDG 1711
BLAST of CmUC06G124080 vs. NCBI nr
Match:
XP_008453238.1 (PREDICTED: uncharacterized protein LOC103494020 isoform X1 [Cucumis melo] >XP_008453239.1 PREDICTED: uncharacterized protein LOC103494020 isoform X1 [Cucumis melo] >XP_008453240.1 PREDICTED: uncharacterized protein LOC103494020 isoform X1 [Cucumis melo] >XP_008453241.1 PREDICTED: uncharacterized protein LOC103494020 isoform X1 [Cucumis melo] >KAA0057973.1 uncharacterized protein E6C27_scaffold274G002770 [Cucumis melo var. makuwa])
HSP 1 Score: 2845.5 bits (7375), Expect = 0.0e+00
Identity = 1535/1727 (88.88%), Postives = 1588/1727 (91.95%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS GVDDQGSGWFEVKK KHRSSSKFS+Q WVGG
Sbjct: 1 MENSGGVDDQGSGWFEVKK-------------------------KHRSSSKFSVQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKN SNS RS +LVNKN +NDR SKSHPP +GGSYAVYTQSN EN +AT+IG+DGGS
Sbjct: 61 FSGKNLSNSLRSTNLVNKNLRNDRSNSKSHPPTTGGSYAVYTQSNAENCVATAIGDDGGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+ PDKCMVRQDTEFPKSSVLH+ DS+A SG+CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 YSPDKCMVRQDTEFPKSSVLHIADSNAGSGECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ N LVA SVR+EEASHQ L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASTVSSSVDTQVNALVAASVRDEEASHQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+VL+N+E TVC VIDGSNFKDT +TKPVD+H SN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVLSNNEVTVCSVIDGSNFKDTRNENTKPVDNHISN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F SPSC EAGTEPKVQK +KL EVENP L E GK E SSLSLPVQNAE VST+T G EN
Sbjct: 301 FDSPSCEEAGTEPKVQKVIKLHEVENPVLPETSGKPEISSLSLPVQNAESVSTKTCGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG+SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGSSDSVEEPQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVKRLSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKRLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
+ TINCVVHDA DSASK A K VDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 EISTINCVVHDAPDSASKTAGNKDVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
LSSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K IEHSKRQATV EKD
Sbjct: 601 LSSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPIEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNNGDEQGP SSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNGDEQGPSSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF EP+NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVEPINGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAAS+NAPG+GV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASVNAPGSGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPG DSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGIDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI +GMIPSDGG+RE QTDC I+ANGVN++Q + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKNGMIPSDGGQREQQTDCFIEANGVNIVQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP VSQ DQK+ VD+ SD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPSVSQGDQKQQVDVASDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKWTAPSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+RSNKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRSNKTRNQRDNRSAK 1680
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPAIEVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1701
BLAST of CmUC06G124080 vs. NCBI nr
Match:
XP_011660268.1 (uncharacterized protein LOC101216553 isoform X3 [Cucumis sativus] >KGN63704.1 hypothetical protein Csa_013570 [Cucumis sativus])
HSP 1 Score: 2820.4 bits (7310), Expect = 0.0e+00
Identity = 1530/1727 (88.59%), Postives = 1584/1727 (91.72%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENSSGVDDQGSGWFEVKK KHRSSSKFSLQ WVGG
Sbjct: 1 MENSSGVDDQGSGWFEVKK-------------------------KHRSSSKFSLQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKNSS+S +LVNKN QNDR SKSHPP +GGSYAVYTQSNTEN +ATSIG+D GS
Sbjct: 61 FSGKNSSDSMCRTNLVNKNPQNDRSNSKSHPP-TGGSYAVYTQSNTENCVATSIGDDEGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
H PDKCMVRQDTEFPKSSVLH+ DS+A + +CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 HSPDKCMVRQDTEFPKSSVLHIADSNAGNEECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ + LVA SVR+EEAS Q L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASLVSSSVDTQVSTLVAASVRDEEASDQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+V++NSEATVC VIDGSNFKDT +TKPVD+HSSN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVISNSEATVCSVIDGSNFKDTRNENTKPVDNHSSN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F C EAGTE KVQKA+K+ EVENP LHEA G+ E SSLSLPVQNAE VST+T+G EN
Sbjct: 301 FDFLFCEEAGTEAKVQKAIKVHEVENPVLHEAAGEPEISSLSLPVQNAESVSTKTSGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGCSDSVEETQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVK+LSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKKLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
KT TINCVVHDA DSASK A KGVDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 KTSTINCVVHDAPDSASKAAGNKGVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
SSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K +EHSKRQATV EKD
Sbjct: 601 -SSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPVEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSS+ ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSIRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIR ELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRCELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNN DEQGP SSDLGSGLAMGKTT+QQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNVDEQGPSSSDLGSGLAMGKTTLQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF E NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVETTNGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAASINAPGNGV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPGTDSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGTDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI GMIP DGG+RE QTDC I+ANGVNLIQ + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKTGMIPIDGGQREPQTDCFIEANGVNLIQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP+VSQ DQK+ VD+VSD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPYVSQGDQKQHVDVVSDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKW APSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWAAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+R+NKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRNNKTRNQRDNRSAK 1680
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPA+EVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAMEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1699
BLAST of CmUC06G124080 vs. NCBI nr
Match:
XP_031744557.1 (uncharacterized protein LOC101216553 isoform X1 [Cucumis sativus] >XP_031744565.1 uncharacterized protein LOC101216553 isoform X1 [Cucumis sativus])
HSP 1 Score: 2808.9 bits (7280), Expect = 0.0e+00
Identity = 1530/1746 (87.63%), Postives = 1584/1746 (90.72%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENSSGVDDQGSGWFEVKK KHRSSSKFSLQ WVGG
Sbjct: 1 MENSSGVDDQGSGWFEVKK-------------------------KHRSSSKFSLQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKNSS+S +LVNKN QNDR SKSHPP +GGSYAVYTQSNTEN +ATSIG+D GS
Sbjct: 61 FSGKNSSDSMCRTNLVNKNPQNDRSNSKSHPP-TGGSYAVYTQSNTENCVATSIGDDEGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
H PDKCMVRQDTEFPKSSVLH+ DS+A + +CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 HSPDKCMVRQDTEFPKSSVLHIADSNAGNEECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ + LVA SVR+EEAS Q L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASLVSSSVDTQVSTLVAASVRDEEASDQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+V++NSEATVC VIDGSNFKDT +TKPVD+HSSN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVISNSEATVCSVIDGSNFKDTRNENTKPVDNHSSN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F C EAGTE KVQKA+K+ EVENP LHEA G+ E SSLSLPVQNAE VST+T+G EN
Sbjct: 301 FDFLFCEEAGTEAKVQKAIKVHEVENPVLHEAAGEPEISSLSLPVQNAESVSTKTSGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGCSDSVEETQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVK+LSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKKLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
KT TINCVVHDA DSASK A KGVDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 KTSTINCVVHDAPDSASKAAGNKGVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
SSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K +EHSKRQATV EKD
Sbjct: 601 -SSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPVEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSS+ ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSIRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIR ELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRCELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNN DEQGP SSDLGSGLAMGKTT+QQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNVDEQGPSSSDLGSGLAMGKTTLQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF E NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVETTNGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAASINAPGNGV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPGTDSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGTDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI GMIP DGG+RE QTDC I+ANGVNLIQ + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKTGMIPIDGGQREPQTDCFIEANGVNLIQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP+VSQ DQK+ VD+VSD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPYVSQGDQKQHVDVVSDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQ-------------------AAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVL 1500
RLSSEQ A+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVL
Sbjct: 1441 RLSSEQPCRLKAMVSSCSCCLILFLASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVL 1500
Query: 1501 ARPDLKMEFFHLMSFLLSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGK 1560
ARPDLKMEFFHLMSFLLSHCSSKW APSDPIGLLLLESLSILGHFALFHP NQEVLRWGK
Sbjct: 1501 ARPDLKMEFFHLMSFLLSHCSSKWAAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGK 1560
Query: 1561 SPTILHKVCDLPFVFFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK 1620
SPTILHKVCDLPFVFFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK
Sbjct: 1561 SPTILHKVCDLPFVFFSDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCK 1620
Query: 1621 NNPLVPALQSTATQENEESNESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSS 1680
NN LVPAL ST+TQENEESNESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SS
Sbjct: 1621 NNLLVPALPSTSTQENEESNESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSS 1680
Query: 1681 GNSSRSNKTRNQRDNRSAKASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGI 1727
GNS+R+NKTRNQRDNRSAKASDE+ LKH QPA+EVASVMLHYRFPSSFLDRAEQFFSA I
Sbjct: 1681 GNSNRNNKTRNQRDNRSAKASDEITLKHNQPAMEVASVMLHYRFPSSFLDRAEQFFSADI 1718
BLAST of CmUC06G124080 vs. NCBI nr
Match:
XP_008453242.1 (PREDICTED: uncharacterized protein LOC103494020 isoform X2 [Cucumis melo])
HSP 1 Score: 2798.8 bits (7254), Expect = 0.0e+00
Identity = 1515/1727 (87.72%), Postives = 1567/1727 (90.74%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS GVDDQGSGWFEVKK KHRSSSKFS+Q WVGG
Sbjct: 1 MENSGGVDDQGSGWFEVKK-------------------------KHRSSSKFSVQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKN SNS RS +LVNKN +NDR SKSHPP +GGSYAVYTQSN EN +AT+IG+DGGS
Sbjct: 61 FSGKNLSNSLRSTNLVNKNLRNDRSNSKSHPPTTGGSYAVYTQSNAENCVATAIGDDGGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+ PDKCMVRQDTEFPKSSVLH+ DS+A SG+CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 YSPDKCMVRQDTEFPKSSVLHIADSNAGSGECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ N LVA SVR+EEASHQ L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASTVSSSVDTQVNALVAASVRDEEASHQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+VL+N+E TVC VIDGSNFKDT +TKPVD+H SN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVLSNNEVTVCSVIDGSNFKDTRNENTKPVDNHISN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F SPSC EAGTEPKVQK +KL EVENP L PE
Sbjct: 301 FDSPSCEEAGTEPKVQKVIKLHEVENPVL----------------------------PET 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG+SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGSSDSVEEPQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVKRLSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKRLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
+ TINCVVHDA DSASK A K VDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 EISTINCVVHDAPDSASKTAGNKDVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
LSSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K IEHSKRQATV EKD
Sbjct: 601 LSSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPIEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNNGDEQGP SSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNGDEQGPSSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF EP+NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVEPINGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAAS+NAPG+GV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASVNAPGSGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPG DSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGIDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI +GMIPSDGG+RE QTDC I+ANGVN++Q + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKNGMIPSDGGQREQQTDCFIEANGVNIVQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP VSQ DQK+ VD+ SD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPSVSQGDQKQQVDVASDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKWTAPSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+RSNKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRSNKTRNQRDNRSAK 1673
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPAIEVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1673
BLAST of CmUC06G124080 vs. ExPASy Swiss-Prot
Match:
Q9BY12 (S phase cyclin A-associated protein in the endoplasmic reticulum OS=Homo sapiens OX=9606 GN=SCAPER PE=1 SV=2)
HSP 1 Score: 138.7 bits (348), Expect = 6.4e-31
Identity = 312/1404 (22.22%), Postives = 550/1404 (39.17%), Query Frame = 0
Query: 401 RQRLWCFLFENLNRAIDELYLLCELECDLEQMKEAILVLEEAASDFK------ELNARVE 460
R R W FLF+NL RA+DE+Y+ CE + + + KE +++L+ DFK +L ++E
Sbjct: 99 RARYWAFLFDNLRRAVDEIYVTCESDQSVVECKEVLMMLDNYVRDFKALIDWIQLQEKLE 158
Query: 461 EFE----------EVKRLS-------SQSVDGMSI---------------TMKSDHCRPH 520
+ + EVK++S S S D +++ T+ + P
Sbjct: 159 KTDAQSRPTSLAWEVKKMSPGRHVIPSPSTDRINVTSNARRSLNFGGSTGTVPAPRLAPT 218
Query: 521 ALSWEVRRMTNSPHKAEILSSSL---EAFKKIRQERANMLEASKKLPGNESLSPQCMDQM 580
+SW + + H SS + ++ + ++A+ K G E++ +
Sbjct: 219 GVSWADK--VKAHHTGSTASSEITPAQSCPPMTVQKASRKNERKDAEGWETVQRGRPIRS 278
Query: 581 KKTCTINCVVHDAEDSASK-IAEKKGVDLTPGSL--------SGKEKNTESLGTEKVNVV 640
+ T + V E + SK ++K+ V L P G ES + ++
Sbjct: 279 RSTAVMPKVSLATEATRSKDDSDKENVCLLPDESIQKGQFVGDGTSNTIESHPKDSLHSC 338
Query: 641 QN-IRSRPQNSLSSINSSK---------------------------------PPLAV--- 700
+ + + Q ++S+++ K PPL V
Sbjct: 339 DHPLAEKTQFTVSTLDDVKNSGSIRDNYVRTSEISAVHIDTECVSVMLQAGTPPLQVNEE 398
Query: 701 KFKREQLESDVERLVSKRERALAEGTCEKN------LKAIEHSKRQATVPEKD-----KE 760
KF E+ + E S ++AE +K KA E + A E+ +
Sbjct: 399 KFPAEKARIENEMDPSDISNSMAEVLAKKEELADRLEKANEEAIASAIAEEEQLTREIEA 458
Query: 761 KEKRNIGVR--------------------------------KSMDAWKEKRNWEDILSSS 820
+E +I + ++ ++W++ +W DI
Sbjct: 459 EENNDINIETDNDSDFSASMGSGSVSFCGMSMDWNDVLADYEARESWRQNTSWGDI---- 518
Query: 821 VWISSRVSHLPGMSRKSAERVRVLHDKLMSPDKKKKTSLDVKREAEEKHARAMKIRSELE 880
+ + PG +H+KL SP +K+T + K++ EEK +A ++R +L
Sbjct: 519 --VEEEPARPPGHGIH-------MHEKLSSP-SRKRTIAESKKKHEEKQMKAQQLREKLR 578
Query: 881 NERVQKLQRTSEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSK 940
E+ KLQ+ E+ V +W+ + R M + +E + E L +VK+A +E +K
Sbjct: 579 EEKTLKLQKLLEREKDVRKWKEELLDQRRRMMEEKLLHAEFKREVQLQAIVKKAQEEEAK 638
Query: 941 VNEVRFITSLNEENKKIMLRQKLHGSELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEA 1000
VNE+ FI +L +NK+ + KL E R E + + +Q+E AR+EAV ERK+ +EA
Sbjct: 639 VNEIAFINTLEAQNKRHDVLSKLKEYEQRLNELQEERQRRQEEKQARDEAVQERKRALEA 698
Query: 1001 EKLQRLAETQRKKEEAHVRREEERKASSAAREARAMEQLRRKEER----ARAQQEEAELM 1060
E+ R+ E K++E R E++R+ ARE A E+ R +EER AQQE E +
Sbjct: 699 ERQARVEELLMKRKEQEARIEQQRQEKEKAREDAARERARDREERLAALTAAQQEAMEEL 758
Query: 1061 AQKLAERLSESEQRRKFYLEQIRER---------ASMDFRDQSSPLLRR----------- 1120
+K+ + ES +R +EQ +E+ A+ D+ + +P R+
Sbjct: 759 QKKIQLKHDESIRRHMEQIEQRKEKAAELSSGRHANTDYAPKLTPYERKKQCSLCNVLIS 818
Query: 1121 -----YMHKDG-PNRSTPNNNGDEQGPCSSD-------------------LGSGLAMGKT 1180
+ H G ++ N QG SD A+
Sbjct: 819 SEVYLFSHVKGRKHQQAVRENTSIQGRELSDEEVEHLSLKKYIIDIVVESTAPAEALKDG 878
Query: 1181 TMQQHMKRRIKRIRQRLMALKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLR 1240
+Q K++ K+I+ R M + + +E + +N G S +AK+ R ++L K
Sbjct: 879 EERQKNKKKAKKIKAR-MNFRAKEYESLMETKNSG-----SDSPYKAKLQRLAKDLLKQV 938
Query: 1241 QERKEGA------ASLGLIIAEMIKYLDGRELELQASRQA-----GLLDFIASALPASHT 1300
Q + G+ ++L + E+ + L+ + Q + QA L + + +PA++
Sbjct: 939 QVQDSGSWANNKVSALDRTLGEITRILEKENVADQIAFQAAGGLTALEHILQAVVPATNV 998
Query: 1301 S-----KPEACQVMIHLLKLLRVVLSASANRSYFLAQNLLPPIIPMLSTALENYIKIAAS 1360
+ P++ I++ L + S N S L N + ++ +L L Y
Sbjct: 999 NTVLRIPPKSLCNAINVYNL--TCNNCSENCSDVLFSNKITFLMDLLIHQLTVY------ 1058
Query: 1361 INAPGNGVPSSRTSIENFESSSEVLDGSLWTITTIIGHINPEGPQLQMWDGLLELLVAYQ 1420
VP +I ++ +V +G +TT GLL++
Sbjct: 1059 -------VPDENNTILGRNTNKQVFEG----LTT----------------GLLKV----- 1118
Query: 1421 VIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVVLTSRP-GTDSTINCVLPYSEKLAGD 1480
S+++L ++ +RP G +P E
Sbjct: 1119 -----------------------SAVVLG----CLIANRPDGNCQPATPKIPTQEMKNKP 1178
Query: 1481 GSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQKPKMTVDQLDDESCEQKINDGMIP 1540
G + +DL +N G+I
Sbjct: 1179 SQGDPFNNRVQDLI------------------------------------SYVVNMGLID 1238
Query: 1541 SDGGKREQTDCSIKANGVNLIQSENQDEPPDSEVVLKPFVSQEDQKKPVDLVSDLRIKNV 1600
+ C + G DE P + L+ + R ++
Sbjct: 1239 ------KLCACFLSVQG-------PVDENPKMAIFLQHAAGLLHAMCTLCFAVTGRSYSI 1298
Query: 1601 --TKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQA---NNRLSSEQAAYILPSNFEDVAT 1604
+ P L +A+ T + G+L +L VL + +S + Y N VA
Sbjct: 1299 FDNNRQDPTG-LTAALQATDLAGVLHMLYCVLFHGTILDPSTASPKENY--TQNTIQVAI 1355
BLAST of CmUC06G124080 vs. ExPASy TrEMBL
Match:
A0A1S3BWJ0 (uncharacterized protein LOC103494020 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494020 PE=4 SV=1)
HSP 1 Score: 2845.5 bits (7375), Expect = 0.0e+00
Identity = 1535/1727 (88.88%), Postives = 1588/1727 (91.95%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS GVDDQGSGWFEVKK KHRSSSKFS+Q WVGG
Sbjct: 1 MENSGGVDDQGSGWFEVKK-------------------------KHRSSSKFSVQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKN SNS RS +LVNKN +NDR SKSHPP +GGSYAVYTQSN EN +AT+IG+DGGS
Sbjct: 61 FSGKNLSNSLRSTNLVNKNLRNDRSNSKSHPPTTGGSYAVYTQSNAENCVATAIGDDGGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+ PDKCMVRQDTEFPKSSVLH+ DS+A SG+CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 YSPDKCMVRQDTEFPKSSVLHIADSNAGSGECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ N LVA SVR+EEASHQ L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASTVSSSVDTQVNALVAASVRDEEASHQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+VL+N+E TVC VIDGSNFKDT +TKPVD+H SN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVLSNNEVTVCSVIDGSNFKDTRNENTKPVDNHISN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F SPSC EAGTEPKVQK +KL EVENP L E GK E SSLSLPVQNAE VST+T G EN
Sbjct: 301 FDSPSCEEAGTEPKVQKVIKLHEVENPVLPETSGKPEISSLSLPVQNAESVSTKTCGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG+SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGSSDSVEEPQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVKRLSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKRLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
+ TINCVVHDA DSASK A K VDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 EISTINCVVHDAPDSASKTAGNKDVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
LSSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K IEHSKRQATV EKD
Sbjct: 601 LSSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPIEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNNGDEQGP SSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNGDEQGPSSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF EP+NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVEPINGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAAS+NAPG+GV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASVNAPGSGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPG DSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGIDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI +GMIPSDGG+RE QTDC I+ANGVN++Q + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKNGMIPSDGGQREQQTDCFIEANGVNIVQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP VSQ DQK+ VD+ SD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPSVSQGDQKQQVDVASDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKWTAPSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+RSNKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRSNKTRNQRDNRSAK 1680
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPAIEVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1701
BLAST of CmUC06G124080 vs. ExPASy TrEMBL
Match:
A0A5A7UQ26 (SCAPER_N domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G002770 PE=4 SV=1)
HSP 1 Score: 2845.5 bits (7375), Expect = 0.0e+00
Identity = 1535/1727 (88.88%), Postives = 1588/1727 (91.95%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS GVDDQGSGWFEVKK KHRSSSKFS+Q WVGG
Sbjct: 1 MENSGGVDDQGSGWFEVKK-------------------------KHRSSSKFSVQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKN SNS RS +LVNKN +NDR SKSHPP +GGSYAVYTQSN EN +AT+IG+DGGS
Sbjct: 61 FSGKNLSNSLRSTNLVNKNLRNDRSNSKSHPPTTGGSYAVYTQSNAENCVATAIGDDGGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+ PDKCMVRQDTEFPKSSVLH+ DS+A SG+CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 YSPDKCMVRQDTEFPKSSVLHIADSNAGSGECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ N LVA SVR+EEASHQ L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASTVSSSVDTQVNALVAASVRDEEASHQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+VL+N+E TVC VIDGSNFKDT +TKPVD+H SN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVLSNNEVTVCSVIDGSNFKDTRNENTKPVDNHISN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F SPSC EAGTEPKVQK +KL EVENP L E GK E SSLSLPVQNAE VST+T G EN
Sbjct: 301 FDSPSCEEAGTEPKVQKVIKLHEVENPVLPETSGKPEISSLSLPVQNAESVSTKTCGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG+SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGSSDSVEEPQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVKRLSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKRLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
+ TINCVVHDA DSASK A K VDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 EISTINCVVHDAPDSASKTAGNKDVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
LSSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K IEHSKRQATV EKD
Sbjct: 601 LSSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPIEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNNGDEQGP SSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNGDEQGPSSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF EP+NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVEPINGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAAS+NAPG+GV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASVNAPGSGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPG DSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGIDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI +GMIPSDGG+RE QTDC I+ANGVN++Q + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKNGMIPSDGGQREQQTDCFIEANGVNIVQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP VSQ DQK+ VD+ SD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPSVSQGDQKQQVDVASDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKWTAPSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+RSNKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRSNKTRNQRDNRSAK 1680
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPAIEVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1701
BLAST of CmUC06G124080 vs. ExPASy TrEMBL
Match:
A0A0A0LRS2 (SCAPER_N domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G011560 PE=4 SV=1)
HSP 1 Score: 2820.4 bits (7310), Expect = 0.0e+00
Identity = 1530/1727 (88.59%), Postives = 1584/1727 (91.72%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENSSGVDDQGSGWFEVKK KHRSSSKFSLQ WVGG
Sbjct: 1 MENSSGVDDQGSGWFEVKK-------------------------KHRSSSKFSLQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKNSS+S +LVNKN QNDR SKSHPP +GGSYAVYTQSNTEN +ATSIG+D GS
Sbjct: 61 FSGKNSSDSMCRTNLVNKNPQNDRSNSKSHPP-TGGSYAVYTQSNTENCVATSIGDDEGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
H PDKCMVRQDTEFPKSSVLH+ DS+A + +CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 HSPDKCMVRQDTEFPKSSVLHIADSNAGNEECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ + LVA SVR+EEAS Q L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASLVSSSVDTQVSTLVAASVRDEEASDQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+V++NSEATVC VIDGSNFKDT +TKPVD+HSSN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVISNSEATVCSVIDGSNFKDTRNENTKPVDNHSSN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F C EAGTE KVQKA+K+ EVENP LHEA G+ E SSLSLPVQNAE VST+T+G EN
Sbjct: 301 FDFLFCEEAGTEAKVQKAIKVHEVENPVLHEAAGEPEISSLSLPVQNAESVSTKTSGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGCSDSVEETQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVK+LSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKKLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
KT TINCVVHDA DSASK A KGVDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 KTSTINCVVHDAPDSASKAAGNKGVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
SSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K +EHSKRQATV EKD
Sbjct: 601 -SSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPVEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSS+ ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSIRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIR ELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRCELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNN DEQGP SSDLGSGLAMGKTT+QQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNVDEQGPSSSDLGSGLAMGKTTLQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF E NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVETTNGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAASINAPGNGV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPGTDSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGTDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI GMIP DGG+RE QTDC I+ANGVNLIQ + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKTGMIPIDGGQREPQTDCFIEANGVNLIQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP+VSQ DQK+ VD+VSD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPYVSQGDQKQHVDVVSDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKW APSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWAAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+R+NKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRNNKTRNQRDNRSAK 1680
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPA+EVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAMEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1699
BLAST of CmUC06G124080 vs. ExPASy TrEMBL
Match:
A0A1S3BVR2 (uncharacterized protein LOC103494020 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494020 PE=4 SV=1)
HSP 1 Score: 2798.8 bits (7254), Expect = 0.0e+00
Identity = 1515/1727 (87.72%), Postives = 1567/1727 (90.74%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS GVDDQGSGWFEVKK KHRSSSKFS+Q WVGG
Sbjct: 1 MENSGGVDDQGSGWFEVKK-------------------------KHRSSSKFSVQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
FSGKN SNS RS +LVNKN +NDR SKSHPP +GGSYAVYTQSN EN +AT+IG+DGGS
Sbjct: 61 FSGKNLSNSLRSTNLVNKNLRNDRSNSKSHPPTTGGSYAVYTQSNAENCVATAIGDDGGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+ PDKCMVRQDTEFPKSSVLH+ DS+A SG+CEKV H+DMPGVVQKIKWGDLEDDSLVLN
Sbjct: 121 YSPDKCMVRQDTEFPKSSVLHIADSNAGSGECEKVSHRDMPGVVQKIKWGDLEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDLGVSEKNEVK LAS VS+SVDTQ N LVA SVR+EEASHQ L
Sbjct: 181 NSANGVEIKFGNIGEVDLGVSEKNEVKHDLASTVSSSVDTQVNALVAASVRDEEASHQTL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E +V HQDINREFIE L+VL+N+E TVC VIDGSNFKDT +TKPVD+H SN
Sbjct: 241 LSTNEINSCQVSHQDINREFIEDLKVLSNNEVTVCSVIDGSNFKDTRNENTKPVDNHISN 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F SPSC EAGTEPKVQK +KL EVENP L PE
Sbjct: 301 FDSPSCEEAGTEPKVQKVIKLHEVENPVL----------------------------PET 360
Query: 361 SGGTSDSVEEAQIEQGSGTHNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAIDELY 420
SGG+SDSVEE QIEQGSGTHNVQVVS PSEG TGESKERFRQRLWCFLFENLNRA+DELY
Sbjct: 361 SGGSSDSVEEPQIEQGSGTHNVQVVSAPSEGATGESKERFRQRLWCFLFENLNRAVDELY 420
Query: 421 LLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKSDHCR 480
LLCELECDLEQMKEAILVLEEAASDFKELN RVEEFEEVKRLSSQS+DGM ITMKSDHCR
Sbjct: 421 LLCELECDLEQMKEAILVLEEAASDFKELNTRVEEFEEVKRLSSQSIDGMPITMKSDHCR 480
Query: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCMDQMK 540
PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNE LSPQCMDQMK
Sbjct: 481 PHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNECLSPQCMDQMK 540
Query: 541 KTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSRPQNS 600
+ TINCVVHDA DSASK A K VDLTPGSLSGKEKNTESLG++KVNV QNIRSRPQNS
Sbjct: 541 EISTINCVVHDAPDSASKTAGNKDVDLTPGSLSGKEKNTESLGSDKVNVAQNIRSRPQNS 600
Query: 601 LSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATVPEKD 660
LSSINSSKPPLAVKFKREQLESDVERLVS+RERALAEGTCEK K IEHSKRQATV EKD
Sbjct: 601 LSSINSSKPPLAVKFKREQLESDVERLVSRRERALAEGTCEKTQKPIEHSKRQATVSEKD 660
Query: 661 KEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHDKLMS 720
KEKEKRN+G RKSMDAWKEKRNWEDILSSSV ISSRVSHLPGMS+KSAERVRVLHDKLMS
Sbjct: 661 KEKEKRNMGARKSMDAWKEKRNWEDILSSSVRISSRVSHLPGMSKKSAERVRVLHDKLMS 720
Query: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE
Sbjct: 721 PDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTMKLRE 780
Query: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR
Sbjct: 781 GMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGSELRR 840
Query: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA
Sbjct: 841 AEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKASSAA 900
Query: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS
Sbjct: 901 REARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDFRDQS 960
Query: 961 SPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
SPLLRRYMHKDGP+RS NNNGDEQGP SSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA
Sbjct: 961 SPLLRRYMHKDGPSRSATNNNGDEQGPSSSDLGSGLAMGKTTMQQHMKRRIKRIRQRLMA 1020
Query: 1021 LKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIAEMIK 1080
LKYEF EP+NGAENV IGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIAEMIK
Sbjct: 1021 LKYEFVEPINGAENVSIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIAEMIK 1080
Query: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL
Sbjct: 1081 YLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANRSYFL 1140
Query: 1141 AQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWTITTI 1200
AQNLLPPIIPMLSTALENYIKIAAS+NAPG+GV SSRTSIENFESSSEVLDGSLWTITTI
Sbjct: 1141 AQNLLPPIIPMLSTALENYIKIAASVNAPGSGVQSSRTSIENFESSSEVLDGSLWTITTI 1200
Query: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV
Sbjct: 1201 IGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIRLLVV 1260
Query: 1261 LTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGAKIVQ 1320
LTSRPG DSTINCVLP+SE L G SGI IS SRD TGFTEDG P ESGLNG KI+Q
Sbjct: 1261 LTSRPGIDSTINCVLPFSENLTGVESGIAISTMSRDFPRTGFTEDGIPLESGLNGGKILQ 1320
Query: 1321 KPKMTVDQLDDESCEQKINDGMIPSDGGKRE-QTDCSIKANGVNLIQSENQDEPPDSEVV 1380
PKMTVDQL DESCEQKI +GMIPSDGG+RE QTDC I+ANGVN++Q + QDEP D E+V
Sbjct: 1321 NPKMTVDQL-DESCEQKIKNGMIPSDGGQREQQTDCFIEANGVNIVQIDIQDEPQDGEIV 1380
Query: 1381 LKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
LKP VSQ DQK+ VD+ SD IKNVTK+KPPIAYLLSAISDTGIVGLLSLLTAVLLQANN
Sbjct: 1381 LKPSVSQGDQKQQVDVASDEGIKNVTKMKPPIAYLLSAISDTGIVGLLSLLTAVLLQANN 1440
Query: 1441 RLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
RLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH
Sbjct: 1441 RLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFLLSH 1500
Query: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
CSSKWTAPSDPIGLLLLESLSILGHFALFHP NQEVLRWGKSPTILHKVCDLPFVFFSDP
Sbjct: 1501 CSSKWTAPSDPIGLLLLESLSILGHFALFHPGNQEVLRWGKSPTILHKVCDLPFVFFSDP 1560
Query: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQENEES 1620
ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNN LVPAL ST+TQENEES
Sbjct: 1561 ELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNLLVPALPSTSTQENEES 1620
Query: 1621 NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDNRSAK 1680
NESNPNG ESRKPQ DG IRATRNVSRITRTSLGRPGG SSGNS+RSNKTRNQRDNRSAK
Sbjct: 1621 NESNPNGFESRKPQTDGIIRATRNVSRITRTSLGRPGGVSSGNSNRSNKTRNQRDNRSAK 1673
Query: 1681 ASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAIDG 1727
ASDE+ LKH QPAIEVASVMLHYRFPSSFLDRAEQFFSA ISTA+DG
Sbjct: 1681 ASDEITLKHNQPAIEVASVMLHYRFPSSFLDRAEQFFSADISTAVDG 1673
BLAST of CmUC06G124080 vs. ExPASy TrEMBL
Match:
A0A6J1BYK0 (uncharacterized protein LOC111006915 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111006915 PE=4 SV=1)
HSP 1 Score: 2680.6 bits (6947), Expect = 0.0e+00
Identity = 1455/1730 (84.10%), Postives = 1545/1730 (89.31%), Query Frame = 0
Query: 1 MENSSGVDDQGSGWFEVKKPIPLNKCSVSPPLGTGFMLIYFTFLKHRSSSKFSLQGWVGG 60
MENS G DDQGSGWFEVKK KHRSSSKFSLQ WVGG
Sbjct: 1 MENSGGGDDQGSGWFEVKK-------------------------KHRSSSKFSLQSWVGG 60
Query: 61 FSGKNSSNSSRSPHLVNKNAQNDRPISKSHPPASGGSYAVYTQSNTENRIATSIGEDGGS 120
SGKNSSNS RS H VNKNA+NDR SKSH PASGGSYA+ TQSN EN + TSIG++ GS
Sbjct: 61 ISGKNSSNSLRSLHSVNKNARNDR--SKSHSPASGGSYAIRTQSNNENSVTTSIGDNEGS 120
Query: 121 HCPDKCMVRQDTEFPKSSVLHVTDSHAASGDCEKVPHKDMPGVVQKIKWGDLEDDSLVLN 180
+CP+KCM++QD E+PK+SVLH+ DSH SGDCEKVP DMPG VQKIKWGD+EDDSLVLN
Sbjct: 121 NCPEKCMIKQDAEYPKASVLHIMDSHGGSGDCEKVPRNDMPGAVQKIKWGDIEDDSLVLN 180
Query: 181 NSENGVEIKFGNIGEVDLGVSEKNEVKDGLASHVSTSVDTQANKLVAVSVREEEASHQAL 240
NS NGVEIKFGNIGEVDL VS KNEVK L SHVS+S+DTQ LVA+SVR+ EASHQAL
Sbjct: 181 NSANGVEIKFGNIGEVDLAVSAKNEVKHDLVSHVSSSLDTQVKNLVALSVRQGEASHQAL 240
Query: 241 LSTDEDKVWRVRHQDINREFIEALEVLNNSEATVCPVIDGSNFKDTGTGHTKPVDDHSSN 300
LST+E+K+WRV HQD N++FIE LE+ ++ EA VC V D SN KD GT H K V+DHSS+
Sbjct: 241 LSTNEEKLWRVSHQDSNKKFIEDLELPSHDEAIVCTVTDDSNCKDIGTEHNKIVNDHSSS 300
Query: 301 FSSPSCVEAGTEPKVQKAMKLPEVENPELHEAPGKTEFSSLSLPVQNAELVSTETNGPEN 360
F+SP+ EAG EPKV+KA +LPEVE ELHEA GK FSS L VQ+ ELVSTET+G EN
Sbjct: 301 FNSPTREEAGIEPKVKKAAELPEVEILELHEAAGKNGFSSSPLIVQDVELVSTETSGHEN 360
Query: 361 SGGTSDSVEEAQIEQGSGTH----NVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAI 420
SGG+ D VE+AQIE GSGTH NVQVVSVPSEGETGESKERFRQRLWCFLFENLNRA+
Sbjct: 361 SGGSCDFVEDAQIEHGSGTHTDKLNVQVVSVPSEGETGESKERFRQRLWCFLFENLNRAV 420
Query: 421 DELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMSITMKS 480
DELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGM ITMKS
Sbjct: 421 DELYLLCELECDLEQMKEAILVLEEAASDFKELNARVEEFEEVKRLSSQSVDGMPITMKS 480
Query: 481 DHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERANMLEASKKLPGNESLSPQCM 540
DHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERA+MLE+SKKLPG E LSPQ M
Sbjct: 481 DHCRPHALSWEVRRMTNSPHKAEILSSSLEAFKKIRQERASMLESSKKLPGTECLSPQRM 540
Query: 541 DQMKKTCTINCVVHDAEDSASKIAEKKGVDLTPGSLSGKEKNTESLGTEKVNVVQNIRSR 600
DQMKKT I+ ++ +A+DSA K E +G DLTPGSLSGKEKN ES+ T+KVN VQN RSR
Sbjct: 541 DQMKKTSMIDNILSNADDSALKTTENRGADLTPGSLSGKEKNIESICTDKVNAVQNGRSR 600
Query: 601 PQNSLSSINSSKPPLAVKFKREQLESDVERLVSKRERALAEGTCEKNLKAIEHSKRQATV 660
PQNSLSSINSSKPPLAVKFKREQLE DVERL+ K+ERALAEGTCE KRQA V
Sbjct: 601 PQNSLSSINSSKPPLAVKFKREQLELDVERLLPKKERALAEGTCE---------KRQANV 660
Query: 661 PEKDKEKEKRNIGVRKSMDAWKEKRNWEDILSSSVWISSRVSHLPGMSRKSAERVRVLHD 720
PEKDKEKEKRN+ RKSMDAWKEKRNWEDILSSSV SSRVSHLPGMSRKSAERVRVLHD
Sbjct: 661 PEKDKEKEKRNLAARKSMDAWKEKRNWEDILSSSVRTSSRVSHLPGMSRKSAERVRVLHD 720
Query: 721 KLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTM 780
KLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTM
Sbjct: 721 KLMSPDKKKKTSLDVKREAEEKHARAMKIRSELENERVQKLQRTSEKLNRVNEWQAVRTM 780
Query: 781 KLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGS 840
KLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGS
Sbjct: 781 KLREGMYARHQRSESRHEAFLAQVVKRAGDESSKVNEVRFITSLNEENKKIMLRQKLHGS 840
Query: 841 ELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKA 900
ELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKA
Sbjct: 841 ELRRAEKLQVMKIKQKEDMAREEAVLERKKLIEAEKLQRLAETQRKKEEAHVRREEERKA 900
Query: 901 SSAAREARAMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDF 960
SSAAREAR+MEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDF
Sbjct: 901 SSAAREARSMEQLRRKEERARAQQEEAELMAQKLAERLSESEQRRKFYLEQIRERASMDF 960
Query: 961 RDQSSPLLRRYMHKDGPNRSTPNNNGDEQGPCSSDLGSGLAMGKTTMQQHMKRRIKRIRQ 1020
RDQSSPLLRRYMHKD +R T NNN DEQGP SSDLGS LAMGKTTMQQHMKRRIKRIRQ
Sbjct: 961 RDQSSPLLRRYMHKDAQSRLTSNNNCDEQGPSSSDLGSDLAMGKTTMQQHMKRRIKRIRQ 1020
Query: 1021 RLMALKYEFFEPVNGAENVGIGYRTSIGTARAKIGRWLQELQKLRQERKEGAASLGLIIA 1080
RLMALKYEF EP+ GAENVGIGYRTSIGTARAKIGRWLQELQKLRQ RKEGAASLGLIIA
Sbjct: 1021 RLMALKYEFIEPITGAENVGIGYRTSIGTARAKIGRWLQELQKLRQARKEGAASLGLIIA 1080
Query: 1081 EMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANR 1140
EMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASAN+
Sbjct: 1081 EMIKYLDGRELELQASRQAGLLDFIASALPASHTSKPEACQVMIHLLKLLRVVLSASANK 1140
Query: 1141 SYFLAQNLLPPIIPMLSTALENYIKIAASINAPGNGVPSSRTSIENFESSSEVLDGSLWT 1200
SYFLAQNLLPPIIPMLSTALENYIKIAAS+NAPGN +PSS+TSIENFESSSEVLDGSLWT
Sbjct: 1141 SYFLAQNLLPPIIPMLSTALENYIKIAASVNAPGNVLPSSKTSIENFESSSEVLDGSLWT 1200
Query: 1201 ITTIIGHINPEGPQLQMWDGLLELLVAYQVIQRLRDLFALYDRPQVEGSPFPSSILLSIR 1260
ITTIIGH++PEGPQLQMWDGLLELLVAYQVI RLRDLFALYDRPQVEGSPFPSSILLSIR
Sbjct: 1201 ITTIIGHVSPEGPQLQMWDGLLELLVAYQVIHRLRDLFALYDRPQVEGSPFPSSILLSIR 1260
Query: 1261 LLVVLTSRPGTDSTINCVLPYSEKLAGDGSGITISAESRDLFGTGFTEDGSPSESGLNGA 1320
LLVVLTSRPGT+STINC+LP SE + GD SGI ISAESRD GTGFTED SPSE LNG
Sbjct: 1261 LLVVLTSRPGTESTINCILPASEAVVGDESGIAISAESRDFHGTGFTEDNSPSEFALNGC 1320
Query: 1321 KIVQKPKMTVDQLDDESCEQKINDGMIPSDGGKREQTDCSIKANGVNLIQSENQDEPPDS 1380
KIVQKPK+ +D+LD ES EQK N+G I DGG+REQ D SI+ANGV+L+ Q+EP DS
Sbjct: 1321 KIVQKPKIAIDKLDGESFEQKKNNGTISGDGGQREQMDSSIEANGVSLV----QNEPQDS 1380
Query: 1381 EVVLKPFVSQEDQKKPVDLVSDLRIKNVTKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQ 1440
EV+LKPFVSQ DQ++PVDLVSD RIKN+TKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQ
Sbjct: 1381 EVILKPFVSQGDQRQPVDLVSDRRIKNITKLKPPIAYLLSAISDTGIVGLLSLLTAVLLQ 1440
Query: 1441 ANNRLSSEQAAYILPSNFEDVATGVLKVLNNLAFLDLKFMQRVLARPDLKMEFFHLMSFL 1500
ANNRLSSEQA+YILPSNFEDVATGVLKVLNNLAFLDLKF+QR+LARPDLKMEFFHLMSFL
Sbjct: 1441 ANNRLSSEQASYILPSNFEDVATGVLKVLNNLAFLDLKFVQRMLARPDLKMEFFHLMSFL 1500
Query: 1501 LSHCSSKWTAPSDPIGLLLLESLSILGHFALFHPENQEVLRWGKSPTILHKVCDLPFVFF 1560
LSHCSSKWTAPSD IGLLLLESLSILGHFALFHPENQEVL WGKSPTILHKVCDLPFVFF
Sbjct: 1501 LSHCSSKWTAPSDQIGLLLLESLSILGHFALFHPENQEVLCWGKSPTILHKVCDLPFVFF 1560
Query: 1561 SDPELMPVLASTLVAACYGCEQNKSVVQQELSIDMLLSLLRSCKNNPLVPALQSTATQEN 1620
SDPELMPVLASTLVAACYG EQNKSVVQQELSIDMLLSLLRSCKNN V A+QS +T EN
Sbjct: 1561 SDPELMPVLASTLVAACYGYEQNKSVVQQELSIDMLLSLLRSCKNNLSVSAVQSISTLEN 1620
Query: 1621 EES-NESNPNGSESRKPQMDGTIRATRNVSRITRTSLGRPGGGSSGNSSRSNKTRNQRDN 1680
+ES NE NPNG+E RKPQMD T+RA+RNVSR TR SLGRPGGGSSGNSSRSNK R+QRDN
Sbjct: 1621 DESANECNPNGTEFRKPQMDSTLRASRNVSRTTRISLGRPGGGSSGNSSRSNKIRSQRDN 1680
Query: 1681 RSAKASDELALKHIQPAIEVASVMLHYRFPSSFLDRAEQFFSAGISTAID 1726
RSAKASDE+ LK+ QPA+EVASVMLHYR PSSF++RAEQFFSAG T +D
Sbjct: 1681 RSAKASDEIVLKNNQPALEVASVMLHYRIPSSFIERAEQFFSAGTPTTVD 1690
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038878649.1 | 0.0e+00 | 91.87 | uncharacterized protein LOC120070833 [Benincasa hispida] | [more] |
XP_008453238.1 | 0.0e+00 | 88.88 | PREDICTED: uncharacterized protein LOC103494020 isoform X1 [Cucumis melo] >XP_00... | [more] |
XP_011660268.1 | 0.0e+00 | 88.59 | uncharacterized protein LOC101216553 isoform X3 [Cucumis sativus] >KGN63704.1 hy... | [more] |
XP_031744557.1 | 0.0e+00 | 87.63 | uncharacterized protein LOC101216553 isoform X1 [Cucumis sativus] >XP_031744565.... | [more] |
XP_008453242.1 | 0.0e+00 | 87.72 | PREDICTED: uncharacterized protein LOC103494020 isoform X2 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Q9BY12 | 6.4e-31 | 22.22 | S phase cyclin A-associated protein in the endoplasmic reticulum OS=Homo sapiens... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BWJ0 | 0.0e+00 | 88.88 | uncharacterized protein LOC103494020 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7UQ26 | 0.0e+00 | 88.88 | SCAPER_N domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... | [more] |
A0A0A0LRS2 | 0.0e+00 | 88.59 | SCAPER_N domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G011560 PE... | [more] |
A0A1S3BVR2 | 0.0e+00 | 87.72 | uncharacterized protein LOC103494020 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1BYK0 | 0.0e+00 | 84.10 | uncharacterized protein LOC111006915 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |