Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATCCTGGCGTCGGGACCAAGTTTGTGTCCGTGAATCTCAACAAATCGTATGGACAGGCTCATCATCATCATTCATCTCATTCGAATTCTTATGGATCAAATCGAACGCGACCTGGTGGTCATGGGGCCGGCGGAGGAATGGTGGTCCTGTCGAGGCCCCGCAGCTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCTTTGCGAAAGGAGCATGAGAGACTTGATTCTTTGGGTTCAGGCACTGGGCCAACTGGTGGAGGGGTTTTGGGAAACGGACAGAGGCCAACTTCAGCTGGTATGGGTTGGACAAAGCCGCGCACCAGCGATTTGCCAGAGAAAGAAGGGGTTAGTGCTAATATAGTTGATAAAATTGATCCGTCTTTGCGAAGTGTTGATGGGGTGAGCGGTGGGAGTAGTGTGTATATGCCTCCTTCTGCTCGTGCTGGTACGACAGGCCCGATTGTGTCTACTTCTGCTTCCTCTCAGGTTCATACTGCAGTTGAAAAAGCCCCCGTTTTGAGAGGTGAGGATTTCCCTTCTTTGCAAGCAACTTTACCTTCTGCAGCTGCGCCTTCTCAGAAACAGAGAGATGGGTTGAGTTCTAAATTGAAGCATGCGGCTGAAGGTTCATATGAAGAGCAGAGGGATACTTCTCATTTAAGTTCAAGAATAGATGCCCGCTCAAAATTTCAGTCATCACAGAAAAGTATTCCCTGTGAAAATGCAAGAAATGGCAACTCTTACAGTTCTGGGAGTTTCCAGTCACCAGAGTTATCTCGGAAGCAGGAAGATATTTTCCCAGGTCCTTTACCACTCGTCTCAATGAATCCTAGATCAGACTGGGCTGATGATGAACGTGATACAAGCCATGGTTTAATTGACAGGGTTAGGGATCGAGGGCATCCAAAAAGTGAGGCTTATTGGGAGAGGGACTTTGACATGCCTCGGGTTAGTTCTCTTCCCCACAAGCCCACTCATAATTTTTCTCAGAGATGGAATCTGCGGGATGATGAATCTGGGAAGTTTCATTCTAGTGACATTCACAAAGTGGACTCTTATGGTCGGGATGTCAGGACGGCTGGTAGAGAAGGCTGGGAAGGAAACTTTCGGAAAAACAACCCTATACCAAAAGATGGATTTGGTTCAGACAGTGGTAATGATAGAAATGATATTGCAGGAAGGCCCACTAGCCTCGATCGAGAAACAAATGCTGATAACATGCATGTTTCACATTTTCGAGAACATGCTAATAAAGATGGGAGGAGAGATACTGGATTTGGACAGAATGGGCGGCAATCTTGGAATAGTGCAACAGAATCTTATAGCTCCCAGGAACCAGATCGGAATGTAAGAGACAAGTATGGTAGTGAGCAACACAGTAGGTACAGGGGCGAAACACATAATACTTCAGTTGCAAACTCATCATACTCTTCAGGTTTAAAACGAATTCCTGCCGATGAGCCATTGCTGAACTTTGGCAGGGATAGACGTTCGTTTGCAAAGATTGAGAAACCTTACATGGAAGATCCTTTTATGAAAGATTTTGGAGCCTCTAGTTTTGATGGACGAGATCCTTTTACTACTGGTCTTGTTGGGGTGGTTAAGAGGAAGAAGGATGTTATTAAGCAGATTGATTTTCATGACCCTGTTAGGGAATCTTTTGAGGCCGAACTTGAGAGAGTTCAACAGATCCAAGAGCAGGAGCGACAGCGAATTATTGAGGAGCAAGAAAGAGCTCTAGAACTAGCTAGGAGAGAAGAGGAAGAGAGACAGAGGCTTGCAAGGGAACATGAAGAAAGGCAGAGGAGAGCTGAAGAAGAAGCCAGAGAAGCAGCATGGAGAGCTGAGCAAGAGCGACTGGAGGCTATCCAAAAGGCTGAAGAACTTCGGATAGCTAGAGAGGAAGAAAAACAGAGGATTCTTCTGGAGGAAGAGAGAAGAAAGCAGGCTGCTAAGCTAAAACTTTTAGAATTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCAGATATTCCCGAAAAGAAGATTCCCAGTGTTGTTAAAGATGTTTCCAGGTTGGCGGACACAGTTGATTGGGAAGATGGTGAAAAGATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGCATAATTAGGTCCTCTGAGGTGGGCCTTAGATCTCAATTTTCTAGAGATGGTTCTCCTTCCTTTGTGGACAGAGGCAAGTCTGTTAATTCATGGAGAAGAGATTTTTATGAGAGAGGAAGTGGCTCTCAATTTGTTCTACAGGATCAGAGTACTGGCTACAATGGCACAAGGCGGGAGGCATCAACTGGTGGGCGGGTATCATCAAGGAAAGAGCTTTATGGGGGAGCTGGATTTACGACTTCCAAGACATCTCATAGAAGAGGTATTACAGAGCCACAATCAGATGAATATTCTCAGCTAAGAGGGCAGAGACCTAACCTTTCTGGAGGTGGCGATCATTATAACAGGAGCCAAGAGTTTGACCCCGAATTTCAGGATAATGTTGAGAATTTTGGTGATCATGGATGGAGGCAGGAGAGTGGTCGCAACAACTTCTATTTTCCTTACCCTGAACGAGTAAATCCAATTTCTGAGACTGATGGGTCCTATTCTGTTGGAAGGTCACGCTATTCCCATAGGCAACCTCGTGTTCTTCCTCCTCCATCTGTAGCTTCTATACAGAAATCTTCTGTCAGGGGCGAATATGAATCTGTTCCCCGGGATATTGTAGAAAGTGAGATACAATATGACCATCCGGCAAGTAATATTTCTACTGCTCAGACAAGGTATATTCATCATGAAAACCGTGCACTTCCTGAGATAATTGATGTTAATTTAGAGAATGGTGAGAATGAGGAGCAGAAACCAGACGGTAACACAACACTGCGGTGTGACTCACAGTCAACCCTTTCTGTTTTTAGCCCCCCAACCTCTCCAACCCATCTATCTCATGAGGACTTGGATGATTCTGGAGATTCTCCTGTTTTATCAGCTAGCAGAGAAGGCACATTGTCAATAGAGGACAATGAATCTGCTGAACCAGCCAAGGTTGGGAAAGAGATCATGATTACCTCTGCTAGGGTATCTACAGGTGATGAAGATGAATGGGGTGTTGTAGATGAGCATGTGCAAGAACAGGAAGAATATGATGAGGATGATGATGGGTATCAGGAAGAAGACGAAGTTCATGAAGGAGAGGACGAGAACATTGACCTTGTACAAGATTTTGATGATTTGCACTTAGATGATAAAGGATCACCCCATATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTGGGGATGCCGAATGACGAGTTTGAAAGAATTCCAGGAAATGAGGAAAATATTTATGTCGCACCAGAAATTTCAAATGGCATCAGGGAAGAGCAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAATCTGTCAATACGTGGATGCTTCTTCTCAAATAAGGATTGACCCTGAGGAGATGCAGGACTTGGTTATGCAGTCTAAAATTGCACAAGCATTGCCAGAATCTGAAATTACCGAGCAAGGAAATTCTTCTTGCAGATCTAGTGTGTCTGTTCAACAGCCAATCTCATCTTCAGTTTCAATGGCCTCACAACCTATATCTGGTCAAGTTATTGTGCCAAGTACTGCCGTTTCAGGTCAAGCTGAGTCTCCTGTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCCTGTACCAGCCATACAGATAGGTTCTATACAGATGCCTCTTCATCTGCATCCTCAGATTACCCAATCTATGACTCACATGCATTCATCACAGCCCCCTCTATTCCAGTTTGGACAGCTAAGGTATACATCTTCTGTCTCCCAAGGTGTACTGCCTTTGGCTCCTCAACCGCTGACATTTGTTCCGCCCACTGTTCAAACTGGTTTTCCTTTAAATAAAAACCCAGGAGAGGCTCTGTCCATTCATCCTTCTCAGGAAACCAGTGCTCATAATTCACGAAAAAATGATGTGCTGCCTTTTTTGATGGATAACCAACAAGGCATTGTGTCAAGATCTTTGAATGTGAACCCATCAGGGGAGTCAAAGTCATTACCATTAGCAGAAAGCATAGAAAGCAAAGTTATGACTCCACAGGATCAAACTGTAGGTTCATGCATTGATGAGAGCAATTCCAGGTCCGAACCAGGTTTTCAAGCAGAACATCAGAGGCACCGTGTTTCAACTTCAGATAATCATTATGTGGTATCTAGGGGAAAAGAATCTGAAGGTCGAGCTCAGGATGGGATGGGATCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGTGGGTTAAAAGCTCGTGGCCAGTTTCCTGGTGGAAGAGGGAAAAAGTATATCTTTACAGTAAAAAATTCTGGATCTAGATTGCCATTCCCAGGTTCTGAATCTACTCGCTTAGATACTGGTGGATTTCAGAGGCGGCCTAGGCGCAATATTCCTCGTACTGAGTTTCGTGTTCGAGAAACTGTGGATAAAAAATTGTCTAATAGCCAAGTTTCTTCAAACCATGTAGGGGTAGATGATAAGCCAACTGTTAGTGGAAGAACTGCGGTCAATTCTGCCAGAAATGGGACTAGGAAGGTTGTCATATCTAATAAGCCATCGAAAAGAGCATTAGAGTCTGAAGGATTAAGCTCTGGGGCGAGTACTTCTCTAGAGCTTGATGCTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGAGTATTTGGGCAAGAGCCAGGGAAGCCAATATTCTGGAGAAGGTAACTTCAGAAAGAACATTTGTTCTGGGGAGGATGTTGATGCCCCTTTGCAGAGTGGAATCATACGTGTATTTGAGCAACCTGGCATAGAGGCTCCCAGTGATGAGGATGATTTCATTGAGGTGCGATCAAAACGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAGGAGATCAAGGCGAAGTCCCACAATTCAAAGGTTTATACGTTTACCTTCCTTTTCTTTTTTATTAATTTTTTTTTTGTTATTGCAAGTTTGTAGTTCAATAAATTTGACAAGCACTCTTTTAGATCCCACGGAAAAGTCGATCTACTTCGAAAAGTGCATTATCCTCAGTCACTTCAAGCAAAATGTATGCCACTAAGGAAGCAGAAACAGTAAAGAGAACACGATCTGATTTTGTTGCTCCAGATGGAGGAGGACGTGGATCAGGAAATATTGTGGTGTCAAGCGCATTTAGTTCTCCAGTAGTCTCTCAACCATTGGCCCCAATTGGGACTCCTGCTCTGAAATCTGATTCCCAGACCGAGAGATCACATACTGCTAGGTTGGTATTTATTATGAAGGAACTTACATGCATCATCTTTGACATCTGAGCAAATATTTTATATTATTTTGTAGGTCTATCCAGACGAGTGGCCCTGCATTGGCAACTAGTGACGGTAGAAATCTCGACTCAAGCATGATGTTTGATAAGAAGGATGATATTTTGGATAATGTTCAATCATCTTTTACTTCCTGGGGTAATTCACGTATAAATCAACAGGTACAGTGGTGGAAATGGTGACCTAGTTGAGTTTTATTTTGGTCATTGCATAATTATCTGGATGTTAATATGGGATTTTAATGTTGACCTTGTTGAGTGTTATTTGTTCACTGCTTGCTTTGATGGATCTCTATAGGCAATTTTAATATTGCTGTGTTACTGTGTTACTATGTTAGAGGACCTTTATACTATTGATGTATCATGTTTATTTCAGTTTGAGCGCAAGAAATGTTAAGCTGATTGTACCTACTACTTAGATCTATACTTCTAATGTGATTTTACTTGTTTATATGTCATAATTATCTGATAGAATTTTAAACTGCTTGGCAGGTTATGGCTTTGACGCAAACCCAACTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCTCCAGTAGGAGATCATTCTGGCTTAGCTGGTGATCCTAATGTGCCGTCACCATCTATCTTAGCAATGGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTTGCTGGGGAGAAAATTCAGTTTGGTGAGTATCGGAGGAACTGCATGCATCAAAATCTGATTTTAGGAGTTCAATTTTGTTAATTCTAATTTTCTCTTTCCAATTATGTATGCAGGTGCAGTCACATCTCCAACAGTTCTTCCTCCTGGTAGCTGTTCCACTTTGCTTGGGATTGGTGCCCCCACTGGTCTCTGTCACTCGGACATCCCAATTTCTCGCAAACTTTCTGGTGCCGAGAATGATTGTCATATTTTCTTCGAGAAAGAGAAGCATCACTCTGAATCTTGTACTCATATTGAAGATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCTGTTGCTGTTGCAGCTATCAGTAGTGATGAGATGGTCACTAATGGGATTGGCACGTGCTCTGTTTCAGTTACTGATACCAATAATTTTGGTGGTGGGGATATTAACGTTATAACAGCAGGTAGTGTGGAAGGTGTGGTGCAATTACGATGATTAGTTTTACCTCTTATTTTGTTTGTATATTTGGTTACATGACTTCGATGGTCACTGTATTACAGGTTCAGCCGGTGATCAGCAATTAGCCTGCAAAACAAGGGCGGATGACTCTCTTACCGTAGCCCTTCCTGCAGATTTGTCTGTTGAGACTCCCCCAATTTCCCTGTGGCCAACTTTGCCAAGTCCACAAAATTCTTCAAGCCAGATGCTTTCACATTTCCCTGGTGGTTCGCCTTCCCAATTTCCCTTTTATGAGATAAATCCTATGTTGGGAGGTCCTGTCTTTACTTTTGGACCCCATGATGAGTCAGTGCCCACCACCCAAGCTCAAACACAAAAAAGCAGTGCACCAGCACCTGGCCCTCTTGGATCCTGGAAACAGTGTCATTCTGGTGTCGATTCTTTCTATGGGCCTCCTACTGGTTTTACTGGTCCGTTCATAAGTCCTGGAGGCATCCCAGGGGTTCAAGGTCCTCCGCACATGGTTGTATACAATCACTTTGCTCCTGTTGGACAGTTTGGGCAAGTTGGCTTGAGTTTCATGGGTGCTACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCATAGCCCTGGACCTTCTTCCTTGGGCGTTGAAGGGGATCAGAAAAATTTAAATATGGTTTCAGCTCAACGCATGCCCACCAACTTACCTCCAATCCAGCATCTTGCCCCTGGTTCGCCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTCTCTCCATTTCAGGTACAGTTTGTTGATTTTATTACCCTTTAATCACCTTCTTGTTGTGTAGCTCCCGAGGGAGAACATTTTGTGTTACGGACTCTAGACATTATTTGGAAGAAATGTAAGGTCCAAGAACCAAGATTATATATATATTTCATAAACAGCATCTCTGGTCCTACATAATGAAAGTAGATGCATGGATTGGCAGTTGTGCTAATCAATTGCTTTTATTCCTTCTATAAAGGGTTCTTATGGTTTGCCGTTAAATATTTTTGCAGGCCTCTCCTGAAATGTCAGTTCAAGCTCGGTGGCCTTCTTCAGCATCCTCTGTTCAGCCTGTGCCTCTGTCCATGCCCATGCAGCAGCAGGCGGAAGGCATTCTTCCTTCTCATTTCAGTCATGCATCATCTTCTGATCCGACATTTACAGTTAACAGATTTCCTGGATCACAACCCTCTGTAGCCTCTGACCACAAGCGTAATTTTACTGTGGCAGCCGATGCAACCGTCACCCAACTTCCAGATGAACTTGGAATAGTTGAAGCTTCAAGTTGCGTCGGTTCTGGGGCTTCAGTGCCAAATGCCGACATTAACAGCTTATTGGTGAACTCAGTTACTGATGCTGGCAAGACTGGGGTTCAGAATTGCAGTAGCAGCAACAGTGGCCAGAATGCAGGCACTAATTTAAAATCTCAGTCGTCTCATCATAAGGGCGTATCCGCCCAGCAATACAGTCATTCTTCTGGATACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGCCCCACCGGAGAACAGGGTTCATGGGAAGAAACCAATCTGGAGCTGAAAAGAACTTTTCCTCTGCAAAAATGAAGCAAATTTATGTGGCCAAGCAACCATCGAACGGAAATCTCAGAGTATAG
mRNA sequence
ATGGCTAATCCTGGCGTCGGGACCAAGTTTGTGTCCGTGAATCTCAACAAATCGTATGGACAGGCTCATCATCATCATTCATCTCATTCGAATTCTTATGGATCAAATCGAACGCGACCTGGTGGTCATGGGGCCGGCGGAGGAATGGTGGTCCTGTCGAGGCCCCGCAGCTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCTTTGCGAAAGGAGCATGAGAGACTTGATTCTTTGGGTTCAGGCACTGGGCCAACTGGTGGAGGGGTTTTGGGAAACGGACAGAGGCCAACTTCAGCTGGTATGGGTTGGACAAAGCCGCGCACCAGCGATTTGCCAGAGAAAGAAGGGGTTAGTGCTAATATAGTTGATAAAATTGATCCGTCTTTGCGAAGTGTTGATGGGGTGAGCGGTGGGAGTAGTGTGTATATGCCTCCTTCTGCTCGTGCTGGTACGACAGGCCCGATTGTGTCTACTTCTGCTTCCTCTCAGGTTCATACTGCAGTTGAAAAAGCCCCCGTTTTGAGAGGTGAGGATTTCCCTTCTTTGCAAGCAACTTTACCTTCTGCAGCTGCGCCTTCTCAGAAACAGAGAGATGGGTTGAGTTCTAAATTGAAGCATGCGGCTGAAGGTTCATATGAAGAGCAGAGGGATACTTCTCATTTAAGTTCAAGAATAGATGCCCGCTCAAAATTTCAGTCATCACAGAAAAGTATTCCCTGTGAAAATGCAAGAAATGGCAACTCTTACAGTTCTGGGAGTTTCCAGTCACCAGAGTTATCTCGGAAGCAGGAAGATATTTTCCCAGGTCCTTTACCACTCGTCTCAATGAATCCTAGATCAGACTGGGCTGATGATGAACGTGATACAAGCCATGGTTTAATTGACAGGGTTAGGGATCGAGGGCATCCAAAAAGTGAGGCTTATTGGGAGAGGGACTTTGACATGCCTCGGGTTAGTTCTCTTCCCCACAAGCCCACTCATAATTTTTCTCAGAGATGGAATCTGCGGGATGATGAATCTGGGAAGTTTCATTCTAGTGACATTCACAAAGTGGACTCTTATGGTCGGGATGTCAGGACGGCTGGTAGAGAAGGCTGGGAAGGAAACTTTCGGAAAAACAACCCTATACCAAAAGATGGATTTGGTTCAGACAGTGGTAATGATAGAAATGATATTGCAGGAAGGCCCACTAGCCTCGATCGAGAAACAAATGCTGATAACATGCATGTTTCACATTTTCGAGAACATGCTAATAAAGATGGGAGGAGAGATACTGGATTTGGACAGAATGGGCGGCAATCTTGGAATAGTGCAACAGAATCTTATAGCTCCCAGGAACCAGATCGGAATGTAAGAGACAAGTATGGTAGTGAGCAACACAGTAGGTACAGGGGCGAAACACATAATACTTCAGTTGCAAACTCATCATACTCTTCAGGTTTAAAACGAATTCCTGCCGATGAGCCATTGCTGAACTTTGGCAGGGATAGACGTTCGTTTGCAAAGATTGAGAAACCTTACATGGAAGATCCTTTTATGAAAGATTTTGGAGCCTCTAGTTTTGATGGACGAGATCCTTTTACTACTGGTCTTGTTGGGGTGGTTAAGAGGAAGAAGGATGTTATTAAGCAGATTGATTTTCATGACCCTGTTAGGGAATCTTTTGAGGCCGAACTTGAGAGAGTTCAACAGATCCAAGAGCAGGAGCGACAGCGAATTATTGAGGAGCAAGAAAGAGCTCTAGAACTAGCTAGGAGAGAAGAGGAAGAGAGACAGAGGCTTGCAAGGGAACATGAAGAAAGGCAGAGGAGAGCTGAAGAAGAAGCCAGAGAAGCAGCATGGAGAGCTGAGCAAGAGCGACTGGAGGCTATCCAAAAGGCTGAAGAACTTCGGATAGCTAGAGAGGAAGAAAAACAGAGGATTCTTCTGGAGGAAGAGAGAAGAAAGCAGGCTGCTAAGCTAAAACTTTTAGAATTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCAGATATTCCCGAAAAGAAGATTCCCAGTGTTGTTAAAGATGTTTCCAGGTTGGCGGACACAGTTGATTGGGAAGATGGTGAAAAGATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGCATAATTAGGTCCTCTGAGGTGGGCCTTAGATCTCAATTTTCTAGAGATGGTTCTCCTTCCTTTGTGGACAGAGGCAAGTCTGTTAATTCATGGAGAAGAGATTTTTATGAGAGAGGAAGTGGCTCTCAATTTGTTCTACAGGATCAGAGTACTGGCTACAATGGCACAAGGCGGGAGGCATCAACTGGTGGGCGGGTATCATCAAGGAAAGAGCTTTATGGGGGAGCTGGATTTACGACTTCCAAGACATCTCATAGAAGAGGTATTACAGAGCCACAATCAGATGAATATTCTCAGCTAAGAGGGCAGAGACCTAACCTTTCTGGAGGTGGCGATCATTATAACAGGAGCCAAGAGTTTGACCCCGAATTTCAGGATAATGTTGAGAATTTTGGTGATCATGGATGGAGGCAGGAGAGTGGTCGCAACAACTTCTATTTTCCTTACCCTGAACGAGTAAATCCAATTTCTGAGACTGATGGGTCCTATTCTGTTGGAAGGTCACGCTATTCCCATAGGCAACCTCGTGTTCTTCCTCCTCCATCTGTAGCTTCTATACAGAAATCTTCTGTCAGGGGCGAATATGAATCTGTTCCCCGGGATATTGTAGAAAGTGAGATACAATATGACCATCCGGCAAGTAATATTTCTACTGCTCAGACAAGGTATATTCATCATGAAAACCGTGCACTTCCTGAGATAATTGATGTTAATTTAGAGAATGGTGAGAATGAGGAGCAGAAACCAGACGGTAACACAACACTGCGGTGTGACTCACAGTCAACCCTTTCTGTTTTTAGCCCCCCAACCTCTCCAACCCATCTATCTCATGAGGACTTGGATGATTCTGGAGATTCTCCTGTTTTATCAGCTAGCAGAGAAGGCACATTGTCAATAGAGGACAATGAATCTGCTGAACCAGCCAAGGTTGGGAAAGAGATCATGATTACCTCTGCTAGGGTATCTACAGGTGATGAAGATGAATGGGGTGTTGTAGATGAGCATGTGCAAGAACAGGAAGAATATGATGAGGATGATGATGGGTATCAGGAAGAAGACGAAGTTCATGAAGGAGAGGACGAGAACATTGACCTTGTACAAGATTTTGATGATTTGCACTTAGATGATAAAGGATCACCCCATATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTGGGGATGCCGAATGACGAGTTTGAAAGAATTCCAGGAAATGAGGAAAATATTTATGTCGCACCAGAAATTTCAAATGGCATCAGGGAAGAGCAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAATCTGTCAATACGTGGATGCTTCTTCTCAAATAAGGATTGACCCTGAGGAGATGCAGGACTTGGTTATGCAGTCTAAAATTGCACAAGCATTGCCAGAATCTGAAATTACCGAGCAAGGAAATTCTTCTTGCAGATCTAGTGTGTCTGTTCAACAGCCAATCTCATCTTCAGTTTCAATGGCCTCACAACCTATATCTGGTCAAGTTATTGTGCCAAGTACTGCCGTTTCAGGTCAAGCTGAGTCTCCTGTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCCTGTACCAGCCATACAGATAGGTTCTATACAGATGCCTCTTCATCTGCATCCTCAGATTACCCAATCTATGACTCACATGCATTCATCACAGCCCCCTCTATTCCAGTTTGGACAGCTAAGGTATACATCTTCTGTCTCCCAAGGTGTACTGCCTTTGGCTCCTCAACCGCTGACATTTGTTCCGCCCACTGTTCAAACTGGTTTTCCTTTAAATAAAAACCCAGGAGAGGCTCTGTCCATTCATCCTTCTCAGGAAACCAGTGCTCATAATTCACGAAAAAATGATGTGCTGCCTTTTTTGATGGATAACCAACAAGGCATTGTGTCAAGATCTTTGAATGTGAACCCATCAGGGGAGTCAAAGTCATTACCATTAGCAGAAAGCATAGAAAGCAAAGTTATGACTCCACAGGATCAAACTGTAGGTTCATGCATTGATGAGAGCAATTCCAGGTCCGAACCAGGTTTTCAAGCAGAACATCAGAGGCACCGTGTTTCAACTTCAGATAATCATTATGTGGTATCTAGGGGAAAAGAATCTGAAGGTCGAGCTCAGGATGGGATGGGATCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGTGGGTTAAAAGCTCGTGGCCAGTTTCCTGGTGGAAGAGGGAAAAAGTATATCTTTACAGTAAAAAATTCTGGATCTAGATTGCCATTCCCAGGTTCTGAATCTACTCGCTTAGATACTGGTGGATTTCAGAGGCGGCCTAGGCGCAATATTCCTCGTACTGAGTTTCGTGTTCGAGAAACTGTGGATAAAAAATTGTCTAATAGCCAAGTTTCTTCAAACCATGTAGGGGTAGATGATAAGCCAACTGTTAGTGGAAGAACTGCGGTCAATTCTGCCAGAAATGGGACTAGGAAGGTTGTCATATCTAATAAGCCATCGAAAAGAGCATTAGAGTCTGAAGGATTAAGCTCTGGGGCGAGTACTTCTCTAGAGCTTGATGCTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGAGTATTTGGGCAAGAGCCAGGGAAGCCAATATTCTGGAGAAGGTAACTTCAGAAAGAACATTTGTTCTGGGGAGGATGTTGATGCCCCTTTGCAGAGTGGAATCATACGTGTATTTGAGCAACCTGGCATAGAGGCTCCCAGTGATGAGGATGATTTCATTGAGGTGCGATCAAAACGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAGGAGATCAAGGCGAAGTCCCACAATTCAAAGATCCCACGGAAAAGTCGATCTACTTCGAAAAGTGCATTATCCTCAGTCACTTCAAGCAAAATGTATGCCACTAAGGAAGCAGAAACAGTAAAGAGAACACGATCTGATTTTGTTGCTCCAGATGGAGGAGGACGTGGATCAGGAAATATTGTGGTGTCAAGCGCATTTAGTTCTCCAGTAGTCTCTCAACCATTGGCCCCAATTGGGACTCCTGCTCTGAAATCTGATTCCCAGACCGAGAGATCACATACTGCTAGGTCTATCCAGACGAGTGGCCCTGCATTGGCAACTAGTGACGGTAGAAATCTCGACTCAAGCATGATGTTTGATAAGAAGGATGATATTTTGGATAATGTTCAATCATCTTTTACTTCCTGGGGTAATTCACGTATAAATCAACAGGTTATGGCTTTGACGCAAACCCAACTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCTCCAGTAGGAGATCATTCTGGCTTAGCTGGTGATCCTAATGTGCCGTCACCATCTATCTTAGCAATGGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTTGCTGGGGAGAAAATTCAGTTTGGTGCAGTCACATCTCCAACAGTTCTTCCTCCTGGTAGCTGTTCCACTTTGCTTGGGATTGGTGCCCCCACTGGTCTCTGTCACTCGGACATCCCAATTTCTCGCAAACTTTCTGGTGCCGAGAATGATTGTCATATTTTCTTCGAGAAAGAGAAGCATCACTCTGAATCTTGTACTCATATTGAAGATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCTGTTGCTGTTGCAGCTATCAGTAGTGATGAGATGGTCACTAATGGGATTGGCACGTGCTCTGTTTCAGTTACTGATACCAATAATTTTGGTGGTGGGGATATTAACGTTATAACAGCAGGTAGTGTGGAAGGTTCAGCCGGTGATCAGCAATTAGCCTGCAAAACAAGGGCGGATGACTCTCTTACCGTAGCCCTTCCTGCAGATTTGTCTGTTGAGACTCCCCCAATTTCCCTGTGGCCAACTTTGCCAAGTCCACAAAATTCTTCAAGCCAGATGCTTTCACATTTCCCTGGTGGTTCGCCTTCCCAATTTCCCTTTTATGAGATAAATCCTATGTTGGGAGGTCCTGTCTTTACTTTTGGACCCCATGATGAGTCAGTGCCCACCACCCAAGCTCAAACACAAAAAAGCAGTGCACCAGCACCTGGCCCTCTTGGATCCTGGAAACAGTGTCATTCTGGTGTCGATTCTTTCTATGGGCCTCCTACTGGTTTTACTGGTCCGTTCATAAGTCCTGGAGGCATCCCAGGGGTTCAAGGTCCTCCGCACATGGTTGTATACAATCACTTTGCTCCTGTTGGACAGTTTGGGCAAGTTGGCTTGAGTTTCATGGGTGCTACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCATAGCCCTGGACCTTCTTCCTTGGGCGTTGAAGGGGATCAGAAAAATTTAAATATGGTTTCAGCTCAACGCATGCCCACCAACTTACCTCCAATCCAGCATCTTGCCCCTGGTTCGCCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTCTCTCCATTTCAGGCCTCTCCTGAAATGTCAGTTCAAGCTCGGTGGCCTTCTTCAGCATCCTCTGTTCAGCCTGTGCCTCTGTCCATGCCCATGCAGCAGCAGGCGGAAGGCATTCTTCCTTCTCATTTCAGTCATGCATCATCTTCTGATCCGACATTTACAGTTAACAGATTTCCTGGATCACAACCCTCTGTAGCCTCTGACCACAAGCGTAATTTTACTGTGGCAGCCGATGCAACCGTCACCCAACTTCCAGATGAACTTGGAATAGTTGAAGCTTCAAGTTGCGTCGGTTCTGGGGCTTCAGTGCCAAATGCCGACATTAACAGCTTATTGGTGAACTCAGTTACTGATGCTGGCAAGACTGGGGTTCAGAATTGCAGTAGCAGCAACAGTGGCCAGAATGCAGGCACTAATTTAAAATCTCAGTCGTCTCATCATAAGGGCGTATCCGCCCAGCAATACAGTCATTCTTCTGGATACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGCCCCACCGGAGAACAGGGTTCATGGGAAGAAACCAATCTGGAGCTGAAAAGAACTTTTCCTCTGCAAAAATGAAGCAAATTTATGTGGCCAAGCAACCATCGAACGGAAATCTCAGAGTATAG
Coding sequence (CDS)
ATGGCTAATCCTGGCGTCGGGACCAAGTTTGTGTCCGTGAATCTCAACAAATCGTATGGACAGGCTCATCATCATCATTCATCTCATTCGAATTCTTATGGATCAAATCGAACGCGACCTGGTGGTCATGGGGCCGGCGGAGGAATGGTGGTCCTGTCGAGGCCCCGCAGCTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGCCTTCTTTGCGAAAGGAGCATGAGAGACTTGATTCTTTGGGTTCAGGCACTGGGCCAACTGGTGGAGGGGTTTTGGGAAACGGACAGAGGCCAACTTCAGCTGGTATGGGTTGGACAAAGCCGCGCACCAGCGATTTGCCAGAGAAAGAAGGGGTTAGTGCTAATATAGTTGATAAAATTGATCCGTCTTTGCGAAGTGTTGATGGGGTGAGCGGTGGGAGTAGTGTGTATATGCCTCCTTCTGCTCGTGCTGGTACGACAGGCCCGATTGTGTCTACTTCTGCTTCCTCTCAGGTTCATACTGCAGTTGAAAAAGCCCCCGTTTTGAGAGGTGAGGATTTCCCTTCTTTGCAAGCAACTTTACCTTCTGCAGCTGCGCCTTCTCAGAAACAGAGAGATGGGTTGAGTTCTAAATTGAAGCATGCGGCTGAAGGTTCATATGAAGAGCAGAGGGATACTTCTCATTTAAGTTCAAGAATAGATGCCCGCTCAAAATTTCAGTCATCACAGAAAAGTATTCCCTGTGAAAATGCAAGAAATGGCAACTCTTACAGTTCTGGGAGTTTCCAGTCACCAGAGTTATCTCGGAAGCAGGAAGATATTTTCCCAGGTCCTTTACCACTCGTCTCAATGAATCCTAGATCAGACTGGGCTGATGATGAACGTGATACAAGCCATGGTTTAATTGACAGGGTTAGGGATCGAGGGCATCCAAAAAGTGAGGCTTATTGGGAGAGGGACTTTGACATGCCTCGGGTTAGTTCTCTTCCCCACAAGCCCACTCATAATTTTTCTCAGAGATGGAATCTGCGGGATGATGAATCTGGGAAGTTTCATTCTAGTGACATTCACAAAGTGGACTCTTATGGTCGGGATGTCAGGACGGCTGGTAGAGAAGGCTGGGAAGGAAACTTTCGGAAAAACAACCCTATACCAAAAGATGGATTTGGTTCAGACAGTGGTAATGATAGAAATGATATTGCAGGAAGGCCCACTAGCCTCGATCGAGAAACAAATGCTGATAACATGCATGTTTCACATTTTCGAGAACATGCTAATAAAGATGGGAGGAGAGATACTGGATTTGGACAGAATGGGCGGCAATCTTGGAATAGTGCAACAGAATCTTATAGCTCCCAGGAACCAGATCGGAATGTAAGAGACAAGTATGGTAGTGAGCAACACAGTAGGTACAGGGGCGAAACACATAATACTTCAGTTGCAAACTCATCATACTCTTCAGGTTTAAAACGAATTCCTGCCGATGAGCCATTGCTGAACTTTGGCAGGGATAGACGTTCGTTTGCAAAGATTGAGAAACCTTACATGGAAGATCCTTTTATGAAAGATTTTGGAGCCTCTAGTTTTGATGGACGAGATCCTTTTACTACTGGTCTTGTTGGGGTGGTTAAGAGGAAGAAGGATGTTATTAAGCAGATTGATTTTCATGACCCTGTTAGGGAATCTTTTGAGGCCGAACTTGAGAGAGTTCAACAGATCCAAGAGCAGGAGCGACAGCGAATTATTGAGGAGCAAGAAAGAGCTCTAGAACTAGCTAGGAGAGAAGAGGAAGAGAGACAGAGGCTTGCAAGGGAACATGAAGAAAGGCAGAGGAGAGCTGAAGAAGAAGCCAGAGAAGCAGCATGGAGAGCTGAGCAAGAGCGACTGGAGGCTATCCAAAAGGCTGAAGAACTTCGGATAGCTAGAGAGGAAGAAAAACAGAGGATTCTTCTGGAGGAAGAGAGAAGAAAGCAGGCTGCTAAGCTAAAACTTTTAGAATTAGAGGAAAGGATGGCCAAGAGGCAGGCTGAAGCTGTGAAATCAAGCAGTTTGACTTCAGATATTCCCGAAAAGAAGATTCCCAGTGTTGTTAAAGATGTTTCCAGGTTGGCGGACACAGTTGATTGGGAAGATGGTGAAAAGATGGTGGAGCGAATCACTACATCAGCTTCTTCTGAGTCATCTAGCATAATTAGGTCCTCTGAGGTGGGCCTTAGATCTCAATTTTCTAGAGATGGTTCTCCTTCCTTTGTGGACAGAGGCAAGTCTGTTAATTCATGGAGAAGAGATTTTTATGAGAGAGGAAGTGGCTCTCAATTTGTTCTACAGGATCAGAGTACTGGCTACAATGGCACAAGGCGGGAGGCATCAACTGGTGGGCGGGTATCATCAAGGAAAGAGCTTTATGGGGGAGCTGGATTTACGACTTCCAAGACATCTCATAGAAGAGGTATTACAGAGCCACAATCAGATGAATATTCTCAGCTAAGAGGGCAGAGACCTAACCTTTCTGGAGGTGGCGATCATTATAACAGGAGCCAAGAGTTTGACCCCGAATTTCAGGATAATGTTGAGAATTTTGGTGATCATGGATGGAGGCAGGAGAGTGGTCGCAACAACTTCTATTTTCCTTACCCTGAACGAGTAAATCCAATTTCTGAGACTGATGGGTCCTATTCTGTTGGAAGGTCACGCTATTCCCATAGGCAACCTCGTGTTCTTCCTCCTCCATCTGTAGCTTCTATACAGAAATCTTCTGTCAGGGGCGAATATGAATCTGTTCCCCGGGATATTGTAGAAAGTGAGATACAATATGACCATCCGGCAAGTAATATTTCTACTGCTCAGACAAGGTATATTCATCATGAAAACCGTGCACTTCCTGAGATAATTGATGTTAATTTAGAGAATGGTGAGAATGAGGAGCAGAAACCAGACGGTAACACAACACTGCGGTGTGACTCACAGTCAACCCTTTCTGTTTTTAGCCCCCCAACCTCTCCAACCCATCTATCTCATGAGGACTTGGATGATTCTGGAGATTCTCCTGTTTTATCAGCTAGCAGAGAAGGCACATTGTCAATAGAGGACAATGAATCTGCTGAACCAGCCAAGGTTGGGAAAGAGATCATGATTACCTCTGCTAGGGTATCTACAGGTGATGAAGATGAATGGGGTGTTGTAGATGAGCATGTGCAAGAACAGGAAGAATATGATGAGGATGATGATGGGTATCAGGAAGAAGACGAAGTTCATGAAGGAGAGGACGAGAACATTGACCTTGTACAAGATTTTGATGATTTGCACTTAGATGATAAAGGATCACCCCATATGTTAGATAACTTGGTATTAGGTTTTAATGAAGGCGTTGAAGTGGGGATGCCGAATGACGAGTTTGAAAGAATTCCAGGAAATGAGGAAAATATTTATGTCGCACCAGAAATTTCAAATGGCATCAGGGAAGAGCAGGGGTCTTCTGAAGGATTGCAAGTTGATGGTAAAATCTGTCAATACGTGGATGCTTCTTCTCAAATAAGGATTGACCCTGAGGAGATGCAGGACTTGGTTATGCAGTCTAAAATTGCACAAGCATTGCCAGAATCTGAAATTACCGAGCAAGGAAATTCTTCTTGCAGATCTAGTGTGTCTGTTCAACAGCCAATCTCATCTTCAGTTTCAATGGCCTCACAACCTATATCTGGTCAAGTTATTGTGCCAAGTACTGCCGTTTCAGGTCAAGCTGAGTCTCCTGTTAAGCTTCAGTTTGGGTTGTTCTCAGGTCCTTCTCTCATACCATCTCCTGTACCAGCCATACAGATAGGTTCTATACAGATGCCTCTTCATCTGCATCCTCAGATTACCCAATCTATGACTCACATGCATTCATCACAGCCCCCTCTATTCCAGTTTGGACAGCTAAGGTATACATCTTCTGTCTCCCAAGGTGTACTGCCTTTGGCTCCTCAACCGCTGACATTTGTTCCGCCCACTGTTCAAACTGGTTTTCCTTTAAATAAAAACCCAGGAGAGGCTCTGTCCATTCATCCTTCTCAGGAAACCAGTGCTCATAATTCACGAAAAAATGATGTGCTGCCTTTTTTGATGGATAACCAACAAGGCATTGTGTCAAGATCTTTGAATGTGAACCCATCAGGGGAGTCAAAGTCATTACCATTAGCAGAAAGCATAGAAAGCAAAGTTATGACTCCACAGGATCAAACTGTAGGTTCATGCATTGATGAGAGCAATTCCAGGTCCGAACCAGGTTTTCAAGCAGAACATCAGAGGCACCGTGTTTCAACTTCAGATAATCATTATGTGGTATCTAGGGGAAAAGAATCTGAAGGTCGAGCTCAGGATGGGATGGGATCATTTGATTCTGTTTCAAGAGATAAGGGTTTGAGTGGGTTAAAAGCTCGTGGCCAGTTTCCTGGTGGAAGAGGGAAAAAGTATATCTTTACAGTAAAAAATTCTGGATCTAGATTGCCATTCCCAGGTTCTGAATCTACTCGCTTAGATACTGGTGGATTTCAGAGGCGGCCTAGGCGCAATATTCCTCGTACTGAGTTTCGTGTTCGAGAAACTGTGGATAAAAAATTGTCTAATAGCCAAGTTTCTTCAAACCATGTAGGGGTAGATGATAAGCCAACTGTTAGTGGAAGAACTGCGGTCAATTCTGCCAGAAATGGGACTAGGAAGGTTGTCATATCTAATAAGCCATCGAAAAGAGCATTAGAGTCTGAAGGATTAAGCTCTGGGGCGAGTACTTCTCTAGAGCTTGATGCTGGTAATAGATCTGAAAAGGGAGTGAAAAAAGAGTATTTGGGCAAGAGCCAGGGAAGCCAATATTCTGGAGAAGGTAACTTCAGAAAGAACATTTGTTCTGGGGAGGATGTTGATGCCCCTTTGCAGAGTGGAATCATACGTGTATTTGAGCAACCTGGCATAGAGGCTCCCAGTGATGAGGATGATTTCATTGAGGTGCGATCAAAACGGCAGATGCTGAATGATAGGCGTGAACAAAGAGAGAAGGAGATCAAGGCGAAGTCCCACAATTCAAAGATCCCACGGAAAAGTCGATCTACTTCGAAAAGTGCATTATCCTCAGTCACTTCAAGCAAAATGTATGCCACTAAGGAAGCAGAAACAGTAAAGAGAACACGATCTGATTTTGTTGCTCCAGATGGAGGAGGACGTGGATCAGGAAATATTGTGGTGTCAAGCGCATTTAGTTCTCCAGTAGTCTCTCAACCATTGGCCCCAATTGGGACTCCTGCTCTGAAATCTGATTCCCAGACCGAGAGATCACATACTGCTAGGTCTATCCAGACGAGTGGCCCTGCATTGGCAACTAGTGACGGTAGAAATCTCGACTCAAGCATGATGTTTGATAAGAAGGATGATATTTTGGATAATGTTCAATCATCTTTTACTTCCTGGGGTAATTCACGTATAAATCAACAGGTTATGGCTTTGACGCAAACCCAACTTGATGAGGCTATGAAGCCTGCTCAGTTTGATTTACATCCTCCAGTAGGAGATCATTCTGGCTTAGCTGGTGATCCTAATGTGCCGTCACCATCTATCTTAGCAATGGATAGGTCATTTTCTTCTGCTGCTAATCCAATCAGTTCTCTGCTTGCTGGGGAGAAAATTCAGTTTGGTGCAGTCACATCTCCAACAGTTCTTCCTCCTGGTAGCTGTTCCACTTTGCTTGGGATTGGTGCCCCCACTGGTCTCTGTCACTCGGACATCCCAATTTCTCGCAAACTTTCTGGTGCCGAGAATGATTGTCATATTTTCTTCGAGAAAGAGAAGCATCACTCTGAATCTTGTACTCATATTGAAGATAGTGAAGCTGAAGCTGAGGCAGCTGCTTCTGCTGTTGCTGTTGCAGCTATCAGTAGTGATGAGATGGTCACTAATGGGATTGGCACGTGCTCTGTTTCAGTTACTGATACCAATAATTTTGGTGGTGGGGATATTAACGTTATAACAGCAGGTAGTGTGGAAGGTTCAGCCGGTGATCAGCAATTAGCCTGCAAAACAAGGGCGGATGACTCTCTTACCGTAGCCCTTCCTGCAGATTTGTCTGTTGAGACTCCCCCAATTTCCCTGTGGCCAACTTTGCCAAGTCCACAAAATTCTTCAAGCCAGATGCTTTCACATTTCCCTGGTGGTTCGCCTTCCCAATTTCCCTTTTATGAGATAAATCCTATGTTGGGAGGTCCTGTCTTTACTTTTGGACCCCATGATGAGTCAGTGCCCACCACCCAAGCTCAAACACAAAAAAGCAGTGCACCAGCACCTGGCCCTCTTGGATCCTGGAAACAGTGTCATTCTGGTGTCGATTCTTTCTATGGGCCTCCTACTGGTTTTACTGGTCCGTTCATAAGTCCTGGAGGCATCCCAGGGGTTCAAGGTCCTCCGCACATGGTTGTATACAATCACTTTGCTCCTGTTGGACAGTTTGGGCAAGTTGGCTTGAGTTTCATGGGTGCTACGTATATTCCTTCTGGAAAACAGCATGACTGGAAGCATAGCCCTGGACCTTCTTCCTTGGGCGTTGAAGGGGATCAGAAAAATTTAAATATGGTTTCAGCTCAACGCATGCCCACCAACTTACCTCCAATCCAGCATCTTGCCCCTGGTTCGCCCCTGCTGCCGATGGCTTCTCCATTAGCTATGTTTGATGTCTCTCCATTTCAGGCCTCTCCTGAAATGTCAGTTCAAGCTCGGTGGCCTTCTTCAGCATCCTCTGTTCAGCCTGTGCCTCTGTCCATGCCCATGCAGCAGCAGGCGGAAGGCATTCTTCCTTCTCATTTCAGTCATGCATCATCTTCTGATCCGACATTTACAGTTAACAGATTTCCTGGATCACAACCCTCTGTAGCCTCTGACCACAAGCGTAATTTTACTGTGGCAGCCGATGCAACCGTCACCCAACTTCCAGATGAACTTGGAATAGTTGAAGCTTCAAGTTGCGTCGGTTCTGGGGCTTCAGTGCCAAATGCCGACATTAACAGCTTATTGGTGAACTCAGTTACTGATGCTGGCAAGACTGGGGTTCAGAATTGCAGTAGCAGCAACAGTGGCCAGAATGCAGGCACTAATTTAAAATCTCAGTCGTCTCATCATAAGGGCGTATCCGCCCAGCAATACAGTCATTCTTCTGGATACAATTATCAGAGAGGTGGTGCTTCTCAAAAGAATAGTTCAGGTGGCAGCGAATGGCCCCACCGGAGAACAGGGTTCATGGGAAGAAACCAATCTGGAGCTGAAAAGAACTTTTCCTCTGCAAAAATGAAGCAAATTTATGTGGCCAAGCAACCATCGAACGGAAATCTCAGAGTATAG
Protein sequence
MANPGVGTKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAVEKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDARSKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDIHKVDSYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNMHVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGRSRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDDDGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNEENIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIAQALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIVSRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCHSDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPMQQQAEGILPSHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGASVPNADINSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGVSAQQYSHSSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV
Homology
BLAST of HG10016734 vs. NCBI nr
Match:
XP_038883483.1 (uncharacterized protein LOC120074436 [Benincasa hispida])
HSP 1 Score: 4417.8 bits (11457), Expect = 0.0e+00
Identity = 2318/2453 (94.50%), Postives = 2363/2453 (96.33%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA-HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQ 60
MANPGVGTKFVSVNLNKSYGQA HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQ
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQAHHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQ 60
Query: 61 KPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDL 120
KPGPKLSVPPPLNLPSLRKEHERLDSLGSG GPTGGGVLGNGQRPTSAGMGWTKPRT+DL
Sbjct: 61 KPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPTGGGVLGNGQRPTSAGMGWTKPRTNDL 120
Query: 121 PEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAVEKA 180
PEKEG+SANIVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASSQV TAVEKA
Sbjct: 121 PEKEGLSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVLTAVEKA 180
Query: 181 PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDARSKF 240
PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHA EG YEEQRDTSHLSSRIDA SKF
Sbjct: 181 PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAPEGLYEEQRDTSHLSSRIDAHSKF 240
Query: 241 QSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSH 300
QSSQ+SIP ENA+NGNS+ SGS QSPELS KQ+DIFPGPLPLVSMNPRSDWADDERDTSH
Sbjct: 241 QSSQESIPSENAKNGNSFGSGSLQSPELSWKQDDIFPGPLPLVSMNPRSDWADDERDTSH 300
Query: 301 GLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDIHKV 360
GLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHK THNFSQRWNLRDDESGKFHSSDIHK+
Sbjct: 301 GLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKHTHNFSQRWNLRDDESGKFHSSDIHKL 360
Query: 361 DSYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNMHVS 420
D YGRD RTA REGWEGNFR+NNPIPKDGFGSDSGNDRNDIAGRPTS+DRETNADNMHVS
Sbjct: 361 DPYGRDARTASREGWEGNFRRNNPIPKDGFGSDSGNDRNDIAGRPTSIDRETNADNMHVS 420
Query: 421 HFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTS 480
HFREH NKDGRRDTGFGQNGRQ+WNSATESYSSQEPDR VRDKY SEQH+RYRGETHNTS
Sbjct: 421 HFREHVNKDGRRDTGFGQNGRQTWNSATESYSSQEPDRTVRDKYVSEQHNRYRGETHNTS 480
Query: 481 VANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGL 540
VANSSYS+ LKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFT GL
Sbjct: 481 VANSSYSTSLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTAGL 540
Query: 541 VGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
VGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER
Sbjct: 541 VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEER 600
Query: 601 QRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQA 660
QRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELR+AREEEKQRILLEEERRKQA
Sbjct: 601 QRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRMAREEEKQRILLEEERRKQA 660
Query: 661 AKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERI 720
AKLKLLELEERMAKRQAE VKSS+LTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERI
Sbjct: 661 AKLKLLELEERMAKRQAEVVKSSTLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERI 720
Query: 721 TTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQS 780
TTSASSESSSI RSSEVG RSQFS DGSPSFVDRGKS+NSWRRDFYERGSGSQFVLQDQS
Sbjct: 721 TTSASSESSSINRSSEVGFRSQFSTDGSPSFVDRGKSINSWRRDFYERGSGSQFVLQDQS 780
Query: 781 TGY-NGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSG 840
TGY NG RREASTGGRVSSRKE YGGAGFTTS+TSHRRGITEPQSDEYSQLRGQRPNLSG
Sbjct: 781 TGYNNGPRREASTGGRVSSRKEFYGGAGFTTSRTSHRRGITEPQSDEYSQLRGQRPNLSG 840
Query: 841 GGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGRSR 900
GGDHYNRSQEFD EFQDNVEN+GDHGWRQESGRNNFYFPYPERVNPISE DGSYSVGRSR
Sbjct: 841 GGDHYNRSQEFDSEFQDNVENYGDHGWRQESGRNNFYFPYPERVNPISEADGSYSVGRSR 900
Query: 901 YSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENR 960
YS RQPRVLPPPSVAS+QKSSVRGEYESVPRDIVESEIQYDHPA NIST+QTRYIHH+NR
Sbjct: 901 YSQRQPRVLPPPSVASVQKSSVRGEYESVPRDIVESEIQYDHPAHNISTSQTRYIHHDNR 960
Query: 961 ALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVL 1020
ALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVL
Sbjct: 961 ALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVL 1020
Query: 1021 SASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDDDGY 1080
SASREGTLSIEDNESA PAK GKEIMITS RVSTGDEDEWGVVDEHVQEQEEYDEDDDGY
Sbjct: 1021 SASREGTLSIEDNESAVPAKSGKEIMITSTRVSTGDEDEWGVVDEHVQEQEEYDEDDDGY 1080
Query: 1081 QEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGN 1140
QEEDEVHEGEDENIDLV+DFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGN
Sbjct: 1081 QEEDEVHEGEDENIDLVEDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGN 1140
Query: 1141 EENIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIAQAL 1200
+EN+YVAPEISNGI+EEQGSSEGL VDGK+CQY DASSQIRIDPEEMQDLVMQ AQAL
Sbjct: 1141 DENMYVAPEISNGIKEEQGSSEGLPVDGKVCQYADASSQIRIDPEEMQDLVMQPITAQAL 1200
Query: 1201 PESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQFGLF 1260
PESEITEQGNSSCRSS SVQQP MASQ ISGQVIVP+TAVSGQAE PVKLQFGLF
Sbjct: 1201 PESEITEQGNSSCRSSASVQQP------MASQSISGQVIVPNTAVSGQAEPPVKLQFGLF 1260
Query: 1261 SGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQGVLP 1320
SGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQ PLFQFGQLRYTSSVSQGVLP
Sbjct: 1261 SGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQTPLFQFGQLRYTSSVSQGVLP 1320
Query: 1321 LAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIVSRS 1380
LAPQPLTFVPPTVQTGFPLNKNPG+ALSIHPSQET HNSRKNDVLPFLMDNQQG+VSRS
Sbjct: 1321 LAPQPLTFVPPTVQTGFPLNKNPGDALSIHPSQETCVHNSRKNDVLPFLMDNQQGLVSRS 1380
Query: 1381 LNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTSDNH 1440
LNVNPS ESKSLPL ES ESK+MTPQDQT GSCIDESNSRSEPGFQAEHQRHRVSTSDN
Sbjct: 1381 LNVNPSMESKSLPLTESTESKLMTPQDQTAGSCIDESNSRSEPGFQAEHQRHRVSTSDNQ 1440
Query: 1441 YVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFP 1500
YVVSRGKESEG+ QDGMGSFDSVSRDKGLSGLKARGQF GGRGKKYIFTVKNSGSRLPFP
Sbjct: 1441 YVVSRGKESEGQGQDGMGSFDSVSRDKGLSGLKARGQFHGGRGKKYIFTVKNSGSRLPFP 1500
Query: 1501 GSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTAVN 1560
GSESTRLDTGGFQRR RRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRT V+
Sbjct: 1501 GSESTRLDTGGFQRRTRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTVVH 1560
Query: 1561 SARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYSG 1620
SARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRS KGVKKEYLGKSQGSQY G
Sbjct: 1561 SARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSAKGVKKEYLGKSQGSQYPG 1620
Query: 1621 EGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKE 1680
EGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKE
Sbjct: 1621 EGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKE 1680
Query: 1681 IKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSGNIV 1740
IKAKSHNSKIPRKSRSTSK+ALSSV SSK+YA KEAE VKRTRSDFVA DGGGRGSGNIV
Sbjct: 1681 IKAKSHNSKIPRKSRSTSKNALSSVNSSKVYAAKEAEPVKRTRSDFVAADGGGRGSGNIV 1740
Query: 1741 VSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMMFDK 1800
VS+AFSSPVVSQPLAPIGTPALKSDSQ+ERSH ARSIQTSGPALATS+GRNLDSSMMFDK
Sbjct: 1741 VSTAFSSPVVSQPLAPIGTPALKSDSQSERSHAARSIQTSGPALATSEGRNLDSSMMFDK 1800
Query: 1801 KDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDPNVP 1860
KDDIL+NV SSFTSWG SRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHS LAGDPNVP
Sbjct: 1801 KDDILENVHSSFTSWGTSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNVP 1860
Query: 1861 SPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCHSDI 1920
SPSILA+DRSFSSAANPISSLLAGEKIQFGAVTSPTVL PGSCSTLLGIGAP+ LCHSDI
Sbjct: 1861 SPSILALDRSFSSAANPISSLLAGEKIQFGAVTSPTVLSPGSCSTLLGIGAPSSLCHSDI 1920
Query: 1921 PISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTNGIG 1980
PI KLSGAENDCH+FFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDE+V NGIG
Sbjct: 1921 PIPHKLSGAENDCHLFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEIVANGIG 1980
Query: 1981 TCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETPPIS 2040
TCSVSVTDTNNFGGGDINVITAGSV GDQQLA KTRADDSLTVALPADLSVETPPIS
Sbjct: 1981 TCSVSVTDTNNFGGGDINVITAGSV----GDQQLASKTRADDSLTVALPADLSVETPPIS 2040
Query: 2041 LWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKS 2100
LWPTLPSPQNSSSQ+LSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKS
Sbjct: 2041 LWPTLPSPQNSSSQVLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKS 2100
Query: 2101 SAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFG 2160
SAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFG
Sbjct: 2101 SAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFG 2160
Query: 2161 QVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGS 2220
QVGLSFMG TYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGS
Sbjct: 2161 QVGLSFMGTTYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGS 2220
Query: 2221 PLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPMQQQAEGILPSHFSHA 2280
PLLPMASPLAMFDVSPFQASPEMSVQARWPSSASS QPVPLSMPMQQQAEGILPSHFSHA
Sbjct: 2221 PLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSGQPVPLSMPMQQQAEGILPSHFSHA 2280
Query: 2281 SSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGASVPNA 2340
SSSDPTFTVNRFPGSQPSVASDHKRNF VAADATVTQLPDELGIV+ASSCV SGASVPNA
Sbjct: 2281 SSSDPTFTVNRFPGSQPSVASDHKRNFPVAADATVTQLPDELGIVDASSCVSSGASVPNA 2340
Query: 2341 DINSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGVSAQQYSHSSGYNYQRG 2400
DIN L VN VTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKG+SAQQY HSSGYNYQRG
Sbjct: 2341 DINGLSVNLVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGISAQQYGHSSGYNYQRG 2400
Query: 2401 GASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
GASQKN SGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV
Sbjct: 2401 GASQKNGSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2443
BLAST of HG10016734 vs. NCBI nr
Match:
XP_004142008.1 (uncharacterized protein LOC101218305 isoform X1 [Cucumis sativus])
HSP 1 Score: 4298.0 bits (11146), Expect = 0.0e+00
Identity = 2278/2460 (92.60%), Postives = 2336/2460 (94.96%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA----HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPR 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSRPR
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSRPR 60
Query: 61 SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT 120
SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT
Sbjct: 61 SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT 120
Query: 121 SDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAV 180
+DLPEKEG SA IVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASS VH V
Sbjct: 121 NDLPEKEGPSATIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSHVHATV 180
Query: 181 EKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDAR 240
EK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGSYEEQRDT+HLSSRID R
Sbjct: 181 EKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHGSEGSYEEQRDTTHLSSRIDDR 240
Query: 241 SKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERD 300
SK+QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDERD
Sbjct: 241 SKYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDERD 300
Query: 301 TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI 360
TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI
Sbjct: 301 TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI 360
Query: 361 HKVDSYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNM 420
HKVD YGRD R A REGWEGNFRKNNP+PKDGFGSD+ NDRN IAGRPTS+DRETNADN
Sbjct: 361 HKVDPYGRDARVASREGWEGNFRKNNPVPKDGFGSDNANDRNAIAGRPTSVDRETNADNT 420
Query: 421 HVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETH 480
HVSHFREHANKDGRRDTGFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQH+R+RGETH
Sbjct: 421 HVSHFREHANKDGRRDTGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHNRFRGETH 480
Query: 481 NTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFT 540
NTSVANSSYSSGLKRIPADEPLLNFGRDRRS+AKIEKPYMEDPFMKDFGASSFDGRDPFT
Sbjct: 481 NTSVANSSYSSGLKRIPADEPLLNFGRDRRSYAKIEKPYMEDPFMKDFGASSFDGRDPFT 540
Query: 541 TGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE 600
GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE
Sbjct: 541 AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE 600
Query: 601 EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERR 660
EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEEERR
Sbjct: 601 EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEEERR 660
Query: 661 KQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMV 720
KQ AKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGEKMV
Sbjct: 661 KQGAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGEKMV 720
Query: 721 ERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQ 780
ERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFY+RGSGSQFVLQ
Sbjct: 721 ERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQ 780
Query: 781 DQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNL 840
DQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYS LRGQRPNL
Sbjct: 781 DQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYS-LRGQRPNL 840
Query: 841 SGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGR 900
SGG DHYN++QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYSVGR
Sbjct: 841 SGGVDHYNKTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSVGR 900
Query: 901 SRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHE 960
SRYS RQPRVLPPPSVAS+QKSSVR EYESV RDIVESEIQYDHPASNISTAQT YIHHE
Sbjct: 901 SRYSQRQPRVLPPPSVASMQKSSVRNEYESVSRDIVESEIQYDHPASNISTAQTMYIHHE 960
Query: 961 NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP 1020
NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP
Sbjct: 961 NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP 1020
Query: 1021 VLSASREGTLSIEDNESAEP-AKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDD 1080
VLSASREGTLSIEDNESA P AK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDEDD
Sbjct: 1021 VLSASREGTLSIEDNESAVPAAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDEDD 1080
Query: 1081 DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI 1140
DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI
Sbjct: 1081 DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI 1140
Query: 1141 PGNEENIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIA 1200
PGNEEN+YV EISN IREEQGSS+GLQVDG +CQYVDASSQIRIDPEEMQDLV+QSK A
Sbjct: 1141 PGNEENLYVTSEISNDIREEQGSSKGLQVDGNVCQYVDASSQIRIDPEEMQDLVLQSKTA 1200
Query: 1201 QALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQF 1260
QAL ESEITEQGNSSCRSSVSVQQPISSSVSMA Q ISGQVIVPS AVSGQAE PVKLQF
Sbjct: 1201 QALAESEITEQGNSSCRSSVSVQQPISSSVSMAPQSISGQVIVPS-AVSGQAEPPVKLQF 1260
Query: 1261 GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQG 1320
GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS G
Sbjct: 1261 GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSPG 1320
Query: 1321 VLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIV 1380
VLPLAPQPLTFVPPTVQTGF L KNPG+ LSIHPSQET AH+SRKN+V PFLMDNQQG+V
Sbjct: 1321 VLPLAPQPLTFVPPTVQTGFSLKKNPGDGLSIHPSQETCAHSSRKNNVSPFLMDNQQGLV 1380
Query: 1381 SRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTS 1440
SRSLNVNPSGES+SLPLAESIESKV+TP DQT SCIDESNSR EPGFQAEH R RVS+S
Sbjct: 1381 SRSLNVNPSGESESLPLAESIESKVVTPHDQTAVSCIDESNSRPEPGFQAEHHRLRVSSS 1440
Query: 1441 DNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRL 1500
DN YVVSRGKESEGRA DGMGSFDSVSR+KGLSGLK RGQFPGGRGKKYIFTVKNSGSRL
Sbjct: 1441 DNRYVVSRGKESEGRAPDGMGSFDSVSRNKGLSGLKGRGQFPGGRGKKYIFTVKNSGSRL 1500
Query: 1501 PFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRT 1560
PFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVSGRT
Sbjct: 1501 PFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVSGRT 1560
Query: 1561 AVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQ 1620
AVNSARNGTRKV++SNKPSKRALESEGLSSG STS+ELDAGNRSEKGVKKEY GKSQGSQ
Sbjct: 1561 AVNSARNGTRKVIVSNKPSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYSGKSQGSQ 1620
Query: 1621 YSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
YSGEGNFR+NICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR
Sbjct: 1621 YSGEGNFRRNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
Query: 1681 EKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSG 1740
EKEIKAKSHNSKIPRK RSTSKSALSSV SSK+YA KEAETVKRTRSDFVA DGG RGSG
Sbjct: 1681 EKEIKAKSHNSKIPRKGRSTSKSALSSVNSSKVYAPKEAETVKRTRSDFVAADGGVRGSG 1740
Query: 1741 NIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMM 1800
N+VVSSAFS PVVSQPLAPIGTPALKSDSQ+ERSHTARSIQTSGP LAT+DGRNLDSSMM
Sbjct: 1741 NVVVSSAFSPPVVSQPLAPIGTPALKSDSQSERSHTARSIQTSGPTLATNDGRNLDSSMM 1800
Query: 1801 FDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDP 1860
FDKKDDILDNVQSSFTSWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP AGD
Sbjct: 1801 FDKKDDILDNVQSSFTSWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------AGDT 1860
Query: 1861 NVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCH 1920
NVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCH
Sbjct: 1861 NVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCH 1920
Query: 1921 SDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN 1980
SDIPI KLSGA+NDCH+FFEKEKH SESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN
Sbjct: 1921 SDIPIPHKLSGADNDCHLFFEKEKHRSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN 1980
Query: 1981 GIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETP 2040
GIGTCSVSVTDTNNFGGGDINV T GS GDQQLA KTRADDSLTVALPADLSVETP
Sbjct: 1981 GIGTCSVSVTDTNNFGGGDINVAT-----GSTGDQQLASKTRADDSLTVALPADLSVETP 2040
Query: 2041 PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT 2100
PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT
Sbjct: 2041 PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT 2100
Query: 2101 QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG 2160
QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG
Sbjct: 2101 QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG 2160
Query: 2161 QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLA 2220
QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMPTNLPPIQHLA
Sbjct: 2161 QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPTNLPPIQHLA 2220
Query: 2221 PGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGILPSH 2280
PGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSSAS VQPVPLSMPM QQQAEGILPSH
Sbjct: 2221 PGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSASPVQPVPLSMPMQQQQAEGILPSH 2280
Query: 2281 FSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGAS 2340
FSHASSSDPTF+VNRF GSQPSVASD KRNFTV+ADATVTQLPDELGIV++SSCV SGAS
Sbjct: 2281 FSHASSSDPTFSVNRFSGSQPSVASDLKRNFTVSADATVTQLPDELGIVDSSSCVSSGAS 2340
Query: 2341 VPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYSHSS 2400
VPN DINSL SVTDAGK GVQNC SSSNSGQ NAGT+LKSQ SHHKG+ SAQQYSHSS
Sbjct: 2341 VPNGDINSL---SVTDAGKAGVQNCSSSSNSGQNNAGTSLKSQ-SHHKGITSAQQYSHSS 2400
Query: 2401 GYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
GYNYQR GASQKNSSGGS+W HRRTGFMGR QSGAEKNFSSAKMKQIYVAKQPSNGNLRV
Sbjct: 2401 GYNYQRSGASQKNSSGGSDWTHRRTGFMGRTQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2442
BLAST of HG10016734 vs. NCBI nr
Match:
TYK12892.1 (uncharacterized protein E5676_scaffold255G004860 [Cucumis melo var. makuwa])
HSP 1 Score: 4271.1 bits (11076), Expect = 0.0e+00
Identity = 2268/2462 (92.12%), Postives = 2333/2462 (94.76%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA-----HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRP 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSRP
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSRP 60
Query: 61 RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR 120
RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR
Sbjct: 61 RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR 120
Query: 121 TSDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTA 180
T+DLPEKEG SANIVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASSQVH A
Sbjct: 121 TNDLPEKEGPSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVHAA 180
Query: 181 VEKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDA 240
VEK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGS EEQRD++HLSSRIDA
Sbjct: 181 VEKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVSEGSCEEQRDSAHLSSRIDA 240
Query: 241 RSKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDER 300
RS +QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDER
Sbjct: 241 RSNYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDER 300
Query: 301 DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD 360
DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD
Sbjct: 301 DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD 360
Query: 361 IHKVDSYGRDVRTAGREGWE-GNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNAD 420
IHKVD YGRD R A R+GWE GNFRKNNP+PKDGFGSD+GNDRN IAGR TS+DRETNAD
Sbjct: 361 IHKVDPYGRDSRMASRDGWEGGNFRKNNPVPKDGFGSDNGNDRNAIAGRLTSVDRETNAD 420
Query: 421 NMHVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGE 480
NMHVSHFREHANKDGRRD GFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQHS++RGE
Sbjct: 421 NMHVSHFREHANKDGRRDAGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHSKFRGE 480
Query: 481 THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP 540
THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP
Sbjct: 481 THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP 540
Query: 541 FTTGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR 600
FT GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR
Sbjct: 541 FTAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR 600
Query: 601 EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEE 660
EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEEE
Sbjct: 601 EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEEE 660
Query: 661 RRKQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEK 720
RRKQAAKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGEK
Sbjct: 661 RRKQAAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGEK 720
Query: 721 MVERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV 780
MVERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV
Sbjct: 721 MVERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV 780
Query: 781 LQDQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRP 840
LQDQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYSQLRGQRP
Sbjct: 781 LQDQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQRP 840
Query: 841 NLSGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSV 900
NLSGG DHYNR+QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYSV
Sbjct: 841 NLSGGVDHYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSV 900
Query: 901 GRSRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIH 960
GRSRYS RQPRVLPPPSVAS+QKSSVR EYESVPRDI ESEIQYDHPASNISTAQT YIH
Sbjct: 901 GRSRYSQRQPRVLPPPSVASMQKSSVRNEYESVPRDI-ESEIQYDHPASNISTAQTMYIH 960
Query: 961 HENRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD 1020
HENRALPEIIDVNLENGENEEQK DGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD
Sbjct: 961 HENRALPEIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD 1020
Query: 1021 SPVLSASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDED 1080
SPVLSASREGTLSIEDN+SA PAK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDED
Sbjct: 1021 SPVLSASREGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDED 1080
Query: 1081 DDGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER 1140
DDGYQEEDEVHEGEDENIDLV DFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER
Sbjct: 1081 DDGYQEEDEVHEGEDENIDLVPDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER 1140
Query: 1141 IPGNEENIYVAPEISNGIREEQGSSEGLQVDG-KICQYVDASSQIRIDPEEMQDLVMQSK 1200
IPGNEEN+YVA EISN IREE+GSSEGLQVDG K+CQYVDASSQIRIDPEEMQDLVMQSK
Sbjct: 1141 IPGNEENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRIDPEEMQDLVMQSK 1200
Query: 1201 IAQALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKL 1260
IAQALP+SEITEQGN+SCRSSVSV+QPISSSVSMASQ ISGQVIVPS AVSGQAE PVKL
Sbjct: 1201 IAQALPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPS-AVSGQAEPPVKL 1260
Query: 1261 QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS 1320
QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS
Sbjct: 1261 QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS 1320
Query: 1321 QGVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQG 1380
GVLPLAPQPLTF PTVQTGF LNKNPG+ LSIHPSQET AH+SRKND PF MDNQQG
Sbjct: 1321 PGVLPLAPQPLTFA-PTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQQG 1380
Query: 1381 IVSRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVS 1440
+VSRSLNVNPSGESKSLPL ES+ESKV++PQDQ SCIDESNSRSEPGFQAEH R VS
Sbjct: 1381 LVSRSLNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLHVS 1440
Query: 1441 TSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGS 1500
TSDNHYVVSRGKESEGRAQDGMGSFDS SR+KG SGLK RGQFPGGRGKKYIFTVKNSGS
Sbjct: 1441 TSDNHYVVSRGKESEGRAQDGMGSFDSASRNKGSSGLKGRGQFPGGRGKKYIFTVKNSGS 1500
Query: 1501 RLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSG 1560
RLPFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVSG
Sbjct: 1501 RLPFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVSG 1560
Query: 1561 RTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQG 1620
RTAV+SARNGTRKV++SNK SKRALESEGLSSG STS+ELDAGNRSEKGVKKEYLGKSQG
Sbjct: 1561 RTAVSSARNGTRKVIMSNKSSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYLGKSQG 1620
Query: 1621 SQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE 1680
SQYSGEG+FR+NICSGED D PLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE
Sbjct: 1621 SQYSGEGSFRRNICSGEDADTPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE 1680
Query: 1681 QREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRG 1740
QREKEIKAKSHN+KIPRK RST KSALSSV+SSK+YA KEAETVKRTRSDFVA DGG RG
Sbjct: 1681 QREKEIKAKSHNTKIPRKGRSTLKSALSSVSSSKVYAPKEAETVKRTRSDFVAADGGVRG 1740
Query: 1741 SGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSS 1800
SGN+VVSSAFS PVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALAT+DGRNLDSS
Sbjct: 1741 SGNVVVSSAFSPPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATNDGRNLDSS 1800
Query: 1801 MMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAG 1860
+MFDKKDDILDNVQSSF SWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP AG
Sbjct: 1801 LMFDKKDDILDNVQSSFASWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------AG 1860
Query: 1861 DPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGL 1920
D NVPSPSILAMDRS+SSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIG PTGL
Sbjct: 1861 DTNVPSPSILAMDRSYSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGTPTGL 1920
Query: 1921 CHSDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMV 1980
CHSDI I KLSGAENDCH+FFEKEKH ESCTHIEDSEAEAEAAASAVAVAAISSDEMV
Sbjct: 1921 CHSDISIPHKLSGAENDCHLFFEKEKHRPESCTHIEDSEAEAEAAASAVAVAAISSDEMV 1980
Query: 1981 TNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVE 2040
TNGIGTCSVSV+DTNNFG GDINVI GS GDQQLA KTRADDSLTVALPADLSVE
Sbjct: 1981 TNGIGTCSVSVSDTNNFGSGDINVIAT----GSTGDQQLASKTRADDSLTVALPADLSVE 2040
Query: 2041 TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA 2100
TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA
Sbjct: 2041 TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA 2100
Query: 2101 QTQKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP 2160
QTQKSSAPAPGPLGSWK CHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP
Sbjct: 2101 QTQKSSAPAPGPLGSWKHCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP 2160
Query: 2161 VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQH 2220
VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMP NLPPIQH
Sbjct: 2161 VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPANLPPIQH 2220
Query: 2221 LAPGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGILP 2280
LAPGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSSAS QPVPLSMPM QQQAEGILP
Sbjct: 2221 LAPGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSASPAQPVPLSMPMQQQQAEGILP 2280
Query: 2281 SHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSG 2340
SHFSHASSSDPTF+VNRFPGSQ SVASDHKRNFTV+ADATVTQLPDELGIV++SSCV SG
Sbjct: 2281 SHFSHASSSDPTFSVNRFPGSQASVASDHKRNFTVSADATVTQLPDELGIVDSSSCVSSG 2340
Query: 2341 ASVPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYSH 2400
ASVPN DINSL SVTDAG+TGV+NC SSSNSGQ NAGTNLKS S HHKG+ SAQQYSH
Sbjct: 2341 ASVPNVDINSL---SVTDAGQTGVKNCSSSSNSGQNNAGTNLKS-SLHHKGISSAQQYSH 2400
Query: 2401 SSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2452
SSGYNYQRGGASQKNSSGGSEW HRRTGF+GRNQSGAEKNFSSAKMKQIYVAKQPSNGNL
Sbjct: 2401 SSGYNYQRGGASQKNSSGGSEWSHRRTGFVGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2444
BLAST of HG10016734 vs. NCBI nr
Match:
XP_008440276.1 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 [Cucumis melo])
HSP 1 Score: 4269.2 bits (11071), Expect = 0.0e+00
Identity = 2267/2463 (92.04%), Postives = 2331/2463 (94.64%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA------HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSR 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSR
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSR 60
Query: 61 PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP 120
PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP
Sbjct: 61 PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP 120
Query: 121 RTSDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHT 180
RT+DLPEKEG SANIVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASSQVH
Sbjct: 121 RTNDLPEKEGPSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVHA 180
Query: 181 AVEKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRID 240
AVEK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGSYEEQRD++HLSSRID
Sbjct: 181 AVEKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVSEGSYEEQRDSAHLSSRID 240
Query: 241 ARSKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDE 300
ARS +QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDE
Sbjct: 241 ARSNYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDE 300
Query: 301 RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSS 360
RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNL DDESGKFHSS
Sbjct: 301 RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLPDDESGKFHSS 360
Query: 361 DIHKVDSYGRDVRTAGREGWE-GNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNA 420
DIHKVD YGRD R A R+GWE GNFRKNNP+PKDGFGSD+GNDRN IAGR TS+DRETNA
Sbjct: 361 DIHKVDPYGRDSRMASRDGWEGGNFRKNNPVPKDGFGSDNGNDRNAIAGRLTSVDRETNA 420
Query: 421 DNMHVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRG 480
DNMHVSHFREHANKDGRRD GFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQHSR+RG
Sbjct: 421 DNMHVSHFREHANKDGRRDAGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHSRFRG 480
Query: 481 ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD 540
ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD
Sbjct: 481 ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD 540
Query: 541 PFTTGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR 600
PFT GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR
Sbjct: 541 PFTAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR 600
Query: 601 REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEE 660
REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEE
Sbjct: 601 REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEE 660
Query: 661 ERRKQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGE 720
ERRKQAAKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGE
Sbjct: 661 ERRKQAAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGE 720
Query: 721 KMVERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF 780
KMVERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF
Sbjct: 721 KMVERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF 780
Query: 781 VLQDQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQR 840
VLQDQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYSQLRGQR
Sbjct: 781 VLQDQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQR 840
Query: 841 PNLSGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYS 900
PNLSGG DHYNR+QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYS
Sbjct: 841 PNLSGGVDHYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYS 900
Query: 901 VGRSRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYI 960
VGRSRYS RQPRVLPPPSVAS+QKSSVR EYESVPRDI ESEIQYDHPASNISTAQT YI
Sbjct: 901 VGRSRYSQRQPRVLPPPSVASMQKSSVRNEYESVPRDI-ESEIQYDHPASNISTAQTMYI 960
Query: 961 HHENRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG 1020
HHENRALPEIIDVNLENGENEEQK DGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG
Sbjct: 961 HHENRALPEIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG 1020
Query: 1021 DSPVLSASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDE 1080
DSPVLSASREGTLSIEDN+SA PAK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDE
Sbjct: 1021 DSPVLSASREGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDE 1080
Query: 1081 DDDGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE 1140
DDDGYQEEDEVHEGEDENIDLV DFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE
Sbjct: 1081 DDDGYQEEDEVHEGEDENIDLVPDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE 1140
Query: 1141 RIPGNEENIYVAPEISNGIREEQGSSEGLQVDG-KICQYVDASSQIRIDPEEMQDLVMQS 1200
RIPGNEEN+YVA EISN IREE+GSSEGLQVDG K+CQYVDASSQIRIDPEEMQDLVMQS
Sbjct: 1141 RIPGNEENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRIDPEEMQDLVMQS 1200
Query: 1201 KIAQALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVK 1260
K AQALP+SEITEQGN+SCRSSVSV+QPISSSVSMASQ ISGQVIVPS AVSGQAE PVK
Sbjct: 1201 KTAQALPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPS-AVSGQAEPPVK 1260
Query: 1261 LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV 1320
LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV
Sbjct: 1261 LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV 1320
Query: 1321 SQGVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQ 1380
S GVLPLAPQPLTF PTVQTGF LNKNPG+ LSIHPSQET AH+SRKND PF MDNQQ
Sbjct: 1321 SPGVLPLAPQPLTFA-PTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQQ 1380
Query: 1381 GIVSRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRV 1440
G+VSRSLNVNPSGESKSLPL ES+ESKV++PQDQ SCIDESNSRSEPGFQAEH R V
Sbjct: 1381 GLVSRSLNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLHV 1440
Query: 1441 STSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSG 1500
STSDNHYVVSRGKESEGRAQDGMGSFDS SR+KG SGLK RGQFPGGRGKKYIFTVKNSG
Sbjct: 1441 STSDNHYVVSRGKESEGRAQDGMGSFDSASRNKGSSGLKGRGQFPGGRGKKYIFTVKNSG 1500
Query: 1501 SRLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVS 1560
SRLPFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVS
Sbjct: 1501 SRLPFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVS 1560
Query: 1561 GRTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQ 1620
GRTAV+SARNGTRKV++SNK SKRALESEGLSSG STS+ELDAGNRSEKGVKKEYLGKSQ
Sbjct: 1561 GRTAVSSARNGTRKVIMSNKSSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYLGKSQ 1620
Query: 1621 GSQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR 1680
GSQYSGEG+FR+NICSGED D PLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR
Sbjct: 1621 GSQYSGEGSFRRNICSGEDADTPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR 1680
Query: 1681 EQREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGR 1740
EQREKEIKAKSHN+KIPRK RST KSALSSV+SSK+YA KEAETVKRTRSDFVA DGG R
Sbjct: 1681 EQREKEIKAKSHNTKIPRKGRSTLKSALSSVSSSKVYAPKEAETVKRTRSDFVAADGGVR 1740
Query: 1741 GSGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDS 1800
GSGN+VVSSAFS PVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALAT+DGRNLDS
Sbjct: 1741 GSGNVVVSSAFSPPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATNDGRNLDS 1800
Query: 1801 SMMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLA 1860
S+MFDKKDDILDNVQSSF SWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP A
Sbjct: 1801 SLMFDKKDDILDNVQSSFASWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------A 1860
Query: 1861 GDPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTG 1920
GD NVPSPSILAMDRS+SSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIG PTG
Sbjct: 1861 GDTNVPSPSILAMDRSYSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGTPTG 1920
Query: 1921 LCHSDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEM 1980
LCHSDI I KLSGAENDCH+FFEKEKH ESCTHIEDSEAEAEAAASAVAVAAISSDEM
Sbjct: 1921 LCHSDISIPHKLSGAENDCHLFFEKEKHRPESCTHIEDSEAEAEAAASAVAVAAISSDEM 1980
Query: 1981 VTNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSV 2040
VTNGIGTCSVSV+DTNNFG GDINVI GS GDQQLA KTRADDSLTVALPADLSV
Sbjct: 1981 VTNGIGTCSVSVSDTNNFGSGDINVIAT----GSTGDQQLASKTRADDSLTVALPADLSV 2040
Query: 2041 ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ 2100
ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ
Sbjct: 2041 ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ 2100
Query: 2101 AQTQKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA 2160
AQTQKSSAPAPGPLGSWK CHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA
Sbjct: 2101 AQTQKSSAPAPGPLGSWKHCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA 2160
Query: 2161 PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQ 2220
PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMP NLPPIQ
Sbjct: 2161 PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPANLPPIQ 2220
Query: 2221 HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGIL 2280
HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSS S QPVPLSMPM QQQAEGIL
Sbjct: 2221 HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSVSPAQPVPLSMPMQQQQAEGIL 2280
Query: 2281 PSHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGS 2340
PSHFSHASSSDPTF+VNRFPGSQ SVASDHKRNFTV+ADATVTQLPDELGIV++SSCV S
Sbjct: 2281 PSHFSHASSSDPTFSVNRFPGSQASVASDHKRNFTVSADATVTQLPDELGIVDSSSCVSS 2340
Query: 2341 GASVPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYS 2400
GASVPN DINSL SVTDAG+TGV+NC SSSNSGQ NAGTNLKS S HHKG+ SAQQYS
Sbjct: 2341 GASVPNVDINSL---SVTDAGQTGVKNCSSSSNSGQNNAGTNLKS-SLHHKGISSAQQYS 2400
Query: 2401 HSSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGN 2452
HSSGYNYQRGGASQKNSSGGSEW HRRTGF+GRNQSGAEKNFSSAKMKQIYVAKQPSNGN
Sbjct: 2401 HSSGYNYQRGGASQKNSSGGSEWSHRRTGFVGRNQSGAEKNFSSAKMKQIYVAKQPSNGN 2445
BLAST of HG10016734 vs. NCBI nr
Match:
XP_022950041.1 (uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950043.1 uncharacterized protein LOC111453246 [Cucurbita moschata])
HSP 1 Score: 4239.1 bits (10993), Expect = 0.0e+00
Identity = 2241/2451 (91.43%), Postives = 2312/2451 (94.33%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQK 60
MANPGVG KFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPG HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDLP 120
PGPKLSVPPPLNLPSLRKEHERLDSLGSG G TGGGVLGN QRPTSAG+GWTKP T+DLP
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 EKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAVEKAP 180
EKEG+S NIVDKIDPSLRSVDGV+GGSSVYMPPSARA T GP+VSTSASSQVHTAVEKAP
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDARSKFQ 240
VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAE SYEEQRDTSHLSS IDARSKFQ
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
SS+KSIP ENA+NGNS+SSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDIHKVD 360
LIDRVRD GHPKSEAYWERDFDMP VSSLPHKP HNFSQRW+ RDDESGKFHSSDIHKVD
Sbjct: 301 LIDRVRDHGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 SYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNMHVSH 420
YGRD RT REGWEGNF+KNNPIPKD FGSDSGNDRNDIAGRPTS+DRETNADNMHVS
Sbjct: 361 PYGRDTRTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTSV 480
FREHA K GRRDTGF GRQ+WNSA+ESY+SQ+PD V+DK+GSEQH+++RG+THNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGLV 540
+NSSYS GLKRIPAD+ LLNFGRDRRSFAKIEKPYMEDPFMKDFG SSFDGRDP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
GVVKRKKDVIKQ DFHDPVR+SFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQAA 660
RLARE EERQRRAEE AREAAWRAEQERLEAIQKAEELRIAREEEKQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERIT 720
KLKLLELEERMAKRQAEAVKSS+LTSDIPEKKI SVVKD SRLADTVDWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQST 780
TSASSESSSI R SEVGLR+Q SRDGSPSFVDRGKSVNSWRRDFY+RGSGSQFVLQDQST
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGG 840
GY G RREA+TGGRVSSRKE YGGAG TS+ +RRG+TEPQSD+YSQLRGQRPNLSGGG
Sbjct: 781 GYTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGRSRYS 900
D YNRSQEFD EFQDNVENFGDHGWRQE GRNNFYFPYPERVNPISE DGSYSVGRSRYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 HRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENRAL 960
RQPRVLPPPSVASIQKSSVRGE+ SV RDI ESEIQYDH A N+STAQTRYIHHENR L
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESA PAK GKEIMITS R STGDEDEWGVVDEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNEE 1140
EDEVHEGEDENIDL Q+FDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI GNEE
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNEE 1140
Query: 1141 NIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIAQALPE 1200
N++VAPE+SN IREEQGSSEGLQVDGK+CQY DASSQIRIDPEEMQDLVMQS+ AQALPE
Sbjct: 1141 NMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRIDPEEMQDLVMQSETAQALPE 1200
Query: 1201 SEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQFGLFSG 1260
EI EQGNSSCRSSVSVQQPISSSVS ASQ SGQVIVP+ A SGQAE PVKLQFGLFSG
Sbjct: 1201 PEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLFSG 1260
Query: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
PSLIPSPVPAIQIGSIQMPLHLHPQ+T SMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA
Sbjct: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
Query: 1321 PQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIVSRSLN 1380
PQPLTFVPP VQTGFPLNKNPG+AL I SQET AHNSRKNDVLP LMDNQQG+VSRSLN
Sbjct: 1321 PQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRSLN 1380
Query: 1381 VNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTSDNHYV 1440
VN SGESKSLPL ESIES+VM Q QT GSCIDESNSRSEPGFQAEHQRH VSTSDNHYV
Sbjct: 1381 VNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDNHYV 1440
Query: 1441 VSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGS 1500
VSRGKESEGRAQDGMGS DSVSRDKGLSGLKARGQFPGGRGKKY+FTVKNSGSRLPFPGS
Sbjct: 1441 VSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPFPGS 1500
Query: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTAVNSA 1560
ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLS+SQVSSNHV VDDKPTVSGRTAVNSA
Sbjct: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAVNSA 1560
Query: 1561 RNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYSGEG 1620
RNGTRKV +SNKPSKRALE EGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQY GE
Sbjct: 1561 RNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYYGES 1620
Query: 1621 NFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
NFRKNICSGEDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK
Sbjct: 1621 NFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
Query: 1681 AKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSGNIVVS 1740
AKSHNSKIPRKSRSTSK ALSSV SSK+YA K AETVKRTRSDFVA DGGGRGSGNIVVS
Sbjct: 1681 AKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNIVVS 1740
Query: 1741 SAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMMFDKKD 1800
SA SS +VSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNL+SS+MFDKK+
Sbjct: 1741 SALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFDKKN 1800
Query: 1801 DILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDPNVPSP 1860
DILDNV SSF SWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHS LAGDPNVPS
Sbjct: 1801 DILDNVTSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNVPSS 1860
Query: 1861 SILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCHSDIPI 1920
SILA+DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SCSTLLGIG PTGLCHSD+ I
Sbjct: 1861 SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIG-PTGLCHSDMQI 1920
Query: 1921 SRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTNGIGTC 1980
KLSGAENDCH+FFEKEKHHSES T IEDSEAEAEAAASAVAVAAISSDE+VTNG+GT
Sbjct: 1921 PHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEIVTNGLGTS 1980
Query: 1981 SVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETPPISLW 2040
SV VTDTNNFGGGDINVI A GSAG+QQ A KTRADDSLTVALPADLSVETPPISLW
Sbjct: 1981 SVPVTDTNNFGGGDINVIIA----GSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKSSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV TTQAQTQKSSA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPP GFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
GLSFMGATYIPSGKQ DWKHSPGP SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGP-SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
Query: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPMQQQAEGILPSHFSHASS 2280
LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMP+QQQAEGILPSHFSHASS
Sbjct: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASS 2280
Query: 2281 SDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGASVPNADI 2340
+DP+FTVNRFPGSQPSVASDHKRN+TVAADATVTQLPDELGIV+ASSCV SG SVPN DI
Sbjct: 2281 ADPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDI 2340
Query: 2341 NSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGVSAQQYSHSSGYNYQRGGA 2400
SL VNSVTDAGKT VQNCSSSNS NAGTNLKSQS HKG+ AQQYSHSSGYNYQRGGA
Sbjct: 2341 KSLSVNSVTDAGKT-VQNCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPS+GNLRV
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNLRV 2441
BLAST of HG10016734 vs. ExPASy TrEMBL
Match:
A0A5D3CNG4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004860 PE=4 SV=1)
HSP 1 Score: 4271.1 bits (11076), Expect = 0.0e+00
Identity = 2268/2462 (92.12%), Postives = 2333/2462 (94.76%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA-----HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRP 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSRP
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSRP 60
Query: 61 RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR 120
RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR
Sbjct: 61 RSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPR 120
Query: 121 TSDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTA 180
T+DLPEKEG SANIVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASSQVH A
Sbjct: 121 TNDLPEKEGPSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVHAA 180
Query: 181 VEKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDA 240
VEK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGS EEQRD++HLSSRIDA
Sbjct: 181 VEKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVSEGSCEEQRDSAHLSSRIDA 240
Query: 241 RSKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDER 300
RS +QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDER
Sbjct: 241 RSNYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDER 300
Query: 301 DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD 360
DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD
Sbjct: 301 DTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSD 360
Query: 361 IHKVDSYGRDVRTAGREGWE-GNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNAD 420
IHKVD YGRD R A R+GWE GNFRKNNP+PKDGFGSD+GNDRN IAGR TS+DRETNAD
Sbjct: 361 IHKVDPYGRDSRMASRDGWEGGNFRKNNPVPKDGFGSDNGNDRNAIAGRLTSVDRETNAD 420
Query: 421 NMHVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGE 480
NMHVSHFREHANKDGRRD GFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQHS++RGE
Sbjct: 421 NMHVSHFREHANKDGRRDAGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHSKFRGE 480
Query: 481 THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP 540
THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP
Sbjct: 481 THNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDP 540
Query: 541 FTTGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR 600
FT GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR
Sbjct: 541 FTAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARR 600
Query: 601 EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEE 660
EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEEE
Sbjct: 601 EEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEEE 660
Query: 661 RRKQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEK 720
RRKQAAKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGEK
Sbjct: 661 RRKQAAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGEK 720
Query: 721 MVERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV 780
MVERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV
Sbjct: 721 MVERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFV 780
Query: 781 LQDQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRP 840
LQDQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYSQLRGQRP
Sbjct: 781 LQDQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQRP 840
Query: 841 NLSGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSV 900
NLSGG DHYNR+QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYSV
Sbjct: 841 NLSGGVDHYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSV 900
Query: 901 GRSRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIH 960
GRSRYS RQPRVLPPPSVAS+QKSSVR EYESVPRDI ESEIQYDHPASNISTAQT YIH
Sbjct: 901 GRSRYSQRQPRVLPPPSVASMQKSSVRNEYESVPRDI-ESEIQYDHPASNISTAQTMYIH 960
Query: 961 HENRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD 1020
HENRALPEIIDVNLENGENEEQK DGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD
Sbjct: 961 HENRALPEIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGD 1020
Query: 1021 SPVLSASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDED 1080
SPVLSASREGTLSIEDN+SA PAK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDED
Sbjct: 1021 SPVLSASREGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDED 1080
Query: 1081 DDGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER 1140
DDGYQEEDEVHEGEDENIDLV DFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER
Sbjct: 1081 DDGYQEEDEVHEGEDENIDLVPDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFER 1140
Query: 1141 IPGNEENIYVAPEISNGIREEQGSSEGLQVDG-KICQYVDASSQIRIDPEEMQDLVMQSK 1200
IPGNEEN+YVA EISN IREE+GSSEGLQVDG K+CQYVDASSQIRIDPEEMQDLVMQSK
Sbjct: 1141 IPGNEENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRIDPEEMQDLVMQSK 1200
Query: 1201 IAQALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKL 1260
IAQALP+SEITEQGN+SCRSSVSV+QPISSSVSMASQ ISGQVIVPS AVSGQAE PVKL
Sbjct: 1201 IAQALPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPS-AVSGQAEPPVKL 1260
Query: 1261 QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS 1320
QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS
Sbjct: 1261 QFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS 1320
Query: 1321 QGVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQG 1380
GVLPLAPQPLTF PTVQTGF LNKNPG+ LSIHPSQET AH+SRKND PF MDNQQG
Sbjct: 1321 PGVLPLAPQPLTFA-PTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQQG 1380
Query: 1381 IVSRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVS 1440
+VSRSLNVNPSGESKSLPL ES+ESKV++PQDQ SCIDESNSRSEPGFQAEH R VS
Sbjct: 1381 LVSRSLNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLHVS 1440
Query: 1441 TSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGS 1500
TSDNHYVVSRGKESEGRAQDGMGSFDS SR+KG SGLK RGQFPGGRGKKYIFTVKNSGS
Sbjct: 1441 TSDNHYVVSRGKESEGRAQDGMGSFDSASRNKGSSGLKGRGQFPGGRGKKYIFTVKNSGS 1500
Query: 1501 RLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSG 1560
RLPFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVSG
Sbjct: 1501 RLPFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVSG 1560
Query: 1561 RTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQG 1620
RTAV+SARNGTRKV++SNK SKRALESEGLSSG STS+ELDAGNRSEKGVKKEYLGKSQG
Sbjct: 1561 RTAVSSARNGTRKVIMSNKSSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYLGKSQG 1620
Query: 1621 SQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE 1680
SQYSGEG+FR+NICSGED D PLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE
Sbjct: 1621 SQYSGEGSFRRNICSGEDADTPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRRE 1680
Query: 1681 QREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRG 1740
QREKEIKAKSHN+KIPRK RST KSALSSV+SSK+YA KEAETVKRTRSDFVA DGG RG
Sbjct: 1681 QREKEIKAKSHNTKIPRKGRSTLKSALSSVSSSKVYAPKEAETVKRTRSDFVAADGGVRG 1740
Query: 1741 SGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSS 1800
SGN+VVSSAFS PVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALAT+DGRNLDSS
Sbjct: 1741 SGNVVVSSAFSPPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATNDGRNLDSS 1800
Query: 1801 MMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAG 1860
+MFDKKDDILDNVQSSF SWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP AG
Sbjct: 1801 LMFDKKDDILDNVQSSFASWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------AG 1860
Query: 1861 DPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGL 1920
D NVPSPSILAMDRS+SSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIG PTGL
Sbjct: 1861 DTNVPSPSILAMDRSYSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGTPTGL 1920
Query: 1921 CHSDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMV 1980
CHSDI I KLSGAENDCH+FFEKEKH ESCTHIEDSEAEAEAAASAVAVAAISSDEMV
Sbjct: 1921 CHSDISIPHKLSGAENDCHLFFEKEKHRPESCTHIEDSEAEAEAAASAVAVAAISSDEMV 1980
Query: 1981 TNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVE 2040
TNGIGTCSVSV+DTNNFG GDINVI GS GDQQLA KTRADDSLTVALPADLSVE
Sbjct: 1981 TNGIGTCSVSVSDTNNFGSGDINVIAT----GSTGDQQLASKTRADDSLTVALPADLSVE 2040
Query: 2041 TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA 2100
TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA
Sbjct: 2041 TPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQA 2100
Query: 2101 QTQKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP 2160
QTQKSSAPAPGPLGSWK CHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP
Sbjct: 2101 QTQKSSAPAPGPLGSWKHCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAP 2160
Query: 2161 VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQH 2220
VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMP NLPPIQH
Sbjct: 2161 VGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPANLPPIQH 2220
Query: 2221 LAPGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGILP 2280
LAPGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSSAS QPVPLSMPM QQQAEGILP
Sbjct: 2221 LAPGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSASPAQPVPLSMPMQQQQAEGILP 2280
Query: 2281 SHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSG 2340
SHFSHASSSDPTF+VNRFPGSQ SVASDHKRNFTV+ADATVTQLPDELGIV++SSCV SG
Sbjct: 2281 SHFSHASSSDPTFSVNRFPGSQASVASDHKRNFTVSADATVTQLPDELGIVDSSSCVSSG 2340
Query: 2341 ASVPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYSH 2400
ASVPN DINSL SVTDAG+TGV+NC SSSNSGQ NAGTNLKS S HHKG+ SAQQYSH
Sbjct: 2341 ASVPNVDINSL---SVTDAGQTGVKNCSSSSNSGQNNAGTNLKS-SLHHKGISSAQQYSH 2400
Query: 2401 SSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2452
SSGYNYQRGGASQKNSSGGSEW HRRTGF+GRNQSGAEKNFSSAKMKQIYVAKQPSNGNL
Sbjct: 2401 SSGYNYQRGGASQKNSSGGSEWSHRRTGFVGRNQSGAEKNFSSAKMKQIYVAKQPSNGNL 2444
BLAST of HG10016734 vs. ExPASy TrEMBL
Match:
A0A1S3B1H0 (LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 OS=Cucumis melo OX=3656 GN=LOC103484772 PE=4 SV=1)
HSP 1 Score: 4269.2 bits (11071), Expect = 0.0e+00
Identity = 2267/2463 (92.04%), Postives = 2331/2463 (94.64%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA------HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSR 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSR
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSR 60
Query: 61 PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP 120
PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP
Sbjct: 61 PRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKP 120
Query: 121 RTSDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHT 180
RT+DLPEKEG SANIVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASSQVH
Sbjct: 121 RTNDLPEKEGPSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSQVHA 180
Query: 181 AVEKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRID 240
AVEK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGSYEEQRD++HLSSRID
Sbjct: 181 AVEKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHVSEGSYEEQRDSAHLSSRID 240
Query: 241 ARSKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDE 300
ARS +QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDE
Sbjct: 241 ARSNYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDE 300
Query: 301 RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSS 360
RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNL DDESGKFHSS
Sbjct: 301 RDTSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLPDDESGKFHSS 360
Query: 361 DIHKVDSYGRDVRTAGREGWE-GNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNA 420
DIHKVD YGRD R A R+GWE GNFRKNNP+PKDGFGSD+GNDRN IAGR TS+DRETNA
Sbjct: 361 DIHKVDPYGRDSRMASRDGWEGGNFRKNNPVPKDGFGSDNGNDRNAIAGRLTSVDRETNA 420
Query: 421 DNMHVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRG 480
DNMHVSHFREHANKDGRRD GFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQHSR+RG
Sbjct: 421 DNMHVSHFREHANKDGRRDAGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHSRFRG 480
Query: 481 ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD 540
ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD
Sbjct: 481 ETHNTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRD 540
Query: 541 PFTTGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR 600
PFT GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR
Sbjct: 541 PFTAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELAR 600
Query: 601 REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEE 660
REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEE
Sbjct: 601 REEEERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEE 660
Query: 661 ERRKQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGE 720
ERRKQAAKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGE
Sbjct: 661 ERRKQAAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGE 720
Query: 721 KMVERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF 780
KMVERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF
Sbjct: 721 KMVERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQF 780
Query: 781 VLQDQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQR 840
VLQDQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYSQLRGQR
Sbjct: 781 VLQDQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYSQLRGQR 840
Query: 841 PNLSGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYS 900
PNLSGG DHYNR+QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYS
Sbjct: 841 PNLSGGVDHYNRTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYS 900
Query: 901 VGRSRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYI 960
VGRSRYS RQPRVLPPPSVAS+QKSSVR EYESVPRDI ESEIQYDHPASNISTAQT YI
Sbjct: 901 VGRSRYSQRQPRVLPPPSVASMQKSSVRNEYESVPRDI-ESEIQYDHPASNISTAQTMYI 960
Query: 961 HHENRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG 1020
HHENRALPEIIDVNLENGENEEQK DGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG
Sbjct: 961 HHENRALPEIIDVNLENGENEEQKTDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSG 1020
Query: 1021 DSPVLSASREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDE 1080
DSPVLSASREGTLSIEDN+SA PAK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDE
Sbjct: 1021 DSPVLSASREGTLSIEDNDSAVPAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDE 1080
Query: 1081 DDDGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE 1140
DDDGYQEEDEVHEGEDENIDLV DFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE
Sbjct: 1081 DDDGYQEEDEVHEGEDENIDLVPDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFE 1140
Query: 1141 RIPGNEENIYVAPEISNGIREEQGSSEGLQVDG-KICQYVDASSQIRIDPEEMQDLVMQS 1200
RIPGNEEN+YVA EISN IREE+GSSEGLQVDG K+CQYVDASSQIRIDPEEMQDLVMQS
Sbjct: 1141 RIPGNEENLYVASEISNDIREERGSSEGLQVDGNKVCQYVDASSQIRIDPEEMQDLVMQS 1200
Query: 1201 KIAQALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVK 1260
K AQALP+SEITEQGN+SCRSSVSV+QPISSSVSMASQ ISGQVIVPS AVSGQAE PVK
Sbjct: 1201 KTAQALPDSEITEQGNASCRSSVSVRQPISSSVSMASQSISGQVIVPS-AVSGQAEPPVK 1260
Query: 1261 LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV 1320
LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV
Sbjct: 1261 LQFGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSV 1320
Query: 1321 SQGVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQ 1380
S GVLPLAPQPLTF PTVQTGF LNKNPG+ LSIHPSQET AH+SRKND PF MDNQQ
Sbjct: 1321 SPGVLPLAPQPLTFA-PTVQTGFSLNKNPGDGLSIHPSQETCAHSSRKNDSSPFSMDNQQ 1380
Query: 1381 GIVSRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRV 1440
G+VSRSLNVNPSGESKSLPL ES+ESKV++PQDQ SCIDESNSRSEPGFQAEH R V
Sbjct: 1381 GLVSRSLNVNPSGESKSLPLTESMESKVVSPQDQAAVSCIDESNSRSEPGFQAEHHRLHV 1440
Query: 1441 STSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSG 1500
STSDNHYVVSRGKESEGRAQDGMGSFDS SR+KG SGLK RGQFPGGRGKKYIFTVKNSG
Sbjct: 1441 STSDNHYVVSRGKESEGRAQDGMGSFDSASRNKGSSGLKGRGQFPGGRGKKYIFTVKNSG 1500
Query: 1501 SRLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVS 1560
SRLPFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVS
Sbjct: 1501 SRLPFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVS 1560
Query: 1561 GRTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQ 1620
GRTAV+SARNGTRKV++SNK SKRALESEGLSSG STS+ELDAGNRSEKGVKKEYLGKSQ
Sbjct: 1561 GRTAVSSARNGTRKVIMSNKSSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYLGKSQ 1620
Query: 1621 GSQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR 1680
GSQYSGEG+FR+NICSGED D PLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR
Sbjct: 1621 GSQYSGEGSFRRNICSGEDADTPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR 1680
Query: 1681 EQREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGR 1740
EQREKEIKAKSHN+KIPRK RST KSALSSV+SSK+YA KEAETVKRTRSDFVA DGG R
Sbjct: 1681 EQREKEIKAKSHNTKIPRKGRSTLKSALSSVSSSKVYAPKEAETVKRTRSDFVAADGGVR 1740
Query: 1741 GSGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDS 1800
GSGN+VVSSAFS PVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALAT+DGRNLDS
Sbjct: 1741 GSGNVVVSSAFSPPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATNDGRNLDS 1800
Query: 1801 SMMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLA 1860
S+MFDKKDDILDNVQSSF SWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP A
Sbjct: 1801 SLMFDKKDDILDNVQSSFASWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------A 1860
Query: 1861 GDPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTG 1920
GD NVPSPSILAMDRS+SSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIG PTG
Sbjct: 1861 GDTNVPSPSILAMDRSYSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGTPTG 1920
Query: 1921 LCHSDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEM 1980
LCHSDI I KLSGAENDCH+FFEKEKH ESCTHIEDSEAEAEAAASAVAVAAISSDEM
Sbjct: 1921 LCHSDISIPHKLSGAENDCHLFFEKEKHRPESCTHIEDSEAEAEAAASAVAVAAISSDEM 1980
Query: 1981 VTNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSV 2040
VTNGIGTCSVSV+DTNNFG GDINVI GS GDQQLA KTRADDSLTVALPADLSV
Sbjct: 1981 VTNGIGTCSVSVSDTNNFGSGDINVIAT----GSTGDQQLASKTRADDSLTVALPADLSV 2040
Query: 2041 ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ 2100
ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ
Sbjct: 2041 ETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQ 2100
Query: 2101 AQTQKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA 2160
AQTQKSSAPAPGPLGSWK CHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA
Sbjct: 2101 AQTQKSSAPAPGPLGSWKHCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFA 2160
Query: 2161 PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQ 2220
PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMP NLPPIQ
Sbjct: 2161 PVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPANLPPIQ 2220
Query: 2221 HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGIL 2280
HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSS S QPVPLSMPM QQQAEGIL
Sbjct: 2221 HLAPGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSVSPAQPVPLSMPMQQQQAEGIL 2280
Query: 2281 PSHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGS 2340
PSHFSHASSSDPTF+VNRFPGSQ SVASDHKRNFTV+ADATVTQLPDELGIV++SSCV S
Sbjct: 2281 PSHFSHASSSDPTFSVNRFPGSQASVASDHKRNFTVSADATVTQLPDELGIVDSSSCVSS 2340
Query: 2341 GASVPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYS 2400
GASVPN DINSL SVTDAG+TGV+NC SSSNSGQ NAGTNLKS S HHKG+ SAQQYS
Sbjct: 2341 GASVPNVDINSL---SVTDAGQTGVKNCSSSSNSGQNNAGTNLKS-SLHHKGISSAQQYS 2400
Query: 2401 HSSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGN 2452
HSSGYNYQRGGASQKNSSGGSEW HRRTGF+GRNQSGAEKNFSSAKMKQIYVAKQPSNGN
Sbjct: 2401 HSSGYNYQRGGASQKNSSGGSEWSHRRTGFVGRNQSGAEKNFSSAKMKQIYVAKQPSNGN 2445
BLAST of HG10016734 vs. ExPASy TrEMBL
Match:
A0A0A0KLC4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490850 PE=4 SV=1)
HSP 1 Score: 4267.2 bits (11066), Expect = 0.0e+00
Identity = 2266/2460 (92.11%), Postives = 2324/2460 (94.47%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQA----HHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPR 60
MANPGVGTKFVSVNLNKSYGQ HHHHSSHSNSYGSNRTRPGGHG GGGMVVLSRPR
Sbjct: 1 MANPGVGTKFVSVNLNKSYGQTHHHHHHHHSSHSNSYGSNRTRPGGHGVGGGMVVLSRPR 60
Query: 61 SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT 120
SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT
Sbjct: 61 SSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRT 120
Query: 121 SDLPEKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAV 180
+DLPEKEG SA IVDKIDPSLRSVDGVSGGSSVYMPPSARAG TGP+VSTSASS VH V
Sbjct: 121 NDLPEKEGPSATIVDKIDPSLRSVDGVSGGSSVYMPPSARAGMTGPVVSTSASSHVHATV 180
Query: 181 EKAPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDAR 240
EK+PVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKH +EGSYEEQRDT+HLSSRID R
Sbjct: 181 EKSPVLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHGSEGSYEEQRDTTHLSSRIDDR 240
Query: 241 SKFQSSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERD 300
SK+QSSQKS+ ENA+NGNS+SSG+FQSPE SRKQEDIFPGPLPLVSMNPRSDWADDERD
Sbjct: 241 SKYQSSQKSVRSENAKNGNSFSSGTFQSPESSRKQEDIFPGPLPLVSMNPRSDWADDERD 300
Query: 301 TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI 360
TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI
Sbjct: 301 TSHGLIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDI 360
Query: 361 HKVDSYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNM 420
HKVD YGRD R A REGWEGNFRKNNP+PKDGFGSD+ NDRN IAGRPTS+DRETNADN
Sbjct: 361 HKVDPYGRDARVASREGWEGNFRKNNPVPKDGFGSDNANDRNAIAGRPTSVDRETNADNT 420
Query: 421 HVSHFREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETH 480
HVSHFREHANKDGRRDTGFGQNGRQ+WNSATESYSSQEPDR V+DKYGSEQH+R+RGETH
Sbjct: 421 HVSHFREHANKDGRRDTGFGQNGRQTWNSATESYSSQEPDRTVKDKYGSEQHNRFRGETH 480
Query: 481 NTSVANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFT 540
NTSVANSSYSSGLKRIPADEPLLNFGRDRRS+AKIEKPYMEDPFMKDFGASSFDGRDPFT
Sbjct: 481 NTSVANSSYSSGLKRIPADEPLLNFGRDRRSYAKIEKPYMEDPFMKDFGASSFDGRDPFT 540
Query: 541 TGLVGVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE 600
GLVGVVKRKKDVIKQ DFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE
Sbjct: 541 AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREE 600
Query: 601 EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERR 660
EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRI LEEERR
Sbjct: 601 EERQRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFLEEERR 660
Query: 661 KQAAKLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMV 720
KQ AKLKLLELEE++AKRQAEAVKSS+ SDIPEKKIPSVVKDVSRL DTVDWEDGEKMV
Sbjct: 661 KQGAKLKLLELEEKIAKRQAEAVKSSTSNSDIPEKKIPSVVKDVSRLVDTVDWEDGEKMV 720
Query: 721 ERITTSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQ 780
ERITTSASSESSSI RSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFY+RGSGSQFVLQ
Sbjct: 721 ERITTSASSESSSINRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQ 780
Query: 781 DQSTGYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNL 840
DQSTGYNG RRE STGGRVSSRKE YGGA FTTSKTSHRRGITEPQSDEYS LRGQRPNL
Sbjct: 781 DQSTGYNGPRREVSTGGRVSSRKEFYGGAAFTTSKTSHRRGITEPQSDEYS-LRGQRPNL 840
Query: 841 SGGGDHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGR 900
SGG DHYN++QEFD +FQDNVENFGDHGWRQESG NNFYFPYPERVNPISETDGSYSVGR
Sbjct: 841 SGGVDHYNKTQEFDSDFQDNVENFGDHGWRQESGHNNFYFPYPERVNPISETDGSYSVGR 900
Query: 901 SRYSHRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHE 960
SRYS RQPRVLPPPSVAS+QKSSVR EYESV RDIVESEIQYDHPASNISTAQT YIHHE
Sbjct: 901 SRYSQRQPRVLPPPSVASMQKSSVRNEYESVSRDIVESEIQYDHPASNISTAQTMYIHHE 960
Query: 961 NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP 1020
NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP
Sbjct: 961 NRALPEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSP 1020
Query: 1021 VLSASREGTLSIEDNESAEP-AKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDD 1080
VLSASREGTLSIEDNESA P AK GKEIMITS RVSTGDEDEWG VDEHVQEQEEYDEDD
Sbjct: 1021 VLSASREGTLSIEDNESAVPAAKAGKEIMITSTRVSTGDEDEWGAVDEHVQEQEEYDEDD 1080
Query: 1081 DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI 1140
DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI
Sbjct: 1081 DGYQEEDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI 1140
Query: 1141 PGNEENIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIA 1200
PGNEEN+YV EISN IREEQGSS+GLQVDG +CQYVDASSQIRIDPEEMQDLV+QSK A
Sbjct: 1141 PGNEENLYVTSEISNDIREEQGSSKGLQVDGNVCQYVDASSQIRIDPEEMQDLVLQSKTA 1200
Query: 1201 QALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQF 1260
QAL ESEITEQGNSSCRSSVSVQQPISSSVSMA Q ISGQVIVPS AVSGQAE PVKLQF
Sbjct: 1201 QALAESEITEQGNSSCRSSVSVQQPISSSVSMAPQSISGQVIVPS-AVSGQAEPPVKLQF 1260
Query: 1261 GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQG 1320
GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVS G
Sbjct: 1261 GLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSPG 1320
Query: 1321 VLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIV 1380
VLPLAPQPLTFVPPTVQTGF L KNPG+ LSIHPSQET AH+SRKN+V PFLMDNQQG+V
Sbjct: 1321 VLPLAPQPLTFVPPTVQTGFSLKKNPGDGLSIHPSQETCAHSSRKNNVSPFLMDNQQGLV 1380
Query: 1381 SRSLNVNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTS 1440
SRSLNVNPSGES+SLPLAESIESKV+TP DQT SCIDESNSR EPGFQAEH R RVS+S
Sbjct: 1381 SRSLNVNPSGESESLPLAESIESKVVTPHDQTAVSCIDESNSRPEPGFQAEHHRLRVSSS 1440
Query: 1441 DNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRL 1500
DN YVVSRGKESEGRA DGMGSFDSVSR+KGLSGLK RGQFPGGRGKKYIFTVKNSGSRL
Sbjct: 1441 DNRYVVSRGKESEGRAPDGMGSFDSVSRNKGLSGLKGRGQFPGGRGKKYIFTVKNSGSRL 1500
Query: 1501 PFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRT 1560
PFP SESTRL+TGGFQRRPRRNI RTEFRVRET DKKLSNSQVSSNHVGVDDKPTVSGRT
Sbjct: 1501 PFPVSESTRLETGGFQRRPRRNITRTEFRVRETADKKLSNSQVSSNHVGVDDKPTVSGRT 1560
Query: 1561 AVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQ 1620
AVNSARNGTRKV++SNKPSKRALESEGLSSG STS+ELDAGNRSEKGVKKEY GKSQGSQ
Sbjct: 1561 AVNSARNGTRKVIVSNKPSKRALESEGLSSGVSTSVELDAGNRSEKGVKKEYSGKSQGSQ 1620
Query: 1621 YSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
YSGEGNFR+NICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR
Sbjct: 1621 YSGEGNFRRNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQR 1680
Query: 1681 EKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSG 1740
EKEIKAKSHNSKIPRK RSTSKSALSSV SSK+YA KEAETVKRTRSDFVA DGG RGSG
Sbjct: 1681 EKEIKAKSHNSKIPRKGRSTSKSALSSVNSSKVYAPKEAETVKRTRSDFVAADGGVRGSG 1740
Query: 1741 NIVVSSAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMM 1800
N+VVSSAFS PVVSQPLAPIGTPALKSDSQ+ERSHTARSIQTSGP LAT+DGRNLDSSMM
Sbjct: 1741 NVVVSSAFSPPVVSQPLAPIGTPALKSDSQSERSHTARSIQTSGPTLATNDGRNLDSSMM 1800
Query: 1801 FDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDP 1860
FDKKDDILDNVQSSFTSWGNSRINQQV+ALTQTQLDEAMKPAQFDLHPP AGD
Sbjct: 1801 FDKKDDILDNVQSSFTSWGNSRINQQVIALTQTQLDEAMKPAQFDLHPP-------AGDT 1860
Query: 1861 NVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCH 1920
NVPSPSILAMDRSFSSAANPISSLLAGEKIQF G CSTLLGIGAPTGLCH
Sbjct: 1861 NVPSPSILAMDRSFSSAANPISSLLAGEKIQF-----------GDCSTLLGIGAPTGLCH 1920
Query: 1921 SDIPISRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN 1980
SDIPI KLSGA+NDCH+FFEKEKH SESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN
Sbjct: 1921 SDIPIPHKLSGADNDCHLFFEKEKHRSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTN 1980
Query: 1981 GIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETP 2040
GIGTCSVSVTDTNNFGGGDINV T GS GDQQLA KTRADDSLTVALPADLSVETP
Sbjct: 1981 GIGTCSVSVTDTNNFGGGDINVAT-----GSTGDQQLASKTRADDSLTVALPADLSVETP 2040
Query: 2041 PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT 2100
PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT
Sbjct: 2041 PISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQT 2100
Query: 2101 QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG 2160
QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG
Sbjct: 2101 QKSSAPAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVG 2160
Query: 2161 QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLA 2220
QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGV+GDQKNLNMVSAQRMPTNLPPIQHLA
Sbjct: 2161 QFGQVGLSFMGATYIPSGKQHDWKHSPGPSSLGVDGDQKNLNMVSAQRMPTNLPPIQHLA 2220
Query: 2221 PGSPLLPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPM-QQQAEGILPSH 2280
PGSPLLPMASPLAMFDVSPFQASPEMSVQ RWPSSAS VQPVPLSMPM QQQAEGILPSH
Sbjct: 2221 PGSPLLPMASPLAMFDVSPFQASPEMSVQTRWPSSASPVQPVPLSMPMQQQQAEGILPSH 2280
Query: 2281 FSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGAS 2340
FSHASSSDPTF+VNRF GSQPSVASD KRNFTV+ADATVTQLPDELGIV++SSCV SGAS
Sbjct: 2281 FSHASSSDPTFSVNRFSGSQPSVASDLKRNFTVSADATVTQLPDELGIVDSSSCVSSGAS 2340
Query: 2341 VPNADINSLLVNSVTDAGKTGVQNC-SSSNSGQ-NAGTNLKSQSSHHKGV-SAQQYSHSS 2400
VPN DINSL SVTDAGK GVQNC SSSNSGQ NAGT+LKSQ SHHKG+ SAQQYSHSS
Sbjct: 2341 VPNGDINSL---SVTDAGKAGVQNCSSSSNSGQNNAGTSLKSQ-SHHKGITSAQQYSHSS 2400
Query: 2401 GYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
GYNYQR GASQKNSSGGS+W HRRTGFMGR QSGAEKNFSSAKMKQIYVAKQPSNGNLRV
Sbjct: 2401 GYNYQRSGASQKNSSGGSDWTHRRTGFMGRTQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2431
BLAST of HG10016734 vs. ExPASy TrEMBL
Match:
A0A6J1GDR0 (uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC111453246 PE=4 SV=1)
HSP 1 Score: 4239.1 bits (10993), Expect = 0.0e+00
Identity = 2241/2451 (91.43%), Postives = 2312/2451 (94.33%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQK 60
MANPGVG KFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPG HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDLP 120
PGPKLSVPPPLNLPSLRKEHERLDSLGSG G TGGGVLGN QRPTSAG+GWTKP T+DLP
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 EKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAVEKAP 180
EKEG+S NIVDKIDPSLRSVDGV+GGSSVYMPPSARA T GP+VSTSASSQVHTAVEKAP
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSASSQVHTAVEKAP 180
Query: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDARSKFQ 240
VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAE SYEEQRDTSHLSS IDARSKFQ
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
SS+KSIP ENA+NGNS+SSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDIHKVD 360
LIDRVRD GHPKSEAYWERDFDMP VSSLPHKP HNFSQRW+ RDDESGKFHSSDIHKVD
Sbjct: 301 LIDRVRDHGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 SYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNMHVSH 420
YGRD RT REGWEGNF+KNNPIPKD FGSDSGNDRNDIAGRPTS+DRETNADNMHVS
Sbjct: 361 PYGRDTRTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTSV 480
FREHA K GRRDTGF GRQ+WNSA+ESY+SQ+PD V+DK+GSEQH+++RG+THNTSV
Sbjct: 421 FREHAPKVGRRDTGF---GRQTWNSASESYNSQDPDWTVKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGLV 540
+NSSYS GLKRIPAD+ LLNFGRDRRSFAKIEKPYMEDPFMKDFG SSFDGRDP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
GVVKRKKDVIKQ DFHDPVR+SFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQAA 660
RLARE EERQRRAEE AREAAWRAEQERLEAIQKAEELRIAREEEKQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAIQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERIT 720
KLKLLELEERMAKRQAEAVKSS+LTSDIPEKKI SVVKD SRLADTVDWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDASRLADTVDWEDGEKMVERIT 720
Query: 721 TSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQST 780
TSASSESSSI R SEVGLR+Q SRDGSPSFVDRGKSVNSWRRDFY+RGSGSQFVLQDQST
Sbjct: 721 TSASSESSSINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGG 840
GY G RREA+TGGRVSSRKE YGGAG TS+ +RRG+TEPQSD+YSQLRGQRPNLSGGG
Sbjct: 781 GYTGPRREATTGGRVSSRKEFYGGAGLATSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGRSRYS 900
D YNRSQEFD EFQDNVENFGDHGWRQE GRNNFYFPYPERVNPISE DGSYSVGRSRYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHGWRQEGGRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 HRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENRAL 960
RQPRVLPPPSVASIQKSSVRGE+ SV RDI ESEIQYDH A N+STAQTRYIHHENR L
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESA PAK GKEIMITS R STGDEDEWGVVDEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMITSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNEE 1140
EDEVHEGEDENIDL Q+FDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI GNEE
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNEE 1140
Query: 1141 NIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIAQALPE 1200
N++VAPE+SN IREEQGSSEGLQVDGK+CQY DASSQIRIDPEEMQDLVMQS+ AQALPE
Sbjct: 1141 NMFVAPEVSNCIREEQGSSEGLQVDGKVCQYEDASSQIRIDPEEMQDLVMQSETAQALPE 1200
Query: 1201 SEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQFGLFSG 1260
EI EQGNSSCRSSVSVQQPISSSVS ASQ SGQVIVP+ A SGQAE PVKLQFGLFSG
Sbjct: 1201 PEINEQGNSSCRSSVSVQQPISSSVSTASQSSSGQVIVPNAAGSGQAEPPVKLQFGLFSG 1260
Query: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
PSLIPSPVPAIQIGSIQMPLHLHPQ+T SMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA
Sbjct: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
Query: 1321 PQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIVSRSLN 1380
PQPLTFVPP VQTGFPLNKNPG+AL I SQET AHNSRKNDVLP LMDNQQG+VSRSLN
Sbjct: 1321 PQPLTFVPPAVQTGFPLNKNPGDALPIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRSLN 1380
Query: 1381 VNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTSDNHYV 1440
VN SGESKSLPL ESIES+VM Q QT GSCIDESNSRSEPGFQAEHQRH VSTSDNHYV
Sbjct: 1381 VNSSGESKSLPLTESIESQVMAQQYQTAGSCIDESNSRSEPGFQAEHQRHHVSTSDNHYV 1440
Query: 1441 VSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGS 1500
VSRGKESEGRAQDGMGS DSVSRDKGLSGLKARGQFPGGRGKKY+FTVKNSGSRLPFPGS
Sbjct: 1441 VSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYVFTVKNSGSRLPFPGS 1500
Query: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTAVNSA 1560
ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLS+SQVSSNHV VDDKPTVSGRTAVNSA
Sbjct: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAVNSA 1560
Query: 1561 RNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYSGEG 1620
RNGTRKV +SNKPSKRALE EGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQY GE
Sbjct: 1561 RNGTRKVFVSNKPSKRALEPEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYYGES 1620
Query: 1621 NFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
NFRKNICSGEDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK
Sbjct: 1621 NFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
Query: 1681 AKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSGNIVVS 1740
AKSHNSKIPRKSRSTSK ALSSV SSK+YA K AETVKRTRSDFVA DGGGRGSGNIVVS
Sbjct: 1681 AKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSDFVAADGGGRGSGNIVVS 1740
Query: 1741 SAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMMFDKKD 1800
SA SS +VSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNL+SS+MFDKK+
Sbjct: 1741 SALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFDKKN 1800
Query: 1801 DILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDPNVPSP 1860
DILDNV SSF SWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHS LAGDPNVPS
Sbjct: 1801 DILDNVTSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNVPSS 1860
Query: 1861 SILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCHSDIPI 1920
SILA+DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SCSTLLGIG PTGLCHSD+ I
Sbjct: 1861 SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIG-PTGLCHSDMQI 1920
Query: 1921 SRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTNGIGTC 1980
KLSGAENDCH+FFEKEKHHSES T IEDSEAEAEAAASAVAVAAISSDE+VTNG+GT
Sbjct: 1921 PHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEIVTNGLGTS 1980
Query: 1981 SVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETPPISLW 2040
SV VTDTNNFGGGDINVI A GSAG+QQ A KTRADDSLTVALPADLSVETPPISLW
Sbjct: 1981 SVPVTDTNNFGGGDINVIIA----GSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKSSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV TTQAQTQKSSA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPP GFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
GLSFMGATYIPSGKQ DWKHSPGP SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGP-SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
Query: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPMQQQAEGILPSHFSHASS 2280
LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMP+QQQAEGILPSHFSHASS
Sbjct: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASS 2280
Query: 2281 SDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGASVPNADI 2340
+DP+FTVNRFPGSQPSVASDHKRN+TVAADATVTQLPDELGIV+ASSCV SG SVPN DI
Sbjct: 2281 ADPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDI 2340
Query: 2341 NSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGVSAQQYSHSSGYNYQRGGA 2400
SL VNSVTDAGKT VQNCSSSNS NAGTNLKSQS HKG+ AQQYSHSSGYNYQRGGA
Sbjct: 2341 KSLSVNSVTDAGKT-VQNCSSSNSSLNAGTNLKSQSPQHKGIPAQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPS+GNLRV
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNLRV 2441
BLAST of HG10016734 vs. ExPASy TrEMBL
Match:
A0A6J1IST3 (uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360 PE=4 SV=1)
HSP 1 Score: 4205.2 bits (10905), Expect = 0.0e+00
Identity = 2226/2451 (90.82%), Postives = 2301/2451 (93.88%), Query Frame = 0
Query: 1 MANPGVGTKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGGHGAGGGMVVLSRPRSSQK 60
MANPGVG KFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPG HGAGGGMVVLSRPRSSQK
Sbjct: 1 MANPGVGAKFVSVNLNKSYGQAHHHHSSHSNSYGSNRTRPGSHGAGGGMVVLSRPRSSQK 60
Query: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDLP 120
PGPKLSVPPPLNLPSLRKEHERLDSLGSG G TGGGVLGN QRPTSAG+GWTKP T+DLP
Sbjct: 61 PGPKLSVPPPLNLPSLRKEHERLDSLGSGAGQTGGGVLGNRQRPTSAGLGWTKPHTNDLP 120
Query: 121 EKEGVSANIVDKIDPSLRSVDGVSGGSSVYMPPSARAGTTGPIVSTSASSQVHTAVEKAP 180
EKEG+S NIVDKIDPSLRSVDGV+GGSSVYMPPSARA T GP+VSTSA SQVHTAVEKAP
Sbjct: 121 EKEGLSGNIVDKIDPSLRSVDGVNGGSSVYMPPSARASTAGPVVSTSALSQVHTAVEKAP 180
Query: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEGSYEEQRDTSHLSSRIDARSKFQ 240
VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAE SYEEQRDTSHLSS IDARSKFQ
Sbjct: 181 VLRGEDFPSLQATLPSAAAPSQKQRDGLSSKLKHAAEVSYEEQRDTSHLSSSIDARSKFQ 240
Query: 241 SSQKSIPCENARNGNSYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
SS+KSIP ENA+NGNS+SSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG
Sbjct: 241 SSKKSIPSENAKNGNSFSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHG 300
Query: 301 LIDRVRDRGHPKSEAYWERDFDMPRVSSLPHKPTHNFSQRWNLRDDESGKFHSSDIHKVD 360
LIDRVRDRGHPKSEAYWERDFDMP VSSLPHKP HNFSQRW+ RDDESGKFHSSDIHKVD
Sbjct: 301 LIDRVRDRGHPKSEAYWERDFDMPWVSSLPHKPIHNFSQRWHPRDDESGKFHSSDIHKVD 360
Query: 361 SYGRDVRTAGREGWEGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRETNADNMHVSH 420
YGRD RT REGWEGNF+KNNPIPKD FGSDSGNDRNDIAGRPTS+DRETNADNMHVS
Sbjct: 361 PYGRDARTPSREGWEGNFQKNNPIPKDRFGSDSGNDRNDIAGRPTSIDRETNADNMHVSQ 420
Query: 421 FREHANKDGRRDTGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTSV 480
FREHA K GRRD GF GRQ+WNSA+ESY+SQ+PD +DK+GSEQH+++RG+THNTSV
Sbjct: 421 FREHAPKVGRRDAGF---GRQTWNSASESYNSQDPDWTAKDKHGSEQHNKFRGQTHNTSV 480
Query: 481 ANSSYSSGLKRIPADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGLV 540
+NSSYS GLKRIPAD+ LLNFGRDRRSFAKIEKPYMEDPFMKDFG SSFDGRDP+T GLV
Sbjct: 481 SNSSYSPGLKRIPADDLLLNFGRDRRSFAKIEKPYMEDPFMKDFGGSSFDGRDPYTGGLV 540
Query: 541 GVVKRKKDVIKQIDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
GVVKRKKDVIKQ DFHDPVR+SFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ
Sbjct: 541 GVVKRKKDVIKQTDFHDPVRDSFEAELERVQQIQEQERQRIIEEQERALELARREEEERQ 600
Query: 601 RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQAA 660
RLARE EERQRRAEE AREAAWRAEQERLEA+QKAEELRIAREEEKQRI +EEERRKQAA
Sbjct: 601 RLAREQEERQRRAEEIAREAAWRAEQERLEAVQKAEELRIAREEEKQRIFVEEERRKQAA 660
Query: 661 KLKLLELEERMAKRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERIT 720
KLKLLELEERMAKRQAEAVKSS+LTSDIPEKKI SVVKDVSRLAD+VDWEDGEKMVERIT
Sbjct: 661 KLKLLELEERMAKRQAEAVKSSTLTSDIPEKKISSVVKDVSRLADSVDWEDGEKMVERIT 720
Query: 721 TSASSESSSIIRSSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQST 780
TSASSESS I R SEVGLR+Q SRDGSPSFVDRGKSVNSWRRDFY+RGSGSQFVLQDQST
Sbjct: 721 TSASSESSCINRPSEVGLRTQVSRDGSPSFVDRGKSVNSWRRDFYDRGSGSQFVLQDQST 780
Query: 781 GYNGTRREASTGGRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGG 840
GY G REA+TGGRVSSRKE YGGAG TTS+ +RRG+TEPQSD+YSQLRGQRPNLSGGG
Sbjct: 781 GYTGPWREATTGGRVSSRKEFYGGAGLTTSRIYNRRGMTEPQSDDYSQLRGQRPNLSGGG 840
Query: 841 DHYNRSQEFDPEFQDNVENFGDHGWRQESGRNNFYFPYPERVNPISETDGSYSVGRSRYS 900
D YNRSQEFD EFQDNVENFGDH WRQE RNNFYFPYPERVNPISE DGSYSVGRSRYS
Sbjct: 841 DQYNRSQEFDSEFQDNVENFGDHAWRQEGSRNNFYFPYPERVNPISEADGSYSVGRSRYS 900
Query: 901 HRQPRVLPPPSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENRAL 960
RQPRVLPPPSVASIQKSSVRGE+ SV RDI ESEIQYDH A N+STAQTRYIHHENR L
Sbjct: 901 QRQPRVLPPPSVASIQKSSVRGEFTSVTRDIAESEIQYDHLARNVSTAQTRYIHHENRTL 960
Query: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA
Sbjct: 961 PEIIDVNLENGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSA 1020
Query: 1021 SREGTLSIEDNESAEPAKVGKEIMITSARVSTGDEDEWGVVDEHVQEQEEYDEDDDGYQE 1080
SREGTLSIEDNESA PAK GKEIMI+S R STGDEDEWGVVDEHVQEQEEYDEDDDGY+E
Sbjct: 1021 SREGTLSIEDNESAVPAKAGKEIMISSTRASTGDEDEWGVVDEHVQEQEEYDEDDDGYRE 1080
Query: 1081 EDEVHEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNEE 1140
EDEVHEGEDENIDL Q+FDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERI GNEE
Sbjct: 1081 EDEVHEGEDENIDLAQNFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERILGNEE 1140
Query: 1141 NIYVAPEISNGIREEQGSSEGLQVDGKICQYVDASSQIRIDPEEMQDLVMQSKIAQALPE 1200
N++ PEISN IREEQGSSEGLQVDGK+CQY DASSQIRIDPEEMQDLVMQS+ AQALPE
Sbjct: 1141 NMFATPEISNCIREEQGSSEGLQVDGKVCQYEDASSQIRIDPEEMQDLVMQSETAQALPE 1200
Query: 1201 SEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPSTAVSGQAESPVKLQFGLFSG 1260
EI EQGNSSCRSSVSVQQPISSSVSMASQ SGQVIVP+ A SGQAE PVKLQFGLFSG
Sbjct: 1201 PEINEQGNSSCRSSVSVQQPISSSVSMASQSSSGQVIVPNAAGSGQAEPPVKLQFGLFSG 1260
Query: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
PSLIPSPVPAIQIGSIQMPLHLHPQ+T SMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA
Sbjct: 1261 PSLIPSPVPAIQIGSIQMPLHLHPQMTPSMTHMHSSQPPLFQFGQLRYTSSVSQGVLPLA 1320
Query: 1321 PQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGIVSRSLN 1380
PQPLTFVPP VQTGFPLNKNPG+AL I SQET AHNSRKNDVLP LMDNQQG+VSRS N
Sbjct: 1321 PQPLTFVPPAVQTGFPLNKNPGDALLIQTSQETCAHNSRKNDVLPLLMDNQQGLVSRSSN 1380
Query: 1381 VNPSGESKSLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRVSTSDNHYV 1440
VN SGESKSLPL ESIES+VM Q QT GSCIDE+NSRSE GFQAEHQR VSTSDNHYV
Sbjct: 1381 VNSSGESKSLPLTESIESQVMAQQYQTAGSCIDENNSRSELGFQAEHQRQHVSTSDNHYV 1440
Query: 1441 VSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGS 1500
VSRGKESEGRAQDGMGS DSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGS
Sbjct: 1441 VSRGKESEGRAQDGMGSLDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSGSRLPFPGS 1500
Query: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVSGRTAVNSA 1560
ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLS+SQVSSNHV VDDKPTVSGRTAVNSA
Sbjct: 1501 ESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSSSQVSSNHVEVDDKPTVSGRTAVNSA 1560
Query: 1561 RNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQGSQYSGEG 1620
RNGTRKV +SNKPSKRALE EGLSS ASTSLELDAGNRSEK VKKEYLGKSQGSQY GE
Sbjct: 1561 RNGTRKVFVSNKPSKRALEPEGLSSRASTSLELDAGNRSEKEVKKEYLGKSQGSQYYGES 1620
Query: 1621 NFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
NFRKNICSGEDVDAP+QSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK
Sbjct: 1621 NFRKNICSGEDVDAPMQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRREQREKEIK 1680
Query: 1681 AKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGRGSGNIVVS 1740
AKSHNSKIPRKSRSTSK ALSSV SSK+YA K AETVKRTRS+F+A D GGRGSGNIVVS
Sbjct: 1681 AKSHNSKIPRKSRSTSKIALSSVNSSKVYAAKVAETVKRTRSEFIAAD-GGRGSGNIVVS 1740
Query: 1741 SAFSSPVVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLDSSMMFDKKD 1800
SA SS +VSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNL+SS+MFDKK+
Sbjct: 1741 SALSSSIVSQPLAPIGTPALKSDSQTERSHTARSIQTSGPALATSDGRNLESSLMFDKKN 1800
Query: 1801 DILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSGLAGDPNVPSP 1860
DILDNV SSF SWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHS LAGDPNVPS
Sbjct: 1801 DILDNVPSSFPSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHSSLAGDPNVPSS 1860
Query: 1861 SILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGAPTGLCHSDIPI 1920
SILA+DRSFSSAANPISSLLAGEKIQFGAVTSPTVLPP SCSTLLGIG PTGLCHSD+ I
Sbjct: 1861 SILAIDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPDSCSTLLGIG-PTGLCHSDMQI 1920
Query: 1921 SRKLSGAENDCHIFFEKEKHHSESCTHIEDSEAEAEAAASAVAVAAISSDEMVTNGIGTC 1980
KLSGAENDCH+FFEKEKHHSES T IEDSEAEAEAAASAVAVAAISSDE+VTNG+GT
Sbjct: 1921 PHKLSGAENDCHLFFEKEKHHSESRTRIEDSEAEAEAAASAVAVAAISSDEIVTNGLGTS 1980
Query: 1981 SVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPADLSVETPPISLW 2040
SV VTDTNNFGGGDINVI A GSAG+QQ A KTRADDSLTVALPADLSVETPPISLW
Sbjct: 1981 SVPVTDTNNFGGGDINVIIA----GSAGNQQFASKTRADDSLTVALPADLSVETPPISLW 2040
Query: 2041 PTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVPTTQAQTQKSSA 2100
P+LPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV TTQAQTQKSSA
Sbjct: 2041 PSLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESVSTTQAQTQKSSA 2100
Query: 2101 PAPGPLGSWKQCHSGVDSFYGPPTGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
PAPGPLGSWKQCHSGVDSFYGPP GFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV
Sbjct: 2101 PAPGPLGSWKQCHSGVDSFYGPPAGFTGPFISPGGIPGVQGPPHMVVYNHFAPVGQFGQV 2160
Query: 2161 GLSFMGATYIPSGKQHDWKHSPGPSSLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
GLSFMGATYIPSGKQ DWKHSPGP SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL
Sbjct: 2161 GLSFMGATYIPSGKQPDWKHSPGP-SLGVEGDQKNLNMVSAQRMPTNLPPIQHLAPGSPL 2220
Query: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPMQQQAEGILPSHFSHASS 2280
LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMP+QQQAEGILPSHFSHASS
Sbjct: 2221 LPMASPLAMFDVSPFQASPEMSVQARWPSSASSVQPVPLSMPLQQQAEGILPSHFSHASS 2280
Query: 2281 SDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELGIVEASSCVGSGASVPNADI 2340
+DP+FTVNRFPGSQPSVASDHKRN+TVAADATVTQLPDELGIV+ASSCV SG SVPN DI
Sbjct: 2281 ADPSFTVNRFPGSQPSVASDHKRNYTVAADATVTQLPDELGIVDASSCVSSGGSVPNVDI 2340
Query: 2341 NSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKGVSAQQYSHSSGYNYQRGGA 2400
SL VNSVTDAGKTGVQNCSSSNS NAGTNLKSQS HKG+ QQYSHSSGYNYQRGGA
Sbjct: 2341 KSLSVNSVTDAGKTGVQNCSSSNSSLNAGTNLKSQSPQHKGIPVQQYSHSSGYNYQRGGA 2400
Query: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSNGNLRV 2452
SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPS+GNLRV
Sbjct: 2401 SQKNSSGGSEWPHRRTGFMGRNQSGAEKNFSSAKMKQIYVAKQPSSGNLRV 2441
BLAST of HG10016734 vs. TAIR 10
Match:
AT3G50370.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; Has 27734 Blast hits to 16708 proteins in 1259 species: Archae - 81; Bacteria - 3434; Metazoa - 10876; Fungi - 2514; Plants - 987; Viruses - 212; Other Eukaryotes - 9630 (source: NCBI BLink). )
HSP 1 Score: 1236.9 bits (3199), Expect = 0.0e+00
Identity = 1001/2407 (41.59%), Postives = 1332/2407 (55.34%), Query Frame = 0
Query: 79 EHERLDSLGSGTGPTGGGVLGNGQRPTSAGMGWTKPRTSDLPEKEGVSANIVDKIDPSLR 138
EHER+DS GS + +GGG+ G+G RP S+G+GW+KP +A D D
Sbjct: 27 EHERVDSSGS-SFHSGGGIAGSGTRPASSGIGWSKP-----------AATATDG-DIGNH 86
Query: 139 SVDGVSGGSS-VYMPPSARAGTTGPIVSTSASSQVHTAVEKAPVLRGEDFPSLQATLPSA 198
+ +GV+ GS+ + ++R G P+ + VEK LRGEDFPSL+A+LPSA
Sbjct: 87 TGEGVTRGSNGLNTSLASRVGAAEPM------ERAFHHVEKVATLRGEDFPSLKASLPSA 146
Query: 199 AAPSQKQRDGLSSKLKHAA-EGSYEEQRDTSHLSSR-IDARSKFQSSQKSIPCENARNGN 258
+ QKQ++GL+ K K AA E +E R S +SS +D R + QS + + E + +
Sbjct: 147 SVSGQKQKEGLNQKQKQAAGEDFSKEPRGVSGMSSSLVDMRPQNQSGRSRLGNELSES-P 206
Query: 259 SYSSGSFQSPELSRKQEDIFPGPLPLVSMNPRSDWADDERDTSHGLIDRVRDRGHPKSEA 318
S+S G S + +K + F GPLPLV + PRSDWADDERDTSHGL DR RD G+ K+E
Sbjct: 207 SFSDGLHSSEHVRKK--EYFAGPLPLVRLAPRSDWADDERDTSHGLRDRDRDHGYSKNEP 266
Query: 319 YWERDFDMPRVSSLPHK-PTHNFSQRWNLRDDESGKFHSSDIHKVDSYGRDVRTAGREGW 378
+W+R FD+ R LP K N + R++E K + + V GR+
Sbjct: 267 FWDRGFDL-RPHVLPQKHAASNVFDKPGQRENEIAKSSLTQVRPVSGGGREANA------ 326
Query: 379 EGNFRKNNPIPKDGFGSDSGNDRNDIAGRPTSLDRE-TNADNMHVSHFREHA-NKDGRRD 438
+R ++P+ + G + N+ RP+S RE N +S RE+ N G R+
Sbjct: 327 ---WRVSSPL------QNEGANHNNYGARPSSRGREAAKKSNYVLSSSRENVWNNSGARE 386
Query: 439 TGFGQNGRQSWNSATESYSSQEPDRNVRDKYGSEQHSRYRGETHNTSVANSSYSSGLKRI 498
+ GRQ WN+ +S SS N RD YG E +
Sbjct: 387 APYQHGGRQPWNNNMDS-SSNRGTYN-RDGYGIEHQN----------------------- 446
Query: 499 PADEPLLNFGRDRRSFAKIEKPYMEDPFMKDFGASSFDGRDPFTTGLVGVVKRKKDVIKQ 558
RD+RSF K +KP++EDPFMKDFG S FD DPF ++GV K+KK+ +KQ
Sbjct: 447 ----------RDKRSFFKSDKPHVEDPFMKDFGDSGFDVHDPFP--VLGVTKKKKEALKQ 506
Query: 559 IDFHDPVRESFEAELERVQQIQEQERQRIIEEQERALELARREEEERQRLAREHEERQRR 618
+FHDPVRESFEAELERVQ++QE+ER+RIIEEQER +ELAR EEEER RLARE +ERQRR
Sbjct: 507 TEFHDPVRESFEAELERVQKMQEEERRRIIEEQERVIELARTEEEERLRLAREQDERQRR 566
Query: 619 AEEEAREAAWRAEQERLEAIQKAEELRIAREEEKQRILLEEERRKQAAKLKLLELEERMA 678
EEEAREAA+R EQERLEA ++AEELR ++EEEK R+ +EEERRKQAAK KLLELEE+++
Sbjct: 567 LEEEAREAAFRNEQERLEATRRAEELRKSKEEEKHRLFMEEERRKQAAKQKLLELEEKIS 626
Query: 679 KRQAEAVKSSSLTSDIPEKKIPSVVKDVSRLADTVDWEDGEKMVERITTSASSESSSIIR 738
+RQAEA K S +S I E K +VK+ AD VDWED E+MV+RITTS++ + S +R
Sbjct: 627 RRQAEAAKGCSSSSTISEDKFLDIVKEKDS-ADVVDWEDSERMVDRITTSSTLDLSVPMR 686
Query: 739 SSEVGLRSQFSRDGSPSFVDRGKSVNSWRRDFYERGSGSQFVLQDQSTGYNGTRREASTG 798
S E SQFSRDGS F DR K +WR++ E GS S+F+ Q+ +
Sbjct: 687 SFESNATSQFSRDGSFGFPDRQKP--TWRKEDIESGSNSRFIPQNLENVPH--------- 746
Query: 799 GRVSSRKELYGGAGFTTSKTSHRRGITEPQSDEYSQLRGQRPNLSGGGDHYNRSQEFDPE 858
S ++E +G AG+ ++ + + G E D Q + G G + R+ + E
Sbjct: 747 ---SPQEEFFGTAGYLSAPSYFKPGFPEHSID-------QSWRIPGDGRTHGRNYGMESE 806
Query: 859 FQDNV-ENFGDHGWRQESG--RNNFYFPYPERVNPISETDGSYSVGRSRYSHRQPRVLPP 918
++N E +GD GW Q G R+ Y PYPE++ E D Y GR RYS RQPRVLPP
Sbjct: 807 SRENFGEQYGDPGWGQSQGRPRHGPYSPYPEKLYQNPEGDDYYPFGRPRYSVRQPRVLPP 866
Query: 919 PSVASIQKSSVRGEYESVPRDIVESEIQYDHPASNISTAQTRYIHHENRALPEIIDVNLE 978
P S QK+S R E E I Y H ST YI ++ LP
Sbjct: 867 PQ-ESRQKTSFRSEVEHPGPSTSIGGINYSHKGRTNSTVLANYI-EDHHVLP-------G 926
Query: 979 NGENEEQKPDGNTTLRCDSQSTLSVFSPPTSPTHLSHEDLDDSGDSPVLSASREGTLSIE 1038
+G +E ++ D T RCDSQS+LSV SPP SP HLSH+DLD+S DS VL SR G +
Sbjct: 927 SGIDEHRRFDTKLTGRCDSQSSLSVTSPPDSPVHLSHDDLDESADSTVLPTSRMGEDAGL 986
Query: 1039 DNESAEP---AKVGKE-IMITSARVSTGDEDEWGV-VDEHVQEQEEYDEDDDGYQEEDEV 1098
+ P + +GK+ +M+ + VS D +EW + +E +QEQEEYDED+DGYQEED++
Sbjct: 987 LEKGGAPIISSDIGKDSLMMATGSVSCWDNEEWTLDSNERLQEQEEYDEDEDGYQEEDKI 1046
Query: 1099 HEGEDENIDLVQDFDDLHLDDKGSPHMLDNLVLGFNEGVEVGMPNDEFERIPGNEENIY- 1158
H G DENIDL Q+ +++HLDDK S NLVLGFNEGVEV +P+D+FE+ N E+ +
Sbjct: 1047 H-GVDENIDLAQELEEMHLDDKDS-----NLVLGFNEGVEVEIPSDDFEKCQRNSESTFP 1106
Query: 1159 VAPEISNGIREEQGSSEGLQVDGKICQYV--------DASSQIRIDPEEMQDLVMQSKIA 1218
+ + + +E+ S E + + V +AS + MQ+L + I
Sbjct: 1107 LHQHTVDSLDDERPSIETSRGEQAAQPAVVSDPLGMHNASRTFQGAETTMQNLTVHPNIG 1166
Query: 1219 QALPESEITEQGNSSCRSSVSVQQPISSSVSMASQPISGQVIVPS-TAVSGQAESPVKLQ 1278
+ E+ + +S+ S+VS P+ S A P S Q +P + S Q E PVK Q
Sbjct: 1167 R--QSFEVASKVDSTSNSTVST-HPVIPLHSAALHPSSLQTAIPPVSTTSAQMEEPVKFQ 1226
Query: 1279 FGLFSGPSLIPSPVPAIQIGSIQMPLHLHPQITQSMTHMHSSQPPLFQFGQLRYTSSVSQ 1338
FGLFSGPSLIPSP PAIQIGSIQMPL LHPQ S+THM QPPL QFGQL YTS +SQ
Sbjct: 1227 FGLFSGPSLIPSPFPAIQIGSIQMPLPLHPQFGSSLTHMQQPQPPLIQFGQLPYTSPISQ 1286
Query: 1339 GVLPLAPQPLTFVPPTVQTGFPLNKNPGEALSIHPSQETSAHNSRKNDVLPFLMDNQQGI 1398
GVLP P + V + + LN+NPG +++ Q SA+ +N + Q +
Sbjct: 1287 GVLP--PPHHSVVQANGLSTYALNQNPGSLVTVQLGQGNSANLLARN-AATSVSHPQLSV 1346
Query: 1399 VSRSLNVNPSGESK--SLPLAESIESKVMTPQDQTVGSCIDESNSRSEPGFQAEHQRHRV 1458
+ R NV+ G K +LP A + ++PQ Q S + SR
Sbjct: 1347 LRRPTNVSDEGTLKNANLPPARASIEAAVSPQKQPELSGNSQLPSRK------------- 1406
Query: 1459 STSDNHYVVSRGKESEGRAQDGMGSFDSVSRDKGLSGLKARGQFPGGRGKKYIFTVKNSG 1518
+S GK + Q G V D V+NSG
Sbjct: 1407 --------MSHGKSNFAERQSGY----QVQTDTS--------------------AVRNSG 1466
Query: 1519 SRLPFPGSESTRLDTGGFQRRPRRNIPRTEFRVRETVDKKLSNSQVSSNHVGVDDKPTVS 1578
R +E +R+D+GG RR RR R EFRVRE SN D+ +
Sbjct: 1467 LR-SSGTAEVSRVDSGG-NRRYRRQ--RVEFRVRE------------SNWPSSDENRNGN 1526
Query: 1579 GRTAVNSARNGTRKVVISNKPSKRALESEGLSSGASTSLELDAGNRSEKGVKKEYLGKSQ 1638
GR A S + G+RK V+SNK K+AL+S +SG + + +G E + K+ + K+
Sbjct: 1527 GR-AQTSTKIGSRKYVVSNKSQKQALDSS--ASGLNAMQKTVSGGSFENRLGKDAVVKNP 1586
Query: 1639 GSQYSGEGNFRKNICSGEDVDAPLQSGIIRVFEQPGIEAPSDEDDFIEVRSKRQMLNDRR 1698
S SG+ N ++N+ S +++DAPLQ GI+RVFEQ GIEAPSD+DDFIEVRSKRQMLNDRR
Sbjct: 1587 LSPNSGQANLKRNMVSEKEIDAPLQIGIVRVFEQQGIEAPSDDDDFIEVRSKRQMLNDRR 1646
Query: 1699 EQREKEIKAKSHNSKIPRKSRSTSKSALSSVTSSKMYATKEAETVKRTRSDFVAPDGGGR 1758
EQREKEIK KS +K RK RST ++ ++ S++ A K+
Sbjct: 1647 EQREKEIKEKSQAAKAFRKPRSTFQNNTTAARSNRSPPASRAANNKQ------------- 1706
Query: 1759 GSGNIVVSSAFSSPVVSQPLAPIGTPALKSDSQT-ERSHTARSIQTSG--PALATSDGRN 1818
F+ Q LAPIGTP+ K DS E+S + +S Q S P + +D +N
Sbjct: 1707 ----------FNPVSNRQTLAPIGTPSPKIDSHVDEKSGSNKSTQESSALPVIPKND-QN 1766
Query: 1819 LDSSMMFDKKDDILDNVQSSFTSWGNSRINQQVMALTQTQLDEAMKPAQFDLHPPVGDHS 1878
S +F K+ +LDN + +WGN Q VMALTQ+QLDEAMKP V + +
Sbjct: 1767 PASGFVFSNKNKVLDNSHTPVGTWGNQLTYQPVMALTQSQLDEAMKPVSHLSCVSVENGA 1826
Query: 1879 GLAGDPNVPSPSILAMDRSFSSAANPISSLLAGEKIQFGAVTSPTVLPPGSCSTLLGIGA 1938
+ N S S++ + +FSS+ +PI+SLLA KIQFGAVTS TV+PP T
Sbjct: 1827 NRISESNSTSTSVVPKNNTFSSSTSPINSLLAEGKIQFGAVTSSTVIPPCGGRT------ 1886
Query: 1939 PTGLCHSDIPISRKLSGAENDCHIFFEKE-KHHSESCTHIEDSEAEAEAAASAVAVAAIS 1998
E D ++FEK+ KH + S T IE EAEAEAAASA+AVAAI+
Sbjct: 1887 ------------------EKDSSLYFEKDNKHRNPSSTGIEICEAEAEAAASAIAVAAIT 1946
Query: 1999 SDEMVTNGIGTCSVSVTDTNNFGGGDINVITAGSVEGSAGDQQLACKTRADDSLTVALPA 2058
+DE N + T SV +T +GG +++ G+ G+ G Q +++A++SL V+LPA
Sbjct: 1947 NDETSGNALSTGSVLPVETKIYGGTELD---DGAASGTVGGQ--TSRSKAEESLIVSLPA 2006
Query: 2059 DLSVETPPISLWPTLPSPQNSSSQMLSHFPGGSPSQFPFYEINPMLGGPVFTFGPHDESV 2118
DLSV+T PISLWP LPSP N S+QM++HFP G P +PFY++NPML GP+F FGPH ++
Sbjct: 2007 DLSVDT-PISLWPQLPSPHN-SNQMITHFPPG-PPHYPFYDVNPMLRGPIFAFGPHHDA- 2066
Query: 2119 PTTQAQTQKSSAPAPGPLGSW-KQCHSGVDSFYGPPTGFTGPFIS-PGGIPGVQGPPHMV 2178
TQ+Q+QK GP +W +Q HSGVDSFY PP GFTGPF++ PG IPGVQGPPHM
Sbjct: 2067 GATQSQSQKGPVTVSGPPTTWQQQGHSGVDSFYAPPAGFTGPFLTPPGAIPGVQGPPHMF 2126
Query: 2179 VYNHFAPVGQFGQVGLSFMGATYIPSGKQHDWKHSPGPSS--LGVEGDQKNLNMVSAQRM 2238
VYNHFAPVGQFG GLSFMG TYIPSGKQ DWKH+P SS +G +GD N N+ S M
Sbjct: 2127 VYNHFAPVGQFG--GLSFMGTTYIPSGKQPDWKHNPNVSSSPVGGDGDVNNPNVAS---M 2140
Query: 2239 PTNLPP--IQHLAPGSPLLPMASPLAMFDVSPFQ-ASPEMSVQARWPSSASSVQPVPLSM 2298
N+ P +QHL P+ MFD SPFQ +S EM ++ARWP S P +M
Sbjct: 2187 QCNIVPASLQHL-----------PMPMFDPSPFQSSSQEMPIRARWPYMPFSGPP---TM 2140
Query: 2299 PMQQQAEGILPSHFSHASSSDPTFTVNRFPGSQPSVASDHKRNFTVAADATVTQLPDELG 2358
MQ+Q EG S+ P F N P P+ R V A V
Sbjct: 2247 QMQKQQEGTDGSNL-----PSPQFNNNMLPPPPPN------RYPNVQASTVVD------A 2140
Query: 2359 IVEASSCVGSGASVPNADINSLLVNSVTDAGKTGVQNCSSSNSGQNAGTNLKSQSSHHKG 2418
+V++S+ S P A S L +D QN + G + QSS K
Sbjct: 2307 MVDSSNAYSSTTGAPPAKPTSTL----SDPNSNNTQN--PNGPGFKPPQQQQQQSSQEKN 2140
Query: 2419 VSAQQYSHSSGYNYQRGGASQKNSSGGSEWPHRRTGFMGRNQSGA-EKNF-SSAKMKQIY 2447
+Q H G ++ +N RR+G+ GRNQ A E+ F ++ K+KQIY
Sbjct: 2367 TQSQ---HVGGPSHHHQHQHHQN---------RRSGYHGRNQPMARERGFPNNPKVKQIY 2140
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883483.1 | 0.0e+00 | 94.50 | uncharacterized protein LOC120074436 [Benincasa hispida] | [more] |
XP_004142008.1 | 0.0e+00 | 92.60 | uncharacterized protein LOC101218305 isoform X1 [Cucumis sativus] | [more] |
TYK12892.1 | 0.0e+00 | 92.12 | uncharacterized protein E5676_scaffold255G004860 [Cucumis melo var. makuwa] | [more] |
XP_008440276.1 | 0.0e+00 | 92.04 | PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 [Cucumis me... | [more] |
XP_022950041.1 | 0.0e+00 | 91.43 | uncharacterized protein LOC111453246 [Cucurbita moschata] >XP_022950042.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3CNG4 | 0.0e+00 | 92.12 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3B1H0 | 0.0e+00 | 92.04 | LOW QUALITY PROTEIN: uncharacterized protein LOC103484772 OS=Cucumis melo OX=365... | [more] |
A0A0A0KLC4 | 0.0e+00 | 92.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490850 PE=4 SV=1 | [more] |
A0A6J1GDR0 | 0.0e+00 | 91.43 | uncharacterized protein LOC111453246 OS=Cucurbita moschata OX=3662 GN=LOC1114532... | [more] |
A0A6J1IST3 | 0.0e+00 | 90.82 | uncharacterized protein LOC111478360 OS=Cucurbita maxima OX=3661 GN=LOC111478360... | [more] |
Match Name | E-value | Identity | Description | |
AT3G50370.1 | 0.0e+00 | 41.59 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |