Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGTTCCGGGTGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAGCCTATGCTTCCTGCAAAGTCATCTTCGAGAGAAAATCCACCACAAGCTCTACTGCAATCAGTTGCAAAACAGGTACTTATGAAATTGACTTTCGTGTGTTTAATAATTTAATTTCATTCGCTTTCATGCTTGAAAATATTCGAACGATCATAATCAAACTAATAATTTTTGGTTTTAGACTGGGTGGTTGGAGTTGTTTACTGGAAACATCAAAGTTGTGCATGCGCAATCTGTGTTTTAATATCCGTTATCGTTTGCAAGTACCAACAGTTTATGAAAATACTATTGCTTGTGGAATTTCTTCAGTGGTCTCTCTTCCTCAACTAACCACGTGCTTTAACATTTTTTAATCTTTTGAAAGGGCCAGCCGGCCAGTTTTTGCAAGCACAAAACTACTTTTGTAAGTAAATATAATTTTCTCCAGAAAACTCTCTCTTTCTCCTCTCTCTCTCACACACACACGCCCACAGACTTGTTAATTAAAAGAAGGCACATAGAGGGAGGAAGGAGAGGGGGGACTATGTATCCAGGCTCTGGCTACTCCATGATAATTGAAGTTACAAGAAATGTTCATTCAGAAAATTTATGTGAGCAAATGGATTCATTTATTTTCCGATATTTGAAACAACTGCAAATAAAAAGTAATTCATGGCACTTCTGCTGCTTTTGCTAGTCTATGATTTATTTGATTTTCTCATTTACACATTTTAATGTAGCACCATTCTTCTTGCAGGAAAAGTTTGGATTTTCCTATATTTAAAGAGCAGCAGAAATTGACTAATACTGTTCATATATTATAAAATGCTATGGGGTGTTTGGGCCACCAACTTGAAGTTGGTAATAGCCCACTCCACGTTTGGGCACCAATTATAATAGTTGGTGTTTCCAACTATTATAGTTTACAAGTTACAATTAACTACAACATTACTATTTCAAATCTTTCTTTGCCACAATGTTTACTATTTTCTATTCATCATTTCTTTCCTTCTTTGTTACCGTGTTTACTATTTCTTATCTAAACTAAAATAGTCTGCACCCCAAACACAGACTATTGGAGCCTACAAATTATCATAAACCACAAACTATAATAACCATTGACTATACTAACCAACTCAGCGCCCCAAACGCTCCCAATAGATTTAAACTTTAAAGTAGGTTATTTTAACTTTATGTGGCAGCTTATTTCACACCACTCCTTCTCGTAAGAAAGAAAATGTAAAGAGAGAGAATAAAAGTGTCATCTAGCTTTAAAGATAGTTTATTGTTTAAAAAAATGTTTATCTAACTTAGTGATTGCCCTTGAATCTACAATCAGATGCATGCAGTTGTTCTATTATACAACTACTACCACCGGAAACAACATCCACACCTTGAATATTTGAGTTTTGAGGCATTTTGCAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTTATGCAAAGCTCAGATGATATCAAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGATATAGCCACGTGTCTAGAGGCATCAACAGATGAAAACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTTTTTTAATTGACTCCAAGAAGGAGCATTGTTATTTGCTATTTAGTCTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATATAGATACCTCTGAATGCCAACCAGAAACTGTGGATGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTCTTGATGAAACTAAGACTCAGCAGCTTGCATATACAGCAGTTAAGGAAGCAACTGGTATGTATCTTACTGCTTTAACTCATCAGATAGTTCAATGGTTTTATCTTCTAATTCTCAAAAGGTTTTAAGTATGCCGGGGAGAACTGCTTTTATTTCCTACCTTTTTATTTCGTATGAGATGTCCCCTTCTGTTGTTTGATTCAAAGATACCTTAAGAGTACTAATGATTATAGCAGGGATGATTGTCATATATCATATCTTCCTCAATTAGGATCATGTATGGTTAATTATTAGGAAATGATGATATTCTTCATCATCATCATCTTCCCCTTCCTCCTTTTCCTTTTCTTTCTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGAAGGTTGCTGATTTATATCAATTTCTAATTATGATAGTGGTGGGCTGGTCGGATGGTGCCTCATCATAGTGGCGTTATGTACTCTTTCTATTTTCTTTTGTAATTACCTCAAGAAATGGTGAAGAACTTGAATATAGCATGATCATGTATTTGCTCATCATAGTGGCGTGTGTTTCTTTTGTTTAATTGTATCATGTGTTTTGGAGCATTAGCCTTTTTTTTTCATTACATTAAAGAAAAGTTCTGTTTTCTTCCAAAAAAAAAAAAAAAAACTTGGATATAGCATGATATTGTTTGGAGTATATGTAAAAAAGTAAACTAGATTTTTAAGACAAAAATATTATTTCAAGCTCGTTGCGAGCAACTTGGCCATGGGTTAAGGGTATAGGTCTTGTGGGGGAGTCCTGCCTTTAGGTAGTGGGGTGGGGGTGCTAGTTTTGTTTCCTTTACCTTGTCCCATGGGTGGTCCGGGCGAGCTGTCAATTGGGGGCTTCTAAGGGCACTGACCATCTGTGAGTTCCATGCTAGCGGTGGTTGGAATAGAGGCAATGGAAATTGTTGTCAATTACTCTCGAGCCAATTCCCTCCCTCATTCTAGGGAGTTTCTATGGAGGTGAGCCTCCCCCTGCTAGTTGATTGTGGTTTGAATCAGCAGAAGTAAAATTCTTGGGAAATCAAAAGGGGGAGGAAGTGAGGAGAAGGGACATGGAGCTTAATAAATTGGAATGTTCGGTTAATTATGAGGGAAGGAAAACAGGAAGGAAAGACTCAGAGAAGTCATGGAATCAAAAAGTTCTAAAATGAAAGTGATTACTTGGAATGTGAGAGGTTTGGGGGTAGGGAGAACGAGGAGATTGGTCAAAGTGGTTTTGTCACAAGGAAAACCCTGGTATTGTAATTCTTGTTGAATCCAAATGGGAGAGTTGTTGTAGGGATTTGATTAAGAGTCTCTTGGGTTGAAGGCATGTGGAGTGGGTTTGGTTGGAGGCTCTAGCAGTTTGGGAGGTTTAATAATAATGTGGTATGTTTTGGTTGGAACCTCCTTGGAAGTGGTTAAAGGGGTTTTTTCAATGTTCATGACCTGTAGGTTCTTGGTAGGGACAAAGGGGTGAGTGTTGGGGGTGTACGGACTGCCAAGACAAAAAGGGAGGAAAGAGTTTTGGAAGGAGTTAGGGTGATCTTTTTGGTTTATGTGGTCCTTATTGGTGCCTTGCAGGTGACTTTAATGTGATCCGTGTTTTACCTGTTCCAATAGTGGATACAGAAGCCAAATTGATAGCCTTTTTTCTTTCGAAAGGATGGATGGATGGAAGTGTTTCATAATGTTAGGCATTTCTTATGTCCTAGGGTCACTTTGGACCACTGGCCTCTCTGCTTGGATTCGGGATCAGTGAAATAGGGTCCTACTCCCTTCAAATTTGATACCATGTGGCTTTCTCACCCATCGTTTAAGTCTTCCTTCCCTCGGTGGTGGCAGATAAAGATTCAAGGGAGAAGGGACTGGTTTTGCTTTTTAGGGAAGCGAAAATGGTTCTGAAGTCTTGGAATAGAGAGGTTTTGAGGATGTTAGGCTCAAGAAACATGAGATCTTGGGGAGACTTAAGGTTTTGGATAGCCTGGATGAGATGGAGGAGGTTTTATCTGAGGCAAGGAAGCAGGAGCAGTTGAACCTAAAGGCTGATATTAGTTGGGAAGGAGAATGTGAGTTGGAGACAAAAAGAAAAAGTTAAATGGGCTAGTGAGAGCGACTGTAATCCAACTATTTATTTCCACAGAGTTGCTAGCGGAAGAAGAAACAAGAACTTCAACTCCATTGAGAAGGAGTTTTGAGAGTTGTTAGTTGAGGACCATGAGATTGAGGACGAGATTCTTCACCATTTCTCCTCATTGTATAGCCCTATGGTTGGGCCTCCTCCGTTTGGCGGTGGATTGGTGTCCTGACTCCTTTGGTCCCTTGCTAGCTTCTTTGTTTCTCTTTGTTGTCCAAATTTCCTATCCTCAAACCATCTTATTGTCAAATTTCAACACCACACTAAATGATTTCCTATCCTTAAACCACTTCCTTGTTAGGTTTCGACAACAAACTGCAATTTACTAAATGCAACCCACCTCCCTCCTCAGCCCCTCCCCATAGAAAGTGCCTTGTGACTTCCAACTTTTCCTTGATGGATCCATGAGTGTTGAAAAGAAAGAAATTACAAGGGATTTGAGGGTAGTCTTCCTCTTTATAGAAGAAAGCTTTTCTCCAAGCTCTTTTGAAATTCTCCAAGATCCATGAGTGAATGAATTTTAAATCTTTCACCACTGTTGATGTTAATTTTCTTTTACTTAATTATTAAACAAATGCTACTTCAATTAAAAGAAAAAAAATTGTTCATAGTACAAATATTTTTAAAATTTTCAATTATTCTAAGGCCCCCCCTTCCCCAGTCTTATACAGCAACGGAGTGGAAGTAAAATTCATGGAGTCTATCTTTTTTGTTGCTATTTGTTTCCTTTTCAAAAAAAAAATCTTTGTTGTTGCTATTATCACCCTTGCGGTCCTGTATTATGGACATGGCACTTATATTTGGTACTTACAGATACTTGCTAATCCTATTTATTTTCATGCAAAACAGGGATTAATCAAACCGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCTAAGTAAAGAAAAATCAGCAGTCTGCTTTTATATGATTCAGTGCACTCGATCAGGAACCGAAGAAGTAATTCAAGTTCCCATAAGAGATGCCGTTGACAGGTTTCATTATTTTCTTTCGCATCTGTGCTCTTCAATTCTGTGTCTTGGTTTGAGAATGCATGTTCCAGGACCTCATGTGCAATGACCATGGATGAAGGTAAGTCCTCAAATGCATAGATACCCTTTTTTGCTCATGATTTTTCTCTTTTCCATGGTATTAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACTTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGCGCTAACCTGGTTTCGTAGGTATACTCTTAATTGCTATGGATATTGGTATTAATGAATTTTCGTGACCTTGTTTGTTTCTACTTCCTGTTGCTGTATCCAACCTCTTGGCTTTTGCTAGTTGACAGTAATGTGAGGGTTAACTATTTTTATTATCTTTTGACAAGAAACAAAACATTTCATTACGTAATGCTCTGAAGATATAAAATGTGAAGGTTAACTATTTGATATTCTTTCTTTTGTTTCACTTGATTCCGCTCTGTTACTGTAAACTTCCTGGCTCGTGCTAGTTAACTGTGATGGGAGGGTTAACTATTACTAGATACCTTCTGGAAATTGAGCCTTCTGAACCATTAATTTAGGCTTTACTTAATTCGTTATTATTTCTTCATTCACTTAACTCCACTATCGAAACAGTGTTTTACCCTGTACTTCAAAACCAAAAATCAAAGCACATTCATGATGTTTTTCTTTTAATCAATCAATTAATAAAGTTATGATCAGAATAAATATAGCTGACGAGGAATTGAAAGGGATTGAGTTATTTGTTGCCCATATTTTTGTACGCAACAAATATTTATTATTTGTTGTAGAAAAGAAGCTTTTTTCGTTTGTTTGTTTGTTTCCCTTTCACCACATTTGTGCTTCCGTTGTTATAACACTTGAGTCAGTGAATCACAATTTTCTTCATATTAATTTCCAAATTAAGGGTTAAAGAAATTTTCTTAAAATTTTTTCAGTGTCTGTGGACTTTTTATTTTTTATTTTTTTTTAATTATTTAATTAATGAAAGAATATACAAGGTCTCACATATCTGGGGACTCTTGGTTAGCGAAAGGACAATGAATGCTCAAAGAAAGCAAAGGAGACATGTCCTAGAAAGACAAAACAGAGCCCACCAGTGCAACCCCTTACCAGACAAGTGGTAAAGCAGCTGGAACAACATGCAAAAGGGCTTCTCTCCCACCCAACAATCCTCCCAAAAATGAGTATCTGACCCATGCTCGATAGAACATATAACAAACTGGGAAAACTGAGAAAAACCCGAAGCAATAACATTCCACAAGTTTTTGCCTCCGGCCTTTTAACCCATCACTTGGATACCACTCAATAGGATGCAGATCATACTTGCTTCCTGTGATCCTATACCAAAGATCACCTGGTTCCCTGGAGAACCCCATAACTATTTTGCTAGCATAACCTCATTCTGTAATCTCAGGTTGCCAATACCTAAACCTCCAGCCTCCACCAAACCATCTCCCAACTTCCCAAATGAGAACCATTGCCCTCCTCATTCCTTCCCATTAAAAAATCCTCATCGTACTCAATGTTTTCATATAGTGAAGTAGGGATTCTAAACAGGGAAATGAAGTAAATAGGTATCCTTCTCAGCACACATTGAATCAAGGAAAGGATTTTTGGATATTCTTTATATAGTTCCAATTCACTACTTTGTTTGGATATTTTATGTAGCATTGCTAGTAATAGATTGATGACATTCTTAAACCTGTAAAGTTTCCCACTTGAGCAATATGGTTAGAAAGGAATCTTTGAAAATAAGTGCGATGAATGAGAAATTGTGAGGGTCTTTTTTAGAAAGGAAACAAGACTTTTCATCGATGAAATGAAAAGAGATTAATGCTCAAAGTACATAAATTCTAAAGGAAGAAAAGAAAAAGAACAAGAGAACATTAAATAGAGGAAAACTAAGCATACTAAACATACATAAACAGTGAACAAAGTAGACAACAAAATGATGGAAAAGAGTGCAACACGATGAGACTTTGAGACGAGCCGACTTCAAACGAGCAAAATACATAAAACCATGAGAAGACTACCAGAATATCAAGACCAAGGCCTTTCAAATCAAAATACCAAGAACATGGCCTGCGATACATAAACCCAAAAGAAAAAGTAATCTTCTAGGAACTTTGCACCAATACCAATTTAACAACTTTCATATCTGCGATAAAAGCTTTACTGCTATAGGAATGTGGTTGAACCCAAGACCATGAGAAATCCAAGAAAATAATTCAAGTCATATTAGAAAATCTTTGCACGATCAAGCTGAAACTGTGAAATACTTCTCAACTAGAGCCCACAAATGTCAACCAGAGAAATTGTGAGGGTTAAAATTTTATTTTTACCTGTTCTTGGTCAAAGAATGTGCAACTCGTAAATTTATTTGGCATGCTCTCTAGTGGACAAACTTCCTTTTTTTTTTATCGGAGACAAAAGAACTTTTAGAAGAAAATATTAAAGTATGGAACAGCAGAATCCCAGGCTGGATGGAGCCAGAATAAAGAATAAGATATGTGTAAACTAAAGATAAATCTAGATAGCAAACGATAATATAAATAAAGAAACAAACAAGTAGTTGGGAGATAGACCTCTCTATTTTCTTTTTCCTTGTTTGTACGATTTATCCTCAATGAAAAACAAGTTGGTGGCTGATGTTTTGCCTACTCGTTCCTCTTCTTTTCCCCTGTTTTAGGGTTCCTGCGCCCTCTTTCCAACAGGGAAACAATGGATATTACTGCCCTTTTGTTGTTGATTGGGTTGTTTTTCCCTTTTCCAAGGAGAAGGGGTGTTTTGTTGTTTTCCAAGAAGAACGGATGCTTGTATTTGGAGTTCTTGTCCCTTTTAAGGCTTCTTTTGTAAATCCTTTTTTTGTTGTTTAGCCAATCTCTCTATCTTGAGTGATTTTCTCTTTGCCACATTAGAGAGATGATCTAGGAGTTCTTCAACGTCTCTTTGTGGGCTTTGATGGCCAAGCCAATTATCCATTAGGTGTTATTGTATTGCATTGGAGGTCCTTTCCTGCTCTCCTTCTTGTGGGTTTTATTTTCTTTTTTACAATTTCATTTTTCTCAATTAAAGTTTAGTTTCTTCTTATTCTTCTTCTATGTATATAAACATAATGTGGGTGGTGATATATACATACATACATATATGAACATTATGTGTGTAGTTTGTTGGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATGCCTTTGTATTCTTTCAAGTTTAGTTTCTTCTTCTTATTAATATGTGTGTGTGTGTGTGTGTGTGTTAACATTATGTGGGTGGTGTGCAAGTAGTTTGTTCGAACATGAAACTGGCAGGAGTTTACTAGAGGTGTACCTTATAGGGAAGGGCAAGGTACATATCTCCGGAAAACAGTTTGGGTATAGATTGCATTCATGTAAGGATGATAAAAGCCCACATTATCTAACCTCGGTGCTCAAGGTTTTTGAGGCTTGGTCTCTAAAAATCAAAGCAATTTTTCTTTAATAAGAAACCAAACCTTCATTGAGATAGATGAATACAAGGGCATACAAATGAAGTGTCCAAGAACAAAAAAGCATTCAAGAGAAGGGTCCCGATCCAAAAGAATAAAGTTAAGTTGATAATCGTTTCTCAAAGTTCCTCCCCAGACCGCTTCACCCTACAAAAAATTCTACCATTTGTCTCAACCAAGATGCCCTACAAGGGAAAAAAACTGCCACAAGACCTTTCCTTTGTCAAGAAAGGGAGGGTTCAAGAGCACCTGCTCAAACATAGAGCAATGCTGTCTTACAAGCCAAATTACACAATCTCTAAAGTCCTAAAGAACCAATTCCAACATGGACATAACAACATGACAATCCCAAAGCAAATGGTACATATCCTCTCCCTAATTCCTCCAAAGAATACACCAATGAGGCTGCAACACTAAGGAAAAAGCCTCTGAAGATGGTCCTCAGTATTCAATCTCCCATGCAAAACTTTCCCGACAAAGGATTTTAACCTTCTAAAGAGTGAAAAGCGTGTCTATGCTTTAAGTAAAGCTTGTATACTGTGCTTGGGTTTTGAAAAAAGTCTTTTGAAAGTTTGTTAAATGGAAGAAAATCTTCTAATATAAGGATGGAAGGGCTTTTTCCTCTTCTTCTTCTTTCTTTATTCATTATTGTTATTTTTTAAAAAAGTTTATTTTAATTATTTATTTATTTAATTTAAATTTATTTTTTCTGGACAAGAAACTTAACTTTTATTATCAGAAGAGAGTGCAAATGAAGAGTTTATGTTGAGTGGGTCTAGCAAACTTCTGCAACTAAAAACCTACTGTCTTTCTTCACATTCTATCTCGCTTGAGGATGGGCCGTGTGGAAAGTTGACCTGGAAATTTGAGATCTTGGCTGCATATCTCCTTTATTTCCAGCCGTTGATTCAGTCATTTATCCTAAACCTGTCATTCTAGTAGTTTGCATGTGTGGAGAATGGGCTCTTTTGTAATGTAGCATTTTCTTAAGTAATGAAACATCAATTAGGTACAGCAAAGGAAGTATATTTTGGCTACATAGCGGGAAGAGTTATTCTTCTTTTTTTTTTTTTGTTTTTTTGTTTTGACTTGTATACGGGGAGATGATCGCATACCAATATCTCTATTCTTTCTGTCACGTAGGGAAACTTCAACAGACAATTTGGGAGTCATAGGTGAAGAAAAGATTGATTCAAACTTGAACAGGCCTGAGAGAATAGATGTAACCAGGAAGCTTAAAATTCAAAACAATCAAAATGGTGCTAGTGCGAACAATTTGAATAAAGGGGCTAACACTTATTGTGAAGGATTGGAAAGCTTGCCAGATAAAACTAACTGTGTGAGTAGTTTGTTTGATGCAATCTGTAGGCCCCACAGTATTAGTGTGGATGACTTAGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATCTACCTAACACTAGCCAAGTTATCGTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCAAGGTAAGACATTGAAGTCTCTTCACTTAGTCTTGATTTTGTATTTATATTTAGGGACCCATATTGTCTTAATTTTGTATTTTCTCATGGGAACATTGACTCAGATTTTATTCTCTCCTATAAATTTTGGCATTTAATCATTTCTATATTTTAAGAATTATTATGGGTTCATCACCTGAAGGATTTGGACATGATAATTTTTGGAGCAGATTTTGAATAATATTTTTTTGGTTATTTTGTATGCACTACTTTAATCACTCAAGATGAGATTCAATGTTGGGACCTATGGGCCAAGGGACAGCTTGAGGATGGATTGACGTGGTTACTCTGCTTTAGACTCCACAAAAGATACTATTTGGTTTTTGGCTTCATTTTTGTGATGGATTAAATTGAAGAACTTGTACATTTAAATCCCCGACTTTTTTGATAATGTAAAGAATTATGTCGGGTGTAGCAAGGTAAATGATTAAAAAGAATGAAGTTTTTGAAGGAATTGTTAAGTCCTTTGAAAATGATTAATTACTTCTCAACAAACATAGTGTACACTTGGTGTTCAAATTCTGGCATGGCCATATCTGTCCTTCAAATCAATTATCTGTGTGATGAGCTTGTTAATCAAAGGAGAGTTTCAATCTGCATGTGACAAAATGATAGTTATTAATGTTGTAAAATTAATTAAATAATCTTGTGCTAAATCTCAGACTTTTCAGATTGTATAATCTGGTAGCTTTTAATGAATTAGTTTTTACTATATTTTCTTTTATTTATAATTCAGCTGATAGCATATTTTATTTTCCAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCATTTGATCAAAATAGGAATGGTGATCATGCTCTCATCACCTGTCAATCAAACACAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCGAAAGAAACTGCATTGTCACAAGCTGCAATTAAGGCTCTAATCAGAAAGAGGGATAAACTGGTACAAACATACATCTTATATTGTGGTTAAGTTTAGGGATTATCCTTTTCTCATGAATTTAATTTGCACCAGAAACTTTCTTTTAAAGTTTAGCATGATGCAAATAATGGGAACACATATTATATGATGAAGAGCTGAGAGCTTACACCATAATTGTAAAATAGAGATGAGCATGTAATGATTTAAATCTCAACCTTGGTAGACGAGATCATAGGAAGATAATTGATCTTGCTCTGGTATATTATTGTATGTATTACCTTATAATGGTGTCGCCACGTGCCCTAAAAAAAAGACAAAATTAATACAAAATTAAATTAATAATTAACTATATTTTAAATAAGGCTGGACAAAAATAGGTGGCTACAAATGGCATTCAGCTGTTTGTTCTTCTTTCTTTGAACAAGGGTTGCTCATAGATCGAGCCCTTTTCAATTGTATTACATAATTGTTATTTTTGACACTGTAATTCATTCATTCTGTCTGTAGTCTCATCAACAGCGCCTCATTGAAGATGAGATCGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTATGCTCTATTGTCTCTTTTCCTTTTTCTGTTCCTCCTGTTTTGTTTTGTTTCTACACTTTTGTCTATTTGAAATGTTACAAATACAAAAGCCTCTAGAAATTTAGATAATTTCCTTCTTCAAATACTTGAAGTCCTCGGTCCTCTTTTTTGCTTCAGTGTTAATTGATGAACCCTCACACCCCAGCTATCTATGATTTATGGTCTTACAAGCCTATGTTGCACCCAAGATAAAGTATTTTTCAGTATTGGCTAGAGACTAAGATTTGAATTGTCGTCATGATCACTATTCCAAGGATTATGCAGATCCTGAGCTTCAAGCAGGAGTAATATTTGTATTTTTGGTTTTGGTATTGCTTTACTAAATCGTTTGATGTATATGTTGGAGCTAAATTTTCAATATTGTATGTATGTTGATGAATTGTGTTAAATTAGAACAAGTAATATTAGAAACTCAGATTTAAATTAAAAATATGTATTGAAAAAAGTTTGAGATTGTTATTAGTGCATCCTAGTATTTTAAACAAAAGAAAAAATATTGGATAGGCTTAGCAAAAACATGTGGGGTCTAAGTACTCATCATGTTTATTTATTTATTTGTTTTTTTGTTTGTTTATTGAATATATATAATTAACACTAATGTACTTGTAGCTAACACCATTGGAGATGCTTTTTAACATTTAAAGGAAATGCTTTAAATTTGACATTGAGTTTTACTTTTGTTTCCTGGTTTTACTTATTTGATTCCATAATTCTTTCTAGATCTTATTTTGATCTATTTCATCAGATCAGTTTGATCAATAGTTATCATAATTAGCCAAAGAAAATGTCTGAATTACATTCATATTTTGGTTGCTAAAGGTGACGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGCCTAAGAAGCACTGCCGAAGATAAATCTTATCAATACTTCGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGAGGTTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGTGGGTTAATACTTGAAGAAAATAATATTAAATTTTTTTAATATACCAGGGTATTTGCTTGGTTGTATTTGTGTTATATTTATGTCTTAATGCTGCATACAACATGGACATATATCTTTTTTTTTTTTTAATTATGGGAAACTCATCTTTCATTACAAGAGATATAAGTAAAAGAGAGGCTACAAGATCAGAGGGGCATGAAAGGAACTTTTCAAAAGAATTGCCAAGCTAAAATTATACAGCACAAAAACACAGAAGACATTGGTCTATTATTTCAGACAACTGAACTGGAAGAATAGAGTGATATTAAATAGCTTTGTTATGGCGCTTAGTATACAAGTTGGAATTACTTGGACTTGAAACAAATTACACCTTCCATAGGGTGTCCCACCACCTTCTAGCCGCTTACTATTTTATCATAGAGATATAGTATAGCCAATATCACCAATATCATTGCTACCATCAACTAGTAAACCCTTAGGCTTATTATTTGAGGTCTTACTATAAAATACAGTGACAAAAGGAAGATCAAAAGGTGAGAATTAACTACAATGGGTTGACCTAGGGGTTAAGTTGAAAAAATTAAAGGGACTTTGAGTTGTGGTTTCAAACATCAATGTGCAAGTTGGTCTGGTACTTAACAGTATAGAGAAAGTTGGTATGGTACTCAACAGTATAGAGAAAAAGAAAAGAAAAGAAATAAAGGAATTTTTGGCTAGGTAATTGACCAAATAAATTTTTTCGGGTTATCATAATTCATAGAGATCCTATCACTGAAATAATTTTTTTTTTTAATATGAACAGTTATCAGCCTGATAACACTGGTACCCCTGAAGGTGTGAAAGTTGATGTAGCTATTTGCTATGTTGTGGTAGTAGTGTGAAGATTGGATATTCGTCAAACGTAGTTCTTTGTGATGCCAAACAGTCTAACAAATGTAAACATGGACATTATGATATCATTTTTCAAACTAGAAATGACATAAGAATGCATTATAAAAAAGGTTGGAAGTAAGACTTCTAGTTTATGCAAGGAAAAAACTACCTCAATATTCTTTATGCTATGCACCTAACTAGTTTTGTCTACTTGTTCATTTTATGTTTTAAACTTCTTTATGGCCCCTCTTGTATGAAAGAGATCTTTTCTCTGATTTCTCACTGTAGAGCTTTCTCTCAATTATCTTTTATAACTGGGACACCTGCTATCTCATTTTTTTAAAAAAATACAAATCTTTATTGTGGATGGAACGTGTTCCTGATAAAATAAAAATTGAGAACTTAGTCTATGAGCTTAGTTATAAATACAAATTTTATCTTTCTATTTCAGGGTGATGTTAGGCCTCCAATAGTAAGAGTGTACGAGAGGAGGGGTAAAAAGTGAAATGTATCCTTTAAAAGTTAGATAGGATGAATTGGGATTGATGTAAGGGTGAGGGAGAAGAGGAATCTTGTATGACGTTTGTATAGGGAGAGAAGCTGGCCCTCAAGTTTTGTAAGGTGCTTGGTTCACCTCTTTCTTTGTCAATAGTAATAATATCAGTGTTCGCACATGCTTTCTCTATTATATACTTTCTCGTTCTTTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGAAAAAAAGAAAAAGGAAAAGATCTTAGTTAATGATAATTGTAGAAAAACCAAACCAAACCTTAAGATTGACTTCAGCTTGGTCGAGCCTGTAATTGGAATGCTAACTTTTGAAATTGGCTTAACAGATCCCTGTCTTCTAGATTCTTTAATTTAAACTGCAATATCCACGTGGATCCGGTGGATTTTCCTTTTTGTTAAGAGAGAATTTCCTTTCTAGGACAAGGTACACATAAATGTCAAGTTGTAATGGCCTCAAAGTAACAAACCTTTATATGTTACCCAGGATGCAGTGAATTTATCATAAAACCTTGTAACTTGTAATGCCATTGATGAGATATGCAGTTGAGGATGCTGCTAAGATGGTATTTGTTATGTATGGCAAACTTAGGCTCTAATAAAATGTGTATTGTATTCATCTCTCTAATGTGTTGACAAGAACTGTATCTTGTATGTCAAATTTATTGAGACCGGTTATCAATTTGAACTTCCTAATGCACAAGAAACATCGAACATCATAAATTCTGAAAAGCATGTAGCATAAGGATTCAGAAACGGTTTAAAATGGCGTTGGGATGTACATTTATTCTATACAGGCATTTAAGAGTGAAGAAATGGTAGTCTGCTATACATTTTTATTTCATTATTTCCATATTTGTTTACATTTGTAGTTGTAAATGTAATGATGCTCTGTTGACTTGGCAGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTAAGATCTTTGTAGATTCAGTCTGTTATTACTCTCTCTCTCTCTTTTTCCATTTTGCATTTTTGTATTTTGTATTTTTTTTTCATTTTGTTTTGTTTTTTGTTATTTTTTGAAAGGTAATGATTGTTACTGATCGATAGACATTAAATTCTTTTTAGCACTGTGTTTCTTTCAAGTATAGATTATGTAGAATGAATGATAAAATATACTTCTCCCTTTCCCCTGGAGTACAATGTGATCGCATTATTTTTTTGGGATTGCACATTATTTTGTTTCTATTCTGTCTTTTTTTTATGTTTATTGCAAATCATTTTGATTTACATTGTGGATTCTACCTTCTTGCATCTCAAAATATCCTACAAATACAAATTATTTACTCAGGAAAAAGTACAGTGGATTGCAAATTAGATGGTGCCTCATTCTTGAGGCTGGTATGGTTTTTCTACAATGTGGGATGGCTGATCATGCATTTTTCTGGTGTTCTTTGCAATTTGAAATCATTTGACAAGCATCTAATATGGAATAGTGGTGCTAAATTGTATTCTGTGTTTTGAAGGTGGATTCCAAGCTAATGTATTTTTAAAAGGGATGGATTTTGAGTATTCAAGCTGTGGCGAGCCGTGTTCCGACCCCCGTGAAGCGAGGGAATCAGCTGCAACGAAGATGTTAGGTCAACTATGGAGGATGGCAAATCTGGCCAAGTAGTTTTAGGAGCTTTGAAGAAGGATGTTGTAGTATGTTTTGCCCCTAATGGATCTGTTTTTGGGCTCTCCAGGGAGGAAAGGGGGAAAGAGAAGCTTATGCTCTCCAAGATCGTGTAG
mRNA sequence
ATGAGTGTTCCGGGTGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAGCCTATGCTTCCTGCAAAGTCATCTTCGAGAGAAAATCCACCACAAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCAGTTGTTCTATTATACAACTACTACCACCGGAAACAACATCCACACCTTGAATATTTGAGTTTTGAGGCATTTTGCAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTTATGCAAAGCTCAGATGATATCAAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGATATAGCCACGTGTCTAGAGGCATCAACAGATGAAAACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTTTTTTAATTGACTCCAAGAAGGAGCATTGTTATTTGCTATTTAGTCTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATATAGATACCTCTGAATGCCAACCAGAAACTGTGGATGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTCTTGATGAAACTAAGACTCAGCAGCTTGCATATACAGCAGTTAAGGAAGCAACTGGGATTAATCAAACCGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCTAAGTAAAGAAAAATCAGCAGTCTGCTTTTATATGATTCAGTGCACTCGATCAGGAACCGAAGAAGTAATTCAAGTTCCCATAAGAGATGCCGTTGACAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACTTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGCGCTAACCTGGTTTCGTAGGGAAACTTCAACAGACAATTTGGGAGTCATAGGTGAAGAAAAGATTGATTCAAACTTGAACAGGCCTGAGAGAATAGATGTAACCAGGAAGCTTAAAATTCAAAACAATCAAAATGGTGCTAGTGCGAACAATTTGAATAAAGGGGCTAACACTTATTGTGAAGGATTGGAAAGCTTGCCAGATAAAACTAACTGTGTGAGTAGTTTGTTTGATGCAATCTGTAGGCCCCACAGTATTAGTGTGGATGACTTAGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATCTACCTAACACTAGCCAAGTTATCGTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCAAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCATTTGATCAAAATAGGAATGGTGATCATGCTCTCATCACCTGTCAATCAAACACAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCGAAAGAAACTGCATTGTCACAAGCTGCAATTAAGGCTCTAATCAGAAAGAGGGATAAACTGTCTCATCAACAGCGCCTCATTGAAGATGAGATCGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTGACGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGCCTAAGAAGCACTGCCGAAGATAAATCTTATCAATACTTCGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGAGGTTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTGGATTCCAAGCTAATGTATTTTTAAAAGGGATGGATTTTGAGTATTCAAGCTGTGGCGAGCCGTGTTCCGACCCCCGTGAAGCGAGGGAATCAGCTGCAACGAAGATGTTAGGTCAACTATGGAGGATGGCAAATCTGGCCAAGGAGGAAAGGGGGAAAGAGAAGCTTATGCTCTCCAAGATCGTGTAG
Coding sequence (CDS)
ATGAGTGTTCCGGGTGTATGCCCAACCGAGGATGCCATACATGCATTATTAGATTATTTAGTTGAGCCTATGCTTCCTGCAAAGTCATCTTCGAGAGAAAATCCACCACAAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCAGTTGTTCTATTATACAACTACTACCACCGGAAACAACATCCACACCTTGAATATTTGAGTTTTGAGGCATTTTGCAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTTATGCAAAGCTCAGATGATATCAAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGATATAGCCACGTGTCTAGAGGCATCAACAGATGAAAACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTTTTTTAATTGACTCCAAGAAGGAGCATTGTTATTTGCTATTTAGTCTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATATAGATACCTCTGAATGCCAACCAGAAACTGTGGATGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTCTTGATGAAACTAAGACTCAGCAGCTTGCATATACAGCAGTTAAGGAAGCAACTGGGATTAATCAAACCGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCTAAGTAAAGAAAAATCAGCAGTCTGCTTTTATATGATTCAGTGCACTCGATCAGGAACCGAAGAAGTAATTCAAGTTCCCATAAGAGATGCCGTTGACAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACTTCAAAAGTTGAGTACTTCCACATTCTTCCTTATGCTAAGATGGCGCTAACCTGGTTTCGTAGGGAAACTTCAACAGACAATTTGGGAGTCATAGGTGAAGAAAAGATTGATTCAAACTTGAACAGGCCTGAGAGAATAGATGTAACCAGGAAGCTTAAAATTCAAAACAATCAAAATGGTGCTAGTGCGAACAATTTGAATAAAGGGGCTAACACTTATTGTGAAGGATTGGAAAGCTTGCCAGATAAAACTAACTGTGTGAGTAGTTTGTTTGATGCAATCTGTAGGCCCCACAGTATTAGTGTGGATGACTTAGTTCCCTCCTATCCAGTGGAGAAGAAAAAGGATCTACCTAACACTAGCCAAGTTATCGTTTCCTATACAAAGAAAAGAAATGCTAGGCAAGTTGACAATCGCCATGAAGTGATGATCCCATGTATGGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCAAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCATTTGATCAAAATAGGAATGGTGATCATGCTCTCATCACCTGTCAATCAAACACAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCGAAAGAAACTGCATTGTCACAAGCTGCAATTAAGGCTCTAATCAGAAAGAGGGATAAACTGTCTCATCAACAGCGCCTCATTGAAGATGAGATCGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTGACGAAGATGATTTGGTTATAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGCCTAAGAAGCACTGCCGAAGATAAATCTTATCAATACTTCGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGAGGTTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTGGATTCCAAGCTAATGTATTTTTAAAAGGGATGGATTTTGAGTATTCAAGCTGTGGCGAGCCGTGTTCCGACCCCCGTGAAGCGAGGGAATCAGCTGCAACGAAGATGTTAGGTCAACTATGGAGGATGGCAAATCTGGCCAAGGAGGAAAGGGGGAAAGAGAAGCTTATGCTCTCCAAGATCGTGTAG
Protein sequence
MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRKQHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIATCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETVDEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSKEKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAKMALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTYCEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQNRNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREARESAATKMLGQLWRMANLAKEERGKEKLMLSKIV
Homology
BLAST of HG10003813.1 vs. NCBI nr
Match:
XP_008445716.1 (PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo] >XP_008445717.1 PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo])
HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 614/684 (89.77%), Postives = 647/684 (94.59%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS PGVCPTEDAIHALLDYLVEPMLPAKSSSRENPP+ALLQSVAKQMHAVVLLYN+YHRK
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAV+VKPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAS DEN+EGWPLSKVAVFL+DSKKEHCYLLFS ITQGVWSVIEQDID+SE QPETV
Sbjct: 121 TCLEASPDENIEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEE+HVNKKKRVIKKPSKEGLV+DETKTQQ+AYTAVKEATGINQ+DLKILESHVVYSLSK
Sbjct: 181 DEERHVNKKKRVIKKPSKEGLVVDETKTQQVAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRS TE+VIQVPIRD V+SLQDSLF+ +GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVVNSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
MALTWF RE+S+D LGVIGEEK+D NLNRPERIDV R+LK+QNNQNGASANNLN AN Y
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRPERIDVIRRLKVQNNQNGASANNLNIRANIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVS----YTK 420
+G E LPDKTNCV SL DAI RP S SVDDLVPSYPVEKKKD+PNTSQ IVS YTK
Sbjct: 361 GKGFERLPDKTNCVGSLHDAIYRPQSTSVDDLVPSYPVEKKKDVPNTSQAIVSYTKTYTK 420
Query: 421 KRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNIS 480
K RQVDN +E+MIPCMVNES+ASESGIK KDGILATNPCIAECSGEKIASGNLSDNIS
Sbjct: 421 KITDRQVDNSYELMIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNIS 480
Query: 481 FDQNRNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
FDQNRNGDHALITCQSN EHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE
Sbjct: 481 FDQNRNGDHALITCQSNAEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
Query: 541 IAQCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKR 600
IAQCDKNMQTILRGDEDDLV+KLDSVI+CCND+C +STAEDKSYQYFEENCSSQYVTRKR
Sbjct: 541 IAQCDKNMQTILRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKR 600
Query: 601 LSEAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSD 660
LSEAILC+QNPCQELDGICHKNNWILPVYGVSSLDGGFQANVF+KGMDFEYSSCGE CSD
Sbjct: 601 LSEAILCIQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCGELCSD 660
Query: 661 PREARESAATKMLGQLWRMANLAK 681
PR+ARESAA KMLGQLWRMAN AK
Sbjct: 661 PRDARESAAMKMLGQLWRMANQAK 683
BLAST of HG10003813.1 vs. NCBI nr
Match:
XP_038884896.1 (uncharacterized protein LOC120075512 isoform X2 [Benincasa hispida])
HSP 1 Score: 1211.8 bits (3134), Expect = 0.0e+00
Identity = 610/680 (89.71%), Postives = 639/680 (93.97%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS P VCPTEDAIHALLDYLVEPMLPAKSSSRENPP+ALLQSVAKQMHAV+LLYNYYHRK
Sbjct: 1 MSTPDVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVILLYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAV+VKPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAST+ENVEGWPLSKVAVFLIDSK+EHCYLLFS ITQGVWSVIEQDIDTSECQPETV
Sbjct: 121 TCLEASTNENVEGWPLSKVAVFLIDSKREHCYLLFSFITQGVWSVIEQDIDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEEKHVNKKKRVIKK SKEGLV+DE KTQQLAY AVKEATGINQ+DLKILESHVVYSLSK
Sbjct: 181 DEEKHVNKKKRVIKKASKEGLVVDEAKTQQLAYKAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAVCFY+IQCTRS TE+VIQVPIRDAV+SLQD LFK +GRRW ITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYIIQCTRSATEDVIQVPIRDAVNSLQDLLFKRSGRRWGITSKVEYFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M LTWF RETS DNLG IGEEKID NLNRPERIDVTRKLKIQN+QNGASAN++ A+T
Sbjct: 301 MVLTWFHRETSLDNLGGIGEEKIDENLNRPERIDVTRKLKIQNDQNGASANHMYTEASTC 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE L D TNCV L DAICRP S +VDD+VPSY EKKKD+PNTSQVI+SYTKKRNA
Sbjct: 361 GEGLERLSDNTNCVGGLHDAICRPQSANVDDIVPSYTAEKKKDVPNTSQVIISYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQ DN +EVM PCM+NESNA ES IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN
Sbjct: 421 RQADNHYEVMTPCMINESNALES-IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQC 540
RN DHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQ LIEDEIAQC
Sbjct: 481 RNDDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQHLIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
DKNMQTIL+GDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA
Sbjct: 541 DKNMQTILKGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
Query: 601 ILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREA 660
ILCVQNPC+ELDGICHKN WILPVYGVSS+DGGFQANVF+KGMDFEYSSCGE CSDPREA
Sbjct: 601 ILCVQNPCKELDGICHKNYWILPVYGVSSIDGGFQANVFVKGMDFEYSSCGELCSDPREA 660
Query: 661 RESAATKMLGQLWRMANLAK 681
RESAA KMLGQLWRMA++ K
Sbjct: 661 RESAAMKMLGQLWRMASVGK 679
BLAST of HG10003813.1 vs. NCBI nr
Match:
XP_011656540.1 (uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus] >XP_011656541.1 uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus])
HSP 1 Score: 1208.4 bits (3125), Expect = 0.0e+00
Identity = 605/682 (88.71%), Postives = 642/682 (94.13%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS PGVCPTEDAIHALLDYLVEPMLPAKSSSRENPP+ALLQSVAKQMHAVVLLYN+YH+K
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHQK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFE FCKLAV++KPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFETFCKLAVIIKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEASTDENVEGWPLSKVAVFL+DSKKEHCYLLFS ITQGVWSVIEQDID+SE QPETV
Sbjct: 121 TCLEASTDENVEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
D E+HVNKKKRVIKKPSKEGLV+DE KTQQLAYTAVKEATGINQ+DLKILESHVVYSLSK
Sbjct: 181 DVERHVNKKKRVIKKPSKEGLVVDEAKTQQLAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRS TE+VIQVPIRD +SLQDSLF+ +GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVANSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
MALTWF RE+S+D LGVIGEEK+D NLNR ERIDVTRKLK++NNQNGASANNLNK AN Y
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRRERIDVTRKLKVENNQNGASANNLNKSANIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
+GLE LPDKTNCV SL DAI RP S S DLVP YPVEKKKD+PNTSQ I+SYT K
Sbjct: 361 GKGLERLPDKTNCVGSLHDAIYRPQSTSAVDLVPFYPVEKKKDVPNTSQDIISYTSKITD 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
R+VDN +E+MIPC+VNESNASESGIKV+DGILATNPCIAECSGEK+ASGNLSDNISFDQN
Sbjct: 421 RKVDNSYELMIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSN--TEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIA 540
RNGDHALITCQSN +EHLSKLQAIIVSKE ALSQAAI+ALIRKRDKLSHQQRLIEDEIA
Sbjct: 481 RNGDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIA 540
Query: 541 QCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLS 600
QCDKNMQTILRGDEDDLV+KLDSVIECCND+C RSTAEDKSYQYFEENCSSQYVTRKRLS
Sbjct: 541 QCDKNMQTILRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLS 600
Query: 601 EAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPR 660
EAILC+QNPC ELDGICHKNNWILPVYGVSSLDGGFQANVF+KGMDFEYSSC E CSDPR
Sbjct: 601 EAILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPR 660
Query: 661 EARESAATKMLGQLWRMANLAK 681
+ARESAA KMLGQLWRMANLAK
Sbjct: 661 DARESAAMKMLGQLWRMANLAK 682
BLAST of HG10003813.1 vs. NCBI nr
Match:
XP_038884894.1 (uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida] >XP_038884895.1 uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida])
HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 610/688 (88.66%), Postives = 639/688 (92.88%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS P VCPTEDAIHALLDYLVEPMLPAKSSSRENPP+ALLQSVAKQMHAV+LLYNYYHRK
Sbjct: 1 MSTPDVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVILLYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAV+VKPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAST+ENVEGWPLSKVAVFLIDSK+EHCYLLFS ITQGVWSVIEQDIDTSECQPETV
Sbjct: 121 TCLEASTNENVEGWPLSKVAVFLIDSKREHCYLLFSFITQGVWSVIEQDIDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEEKHVNKKKRVIKK SKEGLV+DE KTQQLAY AVKEATGINQ+DLKILESHVVYSLSK
Sbjct: 181 DEEKHVNKKKRVIKKASKEGLVVDEAKTQQLAYKAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAVCFY+IQCTRS TE+VIQVPIRDAV+SLQD LFK +GRRW ITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYIIQCTRSATEDVIQVPIRDAVNSLQDLLFKRSGRRWGITSKVEYFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M LTWF RETS DNLG IGEEKID NLNRPERIDVTRKLKIQN+QNGASAN++ A+T
Sbjct: 301 MVLTWFHRETSLDNLGGIGEEKIDENLNRPERIDVTRKLKIQNDQNGASANHMYTEASTC 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE L D TNCV L DAICRP S +VDD+VPSY EKKKD+PNTSQVI+SYTKKRNA
Sbjct: 361 GEGLERLSDNTNCVGGLHDAICRPQSANVDDIVPSYTAEKKKDVPNTSQVIISYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQ DN +EVM PCM+NESNA ES IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN
Sbjct: 421 RQADNHYEVMTPCMINESNALES-IKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKL--------SHQQRL 540
RN DHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKL SHQQ L
Sbjct: 481 RNDDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLCNPFILSQSHQQHL 540
Query: 541 IEDEIAQCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYV 600
IEDEIAQCDKNMQTIL+GDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYV
Sbjct: 541 IEDEIAQCDKNMQTILKGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYV 600
Query: 601 TRKRLSEAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGE 660
TRKRLSEAILCVQNPC+ELDGICHKN WILPVYGVSS+DGGFQANVF+KGMDFEYSSCGE
Sbjct: 601 TRKRLSEAILCVQNPCKELDGICHKNYWILPVYGVSSIDGGFQANVFVKGMDFEYSSCGE 660
Query: 661 PCSDPREARESAATKMLGQLWRMANLAK 681
CSDPREARESAA KMLGQLWRMA++ K
Sbjct: 661 LCSDPREARESAAMKMLGQLWRMASVGK 687
BLAST of HG10003813.1 vs. NCBI nr
Match:
XP_022150346.1 (uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150347.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150348.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150349.1 uncharacterized protein LOC111018541 isoform X1 [Momordica charantia])
HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 569/680 (83.68%), Postives = 623/680 (91.62%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS GVCPTEDAIHALLDYLVEPMLPAKSSSR+NPPQ+L QSVAKQ+HAVV+LYNYYHRK
Sbjct: 1 MSALGVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLQQSVAKQVHAVVILYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE LSFEAFCKLAVVVKPALLSHMKLMQSSDD +LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAS DENVEGWPLSKVAV LIDS+KE C+LLFS ITQGVWSVIEQD+DTSECQPETV
Sbjct: 121 TCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
+EEKHVNKK+RVIKKPSKE V+DE KTQQLAY+AVKEATGINQ DLKIL+ HVVYSLSK
Sbjct: 181 EEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRDLKILDGHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAV FYMIQCT+S TE+VIQVPI+DA+DSLQ SLF+ +GRRWSITSKVE+FHILPYAK
Sbjct: 241 EKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSITSKVEHFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M LTW +RETS D+L V+ EK+D NL++ ERID RKL+IQN+Q+G SAN+L+KG + Y
Sbjct: 301 MVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQDGDSANDLSKGTSIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE L +KTN V SL DAICRP +VDDLVPSYPV+KKKD+PNTSQVIVSYTKKRNA
Sbjct: 361 GEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPNTSQVIVSYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQVDN HEVMIPC NESNASESGIK+KDG+LATNPCIAECSGEKIASGN SDN+SFDQN
Sbjct: 421 RQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKIASGNFSDNVSFDQN 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQC 540
RNGDHALITCQSN EHLSKLQAI+VSKETALSQAAI+ALIRKRDKLSHQQR+IEDEIAQC
Sbjct: 481 RNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKLSHQQRIIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
DK +QTILRGDEDDLVIKLDSVIECCNDVCLR+TAED SYQ F+ENCSSQYVTRKRLSEA
Sbjct: 541 DKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKENCSSQYVTRKRLSEA 600
Query: 601 ILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREA 660
+LCV++PCQELD ICHKNNWILPVY +SS DGGFQANVF+KG+DFEYSSC E CS+PREA
Sbjct: 601 VLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKGLDFEYSSCSETCSNPREA 660
Query: 661 RESAATKMLGQLWRMANLAK 681
R SAATKMLGQLW +A+ K
Sbjct: 661 RASAATKMLGQLWSIASQRK 680
BLAST of HG10003813.1 vs. ExPASy TrEMBL
Match:
A0A1S3BE29 (uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488666 PE=4 SV=1)
HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 614/684 (89.77%), Postives = 647/684 (94.59%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS PGVCPTEDAIHALLDYLVEPMLPAKSSSRENPP+ALLQSVAKQMHAVVLLYN+YHRK
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAV+VKPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAS DEN+EGWPLSKVAVFL+DSKKEHCYLLFS ITQGVWSVIEQDID+SE QPETV
Sbjct: 121 TCLEASPDENIEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEE+HVNKKKRVIKKPSKEGLV+DETKTQQ+AYTAVKEATGINQ+DLKILESHVVYSLSK
Sbjct: 181 DEERHVNKKKRVIKKPSKEGLVVDETKTQQVAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAVCFYMIQCTRS TE+VIQVPIRD V+SLQDSLF+ +GRRWSITSKVEYFHILPYAK
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVVNSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
MALTWF RE+S+D LGVIGEEK+D NLNRPERIDV R+LK+QNNQNGASANNLN AN Y
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRPERIDVIRRLKVQNNQNGASANNLNIRANIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVS----YTK 420
+G E LPDKTNCV SL DAI RP S SVDDLVPSYPVEKKKD+PNTSQ IVS YTK
Sbjct: 361 GKGFERLPDKTNCVGSLHDAIYRPQSTSVDDLVPSYPVEKKKDVPNTSQAIVSYTKTYTK 420
Query: 421 KRNARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNIS 480
K RQVDN +E+MIPCMVNES+ASESGIK KDGILATNPCIAECSGEKIASGNLSDNIS
Sbjct: 421 KITDRQVDNSYELMIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNIS 480
Query: 481 FDQNRNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
FDQNRNGDHALITCQSN EHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE
Sbjct: 481 FDQNRNGDHALITCQSNAEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
Query: 541 IAQCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKR 600
IAQCDKNMQTILRGDEDDLV+KLDSVI+CCND+C +STAEDKSYQYFEENCSSQYVTRKR
Sbjct: 541 IAQCDKNMQTILRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKR 600
Query: 601 LSEAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSD 660
LSEAILC+QNPCQELDGICHKNNWILPVYGVSSLDGGFQANVF+KGMDFEYSSCGE CSD
Sbjct: 601 LSEAILCIQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCGELCSD 660
Query: 661 PREARESAATKMLGQLWRMANLAK 681
PR+ARESAA KMLGQLWRMAN AK
Sbjct: 661 PRDARESAAMKMLGQLWRMANQAK 683
BLAST of HG10003813.1 vs. ExPASy TrEMBL
Match:
A0A6J1DAH9 (uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018541 PE=4 SV=1)
HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 569/680 (83.68%), Postives = 623/680 (91.62%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS GVCPTEDAIHALLDYLVEPMLPAKSSSR+NPPQ+L QSVAKQ+HAVV+LYNYYHRK
Sbjct: 1 MSALGVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLQQSVAKQVHAVVILYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE LSFEAFCKLAVVVKPALLSHMKLMQSSDD +LENPEKQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCLEAS DENVEGWPLSKVAV LIDS+KE C+LLFS ITQGVWSVIEQD+DTSECQPETV
Sbjct: 121 TCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
+EEKHVNKK+RVIKKPSKE V+DE KTQQLAY+AVKEATGINQ DLKIL+ HVVYSLSK
Sbjct: 181 EEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRDLKILDGHVVYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAV FYMIQCT+S TE+VIQVPI+DA+DSLQ SLF+ +GRRWSITSKVE+FHILPYAK
Sbjct: 241 EKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSITSKVEHFHILPYAK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M LTW +RETS D+L V+ EK+D NL++ ERID RKL+IQN+Q+G SAN+L+KG + Y
Sbjct: 301 MVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQDGDSANDLSKGTSIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE L +KTN V SL DAICRP +VDDLVPSYPV+KKKD+PNTSQVIVSYTKKRNA
Sbjct: 361 GEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPNTSQVIVSYTKKRNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQVDN HEVMIPC NESNASESGIK+KDG+LATNPCIAECSGEKIASGN SDN+SFDQN
Sbjct: 421 RQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKIASGNFSDNVSFDQN 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQC 540
RNGDHALITCQSN EHLSKLQAI+VSKETALSQAAI+ALIRKRDKLSHQQR+IEDEIAQC
Sbjct: 481 RNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKLSHQQRIIEDEIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
DK +QTILRGDEDDLVIKLDSVIECCNDVCLR+TAED SYQ F+ENCSSQYVTRKRLSEA
Sbjct: 541 DKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKENCSSQYVTRKRLSEA 600
Query: 601 ILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREA 660
+LCV++PCQELD ICHKNNWILPVY +SS DGGFQANVF+KG+DFEYSSC E CS+PREA
Sbjct: 601 VLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKGLDFEYSSCSETCSNPREA 660
Query: 661 RESAATKMLGQLWRMANLAK 681
R SAATKMLGQLW +A+ K
Sbjct: 661 RASAATKMLGQLWSIASQRK 680
BLAST of HG10003813.1 vs. ExPASy TrEMBL
Match:
A0A6J1KZE5 (uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732 PE=4 SV=1)
HSP 1 Score: 1122.8 bits (2903), Expect = 0.0e+00
Identity = 565/681 (82.97%), Postives = 616/681 (90.46%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS PGVCPTEDAI LLDYLVEPMLPAKS SRENPPQ+LLQSVAKQ+HAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFE FCKLAVVVKPALLSHMKLMQ+SDDI+LENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEEFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCL+AS D++VEGWPLSKVAV LIDSK+E C+LLFS+ITQGVWSVIEQD+DTSECQPET+
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETM 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEEKHVNKKKRVIKKPSKEG V DE KTQQLAY+ V++ATGINQ+DLKILESHVVYS SK
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPV-DEIKTQQLAYSTVRKATGINQSDLKILESHVVYSHSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
KSAVCFY+IQCTRS TE+VIQVPI+D +DSLQDSLFKINGRRWSITSKVEYFHILPYA+
Sbjct: 241 AKSAVCFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M L WF TST++L VIG K+D NLN+PERIDVTR L+IQ+NQ+GA+A NLNKG +TY
Sbjct: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGANAYNLNKGTSTY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE LPDKTN +SSL D +CRP + +VDDLVPSYPVEKKKD+PNTSQV S TKK+NA
Sbjct: 361 GEGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSCTKKKNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQVDN + VMIPCMVNESNASESGIKVKD ILA NPC+AECSGEKIASGNLSDNIS DQ
Sbjct: 421 RQVDNSYAVMIPCMVNESNASESGIKVKDRILAANPCLAECSGEKIASGNLSDNISLDQY 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQC 540
RNGDHAL+TCQSNTEHL+KLQ II+SKETALSQAAIKAL RKRDKLSHQQR+IED+IAQC
Sbjct: 481 RNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAQC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
DKNMQTILRGDED LVIKLDSVIECC DVC+RS AED+SYQ FEENCSSQY T KRLSEA
Sbjct: 541 DKNMQTILRGDEDGLVIKLDSVIECCYDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEA 600
Query: 601 ILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREA 660
ILCVQNPCQELD IC KNNWILPVYGVS+ DGGFQANV +KGMDF YSSC E C DP EA
Sbjct: 601 ILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVLVKGMDFAYSSCSELCPDPCEA 660
Query: 661 RESAATKMLGQLWRMANLAKE 682
R+SAATKMLGQLW MA+ K+
Sbjct: 661 RKSAATKMLGQLWTMASQTKQ 680
BLAST of HG10003813.1 vs. ExPASy TrEMBL
Match:
A0A6J1HAN9 (uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC111461089 PE=4 SV=1)
HSP 1 Score: 1120.1 bits (2896), Expect = 0.0e+00
Identity = 566/681 (83.11%), Postives = 614/681 (90.16%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS GVCPTEDAI LLDYLVEPMLPAKS SRENPPQ+LLQSVAKQ+HAVVLLYNYYHRK
Sbjct: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAVVVKPALLSHMKLMQ+SDDI+LENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCL+AS D++VEGWPLSKVAV LIDSK+E C+LLFS+ITQGVWSVIEQD+DTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
DEEKHVNKKKRVIKKPSKEG V DE KTQQLAY+ V++ATGINQTDLKILESHVVYS SK
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPV-DEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
KSAV FY+IQCTRS TE+VIQVPI+D +DSLQDSLFKINGRRWSITSKVEYFHILPYA+
Sbjct: 241 AKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M L WF TST++L VIG K+D NLN+PERIDV R L+IQ+NQ+GASANNLNKG +TY
Sbjct: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNA 420
EGLE LPDKTN +SSL D + RP + +VDDLVPSYPVEKKKD+PNTSQV SY KK+NA
Sbjct: 361 GEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNA 420
Query: 421 RQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQN 480
RQ DNR VMIPCMVNE NASESGIKVKD ILATNPC AECSGEKIASGNLSDNIS DQ
Sbjct: 421 RQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQY 480
Query: 481 RNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQC 540
RNGDHAL+TCQSNTEHL+KLQ II+SKETALSQAAIKAL RKRDKLSHQQR+IED+IA+C
Sbjct: 481 RNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARC 540
Query: 541 DKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEA 600
DKNMQTILRGDED LVIKLDSVIECCNDVC+RS AED+SYQ FEENCSSQY T KRLSEA
Sbjct: 541 DKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEA 600
Query: 601 ILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREA 660
ILCVQNPCQELD IC KNNWILPVYGVS+ DGGFQANV++KGMDF YSSC E C DP EA
Sbjct: 601 ILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEA 660
Query: 661 RESAATKMLGQLWRMANLAKE 682
R+SAATKMLGQLW MA+ K+
Sbjct: 661 RKSAATKMLGQLWTMASQTKQ 680
BLAST of HG10003813.1 vs. ExPASy TrEMBL
Match:
A0A6J1EQ29 (uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)
HSP 1 Score: 1106.7 bits (2861), Expect = 0.0e+00
Identity = 567/693 (81.82%), Postives = 612/693 (88.31%), Query Frame = 0
Query: 1 MSVPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRK 60
MS GVCPTEDAI ALLDYLVEPMLP+KSSS ENPP ALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1 MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
Query: 61 QHPHLEYLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIA 120
QHPHLE+LSFEAFCKLAVVVKPALLSHMKLMQSSDDI+LENPEKQLSPAEKAIMDAC +A
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
Query: 121 TCLEASTDENVEGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETV 180
TCL S DEN+EGWPLSKVAVFLIDSKKEHC+LLFS ITQGVWSVIEQ++DTSECQP++V
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSK 240
+EEKHVNKKKRVIKKPSKEGLV+ TKTQQLAY+AVKEATGINQ DLKILESHV YSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
Query: 241 EKSAVCFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAK 300
EKSAV FYM+QCTRS TE+VIQVPI+DAVDSLQDSLFK NGRRWS+TSKVEY+HILPY K
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
Query: 301 MALTWFRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTY 360
M LTWFRRET TDNLGV+G EKID NLN+P+R DVTRKL QNNQ+ A+ NN+NKG + Y
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
Query: 361 CEGLESLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRN- 420
GLE LP+KTNC+SSL DAICRP S SVDDLVPS P+EK+K +P +QVI+SY KK +
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420
Query: 421 ----------------ARQVDNRHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSG 480
RQV N +E IPC VNES ASESGIKV+DGILATNPCIAECSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480
Query: 481 EKIASGNLSDNISFDQNRNGDHALITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKR 540
EK+ASGNLSDNIS DQNRN DHALITCQSNT++LSK+QAII SKETALSQAAIKALIRKR
Sbjct: 481 EKVASGNLSDNIS-DQNRNDDHALITCQSNTKNLSKMQAII-SKETALSQAAIKALIRKR 540
Query: 541 DKLSHQQRLIEDEIAQCDKNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYF 600
DKLSHQQR+IEDEIAQCDKNMQTILRGDEDD V+KLDSVIECCNDVCLRS AEDK YQY
Sbjct: 541 DKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYS 600
Query: 601 EENCSSQYVTRKRLSEAILCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGM 660
EENCSSQ VTRKRLSE ILC++NPCQELD ICHKNNWILPVYGVSS DGGFQANV LKG+
Sbjct: 601 EENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGL 660
Query: 661 DFEYSSCGEPCSDPREARESAATKMLGQLWRMA 677
DFEYSS GE C +PREARESAA KMLGQLWRMA
Sbjct: 661 DFEYSSNGEVCHNPREARESAAMKMLGQLWRMA 691
BLAST of HG10003813.1 vs. TAIR 10
Match:
AT1G05950.1 (unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 340.9 bits (873), Expect = 2.4e-93
Identity = 242/671 (36.07%), Postives = 364/671 (54.25%), Query Frame = 0
Query: 7 CPTEDAIHALLDYLVEPMLPAKSSSRENPPQALLQSVAKQMHAVVLLYNYYHRKQHPHLE 66
CPTEDAI ALL+ LV+P+LP+K + + P ++ +SVAKQ+HAVVLLYNYYHRK +PHLE
Sbjct: 17 CPTEDAIRALLESLVDPLLPSKPTD-DLPSTSIRESVAKQVHAVVLLYNYYHRKDNPHLE 76
Query: 67 YLSFEAFCKLAVVVKPALLSHMKLMQSSDDIKLENPEKQLSPAEKAIMDACDIATCLEAS 126
LSFE+F LA V+KPALL H+K +D + Q EK I+DAC ++ L+AS
Sbjct: 77 CLSFESFRSLATVMKPALLQHLK-----EDGGVSG---QTVLLEKVIVDACSLSMSLDAS 136
Query: 127 TDENV-EGWPLSKVAVFLIDSKKEHCYLLFSLITQGVWSVIEQDIDTSECQPETVDEEKH 186
+D + P+ +VAV L+DS+K+ CYL S ITQGVWS++ EK
Sbjct: 137 SDLFILNKCPIRRVAVLLVDSEKKSCYLQHSSITQGVWSLL----------------EKP 196
Query: 187 VNKKKRVIKKPSKEGLVLDETKTQQLAYTAVKEATGINQTDLKILESHVVYSLSKEKSAV 246
+ K+K + +EG+ Q++A+ VKEATG+N D+ ILE H+V SLS+EK+AV
Sbjct: 197 IEKEKAARENQKEEGVF------QKVAFAVVKEATGVNHKDIVILERHLVCSLSEEKTAV 256
Query: 247 CFYMIQCTRSGTEEVIQVPIRDAVDSLQDSLFKINGRRWSITSKVEYFHILPYAKMALTW 306
FY+++CT S + + P+ + + +Q LF+ + W++ S VEYFH+LPYA + W
Sbjct: 257 RFYIMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLIEDW 316
Query: 307 FRRETSTDNLGVIGEEKIDSNLNRPERIDVTRKLKIQNNQNGASANNLNKGANTYCEGLE 366
F R T+ + E + ++ ++D T++ ++ + L + + +
Sbjct: 317 FSRRGDTEFVIEKEPEAVCDDI-ESNKVDATKESEVSDIFERREKAALKRRYEIKAKKVA 376
Query: 367 SLPDKTNCVSSLFDAICRPHSISVDDLVPSYPVEKKKDLPNTSQVIVSYTKKRNARQVDN 426
+L + + L S K+ PN V K +N V N
Sbjct: 377 ALLSHPGARGKATTRLQNRY------LKGSMSGAKE---PNVHSETVVALKAKN---VGN 436
Query: 427 RHEVMIPCMVNESNASESGIKVKDGILATNPCIAECSGEKIASGNLSDNISFDQNRNGDH 486
M PC N SN + G +V A++P +++ L + N H
Sbjct: 437 E---MSPCKDNYSNGEKGGFEV-----ASDP-------KELKERGLQRKKAVPDRLNSIH 496
Query: 487 AL----ITCQSNTEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQCD 546
L + ++ +L +LQ ++SK T+LS+ A+K L+ KRDKL+ QQR IEDEIA+CD
Sbjct: 497 KLNSTPASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAKCD 556
Query: 547 KNMQTILRGDEDDLVIKLDSVIECCNDVCLRSTAEDKSYQYFEENCSSQYVTRKRLSEAI 606
K +Q I + D ++L++V+ECCN+ T ++ Q + + Q R +LSE +
Sbjct: 557 KCIQNI----KGDWELQLETVLECCNE-----TYPRRNLQESLDKSACQSNKRLKLSETL 616
Query: 607 LCVQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFLKGMDFEYSSCGEPCSDPREAR 666
++ CQ LD IC NNW+LP Y V+ DGG++A V + G + GE SD EAR
Sbjct: 617 PSTKSLCQRLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHGEEKSDAEEAR 618
Query: 667 ESAATKMLGQL 673
ESAA +L +L
Sbjct: 677 ESAAACLLTKL 618
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008445716.1 | 0.0e+00 | 89.77 | PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo] >XP_00... | [more] |
XP_038884896.1 | 0.0e+00 | 89.71 | uncharacterized protein LOC120075512 isoform X2 [Benincasa hispida] | [more] |
XP_011656540.1 | 0.0e+00 | 88.71 | uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus] >XP_011656541.... | [more] |
XP_038884894.1 | 0.0e+00 | 88.66 | uncharacterized protein LOC120075512 isoform X1 [Benincasa hispida] >XP_03888489... | [more] |
XP_022150346.1 | 0.0e+00 | 83.68 | uncharacterized protein LOC111018541 isoform X1 [Momordica charantia] >XP_022150... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BE29 | 0.0e+00 | 89.77 | uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1DAH9 | 0.0e+00 | 83.68 | uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1KZE5 | 0.0e+00 | 82.97 | uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732... | [more] |
A0A6J1HAN9 | 0.0e+00 | 83.11 | uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC1114610... | [more] |
A0A6J1EQ29 | 0.0e+00 | 81.82 | uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT1G05950.1 | 2.4e-93 | 36.07 | unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bac... | [more] |