Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATCAAAAGCTGCAATATCAATAATAGATACTACTGCATTTGGCTTGATGATTCTGGATTTTCGGTAGAAGATTTGGAAAGAGGTCAGAAGATTACGCTTTCTTCTCAAATGATGGAGTGGTTTGTAGAAAACCTTAGCAACATGATCAAGGACCAAGTTCAAGCCTTCTTTTCAGATAAATCCAGAAATGATCGTGGTATTTCTCGGCTGGCCAAGTTTCGCTCTAAAGAAGAATGGTTTGTAGAATATGCCTTTTGGCCTTCCTCGGGTGGTAGAAAAAATATCCATATTCCAGCTGGAAGAAATAAGCAAGGTTGGCTCTCTTTCTACTCAATGCTCAAGGAATTCAAGACCTACACAGATAGTAATGAAGTTCTTACTGGGGATCAAAGAATCAGAGCTTTTTCAGGAATGGCTACAGCTGAAGTGCTATCAGAATCTGAAAAAATCAGCCAAGATAGCTTTGAAAGCTTGAAGGATAATCAGGTACATTTCACATCCTCTTCATTTTGGGTAAAAAAGGAGAAAGATATGCTGAATATGGATTTTCACTCCCTTCTCGTGGTGACGAAACTAATGGAATGTTACTCTTGGAATGATGTTAAAATCACCCTCGAAGAATTTTTCCAAGCCTCCATTTCAATTAACCCTTATCTAGCTGATAAGGCCTTGATCAAGTTCAACAAACAGATAGACTCAGATTTCACAAATGGAGAATGGTTTGAATATGGAAACTTCCATCTCAAGATGGAGCATTGGTCCAAAATGAACCACAACCTGCCGGAAGTGATTAATTGTTATGGGGGTTGGATTTCCATCAAGAATCTACCTTTGCCCTTTTGGAAAAAGTCAGTGTTTGAAGCTATTGGGTATTTTTTGGGTGGTTTGATAGAGATTTCTTCCCAAACGCTAAATTTTTTAGATTGTTGTTCAGCAATTATTAAAGTTCAAAGTAATTTATGTGGTTTCATCCCAGCATCCCTACCTATTGAAGATTCAAGTTTGGGGAAACTTTTGATCCATTTCGAAGGCATGGACTCTTTGGCTGATCAGATTAAAGTTCTACCCAAAAAAATTGCTTTTTTGCATCTGACTTTTCCAATTCTTTGGATATTATTAGAATTAAAGCTGCCATGACAGATGAGGGCTTCTCAGTAGACATGTTTGAACAAAAAGATAACATTGTAAAGGTAGTTGAAACATTACAACCAGTCGGAGTTGAGACAAAATCAAGAGATATTACATTGGAAAATGGATCTAGTGGCTTGAATATTGAAGTAACAGCACAAACCTCGGAGAATGAAAAGGCAGAACCAAGTACAACTGGTGCATTTAATCAAATCCTTCCTCTCTCCTTCGAATTAATGAGGAACGAAACACAAGGCACATTGACAATAGGAAAGCAGACGATTCATACATCCTCTTTATTCGAGATTACAGCATTCAAGAGGGAAATTCAAAAGAAAAGGCAGATCGGCCAGAAGGAGAAGATGAATCAAAATGTGAACGGGCTTATTTTCACTCCCAACCAACTCCATCTCTCTCTCCAAATTCACATTTCCCAAGCGCCCATCTGGAAGTGAAAGAGTCTGACCCGGATTTTCAAAAGGATTTAAATGAGATAGCAAGTCCTCGAAGCAACATTGATACACCCACATTTTCTAGAAAGCAGAACTTCAAATCCTCAGTTGGTTCATTATTAAATCATTTGACAAATCCCGATTCCTTGGAGGATATTTGTGTTCATGCTATGGTTCCACAGAGTAAAAGTCCACCAAGAATAAGAAATTCTGCTGCTGTAGTTGTTCGTCCCAATTTCACCATCCCTGGTTCACAAGTTTCTTTTGTCCAAGGCACCTTCTCTCAAGACTACAAGAAACAAGCAACCCATGATTCAGAGGATGAGTCCATTGATGAGTCAAATGTCAGTGTAAGTAGTGAAGAGTTTGATCAAGACTTTATAGAAGCCACAACTGAGGGGGAAATCTTGCCGGATCAAATGGGGGAAGACTTTAGAACTCTCTTCCTACAACAACAGTACACTCCGGTCAGAGATTCCTACTTATCTCCTTCTCAAATTCCCTCTCAGTTTTCTTCTTTAGTGGCTGCATGCGGTTTTCAAATGCTGGAAATTGCATCTCCAAGGGAGGTGCTTCAAACATGATCATTTCTTGGAACATCATGGGCCTTAATGATAAATCCAAATAGGCAGCCTTGAAACATTTTATACAAAGTCAACTCTCGGATCTAGAAATCATTCAAGAAACCAATTGTCAGGAGTTTGACAAACAGTTCATTAAGGCAATATGGAGTTCAAATGGAATCAGTTGGATGTCAGTGGAAGATTATGAAAAATAAAGAGGCCTATCTGAAATTCCTCAAGGGTGGTTATTCACTTTTAATCAAAATATGACCATTTGCAAAGATACTTGTTGGGTGACTAATGTTTTTCAAAATGTGCTGGAATTATTTTTATTTCTACATTCTTCAAACGGGCGAAAACAGTGGGAACACTTTCTTTTCTAGGATAATGAACGTGCTTGTGCTGCACTTCGATGGAGCAACCTATGGAGGGCATCCTCTGGAATATTACGGTGGAAGCTTTCCTCTATTTGTTTTGGGCTTGATCGAGTTAAAATCTTATGGAATATGGGTTTCTTGAATTATGTGTTTAATTGGTTTGTTTTCCTTTTTCTCCAGAATTTTAGTTTTTATGATGTAATTTGGTTCTAGACTTGTATTTGCATGTTTGATGTTTTAACTTTGTATCCTCTCATGTACTTTGAGCATTAGACTCTTTCATTATATCAATGAATAGTCTTGTTTCTGTTTCAAAAAGAAATAAGTTTCCCGACAAGTATTATAGGTAATTTAAGGCTGAATTTTTATCCTCAAGGTAGTTCTTTCCAAGAAAGGATATGTAGGGAGTGGGTTGGTTGGGGGCTCTGCTGCTGCGTGTTGGGCCTCATTGAAATCAATTTGAGCCCTTTCTGGACCTACTGATTGACTAATGGAGCAAAACCGACCAACGGACGTGATCGAAAAAATACATAATCCAACCGAACAAAGTTAGTCTAGCTGGTTTTAACTAAGAAGCACATACACTTTAGTTTGGCTAACATGTCCGTGTCTAACACGTGACGGACACTTGGACACTGGACACTCAAGCACTTGTTGGGACACTTGTTAGTGCAACAAATGTGTGAAGACATGGATAGAACACTTGTTGAGTAGACTAAAAAGACATATGACAATAATAGTAACTTTTGAGCGTGAAATACATCAAGCTAAGTTTTTTAAGCATATAAATGCACCCAACCCATTGACTTTGAATTTTCTTTTGGTATAAAAATGATATATATTTTAAAAATGTACATTTTAATAAGTGTGTCCATGTAGTGTTGTGTCGTAAATTTTTAAAATATGGCATGTTGTCGTGTCCGTGTCGTGTTGTATCCGTGTCTATATCTGTATCTGTGCTTCTTAGGGTTTTAGTAAGTTGAATCGGTTTGGGAGAATATTTTGATTACCCAAACCGCTAGGTGCTTATTGGGGCAAATGATTGAATTTCATCATTGCTTTGTTGATTTGGCATGTTGATATTCAACTTGATTAGTTTCTGAATGTTATAGCTTTTAAGTTCAACTTGATTAGTTTATGGCTTGCGGGGGTGTGTGCGCGATGTTATGGACTTTGTGGGGTGCGTAGAATAATAGGACTTTTAGAGGGAGGGATAGGGACCCTAGTAAGGTTTGCTCTCTTGTCGTGTTCGTTATCATGTTTCTTTGGGAGCTTCGATTTTGAAGCCCTTTTGTAACTATTCTAAAAGCACTATTTTGCATAGCTGGAGGCCCTTCCTGGAGAGGGTTGTCTCTTTTTATGGGTTTGGTTTTTTTTATGCTCGTGTATTCTTTCATTTATTCTCAATGAAAGTTGTTTCTATTAAAAATAAAAAGTTCACAATTGTTTAGTAGGAGACTGATAGCATTTGACTTGTTATTTATATACTTCTTTTTCTCTATTTCAATTTCTTATAACTTAATACTGTACTTGCAGAATTCCAGCTTGAATTTTTGATGCAGTCACTATTCTTTTCTCGGGTAATGTTTTACACAGTTGCTGGTTTTAATTGAAATTGTACCAAAGAATTGTTTTTTAAATCTCTTGGTTTGGTGGGGCTAGAACGGACGCGTGTGGGACATGGATATGTTTGAGCATGTTTCATACTTGTGTTTGACTAGTGACAAAAATTCTTATGTTAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAAATTTTAGACACACTGAACATGGGACATTTCACACTGAGACATGTCAACCAAAAAAACACGGATGAGAAAATAATATAAACAACACTTAACCCAGCCTATTATCAGGCCCATTTTTTAACCCATTTGTCAATAGAAGTTTCCTCTTTAAAAAAAGGTAATGTTAGTTATTTGTCATTTTGTTGTATTATTTTCCTACTTTGTATCCTGGTCTAGTGGCTTTAACCAATTACAGATTATTGGAAAACTTCAATCCTAGGTCTTGTTGTGAATTATCAACTTTCTTCTTTAAACGCCACCCCAATCATACCTCAATATGGATGCTCTTTTATTGCAGGCTAGTGTTAGCGATGGACAAAATGACAATAACTTGACGAGAGTCATGATTGGTGGACTTTTTTTGAGGTAGCTCGAGTTTTCATCTCTATTGATCATCCTTCTTATTTTTACTTTTATTTTTTGGATAAAGAATTGAACTTTTATTGAGAAAGAACAAAAGAATGACTGCATAAAAAAACCCAAAAACCCCAAAAAAGGTCCCACGGAAGGAAAGAACTCCAGTTAAGTACAATAAGACCTAACTTATAGTTACAAAAAAATTGCATCACTGACGCGCAAAGAGAAGCATGGAACTTAGTGAGGGACCAAACATCACAAAGATCGCTCTCTACACCTCAAAAGGTTATTTGTTTCTTTCTTCTCAAAGATACGACAAAATAGCACAAATCCTAACTTGCCAAAGTAACCTCCCTTCATCACGAAAAGGAAGGTGATAGAGGAACCTTCTCAATCATCTCATCATAATCTCCGTGTCTGGCAACCTGAACTCCGAACATAGCAAAAAAGTAGGTAAAATATATTGAGCAAACTTGCCACTTCAAAGGATGTGATTCAAGTTTTTTTTCCACTTTCCGACAAAGATATAACAAATTGACCCTATCAAGGATGGAAACTTTCTCGGAGCCTATCAAGAGTATTAACTACTCCTTGCCAGATCAAGAACTTAACCTTCTTTGGAATTTTGGACCCCAAAGACGGAAGTTCGTTTTCCACAGTCTAAACTTAGCCTCCCCAATCAAAGATAAGAGAGTCATGTCATCTGTTGTTTTCCTATTAGACAAATGACGACAGGACCTAAGCGATAAACTGTGAGAGGTCCTTAAACGAACTATGGTATCAACCATCAAACGTTTTTTTCATAGAACACAGGTGGTATAAACAAGGAAATGTATGGCGGTGGAGTTTATCCTCCATCCACTTGTTCTTCCAACAATATGTTTCCTTCCATCAACCACAAAACAATGAATCGGGTGAGAGAAAGAAGGGAGCTCATTGGATAATTTTTTCCAAAGGTTTTTGTGAGTGTTTATGACCTCACCTGACATCCACTTAAAAGGATGAGGACTGTAGTTGCTTGCTGTAATCCTCTACCACAGGGTATTAGACCCAAGGGGAAAACGCCACAACTATTTAGCCAATGGGCTTTGTTTTGAGCTTTTAAATTCCCTAGTTCTAAATCCCCCAGGTCCATTGGCTTTCCCACTACCTCCTAATTCACCAAATGGGAGCCTTTCCCCTCAACAATCCATTCCCACAAAAAACCCTAATCAACTTCGCAAGATTCTTACATAGCAAATGGAACCCTAAAAAGGGAAAGAAGTAACTGGGGGTTCCACTCAAAACCGGATGGATGAGGTACCTTCTGCCTCTGGAGAAGAAACTTACTCTTCCAAGATGCAATAATCCTTTTATGCATCATATCCACCACGGGTCCCAAAACGGTAGGTTCCTCAGATTATCACCCGGGGGAAGACCAAGATCAGAGGAGGAAAGCGTACCAAAAATTTTATTTTCTTGACAACATTTTATTGAATACAATTGTAGCTAGTTGTATGTATTAATTTGTCTAAATGGTGTTGTGAATTAATAGGATGTGCTATTTGACCTAAATTTGACATTGTATGCATGCATGCAGTTGATTCTTCTTTTTGAATGTGTACTATTTTAGGGATACTTTTTCTCGCCCTCCATGCACATTAGTTCAACCAGCAATGCAGGCTGTTACAGATGACTTTTTACATGTTCCAGAATTTGGTAATGTCTGACTGACCCTTTGTATAATTTCAAGAAATCATTCTTGGTTCTGTAGCCTAATAATATGCATTTAAGAATTGAGATTTGAAATGTATCTCGTCTAATGCTGATGATAACTTCTTTGGTGGACAGCTAAGAACTTTTGCCCACCAATATATCCTTTCAAGGACAAGCAGTGGGGATTAAGTGGAAGTGTTCCTTTACTGTGCCTCCACTCTGTGCAGGTCAAACCTTCTCCAGTCCCCCCATCTTTTGCCACCCAGACAGTCATCCACTGCCAACCTCTCACAGTATGTACACTATTCTTAAAATGACCATTCGAGTTTACTTAAGCATATTTGAAACCATCATTTCTCTAAACTCCTTTTCTATTTTTGTACTTCTTTTCCAGATTCATCTTCAGGAAAAATCATGTTTGAGGATATCATCTTTCCTAGCTGATGGAATAGTTGTGAATCCTGGTTCTGTTTTACCAGATTTCTCCATTAGTTCTATAATACTTTCTCTCAAGGAGTTAGATGTTACTGTCCCACTAGACGTGGCCAAATCTACTGATTATCATAGCAGCTGGGATGGCATCTCTCAAAGCACTTTTGATGGAGCTCGGCTTCATATTAAGAACATGCAATTTTCTGAATCACCTTCTCTGAATCTTAGACTATTGAATTTGGATAAAGATCCTGCTTGCTTTCTTCTCTGGGAAGGTCAACCAGTTGATGCTAGCCAGAAGAAATGGGCCACTAGCGTGTCTCAGATTAGTTTATCGTTAGAAACATACAATAAAGTGTCTGGATCTAAGAGTTCTGATGCTATTTTAGCCTTGTTGAGATGTGTGGAGCTGACAGAGGTTTCCATTGAAGTAGCTATGGCCACTGCAGATGGAAAAACATTAACGGCAGTTCCTCCTCCTGGGGGTGTTGTAAGAGTTGGGGTTTCCTGTCAACAGTATCTATCAAACACCTCAGTTGATCAACTATTTTTCGTTCTAGATCTTTATGCTTACTTTGGTAGAGTTACTGAAAAGATAGCCCTTGTTGGAAAGAAAAATCAACCAAAAGAAAGTAGAAGTAACTTGTTGGTTGGAAAGCTTGTGGATAAGGTACCAAGTGATACTGCTGTTAGTTTATTGCTCAGGAACCTTCAACTTAGATTTCTGGAGTCTTCTTCCTCAATTGTTGAGGAACTGCCTCTTGTTCAATTTGTTGGCAACAATATGTTCATCAAAGTTTCTCATAGAACACTTGGTGGTGCTGTTGCTATTTCATCCACAGTACGATGGGATAATGTTGAAGTGGATTGTGTAGACACCGACGGAAATATTGCATATGACAATGGAACCGTGTCAACTTCAATTGAAAATGGTTCTCTTATGAATGGGAATGGATTATCTCAACTAAGAGCAATCCTTTGGGTACATAACAAAGGGGATAGATTTACAACCCCGTTTCTTGATGTTAGCATAGTGCATGTGATTCCCTTAAATGAGCGGGACATGGAGTGTCACAGTTTGAATGTGTCAGCTTGTATTGCTGGGGTGCGCCTAAGTGGAGGAATGAACTATGCTGAAGCCTTGCTGCATCGATTTGGAATTCTTGGTCCTGATGGGGGCCCAGGAAAGGGTCTGATGAAAGGTTTGGAGAATCTACGGGCTGGGCCACTCGTGAAACTTTTCAAAACTTCACCTCTTCTTGCTGGCAGTTTGGAAGGTACAGGAACTTAAAATATGTTGGCTGCTTAAGGATTACATTGTTCATGCACACTAATTTGTAATCTTAGACTTTAGGGATGCATGGTATCGTCTGTTCTCGTCACAGACGTTTCCTTTGTTATGTGTCCTTTAAATTCTTTAGAATTTGTCTTTACAATCTTCAAGTTTCGATTTTCTTTGGAATTTCAGGAGACGGGAAAGAAAGTCCTCTATTGCAATTAGGAAAGCCAGATGATGTGGATGTTTCCATAGAACTTAAAAATTGGTTATTTGCACTTGAAGGTGCAGAGGAAATGGCAGAAAGATGGTGGTTTTATAATCCTAATTACGCAGGCAGAGAAGAGAGGTGTTGGCACACTTCTTTCCAGAGCTTCCGAGTAAAAGCGCAGAGTAGACCGAAGGATCTACTTAATGGCAAAGGAAGCTCATGTGGAACTCAACAGTATCCCGTGGAGTTGGTGATAGTAAGCACTCCCCAACTCGAAAAATTTAATTGACATAAACTAATTAACCATCAGCAACTTGCCCCTTTCCCGGTGCTTCTGATTTTGTTGAACCATTCTCTTTCTGTCTTTCTCCATGCTTGTGGTTTAAAATGCATGAGCTTGAAATATTTGTGTCAAATTCAGGTCAGCGTAGAAGGCCTGCAAACATTGAAGCCTCAGGTTCAAAAGAACACTCATCATAATGTCTCTCTCCTCAATGGGGCGAATGAAACAGTCGAGCCACTTGGAGGGATAAATCTTGAAGCTCGCATGGTGGTGTCTGAGGATGATGTTGATGTTGAGATGGCCAACTGGATTATGGAAAACTTGAAGTTCTCTGTCAAGCATCCGGTATTTGACAATCAATTTTCCTTGTGTGACATCATCTGTTTGTTTAGGAACCGTTTTCACACATTATTATTTCATGATAGATTGAGGCGGTTGTTACAAAGAATGAACTGCAACATCTTGCCTTACTCTTCAAGTCAGAAGTTGATTCGATGGGTCGAATTGCTGCTGGAATTCTTAGGCTTCTAAAGCTGGAGGGTTCTATTGGTCAAGCCACCTTGGATCAGCTAAGCAACCTTGGTATGTCAGTTTGCTTACTCATTCTAGGTTCTTTTGAGGTCTCTTAGACAGCAACAAATATTTACTGGTTGATGCTTAATTCCAGGTAGTGAGAGCATTGACAAAATCTTCACTCCAGAAAAGCTTAGCAGGGGAAGTAGTGCGGCCAGTTTGGGAATCTCTCCGTCAGCATATTTGATTGGGGAAAGCCCTCATCGTCCAACTGTAGAATCTACAGTGACTTCTCTGGAGCAGGCTGTTCTTGATTCCCAATCTAAATGCACTTCTCTCATGACTGGACTTAGTAGTTCAGATTCTTCATTACATGTTGCAACGATTAAACAACTCTACGAGAAACTTGATAGCATGCAGACATTACTGTCAAGGTTGCGGAATCAAATCTGA
mRNA sequence
ATGGAAATCAAAAGCTGCAATATCAATAATAGATACTACTGCATTTGGCTTGATGATTCTGGATTTTCGGTAGAAGATTTGGAAAGAGGTCAGAAGATTACGCTTTCTTCTCAAATGATGGAGTGGTTTGTAGAAAACCTTAGCAACATGATCAAGGACCAAGTTCAAGCCTTCTTTTCAGATAAATCCAGAAATGATCGTGGTATTTCTCGGCTGGCCAAGTTTCGCTCTAAAGAAGAATGGTTTGTAGAATATGCCTTTTGGCCTTCCTCGGGTGGTAGAAAAAATATCCATATTCCAGCTGGAAGAAATAAGCAAGGTTGGCTCTCTTTCTACTCAATGCTCAAGGAATTCAAGACCTACACAGATAGTAATGAAGTTCTTACTGGGGATCAAAGAATCAGAGCTTTTTCAGGAATGGCTACAGCTGAAGTGCTATCAGAATCTGAAAAAATCAGCCAAGATAGCTTTGAAAGCTTGAAGGATAATCAGGTACATTTCACATCCTCTTCATTTTGGGTAAAAAAGGAGAAAGATATGCTGAATATGGATTTTCACTCCCTTCTCGTGGTGACGAAACTAATGGAATGTTACTCTTGGAATGATGTTAAAATCACCCTCGAAGAATTTTTCCAAGCCTCCATTTCAATTAACCCTTATCTAGCTGATAAGGCCTTGATCAAGTTCAACAAACAGATAGACTCAGATTTCACAAATGGAGAATGGTTTGAATATGGAAACTTCCATCTCAAGATGGAGCATTGGTCCAAAATGAACCACAACCTGCCGGAAGTGATTAATTGTTATGGGGGTTGGATTTCCATCAAGAATCTACCTTTGCCCTTTTGGAAAAAGTCAGTGTTTGAAGCTATTGGAATTAAAGCTGCCATGACAGATGAGGGCTTCTCAGTAGACATGTTTGAACAAAAAGATAACATTGTAAAGGTAGTTGAAACATTACAACCAGTCGGAGTTGAGACAAAATCAAGAGATATTACATTGGAAAATGGATCTAGTGGCTTGAATATTGAAGTAACAGCACAAACCTCGGAGAATGAAAAGGCAGAACCAAGTACAACTGATTACAGCATTCAAGAGGGAAATTCAAAAGAAAAGGCAGATCGGCCAGAAGGAGAAGATGAATCAAAATGTGAACGGGCTTATTTTCACTCCCAACCAACTCCATCTCTCTCTCCAAATTCACATTTCCCAAGCGCCCATCTGGAAGTGAAAGAGTCTGACCCGGATTTTCAAAAGGATTTAAATGAGATAGCAAGTCCTCGAAGCAACATTGATACACCCACATTTTCTAGAAAGCAGAACTTCAAATCCTCAGTTGGTTCATTATTAAATCATTTGACAAATCCCGATTCCTTGGAGGATATTTGTGTTCATGCTATGGTTCCACAGAGTAAAAGTCCACCAAGAATAAGAAATTCTGCTGCTGTAGTTGTTCGTCCCAATTTCACCATCCCTGGTTCACAAGTTTCTTTTGTCCAAGGCACCTTCTCTCAAGACTACAAGAAACAAGCAACCCATGATTCAGAGGATGAGTCCATTGATGAGTCAAATGTCAGTGTAAGTAGTGAAGAGTTTGATCAAGACTTTATAGAAGCCACAACTGAGGGGGAAATCTTGCCGGATCAAATGGGGGAAGACTTTAGAACTCTCTTCCTACAACAACAGTACACTCCGGTCAGAGATTCCTACTTATCTCCTTCTCAAATTCCCTCTCAGTTTTCTTCTTTAGTGGCTGCATGCGGTTTTCAAATGCTGGAAATTGCATCTCCAAGGGAGGCTAGTGTTAGCGATGGACAAAATGACAATAACTTGACGAGAGTCATGATTGGTGGACTTTTTTTGAGGGATACTTTTTCTCGCCCTCCATGCACATTAGTTCAACCAGCAATGCAGGCTGTTACAGATGACTTTTTACATGTTCCAGAATTTGCTAAGAACTTTTGCCCACCAATATATCCTTTCAAGGACAAGCAGTGGGGATTAAGTGGAAGTGTTCCTTTACTGTGCCTCCACTCTGTGCAGGTCAAACCTTCTCCAGTCCCCCCATCTTTTGCCACCCAGACAGTCATCCACTGCCAACCTCTCACAATTCATCTTCAGGAAAAATCATGTTTGAGGATATCATCTTTCCTAGCTGATGGAATAGTTGTGAATCCTGGTTCTGTTTTACCAGATTTCTCCATTAGTTCTATAATACTTTCTCTCAAGGAGTTAGATGTTACTGTCCCACTAGACGTGGCCAAATCTACTGATTATCATAGCAGCTGGGATGGCATCTCTCAAAGCACTTTTGATGGAGCTCGGCTTCATATTAAGAACATGCAATTTTCTGAATCACCTTCTCTGAATCTTAGACTATTGAATTTGGATAAAGATCCTGCTTGCTTTCTTCTCTGGGAAGGTCAACCAGTTGATGCTAGCCAGAAGAAATGGGCCACTAGCGTGTCTCAGATTAGTTTATCGTTAGAAACATACAATAAAGTGTCTGGATCTAAGAGTTCTGATGCTATTTTAGCCTTGTTGAGATGTGTGGAGCTGACAGAGGTTTCCATTGAAGTAGCTATGGCCACTGCAGATGGAAAAACATTAACGGCAGTTCCTCCTCCTGGGGGTGTTGTAAGAGTTGGGGTTTCCTGTCAACAGTATCTATCAAACACCTCAGTTGATCAACTATTTTTCGTTCTAGATCTTTATGCTTACTTTGGTAGAGTTACTGAAAAGATAGCCCTTGTTGGAAAGAAAAATCAACCAAAAGAAAGTAGAAGTAACTTGTTGGTTGGAAAGCTTGTGGATAAGGTACCAAGTGATACTGCTGTTAGTTTATTGCTCAGGAACCTTCAACTTAGATTTCTGGAGTCTTCTTCCTCAATTGTTGAGGAACTGCCTCTTGTTCAATTTGTTGGCAACAATATGTTCATCAAAGTTTCTCATAGAACACTTGGTGGTGCTGTTGCTATTTCATCCACAGTACGATGGGATAATGTTGAAGTGGATTGTGTAGACACCGACGGAAATATTGCATATGACAATGGAACCGTGTCAACTTCAATTGAAAATGGTTCTCTTATGAATGGGAATGGATTATCTCAACTAAGAGCAATCCTTTGGGTACATAACAAAGGGGATAGATTTACAACCCCGTTTCTTGATGTTAGCATAGTGCATGTGATTCCCTTAAATGAGCGGGACATGGAGTGTCACAGTTTGAATGTGTCAGCTTGTATTGCTGGGGTGCGCCTAAGTGGAGGAATGAACTATGCTGAAGCCTTGCTGCATCGATTTGGAATTCTTGGTCCTGATGGGGGCCCAGGAAAGGGTCTGATGAAAGGTTTGGAGAATCTACGGGCTGGGCCACTCGTGAAACTTTTCAAAACTTCACCTCTTCTTGCTGGCAGTTTGGAAGGAGACGGGAAAGAAAGTCCTCTATTGCAATTAGGAAAGCCAGATGATGTGGATGTTTCCATAGAACTTAAAAATTGGTTATTTGCACTTGAAGGTGCAGAGGAAATGGCAGAAAGATGGTGGTTTTATAATCCTAATTACGCAGGCAGAGAAGAGAGGTGTTGGCACACTTCTTTCCAGAGCTTCCGAGTAAAAGCGCAGAGTAGACCGAAGGATCTACTTAATGGCAAAGGAAGCTCATGTGGAACTCAACAGTATCCCGTGGAGTTGGTGATAGTCAGCGTAGAAGGCCTGCAAACATTGAAGCCTCAGGTTCAAAAGAACACTCATCATAATGTCTCTCTCCTCAATGGGGCGAATGAAACAGTCGAGCCACTTGGAGGGATAAATCTTGAAGCTCGCATGGTGGTGTCTGAGGATGATGTTGATGTTGAGATGGCCAACTGGATTATGGAAAACTTGAAGTTCTCTGTCAAGCATCCGATTGAGGCGGTTGTTACAAAGAATGAACTGCAACATCTTGCCTTACTCTTCAAGTCAGAAGTTGATTCGATGGGTCGAATTGCTGCTGGAATTCTTAGGCTTCTAAAGCTGGAGGGTTCTATTGGTCAAGCCACCTTGGATCAGCTAAGCAACCTTGGTAGTGAGAGCATTGACAAAATCTTCACTCCAGAAAAGCTTAGCAGGGGAAGTAGTGCGGCCAGTTTGGGAATCTCTCCGTCAGCATATTTGATTGGGGAAAGCCCTCATCGTCCAACTGTAGAATCTACAGTGACTTCTCTGGAGCAGGCTGTTCTTGATTCCCAATCTAAATGCACTTCTCTCATGACTGGACTTAGTAGTTCAGATTCTTCATTACATGTTGCAACGATTAAACAACTCTACGAGAAACTTGATAGCATGCAGACATTACTGTCAAGGTTGCGGAATCAAATCTGA
Coding sequence (CDS)
ATGGAAATCAAAAGCTGCAATATCAATAATAGATACTACTGCATTTGGCTTGATGATTCTGGATTTTCGGTAGAAGATTTGGAAAGAGGTCAGAAGATTACGCTTTCTTCTCAAATGATGGAGTGGTTTGTAGAAAACCTTAGCAACATGATCAAGGACCAAGTTCAAGCCTTCTTTTCAGATAAATCCAGAAATGATCGTGGTATTTCTCGGCTGGCCAAGTTTCGCTCTAAAGAAGAATGGTTTGTAGAATATGCCTTTTGGCCTTCCTCGGGTGGTAGAAAAAATATCCATATTCCAGCTGGAAGAAATAAGCAAGGTTGGCTCTCTTTCTACTCAATGCTCAAGGAATTCAAGACCTACACAGATAGTAATGAAGTTCTTACTGGGGATCAAAGAATCAGAGCTTTTTCAGGAATGGCTACAGCTGAAGTGCTATCAGAATCTGAAAAAATCAGCCAAGATAGCTTTGAAAGCTTGAAGGATAATCAGGTACATTTCACATCCTCTTCATTTTGGGTAAAAAAGGAGAAAGATATGCTGAATATGGATTTTCACTCCCTTCTCGTGGTGACGAAACTAATGGAATGTTACTCTTGGAATGATGTTAAAATCACCCTCGAAGAATTTTTCCAAGCCTCCATTTCAATTAACCCTTATCTAGCTGATAAGGCCTTGATCAAGTTCAACAAACAGATAGACTCAGATTTCACAAATGGAGAATGGTTTGAATATGGAAACTTCCATCTCAAGATGGAGCATTGGTCCAAAATGAACCACAACCTGCCGGAAGTGATTAATTGTTATGGGGGTTGGATTTCCATCAAGAATCTACCTTTGCCCTTTTGGAAAAAGTCAGTGTTTGAAGCTATTGGAATTAAAGCTGCCATGACAGATGAGGGCTTCTCAGTAGACATGTTTGAACAAAAAGATAACATTGTAAAGGTAGTTGAAACATTACAACCAGTCGGAGTTGAGACAAAATCAAGAGATATTACATTGGAAAATGGATCTAGTGGCTTGAATATTGAAGTAACAGCACAAACCTCGGAGAATGAAAAGGCAGAACCAAGTACAACTGATTACAGCATTCAAGAGGGAAATTCAAAAGAAAAGGCAGATCGGCCAGAAGGAGAAGATGAATCAAAATGTGAACGGGCTTATTTTCACTCCCAACCAACTCCATCTCTCTCTCCAAATTCACATTTCCCAAGCGCCCATCTGGAAGTGAAAGAGTCTGACCCGGATTTTCAAAAGGATTTAAATGAGATAGCAAGTCCTCGAAGCAACATTGATACACCCACATTTTCTAGAAAGCAGAACTTCAAATCCTCAGTTGGTTCATTATTAAATCATTTGACAAATCCCGATTCCTTGGAGGATATTTGTGTTCATGCTATGGTTCCACAGAGTAAAAGTCCACCAAGAATAAGAAATTCTGCTGCTGTAGTTGTTCGTCCCAATTTCACCATCCCTGGTTCACAAGTTTCTTTTGTCCAAGGCACCTTCTCTCAAGACTACAAGAAACAAGCAACCCATGATTCAGAGGATGAGTCCATTGATGAGTCAAATGTCAGTGTAAGTAGTGAAGAGTTTGATCAAGACTTTATAGAAGCCACAACTGAGGGGGAAATCTTGCCGGATCAAATGGGGGAAGACTTTAGAACTCTCTTCCTACAACAACAGTACACTCCGGTCAGAGATTCCTACTTATCTCCTTCTCAAATTCCCTCTCAGTTTTCTTCTTTAGTGGCTGCATGCGGTTTTCAAATGCTGGAAATTGCATCTCCAAGGGAGGCTAGTGTTAGCGATGGACAAAATGACAATAACTTGACGAGAGTCATGATTGGTGGACTTTTTTTGAGGGATACTTTTTCTCGCCCTCCATGCACATTAGTTCAACCAGCAATGCAGGCTGTTACAGATGACTTTTTACATGTTCCAGAATTTGCTAAGAACTTTTGCCCACCAATATATCCTTTCAAGGACAAGCAGTGGGGATTAAGTGGAAGTGTTCCTTTACTGTGCCTCCACTCTGTGCAGGTCAAACCTTCTCCAGTCCCCCCATCTTTTGCCACCCAGACAGTCATCCACTGCCAACCTCTCACAATTCATCTTCAGGAAAAATCATGTTTGAGGATATCATCTTTCCTAGCTGATGGAATAGTTGTGAATCCTGGTTCTGTTTTACCAGATTTCTCCATTAGTTCTATAATACTTTCTCTCAAGGAGTTAGATGTTACTGTCCCACTAGACGTGGCCAAATCTACTGATTATCATAGCAGCTGGGATGGCATCTCTCAAAGCACTTTTGATGGAGCTCGGCTTCATATTAAGAACATGCAATTTTCTGAATCACCTTCTCTGAATCTTAGACTATTGAATTTGGATAAAGATCCTGCTTGCTTTCTTCTCTGGGAAGGTCAACCAGTTGATGCTAGCCAGAAGAAATGGGCCACTAGCGTGTCTCAGATTAGTTTATCGTTAGAAACATACAATAAAGTGTCTGGATCTAAGAGTTCTGATGCTATTTTAGCCTTGTTGAGATGTGTGGAGCTGACAGAGGTTTCCATTGAAGTAGCTATGGCCACTGCAGATGGAAAAACATTAACGGCAGTTCCTCCTCCTGGGGGTGTTGTAAGAGTTGGGGTTTCCTGTCAACAGTATCTATCAAACACCTCAGTTGATCAACTATTTTTCGTTCTAGATCTTTATGCTTACTTTGGTAGAGTTACTGAAAAGATAGCCCTTGTTGGAAAGAAAAATCAACCAAAAGAAAGTAGAAGTAACTTGTTGGTTGGAAAGCTTGTGGATAAGGTACCAAGTGATACTGCTGTTAGTTTATTGCTCAGGAACCTTCAACTTAGATTTCTGGAGTCTTCTTCCTCAATTGTTGAGGAACTGCCTCTTGTTCAATTTGTTGGCAACAATATGTTCATCAAAGTTTCTCATAGAACACTTGGTGGTGCTGTTGCTATTTCATCCACAGTACGATGGGATAATGTTGAAGTGGATTGTGTAGACACCGACGGAAATATTGCATATGACAATGGAACCGTGTCAACTTCAATTGAAAATGGTTCTCTTATGAATGGGAATGGATTATCTCAACTAAGAGCAATCCTTTGGGTACATAACAAAGGGGATAGATTTACAACCCCGTTTCTTGATGTTAGCATAGTGCATGTGATTCCCTTAAATGAGCGGGACATGGAGTGTCACAGTTTGAATGTGTCAGCTTGTATTGCTGGGGTGCGCCTAAGTGGAGGAATGAACTATGCTGAAGCCTTGCTGCATCGATTTGGAATTCTTGGTCCTGATGGGGGCCCAGGAAAGGGTCTGATGAAAGGTTTGGAGAATCTACGGGCTGGGCCACTCGTGAAACTTTTCAAAACTTCACCTCTTCTTGCTGGCAGTTTGGAAGGAGACGGGAAAGAAAGTCCTCTATTGCAATTAGGAAAGCCAGATGATGTGGATGTTTCCATAGAACTTAAAAATTGGTTATTTGCACTTGAAGGTGCAGAGGAAATGGCAGAAAGATGGTGGTTTTATAATCCTAATTACGCAGGCAGAGAAGAGAGGTGTTGGCACACTTCTTTCCAGAGCTTCCGAGTAAAAGCGCAGAGTAGACCGAAGGATCTACTTAATGGCAAAGGAAGCTCATGTGGAACTCAACAGTATCCCGTGGAGTTGGTGATAGTCAGCGTAGAAGGCCTGCAAACATTGAAGCCTCAGGTTCAAAAGAACACTCATCATAATGTCTCTCTCCTCAATGGGGCGAATGAAACAGTCGAGCCACTTGGAGGGATAAATCTTGAAGCTCGCATGGTGGTGTCTGAGGATGATGTTGATGTTGAGATGGCCAACTGGATTATGGAAAACTTGAAGTTCTCTGTCAAGCATCCGATTGAGGCGGTTGTTACAAAGAATGAACTGCAACATCTTGCCTTACTCTTCAAGTCAGAAGTTGATTCGATGGGTCGAATTGCTGCTGGAATTCTTAGGCTTCTAAAGCTGGAGGGTTCTATTGGTCAAGCCACCTTGGATCAGCTAAGCAACCTTGGTAGTGAGAGCATTGACAAAATCTTCACTCCAGAAAAGCTTAGCAGGGGAAGTAGTGCGGCCAGTTTGGGAATCTCTCCGTCAGCATATTTGATTGGGGAAAGCCCTCATCGTCCAACTGTAGAATCTACAGTGACTTCTCTGGAGCAGGCTGTTCTTGATTCCCAATCTAAATGCACTTCTCTCATGACTGGACTTAGTAGTTCAGATTCTTCATTACATGTTGCAACGATTAAACAACTCTACGAGAAACTTGATAGCATGCAGACATTACTGTCAAGGTTGCGGAATCAAATCTGA
Protein sequence
MEIKSCNINNRYYCIWLDDSGFSVEDLERGQKITLSSQMMEWFVENLSNMIKDQVQAFFSDKSRNDRGISRLAKFRSKEEWFVEYAFWPSSGGRKNIHIPAGRNKQGWLSFYSMLKEFKTYTDSNEVLTGDQRIRAFSGMATAEVLSESEKISQDSFESLKDNQVHFTSSSFWVKKEKDMLNMDFHSLLVVTKLMECYSWNDVKITLEEFFQASISINPYLADKALIKFNKQIDSDFTNGEWFEYGNFHLKMEHWSKMNHNLPEVINCYGGWISIKNLPLPFWKKSVFEAIGIKAAMTDEGFSVDMFEQKDNIVKVVETLQPVGVETKSRDITLENGSSGLNIEVTAQTSENEKAEPSTTDYSIQEGNSKEKADRPEGEDESKCERAYFHSQPTPSLSPNSHFPSAHLEVKESDPDFQKDLNEIASPRSNIDTPTFSRKQNFKSSVGSLLNHLTNPDSLEDICVHAMVPQSKSPPRIRNSAAVVVRPNFTIPGSQVSFVQGTFSQDYKKQATHDSEDESIDESNVSVSSEEFDQDFIEATTEGEILPDQMGEDFRTLFLQQQYTPVRDSYLSPSQIPSQFSSLVAACGFQMLEIASPREASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIYPFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLADGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIKNMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSKSSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLFFVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLESSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGTVSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVSACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGSLEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTSFQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNGANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFKSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGISPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKLDSMQTLLSRLRNQI
Homology
BLAST of HG10011577 vs. NCBI nr
Match:
XP_038904052.1 (uncharacterized protein LOC120090451 isoform X2 [Benincasa hispida])
HSP 1 Score: 1588.5 bits (4112), Expect = 0.0e+00
Identity = 807/853 (94.61%), Postives = 830/853 (97.30%), Query Frame = 0
Query: 601 SVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIYP 660
SVS+G+NDNNLT+VMIGGLFLRDTF RPPCTLVQP MQ VTD LHVPEFAKNFCPPIYP
Sbjct: 226 SVSNGKNDNNLTKVMIGGLFLRDTFLRPPCTLVQPTMQTVTDGILHVPEFAKNFCPPIYP 285
Query: 661 FKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLAD 720
FKDKQWG SGSVPL CLHSVQVKPSPVPPSFAT+TVIHCQPLTIHLQEKSCLRISSFLAD
Sbjct: 286 FKDKQWGFSGSVPLFCLHSVQVKPSPVPPSFATRTVIHCQPLTIHLQEKSCLRISSFLAD 345
Query: 721 GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIKN 780
GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKS+DYHSSWDGISQS+FDGARLHIKN
Sbjct: 346 GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSSDYHSSWDGISQSSFDGARLHIKN 405
Query: 781 MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSKS 840
MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETY+KVSGSKS
Sbjct: 406 MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYDKVSGSKS 465
Query: 841 SDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLFF 900
SDAILALLRCVELT+VSIEVAMATADGKTLT VPPPGGVVR+GVSCQQYLSNTSVDQLFF
Sbjct: 466 SDAILALLRCVELTDVSIEVAMATADGKTLTEVPPPGGVVRIGVSCQQYLSNTSVDQLFF 525
Query: 901 VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLES 960
VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLL+RNLQLRFLES
Sbjct: 526 VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLVRNLQLRFLES 585
Query: 961 SSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGTV 1020
SS+IVEELPLVQF+G++MFIKVSHRTLGGAVAISSTVRWD+VEVDCVDTDGNIAYDNGT+
Sbjct: 586 SSTIVEELPLVQFIGDDMFIKVSHRTLGGAVAISSTVRWDSVEVDCVDTDGNIAYDNGTM 645
Query: 1021 STSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVSA 1080
STSIENGSLMNGNGLSQLRAILWV NKGDRFT PFLDVSIVHVIPLNERDMECHSLNVSA
Sbjct: 646 STSIENGSLMNGNGLSQLRAILWVRNKGDRFTAPFLDVSIVHVIPLNERDMECHSLNVSA 705
Query: 1081 CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGSL 1140
CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGL+KGLENLRAGPL KLFKTSPLLAG L
Sbjct: 706 CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLVKGLENLRAGPLAKLFKTSPLLAGGL 765
Query: 1141 EGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTSF 1200
EGDGKESPLLQLGKPDDVD+SIELKNWLFALEGA+E+AERWWFYN N AGREERCWHTSF
Sbjct: 766 EGDGKESPLLQLGKPDDVDISIELKNWLFALEGAQEVAERWWFYNTNNAGREERCWHTSF 825
Query: 1201 QSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNGA 1260
QSFRVKAQSRPKDL KG+SCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNV LLNG
Sbjct: 826 QSFRVKAQSRPKDLHVAKGNSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVPLLNGV 885
Query: 1261 NETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK 1320
NETVEPLGGINLEARMVVSEDD+DVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK
Sbjct: 886 NETVEPLGGINLEARMVVSEDDIDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK 945
Query: 1321 SEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGIS 1380
SEVDSMGRIAAGILRLLKLEGSIG ATLDQLSNLGSESIDKIFTPEKLSRGSS ASLGIS
Sbjct: 946 SEVDSMGRIAAGILRLLKLEGSIGHATLDQLSNLGSESIDKIFTPEKLSRGSSLASLGIS 1005
Query: 1381 PSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKLD 1440
PSAYLIGESP RPTVESTVTSLEQAVLDSQSKCTSLMT LSSS+SSLHVATIKQLYEKLD
Sbjct: 1006 PSAYLIGESP-RPTVESTVTSLEQAVLDSQSKCTSLMTELSSSNSSLHVATIKQLYEKLD 1065
Query: 1441 SMQTLLSRLRNQI 1454
SMQTLLSRLRNQI
Sbjct: 1066 SMQTLLSRLRNQI 1077
BLAST of HG10011577 vs. NCBI nr
Match:
XP_038904051.1 (uncharacterized protein LOC120090451 isoform X1 [Benincasa hispida])
HSP 1 Score: 1588.5 bits (4112), Expect = 0.0e+00
Identity = 807/853 (94.61%), Postives = 830/853 (97.30%), Query Frame = 0
Query: 601 SVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIYP 660
SVS+G+NDNNLT+VMIGGLFLRDTF RPPCTLVQP MQ VTD LHVPEFAKNFCPPIYP
Sbjct: 352 SVSNGKNDNNLTKVMIGGLFLRDTFLRPPCTLVQPTMQTVTDGILHVPEFAKNFCPPIYP 411
Query: 661 FKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLAD 720
FKDKQWG SGSVPL CLHSVQVKPSPVPPSFAT+TVIHCQPLTIHLQEKSCLRISSFLAD
Sbjct: 412 FKDKQWGFSGSVPLFCLHSVQVKPSPVPPSFATRTVIHCQPLTIHLQEKSCLRISSFLAD 471
Query: 721 GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIKN 780
GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKS+DYHSSWDGISQS+FDGARLHIKN
Sbjct: 472 GIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSSDYHSSWDGISQSSFDGARLHIKN 531
Query: 781 MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSKS 840
MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETY+KVSGSKS
Sbjct: 532 MQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYDKVSGSKS 591
Query: 841 SDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLFF 900
SDAILALLRCVELT+VSIEVAMATADGKTLT VPPPGGVVR+GVSCQQYLSNTSVDQLFF
Sbjct: 592 SDAILALLRCVELTDVSIEVAMATADGKTLTEVPPPGGVVRIGVSCQQYLSNTSVDQLFF 651
Query: 901 VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLES 960
VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLL+RNLQLRFLES
Sbjct: 652 VLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLVRNLQLRFLES 711
Query: 961 SSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGTV 1020
SS+IVEELPLVQF+G++MFIKVSHRTLGGAVAISSTVRWD+VEVDCVDTDGNIAYDNGT+
Sbjct: 712 SSTIVEELPLVQFIGDDMFIKVSHRTLGGAVAISSTVRWDSVEVDCVDTDGNIAYDNGTM 771
Query: 1021 STSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVSA 1080
STSIENGSLMNGNGLSQLRAILWV NKGDRFT PFLDVSIVHVIPLNERDMECHSLNVSA
Sbjct: 772 STSIENGSLMNGNGLSQLRAILWVRNKGDRFTAPFLDVSIVHVIPLNERDMECHSLNVSA 831
Query: 1081 CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGSL 1140
CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGL+KGLENLRAGPL KLFKTSPLLAG L
Sbjct: 832 CIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLVKGLENLRAGPLAKLFKTSPLLAGGL 891
Query: 1141 EGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTSF 1200
EGDGKESPLLQLGKPDDVD+SIELKNWLFALEGA+E+AERWWFYN N AGREERCWHTSF
Sbjct: 892 EGDGKESPLLQLGKPDDVDISIELKNWLFALEGAQEVAERWWFYNTNNAGREERCWHTSF 951
Query: 1201 QSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNGA 1260
QSFRVKAQSRPKDL KG+SCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNV LLNG
Sbjct: 952 QSFRVKAQSRPKDLHVAKGNSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVPLLNGV 1011
Query: 1261 NETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK 1320
NETVEPLGGINLEARMVVSEDD+DVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK
Sbjct: 1012 NETVEPLGGINLEARMVVSEDDIDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLFK 1071
Query: 1321 SEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGIS 1380
SEVDSMGRIAAGILRLLKLEGSIG ATLDQLSNLGSESIDKIFTPEKLSRGSS ASLGIS
Sbjct: 1072 SEVDSMGRIAAGILRLLKLEGSIGHATLDQLSNLGSESIDKIFTPEKLSRGSSLASLGIS 1131
Query: 1381 PSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKLD 1440
PSAYLIGESP RPTVESTVTSLEQAVLDSQSKCTSLMT LSSS+SSLHVATIKQLYEKLD
Sbjct: 1132 PSAYLIGESP-RPTVESTVTSLEQAVLDSQSKCTSLMTELSSSNSSLHVATIKQLYEKLD 1191
Query: 1441 SMQTLLSRLRNQI 1454
SMQTLLSRLRNQI
Sbjct: 1192 SMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. NCBI nr
Match:
XP_004152911.1 (uncharacterized protein LOC101210396 isoform X1 [Cucumis sativus] >KGN56161.1 hypothetical protein Csa_011016 [Cucumis sativus])
HSP 1 Score: 1582.8 bits (4097), Expect = 0.0e+00
Identity = 801/854 (93.79%), Postives = 834/854 (97.66%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFA+NFCPPIY
Sbjct: 351 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFARNFCPPIY 410
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 411 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 470
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFS+SSI+LSLKELDV+VPLDVAKS+DYH SWDGIS S+FDGARLHIK
Sbjct: 471 DGIVVNPGSVLPDFSVSSIVLSLKELDVSVPLDVAKSSDYHGSWDGISHSSFDGARLHIK 530
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK
Sbjct: 531 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 590
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADGKTLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 591 RSDAILALLRCVELTDVSIEVAMATADGKTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 650
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SN+LVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 651 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNMLVGKLVDKVPSDTAVSLLVRNLQLRFLE 710
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQFVGN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN AYDNGT
Sbjct: 711 SSSTIIEELPLVQFVGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTAYDNGT 770
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
+STSIENGSLM GN LSQLRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 771 MSTSIENGSLMKGNELSQLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 830
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLL G+
Sbjct: 831 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGN 890
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 891 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 950
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR K+ L+GKGSS GTQQ+PVELVI+SVEGLQTLKP VQKN+HHNVSL+NG
Sbjct: 951 FQSFRVKAQSRRKEPLSGKGSSRGTQQFPVELVILSVEGLQTLKPHVQKNSHHNVSLING 1010
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGI+LEARMVVSED+VDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1011 VNETIEPLGGISLEARMVVSEDNVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1070
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 1071 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSMASLGV 1130
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 1131 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 1190
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1191 DSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. NCBI nr
Match:
XP_031738234.1 (uncharacterized protein LOC101210396 isoform X2 [Cucumis sativus])
HSP 1 Score: 1582.8 bits (4097), Expect = 0.0e+00
Identity = 801/854 (93.79%), Postives = 834/854 (97.66%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFA+NFCPPIY
Sbjct: 156 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFARNFCPPIY 215
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 216 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 275
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFS+SSI+LSLKELDV+VPLDVAKS+DYH SWDGIS S+FDGARLHIK
Sbjct: 276 DGIVVNPGSVLPDFSVSSIVLSLKELDVSVPLDVAKSSDYHGSWDGISHSSFDGARLHIK 335
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK
Sbjct: 336 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 395
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADGKTLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 396 RSDAILALLRCVELTDVSIEVAMATADGKTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 455
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SN+LVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 456 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNMLVGKLVDKVPSDTAVSLLVRNLQLRFLE 515
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQFVGN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN AYDNGT
Sbjct: 516 SSSTIIEELPLVQFVGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTAYDNGT 575
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
+STSIENGSLM GN LSQLRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 576 MSTSIENGSLMKGNELSQLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 635
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLL G+
Sbjct: 636 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGN 695
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 696 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 755
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR K+ L+GKGSS GTQQ+PVELVI+SVEGLQTLKP VQKN+HHNVSL+NG
Sbjct: 756 FQSFRVKAQSRRKEPLSGKGSSRGTQQFPVELVILSVEGLQTLKPHVQKNSHHNVSLING 815
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGI+LEARMVVSED+VDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 816 VNETIEPLGGISLEARMVVSEDNVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 875
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 876 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSMASLGV 935
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 936 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 995
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 996 DSMQTLLSRLRNQI 1008
BLAST of HG10011577 vs. NCBI nr
Match:
KAA0025451.1 (Chorein_N domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1579.7 bits (4089), Expect = 0.0e+00
Identity = 801/854 (93.79%), Postives = 831/854 (97.31%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAV DDFLHVPEFA+NFCPPIY
Sbjct: 351 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVIDDFLHVPEFARNFCPPIY 410
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 411 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 470
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFSISSI+LSLKELDV+VPLDVAKSTDYH SWDGIS +FDGARLHIK
Sbjct: 471 DGIVVNPGSVLPDFSISSIVLSLKELDVSVPLDVAKSTDYHGSWDGISHCSFDGARLHIK 530
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKW+TSVSQISLSLETYNKVSGSK
Sbjct: 531 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWSTSVSQISLSLETYNKVSGSK 590
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADG+TLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 591 RSDAILALLRCVELTDVSIEVAMATADGRTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 650
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SNLLVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 651 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNLLVGKLVDKVPSDTAVSLLVRNLQLRFLE 710
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQFVGN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN YDNGT
Sbjct: 711 SSSTIIEELPLVQFVGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTTYDNGT 770
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
VSTSIENGSLMNGN LS+LRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 771 VSTSIENGSLMNGNELSRLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 830
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILG DGGPGKGLMKGLENLRAGPLVKLFKTSPLL GS
Sbjct: 831 ACIAGVRLSGGMNYAEALLHRFGILGLDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGS 890
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 891 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 950
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR KD L+GKGSS G+QQ+PVELVI+SVEGLQTLKPQ QKN+HHNVSL+NG
Sbjct: 951 FQSFRVKAQSRRKDPLSGKGSSLGSQQFPVELVIMSVEGLQTLKPQAQKNSHHNVSLING 1010
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGINLEARMVVSED+VDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1011 VNETIEPLGGINLEARMVVSEDNVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1070
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 1071 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSLASLGV 1130
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 1131 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 1190
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1191 DSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. ExPASy TrEMBL
Match:
A0A0A0L7Q7 (Chorein_N domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G081370 PE=4 SV=1)
HSP 1 Score: 1582.8 bits (4097), Expect = 0.0e+00
Identity = 801/854 (93.79%), Postives = 834/854 (97.66%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFA+NFCPPIY
Sbjct: 351 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFARNFCPPIY 410
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 411 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 470
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFS+SSI+LSLKELDV+VPLDVAKS+DYH SWDGIS S+FDGARLHIK
Sbjct: 471 DGIVVNPGSVLPDFSVSSIVLSLKELDVSVPLDVAKSSDYHGSWDGISHSSFDGARLHIK 530
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK
Sbjct: 531 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 590
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADGKTLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 591 RSDAILALLRCVELTDVSIEVAMATADGKTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 650
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SN+LVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 651 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNMLVGKLVDKVPSDTAVSLLVRNLQLRFLE 710
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQFVGN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN AYDNGT
Sbjct: 711 SSSTIIEELPLVQFVGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTAYDNGT 770
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
+STSIENGSLM GN LSQLRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 771 MSTSIENGSLMKGNELSQLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 830
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLL G+
Sbjct: 831 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGN 890
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 891 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 950
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR K+ L+GKGSS GTQQ+PVELVI+SVEGLQTLKP VQKN+HHNVSL+NG
Sbjct: 951 FQSFRVKAQSRRKEPLSGKGSSRGTQQFPVELVILSVEGLQTLKPHVQKNSHHNVSLING 1010
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGI+LEARMVVSED+VDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1011 VNETIEPLGGISLEARMVVSEDNVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1070
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 1071 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSMASLGV 1130
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 1131 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 1190
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1191 DSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. ExPASy TrEMBL
Match:
A0A5A7SMI5 (Chorein_N domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold417G00470 PE=4 SV=1)
HSP 1 Score: 1579.7 bits (4089), Expect = 0.0e+00
Identity = 801/854 (93.79%), Postives = 831/854 (97.31%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAV DDFLHVPEFA+NFCPPIY
Sbjct: 351 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVIDDFLHVPEFARNFCPPIY 410
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 411 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 470
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFSISSI+LSLKELDV+VPLDVAKSTDYH SWDGIS +FDGARLHIK
Sbjct: 471 DGIVVNPGSVLPDFSISSIVLSLKELDVSVPLDVAKSTDYHGSWDGISHCSFDGARLHIK 530
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKW+TSVSQISLSLETYNKVSGSK
Sbjct: 531 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWSTSVSQISLSLETYNKVSGSK 590
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADG+TLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 591 RSDAILALLRCVELTDVSIEVAMATADGRTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 650
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SNLLVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 651 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNLLVGKLVDKVPSDTAVSLLVRNLQLRFLE 710
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQFVGN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN YDNGT
Sbjct: 711 SSSTIIEELPLVQFVGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTTYDNGT 770
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
VSTSIENGSLMNGN LS+LRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 771 VSTSIENGSLMNGNELSRLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 830
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILG DGGPGKGLMKGLENLRAGPLVKLFKTSPLL GS
Sbjct: 831 ACIAGVRLSGGMNYAEALLHRFGILGLDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGS 890
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 891 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 950
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR KD L+GKGSS G+QQ+PVELVI+SVEGLQTLKPQ QKN+HHNVSL+NG
Sbjct: 951 FQSFRVKAQSRRKDPLSGKGSSLGSQQFPVELVIMSVEGLQTLKPQAQKNSHHNVSLING 1010
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGINLEARMVVSED+VDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1011 VNETIEPLGGINLEARMVVSEDNVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1070
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 1071 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSLASLGV 1130
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 1131 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 1190
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1191 DSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. ExPASy TrEMBL
Match:
A0A1S3CJR3 (uncharacterized protein LOC103501618 OS=Cucumis melo OX=3656 GN=LOC103501618 PE=4 SV=1)
HSP 1 Score: 1576.6 bits (4081), Expect = 0.0e+00
Identity = 800/854 (93.68%), Postives = 831/854 (97.31%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAV DDFLHVPEFA+NFCPPIY
Sbjct: 351 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVIDDFLHVPEFARNFCPPIY 410
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQWGLSG+VPLLCLHSVQVKPSPVPPSFA+QTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 411 PFKDKQWGLSGNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLA 470
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFSISSI+LSLKELDV+VPLDVAKSTDYH SWDGIS S+FDGARLHIK
Sbjct: 471 DGIVVNPGSVLPDFSISSIVLSLKELDVSVPLDVAKSTDYHGSWDGISHSSFDGARLHIK 530
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKW+TSVSQISLSLETYNKVSGSK
Sbjct: 531 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWSTSVSQISLSLETYNKVSGSK 590
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SDAILALLRCVELT+VSIEVAMATADG+TLTA+PPPGGVVRVGVSCQQYLSNTSVDQLF
Sbjct: 591 RSDAILALLRCVELTDVSIEVAMATADGRTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLF 650
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKES SNLLVGKLVDKVPSDTAVSLL+RNLQLRFLE
Sbjct: 651 FVLDLYAYFGRVTEKIALVGKKNRPKESGSNLLVGKLVDKVPSDTAVSLLVRNLQLRFLE 710
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+I+EELPLVQF+GN+MFIKVSHRTLGGAVAI+STVRWDNVEVDCVDT+GN YDNGT
Sbjct: 711 SSSTIIEELPLVQFIGNDMFIKVSHRTLGGAVAITSTVRWDNVEVDCVDTEGNTTYDNGT 770
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
VSTSIENGSLMNGN LS+LRAILWVHNKGDRF TPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 771 VSTSIENGSLMNGNELSRLRAILWVHNKGDRFPTPFLDVSIVHVIPLNERDMECHSLNVS 830
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLL GS
Sbjct: 831 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLTGS 890
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES LLQLGKPDDVDVSIELKNWLFALEGA+EMAERWWFYNPN AGREERCWHTS
Sbjct: 891 LEGDGKESSLLQLGKPDDVDVSIELKNWLFALEGAQEMAERWWFYNPNNAGREERCWHTS 950
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKAQSR KD L+GKGSS G+QQ+PVELVI+SVEGLQTLKPQ QKN+HHNVSL+NG
Sbjct: 951 FQSFRVKAQSRRKDPLSGKGSSLGSQQFPVELVIMSVEGLQTLKPQAQKNSHHNVSLING 1010
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NET+EPLGGINLEARMVVSED+V VEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1011 VNETIEPLGGINLEARMVVSEDNV-VEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1070
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSAASLGI 1379
KSEVDSMGRIAAG LRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSS ASLG+
Sbjct: 1071 KSEVDSMGRIAAGFLRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSRGSSLASLGV 1130
Query: 1380 SPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEKL 1439
SPSAYLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDSS HVATIKQL+EKL
Sbjct: 1131 SPSAYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSSSHVATIKQLHEKL 1190
Query: 1440 DSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1191 DSMQTLLSRLRNQI 1202
BLAST of HG10011577 vs. ExPASy TrEMBL
Match:
A0A6J1FP42 (uncharacterized protein LOC111447221 OS=Cucurbita moschata OX=3662 GN=LOC111447221 PE=4 SV=1)
HSP 1 Score: 1554.7 bits (4024), Expect = 0.0e+00
Identity = 785/855 (91.81%), Postives = 821/855 (96.02%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAM+AVTDDFLHVPEFAKNFCPPIY
Sbjct: 350 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMRAVTDDFLHVPEFAKNFCPPIY 409
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQW LSG+VPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 410 PFKDKQWELSGNVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 469
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFSI+SI+LSLKELDVTVP+DVAKST+YHSSW G SQS+FDGARLHIK
Sbjct: 470 DGIVVNPGSVLPDFSINSILLSLKELDVTVPIDVAKSTNYHSSWVGTSQSSFDGARLHIK 529
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSL LRLLNL+KDPACFLLWEGQP+DASQKKWATSVSQ+SLSLETYNKV GSK
Sbjct: 530 NMQFSESPSLKLRLLNLEKDPACFLLWEGQPIDASQKKWATSVSQVSLSLETYNKVIGSK 589
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SSDAILA LRCVELT+VS+EVAMATADGK LT +PPPGG VRVGVSCQQYLSNTSVDQLF
Sbjct: 590 SSDAILASLRCVELTDVSVEVAMATADGKILTVLPPPGGFVRVGVSCQQYLSNTSVDQLF 649
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKESRSNLL GKLVDKVPSDTAVSLL++N+QLRFLE
Sbjct: 650 FVLDLYAYFGRVTEKIALVGKKNRPKESRSNLLAGKLVDKVPSDTAVSLLVKNIQLRFLE 709
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+IV ELPLVQF+GN+MFIKV+HRTLGGAVAISSTVRWDNVEVDCVDT+GNIAYDNGT
Sbjct: 710 SSSTIVGELPLVQFIGNDMFIKVAHRTLGGAVAISSTVRWDNVEVDCVDTEGNIAYDNGT 769
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
VSTSIENGS +NGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 770 VSTSIENGSFVNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 829
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
AC+AGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPL KLFKTSPLLAGS
Sbjct: 830 ACVAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLAKLFKTSPLLAGS 889
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES +LQLGKPDDVDVSIELKNWLFALEG +EM+ERWWFYNPN AGREERCWHTS
Sbjct: 890 LEGDGKESTVLQLGKPDDVDVSIELKNWLFALEGEQEMSERWWFYNPNNAGREERCWHTS 949
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKA SRPK+ LNGKG SCG Q+YPVELVIVSVEGLQTLKPQ+QKNTHH VSLLNG
Sbjct: 950 FQSFRVKAHSRPKEPLNGKGRSCGAQRYPVELVIVSVEGLQTLKPQIQKNTHHTVSLLNG 1009
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NETVEPLGGINLEAR+VV ED+VD EMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1010 VNETVEPLGGINLEARLVVPEDNVDDEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1069
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKL-SRGSSAASLG 1379
KSEVDSMGRIAAG+LRLLKLE SIG TLDQL+NLGSESIDKIFTPEKL SRGSSAAS G
Sbjct: 1070 KSEVDSMGRIAAGVLRLLKLESSIGLTTLDQLNNLGSESIDKIFTPEKLSSRGSSAASFG 1129
Query: 1380 ISPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEK 1439
SPS YLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDS +HVATIKQLYEK
Sbjct: 1130 FSPSTYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSLVHVATIKQLYEK 1189
Query: 1440 LDSMQTLLSRLRNQI 1454
LDSMQTLLSRLRNQI
Sbjct: 1190 LDSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. ExPASy TrEMBL
Match:
A0A6J1IS31 (uncharacterized protein LOC111477917 OS=Cucurbita maxima OX=3661 GN=LOC111477917 PE=4 SV=1)
HSP 1 Score: 1550.8 bits (4014), Expect = 0.0e+00
Identity = 785/855 (91.81%), Postives = 819/855 (95.79%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAM+AVTDDFLHVPEFAKNFCPPIY
Sbjct: 350 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMRAVTDDFLHVPEFAKNFCPPIY 409
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
PFKDKQW LSGSVPLLCLHSVQ KPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA
Sbjct: 410 PFKDKQWELSGSVPLLCLHSVQFKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 469
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPGSVLPDFSI+SI+LSLKELDVTVP+DVAKST+YHSSW G SQS+FDGARLHIK
Sbjct: 470 DGIVVNPGSVLPDFSINSILLSLKELDVTVPIDVAKSTNYHSSWVGTSQSSFDGARLHIK 529
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGSK 839
NMQFSESPSL LRLLNL+KDPACFLLWEGQP+DASQKKWATSVSQ+SLSLETYNKV GSK
Sbjct: 530 NMQFSESPSLKLRLLNLEKDPACFLLWEGQPIDASQKKWATSVSQVSLSLETYNKVIGSK 589
Query: 840 SSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQLF 899
SSDAILA LRCVELT+VSIEVAMATADGK LT +PPPGG VRVGVSCQQYLSNTSVDQLF
Sbjct: 590 SSDAILASLRCVELTDVSIEVAMATADGKILTVLPPPGGFVRVGVSCQQYLSNTSVDQLF 649
Query: 900 FVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFLE 959
FVLDLYAYFGRVTEKIALVGKKN+PKESRSNLL GKLVDKVPSDTAVSLL++N+QLRFLE
Sbjct: 650 FVLDLYAYFGRVTEKIALVGKKNRPKESRSNLLAGKLVDKVPSDTAVSLLVKNIQLRFLE 709
Query: 960 SSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNGT 1019
SSS+IV ELPLVQF+GN+MFIKV+HRTLGGAVAISSTV+WDNVEVDCVDT+GNIAYDNGT
Sbjct: 710 SSSTIVGELPLVQFIGNDMFIKVAHRTLGGAVAISSTVKWDNVEVDCVDTEGNIAYDNGT 769
Query: 1020 VSTSIENGSLMNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 1079
VSTSIENGS +NGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS
Sbjct: 770 VSTSIENGSFVNGNGLSQLRAILWVHNKGDRFTTPFLDVSIVHVIPLNERDMECHSLNVS 829
Query: 1080 ACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTSPLLAGS 1139
AC+AGVRLSGGMNYAEALLHRFGILGPDGGPGKGLM+GLENLRAGPL KLFKTSPLLAGS
Sbjct: 830 ACVAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMRGLENLRAGPLAKLFKTSPLLAGS 889
Query: 1140 LEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREERCWHTS 1199
LEGDGKES +LQLGKPDDVDVSIELKNWLFALEG +EM+ERWWFYNPN AGREERCWHTS
Sbjct: 890 LEGDGKESTVLQLGKPDDVDVSIELKNWLFALEGEQEMSERWWFYNPNNAGREERCWHTS 949
Query: 1200 FQSFRVKAQSRPKDLLNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHHNVSLLNG 1259
FQSFRVKA SRPK+LLNGKG S G QQYPVELVIVSVEGLQTLKPQ+QKNTHH VSL NG
Sbjct: 950 FQSFRVKAHSRPKELLNGKGRSFGAQQYPVELVIVSVEGLQTLKPQIQKNTHHTVSLPNG 1009
Query: 1260 ANETVEPLGGINLEARMVVSEDDVDVEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1319
NETVEPLGGINLEAR+VVSED+VD EMANWIMENLKFSVKHPIEAVVTKNELQHLALLF
Sbjct: 1010 VNETVEPLGGINLEARLVVSEDNVDDEMANWIMENLKFSVKHPIEAVVTKNELQHLALLF 1069
Query: 1320 KSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKL-SRGSSAASLG 1379
KSEVDSMGRIAAG+LRLLKLE SIG TLDQLSNLGSESIDKIFTPEKL SRGSSAAS G
Sbjct: 1070 KSEVDSMGRIAAGVLRLLKLESSIGLTTLDQLSNLGSESIDKIFTPEKLSSRGSSAASFG 1129
Query: 1380 ISPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVATIKQLYEK 1439
SPS YLIGESP RPT+ESTVTSLEQAVLDSQSKCTSLMT LSSSDS +HVATIKQLYEK
Sbjct: 1130 FSPSTYLIGESP-RPTIESTVTSLEQAVLDSQSKCTSLMTELSSSDSLVHVATIKQLYEK 1189
Query: 1440 LDSMQTLLSRLRNQI 1454
DSMQTLLSRLRNQI
Sbjct: 1190 FDSMQTLLSRLRNQI 1203
BLAST of HG10011577 vs. TAIR 10
Match:
AT3G20720.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 906.0 bits (2340), Expect = 4.0e-263
Identity = 477/870 (54.83%), Postives = 635/870 (72.99%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
A VSDG++ N LT+++IGGLFLRD FSR PC L+QP+M+A +D L +P+FAKNFCP IY
Sbjct: 349 ACVSDGESANYLTKILIGGLFLRDAFSRSPCALIQPSMKAAAED-LAIPDFAKNFCPLIY 408
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
P W + VPL+ LHS+QVKPSP PP F ++TVI CQPL +HLQE++CLRISSFLA
Sbjct: 409 PLDSGPWQIVQDVPLISLHSLQVKPSPKPPHFFSKTVIQCQPLMVHLQEEACLRISSFLA 468
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPG VLPD S++S++ +LKELDV+VPLD++ D D + +F GARLHI+
Sbjct: 469 DGIVVNPGDVLPDNSVNSLLFTLKELDVSVPLDMSNLQDSAIEEDLSVKKSFVGARLHIE 528
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETY-NKVSGS 839
N+ F+ESP+L +RLLNL+KDPACF LW GQP+DASQKKW S SL+LET N
Sbjct: 529 NLSFAESPTLKVRLLNLEKDPACFCLWPGQPIDASQKKWTAGASHFSLALETSPNSTQLQ 588
Query: 840 KSSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQL 899
+ L CVE +VSIEVAM +ADGK L +PPPGG+VR+GV+C+QY+S SV+QL
Sbjct: 589 SPRGPEMGLWNCVEGKDVSIEVAMVSADGKPLITIPPPGGIVRIGVACEQYISRASVEQL 648
Query: 900 FFVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFL 959
FFVLDLY+YFG+V+EKI++V + K + L G L++KVPSDTAV L L++LQL+FL
Sbjct: 649 FFVLDLYSYFGKVSEKISIV---KESKRQNTVSLTGGLLEKVPSDTAVKLALKDLQLKFL 708
Query: 960 ESSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNG 1019
ESS + +++PLVQF+G ++ +KV+HRTLGGA+A+SS + W+N+EVDCVDTD ++N
Sbjct: 709 ESSFTSTQDMPLVQFLGKDLSVKVTHRTLGGAIAVSSNIYWENIEVDCVDTDVEHEHENS 768
Query: 1020 TVSTSIENGSLMNGNGLSQLRAILWV----HNKGDRFT-TPFLDVSIVHVIPLNERDMEC 1079
NG L++ NG + LR + WV H++ T TPFLD+SI HVIPL+E+DMEC
Sbjct: 769 W------NGHLVSCNGSTPLRRVFWVVNGRHDEHSGSTLTPFLDISITHVIPLSEKDMEC 828
Query: 1080 HSLNVSACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTS 1139
HS+++ ACI+GVRL GGM+YAEALLHRFGIL DGGPG+GL +GL++L +GP+ KLFK S
Sbjct: 829 HSVSIVACISGVRLGGGMSYAEALLHRFGILNHDGGPGEGLSRGLDHLSSGPMSKLFKAS 888
Query: 1140 PL-------LAGSLEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNP 1199
+ G+ GDG LG+PDD+DVS+EL++WLFALEG E + R N
Sbjct: 889 IVDDRKKDGTPGNWNGDG----FPHLGRPDDIDVSVELRDWLFALEGREGVGTR--ILNN 948
Query: 1200 NYAGREERCWHTSFQSFRVKAQSRPKDL-LNGKGSSCGTQQYPVELVIVSVEGLQTLKPQ 1259
GREERCWHT+F++FRV A+S PK++ NG + C +YPV+ +IVSVEGLQT+KPQ
Sbjct: 949 EDIGREERCWHTNFRTFRVIAKSTPKNVDSNGTENQCDAHKYPVDSIIVSVEGLQTVKPQ 1008
Query: 1260 VQKNTHH-NVSLLNGANETVEPLGGINLEARMVVSED-DVDVEMANWIMENLKFSVKHPI 1319
+QK T N NG +E + GG+N+EA +V SED V ++ NW+ E+LKFSVK P+
Sbjct: 1009 MQKGTDSCNGLSTNGVHENGQMHGGVNIEANIVASEDKSVHDDLLNWVAESLKFSVKQPV 1068
Query: 1320 EAVVTKNELQHLALLFKSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIF 1379
EAVVTK+ELQHL L KSE+D+MGRI AG+LR+LKLE SIGQATL+QLSNLGSE DK+F
Sbjct: 1069 EAVVTKDELQHLTFLCKSEIDAMGRIVAGVLRVLKLEESIGQATLNQLSNLGSEGFDKMF 1128
Query: 1380 TPEKLSRGSSAASLGISPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSS 1439
+P K SR S S + S + E R +EST++S+E+A ++ ++KC++L++ L+ S
Sbjct: 1129 SP-KASRAGSPKSSPFAASLDSMREISLRANLESTISSIEEASMELEAKCSALVSDLNDS 1188
Query: 1440 DSSLHVATIKQLYEKLDSMQTLLSRLRNQI 1454
+SS A +L +KL+S+Q+L+++LR QI
Sbjct: 1189 ESSAKHA--NELKQKLESLQSLMAKLRTQI 1199
BLAST of HG10011577 vs. TAIR 10
Match:
AT3G20720.1 (unknown protein; Has 184 Blast hits to 181 proteins in 66 species: Archae - 0; Bacteria - 2; Metazoa - 137; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )
HSP 1 Score: 807.7 bits (2085), Expect = 1.5e-233
Identity = 441/863 (51.10%), Postives = 590/863 (68.37%), Query Frame = 0
Query: 600 ASVSDGQNDNNLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFAKNFCPPIY 659
A VSDG++ N LT+++IGGLFLRD FSR PC L+QP+M+A +D L +P+FAKNFCP IY
Sbjct: 332 ACVSDGESANYLTKILIGGLFLRDAFSRSPCALIQPSMKAAAED-LAIPDFAKNFCPLIY 391
Query: 660 PFKDKQWGLSGSVPLLCLHSVQVKPSPVPPSFATQTVIHCQPLTIHLQEKSCLRISSFLA 719
P W + VPL+ LHS+QVKPSP PP F ++TVI CQPL +HLQE++CLRISSFLA
Sbjct: 392 PLDSGPWQIVQDVPLISLHSLQVKPSPKPPHFFSKTVIQCQPLMVHLQEEACLRISSFLA 451
Query: 720 DGIVVNPGSVLPDFSISSIILSLKELDVTVPLDVAKSTDYHSSWDGISQSTFDGARLHIK 779
DGIVVNPG VLPD S++S++ +LKELDV+VPLD++ D D + +F GARLHI+
Sbjct: 452 DGIVVNPGDVLPDNSVNSLLFTLKELDVSVPLDMSNLQDSAIEEDLSVKKSFVGARLHIE 511
Query: 780 NMQFSESPSLNLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETY-NKVSGS 839
N+ F+ESP+L +RLLNL+KDPACF LW GQP+DASQKKW S SL+LET N
Sbjct: 512 NLSFAESPTLKVRLLNLEKDPACFCLWPGQPIDASQKKWTAGASHFSLALETSPNSTQLQ 571
Query: 840 KSSDAILALLRCVELTEVSIEVAMATADGKTLTAVPPPGGVVRVGVSCQQYLSNTSVDQL 899
+ L CVE +VSIEVAM +ADGK L +PPPGG+VR+GV+C+QY+S SV+QL
Sbjct: 572 SPRGPEMGLWNCVEGKDVSIEVAMVSADGKPLITIPPPGGIVRIGVACEQYISRASVEQL 631
Query: 900 FFVLDLYAYFGRVTEKIALVGKKNQPKESRSNLLVGKLVDKVPSDTAVSLLLRNLQLRFL 959
FFVLDLY+YFG+V+EKI++V + K + L G L++KVPSDTAV L L++LQL+FL
Sbjct: 632 FFVLDLYSYFGKVSEKISIV---KESKRQNTVSLTGGLLEKVPSDTAVKLALKDLQLKFL 691
Query: 960 ESSSSIVEELPLVQFVGNNMFIKVSHRTLGGAVAISSTVRWDNVEVDCVDTDGNIAYDNG 1019
ESS + +++PLVQF+G ++ +KV+HRTLGGA+A+SS + W+N+EVDCVDTD ++N
Sbjct: 692 ESSFTSTQDMPLVQFLGKDLSVKVTHRTLGGAIAVSSNIYWENIEVDCVDTDVEHEHENS 751
Query: 1020 TVSTSIENGSLMNGNGLSQLRAILWV----HNKGDRFT-TPFLDVSIVHVIPLNERDMEC 1079
NG L++ NG + LR + WV H++ T TPFLD+SI HVIPL+E+DMEC
Sbjct: 752 W------NGHLVSCNGSTPLRRVFWVVNGRHDEHSGSTLTPFLDISITHVIPLSEKDMEC 811
Query: 1080 HSLNVSACIAGVRLSGGMNYAEALLHRFGILGPDGGPGKGLMKGLENLRAGPLVKLFKTS 1139
HS+++ A G P
Sbjct: 812 HSVSIVAY--------------------------GTP----------------------- 871
Query: 1140 PLLAGSLEGDGKESPLLQLGKPDDVDVSIELKNWLFALEGAEEMAERWWFYNPNYAGREE 1199
G+ GDG LG+PDD+DVS+EL++WLFALEG E + R N GREE
Sbjct: 872 ----GNWNGDG----FPHLGRPDDIDVSVELRDWLFALEGREGVGTR--ILNNEDIGREE 931
Query: 1200 RCWHTSFQSFRVKAQSRPKDL-LNGKGSSCGTQQYPVELVIVSVEGLQTLKPQVQKNTHH 1259
RCWHT+F++FRV A+S PK++ NG + C +YPV+ +IVSVEGLQT+KPQ+QK T
Sbjct: 932 RCWHTNFRTFRVIAKSTPKNVDSNGTENQCDAHKYPVDSIIVSVEGLQTVKPQMQKGTDS 991
Query: 1260 -NVSLLNGANETVEPLGGINLEARMVVSED-DVDVEMANWIMENLKFSVKHPIEAVVTKN 1319
N NG +E + GG+N+EA +V SED V ++ NW+ E+LKFSVK P+EAVVTK+
Sbjct: 992 CNGLSTNGVHENGQMHGGVNIEANIVASEDKSVHDDLLNWVAESLKFSVKQPVEAVVTKD 1051
Query: 1320 ELQHLALLFKSEVDSMGRIAAGILRLLKLEGSIGQATLDQLSNLGSESIDKIFTPEKLSR 1379
ELQHL L KSE+D+MGRI AG+LR+LKLE SIGQATL+QLSNLGSE DK+F+P K SR
Sbjct: 1052 ELQHLTFLCKSEIDAMGRIVAGVLRVLKLEESIGQATLNQLSNLGSEGFDKMFSP-KASR 1111
Query: 1380 GSSAASLGISPSAYLIGESPHRPTVESTVTSLEQAVLDSQSKCTSLMTGLSSSDSSLHVA 1439
S S + S + E R +EST++S+E+A ++ ++KC++L++ L+ S+SS A
Sbjct: 1112 AGSPKSSPFAASLDSMREISLRANLESTISSIEEASMELEAKCSALVSDLNDSESSAKHA 1122
Query: 1440 TIKQLYEKLDSMQTLLSRLRNQI 1454
+L +KL+S+Q+L+++LR QI
Sbjct: 1172 --NELKQKLESLQSLMAKLRTQI 1122
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038904052.1 | 0.0e+00 | 94.61 | uncharacterized protein LOC120090451 isoform X2 [Benincasa hispida] | [more] |
XP_038904051.1 | 0.0e+00 | 94.61 | uncharacterized protein LOC120090451 isoform X1 [Benincasa hispida] | [more] |
XP_004152911.1 | 0.0e+00 | 93.79 | uncharacterized protein LOC101210396 isoform X1 [Cucumis sativus] >KGN56161.1 hy... | [more] |
XP_031738234.1 | 0.0e+00 | 93.79 | uncharacterized protein LOC101210396 isoform X2 [Cucumis sativus] | [more] |
KAA0025451.1 | 0.0e+00 | 93.79 | Chorein_N domain-containing protein [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L7Q7 | 0.0e+00 | 93.79 | Chorein_N domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G081370 P... | [more] |
A0A5A7SMI5 | 0.0e+00 | 93.79 | Chorein_N domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... | [more] |
A0A1S3CJR3 | 0.0e+00 | 93.68 | uncharacterized protein LOC103501618 OS=Cucumis melo OX=3656 GN=LOC103501618 PE=... | [more] |
A0A6J1FP42 | 0.0e+00 | 91.81 | uncharacterized protein LOC111447221 OS=Cucurbita moschata OX=3662 GN=LOC1114472... | [more] |
A0A6J1IS31 | 0.0e+00 | 91.81 | uncharacterized protein LOC111477917 OS=Cucurbita maxima OX=3661 GN=LOC111477917... | [more] |
Match Name | E-value | Identity | Description | |
AT3G20720.2 | 4.0e-263 | 54.83 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G20720.1 | 1.5e-233 | 51.10 | unknown protein; Has 184 Blast hits to 181 proteins in 66 species: Archae - 0; B... | [more] |