Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATTTTTTGTAAGTTAGCCTAAAAGATACCGTTGAAAAATATGGCAATACAATAAAGCAGTGGAATACATGAGTAAATTATCTTTAAAATGGAAAAAAATAACAATAATAATTACAAAAAAGGTGATTTTGTGTAAAATTGGCGGACTTGGCATAAAAGTTTTTGTCATTTTTTCTCTTCTTTTGCATTTGTCTGCTTCTAAATGGAAGCCAAGCCATGGCAGTGACCGCAATTCACATCTGAGTTTTCACATCTCCATTCTTCGTTTCTTCTCATCAATCCTCGGAAAAAAATTTCCCATCACGTTTCCGAGAAAAAAGTTCAGAGCACTGAATCATCTGGATGAGACGAAGTTCATCTTCGGAGATCGACGACAATGGGAGTGCAAATGTCGTCCCCGGCGTTCACTCGATTCGCGATCGTTTTCCTTTCAAGCGGAATTCTAGTCACTTCCGTTTGCGAGTCAAGGACTCATTGGATCATGCAGCCTCCCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTCCTCTGGTGGCTCCCGGCTAGAGGGAATACGCTTTTCTACTGTGTTGTCATTTTTGCGGTCTTCGCCTTTGTTACCGGCTCTGTGATGTTGCAGAGCTCGATTACTTTGATGTCCAGCCATGGAAGTGAAAAGGGGCGGTGGCTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCGGGGAGGATTTCGAGGAGGCTGGTGGAAGGTGATGGGCTCGACGAGGCGCGGAAGAAGGACCGCGTTGGCGTTCGTGCACCGAGGCTTGCTCTAGTGAGTTCTTCAAGTTTTTGATATCTGTTTGAAACTGTTTGCGTTCAAGTCACTGATTAGCTATCAGTACTTAGCATATGCCACTTAGTTGATTGCGTTTCTTATTGGTTACGTGCACGAACACTCAAGTCTCACGAATGGAAGATTTACCAAAGCTAACTTGGGGCTTATTTCTGTAGATTCCTATTAGTTGGAACCTTCCTTGCACACCCTCTTCCCCTTCCCGAACTTTTATGTGAACTCCGCGTTGTGTTTATTTATTTAGATCAATCCACCTTTCCTATCTCACATTAGTTAAATTATGTGAAGTAGATTAACTTAGAACCATTTGTTTGGATTCTGACCACCCCTCGGTGGCAGTAGTATATCATCCATTTGAAGGATATCTTCAAGTTCTAAGGTTTTGTATAATTTTTGAAAATGAGTTTATGGAAACATCGGAAACAAGATAAATAAAATTGGTCAAGGAGACTTGTTACAGTGTAGGAAGGCGTATGCTTGCAATAACCACAAAAGTTTTCATAATACTGTGGTCCTTTAAAAGCATTGGATATTTGGATCTTGAAATTTTCATAATCGACTTGACTTTAATTTCCATTTGATTACTTTAGATGGTTGATTATCATAATCTTTATATGACCCAGTTAACCAAATTATTTTAAAATATTTATTTTTCTACATTTAAATTTAAACTGATAGTTTATTATATTGATTAAAGCTCTTTGGTTTTATGAAAAATTTTCTCTGAAAAATTAACATAATAACATTTAGAAACAGAGAGAGTGTTTTTAACATTTTTCATAATTTCACAAGCTTTTGTAAAACTGTATCAAAATTCGAACAGAACTATTGTGATTCTTCTTAATTTTTTGTTTGTTAAGAAACCAATATGTTAGGCTACCAATAATAAAACGTACCTTAGGAGGGGTAAAAGAGGGAATGTACCAAGAGTTAATCAAGAAATTGTACATAATGAGCATATATTTTTGGGGTTGAGATATGGTGAGGTAAGCAGAATTTTGGTTAGGGAATAGTAGGAGAGAAGCTGGTGCTCTCGGACTGGCCCAAAGTTTGTAGTTGAGCTCGTAGCTCTTTGTGATTGCTCTCACTCGTTTCTGTGAATACAATACCCATCAAGATTGTTCCATTCTTCTCCATCCAAAAAACCCAATTAATGGCTTTAATGGCACTGCTCCATAATCTTTGGCTTCATTCTTAAAGAAGTGCACACGCAGAAGTTTAATGATGTCCTGTTTAAAATCTTTATGAAAACACTAATGTATTCCATAGTATCCTTGGGAGTTGGAGCTTTTTTCGGTTAAGTCCATGCTCGTAAGAAGTTGACGAACCCTTACCACTGGTTTGGTATAAGATTCTTTGGGAAGATAAGTATTAGTATCCAAAGAAAGTTAAGGTCTTCTTTTGGTCCCTTTATCTTGGTGGTATTAGTACAAAAGATAAAGTGCAACCAGAAATCCCCAACCAGTTGCTTCAACCAAATTTGCATGTCTACTCTAATCTAAATACTCCTCTGAAGATGCATCACATCGAAACATAATTTTTAATGAAAAGTGAAATGAAAGGAAAGGGAACCGAAGCTTGTCAAATGATTGACGGAAGTGAAATGAAGTATTTTTGTAACTAAAAGATTCGAGTAATTTAAAAAAAATAAGAAATGAAGTTCATTGAACAAATAATTTAAACGAGAAGCTAAATTCCGGGGAACAGTCAAGGAGTCAAAAGAGGATCTCCAATTGAATACAATCATAGAAATATAATAATTATAAGAAGGTCTAAAAAGACGAGCCAAAGAGGAGGCAGAAAAAGCTTCGAGCAATATTTTCTTCTTATTATTTCATTACTGACTGGTACTGTTCTGACTGGTACTGATTATTCAATGAAATATAATTTTAAACTAAGTACTAGTTCATGAAAAATTCTATGACCTATTTAGAATCTTAGTCCTTGTGGTAGAGATTACCCCTGTGGTCTCGTACTTGTGCTCTTACGTGCATTAATATGGTTGTTATGATTATGGATTAATTTTAGACTTGATTGTCAAAAAGTTTGAATCGGAATGCATTCTCAATGTCATGGTTATAAGCAAATGCTGAATTCTGATAACATAAGAATTTAACTCATATGCATGGCTTCCCAGTTTAATATATACTGCGTAAGAACAGTATCTGGATCTAGATACATCAAGGAATTAAGTTATGTACCAATCTTCTTCTTTTTCCTTTTTCCTTTTTCCTTTTTCTCTTTTCCTTTTTCCTTATGCTTTCATTGGTACCAGATCTTAGGAAGCACGGAGAGAGATCCACAATCATTAATGTTGGTTACTGTGATGAAGAACATTCAGAAACTTGGATATGTGCTTGAGGTGAGTTATTGGAATTTATTTGTTGTCATTCATGTAAAAAAGATATTTTATTGGGATTTTTATTTTTCATTTTATGAGAGTAGAAGGTGTAAAGTTATCTTCTTTTGTCCACTGTTCTAGTAGTATGTTTCCAGAATTTATATATGGTGCATGGATTTCTTGTAGGTTTATGTAGTTATTTATGATTGGTGTTTTAATTTCTGGCTACAATTTTTTTTGGATCCTCATTAAGGCGTATCTGACAGATTTTTGCAGTAGAGGGTGGAAATAAACATTCAATGTGGAAACAGATAGGTGGCCAGCCTTCAATATTAAGTCCAGAGCATTATGGTCATGTTGATTGGTCAATGTACATGCTCCTTTTGTTCGTTAAACAGTTTTCTTACCATCCTTTTTCTTTATTTGTTCTCTGGTTGATATTTTTTTCGTTGCCTTTCAAGAACTTTTTGGGGACTACTTTTCAAAGATATCCTTGGCAGAAATGACTGATTCTACTTTCATATATATATTGTTTCAGATATGATGGTATTGTTGCTGACTCCTTGGAAGCAGAGGGGGCCATAGCAAGGTTTTTTTTCTTTATAGCATAAAGTACCTAGTGAAATCTTTAAGTATGTAGAGTAATCTTTATTTATACTTGCAATGAGGTGAATGGTTCTTTGTCTAACTGTATTTATGATGTGCTTTATGAACTTATCTGCATTAGGCGTTACAATAGATTATGGGCATAGAAGCAAGTGCAAACTATTATCGTTGGAGAGTTGGAGTTGCTCTATCTCTTCCTAACTAGATTTTTTGTTAAGTTTTTTCATAAAAGATATGAAAAACATACCATAATGCTTTAGATTAACAGGTAATTCATTGTTGTTAAGCATTTGATTTCGTATGTCTCATGTTCATTGGAACATAAATTATTGGAGCTTTAACATGTTCCTTTAAATATGAATTACAAAATTTTAAGAGGCTTGTATTCTACAAATTCCATCATAAAAGTGGAACATTTAGCTCCAAGGTTGGGCAGTTTATCTAGATATTAAGGCTGCCTCCAAGGGTTGGTAATTGATTTATACTTAGGGCTGCCGGATAGGTTGTTTTCCAGACTTGTTTATAGAAATGTGAAACAGGTGGCTTACAAAGATTGAGTGTCCTATATGTTGAGGCCTTTGATTTTTGGAGAAGTAACAATTGGAGAAGATCTCAAACACAAGTCTCTAAGAGGGAAATACAACTTTCAAAATGTAATCTCCCAAATTTGCTTTCTCTCGTCTTTTGTTATATATATAAAAAAACAATTTATAAATTCAACTCCAGAGTACCGTTTGGTAGTTTGATTTAAAGACCTATTTTATATAGATGTGATCAAGGGTGACTTGTAAAGAACGAGTTTACTTACTGTGTACTTGAAATAACTGTTCAAAAACTTGTAGCCTTCACAGTATAATTTTAGATGTTTTGAAAAAAGCAGGCTATCCATGAGAACAAACCAAGAGAACAACTTTTAGAAATAGTTACCAAACATGTTTTCATACGCCTTTTTTTTTGTTTTGTTATAGAAATTTTTAACATTGACTCCCGGACAATTGGTCAAGTGTTGTGGGGGATTGAATCTAGTGAATATGAAGGGTTAATACATTTCTACATTTAAGAGATTGATGAAAATCTAACCATTCTTACCAAAAAAAAAAAAAAGAAAAAATGAAAATCTAACCATAATTAATGCTTAATTCTAATAAGAACCTATAAAAATAGTGCCTATGTTTTTTCTTGGGACGAGAAACTTTATTCTTCTTCCTTTTTTTGTTTTGTTTTGTTTTTGTTTTTGTTTTTTTTTTTAAATATAAGAGCCTATCAAATTCAGGGAAGTTAATTTTGGATGCATGGAATGTCAGTGGTTCTGATTTGACTTGGAGGAATTAGTTAGAGTTTCAAAGAAACTATACATTTTAGTTTATCTTGTTAATATGGAGAAACTTATGTTGGGGTTGTGAACATTCTTTATGTTTAGATAGTGAGCATCTCACATGTAAAAGAAACTGTGATTAATATAGGTAAGTGCATGGGCGTGCATGTGCAGATAATAATACTCGTCTTTCTGAGTTATGTTGTTTTCATGTGCTTAAAGCATGTGATAATCATGTAACATATTGTTTTAGACTTTAGTTTGAGAATTGAGATACTATGATTAGTGAGGTTTATATTTATGCAGCCTTATGCAGGAACCTTTTTGTTCTATACCACTCGTATGGATAATTCGAGAAGATACACTTGCCAACCGCTTGCCGATGTACGAACAAAGGGGCTGGAAGCATCTTATTTCACATTGGAAGAGTTCTTTCAGAAGGGCCAATGTTGTCGTGTTCCCTGATTTTGCCCTCCCAGTAAATCTTCTTTTCATCTCTATTTGTTATTGGATGTTACGATTATTTATAGTTAATCTTGTATTCAACATTTCTGTTTTTTTCATTTATCAATAAAATCTCTTTATTGTTTAAAAAAAAAAAACACTCTCATGCCATTGCCATTTCCTTACCATATCAGATGATGTATAGCACTTTGGACAATGGAAACTTCTATGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAACTTCAAAAATGTTCACTCCAAAAGTCAATTAAGAGAGAAAAATGGATTTAATGAAGATGATATACTGGTTCTAGTTGTTGGAAGTTTGTTCTTCCCAAATGAGCTGTCATGGGACTATGCTGTGGCAATGCATAGCATTGGACCTCTACTCACAAATTATGCAAGGCAAGAAGTAGGAGGGTCGTTTAAATTTGTTTTCCTATGTTGCAATTCAACTGATGGATCCCATGATGCTTTACAGGTATATTTAATAATTATGCTTCTGCATTTAGATTTATTTATTTGTTATTTTATATATATATATATGCATTATGCTCATGTCCCACTCTAAAGTCAGCTCTGGGAGATAACAAGCTTTCAGTTGACAAGTTTTTGGGGACATTGTGGCTTTTCTTCCAGTGTGTGTATGGTATTCATATTCTATACGGATGTTTAGATTTAAGAGAATATTTATCTAGTTCCTCAACTCTTTTATTATGTATCTGAAAATTATAACTTCATTCTTCTTTGACCATTGGGTGATTCATTGCCACTTACATCCTCCATATCGAAAGCACGTCAAGATTATCCTCGCTTCATTTTGTTGCAAATATGCATATACCATTCGTGTGTATGCAACTTGACTATGAAATTCCAGAATATTCCAGAATAACTTTTAATTTTTTAATGAAAGTTATGCTTGTGGTTTGTTTATTCATTTCCTTTTTATGGTCTTTAGCTTTTTGTGGTGAGGAGAAAAGGAGCAAGTAAAATATGAAGGAAAAAAAACTAGACATGGTTATTTTTCAAAGATGGAAATGGATTGATTGCCCTCTTTGTCTAATTATTCTTAATCAAGGTAAGGTTAGGCAGGATCTGGGTGACTACTGAATAGATGAAGTATTTCATGGATTTACATATAATTCCAAAAGCCTAGCTTTGAATTCTCTTCATATGACCATACACAAACTAAATTATGTGAAGCCAATCATTTGCTTGCTTACTTTTTAAATGCTTCACACATGGAAAAAGTTGCTTTCTTACCTGGGTGCTCTATATTTCCTCTTGTATGTAAAAATATATTGATTTTGTGGGGATTGTTTATCTTCTTATGTACAATTCCTTTGTTTTTTTTTTTGTTTTTTTTTTCATCTTACTTAGTTATGCCAAATGATTTTGAATATATTTTCTTGTGTTTCTCATTAAAAAAAAAAAGTTATGCCAAATGATTGTTATTGGCATCATTATTCCTGATCTTTGTATTGGTATCTTTTATATTGTTATGTATACTATTGTTTTGTACCTGATTTTTCTTTGAAAAAAAAGGAAGATATTTTGTACCTGTATACTCCTTTTGTTTTGTACCTGACAATGGATATATCTTTAACCTACGTGCAGGAGATTGCGTCACGTCTAGGACTTCCTGATGGTTCTATAACACATTATGGCTTAAATGGAGATGTCAATGGTGTGCTGATGATGGCTGACATTGTGCTTTATGGATCTTCACAAGAAATACAGAGTTTTCCACTTCTACTCATCCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGCCTTGAGAAATTATGTAAGTTTGTATGAGATTTGCTGCCATGATGTAACAGCACGACTCCACCTCCACCCTGTTTTATCTTTCCCCTTCTTGGGTTGTTTCTTTTGCTAAAGCATTTTCCCTGGTTCTCAGATCGTTGATGGTGTCCATGGAATTATCTTCCCAAAACATGATTCTGATGCTTTATTGAGAGCTTTCTCAGAGATGATATCAGATGGGAAGCTCTCCAGATATGCACAAGCAATAGCTTCTTCTGGAAGATTGCTTGCTAAGAATATACTTGCATCAGAATGCGTTACCAGTTACGCACGGCTCCTAGAGAATGTTCTGAATTTCCCATCAGATGTTAAGCTTCCAGGCTCTTTCTCTCAGCTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAACGGTACAAACAATTGACGGAAATGAAGATGCTGAAGAGGGGATTACAGCAATAAGTAAATCCAGTGTTATTTTTGCTCTTGAAGCACAATTAACTAATTTTGTTAACTTAACAAATTTTTCTGAGACTGGAAATGGGACTCTGGAGCAAGATCTTCCAACTCCACAAGACTGGGATATTTTGGAGGAAATACAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGGTATGTGGTACTTCTTGCAATTGGATTACTTGCTCGATCCATTGCCTTTCCCTGTTTCTGAACTTCCTTTTTCACACATCCTTCTCAAATGGATAGAATCATCTTCCTTTTCTGTAGTTTCAAGAAAGAATGGAGAGGGATCTAGGTGCATGGGATGAAATATATCGCAATGCCCGGAAATCAGAAAAGCTCAAGTTTGAAGCCAATGAACGAGATGAGGGGGAGCTTGAAAGGACAGGACAGCCTGTATCCATTTATGAGATATACAGTGGTGCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTTTACCGTGGACTAAGTCTTGTATGCTCCATCCATCTTCTGTTTATCATGTGATACTTTCCATTTAGTTTAACTCGTTTTTGTTTTCTTAGTCAATTGCATAAGAATTGCTTTCACTCATCTGGTTTTCTAGAACTACTCCGAACTTTTTACTTGTAGTTATATTTCTGTTGGAAATTCATCCTTGTAGTTACTTCATTTAGTTTGAAGTTTTACTCATTGGATGCTTTAGTTCTTGAATCTTTCCTGAAAAGTTTGCAAGGTGCTGTGGAATAACTCCCTCCCTTCCTCCTTGAGTAGTTGAATCTATAAGGTTTACAAAGAAAATTTGGAATTTGATACTTTTGCTTGTCATTCATTTCTTAAGGGCAGCCGTGAATCTTTTTACGTTGTTTTGCTTGAGTTTGTAAAGAATTTATCCTAAATTTTATGGCAGTCCACAAGAGCACTGAGGTTAAAATCTGATGATGTCAATGCTGTGGGACGACTTCCTCTTCTGAATGACTCTTACCATATGGATATTCTCTGTGAGATCGGAGGAATGTTTGCCATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGGTTCCAATCATGGCGGGCTTCTGGCAGAAAGGTATTTATTTACTGGTTGACTATTACAAAGTTTGCAGTTTTGCAAAGTCCCCCCTCCTTATTTTTTCCATTTTAAGGAAATTTTTGCTTCCCCCTCCTTGGGTGGATAGGGTGCCCTCAAGGCTACAAATATTATTCAGCCCTTCATCTTTAACAGGTTTCCTTGTGCACAAAAGCTGAAAATGTTTTGGAAGAGACTATACGGGACAAGCCTAAAGGAGATGTTATATACTTCTGGGCACACTTCCACGTGAATGGTGGAATCATAGGGAGCAGTAATGCACCCACTTTCTGGTCAGCGTGTGATATCTTGAACGGTGGGCTCTGCAGGTAATTTTACCATGTCTGGTATGAAATACCTGTAATCTAAGCAAGTTGGGATGCATGCTCTGTACTGGGATGCGTTATTTGTTTTCATCCTACTTTTAAATGTTTACTTTTCTAGTGCTTTCTATTGTTTTTGAGTCATGAAATTGTGAATGATGATCTTCGTTACCCATGTTCCTACATTGTAACGTTTTTAGTTTCTGGACGTCATTTCAAAAAGATGTTAAAAAATCATATATGAAACAGTGATTCCTGCTTAAGTCTCCGTTGTGGATGTGCATACACTAATAAATGGCCTGAAATTATATAGTAAATGTAAATAAAACATGCTGAGAATGTGTTAAAGCGTGGTCTTATCTATTTATTTTCTTATTTATGGACATTTCTATGAATTGAACTTGCCCCTAGCTATAGTTGTGATTTATGAAGTAGCTATTCAAACTTGCCTTATTTTACTTTAAATGAGTGCAGAACCGCCTTCGAAAACACCTTTCGTGAGATGTATGGATTGTCAGCAAATATGGAAGCTCTTCCTCCTATGCCAGAAGATGGCGGTTGCTGGTCTGCCCTCCATAGCTGGGTGATGCCAACCCCATCCTTCTTGGAGTTCATGATGTTTTCCAGGTATTTTCCTCCAGTTCATAAGCCATGATGCATATACAGTAATTTTTCTTCATTATTTGGATATTGTGTCAATATAAAGGAGGCAATCCCCTTAAATTGCAATTTGAATTGGGAACTACAGGATGTTCACCCATTACCTTGATGCTCTCAATAGAAATCAGAGTCAACCATATGGATGTATGTTGGCTTCCTCAGAGCTTGAGGTAAGGATTTGAGTCCCAATCCATTTCCCTTCATATGATTCTCTCTCTCAACTCCTGTTAGTTTCTATGATGAGCAGAAAAAACATTGTTACTGTCGGATCTCGGAAATCCTGGTCAATGTCTGGGCTTACCACAGTGGACGGAGGATGGTTTACATTGATCCCCACTCGGGTTTCCTAGAAGAACAACATCCAGTTGAGCAACGCATGGAATTTATGTGGGCAAAGTATTTCAACCTCACGTTGTTGAAGAGTATGGATGAAGATTTAGCAGAAGTTGCCGACGACGAAGGCGGTTCGAATAAAATGGGGCTGTGGCCATTAACAGGGGAAGTGCACTGGCAAGGGATCTATGAAAGAGCGAGAGAAGAAAGGTATAGGCTGAAGATGGACAAGAAGAGAACTACAAAAGTAAAGTTAATTGAGAGGATGAAATTTGGATACAAGCAAAAATCACTTGGAGGATGAGAAACTGGAATAGTTCTGAACCACAATTAGCCTGCTCATTCTGAAAGCGCTTCTGACTGCTAACTTGGCTAGTAGTAGAGATTGAAAGAAAGAGCCTCTCCCCCTCCTCTTACGATTTGGTACGTAAAAGAATTCGAGTTCTTTCCGAATTATTTCTCACTCTACAGAGCATTTCTAGGCAACAGAACTGAGGAAAGACAAAGCTGAAACTGCAGCTCGGTTATTGGAGGCATTACCAGAGAATACAAGGTTATTTTGGCAGACAGAGGTAACTAAATATTCTTTGAAATTCTTTCTTCTATGTTCAACATACCTAGTATAGCAACAAGATAGGCAAATTCCTTTGCTGTTGTAGTACAACCAAAATAGCAATTATTAAGCCAATTTTGGTCACACGTAATAATAGATGCTTTCCACTTTCAGATATATTGTTGTCCTC
mRNA sequence
TTATTTTTTGTAAGTTAGCCTAAAAGATACCGTTGAAAAATATGGCAATACAATAAAGCAGTGGAATACATGAGTAAATTATCTTTAAAATGGAAAAAAATAACAATAATAATTACAAAAAAGGTGATTTTGTGTAAAATTGGCGGACTTGGCATAAAAGTTTTTGTCATTTTTTCTCTTCTTTTGCATTTGTCTGCTTCTAAATGGAAGCCAAGCCATGGCAGTGACCGCAATTCACATCTGAGTTTTCACATCTCCATTCTTCGTTTCTTCTCATCAATCCTCGGAAAAAAATTTCCCATCACGTTTCCGAGAAAAAAGTTCAGAGCACTGAATCATCTGGATGAGACGAAGTTCATCTTCGGAGATCGACGACAATGGGAGTGCAAATGTCGTCCCCGGCGTTCACTCGATTCGCGATCGTTTTCCTTTCAAGCGGAATTCTAGTCACTTCCGTTTGCGAGTCAAGGACTCATTGGATCATGCAGCCTCCCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTCCTCTGGTGGCTCCCGGCTAGAGGGAATACGCTTTTCTACTGTGTTGTCATTTTTGCGGTCTTCGCCTTTGTTACCGGCTCTGTGATGTTGCAGAGCTCGATTACTTTGATGTCCAGCCATGGAAGTGAAAAGGGGCGGTGGCTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCGGGGAGGATTTCGAGGAGGCTGGTGGAAGGTGATGGGCTCGACGAGGCGCGGAAGAAGGACCGCGTTGGCGTTCGTGCACCGAGGCTTGCTCTAATCTTAGGAAGCACGGAGAGAGATCCACAATCATTAATGTTGGTTACTGTGATGAAGAACATTCAGAAACTTGGATATGTGCTTGAGATTTTTGCAGTAGAGGGTGGAAATAAACATTCAATGTGGAAACAGATAGGTGGCCAGCCTTCAATATTAAGTCCAGAGCATTATGGTCATGTTGATTGGTCAATATATGATGGTATTGTTGCTGACTCCTTGGAAGCAGAGGGGGCCATAGCAAGCCTTATGCAGGAACCTTTTTGTTCTATACCACTCGTATGGATAATTCGAGAAGATACACTTGCCAACCGCTTGCCGATGTACGAACAAAGGGGCTGGAAGCATCTTATTTCACATTGGAAGAGTTCTTTCAGAAGGGCCAATGTTGTCGTGTTCCCTGATTTTGCCCTCCCAATGATGTATAGCACTTTGGACAATGGAAACTTCTATGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAACTTCAAAAATGTTCACTCCAAAAGTCAATTAAGAGAGAAAAATGGATTTAATGAAGATGATATACTGGTTCTAGTTGTTGGAAGTTTGTTCTTCCCAAATGAGCTGTCATGGGACTATGCTGTGGCAATGCATAGCATTGGACCTCTACTCACAAATTATGCAAGGCAAGAAGTAGGAGGGTCGTTTAAATTTGTTTTCCTATGTTGCAATTCAACTGATGGATCCCATGATGCTTTACAGGAGATTGCGTCACGTCTAGGACTTCCTGATGGTTCTATAACACATTATGGCTTAAATGGAGATGTCAATGGTGTGCTGATGATGGCTGACATTGTGCTTTATGGATCTTCACAAGAAATACAGAGTTTTCCACTTCTACTCATCCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGCCTTGAGAAATTATATCGTTGATGGTGTCCATGGAATTATCTTCCCAAAACATGATTCTGATGCTTTATTGAGAGCTTTCTCAGAGATGATATCAGATGGGAAGCTCTCCAGATATGCACAAGCAATAGCTTCTTCTGGAAGATTGCTTGCTAAGAATATACTTGCATCAGAATGCGTTACCAGTTACGCACGGCTCCTAGAGAATGTTCTGAATTTCCCATCAGATGTTAAGCTTCCAGGCTCTTTCTCTCAGCTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAACGGTACAAACAATTGACGGAAATGAAGATGCTGAAGAGGGGATTACAGCAATAAGTAAATCCAGTGTTATTTTTGCTCTTGAAGCACAATTAACTAATTTTGTTAACTTAACAAATTTTTCTGAGACTGGAAATGGGACTCTGGAGCAAGATCTTCCAACTCCACAAGACTGGGATATTTTGGAGGAAATACAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGGGATCTAGGTGCATGGGATGAAATATATCGCAATGCCCGGAAATCAGAAAAGCTCAAGTTTGAAGCCAATGAACGAGATGAGGGGGAGCTTGAAAGGACAGGACAGCCTGTATCCATTTATGAGATATACAGTGGTGCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTTTACCGTGGACTAAGTCTTTCCACAAGAGCACTGAGGTTAAAATCTGATGATGTCAATGCTGTGGGACGACTTCCTCTTCTGAATGACTCTTACCATATGGATATTCTCTGTGAGATCGGAGGAATGTTTGCCATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGGTTCCAATCATGGCGGGCTTCTGGCAGAAAGGTTTCCTTGTGCACAAAAGCTGAAAATGTTTTGGAAGAGACTATACGGGACAAGCCTAAAGGAGATGTTATATACTTCTGGGCACACTTCCACGTGAATGGTGGAATCATAGGGAGCAGTAATGCACCCACTTTCTGGTCAGCGTGTGATATCTTGAACGGTGGGCTCTGCAGAACCGCCTTCGAAAACACCTTTCGTGAGATGTATGGATTGTCAGCAAATATGGAAGCTCTTCCTCCTATGCCAGAAGATGGCGGTTGCTGGTCTGCCCTCCATAGCTGGGTGATGCCAACCCCATCCTTCTTGGAGTTCATGATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCTCAATAGAAATCAGAGTCAACCATATGGATGTATGTTGGCTTCCTCAGAGCTTGAGAAAAAACATTGTTACTGTCGGATCTCGGAAATCCTGGTCAATGTCTGGGCTTACCACAGTGGACGGAGGATGGTTTACATTGATCCCCACTCGGGTTTCCTAGAAGAACAACATCCAGTTGAGCAACGCATGGAATTTATGTGGGCAAAGTATTTCAACCTCACGTTGTTGAAGAGTATGGATGAAGATTTAGCAGAAGTTGCCGACGACGAAGGCGGTTCGAATAAAATGGGGCTGTGGCCATTAACAGGGGAAGTGCACTGGCAAGGGATCTATGAAAGAGCGAGAGAAGAAAGGTATAGGCTGAAGATGGACAAGAAGAGAACTACAAAAGTAAAGTTAATTGAGAGGATGAAATTTGGATACAAGCAAAAATCACTTGGAGGATGAGAAACTGGAATAGTTCTGAACCACAATTAGCCTGCTCATTCTGAAAGCGCTTCTGACTGCTAACTTGGCTAGTAGTAGAGATTGAAAGAAAGAGCCTCTCCCCCTCCTCTTACGATTTGGTACGTAAAAGAATTCGAGTTCTTTCCGAATTATTTCTCACTCTACAGAGCATTTCTAGGCAACAGAACTGAGGAAAGACAAAGCTGAAACTGCAGCTCGGTTATTGGAGGCATTACCAGAGAATACAAGGTTATTTTGGCAGACAGAGGTAACTAAATATTCTTTGAAATTCTTTCTTCTATGTTCAACATACCTAGTATAGCAACAAGATAGGCAAATTCCTTTGCTGTTGTAGTACAACCAAAATAGCAATTATTAAGCCAATTTTGGTCACACGTAATAATAGATGCTTTCCACTTTCAGATATATTGTTGTCCTC
Coding sequence (CDS)
ATGAGACGAAGTTCATCTTCGGAGATCGACGACAATGGGAGTGCAAATGTCGTCCCCGGCGTTCACTCGATTCGCGATCGTTTTCCTTTCAAGCGGAATTCTAGTCACTTCCGTTTGCGAGTCAAGGACTCATTGGATCATGCAGCCTCCCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTCCTCTGGTGGCTCCCGGCTAGAGGGAATACGCTTTTCTACTGTGTTGTCATTTTTGCGGTCTTCGCCTTTGTTACCGGCTCTGTGATGTTGCAGAGCTCGATTACTTTGATGTCCAGCCATGGAAGTGAAAAGGGGCGGTGGCTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCGGGGAGGATTTCGAGGAGGCTGGTGGAAGGTGATGGGCTCGACGAGGCGCGGAAGAAGGACCGCGTTGGCGTTCGTGCACCGAGGCTTGCTCTAATCTTAGGAAGCACGGAGAGAGATCCACAATCATTAATGTTGGTTACTGTGATGAAGAACATTCAGAAACTTGGATATGTGCTTGAGATTTTTGCAGTAGAGGGTGGAAATAAACATTCAATGTGGAAACAGATAGGTGGCCAGCCTTCAATATTAAGTCCAGAGCATTATGGTCATGTTGATTGGTCAATATATGATGGTATTGTTGCTGACTCCTTGGAAGCAGAGGGGGCCATAGCAAGCCTTATGCAGGAACCTTTTTGTTCTATACCACTCGTATGGATAATTCGAGAAGATACACTTGCCAACCGCTTGCCGATGTACGAACAAAGGGGCTGGAAGCATCTTATTTCACATTGGAAGAGTTCTTTCAGAAGGGCCAATGTTGTCGTGTTCCCTGATTTTGCCCTCCCAATGATGTATAGCACTTTGGACAATGGAAACTTCTATGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAACTTCAAAAATGTTCACTCCAAAAGTCAATTAAGAGAGAAAAATGGATTTAATGAAGATGATATACTGGTTCTAGTTGTTGGAAGTTTGTTCTTCCCAAATGAGCTGTCATGGGACTATGCTGTGGCAATGCATAGCATTGGACCTCTACTCACAAATTATGCAAGGCAAGAAGTAGGAGGGTCGTTTAAATTTGTTTTCCTATGTTGCAATTCAACTGATGGATCCCATGATGCTTTACAGGAGATTGCGTCACGTCTAGGACTTCCTGATGGTTCTATAACACATTATGGCTTAAATGGAGATGTCAATGGTGTGCTGATGATGGCTGACATTGTGCTTTATGGATCTTCACAAGAAATACAGAGTTTTCCACTTCTACTCATCCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGCCTTGAGAAATTATATCGTTGATGGTGTCCATGGAATTATCTTCCCAAAACATGATTCTGATGCTTTATTGAGAGCTTTCTCAGAGATGATATCAGATGGGAAGCTCTCCAGATATGCACAAGCAATAGCTTCTTCTGGAAGATTGCTTGCTAAGAATATACTTGCATCAGAATGCGTTACCAGTTACGCACGGCTCCTAGAGAATGTTCTGAATTTCCCATCAGATGTTAAGCTTCCAGGCTCTTTCTCTCAGCTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAACGGTACAAACAATTGACGGAAATGAAGATGCTGAAGAGGGGATTACAGCAATAAGTAAATCCAGTGTTATTTTTGCTCTTGAAGCACAATTAACTAATTTTGTTAACTTAACAAATTTTTCTGAGACTGGAAATGGGACTCTGGAGCAAGATCTTCCAACTCCACAAGACTGGGATATTTTGGAGGAAATACAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGGGATCTAGGTGCATGGGATGAAATATATCGCAATGCCCGGAAATCAGAAAAGCTCAAGTTTGAAGCCAATGAACGAGATGAGGGGGAGCTTGAAAGGACAGGACAGCCTGTATCCATTTATGAGATATACAGTGGTGCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTTTACCGTGGACTAAGTCTTTCCACAAGAGCACTGAGGTTAAAATCTGATGATGTCAATGCTGTGGGACGACTTCCTCTTCTGAATGACTCTTACCATATGGATATTCTCTGTGAGATCGGAGGAATGTTTGCCATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGGTTCCAATCATGGCGGGCTTCTGGCAGAAAGGTTTCCTTGTGCACAAAAGCTGAAAATGTTTTGGAAGAGACTATACGGGACAAGCCTAAAGGAGATGTTATATACTTCTGGGCACACTTCCACGTGAATGGTGGAATCATAGGGAGCAGTAATGCACCCACTTTCTGGTCAGCGTGTGATATCTTGAACGGTGGGCTCTGCAGAACCGCCTTCGAAAACACCTTTCGTGAGATGTATGGATTGTCAGCAAATATGGAAGCTCTTCCTCCTATGCCAGAAGATGGCGGTTGCTGGTCTGCCCTCCATAGCTGGGTGATGCCAACCCCATCCTTCTTGGAGTTCATGATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCTCAATAGAAATCAGAGTCAACCATATGGATGTATGTTGGCTTCCTCAGAGCTTGAGAAAAAACATTGTTACTGTCGGATCTCGGAAATCCTGGTCAATGTCTGGGCTTACCACAGTGGACGGAGGATGGTTTACATTGATCCCCACTCGGGTTTCCTAGAAGAACAACATCCAGTTGAGCAACGCATGGAATTTATGTGGGCAAAGTATTTCAACCTCACGTTGTTGAAGAGTATGGATGAAGATTTAGCAGAAGTTGCCGACGACGAAGGCGGTTCGAATAAAATGGGGCTGTGGCCATTAACAGGGGAAGTGCACTGGCAAGGGATCTATGAAAGAGCGAGAGAAGAAAGGTATAGGCTGAAGATGGACAAGAAGAGAACTACAAAAGTAAAGTTAATTGAGAGGATGAAATTTGGATACAAGCAAAAATCACTTGGAGGATGA
Protein sequence
MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRINRKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGSSLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQKLGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLMQEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYSTLDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLNGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFPKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSDVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNLTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKAENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFREMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYGCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKYFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTKVKLIERMKFGYKQKSLGG
Homology
BLAST of MC04g1449 vs. NCBI nr
Match:
XP_022133863.1 (uncharacterized protein LOC111006310 isoform X1 [Momordica charantia])
HSP 1 Score: 2096 bits (5431), Expect = 0.0
Identity = 1036/1038 (99.81%), Postives = 1038/1038 (100.00%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN
Sbjct: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSS+TLMSSHGSEKGRWLMERIKFGS
Sbjct: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSVTLMSSHGSEKGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK
Sbjct: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM
Sbjct: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST
Sbjct: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN
Sbjct: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP
Sbjct: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL
Sbjct: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+MDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA
Sbjct: 721 DVNAVGRLPLLNDSYYMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG
Sbjct: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY
Sbjct: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK
Sbjct: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKLIERMKFGYKQKSLGG
Sbjct: 1021 VKLIERMKFGYKQKSLGG 1038
BLAST of MC04g1449 vs. NCBI nr
Match:
XP_022958089.1 (uncharacterized protein LOC111459418 isoform X1 [Cucurbita moschata] >KAG6602279.1 hypothetical protein SDJN03_07512, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1910 bits (4947), Expect = 0.0
Identity = 936/1038 (90.17%), Postives = 980/1038 (94.41%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSS+EIDDNGS N VP +HSIRDRFPFKRNSSHFRLR KDSLDHA RSRSHQSRIN
Sbjct: 1 MRRSSSTEIDDNGSGNAVPVLHSIRDRFPFKRNSSHFRLRAKDSLDHATPRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWWLPARG T FY VV+FAVFAFV+GS++LQSSI+LMSS GSE+GRWLMERIKFGS
Sbjct: 61 RKGLLWWLPARGQTFFYFVVVFAVFAFVSGSMLLQSSISLMSSPGSERGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKF PGRISRRLVEG GLDE RKKDRVGVRAPRLALILGS E +PQSLML+TVMKNIQK
Sbjct: 121 SLKFFPGRISRRLVEGVGLDEVRKKDRVGVRAPRLALILGSMESNPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVE GN+HSMWKQIGGQPSILSPEHYGHVDWSIYDGI+ADSLEAEGAIASLM
Sbjct: 181 LGYVLEIFAVESGNEHSMWKQIGGQPSILSPEHYGHVDWSIYDGIIADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPM+YS
Sbjct: 241 QEPFCSVPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANIVVFPDFSLPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAEN+KNVHSKSQLREKNGFNEDDILV+VVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENYKNVHSKSQLREKNGFNEDDILVIVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLT YARQEVGGSFKFVFLCCNSTDGSH ALQEIASRLGLPD SITHYGLN
Sbjct: 361 AVAMHSIGPLLTKYARQEVGGSFKFVFLCCNSTDGSHGALQEIASRLGLPDASITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IFP
Sbjct: 421 GDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KH+ DALL +FS MISDGKLSR++QAIASSG+LLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHNPDALLDSFSRMISDGKLSRFSQAIASSGKLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGS SQLQLGAWEWNLFR+E VQTI D EE I A SKSSVIFALEAQ+TNFVNL
Sbjct: 541 VKLPGSVSQLQLGAWEWNLFREEAVQTIGKKVDTEERIAATSKSSVIFALEAQITNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TN SET NGTLEQD+PTP DWDILEEI+NAEEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNSSETENGTLEQDIPTPHDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIY+GAGAWPFMHHGS YRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYNGAGAWPFMHHGSLYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+ D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KA
Sbjct: 721 DVNAVGRLPLLNDSYYPDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLFPKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLE+TIRD KGDVIYFWAH VN GI+G SNAPTFWS CDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEDTIRDNTKGDVIYFWAHLQVNRGILGGSNAPTFWSVCDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
M+GLS+NMEALPPMP+DGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDA+NRNQSQPYG
Sbjct: 841 MFGLSSNMEALPPMPDDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAVNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
C++ASSELEKKHCYCRI EILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQR EFMWAKY
Sbjct: 901 CLVASSELEKKHCYCRILEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRREFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FN TLLKSMDEDLAE ADDEGGSN+MGLWPLTGEVHWQGIYER REERYR+KMDKKRTTK
Sbjct: 961 FNSTLLKSMDEDLAEAADDEGGSNQMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKL+ERMKFGYKQKSL G
Sbjct: 1021 VKLMERMKFGYKQKSLAG 1038
BLAST of MC04g1449 vs. NCBI nr
Match:
KAG7032959.1 (hypothetical protein SDJN02_07010 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1907 bits (4940), Expect = 0.0
Identity = 935/1038 (90.08%), Postives = 979/1038 (94.32%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSS+EIDDNGS N VP +HSIRDRFPFKRNSSHFRLR KDSLDHA RSRSHQSRIN
Sbjct: 1 MRRSSSTEIDDNGSGNAVPVLHSIRDRFPFKRNSSHFRLRAKDSLDHATPRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWWLPARG T FY VV+FAVFAFV+GS++LQSSI+LMSS GSE+GRWLMERIKFGS
Sbjct: 61 RKGLLWWLPARGQTFFYFVVVFAVFAFVSGSMLLQSSISLMSSPGSERGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKF PGRISRRLVEG GLDE RKKDRVGVRAPRLALILGS E +PQSLML+TVMKNIQK
Sbjct: 121 SLKFFPGRISRRLVEGVGLDEVRKKDRVGVRAPRLALILGSMESNPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVE GN+HSMWKQIGGQPSILSPEHYGHVDWSIYDGI+ADSLEAEGAIASLM
Sbjct: 181 LGYVLEIFAVESGNEHSMWKQIGGQPSILSPEHYGHVDWSIYDGIIADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPM+YS
Sbjct: 241 QEPFCSVPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANIVVFPDFSLPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAEN+KNVHSKSQLREKNGFNEDDILV+VVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENYKNVHSKSQLREKNGFNEDDILVIVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLT YARQEVGGSFKFVFLCCNSTDGSH ALQEIASRLGLPD SITHYGLN
Sbjct: 361 AVAMHSIGPLLTKYARQEVGGSFKFVFLCCNSTDGSHGALQEIASRLGLPDASITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IFP
Sbjct: 421 GDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KH+ DALL +FS MISDGKLSR++QAIASSG+LLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHNPDALLDSFSRMISDGKLSRFSQAIASSGKLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGS SQLQLGAWEWNLFR+E VQTI D EE I A SKSSVIFALEAQ+TNFVNL
Sbjct: 541 VKLPGSVSQLQLGAWEWNLFREEAVQTIGKKVDTEERIAATSKSSVIFALEAQITNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TN SET NGTLEQD+PTP DWDILEEI+NAEEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNSSETENGTLEQDIPTPHDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIY+GAGAWPFMHHGS YRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYNGAGAWPFMHHGSLYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+ D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KA
Sbjct: 721 DVNAVGRLPLLNDSYYPDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLFPKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLE+TIRD KGDVIYFWAH VN GI+G SNAPTFWS CDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEDTIRDNTKGDVIYFWAHLQVNRGILGGSNAPTFWSVCDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
M+GLS+NMEALPPMP+DGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDA+NRN SQPYG
Sbjct: 841 MFGLSSNMEALPPMPDDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAVNRNLSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
C++ASSELEKKHCYCRI EILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQR EFMWAKY
Sbjct: 901 CLVASSELEKKHCYCRILEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRREFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FN TLLKSMDEDLAE ADDEGGSN+MGLWPLTGEVHWQGIYER REERYR+KMDKKRTTK
Sbjct: 961 FNSTLLKSMDEDLAEAADDEGGSNQMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKL+ERMKFGYKQKSL G
Sbjct: 1021 VKLMERMKFGYKQKSLAG 1038
BLAST of MC04g1449 vs. NCBI nr
Match:
XP_022990225.1 (uncharacterized protein LOC111487177 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1902 bits (4927), Expect = 0.0
Identity = 931/1038 (89.69%), Postives = 976/1038 (94.03%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSS+EIDDNGS N VP +HS RDRFPFKRNSSHFRLR KDSLDHA RSRSHQSRIN
Sbjct: 1 MRRSSSTEIDDNGSGNAVPVLHSSRDRFPFKRNSSHFRLRAKDSLDHATPRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWWLPARG T FY VV+FAVFAFV+GS++LQSSI+LMSS GSE+GRWLMERIKFGS
Sbjct: 61 RKGLLWWLPARGQTFFYFVVVFAVFAFVSGSMLLQSSISLMSSPGSERGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKF PGRISRRLVEG GLDE RKKDRVGVRAPRLALILGS E +PQSLML+TVMKNIQK
Sbjct: 121 SLKFFPGRISRRLVEGVGLDEVRKKDRVGVRAPRLALILGSMESNPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVE GN+HSMWKQIGGQPSILSPEHYGHVDWSIYDGI+ADSLEAEG IASLM
Sbjct: 181 LGYVLEIFAVESGNEHSMWKQIGGQPSILSPEHYGHVDWSIYDGIIADSLEAEGVIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPM+YS
Sbjct: 241 QEPFCSVPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANIVVFPDFSLPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAEN+KNVHSKSQLREKNGFNEDDILV+VVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENYKNVHSKSQLREKNGFNEDDILVIVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLT YARQEVGGSFKF+FLCCNSTDGSH ALQEIASRLGLPD SITHYGLN
Sbjct: 361 AVAMHSIGPLLTKYARQEVGGSFKFIFLCCNSTDGSHGALQEIASRLGLPDDSITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IFP
Sbjct: 421 GDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KH+ DALL +FS MISDGKLSR++QAIASSG+LLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHNPDALLDSFSRMISDGKLSRFSQAIASSGKLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGS SQLQL AWEWNLFR+E VQTI D EE I A SKSSVIFALEAQ+TNFVNL
Sbjct: 541 VKLPGSVSQLQLEAWEWNLFREEVVQTIGKKVDTEERIAATSKSSVIFALEAQITNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TN SETGNGTLEQD+PTP DWDILEEI+N EEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNSSETGNGTLEQDIPTPHDWDILEEIENTEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPF+HHGS YRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFLHHGSLYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+ D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KA
Sbjct: 721 DVNAVGRLPLLNDSYYQDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLFPKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLE+TIRD KGDVIYFW H VN GI+G SNAPTFWS CDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEDTIRDNTKGDVIYFWTHLQVNRGILGGSNAPTFWSVCDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
M+GLS+NMEALPPMP++GG WSALHSWVMPTPSFLEF+MFSRMFTHYLDA+NRNQSQPYG
Sbjct: 841 MFGLSSNMEALPPMPDNGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAVNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
C+LASSELEKKHCYCRI EILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQR EFMWAKY
Sbjct: 901 CLLASSELEKKHCYCRILEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRREFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FN TLLKSMDEDLAE ADDEGGSN+MGLWPLTGEVHWQGIYER REERYR+KMDKKRTTK
Sbjct: 961 FNSTLLKSMDEDLAEAADDEGGSNQMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKL+ERMKFGYKQKSL G
Sbjct: 1021 VKLMERMKFGYKQKSLAG 1038
BLAST of MC04g1449 vs. NCBI nr
Match:
XP_038884759.1 (uncharacterized protein LOC120075439 isoform X1 [Benincasa hispida])
HSP 1 Score: 1841 bits (4768), Expect = 0.0
Identity = 909/1039 (87.49%), Postives = 962/1039 (92.59%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSSSEIDDNGS N VPG HSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQSRIN
Sbjct: 1 MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWW+PARG TLFY +V+FAVF FVTGS++LQSSI+LMSS GSE+ RWLMERIKFGS
Sbjct: 61 RKGLLWWIPARGQTLFYFIVVFAVFGFVTGSMLLQSSISLMSSPGSERERWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKFVPG ISR+LVEGDGLDE RKKDRVGVR+PRLALILGS E DPQSLML+TVMKNIQK
Sbjct: 121 SLKFVPGGISRKLVEGDGLDEMRKKDRVGVRSPRLALILGSMENDPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGY+LEIFAVE GNKHS+W+QIGGQPSILSP HYG VDWSIYDGI+ADSLEAEGAIASLM
Sbjct: 181 LGYLLEIFAVESGNKHSIWEQIGGQPSILSPRHYGRVDWSIYDGIIADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLP+YEQRGWKHLISHWKSSFRRANVVVFPDFALPM+YST
Sbjct: 241 QEPFCSLPLIWIVREDTLANRLPVYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LD+GNF+VIPGSPADVYAAEN+KN HSKSQLREKNGF+EDDILVLVVGSLFFPNELSWDY
Sbjct: 301 LDSGNFHVIPGSPADVYAAENYKNNHSKSQLREKNGFSEDDILVLVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQ-EVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGL 420
AVAMHSIGPLL+ YAR+ EVGGSFKFVFLCCNSTDGSHDAL+EIASRLGLPDGSITHYGL
Sbjct: 361 AVAMHSIGPLLSIYARRKEVGGSFKFVFLCCNSTDGSHDALKEIASRLGLPDGSITHYGL 420
Query: 421 NGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIF 480
NGDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IF
Sbjct: 421 NGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIF 480
Query: 481 PKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPS 540
PKH+ DALL +FS+MISDGKLSR+AQAIASSGRLLAKNILASECVT YA+LLENVLNFP
Sbjct: 481 PKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVTGYAQLLENVLNFPL 540
Query: 541 DVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVN 600
DVKLP S SQLQLGAWEWNLFRKE V+ ID D EE I A +K+SVIFALEAQLTN VN
Sbjct: 541 DVKLPSSASQLQLGAWEWNLFRKEMVKKIDEYADDEERIAAKNKASVIFALEAQLTNSVN 600
Query: 601 LTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNA 660
LT SE NGTLE D+PT QDWD+LEEI+NAEEYETVEMEEFQERMERDLGAWD+IYRNA
Sbjct: 601 LTILSENENGTLEYDIPTSQDWDVLEEIENAEEYETVEMEEFQERMERDLGAWDDIYRNA 660
Query: 661 RKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKS 720
RKSEKLKFEANERDEGELERTGQ VSIYEIYSGAGAWPFMHHGS YRGLSLST+ALRLKS
Sbjct: 661 RKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTKALRLKS 720
Query: 721 DDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTK 780
DDVNAVGRLPLLNDSY++D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLC K
Sbjct: 721 DDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKK 780
Query: 781 AENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFR 840
AEN LE+ IRD PKGDVIYFWAH VN GII TFWS CDILNGGLCRT F++TFR
Sbjct: 781 AENALEDAIRDNPKGDVIYFWAHLQVNRGII----PLTFWSVCDILNGGLCRTTFKSTFR 840
Query: 841 EMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPY 900
+MYGLS+NM ALPPMPEDGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDALNRNQS P
Sbjct: 841 KMYGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDALNRNQSHPN 900
Query: 901 GCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAK 960
GC+LASSELEKKHCYCRI E+LVNVWAYHSGRR+VYI+P SGFLEEQHPVEQR EFMWAK
Sbjct: 901 GCLLASSELEKKHCYCRILEMLVNVWAYHSGRRIVYINPQSGFLEEQHPVEQRKEFMWAK 960
Query: 961 YFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTT 1020
YFN TLLKSMDEDLAE DDEG S K GLWPLTGEVHWQGIYER REERYR+KMDKKRTT
Sbjct: 961 YFNFTLLKSMDEDLAEAVDDEGSSGKTGLWPLTGEVHWQGIYEREREERYRVKMDKKRTT 1020
Query: 1021 KVKLIERMKFGYKQKSLGG 1038
KVKL ERMKFGYKQKSLGG
Sbjct: 1021 KVKLAERMKFGYKQKSLGG 1035
BLAST of MC04g1449 vs. ExPASy TrEMBL
Match:
A0A6J1C0E9 (uncharacterized protein LOC111006310 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006310 PE=4 SV=1)
HSP 1 Score: 2096 bits (5431), Expect = 0.0
Identity = 1036/1038 (99.81%), Postives = 1038/1038 (100.00%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN
Sbjct: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSS+TLMSSHGSEKGRWLMERIKFGS
Sbjct: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSVTLMSSHGSEKGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK
Sbjct: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM
Sbjct: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST
Sbjct: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN
Sbjct: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP
Sbjct: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL
Sbjct: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+MDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA
Sbjct: 721 DVNAVGRLPLLNDSYYMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG
Sbjct: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY
Sbjct: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK
Sbjct: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKLIERMKFGYKQKSLGG
Sbjct: 1021 VKLIERMKFGYKQKSLGG 1038
BLAST of MC04g1449 vs. ExPASy TrEMBL
Match:
A0A6J1H431 (uncharacterized protein LOC111459418 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459418 PE=4 SV=1)
HSP 1 Score: 1910 bits (4947), Expect = 0.0
Identity = 936/1038 (90.17%), Postives = 980/1038 (94.41%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSS+EIDDNGS N VP +HSIRDRFPFKRNSSHFRLR KDSLDHA RSRSHQSRIN
Sbjct: 1 MRRSSSTEIDDNGSGNAVPVLHSIRDRFPFKRNSSHFRLRAKDSLDHATPRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWWLPARG T FY VV+FAVFAFV+GS++LQSSI+LMSS GSE+GRWLMERIKFGS
Sbjct: 61 RKGLLWWLPARGQTFFYFVVVFAVFAFVSGSMLLQSSISLMSSPGSERGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKF PGRISRRLVEG GLDE RKKDRVGVRAPRLALILGS E +PQSLML+TVMKNIQK
Sbjct: 121 SLKFFPGRISRRLVEGVGLDEVRKKDRVGVRAPRLALILGSMESNPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVE GN+HSMWKQIGGQPSILSPEHYGHVDWSIYDGI+ADSLEAEGAIASLM
Sbjct: 181 LGYVLEIFAVESGNEHSMWKQIGGQPSILSPEHYGHVDWSIYDGIIADSLEAEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPM+YS
Sbjct: 241 QEPFCSVPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANIVVFPDFSLPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAEN+KNVHSKSQLREKNGFNEDDILV+VVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENYKNVHSKSQLREKNGFNEDDILVIVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLT YARQEVGGSFKFVFLCCNSTDGSH ALQEIASRLGLPD SITHYGLN
Sbjct: 361 AVAMHSIGPLLTKYARQEVGGSFKFVFLCCNSTDGSHGALQEIASRLGLPDASITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IFP
Sbjct: 421 GDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KH+ DALL +FS MISDGKLSR++QAIASSG+LLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHNPDALLDSFSRMISDGKLSRFSQAIASSGKLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGS SQLQLGAWEWNLFR+E VQTI D EE I A SKSSVIFALEAQ+TNFVNL
Sbjct: 541 VKLPGSVSQLQLGAWEWNLFREEAVQTIGKKVDTEERIAATSKSSVIFALEAQITNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TN SET NGTLEQD+PTP DWDILEEI+NAEEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNSSETENGTLEQDIPTPHDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIY+GAGAWPFMHHGS YRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYNGAGAWPFMHHGSLYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+ D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KA
Sbjct: 721 DVNAVGRLPLLNDSYYPDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLFPKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLE+TIRD KGDVIYFWAH VN GI+G SNAPTFWS CDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEDTIRDNTKGDVIYFWAHLQVNRGILGGSNAPTFWSVCDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
M+GLS+NMEALPPMP+DGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDA+NRNQSQPYG
Sbjct: 841 MFGLSSNMEALPPMPDDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAVNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
C++ASSELEKKHCYCRI EILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQR EFMWAKY
Sbjct: 901 CLVASSELEKKHCYCRILEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRREFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FN TLLKSMDEDLAE ADDEGGSN+MGLWPLTGEVHWQGIYER REERYR+KMDKKRTTK
Sbjct: 961 FNSTLLKSMDEDLAEAADDEGGSNQMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKL+ERMKFGYKQKSL G
Sbjct: 1021 VKLMERMKFGYKQKSLAG 1038
BLAST of MC04g1449 vs. ExPASy TrEMBL
Match:
A0A6J1JPJ0 (uncharacterized protein LOC111487177 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487177 PE=4 SV=1)
HSP 1 Score: 1902 bits (4927), Expect = 0.0
Identity = 931/1038 (89.69%), Postives = 976/1038 (94.03%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSS+EIDDNGS N VP +HS RDRFPFKRNSSHFRLR KDSLDHA RSRSHQSRIN
Sbjct: 1 MRRSSSTEIDDNGSGNAVPVLHSSRDRFPFKRNSSHFRLRAKDSLDHATPRSRSHQSRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG LWWLPARG T FY VV+FAVFAFV+GS++LQSSI+LMSS GSE+GRWLMERIKFGS
Sbjct: 61 RKGLLWWLPARGQTFFYFVVVFAVFAFVSGSMLLQSSISLMSSPGSERGRWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKF PGRISRRLVEG GLDE RKKDRVGVRAPRLALILGS E +PQSLML+TVMKNIQK
Sbjct: 121 SLKFFPGRISRRLVEGVGLDEVRKKDRVGVRAPRLALILGSMESNPQSLMLITVMKNIQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYVLEIFAVE GN+HSMWKQIGGQPSILSPEHYGHVDWSIYDGI+ADSLEAEG IASLM
Sbjct: 181 LGYVLEIFAVESGNEHSMWKQIGGQPSILSPEHYGHVDWSIYDGIIADSLEAEGVIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPM+YS
Sbjct: 241 QEPFCSVPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANIVVFPDFSLPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNFYVIPGSPADVYAAEN+KNVHSKSQLREKNGFNEDDILV+VVGSLFFPNELSWDY
Sbjct: 301 LDNGNFYVIPGSPADVYAAENYKNVHSKSQLREKNGFNEDDILVIVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQEVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLN 420
AVAMHSIGPLLT YARQEVGGSFKF+FLCCNSTDGSH ALQEIASRLGLPD SITHYGLN
Sbjct: 361 AVAMHSIGPLLTKYARQEVGGSFKFIFLCCNSTDGSHGALQEIASRLGLPDDSITHYGLN 420
Query: 421 GDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFP 480
GDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IFP
Sbjct: 421 GDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIFP 480
Query: 481 KHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPSD 540
KH+ DALL +FS MISDGKLSR++QAIASSG+LLAKNILASECVTSYARLLENVLNFPSD
Sbjct: 481 KHNPDALLDSFSRMISDGKLSRFSQAIASSGKLLAKNILASECVTSYARLLENVLNFPSD 540
Query: 541 VKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNL 600
VKLPGS SQLQL AWEWNLFR+E VQTI D EE I A SKSSVIFALEAQ+TNFVNL
Sbjct: 541 VKLPGSVSQLQLEAWEWNLFREEVVQTIGKKVDTEERIAATSKSSVIFALEAQITNFVNL 600
Query: 601 TNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
TN SETGNGTLEQD+PTP DWDILEEI+N EEYETVEMEEFQERMERDLGAWDEIYRNAR
Sbjct: 601 TNSSETGNGTLEQDIPTPHDWDILEEIENTEEYETVEMEEFQERMERDLGAWDEIYRNAR 660
Query: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSD 720
KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPF+HHGS YRGLSLSTRALRLKSD
Sbjct: 661 KSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFLHHGSLYRGLSLSTRALRLKSD 720
Query: 721 DVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKA 780
DVNAVGRLPLLNDSY+ D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KA
Sbjct: 721 DVNAVGRLPLLNDSYYQDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLFPKA 780
Query: 781 ENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFRE 840
ENVLE+TIRD KGDVIYFW H VN GI+G SNAPTFWS CDILNGGLCRTAFENTFRE
Sbjct: 781 ENVLEDTIRDNTKGDVIYFWTHLQVNRGILGGSNAPTFWSVCDILNGGLCRTAFENTFRE 840
Query: 841 MYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPYG 900
M+GLS+NMEALPPMP++GG WSALHSWVMPTPSFLEF+MFSRMFTHYLDA+NRNQSQPYG
Sbjct: 841 MFGLSSNMEALPPMPDNGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAVNRNQSQPYG 900
Query: 901 CMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKY 960
C+LASSELEKKHCYCRI EILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQR EFMWAKY
Sbjct: 901 CLLASSELEKKHCYCRILEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRREFMWAKY 960
Query: 961 FNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTTK 1020
FN TLLKSMDEDLAE ADDEGGSN+MGLWPLTGEVHWQGIYER REERYR+KMDKKRTTK
Sbjct: 961 FNSTLLKSMDEDLAEAADDEGGSNQMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTK 1020
Query: 1021 VKLIERMKFGYKQKSLGG 1038
VKL+ERMKFGYKQKSL G
Sbjct: 1021 VKLMERMKFGYKQKSLAG 1038
BLAST of MC04g1449 vs. ExPASy TrEMBL
Match:
A0A5A7UUA8 (UDP-Glycosyltransferase superfamily protein isoform 3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold371G00160 PE=4 SV=1)
HSP 1 Score: 1837 bits (4759), Expect = 0.0
Identity = 911/1039 (87.68%), Postives = 963/1039 (92.69%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSSSEIDDN SAN VPG HSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQ+RIN
Sbjct: 1 MRRSSSSEIDDNASANAVPGTHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQTRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG L W+PARG TLFY +V+FAVF F TGS++LQSSI+L+SSHGS++ RWLMERIKFGS
Sbjct: 61 RKGLLSWIPARGQTLFYFLVVFAVFGFFTGSMLLQSSISLLSSHGSQRERWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKFVPGRISRRLVEGDGL+E RKKDRVGVRAPRLALILGS E DPQSLML+TVMKN+QK
Sbjct: 121 SLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNMQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYV EIFAVE GNK SMW+QIG QPSILSP HYG VDWSIYDGI+ADSLE EGAIASLM
Sbjct: 181 LGYVFEIFAVESGNKQSMWEQIG-QPSILSPGHYGRVDWSIYDGIIADSLETEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPM+YS
Sbjct: 241 QEPFCSLPLIWIVREDTLASRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNF+VIPGSPADVYAAEN+ NVHSKSQLREKNGFN DDILVLVVGSLFFPNELSWDY
Sbjct: 301 LDNGNFHVIPGSPADVYAAENYMNVHSKSQLREKNGFNGDDILVLVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQ-EVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGL 420
AVAMHSIGPLL+ YAR+ EV GSFKFVFLCCNSTDGSHDAL+EIASRLGLPDGSITHYGL
Sbjct: 361 AVAMHSIGPLLSIYARRREVEGSFKFVFLCCNSTDGSHDALKEIASRLGLPDGSITHYGL 420
Query: 421 NGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIF 480
NGDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IF
Sbjct: 421 NGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIF 480
Query: 481 PKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPS 540
PKH+ DALL +FS+MISDGKLSR+AQAIASSGRLLAKNILASECVT Y +LLENVLNFPS
Sbjct: 481 PKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVTGYVQLLENVLNFPS 540
Query: 541 DVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVN 600
DVKLPG SQLQLGAWEWNLFRKE V+TID N D EE I AISK+SVIFALEAQLTN VN
Sbjct: 541 DVKLPGPASQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKASVIFALEAQLTNSVN 600
Query: 601 LTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNA 660
LT SE NGTLEQD+PTPQDWDILEEI++AEEYETVEMEEFQERMERDLGAWDEIYRNA
Sbjct: 601 LTILSENENGTLEQDIPTPQDWDILEEIESAEEYETVEMEEFQERMERDLGAWDEIYRNA 660
Query: 661 RKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKS 720
RKSEKLKFEANERDEGELERTGQ VSIYEIYSGAGAWPFMHHGS YRGLSLSTRALRLKS
Sbjct: 661 RKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKS 720
Query: 721 DDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTK 780
DDVNAVGRLPLLNDSY++D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL K
Sbjct: 721 DDVNAVGRLPLLNDSYYLDALCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLGKK 780
Query: 781 AENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFR 840
AENVLE+TIRD P+GDVIYFWAH VN G + PTFWS CDILNGGLCRT F +TFR
Sbjct: 781 AENVLEDTIRDNPQGDVIYFWAHLQVNRGTL----PPTFWSVCDILNGGLCRTTFGSTFR 840
Query: 841 EMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPY 900
EM+GLS+NM ALPPMPEDGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDALNRNQSQP
Sbjct: 841 EMFGLSSNMGALPPMPEDGGHWSALHSWVMPTPSFLEFIMFSRMFTHYLDALNRNQSQPN 900
Query: 901 GCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAK 960
GC+ A SE+EKKHCYCRI E+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR EFMWAK
Sbjct: 901 GCLFAFSEIEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAK 960
Query: 961 YFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTT 1020
YFN TLLKSMDEDLAE ADDEGGS K+GLWPLTGEVHWQGIYER REERYR+KMDKKRTT
Sbjct: 961 YFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTT 1020
Query: 1021 KVKLIERMKFGYKQKSLGG 1038
KVKL+ERMKFGYKQKSLGG
Sbjct: 1021 KVKLMERMKFGYKQKSLGG 1034
BLAST of MC04g1449 vs. ExPASy TrEMBL
Match:
A0A1S3C3I4 (uncharacterized protein LOC103496475 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496475 PE=4 SV=1)
HSP 1 Score: 1837 bits (4759), Expect = 0.0
Identity = 911/1039 (87.68%), Postives = 963/1039 (92.69%), Query Frame = 0
Query: 1 MRRSSSSEIDDNGSANVVPGVHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQSRIN 60
MRRSSSSEIDDN SAN VPG HSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQ+RIN
Sbjct: 1 MRRSSSSEIDDNASANAVPGTHSIRDRFPFKRNSSHFRLRVKDSLDHAASRSRSHQTRIN 60
Query: 61 RKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHGSEKGRWLMERIKFGS 120
RKG L W+PARG TLFY +V+FAVF F TGS++LQSSI+L+SSHGS++ RWLMERIKFGS
Sbjct: 61 RKGLLSWIPARGQTLFYFLVVFAVFGFFTGSMLLQSSISLLSSHGSQRERWLMERIKFGS 120
Query: 121 SLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERDPQSLMLVTVMKNIQK 180
SLKFVPGRISRRLVEGDGL+E RKKDRVGVRAPRLALILGS E DPQSLML+TVMKN+QK
Sbjct: 121 SLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNMQK 180
Query: 181 LGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGIVADSLEAEGAIASLM 240
LGYV EIFAVE GNK SMW+QIG QPSILSP HYG VDWSIYDGI+ADSLE EGAIASLM
Sbjct: 181 LGYVFEIFAVESGNKQSMWEQIG-QPSILSPGHYGRVDWSIYDGIIADSLETEGAIASLM 240
Query: 241 QEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYST 300
QEPFCS+PL+WI+REDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPM+YS
Sbjct: 241 QEPFCSLPLIWIVREDTLASRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSI 300
Query: 301 LDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDY 360
LDNGNF+VIPGSPADVYAAEN+ NVHSKSQLREKNGFN DDILVLVVGSLFFPNELSWDY
Sbjct: 301 LDNGNFHVIPGSPADVYAAENYMNVHSKSQLREKNGFNGDDILVLVVGSLFFPNELSWDY 360
Query: 361 AVAMHSIGPLLTNYARQ-EVGGSFKFVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGL 420
AVAMHSIGPLL+ YAR+ EV GSFKFVFLCCNSTDGSHDAL+EIASRLGLPDGSITHYGL
Sbjct: 361 AVAMHSIGPLLSIYARRREVEGSFKFVFLCCNSTDGSHDALKEIASRLGLPDGSITHYGL 420
Query: 421 NGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIF 480
NGDVN VLMMADIVLYGSSQEIQSFP LLIRAMSFGIPIMVPDLPALRNYIVDGVHG+IF
Sbjct: 421 NGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPALRNYIVDGVHGVIF 480
Query: 481 PKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECVTSYARLLENVLNFPS 540
PKH+ DALL +FS+MISDGKLSR+AQAIASSGRLLAKNILASECVT Y +LLENVLNFPS
Sbjct: 481 PKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVTGYVQLLENVLNFPS 540
Query: 541 DVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVN 600
DVKLPG SQLQLGAWEWNLFRKE V+TID N D EE I AISK+SVIFALEAQLTN VN
Sbjct: 541 DVKLPGPASQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKASVIFALEAQLTNSVN 600
Query: 601 LTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNA 660
LT SE NGTLEQD+PTPQDWDILEEI++AEEYETVEMEEFQERMERDLGAWDEIYRNA
Sbjct: 601 LTILSENENGTLEQDIPTPQDWDILEEIESAEEYETVEMEEFQERMERDLGAWDEIYRNA 660
Query: 661 RKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKS 720
RKSEKLKFEANERDEGELERTGQ VSIYEIYSGAGAWPFMHHGS YRGLSLSTRALRLKS
Sbjct: 661 RKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKS 720
Query: 721 DDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTK 780
DDVNAVGRLPLLNDSY++D LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL K
Sbjct: 721 DDVNAVGRLPLLNDSYYLDALCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLGKK 780
Query: 781 AENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFR 840
AENVLE+TIRD P+GDVIYFWAH VN G + PTFWS CDILNGGLCRT F +TFR
Sbjct: 781 AENVLEDTIRDNPQGDVIYFWAHLQVNRGTL----PPTFWSVCDILNGGLCRTTFGSTFR 840
Query: 841 EMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMFTHYLDALNRNQSQPY 900
EM+GLS+NM ALPPMPEDGG WSALHSWVMPTPSFLEF+MFSRMFTHYLDALNRNQSQP
Sbjct: 841 EMFGLSSNMGALPPMPEDGGHWSALHSWVMPTPSFLEFIMFSRMFTHYLDALNRNQSQPN 900
Query: 901 GCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAK 960
GC+ A SE+EKKHCYCRI E+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR EFMWAK
Sbjct: 901 GCLFAFSEIEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAK 960
Query: 961 YFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERAREERYRLKMDKKRTT 1020
YFN TLLKSMDEDLAE ADDEGGS K+GLWPLTGEVHWQGIYER REERYR+KMDKKRTT
Sbjct: 961 YFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTT 1020
Query: 1021 KVKLIERMKFGYKQKSLGG 1038
KVKL+ERMKFGYKQKSLGG
Sbjct: 1021 KVKLMERMKFGYKQKSLGG 1034
BLAST of MC04g1449 vs. TAIR 10
Match:
AT5G04480.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 1303.9 bits (3373), Expect = 0.0e+00
Identity = 644/1054 (61.10%), Postives = 793/1054 (75.24%), Query Frame = 0
Query: 1 MRRSSSSEIDDNG--------SANVVPG-----VHSIRDRFPFKRNSSHFRLRVKDSLDH 60
+R S S EIDDNG +AN V G HSIRDR KRNSS R R LD
Sbjct: 2 VRNSLSLEIDDNGGAGRDGNHNANNVAGNGDTSFHSIRDRLRLKRNSSDRRDRSHSGLDR 61
Query: 61 AASRSRSHQ--SRINRKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHG 120
+ R+R H +NRKG L L RG L Y +V F V AFV S++LQ+SIT G
Sbjct: 62 PSLRTRPHHIGRSLNRKGLLSLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNSITW---QG 121
Query: 121 SEKGRWLMERIKFGSSLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERD 180
+ KG + +I GS+LK+VPG I+R L+EG GLD R R+GVR PRLAL+LG+ ++D
Sbjct: 122 NVKGGQVRSQIGLGSTLKYVPGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVLGNMKKD 181
Query: 181 PQSLMLVTVMKNIQKLGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGI 240
P++LMLVTVMKN+QKLGYV ++FAVE G S+W+Q+ G +L E GH DW+I++G+
Sbjct: 182 PRTLMLVTVMKNLQKLGYVFKVFAVENGEARSLWEQLAGHVKVLVSEQLGHADWTIFEGV 241
Query: 241 VADSLEAEGAIASLMQEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRAN 300
+ADSLEA+ AI+SLMQEPF S+PL+WI+ ED LANRLP+Y++ G LISHW+S+F RA+
Sbjct: 242 IADSLEAKEAISSLMQEPFRSVPLIWIVHEDILANRLPVYQRMGQNSLISHWRSAFARAD 301
Query: 301 VVVFPDFALPMMYSTLDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVL 360
VVVFP F LPM++S LD+GNF VIP S DV+AAE++ H+K LRE N F EDD+++L
Sbjct: 302 VVVFPQFTLPMLHSVLDDGNFVVIPESVVDVWAAESYSETHTKQNLREINEFGEDDVIIL 361
Query: 361 VVGSLFFPNELSWDYAVAMHSIGPLLTNYA-RQEVGGSFKFVFLCCNSTDGSHDALQEIA 420
V+GS FF +E SWD AVAMH +GPLLT Y R++ GSFKFVFL NST G DA+QE+A
Sbjct: 362 VLGSSFFYDEFSWDNAVAMHMLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSDAVQEVA 421
Query: 421 SRLGLPDGSITHYGLNGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLP 480
SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FP L++RAMSFGIPI+ PD P
Sbjct: 422 SRLGLTEGTVRHFGLNEDVNRVLRMADILVYASSQEEQNFPPLIVRAMSFGIPIITPDFP 481
Query: 481 ALRNYIVDGVHGIIFPKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECV 540
++ Y+ D VHGI F ++D DALL+AFS +ISDG+LS++AQ IASSGRLL KN++A+EC+
Sbjct: 482 IMKKYMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECI 541
Query: 541 TSYARLLENVLNFPSDVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKS 600
T YARLLEN+L+FPSD LPGS SQLQ+ AWEWN FR E Q D+ I KS
Sbjct: 542 TGYARLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILDS--AYAFIGKS 601
Query: 601 SVIFALEAQLTNFVNLTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQER 660
++F +E + + TN + + +LP+ DWD+LEEI+ AEEYE VE EE ++R
Sbjct: 602 GIVFQVEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDR 661
Query: 661 MERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSF 720
MERD+ W+EIYRNARKSEKLKFE NERDEGELERTG+P+ IYEIY+GAGAWPF+HHGS
Sbjct: 662 MERDVEDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSL 721
Query: 721 YRGLSLSTRALRLKSDDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGF 780
YRGLSLS++ RL SDDV+A RLPLLND+Y+ DILCEIGGMF++ANK+D+IH RPWIGF
Sbjct: 722 YRGLSLSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGF 781
Query: 781 QSWRASGRKVSLCTKAENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDI 840
QSWRA+GRKVSL +KAE LE I+ + KG++IYFW ++G GS NA TFWS CDI
Sbjct: 782 QSWRAAGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDI 841
Query: 841 LNGGLCRTAFENTFREMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMF 900
LN G CRT FE+ FR MYGL ++EALPPMPEDG WS+LH+WVMPTPSFLEF+MFSRMF
Sbjct: 842 LNQGNCRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMF 901
Query: 901 THYLDALNRNQSQPYGCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLE 960
+ LDAL+ N + C LASS LE+KHCYCR+ E+LVNVWAYHSGR+MVYI+P G LE
Sbjct: 902 SESLDALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLE 961
Query: 961 EQHPVEQRMEFMWAKYFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERA 1020
EQHP++QR MWAKYFN TLLKSMDEDLAE ADD+ + LWPLTGEVHW+G+YER
Sbjct: 962 EQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYERE 1021
Query: 1021 REERYRLKMDKKRTTKVKLIERMKFGYKQKSLGG 1039
REERYRLKMDKKR TK KL +R+K GYKQKSLGG
Sbjct: 1022 REERYRLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1050
BLAST of MC04g1449 vs. TAIR 10
Match:
AT5G04480.2 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 1270.0 bits (3285), Expect = 0.0e+00
Identity = 633/1054 (60.06%), Postives = 779/1054 (73.91%), Query Frame = 0
Query: 1 MRRSSSSEIDDNG--------SANVVPG-----VHSIRDRFPFKRNSSHFRLRVKDSLDH 60
+R S S EIDDNG +AN V G HSIRDR KRNSS R R LD
Sbjct: 2 VRNSLSLEIDDNGGAGRDGNHNANNVAGNGDTSFHSIRDRLRLKRNSSDRRDRSHSGLDR 61
Query: 61 AASRSRSHQ--SRINRKGFLWWLPARGNTLFYCVVIFAVFAFVTGSVMLQSSITLMSSHG 120
+ R+R H +NRKG L L RG L Y +V F V AFV S++LQ+SIT G
Sbjct: 62 PSLRTRPHHIGRSLNRKGLLSLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNSITW---QG 121
Query: 121 SEKGRWLMERIKFGSSLKFVPGRISRRLVEGDGLDEARKKDRVGVRAPRLALILGSTERD 180
+ KG + +I GS+LK+VPG I+R L+EG GLD R R+GVR PRLAL+LG+ ++D
Sbjct: 122 NVKGGQVRSQIGLGSTLKYVPGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVLGNMKKD 181
Query: 181 PQSLMLVTVMKNIQKLGYVLEIFAVEGGNKHSMWKQIGGQPSILSPEHYGHVDWSIYDGI 240
P++LMLV FAVE G S+W+Q+ G +L E GH DW+I++G+
Sbjct: 182 PRTLMLV---------------FAVENGEARSLWEQLAGHVKVLVSEQLGHADWTIFEGV 241
Query: 241 VADSLEAEGAIASLMQEPFCSIPLVWIIREDTLANRLPMYEQRGWKHLISHWKSSFRRAN 300
+ADSLEA+ AI+SLMQEPF S+PL+WI+ ED LANRLP+Y++ G LISHW+S+F RA+
Sbjct: 242 IADSLEAKEAISSLMQEPFRSVPLIWIVHEDILANRLPVYQRMGQNSLISHWRSAFARAD 301
Query: 301 VVVFPDFALPMMYSTLDNGNFYVIPGSPADVYAAENFKNVHSKSQLREKNGFNEDDILVL 360
VVVFP F LPM++S LD+GNF VIP S DV+AAE++ H+K LRE N F EDD+++L
Sbjct: 302 VVVFPQFTLPMLHSVLDDGNFVVIPESVVDVWAAESYSETHTKQNLREINEFGEDDVIIL 361
Query: 361 VVGSLFFPNELSWDYAVAMHSIGPLLTNYA-RQEVGGSFKFVFLCCNSTDGSHDALQEIA 420
V+GS FF +E SWD AVAMH +GPLLT Y R++ GSFKFVFL NST G DA+QE+A
Sbjct: 362 VLGSSFFYDEFSWDNAVAMHMLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSDAVQEVA 421
Query: 421 SRLGLPDGSITHYGLNGDVNGVLMMADIVLYGSSQEIQSFPLLLIRAMSFGIPIMVPDLP 480
SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FP L++RAMSFGIPI+ PD P
Sbjct: 422 SRLGLTEGTVRHFGLNEDVNRVLRMADILVYASSQEEQNFPPLIVRAMSFGIPIITPDFP 481
Query: 481 ALRNYIVDGVHGIIFPKHDSDALLRAFSEMISDGKLSRYAQAIASSGRLLAKNILASECV 540
++ Y+ D VHGI F ++D DALL+AFS +ISDG+LS++AQ IASSGRLL KN++A+EC+
Sbjct: 482 IMKKYMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECI 541
Query: 541 TSYARLLENVLNFPSDVKLPGSFSQLQLGAWEWNLFRKETVQTIDGNEDAEEGITAISKS 600
T YARLLEN+L+FPSD LPGS SQLQ+ AWEWN FR E Q D+ I KS
Sbjct: 542 TGYARLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILDS--AYAFIGKS 601
Query: 601 SVIFALEAQLTNFVNLTNFSETGNGTLEQDLPTPQDWDILEEIQNAEEYETVEMEEFQER 660
++F +E + + TN + + +LP+ DWD+LEEI+ AEEYE VE EE ++R
Sbjct: 602 GIVFQVEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDR 661
Query: 661 MERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQPVSIYEIYSGAGAWPFMHHGSF 720
MERD+ W+EIYRNARKSEKLKFE NERDEGELERTG+P+ IYEIY+GAGAWPF+HHGS
Sbjct: 662 MERDVEDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSL 721
Query: 721 YRGLSLSTRALRLKSDDVNAVGRLPLLNDSYHMDILCEIGGMFAIANKIDNIHKRPWIGF 780
YRGLSLS++ RL SDDV+A RLPLLND+Y+ DILCEIGGMF++ANK+D+IH RPWIGF
Sbjct: 722 YRGLSLSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGF 781
Query: 781 QSWRASGRKVSLCTKAENVLEETIRDKPKGDVIYFWAHFHVNGGIIGSSNAPTFWSACDI 840
QSWRA+GRKVSL +KAE LE I+ + KG++IYFW ++G GS NA TFWS CDI
Sbjct: 782 QSWRAAGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDI 841
Query: 841 LNGGLCRTAFENTFREMYGLSANMEALPPMPEDGGCWSALHSWVMPTPSFLEFMMFSRMF 900
LN G CRT FE+ FR MYGL ++EALPPMPEDG WS+LH+WVMPTPSFLEF+MFSRMF
Sbjct: 842 LNQGNCRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMF 901
Query: 901 THYLDALNRNQSQPYGCMLASSELEKKHCYCRISEILVNVWAYHSGRRMVYIDPHSGFLE 960
+ LDAL+ N + C LASS LE+KHCYCR+ E+LVNVWAYHSGR+MVYI+P G LE
Sbjct: 902 SESLDALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLE 961
Query: 961 EQHPVEQRMEFMWAKYFNLTLLKSMDEDLAEVADDEGGSNKMGLWPLTGEVHWQGIYERA 1020
EQHP++QR MWAKYFN TLLKSMDEDLAE ADD+ + LWPLTGEVHW+G+YER
Sbjct: 962 EQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYERE 1021
Query: 1021 REERYRLKMDKKRTTKVKLIERMKFGYKQKSLGG 1039
REERYRLKMDKKR TK KL +R+K GYKQKSLGG
Sbjct: 1022 REERYRLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1035
BLAST of MC04g1449 vs. TAIR 10
Match:
AT4G01210.1 (glycosyl transferase family 1 protein )
HSP 1 Score: 656.8 bits (1693), Expect = 3.0e-188
Identity = 350/899 (38.93%), Postives = 521/899 (57.95%), Query Frame = 0
Query: 147 RVGVRAPRLALILGSTERDPQSLMLVTVMKNIQKLGYVLEIFAVEGGNKHSMWKQIGGQP 206
R G R P+LAL+ G DP+ +++V++ K +Q++GY +E++++E G +S+W+++G
Sbjct: 141 RFGFRKPKLALVFGDLLADPEQVLMVSLSKALQEVGYAIEVYSLEDGPVNSIWQKMGVPV 200
Query: 207 SILSPEHYGH--VDWSIYDGIVADSLEAEGAIASLMQEPFCSIPLVWIIREDTLANRLPM 266
+IL P +DW YDGI+ +SL A MQEPF S+PL+W+I E+TLA R
Sbjct: 201 TILKPNQESSCVIDWLSYDGIIVNSLRARSMFTCFMQEPFKSLPLIWVINEETLAVRSRQ 260
Query: 267 YEQRGWKHLISHWKSSFRRANVVVFPDFALPMMYSTLDNGNFYVIPGSPADVYAAENFKN 326
Y G L++ WK F RA+VVVF ++ LP++Y+ D GNFYVIPGSP +V A+N +
Sbjct: 261 YNSTGQTELLTDWKKIFSRASVVVFHNYLLPILYTEFDAGNFYVIPGSPEEVCKAKNLEF 320
Query: 327 VHSKSQLREKNGFNEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLTNYARQEVGGSFK 386
K DD+++ +VGS F ++A+ + ++ PL + + K
Sbjct: 321 PPQK-----------DDVVISIVGSQFLYKGQWLEHALLLQALRPLFSGNYLESDNSHLK 380
Query: 387 FVFLCCNSTDGSHDALQEIASRLGLPDGSITHYGLNGDVNGVLMMADIVLYGSSQEIQSF 446
+ L + A++ I+ L P ++ H + G+V+ +L +D+V+YGS E QSF
Sbjct: 381 IIVLGGETASNYSVAIETISQNLTYPKEAVKHVRVAGNVDKILESSDLVIYGSFLEEQSF 440
Query: 447 PLLLIRAMSFGIPIMVPDLPALRNYIVDGVHGIIFPKHDSDALLRAFSEMISDGKLSRYA 506
P +L++AMS G PI+ PDL +R Y+ D V G +FPK + L + E+I++GK+S A
Sbjct: 441 PEILMKAMSLGKPIVAPDLFNIRKYVDDRVTGYLFPKQNLKVLSQVVLEVITEGKISPLA 500
Query: 507 QAIASSGRLLAKNILASECVTSYARLLENVLNFPSDVKLPGSFSQLQ---LGAWEWNLFR 566
Q IA G+ KN++A E + YA LLEN+L F S+V P ++ W W+ F
Sbjct: 501 QKIAMMGKTTVKNMMARETIEGYAALLENMLKFSSEVASPKDVQKVPPELREEWSWHPF- 560
Query: 567 KETVQTIDGNEDAEEGITAISKSSVIFALEAQLTNFVNLTNFSETGNGTLEQDLPTPQDW 626
+ + T N A + A++ N T G + D + W
Sbjct: 561 EAFMDTSPNNRIARS-----------YEFLAKVEGHWNYTPGEAMKFGAVNDDSFVYEIW 620
Query: 627 DILEEIQNAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTG 686
+ +Q + E EE + R+ + G W+++Y++A+++++ K + +ERDEGEL RTG
Sbjct: 621 EEERYLQMMNSKKRREDEELKSRVLQYRGTWEDVYKSAKRADRSKNDLHERDEGELLRTG 680
Query: 687 QPVSIYEIYSGAGAWPFMHHGSFYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYHMDILC 746
QP+ IYE Y G G W F+H YRG+ LS + R + DDV+A RLPL N+ Y+ D L
Sbjct: 681 QPLCIYEPYFGEGTWSFLHQDPLYRGVGLSVKGRRPRMDDVDASSRLPLFNNPYYRDALG 740
Query: 747 EIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCTKAENVLEETIRDKPKGDVIYFWA 806
+ G FAI+NKID +HK WIGFQSWRA+ RK SL AE+ L I+ + GD +YFW
Sbjct: 741 DFGAFFAISNKIDRLHKNSWIGFQSWRATARKESLSKIAEDALLNAIQTRKHGDALYFWV 800
Query: 807 HFHVNGGIIGSSNAPTFWSACDILNGGLCRTAFENTFREMYGLSANMEALPPMPEDGGCW 866
+ + FWS CD +N G CR A+ T ++MY + N+++LPPMPEDG W
Sbjct: 801 RMDKDP---RNPLQKPFWSFCDAINAGNCRFAYNETLKKMYSIK-NLDSLPPMPEDGDTW 860
Query: 867 SALHSWVMPTPSFLEFMMFSRMFTHYLDA-LNRNQSQPYGCMLASSELEKKHCYCRISEI 926
S + SW +PT SFLEF+MFSRMF LDA + + C L S + KHCY R+ E+
Sbjct: 861 SVMQSWALPTRSFLEFVMFSRMFVDSLDAQIYEEHHRTNRCYL--SLTKDKHCYSRVLEL 920
Query: 927 LVNVWAYHSGRRMVYIDPHSGFLEEQHPVEQRMEFMWAKYFNLTLLKSMDEDLAEVADDE 986
LVNVWAYHS RR+VYIDP +G ++EQH + R MW K+F+ T LK+MDEDLAE AD +
Sbjct: 921 LVNVWAYHSARRIVYIDPETGLMQEQHKQKNRRGKMWVKWFDYTTLKTMDEDLAEEADSD 980
Query: 987 GGSNKMG--LWPLTGEVHWQGIYERAREERYRLKMDKKRTTKVKLIERMKFGYKQKSLG 1038
++G LWP TGE+ W+G E+ ++++ K +KK+ ++ KL +QK +G
Sbjct: 981 ---RRVGHWLWPWTGEIVWRGTLEKEKQKKNLEKEEKKKKSRDKLSRMRSRSGRQKVIG 1007
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022133863.1 | 0.0 | 99.81 | uncharacterized protein LOC111006310 isoform X1 [Momordica charantia] | [more] |
XP_022958089.1 | 0.0 | 90.17 | uncharacterized protein LOC111459418 isoform X1 [Cucurbita moschata] >KAG6602279... | [more] |
KAG7032959.1 | 0.0 | 90.08 | hypothetical protein SDJN02_07010 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022990225.1 | 0.0 | 89.69 | uncharacterized protein LOC111487177 isoform X1 [Cucurbita maxima] | [more] |
XP_038884759.1 | 0.0 | 87.49 | uncharacterized protein LOC120075439 isoform X1 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1C0E9 | 0.0 | 99.81 | uncharacterized protein LOC111006310 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1H431 | 0.0 | 90.17 | uncharacterized protein LOC111459418 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JPJ0 | 0.0 | 89.69 | uncharacterized protein LOC111487177 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A5A7UUA8 | 0.0 | 87.68 | UDP-Glycosyltransferase superfamily protein isoform 3 OS=Cucumis melo var. makuw... | [more] |
A0A1S3C3I4 | 0.0 | 87.68 | uncharacterized protein LOC103496475 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |