MS016640 (gene) Bitter gourd (TR) v1

Overview
NameMS016640
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionARM repeat superfamily protein, putative
Locationscaffold587: 283032 .. 304854 (-)
RNA-Seq ExpressionMS016640
SyntenyMS016640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAAGACGAAGGGGGACTGCTCCTATGGAAGTCGGACTCAGCACCTCAATCTATGGTCTCAGTCACGGTTGGCAGAGTCATGGCCACTCTCCTCGGTGCTCGCCCTAAGAAGCTCGCCGACGCCGTCTCCCGTCTCTTCCCCGACCACCGCCGCGGAGCTTCTTCACTAGGTTCGTCTCTCATTTCACCTTCAATCGTTTCCGTTTGAGTGGTAGTATACAGCAATGAATTATGCTGGTTCCACAGATTCTCTGGACGAATCCCTCTGGTTTCTGCACAACTATGTCAGGGACGCTGCTCAAAACCACGCCTCTTTCGATGAAATCCTCGTCCCCATGATCGAACATGTAACCTCATTCATCATCATCTTTTTCTGTTTTATAATTTTATGTTTGCTAATGAAACTTCCAGTCCGTCGATTAAAACTTCAAGTGTAATCGCGCAGTCATTGAGATTCAAGGACAGGAACTGGAAGCGAGGGGGGCAAGTTATGGTGCTCCTCAACTGGCTGTTCCTTGAAGAGCTTATTTTCCAGCCGCTTATCAAAAATCTTGCTGATATTATTGCGAGGAAGGATGATCGCTATGTCGCCCTCGGCTGGTGTATCCTTGTTCGTAGTCTTGTGGAGTACGAGTCTGTCGCCAGTGAATTTTCATCGAATGGTACTGAAATTTGTATTTATAAACTAAAATGCTGTTGATAACGTCCCTTATTTTTGGTATTGACTTAATTTTCTTTTGTTTTGAATAGGGTTAAGGGAGAGATTCAACGATATGTTGAAGGTATTCTGCACATGCATTCCACGTCTGGCATGTATTTTAAGTAAAGGAAGGTAAATTCCTTCTCCTTGTCATAACAAGGCTGTCAGCACCTGCTTTTCATGTTAACTTGTGTATTTTGCCTTTTATCTTTTTGAATGTTGTAACAATTGGCACAGTCCAGTTGGCAAGAGCTTAGCTCTCTTGGTCATACGGGCTTAGAAGTCTCAAATTCAAATATTTTGGTGAGCTTAATTTCAAAAACCCTTGATGTCTCCCGAGTCCAGACCTTGAGACGGGCACGGGTGTCTTGGGAGCGAAGCTCTGACTCCCAGCTCTAAAAAGAAAAAAAAAGAAGAATAAAAAAAAGAATGTTGTAACAATTGGCTAATGCTGCAATGAATACCATATACCATCTTTCATCTTAAATATGCTGTTCCAATATTTCTCTAGACAACTTCACTTTCCAATGTGGAACTGTGTTATCTAATTTTGTTTGTGTTGCGTTTGCATTTTTGGCTTTTACATCTCATGGATTAATTTATTTTGCAGTAGTATGCAGGAAGGATTTGAGTTACCTTCTCGCCTCTCAGTCTGTGCTGCTGATTGTGTTGTATCTCTGACTAATGCGCTGACCAGAAAGTCTGAGGTTCAAACTAGGCAGAAAAGATTAAATTCAAGCTCATCTCATCAGCAAGTAACTTTCTTTTCAACTATTGTTGATGACCAGCGAGAGAAACAAATTAGCAACTCTTCAAAAGACTCAGATTCAGGCATGGAATATTTACTCTGGCATCAATTAAAGGATCTCGTGATTTTAGTACAGAGGCTTCTTGCAGTATGTTTCTGTTTGCAATTCCATGTTATTCTTTTCTGTTAAGAATGAATAGACCACAAAAAGTTTAAGCTGATTGGTTGGGTAAATTTTCATTATAAATGGGGAGGAAACAACAATACAGCGCAAAATCTCCCACTTTGATACCATGTTAAGGATGAATAAATCACCACTCAACCCAAAAGTTTAAGCTGTTGGGTTGGGTAAATTTAATTCATATATATGGAGTCTCAACATTTTCTTCTATTTTTCATGGTACAATTTTAATTTGTAACAATCTTACATGGAACCTTGGTTTGCACGCTGTTTTTATCCCTATTAGCTTTCATTACTTATATATCTTTTTACATAAGAAATGTATTTATAACTTTATATTATATGCTTCACAAAAAAAGAAAAAATCTGCCTTTTCTATTATTCATTTATTTATTTTTTCCTTGCTTCACAATTAAGTTCAAGGGAGTTTTATTATAGTATGCCTCACTTTATATAAAAACACTAAATTTGAGTATGAATGATTATTTTATCTCTCTACGATAATTTACTTCTAATCCCATGCTGGCCAGTTACCTTTTGTGTTACGTCATCCTTAAAAAATAAGAGGATTTTTTAATCAAGGCAAATAGGTTCAAACTATGGTAGCCACCCACCTTGGATTTAAAATACTTTAAAGTTTTTTTTGCAACGATGCTTGCAAGGTAGCCACTCACAAATATCACAAAGGTGGATGATTCATTGGGGGATGAGGATGAGGGTGTGTTATTTTTTTATTTTTTGGAAATGAAATGATTTCATTGATAGAATGAAATAGGAGAGAGAAGATTACAAAATGTGGGAGTGGGAGAGAACCCAATAAGGGTCACAAAAAAGCTTTCCAATTGGATAATAATGAAGCAAAACTATAATGATGAAAAAGAAGAGAGCATTTACACCAAGATATAACTAAAAAAAAACTATAGATTCAAAAGAGTGATGAAAAGAGCGTTCCTTATCATGAAAGATCCTTCGATTTCTTTCCACCGATATGATCCATGGTAGAGCCCGTTTAAAATGAAGCTAGATACACTTTTTCTCATTCTTGAACGAATGACCATTGAGCGTAGAAGCCAAGAAAAGGGACATGTCGATAGGGAGAGTAGTCAACCATCCGAAAAGGGTAAGGATTTTGTTCCAGAGGCCAGATGTGTATTTGCATGAGACGAAAAGGTGGCCGATAGACCCACTATGATTTTTACATGAGATCAAATAATTAAGGAGCCTTTTAAAAACTTTGGATAATGCTCCTAGTGAAGGTACAGGTTCGGTAACTTTGACAAGTACAACTTCTCAGCCTAAACGTTGTACTTGGATTATTGATTCTGGTGCAACTGACCATATGACGAATAGTAACTGTCATTTTATCTCTTGTAGTCTTTGACTATCGAGCAATAAAAAATTTCCACAGCAGATGGTACCTTAGTAACAGTGGCTGGTCAAGGAGACATCAAGATAAATTCGAACATTATGTTAAAAAATATGCTCCATATACCACGTCTAACTGTTAATTTGATCTCAGTGAAGAATTTGATCAAAGATTTAGGATGTAGATTAATTTTTTTTGATTCTTCTTGTATTTTACAGGACCAGGGTAATGGGAAGACGATTGGACTTGCGAACGAAAAAAATGGACTCTATTATCTTGATGAACCTATTGGTCAAAATAGTTTTCCCAAACAATTTCGCTCTGACAATGCTCTAGCTTATTTCAATCAAAATCTAACATCATTATTTCAGGAAAATGGAGTCATCTTGTGTGGACACTCCTCAACAAAATGGAGTGGCCGAAAGAAAAATTGGTTACATTCTTTCTGTGGCACGTGCCATTCTTTTTCAAAGAAATTTTCCTAAATCCTTATTGGGGGAAGCTGTGCTAACAGCCACTTATTTAATCAATCGTTTACCATCTGCTACTCTAGATTAGAAAACTCCTATGGATATCCTATCAAAATTTTATCCTGATTTTCAAACGACAAATAACTTGATTCCTCGAGTATTTGGTTGTGTAGCCTTTGTTCATGTTCACAGTCATCGGCGAGGTAAACTTGATCCACGAGTGCTCAAGTGTATTTTTGTAGGATACTTGAACACTCAAAAGGGATATAAATGCTATCATCCTCCTTAAAAAAGGTACTTCATCTCTACTGATGTATCTTTTATGAAGACATGAGCTTCTTTACCTTACCTTATCTTTAGGGGGAGGAAATATTGTTGGAAGATAAGAATTTGCCCATTCCCAACCTCACATTTCCTGTATTAAATCCTTTACCATCACCTGTACCAAGCCCTATTGAAATTGACACCAACATTAGTCAATCCTCATCTTTACCACCCATAGACACCTCACCAGGGCCAAATAACCCTTTACCTCCCATAGGCACCTCACCAAGGCTATCTAACCCTTCTTCTACTATACCATTACAGATTTATTCAAGAAGGAACATTGAAAAAGATACGCAAGTCCAAGAGTTTGATCATATCTCTAGAACTGAAATTGACGATAAAAGAATCACTGATGAATCAAACATTACAATATCTGCTAATGATGAGTCAGACCTGGATTTGCCTATTGCTTTAAGGAAGGGAAAACGAACTTGTACTCAATATCCATTGTCTCACTTTGTATCATATGACAAATTCTCAAGTCAGTACAGAAGTTTCATGGTTAATCTTACTCAGGTAAATATTCCAAAAACATATATGGAGGCAATACAACATGAAGAATGGAGGCAAGCAATGAATCTTGAAATGTAGGCTTTAGAAAAAAATAGAACGTGGGAGTTGGTTACTATGCCTAAAGATAAGAAAGCTATAGGATATCGATGGATATATACTGTGAAACATAGAGCAGATGGAACACTTGAGAGGTACAAAACACGGTTGGTTGCTAAAGGATATACTCAAACATATGGAGTTGATTACCTTGAAACCTTTGCACCCGTTGCCAAGATGAACACGATACGAGTATTTCTATCTCTTGCTATAAATCATGGGTGAGACATGTTACAATATGATGTAAAAAATGCATTTTTGCATGGTGATCTTGAAGAAGAAATTTATATCGAGATTCCACCTGGGTATGAGCAAGCAGGAGATAAAGTTTTTAAATTAAGAAAGATGTTGTATGGATTAAAGCAATCTCCTAGGACATGATTTGGTAGATTTTTTCAAGTTATGAGGAAGATGAGCTATAAACAAAGTTAGGGAGATCATACATTGTTTATAAAATATTCTACTACAGGGGGAGTGACAACACTAATTGTTTATATCGATGATATTATTGTCACTGGTAATGATAATAACGAACAAGATAAATTGAGAAATTGTTTAATCCAAGAGTTTGAAGTTAAAAAACTAGGAAAGCTGAAATATTTTCTTGGAATAGAAGTTGCCTATTCTAAACAAGACATATTCCTATCACAACAGAAATATGTGATTGACTTTCTTTTTGAAACAGGAAAGCTTGGCAGCAAACCGATGGGAACGCCAATTGATCAAAACCATAGATTATGTGCATTTGAAGAGAGTCCTCCAGTAAATAAGGAAACTTACCAGAGGTTAGTTGGAAAATTAATATATCTTTCTCATACGAGACCAGATATTGCGTATGTGGTAGGAGTAGTAAGTCAATTTATGCATAGTCCAAAAGAAGTTCATCTCCAAGCAGTGTACCGAATACTTCACTATTTGAAAAGCTCCATTGGGAGAGGATTGTTGTTTGAGAAAGGCGAGAAACTCAATCTAGAAGTTTATACAGATGCAGATTATGCAGGATCGATAGATGATAGATAATCTACTTCTGACTATTGTACTTTATTTGGTGGAAACTTAGTGACATGGAGAAGTAGAAAACAAAATGTAGTTGCAAGATTGAGCTCAGAAGCAGAATTTAGAGCAATGACATTGGGCATCTGCGAATTACCGTGGATAAAAGTTATTTTAGAAGATTTGAAGATTCCATAGGAAATGCCTATGAAACTTTACTGTGATAATAAATCTGCCATCAGTATTGCACATAATCCAGTTCAACATGATAGAACCAAACATGTTGAGGTTGATAGACACTTCATAAAAGAGAAGTTAGACAGCGATTTCATATGTACTCCATTTGTACCAACAAATAATCAAATAGCAGATATCCTAACTAAAGGATTGAACAACGACAACATTGAGCATTTGGTTAGCAAGCTGGGAATGGAAAATATTCATTCACCAACTTGAGGAGAGTGTTAAATTATATTAGTTTCTTTTAATTTCCTATTGTATTTAGGTTAGGTTTCCTTGTTCTAACGGGTAAGTGTTTTCTTATCCTAATAGGCAAGAGTTTCTTTTCTTTTCTTCTTTGTAGCTCCTATTTATAGTCTTGTATCAATTCTTTGGAAGTAAGTAAGAAATAATTATTTTTTCTTATCAATTAACACAAAGAGTACACCAGCAAGGAGAGAGTTGCATATAAGACATTCTCTCGTTTCTAGCTTGTCATGCGTGTTTAGTGCAAGAGGCAAGTTTCCCAAAGAAAGAACTTAATCTTTTTAGGATAGGAGTCTTTCCATATGGCCCTATAAAATTCGGGCGAAGATGAAGATTCAATGGGGCTTTGCATATCTTTTAGTAAAGAGTTGGTGGTGAAGGAGCCATTCGATTCCAAACAAAGGCGTCCCTAGTTGAGGAGAGTCGGATGGGAAACAAAATATGGAGAAGAGCAGCCAATTCCACAATCTCAATATCCTTAAGATTCCTTTGGGTAAATGGTGTCAAAGCTATCATCTCGAAGATTTGGTTTGAGAGAAGCCAAAGGATTTTTGAGGAAAAAAGAAGACCTTCGGTGGAGTGCTTCAGCTTGGCTAAGTTTAAAGCGTCTCAATGGTGTGCTCTATCCAATCTTTTTGCTAATTATTCTCCAAACATGATATATGTTAATTGGGGGGCTTTCATTTTCCCTAGTTAGACATACACAGGATGATACTCTGTCTTTTTATTTTTGCTATTCTGGATTGTATCTTTTTATTTTATGTTTTTTCACTCCCTTGGGAGTTTGTATCTTTGAACATTTTATGTTCCTTTTCATCTATCAATGAAAAGTTCGTTTCTTGTTAAAAAAAAAATATCCTTAAGATTCCTTCCAAAATGAATGTCCCATGAGTTGCTATCCAGGGACCAAAAATCTCTAATCCGGTCTTCTTTCGTGATATGCAAGGCAAAGAGGAGGGGATAGGTGTGGAAAAGAGGCTTGTTAGTGAGCCATATATCCTTCCAGAAAAGGGTAAGGGAGCCATAACCCACCTTATATGTGATTATATCCAATATCAAAGGGTTGTGCTTGGTAATGTCTCTCCATGGTCCTCTGGCCATTTTGAGGGAACGATTGGGGGCTGTTGTGCAAGAATCAAGAATGGAGTACTTAGCTTTAATAAGGCTTCTCCATAGAGTATGGTCTTCGGTCAAGTACCTCCAATTCCACTTGGTGAAGAGAGCTCTGTTTTTTGCAAGCTAAAAAGACCGAAGCGACCTTCCTCAAAAGATTTATGCACTTGTTCCCATCTAACAAGGTTCAAACCCCCATTGTTATTCTTTCCTTTGCCCATCTAACAAGGTTTAGACCCTATTGTTATTCATTCCCTTCAAAGGAAGGTTTAGATGACTTTCACTTTATCAAGCTCATTAGCAACTTTCGTGGGAATGGAGAAGAGAGAGGTAATAGATGGGGAGATTGGATTGGAAAGGGCAGCATTGGTGAGTGTGTGTCGTCTACGTTTGGAGATGTGTGTCTGATTCCATGACGAGAGCCGTTTTTTAACTTCCTCCAAAAACAGATCCCAAAAGGAGGGAGAGTGATAGTTGCTATTAAGAGGAAGACCAAGGTAGGTATTCGGCCAACTATCTTGTCTACACTCAAACCTGTCAGCAAGAGCCTCTAAGGCAGAATTCCCAACATTGATGCCCAAAATCTCAAATTTAGTTGGAGTTAAAGACTCAAATCCTCCCCAGAGACCTTGATGAGGTGAAACAGATTCTCGATTTTGTGATTATCATCTACAGAAAAGAGAGGGATATTATCAGCAAATTGAAGGTGGGTGATCTGTTGTGAGGAGCATCCCACTGAGAAACCTTAAATGGTTCTGGCGCTCGCTGCATGTTTTAAGAGCTTGCTAAGACCGTCAACAATAAGGATAAAAAGGGCTTGGGGTCCGAGCCTTGGAGTGGGCGCGGGTGCCCTGGGTATAGGGTAGGGGAGCAAAGCTCTGACTTCCAGTTATCAAAAAAGAAAAAACAAATCAATAAGGATAAAAAAAAGGGAGAGACGGGGTCTCCTTGCCTCATTCCCCTTGTGGCTTTGATCTTACCTTGAGGTTGACCATGAATGATAATCGAGAAGTTAGTTGATGAGATGTAGCCTCGGATCCATTTTCTCCACGTATGACTAAAACCTTTGGTCGCCAAGACTCCTTCGAGGAAGTCCCAATCGACCTTGTCAAATACCTTTTCTATATCAAGCTTGATTACCTCCCCTTTCTTCTTCTTCCTAACCCATTCTTCAATGAGCTCGTTGGCAATTAAGGAGCCATCTAGGATCTGTCTATTAGCCATAGATTGATAAGCTATTATAGTGTGTGGAAGAACACGTTTGAGCCTCTATGACTGGACACAGGCGATGAGCTTATATAAACAGGAGATCAAGCTGATCGGGCGATAGTAACTGATGGTTTTGGCATCCACCTTTTCTGAATAAGGTAGATATACATCTCGTTAAGGCAAGTGTTTATGATACCCATTTGAAAGACATCTTGGAAAACCCTCGTAATATCTTGTTTGAGGATGTTCCAATATATTTTTTTGTAGAATTCAGCCGTGAATCCATCCGCGCTGGACGTTTTGTTGGAACCGAGGTCTTTAACTGCTTGCCAGACTTCAGACTCTAGGAAGGGAGCTTCTAAGTGAAATGGGATCCCAATTGACTGGATGCGACAGAAACCCGACCCGTCGTTTATGTTATATAGGTCTCTATAAAAGGAGAAGAATTCTGTTTCTATAGCAACCTGTTCCATCAAGCTTCGGCCGTTCCATCAAGCTTCGGCCGCCCCTTGATAGGATCTCAAAAATAGTGCTTTTCTTATGCCTTGCAGCCAACACTCTGTGAAAAAAGCTCGTATTCATGTTAGGGCGACTTAGTAGAAAAAAGTTTAACCATTACTTTTTTGATTTAATTTTGGCTATTAGAGGGTCTTGTGGTTATCTTCTTGTCTATGGGTTTGGGTGGAAATCTTAGTTTTTTTTCCTTCATTGTTTTTTCCTTTTATGTATATTTTAAAATTTTTTAAAAACGGTTACGTTCATGCTTTTTTGGTACACAATTCACTTGACACCCTTTTTTTATTATAAAAAATGCAACCACATGTACAAGTATCTATTGCTCTGGTTAATCTTACCAAAAATTTTATTTTTATTTTTTAAACTATTATTTGTTTTTTATTCTCAACTACATTCTTTGGCTTTAGGTTTAGTGGTGCTGCTTTTGCAGTTTGTAGTGTGTGGTTTTGGCTTTTTGTTCTATTCTCTTGTATCTTTTTCATGGTGGCTTTTCGTTGGTTGTTCTTTCTTTTTTTTTGCTCCCTCTGGGAGTTTGTATCCCTGAACATTTTCTTCTTTTCATATTATCATTGAAAGGTTTGTTTCTTGTTAAAAAAAAACCATATTTTATATGTTCTCTTGGCCAGGATGTAGTACATTTAACATTCTTTTAATGTCTTATTTGATGGGCTTGGGAAGTCTAGCATGAATAATTTGATATCTAACAGTGTACTTTGGCTCAGGATATGAGTTGATTTAATTGTACTGATATTCTTCATTTGTACTTTAGTGGAGCAGGAAAAGTCGGCCATTGCATGCAAAAGGTCTGGAGCAAGTGCTTACGTGGTTGCATGAGATAAATGTGCATTATGGTAACTTCCAAAATGAGGCAGGTCCTACTTCTAATAAATTATTCAAATCATATCTTAATCGATCAGTTAATGTTGTATGTTAAATCATTTGTTTGGTTTGCCATAGGAAAGGCAAAATCAAATATTCTGCAAACTGGAGCATTGCTACTCTCTTCTTGTTGGAGGCATTACAGCATTTTATTATTTTTGGAAGATTATAGATTTTCTCAGCACTACCAAGAATGGTTGAACCAGTACTTGTCAGGCATCCAGGTTAAGTATTAAAAATTTAGTTTGACAATTTACATCATGATACTTTTAAGATTTTAATTTTAGGTAGTGCAGTTTGTTAATTTTTTGGAAAATTGGATTGCGAGCAACCGGTGCGGTTTTCCAATTGTTATTTTTTTACTTTCCTTTTTGTCTTATCTAATATATATTTACTTTTCGTTTTCCATTAATGATTATACTATTTTATTTTATATGTTCGGTTTGAGTGTTTTTCTTTCTTTGTCTGCTCGATTTTGTATATCAATATCATGTAAATCTATGAAGTTTTTCTTTATCTAGAGAAGAAAGGTAAAAATTAATTTCTTCATGCATCAGGGCTCATCTAGTCTTTCCTTATTCTTACCCTAGATGTCGATGGTTTTTCCTTTCCACCGTTGTTGTTGAATGTTTCTGACAATATGGTAAACATCCCCTTCTGTAATTCTCTTCACTATGTCAGAATTTGGTGAAGGAGAATCCTAAAACAATTGAACCAACCGTGTTAAATGTGTTGGAGAAATTTGAAATCAAAAGCATGGAACCAGGCATCAAGTGCCCATCAGCTCCCACAAGTTCGGTGGTCTGAATTACTTGCACTAGCCACAAGTTATTTTCTTCATTGGGAGGATAAAACTGGTTACTTGACTGTATTAAAAATTTTAAAAGGCTCCTCACATGGATGAGGAAGTTACGTACCTATGGCTTGGCTAATGGATTCATAGGTCTTGGCATTTGTCAAACTAATTTTATTATTATTTTTTTTTGGAAGAGAAACAATTTCATTAATGCAATGAAATGTAAGATATACTAGAGGGGATACCATCCCAAACCAATGGAGCTACAAAAATGCTTTCCATTTGGATTTGGTTAAAGAGAGACAAAACTATAGTGTTGAAAGATGGGAGAACATTTGCACCAAGATACAACCAATGACACTATAGATTAAAAAATGTTACTGATAAATCACTCTTTTCCTTGAAAGATCCTATGATTTCTTTCTAGCCGAATCGACCACAGTAAGACCCTAATAAAATGCATCCACAGGATTTTAGCCTCATTAGTAAAGGGGTGTCCTGTGAGAGTAGAAGATAAAAGAGTGTGGAGTTCTGTCGGTAGGACCAATGACCAGCTAAAGGATGATAGGATGGTGTGCCAGGGTGTGGTTGAAAAGCTACAAGTAGCGAAAAGATGTCCCTGTGATTCACCATGCTGCTTACACAAAACACACCAGTGAGGAGAAAGGCACATCAACGGCAACCTCCTTTGGAGCTTGTCGTGGGTGTTTAAAGCTAGCAAACTGGCCTCCCAAAGAAAAAACTTGATATACTTTGGGTCTTTCCATATCGATTTATAGAGTAGACCATCAGTTTGAGGGGTTGAGGGATTTGCCATGTCGTGGATGAGAGTGCGGGTTGTGAAGAGTCCCGATGGTTCCAATCTCCATATAAGTCTGTCACTTTCTTGATTTTGATGATAGGAAGACAAACGGTGGAGAAACGATACTGATTTTGCCTCTTTGAGGTTCCTTCTAAAGCTGATGTCTGATGAATTTGTTGCTGTCGACCAACATTTCTTAACAGTAGCATCTTTTCTGGAGTGCAAGGGAAAGAGGAGAGGGCAAGTCTAGGCATTAGACAAAGGCACATTCTCCGGCAAATTATCCTTCCAGAAGTAAGTTTTTGCTCCATCACCAACCTTAAAGATGCTTTTGTCCATGATAAGTTGATGATGTCTGAGGATAGATTTCCAAGGACCACGACAGATGTTATGGAGGACTGTAAGGAGCTGTGGTGACGGTTTAGGAGCCCATATTTTTCAGCAATGACTCTTTTCCATAGGGCGGATTCTTCGACGTGGAATCTCCAAATCCATTTAGCAAGGAAGGCTTTGTTTTTCCTATTATGTAGCCAATTCCCAGCCCTCTATCCTCTAAGAGGAGGATTACCTTTTCTCATCTCACAAGATTCTGTCCCTTATTTGTGCTGTTTCCTTTCCATAGAAAGTTCCTCAGGAGAGAACTCTTTGGCTACTAGTGATTGAATGGAGAAGATAGATAGATAATATATAGGGAGATTGGTTAGTGTAGCTTGTAGGAGGGTGGGCCTACCACCTTTTGAGATAGGATAATTCTCCAAAGAGCTGAGCTTTTTCCTCATCTTTTCCAATATGGGATCCCAAAAAGAGATTGTATACGGTTTACCATTCAAGGGCAGGCCAAGATATGAGTTGGGCCAGGATCCTTTTTTGCAAACAAACCAACCTGCCAAGAGATCCAAATCAGATTCCCTCATGTTGATACTCGTTAATTCGGCCTTTTGGAGATTGGAGTTCAGCCCTTACCCTTCTTCAAAGATTCGAATGGTGTTGAAAAGATTTGCAAGATGGCCGTCCTTTGTTGAAGAGAAAAGAATGGTATTGTCTACGAATTGGAGGTGGTGTATTTCCAATTGATTCTGCCCCACCAAGAAGCCCTTTATCAAGTTGGAAGCAGTAGCATGGGTGAGGAGTCTACTAAAAGTCTAAAACAGTCCACAACAAGGGTGAAGAGGGAAAGGTAGCATGGGTGAGGAGTCTACTAAACAGTCCACAACAAGGGTGAAGAGGAAAGGGGAGAGGGGGTCTCCTTGCTTGAGACCTCCAGAGGCTCTAATTTTACCTCTGGGCTTGCCATTGATGATAATAGAGAAATTTGGGGAGGATATACAACCTCTGATCCAAGATTCCATCGGTGACTGAAGATCACCTTCTGCTTCTGAGATGCCATCCAAGAAATCCTAATCCATCTTGTCAAAGGCTTTTTCAATGTCTAACTTTATTACCACCCCTTTCTCGTTTATCCGTTACAATCATCAATGAGCTTGTTTGCAATTAAGGATGCATATAGGATTTGTCTGTCAGCCACAAAAGTGGACTGTTGGGAGTGATGGTGCACGACAAGACCTTCTTCAGTCTCTCAGAAAGGACTCTAGCAATGCTTTTGTACATACATGAGATCAATCTAATGGGGCGAAAATCACTGACTGTGTGGGCATCGACCTTTTTTGGGATTAAGCAAATGCATGTCTCGTTAAGGTTGACATTTACAATACCCTGCTGGAAAAAATCTTGGAACACTCTTAAGGATGTCTTCTTTAAGAATGTTCCAATATTTTTTTATTTTTTTATAGAATTCTGCTGTATATCCTTCGAGGCCCGGTGTTTTGTTGTGTCCTGGATCCTTCATAGTCATCCAAATTTCTGTTTCTGTAAAGGGTGATTCAAGAAGAGCTCTCTGGTGTGCAGAGATGGGATCCCATTGGAGGGGATGTGGGAGAAATCTAGTACCTTTCTTTTTAGTGTAGAGGTTTTCATAAAAAGATAGGAACTCCTGTTCTATATCAATGTCTTCGATCAAACTTCTACCATATTGGGAAAGCACCTCCAAAATTGAGCTTCTTCTACGTCTGGCTGCAACCAAGCGGTGAAAGAAGCCTGACTTCACATCTCTTTCATTAAGGCATTTCACCTTGCAACGTTGGCGCCAATGCATTTCCTCTTTGACTGCTAAATCGAGGAGCTGTGCTTTCAAGGTGGTCCTTTGGGAGATCTGGGCTGAATGGCCCCCCTTTGTTCTTTTGTGTCTAGGGCCGCTAGATCCAGAAGAAGTTGGTCTCTTTGCCTATGAACGCATCCAAAAGTATATTTTTTCCATGATTGGATGATTGCTTCAAAGCCCTTTAGTTTTTGGATGAAACCGTGGCCATACCATCCTTGCAAGGGATTTGATTTCCACTAGAGATCCATCATGGGAGAGAAAGATGAATGTTGCATCCACATGTTTTCAAAACGGGAGGGAGAAGGGCCCCATTGGTTAATACACAGAGAGAGCTGAATCGGGAAGTGGTCTGATGTTTCCCTGTCCAGACGTTTGACCAATGCATTAGGGAATTTAGAGAGCAAAACATCAGTACATAAGAACCTATCAATAAGAGAGGGAGAAAGGACTCCCTCAAGTTGGACTAGGTGAAGAGACCATTGGTGAATGGGAGGTCATTGAGATTAGATTCTGCAATGAACTTGTTGAAGAGTCGCATGCCTTGGATGGGCCTCATATGATGGGACTTTTCATGAGACCATCTGGTGATGTTAAAATCGCCTCCGAGGAGCGAGAAGGTGGTGCAGCGGTTAGCAAGGCTGTGGAGCTCCTTCCAAAAAAGGTGTCTTTCTCTAGCTTTGGAGGGGCCGTAAAGACCAGAGAGCCAAAAATGGAACCCTTCAGAAAGAGTGATCTTGAGCGGGAGGGAGAAGGAGCCAACAATTATATCTGCCACAGAGAAGGATGTGTCATTCCATGAGATGATAAGGCCCCTAAAAGAACTCGTGGATCCTTTGTAAGCCCAAGCAATGTTTCTGAAGCTCCAGAGTGATTTGATTTAAAGTCTATCATCTTGTTTTGAGTATTTATAGAAACTCGCCTTGAGTAAGGATCACAACTATTTGAAGTTATTCTCTTTGAGAGTTTTCCTTGATAGCATAGATCTCCTTTCAGACGACACTCATTTGAATGTTCCTAAAGAATTCATCGTCTCTTCTCTCAAATATATTTTAAGCCTTTCTCCTTCTTAGTTGAATGAGTATCTTGCCTTTCTTGATTCCGAACTCGTTAGGCTTTGAGAAGAATGGACCTTGACGTTAGAAGAATTTTCTAGAAGAAGGTTGGAGATTGAAGTCGCTCGGGCTAGGGAGCTAGCTGAAATTCAAGCCAAATTCTCCCTGGTTTTCAAAGATCTATATGCAGAAGATAAAGATGGAGAACCCGAAGTTAAAGACCTCTAAAATTTCTCTTTTGTGTCATTAGATCTTGGTGGATCTTTTTGTTTGAAGTTAGCTTGTTTCTTTTGTTGGCTTTTGATATCACTCTCATGTTGATTGTATTGCTCTCTGTCGAGCCTATATTTTGCTTTCGGGAGTTTGTATTCTTGAACTTTTTCTCCTTTTCATATATCAATGAAAAGTTGTTTCTTGTTAAAAAAAAAAAAGTTGGTCAGCAACTTCGATCTTGGTTTCTTGAAGAATAACGATGAAAGGGTTGAGGTGTTGGATGAAGCTCTTGGTTTGCGCTCGTTTATCTCTTGAGCCAATACCTCTGACATCCCAAGAAAGAATGTTCATTAAGAAGTGGAAGGCCCGCCGTTGGTTAGGGAGGAAGAGGGTCCATCGTAGTTGATGGAAGAATAAAGGTCGGATAATTTCTCTAACATACTTGGTCTACTTTCGCTGTTCATTATTTTGCTTTGGTGTCTTTGTGGGTAAAGGGTGGATGCCCAACCCCATGGTCTTAAGCGAAGGAACCACTGATCATAGATAAACAGAGAGGTGGATGGATGGGTCACGAAAGTATGGAGGGGAGGGGGGCAAATCCGCTTGTCTATCATCAAACACTTGCAATGCCAAGGAGTTAGCGTTGGTGGGGGTTGGTTTGAGGAGAAAGTCCATAGGTGGGAGATAGGGGAGGGGAAGATTGGAGAAGAGAGACGTGGGAGGTTGGATAGGGGGTGGAAAGGGGTTTTCTGTATCTGGTTCGGAGGGAAGGGGATTAGTGGTGAATTTTGTGCCAGAGACAAAGAATGCCTTTTTTCTCTATCTCAATGACTCTTTTGGGTTTGTAGTGAAGTTGACAAGGGTAAGGTAAGGGGGATAGGGGGATGTTTGGATGGGGGGTGGGTGATAAGGGCGTGTGAAGGGACTCTGTCCGTTTGACAGGGGGACGCAAATAAAGGCAAATGGAAGGTAGGTTTGATTCAAACACTGGGCTCTCATCCACGGGGCTCTTCTTTTGAGCAATGGTCAATTTGTGATGCAGTCGAGGTGGCAGGTGCAATGTGTTGACAACTAGCACATTTAATGCATCTTTATTGTTGGAGCAGCCAAAGTCAATGCTGGGACAAAAAAGGTCAGAGGGAGCCTGTTGCAAATCTAAAATATTGTTGTATCTCGAAGGCTACTGTAAGTCTGAATAAGTGCTGCTTCTCTTGTAGGGGCAGGTCTCGAGGGTACAGCCTGAGGAGGCCATCTTCTTTTCCTTTGGCGGTGGTCAAACTCAATGGATTTCAGCCGCATATCCGATGTATTTTTCGCTAGAGAAAAAAGGGTCTATCTAGACGATAAGAGCTTTGATTTTTTCAGGGCTTAGGTCAATGGTGGCTGGGATAAAACCCGTAGGATTTTTCTCCACACAAATAGTAGCTTCGAAAAGGTCCAGCCATGAGACAATCTTCTTTGAGGTCTCAATGAAACCGCCGCATATGTTGTCTATAAATCTTAGGGTGTCTTCATTCCATCGATCGGGAGGCAGATATCTCAGATCCAGCCCCCATATGAAGGAACCGCAGATGCTTTGTAGGAGGCGTGGCTATCCCGAAGTGCAATTTCAGGTCAAACTTACCCACTCTATACCAATCTTCAAAGTTGGTGTGAATTTGTGCTAGATCATCTATTCACATGTGAGAAGGGCTCTGTCGGCCGCAAATGGGTTAAAAAGGAGCAGAAAAAACTAATAGATTCTTGCATGGCCCTCATGATTTTGAACCAGTCATCATGGAGATGTTTTCTGAAAACAATAATGGAGGAGTCAAGGTTCGGGTTTGCGTTTATAGAGTTCCTGTTTGATTCTTAACTCCACCTTTGTTAATGGCTTCACCTACTTTATTTTCTTTTTGAACATGAAACAACTTTTCATTGATATATGAAAAGGAGAAAAAGTTCAAGAATACAAACCCCATAGGAGTGAGAGAAAAAGGAAAAAATGAAAGCAAAATATAGGTTCGCCAGAGAGCAATACAAACAACATGAGAGTGATACCAAAAACCAACAAAAGAAACAAGCTAACTTCAAACAAAAAGATCCACCAAGACGTGATGACACAAAAGAGAAATTTTAGAGGTCTTTAACTTCGGGTTCTCCATCCTTATTTTCTGCATATAGATGTTTGAAAACCAGGGAGAATTTGACTTGAAATTCAGCTAGCTCCCTAGCCCGAGCGACGTCAATATCCAGCCTTCTAGAAAATTCTTCTAACGTTAAGGTCTATTCTTCTCGAAGGCTAATGAGTTCGGAATCGAGAAGGCAAGATACTCATTCAACTAAGAGAAGAGACGATGAAATCTTTAGGAACATTCAAAGGAGTGTCGTCTGAAGGGAGATCCATGCTATCAAGGAAAACTCTCCAAGAGAAGAACTTCAAATAGTTGTGATCCTTCCTCAAGGTGAGACTTCACCTACTTTCAATACACCACAGTACTAGAAAGAAGGTTTTTGTTGTTGACAGACCGATGTTGTGGCAATAAATTTCTTGGGTTGGGGGTTTGGTGTCAAGTATTTAACGAGCTTGTAAAAGGAGTGCCAATCTCTTCTTTCTTCACCCACAGGGACCAGGATTTTGTTGATACTCCCATTATTGTCCAGCCGTTCAATTTCTACAATGGTGCCTTTCTTGTTGGACGATTTTTCAACCCAAATGGTATGGCCATTGATCCTTGTTTCTGTGAAGAATCTTTGATTAATAGGGACAGATAGGAGGGAGGAGAAACATATAGCAAGCCAGGATATGGAGGACCAATGTAAGTAAATGAAGAAGTACTTATCTTTTGAGGTTTAAGATATTTTGAGGCGACTACCTCGGTACCTTTGGTCAACTTCTATAGAGAAGCGTTTCCTTCCTATGGTGGCGGATCTGTGGGTTGATGCTGGTTTGTTGGTCATGGCTATTCGTACAGAAGCTTAATATCGTCATGAGGTCGGAGATGAGTGATGGGTAGGGTTGATGGTGGGGAGAGAGGTGACAGGATTCCATGGTCAACTATGGATTGGTCTAAGTATTGTAGTGATGGATAGAGTGCATTTGTCAAACTTATTGATCAAGGATATCATGGATGTTGAGCTTCTCCACGAAGAGATGGTATAGGTTGAGATATTTCTTCATCAATTTTTTCTGAGTACATAAAATTGGATCTGACATACTGGACAAGCTAAGGCTTCATGACTTTGATCAAGTGGTACAAAATTCTCGAGGGGTTATGGCAGGAGTTGGATTTTGTAATGATCCTGACTTAGATTTTCCTATAGTTGTAAAAATTAAAATACCTGAAAATGATGGGAGATTGAGAGAAAAGTGAGTACTTGCTGAAGTATCTGATTCATTGTTCGAGATTATGAGCATCTGCAAGCAATCCATCAGACATTATAGCATACTAAACAGTGGAGTAGCGAGTAGTGATGAATCTGTTGCACCTTCTTGAGATGTTAATTTGACTACAAAAGCTTTTGTGTGGCTTGCGACTATTCTAAGAGCATGAAAGACTATAATACTTATTGAATTTGAATTATGGCCTTCTGAATTGTATATTTTGTTTTCTTTCTCAGTATTATTCAGGGCTGCATACTGGGGAACATATCGGAAATAAGGATGGGAGAGAGACCACAATTTTTTTCCTGAATTGTTTATGCCTTCTACTGGGTAGGCTTGACAGTAAAAGATTTGAAAGCACAATATCAGAATATGGAACTCAGATTTCTCAGGTTCTGCTATTGCAGGTATTTATTTATGTATGTAACATTTTTAAGCTGCTAAAAATATATGTGAAGAAGATAGGCATATAGAGACTTTCATCTTTCATTGAAATGAGTGCCCAACTGTCAAATCCTTGAAAGAAAGTTCTATAGTATTCTTCTTCTTAAACCACTGACGGAACATAGAGGGATAAGTTGAACTCATTTCTGGTAAAGGACCAGCCCCTTGCAGCGAAATGGCAATGGATAAAAGTATGATCAATTTTTTCATTATCAATAAAATAAAAGTTTCGTATAGAAGGGTGGAGAGCCATTTCAGATAAGATTTTTATAGGAACCAAAAGCTAAAATAGATTGAAAGATAAAATGATAAAAGGACTCCCAAGTAATTGAAGGTACTTCAACCCTCATTCTCTTGAGTATACTTACGCAAGCCTAAACCACACTAAACCAATGCCCTACCTTCACCCTTTCCCTCTCACTCTATTTGTTTTTTTTGAAAGAAGATACAAAATTTTCATTGATAAATGAGAAAAACACAGATATAAACTGAAAGGAGTGAAAGAAAATACAAAAACCAGATAACAAGAAGAAATAGAAGACTAAGAAGAAATAAAAACTTCCCAAGAACAAAAAATGATACTAGAAGAATAACTCGAATAAAACCGAAAGAGAACACCATTGAGAAGCCTTCTCTTATGCCTTCTGCAGTTTTCAAAGCTTTGCAGTTTTAGTTTGGATGGTTGGTTTGTGAGGCTTTCAAAGTTTTTCTCCAATCCAATTGGCATATGATGGTTTATATTTGGAGGTTTTTTGAAGACAGCATTGAGGCTTAGTATTTTTCCGGCTGCGGTCTAGTTCTTGTGTCCCTAGCTTCTACTTGGTTCATTTTGGAGTTTGATCTTCGCTCTTGGTGCTAAGGAAGGTTTTTCCCCTCATGCAGATGCTGACTCAGGAACTAGTTTGGTTTTCCAAGTTTTGTAGCAAGCAGTTTCTCTGAGTTCTGTTTTAGAATTTTTTATCTTTTTTGAGCTTGTATTGCTCTTTTTGTGCTATTTTTTCATTATTTTCTCTTCTTCTCTCATTTCTTTTTTCGCCCTTGAGGTTTGTATCTTGAACATTTTTCCTTTCCAATATCAATGAAAAGTTGTTTCTTGTTAAAAAAAAAACACCATTGAGAAGCCTTGAACTTTGCAAGGTCATAACATTCGAACTTGTTTCGTAAGATTCTTTGAATCCTTTCTAACCAAATTTCGGATAATAAGGCTTTGACCACTCAAACCCAAATTAAATGGGCCTTAGGAAGATAAAAGGGGCAATAACTTTCCTAACAGCTTTCCTAACCAATTACTAGTATACCCTTAATACCCTAATAATATTCTTAATTTTGTCCTAATAGCAATCCTATCAGATTTCTGCATCTTCTCATTTGTATTCTAGTATGGCTTTTAACACTTGATCTTTTGGATTTAGATTTTTCCAATAGATATATCAAAATCTACATGCCATATATTTATTTAAAATAAAATATGAAAATTTCATTGATAAATAGAAAGAAAAAAATAATAAAAGTTTTAAAAGGATCCAAACTCCTTTAAGATGTGAAAAGATAACGGTGAGAGACGTTCTTAGAATTAAGCTACCATAAACATAGGATCTTAACTTCTTTAGAAGTGTAGCCAGCAGTGATTGTGTCACAAAATCATTAATATGATGGGTGCACACTTGGTGGTTGAATTTTTCAAAAAAGGGCCACCGAGTCTTTCGAGTAACTCATAACTCAAGCAAATATCTTTTTTTTGAACAAGATACTAACTCTTCATTGAATTAATGAAAAGGAACAAAATTGTTCAAAGATACAAACTCTCGAAAGAGTGAAAGTACAAATAAGATTGTAAATAAAAGACTGTTAATAAAATTGCACTCAAGAAAATATATTTTTAAATTTAATTTTTTTATTTAATGAAAAACCATTCATGGCACACATGAAAAATTACTCCTCGGACTGGACTTGTCTCCACAAGGTCGTCTTTTATGGATAAATGCAATTAAAGCTTTGCTCTATGAGCTTTGGTTTGAAAGAAATCTCAGAACCTTCGAGAATCAGCACAAACCTCCGTTTGATCGGTTTAGCCTAGCTAAGTTTAAAGCTTCACAATGGTGTTCTATGTCATCTCTCTTTGGGAATTATTCTTCTTCCCTCATTTGTTCTAATTGGGGGGCCTTTTGCTTTCACTTTTAATCTTTGCTGTTTGTTTTTATTTTGTTTTTCTTTCACTCTTTCGAGAGTTTGTATCGTTCAACATTTATACTCCTTTTCATTATTTCAGTGAGAAGTTTGTTTCTTGTTAAAAAAAAAAAAATTGCCACACATATAAAAATAAACTATACAAAAAATGGGCTTCAATCTAAGATGATAAGACCTCAAGAATAACTCTAGGATATATGTTACTGGTTCCACTAGTTGCGTAACAGACACCAATTAGGAGATAGTATGATGGCAGGATTCCTCTATTGTACGATGTTGAAAGGGCTTGTCTTCCATAGCTCGTGGAACAAATGAAGATTTTTGTAATCTTTGGTATCTTGAGTTTCTAGAGTAGAAGTTACCATCACTTTATAATATGCTTGGTTAGCAGTTCCTAATGAGATGGATATGTTCTTTCTTCCATGGAACTGATGCTTTTGTAGCTTGATAATTGATATTACTATCCTTGTTAAGACTTACCTTAAATTTATAATCTCTAATGCAATGCTTCGATTTGATACTATTTGCTTTATTTCTGTTGAACATGTGATTTATGCTTACTTGATTACATAGTTATTTTTGTGTTCTGTTTTTAAGTTCCATAGTACGGATGAAGATGTCATTGACGAGGTTGTTAGCATATTTAAGGCAGTTTTTCTCAATTCAAATTTATCATCTGGAGGCAGTATCCCTGATATTAGGCAACTAGATGTTGTGATGCCGTTGTTGCTTAACCTTCTAGACGAGCGGGATATGATAGCTAGAGCTGTCACCATTCTCATTTCTGAATGCTGTGTAATGTACTCTCTCTTTCTCTCTCTCTGATCTTGGAATTCATTGTCTTCTTTTGGTTCACTTTCACCTGCTGGTTTATCAGGAGCGGAGATAATCAGTTCCTTTCGGAAGTCTTTAAGCGATTTGATTCTGATAGTATAATACAGAGGAGGAATGCTCTTGATGTGATTTCTGAAATTGTTCAGATGTCATCAAATACGAGAAATTTACTGACTCAGTCAGCATGGTACATATGCACTAGCATTGCCGTCATTTTTTTGTTTTTCCTACATTAGTTATCTAGAACTGGCATAACTTTGTATGACGTTAATTGCTGCAGGCAAGATACTACTAACCGATTACTCAAATGCCTAGAAGATGAAGAAATTCTAATCTGTAAACAGGCTGCTAATTTGCTTCCTTGCATTGGT

mRNA sequence

ATGGAAGAAGACGAAGGGGGACTGCTCCTATGGAAGTCGGACTCAGCACCTCAATCTATGGTCTCAGTCACGGTTGGCAGAGTCATGGCCACTCTCCTCGGTGCTCGCCCTAAGAAGCTCGCCGACGCCGTCTCCCGTCTCTTCCCCGACCACCGCCGCGGAGCTTCTTCACTAGATTCTCTGGACGAATCCCTCTGGTTTCTGCACAACTATGTCAGGGACGCTGCTCAAAACCACGCCTCTTTCGATGAAATCCTCGTCCCCATGATCGAACATTCATTGAGATTCAAGGACAGGAACTGGAAGCGAGGGGGGCAAGTTATGGTGCTCCTCAACTGGCTGTTCCTTGAAGAGCTTATTTTCCAGCCGCTTATCAAAAATCTTGCTGATATTATTGCGAGGAAGGATGATCGCTATGTCGCCCTCGGCTGGTGTATCCTTGTTCGTAGTCTTGTGGAGTACGAGTCTGTCGCCAGTGAATTTTCATCGAATGGGTTAAGGGAGAGATTCAACGATATGTTGAAGGTATTCTGCACATGCATTCCACGTCTGGCATGTATTTTAAGTAAAGGAAGTAGTATGCAGGAAGGATTTGAGTTACCTTCTCGCCTCTCAGTCTGTGCTGCTGATTGTGTTGTATCTCTGACTAATGCGCTGACCAGAAAGTCTGAGGTTCAAACTAGGCAGAAAAGATTAAATTCAAGCTCATCTCATCAGCAAGTAACTTTCTTTTCAACTATTGTTGATGACCAGCGAGAGAAACAAATTAGCAACTCTTCAAAAGACTCAGATTCAGGCATGGAATATTTACTCTGGCATCAATTAAAGGATCTCGTGATTTTAGTACAGAGGCTTCTTGCATGGAGCAGGAAAAGTCGGCCATTGCATGCAAAAGGTCTGGAGCAAGTGCTTACGTGGTTGCATGAGATAAATGTGCATTATGGTAACTTCCAAAATGAGGCAGGAAAGGCAAAATCAAATATTCTGCAAACTGGAGCATTGCTACTCTCTTCTTGTTGGAGGCATTACAGCATTTTATTATTTTTGGAAGATTATAGATTTTCTCAGCACTACCAAGAATGGTTGAACCAGTACTTGTCAGGCATCCAGTATTATTCAGGGCTGCATACTGGGGAACATATCGGAAATAAGGATGGGAGAGAGACCACAATTTTTTTCCTGAATTGTTTATGCCTTCTACTGGGTAGGCTTGACAGTAAAAGATTTGAAAGCACAATATCAGAATATGGAACTCAGATTTCTCAGGTTCTGCTATTGCAGTTCCATAGTACGGATGAAGATGTCATTGACGAGGTTGTTAGCATATTTAAGGCAGTTTTTCTCAATTCAAATTTATCATCTGGAGGCAGTATCCCTGATATTAGGCAACTAGATGTTGTGATGCCGTTGTTGCTTAACCTTCTAGACGAGCGGGATATGATAGCTAGAGCTGTCACCATTCTCATTTCTGAATGCTGTGTAATGAGCGGAGATAATCAGTTCCTTTCGGAAGTCTTTAAGCGATTTGATTCTGATAGTATAATACAGAGGAGGAATGCTCTTGATGTGATTTCTGAAATTGTTCAGATGTCATCAAATACGAGAAATTTACTGACTCAGTCAGCATGGCAAGATACTACTAACCGATTACTCAAATGCCTAGAAGATGAAGAAATTCTAATCTGTAAACAGGCTGCTAATTTGCTTCCTTGCATTGGT

Coding sequence (CDS)

ATGGAAGAAGACGAAGGGGGACTGCTCCTATGGAAGTCGGACTCAGCACCTCAATCTATGGTCTCAGTCACGGTTGGCAGAGTCATGGCCACTCTCCTCGGTGCTCGCCCTAAGAAGCTCGCCGACGCCGTCTCCCGTCTCTTCCCCGACCACCGCCGCGGAGCTTCTTCACTAGATTCTCTGGACGAATCCCTCTGGTTTCTGCACAACTATGTCAGGGACGCTGCTCAAAACCACGCCTCTTTCGATGAAATCCTCGTCCCCATGATCGAACATTCATTGAGATTCAAGGACAGGAACTGGAAGCGAGGGGGGCAAGTTATGGTGCTCCTCAACTGGCTGTTCCTTGAAGAGCTTATTTTCCAGCCGCTTATCAAAAATCTTGCTGATATTATTGCGAGGAAGGATGATCGCTATGTCGCCCTCGGCTGGTGTATCCTTGTTCGTAGTCTTGTGGAGTACGAGTCTGTCGCCAGTGAATTTTCATCGAATGGGTTAAGGGAGAGATTCAACGATATGTTGAAGGTATTCTGCACATGCATTCCACGTCTGGCATGTATTTTAAGTAAAGGAAGTAGTATGCAGGAAGGATTTGAGTTACCTTCTCGCCTCTCAGTCTGTGCTGCTGATTGTGTTGTATCTCTGACTAATGCGCTGACCAGAAAGTCTGAGGTTCAAACTAGGCAGAAAAGATTAAATTCAAGCTCATCTCATCAGCAAGTAACTTTCTTTTCAACTATTGTTGATGACCAGCGAGAGAAACAAATTAGCAACTCTTCAAAAGACTCAGATTCAGGCATGGAATATTTACTCTGGCATCAATTAAAGGATCTCGTGATTTTAGTACAGAGGCTTCTTGCATGGAGCAGGAAAAGTCGGCCATTGCATGCAAAAGGTCTGGAGCAAGTGCTTACGTGGTTGCATGAGATAAATGTGCATTATGGTAACTTCCAAAATGAGGCAGGAAAGGCAAAATCAAATATTCTGCAAACTGGAGCATTGCTACTCTCTTCTTGTTGGAGGCATTACAGCATTTTATTATTTTTGGAAGATTATAGATTTTCTCAGCACTACCAAGAATGGTTGAACCAGTACTTGTCAGGCATCCAGTATTATTCAGGGCTGCATACTGGGGAACATATCGGAAATAAGGATGGGAGAGAGACCACAATTTTTTTCCTGAATTGTTTATGCCTTCTACTGGGTAGGCTTGACAGTAAAAGATTTGAAAGCACAATATCAGAATATGGAACTCAGATTTCTCAGGTTCTGCTATTGCAGTTCCATAGTACGGATGAAGATGTCATTGACGAGGTTGTTAGCATATTTAAGGCAGTTTTTCTCAATTCAAATTTATCATCTGGAGGCAGTATCCCTGATATTAGGCAACTAGATGTTGTGATGCCGTTGTTGCTTAACCTTCTAGACGAGCGGGATATGATAGCTAGAGCTGTCACCATTCTCATTTCTGAATGCTGTGTAATGAGCGGAGATAATCAGTTCCTTTCGGAAGTCTTTAAGCGATTTGATTCTGATAGTATAATACAGAGGAGGAATGCTCTTGATGTGATTTCTGAAATTGTTCAGATGTCATCAAATACGAGAAATTTACTGACTCAGTCAGCATGGCAAGATACTACTAACCGATTACTCAAATGCCTAGAAGATGAAGAAATTCTAATCTGTAAACAGGCTGCTAATTTGCTTCCTTGCATTGGT

Protein sequence

MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDSLDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELIFQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTCIPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQVTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGLEQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQEWLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDMIARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQSAWQDTTNRLLKCLEDEEILICKQAANLLPCIG
Homology
BLAST of MS016640 vs. NCBI nr
Match: XP_022151623.1 (uncharacterized protein LOC111019538 [Momordica charantia])

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 568/572 (99.30%), Postives = 571/572 (99.83%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEGGLLLWKSDSAPQSM+SVTVGRVM TLLGARPKKLADAVSRLFPDHRRGASSLDS
Sbjct: 1   MEEDEGGLLLWKSDSAPQSMISVTVGRVMVTLLGARPKKLADAVSRLFPDHRRGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI
Sbjct: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC
Sbjct: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ
Sbjct: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL
Sbjct: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQ+
Sbjct: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQD 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI
Sbjct: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM
Sbjct: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
           IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEI+QMSSNTRNLLTQ
Sbjct: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIIQMSSNTRNLLTQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQDTTNRLLKCLEDEEILICKQAANLLPCI
Sbjct: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 572

BLAST of MS016640 vs. NCBI nr
Match: XP_038882127.1 (uncharacterized protein LOC120073376 isoform X3 [Benincasa hispida])

HSP 1 Score: 954.1 bits (2465), Expect = 5.4e-274
Identity = 484/572 (84.62%), Postives = 517/572 (90.38%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEG LLLWKSDSAPQSM SVT+GRVM TLL ARPKKL DA+S L PDHR GASSLDS
Sbjct: 1   MEEDEGELLLWKSDSAPQSMASVTIGRVMVTLLAARPKKLHDAISSLSPDHRHGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LD+SLWFLH YV DA QNHAS DEILVP+IEH+LRFKD+NWKRGGQVMVLLNWLFL+ELI
Sbjct: 61  LDQSLWFLHQYVGDAVQNHASLDEILVPIIEHTLRFKDKNWKRGGQVMVLLNWLFLDELI 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKNLADII RKDDRYVALGWCILVRSLVEYESV  E   NGLRERFNDMLKV C+C
Sbjct: 121 FQTLIKNLADIIVRKDDRYVALGWCILVRSLVEYESVPCELPLNGLRERFNDMLKVLCSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRLSVCAADCVVSLTNALT K+EVQTRQKRLN+SSS+QQ
Sbjct: 181 IPRLTCILSKGSIIQEGFELPSRLSVCAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
            TFFS +VDDQREK ISNSSK SDS M+YLLWHQLKDL+ILVQ+LLAWSRKSRPLHAKGL
Sbjct: 241 DTFFSNVVDDQREKPISNSSKHSDSDMDYLLWHQLKDLMILVQKLLAWSRKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WLHEIN+HYGNFQNEAGKAKS I +TGALLLSSCWRHYSILLFLED RFSQHY+E
Sbjct: 301 EQVLKWLHEINLHYGNFQNEAGKAKSKIPRTGALLLSSCWRHYSILLFLEDCRFSQHYEE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
            L QYLSGIQY SG HTGE I N+DGRET IFFLNCLCLLLGR DSKRFESTISEYGTQI
Sbjct: 361 CLKQYLSGIQYCSGHHTGEGIRNEDGRETIIFFLNCLCLLLGRHDSKRFESTISEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
            QVLLLQFHSTD DV+DEVVSIFKAVFLNS LSSGGSI D RQLD+VMPLLLNLLDE D+
Sbjct: 421 FQVLLLQFHSTD-DVVDEVVSIFKAVFLNSKLSSGGSITDNRQLDIVMPLLLNLLDEPDV 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
            ARAV ILI+E C+MS DNQFL EVFKRFDSD I+ RRNA+DVISEIVQMSSNTRNLL+Q
Sbjct: 481 TARAVIILIAESCLMSRDNQFLLEVFKRFDSDRIMPRRNAIDVISEIVQMSSNTRNLLSQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQDT N+L++CLEDEEILI KQAA+LLPC+
Sbjct: 541 SAWQDTANQLIRCLEDEEILIRKQAADLLPCV 571

BLAST of MS016640 vs. NCBI nr
Match: XP_038882125.1 (uncharacterized protein LOC120073376 isoform X1 [Benincasa hispida])

HSP 1 Score: 954.1 bits (2465), Expect = 5.4e-274
Identity = 484/572 (84.62%), Postives = 517/572 (90.38%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEG LLLWKSDSAPQSM SVT+GRVM TLL ARPKKL DA+S L PDHR GASSLDS
Sbjct: 1   MEEDEGELLLWKSDSAPQSMASVTIGRVMVTLLAARPKKLHDAISSLSPDHRHGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LD+SLWFLH YV DA QNHAS DEILVP+IEH+LRFKD+NWKRGGQVMVLLNWLFL+ELI
Sbjct: 61  LDQSLWFLHQYVGDAVQNHASLDEILVPIIEHTLRFKDKNWKRGGQVMVLLNWLFLDELI 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKNLADII RKDDRYVALGWCILVRSLVEYESV  E   NGLRERFNDMLKV C+C
Sbjct: 121 FQTLIKNLADIIVRKDDRYVALGWCILVRSLVEYESVPCELPLNGLRERFNDMLKVLCSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRLSVCAADCVVSLTNALT K+EVQTRQKRLN+SSS+QQ
Sbjct: 181 IPRLTCILSKGSIIQEGFELPSRLSVCAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
            TFFS +VDDQREK ISNSSK SDS M+YLLWHQLKDL+ILVQ+LLAWSRKSRPLHAKGL
Sbjct: 241 DTFFSNVVDDQREKPISNSSKHSDSDMDYLLWHQLKDLMILVQKLLAWSRKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WLHEIN+HYGNFQNEAGKAKS I +TGALLLSSCWRHYSILLFLED RFSQHY+E
Sbjct: 301 EQVLKWLHEINLHYGNFQNEAGKAKSKIPRTGALLLSSCWRHYSILLFLEDCRFSQHYEE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
            L QYLSGIQY SG HTGE I N+DGRET IFFLNCLCLLLGR DSKRFESTISEYGTQI
Sbjct: 361 CLKQYLSGIQYCSGHHTGEGIRNEDGRETIIFFLNCLCLLLGRHDSKRFESTISEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
            QVLLLQFHSTD DV+DEVVSIFKAVFLNS LSSGGSI D RQLD+VMPLLLNLLDE D+
Sbjct: 421 FQVLLLQFHSTD-DVVDEVVSIFKAVFLNSKLSSGGSITDNRQLDIVMPLLLNLLDEPDV 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
            ARAV ILI+E C+MS DNQFL EVFKRFDSD I+ RRNA+DVISEIVQMSSNTRNLL+Q
Sbjct: 481 TARAVIILIAESCLMSRDNQFLLEVFKRFDSDRIMPRRNAIDVISEIVQMSSNTRNLLSQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQDT N+L++CLEDEEILI KQAA+LLPC+
Sbjct: 541 SAWQDTANQLIRCLEDEEILIRKQAADLLPCV 571

BLAST of MS016640 vs. NCBI nr
Match: XP_023543535.1 (uncharacterized protein LOC111803391 isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 953.0 bits (2462), Expect = 1.2e-273
Identity = 488/573 (85.17%), Postives = 517/573 (90.23%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           ME+DEG LLLWKSDSAPQSMVS+TVGRVMATLL ARPKKL DAVS L PDHR GA SLDS
Sbjct: 1   MEDDEGELLLWKSDSAPQSMVSITVGRVMATLLAARPKKLHDAVSGLSPDHRHGA-SLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LD+SLWFLH YVRDA QNHAS DEILVPMIEH+LRFKD+NWKRGGQVMVLLNWLFL+EL 
Sbjct: 61  LDQSLWFLHKYVRDAVQNHASLDEILVPMIEHTLRFKDKNWKRGGQVMVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKNLADII RKDDRYVALGWCILVRSLVE+ESV +E S NGLRERF DMLKVF +C
Sbjct: 121 FQSLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPTELSLNGLRERFKDMLKVFSSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRLSVCAADCVVSLTNALTRK E QTRQKRLN+SSS+QQ
Sbjct: 181 IPRLTCILSKGSILQEGFELPSRLSVCAADCVVSLTNALTRKPEAQTRQKRLNTSSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VTFFS  VDDQREK IS+SSKDS+  MEYLLW QLKDLVILVQRLLAWS KSRPLHAKGL
Sbjct: 241 VTFFSNAVDDQREKPISSSSKDSNLDMEYLLWDQLKDLVILVQRLLAWSMKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EINVHYG+F+NEAGK K+ I QTGALLLSSCWRHYSILLFL+D RFSQHY+E
Sbjct: 301 EQVLKWLQEINVHYGSFRNEAGKEKAKISQTGALLLSSCWRHYSILLFLDDCRFSQHYEE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HTGE IGNKDGRET IFFLNCLCLLLGRLDSK+FEST+SEYG+QI
Sbjct: 361 WLNQYLSGIQYYSGHHTGESIGNKDGRETIIFFLNCLCLLLGRLDSKKFESTVSEYGSQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLL QFHSTDEDVI EVVSIFKAVFLN  LSSG SI DIRQLDVVMP LLNLLDERD+
Sbjct: 421 SQVLLSQFHSTDEDVITEVVSIFKAVFLNPKLSSGDSITDIRQLDVVMPSLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDN-QFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLT 540
            ARAV ILI+E C+MS DN QFL EVFKRFDSDSIIQRRNA+DVISEIVQMSSN RNLLT
Sbjct: 481 TARAVIILIAESCLMSRDNDQFLLEVFKRFDSDSIIQRRNAIDVISEIVQMSSNKRNLLT 540

Query: 541 QSAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           QSAWQD   +L+KCLEDEEILI KQAA+LLPCI
Sbjct: 541 QSAWQDIAKQLIKCLEDEEILIRKQAADLLPCI 572

BLAST of MS016640 vs. NCBI nr
Match: XP_023543533.1 (uncharacterized protein LOC111803391 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 953.0 bits (2462), Expect = 1.2e-273
Identity = 488/573 (85.17%), Postives = 517/573 (90.23%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           ME+DEG LLLWKSDSAPQSMVS+TVGRVMATLL ARPKKL DAVS L PDHR GA SLDS
Sbjct: 1   MEDDEGELLLWKSDSAPQSMVSITVGRVMATLLAARPKKLHDAVSGLSPDHRHGA-SLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LD+SLWFLH YVRDA QNHAS DEILVPMIEH+LRFKD+NWKRGGQVMVLLNWLFL+EL 
Sbjct: 61  LDQSLWFLHKYVRDAVQNHASLDEILVPMIEHTLRFKDKNWKRGGQVMVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKNLADII RKDDRYVALGWCILVRSLVE+ESV +E S NGLRERF DMLKVF +C
Sbjct: 121 FQSLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPTELSLNGLRERFKDMLKVFSSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRLSVCAADCVVSLTNALTRK E QTRQKRLN+SSS+QQ
Sbjct: 181 IPRLTCILSKGSILQEGFELPSRLSVCAADCVVSLTNALTRKPEAQTRQKRLNTSSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VTFFS  VDDQREK IS+SSKDS+  MEYLLW QLKDLVILVQRLLAWS KSRPLHAKGL
Sbjct: 241 VTFFSNAVDDQREKPISSSSKDSNLDMEYLLWDQLKDLVILVQRLLAWSMKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EINVHYG+F+NEAGK K+ I QTGALLLSSCWRHYSILLFL+D RFSQHY+E
Sbjct: 301 EQVLKWLQEINVHYGSFRNEAGKEKAKISQTGALLLSSCWRHYSILLFLDDCRFSQHYEE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HTGE IGNKDGRET IFFLNCLCLLLGRLDSK+FEST+SEYG+QI
Sbjct: 361 WLNQYLSGIQYYSGHHTGESIGNKDGRETIIFFLNCLCLLLGRLDSKKFESTVSEYGSQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLL QFHSTDEDVI EVVSIFKAVFLN  LSSG SI DIRQLDVVMP LLNLLDERD+
Sbjct: 421 SQVLLSQFHSTDEDVITEVVSIFKAVFLNPKLSSGDSITDIRQLDVVMPSLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDN-QFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLT 540
            ARAV ILI+E C+MS DN QFL EVFKRFDSDSIIQRRNA+DVISEIVQMSSN RNLLT
Sbjct: 481 TARAVIILIAESCLMSRDNDQFLLEVFKRFDSDSIIQRRNAIDVISEIVQMSSNKRNLLT 540

Query: 541 QSAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           QSAWQD   +L+KCLEDEEILI KQAA+LLPCI
Sbjct: 541 QSAWQDIAKQLIKCLEDEEILIRKQAADLLPCI 572

BLAST of MS016640 vs. ExPASy TrEMBL
Match: A0A6J1DCN8 (uncharacterized protein LOC111019538 OS=Momordica charantia OX=3673 GN=LOC111019538 PE=4 SV=1)

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 568/572 (99.30%), Postives = 571/572 (99.83%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEGGLLLWKSDSAPQSM+SVTVGRVM TLLGARPKKLADAVSRLFPDHRRGASSLDS
Sbjct: 1   MEEDEGGLLLWKSDSAPQSMISVTVGRVMVTLLGARPKKLADAVSRLFPDHRRGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI
Sbjct: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC
Sbjct: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ
Sbjct: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL
Sbjct: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQ+
Sbjct: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQD 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI
Sbjct: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM
Sbjct: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
           IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEI+QMSSNTRNLLTQ
Sbjct: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIIQMSSNTRNLLTQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQDTTNRLLKCLEDEEILICKQAANLLPCI
Sbjct: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 572

BLAST of MS016640 vs. ExPASy TrEMBL
Match: A0A6J1EPJ3 (uncharacterized protein LOC111435467 OS=Cucurbita moschata OX=3662 GN=LOC111435467 PE=4 SV=1)

HSP 1 Score: 952.2 bits (2460), Expect = 9.9e-274
Identity = 486/573 (84.82%), Postives = 518/573 (90.40%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           ME+DEG LLLWKSDSAPQSMVSVTVGRVMATLL ARPKKL DAVS L PDHR GA SLDS
Sbjct: 1   MEDDEGELLLWKSDSAPQSMVSVTVGRVMATLLAARPKKLHDAVSGLSPDHRHGA-SLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           LD+SLWFLH YVRDA QNHAS DEILVPMIEH+LRFKD+NWKRGGQVMVLLNWLFL+EL 
Sbjct: 61  LDQSLWFLHKYVRDAVQNHASLDEILVPMIEHTLRFKDKNWKRGGQVMVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKNLADII RKDDRYVALGWCILVRSLVE+ESV +E S NGLRERF DMLKVF +C
Sbjct: 121 FQSLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPNELSLNGLRERFKDMLKVFSSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRLSVCAADCVVSLTNALTRK+E QTRQKRLN+SSS+QQ
Sbjct: 181 IPRLTCILSKGSILQEGFELPSRLSVCAADCVVSLTNALTRKAEAQTRQKRLNASSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VT FS  VDDQREK IS+SSKDS+  MEYLLW QLKDL+ILVQRLLAWS KSRPLHAKGL
Sbjct: 241 VTLFSNAVDDQREKPISSSSKDSNLDMEYLLWDQLKDLLILVQRLLAWSMKSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EINVHYG+F+NEAGK K+ I QTGALLLSSCWRHYSILLFL+D RFSQHY+E
Sbjct: 301 EQVLKWLQEINVHYGSFRNEAGKEKAKISQTGALLLSSCWRHYSILLFLDDCRFSQHYEE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HTGE +GNKDGRET IFFLNCLCLLLGRLDSK+FEST+SEYG+QI
Sbjct: 361 WLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKKFESTVSEYGSQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVI EVVSIFKAVFLN  LSSG SI DIRQLDVVMP LLNLLDERD+
Sbjct: 421 SQVLLLQFHSTDEDVITEVVSIFKAVFLNPKLSSGDSITDIRQLDVVMPSLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDN-QFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLT 540
            ARAV ILI+E C+MS DN QFL EVFKRFDSDSI+QRRNA+DVISEIVQMSSN RNLLT
Sbjct: 481 TARAVIILIAESCLMSRDNDQFLLEVFKRFDSDSIVQRRNAIDVISEIVQMSSNKRNLLT 540

Query: 541 QSAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           QSAWQD   +L+KCLEDEEILI KQAA+LLPCI
Sbjct: 541 QSAWQDIAKQLIKCLEDEEILIRKQAADLLPCI 572

BLAST of MS016640 vs. ExPASy TrEMBL
Match: A0A1S4DUC1 (uncharacterized protein LOC103486160 isoform X7 OS=Cucumis melo OX=3656 GN=LOC103486160 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 5.6e-269
Identity = 472/572 (82.52%), Postives = 511/572 (89.34%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEG LLLWKSD AP+SMVSVTVGRVMATLL ARPKKL +AVS L PDHR+GASSLDS
Sbjct: 1   MEEDEGELLLWKSDLAPESMVSVTVGRVMATLLVARPKKLHNAVSGLSPDHRQGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           + +SLWFLH YV+DA QNH S DEIL+PMIEH+LR KD+NWKRGGQV+VLLNWLFL+EL 
Sbjct: 61  IHQSLWFLHQYVKDAVQNHVSLDEILIPMIEHALRLKDKNWKRGGQVLVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKN+ADII RKDDRYVALGWCILVRSLVE+ESV  E   NGLRERFNDMLKV C+C
Sbjct: 121 FQTLIKNIADIIVRKDDRYVALGWCILVRSLVEFESVPCELPLNGLRERFNDMLKVLCSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRL+VCAADC+VSLTNALTRK+EVQTRQKR N++SS+QQ
Sbjct: 181 IPRLTCILSKGSMLQEGFELPSRLAVCAADCIVSLTNALTRKAEVQTRQKRSNANSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VT FS  VDDQREK ISN+SKDS   MEYLLW QLKDL  LVQRLLAWS+ SRPLHAKGL
Sbjct: 241 VTIFSNTVDDQREKPISNASKDSYLDMEYLLWDQLKDLAKLVQRLLAWSKNSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EIN+HYGNFQ+EAGK KS I +TG+LLLSSCWRHYSILLFLED  FSQHY+E
Sbjct: 301 EQVLKWLDEINLHYGNFQDEAGKVKSKIPRTGSLLLSSCWRHYSILLFLEDRLFSQHYKE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HT E IGNK  RET IFFLNCLCLLLGRLDSK+ EST+SEYGTQI
Sbjct: 361 WLNQYLSGIQYYSGHHTEETIGNKKARETMIFFLNCLCLLLGRLDSKKIESTVSEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVIDEVVSIFKAVFLNS LSSGGSI D RQLD VMPLLLNLLDERD+
Sbjct: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSKLSSGGSITDHRQLDSVMPLLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
            ARAV ILI+E C+MS DNQFL EVFKRFDSDSI+QRRNA+DVISEIVQMSSNTRNLLTQ
Sbjct: 481 TARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQD  N+L+KCLEDEEILI KQAA+LLPC+
Sbjct: 541 SAWQDIANQLIKCLEDEEILIRKQAADLLPCV 572

BLAST of MS016640 vs. ExPASy TrEMBL
Match: A0A1S4DUB5 (uncharacterized protein LOC103486160 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486160 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 5.6e-269
Identity = 472/572 (82.52%), Postives = 511/572 (89.34%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEG LLLWKSD AP+SMVSVTVGRVMATLL ARPKKL +AVS L PDHR+GASSLDS
Sbjct: 1   MEEDEGELLLWKSDLAPESMVSVTVGRVMATLLVARPKKLHNAVSGLSPDHRQGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           + +SLWFLH YV+DA QNH S DEIL+PMIEH+LR KD+NWKRGGQV+VLLNWLFL+EL 
Sbjct: 61  IHQSLWFLHQYVKDAVQNHVSLDEILIPMIEHALRLKDKNWKRGGQVLVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKN+ADII RKDDRYVALGWCILVRSLVE+ESV  E   NGLRERFNDMLKV C+C
Sbjct: 121 FQTLIKNIADIIVRKDDRYVALGWCILVRSLVEFESVPCELPLNGLRERFNDMLKVLCSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRL+VCAADC+VSLTNALTRK+EVQTRQKR N++SS+QQ
Sbjct: 181 IPRLTCILSKGSMLQEGFELPSRLAVCAADCIVSLTNALTRKAEVQTRQKRSNANSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VT FS  VDDQREK ISN+SKDS   MEYLLW QLKDL  LVQRLLAWS+ SRPLHAKGL
Sbjct: 241 VTIFSNTVDDQREKPISNASKDSYLDMEYLLWDQLKDLAKLVQRLLAWSKNSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EIN+HYGNFQ+EAGK KS I +TG+LLLSSCWRHYSILLFLED  FSQHY+E
Sbjct: 301 EQVLKWLDEINLHYGNFQDEAGKVKSKIPRTGSLLLSSCWRHYSILLFLEDRLFSQHYKE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HT E IGNK  RET IFFLNCLCLLLGRLDSK+ EST+SEYGTQI
Sbjct: 361 WLNQYLSGIQYYSGHHTEETIGNKKARETMIFFLNCLCLLLGRLDSKKIESTVSEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVIDEVVSIFKAVFLNS LSSGGSI D RQLD VMPLLLNLLDERD+
Sbjct: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSKLSSGGSITDHRQLDSVMPLLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
            ARAV ILI+E C+MS DNQFL EVFKRFDSDSI+QRRNA+DVISEIVQMSSNTRNLLTQ
Sbjct: 481 TARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQD  N+L+KCLEDEEILI KQAA+LLPC+
Sbjct: 541 SAWQDIANQLIKCLEDEEILIRKQAADLLPCV 572

BLAST of MS016640 vs. ExPASy TrEMBL
Match: A0A1S4DUA1 (uncharacterized protein LOC103486160 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103486160 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 5.6e-269
Identity = 472/572 (82.52%), Postives = 511/572 (89.34%), Query Frame = 0

Query: 1   MEEDEGGLLLWKSDSAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDS 60
           MEEDEG LLLWKSD AP+SMVSVTVGRVMATLL ARPKKL +AVS L PDHR+GASSLDS
Sbjct: 1   MEEDEGELLLWKSDLAPESMVSVTVGRVMATLLVARPKKLHNAVSGLSPDHRQGASSLDS 60

Query: 61  LDESLWFLHNYVRDAAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELI 120
           + +SLWFLH YV+DA QNH S DEIL+PMIEH+LR KD+NWKRGGQV+VLLNWLFL+EL 
Sbjct: 61  IHQSLWFLHQYVKDAVQNHVSLDEILIPMIEHALRLKDKNWKRGGQVLVLLNWLFLDELT 120

Query: 121 FQPLIKNLADIIARKDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTC 180
           FQ LIKN+ADII RKDDRYVALGWCILVRSLVE+ESV  E   NGLRERFNDMLKV C+C
Sbjct: 121 FQTLIKNIADIIVRKDDRYVALGWCILVRSLVEFESVPCELPLNGLRERFNDMLKVLCSC 180

Query: 181 IPRLACILSKGSSMQEGFELPSRLSVCAADCVVSLTNALTRKSEVQTRQKRLNSSSSHQQ 240
           IPRL CILSKGS +QEGFELPSRL+VCAADC+VSLTNALTRK+EVQTRQKR N++SS+QQ
Sbjct: 181 IPRLTCILSKGSMLQEGFELPSRLAVCAADCIVSLTNALTRKAEVQTRQKRSNANSSYQQ 240

Query: 241 VTFFSTIVDDQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGL 300
           VT FS  VDDQREK ISN+SKDS   MEYLLW QLKDL  LVQRLLAWS+ SRPLHAKGL
Sbjct: 241 VTIFSNTVDDQREKPISNASKDSYLDMEYLLWDQLKDLAKLVQRLLAWSKNSRPLHAKGL 300

Query: 301 EQVLTWLHEINVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQE 360
           EQVL WL EIN+HYGNFQ+EAGK KS I +TG+LLLSSCWRHYSILLFLED  FSQHY+E
Sbjct: 301 EQVLKWLDEINLHYGNFQDEAGKVKSKIPRTGSLLLSSCWRHYSILLFLEDRLFSQHYKE 360

Query: 361 WLNQYLSGIQYYSGLHTGEHIGNKDGRETTIFFLNCLCLLLGRLDSKRFESTISEYGTQI 420
           WLNQYLSGIQYYSG HT E IGNK  RET IFFLNCLCLLLGRLDSK+ EST+SEYGTQI
Sbjct: 361 WLNQYLSGIQYYSGHHTEETIGNKKARETMIFFLNCLCLLLGRLDSKKIESTVSEYGTQI 420

Query: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDM 480
           SQVLLLQFHSTDEDVIDEVVSIFKAVFLNS LSSGGSI D RQLD VMPLLLNLLDERD+
Sbjct: 421 SQVLLLQFHSTDEDVIDEVVSIFKAVFLNSKLSSGGSITDHRQLDSVMPLLLNLLDERDV 480

Query: 481 IARAVTILISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQ 540
            ARAV ILI+E C+MS DNQFL EVFKRFDSDSI+QRRNA+DVISEIVQMSSNTRNLLTQ
Sbjct: 481 TARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQ 540

Query: 541 SAWQDTTNRLLKCLEDEEILICKQAANLLPCI 573
           SAWQD  N+L+KCLEDEEILI KQAA+LLPC+
Sbjct: 541 SAWQDIANQLIKCLEDEEILIRKQAADLLPCV 572

BLAST of MS016640 vs. TAIR 10
Match: AT3G57570.1 (ARM repeat superfamily protein )

HSP 1 Score: 501.1 bits (1289), Expect = 1.2e-141
Identity = 267/560 (47.68%), Postives = 379/560 (67.68%), Query Frame = 0

Query: 15  SAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDSLDESLWFLHNYVRD 74
           S P+S+VSVTV R M+TLL ARPKKL +++SRL PD ++G S   S+DE+LWFL   V D
Sbjct: 9   SEPESLVSVTVARFMSTLLSARPKKLRESISRLTPDSQKGVSG--SIDEALWFLEKCVID 68

Query: 75  AAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELIFQPLIKNLADIIAR 134
           AA+   +  EILVP+IEH+LRFKD   K G   M+LLNWLF +E++FQ + +NL++II R
Sbjct: 69  AAERDEAMSEILVPIIEHTLRFKDS--KHGNPAMILLNWLFQDEVLFQAVSRNLSNIILR 128

Query: 135 KDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTCIPRLACILSKGSSM 194
            +DR++ALGWC+L+R LVE E    +   +G+RE+ +  +++  +C+P L  I+  GS +
Sbjct: 129 NEDRFLALGWCLLIRRLVECEDTGDQGFWHGIREKHSMFVEIVSSCVPHLLMIVRNGSIL 188

Query: 195 QEGFELPSRLSVCAADCVVSLTNALT-RKSEVQTRQKRLNSSSSHQQVTFFSTIVDDQRE 254
           Q+G+E+PSRLS+ AADC++S+T AL  R + +  R K    + SHQ V     I   +++
Sbjct: 189 QDGYEVPSRLSLSAADCLLSITGALAKRDNTLINRPKSPTITGSHQPVALTPNI--SEKK 248

Query: 255 KQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGLEQVLTWLHEINVH 314
           K+ ++  +DS+     +LW+ ++DL  LVQ L AW+RK+R LHAKGL QVL WL E+  H
Sbjct: 249 KRPTSLPEDSNIETNCILWNHMEDLTRLVQCLFAWNRKTRLLHAKGLSQVLKWLEELKEH 308

Query: 315 YGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQEWLNQYLSGIQYYS 374
           +G  Q EAG   + +   GALLLSSCW+HYS+LL +ED +FS+  +E L QYLSGI+YYS
Sbjct: 309 HGGSQKEAG---TEVSMGGALLLSSCWKHYSVLLHMEDQKFSKISKELLEQYLSGIKYYS 368

Query: 375 GLHTGEHIGNKDGR-ETTIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQFHSTD 434
             +       K+G  ET  FFLNCLCLLLGR + K+FES +SEYG ++  +LL Q  S +
Sbjct: 369 ESYPQGCSDTKNGGIETQKFFLNCLCLLLGRFEGKKFESILSEYGMKLVPILLHQLRSNN 428

Query: 435 EDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDMIARAVTILISEC 494
           E++ + VV+IFKAVF      SG S  D   +DVV+P LL+LLDERD  A+AV++L+++ 
Sbjct: 429 EEISEGVVAIFKAVFFKLQSQSGDSFSDTMCMDVVIPSLLHLLDERDGAAKAVSVLLADY 488

Query: 495 CVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQSAWQDTTNRLLK 554
           C  +  N  LSE+ +R  S + +QR N+LDVISE++ MS +  +  +   W++  + LLK
Sbjct: 489 CSKNAGNSCLSEILQRLASGTTVQRLNSLDVISEVILMSKD--SFPSHIPWKEIADCLLK 548

Query: 555 CLEDEEILICKQAANLLPCI 573
           CL+DEE  ICKQ + LL  I
Sbjct: 549 CLDDEETCICKQTSELLKSI 557

BLAST of MS016640 vs. TAIR 10
Match: AT3G57570.2 (ARM repeat superfamily protein )

HSP 1 Score: 492.3 bits (1266), Expect = 5.4e-139
Identity = 266/564 (47.16%), Postives = 377/564 (66.84%), Query Frame = 0

Query: 15  SAPQSMVSVTVGRVMATLLGARPKKLADAVSRLFPDHRRGASSLDSLDESLWFLHNYVRD 74
           S P+S+VSVTV R M+TLL ARPKKL +++SRL PD ++G S   S+DE+LWFL   V D
Sbjct: 9   SEPESLVSVTVARFMSTLLSARPKKLRESISRLTPDSQKGVSG--SIDEALWFLEKCVID 68

Query: 75  AAQNHASFDEILVPMIEHSLRFKDRNWKRGGQVMVLLNWLFLEELIFQPLIKNLADIIAR 134
           AA+   +  EILVP+IEH+LRFKD   K G   M+LLNWLF +E++FQ + +NL++II R
Sbjct: 69  AAERDEAMSEILVPIIEHTLRFKDS--KHGNPAMILLNWLFQDEVLFQAVSRNLSNIILR 128

Query: 135 KDDRYVALGWCILVRSLVEYESVASEFSSNGLRERFNDMLKVFCTCIPRLACILSKG--- 194
            +DR++ALGWC+L+R LVE E    +   +G+RE+ +  +++  +C+P L  I+  G   
Sbjct: 129 NEDRFLALGWCLLIRRLVECEDTGDQGFWHGIREKHSMFVEIVSSCVPHLLMIVRNGRYK 188

Query: 195 -SSMQEGFELPSRLSVCAADCVVSLTNALT-RKSEVQTRQKRLNSSSSHQQVTFFSTIVD 254
            S   +G+E+PSRLS+ AADC++S+T AL  R + +  R K    + SHQ V     I  
Sbjct: 189 TSLSMDGYEVPSRLSLSAADCLLSITGALAKRDNTLINRPKSPTITGSHQPVALTPNI-- 248

Query: 255 DQREKQISNSSKDSDSGMEYLLWHQLKDLVILVQRLLAWSRKSRPLHAKGLEQVLTWLHE 314
            +++K+ ++  +DS+     +LW+ ++DL  LVQ L AW+RK+R LHAKGL QVL WL E
Sbjct: 249 SEKKKRPTSLPEDSNIETNCILWNHMEDLTRLVQCLFAWNRKTRLLHAKGLSQVLKWLEE 308

Query: 315 INVHYGNFQNEAGKAKSNILQTGALLLSSCWRHYSILLFLEDYRFSQHYQEWLNQYLSGI 374
           +  H+G  Q EAG   + +   GALLLSSCW+HYS+LL +ED +FS+  +E L QYLSGI
Sbjct: 309 LKEHHGGSQKEAG---TEVSMGGALLLSSCWKHYSVLLHMEDQKFSKISKELLEQYLSGI 368

Query: 375 QYYSGLHTGEHIGNKDGR-ETTIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQF 434
           +YYS  +       K+G  ET  FFLNCLCLLLGR + K+FES +SEYG ++  +LL Q 
Sbjct: 369 KYYSESYPQGCSDTKNGGIETQKFFLNCLCLLLGRFEGKKFESILSEYGMKLVPILLHQL 428

Query: 435 HSTDEDVIDEVVSIFKAVFLNSNLSSGGSIPDIRQLDVVMPLLLNLLDERDMIARAVTIL 494
            S +E++ + VV+IFKAVF      SG S  D   +DVV+P LL+LLDERD  A+AV++L
Sbjct: 429 RSNNEEISEGVVAIFKAVFFKLQSQSGDSFSDTMCMDVVIPSLLHLLDERDGAAKAVSVL 488

Query: 495 ISECCVMSGDNQFLSEVFKRFDSDSIIQRRNALDVISEIVQMSSNTRNLLTQSAWQDTTN 554
           +++ C  +  N  LSE+ +R  S + +QR N+LDVISE++ MS +  +  +   W++  +
Sbjct: 489 LADYCSKNAGNSCLSEILQRLASGTTVQRLNSLDVISEVILMSKD--SFPSHIPWKEIAD 548

Query: 555 RLLKCLEDEEILICKQAANLLPCI 573
            LLKCL+DEE  ICKQ + LL  I
Sbjct: 549 CLLKCLDDEETCICKQTSELLKSI 561

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151623.10.0e+0099.30uncharacterized protein LOC111019538 [Momordica charantia][more]
XP_038882127.15.4e-27484.62uncharacterized protein LOC120073376 isoform X3 [Benincasa hispida][more]
XP_038882125.15.4e-27484.62uncharacterized protein LOC120073376 isoform X1 [Benincasa hispida][more]
XP_023543535.11.2e-27385.17uncharacterized protein LOC111803391 isoform X3 [Cucurbita pepo subsp. pepo][more]
XP_023543533.11.2e-27385.17uncharacterized protein LOC111803391 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DCN80.0e+0099.30uncharacterized protein LOC111019538 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A6J1EPJ39.9e-27484.82uncharacterized protein LOC111435467 OS=Cucurbita moschata OX=3662 GN=LOC1114354... [more]
A0A1S4DUC15.6e-26982.52uncharacterized protein LOC103486160 isoform X7 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DUB55.6e-26982.52uncharacterized protein LOC103486160 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DUA15.6e-26982.52uncharacterized protein LOC103486160 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT3G57570.11.2e-14147.68ARM repeat superfamily protein [more]
AT3G57570.25.4e-13947.16ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 379..573
e-value: 5.3E-6
score: 27.3
NoneNo IPR availablePANTHERPTHR37743ARM REPEAT SUPERFAMILY PROTEINcoord: 10..572
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 21..572

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS016640.1MS016640.1mRNA