HG10007167 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007167
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein MON2 homolog isoform X2
LocationChr10: 1988532 .. 2012047 (+)
RNA-Seq ExpressionHG10007167
SyntenyHG10007167
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTCATGGCTGTACTGGAATCTGATCTTCGCGCGCTGTCCGTTGAAGCTCGCCGCCGTCACCCGGCCGTCAAAGATGGAGCTGAGCACGCTATTTTGAAGGTCCAACTTCATTCTTGCTCTTATACCAAGGGTGGACTGTGCATTTAATTGGCGTCACAATAGCTGTAATTGTATATTTTGATGATTCAATTTTAATAGCTCATTCGAAGCTGCTGGAGTTGAATTTCTGGTTTGAATCTGTGTAGTAAAGGAACCGAAACAAGTAAAGAGTGTAGAATTTCTTGTTCCGATAATATTTTGTTATTTCTCTCCGTGTGTGTGTTCGGGAGCGTGGGCGTGGGTGCAGGGGCTATTTTCAAGAGAAATGGACATTTTCTTCAATTTCTCTTGCAATGAGCCAAATTTCAACTTTTTTGGGCTCGAAAGAAATTTGTGGCTAGATTGTTAGTGTTGCGACGTGCTAGCATCAGTCGATGAAGCATGAATGCTACAACATAATTCATTATCTATGCTTCGATATATTAGGTTGTCATGAATTTGATCTCACAGAGTATCGAGACTGTACTGAATCTTGATATTTATTTTCATTTTTTGGTTTGCAGCTTCGATCAATGTCAAGTCCCAGTGACATTGCAGAAAATGAAGACATATTGCGAATTTTTCTGCTGGCCTGTGAAGCCAAAACTATAAAGTTGAGTGTAATTGGGCTTTCGTCTCTACAAAAGCTCATATCACATGATGCAGTTAGTCCTTCTGCTTTGAAAGAAATACTCTTTACGCTGAAAGATGTAAGTTCTTAATAATAGTCATGGTGTACTGTCTGCTTTACTGTGTTAAGGGAAATTATTAACTTAAATGACTTAATTTCATCGATTACTTTTATTACATTAAGGGCTATATACTGCTTATTGTGCTTTTCATTACCGAAACGTTTATTTGAGTTGATCAACTATTCTTGTTATGTTAAGCTAGTTACAGTTAGCTTATGGTTTTTGTAATAAAATCTAAGTCTACAGTTAGCTTATGGTTTTTGATAATAAAATCTAAGTCTACAGTTAACTAATACCAATTGAAACGTAAGTTTGTTACGCTTCTGCCACCCAAAGCGTTGTCCTCTTCCCTCTAGTGGGTTTCAGTTCTGGGAAGTGGGAACTACTGCCGTTTGGTTGGAGAGAAATTCTCTTTTTGTCAGTGGGTTCCCGTTCTCACCTAAATGAAACATAAGTAGCCAAAAATTCTGTCTTTACTTGATGAGTTCAATTTTAGGTCTGGGAGAAGAGACGATAGTGTTTGGAATCCTAATCCTATTGAAGGTTTTTCTTGTAAGTACTTTTTTCTGTTGTTGGATACCTCTCCCACTAGCAAGTTAGTCTTTGTTGTATTTTTTTTTTTTGCGGGTTTGGTTAGGTAATAGAAGCTAAGGAGAGATGCTTTCCAAAATGCCCCCCACCAACCCCAATGCGAACATTTTGGATAGGCGTTTGAGAAAGAGGCCTACATTAATGGGTTTTTCTTTTTTTTGGGGAGGGGGGGGGGGGGGTTGTTGTATTCTTTGTTGGAAAGAGGAGGAGGAGACCTAGATCACCTTCTTTAGAGTTGTCAATTTTTGAGATCTGTGTGGAATTATTTCTTTCAGGAATTTGATCTGCAAGGGTGTGTGCTTTATTGTGGGATATTTGGGAGGAGAACAACAATAAATGTTGAGAGGACCCTTATGATGTTTGGTCCTTTGTGAGGTTCCTTTTCTCTCCAGGCTTCAATTTTGAAGACTTTTTGTAATTATTCTCTAGGTAGCATCGTACTTAGTCGGAAACCCTTTCTTTAGAATGGGTTTTGCGGGCTTGGTTTCTTGTATGCCCTTATATTCCTTCATTTTTTCTCAATGAAAGCAATTGTTTCCATTAAAAAAATACAAGTAGCTAGCGGAGAAAATAATCCAGCCTTTCAAATAATTTTTTTCTTCCTTTTAATAAGACGTGAAGATCACATTATATAAGCCATCTCCGTCTGTGGCTCATCATGATAATACTTCTACACCCATATTTCCTAGTTAGTTTTATTTTTATTTTTCTATTTTCTCTCTCAATGGAAGATTGTATCCTTGAACCAATTCCATTTCTTTTATCAATCAATAGACAAATAGTTCCTTGCTGATATATATATATATATAGTTTGTTTGAATGCTCGAACAAAACTTTACACTTACTAAAGGGATGGAGATGTGTGTTAAACCTAACAAAGAGAGCTTGGGACTGTAATACGAAGGCTAAGGACTGACTGCTTAATGAGAATTGAACTTTACATCTAACAAATGGAGCTCCAAAATCTTCAAACAACCGTGAATTATGATAAGAGGAAGAAAGGAAGTGGCAGGGAGGGTTCCTCTCGTGTTTTATGAAGATCCTCTCGTGTAAACATATGTGGCTTGGGCGCATCTACTAGGTGCCGTGACTTCGCGAGTGGTGTTTGAGATTCCTATTTCCAGACGTTTGGTACGTTGACTGTTTGTCACAGAGATGTCAGTGCTATGATCGAGGAGTTTCTCCTCAATTCGCTTTTTGGGGAGAAAGGCCGGTTTCTTTGGTTTGCTGGTGTTTGTGCGGTTTTATGGGTTTTGTGGGGGTGAGAGGAACAGTAGGGTGTTTAGGGGAGTGGAGAGGGATCCTAGGGAGTTTTGGTCCCTTGTTCTCTATCTTGTTTCCCTTTGGGCTTCGATTTCGAAGACCTTTTGTAATTATTCTATAGGTTTTACTTAGTTGGAGTCCCTTCTTGTAATGGGAGTCCCTCATTTGTTGTAGGCTTGGTTTTTTGTATGCCCGTGAATTCTTTCATTCTTTCTCAATGAAAGTCGTTATTTTCATTAAAAAAAAACTGAAAAAGGGCTTATGATTAAGGATTTGGTGATGGATTGTAATTTGGATGTTATTCTTTGGGAAACAAAACCCCTTTCATTGACAAAATGATGATGAAATCAGTTTGGAGTTCTAGAGCCGTTTGTTGGGCTTATCAACCATCCGTTGGAGCTTTGGTAGGCATTGCAATTTTATGGAATGAAGACTTAGTCTACACAGATGATTCTCTTGGGGGTTTTTAGGCCCTCCGTTTAATGTGGCAATCAAGTTGATTCTAGGCTCTAGGTTGTTTTGTCCAAAGACCTCCATCTCAAAATCCTCTTTCTGGTTCAAGAAAAGAAAAAGAAGTAGTTGATTTAAATTGGAATTCTTTGTTGGTAGTTTCTATTTTGAGAGAGATCGAAAAGGAGAGGGTAAATAGTAGCAAGGAAATCGTTTGCCATCCATTGGTGGGATAAAAAAGAAGTATGAGCACCTATCTCCAACCCTAAATTTAATTATGTTTTTCACAGGTCTTTGCTATTTGATAAGGAAGTGCTCCCATTGGTGGGATAAAAAGGATGTATGTAGCCTGCAAAAGTGCGAGGCAACCGCCTTTGGAAATGAAAGAAGAGCTCCATGAAGGGAACCTTCTTTCAATTTTTTCAATTTAAAAAATGGACTAAAAGGATATAGATTTATGAGCACCATTTTAAGGGAGACCCAACTGTGTGTTTGGCTTGTGACCCACCTTACAATTGAATCTTTGAGCAATAGCCTTTGTTAGATTACCCTTAAGGTTGATTCCTGAAACTTTTGACTTCTGGAGAATGGTTTGATTCGAAATTTCTTGAGGAAAAAACCCAATTGTTGACTGAGCTTGAAAAAAATCATTGATGTGGGGAAGAATAACCTTCATTATCTTGCATTGGTCATCATGGAAGAACTTTTAGGTGACAATGATGGTGGTGTCCTAGTGAGTTATTACTAAGGGTAGGTGTAGGACATAAGGAAAATGGATTTCATATCCATGCTTTTAGCGACGAGGGTGGAGGGGAAAACGATACATATGGGTGAGATAGAGGATTAGTAAAAGGGGGCCAAGTGGGTAGGTGGGGTAGGTTTTTGTGGGTAACTTCTCTTGAATGTGAGTTAGTAGGGTAGGTAGAGTGGGACTATAAAAAGCCTTTCCATCGTTTCCTTTCTTCACCCCTGGGACAGTGATTTTGTTTAAACCTCCATTGTTTTGAAGAAGGGTGATCTTTGGTATCATTCCTACTTTGTTTAAAGACTTTCGAGGCCAAGAGGGGTGGTGAGGATAGATGATTGAGGAGTTTGGCTTGAGTCTTGCTCAACTCATGCTATTTGGAGTTGTTTCTTTGAGGATTTTGGCTTGAGTTTTGCTATTTGGAGTTGTTTATTTGGGTTGTAGAGAGATGGTCGATGAGTTTTTCTCCACCATCCTTTTTGTGAGAAGGGAAGATTTTTGTGACAGACCAGAGTGCGTGTTATCCTGTTGAGTCTTTGGGGGAAAGGAATAACAAAGTTGAGGGTTCAAGAGGCCTCTGAGTGACGTTCGAACTCTCATTAGGTTCATTGTTTCTTTGTAGGCTTCAAGGGCTAAGACGTCCTGTAATTACCTGTTAGGTTTTATTTGATTTGATTGGGGGCCCTTCCTTTAGTTTGCCTCCTTTAATGGGCTCCATTTTTTCTGTATGCTCTTGTATTATTCTTTTCCTCAATGAAGGCTCGATCATTCACCAAGTGTGCAAAACTATGGCCGTATTTAGTCTATTGATTTTATACTTGTGAAGCCCTTTCCTAATATTATTATTTTGGAAAAAAAATCATATACCCCTAAACTTTGAGGTTGTATCACTCAAAACCCCAAACGAATAATTGAATAAATTTAAACCCTAAATTTTCATAAGTTAATTAATTTAGACCTTTTGTTAAGAATTTGTTTTAAATTGTTATAGTTATTACATAATCTTCACCCTCGTATAAGACTTCTTTAAATGCATGGATTAACATAAAAAAATATCACTATTTCATCTCATCAATGAAATTATAGTATGTTTCTTATGAAAAAAAATAGTTCTGAATGATATGAGTTCGATCCTGATTCCTGCCTCTGACATGTCAAGTTAAATGTGAAAATGAATTTTCATATTGTTCTAATATAAATTTCTTTTACCCTGCAGCACGCTGAAGTATCAGATGAGACTATTCAGTTAAAGACACTTCAAACTATACTAATAATTTTTCAATCTCGTTTACATCCTGAAAGTGAGGTAATGAGCACTTGAAAACTTTCTCTCCATAAGATAACCACTGTGCAAGAAACAATACATCATTAACACGTATAAACGGACAACTGAACAAATTTTACTTCTCTTGTTTATTTTTTGTTGATGGGATATTATTTTAGATTGCACGGTTCTTAGATTCAAAATCACGTATGATATGTCATTGTGCACATAATGTTGGTTTTTCATTGATGAAAAGAAGAGAGACTCATGTTTAAACTATACAAGCTCTATAAGGGAGTGAAATAAAAAAGGTAAATGAAAGCAATAAGTAAAGAGAATTACAATGGAATGATAAAGGCATTTTGGCTAAACCATGAAACTTGTAAAGAGAAATCAATAAAAGATTTAGAAAGAGAACACCATAAAAACTCTTGGAAATGAGCCGAAACAACGCGATCTATGTGAGGGAGGACCTTGATATGGAAGATCCTTTGATTTCTTTCAAATCGGATCTCAAAATGAAAATCTTTACCAACAAAAGACCACAAAAGATTAATTTGGAGCTGAAGATGGAATATTAAGCAATTTAATGAACTTGCATCATTTATGCAGTTCTACTGATTCTCGTTTTGGAATGATAATTGGCTGAATTTTTCAACCCTTAACCTGAGTCAATTTTGGGGCTAAGTCAATTTTGGAGCGCAACATGAGAAATTTCTTTTGGGAAGGACATAAAGGGGGAAAATTAAACCACCTAGTGAAATGGGAGATTGTCACTCGAGCACAAGAAGATGGAGGGCTTGGCCTTGGTGGACTAAAATCAAGAAACTTGGCTTTACTAGCTAAGTGGGGTTGGAGATATTTTAATGAAGAGAACTCCCTTTGGTGCCAAGTAATTAAAAGCATACATGGTAGCAGCCCTTACAATTGGCATACGGCTGGTAAAGCTTGTCATAGCCTTCGAAGCCCTTGGATAAGCATTTCGAGGTCTTGGCTTAAAGTGGAAGCCTTGGCAGTTTACAAGCTTGGATCCGGGAGTAGGATTGGTTTTTGGCATGATCCATGGATTGATGATTTGTCTTTGGAGGTCAGATTTCCTTGTTTATTCAAGATTGCCCTAAATCCTAGTGATTCAATATTGGACCATTGGGACTCTTCTACCTTCTCTAGGTCAATATTCTTTCATAGACTTTTAAAGGAGGAGGAGATATTACATTTTCAGAACCCTCTTAGCCTAATCTCAGCCCAAAGAGTAAAAGAAGGCCATGATAAAAGGGTATGGTCACTAGAAGGCAGCGGTACATTCTCAGTTAAGTCTCTTGTTACACACTTATCTTTGGCCTCCCCTCTTGATAGCCATTTGCTGAAAGCTTTATGGAAGTCCAGGAGCCCTCGAAGAGTGAATATTACAATGTGGATCATGTTATGTGGTCATTTAAATACCAATTCAGTTATGCAAAAAAAGTTGCCTATCTACTGTTTGTCCCCCAATATTTGTTCATTATGTTGGGCTAATTCAGAAGAACCACAACAATTGTTCTTTGAGTGCAATTATGTAAGGAATTGCTGGCACTGACTGTTTAGCATTTTTAATTTGTGCTGGGTCTTTAGCAATGAGTTGCGGGATAATATTTTGTAGATTTTAGTTGGCTCAAAGCTTAAGTCATCCACAAGATTACTGTGGAATAACGCGGTCAAAGCCTTACTTGCAGAAATTGGTTTCAAAGAAACCAAAGAGTGTTTCATGATAAGCCAACATCATGGTTAGATCGGCTGGAGATTGCTAGGCTCAATGCTTCCTCTTGGTGTTCCCTCTCTAAAGAATATGCAGATTTCTCAATACGAGATATTAGCCTAAATTGGAAGGCATTTGTTGTTCCAGCTCAGTAATTGTGATCTGCAACAGGTCTTATGGTTTAAATTTGTGCTAGTGTGGATGTTTGTAATTTTTGGTTGGTGCTGTGAATTTTTCCAACATAGTAATTTGTAGAATATTTGGGCTGGAAGTGTTTTTATTTGTGTCCCATTGTATTCAGGATGTTTTTCTTATACGGTTGAACTACTATGTGGCCTTGGCTTGTGGTGTTCTTTTTTATTTTATTTGAATATGATGAGAGTGCTATGGAGGTGTCAACATAATTGAGATGTCCGGGTGCACTCACTGATCCTAAGGATTTTATGTTTGTTTCCCTCATTGTAACTTGAGCATTAGTCTCATTTCATTATTTCAATGAAGAGACTCGTTTCCTTTTTAAAAAAAACCCTTAGCCTGAGTCTTCCTTGGTTATTTGGTGGTACTTATTTGTCATTTGCTTGATTGCATTTCTCTGGAGATCCTGTGCCGCAGACTTTTATTTTATTTTAGAAGGCCTTAAAGATGCAGATTTGTGTATTTTTAGCAGCCGCTTCACGTTTAATCAAGTTCTAAGCAGGGGGTTCTTCCCGTTTTAAATTGTTGGTAGCTGGAGTCGTTTGGGAAGATCGATGTTCTATCTCTATGATTTTTGTATTTTCTTTATTTAAGTTGTAGTTTTGGAGGGTTTTTTCTCTTTTGCTTAATTATTGTCATTTGGATGTATTTGCTCTCATTTTTGTTGTTTTTCACCTACTCTCGAATTTTCTATTTCCCTGTATTTTGAGCATTAGCCTTTTTTTCATTATATAAAATAAAAGTTCTGTTTCGTTTCCAAAAGAAAGCTTGAATCTCACCTAAGCTGATAATCTTAAGCCAAAATTATTCTATGAATACTTGCGAATGAATGCCGAGCCCAGTAACGTGGACTTGACCTGAGTGGGATGTCATTAAGAGAGTTTGGAAGGTATGGAGTAGGATAGACTGATCGGATTGGGGGATTCGTTTGGTTGATCGGGATTCCTTATGGTGGGTGACCGAGCACAAAGAGAAAGTGTTTCTTGCCATAAAAAGGATAGAATACTCTCAAACTTGACTTATCATCAATGATTCTTTTATTGTAACAAACAAAAATTTGTCTGCTTATTGGTATTGTAACTTTATGTGCGACCCCAATTATACACACGTATGAAAAGATGGGTAGAAAAGGGACTTCACAGCTGACAAATCGATAGTGGGACGTGGAATTCTGTTATGTGAGAAGCGTGGGGACCAACTGTGGGGACTTGAGGGGTTTATAAAGGTTTTGGAGGATTAGTTCTCGGATGTGAAGAATTTGGCAAGAAGTCGAACTTGGGGAGGATCTCAGGCTCTCTTGAATGTGCTTGATTTTTATGTTTCTGCATTATTGTTTATTTTGTTCTTCCTTTATATTGTGCTATGTTGGAATTGTGAAAAACTTTGTTATCTTTATTGTTCATCAATAGACTATCAGAGAAGTCGATATTCTCCAGTTTTTTGGAATCTGTTGGGTTTTTTTTTTGTTGCAGGATAAATTGAAACCTAACAATTTGTAGTGGTATTCAAAAGTTTTCAATTGTGTAATTGCAGGAGAATATGGCTCAAGCTCTTGGTATCTGTATTCGGCTTTTAGAAAACAACAGATCTTCTGACAGTGTGCGGAAGTATGTCTGTATTTAGTCGATTGAATTTAACTTATGAATCCTTTTCTAATTTTATTATTTTGGAGAAAAATCAATTTATACCTCTAAAATTTTGGGATTGTATAACTTTTTTTAATCTCACGGGACAACCCGCCTGACCCTACAACATTTGGGTGTCAAGAAAACTCGTAAAAAATTTATTCCTAGGTAGGTGGCCACCATGGATTGAACCCATGACCTCTTAGCCCTAAACTAATAATTGTATCAATTGAAATTCTGAACTTGCATAAGTGAGTCAATTTAGATACTCTAGTTTAGATTTCATTTTAAGACCATCAATGCATAAACTTCATACATGTGTAGGACTTCTTTAAATGCATGGATTGATACAATATTAGAGAGAAATGCTCGTACAAAATAGATAGTAACAATTTTTCAGTAATATGAGCGATTTTTCTTCTCCTCATCTTCCTCTATTTTCTCTACATATTTAGGAAGTACATGATAATTTTCTTATCACTTTACTAGATTTTCGGAGAAGAATAATTTTGTTTAGGTAATTTTAAGTTGATTTTAAATGGAATCAAGTGAAAATAGTTATATATCACTAATGGAGAGTTTTAAGTTGATTTAACTAGGAAAGTTTAGGGTTTAAACTGACACAATTATTAGTTTGGGGGTATAAATTGATTTTTTCTCTATTATTTTTCATCAGCTACTTGCTAATTTTCCACTTTATCTAAAAAAACCTATTTCATTTGAGTGCTATACAGTACTGCAGCGGCTACCTTTAGGCAAGCAGTAGCTCTGATTTTTGATCATGTAATTTTTGGCGAGTCGCTTCCGGCTGGAAAGTTTGGTACTGGAAGTCAACACTCTCGAACCAGTATGGTTATTTCTGATGTTGATCGTAACATCAATAGCTCAGAGTATGCTTGTCATTTGATTGACATTAATAATGATCACTTTACTCTCAGTTCACAGCTTTCTATGGATAGTTAATATTTTGTTGTGCTATTTGAATGTTATTTAATGTGTGCCTATTGAAGTTCACTTTTTCTGCAACCCATGTTTGTCTAACATTGAAATAGAAGGAAAACTATTGTCTTCCTCTGGTTTCATTGTTCGTTGGGGGGATATCTGTTGTCCAATTTATTTATTTTGGATGAAATATCGTGCTTTCAAGAAAAATAAATGAAACAAAGGGGGCTTACAAAAACCATAGCCAGAGAAAACAACCATCTTTTTATGTTCTGTACAATGTCCTGTCCCGTCTCTTCTTTATTATCCGTCAATTACTAAATTTGTTTTCCATTAAAAAGGCAGAAGTTATTATGTACTATAAGCATGGTGTTGTGACCTGAACTACTTTATTTCATTAATCATGACATGAACAGGACACTGAAAAATGGGTCTCTTTCTGGAGGGCCATTGTTGAAGCGAGAGAACTTGACTAAAGCTGGGAGGCTTGGGCTACAATTGCTTGAAGATCTTACAGCTCTTGCCGCAGGTGGATCTGTATGTACTCACTTTAAGCTATTAGATTTTTTCTTTTATTTATTTATTATTATTTTTCTAAAATCTAAAATAAAAGAAACGAAACTTTTCAGTGATGTAATGAAAATAGACTAATGCTCAATGGATACGAACTCCACCAAGGAGTGACTAAAAACAATAATAAATAGTAAGAGTTAAACTATTATGTTGTGAATTAAGATAATTCTTGCACACCTTTGGGATTTTCTCGTTATGCTAAAACTAGTTGGGAAATTAGTCTACATCTTCACATTCTTCCTTTTAGTATTTTATACGCTTAGGGAAATTGTGATTAAATAGTTTTGTTTTTGTTTTTTTCTTCCTTTTCAATTTCTTAGTAAATGACGTTGGTTTTCTTCTACAGGCAACTTGGTTACGCCCGATTTCTTTCCAGCGCACCTTTGCCCTTGATATACTAGAGTAAGCAGCTCTCTTCTTAGAAGGTTTTGTATACTTAACCACTATTTCAAGATAATTTCGATTAGGATACTTTTCCACGATGTCGTATTATCTTTTCACCTATATCTATTCCTTCTAGAGGTACAAAGAGTAATAATTTTGATTATCGACTCAAATTTTTCTTGTTAAATACAACAAATACACATGCAGAAACATATTGTATTGCAGCTCCAATTTTGTTCAGATGGTCATTCTATAAAAAATAATTTAGTCATCCTAACTAAGGATGTGGGAACTGAAAAGATATTATGGACAAAAAGAAGAACAACAACCTAAGGACCGGGGGTAGAGGACATCCCCGTTCAAGAACTACATGATGAGAGCCTTCTAATCAAGATGCTGTAATTACAAATGAAGACTCAATTTACTAGAATTATTGCTTGTCATTCTTCTTCCAAGGATCTCTCTTGCCTCTATGATTATGATGATGACTCGGTAGTTAGCGTGAGTAGTGAAGAATATGATCTTCATCATTTGGACCCAGTTTTTGAGGAACCATTGAACGTATTGAGGAAGCTTTTGATTTTTTTTTTCCCAAAGGGATGAAGAAAAGCAGTGTGTTCAAGATACAAGGTTGAGTGCTTCTCCTTTGTCTCACTCTCATATACCATCCAAATTTTCTTCAATTGTAGAAGCCTGTGGTCTTCAATTGTGTAAACTGGGTTTCTGTCATTGGCCTCTCAGAATGCTCCTTTGCACATTGTCTCGGTTTCAGTCCTTTGGGTTGAAGCTACACTTTCAAGGATTCTCTCTTCTTGAAGATTGTCAAAAGCTGTTGATTCATCTGTTGGGTTCTTCATTTGCCAGCTTCCTACCCTCTTTGTTTGGCTGTCTGCATAGTCTTGCAGTGTATCTTTTTTGTGTGCATTTTTTCAAGCTTTGTTGGCGTTAATGTTCTTCTGTTTAAGGTTTGGTTTGAAAGAGTTCAATGGTTTTTTCAGGATAAGTCTCTTCTTTTGATTGCTCACTTTGTTTCAGTAAGGTTTCAAGCCTCTTCACATTCGACATGGATTGATTCTATTTCTCTTGCTTATTTTTTGTTTTTGTTTTAGTATTTTTGTTTGGTTTCCTTCTACTTTGAACATTAGTCTCTTTTCATTATATCAATGAAAAGCTTCATTTCCATTTTTAAAAAGCAAAAAAGTGGAACTTAGGTTTGAAATTTTTGTAATAATGTGAATTTGGCTTTATTAGCTAAATGGGTCTGGAGATATATTAATGAGCCTCACGCTTATTGGAGAAAAGTGATCACAAGCATCCATGGCAAAGATTTTTATGATTGGCACAAAATGGGTAAATCAAATCTTAGTCTTCGTAGTTCCTGGGGAATTATTTCTAAAGCTTGGAAGTCAGTAGATTCTTATGCTTGTTTTTCTTTAGGGAATGGCTCTAGAATTCAATTTTGGCATGATGTTTGGATCGGTGAAACTCGTTTAGAGTTCAGCTTCCCTCACTTATTAAGTATTTGTACCATTCTCAATGGTTCCGTTTCAAGTTTTTGGGACTTACTCTGCCATGCTTGGAATATTTTATATAGAAGATTATTGAACTCGTTATATAGCAAGCACTGTTTCCTCCCCTTTATGAAAAGATTTGTTCTCTGCCTTAAGGAAATCAAAGTCTAATTCTTTCCTCTCTTTCTGTTTCATTTTTATAATCCTCTGTGTTAGGAGTCTGGCATGTGTGTTTTTCTAAAATGGCTGTAACCATTTTGGAATTGGGATCTTCAATTAAATAAGGGAAACGTATATACTCGTACATGGAGTCTCATTGTAACTCTTCACTAATTTAATGAAATTCATTTCTCCATGTTCTTTTTCTTAATATTTATTTGTTGCTTGTAGGTTCATTTTGTCAAATTATGTTGCTGTTTTCAGGATATTAGTTCCATATGAACAGGTGATTACTATAGCTTACCTAAGAAAACTCTCTTTTTGCGTTTATCTGTGTTTTTTTGATGAAAGGTTTAAAGCTGAAAAGATTGCTTAAAAAATCAAACAGGTTTTGCGCCATCAGATATGTTCCCTTCTCATGACATCACTTCGTACAAATGCTGAGGTGATTAATTTGGGAAATTTGTTCAATAAATCTTTCACTTTGGTAGCCATTCTTATTGAGATCTTGGTAACAATTGGCTTCCTTTTGCCCCATTTCTATGTCATGTTAGCTTGAAGGGGAAGCAGGGGAGCCTTATTTTCGACGTCTAGTCTTGCGGTCAGTTGCTCATATTATCAGACTATATAGTACATCCCTCATCACTGAATGTGAGGTCTGTTCTATAAGAATAAACTTTTTAATTGTGACATGAAAATTTGAATCATGAGCTTTATCAATTGTTGTTTTGTTATTGTTTAACAACCTTAACTCTTGGTATACGCTTATCCAACCTTAATTAGTTCACAGATTAAGTTCTACACTTATCCAACCTTAATTAGTTCACATATTAAGTTCTATGCTTATCCAACCTTAATTAGTTCACATATTAAGTTCTATGCTTATCCAACCTTAATTAGTTCACATATTAAGTTCTTGCTTTATTTTCTCTGGTTAGTCTTTGATTAGTGTTGGTTAGTCTTGGCATTGAACGGATTAAGTTCTTCTTGCTTTATCTTTTTATTTCTTTATATTTTGAGCATTAGCCTCTTTTTATTTTATCAATGAAAAGTTCTATTTCCTTTAGAAAAATAAAATAAAATTAGTTCGCAGATTTCAATAGCGCGACTTAAATGTTTCATATGTGTCTTATTTATATGTTTTTTCTAAACAAGAAACAATGTCTCATTATTTAGTGACAGAAGAATGTTAAAGGATACAAACTACAAAGGAAGTGAGAATAAGAGAAGAAGAATCACGAACTTAGAGAACAAAGCAAAATCAAGAACTTAGAGATAAATCAAGCAAATAGAGGTCAGCTGAGAGAACAATAAATCTATTAAAGAACAATGGATTCATCTATCCAATAATCCCGCCTTCTTTGAACCAAAAATCTATTAGAGCTTGCAAATCATGGCAAAGGTGAAGAAGGAAGGCTCCACCAATCACTCCAAAAGGAGCTGCAACAACCCAAAACACACACAGTGACTGGAACGGTACCAAAAAGTAAATTTCAACCAGTGCTTGTTGAAAAAATGCTGGATAGAAGCTTGTTACTTCCATTTCCATCTATCTTGTGTTTTACAGAAATGCTCCCCCCTCTCTCCTCTCTCCCAATCTAACCTTCCATAACTGTCCCTCCGCCATGGTTTCCACCGTGCAACTGCCGTCTGCCTCCTATCCCAGCCGTTCCATCCTCATTGATCGGAAAGTCTTCACGCTGCATTGTGATAAAACAACTAACGGCCAATCGGTGATAATCACTGAACAAGGAACATCATCCACACAATCCCTCAGCCTCTCTTTGAAAGCACTTGATTGGCTGGTCTCCTCGTTCAATTCTCTCCATCGAGACCCATGCAATTACAAGTTCTTCAAGAAGTTCAATGATGTTGATGCTACTCTTTGGCTGGAAAAACAAATCAACGAAAGTGGCTACTTTGTTGAATTAACCCACCTATACCACATAAGGCATCGAACCAAATTATTTATCCCATGAGAAGATAAGAAACAGGGCTGGTTCTCATTTTTCTCTCCCCCCCTAGTCACAAGACTGTGGTCCAATCAAAACCCATTGGTAAAGCATCTCCAAGGATTCAATTGCCGAAAGGCACTCCACTGCCCTCTATCTCAACAACCAATGCTCATTATCTTCCTCAATTTGATTGGTCTTCTATGGTGGTGGTGCAACGACATAAATTATAGGATTCCTGGTCAGACATGGTTGAAGATTCAATTCTCGCTGGGAACTCGTTGTACTATCAACCCATTCCTTGATGAAAAGGATCTGTTACTTGTCTACGACTCACAGATTGTTTGAAAGAAACCAAAGAGTGTTTCACGATAAGTCAACTCCATGGTTAGATCGACTGGAGATTGCTAGGCTCAATGCCTCCTCTTGGTGTTCCCTCTCCAAAGAATATGCGGATTTCTCACTACAAGATATTAACCTTAATTGGAAGGCTTTTATTGTTCCAGCTCAATAATTGTGATATGTAACAGAGAGTATGCTTTAAATTTGTTCTAGTATGGATGTTTGTAATTTTTGGTTGGTGATGTGTATTTTCCAGCATAGTAATTTGTTCAATACTTGGGCTGGAAGTGTTTTTTAGTTGTGTCCCATTGTATTTGGGATGTCTTTTCTTATACATTTGAACTTCTATGTGGTATTGGTTCGTGGTGTTCTTATTAAAAATGAACGAGGTTTGCTCTCTTTATTTTGTTTTGGATATGATGAAGAGTGCTATGGGAGTGTCAACATAGTTGAGATGTCCGGGTGCACTTGCTGATCCTAAGGTGTTTATATTGGTACCCCTCATTGTAACTTGAGCATTAGTCTCATTTCATTACTTCAATGAAGATACTTGTTTCCTTTGCAAAAAAAAAAAAAAAAACCAACCAAGACTTATTTAAAAAGACCAATCTGGGACCATCAACAACCTTCTTTCGAAGAGCTTCAACTATACAATCAAGAAATACTATTCCAGGATCTACTTCCAATCAACCGACCAAGAAGGATGATCCAAATGCTAAGCAACCGAATACAGAAGGAGCTGAGTCTAATCTTCCAAAGAGAAATCCGAATCCTTGCAATCGTCCAACCTTGGGCAAGTTCTTTCGATGTGGTCAACAAGGCCACCTCTCTAATGAATGTCCCCAACGCAGAACCTTGGCAATTCAAGAAGAAAAGCAGATGATGAAGATGGCATTGTTGATGAGAACACAAATGGGCTTGCATACGACCAAATGACGACGACTAGCTTTCATGTGTTCTTCAAAGAGTTATGTTGATCATTGATAGTGGGAGCAGCGAAACTATTGTATCCAAGAAGCTCGTTTCTGCCCTTAATTTGAAAGCTGAGCCACATTCGAATTCGTATAAGGTGAGTTGGATAAAAGGGGGGGGGGGGGCTTCTGTCTAAGAGATTTGCACAGTGCCCCTGTCTATAGGAAATTTGTACGAAGCTCAAATTACACGCGATGTCTTGGAAATGGATGCCTGTCACCTACTCTTGGGTCGCCCATGGCAATATGACAACAAGACATTCCACAATGGTGGAGACAATACATATGAATTCAGTTAGATGAGTAAGAAAGTGGTTCTACTTCCCCTCAGTAAGTCTACCACAACAAGTAAGGTACCTTCTTCTACAAAAAAGCAACTTTTTCTTTTATCTCCCGGCATAGATCTTTTATACTTTAAAGAGCAATCTCTTTTAGTGTTTGTTGTTAAAAAAAGATCCTTGTGAAACATCTAACCAAAACCAATTAGATCCCCAGATTTCCCATATTAAAAGAGTTCCCATCCCTCTTAGAAGAACCTAAGACTTTACCTCCACTAAGAAATATTCAACATCATATAGACTTAATATCGGGCAGTACCTTACCTAATTTACCACATTATTGTATGAGTCCTAAGGAATATGAAGCATTGCACACACAAATACATATTCTAGACAAAGAACATATACAACCTAGTCTTAGCCCATGTGCTATGCCCACATTACTTGCACCTAAAAAGGATGGCTCTTGGCGACTTTGTGTAGATAATAGGGCAATCAACAAAATCATAATCAAATATAGGTTTCCTATCCCATGTGTATTAGATTTATTAGATCAATTAGGAGGAGCATCAATTTTTTGTGGTGTGTGGGGTGTGCACGATCTTATGGGTTTTATGTGGTGAGCAAAATAGTAGAGTGTTTAGGGGAGTGGATAGGGATCCTTTGGATATTTGGTCCCTGGTTCATTTTCATGATTCATTGTGGGCTTCTGCTTTGTTCTCGAAGATTTTTTGCAATTATACTATTGACATGATTTTGCATAGTTGGAACTTGGAATCCCTTCTTGTAATGGGAGCTTCCCTTTTTTGTGAGCTTGTTTTTTGTATGCCCGTGTATTCTTTTGGAGTTGTAGCCTGGAGAGCATTACTTTTAGTGGCTGATTAGTAATCCAGTATTCCTCTCAAGATTCAAGATTACAACCTACTGTCAAAGTGCCAACTTTGAATTTGTCTGCAGTTCCTATGAGATATTGCAGAAGAAATGAATTTCCAAGGACTCTCCCACTAAAGCCAAATGTTTTATTAATACAAAAAGATTTTTTCCTCCTCCTAGGGGTTTTCCATTTCTCATATAATATGATAATAAAATCACTATTTTTAACTCATAATTGAGTTTCTAGTTATGTATGTATGTTTAATCATATATGTGTGTGTATATATGTATATATCAAGAAACAAAACTTTTCATTGATGGAAGGAAAAGAAGATTAAGTTTCTTTTTTCATATCTTTATTTATTTATTTATTTTTGTGGTACCAAAATTTAGTCACTTCTGTTGTAGGTAAGCAATACAATTTTTAGCTTCTTATGGAAAGGCTTTGAACAGCATTATACTGCTGCTCCATTTGTCTACATTAAAGCTTAGACTTCAAGAGAATCTGTCTACTGTATAACTATAGTTTTCCTTTTTTTTTTACCAAATGCAGGTTTTCCTCAGTATGTTATTGAAGGTTACTTTTCTTGATCTACCATTGTGGCATAGGATTCTTGTTCTTGAAAATTTGAGGGTAACTGTAATTAAGTAGAGTATATGTGAATTTGCGGTTATCTTAGATATATATATTTTGGGTAGAAGTTTCTAATAATTTGAATTTGTTGATCCTAATAGGGTTTTTGCATGGAGGCTAGAACTTTGCAGGTTCTTTTCCAGAACTTTGATATGTAAGCTTACTAACTCCAGTTTATTTTATATATTGTACATTCAATACACAATAGAAAATGCATGTTCATAAAGTTTAAAATCTTCGAAAATGCTATATAGTTATATAATTAAAAGAATTTATTTGGATCTCATGGAAAATATTGTAAAACATATGTGAGAGTACTAGTTAGGGATATATGGAGAAGATTAGTATGATACTAGTAAAGCTCATAATAAGTAATTAACTAGTAGTGTCTTGGGTATAAATAAGGGGAGTGTTGGACCTTAGAGGGTGTGGATGCTTTTGGTGGAGTTTCCATTGTGTGACTTTGGGAGAGACATAGTCATCTCAAAAGGCTATTGGTATATGGTAACTTCTCTTGATATTGCAATATATTGCTATCTTTGTATTCTTTGTGCTTCTTTGAGTTTTCTCATTAGGTGATATCCTGCAATTTAATAAAATAAATTGAAAAATAGTAATACTTCTTTTTTGGTTGAGGAGGATCTTCAGAATCTTTGATAACAATCATCTTCCTTGGCTTGCGCTCTTTGAATCTGCAAGTCTGAAAGCTTTCTATACGGGTTATCTATCCAACCTTTAGGCTTTTATTTTGCAAGATTGTCTGGATTGCGCTTAATTGGTTATCAATCCACCTTTGGCTCCAAAATCCAATCTTTTGTGGTCTTTTGCTGTTAAAACTTTTCATTATGAGTTACGTTTTGAAACAAATCAAAGCATCTTCCATATCAAACTCCTCAAAACCACTGTTTCCAACTGATACCAGATCATGTTAGTGATTTTAGCCTTCTTTTTTCAGCTTCAATTTCCAAGAATCTTCTTTATCCCTGCATTGAAATTTTTTTTCGGGGGGGGGGGGGATTAAATTGGAAAGTTGAGCCCATCCTTTTTAAGCGACAAGGCAAATAAGGATGGACATTCATCCTTAAGAATCAGATTGTCCAACTATGAATCCATAAAAAAACTGAGTTTTGTTTTCCCTACCGGCATTGTGAGTGTCGACGTTGGGCTTGTAAAGGATTTGAAGAGGTTGTAATTCGAATTGATTAGAGGTTGATTGCTTTTAGTGGGAAGGACCTTGTGGAAGGTTCAGATCTCTTGACCTTTGTTTCCACAATTCTCCTACCGGCATTGTGAGTGATATTGTTGTAAACGTGGACACTGGACATTTACTTTATCAATATACTTTCAAGGCCCTTTAGCCAACTTTTTCTTGTTTTTTAAATGTAAAAACAATTGCGATGATTTGGTTGTTACATTCTTGGCACTTTTTCCCTTTAGAAATGGAATACTCAACGGTCATTGATTTTTTTCCATCCTGATTAGTAAATTTGATTTTTCTCAATAGTTTTTACAAAGATGTCAATTAGTGCTTCAGGATGAAGGCTTAGCATGGAAGAACCTTTTAGGTTTTCTTTTTTCTTGGGGGAGGCATCTGTATATTTTAGAGTATATTAATTACTGATTTTCATTGGTTTTGCAGGCATCCGAAAAACACAAATGTTGTTGAAGGCATCGTTAAATCCCTCGCTAGAGTTGTCTCCAATGTACAGGTATTTATTTAGATTGCTTGCCATTATCACCTTTTCAGGTAGCTTTGTGGCTGGGAGCCTGAGATAAATTAGTAAAAGGAGCAAAGCAAAAAATAGATGGAGAAAATGGGAAATTGAGAGTGGTTGAGAGAAAGGAATGAACACGAGGACACTGGATCGACCATTTGGTGTCCTTGTATTTTTTGCTTTAAGTATTCTCTATGTTGCGTCTGTATGTTTGGTTAGTGTTTCCTGTTCCTAAGGGGTTTAGCTTTTGGCCTATTGGTTCTCTTTTTACTATATACTAATCAAAAGATGTAAATATTGTAGGTCCACGAGACGAGTGAAGAAAGCTTGGCAGCTGTCGCAGGAATGTTTAGTAGCAAGGCCAAAGGTAACTATTTTCTTTATTTAACTTGCCTTGTTAATTCAAATTATCCCCTCTCTACGGGAACCCTGTTAGTTGTATTTTGCTTTCTTTTCCCACCATACGTCTGTTGCTACAATAAATCTCTCTTGGTCTATTATATTTTGTGCACATCTTTGTCTGGGAAGTTGTTTTTCCTCCATGTCATGTAATGAGAAGTATAAACTTTTGGGCGATAAACATTTCAGACATGCTAATCCTAATGAATTTTTAACTGCATGTAATTGATAATGCTAAACATCTAAACTAATTTATATGTTTTCTCAATCAAAATCTTTTAACTGCGTGTAATGAGGAGTATAAACTTATGGGCGAAGAATATTTCAGACATGGTAATCCTAATAAAATTGTAACTGCATGTACTTGATAATGCTAAACATCTAAACTTATTTCTATGTTTTCTCAATCAAAATTAAAGTACTCGATAAAATTCATTCTCAAATATGATATGGAGATTGTACTTTCTTACTCTAGCTAATATTTTGTTAAGGATTTAAAATTCTTCTATATATATGGATTTGTTCGGTCTAGATCAGAGTGACTCAAACTGTAGTCTACTGTTATACCTCTATTTGCCATCAGTGCTTACTATTCAGTCTTTTTTTTTAATCAGAAAAAGAATTCTACATGCCCTTCACTTATTATCCATGGAATCTGTTCTAGAGATTAAATTGGAATGTCAGGATATTTATTTTTTTAGTAAAAGAAACAATTTCATTGATAAATGAAAGTAGGGGAAATACTCCAAATACCAAGAGGTGATTACATCAAAGAACGCCAATTACTGACTAAAAAGGATGAGTTGAAATGACTAAAAGGGTGTTTAGTTTTACACCAAGAAAAAGCAGTAAACAAGACAGATTCCATAAAACGGTTGAAAGGAGTAAAAGAGTCATTGAAGAGCCGACTGTTTCGTTCACCCCACAATCTCGAGAAGAAAGCACGCAAAGGGCCAACCAAACTATCTCTTTTGTACCACCAAAAGGATGACCCACCAATAAAGAAGCTAGTGCATCAGAGATGGAATTAGGGCAAATGAAAGACCATCCAAAAGTAGACAAAACAAGATCCCAAAAGTTGGCAGCAAATGGACATTGCAAAAACAAGTGAACAGGGGATTCGACATGTTGTCGACACATGATGCACCAAGAAGGAGAAATACACATATAGGGAAATCGTCGCTGCAAACGATCAGCAGTGTTAATGGCACCCCAACTAAGCTCCCAAAGAAAAATTTTAATTTTCTTTGGATAGTGGTCCTTCCAAATCAACGCATACAGATCAGATAAAACAGAATCAGTAGCACCCACCAGATCAGTCATAAGTGATTTAACTGTAAAAGTCATAGATGAATCCAGAGGCCAAATCCAAGAATCAGGATAGGAGCGCAACCTGACAGAAGCCAAATGGTGTGACAAAGTAGCCCATTCTGCAATTTCCAATTCAGTAAGACTGCGACGAAGATTTAAGTTCCAGGCACTAGAGGAGGCAACCCACACATTAGCCACAGCAGTCTTCGGTTGCCAAGCAAGGTGAAAAAGCCTCGGAAACACAGTTGAAAGGACACCACAACTAAGCCAAGGATCATTCCAAAAAGAGGTAGTAGTACCATCACCAAGCCGACGATGAATACGATCAGCAACCAAATCAATGTATTGGCAAATACCTCCAAGGGGCTTTAGAAGGAGCACGAGAGATAGGAGTAGGCCACATACTATCAATAGGATAATATTTAGCCACAATAAATGTTTGCCATAAAGCATCTCGCTCGGAAAGAAACCACCAAGTCCACTTTGCAAGAAGGGAGGAATTACGGTGTTTGAAGTTTCCAATATCAAGACCTCTCGAAACTTGAGGGCGTTGAGAGATCCCCCAATTCACGTTATGCATACCACCATCCCCATGAGCACCTTCCCAAAAGAAATCACGAACCATCTTATCAAGGCAATTGATAACAGAGGAGGGAGCTTTGAACAAAGATAAATAATACGTGGGGCGATTAGATAGAGTAGCCTGAATGAGAGTGTGCCTACCACCTTTCGAGATATAAGCATATTTCCAATTATGAAGCTTGTGCTGAATTCTCTCCACCACCGGCTGCCAGAAATTGATAGAATTGGAGTTACCACGCAAAGGAAGCCCGAGATAAGTAGATGGCCAAAAACCCCTTTTGCAACCAAAAGAGTTCACCAACCATTCAAAATCTAAGTCCGGAACATGGATCCCAAGGAGCCCACTCTTTGCAAAATTAATTTTCAGCCCATAAGCAACTTCAAAGATATGGACAACATCAATCAAATTTTGAAGGGCAGGATGCTCAGCAGTGGAGAATAACGAAGTATCATATGCAAACTGTAAATGATTCAAAGAAAAAGTCGAGTTTCCAATAGGGTGTGCAACAGTGAGACCAAGCGAAGCACTATGATAAAGAAGGCGACTCAGGCAATCAACAACAATAATAAATAAGAAAGGGGAGAGAGGATCGCCTTGTCTAATACCCCTTGATGGAACAATCTTCCCCCGAGGACGACCATTAATAATAATAGAGTGATTAGCACTAGTGACACAACCTCTGATCCATGTATGCCAAAGATTACTAAAACTCTTAGCCTTGAGAATAGAGTCAAGAAGGCCTTCTCCAAATCAAGCTTCAGAACGACCCCTGCCTTCTTGTTGGAGGACCAATCATTAATAAGCTCGTTAGCCATCAAAGAGGCATCAAGAATTTGTCTGTTTGTCACAAATGCAAGCTGATTGGCTGCAATTGTAGAAGGGAGGAGCAACTTGAGACAATTCGATAAAACACAGGCCACAATCTTATTGCACATGGAATGAGACTAATAGGTCGGTAGTCAGAAACTGATTTAGAATCCACTTTTTTGGAATCAAACAAATATAAGTCTCATTCAACTTAACATTAACAACCCCAGAGGTATAAAAATCCTGTATCAACATCATCATGTCAGATTTAATGGTGGACCATAAAAATTTGAAAAATTCAGCTGTAAAACCATCCAGACCAGGAGATTTTCTTGTGTCAAGAGAGGAGATAGCTATATACACTTCCTCTTCTGTAAAAGGATGCTCCAAATCAGTAGTTTGTGTGGAAGAAATAGGACTCCAATCAAGGTTGGTTGGAAGGAAGCGCTGATGATTATCCTTTTTGAAAAGGGTAGAGTAAAAACCCAAGAATTCAGCCTCTATGTCTTGATCAATCAATAGACTAGTACCATTCCTGGAGAGAATTTCAATGATAGAACTTTTACGACGCCTAGCAGCAATGATTCTGTGAAAAAAACGGGTATTTTCCTCCCCTTCAAGAAGCCATTTAAGCTTACATCGTTGTTGCCAATGGATATGATCTTGAACAGTCAAATTTTAAATCTGTTCTCGAAGTAATCGACGCCTGATTATCTGTTCATTGCTAAGAGGACCGCGGTCTTCCATATCATCTAGTTGTGACAGTTGTGTTACAAGAGTGGGAAGTTTTGATGCTTCAGTACGTTGTGTAGCATTCCATTCACGGATAATCAATTTCAGACCTTGTAATTTCATCATAAAACCATGTCCTAGCCAGTCAGTAATAAGGTGAAGGTTCCACCAATTCTCTACTAAAGTGCGGAAACAAGGAATTTTAAGCCATGAGTTTTCAAATCTGAAGGAACAAGGACCCGAAGCAATACTTCCCATAGAAAGAGCCAAAGGAAAGTGATCAGAGGTAATTCGATCCAGGCGTTTGAGAGCAACATTTCCAAATTTGTGGAGACAATTCTCTGTAAGAAGGAAACGAGCCAAAAGAGAGAGATATTGGGTAGGACCCGAGCTAGACCTTGTAAAGCAACCATTCGCCAAAGGAATATCATGCAATTGATAATTAGAAATCCATTGATTAAAAGTTCTCATACCGGAAGAGATTGTATTATCATGAGATTTTCATGGTTCAGATGTAGAATCAATTGTCAGTGCCAACAGTGAAGAGTTTGAGGGCTATGATGAAGATGATATACAGGAAGTTGTAAAGGCTTCAGAAGATTTTAGCACAGAATTAAAAGAATTGCTACAAGGGGAAGACATTCATGAATCTGAAGCAGGTGAAGGTTTCTCTAATGTACCCTTATCAAAGAATGTTATCCCTTAG

mRNA sequence

ATGGCTTTCATGGCTGTACTGGAATCTGATCTTCGCGCGCTGTCCGTTGAAGCTCGCCGCCGTCACCCGGCCGTCAAAGATGGAGCTGAGCACGCTATTTTGAAGCTTCGATCAATGTCAAGTCCCAGTGACATTGCAGAAAATGAAGACATATTGCGAATTTTTCTGCTGGCCTGTGAAGCCAAAACTATAAAGTTGAGTGTAATTGGGCTTTCGTCTCTACAAAAGCTCATATCACATGATGCAGTTAGTCCTTCTGCTTTGAAAGAAATACTCTTTACGCTGAAAGATCACGCTGAAGTATCAGATGAGACTATTCAGTTAAAGACACTTCAAACTATACTAATAATTTTTCAATCTCGTTTACATCCTGAAAGTGAGGAGAATATGGCTCAAGCTCTTGGTATCTGTATTCGGCTTTTAGAAAACAACAGATCTTCTGACAGTGTGCGGAATACTGCAGCGGCTACCTTTAGGCAAGCAGTAGCTCTGATTTTTGATCATGTAATTTTTGGCGAGTCGCTTCCGGCTGGAAAGTTTGGTACTGGAAGTCAACACTCTCGAACCAGTATGGTTATTTCTGATGTTGATCGTAACATCAATAGCTCAGAGACACTGAAAAATGGGTCTCTTTCTGGAGGGCCATTGTTGAAGCGAGAGAACTTGACTAAAGCTGGGAGGCTTGGGCTACAATTGCTTGAAGATCTTACAGCTCTTGCCGCAGGTGGATCTGCAACTTGGTTACGCCCGATTTCTTTCCAGCGCACCTTTGCCCTTGATATACTAGAGTTCATTTTGTCAAATTATGTTGCTGTTTTCAGGATATTAGTTCCATATGAACAGGTTTTGCGCCATCAGATATGTTCCCTTCTCATGACATCACTTCGTACAAATGCTGAGCTTGAAGGGGAAGCAGGGGAGCCTTATTTTCGACGTCTAGTCTTGCGGTCAGTTGCTCATATTATCAGACTATATAGTACATCCCTCATCACTGAATGTGAGGTTTTCCTCAGTATGTTATTGAAGGTTACTTTTCTTGATCTACCATTGTGGCATAGGATTCTTGTTCTTGAAAATTTGAGGGGTTTTTGCATGGAGGCTAGAACTTTGCAGGTTCTTTTCCAGAACTTTGATATGCATCCGAAAAACACAAATGTTGTTGAAGGCATCGTTAAATCCCTCGCTAGAGTTGTCTCCAATGTACAGGTCCACGAGACGAGTGAAGAAAGCTTGGCAGCTGTCGCAGGAATGTTTAGTAGCAAGGCCAAAGAAGATTTTAGCACAGAATTAAAAGAATTGCTACAAGGGGAAGACATTCATGAATCTGAAGCAGGTGAAGGTTTCTCTAATGTACCCTTATCAAAGAATGTTATCCCTTAG

Coding sequence (CDS)

ATGGCTTTCATGGCTGTACTGGAATCTGATCTTCGCGCGCTGTCCGTTGAAGCTCGCCGCCGTCACCCGGCCGTCAAAGATGGAGCTGAGCACGCTATTTTGAAGCTTCGATCAATGTCAAGTCCCAGTGACATTGCAGAAAATGAAGACATATTGCGAATTTTTCTGCTGGCCTGTGAAGCCAAAACTATAAAGTTGAGTGTAATTGGGCTTTCGTCTCTACAAAAGCTCATATCACATGATGCAGTTAGTCCTTCTGCTTTGAAAGAAATACTCTTTACGCTGAAAGATCACGCTGAAGTATCAGATGAGACTATTCAGTTAAAGACACTTCAAACTATACTAATAATTTTTCAATCTCGTTTACATCCTGAAAGTGAGGAGAATATGGCTCAAGCTCTTGGTATCTGTATTCGGCTTTTAGAAAACAACAGATCTTCTGACAGTGTGCGGAATACTGCAGCGGCTACCTTTAGGCAAGCAGTAGCTCTGATTTTTGATCATGTAATTTTTGGCGAGTCGCTTCCGGCTGGAAAGTTTGGTACTGGAAGTCAACACTCTCGAACCAGTATGGTTATTTCTGATGTTGATCGTAACATCAATAGCTCAGAGACACTGAAAAATGGGTCTCTTTCTGGAGGGCCATTGTTGAAGCGAGAGAACTTGACTAAAGCTGGGAGGCTTGGGCTACAATTGCTTGAAGATCTTACAGCTCTTGCCGCAGGTGGATCTGCAACTTGGTTACGCCCGATTTCTTTCCAGCGCACCTTTGCCCTTGATATACTAGAGTTCATTTTGTCAAATTATGTTGCTGTTTTCAGGATATTAGTTCCATATGAACAGGTTTTGCGCCATCAGATATGTTCCCTTCTCATGACATCACTTCGTACAAATGCTGAGCTTGAAGGGGAAGCAGGGGAGCCTTATTTTCGACGTCTAGTCTTGCGGTCAGTTGCTCATATTATCAGACTATATAGTACATCCCTCATCACTGAATGTGAGGTTTTCCTCAGTATGTTATTGAAGGTTACTTTTCTTGATCTACCATTGTGGCATAGGATTCTTGTTCTTGAAAATTTGAGGGGTTTTTGCATGGAGGCTAGAACTTTGCAGGTTCTTTTCCAGAACTTTGATATGCATCCGAAAAACACAAATGTTGTTGAAGGCATCGTTAAATCCCTCGCTAGAGTTGTCTCCAATGTACAGGTCCACGAGACGAGTGAAGAAAGCTTGGCAGCTGTCGCAGGAATGTTTAGTAGCAAGGCCAAAGAAGATTTTAGCACAGAATTAAAAGAATTGCTACAAGGGGAAGACATTCATGAATCTGAAGCAGGTGAAGGTTTCTCTAATGTACCCTTATCAAAGAATGTTATCCCTTAG

Protein sequence

MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACEAKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQSRLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKFGTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALAAGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAELEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSSKAKEDFSTELKELLQGEDIHESEAGEGFSNVPLSKNVIP
Homology
BLAST of HG10007167 vs. NCBI nr
Match: XP_038879344.1 (protein MON2 homolog isoform X1 [Benincasa hispida])

HSP 1 Score: 786.9 bits (2031), Expect = 9.1e-224
Identity = 418/423 (98.82%), Postives = 421/423 (99.53%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPE+EENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF
Sbjct: 121 RLHPENEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGSQ SRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAG+LGLQLLEDLTALA
Sbjct: 181 GTGSQSSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGKLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTN E
Sbjct: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNVE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 423

BLAST of HG10007167 vs. NCBI nr
Match: XP_011659942.1 (protein MON2 homolog isoform X1 [Cucumis sativus] >KAE8653396.1 hypothetical protein Csa_007592 [Cucumis sativus])

HSP 1 Score: 773.9 bits (1997), Expect = 8.0e-220
Identity = 412/423 (97.40%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLR+MS PSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRTMSCPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAV+PSALKEIL TLKDHAEVSDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVTPSALKEILLTLKDHAEVSDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVI GESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVILGESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGSQ+SRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLT+AGRLGLQLLEDLTALA
Sbjct: 181 GTGSQNSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTRAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           AGGSATWLR IS QRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTN E
Sbjct: 241 AGGSATWLRSISSQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNVE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 423

BLAST of HG10007167 vs. NCBI nr
Match: XP_008450757.1 (PREDICTED: LOW QUALITY PROTEIN: protein MON2 homolog [Cucumis melo])

HSP 1 Score: 767.7 bits (1981), Expect = 5.7e-218
Identity = 409/423 (96.69%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLR+MS PSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRTMSCPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSP+ALKEIL TLKDHAEVSDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPNALKEILLTLKDHAEVSDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVI GESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVILGESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGSQ+SRTSMVISDVDR+INSSETLKNGSLSGG LLKRENLT+AGRLGL+LLEDLTALA
Sbjct: 181 GTGSQNSRTSMVISDVDRSINSSETLKNGSLSGGQLLKRENLTRAGRLGLKLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           AGGSATWLR IS QRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 AGGSATWLRSISSQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEE+LAAVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEENLAAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 423

BLAST of HG10007167 vs. NCBI nr
Match: KAG7020887.1 (Protein MON2-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 756.5 bits (1952), Expect = 1.3e-214
Identity = 404/423 (95.51%), Postives = 411/423 (97.16%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SRT MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSRT-MVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. NCBI nr
Match: KAG6588324.1 (Protein MON2-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 756.5 bits (1952), Expect = 1.3e-214
Identity = 404/423 (95.51%), Postives = 411/423 (97.16%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SRT MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSRT-MVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. ExPASy Swiss-Prot
Match: Q7Z3U7 (Protein MON2 homolog OS=Homo sapiens OX=9606 GN=MON2 PE=1 SV=3)

HSP 1 Score: 147.9 bits (372), Expect = 2.8e-34
Identity = 113/413 (27.36%), Postives = 214/413 (51.82%), Query Frame = 0

Query: 7   LESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSD-----IAEN-EDILRIFLLACE 66
           ++SDLRALS+E +++ P VK+ AE  I+K++++++ +      + EN  ++++ FL+ C 
Sbjct: 17  MQSDLRALSLECKKKFPPVKEAAESGIIKVKTIAARNTEILAALKENSSEVVQPFLMGCG 76

Query: 67  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 126
            K  K++ + L+++Q+L+SH+ VS +A   I+  L    E S E  +LK LQT+L++  +
Sbjct: 77  TKEPKITQLCLAAIQRLMSHEVVSETAAGNIINMLWQLMENSLE--ELKLLQTVLVLLTT 136

Query: 127 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 186
                 +E +++A+ +C RL  +    +   NTAAAT RQ V ++F+ ++  +       
Sbjct: 137 NT-VVHDEALSKAIVLCFRL--HFTKDNITNNTAAATVRQVVTVVFERMVAED------- 196

Query: 187 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 246
                         +  R+I     L  G+ +   +     L    +    L +DL  L 
Sbjct: 197 --------------ERHRDIIEQPVLVQGNSNRRSV---STLKPCAKDAYMLFQDLCQLV 256

Query: 247 AGGSATWLRPIS-FQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNA 306
              +  WL  ++   RTF L++LE +L+++  VF     +  +L+ ++C L++     N 
Sbjct: 257 NADAPYWLVGMTEMTRTFGLELLESVLNDFPQVFLQHQEFSFLLKERVCPLVIKLFSPNI 316

Query: 307 ELE---------GEAGEPYFR--RLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDL 366
           +               +PYF     +LR V+ +I+ + + L+TECE+FLS+L+K    D 
Sbjct: 317 KFRQGSSTSSSPAPVEKPYFPICMRLLRVVSVLIKQFYSLLVTECEIFLSLLVKFLDADK 376

Query: 367 PLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNV 402
           P W R + +E++  FC++ + L+   Q++DM   +T V   IV +L   + ++
Sbjct: 377 PQWLRAVAVESIHRFCVQPQLLRSFCQSYDMKQHSTKVFRDIVNALGSFIQSL 400

BLAST of HG10007167 vs. ExPASy Swiss-Prot
Match: Q80TL7 (Protein MON2 homolog OS=Mus musculus OX=10090 GN=Mon2 PE=1 SV=2)

HSP 1 Score: 143.7 bits (361), Expect = 5.3e-33
Identity = 117/429 (27.27%), Postives = 217/429 (50.58%), Query Frame = 0

Query: 7   LESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSD-----IAEN-EDILRIFLLACE 66
           ++SDLRALS+E +++ P VK+ AE  I+K++++++ +      + EN  ++++ FL+ C 
Sbjct: 17  MQSDLRALSLECKKKFPPVKEAAESGIIKVKTIAARNTEILAALKENSSEVVQPFLMGCG 76

Query: 67  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 126
            K  K++ + L+++Q+L+SH+ VS +A   I+  L    E S E  +LK LQT+L++  +
Sbjct: 77  TKEPKITQLCLAAIQRLMSHEVVSETAAGNIINMLWQLMENSLE--ELKLLQTVLVLLTT 136

Query: 127 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 186
                 +E +++A+ +C RL  +    +   NTAAAT RQ V ++F+ ++  +       
Sbjct: 137 NT-VVHDEALSKAIVLCFRL--HFTKDNITNNTAAATVRQVVTVVFERMVAED------- 196

Query: 187 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 246
                         D  R+I     ++  S           L    +    L +DL  L 
Sbjct: 197 --------------DRHRDIEPPVPIQGNSNRRSV----STLRPCAKDAYMLFQDLCQLV 256

Query: 247 AGGSATWLRPIS-FQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNA 306
              +  WL  ++   RTF L++LE +L+++  VF     +  +L+ ++C L++     N 
Sbjct: 257 NADAPYWLVGMTEMTRTFGLELLESVLNDFPQVFLQHQEFSFLLKERVCPLVIKLFSPNI 316

Query: 307 ELE---------GEAGEPYFR--RLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDL 366
           +               +PYF     +LR V+ +I+ + + L+TECE+FLS+L+K    D 
Sbjct: 317 KFRQGSSTSSSPAPVEKPYFPICMRLLRVVSVLIKQFYSLLVTECEIFLSLLVKFLDSDK 376

Query: 367 PLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNV-QVHETS 417
           P W R + +E++   C++ + L+   Q++DM   +T V   IV +L   + ++  V  T 
Sbjct: 377 PQWLRAVAVESIHRLCVQPQLLRSFCQSYDMKQHSTKVFRDIVNALGSFIQSLFLVPPTG 415

BLAST of HG10007167 vs. ExPASy Swiss-Prot
Match: Q6GP04 (Protein MON2 homolog OS=Xenopus laevis OX=8355 GN=mon2 PE=2 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 9.0e-33
Identity = 112/413 (27.12%), Postives = 213/413 (51.57%), Query Frame = 0

Query: 7   LESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSD-----IAEN-EDILRIFLLACE 66
           ++SDLR LS+E +++ P VK+ AE  I+K++++++ S      + EN  ++++ FL+ C 
Sbjct: 20  MQSDLRGLSMECKKKFPPVKEAAESGIVKVKNIAARSPDVLTALKENSSEVVQPFLMGCG 79

Query: 67  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 126
            K  K++ + L+++Q+L+SH+ VS  A   I+  L    E   E  +LK LQT+L++  +
Sbjct: 80  TKEQKITQLCLAAIQRLMSHEVVSEGAAGNIINMLWQLMENGLE--ELKLLQTVLVLLTT 139

Query: 127 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 186
                 +E++++A+ +C RL  +    +   NTAAAT RQ V ++F+ ++          
Sbjct: 140 NT-VVHDESLSKAIVLCFRL--HFTKDNITNNTAAATVRQVVTVVFERMV---------- 199

Query: 187 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 246
            T  +  + +     V++ I  +      S+S         L    +    L +DL  L 
Sbjct: 200 -TEDERHKDA-----VEQPIPVTGNSNRRSVS--------TLKPCAKDAYMLFQDLCQLV 259

Query: 247 AGGSATWLRPIS-FQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNA 306
              +  WL  ++   RTF L++LE +L+++  VF     +  +L+ ++C L++     N 
Sbjct: 260 NADAPYWLVGMTEMTRTFGLELLESVLNDFPQVFLQHQEFSFLLKERVCPLVIKLFSPNI 319

Query: 307 ELE---------GEAGEPYFR--RLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDL 366
           +               +PYF     +LR V+ +I+ + + L+TECE+FLS+L+K    D 
Sbjct: 320 KFRQGSNSNSSPAPVEKPYFPICMRLLRVVSVLIKQFYSLLVTECEIFLSLLVKFLDADK 379

Query: 367 PLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNV 402
           P W R + +E++   C++ + L+   Q++DM   +T V   IV +L   + ++
Sbjct: 380 PQWLRAVAVESIHRLCVQPQLLRSFCQSYDMKQHSTKVFRDIVNALGSFIQSL 403

BLAST of HG10007167 vs. ExPASy Swiss-Prot
Match: Q29L43 (Protein MON2 homolog OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=mon2 PE=3 SV=2)

HSP 1 Score: 127.9 bits (320), Expect = 3.0e-28
Identity = 105/414 (25.36%), Postives = 194/414 (46.86%), Query Frame = 0

Query: 3   FMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSP---SDIAENEDILRIFLLAC 62
           F+  L++D + LS+E ++++P +K+  E AI KL +  S    S       IL   +  C
Sbjct: 15  FVEALQADFKTLSLETKKKYPQIKEACEEAISKLCTAGSSQQNSVYYTVNQILYPLVQGC 74

Query: 63  EAKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQ 122
           E K +K+    L  +Q+LI+   V       I   L    E + E +++    T+L+   
Sbjct: 75  ETKDLKIIKFCLGMMQRLITQQVVDQKGALYITNALWTLMENNIEEVKVLQTVTLLLTTN 134

Query: 123 SRLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHV-IFGESLPAG 182
           + +H ++   +A+AL +C RL  +   + ++ NTA AT RQ V+L+F+ V +  +S+P+ 
Sbjct: 135 TVVHGDT---LAKALVLCFRL--HYTKNPTIVNTAGATIRQLVSLVFERVYLEKDSVPS- 194

Query: 183 KFGTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTA 242
                            + +      +  NG +        +++         L +DL  
Sbjct: 195 -----------------LQQQQQQPSSSSNGPVEADGATAGQDVQTFASDAFLLFQDLVQ 254

Query: 243 LAAGGSATWLRPIS-FQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRT 302
           L       WL  ++   RTF L++LE +L+N+ AVF     +  +L+ ++C+L++     
Sbjct: 255 LVNAEQPYWLVGMTEMTRTFGLELLEAVLTNFSAVFHESNDFRLLLKERVCALVIKLFSP 314

Query: 303 NAE-----------LEGEAGEPYF--RRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVT 362
           N +               A +PYF     +LR V+ +I+ Y T L+TECE+FLS+++K  
Sbjct: 315 NVKHRQLPAPNNGTAPVPAEKPYFPISMRLLRLVSILIQKYHTILVTECEIFLSLIIKFL 374

Query: 363 FLDLPLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVV 399
             D P W R L +E +      +  +    +++D+    TN+V  ++ ++   V
Sbjct: 375 DPDKPAWQRALAVEVIHKLVTRSSLIAFFCKSYDLKNHATNIVHDMIAAMGSFV 405

BLAST of HG10007167 vs. ExPASy Swiss-Prot
Match: Q9VLT1 (Protein MON2 homolog OS=Drosophila melanogaster OX=7227 GN=mon2 PE=2 SV=4)

HSP 1 Score: 127.1 bits (318), Expect = 5.1e-28
Identity = 109/413 (26.39%), Postives = 193/413 (46.73%), Query Frame = 0

Query: 3   FMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSP---SDIAENEDILRIFLLAC 62
           F+  L++D + LS+E ++++P +K+  E AI KL +  S    S       IL   +  C
Sbjct: 17  FVEALQADFKTLSLETKKKYPQIKEACEEAISKLCTAGSSQQNSVYYTVNQILYPLVQGC 76

Query: 63  EAKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQ 122
           E K +K+    L  +Q+LI+   V       I   L    E + E +++    T+L+   
Sbjct: 77  ETKDLKIIKFCLGMMQRLITQQVVDQKGALYITNALWTLMENNIEEVKVLQTVTLLLTTN 136

Query: 123 SRLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGK 182
           + +H ++   +A+AL +C RL  +   + ++ NTA AT RQ V+L+F+ V          
Sbjct: 137 TVVHGDT---LAKALVLCFRL--HYAKNPTIVNTAGATIRQLVSLVFERVYL-------- 196

Query: 183 FGTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTAL 242
                          + D   +  +   +GS + G    ++  T A    L L +DL  L
Sbjct: 197 ---------------EKDSVSSLQQQQSSGSPAEGEGGNQDVQTFASDAFL-LFQDLVQL 256

Query: 243 AAGGSATWLRPIS-FQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLR-- 302
                  WL  ++   RTF L++LE +L+N+ AVF     +  +L+ ++C+L++      
Sbjct: 257 VNADQPYWLLGMTEMTRTFGLELLEAVLTNFSAVFHESNDFRLLLKERVCALVIKLFSPN 316

Query: 303 ---------TNAELEGEAGEPYF--RRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTF 362
                    +N      A +PYF     +LR VA +I+ Y T L+TECE+FLS+++K   
Sbjct: 317 VKHRQLPAPSNGNAPVPAEKPYFPISMRLLRLVAILIQKYHTILVTECEIFLSLIIKFLD 376

Query: 363 LDLPLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVV 399
            D P W R L LE +      +  +    +++D+    TN+V  ++ ++   +
Sbjct: 377 PDKPAWQRALALEVIHKLVTRSSLIAFFCKSYDLKNHATNIVHDMIAAMGSYI 400

BLAST of HG10007167 vs. ExPASy TrEMBL
Match: A0A1S3BPZ4 (LOW QUALITY PROTEIN: protein MON2 homolog OS=Cucumis melo OX=3656 GN=LOC103492245 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 2.8e-218
Identity = 409/423 (96.69%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLR+MS PSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRTMSCPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSP+ALKEIL TLKDHAEVSDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPNALKEILLTLKDHAEVSDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVI GESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVILGESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGSQ+SRTSMVISDVDR+INSSETLKNGSLSGG LLKRENLT+AGRLGL+LLEDLTALA
Sbjct: 181 GTGSQNSRTSMVISDVDRSINSSETLKNGSLSGGQLLKRENLTRAGRLGLKLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           AGGSATWLR IS QRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 AGGSATWLRSISSQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEE+LAAVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEENLAAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 423

BLAST of HG10007167 vs. ExPASy TrEMBL
Match: A0A6J1HQ18 (protein MON2 homolog isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465638 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 1.9e-214
Identity = 403/423 (95.27%), Postives = 411/423 (97.16%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SR +MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSR-AMVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. ExPASy TrEMBL
Match: A0A6J1HLJ3 (protein MON2 homolog isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465638 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 1.9e-214
Identity = 403/423 (95.27%), Postives = 411/423 (97.16%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SDET+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDETVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SR +MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSR-AMVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. ExPASy TrEMBL
Match: A0A6J1EX82 (protein MON2 homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439021 PE=4 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 4.1e-214
Identity = 403/423 (95.27%), Postives = 410/423 (96.93%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SD T+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDGTVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SRT MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSRT-MVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. ExPASy TrEMBL
Match: A0A6J1EWL3 (protein MON2 homolog isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439021 PE=4 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 4.1e-214
Identity = 403/423 (95.27%), Postives = 410/423 (96.93%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSM SPSDIAENEDILRIFLLACE
Sbjct: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMMSPSDIAENEDILRIFLLACE 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
           AKTIKLSVIGLSSLQKLISHDAVSPSALKEIL TLKDHAE+SD T+QLKTLQTILIIFQS
Sbjct: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILSTLKDHAEISDGTVQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIF ESLPAGKF
Sbjct: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFAESLPAGKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           GTGS  SRT MV +DVD NINSSET+ NGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA
Sbjct: 181 GTGSLTSRT-MVTADVDHNINSSETMNNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           A GSATWLRPISFQR+FALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE
Sbjct: 241 ADGSATWLRPISFQRSFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           +EGEAGEPYFRR+VLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL
Sbjct: 301 IEGEAGEPYFRRIVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360

Query: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETSEESLAAVAGMFSS 420
           RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHE SEESL AVAGMFSS
Sbjct: 361 RGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHEMSEESLVAVAGMFSS 420

Query: 421 KAK 424
           KAK
Sbjct: 421 KAK 422

BLAST of HG10007167 vs. TAIR 10
Match: AT5G27970.1 (ARM repeat superfamily protein )

HSP 1 Score: 582.4 bits (1500), Expect = 3.2e-166
Identity = 314/436 (72.02%), Postives = 364/436 (83.49%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MA +A LE+DLRALS EARRR+PAVKDGAEHAILKLRS SS SD++ NEDILRIFL+AC 
Sbjct: 1   MALVAALEADLRALSAEARRRYPAVKDGAEHAILKLRSSSSASDLSSNEDILRIFLMACG 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKD-------------HAEVSDETIQ 120
            +  KLSVIGLS LQKLISHDAV PS+LKEIL+TLKD             H+E+++E IQ
Sbjct: 61  VRNTKLSVIGLSCLQKLISHDAVEPSSLKEILYTLKDAKQLSDAVFPYLQHSEMAEENIQ 120

Query: 121 LKTLQTILIIFQSRLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFD 180
           LKTLQTILIIFQSRLHPE+E+NM   L IC+ LL+NNR   SV NTAAATFRQAVALIFD
Sbjct: 121 LKTLQTILIIFQSRLHPETEDNMVLGLSICLTLLDNNR-PPSVYNTAAATFRQAVALIFD 180

Query: 181 HVIFGESLPAGKFGTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGR 240
            V+  ESLP  KFG+ SQ +RT  V  D+ +NIN+S  L+   + GG L  R+ L++ G+
Sbjct: 181 QVVSAESLPMPKFGSSSQTARTGSVTGDLSQNINNSGPLEK-DVIGGRLTIRDTLSETGK 240

Query: 241 LGLQLLEDLTALAAGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQI 300
           LGL+LLEDLTA AAGGSA WL   S  RTF+L+++EF+LSNY++VF+IL+PYEQVLRHQI
Sbjct: 241 LGLRLLEDLTASAAGGSAAWLHVTSLPRTFSLELIEFVLSNYISVFKILLPYEQVLRHQI 300

Query: 301 CSLLMTSLRTNAELEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLD 360
           CSLLMTSLRT++ELEGE  EPYFRRLVLRSVAHIIRLYS+SLITECEVFLSML+K TFLD
Sbjct: 301 CSLLMTSLRTSSELEGEMVEPYFRRLVLRSVAHIIRLYSSSLITECEVFLSMLVKATFLD 360

Query: 361 LPLWHRILVLENLRGFCMEARTLQVLFQNFDMHPKNTNVVEGIVKSLARVVSNVQVHETS 420
           LPLWHRILVLE LRGFC+EARTL++LFQNFDMHPKNTNVVE +VK+LARVVS++Q  ETS
Sbjct: 361 LPLWHRILVLEILRGFCVEARTLRILFQNFDMHPKNTNVVESMVKALARVVSSIQFQETS 420

Query: 421 EESLAAVAGMFSSKAK 424
           EESLAAVAGMFSSKAK
Sbjct: 421 EESLAAVAGMFSSKAK 434

BLAST of HG10007167 vs. TAIR 10
Match: AT5G27970.2 (ARM repeat superfamily protein )

HSP 1 Score: 582.0 bits (1499), Expect = 4.1e-166
Identity = 314/437 (71.85%), Postives = 364/437 (83.30%), Query Frame = 0

Query: 1   MAFMAVLESDLRALSVEARRRHPAVKDGAEHAILKLRSMSSPSDIAENEDILRIFLLACE 60
           MA +A LE+DLRALS EARRR+PAVKDGAEHAILKLRS SS SD++ NEDILRIFL+AC 
Sbjct: 1   MALVAALEADLRALSAEARRRYPAVKDGAEHAILKLRSSSSASDLSSNEDILRIFLMACG 60

Query: 61  AKTIKLSVIGLSSLQKLISHDAVSPSALKEILFTLKDHAEVSDETIQLKTLQTILIIFQS 120
            +  KLSVIGLS LQKLISHDAV PS+LKEIL+TLKDH+E+++E IQLKTLQTILIIFQS
Sbjct: 61  VRNTKLSVIGLSCLQKLISHDAVEPSSLKEILYTLKDHSEMAEENIQLKTLQTILIIFQS 120

Query: 121 RLHPESEENMAQALGICIRLLENNRSSDSVRNTAAATFRQAVALIFDHVIFGESLPAGKF 180
           RLHPE+E+NM   L IC+ LL+NNR   SV NTAAATFRQAVALIFD V+  ESLP  KF
Sbjct: 121 RLHPETEDNMVLGLSICLTLLDNNR-PPSVYNTAAATFRQAVALIFDQVVSAESLPMPKF 180

Query: 181 GTGSQHSRTSMVISDVDRNINSSETLKNGSLSGGPLLKRENLTKAGRLGLQLLEDLTALA 240
           G+ SQ +RT  V  D+ +NIN+S  L+   + GG L  R+ L++ G+LGL+LLEDLTA A
Sbjct: 181 GSSSQTARTGSVTGDLSQNINNSGPLEK-DVIGGRLTIRDTLSETGKLGLRLLEDLTASA 240

Query: 241 AGGSATWLRPISFQRTFALDILEFILSNYVAVFRILVPYEQVLRHQICSLLMTSLRTNAE 300
           AGGSA WL   S  RTF+L+++EF+LSNY++VF+IL+PYEQVLRHQICSLLMTSLRT++E
Sbjct: 241 AGGSAAWLHVTSLPRTFSLELIEFVLSNYISVFKILLPYEQVLRHQICSLLMTSLRTSSE 300

Query: 301 LEGEAGEPYFRRLVLRSVAHIIRLYSTSLITECEVFLSMLLKVTFLDLPLWHRILVLENL 360
           LEGE  EPYFRRLVLRSVAHIIRLYS+SLITECEVFLSML+K TFLDLPLWHRILVLE L
Sbjct: 301 LEGEMVEPYFRRLVLRSVAHIIRLYSSSLITECEVFLSMLVKATFLDLPLWHRILVLEIL 360

Query: 361 RGFCMEARTLQVLFQNFDM--------------HPKNTNVVEGIVKSLARVVSNVQVHET 420
           RGFC+EARTL++LFQNFDM              HPKNTNVVE +VK+LARVVS++Q  ET
Sbjct: 361 RGFCVEARTLRILFQNFDMKLPSRSFFTLQLKKHPKNTNVVESMVKALARVVSSIQFQET 420

Query: 421 SEESLAAVAGMFSSKAK 424
           SEESLAAVAGMFSSKAK
Sbjct: 421 SEESLAAVAGMFSSKAK 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879344.19.1e-22498.82protein MON2 homolog isoform X1 [Benincasa hispida][more]
XP_011659942.18.0e-22097.40protein MON2 homolog isoform X1 [Cucumis sativus] >KAE8653396.1 hypothetical pro... [more]
XP_008450757.15.7e-21896.69PREDICTED: LOW QUALITY PROTEIN: protein MON2 homolog [Cucumis melo][more]
KAG7020887.11.3e-21495.51Protein MON2-like protein [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6588324.11.3e-21495.51Protein MON2-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q7Z3U72.8e-3427.36Protein MON2 homolog OS=Homo sapiens OX=9606 GN=MON2 PE=1 SV=3[more]
Q80TL75.3e-3327.27Protein MON2 homolog OS=Mus musculus OX=10090 GN=Mon2 PE=1 SV=2[more]
Q6GP049.0e-3327.12Protein MON2 homolog OS=Xenopus laevis OX=8355 GN=mon2 PE=2 SV=1[more]
Q29L433.0e-2825.36Protein MON2 homolog OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=mon2 ... [more]
Q9VLT15.1e-2826.39Protein MON2 homolog OS=Drosophila melanogaster OX=7227 GN=mon2 PE=2 SV=4[more]
Match NameE-valueIdentityDescription
A0A1S3BPZ42.8e-21896.69LOW QUALITY PROTEIN: protein MON2 homolog OS=Cucumis melo OX=3656 GN=LOC10349224... [more]
A0A6J1HQ181.9e-21495.27protein MON2 homolog isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465638 PE=4... [more]
A0A6J1HLJ31.9e-21495.27protein MON2 homolog isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465638 PE=4... [more]
A0A6J1EX824.1e-21495.27protein MON2 homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439021 PE... [more]
A0A6J1EWL34.1e-21495.27protein MON2 homolog isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439021 PE... [more]
Match NameE-valueIdentityDescription
AT5G27970.13.2e-16672.02ARM repeat superfamily protein [more]
AT5G27970.24.1e-16671.85ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032629Mon2, dimerisation and cyclophilin-binding domainPFAMPF16213DCBcoord: 3..169
e-value: 6.8E-39
score: 133.3
IPR032691Guanine nucleotide exchange factor, N-terminalPFAMPF12783Sec7_Ncoord: 230..386
e-value: 3.3E-32
score: 111.6
NoneNo IPR availablePANTHERPTHR10663GUANYL-NUCLEOTIDE EXCHANGE FACTORcoord: 6..409
IPR026829Protein Mon2-likePANTHERPTHR10663:SF333PROTEIN MON2 HOMOLOGcoord: 6..409
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 21..412

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007167.1HG10007167.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015031 protein transport
cellular_component GO:0005737 cytoplasm