HG10007663 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007663
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr10: 9556595 .. 9603873 (+)
RNA-Seq ExpressionHG10007663
SyntenyHG10007663
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCATGATCGTGGATCTTCTACAGTCGCTGCTGCTAAACTCTTCAGCTTATCTGGGAGATTCACGATTCCCAAGCGTTTGCACCTACTCTGTATTGTTTTGCTGGTAACATTCCTCTACAATTTGTTTCTTTTATGTTTTCAATACAATGGAAATTTTATGGTTGATTATCTCGTTTGTCATATATCTCATTGGAGATTCGCTTAGGATTTTTTTTGGGTATTTATTGCTTTCATTTGTTGTTTTAAGCTATTTTCTTCACGAAGTTGAATAAACCTGTTATTGGAGTTTAATCGCGTCTCGGAGAAAGAAATTGGTGAAGTAAATGAAAGACTGAGACTGAGATACCCGAATTTTACATTGGTGATGATATAAAAGAAAGTAGTAAAACATATCTCAATTCATGAGATTGATATGTAAAAACAACAACTGAAATGGAAATGAACAGAAAATACGTAAATGGCGTTGATAAGTTCCAAGCATTTCGAGACGCTAGGCTTTCCCCAAAATTAATCCCAGAATATATTGCCTTCAACCTCTTGTAACTACTTTTCCCAGAATCAAAATCAGTCTTTCCTGTTGGTTTCTGTTTTGGCCTTTTTGGTTAGTTGAATTTTGTTTGGTAGTTGTTGATTTTCTCTCACTGGGTGGTCGTTTCCTTGAATATTATGTATCATTTCATCTACCCATGAAAACTTGTTTCTTGTTGGAAAAAGAACCTTTCTTTTAATCGGCAATAAAAGTATTTAATCACTCCCTCTAAAAGCGTAACAAATATTGATGTTAAAGCACATCGTTTTCATCTTTTCTTATATAAATCTGGTCTGGTTGGGAATTAGTCAGAAAGTGCACAGCCTTGAACAAGTTTCCAATAGCTGCTGTTTCATCTAGAACACTGGTGGACCAGTTAGTTCTTTTAATAATAGAATGCATAGTGACCTCTGGGAAATGGTTAGCGTGTTAACTCACAAATACATTTGTATTAACTTCTTTCATGTGCTCTCATCAGCTATTAGCAGCAAGACCGTTTGCATCCTCCTCTGGAAATCGTAAAAGTGGAAAGTCCTCTGTATTTTCCTTGTTTAACCTTAAAGATAAGAGTAGATTTTGGAGTGAGACAGTCATACGTGGTGGTATGTCTAGCTTTCTTATCTTTGTTGCCTCAAAGTTTATACGTTCCTTAATCATGGTTATTGCTAATGAGTTCTTTTTTTTGTTTTACTCTTTTTGACACATTGTTTATTTATTTGTTTAGATTTTGATGATTTGGAATCATCCACCACTGAGAAATTGAGTGTTGTTAACTACACGAAGGCAGGTAATTGTATGCTCGAAACAGTGAAGAGTTTCTTTTTTGTCAATGTAATAAATATTGTGAGAGTTGGGGGCTTTTAAGCCCTTGTTATGCTTATGTTACATTAAGCATGAGAGTTGGTTGGATGTCACATAAGTGAAGAGGAGTTCGTGGAGTGAGGTAAAAAAATTTGAATTTTTACTGGAACGAAACTCTAGTTGGGAGAGTTCTAAGTTTTTCAAAATCTCGGAGTACTTAATGGCTAATGCAACAGTCTTTAGTCCTATCACAAACCAGCAACACCAATGGATTCATGCCAGATACAGAGCCATCTGAAATGTCTTTTAAGCTCCATTTGGGATCTCACCCTGCTGATTTTTGTTTACATTCTTGAAATTGTTATATTCATTTCCTTTTCATTTCCTTGCCTGTTTCTTCTTATAAATGAACATGTAGAAAGGTTTCTAAATCGTCATTACTCACTCAGTGATATATGTTTTTGGAGGGGAAATCTTTTAGCTTGGTTGTGTTGGGTTTACAGAATTTATTATTTCATCTTATGTCTTCCTTTAGAAAGTCTCCGGTACCCGTGAAAGATAATCTGAAATATATGTGTATTTTATGGCAATTATTTAAATCTGGAGTTGGTGCGGTGTAAAATATCTTTTGTATTGTCATTTTGAGCGCCAGTGACAAGGGAGTTGTATTATTTGTTTGTTACTCATTCCATATATTTATTATTTAAACTATTTATTCTTTTTGGTAATTTCCACAACGAAAAAGCAAGAGCACCATTAGAAGTCTCTTTTTCACATCCATCTTCCATTGCAGGTAATGTAGCAAATTACTTAAAGCTTCTTGAAGTTCATTCCCTGTACCTTCCAGTCCCTGTGATTTCATTGTATAGGATTTGAAGGGAAAGGTAACCATGGTATGTCATGCTCCTGATTATGAATTTGTTTAACGGTTGGACTTTTAGCTTTGGTTCCGAATTGTTATGTGGATGATCTAAGAGTTCTCTTTCTTGTTAGAATTCAAGCTGCATCCAGAAGAGCTTGAACGTTGGTTCATGAAACCTGATCATATCTTTGAACATACACGGATTCCGCAAGTTAGCGAGGTGCTAACCCCTTTTTATAATATCAGCATGGACAAAGTTTTGAGGCACCAACTACCCCTCGTCAGTCACATAAACTACAAGTAATGCTGTACTGGTACTTTGTATATTATGGGACCTGAAGTTGACATGCTTCTTTGAGAAAGTTAAATTTCTTATTTTATGTTTGTCATAAGTTTGTAAATGACGATTTAGCTCATGCCTACAAAGTTTGAAATGAGATCCTTAATCTTCTCATGAAAAATGTTGTTGAAATTTTGCAGTTTTTCTGTTCATGTAATACAAACGGGCGAGAAGGTTACTTCAATCTTTGAGCTTGCAAGAAATGTCTTATCTCGCAAAGAAGACGTATCCAATAATGGGTAATTTTGATATTATTATACTATTGTTAGTTTTATCATTTTTTATCTCATTAGATTCTCATTGTTGAAGCTATTTGAGCAATGTTGGCTTTGGCTTTTTTTTTTTTTTTTATTTTATTTTATTTATTTATGTCTAAATTTTTTAAAATTATTTATTATTATATGTATGTTTCTTTTTATAGTTTGGCTCTTCTGGCTGTTAATTGTATCTCATGGTTGCAGGGATGGGAATGATGCTTTTTGGCAAGTAGACGTGGACCTGATGGATGTACTTTTCACTAGCTTTGTGGAGTACCTTCAACTTGAAAATGCTTATAACATTTTTATTCTAAATCTCAAGCGTGACACAAAAAGGGCCAGATATGGATACCGGTGTGTAAATTTTTTAGCAAATCCTACTGCAATTTTGTTTGTTTGCTTTGTTTCTTTTCTGAACTCTAATACTAGAGATTAACCATACAGTTGAACCCTCAGTGATGTAAATTTTTTTTTCCTGGTCCATCTTGAATGGTCTTGGAATCTGTTAAATATTGCAATTGGATCTGTATCTTATAAAGAAAATCAAATCCATAGAAAGTTTATGACCATGTTCATGGAAAGCAGTTAATAGTATTGTCAGCATATCCTTCTTTGTTATTCTTGCAAAAGGAAATTACTTATTTTGAATAACCTGGAATCACACACTATTTAAATAAAGCCTAAAGGTGTGCATAGGTTTCTTTGTAGATTCTCATGTTTCTCTCCTTTTGCTTTGGTGGTGTTTGGAAATTGGCATTGTCATTTGGCTCTTGTATTAGGCTGCTTTTTCGCAGAGCTTTGCAGGCTGGGAATCCTTTTTGGTTGTTATTTTTTTTTCTTGGAAAATTGTCTAATTTAGTCTTTAGGTTGTGAAAACTTACACTTTCATCTCTAAGTTTTAGGAAATATTCCATTTTAGTCCTCTGATGGTTAGTTAACGTAGTGGAAAAGTGATGTGACATATTTTATAATAAAATAGGTAATATTGTAATTTTTTAAAGAAAAATATAAAAAGAGAAAAGCTATCTCTCCCCCTACCCACCCCACCCCCTTTCCTTTCCCCTTCCCCCTTCCCAAGTTCTACGGTCCCCCCTGTCGGGCCCACCCTTTCGACTCTCTCATCTCCATTGAGCAACATTCGACGGGGGTGTGACCTATGACCTCGGCATGGCCTCCTCTAGACAGCCTCCCCCTGCTTCCATCCCCACTCGCTCCATCCTCATTGACTAGAAGATATTCACGCTACAATGTGATATCTCTTCCAATGGTAAACAACTGCAAATCACGGAACATAGCAGAATGTCCTCTCAATCCCTTACCCTCACTGGGAAGGCACTCGATTGGTTTTCCTCCCCCTTCAATTCACCCTACCGTGATCCATGTAACTTTAAATTCTTTAGAAAGTTCATGGATGTTGGGGCCACCATTTGGCTAGAAAAACACAGTAACAAAAATGGCTACTTTGCTGAGTTAACTCACCTGTATTACTAAGGGCGTCAGATCAAACTTTTCATCCCATCTGAAGATGACAACGAGGGTGGTTTTCCTTCTTCTCATTGCTTTTAGAGTACAAGAGTGGACCAACATCGGGTGTCACTAAGCCTCACAGTTATAAGGCTGCTGTGCAATCAAAACCTTCTTTTAATTCAAATCCACTTCCCCCACTACCGATTACAAGAGCCCAAGTCTGTATGAGCTCTACTCCACTTGCCGATAACACCTGCTTACTGCAATATGCTTGGACCTCAATGGTCGTGGTGCAACGATACCAATTACATGATTCTTGGTCGGAGGTTTGCACTGCAATTTAATCCTCTTTGTCACTTGATGCACAATCAACCCGTTCCTTGAAGACAAGGCCCTACTACATGTCTATGATCCACAAATCTCTGAAAATCTCAAAAGTCATACTGATTGGGTTAACATTGGAAAGTATCGGCTAAAGTTTCTAGATACCTCTATTTTATCCTCCTTTTAACCGCAGCTAACGTCTTCCTATGGTGGCTGGATCGATGTACACAATCTTTCTCCAACTCTTTGGAATGACAAGGTATTTGAGTTTATTGGTGATAAATGTGGTGGCTACAGCCGTGCATCGAATAATTCAGAAAGGGGAATCAACTTCATGGCAACACGATTGAGAATAGCTTTGAATACAGTGGGATTTATCCCGTCCCACTTGAATTTACCGATCGATCTGGCTGGAATGGAATGGACGGTGAACATTCGTGGTCTTGACGCAAGAATTGCCACTTTGCTTCCGGGGAAGAATCAACGCTGGCAGCACCATTAATGGGCATTATTTCGGCTTCAAACTTGCCATCTAAGGCTATCAACTAAATTGACAAAGGTAAAAATCTCATATCGAATCTGTTGAGATTACGAGTGATTCACCTTCTTCCCCTATATTGCAAATATCAGCCTCTCTTCTTGATTTGGTGGAGAAAATCATACCCAAAGAATCAACAACCACTCTCCCTGAATTGGGGATAATAATATCCCCTTTGTGTCGACGCCCTCCCACACATAATCATTGACCTTCCCAATTTTTATACTTGGCCCCACGTAGTCTCCCTCTCCTTCATTTACCCAGCCCTCTCTCACAACAGTTGAATTCCATGGAGGCCCATCTACTTTAAGTGTTGGCGGCAAACACTTCACTAATACAAAAGGCCCAACCCACTTATCTGATATGGAAGCATGGACCTTGTTGTATTCCGTGAAGATCCAGAAGAAGCCCATCAACCTTTAGCCACTCTCTCTCCTATACTTGCTAACTTACATAGTCCATTGCCCGAAACATCTCAGCCGTTAAACTCATCTTTACCATCTTTTGAGCAAATTATCCCACTTGTAAATCTCATTGTCCACTTAATTTACGTGACATGACCTCTATTCCTATTGATACTTAGGTTATGCATTATGGTCATCCCCTCTCTTCTACCACCGAAAGCTAACAAGCCTTACTCCAACAATGAAAAGTGCTATAAATTATAACGGGAACTATAGAACCTTCAACGTTTGTGGTTTAAACTCCTGGAATATGAGGGCCTTGGTTAAGAGATTGATAATTCGGGCATTGCCTTTCTTCAGGAGACAAAACTATTAGAGGTCGATTCTTTTATGATTAAATCCATTTGGAGTTCTTCTTTTATTGGATGGGTGGCGCTTGATGCGATAAACACCTTATGAGGTATTCTTATTCTTTGGCGTTCTCCCAATTTTTTTGTTCATGATGTTATTCATGGACACGACATTGTTTCTATTCATGTATCTATGGCGGATGGTTTCTCATTTTGGCTTTCAACGGTTTATAGGCCCTCAGATAGTACCTATCATTCCGATTATTGGCAAGAGCTGGATGACCTGGCTAGATTGGGAGGTGATTGTTGGGTTATTGGTGGCGATTTTAATGTGACCCGATGGTCTGGGAGAGATCACATGATCACTCTATTTCTTCTAGTATGCATGCTTTCAATTAGTGGATCTTCAATTATAAACTTCGTGATATTCCAATGATGATGTAGTAGGATTAGGGTAAATTAGGGTATCTTGGTCAAATCCTTAATTGAAGGTTTAGGATTAGTTTCATTGATTGATTAGGATAAGGATTAGTTTCCTTGGTTTATTAGGATTAATATTAGTTTCTTTGATTAATTATAATTAGGATTAGTTTGCTTTCCAATTCTCTATAAATAGAGGGATTGTGTTCTTGTTTTGATAACTTTTGATTCATAATAATGACTTTGATTTTATTATTGGAGATTTCTCTCCTTTTATCTTTTCCACTATATCAATTTGGTATCAAAACAGTTGCTTGGGCCAATGTTGACTGCTGCTGGTGGAAAACTGGAAAACTCTTGGCGACACCAATCGCCAGAGGGAGCTTGTGAGTTGTGCTTGTTGCCAAACAACGCACATCTAATTAGGGGGAACGCTTTCAATTAATTTTTTTTTTTTAAAAAATATTTTGCAAGAATGAGCTACTACCGATCAAGGATAATGTCCAAGAAAAAATGCAGCAGTTGGTATAGGCAAGAAAGTCAAATTTACTTCAAAAAACAATGCTGGAATTTTGACTCAAGTGATTCAGAGGAAGAATTTCAATATCATGTCGCTCATGCTCGATTTTATCAAACAACCTATCAAGATTCACATTGGAAATTTGATTCTAGTGATTCAAACAATGTCAATGAAGATTTAAATACAGTTCGAAAAATCAAAACAAATCGTCATTACAAGGAACAAACTCAAAGATCTAATCATACCAAAATTTCAAGAAATTGACGTGAGAATTTGGACTTAGATGATTCAAAAAGTGATTCAGAGTGTGAAAATAATTATGACCAACATCAAAAAGCACCCATGAATCACTACTACAACAATATGAGGCTGAAAATTATTCAAGAATACCCAATTTGTACATCAATCCTAAAGTATCAGAGGAAATGAAGAAGGAACTAAAAGTTATCCAACGTTTGTTGGCTAAGGTGGCTCAAAATATGGATCAACTTTTAGAACAATGGCAAGAGTTAAAAAAAGAAATAATTCAAGAAAATCAAAACCAAGAACTTGTTGGATTATAGGAAAATCAAGAATTCAACAAAGGTTGTGGAATCTTGCAAGAAACACATAAGACCGAAAACCGAAAGGAAATTTGGAACCCACACATGGACCAACTGAGCGAAGAAGAAAAAATGATCGAAGAAAAATCTAAAGCAAAAGCTAAGGAAGTTGTGTTCAAGAAAGAACACGTAATAATAGACGAAGGTAATTTTGGCAAAAAAGATGACAACCACAGGCAACTCGATTTGAAGAAGTTTGAATCTCTGGTTGTAGTTCATTGCGTGGAAGACGAGGAAGAATATGTGGAAACACATAATCCATTGGAACTGACCCAAGGGATCCGACCCAACATGTGTGCCCAATTCCTAGCATGTGTTTCACGCAATCCGACCTAGGTTCAATCCTTCCCAGTTTTAATTTGCTGCCTCCCTTTCCGTTAGTTTATTTCAACCATTTTCTTCACAAATTTTGGGTTATACAACAACGATTTTTCATATGGGTTTATGGGTTGCTTTGATATTCCAACTTAGGCTAAAATGATTGAGCAAAATCTTGGTGGGTGGTGTGCATACAATTCTCAACAACCAAAGAAAATTTTTGATGAGGGTCATGCTCTACAGTTCTCAGATTTGTTATGGGCTCCTTCAATCTTTGTAGTTGCAGTTGTAGTGCAAGATTTTCACAGGCCAACGATTTTATTTGGGATTTTTACAAATCATCGAAAAAAACTTTGTATTCTTCATTTTCCCTTTGAAACTCGAGGACGAGTTTTTTTGGGAGGGATAGAATGATGTAGTAGGATCATGGTAAATTAAGATATCTTATCAAATCCTTAACTGATTAGGTTAAGGATTAGTTTTATTGATTGATTAGGATTAGGATTAGTTTCCTTGGTTTATTAGGATTAGGATTAGTTTCTTTGATTAATTAGGATTAAGATTAGTTTGCTTTCCAATTCTCTATAAACAGAGGAATTGTGTTCTTGTTTTGATAAATTTTGATTCATAATAAAGACTTTGATTTTATTATTGGAAATTTCTCTCCTTTTATCTTTTAGGTTACATCAGATGAGTAATGGTAATTTTACTTGGTCCAGTTCAAGACCGATTCAATATTCATCTCTAATTGATCGATTCTTCATAACTGACTCCTGTCTAGAGAAATTTGGAACAGTTAAGGTATAATGATTCGATTGGGTTACTTCAGACCATTTTCCTTTGGCCTTTATTTTTGGTGATATCAATTTGGCCCTTGCTCGTTTAGATTTGATAACTCTTGGCTTCAGGATCCTTCTTTCCAACCTTTAGTGGAATCCTGGTGGGCCAATAACCCTATGGCTAGATGGCCGGAGCATGGTTTGATGATGAAATTAGAAGGCAAACTTATGAGTGATCAACTCTCATCGTTTGCTTATGGAACGAATTGAATGTTTGATGGCTTAGGAACATATCTATTGGAGATAGCATTGCAAGCTTAAATGCCCATTGGGGGGATGAAAACACATGTTTTTTTCATTGTATTATTGCGGCCTACAAGAGAAAGAGTTCGATTACTGAAATTATTTCTAGAGATGGCCAAAGCCTTTTGACAGGTATTGGTATTGAAACTGAATTTTGTGATTTCTACTCTCAGTTATTCACTAGGAAATGTTGCCAGTAGTTTCTCCCTCTTAATGTCGATTGGAGCCTTATTAGTGATGAACAAGTAGCGGATCTTGAGCGGCCATTTTTTGAAGAGGAGGTTTTCAGAGCAGTTTCCTCTTTGGGTGTAGGTAAATTCCCTAGTCCTGATGGTTTTACTGTGTACTTTTTCAAAATTTTCTGGCCTACTCTGAAGCGTGACATTATGACTATGATCCATGATTTTTATGATGTTGGGGTTATTAATGCTTCCCTCAATGAAATTTACATTTGTCTCATCCCGAAGAAAACCGATTTCAAGTTGGTATCTGATTATCGTCTTATTACCCTTATCCCGTGTGCTTATAAGATTATTGCTCGAGTCTTGTCTAATCGACTGAAGATGGTCCTTCCTTGCACCATTGCTCCTAAGAAGCTTGCTTTTGTTGGGAATAGGCAGATCCTAATGGTTCAGTTATGGAGCATTGGGATCCTTCCACCTCCTCTTGGTCCATCACATTTCGAAGACTTCTAAAAGAGGAAAAAATTAATGATTTTCAGAGTTTGTTGAGTTCCATCTCCAAGATAACAGTGCGGGATTACCCTGACAAGCGTTTGTCATGGTCAATTGAACCTAAGGGAACTTTCTCGGTCAAATCCCTTATTAATCATCTTTCTTTGGCCTCCCCCCTAGACAAGCAATTGAAGAAAGCTTTGTGGAAATCCAAGAGCCCTCGGAGGGTTAATATAACGATTTGGATCATGCTTTTCGGGCTCCTAAATTTCTCCTCAATTTTGCAAAGGAAACTGCCCTCCCACTACTTATCTCCTAATATGTGTGCTTTGTGCAAGGCTGATTGGGAAGATCTTTAGCACCTCTTTTTTGACTGTTGTTATGCTGGAAAATGTTGGGAATATCTTTTCGCTTGCTTCAACATTAGTCGGGTGTTCGGAAAAGATTTTAAGGATAATATTTTGCAGGTTCTTGTGGGGCCAAAGCTCGGTTCTTGTGGGGCCAAAGCTCAAAGCAACACCAAAACTTCTATGAAACAATGCTATCAAGGCTTTGCTTGCAGATTTATGGTTCGAAAGAAATCAAATAGTGTTCCATGACAAAGGAACTCCTTGGTTTTTGTTAGATATCTACCCTGCACTCGGTAAAACCCAACAAGATATATAGAAAGAAGAAGAGAAGATATAATCCTGTGCTGTGATATTTGTATTCCAAATGGTCTCAAAGAAGTAAATACTAAGTTCAGAACAATGTTGAGATAAGCAAGAAACAAGAATACTCAAAATAGAGAAAGGAAACAAGTGACCAGCACATTCGAGAGAGCTGGCTATCCTCTCCTACTCCCTAGATTAGGGCCTGCTCCAAGTATTCCCCCATCTCCTTTGTCTCAGCCATTCTTTATTTATCAACATTTCAAGAGGAGGTCCCCCCCACACTACTAATCGCTTCTCCCACTACTAACTAATTCCACCTCACTTCTCATTTCCCTTTTTGCCCTTCCTTTTATAAGTATAAATGATTAGGGGCCTAACAATACCCCCCCCCCCCCCCTCAAAATGCACCTTGTCCTCAAGGTGGAGAGTGGGAAACTGTTGTTGTATCAAATAGGCAGACTCCCACATGGCTTCTTCTTCATCCATGTCCTTCCATTTAACTAGCCACTCATTCATTGCCAGCTCCTTGTTCCATCTAACTCCCTGAATATCTTCAGGTATCGCTTGCAACTCGAATTCTTCTGTCAGCACCGGTGGATGATGTTGCGCTGCCATACCTTGTCCCATTTTCTTACAATTTGCTAATGATACCCTGCTCTTCTCCACATCTGATCGTACTGCTTTACAACATCTTTTTGAGGTTATCATTATTTTTGAGCAGGCTTCGGGCTTGTCTATCAATTTATCTAAGAGTGAGCTTTTGGGGCTTAACTTATCTGACTCTAAAATGGACTGGATGGTGAATACTTTTGGTTGCAACTTGCAAGTAGGGCCATTGGCCAACTTCTTATCTTGGCTTGCTCTAGGAGGTAATTCTTTTTCTCCAAGTTTTTGGCAATCGATTGTGGAGAGAATGCATCATGAACTTCACAATTGGAAGTATGCATATATATCTAAAGGTGGTCGGCATATCCTTATTTAGGCTACTTTGTCTAGTCTTCCCACCTACTATTTATCCCTTTATCAAGTTCCTTGTGATGTTATTATGTGATTGAATAAATTGATTCGTGATTTTTTTTTGGGAAAGCTCTCATGGTGATGCTGGCATGCATAACGTGAAGTGGGAAACCACGCTACGTCAAAAATTGATGGGTGGTCTCGGTGTTGGAAATTTCCATTGTTGTAATTCTGCTCTATTGGCCAAATGGATTTGGCAGTTTTTGTATGAACACGATGCTTTGTGGAGAAAGCTCATTGTGGCTAAGTATTGTGATCCTTCTTGCATTTGGCCGACTCTTATTCACGGTTCTTCCAAATCGCCTTGGAGATTTATCTGTCAGAATATTGATTTGGTGGCTTCTCTTTTGCACTCAGAGTTGTCTTGGTATTGGTACATTTATTTCTTTTTGAAAGGACCCTTGGCTTAGTTGTGGTATTCTCTCATCGATGTTCCCTCGATTGTTTTGCCTTGCTCGTTTTTCGAACATCAAGATGGCTGATATGTGTGGAATCTGAGACTTGAGACTTGCGTCTTCGTCGTAATCTTAATGATAATGAAACTCTTGAATGAGCTTCTTTGTCTCATCTGTTAAGCTCTGTCAGGTTTCGTCTATCTCCTGACTCTTGGGTATGGATTACTCTTCATCGTTTTTGGTGAAATCTCTCATGGACGATTTGGTGAGTACAGTTGAGCCTCTTGTGGAAGATTTATATTCTGCCATTCGGATGAATCATTTCCCTTAGAAAATTAAATTTTCTTATGGGAGCTTAGTCTTGGTGCTGTTAATACTGAGGATCGCCTTTAGTGTTGTATACCCTATTTGTCTATCTCTCCATCTTGGAGTCTCGTGTGTCAAAGACATTTTGAATCCTTTGCTCATCCTTTTTTTCATTGTTCTTATGCTTCTCATTTTTGGCATATTGTTTGGGATGCATCTGATTGGTCGTTGGTTTGTTCCGACAATATGTTTGACATTCTTGCATGTTTATTGGTGGGTCATCCTTTTCATGGCGCAAAGAAGATGGTTTGGTTGGCCATTATTCGAGCTTTTTTTTGGACTTTATGGGGCGAGAGCAATAAATTTGTTTTTAGGGATTCTTTTTTTGCTTTTGATTATTTTATGGATTTGGTTCTGTCTACAACTTTGTATTGGTGCAAAACTAAGCATCCTTTTAAGCATTTTAGTCGTTCTTATCTTATTTTCAATTGGAAATCTCTATTTTAATCACCTCTAAGCGCTTTGGGGTTCCCCTATTTCATTTATTCAATGAAATGTTTCCTATCTAAAAAAACATATCCTTTCCTCTTCCATTTTCCCTTTCATACACCCACCATCTTCCTTGAGAAGAACACTCTCTGCCAACATGTCAGTCCACGTGGTTCTTGTTAAAAAGAAATAGATATAGTTGCCCACACACGAGCATAGTTTAGTGGTAAATGGGAATTGGCATCTAATCCTTTCCTTGAGGTTGAAGATTCAAATCTCCATGCTCACATTTATTGTACTAAAGAAAAAAAAGAGAGAGATATAGTTACTGTTGTGTCTGAAGTCTGATAATGGAAGTCCTATAATAAATTTTGGTATCTTCTTGGTTTTGTACTCGGACTGCTGTAAATTCACATGTACAAATCATGCAGGCCGCATAAATAATTTATGACTTGCAGCATTATGATAGATGAATGCAAAAATCTGTGATTTGAATATAGATCTCTTTGACTCGAATAATTTGAATAGAAGAAAGGTTTGTGGAGTGAGCCTTGTCACATGTTAAAGTGTTGAAACGTGAAAGATATTATGGTATTTTCTCGTTCTTCTGTTTCATTATTGATTATTCTAATACTATTGCAGGAAAGGTTTATCTGAATCGGAGATAAATTTTCTTAAAGAGGTATTTACTTTTTTCTTCTTCTTTCTATTCCCTTTGCATAACTGGAAAATCTTTCTATAATCTACTTCTGGGCTTTGAGGCTATTTGGCCCACTTTTGTCATTTGTATATTTTATTTAATATTATCAACGCAATATCTGTTTTCTTTAAAGATTCTTTTAAAGAACAACGAAAATATACAAAACTTCATATGTAAGAAGCGTGGGCATGGCCACGTCTACACAATTTGAACATGCTGGTATATTATTTTCAAAGAAACATGATCATGCACACTACCGATGCACAAATATTAAAAGTTTTTTCCCCTTTCCCTTTCATTTGCTGTGATTAATACTTGAAGAATTTAGAATTTCTGGCACCAAAACTTCTTCAACCACATTGGTCTCTTCATCTTCTAAGACTTGAATGCGAGCTTCATCCAAAGGATTTGAGAAGAGCTTTGCATCTAATGGAAATATGTTTGAAAGCAGAGTGGCCGGAGCTAAATCAGAGCCAAACTCTCCAAATTTGAGGGAAAATTCTCCTCGATGACTAATGTTGCTGGGATAAAACCACATTTGTTTCTTTCCACTTCAATAATTGCTGAAGAACAATCAATACAATTAACTGTCTGTGATGAAATACCCACAAAACCACCTAATTGAATACCTATGGCTTCAAAGACAGAGCGTTTCCAAAATTGCAACGGTAAGATTTTAATGAAAATCTTCCCCCCATAGCTTTTGATGAATTCAGGAAGTGAATGCTTCTCTTTGGACCAATGCTCAATTTTTAAATAGAATTTACCTATAAAGGTTCCATTTTTCTGCAAACTTGAAAGAATCAATATCCTTTCTGAAATGGATCAAGGCTTTATCATCTAAAAAAGGATTAATTGAAACTTCGGATTCGAAATTTTCCTCAGTAGCTTCTTTAATCTCCCCTCCAGAATAATGAGCTATCAGTTTCGAAACAACCATACTTGATTCAAGATTCAGCTCCAATATTTCTCTTTCTTTTCTAACCCAGAAGGATTTTTCTACTTCAACCTTATTTTAACTTTCAAACTTCTGAACAAGTGGGAGGGATGTACCTAGGTTTTATCTGTTTGGATTGTGGAGCTTTGTATCTTTGAGCACCAGTCTCTTTTTAATGAAAAATGTTGTAGCTTGTTTTAAAAAAAAAAGAATTCTGAGGTGTTGAGCTTAGTATATTCATATTATATAAGGAACAATTTACTGCTCCCCCATGTGCCTTAGTGTGATCAACACGTGCAACTTTGAAAATGCCAGGAACTATGTTAGTAACTTAGCATAATACTTTTTTTTATAACCAGAAATAACTTCACATTGATATAATAAAAAGGAAAATATATAAAGGATACAAACCCCCTTAGGGAGTGAAAAAGAAAATAAATCTAGAAAACAGAATAAGATCTTAAAACAACAATCAAAAGTGAACGACCAAGAAACAAATAAGATCCATCAAAACAAAGTTCAGCTCCAAAGAAAACTCGAACAACAACCACAACAGCTGAAAATATGAACCAATGCAAGAATAAGCTGGCACCTTAGCTTAGTACTTGGTATTTGTTAATGTGAATTTCCTTATATCATGCAATTTTTGGTGTGATTTTGCCTTGGTTCCTTATACTAGTTTCTGGTATTTATAAATAATTTATTTTTATGATAATATTAATTTTCTTATTGATGAGATTGACAACGATTGATTGGTTTCTAATAGAACGCACACTTGCAATCAAGAATTCTTCAATCAGAAAGTACACCAGAAACTATTCTTGGTATGATCAATTCTTTTGGGGGTTTTCAAATGTTCCTTTTCTTTATAAGTTATCAGAGTTTACAAAATTCTATGATATTTAATGATTGACAGCTCTTGAGAAGATTAAAAGGCCCTTATATGAAAAGCATCCCATGAGTAAGTTTGCATGGACAATAGCTGAAGACACTGATACTGTAAGTATCCTTGTTAATCAGACTGTCGAGATAAAATTCCATTAACTCCACTAACTGACATTTTAAATTTTTGTATGATTTCAAATCTTTTATGGTTTTTATTTGAAGATGGAATGGTACAACATCTGCCAAGATGCCCTAAGAAAAGTTAATGAATCGTATCAAGGAAAAGAGACAGCTGATATCATTCAAAACAAAGTTTTGCAGGTGCATATAACTCTCACATGTTTTGTGGCTAAAATATCTTAGACTATGATTTATAATGATTATTTCCTTCCGGTATGTTCTTATGTTTATTTGATCTAGATATTGAAGGGGAAAGATAGAGAGATGAGGCTTCGTCTTGATAAGGAACTAAAATCTTTTGATTTCAGTGGTTTCCATGCTGAATGTCTCACAGACACATGGATTGGCAATGACAGGTACCTAGCATCTATATGTGCATGCACAAAGTATATTATGATTGAGTATTACATCATTGATATCTGCTTTGCATCAGGTGGGCATTTATTGATTTAAACGCAGGCCCTTTTTCATGGGGTCCCACTGTTGGTGGTGAAGGTGTGCGAACTGAGCTAAGCCTACCAAATGTTGAAAAGACGGTTGGTGCTGTTCAAGGTACTGATGACTTTAGTTTTGCCAAAACAGTTTTCTTTTTTCAAATTATGTATAACACATTTATTCTAATAAAAAAGAAAACTAAACAAATAATAGTTAATATTATTTGGACTTTTTTCCTGCAGAATATATATTACTTCTTCATGTAGACTAATGAGAAACTTAACATATCAACTGAAATTTAACTTGATGGTTTCCCAAGCGAAAGAAGTTCCCTATTTTTAAGCTCCATCTTGTTTTATTTTTTTCATTGGTTTACCTTTTTCATAAATTTTTATTTTATTTCTGGCAGCTGGTAATTATTGAGCATTTTATCTTGTGAGAAAACATTTAAGAACCTTGTAGTCCATGCCATTTAAAATGCTCAGTAATTACCAGCCACCATAAATAAAGTAAAACTCATTTACCGTTGTTGATAGCATCTTGTGAGAAAACATTTAACATTTGTTTGGCTTTCCATAGTTTTCTGAGCCAAACAAGCGTTACTCTATTTTTAAGTTATAACATCATTATATCTTCGTACTGGTTTACCGTCAAAAGTTTTTACTCTATTTCTGGCGTGCAGAAATCTCAGAAGATGAAGCTGAAGATCGCCTGCAAGATGCTATTCAGGAGAAATTTGCTGTTTTTGGTGATGTATGAACTTTCTTAAGAAACCAGTAAGCAATTGATATGGGAAGTTACAGGGGTCAATTAGAATTGACAAATATTTTTTCTATAACTTCTATAAGTACAATAGAGTTTCTATTCAAGAAGCCTATTCAGATGCTTTATTGGATGCCTCCGTTGAATTTAGAAGTTAAAACACAGGCACATCAAGTTGATCGACAGCTTGCATTTCATTTTTGGTTGAGAATCATCTTATTTGGTACTAATTAATTGATAATACTCTAAGTGATAGGATATTAATTAATGAATCAAAAGCAATGTCCATTGATGAAATTAATTACATGGTCAAAAGTAATATTTTATTTGTACCCAATGCTTAAGCTTCACCGATGTTCCTCAGTAGCTCAGTGGTAGAGCGGTCGATTGTTAACCATTGGTTGTATAAGCTACAGGATATGCTTAGAGTAATGTTCATTTGTTGAGATACTGGATGCTTACTGACAGTGGGGAATTGCATGTAAAAAAGGGAGAATCTTCATTAAATGGGTACCACTGATCAAAGTTCAGCACATTCCAGATGGTGTAAAGTAAAGTGAGACCACGTAAAGAGCTCTTCCTTCTTCAGTCAGATTATTGTTAGACCGTTTTTCTTTTTCTTCTCATCTTCATATAGAAAGCCAACCTTCCTCATATTTCTCCTTTCATTAATTGAATTGGAGGAATCCATAGGCGGCCTGTCCAGTTATGCATTACATGAAAGAACATTTCTTTATATGACTTCTCTTTTGCCTCTGGTCGAAGGAATACAAAAGGGAAATCAATCTTCAACGTGGTTAATTGTCTGGAGAAATTTCTTAGAGACTTCATTTGGGATGGCAATGCCTTTAAACCACTTCTCACTTGGTTACTGTTAGGTACTTAAGCACCTTAGAGAACCACACCCCAAAAGCTAGCTATTGAGGTGAGAGAGCCAAGCCGCTTAAGTACCACATTGGTCATCCCATTCTAACCGATGTGGGACAAAGGTAGCCCATACTACCTTGGTTCCTAACAGTTGCCTGGAAATGAACTTCTTTACTATAATGTATGGCAGAATCGGGGTTGGATCTCTTTACCAAAAAAGCAACCTCTCCTTATGAAATGACTTTGGAGATATATCAAGAGGAAAATGCTTTGTAGAGAAGGGTCATTTGTCCTAATTATGGAGTGGAGCATAGGGGTTTGGAAACAAAAGACACAAGAAAAGGGAAGACAGCGGTTGTGGAGTAATATTATGAAGAATAGAGAGCATTTTTCAGCTTTGTTGAGTTTGTGGTGGGAAATGAGAGAAGAATTAAGTTGTGGGAAGACAAGTGGTGTGGGCCAGATCAACGTTTTAAAAAGCGCCGTCAGGCGCGCGACTAGGCGCAAGGCGCCCCTTTGGCGCTTTGCATTGATTAGGCGAGGTGAACAACATAAGTTGCGCGCCTCCATTGTTTGAGGACGTGGAGGCAATATCTACTGGGATCATGTTTTGTCGTAAAGACCGACAATAGTGCAATATGCCATTTCTTTAGCCAGCCGAAGTTAACTTCCAAGCAGGCACGGTGGCTAGAATTCTTGGCAGAGTTTAACTTTAAGTTCAAGCACAAGACAGGAAAGAGCAATCAGGCTGCCGACGCCCTGAGTCGGAAAGGCGAGCATGCGGCCCTGTGCATGTTAGCCATCTTCACTCGAGCAAGATTGATGGTTCAATGCGAGACATCATTGGGGAATATCTTCAGAAAGACCCTTCTGCCCAAACTGTCATGGCCTTAGCTAAAGCCGGTAAGACCTGACAGTTCTGGATTGAAGGGGACTTATTGTTGACAAAAGGGAATCGACTATACGTTCCTAGAGCAGGAGACCTAAGGAAGAAGCTGCTACATGAGTGTCATGACACCTTATGGGCAGGCCACCCGGGGTGGCAAAGAACGTATGCGTTGTTGAAAAAGGGCTACTTCTGGCCGAAGATGCGAGACGACGTCATGCAATACACCAAGACTTGCCTCATCTGCCAGCAAGACAAGGTTGAGAGAGCGAAAATTTCTGGGCTCCTCGAACCACTGCCAGTCCCTACCAGACCGTGGGAGAGTGTATTCATGGATTTCATTACTCATCTCCCAAAGGTCGGCAATCATGAAGCCATCTTGGTCATCATCGATCGGTTCTCAAAATATGCCACCTTCATTCCAACTCCCAAACAATGTTCAGCTGAATTGACAGCTCACTTATTCTTCAAGAACCTAGTGAAATTGTGGGGCGTCCCAACGAGCATTGTTAGCGACAGGGATGGTAGGTTTATTGGTACGTTTTGGACCGAATTGTTCTCCTTCCTAGGGACAAACCTGAACATATCTTCGAGTTATCACTCCCAGACTGATGGCCAAACAGAAAGATTTAATTGTCTGCTCGAAGAATATCTATGCCACTTCGTTGACGCGCGACAGAAAAATTGGGTCCAATTGTTAGATGTGGCTCAGTTTTGTTTCAACTGCCAAACAAGTTCGTCAACAAGGAAGAGTCCCTTTGAAATCGTAAGTGGAAGACAACCCGTGTTACCTCATATTCTTGATCATCTTTACACAGGAAAAAATCCCCAAGCCCACAACTTCACAAGAGAGTGGAAGCAGACCACCGACATCGCATGTGCTTACTTGGAGAAAGCCTGCAAGCATATGAAGAAGTGGGCAGACAAGAAGCGTCGTCCACTCGAGTTTCGAGCTGGAGATCGAGTCCTTGTAAAGCTGAAACCAGAACAGATCAGATTTCGAAGCCGCAAAGATCAACGGCTTGTCAGGAAGTATGAGGGTCCCGTGGAGATTCTGAAGAAGATCGGGAATGTTTCGTATAAGGTAGAGTTACCTACATGGATGAAAATCCACCAGTAATTCATGTAAGTAACCTAAAACCTTATCATCCCGATCCGAATGACGATTGTCGCAACACCACAATTCTACCGGGGATAAACCTGAAGCAGAAGGAGGACAAAGAAGTTGAAGAGATCCTTACTGATAGGATCAGAAAAGTTGGAAGACCCACCCGGAGAGTTCTCGAGTTCCTTGTCAAATGGAAGAATCTCCCTGTGGAAGAAACAAGTTGGAAACGTGCAAAGACTTGGATTCCTGGAAGCAGAAGATCGAAAAGTTCAAGCTCCGCCAGTCGACAGGAACGTCAACTGTTTAAGTGGGGGAGAATGTCTTGGGCATGCTTGTCCAGGGCATATTTGCCTATGGCCGCATGTCCAAACCATCCTCAACCACGCCATATGTATGTCTTACTATTAGTTTATCAAGTTGTTTTTGTTGTATGTTGTGTTTTTATTTTCTTGCGACGAGGTAGCACCTTTAGCGTGCAAGCCTGCCAGTTTAGTTTTTGCTCAGAATGTACTATAGGGGAAACACACCTGTCATTTCCCCGACCAAGTATTGTTGTACTCTTTGCAATCAAAGCTTTCCTTGCAATTGTCAGTCTTCAATGCTTTCTTTAAAGATGTTTTCAATGTTATTTCTCTCTCCCGTTTTATAAAACCATTTTCTCTCAAAACGTGGGAGGAGGCTAGACCTTATAGCGAGATTGTCCGCGACTTCCGAATTTGTTCGGCTGACTTGCTCGCCGGGTTAGAGCAAGTGCTGCACAATCGCCCGTCTCGTATTTGAGAAGTTGCGATTGTGACACCTCACACGCGCTTTCCATGAGGCGCAAAGCGCGAGTGTAGGCGTGAGGCGCGTGCTTTTGGTGTTTGTAAGTGGGGACCCTCAAAATGCAAGAAATCTTGCGATTTACTATTAAAAAAAAAATAATTTTTAAACCTAGTTGCTTAAACGGGAAGGAAAACGATTGGATTGACTCATGAGCATGAATGTCGTTTGGTATTCGTATTCCTCGGCGCCATTAAATCTCTCGCAGCCGGCAGCCCCATGACCATCTCTGTTCCATCTTCAACGAGGGATTAAACGCAACCGGCGCCGGCCACTTCTTCTTCTTCAATCACGTTTCAATTCAGCCACATTGCTACTGCAACCCTGCTACCCTTCTTCAGCTCTACCCTTCTTCAGCTCTTTGAAGATTAAACGACAAGGTCCAACACTCTGTTTTTTTATGAAAAAACAGAGGAGAAGACAAGACAATACAACAGAAGCCCAACACTCTGTTTTTTGAAAAAAAAAACAGAGAAGCCAAGAGTCACAACACAAGGCAACACAAAATAAGCCAAGCAATCTGTTTTTTGTGAAGAAAAAACAAAAGAGAAGAGTCAGTAGACTATAGACAATCCAAAATCTCTGTTTTTCCACCTTTGATCGTTCGAGTTAGTCAAGCTTCAATCATCTCTACTTGAAAGTTGAACGGTTTGTCATTTCTTTTTTTTCAATCCAAAAATTTGTTCTATGAAGTTTGTTTTGTATTTCGTTCTAGCAGTAGCCCCACTTCAATGCAATTTTATTTTTTAATCTTGCTTTCTTGAGGGAATTTTGAATTAGATTTGTTATGGCATGCTATTGTAATGTTGGACTTGGAGCTGTTGTATTCTTTATAATCTTGGACTTGGAGTTGTTGCATGAATTGTGAATTATTTGGTTCTAATTGCAGTATTTAAACAATTAAACCTTGATGTTGATGTTGATGTTGGAACTCTAGAAATTAGAAATTTAGAACTTATTCTTTAAAAGTAATGAATTATGATTGTGTGTTGTGTAAGTTGTGTTGTGTTGCAGACTTGCAGTGTTGTGTTTGTGAATTGTGAGTTATTGGATGTCGTTGTCTTAGGCAAAGTTGAGATTGCTTGTCTACTTTAAACAGAAACATACAAGGAATGAATGAAGGAAAATTGATATCATGCATAAATTGTTGGAATAAATTTATAAACAGAACTTCAAATCGCATAATGAAACCCTTACTTTGCAGCCCGCACTTGAACTTTTGTTTTAGAATGTGGATTCCACTACTTCCTATAAGTTATAATCTTATTTAGTCTTTGACTCTTTTCTAAATAGATGGCTGAGGATAGTTTAAGAAAAGATCCAGCATGGAAATATGCTCGATTGCAGAATGACCAAGATATAAATACATTTGTCTGTGGATTTTGTTCAAAAATAACCAAAGGAGGGGTCTATAGGATGAAACAACACCTTGTTGGTGGTTACAAAAATGTCACGGCCGGTAGAAAATGTCCAGATCATGTGAAGGAGGAAATTAAAGAGTTTATGTCCAAGAAAAAGGAGATTAAAGAACAAAGAAATTTTATTGTGGACATTGATGCAGAAAATTACGGTATTGAGGACGAGGATGAAGATGAAGTTATTGTAAGCAATTCAAATAAAAGACCAACACCTAGTGGTCCAAGTTCAAAGAAGCCAAGACAAAAGGGTCCAATGGATGCTTATTTTGCACCAAACCCAGAAACTGTCGTTCGAGTAGAAAGAATGACAAAGGAAAGGGAAAACAAACAACGATAAATATGGCCTACAAAAAGGAAACGAGGGAGCACACCATCCAAAGGATTGCTCGATGGTTTTATGATGCTGGAATACCGCTCAACGCTTGCAATTATGAAAGTTTTGCCCCTATCATTGAAGCAATAGGGTAATTCGATCTTGGTTTGAAACCACCTTCTTATCATGAGTTAAGAGTGCGATGTTTAAAAAATGAGTTAGATGCCACTCATGAGCTCATGAAGAGTCATAAGGCAGAGTTGGTCAAAGTTGGATGCACGGTTATGGCCGATGGATGGACGGATAGAAGAAATAGAACATTAATTAACTTTTTAGTTAATAGTCCAAAAGGCACTATGTTTATTGAATTCATCATATGTAAAGGACGGGGAAAAATGTTTGAATTGCTCGATAGTTTTGTTGAGCGCATTGGAGAAGCGAATGTTGTGCAAGTAGTTACTGTTAGTGCCTCTGCAAATGTAATGGCAGGTAAGAATGGTGGGATAATTTTGTTACTTCTTTTGCAATTAGCTAGTACCTTGTTAAAAGGCTAATATGTTCTTATTTTACTTTATTAATAGGGAGATTGTTAGAAGCAAAACGACCACAATTAGTTTGGTCGCCATGTGCCGCCCATTGCATAGACTTGATGTTGGAGGATATATACAAGATCCCCAATATTCTCAAAGCATTGAAAAGAGGCATGGAGATTAGTAATTTTATTTATGTTCGTCCGGAATTATTAAATATGATGAGACGATTTACTAATCAAAAAGAGTTGATTAGACCAGCTAAAACTTGTTTTGCTACTGCTTGCATTACATTATCAAGTATACATCGTCAAAAGAACAACTTGAGGAAGATGTTTACTTCTGACGAATGGAAGAATAGCAAATGGAGCAAGGAGCAACAAGGCAAATGAGTAGTTTAGACTATTTTGTTGGCTACTTTTTGGACTACAATTGTGTTTGCTCTTAAAGTATCAGGCCCACTGGTGCGAGTACTTAGATTGGTGGATGACGAGAAGAAGTCACCTATGGGATATATTTATGAGGCCATGGACAGATCTAAGGAAGCTATTGCTAAATCTTTTAGTGATAGGGAAGAAAAATACAAGGACATTTTTACAATCATTGATAGAAGATGGGAGCTCCAATTGCATCGTCCTTTGCATGCAGCGGGGTATTATTTAAACTTAGGATTCTATTATTCAAATCCTAACATTCAGGAGGACGATGAAATAGTTAATAGGTTGTACTCATGTATAACAAAAATGGTTGCTTCTTTGGACATAAAAGACAAGATACTTGTAGAACTAAGCAAGTACAAAAGAGCTGAAGGATTGTTTGGACAAACTTTAGCAATCAGACAAAGAGACAAAATATCTCCAGGTAAATTTTAAATTTTTAAGTGTTTGATAACTAAGGGATATTAATATAAGCTATTAATATATATTTTCAGTGGAATGGTGGGATAATTTTGGACAATCAACTCCAAACTTTCAAAAGTTTGTTGTGAGAATTTTAGGTCTTACTTGTAGTGCCTTCGGATGCGAACGTAATTGGAGTGTGTTTGAGCAGCTTCATAGCAAGAAACGAAATAGGCTTGCTCAAAGTCGTCTGAATGATTTAGTTTTCATCAAATACAATAGAGCATTAAAACGTCGATACAACCTTCGGGATATCGTCGACCCTATCTCTTTAAAAGATATTGATGATAGTAATGAGTGGTTGATTGGAAGAATGGATGATGATTCTGAGGAGGAGGATGAGCTTGTATTCGATGATGATTTTTTAACGTGGGGTGATGTTTCAAGAGCAACTGGAGCAAAAGAACCTGCCTACTATTCTAGAGGTAGTACCTCAAGAGCTAAGAGTAACGTTTCATGTCTATCCTCGTCCTCTACACAACCACTGACACAACCCACCCCCACACAAGTAAATTTAGATGACTCTGAGATGGAAGAAGAACAGATGGCTATAAGTCCAATGATGGAGTGAATGAAGATGAGGACCAATTTAGTGATGATGAATTTGATCTTTAGGACTCTAAATTTTATATGTGCTTGTTTCTAAATACTGTATAGTTTTGTGTTTGTTTTTTCGATGAGAATTGAATATTGACTATTGAGATATTGGCTTCATTTTGTGTTGAAAAACTAATATTATAACATTGGATATCTTTTATATATTGTAGTTAAGCCTCATTTCTTCCCATGAACATTTTCTATTTTGTATACACACATATATATATATATATATATTTTTTTTTTTTATACATAGTGCGCCTTAAAAAAAAAAGCCCGCGCCTTTTTGTGCGCCTTGCGTCTAGGCTGCAGAGGGCCATTGTGCCTTAGTGCGCCTCGGGCTTTATAAAACACTGGGCCAGATACTTTAGCCACAAAATTTCTAGATATATACCCCATTTCCAATAAAAGGGAGCCTCCATTGTAGATTGTTGGGATAAGCAAACATCAAGCTCGGGACTTGGGGTTTAGGAGACCTTTTTTTGATAGAGAGTTAGGGGAATTCGGACAACCTTTTGCTCAACTCTTAAATGATTGGATTGTCACTGGTTGGTGGACAATGACCATGATAGAATCAGGTGGAGGATTGATTCAACGGGTTTCTTCATTGGTAATTCGACCTCTATGAAACTAGCAAAGGGGCCACAAAGTTAAACAATTCCTAGGGTGAGTTTAATTTGGAAATTCAAGATTCCAATAATAGTGAACTTTCTCATGGTCGTTTGCCTATAGAAGCTTGAATACTCACGAGAAGCTGCAAAGGAAGTTTCAAAATTGGACAGTCTCTCCTATTTGCTGTTTATGCCTTCCATAGGAGTCCATCGACCACTTATTTCTACATTGTCCTTTTGCAACCAAAAGATGGTAGACTATTCTTATTGAGTATTGGTGGAGTTGTGTTTACCTAGGAAGATTGATGATTGGCTTAGGGAGGCACTTGATGGTAGGATTTTTGGCAGAAAAGGGAAAGTTCTTTGGAGATGTGGTACTGATTCTCTCCTTTGGGGCCTTTGGAAGGAAAAGAATAGCCAGGTGTTCGAAGACAAGTCTTCTTCTTTTGACTCTTTATGTCTTTTAGTGCAACACAATGCCTATTGGTGGTATACGAATTATACTAAATTCTTTTGTAGTTACAGCTTTTTGATGATTATTAAGAATTGCAACACTTTCCTTTGTTAGTTTTGTGGGGAGGAGTAACCTCTGCCCTTGCCCTTAGGCTGTTCTTTGGTTCTTCTGAGAAATATATGTCTCCGTTTCGTATATAAAATAAGAGAATACATTTTAATTGGATAATATTGTTTTTTGAAGCTGTTTATTAGGTTTTACAGCTCCTGTTTTGAATTGTCAAATGTTGTGGAATTTGCAGAAAGATCATCAAGCCATTGATATTCTTTTAGCAGAGATTGACATATATGAGCTTTTTGCTTTCAAACATTGCAAGGGAAGGAAAGTCAAACTTGCTCTTTGTGAAGGTTTTTTCTTTTTTCCTTTAAAATGCAAGCTTCAATATTAATTTTATATTTTACTTTAATTGATCTCGTTGCTTGGTAAATTGTGTTTCAAGAGGATGCATGTTGTATAATTGTTTGATTTCAAAGTGTACAATTTGGTGTTTTTTTGACACAGAACTTGATGAGAGGATGCGGGACTTAAAAAATGAGCTTCAGTCATTTGATGGTGAAGAATATGATGAAGATCATAAGAGGAAGGCCATAGATGCATTAAAACGGATGGAAAATTGGAATTTATTTAGTGATACGTACGAGGTAAAAGGCAGACTTTTATAGGTTTTGTTATCTTTCACTTTTATGCAACATTTCTTGCATTCTTTATTATGTTTGGAAGAAGCGCTATTGACTCTCAACATGAATGTATATTTTTTTCTCTAATTTCTTGTACCATGCAGGAGTTCCAAAACTACACTGTAGCACGTGATACTTTTCTGGCTCACCTAGGTGCTACTCTCTGGGGGTCAATGAGACATATTATATCACCTTCACTTTCTGACGGGTCATTCCATTATTTTGAGAAAATATCATTTCAATTGTTTTTCATCACACAGGAGGTATGGTTCATAATTCTAAATTTATGGAAGGACCAAGGGTATCTATTAATTTCCTCCAAACTCTCTGACGCAGAAAGTTAGACAAATTAAACAATTGCCCGTGGATCTTAAAGCTCTAATGGATGGGCTCTCCTCTTTGTTGTTACCTTCACAGAAAGCACTATTTAGTCAGACCATGTATGTCGTTTATCATGTATTCATACAATGTATATTCTATATTTTATATTTATATCTTAAATATATAAGATGGGCTTTAGACAATAAATTTTAGGGAAATTTTTAAAAATAGAAAAATAAGGGAAACTATTTACACAAAATAGCAAAATTTTTAGATAGTTGTGATAGACGCTAATAGAAGTCTATCAAGGTCTATCAGTGATAGAAATGATAGAAAAGTCTATCACTGATAAATGCTGACAGAAGTCTATCAATGTCTATCAGTATCTCTTTTTTGCTATTTTCTGTAAATAGTTTGACATTTTTTTTAGCGGTGAAAATTTCCCTAAATTTTATCATCTAATCTTAAATAGCATTTTTTATATTATAAAATGGAATTCTTTGGATTGAAAGATCCTTTCAGTGCATTGTTTTTTGTTTTTTGTTTTTTAAATTATTTATTTATTAATGTGAAACTTCCTTAGTATATGACCAATGAAGTTGGTGAGTAACCACAAAAAGTGTGAATTTATAACACCAAGAAGGCTAGAACAAAATAACATCTCAAATATGGTCCAAATTTCTATGTTTTTCTAAACAAGAAACGAAACGTTTCATTGATGAAATGAAAAGAGACTAAATGACTAATCCTCAAGGATATGAACTCGAAAGGGGGAGAAATGAAGAAAACAAATAAATCTACTGAACAAAACAAAGCAAAGCAAACCAAAGGATCCAAAAGAAACTAAAGAAAGCCAAAGAAAACCAAACACAATGCTAGACACGGCCAAACTAAAGTGCACTAAAATAAAAAATAAGAAGCAACAACAACCGAACCTCGAGAAAGCAACAAAGAGCCAAAGACCAAAAGTAGCTAAAACCAAAACTACATAATCGAAAACAACCAGGAGGCAAACAAAGAGCCTGAAGCCCCAAGCCAAAATAAAACTCTTCATTAGGCTCGGAAGTACTGGTCCAAAAGCCTTGAGAAAGAAACCAATGATAGGAAACTGTGAGAAATATTATTAAGCCCATCCACTGATGCCGTCAACAAAGATAGGAATTTCACGCAACTCCAGGGCACAAGCTGCAATAAGTGGACCAAGTTTGCTTGGAATCAGGGAAGGATGTAGAGAAGACTCGTTAGAACAATCCTGAAATAAAGAAACTAGAGCTTCTGAAAAATAATCTTAAGAAATATCATCCGAAATTTAATTAATGGTGGGCAATATGGACTCCACATTACTCATGCTGACCTTAGAATCAGCATCCGAAGCATCAAATCCAAGTGGAGAAATAATAGAAGCTGAAGGATCTTTTGGTGAATGAATGACTTCTTTAGTGAATAAATGATTAGACAAATCCTTAGAGATTACTTGTTGGCTGCAAAAATTGGGACTGGAGACCAAGAAGAAGATGATTGTTTACTCGAGAAAATCACAAGACTTGCCTTGAATTTTGATGGAAGCAGGAATAAAACCACAAGAATGAAGGCAGAACTACGATCCAAAACATATAGGGTTTTAGAAGCAACCATAACCAGACCCCGAAAACACTCCCTAAATTCTTAATCTGAGTCCATCCACCATAGCTTACACAAAATTCAGGGTTGCTATGTTTTGCGACAACCCAAATTTTAAATGAAAAGGCCCAATGTGCATCCATTTCCCATTACTGCCAAGTCCCCCAAGAATCGTACATCATTCATTTGTAGCGTTATCAACTATAAAATAGTTTGAGCACAAAAGGAACATTAAAGTGATCCTCAATGACTTCCTTAATTTCCTTCTAGTCATTACACGTATTCAACTTGGAGATTACCATGGGAGAACTCCCATCCAAATCAATCACTTTACTTCTTTGAATCCAATGAGAACTTGGAACACTTAGACAAGGCAACACTTTTGGAGGATCACTTGACTTGGATGCATTCTTAGAAAAGTTCTTAAAATTGGAGATAAAATCCTTCAATATTTCCCAAAAAGTAAGGCTACCCGGCTTAAAATGACCAACAGGCACACAAGTGACCTTCCTTCCGACTGTGATTGGCCATATGATACACTCCATTTTCCAATTGGAAAAATCTAGTCGCTTTTGTATTCTAGTGGTACGAAATTCACCCCTCTTTTACTTAAGAACTTTGAAAAGACTACACACATTAACTCAAGCAAAGAATGTCCAAACCAAACGAGGTGAGAATAAGGGAGAGGAATGGGTGACTTAAGATCCATATCTTCAATGAAGAAATGTTCATCTTTGAACCAAATGTGCGGAAATACATGTTGTCAATACAACAATTAGCATGCTTCATGACAAAGTAACTTCTATAAACATAGTGCTCAAAATATGTATTTTCCTCTCTTATTATTCACAAGGCTAGGATGAATTTCTTAAATAGAAGAAACACCCAATTACAAATAAGGAAATTACAAATAAGGAAAATATAACTATGACAAATATAATAAAATAAATACAGTGAAAATATTAACAAAATGGGAAATAATCAACACTCCCCCTCAAGCTGGTTGAAAGATATCATTCATGGCCAGCTTGTCAATTAAGTTGATGAATTGCCACTTTGGAAGGCCTTTAGTTAGCACGTTTGCAATTTGTTTTGTTGTTGGAAGATAAGGTATGCATATTATTCCAGCATCAATCTTCTCCTTTATGAAATGTTTATCAACTCCAATATGTTTTGTCCTATCATGAAGGACCGGATTGTGGGCAATAGAAATTGCTGCTTTATTATCACAATAAATTCTCATGGGAATCGTCTGATAGAATTTCAGTTCTTCTAGTAGTCTTCTTATCCATATGCCTTCACAAATACCATGGGCTAATGCCCTAAATTCAGCTTCAGCACTACTTCTCGCGACCACACTTTGTTTTTTACTTCACCAAGTAACAAAATTTCCTCCAACAAAGGAGCAATAACCCGAAGTGGATCTTCTATCAGTAGTACTACCTGCCCAATCTGCATCCGTGTAAACTTCGACATTTAGATGATCATGCTTTCTAAACAATATGCCTTTCCCAGGGGTACCTTTCAAATACCTCAAGATTCTATAAACGGCATCAAAATGTGTTGGTCCAGGTGCATGCATGAATTGACTCACCATACTGACGGCAAAAGCAATGTCAGGACGTGTGTGTGAGAGATATATGAGTCTCCCCACAAGTCTTTGATACTTTTCTTTGTCTTTTATTTCTTTTTCAGTTGCAGCTTCTAATTTTAAGTTCTGCTCAATGGGTGTGTCAACTACCTTGCAACCAAGCAAACCTGTTTCTTTCAGGAGGTCAAGAACATATTTCCTTTGGTTGACAATAATGCCACTCTTGGACCTGGCAAACTCCATGCCTAGGAAGTACTTTAAGGTTCCTAGGTCTTTGATTTGGAAATCAGAAGCTAGTAGTTCCTTCACACAAGTCAGTCCTATTTCATCATTACCTGTAAGAATAATATCATCAACATACACTATCAAGACAATAACTTTGTCATTTTTTGTATGTTTATAGAACATAGTGTGATCAGCTTGACTTTGACTGAATCCATAGCTGGTGACTGCCTTCTCAAACCGTTCAAACCAGGCTCTAGGAGATTGTTAAGGCCGTATAATGATTTCTTTAACTTGCACACTTTGTTAACCCCGAGATCCATCTCGAAGCCAGGTGGCAGGTCCATAAAAACCTCTTCTTCAAGATCCCCATTAAGAAAAGGATTTTTAACGTCAAGTTGATAGAGAGGCCAATCAAAATTAACTGCAACAGACAACAAAATTTTGATAAAGTTAATTTTAGCAACGGGCGCAAATGTTTCCTGGTAGTCAACTCCATAAGTCTGAGTGAACCCCTTAAGTCTGAGTGAACCCCTTAGCAACCAATCTAGCCTTGTATCTTTCGATACTACCATCTGCGTTACATTTTACAGTGAACACCCACTTGCATCCTACTGTTTTTTTGTCATTTGGTAGATCAACTATGTCCCATGTGCAATTTTGTTTCAGCGCATTCATTTCTTCCATCACTGCTAAATTCCAGTTCAAATCATTTAGGGCCTCCTGAATATTCCTTGGAACTAACAGGTCGGTTATTTTGGATGTGAAGGCTTTATGACTGTCAGACAATCTATGATAAGAAAGATAATTTGCAATGGGATATTTGACATATTTACGAGTACCTTTCCTAAGAGCAATTGGAAGATCAAGATCAGGAACATCAGGTAGGGAATTATGAGAAGAAGAAGAGTGTATGTTACCTGGATCTTCAAGATCATTCATCAGAGTATTAGATTGGTCCTGTGATAGATCAGATGTCTGTTCTCGATCCCTTTGAGTCAAGTTTCTTCTAGTATAAACTTGAAGTTCAGGTGTTAGTGTTGCTCCCCCTGAAGAGGAACTTTCTATACTTGATATCACAGAACTAGTGTTCATAATTTCAGGGTCAATAATGTGTGGGGTTTCCCAAAAATGATCTTCAAGATTAGATTTCTCCCCCTGAAGAGAATTTGGGCTAAAATAAGGTTGATTTTCTAGAAATACTACATCTAAACTCTCTACATACTTGTTGGTCGAGGGGTCAAAACATTTATAAGCTTTCTTATGAGGAGCATAACCTACAAAGATGCATTTAATGGCTCGAGGATCAAGTTTAGTGCGAGCAAGAGAAGTATGAACATATGCAAGACACCCAAATAATTTCATTGGTAAGTCAGAAAAAAGCCTAACATTAGGAAAAAGATCTTTAAAGTGATTGAGAGGCGTTTTAAAATTCAAAACCTTAGTTGGCCTTCGATTGATCAAGTAGGTAGCAGTAAGGACTGCATCACCCCACAAATATTTTGGAACATTCATAGAAAACATAAGGGCACGGGCAACTTCAAGGAGATGTCTATTTTTTCGTTCAACAATGCCATTTTGTTGAGGAGTATCATGACATGTAGCTTGATGAACAATACCCTTATCGTCAAAAAAGTTTTCAAGTGTTCATTGAAATATTCAGTACCATTATCAGAGTGAAGAATGCGGATTTTATTTTGAAATTGAGTCTCAATCATATTGTAAAAACGAACAAAAACGTCGTTTACTTCTGATTTTTTGGTTAATAAATAAAGCCAGGTTAGACGAGTGTGATCATCTATAAAGGTAACAAACCAACGCTTACCACTATGTGTCAAGATTTTAGACGGTCCCCACACATCAGTATGAATTAAAGAGAAAGGTGAGGAAGCCTTGTAAGGTTTAGGCAAATAAGTGGATCAATGATGTTTGGCAAAAATGCAACTTTCACAATGAAAATCAGAACAATCAATTCCTTTAAATAAATTTGGAAATAAATATTTTAAATAAAAGAAATTTGGATGCCCTAATCTACGATGCCAAAGCATGATAGTTTCTAGAACAAAGGAAGAACTGACACTACTGAAGCCCTGAGCCGTTTTATAACTAGAAGAAACTTTAACATCAAAGTAATAGAGACCATCAATCATTCTAGCACTGCCAATCGTCTCCCCCGAGTCCTGATCCTGAAAGGTACAATGAGTTCCACAAAAAATAACACGACAGTTAGCGTCCTTAGAGATTTGACTGACAGATAACAGATTACATGCTAACTTTGGAACATGAAGGACAGAATGTAAAGTAAATTTTTGAGTCAAAGGAATATGTCCTTTGTCTGCAATAGGGGCAAAACTACCATCTGCAATGCGAATTTTGGAAGTACTATATATCGGAGAGTATGATTCAAACAAAGAGGAGGAACTAGTCATATGATCAGTGACTCCAGAATCTATAATCTGTGACAAGCTCCAAGGGCAATACATCTCCTACCTCAACATTCTTCAATCTCCTTTTCCTAACATGGTCAACTATAGATGGTCACCTGTTGCCAGGGATGGATGTCAAGCACACATAGTGCTTCATGCGTTGATAATTCCTCTTGCCCATTGCCATGCGCAATTGCTGGGCTAACCTTCTTTTTTCTAAATGATAACGAAGTTTTTCATTGATGAAACGAAAAGAGTCTAATGCTCAAAGTACAATGAAACAGAATTGCTGGGCTAACCTTAAAAGTATTGGTTCGTACATAACAGTTCATTTAAACACATTCAATTCATATGTCTTACGTAGTGTTATGAATGAGTTCATGGATATTGTTAATTTGAAATGTTCACATTTTTATTTTATTTTATTTTCTCTTAAATCTATCCTTGTATTTCATATCCTGCCATGAGATTTTTACTATCTGTAATGTGAATGGTCATCTCTTCTTTTGGAATTATAGTTCCTGTGGTATTTGTAAATATGTATTATAGGTTACCACTCTCAGAGGATCCTGCTTTGGCGATGGCCTTCTCAGTGGCACGACGTGCAGCAGCTGTTCCACTGTTGCTTGTTAATGGAACATATCGGAAAACAATTCGTACCTATCTCGATTCATCTATACTCCAGTATCAATTGCAGAGATTGGATCATTCCCTTAAAGGTTTTGGGCCACATTTAATTTATATATATGTATGTATGTATACATATTTCTAATTGGATTTCATGATTTCATGGTCACTCTATATCTTTGTATTAACTCTCCAGTGATTTTGTTTTTTTGTTCACCACAAAAAATGACCTATTATGGTAGTTAATAACATTTTGGAGAGAAGACATTCTTTCTTAGTCATATTAACTGAGGAATTAGAGTCATCTTACGAATATGATGCTTTGTTTAAATATATGTAACCTAGACAAGTTAAGAAAACTAACAGTCTTATTCTATCTATGAAGTTCAGGAGGTACTGGTGGACTTGAGGTAAGATGAAGATGAAGAAAGAAGAACATATCTGGAAGATATCAACCTCGAATAGGATATAATCATATATTACAAGTTAAAAGTTTCACCTTTTCCTAGAGATCACTCAGGCCATAGGTGTTCAAACTTGTTTTTCAAGCCTCCTTGTTGCCTTTCTTATTTATCGTTCGATGAGGTTTTTGCAATTCCATTTGGTATATGTTGGACGATGATTCCAAAACTTCCACCAATGCCATTTGAAATGTTCAATTGCCTTGCGTAAAAAGAGTTCTATGATCGACTTCTGATACTTCTACTGTACCTAAGTAAGGAAGAAAGAAAAAGGACGTTGAGAAGGATGTTCTTTGAGTGATTCCATGGAATTCTTGTAACATTGGGAAGGTGGAAATGAATAATGGAAAAATACATGATGTATAAATTCACCAGCCTTAAAGCAGCTTAGGATTGATACCCAATCTTCTGGAGATTTTCTTTACAATTTGTTCCACGAAGTTAACTTTTCAAACCAATTAGGTTTGAATATTCACCTTTTTTGGTCTTTTAGTCTCCTGTAAAATACGGCTGGAAAAAAATATTTTGGATATCTTCCCCCTTTGGGCCTCCTTAATAAGGGTTTTACGCATCTCTTATCTCTCGATGATGTTGAACTGGAACCCTTTTTTGATAACGATGTAGTTTGTAGCCTTCTGTTTCTGGCTCCTTTTCTCCCTTTATTTTTTTTTTAATGGAAACAAAACTTTTCATTGAAATAATGAAATGAGACTAATGCTCAAATTACAAAAGAAGATACTAAACATAAAGCTAGCTTGCCAGTACAAGAAGAGAAAATTAAAATAGACCAATAACAAATAAAAGAAGAGAAGATAAAAGCTTGACATGAAATGAGACTTCTTCTAGGACAGCTTGAATTCTGAAGAAACCAAAAACCAGCTTCTTGAGGCTAATCTATCAACCAATCTTTGACCCTCTTTCAAACAACAAGATCAAAACCTATAAGATTAGCAAAATCCTTCTCAAAACCCAACAGGCTATAAGATCCATCCAAGCTGCCAATAGATCCTTAAAACTAAACAAACTTTCCCAAACTTCATGGGTAGTGAATGAAGCTTTACCCAAGATTATCTCAAAATGATCCTAGTAAGCAAACAAACGCCCCAAGACTTCTGAAATCTTTTTTTATCTTATTTCTTTTGCATTGTCCTTGTATATTCCTTCTTTCTTCTTTTCTTTTTTTCTATGTGAGTTCCTTAAATATTTTTTTTTGAACAAGCAATATAATTTCTAGCTTGATTAACTTAAAACTAAAGCTGCTCGAGAACCTTAAATATTTGTTTTTGAACAAGAGGCGAAACTTTTCATTAATGAAATGAAAAGAGACTAATGCTCAATGAAACAAACTCCGCAAGGGAGTGAAATTCTAGTGGCAAAAAAGTAAAAAAACAAATACAAAACTGAATTAAAAAGGACAGAAAACAAATCAAAAGAAGATAATAAGGCTCTATTAAGCATCAATAAAAACAGCCTAATTTAAAACGTATCTTGAAGAGAGTATCCCTAAAAAAATTTGGATAAAGAGCACCAAAGTGATGCAGTGCGGTGAGCTTAATCAAAGTCGTCCACCCAAGAGCTATATTCATCTTCAAAGATTTGTTGACTTTCCATTAGAAAAGGGAAGTTCCAACACACCCAGATCATCAAATTGATCCTTAAAAAGCTTCATGCTTCTAGTTGCTCTTCATGTAACCAGCATGTAACAGTGAAAATACCACATAAACACCAAGGTTGAAGCCAAAAAGCAGCTACTGATCAAATCTCGATCCAAAAAAAGAGATTTTTCCTTACTATCATCCCATAGAGTCAGCAAGTCACCCGATCTACCAACAAAATGTACCGCAGACCAATCAACACTCTTGGAGCTCCAAATAGATTTAAGAAGTTCGAAGGAGGTTAACTTTCTTGAATCAAAATTATATCCAGGTTTTGTTGCTTAGGAAACTTTTTGAGAGAAGCTCTTTTTTTTTTTCACATCCCCCAGACTCCTTGTATTCCAGGAAAATATTTCCGTCGTTTAGGGAGAATCAGATGGTTCCCGAGGGAACAATTTCTATAACCTAAGTCAACTTTCTTTAAAAAATGAAGAGAGATTTTCACTGTCATGGCCAACTGCACCTAATTCTGAAGTGTGATCTCCTCCTCATATGTGCAGAAGAAAAAAAAGGGTTTGTAGTCTCCAAACTGAAATTCGGTCAAGTTACTTAACGTGTTGAAGGGGAAACTCTCAGAGAAGTAGTATTCTTCTCCATCTAATCAGGCTATACAAAATGGTAACTAGGATAATTTAAGGTATCTCCTTCAAATTACTCAAAGAATCCTCCAAACAAACCATCTCACTGACTAACTTCTTGATCTTTTCTTTTTGCGCATGGGCGTACTTATAAGGTCTAACATTCACAGGCCCTCCCCTTCCTTCAAAGTAATCCGATGATCCACCTCCCTTTTTTAGGGTAATGTAGTTGGCCATTCGAAAACTGATTCAAACACATCTAGCAATGGTTGGATTATCCTCGGGGTCTCATCAGTTTCTGCTTGCTTCGATTGGCCCTGCTGTAGGTTTTGCTCCTCGGTTGCAATTGGTTATAATTCAACCAAGAAACCTTGATCCTCCTCAGACCAAGTTTGTGCCAACATCTTGAGAGAAACCTCTATTTTAGATTTTAGTCAATAGATCTCCTCTGATAGTTACTGTATTCTCTTCCGAGGTGAATGTCATAGTAAGGGCAGCCCAATTGATCCCCATAAATTCTATCTTTCTCAGCCATTACATTCCCAATATCACGTCTACATTTCCCAAATCCAGAGGCAAAAAGTCTTCCAAAATTGACAACTCCGGTAGACACACCACCACAAATTTGCACATGCCTTTAGCTCTAATTGTCGTCCCATTTATGAAAATTACCCCATAGTTAGAAGTCTTTGTAATCAGAATATATAGTTTCTCTTCTCTTTGCTGAAGAATGGATTTATGTGTAACATCGCAATCAATTACCACGACAACTTGTCTTCCCTTACCTTTCCTTACCTCTCCTTTTACTTTCATGGTCCTAGGAGCAGAAAAACGCAAAACTGAATGTAAAGCCAATTCGGCGATTTCGTCCACCTCTAGGGTTTTAGTTTCCAGTTGCACTCTTCCAAACTTGCCAATTCAAATTTAGTTCCATTCAGTGTTTTAAAAAGCCCTCTCGGGTGTGCGCCTAGGCTCAAGGCGCAGGTCTGGTGCCTCGCCTTAGAAAGGCGAGGCTCACAAAATAAGACGCGCCTTTTGTGAAGCCCCAAGGCTCAAAGCCCTAGGGCTTTGGGGATTTTTCATTATTTTTTAAAAATAATAATAATTAAGGGTTTTCCTTCTTTATAAACTAAAAGAATCAAGTTTACTAAGCCTAAATGCAAAATTTCTTATGTTTGGGGTCTTTTTTTTCTTCCATATTTCCACTTCAACTATGCTCTCTCTTTTTAATGCAATATTTTTATATATAGTGCACCTCACAAAAAAAAAACCCTGCACCTTTTTGTGCGCCTTGCGCCTAGGCTCCAAAGAGCCATTGCGCCTTAGTGCGCTTGGGCATTTAAAAACATTGGTTCATTCACTATCAGCACTCTTAGCTCCCGTACTTCACGATTCTTGCAATGATGGCCAATAAAGAACTTCTCATCACAACGAAAACACAATCCATTTTCCCGTGTTGCTTGGTATTTTGCATTAGATAGGCATTTCATCTGTGCATCCTTCCATTGAGTCCTTGTGACTGTGCGAGTTATCAAAGGGTCTGAAGCCCTATTCGACAAGAGGTCCTTTAGAAACATACTCTCCAAGACATCTTCAGCACCGTTAGGTAACTGAGCAGAGTATGATTAAAAAACATGCCTAAATTCAGACACTGTACCTTGTTTTATCTTCAGAAATCGAACCATCAAACTTCTTGTGTTGGAAAAAATGGTGCAAATAATCGGGCCTTCAAGTCCGCCCAGGTGACGAACTTATGACAATTGTTAGCCCATCGATATGAGTCAACTGCACCTTTTTCAAAACTCACTATCAGAATCTTAATCTTCTCTGAATCTGTCAGCTTCTATATTTCAAAATGCCTCTCTGCCCTAAACATGAATTAGGATTTTCTTAAAGAAAACCCAGCATCTCTAGCTTTTTAAACTTGCCCCGATCCTCGATCGAAGTGGAATCTTCAACGATTTCTGTTTTCCTTTTCACGTTAGAACCATTAGAGATCTTCGATGTTTTTCAACCCTTGCTGAACAAAATTTCACGCACCTGGATTGTAAGTCTTTCCAAACATTTGTTCATATTTTGAAGCACATCTTTAACGTTTGATACTTCCTTCTCCATTCCTTGCATTCATTCTTCAAGTTATTTTGAGCCATTTTTTTCATATTCTGGATTAAGTGCTTTAATCTAAGAACCCTTGTTAATTAATGGCAAACATAGGCTTTCTGGAAGTCAACTTTGAGGATCACCCCCACACTAATTTTTAATTCTCATCTCATCAAGTCCTTTTACAATGAGGATCACATCCAAATATGTGTGTATGCATGGCAAAGGTAACTTGACTCTCAACAATGTTACCTAGGGGGATTTTAATATCAAATTTATGTCATCAGAACACTTCAATGCTGGGAATGCACCAAAGTCTTGGTGGCCACCCTTGTAAATTAAATGTAGTAACAGGGGTTGTCTTGTGAGAACAGGACAAATGTAAAAAAAAAGGACGGAGTACATTACATGTATTTTTCCAATACACTGTACTTTTATGGTTAGAAGATTCAAGTTTGATAGCATACTTAAAAGGGTTTCTTTTTTTTTTTTTTTCCTACTGATCTGTATCCTTTTCTTTTGTTTTTCTTTTTTTTTTTTTTTTTTTTGTTTTTTTTGCTGACTATGGATATTGCAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCACACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCAGAGATTTCATCCTGGGAAAGCCATTTGCAGTGCAATGGGAAATCACTTATATGGGATATGAGGTTAGTTCAAGCATCTTTAGTTTTCTTGTGACATATGATGTTTGGACTGAGAAAATACGATGGAATTGTATAACTTCCAGAGTTCGCTGCTTTCCCATCATTCTAGGTGCTGGGCTCATTTGAATGAATTATTACTTGCCTTGGGTTGGGAGTCAAGTGCATGTCATTTTACTCCCTATTTCCCTATCTTCTCCCCTTTTCTATAGAATCTTTGTCTCTGTCCCTCTCTCATTCTATCTATATATTTATTTATATTTATGAATAACCAAGCTTTCATTGCGAAAAAATGAAAGAATATATGGACATAGGGGATTGGTCGGTCAACATCGATTTTGGGATCAAACTAGTGTCAACCACTGACATGTCTGTTTTCAACCGTCGATTTTGGTTGGTTTTGAAGAGGCTATGTATGATGATTGACTAACCACACACGCTGATGCTTGGTTGGTTTTGGTCGGTTTTATGAATCTGGAAAACTCAAGAAAGAATAATACTAAAAAGAAACTAAATATATATTTAGTCTATTGTTTCTATTGTTCAAATGATCCATTAATCAGGATGGAAAATTATAGTTAGAAAAGGGGAGCAAAATGTAATTTGTGTTTTATTCTATTTTGAAACAAAAAGCTAATAATATAATCCTTTTGCTAACCCGTGCATATTGCAATTGGTTTAGGCAACCACCCTTGACTAGGAAGTCAAAGGTTAAAATTCTCCACATGTGGTTGAACTATAAAATATTATAATCCTTTACTTTTCTATTTGGATCAATATGTATGATTGTGTGCTAGGATAATAAATACAAAATGCTTTCTCTGTCACTTATGTCATTCTGCAGATTACCATAAGTTGCTGAGGTAAAAATGTAGAACCCAAACTTGTTGGTAATCATCAGTGACACCGTAATACAACTATACAAATGCTAAACACGTGGAAGACATTTCAATACGATTATCTCAGGAGTGCTTCTTGCTTCAAAGTTTGCTTTTCTTTTCTGTTTTAAACCCTTTTTTGTAAATCAAATGTGATTACAGTTTCTAGTTTGGTTGGGCTCTTCAGAAGTTTTCCGTCAGTTGTTTTGTCTACTTGTGGTGTTTGTTTACATTGAGGAGTTTCACTTTGGAAGAATTCTTCGAGCCTTTCAAGATTAGTTGCTATTGGATTTTTAGTTTAGAATCGATTAGAGTTTCTCTTCTCTTTTTTGTTGTATTTGGCCGTAATGGTTGTAATTGCTATATTGTCTTTCATTTTGTTTCATTGTATTTTGAGCCTTTTTATTTAATCAATGAAAAGTCTTGTTCCTTTTAAGAAAAAGAAGTGTGATAAAATTAGTATGAATAGCTTCATTTTCAATATATGTGGTTTACCACTTGTTTTAGCAGTTAACCTTAAAGGCCTCTGTCATCAGGAATTATCTTAGAATATTAATGGTTTGTTTCTTCTTATACCTTTCAATACGTAGGAAGCCAATCAAAGCTGCTCTTTCTGCTGCTGCAGAACATCTTTCCGGTCTGCTCCCTCTCCATCTTGCATACAGTCCATCGCATGATACAGCAGTTGAGGTACCATAGTGAAATCCTTTGGGTTGGAGAGATGTAGGCATATTAGTAGATATTTGTTGTAATACTTTCAGTTATATTCTGTTATTAGTTAGGTTTGTTTTAGAGGTTGCGCTCGGTTGTTGATTAGGTAGTTATGAGCCATGTGCCTCTTGCTTGAATCCTATAAATAGGTTGCATCTTGACATCTTCCCATCAATAAAATCATTCAACATATTATTCAGTTTACTTTTTGTCTCCAAGAAGTGTCTTCACCATGGAGTAACCATACGTTTTTCTTTCTTTCTTTCTTTCTTATTAAAAAAACAAACAAACAGAATTCCATAAATTTGGGTACACAATCTCTAACTGGATTTTGTTCAATCTGCTGGTGTAGTGTACTTTCTTTACCTTCTTGAAGTCCCCATTCCAATATTTTATCCTTTTAAATCTCTTAATCAAACTCAATTAGTTTTTTCTACCATTGAATTTTAAATGTCATCTATAGCAATTCGTCTCTCCTCTTTCTACTATGATCCCTCTCCTCTTGTTGCCCTTGTAGAAGCTAAGTAGAAACCTTCCACCACTAAAACATCTTTAGAACCCTCACCAGTTGAAAATTCTGTCGAAAATAAGTGACGTATGCTTGTTGGGGGTTTGAATTGGATTATAAATGTGTAAAGTTCTGTTATTTGTTTTATTATCGTTTTATTTTAGTTTCAGTTAGATAGGGATAATTCGGGAATGCTAGTAGGTTTACTGTTTTACTACTCTTCCTGTAGGTAGAAGGTTAATTTTTAATTTTTTTTAATTAATTAAAATTAAAAAAAAAATAAAAAACAGAACTTTTTATTGATGCAATAAAAGAAGACTAATGCTCAATAGATACGAACTCCAAAAAGGAGCAAAGAGAAATAATGAAAATGACATAACTATAATAGTACCAAATAATAGCAGAATTAAAAGATATGAAGGATATAGAAAAGCATTCCAATTGAGATGTATACTTGAAGAGTAAACCTCACAAATGTTTGACAGTGAACACCAAGATGAAGTGTTTAAATGGGCTGATTCGAATCGATCCTAGGGGAGAAACTTACTGTGAAAAAATATGTGATTTCTTTCCATTCACAACTCTGAAAATATAGCTTTCACCACATTAATCCAGTTTGGCTTTAGAGTTCAAAATTGGCTCCATTAGAAGCTAGGAAATATTGCCTCAAAATATTGCTGAAAGCCCAATGAACATTGAATATCGATAACAATCTTGAACCAACCGTGTTTGGCATATCAGCAATAAAAAAAAAATATGGTTGTGAAGCTAGGTATAGGGAAAGATTATTCTTAGCCATCGAGTTATGAGAAAGGCTCAATATATACAAGAGGATTCTCAGTCATATATGCTGGAATTAAGCTATCAATCTATACCTATATGAAAGGAAAAATTACACAGTATACAAAGAAAGAATTTGCATTGCTCCAAGCTAAGCACGCTTAACTTTAGAGTTCCTATGATTGAGCTACCGAAAAGGAAGGTGCACCTTGTTGGTATAAGTAGAAGCTATAAATTCTTTTAACCTTTTCTTAACCATGCTTTTATATCCTCAGGATCCCTTTCATTCAGATGTGATATTGGTTCATTCATGTACCCCTCCGAAACTTGGAGAGATTGCCCATCAACTTTCGCTTGGTTCGTCCTCGAACAACATCTTACTGGGAGAGATTCTGCTCTAATACCATCTATAACGCCCCAGGCCCAACATTCAGATCAGGATTCAAAGTTCGGATTTAACACCTAATGGCTTCAGCATTCTCCTGCATCCCCTGCAACCTGGTTACGCTTTCTACTTGTCTTAAACTGCTTCAAAGAGTGAAGACTATCCCCACAAACTAACACGAATCTTTTCAGCATACTTTGTTCTCGCTCACATGCTTCCAAGGAAAATTCCCAGGTCACCCAACATAGAATTGCTCCAAACTAAGCACGTTTGACTTTGGAGTTCCTATGATTGCTCTACTGAAAAAGAAGGTGCACTTTGTTGGTATAGGTAGTAGCTAATCAATTCTTTTAAGCCTTTCTTAACCATGCTTTCATATCCTCATGATCCCTCTCATTTAGATGTGATATCAATTCATTCACATACCCCTTCTAAACTCGGGCGTTACAAAGATTATTCTTAGTCATCAAGTGATGAGAAAGGCTCAATATACAAGGGGATTCTCAGTCATATGTGTTGGAATGAAGCTATCAATCCAAACCTTTATGGAAGAAAAAATTACACAATATACAAGGAAAGAATCTAAATTAATTCTTAGTATACAGCTGTATGATTTAGCTTAATTTCCTGCACTCCCTCAAGCTGAATGGTATGTGCCAGTCATTTCAAGCTTGAAAACTAGTCCACTAAGAAGGGAAATAAAACCAAAATCTTAAAACTTCCCTAAGAAGGGAAATAAAAGGTAACCAAAATTTCGAAAATTCTTTCTTCCACATTGAAATATTGAATAACCAGATTGATTTTATAAGATGAAAATGAAAACAAAGAATTGGTGCATCTCAATACCAATTATTTCAGTGAAGTCATTGTTTGGCTTTCTAGTTGTCAATCTCTCTACTCCTAGTTACCTCCCTTGCTGGTTTTCTTCTCACCAATGGAGGTAAATATTTATTTATTTTTTTATACAAGAAACAACTTTTCATTAAAGAAATAAACAGAGCATTTTCCTGATGAGGTACAAACTCTCACGGGAGTGAAAAAGAATAGAGAAAATAGGGATACAAAGGAAAGTAAAAATGAAGGTAAATAAATTTCAGATCTAAATCCATTTGATATCGAAAGTGGGTAAAAACTAAGAAACACAAACACTTTAAGACCTGCTTGGATTGACTAGAGGAAAAAGTGTTTTTCAAAAAATTTGTTTTTATTTAAACTCTTTTGACAAAATGGATTTAAAATACACTTCAAAAGCTATTTTGTGTGATTGCCAAACACTCCAATTTTTTTCAAAATGACATTTTTTTTTAAATTAAACAATTGAAAATGTATTCCAAACACACCCTTAGTTTGGCTAATGTGTTTGTGCTCAACACATGTCGGACATTTGAACACTTTGACACTTGTTGAGACGTATCAAAACACTTGTTAGTGCAATAGATGTGTTAAACACTATCGCACTAACAAGTATCGAAGACTTGTTAAGTATACTAAATAGATACATATATGACTAGAATAATAAATTTTGAGTGAAAATCTTCTCGTATTTGCCTTTGTCCTTCCTCGTCCCAATGATTCCGTTTCGACCCTCCTCCTTCTTACCTTAGCCTCTTCTTTGCCGCTTCCATCTTGGCCGCCGCCACCGTCCTCCGCTGCTTCCCGACCAAAGCCTTGACCTCCATTTTCCATCCCGTCTTTGGTCGCCATAGACTTGTCCGAGCACCTCTTCTTGCCGTTTTTGGTCCTCCGATTGTCTTTTTTTTTTTTTTGACCTCTTCTTTTTGCCGCTTAATTTACCACAACCATTTTCGTTTGACGCTCAGCTACTGTTTACATGGATCCTTTGAGTTGTATTAGTAACCTCTATTTCAACATTTGGATTGAAGATGATATGGGTTTTGTTGAAGATTTGAAGAACAAAATGAAGATTCCCCTCACATTCACTCATTTGATTTGGAATGAAAATTCTTTGGCTGAAATGTTTATTTTTTCGTAGGAAATTCAAGGATGCTTCTGGCACTATCTGCTTGGCCAAGTTTAGATCTCCCCTTGGTTGGAAACTTGAATGTGTTGTTTGGCCACCTATTGGAGGTAGAATGAGATTATTTGTTCTCGTTGGGGTTGAAAAGAAAGGTTGGTCCATTTTTTGGGGCATGTTATAAGATTTTCTCTTAAAAGTTGAGAGAATTCAGCATTCACCTTATATTTCTTCAACGGATGTTGAGGTAGGAGATGTATTGCCCTTGGAGCTTGTCAAAGATAGTCATCTTCACAACAATATTATGGGAGTGGTACGTATGCAGAGTTTGGATGCTTCTAATTTTTGGGTGAGGAAAGAAAAAGAGGTAGTGGAAATGAAATTCAATTCTCTTTTGGTGGTTTCAAAACTGTTAGCTCATAACTCTTGGTCGGACATTCATGATACTTCAGAAGTTTATTTTCAATCTAAGATTTCTATTAACCCATTTATGGCATATGAAGCGTTATTGAAATTGAGTGATGATTTCACTGATTTGGATTTTGATGGTAAATGGAGGCAAATTGGTAATTTTCACCTAAAAATTGAGGAAAGGTTATTAAGGGTTATGGTGGTTCGATTTTATTGAAGAACTTACCAATGCCTCACTGGAAACATTCTACTTTTGAAGCAATTGGTCATCATTTTGTAGGCTTAGTCGGTATTTCTTCTTAGACTCCTAATCTTCTTGATTGCACAACTACTATATTTGAAGTGGAGGAAAACAATTGTGGTTTCATTCCAGTTGAGATTGAGATAACAGATTGGAAGTTGACAGTTTTCTCTTCGTTTTGGGGATATACATTCTCTAGTTCCGCCTTGTATTGATCATGGGGATTTGTCAATTTTTTATTATTCAAAACTCATTGGATCTTTAGATAATAAACCAAATTATGTTGGATGAAGGTTTTAGGTTTACCTTTTGCAACTTTTAAGGGTAAAAATGTTATCTCGAACTCTTCTAATTTGCATGTTCAATGACAAAATTCAGATAGTCTCCTTCCCCATGATGACCTTTAGGTTGCACACCTTTCTTCAGAATTTGGTCAGTTGTCCTAGCCATCTTGTCAGGTTTCCAATTCTACAATTTAGGCCCCTATCTTGAATTCTGGGCTTCTTGAACCTCACCCTCTTTTTTATGCTCAGAAGTTCACTAGCGCTGGTATCCTTGAAAACTCAAAGTTCAATGTAAATATTTTGGAAGAAACTTGTGTTCGGATCACTTCCTCATCGGAGTTATAGCTGCCTCCTTCACAACCTACCTCTTCATAGACTCCTTCAGAGTTCACTCCTTTAGGATCAAATGTTACTTTTACAAAAGGTTCCTTGTGGACTTCCACTACTAAGACATCTAACTCCACTCATTTGGTCATGGTGGTGAAGCCGATGTAGCATCTGAGGTTAGCATGAATGGTGAGGAGGTCGAAGAGCAGTTAGTTGATATCAAGATCGATGATCTTCCTATTGTGGACTCTTTAAGAAGCTTTTGAATATCTTTTTGCAATTGATGATAGTAAGATTTCGAATGAAATTTCGCAATTATCAACATCAGTGAATAAGTGTTCTTCAATTCCTCACTAATTTCATTTTTTAAATGAGGTTTGTGGTTTATATATGAAAGAGATTAGACCTCTGTCGCCACAAATTAATATTTAATTGTGCCCTAAAAATGGGTGGGTAGTTCTTCTTGGTTTTTCAAAGGCTTTGGAAATTGTTTATTAAAGTTGATTTGAAGATCTTTTTCAATGCCGTTTTGTCGTTCGTAGCTATTCTGTTATTGCACCTGTTGAAGCCTGTAGGGTTCCTATTTTGTTTGTTTGGAAGACCCTTGTTCTCTAGTTTTCCTTGTCTTCTCCCTTCTTAGTTTTCTTACGAGAGTTCTAGGTTCTTGGCCTTTATTTTGATATTTTGTTTTCCGTGCATTTCACTTGTTGTCCTTCTTTTCGTTAGTGGAGATTTGTATTTTTGAGTGTTAGTCTCCTTTTTTATATCAGTGAATCTTCTTGTTGCCTGTTTTAAAAAAGAAAAAAAGATTATTTTTTAGTGAAATACATCAAACTAATTTTTTTAAGCATATAAATGCATGAACTCATTGATTTTAAATTTTCTTCTCGTACAAAAATGATTTGTATTTTAAAAAATGTATATTTAATAAATGTGTCCTTTCCATGTTCATGTCTTAGATTTTGAAAAAATGATGTGTTGGTTGCCACGTCGTATCCATGTCTCTATCCATATCCGTGCTCCTTAGGGTACAGATTGCATATTCTTAGAGCATTTGTTTAGGTTGTATAGTAGGTAATGCCTAGAAATGGTTGCTTTCCATGTTTATTTCCACGTTTACTTTCATGTGTTCTTGTACGTTTCTTATTAAAAGGATTTATGAGTTTGAGTCCTTTCTACTCCTCGTGGTTTCTTTAAATATATTTACACATCTGAATACGAACATTCTGTTTTCATGCATTACCAACTAGGTGTATATCACATCTATTCCAACCACTCTTTAATATTATAGGATTGGATATGGTCAGTAGGCTGCAACCCGTTTTCCATTACTTCGCGTGGATGGCATGTTTCGCAGTTTCAGTCTGATACTATTGCTCGAAGTTATATCATCACAGCCCTCGAAGAATCAATACAGCGAGTCAATTCAGCTATTCATCTTTTACTAATGGAGCGTACCAGTATCCTTGCCGGCTCTTGGAATACAAATGTTAAAATGGCTTATCACTAAGATTTTTTTTTTTTTCATGCTAAACTTATTTGGTATTTAAAATGCACTGTTATTGTTTTAAATTAAAGTTCTGTGGGTGTAAAACCTTTACAAAAATTAGCTGAAAAGTCCTTCAAGCTCTTCCTGTCACAGGAGCGTGAGCTTGTAAAAAAGCATCAGTATGTTGTTAGCCTGTGGAGAAGAGTAAGTGCCTTAAACTTCTCCAAAAATTCAGTGGTTCTAACATTCTTCCTGGCTTACGAACAACTGAGATCATGATTATGTATAATCTCTATATTCTCAGATCTCAACCGTTTCTGGAGAGTTGCGGTATATTGATGCAGTAAGATTGTTGCACACCCTAAATGAGGCATCTAAAGGGTAAGGCTCTTCAAATCATTGCTTTTATTTTATGCAAACGTTTATTGTTTAAATGTTTTTTTTTAACCTTCAAACCACGTTTCTTAATGGTTGAAAAGTAATATTAGGATTGGTTATCTACATTCTTTTCATAATCACTTGTGTATGATATGAAATAGTATGCCAACACAAATAAATTTTGGTGTCATTTACTGAACATTTGCCTCCGTCGTTTAAGTACAAAAAAACACACAATACTTTCTATGCAACTCATTGTTTCTCGAATGTAAAAAAGTTTCTCTTAGGGGGTGTTTGGCCCACCGATTTCATCAGTTTGTTAAGTAACTCAACTCTACCAACTTCAACAGTGTTGCACACTTATCGAGGAACTCCATCTTCCCTTCTTTCTCTCTCCCTCCCTCTAATATTTTCCAGCTCCACACTAAACAACTCTGTACCCCAGACATAGACTTCCCAACTCTAACATATTAACTCCGGACTACAGAACTCAACTCAGTGCCCCAAACAGCCTGTAATATTTTTTATTTTGTATGCAACAGAAGTGCATGGTACTCATGTTTGTATCAATCGGCTTCTACTTTATTTATGGCATTTGTTAAGATCTCTTCAAGAAAATTCCTTTTGTATAATATTGGGAAAGAAAAACAAAGATGTGGCCATGATTCTGAATCTCTTTGGGACCTAAACTTGCGGGATATTGTGTCCACCATGATTTCCACACCTTTTGTATTATTATTATTTTTATTCATCATTTTTAACTTACATTGATGTTGAACAAAAAATAAATAGTAAATAATCTTGCTGACCTTTTATGGTCATTATGCAGAGGGAGTAATTGAAATTTAATATGGATTGCATATTCACTTTTGTATGTTTTGTAGGTTCGCAGATCAAGTAAACACGACATTAGCTCTTCTTCATCCCATTCACTGCTCACGAGAGAGAAAAGTAGAGGTTGTGTTTGATGGAACAACCATTCCTGCCTTCATGGTTGTTTTGGGCCTTCTCTACGTTCTTCTAAGACCAAGGCGTCCGAAGCCAAAAATTAACTGA

mRNA sequence

ATGGGTCATGATCGTGGATCTTCTACAGTCGCTGCTGCTAAACTCTTCAGCTTATCTGGGAGATTCACGATTCCCAAGCGTTTGCACCTACTCTGTATTGTTTTGCTGCTATTAGCAGCAAGACCGTTTGCATCCTCCTCTGGAAATCGTAAAAGTGGAAAGTCCTCTGTATTTTCCTTGTTTAACCTTAAAGATAAGAGTAGATTTTGGAGTGAGACAGTCATACGTGGTGATTTTGATGATTTGGAATCATCCACCACTGAGAAATTGAGTGTTGTTAACTACACGAAGGCAGAATTCAAGCTGCATCCAGAAGAGCTTGAACGTTGGTTCATGAAACCTGATCATATCTTTGAACATACACGGATTCCGCAAGTTAGCGAGGTGCTAACCCCTTTTTATAATATCAGCATGGACAAAGTTTTGAGGCACCAACTACCCCTCGTCAGTCACATAAACTACAATTTTTCTGTTCATGTAATACAAACGGGCGAGAAGGTTACTTCAATCTTTGAGCTTGCAAGAAATGTCTTATCTCGCAAAGAAGACGTATCCAATAATGGGGATGGGAATGATGCTTTTTGGCAAGTAGACGTGGACCTGATGGATGTACTTTTCACTAGCTTTGTGGAGTACCTTCAACTTGAAAATGCTTATAACATTTTTATTCTAAATCTCAAGCGTGACACAAAAAGGGCCAGATATGGATACCGGAAAGGTTTATCTGAATCGGAGATAAATTTTCTTAAAGAGAACGCACACTTGCAATCAAGAATTCTTCAATCAGAAAGTACACCAGAAACTATTCTTGCTCTTGAGAAGATTAAAAGGCCCTTATATGAAAAGCATCCCATGAGTAAGTTTGCATGGACAATAGCTGAAGACACTGATACTATGGAATGGTACAACATCTGCCAAGATGCCCTAAGAAAAGTTAATGAATCGTATCAAGGAAAAGAGACAGCTGATATCATTCAAAACAAAGTTTTGCAGATATTGAAGGGGAAAGATAGAGAGATGAGGCTTCGTCTTGATAAGGAACTAAAATCTTTTGATTTCAGTGGTTTCCATGCTGAATGTCTCACAGACACATGGATTGGCAATGACAGGTGGGCATTTATTGATTTAAACGCAGGCCCTTTTTCATGGGGTCCCACTGTTGGTGGTGAAGGTGTGCGAACTGAGCTAAGCCTACCAAATGTTGAAAAGACGGTTGGTGCTGTTCAAGAAATCTCAGAAGATGAAGCTGAAGATCGCCTGCAAGATGCTATTCAGGAGAAATTTGCTGTTTTTGGTGATAAAGATCATCAAGCCATTGATATTCTTTTAGCAGAGATTGACATATATGAGCTTTTTGCTTTCAAACATTGCAAGGGAAGGAAAGTCAAACTTGCTCTTTGTGAAGAACTTGATGAGAGGATGCGGGACTTAAAAAATGAGCTTCAGTCATTTGATGGTGAAGAATATGATGAAGATCATAAGAGGAAGGCCATAGATGCATTAAAACGGATGGAAAATTGGAATTTATTTAGTGATACGTACGAGGAGTTCCAAAACTACACTGTAGCACGTGATACTTTTCTGGCTCACCTAGGTGCTACTCTCTGGGGGTCAATGAGACATATTATATCACCTTCACTTTCTGACGGGTCATTCCATTATTTTGAGAAAATATCATTTCAATTGTTTTTCATCACACAGGAGAAAGTTAGACAAATTAAACAATTGCCCGTGGATCTTAAAGCTCTAATGGATGGGCTCTCCTCTTTGTTGTTACCTTCACAGAAAGCACTATTTAGTCAGACCATGTTACCACTCTCAGAGGATCCTGCTTTGGCGATGGCCTTCTCAGTGGCACGACGTGCAGCAGCTGTTCCACTGTTGCTTGTTAATGGAACATATCGGAAAACAATTCGTACCTATCTCGATTCATCTATACTCCAGTATCAATTGCAGAGATTGGATCATTCCCTTAAAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCACACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCAGAGATTTCATCCTGGGAAAGCCATTTGCAGTGCAATGGGAAATCACTTATATGGGATATGAGGAAGCCAATCAAAGCTGCTCTTTCTGCTGCTGCAGAACATCTTTCCGGTCTGCTCCCTCTCCATCTTGCATACAGTCCATCGCATGATACAGCAGTTGAGTTTCAGTCTGATACTATTGCTCGAAGTTATATCATCACAGCCCTCGAAGAATCAATACAGCGAGTCAATTCAGCTATTCATCTTTTACTAATGGAGCGTACCACTGAAAAGTCCTTCAAGCTCTTCCTGTCACAGGAGCGTGAGCTTGTAAAAAAGCATCAGTATGTTGTTAGCCTGTGGAGAAGAATCTCAACCGTTTCTGGAGAGTTGCGGTATATTGATGCAGTAAGATTGTTGCACACCCTAAATGAGGCATCTAAAGGGTTCGCAGATCAAGTAAACACGACATTAGCTCTTCTTCATCCCATTCACTGCTCACGAGAGAGAAAAGTAGAGGTTGTGTTTGATGGAACAACCATTCCTGCCTTCATGGTTGTTTTGGGCCTTCTCTACGTTCTTCTAAGACCAAGGCGTCCGAAGCCAAAAATTAACTGA

Coding sequence (CDS)

ATGGGTCATGATCGTGGATCTTCTACAGTCGCTGCTGCTAAACTCTTCAGCTTATCTGGGAGATTCACGATTCCCAAGCGTTTGCACCTACTCTGTATTGTTTTGCTGCTATTAGCAGCAAGACCGTTTGCATCCTCCTCTGGAAATCGTAAAAGTGGAAAGTCCTCTGTATTTTCCTTGTTTAACCTTAAAGATAAGAGTAGATTTTGGAGTGAGACAGTCATACGTGGTGATTTTGATGATTTGGAATCATCCACCACTGAGAAATTGAGTGTTGTTAACTACACGAAGGCAGAATTCAAGCTGCATCCAGAAGAGCTTGAACGTTGGTTCATGAAACCTGATCATATCTTTGAACATACACGGATTCCGCAAGTTAGCGAGGTGCTAACCCCTTTTTATAATATCAGCATGGACAAAGTTTTGAGGCACCAACTACCCCTCGTCAGTCACATAAACTACAATTTTTCTGTTCATGTAATACAAACGGGCGAGAAGGTTACTTCAATCTTTGAGCTTGCAAGAAATGTCTTATCTCGCAAAGAAGACGTATCCAATAATGGGGATGGGAATGATGCTTTTTGGCAAGTAGACGTGGACCTGATGGATGTACTTTTCACTAGCTTTGTGGAGTACCTTCAACTTGAAAATGCTTATAACATTTTTATTCTAAATCTCAAGCGTGACACAAAAAGGGCCAGATATGGATACCGGAAAGGTTTATCTGAATCGGAGATAAATTTTCTTAAAGAGAACGCACACTTGCAATCAAGAATTCTTCAATCAGAAAGTACACCAGAAACTATTCTTGCTCTTGAGAAGATTAAAAGGCCCTTATATGAAAAGCATCCCATGAGTAAGTTTGCATGGACAATAGCTGAAGACACTGATACTATGGAATGGTACAACATCTGCCAAGATGCCCTAAGAAAAGTTAATGAATCGTATCAAGGAAAAGAGACAGCTGATATCATTCAAAACAAAGTTTTGCAGATATTGAAGGGGAAAGATAGAGAGATGAGGCTTCGTCTTGATAAGGAACTAAAATCTTTTGATTTCAGTGGTTTCCATGCTGAATGTCTCACAGACACATGGATTGGCAATGACAGGTGGGCATTTATTGATTTAAACGCAGGCCCTTTTTCATGGGGTCCCACTGTTGGTGGTGAAGGTGTGCGAACTGAGCTAAGCCTACCAAATGTTGAAAAGACGGTTGGTGCTGTTCAAGAAATCTCAGAAGATGAAGCTGAAGATCGCCTGCAAGATGCTATTCAGGAGAAATTTGCTGTTTTTGGTGATAAAGATCATCAAGCCATTGATATTCTTTTAGCAGAGATTGACATATATGAGCTTTTTGCTTTCAAACATTGCAAGGGAAGGAAAGTCAAACTTGCTCTTTGTGAAGAACTTGATGAGAGGATGCGGGACTTAAAAAATGAGCTTCAGTCATTTGATGGTGAAGAATATGATGAAGATCATAAGAGGAAGGCCATAGATGCATTAAAACGGATGGAAAATTGGAATTTATTTAGTGATACGTACGAGGAGTTCCAAAACTACACTGTAGCACGTGATACTTTTCTGGCTCACCTAGGTGCTACTCTCTGGGGGTCAATGAGACATATTATATCACCTTCACTTTCTGACGGGTCATTCCATTATTTTGAGAAAATATCATTTCAATTGTTTTTCATCACACAGGAGAAAGTTAGACAAATTAAACAATTGCCCGTGGATCTTAAAGCTCTAATGGATGGGCTCTCCTCTTTGTTGTTACCTTCACAGAAAGCACTATTTAGTCAGACCATGTTACCACTCTCAGAGGATCCTGCTTTGGCGATGGCCTTCTCAGTGGCACGACGTGCAGCAGCTGTTCCACTGTTGCTTGTTAATGGAACATATCGGAAAACAATTCGTACCTATCTCGATTCATCTATACTCCAGTATCAATTGCAGAGATTGGATCATTCCCTTAAAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCACACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCAGAGATTTCATCCTGGGAAAGCCATTTGCAGTGCAATGGGAAATCACTTATATGGGATATGAGGAAGCCAATCAAAGCTGCTCTTTCTGCTGCTGCAGAACATCTTTCCGGTCTGCTCCCTCTCCATCTTGCATACAGTCCATCGCATGATACAGCAGTTGAGTTTCAGTCTGATACTATTGCTCGAAGTTATATCATCACAGCCCTCGAAGAATCAATACAGCGAGTCAATTCAGCTATTCATCTTTTACTAATGGAGCGTACCACTGAAAAGTCCTTCAAGCTCTTCCTGTCACAGGAGCGTGAGCTTGTAAAAAAGCATCAGTATGTTGTTAGCCTGTGGAGAAGAATCTCAACCGTTTCTGGAGAGTTGCGGTATATTGATGCAGTAAGATTGTTGCACACCCTAAATGAGGCATCTAAAGGGTTCGCAGATCAAGTAAACACGACATTAGCTCTTCTTCATCCCATTCACTGCTCACGAGAGAGAAAAGTAGAGGTTGTGTTTGATGGAACAACCATTCCTGCCTTCATGGTTGTTTTGGGCCTTCTCTACGTTCTTCTAAGACCAAGGCGTCCGAAGCCAAAAATTAACTGA

Protein sequence

MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKAEFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQLPLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTPETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQNKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPTVGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLAYSPSHDTAVEFQSDTIARSYIITALEESIQRVNSAIHLLLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKGFADQVNTTLALLHPIHCSRERKVEVVFDGTTIPAFMVVLGLLYVLLRPRRPKPKIN
Homology
BLAST of HG10007663 vs. NCBI nr
Match: XP_038880657.1 (uncharacterized protein LOC120072284 isoform X2 [Benincasa hispida])

HSP 1 Score: 1677.1 bits (4342), Expect = 0.0e+00
Identity = 869/956 (90.90%), Postives = 878/956 (91.84%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGHDRGSSTVAAAK FS SGRFTIP RL L CIVLLLLAARPFASSSGNRKSGKSSVFSL
Sbjct: 2   MGHDRGSSTVAAAKFFSFSGRFTIPMRLQLFCIVLLLLAARPFASSSGNRKSGKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKSRFWSETVIRGDFDDLESSTTEK+SVVNYTKA                      
Sbjct: 62  FNLKDKSRFWSETVIRGDFDDLESSTTEKMSVVNYTKAGNVANYLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWFMK DHIFEHTRIPQV EVLTPFY IS+DKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFMKLDHIFEHTRIPQVREVLTPFYKISVDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRK+DVSNNGD N A WQVDVDLMDVLF
Sbjct: 182 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKDDVSNNGDEN-ALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKEN HLQSRILQSE+ P
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENTHLQSRILQSETIP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ
Sbjct: 302 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILKGKDREMRLRLDKELKSFDFSGF AECLTDTWIGNDRWAFIDLNAGPFSWGP 
Sbjct: 362 NKVLQILKGKDREMRLRLDKELKSFDFSGFDAECLTDTWIGNDRWAFIDLNAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG FHYFEKISFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGPFHYFEKISFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEK R IKQLPVDLKA+MDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP
Sbjct: 602 QEKARHIKQLPVDLKAIMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIR+YLDSSILQYQLQRLDHSLKGTNAP SSTLEVPIFWFIHTEPLLVDK
Sbjct: 662 LLLVNGTYRKTIRSYLDSSILQYQLQRLDHSLKGTNAPQSSTLEVPIFWFIHTEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSA AEHLSGLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSATAEHLSGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YS SHDTAVE                      FQSDTIARSYIITALEESIQ+VNSAIHL
Sbjct: 782 YSASHDTAVEDWIWSVGCNPFSITSRGWHVSKFQSDTIARSYIITALEESIQQVNSAIHL 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLH LNEASKG
Sbjct: 842 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHVLNEASKG 901

BLAST of HG10007663 vs. NCBI nr
Match: XP_004139093.1 (uncharacterized protein LOC101207480 isoform X1 [Cucumis sativus] >KAE8653558.1 hypothetical protein Csa_007415 [Cucumis sativus])

HSP 1 Score: 1670.2 bits (4324), Expect = 0.0e+00
Identity = 862/956 (90.17%), Postives = 876/956 (91.63%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGH RGSSTVAAAKLFSLSGRFTI  RL LLC+VLLLLAARP ASSSGNRKS KSSVFSL
Sbjct: 2   MGHGRGSSTVAAAKLFSLSGRFTISMRLQLLCLVLLLLAARPLASSSGNRKSRKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKS+FWSETVIRGDFDDLESSTTEK+SVVNYTKA                      
Sbjct: 62  FNLKDKSKFWSETVIRGDFDDLESSTTEKMSVVNYTKAGNVANYLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF+K DHIFEHTRIPQ  EVLTPFY +SMDKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFIKLDHIFEHTRIPQFREVLTPFYKMSMDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PL+SH NYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDA WQVDVDLMDVLF
Sbjct: 182 PLISHTNYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHL SRILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLHSRILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           ET LALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADII 
Sbjct: 302 ETNLALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIH 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILKGKDREMRL LDKE KSFDFSGFHAECLTDTWIG+DRWAFIDLNAGPFSWGP 
Sbjct: 362 NKVLQILKGKDREMRLSLDKESKSFDFSGFHAECLTDTWIGDDRWAFIDLNAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG+FHYFEKISFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGAFHYFEKISFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEK R IKQLPVDLKA+ DGLSSLLLPSQK LFSQTMLPLSEDPALAMAFSVARRAAAVP
Sbjct: 602 QEKARNIKQLPVDLKAIKDGLSSLLLPSQKPLFSQTMLPLSEDPALAMAFSVARRAAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSL+WDMRKPIKAALSA AEHLSGLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLVWDMRKPIKAALSATAEHLSGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESIQRVNSAIHL
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWHVSQFQSDTIARSYIITALEESIQRVNSAIHL 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           LLMERTTEKSFKLFLSQER+LVKKHQYVVSLWRRISTVSGELRYIDAVRLL+TLNEASKG
Sbjct: 842 LLMERTTEKSFKLFLSQERDLVKKHQYVVSLWRRISTVSGELRYIDAVRLLYTLNEASKG 901

BLAST of HG10007663 vs. NCBI nr
Match: XP_038880656.1 (uncharacterized protein LOC120072284 isoform X1 [Benincasa hispida])

HSP 1 Score: 1669.4 bits (4322), Expect = 0.0e+00
Identity = 869/965 (90.05%), Postives = 878/965 (90.98%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGHDRGSSTVAAAK FS SGRFTIP RL L CIVLLLLAARPFASSSGNRKSGKSSVFSL
Sbjct: 2   MGHDRGSSTVAAAKFFSFSGRFTIPMRLQLFCIVLLLLAARPFASSSGNRKSGKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKSRFWSETVIRGDFDDLESSTTEK+SVVNYTKA                      
Sbjct: 62  FNLKDKSRFWSETVIRGDFDDLESSTTEKMSVVNYTKAGNVANYLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWFMK DHIFEHTRIPQV EVLTPFY IS+DKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFMKLDHIFEHTRIPQVREVLTPFYKISVDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRK+DVSNNGD N A WQVDVDLMDVLF
Sbjct: 182 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKDDVSNNGDEN-ALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLK---------ENAHLQS 300
           TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLK         EN HLQS
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKEDSHRMFQTENTHLQS 301

Query: 301 RILQSESTPETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQ 360
           RILQSE+ PETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQ
Sbjct: 302 RILQSETIPETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQ 361

Query: 361 GKETADIIQNKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLN 420
           GKETADIIQNKVLQILKGKDREMRLRLDKELKSFDFSGF AECLTDTWIGNDRWAFIDLN
Sbjct: 362 GKETADIIQNKVLQILKGKDREMRLRLDKELKSFDFSGFDAECLTDTWIGNDRWAFIDLN 421

Query: 421 AGPFSWGPTVGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQ 480
           AGPFSWGP VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQ
Sbjct: 422 AGPFSWGPAVGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQ 481

Query: 481 AIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKA 540
           AIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKA
Sbjct: 482 AIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKA 541

Query: 541 IDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEK 600
           IDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG FHYFEK
Sbjct: 542 IDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGPFHYFEK 601

Query: 601 ISFQLFFITQEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFS 660
           ISFQLFFITQEK R IKQLPVDLKA+MDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFS
Sbjct: 602 ISFQLFFITQEKARHIKQLPVDLKAIMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFS 661

Query: 661 VARRAAAVPLLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFI 720
           VARRAAAVPLLLVNGTYRKTIR+YLDSSILQYQLQRLDHSLKGTNAP SSTLEVPIFWFI
Sbjct: 662 VARRAAAVPLLLVNGTYRKTIRSYLDSSILQYQLQRLDHSLKGTNAPQSSTLEVPIFWFI 721

Query: 721 HTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHL 780
           HTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSA AEHL
Sbjct: 722 HTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSATAEHL 781

Query: 781 SGLLPLHLAYSPSHDTAVE----------------------FQSDTIARSYIITALEESI 840
           SGLLPLHLAYS SHDTAVE                      FQSDTIARSYIITALEESI
Sbjct: 782 SGLLPLHLAYSASHDTAVEDWIWSVGCNPFSITSRGWHVSKFQSDTIARSYIITALEESI 841

Query: 841 QRVNSAIHLLLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLL 900
           Q+VNSAIHLLLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLL
Sbjct: 842 QQVNSAIHLLLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLL 901

BLAST of HG10007663 vs. NCBI nr
Match: XP_008443650.1 (PREDICTED: uncharacterized protein LOC103487197 [Cucumis melo])

HSP 1 Score: 1662.9 bits (4305), Expect = 0.0e+00
Identity = 858/956 (89.75%), Postives = 879/956 (91.95%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGHDRGSSTVAAAKLFSLSGRFTI  RL LL +VLLLLAARPFASSSGNRKS KSSVFSL
Sbjct: 2   MGHDRGSSTVAAAKLFSLSGRFTI-MRLQLLFLVLLLLAARPFASSSGNRKSRKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKS+FWSETVIRGDFDDLESSTTEK+SVVNYTKA                      
Sbjct: 62  FNLKDKSKFWSETVIRGDFDDLESSTTEKMSVVNYTKAGNVANYLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF+K DHIFEHTRIPQV EVLTPFY +SMDKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFIKLDHIFEHTRIPQVREVLTPFYKMSMDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PL+SH NYNFSVHVIQTGEKVTSIFELARNVLSRKE VSNNGDGNDA WQVDVDLMDVLF
Sbjct: 182 PLISHTNYNFSVHVIQTGEKVTSIFELARNVLSRKEVVSNNGDGNDALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRD++RARYGYRKGLSESEINFLKEN HLQSRILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDSERARYGYRKGLSESEINFLKENTHLQSRILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           ET LALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ
Sbjct: 302 ETNLALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILK KDR+MRLRLDKE KSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGP 
Sbjct: 362 NKVLQILKEKDRDMRLRLDKESKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQ+AIQEKFAVFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQNAIQEKFAVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG+FHYFEKISFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGAFHYFEKISFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEK R IKQLP+DLKA+MDGLSSLLLPSQK LFSQTMLPLSEDPALAMAFSVARRAAAVP
Sbjct: 602 QEKARNIKQLPIDLKAIMDGLSSLLLPSQKPLFSQTMLPLSEDPALAMAFSVARRAAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSL+WDMRKPIKAALSA AEHLSGLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLLWDMRKPIKAALSATAEHLSGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESI RVNSAIHL
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWYVSQFQSDTIARSYIITALEESILRVNSAIHL 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           L+MERTTEKSFKLFLSQER+LVKKHQYVVSLWRRISTVSGELRYIDAVRLL+TLNEASKG
Sbjct: 842 LMMERTTEKSFKLFLSQERDLVKKHQYVVSLWRRISTVSGELRYIDAVRLLYTLNEASKG 901

BLAST of HG10007663 vs. NCBI nr
Match: XP_022157070.1 (uncharacterized protein LOC111023880 [Momordica charantia])

HSP 1 Score: 1605.9 bits (4157), Expect = 0.0e+00
Identity = 827/956 (86.51%), Postives = 863/956 (90.27%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MG+ R SSTV  A L S SGRF+IP RL LLCI+LLLLAARP ASSSGNRKSGKSSVFSL
Sbjct: 2   MGYHRRSSTV-TANLCSFSGRFSIPMRLQLLCIILLLLAARPAASSSGNRKSGKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKSRFWSETVIRGDFDDLESS+TEK+S VNYTKA                      
Sbjct: 62  FNLKDKSRFWSETVIRGDFDDLESSSTEKMSAVNYTKAGNIANHLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF+K DHIFEHTRIPQV EVLTPFY IS+DKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFLKLDHIFEHTRIPQVREVLTPFYKISVDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PLVSHINYNFSVH IQTGEKVTSIFELARNVL+RKEDVS+NGDG+DA WQVDVDLMDVLF
Sbjct: 182 PLVSHINYNFSVHAIQTGEKVTSIFELARNVLARKEDVSSNGDGDDALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRD KRARYGYRKGLSESEINFLKEN HLQS+ILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDAKRARYGYRKGLSESEINFLKENTHLQSKILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           E IL LEKIKRPLYEKHPM+KFAWTIAEDTDTMEWYNICQDALRKV+E YQGKET+DIIQ
Sbjct: 302 EAILVLEKIKRPLYEKHPMTKFAWTIAEDTDTMEWYNICQDALRKVDELYQGKETSDIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILKGK+REMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDL+AGPFSWGP 
Sbjct: 362 NKVLQILKGKEREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLSAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVE+TVGAVQEISEDEAEDRLQDAIQEKF+VFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVERTVGAVQEISEDEAEDRLQDAIQEKFSVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFK+CKGRKVKLALCEELDERMRDLKNELQSF+GEEYDE+HKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKNCKGRKVKLALCEELDERMRDLKNELQSFEGEEYDENHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNY+VARDTFLAHLG+TLWGSMRHIISPSLSDGSFHYFEK+SFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYSVARDTFLAHLGSTLWGSMRHIISPSLSDGSFHYFEKVSFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEKVRQIK LPVDLKALMDGLSSLLLPSQKALFSQTMLPLS+DPALAMAFSVARR+AAVP
Sbjct: 602 QEKVRQIKHLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSDDPALAMAFSVARRSAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHS KGTNAP  STLEVPIFWFIH EPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSHKGTNAPLMSTLEVPIFWFIHAEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSL+WDMRKP+KAALSA +EHL GLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLLWDMRKPVKAALSATSEHLFGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESIQ VNSAIH 
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWHVSQFQSDTIARSYIITALEESIQLVNSAIHR 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           LLMERTTEKSFK F SQ+RELVKKHQYVVSLWRRIS + GE+RYIDA+RLLH L+EASKG
Sbjct: 842 LLMERTTEKSFKPFHSQQRELVKKHQYVVSLWRRISNLIGEMRYIDAIRLLHVLDEASKG 901

BLAST of HG10007663 vs. ExPASy TrEMBL
Match: A0A1S3B823 (uncharacterized protein LOC103487197 OS=Cucumis melo OX=3656 GN=LOC103487197 PE=4 SV=1)

HSP 1 Score: 1662.9 bits (4305), Expect = 0.0e+00
Identity = 858/956 (89.75%), Postives = 879/956 (91.95%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGHDRGSSTVAAAKLFSLSGRFTI  RL LL +VLLLLAARPFASSSGNRKS KSSVFSL
Sbjct: 2   MGHDRGSSTVAAAKLFSLSGRFTI-MRLQLLFLVLLLLAARPFASSSGNRKSRKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKS+FWSETVIRGDFDDLESSTTEK+SVVNYTKA                      
Sbjct: 62  FNLKDKSKFWSETVIRGDFDDLESSTTEKMSVVNYTKAGNVANYLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF+K DHIFEHTRIPQV EVLTPFY +SMDKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFIKLDHIFEHTRIPQVREVLTPFYKMSMDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PL+SH NYNFSVHVIQTGEKVTSIFELARNVLSRKE VSNNGDGNDA WQVDVDLMDVLF
Sbjct: 182 PLISHTNYNFSVHVIQTGEKVTSIFELARNVLSRKEVVSNNGDGNDALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRD++RARYGYRKGLSESEINFLKEN HLQSRILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDSERARYGYRKGLSESEINFLKENTHLQSRILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           ET LALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ
Sbjct: 302 ETNLALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILK KDR+MRLRLDKE KSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGP 
Sbjct: 362 NKVLQILKEKDRDMRLRLDKESKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQ+AIQEKFAVFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQNAIQEKFAVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG+FHYFEKISFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGAFHYFEKISFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEK R IKQLP+DLKA+MDGLSSLLLPSQK LFSQTMLPLSEDPALAMAFSVARRAAAVP
Sbjct: 602 QEKARNIKQLPIDLKAIMDGLSSLLLPSQKPLFSQTMLPLSEDPALAMAFSVARRAAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSL+WDMRKPIKAALSA AEHLSGLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLLWDMRKPIKAALSATAEHLSGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESI RVNSAIHL
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWYVSQFQSDTIARSYIITALEESILRVNSAIHL 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           L+MERTTEKSFKLFLSQER+LVKKHQYVVSLWRRISTVSGELRYIDAVRLL+TLNEASKG
Sbjct: 842 LMMERTTEKSFKLFLSQERDLVKKHQYVVSLWRRISTVSGELRYIDAVRLLYTLNEASKG 901

BLAST of HG10007663 vs. ExPASy TrEMBL
Match: A0A6J1DS42 (uncharacterized protein LOC111023880 OS=Momordica charantia OX=3673 GN=LOC111023880 PE=4 SV=1)

HSP 1 Score: 1605.9 bits (4157), Expect = 0.0e+00
Identity = 827/956 (86.51%), Postives = 863/956 (90.27%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MG+ R SSTV  A L S SGRF+IP RL LLCI+LLLLAARP ASSSGNRKSGKSSVFSL
Sbjct: 2   MGYHRRSSTV-TANLCSFSGRFSIPMRLQLLCIILLLLAARPAASSSGNRKSGKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKSRFWSETVIRGDFDDLESS+TEK+S VNYTKA                      
Sbjct: 62  FNLKDKSRFWSETVIRGDFDDLESSSTEKMSAVNYTKAGNIANHLKLLEVDSLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF+K DHIFEHTRIPQV EVLTPFY IS+DKVLRHQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFLKLDHIFEHTRIPQVREVLTPFYKISVDKVLRHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PLVSHINYNFSVH IQTGEKVTSIFELARNVL+RKEDVS+NGDG+DA WQVDVDLMDVLF
Sbjct: 182 PLVSHINYNFSVHAIQTGEKVTSIFELARNVLARKEDVSSNGDGDDALWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRD KRARYGYRKGLSESEINFLKEN HLQS+ILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDAKRARYGYRKGLSESEINFLKENTHLQSKILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           E IL LEKIKRPLYEKHPM+KFAWTIAEDTDTMEWYNICQDALRKV+E YQGKET+DIIQ
Sbjct: 302 EAILVLEKIKRPLYEKHPMTKFAWTIAEDTDTMEWYNICQDALRKVDELYQGKETSDIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
           NKVLQILKGK+REMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDL+AGPFSWGP 
Sbjct: 362 NKVLQILKGKEREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLSAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTELSLPNVE+TVGAVQEISEDEAEDRLQDAIQEKF+VFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTELSLPNVERTVGAVQEISEDEAEDRLQDAIQEKFSVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFK+CKGRKVKLALCEELDERMRDLKNELQSF+GEEYDE+HKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKNCKGRKVKLALCEELDERMRDLKNELQSFEGEEYDENHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNY+VARDTFLAHLG+TLWGSMRHIISPSLSDGSFHYFEK+SFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYSVARDTFLAHLGSTLWGSMRHIISPSLSDGSFHYFEKVSFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEKVRQIK LPVDLKALMDGLSSLLLPSQKALFSQTMLPLS+DPALAMAFSVARR+AAVP
Sbjct: 602 QEKVRQIKHLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSDDPALAMAFSVARRSAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHS KGTNAP  STLEVPIFWFIH EPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSHKGTNAPLMSTLEVPIFWFIHAEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIVVQSEISSWESHLQCNGKSL+WDMRKP+KAALSA +EHL GLLPLHLA
Sbjct: 722 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLLWDMRKPVKAALSATSEHLFGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESIQ VNSAIH 
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWHVSQFQSDTIARSYIITALEESIQLVNSAIHR 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           LLMERTTEKSFK F SQ+RELVKKHQYVVSLWRRIS + GE+RYIDA+RLLH L+EASKG
Sbjct: 842 LLMERTTEKSFKPFHSQQRELVKKHQYVVSLWRRISNLIGEMRYIDAIRLLHVLDEASKG 901

BLAST of HG10007663 vs. ExPASy TrEMBL
Match: A0A6J1H937 (uncharacterized protein LOC111461618 OS=Cucurbita moschata OX=3662 GN=LOC111461618 PE=4 SV=1)

HSP 1 Score: 1603.2 bits (4150), Expect = 0.0e+00
Identity = 830/956 (86.82%), Postives = 855/956 (89.44%), Query Frame = 0

Query: 1   MGHDRGSSTVAAAKLFSLSGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSL 60
           MGH RGSS          +GRF IP RL LLCIV LLLAAR FASSSGNRKS KSSVFSL
Sbjct: 2   MGHHRGSS----------AGRFWIPMRLQLLCIVFLLLAARSFASSSGNRKSVKSSVFSL 61

Query: 61  FNLKDKSRFWSETVIRGDFDDLESSTTEKLSVVNYTKA---------------------- 120
           FNLKDKSRFWSETVIRGDFDDLESS+ EK+SVVNYTKA                      
Sbjct: 62  FNLKDKSRFWSETVIRGDFDDLESSSPEKMSVVNYTKAGNIANYLKLLEVESLYLPVPVN 121

Query: 121 ------------EFKLHPEELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQL 180
                       EFKLHPEELERWF K DHIFEHTRIPQV EVLTPFY IS+DKVL+HQL
Sbjct: 122 FIFIGFEGKGNHEFKLHPEELERWFTKLDHIFEHTRIPQVREVLTPFYKISVDKVLKHQL 181

Query: 181 PLVSHINYNFSVHVIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLF 240
           PLVSHINYNFSVH IQTGEKVTSIFELARNVLSRKEDVSNNGDGND  WQVDVDLMDVLF
Sbjct: 182 PLVSHINYNFSVHAIQTGEKVTSIFELARNVLSRKEDVSNNGDGNDTLWQVDVDLMDVLF 241

Query: 241 TSFVEYLQLENAYNIFILNLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTP 300
           TSFVEYLQLENAYNIFILNLKRD KR RYGYRKGLSESE++FLKE+ +LQSRILQSESTP
Sbjct: 242 TSFVEYLQLENAYNIFILNLKRDPKRPRYGYRKGLSESEMDFLKEDINLQSRILQSESTP 301

Query: 301 ETILALEKIKRPLYEKHPMSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQ 360
           ETILALEK+KRPLYEKHPMSKFAWT AEDTDTMEWYNICQDALRKVNE Y+GKETADIIQ
Sbjct: 302 ETILALEKVKRPLYEKHPMSKFAWTTAEDTDTMEWYNICQDALRKVNELYEGKETADIIQ 361

Query: 361 NKVLQILKGKDREMRLRLDKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPT 420
            KV Q+LK KDREMRL LDK LKSFDFSG HAECLTDTWIGNDRWAFIDLNAGPFSWGP 
Sbjct: 362 IKVKQMLKAKDREMRLPLDKGLKSFDFSGLHAECLTDTWIGNDRWAFIDLNAGPFSWGPA 421

Query: 421 VGGEGVRTELSLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 480
           VGGEGVRTE+SLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI
Sbjct: 422 VGGEGVRTEISLPNVEKTVGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEI 481

Query: 481 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMEN 540
           DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDE+HKRKAIDALKRMEN
Sbjct: 482 DIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDENHKRKAIDALKRMEN 541

Query: 541 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 600
           WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT
Sbjct: 542 WNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFIT 601

Query: 601 QEKVRQIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 660
           QEKVR IKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP
Sbjct: 602 QEKVRHIKQLPVDLKALMDGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVP 661

Query: 661 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDK 720
           LLLVNGTYRKTIRTYLDSSILQYQLQRLDHS KGTN P SSTLEVPIFWFIH+EPLLVDK
Sbjct: 662 LLLVNGTYRKTIRTYLDSSILQYQLQRLDHSPKGTNGPRSSTLEVPIFWFIHSEPLLVDK 721

Query: 721 HYQAKALSDMVIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLA 780
           HYQAKALSDMVIV QSE+SSWESHLQCNGKSLIWDMRKPIKAALSA +EHLSGLLPLHLA
Sbjct: 722 HYQAKALSDMVIVAQSEVSSWESHLQCNGKSLIWDMRKPIKAALSATSEHLSGLLPLHLA 781

Query: 781 YSPSHDTAVE----------------------FQSDTIARSYIITALEESIQRVNSAIHL 840
           YSPSHDTAVE                      FQSDTIARSYIITALEESIQR+NSAIHL
Sbjct: 782 YSPSHDTAVEDWIWSVGCNPFSITSRGWHVSQFQSDTIARSYIITALEESIQRINSAIHL 841

Query: 841 LLMERTTEKSFKLFLSQERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKG 900
           LL+ERTTEKSFKLFLSQER+LVKKHQYVVSLWRRIST+SGELRY+DAVRLLH LNEASKG
Sbjct: 842 LLVERTTEKSFKLFLSQERDLVKKHQYVVSLWRRISTLSGELRYVDAVRLLHVLNEASKG 901

BLAST of HG10007663 vs. ExPASy TrEMBL
Match: A0A6J1JJ89 (uncharacterized protein LOC111484942 OS=Cucurbita maxima OX=3661 GN=LOC111484942 PE=4 SV=1)

HSP 1 Score: 1587.4 bits (4109), Expect = 0.0e+00
Identity = 819/930 (88.06%), Postives = 840/930 (90.32%), Query Frame = 0

Query: 27  RLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESST 86
           RL LLCIV LLLAAR FASSSGNRKS KSSVFSLFNLKDKSRFWSETVIRGDFDDLESS+
Sbjct: 2   RLQLLCIVFLLLAARSFASSSGNRKSVKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSS 61

Query: 87  TEKLSVVNYTKA----------------------------------EFKLHPEELERWFM 146
            EK SVVNYTKA                                  EFKLHPEELERWF 
Sbjct: 62  PEKTSVVNYTKAGNIANYLKLLEVESLYLPVPVNFIFIGFEGKGNHEFKLHPEELERWFT 121

Query: 147 KPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQLPLVSHINYNFSVHVIQTGEKVTSIFE 206
           K DHIFEHTRIPQV EVLTPFY IS+DKVL+HQLP VSHINYNFSVH IQTGEKVTSIFE
Sbjct: 122 KLDHIFEHTRIPQVREVLTPFYKISVDKVLKHQLPFVSHINYNFSVHAIQTGEKVTSIFE 181

Query: 207 LARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDTKR 266
            ARNVLSRKEDVSNNGDGND  WQVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRD KR
Sbjct: 182 HARNVLSRKEDVSNNGDGNDTLWQVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDPKR 241

Query: 267 ARYGYRKGLSESEINFLKENAHLQSRILQSESTPETILALEKIKRPLYEKHPMSKFAWTI 326
           ARYGYRKGLSESEINFLKE+ HLQSRILQSESTPETILAL+K+KRPLYEKHPMSKFAWT 
Sbjct: 242 ARYGYRKGLSESEINFLKEDTHLQSRILQSESTPETILALDKVKRPLYEKHPMSKFAWTT 301

Query: 327 AEDTDTMEWYNICQDALRKVNESYQGKETADIIQNKVLQILKGKDREMRLRLDKELKSFD 386
           AEDTDTMEWYNICQDALRKV+E YQGKETADIIQ KV Q+LKGKDREMRL LDK LKSFD
Sbjct: 302 AEDTDTMEWYNICQDALRKVDELYQGKETADIIQIKVKQMLKGKDREMRLPLDKGLKSFD 361

Query: 387 FSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPTVGGEGVRTELSLPNVEKTVGAVQEIS 446
           FSG HAECLTDTWIGNDRWAFIDLNAGPFSWGP VGGEGVRTE+SLPNVEKTVGAVQEIS
Sbjct: 362 FSGLHAECLTDTWIGNDRWAFIDLNAGPFSWGPAVGGEGVRTEISLPNVEKTVGAVQEIS 421

Query: 447 EDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDE 506
           EDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDE
Sbjct: 422 EDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDE 481

Query: 507 RMRDLKNELQSFDGEEYDEDHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLG 566
           RMRDLKNELQSFDGEEYDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLG
Sbjct: 482 RMRDLKNELQSFDGEEYDENHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLG 541

Query: 567 ATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKQLPVDLKALMDGLSSLLL 626
           ATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IKQLPVDLKALMDGLSSLLL
Sbjct: 542 ATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRHIKQLPVDLKALMDGLSSLLL 601

Query: 627 PSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDSSILQYQLQ 686
           PSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDSSILQYQLQ
Sbjct: 602 PSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDSSILQYQLQ 661

Query: 687 RLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQ 746
           RLDHS KGTN P SSTLEVPIFWFIH+EPLLVDKHYQAKALSDMVIV QSE+SSWESHLQ
Sbjct: 662 RLDHSPKGTNGPRSSTLEVPIFWFIHSEPLLVDKHYQAKALSDMVIVAQSEVSSWESHLQ 721

Query: 747 CNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLAYSPSHDTAVE---------------- 806
           CNGKSLIWDMRKPIKAALSA +EHLSGLLPLHLAYSPSHDTAVE                
Sbjct: 722 CNGKSLIWDMRKPIKAALSATSEHLSGLLPLHLAYSPSHDTAVEDWIWSVGCNPFSITSR 781

Query: 807 ------FQSDTIARSYIITALEESIQRVNSAIHLLLMERTTEKSFKLFLSQERELVKKHQ 866
                 FQSDTIARSYIITALEESIQR+NSAIHLLL+E TTEKSFKLFLSQER+LVKKHQ
Sbjct: 782 GWHVSQFQSDTIARSYIITALEESIQRINSAIHLLLVEHTTEKSFKLFLSQERDLVKKHQ 841

Query: 867 YVVSLWRRISTVSGELRYIDAVRLLHTLNEASKGFADQVNTTLALLHPIHCSRERKVEVV 901
           YVVSLWRRIST+SGELRY+DAVRLLH LNEASKGF+D+VNTTLALLHPIHCSRERKV+VV
Sbjct: 842 YVVSLWRRISTLSGELRYVDAVRLLHVLNEASKGFSDRVNTTLALLHPIHCSRERKVDVV 901

BLAST of HG10007663 vs. ExPASy TrEMBL
Match: A0A5E4FEA8 (PREDICTED: ZEAMMB73_Zm00001d016452 OS=Prunus dulcis OX=3755 GN=ALMOND_2B028330 PE=4 SV=1)

HSP 1 Score: 1374.8 bits (3557), Expect = 0.0e+00
Identity = 705/939 (75.08%), Postives = 787/939 (83.81%), Query Frame = 0

Query: 20  GRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDF 79
           G F +P    ++ I LLLLA     S SG  KS +SSVFSLFNLK+KSRFWSE VIRGDF
Sbjct: 14  GLFPLP--FFIISIFLLLLATTSAGSPSG--KSTRSSVFSLFNLKEKSRFWSEAVIRGDF 73

Query: 80  DDLESSTTEKLSVVNYTKA----------------------------------EFKLHPE 139
           DDLESS   K+ V+NYT A                                  EFKLHPE
Sbjct: 74  DDLESSIPGKMGVLNYTNAGNIANYLKFLEVDSMYLPVPVNFIFIGFDGKGNQEFKLHPE 133

Query: 140 ELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQLPLVSHINYNFSVHVIQTGE 199
           ELERWF K DH FEHTRIPQ+ EVLTPFY IS+DK  RH LP+VSHINYNFSVH IQ GE
Sbjct: 134 ELERWFTKIDHTFEHTRIPQIGEVLTPFYRISVDKEQRHHLPIVSHINYNFSVHAIQMGE 193

Query: 200 KVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLFTSFVEYLQLENAYNIFILN 259
           KVTSIFE A NV SRK+D   N D  DA WQVDVD+MDVLFTS V YL+LENAYNIFILN
Sbjct: 194 KVTSIFEKAINVFSRKDDSYGNRDDGDALWQVDVDMMDVLFTSLVGYLELENAYNIFILN 253

Query: 260 LKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTPETILALEKIKRPLYEKHPM 319
            K D+KRA+YGYR+GLSESEI FLKEN +LQ++ILQS S PET+LAL+KIKRPLYEKHPM
Sbjct: 254 PKHDSKRAKYGYRRGLSESEIKFLKENKNLQTKILQSGSIPETVLALDKIKRPLYEKHPM 313

Query: 320 SKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQNKVLQILKGKDREMRLRLD 379
           +KFAW++ EDTDT+EWYN CQDAL  V + Y+GKET DI+QNKVLQ+LKGK+ +M+L   
Sbjct: 314 AKFAWSVTEDTDTVEWYNACQDALNNVEKLYKGKETVDIVQNKVLQLLKGKNEDMKLLFS 373

Query: 380 KELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPTVGGEGVRTELSLPNVEKTV 439
           KELKS +F+  HAECLTDTWIG +RWAFIDL+AGPFSWGP VGGEGVRTELS PNV+KT+
Sbjct: 374 KELKSGEFNNLHAECLTDTWIGKERWAFIDLSAGPFSWGPAVGGEGVRTELSSPNVQKTI 433

Query: 440 GAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLA 499
           GAV EISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLA
Sbjct: 434 GAVSEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKLA 493

Query: 500 LCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMENWNLFSDTYEEFQNYTVARD 559
           LCEELDERMRDLKNELQSF+GEEYDE HKRKA++ALKRMENWNLFSDT+EEFQNYTVARD
Sbjct: 494 LCEELDERMRDLKNELQSFEGEEYDESHKRKALEALKRMENWNLFSDTHEEFQNYTVARD 553

Query: 560 TFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKQLPVDLKALMD 619
           TFL+HLGA LWGSMRHIISPS++DG+FHY++KISFQLFFITQEKVR IKQLPVDLKALMD
Sbjct: 554 TFLSHLGANLWGSMRHIISPSIADGAFHYYDKISFQLFFITQEKVRHIKQLPVDLKALMD 613

Query: 620 GLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDSS 679
           GLSSLLLPSQK  FSQ +LPLSEDPALAMAFSVARRAAAVPLLLVNGTYRK++R+YLDSS
Sbjct: 614 GLSSLLLPSQKPAFSQHLLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKSVRSYLDSS 673

Query: 680 ILQYQLQRL-DH-SLKGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQSE 739
           I+QYQLQR+ DH SLKG  A   STLEVPIFWFIH EPLLVDKHYQAKALSDMVIVVQSE
Sbjct: 674 IVQYQLQRMNDHGSLKGKLAHSRSTLEVPIFWFIHGEPLLVDKHYQAKALSDMVIVVQSE 733

Query: 740 ISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLAYSPSHDTAVE------- 799
            SSWESHLQCNG+ L+WD+R+PIKAAL+AA+EHL+GLLPLHLAYS +H+TA+E       
Sbjct: 734 PSSWESHLQCNGQPLLWDLRRPIKAALAAASEHLAGLLPLHLAYSQAHETAIEDWMWSVG 793

Query: 800 ---------------FQSDTIARSYIITALEESIQRVNSAIHLLLMERTTEKSFKLFLSQ 859
                          FQSDTI+RSYIIT LEES+Q VNSAIHLL+MERTTEK+FKL  SQ
Sbjct: 794 CNPYSITSQGWNISQFQSDTISRSYIITTLEESVQMVNSAIHLLVMERTTEKTFKLVQSQ 853

Query: 860 ERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKGFADQVNTTLALLHPIHC 901
           EREL+ K+ YVVSLWRRISTV+GELRY+DA+RLL+TL EASKGF DQVNTT+A+LHPIHC
Sbjct: 854 ERELINKYNYVVSLWRRISTVTGELRYVDAMRLLYTLEEASKGFVDQVNTTIAILHPIHC 913

BLAST of HG10007663 vs. TAIR 10
Match: AT5G58100.1 (unknown protein; INVOLVED IN: pollen exine formation; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 8 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G28720.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 1238.0 bits (3202), Expect = 0.0e+00
Identity = 621/940 (66.06%), Postives = 750/940 (79.79%), Query Frame = 0

Query: 19  SGRFTIPKRLHLLCIVLLLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGD 78
           +G  ++ K +  +C+ +L + +  + +S GNRK+ KSSVFSLFNL+DKSRFWSE+V R D
Sbjct: 6   AGNRSVSKLVLTICVAILFIPSLSYGASQGNRKTAKSSVFSLFNLRDKSRFWSESVFRTD 65

Query: 79  FDDLESSTTEKLSVVNYTKA----------------------------------EFKLHP 138
           FDDLESS      V+NYTK+                                  +FKL P
Sbjct: 66  FDDLESSVHSNSGVLNYTKSGNIASYLELMEVDSVYLPVPVNFIFIGFEGKGNQDFKLRP 125

Query: 139 EELERWFMKPDHIFEHTRIPQVSEVLTPFYNISMDKVLRHQLPLVSHINYNFSVHVIQTG 198
           EELERWF K DH+FEHTR+PQ+ EVL PFY I+++K ++H LP++S +NYNFSVH IQ G
Sbjct: 126 EELERWFNKLDHMFEHTRVPQIKEVLNPFYKINIEKEVQHHLPIISRVNYNFSVHAIQMG 185

Query: 199 EKVTSIFELARNVLSRKEDVSNNGDGNDAFWQVDVDLMDVLFTSFVEYLQLENAYNIFIL 258
           EKVTS+ E A  VL+RK+DV+ N D   A  QVD ++M+ +FTS VEY  LE+AYN+FIL
Sbjct: 186 EKVTSVIEHAIKVLARKDDVATNKDEESALLQVDAEMMEFIFTSLVEYFHLEDAYNLFIL 245

Query: 259 NLKRDTKRARYGYRKGLSESEINFLKENAHLQSRILQSESTPETILALEKIKRPLYEKHP 318
           N K D K+A+YGYR+G SESEI++LKEN  +   +LQS    E ILA + +++PLY++HP
Sbjct: 246 NPKHDNKKAKYGYRRGFSESEISYLKENKEILKNLLQSGKPSENILAFDMVRKPLYDRHP 305

Query: 319 MSKFAWTIAEDTDTMEWYNICQDALRKVNESYQGKETADIIQNKVLQILKGKDREMRLRL 378
           M KF+WT AE+TDT EW+N CQDAL K+ +   GK+ A++IQ+KVLQ+L+GK+ +M++ L
Sbjct: 306 MLKFSWTNAEETDTAEWFNACQDALNKLEQLSLGKDAAELIQSKVLQLLRGKNEDMKVFL 365

Query: 379 DKELKSFDFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPTVGGEGVRTELSLPNVEKT 438
           +K+L++ DFS  +AECLTD WIG  RWAFIDL AGPFSWGP+VGGEGVRTELSLPNV  T
Sbjct: 366 EKDLRAGDFSNLNAECLTDIWIGKGRWAFIDLTAGPFSWGPSVGGEGVRTELSLPNVGTT 425

Query: 439 VGAVQEISEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIYELFAFKHCKGRKVKL 498
           +GAV EISEDEAED+LQ AIQ+KF+VFG+ DHQA+DILLAEID+YELFAFKHCKGRKVKL
Sbjct: 426 IGAVAEISEDEAEDKLQTAIQDKFSVFGENDHQAVDILLAEIDVYELFAFKHCKGRKVKL 485

Query: 499 ALCEELDERMRDLKNELQSFDGEEYDEDHKRKAIDALKRMENWNLFSDTYEEFQNYTVAR 558
           ALCEELDERMRDLK ELQSFDGEEYDE HKRKA+DAL+RME+WNLFSD  EEFQNYTVAR
Sbjct: 486 ALCEELDERMRDLKTELQSFDGEEYDETHKRKAMDALRRMESWNLFSDEREEFQNYTVAR 545

Query: 559 DTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKQLPVDLKALM 618
           DTFLAHLGATLWGSMRHIISPS++DG+FH++EKISFQL FITQEKVRQIKQLPVDLKALM
Sbjct: 546 DTFLAHLGATLWGSMRHIISPSVADGAFHHYEKISFQLVFITQEKVRQIKQLPVDLKALM 605

Query: 619 DGLSSLLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLDS 678
           DGLSSLLLPSQK LFSQ ML LSEDPALAMAFSVARRAAAVPLLLVNGTYRKT+R+YLDS
Sbjct: 606 DGLSSLLLPSQKPLFSQHMLTLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTVRSYLDS 665

Query: 679 SILQYQLQRL-DH-SLKGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQS 738
           SILQYQLQR+ DH SLKG +A   STLE+PIFW I  +PLL+DKHYQAKALS+MV+VVQS
Sbjct: 666 SILQYQLQRVNDHTSLKGGHAHSRSTLEIPIFWLISGDPLLIDKHYQAKALSNMVVVVQS 725

Query: 739 EISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLAYSPSHDTAVE------ 798
           E SSWESHLQCNG+SL+WD+R P+KAA+++ AEHL+GLLPLHL YS +H++A+E      
Sbjct: 726 EASSWESHLQCNGRSLLWDLRSPVKAAMASVAEHLAGLLPLHLVYSVAHESAIEDWTWSV 785

Query: 799 ----------------FQSDTIARSYIITALEESIQRVNSAIHLLLMERTTEKSFKLFLS 858
                           FQSDTIARSY+ITALEESIQ VNS IHLL +ERT +K+FKLF S
Sbjct: 786 GCNPFSVTSQGWLLSQFQSDTIARSYMITALEESIQAVNSGIHLLRLERTNKKTFKLFQS 845

Query: 859 QERELVKKHQYVVSLWRRISTVSGELRYIDAVRLLHTLNEASKGFADQVNTTLALLHPIH 901
           +EREL+ K++YVVSLWRR+S V+GE RY DA+R LHTL EA+  F  +VN T+ +LHPIH
Sbjct: 846 RERELMNKYKYVVSLWRRLSNVAGETRYGDAMRFLHTLEEATSSFVREVNATVGVLHPIH 905

BLAST of HG10007663 vs. TAIR 10
Match: AT3G28720.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G58100.1); Has 1610 Blast hits to 344 proteins in 85 species: Archae - 0; Bacteria - 567; Metazoa - 95; Fungi - 71; Plants - 145; Viruses - 0; Other Eukaryotes - 732 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 5.3e-08
Identity = 56/239 (23.43%), Postives = 106/239 (44.35%), Query Frame = 0

Query: 528 LAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKQLPVDL--KALMD 587
           LA L + ++ + + +I PSL     ++ + +  Q   +   +V+    L ++   +  MD
Sbjct: 286 LADLASLVYNAYQVLIVPSLRI-PVYFEDTLVVQFIHVYGSEVKDSSGLDLEFVKRTFMD 345

Query: 588 GLSS--LLLPSQKALFSQTMLPLSEDPALAMAFSVARRAAAVPLLLVNGTYRKTIRTYLD 647
              S  LLL  QK  F    +   E      +F+V+R   +     +   Y   +  YLD
Sbjct: 346 EAESGGLLLGEQKLSFKSYSVNYRE--CSICSFAVSRGMNSYTSRFLFDNYTLIVSEYLD 405

Query: 648 SSILQ-------YQLQRLDHSLKGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDM 707
           S  +         +L+R+   ++      +  L V +F      PLL+D+++Q+ A  DM
Sbjct: 406 SKHMHRALTDSAEELRRVAGIVEEEGNEFARVLPVYVFDLDINTPLLLDRYHQSVAFRDM 465

Query: 708 VIVVQSEISSWESHLQCNGKSLIWDMRKPIKAALSAAAEHLSGLLPLHLAYSPSHDTAV 756
           VI V++  +   S   CNG+ +    R   +  + +  + + G+   HL +SP H+T +
Sbjct: 466 VIAVRTRGTQTVSDYTCNGRHVFVHTRDLERPLVGSILQSMWGVSSTHLTWSPRHNTTL 521


HSP 2 Score: 44.7 bits (104), Expect = 4.7e-04
Identity = 44/190 (23.16%), Postives = 78/190 (41.05%), Query Frame = 0

Query: 352 DFSGFHAECLTDTWIGNDRWAFIDLNAGPFSWGPTVGGEGVRTELSLPNVEKTVGAVQEI 411
           D S    +CL   W G DR+ +IDL+AGP  +GP + G+GV     LP      G    +
Sbjct: 223 DSSAGFTKCLGSIWTGKDRYLWIDLSAGPVDYGPALSGDGV-----LPR-----GEFHPL 282

Query: 412 SEDEAEDRLQDAIQEKFAVFGDKDHQAIDILLAEIDIY----ELFAFKHCKGRKVKLALC 471
           +      + + A+    A      +Q + +    I +Y     +  F H  G +VK +  
Sbjct: 283 AALHGRPKSEKALLADLASLVYNAYQVLIVPSLRIPVYFEDTLVVQFIHVYGSEVKDSSG 342

Query: 472 EELDERMRDLKNELQS---------FDGEEYDEDHKRKAIDALKRMENWNLFSDTYEEFQ 529
            +L+   R   +E +S            + Y  +++  +I +       N ++  +  F 
Sbjct: 343 LDLEFVKRTFMDEAESGGLLLGEQKLSFKSYSVNYRECSICSFAVSRGMNSYTSRF-LFD 401

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880657.10.0e+0090.90uncharacterized protein LOC120072284 isoform X2 [Benincasa hispida][more]
XP_004139093.10.0e+0090.17uncharacterized protein LOC101207480 isoform X1 [Cucumis sativus] >KAE8653558.1 ... [more]
XP_038880656.10.0e+0090.05uncharacterized protein LOC120072284 isoform X1 [Benincasa hispida][more]
XP_008443650.10.0e+0089.75PREDICTED: uncharacterized protein LOC103487197 [Cucumis melo][more]
XP_022157070.10.0e+0086.51uncharacterized protein LOC111023880 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B8230.0e+0089.75uncharacterized protein LOC103487197 OS=Cucumis melo OX=3656 GN=LOC103487197 PE=... [more]
A0A6J1DS420.0e+0086.51uncharacterized protein LOC111023880 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1H9370.0e+0086.82uncharacterized protein LOC111461618 OS=Cucurbita moschata OX=3662 GN=LOC1114616... [more]
A0A6J1JJ890.0e+0088.06uncharacterized protein LOC111484942 OS=Cucurbita maxima OX=3661 GN=LOC111484942... [more]
A0A5E4FEA80.0e+0075.08PREDICTED: ZEAMMB73_Zm00001d016452 OS=Prunus dulcis OX=3755 GN=ALMOND_2B028330 P... [more]
Match NameE-valueIdentityDescription
AT5G58100.10.0e+0066.06unknown protein; INVOLVED IN: pollen exine formation; EXPRESSED IN: 19 plant str... [more]
AT3G28720.15.3e-0823.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 460..487
NoneNo IPR availablePANTHERPTHR31515:SF2TRANSMEMBRANE PROTEINcoord: 29..98
coord: 98..899
NoneNo IPR availablePANTHERPTHR31515TRANSMEMBRANE PROTEIN-RELATEDcoord: 29..98
coord: 98..899

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007663.1HG10007663.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane