HG10006289 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10006289
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionzinc finger CCCH domain-containing protein 40-like
LocationChr07: 16987019 .. 17008374 (+)
RNA-Seq ExpressionHG10006289
SyntenyHG10006289
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATATCCGAATCAAGCGCCATAAGACAACTTACTTTATCCAGTGTGATCCAATTGAGACAACTTTAAATATCAAGCAAAAATTAGAGTCCCTTATTGACCAACCAGTAGTTGACCAGCGCTTGATCCTGGTGGGGAGTGGGGAAGTATTGGAGGATTCAAAGACACTGGCTGATCAGAAGGTAGGTCTTAAAGACATATATTTATTACATTTGGTCATCAATTGTCGGATCTTTTGGAAATATTAAGGTTAAAAAAATACAACGGCTGCAACTCTGATAAACGTTTTCTATGTATTTCCAGGTTGAAAATGATGCAGTTGTGGCTCTAACGTTGCGAAAAGGTTTTTCCCTACCAACTCCCAAAAATTTTTGTGAACTTCTTGTTCTAGCACTTGTTTTTAAAAGCCTGTGTTCTGATCAGGGACTGAAAATTATTTGTGGGCTTGTGCTTGGATTTGGAGTCTTTATAGTTATGATCTTCATTGTCATTGGATCTGGATTCGAAATATGAAACTGATAGGAACGGTATTAGGGTGTTATTAGGATATTAAGGATATAATAGTAATTAGATAGGTGGTTGGTTATGATATTTGGTTATAAATAGAGAGGCAGAGGAATGGAGTAGTTAGGCAAAATCTTAGTAAGTGAATTAGGGCTTGAGAGAGGTCTCAAGAGAGGAGGGTTTATGTTCCTCAAATTACTTGACTATCGTGTAAGTTTTCGTACTTTACATTTCAATATAATGGATTGTGTTTTGGTTCCTATAAAAACGTTGTCTCATCATATGTTTTCTTGACTTTTTTCCCCTTCTTTGTGTAGATGATAATGACTTCGAGGAGATCGACATCGTCCAGCCAAATGATTTCTACCAATCTCGTGATGCCGATTCTGGCAATTGGTAAGGACTGAAAATTTGCTTCTACTGTGGGTAGTATATGATGTTCAGCCATCGATCTTGTGGGTGTACTGTTTAAATCACTTTTCTATCTGTTGGATGGAATTTGAATATGATGGAAAAGGAAGTATACTCCATCCTTTCTATTTGTTACCCCGTTAAGTGGTATCCTTCAGGGGTTGCATTAGTGATTTAACTCAACATGTATTGTTGCTACAAGTACTCTATATGAGTTGAGGATTTTGATTAGGACTTTTTAATTGGCCTCAAATGATATGAATGGGTCTAATAAAGTTGTAAATTGATTTTGTTAAATCATCGACTGATCCAAAAGCTTAAGGTAATGTGTGAAGATAAATTTACTTATCATATCTTCTAACACTCCTTCTCACTTGTTCGCTTGAAGTATTATGGAGACCCAACAAGTGGAAATGAATATTAATCGGGGAGGAAATAACATTACATAGGCTTGAACATAAGAGCTCATTGGACCACATGCTCTGATACTATGTTAAATCATCGAGTTTTTTTCTTATGTCAATTGGGTAAGACACTGATTACCATTTTAAAGGTTGATGATTCAATCTTCTACTCCGATAATGATTGAACTTAAAAGGAAGATAAAACTATATGAATTGTTTCAAGATATTCTTAAAATACGAGTTTCGTGGTTAAGATAATGTTTGTTTAAATAGAAATTTAATTTTTGAATTTGTTACGAAAAATGTAATTGGAAACATGAAATCTCGTTTCCAACTTTACATTTATAGATTGCCTACCAAACAAATCATAAATGAAGACAATTGTTTTTTTGAAACTCAATGATTTTTTTCAAAACTTTCTCACTAGTTGTAGGACATAAATAAATATTTTAAAGTTAAAATACTAATTTGGTTCCTATACTTTGTGTCTCATTCTATTTTAGTTTATGCACTTTTAAATTTTTTAAATTTAATTTATGTACTTTCAATAAATCTTGTAAATTAGTCATGTTTATTTGTATATAATTAACTTTATCGATCTTAGTAAAAAAATAGCAATAATTCTCATTTGTGTTCGTACACATTGTGAAAGAAAAAAAAGCTAACAATAACCTAGTAGTAAGACTAAATTTAACCTTTATCGGAAGTACAATTATTAACATTTAATATTCAACTTCTAAAATATTAAATTTGAGAAATTACTATAAATAGAAAAAATTTCAAATTATTTACAAATATGGAAAAATTTCACTATCTATCATAGATCGCATTAGAATTATTTTTTTATATTTGAAACCCGAAAGTTTTGTGGAATTTTTTTTTGGTTATCTAATTTTTAAGTATATTTTCAAAATTCAAGTTAAATTATAAAAACTAAAATAGTAAATTTTTAAACATTTATTTACTTGTTTATCTTGTCTAAGAAATCAAATATCTCCTTATATAATATGAAAATTAGTGCAAAATATGGTGGGAAACCCAGCAGAACTTTAAAAAGCAAACAACAAAAATGTTATCCAATGATTTTTTTTTTTATTAGTATAATAGGGATGAAGGAATTCAACCTACAAATCTTATAGTTCTTATAGTCACACATACTATATGTTAATTGAACTATATTCGTTTTCACAATAAAATAAAATTTAATTTTCAAACTTTCAAAAGTGTACGTCCTTAGACTTTCAATTATGTATCTAACAATTCCTTGCATTTTTAGTTTTATGTCTAACGGGTATTTTCATTTTCAATTTCGCGTCTGATAAATTATTTACACTCCAAATTTAAAAAATCAGCTAATCTATTATATATAAATTTGAATTTTATATCTATTTTTTTTACTTTAAACTTTAAATTTTATGTATAATAGATCAATAAATTTTAAAATTTTAAAAAACTCACTGAACAAATTTGTAACTTTCAAAGTTTAAATAAACACAAAGCTCAAAATTAAGAGTCTAAATTTATAATTTGACCAAAACAAAAAACTCAAAAATGATATTTAAACTATTAAAATAAGAACAATAGTTATGTGTGTCTCGTACTGCATCTATTTTTTTAGTCCATCCAATTATACAACAGAAAAATGTAAGAATATTTTATGAACAAGAAAAAGAAATATTTATCACACAAACTTTAGCTGCATAGAAATAGATGCAGGCAATGATGCAATTAAAATCATCTCGTTAAAATAACATTTTGGTTCTTCTCTTTTGAAGTTTGTTCAAATTTAGTCCATGTTCTTTCAATTGTCAAATATTAGCCATTGTAATTTTAATTGTCTAATTTTAGTCTCTCTATTTTCAATAAATCTTAAAATTAGTCTCTCAAAGTAAAAGTTAATGTTTAACGAAATTGGTTAAATAATAATAATAATTACCTTCAAGAAAAATACAATATATATTGTAAAAAAAAAAAACAAATATATGAATATATTTTCAAAATTAATAGTAAAAATGCCCATAACAATTATTTTATTTTATTTTTTGTTTAAAAAAATCAACAACAAACTAGTAAGCGATGTCTAAATTTAAAATTTATTAAGATAAGGAAAATTACTATAAACAGAAAAAATATCAAACTATTTACAAATATAGAAAAATTTTACAATGATACACCGCGATATAATTCTATCACTTTCTATCGCTCAAACAAATAAATTCTATAGTGATAGAAAGTGATAGAATTTTATCACGATCTATTAGACTATAAAATTTTTCTATATTTATAAATAGTTTCGCTCATTTTTTTTATATTTAAAAAAAACTCTAAAGATAAAATGTAGTAAAATTAAACAAATTTGAAAGTATGACTAAAATGTTAATTTAACTAATTTGTGGTCTTACAATAATAAAATATTTTATTAAAATAATACATAACAAACAAACAAATGCAAGGGATTGAGTTTTCACAGTATTCCGTTGGTCATCCGAATTCCAAATTCTGCAGTTGCAGTCTACTGCTTTCGTTCTCCGTGGGCTACTCCCACTCCATCCGCCATGGCTCTTCTTCCCACCAAGCTCGCTCTCGGATGCCCTAATTCTACACTCTTTTCTGCAGCTCCCAGGCTCTCTTCTCTTCGTCTTCCACCATTCCGAGTCTCATGCGCCAACAAACGGACCGGTAAGAGGAGGTATCCATCGGAAAAGAAGAAGCTCAAATTGAAACACAAAGAAGTCCTCACGACCGTCGAGAACAAGTTCGAAGGCATTTGGAGGTTGTTCAAGCTCGGAGTTCCCGTGGAGAAGGATCCTGGCAAGGATTTTCATGGCCTCTCGGATGCTTTGATGCAAGAGATTGCTAAAGTGCTCGAGTTCCCGGTACTTTTGGCAGCTTCATTGATTGTTATATTTTGAAAAACATTGTGAGTCGAATTTATTTTGTTTTCCAGGTCGCTTCGTTGCTACCGCGGGAAGCTTTCTCAGTTATTCGTAAATCTTTTGACGCTAGAAAGGTATGTTTAATGATATTCTTCTATTTTTTTTATTTAACATTTTTACGTTTTCTTAGCAATGTCTGTTGTTTAGAAGGAATAATTTATTATCAAGAAGTTGCTGGGTTATTATCTTGTGCTAATTCATCACCGATGCTTCCAGATGTTGAAGGAACCAAAGTTTGTTTATACTGTGGACATGGATGTACATAGGCTACTGATTCTTGAACCTCGTGCTCAGGATTTCATTTCGGACTTGGAGCCTAAAGTTGGATTGATGGAACATTTTGCAAAAGAAAAGGTATCCAATGATGTAATCAGTATCGTTCATGATCTCAAAAGTAATCAGGAAGTGGTGGGAGCAAATGGACTTAACGGTCATTCTGGCCCGTACTTGCGCTTGTCAAATGGTAAACCAAAAGTTGCTGTTGTTGGCAGTGGGCCATCTGGCCTTTTTGCCTCACTTGTCCTTGCAGAGTTTGGGGCCGATGTTACCTTGATTGAAAGAGGTCAACCAGTGGAACAAAGAGGGCGTGATATTGGTGCATTGGTAGCTCGTCGGATTCTGGAGCTGGATAGCAATTTTTGCTTTGGGGAGGTTATTTTTCTTACTTCTTTTCTTACTTCTTCCATAGACTTATTTGGTTTTAAGTTATATATGCATTGCATTATTGCTTGATTTGAATGCCTCACTTTTGACCGTTGTTGTATGATCTTTCAGTGTCACTTCTCATCTGGTCTGAAAGATTTGTTGTTATACTTGATGCTTATGATTGCATATACACGAACAACAAATTGAAAAATAAACTCATACTTACTGTTGTATATTCTGAAATTAGAAACGATGTCAAAGATTGGACAGAACCAACTTCTTAATTTTTTCACTGTTGCTAGACAAAGTAAGGTTTAAAACATTTCCAAATGAAGAGCTGTAACATTCCAAGTCGACTGAACAACAACTGACCGCAGAACCTTTCCTCTATTTCTCAACCTCCATCCTGACAACGCCTCTATCAACCAAGTACAGGGGTATATTGAACCCCTATAACACCCATTCCAAGCCAATGAAGTAAATGGGCAATGAATTAACAAGTTCTCAAGGGACTCAACAATTTTGAGGCACAACAAGCACACTGAAGGAGATAAAACCCAACTAACACATTTCCTTTGGAGCAAATCATGAGTGTTTAGCTCTTTGTGAGCCAAAGACCACAAAAAAAAAACTTTCACCTTCTTTGGAGTGCAATCCTTCCAAATAGCATCAATTAAGGAAATTTCTTCACAAATATGCCATAGCTTAAATACTAATTTACGTCCAAGATTTCTTTAGATGAACTTCAAATCCAGGGGATGAATATTATTCTTAGGAACTGAACTATTTGATAGTCAAACTTGAAGCTAAGTTTATTTAGATAGAGTAGTTTTGGTCAGAAAGAAACGATAGAATTTTCCTAGGAGCAGAGAGTTATTATTTTTTTGGAGCATATTTTTCTTGTGGTGTGGCTCTGTTGCAGATTGTCCAAAGACCAAACCTTTTGTAGCTATGTGGCATGTGCTATTTTCTCTTTCTGCTAATTGAATATATATATATATATATATATATAAATGTACGCATATTTTGGCTAACATCTTTAGCTGAGGCTATATAATAGTCAATGCATCAGTTTATCTAAACAAACATTACCAATGATTGAATTCTCACTGTTGTGTGCAATGTGGGTGTCACTTTATTCATTAATTCAATTTTGGTATTCTTGACTATATCTATATGCACTGTTGATTGGGTATCTTTTATATTTTTCTCATTGGCATATTGTCCTACAGTCATTTAATGGTTTTGTTTCTCTTACAACCTTACTTAAACAGGATCTGTTCTATAGGTTGTACAAATGTGTCCTTGAGTTCTAATGGTGTTTCTTTTTTCTTCAAAGATATCATGATGCTTGGGTCTTCAGTTATTTATCTGAGTAATTCATTTAGAACTTATGCGTGGCAGGGTGGTGCAGGTACCTGGAGTGATGGGAAGTTGGTCACTAGAATTGGTAGAAACAGTGGCAGCGTGCAAGCGGTTAGACTTTTCCTCCTTATAGATCTATAAGGGTGTCTTTTCTTTTCTATTATTTTTAATAATATTACACTAAATAGTTGTCACAATATTAGGTGTGCTTTTACAATAAGGGACCTAAATCTTGTGACCTTCTGAATGTTGTTCTGTAGTTTCTCCATGGTGTCTAATCTTCGTTACAATTCATACTTCCTTTATGTATTCTTTTATTTCTATCTATGAAAGCACAATCTCCTTAAAAAAAAAACAAAAGAAAAAAAAAGAAAAAAAAAAGGGAAAAACCACTTTCTCGCCTCAACTTGCTGCACATTCCAATTTCCAACAATATTATGTCATGTGCTTTATCAAGTATATCCCTCGACAACTGAATTTTTTGTTCTCATTCTAATTTCTAGGTTATGAAATCTTTAGTTTATTTTGGGGCCCCAAAGAAAATCTTACTCAATGGAAAGCCTCACCTTGGAACAGACAGGTTGATTCCATTGCTTAGGAACATTCGGCAGCACTTAGAAATGTTGGGTGTGAGTATCTACCCTATTCTGTCCTTGGATGTATGTAAGATTAAATAGTTTTCTAACTTCCACATTTTTCAAATTTTCCCTTTAACTTTCATGACTTCATTATAGGTCAATATCAAGTTCGGGACCAGGGTTGATGATCTAATTGAAGAGAGTGGACATATAGTGGGTGTTAAAGTTTCTGATTCAAGAGACAAGTCAAAGCTCAGCAACCAGAAGCTTGAATTTGATGCCATTGTCCTAGCTGTTGGCCATTCCGCGCGCGATGTATATCAAATGCTTGTGTCTCATAACATTCCCGTGGTTCCCAAGGAGTTTGCCGTTGGTTTCTTCTTGATCCCTGTTGTTTTGTGGATAACTTTGCTTCTTTAACAAGGATATTGCATCTCTACATGCTCTCTTTATACCAAAAAAATGTCGAATACATGGATGCTAAAATAATGCTGGAAAACTTAGCTGGATTCACTTTTTGTGCCAGATTTGTTGCCGGTTTACAGGATGTCTTTCATATATTCTAACATTGTTGTCCTCCGTGCCTGAATAGGTTGGTTTAAGGATCGAGCATCCTCAAGAATTAATAAACAGCATACAGGTTAGTTTTATCGCATTGTTTTAAATTGATTGTTTTTCACTATATTTCCTACTAAAAAGAATTAAAAAATTCCAATAATTTTCCCAAAATTACGTTGGATGCATTCCTATCAAACAACTACTTGCTAATTATGGTGGTTTCATCATCATCTTCATGCAAGCATGCATACAAATTGAGTCGGTAAAATAAAAAAAATAAAAAAAAAAGAACAAAACTTTGCAACTACTCTAAAGCTGAGTTGATTTTATGACTAATTTTTCTCTCTTATCTACTCATACTACCTGGATTGGTTATCTTTCATTTCATATATTTTTCACCTTGTTTTAAGATGTAAACCTGCTCATGTATTCAAATGTAGTATTCTGGATTGGCCAATGAGGTAGAGAAAGGACGTGGAAAAGTACCCGTGGCAGATTACAAAGTTTCCAAGTATGTTAACGTAGACACGGAGAATCCGTCCTCCAATTCTCTTGCAGCAAGTCGCAGTTGCTATTCATTTTGCATGTGTCCTGGTGGCCAGGTACGTTTCTCAATTGATTGCTGTGACCATTGGTCATGAATGCACATTTTAAAGACTGGCTTTAGGTTGCAAGCTCTTCTGGTCTCTGATTCCACCATAATTGGATGATTTGGCATTGACTGTGCAGTTATGTTAAAAAAAAAAAACTGTCACTTATTAGAAGAGATAAACTCAGTTACTTCCTAATGTCACATTTGCATCAATTCATGTCTTGATGTCAAAAAACGGACATTATTTGCTCTTTAATTTCTGATACTTAAGAAAAGTAGAATATTAGTATGTTTAAGTGTCGTTTCATTAGGAAATTGGCATCTGTCGTAAATCCACCATCTTAACAAATTTTATCATTGTATTATCTCTGTCCATAGTAACTTGGTATGGTCATTTCTAGGTTGTCCTCACAAGTACAAACCCGGGAGAACTTTGTATCAATGGCATGTCATTCTCTCGACGTTCATCAAAATGGGCAAATGCTGCCCTTGTTGTTACTGTTTCAACTAAGGACTTTAATGATCATGGTTTCCACGGACCTCTTGCTGGGGTTGAATTCCAGGTAGCTTTTCTGTTGTTCGCTCACTTTGTCCTAAAGAACATTTTATCTCTTTCAGCAATTTTTTCAGGCATATCTCCTTTTGTGATCTAATAAATCTATCTTTTCCTAAAATAACTTCCGGCCATTAACTTCTGCTATTAGATATTAGTGACTTATCCAAATCACTGACTTTCATAAAGGATGTCTCTTACTGCTATTACACTATTTTCTTGTTAGAGAGAACTTGAGCAAAGAGCAGCCGTCATGGGAGGTGGAAATTTTGTTTTGCCCGTGCAGACAGCTACTGATTTTATGGACAGAAGATTAAGAGGTATGTCAAGTCACTGATTTCTCTGAAACTTCAGAGTTGAGAAGGATACTGCCACTCATTAAGTCTAGACTCCAGGCACCTGACGTACAAATGTGCTTCTTATTGTTTGACTGAAACACTGAAATGTTCTATCGTGAAAATCAAGTTTCTAACCAACTGACCTATGCTCAAGATGGCTGTTAAGAAAATCTACATGATCATGTCCGACATTTACTATCAGCATGAGAATCTGCCCTAAATTTTATTATACCAACTATTTCCCTTTCAGATCGATTTTTTTCTCTCTATTCTTAGTGACATCTGTGCCGCCATCAAGTTACCGGTTAGGAGTGAAGGCCTCAAATCTCCACGAGTTATTCCCTGGTCATATAACAGAAGCTTTACAGCAATCTGTCCTTGCATTTGATCAAGAGGTCCTCTCTTACTATCTTTTTGACTGTATTCATTTTCTTAATTATCCAAACTTATTCACCTTTTCTTTTCAAATTTTGGGCGTTCAGTTACCAGGTTTTCTCTCAAGTGACGCCCTTCTACATGGAGTGGAGGTAGTTTTTCTCTGTTTGTTAGGCAGTTTGTGTTCAAAGATTATCTGGTCCTTTTTTCCTTCGAACGACAAATGGTCTGAATGATAAATGCATTTGGGGAAAGTATTACTTTCAAGGCTGTGTGTTATAAGTTTATGAACTTCAAAAAATGTCTAAATACATCTCTAAACTTTCCATTTCGAAAAGTCAGGTTCCAGTTTTATACTAATAGCTTCTTCGTGAATCATAAAAAATATTAAATTGATCACATATTTAGTAGACACAAAATTAAAAGTTCAAGGATTTATCGAACACTTTTATAACTTAGGGGATCTATAAGAAACAAAATCAAAAGTTTTGAGACTTATTGGAGACATTTTCTAAAGATTTAGAACATATAAGACACAACTTTGAAAGTTCAAGGACTTGACAAATGACATGTCATTTAACCAAAACATTTTTAACTAATTAAACATCATAACATTTTTACTTTTGAGTAGAATCACCTTCCACTGAAATTTGTTTTGCTCTCAGACGAGAACAAGTTCCCCTGTTCAAATCCCACGCAACCCTGAGACTTATGAAAGCACATGTCTTAGAGGACTCTATCCGGTTGGTGAAGGAGCAGGCTATGCGGGAGGAATTGTAAGTGCAGCAGTAGATGGCATGTATGCAGGCTTTGCGGTAGCCAAGAGTTTCAATCTTTACCATGGTGACCTTGAGACGGTTTTGGGTAAGGCTCAAAGTTCTGGGTCCGTAATGTATTAGAACTAATCTGGTTCTCTTGCCAGTAATATAAGAGCCTGTTACTACAAATCTGGCCCCTCTGCATGTCAATTCCGGCAAGGCCATCTAAATCAACCAGATTCCACCTTCAACTTAAGTATAAAGGTACGTAAATGCAGTAATTAGTTTCTTGCTATGAAGTGTTAAGAACTTTTTAAGGTTATTCTTTTAAATAAAAAATTCTTGGTATGAATGAAGTTAGGTTCAAAGTATGTTTAAATTCTTTTTTGTCCTATAATTTTGCGAACATTCTATCTCAATCATTGAACTTTATTAAAAAAGAGTCATTCTGGTTATAGTCATTGATTTTTTTAACATATGGTTTAGGAGACACTTGACATAAGCATATTCATTCACTTGATCATTAGATAACTAGATATACAAACAAATATTTACAATTTTCTGATGAACCATCATGGGTTGGCCTAGTAGTAAAAAAGGAGACAAAGTCTTAATAATTGACTAAGAGGTCATGAGTTCAATCTATGGTGGCCACCTACCTAGGAATTAATTTCCTACGAGTTTCTTTGACACCCAAATGTTGTAGAGTGGTGCGCGTAAGCCAAACTGACATTTAAGAGTAGAAAGACTGAAATATAGTATTTGAAAGTTTAGGGACTGAAATATTCTAAGGAAGAAAGTGAGAACAAATGACTAAACTAAGACAGTTTTTTACCCTGTCATTATTTTTAACACAATTTACAGTTGTACATGCACTCAGTAAATGGAGATATATTCGTAAATCATGTGCTATTTTGTTTTTTTGTTTGTTTTTTTTTCAGCATTAAAAAAAATTAAAAAAAGACCATCCCTTTCAAGTTATAATTTGAATGGGTGCATACCGATAATGAAAACTTGTATGCTTTCTGTAAGCAGCTATACTTATGACCTTCGAAAGGAGCATCAAATTCATGTGTTGTAATTCGTACAAGGTTCTGCATCGAGTTGTTCGATTGGACGTGGAGACCTACATCTTCTATTTCGTCAATATGTGACAATTAAACGAGCTGCTCGGGACTTATACTGCAGCTGATTCAGACTGATTTCAAGCAGACTCTACTTGGTCCAGGTAAATGCTCATTTAAGTATCTAAACTCGTAAATTTTCGTGAATTAACTCATGATTATATTTTATTTTTACTTCTTCTAAATCATTGCAACAGTTGAGTTGATTTTTTGTTTTGTTTTTAATATACAAGTTTAGTTCTCGAACTCTCTAGATAGTATTTAATAGAATTTTTAAACTATTAATTTTGTGTTTAATGGATCCTTTTAACTTTAAAATGTTAAACCAAAATATGAATTTCAATTGTGTCCCTTACCTATTAGACGCTTTCTCAAAAACAAAATATTCACAGAACTTTTAATCACAAAATTGAAAGTTTAGGTTTCTATTACGCACAAAATTGAATTTTGACCAATTGTTGTATAAGTTAAAGATTTATTAGACAAAAAATTGAAAGTTCAGACTCTGTTTGGTAACTATTTCATTTTTTATTTTTTGTTTTTGAAAATTAAGCCTATTTCTTCTCAATTTCTTATAATGATTTGCATCTTTTTTAAGCACAATTCTAAAAACAATAACAAGTTTTTAAAAACTATTTTTTTAATTTTCAAATTTTGACTTAGTTTTTGAAAACATAGGTAAAAAATAGATAACAAAGCAAGAAATTGAGGAGTGATATGGGTGTTCATAGACTTAATTTTCAAAAACTCAAAATCAAAAAGCCAAATGATTACTAAATGGAACCTTAACGATTTTGTTTTACAGTTTTGAAGTTAAAAAATTTATTGGACATAGTTTGAAAGGAAGCTATTGATACAATTTTAAAAGTTCAAGACTTAATCTATAATTTAATATTTTCTTAAGTGAATTTATAATTTAATATTTACTGCCTAGTGTTGTTGTTATACAATTTTCTTGAAAATTTAAGGTTAGAACTGACTCTTTAAATTTATCCAACTTTTAATTGTTTATTCAGATAACAGCGACTCATAAAAATAACAAACTTATTTTCTCTCGAATCATTGACAAAATTATCTTACTGACATAATTTTTCTGTTTAAAAGAAATTGTTTGGATTATTTAAAAAGAAAAGTAAAAGAAAAAAAGAACATTCTTTATGATCCTTAAGGAATCACAAACTCATGAATAATCTTTAAAAAAAAAAAACTCATGAATAGTATGACTTTATTTCTAGGGTGAACACTAAAACCGATCAAACCAAACCGAATTGAACCGTATATATATATATATATATATATATATATAGAAACAACAACTTTCATTAGAAAAAATGAAAGAGTACAAGGGCCACAAAAAATTAAGCCCTCAAAACCTCTCTGAAGAAAGGATTTCCAACTAAGTAATATGCTACTTACTACCTAGAGGATAATTACAAAAAGTCTTTAAAACCAAAGTCTAAAAAGAAACATGGAACCTTATCAAGGACTAAATATATACGTATATTTTAAAAAATATGTATTATAGGAAAACCAAATCGAACTTAATATTTGTGGTTCGGTTTGGTCTAAACAAACGGTTTGGTTTGGTATATTCAAAACCGAAATTTTCAGTTCGAACCATGAAAACACCTACTTATTTCCTATGTTACATCAGGTTGGGAGTAAAATATACATGAAATATCATTTGTTTGTTTCATATGCAGCAGAGGGAACTTTCAGTGAGAGCTGAAGTCTCCACTCCAATATCTTATACATACTCTCAACACAGTTGCTTTTCGATCCTATGAGACATCAAATCAACACTGCAGATTCTGCATATTATTATCGGTATGTATTAACTTACATATTGCTGTATGACAATAATAAGATTATGCCAATAATATTAGCTATAGGATAACTTTCATTAAGACGTGAAAGTTAGGAGTAGAAGAGATATAGGAAAACATTAGTCCTATAGCCAAATTTACGAGGTTAAATTATAAAAAATACTCCTAAGTATGGAGTCGGTTTTAATTATACCTTGAACTCTGAAAAGTTTTTTATTTTAACTCTTGAAGTTTGAATTTTGTTGTTGAATTTGTCGGTAGAAAAGTTGGCCTGTACACTCGCGGATATAAAAAAAACTTAAAGTTCTCTCTCGGATTATAATTATGTTTTAGCTTTGATTACTTTTATGTAAACCATCTCATAACACATGCGATGTCCGAGAAATTTATACATTTATTTTAGTACATAAAAAAAGTCTAAGAATATTTTAATTAATGCAATAGCCTTTTACATTATTTGATCCAAATAGTTACAACTATTTGAAGAAAAAGACAAGATAGGAATTTATGAAATTTTATAACTCAAAGATTGTCAGCTCAAACATATATCCACTAAGGTCAACTCACGCTCCAACCATATTCAAACCCTTTTTGGTTTTACGGTCTTTAACCATTTTGCAGCTCCAAATTCGACTAAGGTCATTGTAGGCATGGATTTCGGACTCCACGCAACATCTTTGGCTTCTAAATTGTGAAAAAGTCCTCAATAATAATCAATCTCCAACTCTTGGAAGGAAACCCACGAAAATAATTTAAAGACTCTTTAAGAATGAAAGTTCCTCTGAATAGATTAGTTCTTTAGGTCATTCTTACCCATGAGGATGTGTATAGAGTGAGAATTCACCTCCAGTCTTGTGTATTAGCTAGCTTTTGAAGATGTAGTGTGAAAGCCTCCTAAATCTTCTCTGTCTTACTTTTTGTAATAGCTCCTTCGAATATATGCAATAAATCAATGGTTGGAATCACATAAAAATGGAGATATAAAGATTATTCTCTCATAACCATTTTTTATATATATAGTAAGTATTGATTTTAACAAATTAGAATATACTGTTTGTTACATTTGATCAAGGATGAAAATATTCGTGGATATGTCGATATATCCGTAAATTGAAGGGTTTGATATCGATATTAAATATCCATGGATATCTCCAACATCTTTTATTAATGCTTGTAACACATAAAAATGTCATATATTTAGTTCAATATGAACAAGAATACTAATAACAATATACATAACTTAGTTTGGGAGTAAAAACAACTTAATTATTAATTTAAATAAATTTTATAAAATTTGTGATTTACAAAAATATCCGTGGATATCGATATTTTTGTCGATACATCCATCAATATATCTATAAAATTGAAATGTCGATATCAATATCGACATCGATATTTTAATCCAAATTATTTATCGAGTTCAAAACTAAGTTTAAAGTTTAAGTTCAAAGTAGAAGTTTAAGAAAAGAGAAGAGGGCGTGAACACAGATGAGAAAAATGGGAAGAGAGAGGAGATAAATTTCTGTAATTATTTTTTAATAAATATCTATGCATGTCAGACATTTTATTAATACTTTTGTCGACATTTATGTAAAATTGAGATTTTGAACATCGACATTTTATTAATACTTCTGCCAATATTTCTATAAAATTGAGACATCGACATTTTTGTTTACGTTGATATTTTAAAAGCTTGACTGTAACTAGTTTAATATTTAGTTTTCAATTTAATTTAAATTCTATTTAATTGAACTGTGAAATTCAATAATTCAATTTTGGATGAAACGAAATCTTCTCAAATTTTTTAATCTATAGTTCTTCCATGGAAAAACTAGCAATTCAGTTAACATTAATCCACAAAATATTCCATCAGTCACGAAAATGCTCAAATTCAGCGTACATACGTGAATGTTTAAAACGTCAACATCAACATTAATATCGAGATCTCAATTTTATATAAATGTTGATAAAAATATCGATAAAATATCAATGTTGATAAATATTTCTGATTTTTTTTTTCAATAAAACAAAATACTTAACACATATTTAAATTAATTCTTGGAAGTTGGCCAAGGCAAGATCCAATTCTACGGACAAACCCAAGCTCATAGGGTATCGGGAGGAGGCATATGAGAACACACGTCCCAGAAATGGAAAGTCAATGTTTCAGAGTTTGGGTCGACCAACGAAATCGATAAATATATACGAATCAAGAAGTAATCGAGCTATTCTTGGTCTAGCACCCTACAAAATCGAGTGTTAGAAAAAATCTTGACATTGCCAGGGTTGCACGTAGTTAGAGACAAGTGTTAGAGTAAATAATCTTCAAATGAAAGTTATGACTTTACTTCTTGTGTGCTTAGTTGAACTTTACGCTTCTTATACTAACTTAGATCATAGTATGGTAGACTTTGCGATGATCTCTTACACAAGCTAGCTCGAAAAGCGAGACCAACTCAGGAAAAGTGGGTCGAAGGTCAAGCCCACAAGCAGCAACGAAAATGAACGTGGACGAGATCCAAACGCTAAGATTAATAAATAATAATTTTACACTTTTAAAATGAGTTAAAATACTTGTTTTAATATTATGGTCATGTCATGTTAGGTTACATTTTTTTGGTTTTTTTTTAATATTTTATCATTTTAGTACCGGTGTAACCACACATTCTCGTGTGCCATATATTTTTTTTTACTATATTAGAGATAAAAAGATTTGAACCACATACCTCTTAGTTGCTAACATATACTAAACTATGATTTTTTTTTGTTTTAAAAAATATACTTGCTTTGTCTTTTAGATTTAAATTTAATACTATGGCGTGCGTGCGTGTGTGTATATATATTTACTTATTGTAATATATTTATAAAAATACTTTGCGAAAAGGGACTTATAAATACTGCAATTTACATAAGTAGTTTTGTGCGGCCACAAGCACGTAAAAAACTATCCTTTAAAAAAAAAAAGCACGTCAAAGACTAATAAAAATTCTTCAAAAATATAATAATGATGATAATAATAATAATGAGATTTTAGGGTTTTCATTCTCCTTCGTTTGAGCAGCCGTGCTGAGTGAGTACCCTCTCTCTCGTCCATCTGCTGGCCTGACGTTCGTCGCTCGCTCGCCGCGCCGCCGTCGTTTATCCTTCGTCGCGCCGCCGGCTTTCCTTTTGCTGCACCTCCGTTCTTCTCGTGAGTCGATTTCCTCCTCACCTTCTTATTTTCTTTCGTCCACCAGCCCACGGTTGTAGATCTAAGAATCTCCATTTCTGATCTACATGCGCATTTAATTCTGTTCTCTATGATGTTTATGGGAAGAATTAGGTTCGGGTTATTAAGCGAATGCTTTAATCTTTGATTTGCATATTCATGGGATGCTAATGGAGTATTTTATGTAATTTTCTTGTAGTTTTCTGATAACCGGCCCTTCCATGGCTATGTTGATCATTGAGTTTGAAGGCTGAGTTATTGTTTGAACATTTCTGAATAGTTCATTGATTTATTGGAAGTTCCACTTAAGGCAACAAGGTCAGAGTTGGGCTCGACTAGGATTTTCCATGGCGCATCGGTTGCTAAGGGATCTGGAAGCTGATGGCTGGGAGCGCTCTGATTTTCCCATCATCTGCGAGTCTTGCCTCGGTGATAACCCCTACGTTCGCATGGTACGTTCGTTCTTGGTTCTTCTATCTTATTTAACGTTAGTACCCAGACCCCATGAGTTACTTTGATCCCGTTCTTGAATTACGTTCAAAATTTCACAATTAATCACTTGATTGTTAGAATAAGTCCTCCTTAAATTGTCCTGTTCGTGAAAATACTGTAAATTAATGTACTTGGTAGAACACGAACCCAGTTGGTTTCTGTCAACCGTTATGAATTTTATGTTCACTAGCATGAAATGTGTGGGAGAGCAATGGATAGATGATGGTTGTCGGAAGTTGTGTTCTTTTGCAATTATTGTCTTAGTATTTGATTAAGGCATAACTTTGATTAAATGAATGTGATATTTGAATGCAGACGAGAGCTGATTATGATAAAGAATGCAAAATTTGCACGCGCCCATTTACAGTTTTCAGGTGGAGGCCTGGACGTGATGCTAGATATAAGAAAACAGAGATTTGCCAGACATGTAGTAAGTTAAAAAATGTATGTCAAGTTTGCCTTTTGGACCTGGAATATGGCTTACCAGTTCAAGTTAGAGACACTGCCCTGTCTATCAACTCCAATGATGCTATTCCCAAGAGTGATGTGAATAGGGAATATTTTGCCGAGGAGCATGATCGGAAGGTATACACTTTCTGCTGAAGTTTTTCCTTTAATTATAGCATGATTACTCCTTGTAGTTGTTGGTGAAATACATTTGGTGAGGATTAAAGTAATTGTAATTTTTCAAGTTTATCGCTTTAGCAGGTCTTTGCAACTTTGTTGTTGCTTACTTTATGCTAATTTGGTTCAATGTTGACCATGCAGGCCAGAGCTGGTATTGATTATGAGTCCTCATATGGAAAGGCACGACCAAATGATACTATTCTGAAGCTTCAACGAACCACACCGTACTACAAGAGGAACCGTGCACATGTTTGTAGTTTCTATATTAGGGGTGAATGCACTAGAGGTTCTGAATGCCCTTACAGACATGAGATGCCTGAAACTGGAGAGTTGTCCCAGCAAAATATTAAGGATCGTTATTATGGGTATGGTTGATCTATTCCTTTTGAATGATTGGCTGTAGTGTGTAATCTTTGTCCCTGACCACCCAAATTTCTCTTTGATTTTTTCAGTTATTATCTTGCATTTGTTTGCATATTTTAAGCTTCGTTTGAGCTTTATATCATATGAAGTATAGTCTTCCGAAAGTAATAATTTGTTGGAAAACAAGGATAATTGCTCGGTTATTCATGCGTAAACAATTGATCTAAAGTGCCGGACACTTTATAAGATTATCGAGCAACTTATAAGGGGTCTTACTTGGTTTGATGAGAGATTCTTGAGAGGTTGGTTTACTTAGGGGGCTTGAAATTGGGAATTTAAGGATTTAAGGATTCACAACAAAGCTTTATTGGCCATATGGTTGTGGCATTTTCCCCTTGAATCCAACTCTCTTTGGCATAGGATTATTGTAGGTAAGTTTAGGCCTAATCCTTTTAACTGGCTGGCTGGTAGGGTTAAAGGCACTAATTGGAATTAAGGGAAAGATATTTCTTCAGAGCTTCCTTTTCTCTCTGACTTGGTTCATTTTGTGGTGGGGGCGGGGAAAGAAACATAGTTCTAGGAGGATCATTGGGTGGGATTAAGTTTCCTTTGCTCTGCGTTTCCTCATCCTTTTCATTTGTCCTCCTGTAAAAATAGAGCAGTTTCTGATTTTTTTGGTATGGACACGGAGCTCTAGTTCCTTTTCGTTGGGATTTTGTTGCTCTTTGACCAATAGAGAAGCAACGAATGTGGCTTCTCTTTTGGCTTTGCTGGGGAAGGTTGTCTTTAGGCTTGGGAGAAGGGAATTTTGTGTGTGGAGTCCAACCCTTCGGAGGGATTCTCTTGCAAGTTTTTTTGTTTTTTGTTGGACCCTTCTCCCATTTGTGAGTTTGTTTTTCTTCTCTATGGAGGATTAGGATTTCAAGGAAAGTGAAAATTTACACCTGGCAAGTGTTGCATGGCCATGTTAACACTTTGGATTGGCTTGTGAGAAAGTTGCCTTTGCTAGTTGGCCGTTTTTGCTCACTGGACACAGGAACATCAGTGAGTTGATTGGGGAGTTCCTTCTCCATTTGCCTTTGCACGTAAGGGCCAGTTTTTTATGGTTTGTTAGGAAATGTACTATTTTGTGGGACCTTTGGGGCGAGCAAAATGACAGTGTTTAGAGAGTTGGAGAGGGAGCTTAGTGAAGTTTGTTCCCTTGTTAGATATCATGTTTCTTATATTTCTAGATTGATCATTTTCTTTCTTCTCTTTGTGGGGAAGCTTCTATTTTAGATTTTTCTTTTACATTCATTCTGACCAACAAGTGGCCATTGGATAAAAAATCTACTCTTGGTGCTAGCTACTTTAGTTCCTGCACCATTTCTTGCTGTTAGGTTGCCTTACCCTCTGTTTTATCCATCCACAGGATTTTTAATTCCAACCTCTATTTCTTCTCTTTCATTAGAACACTGTTCTGCAGTCATTTTCCGATAGATACTGGATGGAACCACCTTTAGGCTACCATTGTCTTTCTCTATGAACTACCTTTTTCTTGAAATTCAGAAATTGTATGTATAGAAATAGTAGGATATGCAACTTTATTATTGTTTTAGATAATTACCTAATTTTTAGATATCCTAAACATTGATAATTGTTTAGAATTTTATTTCTACAGTTAGTTAAGATTAAATATACCTGTTTGTCAAATAATAGTATATACTGATTATTTGGTATTTATTTCATTCATCGTGGAAAAAACTAGGGTTGGTTTGCTTGGTATTTCTTAGCTTCGATGTTTTCTTGGAAATTCTCTCTTTCATAAAAGTCTCAACAATGTATGATATGTACATAGTCATTATCATTCCTTGTTTTAAATTGAATGGCATGCAAAACGCCAAAACCCTAGTTAAATAGGAGTTAGTGGTTCTGTGTACAAGAGTTTGATGCCTATGATTCAGAGTTGCACAAGTCTCTTATTGGGACTAAAGTTGCTATGCACTTTGGTACTCGTATTTATTTATTATTATTATTTGTTTGGGGGGGGGGGGGGGGGTGGGGCAGCATCCAGAGAAAATGAAAGAATACAGGGTGAACATAAACTGGAACCTACATGCATTTGAAAGAAGGGGGAGAGAGTGTGAGAGAGAGAGACCAAACCTCCTCCCAAATACCTCTCCAAACAGTATAATCTACTAGCACTCTAGAGAACACTACCCTTAAGGAAGATGAAACAGAACCTCCTCGTATTAAGTGTTTCACAAGGCCAGAATCCCTGTTATAGGCCACACAAACCTCAAAGGTCTCCGATATACAGTCCCAAATTGAACAAGTGAACATAACTTCCTCGATCTAAGAACTCGCTCCGGCAAATTACCCCCCACCAGAGGGTACAGGTAAGCAATTTGGGAATTAGGAGGGAATTAGCGTTGATATAGGTGGGTCACCCTTCATTTGATCAAAGTAAGGGTACACATCACCAATTCGGGAGTTATAAAGGAGTTACGACCTTACTTGAACAGGAGTTGTTCTAGAGGTTGTACAACCTTGCTTGAATAGGAGTTCGATAAATGAAGGGTAGTATCTAATTTAATTTGATTGATGGTTGCATGCTGATTACAAAATCGAAATAGATGACAGATATATTTCATTCTCATATTTCGGTCTTCTATTTGCTAGAATGTTAAAAATGTTTTTGAACGTGCCACATATTTTATCGTTTACAATGTGGGCAGATATATTACTAATGAAATTACTTTCTTTTCTAACTCCTCAGGGTTAATGATCCAGTTGCACTGAAGCTTCTTAACAAGGCTGGAGAGATGCCTTCTCTGGAACCTCCAGAGGATGAGAGTATTAGAACCTTGTATGTGGGCGGACTTGATGCTCGAGTCACCGAGCAAGATCTTCGAGATAACTTTTACGCTCATGGTGAAATTGAATCTATCAGAATGGTGCTACAACGGGCATGCGCTTTTGTAACCTACACCACCCGAGAAGGTGCAGAGAAGGCTGCAGAAGAGCTTTCAAACAAGCTGGTAATTAAGGGCCTTAGGCTGAAATTGATGTGGGGCAGACCTCAAGCACCAAAAGCCGAGTCAGAAGGTTCTGATGAAGCAAAACAAGCGGCAGTGGCTCACAGTGGAATGTTGCCGCGAGCAGTTATCTCGCAGCAGCACAACCAATTACACCCTCCGGGAACTCATGACCAACCTCAAGCTATGCACTACTTCAATATTCCACCGCCACCACCTCAGCAGGAGAGAGCATTCTACCCATCAATGGACCCTCAAAGAATGGGAGCTCTGGTTTCGACTCATGATGTGGGGGTGCCCCCTAACGGACCTACAGGTTCAACCGAAACTAGACCCGGTTCTGAGAAACAACACCAACAGGGACATCAATTTCCCTATCACTCAATGCACCCACCCCCACCTGCACAATATCAACAGCAATTTTATCCGCCGTACGGGTACATGCAACATTATCCACCGTACCCTCCTTATCATTCGAACATGCCACCCCCGCCCCCGTCCCAGTCCCAGTCCCAGCCCCATCCTCCTTCAGGTTTGCAGCAATATCAGCAGCAGCATTCTACACCACCAGGCTCAGCCCCTCAGTCTCATGGAGGAGCATCTTCAGTTTCAGCCCCGCTGGGATCAACACCTTCAGTGTCTGCTCCATCTTCAACGTCTTCCGAACCTGCATCATCGTAG

mRNA sequence

ATGTATATCCGAATCAAGCGCCATAAGACAACTTACTTTATCCAGTGTGATCCAATTGAGACAACTTTAAATATCAAGCAAAAATTAGAGTCCCTTATTGACCAACCAGTAGTTGACCAGCGCTTGATCCTGGTGGGGAGTGGGGAAGTATTGGAGGATTCAAAGACACTGGCTGATCAGAAGGTTGAAAATGATGCAGTTGTGGCTCTAACGTTGCGAAAAGATGATAATGACTTCGAGGAGATCGACATCGTCCAGCCAAATGATTTCTACCAATCTCGTGATGCCGATTCTGGCAATTGTATTCCGTTGGTCATCCGAATTCCAAATTCTGCAGTTGCAGTCTACTGCTTTCGTTCTCCGTGGGCTACTCCCACTCCATCCGCCATGGCTCTTCTTCCCACCAAGCTCGCTCTCGGATGCCCTAATTCTACACTCTTTTCTGCAGCTCCCAGGCTCTCTTCTCTTCGTCTTCCACCATTCCGAGTCTCATGCGCCAACAAACGGACCGGTAAGAGGAGGTATCCATCGGAAAAGAAGAAGCTCAAATTGAAACACAAAGAAGTCCTCACGACCGTCGAGAACAAGTTCGAAGGCATTTGGAGGTTGTTCAAGCTCGGAGTTCCCGTGGAGAAGGATCCTGGCAAGGATTTTCATGGCCTCTCGGATGCTTTGATGCAAGAGATTGCTAAAGTGCTCGAGTTCCCGGTCGCTTCGTTGCTACCGCGGGAAGCTTTCTCAGTTATTCGTAAATCTTTTGACGCTAGAAAGATGTTGAAGGAACCAAAGTTTGTTTATACTGTGGACATGGATGTACATAGGCTACTGATTCTTGAACCTCGTGCTCAGGATTTCATTTCGGACTTGGAGCCTAAAGTTGGATTGATGGAACATTTTGCAAAAGAAAAGGTATCCAATGATGTAATCAGTATCGTTCATGATCTCAAAAGTAATCAGGAAGTGGTGGGAGCAAATGGACTTAACGGTCATTCTGGCCCGTACTTGCGCTTGTCAAATGGTAAACCAAAAGTTGCTGTTGTTGGCAGTGGGCCATCTGGCCTTTTTGCCTCACTTGTCCTTGCAGAGTTTGGGGCCGATGTTACCTTGATTGAAAGAGGTCAACCAGTGGAACAAAGAGGGCGTGATATTGGTGCATTGGTAGCTCGTCGGATTCTGGAGCTGGATAGCAATTTTTGCTTTGGGGAGGGTGGTGCAGGTACCTGGAGTGATGGGAAGTTGGTCACTAGAATTGGTAGAAACAGTGGCAGCGTGCAAGCGGTTATGAAATCTTTAGTTTATTTTGGGGCCCCAAAGAAAATCTTACTCAATGGAAAGCCTCACCTTGGAACAGACAGGTTGATTCCATTGCTTAGGAACATTCGGCAGCACTTAGAAATGTTGGGTGTGAGTATCTACCCTATTCTGTCCTTGGATGTCAATATCAAGTTCGGGACCAGGGTTGATGATCTAATTGAAGAGAGTGGACATATAGTGGGTGTTAAAGTTTCTGATTCAAGAGACAAGTCAAAGCTCAGCAACCAGAAGCTTGAATTTGATGCCATTGTCCTAGCTGTTGGCCATTCCGCGCGCGATGTATATCAAATGCTTGTGTCTCATAACATTCCCGTGGTTCCCAAGGAGTTTGCCGTTGGTTTAAGGATCGAGCATCCTCAAGAATTAATAAACAGCATACAGTATTCTGGATTGGCCAATGAGGTAGAGAAAGGACGTGGAAAAGTACCCGTGGCAGATTACAAAGTTTCCAAGTATGTTAACGTAGACACGGAGAATCCGTCCTCCAATTCTCTTGCAGCAAGTCGCAGTTGCTATTCATTTTGCATGTGTCCTGGTGGCCAGGTTGTCCTCACAAGTACAAACCCGGGAGAACTTTGTATCAATGGCATGTCATTCTCTCGACGTTCATCAAAATGGGCAAATGCTGCCCTTGTTGTTACTGTTTCAACTAAGGACTTTAATGATCATGGTTTCCACGGACCTCTTGCTGGGGTTGAATTCCAGAGAGAACTTGAGCAAAGAGCAGCCGTCATGGGAGGTGGAAATTTTGTTTTGCCCGTGCAGACAGCTACTGATTTTATGGACAGAAGATTAAGAGTGACATCTGTGCCGCCATCAAGTTACCGGTTAGGAGTGAAGGCCTCAAATCTCCACGAGTTATTCCCTGGTCATATAACAGAAGCTTTACAGCAATCTGTCCTTGCATTTGATCAAGAGACGAGAACAAGTTCCCCTGTTCAAATCCCACGCAACCCTGAGACTTATGAAAGCACATGTCTTAGAGGACTCTATCCGGTTGGTGAAGGAGCAGGCTATGCGGGAGGAATTGTAAGTGCAGCAGTAGATGGCATGTATGCAGGCTTTGCGGTAGCCAAGAGTTTCAATCTTTACCATGGTGACCTTGAGACGGTTTTGGGTAAGGCTCAAAGTTCTGGGCAACAAGGTCAGAGTTGGGCTCGACTAGGATTTTCCATGGCGCATCGGTTGCTAAGGGATCTGGAAGCTGATGGCTGGGAGCGCTCTGATTTTCCCATCATCTGCGAGTCTTGCCTCGGTGATAACCCCTACGTTCGCATGACGAGAGCTGATTATGATAAAGAATGCAAAATTTGCACGCGCCCATTTACAGTTTTCAGGTGGAGGCCTGGACGTGATGCTAGATATAAGAAAACAGAGATTTGCCAGACATGTAGTAAGTTAAAAAATGTATGTCAAGTTTGCCTTTTGGACCTGGAATATGGCTTACCAGTTCAAGTTAGAGACACTGCCCTGTCTATCAACTCCAATGATGCTATTCCCAAGAGTGATGTGAATAGGGAATATTTTGCCGAGGAGCATGATCGGAAGGCCAGAGCTGGTATTGATTATGAGTCCTCATATGGAAAGGCACGACCAAATGATACTATTCTGAAGCTTCAACGAACCACACCGTACTACAAGAGGAACCGTGCACATGTTTGTAGTTTCTATATTAGGGGTGAATGCACTAGAGGTTCTGAATGCCCTTACAGACATGAGATGCCTGAAACTGGAGAGTTGTCCCAGCAAAATATTAAGGATCGTTATTATGGGGTTAATGATCCAGTTGCACTGAAGCTTCTTAACAAGGCTGGAGAGATGCCTTCTCTGGAACCTCCAGAGGATGAGAGTATTAGAACCTTGTATGTGGGCGGACTTGATGCTCGAGTCACCGAGCAAGATCTTCGAGATAACTTTTACGCTCATGGTGAAATTGAATCTATCAGAATGGTGCTACAACGGGCATGCGCTTTTGTAACCTACACCACCCGAGAAGGTGCAGAGAAGGCTGCAGAAGAGCTTTCAAACAAGCTGGTAATTAAGGGCCTTAGGCTGAAATTGATGTGGGGCAGACCTCAAGCACCAAAAGCCGAGTCAGAAGGTTCTGATGAAGCAAAACAAGCGGCAGTGGCTCACAGTGGAATGTTGCCGCGAGCAGTTATCTCGCAGCAGCACAACCAATTACACCCTCCGGGAACTCATGACCAACCTCAAGCTATGCACTACTTCAATATTCCACCGCCACCACCTCAGCAGGAGAGAGCATTCTACCCATCAATGGACCCTCAAAGAATGGGAGCTCTGGTTTCGACTCATGATGTGGGGGTGCCCCCTAACGGACCTACAGGTTCAACCGAAACTAGACCCGGTTCTGAGAAACAACACCAACAGGGACATCAATTTCCCTATCACTCAATGCACCCACCCCCACCTGCACAATATCAACAGCAATTTTATCCGCCGTACGGGTACATGCAACATTATCCACCGTACCCTCCTTATCATTCGAACATGCCACCCCCGCCCCCGTCCCAGTCCCAGTCCCAGCCCCATCCTCCTTCAGGTTTGCAGCAATATCAGCAGCAGCATTCTACACCACCAGGCTCAGCCCCTCAGTCTCATGGAGGAGCATCTTCAGTTTCAGCCCCGCTGGGATCAACACCTTCAGTGTCTGCTCCATCTTCAACGTCTTCCGAACCTGCATCATCGTAG

Coding sequence (CDS)

ATGTATATCCGAATCAAGCGCCATAAGACAACTTACTTTATCCAGTGTGATCCAATTGAGACAACTTTAAATATCAAGCAAAAATTAGAGTCCCTTATTGACCAACCAGTAGTTGACCAGCGCTTGATCCTGGTGGGGAGTGGGGAAGTATTGGAGGATTCAAAGACACTGGCTGATCAGAAGGTTGAAAATGATGCAGTTGTGGCTCTAACGTTGCGAAAAGATGATAATGACTTCGAGGAGATCGACATCGTCCAGCCAAATGATTTCTACCAATCTCGTGATGCCGATTCTGGCAATTGTATTCCGTTGGTCATCCGAATTCCAAATTCTGCAGTTGCAGTCTACTGCTTTCGTTCTCCGTGGGCTACTCCCACTCCATCCGCCATGGCTCTTCTTCCCACCAAGCTCGCTCTCGGATGCCCTAATTCTACACTCTTTTCTGCAGCTCCCAGGCTCTCTTCTCTTCGTCTTCCACCATTCCGAGTCTCATGCGCCAACAAACGGACCGGTAAGAGGAGGTATCCATCGGAAAAGAAGAAGCTCAAATTGAAACACAAAGAAGTCCTCACGACCGTCGAGAACAAGTTCGAAGGCATTTGGAGGTTGTTCAAGCTCGGAGTTCCCGTGGAGAAGGATCCTGGCAAGGATTTTCATGGCCTCTCGGATGCTTTGATGCAAGAGATTGCTAAAGTGCTCGAGTTCCCGGTCGCTTCGTTGCTACCGCGGGAAGCTTTCTCAGTTATTCGTAAATCTTTTGACGCTAGAAAGATGTTGAAGGAACCAAAGTTTGTTTATACTGTGGACATGGATGTACATAGGCTACTGATTCTTGAACCTCGTGCTCAGGATTTCATTTCGGACTTGGAGCCTAAAGTTGGATTGATGGAACATTTTGCAAAAGAAAAGGTATCCAATGATGTAATCAGTATCGTTCATGATCTCAAAAGTAATCAGGAAGTGGTGGGAGCAAATGGACTTAACGGTCATTCTGGCCCGTACTTGCGCTTGTCAAATGGTAAACCAAAAGTTGCTGTTGTTGGCAGTGGGCCATCTGGCCTTTTTGCCTCACTTGTCCTTGCAGAGTTTGGGGCCGATGTTACCTTGATTGAAAGAGGTCAACCAGTGGAACAAAGAGGGCGTGATATTGGTGCATTGGTAGCTCGTCGGATTCTGGAGCTGGATAGCAATTTTTGCTTTGGGGAGGGTGGTGCAGGTACCTGGAGTGATGGGAAGTTGGTCACTAGAATTGGTAGAAACAGTGGCAGCGTGCAAGCGGTTATGAAATCTTTAGTTTATTTTGGGGCCCCAAAGAAAATCTTACTCAATGGAAAGCCTCACCTTGGAACAGACAGGTTGATTCCATTGCTTAGGAACATTCGGCAGCACTTAGAAATGTTGGGTGTGAGTATCTACCCTATTCTGTCCTTGGATGTCAATATCAAGTTCGGGACCAGGGTTGATGATCTAATTGAAGAGAGTGGACATATAGTGGGTGTTAAAGTTTCTGATTCAAGAGACAAGTCAAAGCTCAGCAACCAGAAGCTTGAATTTGATGCCATTGTCCTAGCTGTTGGCCATTCCGCGCGCGATGTATATCAAATGCTTGTGTCTCATAACATTCCCGTGGTTCCCAAGGAGTTTGCCGTTGGTTTAAGGATCGAGCATCCTCAAGAATTAATAAACAGCATACAGTATTCTGGATTGGCCAATGAGGTAGAGAAAGGACGTGGAAAAGTACCCGTGGCAGATTACAAAGTTTCCAAGTATGTTAACGTAGACACGGAGAATCCGTCCTCCAATTCTCTTGCAGCAAGTCGCAGTTGCTATTCATTTTGCATGTGTCCTGGTGGCCAGGTTGTCCTCACAAGTACAAACCCGGGAGAACTTTGTATCAATGGCATGTCATTCTCTCGACGTTCATCAAAATGGGCAAATGCTGCCCTTGTTGTTACTGTTTCAACTAAGGACTTTAATGATCATGGTTTCCACGGACCTCTTGCTGGGGTTGAATTCCAGAGAGAACTTGAGCAAAGAGCAGCCGTCATGGGAGGTGGAAATTTTGTTTTGCCCGTGCAGACAGCTACTGATTTTATGGACAGAAGATTAAGAGTGACATCTGTGCCGCCATCAAGTTACCGGTTAGGAGTGAAGGCCTCAAATCTCCACGAGTTATTCCCTGGTCATATAACAGAAGCTTTACAGCAATCTGTCCTTGCATTTGATCAAGAGACGAGAACAAGTTCCCCTGTTCAAATCCCACGCAACCCTGAGACTTATGAAAGCACATGTCTTAGAGGACTCTATCCGGTTGGTGAAGGAGCAGGCTATGCGGGAGGAATTGTAAGTGCAGCAGTAGATGGCATGTATGCAGGCTTTGCGGTAGCCAAGAGTTTCAATCTTTACCATGGTGACCTTGAGACGGTTTTGGGTAAGGCTCAAAGTTCTGGGCAACAAGGTCAGAGTTGGGCTCGACTAGGATTTTCCATGGCGCATCGGTTGCTAAGGGATCTGGAAGCTGATGGCTGGGAGCGCTCTGATTTTCCCATCATCTGCGAGTCTTGCCTCGGTGATAACCCCTACGTTCGCATGACGAGAGCTGATTATGATAAAGAATGCAAAATTTGCACGCGCCCATTTACAGTTTTCAGGTGGAGGCCTGGACGTGATGCTAGATATAAGAAAACAGAGATTTGCCAGACATGTAGTAAGTTAAAAAATGTATGTCAAGTTTGCCTTTTGGACCTGGAATATGGCTTACCAGTTCAAGTTAGAGACACTGCCCTGTCTATCAACTCCAATGATGCTATTCCCAAGAGTGATGTGAATAGGGAATATTTTGCCGAGGAGCATGATCGGAAGGCCAGAGCTGGTATTGATTATGAGTCCTCATATGGAAAGGCACGACCAAATGATACTATTCTGAAGCTTCAACGAACCACACCGTACTACAAGAGGAACCGTGCACATGTTTGTAGTTTCTATATTAGGGGTGAATGCACTAGAGGTTCTGAATGCCCTTACAGACATGAGATGCCTGAAACTGGAGAGTTGTCCCAGCAAAATATTAAGGATCGTTATTATGGGGTTAATGATCCAGTTGCACTGAAGCTTCTTAACAAGGCTGGAGAGATGCCTTCTCTGGAACCTCCAGAGGATGAGAGTATTAGAACCTTGTATGTGGGCGGACTTGATGCTCGAGTCACCGAGCAAGATCTTCGAGATAACTTTTACGCTCATGGTGAAATTGAATCTATCAGAATGGTGCTACAACGGGCATGCGCTTTTGTAACCTACACCACCCGAGAAGGTGCAGAGAAGGCTGCAGAAGAGCTTTCAAACAAGCTGGTAATTAAGGGCCTTAGGCTGAAATTGATGTGGGGCAGACCTCAAGCACCAAAAGCCGAGTCAGAAGGTTCTGATGAAGCAAAACAAGCGGCAGTGGCTCACAGTGGAATGTTGCCGCGAGCAGTTATCTCGCAGCAGCACAACCAATTACACCCTCCGGGAACTCATGACCAACCTCAAGCTATGCACTACTTCAATATTCCACCGCCACCACCTCAGCAGGAGAGAGCATTCTACCCATCAATGGACCCTCAAAGAATGGGAGCTCTGGTTTCGACTCATGATGTGGGGGTGCCCCCTAACGGACCTACAGGTTCAACCGAAACTAGACCCGGTTCTGAGAAACAACACCAACAGGGACATCAATTTCCCTATCACTCAATGCACCCACCCCCACCTGCACAATATCAACAGCAATTTTATCCGCCGTACGGGTACATGCAACATTATCCACCGTACCCTCCTTATCATTCGAACATGCCACCCCCGCCCCCGTCCCAGTCCCAGTCCCAGCCCCATCCTCCTTCAGGTTTGCAGCAATATCAGCAGCAGCATTCTACACCACCAGGCTCAGCCCCTCAGTCTCATGGAGGAGCATCTTCAGTTTCAGCCCCGCTGGGATCAACACCTTCAGTGTCTGCTCCATCTTCAACGTCTTCCGAACCTGCATCATCGTAG

Protein sequence

MYIRIKRHKTTYFIQCDPIETTLNIKQKLESLIDQPVVDQRLILVGSGEVLEDSKTLADQKVENDAVVALTLRKDDNDFEEIDIVQPNDFYQSRDADSGNCIPLVIRIPNSAVAVYCFRSPWATPTPSAMALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEVLTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVIRKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVISIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTLIERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMKSLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDDLIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFAVGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSCYSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAGVEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGHITEALQQSVLAFDQETRTSSPVQIPRNPETYESTCLRGLYPVGEGAGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSGQQGQSWARLGFSMAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGRDARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAEEHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYRHEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVTEQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGRPQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQQERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPPPAQYQQQFYPPYGYMQHYPPYPPYHSNMPPPPPSQSQSQPHPPSGLQQYQQQHSTPPGSAPQSHGGASSVSAPLGSTPSVSAPSSTSSEPASS
Homology
BLAST of HG10006289 vs. NCBI nr
Match: XP_038890425.1 (uncharacterized protein Cbei_0202 isoform X1 [Benincasa hispida] >XP_038890426.1 uncharacterized protein Cbei_0202 isoform X1 [Benincasa hispida])

HSP 1 Score: 1271.1 bits (3288), Expect = 0.0e+00
Identity = 648/703 (92.18%), Postives = 667/703 (94.88%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+ LALGCPNSTLFSAAPRLSSLRLPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSNLALGCPNSTLFSAAPRLSSLRLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVH+LLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHKLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVV ANGLNGHSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVRANGLNGHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
           IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTDRLIPLLRNIRQHL+ LG          VNIKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDRLIPLLRNIRQHLKTLG----------VNIKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEESGH+VG+KVSDSRDK KLSNQKLEFDAIVLAVGHSARDVYQML+SHNIP+VPKEFA
Sbjct: 361 LIEESGHVVGIKVSDSRDKLKLSNQKLEFDAIVLAVGHSARDVYQMLISHNIPMVPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+ T+NPSSNSLA SRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIHTDNPSSNSLAPSRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFND GFHGPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFHGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTS+PPSSYRLGVKASNLH+LFP H
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSMPPSSYRLGVKASNLHDLFPDH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRNPETYEST LRGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNPETYESTSLRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNL+HGDLETVLGKAQSSG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLFHGDLETVLGKAQSSG 692

BLAST of HG10006289 vs. NCBI nr
Match: XP_004144792.2 (uncharacterized protein LOC101214567 [Cucumis sativus] >KAE8651641.1 hypothetical protein Csa_021224 [Cucumis sativus])

HSP 1 Score: 1241.5 bits (3211), Expect = 0.0e+00
Identity = 635/703 (90.33%), Postives = 656/703 (93.31%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+KL    PNSTLFS+ PRLSSL LPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSKLPFTYPNSTLFSSPPRLSSLHLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEGIWRLFKL VPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGIWRLFKLVVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVH LLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHSLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVVGANGL GHSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVGANGLTGHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
           IERGQPVEQRGRDIGALV+RRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 IERGQPVEQRGRDIGALVSRRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTD+LIPLLRNIRQHLE LGV+          IKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDKLIPLLRNIRQHLETLGVT----------IKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEE GH+ GVKVSDSRDK KLS Q LE+DAIVLAVGHSARDVYQML+SHNIPV+PKEF+
Sbjct: 361 LIEEGGHVAGVKVSDSRDKLKLSKQTLEYDAIVLAVGHSARDVYQMLLSHNIPVIPKEFS 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DT+NPSSN LAASRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTKNPSSNFLAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFND GF GPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFRGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFP H
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPDH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRNPETYEST +RGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNPETYESTSVRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+SG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQNSG 692

BLAST of HG10006289 vs. NCBI nr
Match: XP_008452584.1 (PREDICTED: uncharacterized protein Cbei_0202 [Cucumis melo])

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 635/703 (90.33%), Postives = 655/703 (93.17%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+KL L  PNSTLFS+ PRLSSL LPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSKLPLTYPNSTLFSSPPRLSSLHLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEG WRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGTWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVVGANG N HSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVGANGFNSHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
            ERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 FERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTD+LIPLLRN RQHLE LGV+          IKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDKLIPLLRNFRQHLETLGVT----------IKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEE GH+ GVKVSDSRDK KLS QKL +DAIVLAVGHSARDVYQML+SHNIPV+PKEFA
Sbjct: 361 LIEEGGHLTGVKVSDSRDKLKLSKQKLGYDAIVLAVGHSARDVYQMLLSHNIPVIPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DT+NPSSNS+AASRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTKNPSSNSVAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNP ELCINGMSFSRRSSKWANAALVVTVSTKDFND GF GPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPEELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFQGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRN ETYEST +RGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNLETYESTSVRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+SG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQNSG 692

BLAST of HG10006289 vs. NCBI nr
Match: KAA0056117.1 (FAD/NAD(P)-binding oxidoreductase family protein [Cucumis melo var. makuwa] >TYJ96400.1 FAD/NAD(P)-binding oxidoreductase family protein [Cucumis melo var. makuwa])

HSP 1 Score: 1235.7 bits (3196), Expect = 0.0e+00
Identity = 633/703 (90.04%), Postives = 654/703 (93.03%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+KL L   NSTLFS+ PRLSSL LPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSKLPLTYLNSTLFSSPPRLSSLHLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEG WRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGTWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVVGANG N HSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVGANGFNSHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
            ERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 FERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTD+LIPLLRN RQHLE LGV+          IKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDKLIPLLRNFRQHLETLGVT----------IKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEE GH+ GVKVSDSRDK KLS QKL +DAIVLAVGHSARDVYQML+SHNIP++PKEFA
Sbjct: 361 LIEEGGHLTGVKVSDSRDKLKLSKQKLGYDAIVLAVGHSARDVYQMLLSHNIPLIPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DT+NPSSNS+AASRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTKNPSSNSVAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNP ELCINGMSFSRRSSKWANAALVVTVSTKDFND GF GPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPEELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFQGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRN ETYEST +RGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNLETYESTSVRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+SG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQNSG 692

BLAST of HG10006289 vs. NCBI nr
Match: KAG7020822.1 (hypothetical protein SDJN02_17510 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1227.6 bits (3175), Expect = 0.0e+00
Identity = 624/703 (88.76%), Postives = 661/703 (94.03%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP  LALGCPNS+LFSA PRL S RLPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPFNLALGCPNSSLFSATPRLMSPRLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDV+ LLILEPRA+DFISDLEPKVGL+EH  KEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVNNLLILEPRARDFISDLEPKVGLIEHIVKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSN EVV  +GLNGHSGP++RL + KPK+AVVGSGPSGLFA+LVLAEFGADVTL
Sbjct: 181 SIVHDLKSNHEVVEESGLNGHSGPFMRLPSSKPKIAVVGSGPSGLFAALVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
           IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLV+FGAP+ ILL+GKPHLGTDRL+PLLRNIRQHLEMLGVSIY ++S+   +KFGTRVDD
Sbjct: 301 SLVHFGAPENILLSGKPHLGTDRLVPLLRNIRQHLEMLGVSIY-LISILATVKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LI+ESGH+VGVKVSDSRDK KL++QKLEFDA VLAVGHSARDVYQML+SHNIPVVPKEFA
Sbjct: 361 LIQESGHVVGVKVSDSRDKLKLNSQKLEFDATVLAVGHSARDVYQMLMSHNIPVVPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQ LINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DTE+PSSNS+AA+RSC
Sbjct: 421 VGLRIEHPQALINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTEDPSSNSVAANRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFND GFHGPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFHGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRE E+RAA+MGGGNFVLPVQTATDFMDR+L+VTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQREFERRAALMGGGNFVLPVQTATDFMDRKLKVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFD+               ETRTSSPVQIPRN ETYEST LRGLYPVGEG
Sbjct: 601 ITEALQQSILAFDKELPGFLSSDALLHGVETRTSSPVQIPRNSETYESTSLRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAK+FNL +GDLETVLGKAQSSG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKNFNLCNGDLETVLGKAQSSG 701

BLAST of HG10006289 vs. ExPASy Swiss-Prot
Match: Q5SNN4 (Zinc finger CCCH domain-containing protein 40 OS=Oryza sativa subsp. japonica OX=39947 GN=Os06g0170500 PE=2 SV=1)

HSP 1 Score: 679.1 bits (1751), Expect = 1.0e-193
Identity = 359/488 (73.57%), Postives = 390/488 (79.92%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHRLLRD +ADGWERSDFPIICESCLGDNPYVRM RA+YDKECKIC RPFTVFRWRPGR
Sbjct: 1    MAHRLLRDAQADGWERSDFPIICESCLGDNPYVRMLRAEYDKECKICARPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTEICQTC KLKNVCQVCLLDLEYGLPVQVRDTAL+INSNDAIP+SDVNREYFAE
Sbjct: 61   DARYKKTEICQTCCKLKNVCQVCLLDLEYGLPVQVRDTALAINSNDAIPRSDVNREYFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDRKARAGIDY+SS+GKARPNDTILKLQRT PYYKRNRAHVCSFY+RGECTRG+ECPYR
Sbjct: 121  EHDRKARAGIDYDSSHGKARPNDTILKLQRTAPYYKRNRAHVCSFYVRGECTRGAECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVALKLL KAGEMPSL PP+DESIRTLY+GGL+ R+T
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVALKLLGKAGEMPSLTPPDDESIRTLYIGGLNNRIT 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQDLRD FYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEEL+NKLVIKG+RLKLMWG+
Sbjct: 241  EQDLRDQFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELANKLVIKGIRLKLMWGK 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQH--NQLHPPGTHDQPQA---MHYFNIP 1189
            PQAPK E +  +  +Q  VAH GMLPRAVISQQ   +Q  PPG   Q QA    +YFNI 
Sbjct: 301  PQAPKPEDD--EAGRQGHVAHGGMLPRAVISQQQSGDQPQPPGMEGQQQAPSGSYYFNI- 360

Query: 1190 PPPPQQERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFP---- 1249
            P PP  ER  YPSMDPQRMGALV + +    P GP  + + +  S      G  +P    
Sbjct: 361  PAPPGAERTLYPSMDPQRMGALVKSQEGDGKP-GPQQAAQAQASS----SSGQSYPMPPQ 420

Query: 1250 -YHSMHPPPPAQYQQQFYPPY-GYM----QHYPP---YPPYHSNMPPPPPSQSQSQPHP- 1299
             YH  +PP        +YPPY GYM      YPP   YPPY   +  P  SQ+ S   P 
Sbjct: 421  YYHGQYPP--------YYPPYGGYMPPPRMPYPPPPQYPPYQPMLATPAQSQASSSQQPA 472

BLAST of HG10006289 vs. ExPASy Swiss-Prot
Match: Q6Z358 (Zinc finger CCCH domain-containing protein 49 OS=Oryza sativa subsp. japonica OX=39947 GN=Os07g0281000 PE=2 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 3.6e-191
Identity = 359/496 (72.38%), Postives = 390/496 (78.63%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHRLLRD +ADGWERSDFPIICESCLGDNPYVRM RA+YDKECKIC RPFTVFRWRPGR
Sbjct: 1    MAHRLLRDAQADGWERSDFPIICESCLGDNPYVRMLRAEYDKECKICARPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTEICQTC KLKNVCQVCLLDLEYGLPVQVRDTALS NSNDAIP+SDVNREYFAE
Sbjct: 61   DARYKKTEICQTCCKLKNVCQVCLLDLEYGLPVQVRDTALSTNSNDAIPRSDVNREYFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDR+ARAGIDY+SS GKAR NDTILKLQRT PYYKRNRAHVCSFY+RGECTRG+ECPYR
Sbjct: 121  EHDRRARAGIDYDSSNGKARANDTILKLQRTAPYYKRNRAHVCSFYVRGECTRGAECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVALKLL+KAGEMPSL PP+DESIRTLY+GGLD+RVT
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVALKLLSKAGEMPSLTPPDDESIRTLYIGGLDSRVT 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQDLRD FYAHGEIE+IRMVLQRACAFVTYTTREGAEKAAEEL+NKLVIKG+RLKLMWG+
Sbjct: 241  EQDLRDQFYAHGEIETIRMVLQRACAFVTYTTREGAEKAAEELANKLVIKGVRLKLMWGK 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQH--NQLHPPGTHDQPQ---AMHYFNIP 1189
            PQAPK E +  +  +Q  VAH GMLPRAVISQQ   +Q  PPG   Q Q   A +YFNI 
Sbjct: 301  PQAPKPEED--EAGRQGHVAHGGMLPRAVISQQQSGDQPQPPGMEGQQQPASASYYFNI- 360

Query: 1190 PPPPQQERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSM 1249
            P PP  ER  YPSMDPQRMGALV + +     +G  G  +   G +     G  +P    
Sbjct: 361  PAPPAAERTLYPSMDPQRMGALVESQE----GDGKPGPQQAGQG-QASSSSGQSYP---- 420

Query: 1250 HPPPPAQYQQQ---FYPPY-GYM-------QHYPPYPPYHSNMPPPPPSQSQSQPHPPSG 1307
             PPPP  +  Q   +YPPY GYM       Q  P YP Y   + PP  SQ+ S   P   
Sbjct: 421  EPPPPYYHGGQYPPYYPPYGGYMPPPRMPYQQPPQYPAYQPMLAPPAQSQASSLQQPAPA 480

BLAST of HG10006289 vs. ExPASy Swiss-Prot
Match: Q9ZW36 (Zinc finger CCCH domain-containing protein 25 OS=Arabidopsis thaliana OX=3702 GN=At2g29580 PE=1 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 1.1e-182
Identity = 342/507 (67.46%), Postives = 389/507 (76.73%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHR+LRD EADGWERSDFPIICESCLGDNPYVRMT+A+YDKECKICTRPFTVFRWRPGR
Sbjct: 1    MAHRILRDHEADGWERSDFPIICESCLGDNPYVRMTKANYDKECKICTRPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTE+CQTC KLKNVCQVCLLDLEYGLPVQVRDTAL+I+++D+IPKSDVNRE+FAE
Sbjct: 61   DARYKKTEVCQTCCKLKNVCQVCLLDLEYGLPVQVRDTALNISTHDSIPKSDVNREFFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDRK RAG+DYESS+GK RPNDTI  LQRTTPYYKRNRAH+CSF+IRGECTRG ECPYR
Sbjct: 121  EHDRKTRAGLDYESSFGKIRPNDTIRMLQRTTPYYKRNRAHICSFFIRGECTRGDECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVALKLL KAGEM +LE PED+SIRTLYVGGL++RV 
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVALKLLGKAGEMGTLESPEDQSIRTLYVGGLNSRVL 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQD+RD FYAHGEIESIR++ ++ACAFVTYTTREGAEKAAEELSN+LV+ G RLKL WGR
Sbjct: 241  EQDIRDQFYAHGEIESIRILAEKACAFVTYTTREGAEKAAEELSNRLVVNGQRLKLTWGR 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPP-PP 1189
            PQ PK + +GS++  Q +VAHSG+LPRAVISQQ NQ  PP     P   +Y + PPP PP
Sbjct: 301  PQVPKPDQDGSNQ--QGSVAHSGLLPRAVISQQQNQ--PP-----PMLQYYMHPPPPQPP 360

Query: 1190 QQERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPP 1249
             Q+R FYPSMDPQRMGA+ S+ + G   +   G++ +   S      GH +P H  +PPP
Sbjct: 361  HQDRPFYPSMDPQRMGAVSSSKESGSSTSDNRGASSS---SYTMPPHGH-YPQHQPYPPP 420

Query: 1250 PAQYQQQFYPPYGYMQHYPPYPPYHSNM------PPPPPSQSQSQPHPPSGLQQYQQQHS 1309
               Y     PPY   Q YPPY   HS          P P    + PHP S         +
Sbjct: 421  --SYGGYMQPPY---QQYPPYHHGHSQQADHDYPQQPGPGSRPNPPHPSS-------VSA 480

Query: 1310 TPPGSAPQSHGGASSVSAPLGSTPSVS 1330
             PP S   +  G+S  SA    T   S
Sbjct: 481  PPPDSVSAAPSGSSQQSADAAVTTGSS 482

BLAST of HG10006289 vs. ExPASy Swiss-Prot
Match: Q9LNV5 (Zinc finger CCCH domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g07360 PE=1 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 2.0e-181
Identity = 337/510 (66.08%), Postives = 388/510 (76.08%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHR+LRD EADGWERSDFPIICESCLGDNPYVRMT+A+YDKECKICTRPFTVFRWRPGR
Sbjct: 1    MAHRILRDHEADGWERSDFPIICESCLGDNPYVRMTKANYDKECKICTRPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTEICQTC KLKNVCQVCLLDLEYGLPVQVRDTAL+I+++D+IPKSDVNREYFAE
Sbjct: 61   DARYKKTEICQTCCKLKNVCQVCLLDLEYGLPVQVRDTALNISTHDSIPKSDVNREYFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDRKARAG+DYESS+GK RPNDTILKLQRTTPYYKRNRAHVCSF+IRGECTRG+ECPYR
Sbjct: 121  EHDRKARAGLDYESSFGKMRPNDTILKLQRTTPYYKRNRAHVCSFFIRGECTRGAECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVA+KLL KAGEM +LE P+DESI+TLYVGGL++R+ 
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVAMKLLGKAGEMGTLESPDDESIKTLYVGGLNSRIL 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQD+RD FYAHGEIESIR++  +ACAFVTYT+REGAEKAA+ELSN+LVI G RLKL WGR
Sbjct: 241  EQDIRDQFYAHGEIESIRILADKACAFVTYTSREGAEKAAQELSNRLVINGQRLKLTWGR 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQ 1189
               PK + +G+++  Q  VAHSG+LPRAVISQQHN         QP  M  + + PPP  
Sbjct: 301  ---PKPDQDGANQ--QGGVAHSGLLPRAVISQQHN---------QPPPMQQYYMHPPPAN 360

Query: 1190 QERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPPP 1249
            Q++ +YPSMDPQRMGA++ST + G       GS+    G+       +  P H  +PP  
Sbjct: 361  QDKPYYPSMDPQRMGAVISTQEAG-------GSSTENNGAS---SSSYMMPPHQSYPP-- 420

Query: 1250 AQYQQQFYPPYGYM--QHYPPYPPYHSNMPPPPPSQSQSQPHPPSGLQQYQQQHSTPPGS 1309
                    PPYGYM   +   YPP H + P P    +     PP     Y QQ    PGS
Sbjct: 421  --------PPYGYMPSPYQQQYPPNHHHQPSPMQHYA-----PPPAAYPYPQQPG--PGS 469

Query: 1310 APQSHGGASSVSAPLGSTPSVSAPSSTSSE 1338
             P     A S  +P  +     APS +S +
Sbjct: 481  RPAPSPTAVSAISPDSAPAGSGAPSGSSQQ 469

BLAST of HG10006289 vs. ExPASy Swiss-Prot
Match: Q9FL40 (Zinc finger CCCH domain-containing protein 53 OS=Arabidopsis thaliana OX=3702 GN=At5g07060 PE=2 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 5.3e-126
Identity = 233/414 (56.28%), Postives = 284/414 (68.60%), Query Frame = 0

Query: 836  RDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGRDARYKK 895
            RD  ADGWE +DFPI CESC GDNPY+RMTRADYDKECKIC+RPFT FRWRPGR+AR+KK
Sbjct: 4    RDHGADGWESADFPITCESCFGDNPYMRMTRADYDKECKICSRPFTAFRWRPGRNARFKK 63

Query: 896  TEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAEEHDRKA 955
            TEICQTCSKLKNVCQVCLLDL +GLPVQVRD+AL+INS+ ++P S VNREYFA+EHD K 
Sbjct: 64   TEICQTCSKLKNVCQVCLLDLGFGLPVQVRDSALNINSHYSVPMSHVNREYFADEHDPKT 123

Query: 956  RAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYRHEMPET 1015
            RAG+DYESS+GK +PNDTILKLQR TP Y++NR  +CSFY  G+C RG+EC +RHEMPET
Sbjct: 124  RAGLDYESSFGKMQPNDTILKLQRRTPSYEKNRPKICSFYTIGQCKRGAECSFRHEMPET 183

Query: 1016 GELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVTEQDLRD 1075
            GELS QNI+DRYY VNDPVA+KLL KAGEM +LEPPEDESI+TLYVGGL++R+ EQD+ D
Sbjct: 184  GELSHQNIRDRYYSVNDPVAMKLLRKAGEMGTLEPPEDESIKTLYVGGLNSRIFEQDIHD 243

Query: 1076 NFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGRPQAPKA 1135
            +FYA+GE+ESIR++ +                                          K 
Sbjct: 244  HFYAYGEMESIRVMAEDG----------------------------------------KY 303

Query: 1136 ESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQQERAFY 1195
            +  GS++ +Q ++AH+G+     ISQQ NQ      H Q Q   Y+  PPPP   E + Y
Sbjct: 304  DQSGSNQQQQGSIAHTGL-----ISQQQNQ------HSQMQ--QYYMQPPPP--NEYSHY 354

Query: 1196 PSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPPP 1250
            PSMD QRMGA  ST +        +  + T   +       +  P H  +P PP
Sbjct: 364  PSMDTQRMGAAFSTQE--------SDGSSTSENNRAYSSYSYPMPPHQPYPTPP 354

BLAST of HG10006289 vs. ExPASy TrEMBL
Match: A0A0A0LM76 (FAD_binding_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G036590 PE=4 SV=1)

HSP 1 Score: 1249.6 bits (3232), Expect = 0.0e+00
Identity = 644/722 (89.20%), Postives = 665/722 (92.11%), Query Frame = 0

Query: 111 SAVAVYCFRSPWATPTPSAMALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRT 170
           S  AVY FR       PSAMALLP+KL    PNSTLFS+ PRLSSL LPPFRVSCA KRT
Sbjct: 36  SFTAVYFFR------CPSAMALLPSKLPFTYPNSTLFSSPPRLSSLHLPPFRVSCA-KRT 95

Query: 171 GKRRYPSEKKKLKLKHKEVLTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIA 230
           GK+RYPSEKKKLKLKHKEVLTTVENKFEGIWRLFKL VPVEKDPGKDFHGLSDALMQEIA
Sbjct: 96  GKKRYPSEKKKLKLKHKEVLTTVENKFEGIWRLFKLVVPVEKDPGKDFHGLSDALMQEIA 155

Query: 231 KVLEFPVASLLPREAFSVIRKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLE 290
           KVLEFPVASLLPREAFSVIRKSFDARKMLKEPKFVYTVDMDVH LLILEPRA+DFISDLE
Sbjct: 156 KVLEFPVASLLPREAFSVIRKSFDARKMLKEPKFVYTVDMDVHSLLILEPRARDFISDLE 215

Query: 291 PKVGLMEHFAKEKVSNDVISIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSG 350
           PKVGLMEHFAKEKVSNDVISIVHDLKSNQEVVGANGL GHSGPYLR+SNGKPK+AVVGSG
Sbjct: 216 PKVGLMEHFAKEKVSNDVISIVHDLKSNQEVVGANGLTGHSGPYLRMSNGKPKIAVVGSG 275

Query: 351 PSGLFASLVLAEFGADVTLIERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSD 410
           PSGLFASLVLAEFGADVTLIERGQPVEQRGRDIGALV+RRILELDSNFCFGEGGAGTWSD
Sbjct: 276 PSGLFASLVLAEFGADVTLIERGQPVEQRGRDIGALVSRRILELDSNFCFGEGGAGTWSD 335

Query: 411 GKLVTRIGRNSGSVQAVMKSLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVS 470
           GKLVTRIGRNSGSVQAVMKSLVYFGAPK ILLNGKPHLGTD+LIPLLRNIRQHLE LGV+
Sbjct: 336 GKLVTRIGRNSGSVQAVMKSLVYFGAPKNILLNGKPHLGTDKLIPLLRNIRQHLETLGVT 395

Query: 471 IYPILSLDVNIKFGTRVDDLIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSAR 530
                     IKFGTRVDDLIEE GH+ GVKVSDSRDK KLS Q LE+DAIVLAVGHSAR
Sbjct: 396 ----------IKFGTRVDDLIEEGGHVAGVKVSDSRDKLKLSKQTLEYDAIVLAVGHSAR 455

Query: 531 DVYQMLVSHNIPVVPKEFAVGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKY 590
           DVYQML+SHNIPV+PKEF+VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KY
Sbjct: 456 DVYQMLLSHNIPVIPKEFSVGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKY 515

Query: 591 VNVDTENPSSNSLAASRSCYSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALV 650
           VN+DT+NPSSN LAASRSCYSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALV
Sbjct: 516 VNIDTKNPSSNFLAASRSCYSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALV 575

Query: 651 VTVSTKDFNDHGFHGPLAGVEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPP 710
           VTVSTKDFND GF GPLAGVEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPP
Sbjct: 576 VTVSTKDFNDLGFRGPLAGVEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPP 635

Query: 711 SSYRLGVKASNLHELFPGHITEALQQSVLAFDQ---------------ETRTSSPVQIPR 770
           SSYRLGVKASNLHELFP HITEALQQS+LAFDQ               ETRTSSP+QIPR
Sbjct: 636 SSYRLGVKASNLHELFPDHITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPR 695

Query: 771 NPETYESTCLRGLYPVGEGAGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQS 818
           NPETYEST +RGLYPVGEGAGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+
Sbjct: 696 NPETYESTSVRGLYPVGEGAGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQN 740

BLAST of HG10006289 vs. ExPASy TrEMBL
Match: A0A1S3BV00 (uncharacterized protein Cbei_0202 OS=Cucumis melo OX=3656 GN=LOC103493563 PE=4 SV=1)

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 635/703 (90.33%), Postives = 655/703 (93.17%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+KL L  PNSTLFS+ PRLSSL LPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSKLPLTYPNSTLFSSPPRLSSLHLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEG WRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGTWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVVGANG N HSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVGANGFNSHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
            ERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 FERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTD+LIPLLRN RQHLE LGV+          IKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDKLIPLLRNFRQHLETLGVT----------IKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEE GH+ GVKVSDSRDK KLS QKL +DAIVLAVGHSARDVYQML+SHNIPV+PKEFA
Sbjct: 361 LIEEGGHLTGVKVSDSRDKLKLSKQKLGYDAIVLAVGHSARDVYQMLLSHNIPVIPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DT+NPSSNS+AASRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTKNPSSNSVAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNP ELCINGMSFSRRSSKWANAALVVTVSTKDFND GF GPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPEELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFQGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRN ETYEST +RGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNLETYESTSVRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+SG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQNSG 692

BLAST of HG10006289 vs. ExPASy TrEMBL
Match: A0A5D3B957 (FAD/NAD(P)-binding oxidoreductase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G00270 PE=4 SV=1)

HSP 1 Score: 1235.7 bits (3196), Expect = 0.0e+00
Identity = 633/703 (90.04%), Postives = 654/703 (93.03%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP+KL L   NSTLFS+ PRLSSL LPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPSKLPLTYLNSTLFSSPPRLSSLHLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEG WRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGTWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRA+DFISDLEPKVGLMEHFAKEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRARDFISDLEPKVGLMEHFAKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSNQEVVGANG N HSGPYLR+SNGKPK+AVVGSGPSGLFASLVLAEFGADVTL
Sbjct: 181 SIVHDLKSNQEVVGANGFNSHSGPYLRMSNGKPKIAVVGSGPSGLFASLVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
            ERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 FERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLVYFGAPK ILLNGKPHLGTD+LIPLLRN RQHLE LGV+          IKFGTRVDD
Sbjct: 301 SLVYFGAPKNILLNGKPHLGTDKLIPLLRNFRQHLETLGVT----------IKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LIEE GH+ GVKVSDSRDK KLS QKL +DAIVLAVGHSARDVYQML+SHNIP++PKEFA
Sbjct: 361 LIEEGGHLTGVKVSDSRDKLKLSKQKLGYDAIVLAVGHSARDVYQMLLSHNIPLIPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DT+NPSSNS+AASRSC
Sbjct: 421 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTKNPSSNSVAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNP ELCINGMSFSRRSSKWANAALVVTVSTKDFND GF GPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPEELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFQGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFDQ               ETRTSSP+QIPRN ETYEST +RGLYPVGEG
Sbjct: 601 ITEALQQSILAFDQELPGFLSSDALLHGVETRTSSPIQIPRNLETYESTSVRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQ+SG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQNSG 692

BLAST of HG10006289 vs. ExPASy TrEMBL
Match: A0A6J1HYE6 (uncharacterized protein LOC111468611 OS=Cucurbita maxima OX=3661 GN=LOC111468611 PE=4 SV=1)

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 619/703 (88.05%), Postives = 653/703 (92.89%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP KLALGCPNS+LFSA PRL S RLPPFRVSCA KRTGK++YPSEKKKLKLKHKEV
Sbjct: 1   MALLPFKLALGCPNSSLFSATPRLMSPRLPPFRVSCA-KRTGKKKYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEGIWRLFKLGV VEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGIWRLFKLGVSVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDV+ LLILEPRA+DFISDLEPKVGL+EH  KEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVNNLLILEPRARDFISDLEPKVGLIEHIVKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSN E+V  +GLNGHSGPY+RL + KPK+AVVGSGPSGLFA+LVLAEFGADVTL
Sbjct: 181 SIVHDLKSNHELVEESGLNGHSGPYMRLPSSKPKIAVVGSGPSGLFAALVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
           IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLV+FGAP+ ILL+GKPHLGTDRL+PLLRNIRQHLE LG +          +KFGTRVDD
Sbjct: 301 SLVHFGAPENILLSGKPHLGTDRLVPLLRNIRQHLETLGAT----------VKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LI+ESGH+VGVKVSDSRDK KL+NQKLEFDA VLAVGHSARDVYQML+SHNIPVVPKEFA
Sbjct: 361 LIQESGHVVGVKVSDSRDKLKLNNQKLEFDATVLAVGHSARDVYQMLMSHNIPVVPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQ LINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DTE+PSSNS+AASRSC
Sbjct: 421 VGLRIEHPQALINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTEDPSSNSVAASRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFND GFHGPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFHGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRE E+RAA+MGGG FVLPVQTATDFMDR+L+VTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQREFERRAALMGGGKFVLPVQTATDFMDRKLKVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFD+               ETRTSSPVQIPRNP TYEST LRGLYPVGEG
Sbjct: 601 ITEALQQSILAFDKELPGFLSGDALLHGVETRTSSPVQIPRNPGTYESTSLRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAK+FNLY+GDLETVLGKAQSSG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKNFNLYNGDLETVLGKAQSSG 692

BLAST of HG10006289 vs. ExPASy TrEMBL
Match: A0A6J1FBU8 (uncharacterized protein LOC111444197 OS=Cucurbita moschata OX=3662 GN=LOC111444197 PE=4 SV=1)

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 618/703 (87.91%), Postives = 655/703 (93.17%), Query Frame = 0

Query: 130 MALLPTKLALGCPNSTLFSAAPRLSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEV 189
           MALLP  L+LGCPNS+LFSA PRL S RLPPFRVSCA KRTGK+RYPSEKKKLKLKHKEV
Sbjct: 1   MALLPFNLSLGCPNSSLFSATPRLMSPRLPPFRVSCA-KRTGKKRYPSEKKKLKLKHKEV 60

Query: 190 LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 249
           LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI
Sbjct: 61  LTTVENKFEGIWRLFKLGVPVEKDPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVI 120

Query: 250 RKSFDARKMLKEPKFVYTVDMDVHRLLILEPRAQDFISDLEPKVGLMEHFAKEKVSNDVI 309
           RKSFDARKMLKEPKFVYTVDMDV+ LLILEPRA+DFISDLEPKVGL+EH  KEKVSNDVI
Sbjct: 121 RKSFDARKMLKEPKFVYTVDMDVNNLLILEPRARDFISDLEPKVGLIEHIVKEKVSNDVI 180

Query: 310 SIVHDLKSNQEVVGANGLNGHSGPYLRLSNGKPKVAVVGSGPSGLFASLVLAEFGADVTL 369
           SIVHDLKSN EVV  +GLNGHSGP++RL + KPK+AVVGSGPSGLFA+LVLAEFGADVTL
Sbjct: 181 SIVHDLKSNHEVVEESGLNGHSGPFMRLPSSKPKIAVVGSGPSGLFAALVLAEFGADVTL 240

Query: 370 IERGQPVEQRGRDIGALVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 429
           IERGQPVEQRGRDIGALVARRILEL+SNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK
Sbjct: 241 IERGQPVEQRGRDIGALVARRILELNSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMK 300

Query: 430 SLVYFGAPKKILLNGKPHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDD 489
           SLV+FGAP+ ILL+GKPHLGTDRL+PLLRNIRQHLEMLG +          +KFGTRVDD
Sbjct: 301 SLVHFGAPENILLSGKPHLGTDRLVPLLRNIRQHLEMLGAT----------VKFGTRVDD 360

Query: 490 LIEESGHIVGVKVSDSRDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFA 549
           LI+ESGH+VGVKVSDSRDK KL++QKLEFDA VLAVGHSARDVYQML+SHNIPVVPKEFA
Sbjct: 361 LIQESGHVVGVKVSDSRDKLKLNSQKLEFDATVLAVGHSARDVYQMLMSHNIPVVPKEFA 420

Query: 550 VGLRIEHPQELINSIQYSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSC 609
           VGLRIEHPQ LINSIQYSGLANEVEKGRGKVPVADYKV+KYVN+DTE+PSSNS+AA+RSC
Sbjct: 421 VGLRIEHPQALINSIQYSGLANEVEKGRGKVPVADYKVAKYVNIDTEDPSSNSVAANRSC 480

Query: 610 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAG 669
           YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFND GFHGPLAG
Sbjct: 481 YSFCMCPGGQVVLTSTNPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDLGFHGPLAG 540

Query: 670 VEFQRELEQRAAVMGGGNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGH 729
           VEFQRE E+RAA+MGGGNFVLPVQTATDFMDR+L+VTSVPPSSYRLGVKASNLHELFPGH
Sbjct: 541 VEFQREFERRAALMGGGNFVLPVQTATDFMDRKLKVTSVPPSSYRLGVKASNLHELFPGH 600

Query: 730 ITEALQQSVLAFDQ---------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEG 789
           ITEALQQS+LAFD+               ETRTSSPVQIPRN ETYEST LRGLYPVGEG
Sbjct: 601 ITEALQQSILAFDKELPGFLSSDALLHGVETRTSSPVQIPRNSETYESTSLRGLYPVGEG 660

Query: 790 AGYAGGIVSAAVDGMYAGFAVAKSFNLYHGDLETVLGKAQSSG 818
           AGYAGGIVSAAVDGMYAGFAVAK+FNLY+GDLETVLGKAQSSG
Sbjct: 661 AGYAGGIVSAAVDGMYAGFAVAKNFNLYNGDLETVLGKAQSSG 692

BLAST of HG10006289 vs. TAIR 10
Match: AT4G30720.1 (FAD/NAD(P)-binding oxidoreductase family protein )

HSP 1 Score: 891.3 bits (2302), Expect = 9.3e-259
Identity = 461/687 (67.10%), Postives = 547/687 (79.62%), Query Frame = 0

Query: 153 LSSLRLPPFRVSCANKRTGKRRYPSEKKKLKLKHKEVLTTVENKFEGIWRLFKLGVPVEK 212
           LS  R+   R+ CA KRTGKRRYPSE++KL+ + KE +  V+NK EG+WRL KLGVPV  
Sbjct: 29  LSYPRIQTHRILCAAKRTGKRRYPSERRKLRTEQKEAVAKVKNKLEGVWRLSKLGVPVGD 88

Query: 213 DPGKDFHGLSDALMQEIAKVLEFPVASLLPREAFSVIRKSFDARKMLKEPKFVYTVDMDV 272
           DPGKDF G+S+ L+Q IAKV+EFPVAS+LP EAFSVIRKSFDARK+LKE KFVYTVD+DV
Sbjct: 89  DPGKDFLGISEGLLQAIAKVIEFPVASMLPEEAFSVIRKSFDARKILKEAKFVYTVDLDV 148

Query: 273 HRLLILEPRAQDFISDLEPKVGLMEHFAKEK-VSNDVISIVHDLKS-NQEVVGANG---- 332
             LL LEPRA DFI  LEPK+GL+EH   EK VS D+IS+V+D K  N E          
Sbjct: 149 KTLLELEPRAHDFIFRLEPKIGLIEHVPTEKSVSGDLISVVNDCKRINSETASGEYEPQI 208

Query: 333 LNGHSGPYLR-LSNGKPKVAVVGSGPSGLFASLVLAEFGADVTLIERGQPVEQRGRDIGA 392
           +NG   P+       KPK+AVVG GPSGLFA+LVLAEFGADVTLIERGQ VE+RGRDIGA
Sbjct: 209 INGSGDPHHHGGGRSKPKIAVVGGGPSGLFAALVLAEFGADVTLIERGQAVEERGRDIGA 268

Query: 393 LVARRILELDSNFCFGEGGAGTWSDGKLVTRIGRNSGSVQAVMKSLVYFGAPKKILLNGK 452
           LV R+IL+++SNFCFGEGGAGTWSDGKLVTRIG+NS +V AV+K+LV FGAP  IL+NGK
Sbjct: 269 LVVRKILDMESNFCFGEGGAGTWSDGKLVTRIGKNSATVLAVLKTLVRFGAPDNILVNGK 328

Query: 453 PHLGTDRLIPLLRNIRQHLEMLGVSIYPILSLDVNIKFGTRVDDLIEESGHIVGVKVSDS 512
           PHLGTD+L+PLLRN R +L+  GV+          IKFGTRVDDL+ E   +VGV+VSDS
Sbjct: 329 PHLGTDKLVPLLRNFRHYLQSAGVT----------IKFGTRVDDLLVEDSRVVGVRVSDS 388

Query: 513 RDKSKLSNQKLEFDAIVLAVGHSARDVYQMLVSHNIPVVPKEFAVGLRIEHPQELINSIQ 572
            ++ + ++Q L+ DA+VLAVGHSARD Y+ML S N+ ++PK+FAVGLRIEHPQELINSIQ
Sbjct: 389 TNQLQTTSQNLKVDAVVLAVGHSARDTYEMLHSRNVELIPKDFAVGLRIEHPQELINSIQ 448

Query: 573 YSGLANEVEKGRGKVPVADYKVSKYVNVDTENPSSNSLAASRSCYSFCMCPGGQVVLTST 632
           YS LANEV KGRGKVPVADYKV +YVN  TE+ S +S  + RSCYSFCMCPGGQVVLTST
Sbjct: 449 YSDLANEVLKGRGKVPVADYKVVQYVNDKTEDLSQSS--SKRSCYSFCMCPGGQVVLTST 508

Query: 633 NPGELCINGMSFSRRSSKWANAALVVTVSTKDFNDHGFHGPLAGVEFQRELEQRAAVMGG 692
           NP ELCINGMSFSRRSSKWANAALVVTVS KDF+     GPLAG+EFQRE E+RAA+MGG
Sbjct: 509 NPTELCINGMSFSRRSSKWANAALVVTVSAKDFDVLNLKGPLAGIEFQREFERRAAIMGG 568

Query: 693 GNFVLPVQTATDFMDRRLRVTSVPPSSYRLGVKASNLHELFPGHITEALQQSVLAFDQ-- 752
           G+F +PVQ  TDF+  +L  T +PPSSYRLGVK++NLHELFP HITEAL++S+  F++  
Sbjct: 569 GDFTVPVQRVTDFLQNKLSETPLPPSSYRLGVKSANLHELFPAHITEALRESISMFEKEL 628

Query: 753 -------------ETRTSSPVQIPRNPETYESTCLRGLYPVGEGAGYAGGIVSAAVDGMY 812
                        ETRTSSPV+IPR+ ETYEST L+GLYPVGEGAGYAGGIVSAAVDGM+
Sbjct: 629 PGFISEEALLHGVETRTSSPVRIPRSNETYESTSLKGLYPVGEGAGYAGGIVSAAVDGMF 688

Query: 813 AGFAVAKSFNLYHGDLETVLGKAQSSG 818
           +GFAVAKSF+L+ G +E+V+GKAQ +G
Sbjct: 689 SGFAVAKSFDLFDGTIESVIGKAQGAG 703

BLAST of HG10006289 vs. TAIR 10
Match: AT2G29580.1 (CCCH-type zinc fingerfamily protein with RNA-binding domain )

HSP 1 Score: 642.5 bits (1656), Expect = 7.5e-184
Identity = 342/507 (67.46%), Postives = 389/507 (76.73%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHR+LRD EADGWERSDFPIICESCLGDNPYVRMT+A+YDKECKICTRPFTVFRWRPGR
Sbjct: 1    MAHRILRDHEADGWERSDFPIICESCLGDNPYVRMTKANYDKECKICTRPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTE+CQTC KLKNVCQVCLLDLEYGLPVQVRDTAL+I+++D+IPKSDVNRE+FAE
Sbjct: 61   DARYKKTEVCQTCCKLKNVCQVCLLDLEYGLPVQVRDTALNISTHDSIPKSDVNREFFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDRK RAG+DYESS+GK RPNDTI  LQRTTPYYKRNRAH+CSF+IRGECTRG ECPYR
Sbjct: 121  EHDRKTRAGLDYESSFGKIRPNDTIRMLQRTTPYYKRNRAHICSFFIRGECTRGDECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVALKLL KAGEM +LE PED+SIRTLYVGGL++RV 
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVALKLLGKAGEMGTLESPEDQSIRTLYVGGLNSRVL 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQD+RD FYAHGEIESIR++ ++ACAFVTYTTREGAEKAAEELSN+LV+ G RLKL WGR
Sbjct: 241  EQDIRDQFYAHGEIESIRILAEKACAFVTYTTREGAEKAAEELSNRLVVNGQRLKLTWGR 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPP-PP 1189
            PQ PK + +GS++  Q +VAHSG+LPRAVISQQ NQ  PP     P   +Y + PPP PP
Sbjct: 301  PQVPKPDQDGSNQ--QGSVAHSGLLPRAVISQQQNQ--PP-----PMLQYYMHPPPPQPP 360

Query: 1190 QQERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPP 1249
             Q+R FYPSMDPQRMGA+ S+ + G   +   G++ +   S      GH +P H  +PPP
Sbjct: 361  HQDRPFYPSMDPQRMGAVSSSKESGSSTSDNRGASSS---SYTMPPHGH-YPQHQPYPPP 420

Query: 1250 PAQYQQQFYPPYGYMQHYPPYPPYHSNM------PPPPPSQSQSQPHPPSGLQQYQQQHS 1309
               Y     PPY   Q YPPY   HS          P P    + PHP S         +
Sbjct: 421  --SYGGYMQPPY---QQYPPYHHGHSQQADHDYPQQPGPGSRPNPPHPSS-------VSA 480

Query: 1310 TPPGSAPQSHGGASSVSAPLGSTPSVS 1330
             PP S   +  G+S  SA    T   S
Sbjct: 481  PPPDSVSAAPSGSSQQSADAAVTTGSS 482

BLAST of HG10006289 vs. TAIR 10
Match: AT1G07360.1 (CCCH-type zinc fingerfamily protein with RNA-binding domain )

HSP 1 Score: 638.3 bits (1645), Expect = 1.4e-182
Identity = 337/510 (66.08%), Postives = 388/510 (76.08%), Query Frame = 0

Query: 830  MAHRLLRDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGR 889
            MAHR+LRD EADGWERSDFPIICESCLGDNPYVRMT+A+YDKECKICTRPFTVFRWRPGR
Sbjct: 1    MAHRILRDHEADGWERSDFPIICESCLGDNPYVRMTKANYDKECKICTRPFTVFRWRPGR 60

Query: 890  DARYKKTEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAE 949
            DARYKKTEICQTC KLKNVCQVCLLDLEYGLPVQVRDTAL+I+++D+IPKSDVNREYFAE
Sbjct: 61   DARYKKTEICQTCCKLKNVCQVCLLDLEYGLPVQVRDTALNISTHDSIPKSDVNREYFAE 120

Query: 950  EHDRKARAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYR 1009
            EHDRKARAG+DYESS+GK RPNDTILKLQRTTPYYKRNRAHVCSF+IRGECTRG+ECPYR
Sbjct: 121  EHDRKARAGLDYESSFGKMRPNDTILKLQRTTPYYKRNRAHVCSFFIRGECTRGAECPYR 180

Query: 1010 HEMPETGELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVT 1069
            HEMPETGELSQQNIKDRYYGVNDPVA+KLL KAGEM +LE P+DESI+TLYVGGL++R+ 
Sbjct: 181  HEMPETGELSQQNIKDRYYGVNDPVAMKLLGKAGEMGTLESPDDESIKTLYVGGLNSRIL 240

Query: 1070 EQDLRDNFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGR 1129
            EQD+RD FYAHGEIESIR++  +ACAFVTYT+REGAEKAA+ELSN+LVI G RLKL WGR
Sbjct: 241  EQDIRDQFYAHGEIESIRILADKACAFVTYTSREGAEKAAQELSNRLVINGQRLKLTWGR 300

Query: 1130 PQAPKAESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQ 1189
               PK + +G+++  Q  VAHSG+LPRAVISQQHN         QP  M  + + PPP  
Sbjct: 301  ---PKPDQDGANQ--QGGVAHSGLLPRAVISQQHN---------QPPPMQQYYMHPPPAN 360

Query: 1190 QERAFYPSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPPP 1249
            Q++ +YPSMDPQRMGA++ST + G       GS+    G+       +  P H  +PP  
Sbjct: 361  QDKPYYPSMDPQRMGAVISTQEAG-------GSSTENNGAS---SSSYMMPPHQSYPP-- 420

Query: 1250 AQYQQQFYPPYGYM--QHYPPYPPYHSNMPPPPPSQSQSQPHPPSGLQQYQQQHSTPPGS 1309
                    PPYGYM   +   YPP H + P P    +     PP     Y QQ    PGS
Sbjct: 421  --------PPYGYMPSPYQQQYPPNHHHQPSPMQHYA-----PPPAAYPYPQQPG--PGS 469

Query: 1310 APQSHGGASSVSAPLGSTPSVSAPSSTSSE 1338
             P     A S  +P  +     APS +S +
Sbjct: 481  RPAPSPTAVSAISPDSAPAGSGAPSGSSQQ 469

BLAST of HG10006289 vs. TAIR 10
Match: AT5G07060.1 (CCCH-type zinc fingerfamily protein with RNA-binding domain )

HSP 1 Score: 454.1 bits (1167), Expect = 3.8e-127
Identity = 233/414 (56.28%), Postives = 284/414 (68.60%), Query Frame = 0

Query: 836  RDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGRDARYKK 895
            RD  ADGWE +DFPI CESC GDNPY+RMTRADYDKECKIC+RPFT FRWRPGR+AR+KK
Sbjct: 4    RDHGADGWESADFPITCESCFGDNPYMRMTRADYDKECKICSRPFTAFRWRPGRNARFKK 63

Query: 896  TEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAEEHDRKA 955
            TEICQTCSKLKNVCQVCLLDL +GLPVQVRD+AL+INS+ ++P S VNREYFA+EHD K 
Sbjct: 64   TEICQTCSKLKNVCQVCLLDLGFGLPVQVRDSALNINSHYSVPMSHVNREYFADEHDPKT 123

Query: 956  RAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYRHEMPET 1015
            RAG+DYESS+GK +PNDTILKLQR TP Y++NR  +CSFY  G+C RG+EC +RHEMPET
Sbjct: 124  RAGLDYESSFGKMQPNDTILKLQRRTPSYEKNRPKICSFYTIGQCKRGAECSFRHEMPET 183

Query: 1016 GELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVTEQDLRD 1075
            GELS QNI+DRYY VNDPVA+KLL KAGEM +LEPPEDESI+TLYVGGL++R+ EQD+ D
Sbjct: 184  GELSHQNIRDRYYSVNDPVAMKLLRKAGEMGTLEPPEDESIKTLYVGGLNSRIFEQDIHD 243

Query: 1076 NFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGRPQAPKA 1135
            +FYA+GE+ESIR++ +                                          K 
Sbjct: 244  HFYAYGEMESIRVMAEDG----------------------------------------KY 303

Query: 1136 ESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQQERAFY 1195
            +  GS++ +Q ++AH+G+     ISQQ NQ      H Q Q   Y+  PPPP   E + Y
Sbjct: 304  DQSGSNQQQQGSIAHTGL-----ISQQQNQ------HSQMQ--QYYMQPPPP--NEYSHY 354

Query: 1196 PSMDPQRMGALVSTHDVGVPPNGPTGSTETRPGSEKQHQQGHQFPYHSMHPPPP 1250
            PSMD QRMGA  ST +        +  + T   +       +  P H  +P PP
Sbjct: 364  PSMDTQRMGAAFSTQE--------SDGSSTSENNRAYSSYSYPMPPHQPYPTPP 354

BLAST of HG10006289 vs. TAIR 10
Match: AT5G07060.2 (CCCH-type zinc fingerfamily protein with RNA-binding domain )

HSP 1 Score: 454.1 bits (1167), Expect = 3.8e-127
Identity = 227/376 (60.37%), Postives = 273/376 (72.61%), Query Frame = 0

Query: 836  RDLEADGWERSDFPIICESCLGDNPYVRMTRADYDKECKICTRPFTVFRWRPGRDARYKK 895
            RD  ADGWE +DFPI CESC GDNPY+RMTRADYDKECKIC+RPFT FRWRPGR+AR+KK
Sbjct: 4    RDHGADGWESADFPITCESCFGDNPYMRMTRADYDKECKICSRPFTAFRWRPGRNARFKK 63

Query: 896  TEICQTCSKLKNVCQVCLLDLEYGLPVQVRDTALSINSNDAIPKSDVNREYFAEEHDRKA 955
            TEICQTCSKLKNVCQVCLLDL +GLPVQVRD+AL+INS+ ++P S VNREYFA+EHD K 
Sbjct: 64   TEICQTCSKLKNVCQVCLLDLGFGLPVQVRDSALNINSHYSVPMSHVNREYFADEHDPKT 123

Query: 956  RAGIDYESSYGKARPNDTILKLQRTTPYYKRNRAHVCSFYIRGECTRGSECPYRHEMPET 1015
            RAG+DYESS+GK +PNDTILKLQR TP Y++NR  +CSFY  G+C RG+EC +RHEMPET
Sbjct: 124  RAGLDYESSFGKMQPNDTILKLQRRTPSYEKNRPKICSFYTIGQCKRGAECSFRHEMPET 183

Query: 1016 GELSQQNIKDRYYGVNDPVALKLLNKAGEMPSLEPPEDESIRTLYVGGLDARVTEQDLRD 1075
            GELS QNI+DRYY VNDPVA+KLL KAGEM +LEPPEDESI+TLYVGGL++R+ EQD+ D
Sbjct: 184  GELSHQNIRDRYYSVNDPVAMKLLRKAGEMGTLEPPEDESIKTLYVGGLNSRIFEQDIHD 243

Query: 1076 NFYAHGEIESIRMVLQRACAFVTYTTREGAEKAAEELSNKLVIKGLRLKLMWGRPQAPKA 1135
            +FYA+GE+ESIR++ +                                          K 
Sbjct: 244  HFYAYGEMESIRVMAEDG----------------------------------------KY 303

Query: 1136 ESEGSDEAKQAAVAHSGMLPRAVISQQHNQLHPPGTHDQPQAMHYFNIPPPPPQQERAFY 1195
            +  GS++ +Q ++AH+G+     ISQQ NQ      H Q Q   Y+  PPPP   E + Y
Sbjct: 304  DQSGSNQQQQGSIAHTGL-----ISQQQNQ------HSQMQ--QYYMQPPPP--NEYSHY 324

Query: 1196 PSMDPQRMGALVSTHD 1212
            PSMD QRMGA  ST +
Sbjct: 364  PSMDTQRMGAAFSTQE 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890425.10.0e+0092.18uncharacterized protein Cbei_0202 isoform X1 [Benincasa hispida] >XP_038890426.1... [more]
XP_004144792.20.0e+0090.33uncharacterized protein LOC101214567 [Cucumis sativus] >KAE8651641.1 hypothetica... [more]
XP_008452584.10.0e+0090.33PREDICTED: uncharacterized protein Cbei_0202 [Cucumis melo][more]
KAA0056117.10.0e+0090.04FAD/NAD(P)-binding oxidoreductase family protein [Cucumis melo var. makuwa] >TYJ... [more]
KAG7020822.10.0e+0088.76hypothetical protein SDJN02_17510 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q5SNN41.0e-19373.57Zinc finger CCCH domain-containing protein 40 OS=Oryza sativa subsp. japonica OX... [more]
Q6Z3583.6e-19172.38Zinc finger CCCH domain-containing protein 49 OS=Oryza sativa subsp. japonica OX... [more]
Q9ZW361.1e-18267.46Zinc finger CCCH domain-containing protein 25 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9LNV52.0e-18166.08Zinc finger CCCH domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9FL405.3e-12656.28Zinc finger CCCH domain-containing protein 53 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LM760.0e+0089.20FAD_binding_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0365... [more]
A0A1S3BV000.0e+0090.33uncharacterized protein Cbei_0202 OS=Cucumis melo OX=3656 GN=LOC103493563 PE=4 S... [more]
A0A5D3B9570.0e+0090.04FAD/NAD(P)-binding oxidoreductase family protein OS=Cucumis melo var. makuwa OX=... [more]
A0A6J1HYE60.0e+0088.05uncharacterized protein LOC111468611 OS=Cucurbita maxima OX=3661 GN=LOC111468611... [more]
A0A6J1FBU80.0e+0087.91uncharacterized protein LOC111444197 OS=Cucurbita moschata OX=3662 GN=LOC1114441... [more]
Match NameE-valueIdentityDescription
AT4G30720.19.3e-25967.10FAD/NAD(P)-binding oxidoreductase family protein [more]
AT2G29580.17.5e-18467.46CCCH-type zinc fingerfamily protein with RNA-binding domain [more]
AT1G07360.11.4e-18266.08CCCH-type zinc fingerfamily protein with RNA-binding domain [more]
AT5G07060.13.8e-12756.28CCCH-type zinc fingerfamily protein with RNA-binding domain [more]
AT5G07060.23.8e-12760.37CCCH-type zinc fingerfamily protein with RNA-binding domain [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000571Zinc finger, CCCH-typeSMARTSM00356c3hfinal6coord: 986..1012
e-value: 8.7E-6
score: 35.2
IPR000571Zinc finger, CCCH-typePROSITEPS50103ZF_C3H1coord: 986..1013
score: 14.1828
IPR000626Ubiquitin-like domainSMARTSM00213ubq_7coord: 1..73
e-value: 1.7E-9
score: 47.5
IPR000626Ubiquitin-like domainPFAMPF00240ubiquitincoord: 3..74
e-value: 1.3E-8
score: 34.5
IPR000626Ubiquitin-like domainPROSITEPS50053UBIQUITIN_2coord: 1..77
score: 15.078674
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 1058..1126
e-value: 4.8E-17
score: 72.6
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 1059..1117
e-value: 5.6E-12
score: 45.3
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 1057..1130
score: 15.715584
NoneNo IPR availableGENE3D3.10.20.90coord: 1..90
e-value: 9.2E-17
score: 63.0
NoneNo IPR availableGENE3D4.10.1000.10coord: 978..1037
e-value: 1.2E-5
score: 27.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1163..1195
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1288..1341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1269..1341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1208..1255
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1215..1239
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1269..1287
NoneNo IPR availableCDDcd12224RRM_RBM22coord: 1056..1129
e-value: 5.69691E-43
score: 148.947
IPR036188FAD/NAD(P)-binding domain superfamilyGENE3D3.50.50.60coord: 336..571
e-value: 2.2E-67
score: 228.8
coord: 712..801
e-value: 4.3E-10
score: 41.2
IPR036188FAD/NAD(P)-binding domain superfamilySUPERFAMILY51905FAD/NAD(P)-binding domaincoord: 343..797
IPR002938FAD-binding domainPFAMPF01494FAD_binding_3coord: 343..381
e-value: 8.7E-6
score: 25.2
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 1040..1139
e-value: 1.5E-18
score: 68.8
IPR032297Torus domainPFAMPF16131Toruscoord: 967..1023
e-value: 9.8E-7
score: 29.4
IPR028348FAD dependent proteinPANTHERPTHR42842FAD/NAD(P)-BINDING OXIDOREDUCTASEcoord: 162..818
IPR036855Zinc finger, CCCH-type superfamilySUPERFAMILY90229CCCH zinc fingercoord: 988..1011
IPR029071Ubiquitin-like domain superfamilySUPERFAMILY54236Ubiquitin-likecoord: 2..77
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 1042..1132

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10006289.1HG10006289.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0000974 Prp19 complex
cellular_component GO:0071006 U2-type catalytic step 1 spliceosome
cellular_component GO:0071007 U2-type catalytic step 2 spliceosome
molecular_function GO:0003677 DNA binding
molecular_function GO:0071949 FAD binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0036002 pre-mRNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0017070 U6 snRNA binding
molecular_function GO:0003676 nucleic acid binding