HG10023039 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023039
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF724 domain-containing protein 7-like isoform X3
LocationChr05: 30679743 .. 30694909 (-)
RNA-Seq ExpressionHG10023039
SyntenyHG10023039
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGGAATTCGGCTCCCCTTCAACTAATACTCACACTCACCACCACAACCACCACCGTCATCACCAGCTTCCCTTCACAGTCGGCTCTGAAATCGAGGTCTCCATCGATGAAGAAGGCTTTAAGGGCGCTTTATTCAAAGCCACCATTTTGAAGCTCCCCACTACATTTTCTCCTTCAAAGAAGAAAAAGGCTTTAGTTGAGTACAAAACCCTTGTCACTGAAGACGGATCTACCCCTTTGAAGGAGCATGTTGATGCTCTTAGTTTAAGGCCTCTGCCTCCTGATACTGCTAATAAGGATTTCGAGGAATGCGACATTGTTGATGCCGCTGATAAAGATGGATGGTGGACTGGTGTCGTCTGTAAGGCTTTGGAAGGTGGGAGCTACTCTGTTTTGTTCAAAAATCCTATGCATGTCATGGATTTTCAGCGGAACCACTTGAGATTGCATCAGGATTGGGTTGATGGCAAGTGGGTTGTTCCTCAAAAGATGGTAACTTATCTGGGTTACTTTTCATTTTAAGAATTATTCGCTTGTTAGGATCAAGAGTGCATTAGGGTTTTTTTCTTTGCGTTTTAGAGATCTTTAATGCTCTTCATTTTCTGAATTAATGAGTTTAAATGAGTGGTTGTTCAGTTACAAGTTTAATGGCTTCTGCTCAGTCAATCAGTTAAGTTTTGTTTGACTTGTTAGAAAACTGATCGACGAGGACTATTACTACGGTTCACCTTGGTAGTTGGACGATTTTTTTTTTTTTTTTTTTTGGGGTTGGGTTGGGTTGGCTAAAGATTCCTAATTACCGTATCCTAAGTTAAAACGGTGTCCGAGCACCCAATTCCAGGTCTTCTTTTTGGCCACTAAGAGTATCTGGTGATTGTGTTGCCAACGATGGTAAGTTTGAAGCTGTAGTTAAAACTCCAACAACACATTCTTTTCTCTTGAAGAAAGATCCATCGGTTTGCATTCAAAAGGTTCCTTGGTTGGGGTTTAGTGTGAAGTTTACGCATTGTCAAAGGAATTGGATAGCGTAAGATGTTTTGGAAGTCTTTGGAGGGTATCTTTAGGAATAGCATTTTCCTTTTGTTTTTGGAATTTGGAATGTGGGAATGGATAGCAATCAAAACATGGTGGGGTTGGGGATGGGACGGTGTATACTCATGAAAATCATATGCGACTGATCTGCCAATCTGAGGGAAACATCCCCAGTATTTATCAAAATATTTTTATCTGAGAGTTCAGGCTAGACACTCCAATTTGAGGACGTGTTTTCTTTTGATAGGAGGTTTATTTCTTGATAATTTGTTAGATTCAGTTATAACCAATGACAGTAGACTTTTAGGCTACTGATTGGCCCAAGACTAGTAATCATGTGTAATTGTGAAACTGGATGTGTGTGTGGGTAAAAAAGAAGGATCATTAACAATAAAAAGTGGTCTATAGACAAATCAGTGTTGGAAGGTCTAAAATTTCTTTGTTGTATGATTTACATGTTACTTTCACCATGTTGATACGTTCTTCGATCTTGTCATTTATTATGCTATTTCTTGGTGTAAAAGTTCAAATCTTTTTGCTTCATATTCATACGCATCCCTACTAAACAACTGGGATAGCTTATTGTAATATCATAGATTGTGTATCCTCTTTGTGAATTTCACTCATAAATGAAATTGTTTTTTATTAAAAAAAAAAAGTTACTTTCACCCCAAAGTTTTTTAATTGTAAAGCATACACCATGAAACATGAAATTTCTGATATTATGCTAATTAGTATGTTGGAACTTTCAGGAGCACATGAAACTGTCTTACTACTTGTATTTCTCTGAAATGCTGAATCAGCAACAATTGCACTTTCTTTGACAAAATCCCACTTTTTATAGTGAGCAGTTGGCTTCCCATTGTGCAACTGGGTTTTGTCTTTGATAAATTAACGTCCTTTATCATTATGTTTAATGCTTCTCTTCTAATTTTACTCATTGCTTGAAAGATCATTCGCCACAAAATCTTTCATTCTAACTTTTTGTGGGAGTTGTTATGTAGAAAGGTAACATCCGATATCTCTTGTTGTGCAACAGTTCTGCATAACTGTGTAGTCCATGCTACCTATTATTATTATTTTTTTTCCATTGCTTTAGTCTGCTTGGCAGTCTTAGTGACATCCAGTTTTATTTTTGCTCTGCACCTAGATCATCTTTGAGAACTTAGATATCATGATGAAAGTCATATGTTCATTCAATACATTTTACTTGCAATTAAAACAATTTCTTCCACTTTAGGATGCGTCAATCTTGAGAGATCAATTGAGTATCATTTCTGAGGATGCTAATGTACCTGAGAATGTCCAGCACGAAAGCTTAAAAAATACTGAAACTAACATTGAAAAAAAAAACTCTTATTCCATAAACTCAAGGAATGACTTGATGGAAAAGCCAAGTATTCATGATGAAAGTTCTGCCTCATTTGCCTTGACCTCGAGCAAGAGAAGAAGAAGTCTCAGTTCTAAGTCAAGAGTTTCAAATCCTTTAAAGAAGTTGGGAGAAGGAAATGTTCCTGGGATACCAGCAGCAGATGGGTCGAGGATGATGGAAAGCAAAACATCGAGGGGAAAAGCTTTCAGTAAAAGTGCCACACCGAACAGAGATCGCAGAAGAAGAAGCTACCTAAATTTTCATTGTGACGATGATAATGTTTCACCTAATAAATTTGAGAGTCCTAAGGGAGGCAAGAAACTGGTAATTTGTCCCTGTGCTTTGAATCTTGCATTACCTTTTTAATATTTATAGCAGTCCGAACATCACCTGGGAAATTATCATATTTTAATTAAGGGAAATCTTTCTAAATTATTGTGTCATTATTGAGTGCAAGCATGTCAATCACCTCCCCCCTTCATTTACTTATGTTTGTATGTATCAAAGTGGTCTTGCTAAGAACTTAATACAGTCACTGACGTGGATGTCCCAACATTTCCTTAGGCAGTATGTATATGAATTTCCTCATCCTTTCCTCAGTTTTTTCTACTTGCCATTACCTATAATATATTCCACTAAACCAGTTAAAACTATTATGGGGCATTCTGCTCTTCACTGTTTTTCTTGTTACAATGACTCTAAATTATTGAAGATGATTCAATTTCATGTGTAGAGAACTAAGGAAGATGTTGATGGAAGTGATGAGTTGAAAGAGCAAACATTAAGTCTCATAAATGGTAGCAGAGGAAACACACTTAAACGAAGCCAACGGACTCTAGTTACAGGTACACATAGCTCTTGATGGACATTGGCCGTTGATTTTTGCTTGAATAATTTCTTATTCTGCATCTCAGATAAGGAGGGGAAAGAAGATTATGATGTCTTAGAGACCATCCCTAAGGAAGTGACTACAAGCAATAAATCTGAAAGGAATAAACATTTGGCTTCAGATGAAAAACAGACACCAGTTAACAATTCTCTTCACCTTCCTGATGCGGTTGGAGATGGGGAGGAAAATTCAAATAATCAGACAAAAGAGAAGGGCATGGTAAGCAAACTAAAACGATTAACTTGATATAAATCGAATGGATTTTAATAATGAGTATGTCTTGTGGTGACTTCTAGGTTATGTTTTTTTAATATATTTTTTTCCGCGAAGACTAAAGCCTTGTTACTAATATATTACTATGCACAATGTCAGGCCTTAAGATCAATAGAAGTAAATGCCAAATTCTACACTCATCGATCCTGATAAGGTAAGGTGAGAGAGTGGGCCAATCTAGTGTGGTGTGAGTTGGGTATTTTTCTGACGATGTATCATGGTCTACCTCTTGGAGATAATCCGAGGTAGATTTCTTTTTAGAGTTTCGTGGAGGACAAGGTGAGAGAGACTAGCCTCCTGGAAGAGAAGCTACTTCTCAAAAAGAGGGAGACGTACTCTTATTCGTTCAGTTTTAAGTGGTATCACAATTTACTTTTTCTCTCTCTTTAGAGCCCTATGCTCAATGTGTAAAAAGCTTGGATAAGCTCATGAGAGACTTCCTGTGGGAAGGTGTGGAAGAAGGAAACGGAGCTCACTTGGTGAATTAGGAGATAGTTGGGAAACTATTCACTTGGGAGGGTTGGAGGTTGGGAATTTAAGAACTTGTAATATTGCATTATTGGCAAAATGGTTATGACGTTTTACCCTTGAGACATCCTCCCTGTAGTCTAGGATTATCGTAAACAAGTACGACCCTCATTCTTTTGATTGGATGTCAGGTGGGGTTAAAGGCACATCTAGAAATCTATGGGAAGAAATATATTTTGTGCTCCCTTCCTTTTTCAAATCTTGTTCAATGTGTAGTGGCTGATGGGTGTGGCACCTTCTTTTGGGAGGATAGATGGTTGGAGGATCTTCCCTTGCGCAACAGATTTCCCAACCCTTCTGCCATTTATCTGCTTTGAAAAATCATTATGTTGCTGATTTTCTGTTGTACTCGGGGAGTTCTTCCTCTCTTTCGTTCGGTTTTTGTTGTCCTTTGTCTGATAGGGAAACGACGAAGGTGATGACCTTTCTTTACTTGATGATGTGAGCTTAGATAAGGGAGAAGGGATATTTGAATTTGGAGTCCTAGTCCTTGAGGGCTTTTCTTGTAAATCTTTCTTTAATCATTTATTGGATCCCTCTACTCGTGGTGAGTCTTTTTTTCAATTCTTTAGAAGCTGAAAATCCGGAAAAAGATGAAGTTATTTGTCTGGTAGGTTATTCATGGAAGAATCAATACCTTGGAGGGGTCGAGTAAGTTGTCATCCCTAGTTGGGCCTTTTTGCTGTATTCTCTGTCAAAGAGGGAAGGAAAATGTAGATCATCTATTATGGAATTGTAATTATGTTTGTACGGTTTGGAACTTCTTTGATGTGTTCGACATCCATTCAGCCCAACACAGAACTGTTAGTGATATGATCGAGGACCTCCTCCATCCGTCGTTTAGAGGAAAAGAAAGCTTACTTTGGAAGGTTGGGGTGTGTGCTTTGTTGTGGAGTATTTGTTGGGGGGGGGGGGGGGGGGGGCGGACACACTGGGATTTTTAGAGGGAGAGGGATCTGGTTGATGTTTGGTCTCTTGCGAAGTTTAATATTTGTCTTTGGGCATCGGTGACAAGTCCTTTTTGTAATTACTCTTTAGGTCATATTTTGTTAGATTATAGACCCTTCCTCTAGGGTGCTCCATTTTTTCGGTTTTTTTTTTTTATATGCCCTTGTATTCTATCATTTTTTTCTCAATGAAAATTGGTTCCCCCGTCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAAAATACTGTACACGATTCTTAAAGTTACAAAACAATATATTAACATACAAGATGTGGACACTTCTATTAAGTCATTCTAAAAGTTACTAACAAGCTAACAAGGTAAGAAACAGATGCAATGGAGTAGAGGATTGGTGATGACTCATCTCATACACATACTTGGTCATCCCAGTTAAAACACTATATTGTGACAAATAATTCAAGATCCTTCTAGAAAATTCTTATCTAAAGAAACATTCTTATAAAAAGAATTAATTGTTACAAATTTATAAGAAAATACTTACTCGTCCCTTCCCACATGCATGAAGAATCATCTCCACACCTTTATGACATATCTAGAAAGAAAAGGTGCAATTCTTCCTGTGGGAACTGTGCCCCTAGGCCATAAACACCCCTCCATATTGGTGTCATTTTTTTTCACATCATTTTTTCACTGCTCTGCTCCTATGTATTGACCTTCTGGAAAATTATTCTATCCTCCCACAAATGGTACACTGCCATGCCAAATGACCTGAAGCAGCTCTGCACCCATACATTAACGGGTCATCCACTCAAAAAAGGAGAAAAGCAGCACTGCCATGTCAAGCCCTCCCAACCTAAAACCTACTCCGGGTCATGGACCTTTTCTCATCATTAGTCACATGAAAAAGTGGCTTATCAGTAATAGTTTCTCCTCCTCCAACTACTCACCTGTGTTTCTTGTATCATTGTTAAAACCTGAATTGGTTCTAATATTGACTCCAGTCCTTTATTTTCAAGGCTTTCAACATATTTGGAGCTCTCGTTGTAGCCATTTGTAGTTTTGGATGAGGATGGCAAAATGGGGGGGGGGGGGGGGGGGGGGGGCGGACACACTGGGATTTTTAGAGGGAGAGGGATCTGGTTGATGTTTGGTCTCTTGCGAAGTTTAATATTTGTCTTTGGGCATCGGTGACAAGTCCTTTTTGTAATTACTCTTTAGGTCATATTTTGTTAGATTATAGACCCTTCCTCTAGGGTGCTCCATTTTTTCGGTTTTTTTTTTTTATATGCCCTTGTATTCTATCATTTTTTTCTCAATGAAAATTGGTTCCCCCGTCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAATACTGTACACGATTCTTAAAGTTACAAAACAATATATTAACATACAAGATGTGGACACTTCTATTAAGTCATTCTAAAAGTTACTAACAAGCTAACAAGGTAAGAAACAGATGCAATGGAGTAGAGGATTGGTGATGACTCATCTCATACACATACTTGGTCATCCCAGTTAAAACACTATATTGTGACAAATAATTCAAGATCCTTCTAGAAAATTCTTATCTAAAGAAACATTCTTATAAAAAGAATTAATTGTTACAAATTTATAAGAAAATACTTACTCGTCCCTTCCCACATGCATGAAGAATCATCTCCACACCTTTATGACATATCTAGAAAGAAAAGGTGCAATTCTTCCTGTGGGAACTGTGCCCTAGGCCATAAACACCCCTCCATATTGGTGTCATTTTTTTCACATCATTTTTTCACTGCTCTGCTCCTATGTATTGACCTTCTGGAAAATTATTCTATCCTCCCACAAATGGTACACTGCCATGCCAAATGACCTGAAGCAGCTCTGCACCCATACATTAACGGGTCATCCACTCAAAAAAGGAGAAAAGCAGCACTGCCATGTCAAGCCCTCCCAACCTAAAACCTACTCCGGGTCATGGACCTTTTCTCATCATTAGTCACATGAAAAAGTGGCTTATCAGTAATAGTTTCTCCTCCTCCAACTACTCACCTGTGTTTCTTGTATCATTGTTAAAACCTGAATTGGTTCTAATATTGACTCCAGTCCTTTATTTCAAGGCTTTCAACATATTTGGAGCTCTCGTTGTAGCCATTTGTAGTTTTGGATGAGGATGGCAAAATGGGGGGGGGGGGGGGGGGGGGGCGGACACACTGGGATTTTTAGAGGGAGAGGGATCTGGTTGATGTTTGGTCTCTTGCGAAGTTTAATATTTGTCTTTGGGCATCGGTGACAAGTCCTTTTTGTAATTACTCTTTAGGTCATATTTTGTTAGATTATAGACCCTTCCTCTAGGGTGCTCCATTTTTCGGTTTTTTTTTTTTATATGCCCTTGTATTCTATCATTTTTTTCTCAATGAAAATTGGTTCCCCCGTCCCCCCCCCCCAAAAAAAAAAAAAAAAAAATACTGTACACGATTCTTAAAGTTACAAAACAATATATTAACATACAAGATGTGGACACTTCTATTAAGTCATTCTAAAAGTTACTAACAAGCTAACAAGGTAAGAAACAGATGCAATGGAGTAGAGGATTGGTGATGACTCATCTCATACACATACTTGGTCATCCCAGTTAAAACACTATATTGTGACAAATAATTCAAGATCCTTCTAGAAAATTCTTATCTAAAGAAACATTCTTATAAAAAGAATTAATTGTTACAAATTTATAAGAAAATACTTACTCGTCCCTTCCCACATGCATGAAGAATCATCTCCACACCTTTATGACATATCTAGAAAGAAAAGGTGCAATTCTTCCTGTGGGAACTGTGCCCCTAGGCCATAAACACCCTCCATATTGGTGTCATTTTTTTTCACATCATTTTTCACTGCTCTGCTCCTATGTATTGACCTTCTGGAAAATTATTCTATCCTCCCACAAATGGTACACTGCCATGCCAAATGACCTGAAGCAGCTCTGCACCCATACATTAACGGGTCATCCACTCAAAAAAGGAGAAAAGCAGCACTGCCATGTCAAGCCCTCCCAACCTAAAACCTACTCCGGGTCATGGACCTTTTCTCATCATTAGTCACATGAAAAAGTGGCTTATCAGTAATAGTTTCTCCTCCTCCAACTACTCACCTGTGTTTCTTGTATCATTGTTAAAACCTGAATTGGTTCTAATATTGACTCCAGTCCTTTATTTTCAAGGCTTTCAACATATTTGGAGCTCTCGTTGTAGCCATTTGTAGTTTTGGATGAGGATGGCAAAATGGGGGGGGGGGGGGGGGGGGGGGGGGGCGGACACACTGGGATTTTTAGAGGGAGAGGGATCTGGTTGATGTTTGGTCTCTTGCGAAGTTTAATATTTGTCTTTGGGCATCGGTGACAAGTCCTTTTTGTAATTACTCTTTAGGTCATATTTTGTTAGATTATAGACCCTTCCTCTAGGGTGCTCCATTTTTTCGGTTTTTTTTTTTATATGCCCTTGTATTCTATCATTTTTTCTCAATGAAAATTGGTTCCCCCGTCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAAAAAAATACTGTACACGATTCTTAAAGTTACAAAACAATATATTAACATACAAGATGTGGACACTTCTATTAAGTCATTCTAAAAGTTACTAACAAGCTAACAAGGTAAGAAACAGATGCAATGGAGTAGAGGATTGGTGATGACTCATCTCATACACATACTTGGTCATCCCAGTTAAAACACTATATTGTGACAAATAATTCAAGATCCTTCTAGAAAATTCTTATCTAAAGAAACATTCTTATAAAAAGAATTAATTGTTACAAATTTATAAGAAAATACTTACTCGTCCCTTCCCACATGCATGAAGAATCATCTCCACACCTTTATGACATATCTAGAAAGAAAAGGTGCAATTCTTCCTGTGGGAACTGTGCCCCTAGGCCATAAACACCCCTCCATATTGGTGTCATTTTTTTTCACATCATTTTTTCACTGCTCTGCTCCTATGTATTGACCTTCTGGAAAATTATTCTATCCTCCCACAAATGGTACACTGCCATGCCAAATGACCTGAAGCAGCTCTGCACCCATACATTAACGGGTCATCCACTCAAAAAGGAGAAAAGCAGCACTGCCATGTCAAGCCCTCCCAACCTAAAACCTACTCCGGGTCATGGACCTTTTCTCATCATTAGTCACATGAAAAAGTGGCTTATCAGTAATAGTTTCTCCTCCTCCAACTACTCACCTGTGTTTCTTGTATCATTGTTAAAACCTGAATTGGTTCTAATATTGACTCCAGTCCTTTATTTTCAAGGCTTTCAACATATTTGGAGCTCTCGTTGTAGCCATTTGTAGTTTTGGATGAGGATGGCAAAATGGGGGGGGGGGGGGGGGGGGGGCGGACACACTGGGATTTTTAGAGGGAGAGGGATCTGGTTGATGTTTGGTCTCTTGCGAAGTTTAATATTTGTCTTTGGGCATCGGTGACAAGTCCTTTTTGTAATTACTCTTTAGGTCATATTTTGTTAGATTATAGACCCTTCCTCTAGGGTGCTCCATTTTTTCGGTTTTTTTTTTTTATATGCCCTTGTATTCTATCATTTTTTTCTCAATGAAAATTGGTTCCCCGTCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAATACTGTACACGATTCTTAAAGTTACAAAACAATATATTAACATACAAGATGTGGACACTTCTATTAAGTCATTCTAAAAGTTACTAACAAGCTAACAAGGTAAGAAACAGATGCAATGGAGTAGAGGATTGGTGATGACTCATCTCATACACATACTTGGTCATCCCAGTTAAAACACTATATTGTGACAAATAATTCAAGATCCTTCTAGAAAATTCTTATCTAAAGAAACATTCTTATAAAAAGAATTAATTGTTACAAATTTATAAGAAAATACTTACTCGTCCCTTCCCACATGCATGAAGAATCATCTCCACACCTTTATGACATATCTAGAAAGAAAAGGTGCAATTCTTCCTGTGGGAACTGTGCCCCTAGGCCATAAACACCCCTCCATATTGGTGTCATTTTTTTTCACATCATTTTTTCACTGCTCTGCTCCTATGTATTGACCTTCTGGAAAATTATTCTATCCTCCCACAAATGGTACACTGCCATGCCAAATGACCTGAAGCAGCTCTGCACCCATACATTAACGGGTCATCCACTCAAAAAAGGAGAAAAGCAGCACTGCCATGTCAAGCCCTCCCAACCTAAAACCTACTCCGGGTCATGGACCTTTTCTCATCATTAGTCACATGAAAAAGTGGCTTATCAGTAATAGTTTCTCCTCCTCCAACTACTCACCTGTGTTTCTTGTATCATTGTTAAAACCTGAATTGGTTCTAATATTGACTCCAGTCCTTTATTTTCAAGGCTTTCAACATATTTGGAGCTCTCGTTGTAGCCATTTGTAGTTTTGGATGAGGATGGCAAAATGGGTTGGCCTTTTGGCATGCTCTGGTCCCTTGTTTCTTACGGTTCTTTCTTTCACACATTTTAATGAAATCATGTTTGCCTGTTAAAAGAAATTGTTCCAATAAAAATACTATTTGATAGTACTTTTGGATGCTTTTAGTATTTACCATTGGCAAGAGGAACGCCATCCAAAATGTAGCAATATTGTTGGGGTCATGTGGGGTCACTTATCATGTCTCAGCCTTACACTGATAACTCAATAAGGTTAACCTGCTCTGTAGTTCAGGTCTTTGACCAGTTCTGAGATTTGGCATGTGTTTTGATTTACATCTTCTTCAGTGTGCTTGGAGTTTGATAGGATTGATGATGTCTATTCTTTATCAGGAACCTGAGCAGCAAGAAGCTACTGAAAACAGTGATAAAAGAAAGAGGGGAAGGCCTCGTAAAATCATGCAGGTAAGATGCCAAAGAGGTACATCAATCTTCAGTCTGTTTGAACCCCATAAAAACACCACTCACTAGGAAACCTCCCTATTGAGAATATACCAAATCTCCCACAAAATTGGTAAAGTGTTATGTTGGAGCAGCAGATCTTCAAACAGTGAAGAGGCCAGCCAAGGTAAAAGGATGATGTGCTAGAGGTTCCCCATAGACTTTAGGCAACGGAGATCTTTGAACTTCAAAAAAGTGTCTCCTAAATCAGGAACCTAGAATCTAGGCAAAGAAGGAATGAGCAGAAGAGTGTGATAGATCTTGACCTGCCTTGCACAAAGAATGTACGTATTAGGGTTGAGAACATGGACGGTAAAAGTGTATCATACCTTTTGTAAAACCCTTTCGGGAGTGTTTGGAGCAAGGAGTGGAATAGTGAGCTCTATGGAGTTGTGAACTCATGAGACCCACGTAAGGAGTTGATAAGGTATAAAATGATGTTTTTTAGTGGTGGGATCCACAAACGTTGGGCCAAACAAGGAGTAGAGTTCCCAACTCCTTAGACCAAATACACCCTTCAGAGCTATTTGAGCTTCTAAGTATCAATGCCACAAAACTAAAAGTCCTTTGAAATTCATATTTAAGGACCCTAGAACTCTGTTGCAAGAATCTTCTCACGGTATGTTCCTAAAGGATTTGGGAACTTTTTCCTTCAATTATCTTGGTTCATCCTTCAATACAGCAAAAATTCCTTGGGCTTTCATAGAATAATTTAGTCATTGAACATTAATTTGTAACGATTTAGTCCACTTTGAAAATTGTAACAATTTAATCTCCAAATTTTAATAAGTAACACTATAGTCCTTGTGCTTTACAAAGTTTAATGATTTAGTCCATCTCATGCAAAATATTATTAAGATTTAATGAAATTTCTTATAAACTATAAATAGTAAGTAGTTACGTAATAAAATTGACATTTAATGTTGATAAAATCCATTTGACTAAATAGTCACAAGTTTGTAAGTACAGAGACTAAATTGTTACAAATTGAAGTTAAGGGCTAAATTGTTACTTTTATGAAAGTTCAGGGACCAAAACTGTTTCTTAACTTAGAGAATGGAAAAACAAGAAACAAAAAATCACTCAAATCTTTTATGATGAAAGAAAAAGTGAGGGCATGTGATCTAATAATTGGTTTCAAAGATTGCTAGTTGAGCGGCTTCCACAAAATTTTTTCAGACTCTCCTTTTTCCTCAATTCTATCAGCTAAGGATCTACCTTATAGATAATATAGGACACCAAATCATCTCCTCTCTTATTATTAATTTATGAATAACCAAGCTTTCATTAAATGTAAGAATACAGGGGCTAACAACAAAAAAGACCCACAAAATGGGGAGACCAACTTAAGAAAAGGGTTTTCAATCAAGCACAATGAGACCTAACGAATAATTACAAAAAAATCTTCGCAACCAACGCCCACAACGGAACATCAAATCTGTTAAGGAACCAAATATCACGCAAAGAATGCTTAATCCCTCTAAAAAACTCTACTGTTCCTCTCAACACAAAAGCTTCACAAAATAGCACACCCTGACCTGCAACACAAAGTGCCTTGTTCATGAGGGGGGAGGGGGGTGTGAAGTAGGAACATCATCTCCTTAAAGATCCTGTTATTTTAATACGACAGTTCTATCAAAGGAAAGGAAAATAATATCCATCTGAGCCTAGTCTGCACAATTCCCTGTCTCCCAAAGAAAAGACACGTTCCCCATAGATCTTGACAGTTGATTCCAGTATTCTAGCTAGTTAGAGCTTACCTCTGTAACAGTTATTGTCCCTTCCTAATCACGATATACCTCAGATTGTCATGGAGGAGGAAAAAGATGAAACTGAAACCACCAGATTCAGTGCTAAGGAATCTTGCTTCCAACTGGAAAGTTGAAATTCTTCTAGTCATTGTGGGGTGGGGGCACGAAAGGAATGATTGGTACTAAAATCTCTGAAGCTTACAAAATTACCCTTCATTTTTTTCTTAAGACAAAAGCTTGAGAGACTGCCGAATATTTGTTATCAAATGTTATATCATATCCATCTGACCAAGTTTTTGCTTTTCCCAGTGTAGAGAGACATGATTCTCATGGATCTTGTTTTCTACATAGTCAGAGATCTCTGTAACTGTTACTTTCTTTCCTAACCACAAGGTACCCCCCTCAGAAGGCTGTGGAATAGGGAGAAGATTACTGAATCTTGTGAAGAAGAGATTCTTCCAAGGATATATATTGCTTCAAGCTTGAAAAGTTGAAGAAGTTATAGCTAATTTTGGGGTTGGGCATGAAAGAAATGAATGATACTCAGATCTTTGAACTTTAAAGCTTACAAACTGAACCATCATAACTCTCTTAAGACAGGAAGTTGAGGGACTACTGAAAACCGTATCGTGTAATAGATTAAAAGCACTTGCCATAATGTGAAAGCACCAAAAAACAAAGCATATGGCTCAAGAAATTCAACCAGCATTTTTACCCTCGCAAGCAAAGACTGTTGGATCACAAATATTGTTATTTGTAAATGTAAATTGTGAACAAATTAGAAAAAATCCACTCAACCAAATGAGATCAGGAGTCCACCCATCTAGGAGTTTAAGGGAAAGTACACCGTCTTCCACAGACAACACCAACATTTCCTCCAGCTGCGTGGAATATTAAGAAGTCTAGTTATATAATGGTGTATGTGGGATCATTTGAAACGTCTCAGGTTTCAGCCTTTCACTGATAACTCATTTAGTTTAACCAGCTCTGTAGTTCAGGACCCTCAGATCTTTCACCATTTTGTGAGATTTGGCAGGTGTTTTATTACCTCTTCTTCAGTGTGCTTAGAGTTTGATAGGATTGATGTTTTATATCTTCATCAGGAACCTGGGCAGCAACAAGCTAGTAAAAACAGTTACAAAAGAAAGAGAGGACGGCCTCGAAAATTGATGATAGTTCCAACAACTGCAGAAGGTATCCTGTCCTTACGCCTGTGATCATGGGTTTCTTTTTATATATTTTTTTTTTACATTACATATAACTGTGATAAAGATTGGAGTTTTTGTGTTACAGATATGGAGCAAGATGGAAGTGGATGGAAGCCCGAGAAAGCAACTGTAAAAAGTTGTGTAACTGGTGAGGCTGCTATTCTTTAATGTCAAATATCCTTAAACCCGTACAGTAGTGTATTTGTTTCACCTTTCTGAAATTATCATTTCACTAGCCCAAAGAACCAAAAGAAAAAAGGCTTTAGGAAAATGCCAATGTATTTTGTGGATTTTCAAGATCTGAACAGAAGGAACGGGAAGGAACTCTCAGCAAACAAGACAAATGGGACTGGAACTAACAGTGTCGACGATGATGATCGACCACTGTTAATGTGGCTTGGAGGAATGCAAGGTTCAGCAAGCAACAATTCGTTGAGTATGTATACAAAACTTACACTAGCACATCAGAAATGAGTACTGTTTGTGGAGATTTTGATTGTATTTGTTTCAATTTTAGAATTAGGACAAACATATGGTTCCAAGCGAAGGACAAAAGGGAGTGAACAAGTAGATGCTGTGAATAAGGTGAGAACAGTTGATGGAACGCCTGAAAATGAAGTGGTCAAGAATCAAGGTTGGCCTTTTGTAAAGAATTCACCTGTCTGGAGTGCCATTGATTCTTTAGGAGTCTTCAAGCAAATTCCACAAAAGCCTCATTTCCACCCTTTAAGTACATACAAGGAGGAATGTCGCGAAGGATTAGCTATTGGCTGTATGGTAACTTTTGCAAGTTTGGTTGAAAAGATAACCAAACTACAATTTGATTATCCGAGATACATTTTCGAAAGCACGTTGGCCAGTCTATATGACTTGGAACAACATGGATTCAATATTTCAATGCTCTGCAATCGAGTGAATGAGCTACTATTTATCAAAGATACTGAAATGAGATGCATAGAGGAAACAAAAGTAGCAGAAAACAAAATATTGGAGCATACCAAAAACAAAACTAAGTTGGCAGAAGAGAGTAAGGCTATTGAACAGAAGATAATAGAGCTGCAGAGGAGACAATCATCAATCAAACTGGAAATTGAGACTAATGACAACGAGATCGAGGCACTGCAATCACATGTGGAGACCATCAGGGAATGTACCATGAGTACTAAGCTACATTTTGAAAACCAGATTGCCCTCCCCTTGTGGCCACTTTGA

mRNA sequence

ATGGGGGAATTCGGCTCCCCTTCAACTAATACTCACACTCACCACCACAACCACCACCGTCATCACCAGCTTCCCTTCACAGTCGGCTCTGAAATCGAGGTCTCCATCGATGAAGAAGGCTTTAAGGGCGCTTTATTCAAAGCCACCATTTTGAAGCTCCCCACTACATTTTCTCCTTCAAAGAAGAAAAAGGCTTTAGTTGAGTACAAAACCCTTGTCACTGAAGACGGATCTACCCCTTTGAAGGAGCATGTTGATGCTCTTAGTTTAAGGCCTCTGCCTCCTGATACTGCTAATAAGGATTTCGAGGAATGCGACATTGTTGATGCCGCTGATAAAGATGGATGGTGGACTGGTGTCGTCTGTAAGGCTTTGGAAGGTGGGAGCTACTCTGTTTTGTTCAAAAATCCTATGCATGTCATGGATTTTCAGCGGAACCACTTGAGATTGCATCAGGATTGGGTTGATGGCAAGTGGGTTGTTCCTCAAAAGATGGATGCGTCAATCTTGAGAGATCAATTGAGTATCATTTCTGAGGATGCTAATGTACCTGAGAATGTCCAGCACGAAAGCTTAAAAAATACTGAAACTAACATTGAAAAAAAAAACTCTTATTCCATAAACTCAAGGAATGACTTGATGGAAAAGCCAAGTATTCATGATGAAAGTTCTGCCTCATTTGCCTTGACCTCGAGCAAGAGAAGAAGAAGTCTCAGTTCTAAGTCAAGAGTTTCAAATCCTTTAAAGAAGTTGGGAGAAGGAAATGTTCCTGGGATACCAGCAGCAGATGGGTCGAGGATGATGGAAAGCAAAACATCGAGGGGAAAAGCTTTCAGTAAAAGTGCCACACCGAACAGAGATCGCAGAAGAAGAAGCTACCTAAATTTTCATTGTGACGATGATAATGTTTCACCTAATAAATTTGAGAGTCCTAAGGGAGGCAAGAAACTGAGAACTAAGGAAGATGTTGATGGAAGTGATGAGTTGAAAGAGCAAACATTAAGTCTCATAAATGGTAGCAGAGGAAACACACTTAAACGAAGCCAACGGACTCTAGTTACAGATAAGGAGGGGAAAGAAGATTATGATGTCTTAGAGACCATCCCTAAGGAAGTGACTACAAGCAATAAATCTGAAAGGAATAAACATTTGGCTTCAGATGAAAAACAGACACCAGTTAACAATTCTCTTCACCTTCCTGATGCGGTTGGAGATGGGGAGGAAAATTCAAATAATCAGACAAAAGAGAAGGGCATGGAACCTGAGCAGCAAGAAGCTACTGAAAACAGTGATAAAAGAAAGAGGGGAAGGCCTCGTAAAATCATGCAGGAACCTGGGCAGCAACAAGCTAGTAAAAACAGTTACAAAAGAAAGAGAGGACGGCCTCGAAAATTGATGATAGTTCCAACAACTGCAGAAGATATGGAGCAAGATGGAAGTGGATGGAAGCCCGAGAAAGCAACTGTAAAAAGTTGTGTAACTGATCTGAACAGAAGGAACGGGAAGGAACTCTCAGCAAACAAGACAAATGGGACTGGAACTAACAGTGTCGACGATGATGATCGACCACTGTTAATGTGGCTTGGAGGAATGCAAGGTTCAGCAAGCAACAATTCGTTGAAATTAGGACAAACATATGGTTCCAAGCGAAGGACAAAAGGGAGTGAACAAGTAGATGCTGTGAATAAGGTGAGAACAGTTGATGGAACGCCTGAAAATGAAGTGGTCAAGAATCAAGGTTGGCCTTTTGTAAAGAATTCACCTGTCTGGAGTGCCATTGATTCTTTAGGAGTCTTCAAGCAAATTCCACAAAAGCCTCATTTCCACCCTTTAAGTACATACAAGGAGGAATGTCGCGAAGGATTAGCTATTGGCTGTATGGTAACTTTTGCAAGTTTGGTTGAAAAGATAACCAAACTACAATTTGATTATCCGAGATACATTTTCGAAAGCACGTTGGCCAGTCTATATGACTTGGAACAACATGGATTCAATATTTCAATGCTCTGCAATCGAGTGAATGAGCTACTATTTATCAAAGATACTGAAATGAGATGCATAGAGGAAACAAAAGTAGCAGAAAACAAAATATTGGAGCATACCAAAAACAAAACTAAGTTGGCAGAAGAGAGTAAGGCTATTGAACAGAAGATAATAGAGCTGCAGAGGAGACAATCATCAATCAAACTGGAAATTGAGACTAATGACAACGAGATCGAGGCACTGCAATCACATGTGGAGACCATCAGGGAATGTACCATGAGTACTAAGCTACATTTTGAAAACCAGATTGCCCTCCCCTTGTGGCCACTTTGA

Coding sequence (CDS)

ATGGGGGAATTCGGCTCCCCTTCAACTAATACTCACACTCACCACCACAACCACCACCGTCATCACCAGCTTCCCTTCACAGTCGGCTCTGAAATCGAGGTCTCCATCGATGAAGAAGGCTTTAAGGGCGCTTTATTCAAAGCCACCATTTTGAAGCTCCCCACTACATTTTCTCCTTCAAAGAAGAAAAAGGCTTTAGTTGAGTACAAAACCCTTGTCACTGAAGACGGATCTACCCCTTTGAAGGAGCATGTTGATGCTCTTAGTTTAAGGCCTCTGCCTCCTGATACTGCTAATAAGGATTTCGAGGAATGCGACATTGTTGATGCCGCTGATAAAGATGGATGGTGGACTGGTGTCGTCTGTAAGGCTTTGGAAGGTGGGAGCTACTCTGTTTTGTTCAAAAATCCTATGCATGTCATGGATTTTCAGCGGAACCACTTGAGATTGCATCAGGATTGGGTTGATGGCAAGTGGGTTGTTCCTCAAAAGATGGATGCGTCAATCTTGAGAGATCAATTGAGTATCATTTCTGAGGATGCTAATGTACCTGAGAATGTCCAGCACGAAAGCTTAAAAAATACTGAAACTAACATTGAAAAAAAAAACTCTTATTCCATAAACTCAAGGAATGACTTGATGGAAAAGCCAAGTATTCATGATGAAAGTTCTGCCTCATTTGCCTTGACCTCGAGCAAGAGAAGAAGAAGTCTCAGTTCTAAGTCAAGAGTTTCAAATCCTTTAAAGAAGTTGGGAGAAGGAAATGTTCCTGGGATACCAGCAGCAGATGGGTCGAGGATGATGGAAAGCAAAACATCGAGGGGAAAAGCTTTCAGTAAAAGTGCCACACCGAACAGAGATCGCAGAAGAAGAAGCTACCTAAATTTTCATTGTGACGATGATAATGTTTCACCTAATAAATTTGAGAGTCCTAAGGGAGGCAAGAAACTGAGAACTAAGGAAGATGTTGATGGAAGTGATGAGTTGAAAGAGCAAACATTAAGTCTCATAAATGGTAGCAGAGGAAACACACTTAAACGAAGCCAACGGACTCTAGTTACAGATAAGGAGGGGAAAGAAGATTATGATGTCTTAGAGACCATCCCTAAGGAAGTGACTACAAGCAATAAATCTGAAAGGAATAAACATTTGGCTTCAGATGAAAAACAGACACCAGTTAACAATTCTCTTCACCTTCCTGATGCGGTTGGAGATGGGGAGGAAAATTCAAATAATCAGACAAAAGAGAAGGGCATGGAACCTGAGCAGCAAGAAGCTACTGAAAACAGTGATAAAAGAAAGAGGGGAAGGCCTCGTAAAATCATGCAGGAACCTGGGCAGCAACAAGCTAGTAAAAACAGTTACAAAAGAAAGAGAGGACGGCCTCGAAAATTGATGATAGTTCCAACAACTGCAGAAGATATGGAGCAAGATGGAAGTGGATGGAAGCCCGAGAAAGCAACTGTAAAAAGTTGTGTAACTGATCTGAACAGAAGGAACGGGAAGGAACTCTCAGCAAACAAGACAAATGGGACTGGAACTAACAGTGTCGACGATGATGATCGACCACTGTTAATGTGGCTTGGAGGAATGCAAGGTTCAGCAAGCAACAATTCGTTGAAATTAGGACAAACATATGGTTCCAAGCGAAGGACAAAAGGGAGTGAACAAGTAGATGCTGTGAATAAGGTGAGAACAGTTGATGGAACGCCTGAAAATGAAGTGGTCAAGAATCAAGGTTGGCCTTTTGTAAAGAATTCACCTGTCTGGAGTGCCATTGATTCTTTAGGAGTCTTCAAGCAAATTCCACAAAAGCCTCATTTCCACCCTTTAAGTACATACAAGGAGGAATGTCGCGAAGGATTAGCTATTGGCTGTATGGTAACTTTTGCAAGTTTGGTTGAAAAGATAACCAAACTACAATTTGATTATCCGAGATACATTTTCGAAAGCACGTTGGCCAGTCTATATGACTTGGAACAACATGGATTCAATATTTCAATGCTCTGCAATCGAGTGAATGAGCTACTATTTATCAAAGATACTGAAATGAGATGCATAGAGGAAACAAAAGTAGCAGAAAACAAAATATTGGAGCATACCAAAAACAAAACTAAGTTGGCAGAAGAGAGTAAGGCTATTGAACAGAAGATAATAGAGCTGCAGAGGAGACAATCATCAATCAAACTGGAAATTGAGACTAATGACAACGAGATCGAGGCACTGCAATCACATGTGGAGACCATCAGGGAATGTACCATGAGTACTAAGCTACATTTTGAAAACCAGATTGCCCTCCCCTTGTGGCCACTTTGA

Protein sequence

MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISEDANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDDDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKEDYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWPL
Homology
BLAST of HG10023039 vs. NCBI nr
Match: XP_038899475.1 (DUF724 domain-containing protein 3-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1319.3 bits (3413), Expect = 0.0e+00
Identity = 681/781 (87.20%), Postives = 720/781 (92.19%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP TNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALF+ATILKLPTTF PS
Sbjct: 1   MGEFGSPPTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFRATILKLPTTFPPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEY+TLVTEDGSTPLKEHVDALSLRPLPPDTA+KDF+ECDIVDAADKDGWWTGV
Sbjct: 61  KKKKALVEYQTLVTEDGSTPLKEHVDALSLRPLPPDTADKDFQECDIVDAADKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LEGG YSVLFKNPMHVMDFQRNHLRLHQDWV G WVVPQKMDASILRDQLSIISED
Sbjct: 121 VCKVLEGGGYSVLFKNPMHVMDFQRNHLRLHQDWVGGNWVVPQKMDASILRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENVQ ESLK TET   K+NSY++NSRND+MEKP ++DESSASFALTSSKRRRSLSS
Sbjct: 181 ANVPENVQRESLKGTETINGKENSYTVNSRNDVMEKPGVYDESSASFALTSSKRRRSLSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDD 300
           KSRV NPLKKL EG V G P ADGSRM+ESKT RGKAF+KSATPNRDRRRRSYLNFH DD
Sbjct: 241 KSRVLNPLKKLREGIVLGTPVADGSRMIESKTLRGKAFNKSATPNRDRRRRSYLNFHSDD 300

Query: 301 DNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 360
           D+ SPN F SP+GGKK RTKEDVDGSD+LKEQ LS ING+ GN  KRSQ+T VTDKEGKE
Sbjct: 301 DSASPNSFGSPRGGKKPRTKEDVDGSDKLKEQGLSSINGNGGNKCKRSQQTQVTDKEGKE 360

Query: 361 DYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGME 420
           DYDVLE IPKEVTTSN+SERN+H+ASD KQTPV NSLH+P  VGDGEE+SNNQ  EKG+E
Sbjct: 361 DYDVLEIIPKEVTTSNESERNRHVASDGKQTPVKNSLHIPKEVGDGEEDSNNQATEKGVE 420

Query: 421 PEQQEATENSDK--------RKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTA 480
           PEQQEATENSDK        RKRGRPRKIMQE GQQQASKNSYKRKRGRPRKLMIVPTTA
Sbjct: 421 PEQQEATENSDKEATENSDRRKRGRPRKIMQESGQQQASKNSYKRKRGRPRKLMIVPTTA 480

Query: 481 EDMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQ 540
           ED+EQDGSGWKPEKAT+KSCVTDLNRRNG  +SA KTNGTGTNSVDDDDRPLL+WLGGMQ
Sbjct: 481 EDVEQDGSGWKPEKATIKSCVTDLNRRNGNNVSAFKTNGTGTNSVDDDDRPLLLWLGGMQ 540

Query: 541 GSASNNSLKLGQTYG--SKRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVW 600
           GSASNNSLKLGQT G  SKRRT GSEQVD VN++RTVDG PENEVVKNQGWPFVKNSPVW
Sbjct: 541 GSASNNSLKLGQTSGSTSKRRTNGSEQVDGVNEMRTVDGAPENEVVKNQGWPFVKNSPVW 600

Query: 601 SAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFE 660
           SAIDSL VFKQIPQKPHFHPL+TYKEECREGLAIGCMVTFASLVEKITKLQF YPR++FE
Sbjct: 601 SAIDSLEVFKQIPQKPHFHPLNTYKEECREGLAIGCMVTFASLVEKITKLQFSYPRHVFE 660

Query: 661 STLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEE 720
           STLASLYDLEQHGF+ISMLCNRVNELLFIKDTEMR IEETKVAENKILEHT+NK+KLAEE
Sbjct: 661 STLASLYDLEQHGFDISMLCNRVNELLFIKDTEMRYIEETKVAENKILEHTENKSKLAEE 720

Query: 721 SKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWP 772
           SKAIEQKI ELQRRQSSIKLEIET +NEI+ALQSHVETIRECTM+TKLHFENQIALPL P
Sbjct: 721 SKAIEQKITELQRRQSSIKLEIETKENEIQALQSHVETIRECTMNTKLHFENQIALPLCP 780

BLAST of HG10023039 vs. NCBI nr
Match: XP_038899474.1 (DUF724 domain-containing protein 3-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1307.0 bits (3381), Expect = 0.0e+00
Identity = 681/802 (84.91%), Postives = 720/802 (89.78%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP TNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALF+ATILKLPTTF PS
Sbjct: 1   MGEFGSPPTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFRATILKLPTTFPPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEY+TLVTEDGSTPLKEHVDALSLRPLPPDTA+KDF+ECDIVDAADKDGWWTGV
Sbjct: 61  KKKKALVEYQTLVTEDGSTPLKEHVDALSLRPLPPDTADKDFQECDIVDAADKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LEGG YSVLFKNPMHVMDFQRNHLRLHQDWV G WVVPQKMDASILRDQLSIISED
Sbjct: 121 VCKVLEGGGYSVLFKNPMHVMDFQRNHLRLHQDWVGGNWVVPQKMDASILRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENVQ ESLK TET   K+NSY++NSRND+MEKP ++DESSASFALTSSKRRRSLSS
Sbjct: 181 ANVPENVQRESLKGTETINGKENSYTVNSRNDVMEKPGVYDESSASFALTSSKRRRSLSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDD 300
           KSRV NPLKKL EG V G P ADGSRM+ESKT RGKAF+KSATPNRDRRRRSYLNFH DD
Sbjct: 241 KSRVLNPLKKLREGIVLGTPVADGSRMIESKTLRGKAFNKSATPNRDRRRRSYLNFHSDD 300

Query: 301 DNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 360
           D+ SPN F SP+GGKK RTKEDVDGSD+LKEQ LS ING+ GN  KRSQ+T VTDKEGKE
Sbjct: 301 DSASPNSFGSPRGGKKPRTKEDVDGSDKLKEQGLSSINGNGGNKCKRSQQTQVTDKEGKE 360

Query: 361 DYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGME 420
           DYDVLE IPKEVTTSN+SERN+H+ASD KQTPV NSLH+P  VGDGEE+SNNQ  EKG+E
Sbjct: 361 DYDVLEIIPKEVTTSNESERNRHVASDGKQTPVKNSLHIPKEVGDGEEDSNNQATEKGVE 420

Query: 421 PEQQEATENSDK--------RKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTA 480
           PEQQEATENSDK        RKRGRPRKIMQE GQQQASKNSYKRKRGRPRKLMIVPTTA
Sbjct: 421 PEQQEATENSDKEATENSDRRKRGRPRKIMQESGQQQASKNSYKRKRGRPRKLMIVPTTA 480

Query: 481 EDMEQDGSGWKPEKATVKSCVT---------------------DLNRRNGKELSANKTNG 540
           ED+EQDGSGWKPEKAT+KSCVT                     DLNRRNG  +SA KTNG
Sbjct: 481 EDVEQDGSGWKPEKATIKSCVTAKRTKRKKGFRKMPMYFVDFQDLNRRNGNNVSAFKTNG 540

Query: 541 TGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYG--SKRRTKGSEQVDAVNKVRTVDG 600
           TGTNSVDDDDRPLL+WLGGMQGSASNNSLKLGQT G  SKRRT GSEQVD VN++RTVDG
Sbjct: 541 TGTNSVDDDDRPLLLWLGGMQGSASNNSLKLGQTSGSTSKRRTNGSEQVDGVNEMRTVDG 600

Query: 601 TPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVT 660
            PENEVVKNQGWPFVKNSPVWSAIDSL VFKQIPQKPHFHPL+TYKEECREGLAIGCMVT
Sbjct: 601 APENEVVKNQGWPFVKNSPVWSAIDSLEVFKQIPQKPHFHPLNTYKEECREGLAIGCMVT 660

Query: 661 FASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEE 720
           FASLVEKITKLQF YPR++FESTLASLYDLEQHGF+ISMLCNRVNELLFIKDTEMR IEE
Sbjct: 661 FASLVEKITKLQFSYPRHVFESTLASLYDLEQHGFDISMLCNRVNELLFIKDTEMRYIEE 720

Query: 721 TKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETI 772
           TKVAENKILEHT+NK+KLAEESKAIEQKI ELQRRQSSIKLEIET +NEI+ALQSHVETI
Sbjct: 721 TKVAENKILEHTENKSKLAEESKAIEQKITELQRRQSSIKLEIETKENEIQALQSHVETI 780

BLAST of HG10023039 vs. NCBI nr
Match: XP_038899477.1 (DUF724 domain-containing protein 3-like isoform X4 [Benincasa hispida])

HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 673/781 (86.17%), Postives = 711/781 (91.04%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP TNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALF+ATILKLPTTF PS
Sbjct: 1   MGEFGSPPTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFRATILKLPTTFPPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEY+TLVTEDGSTPLKEHVDALSLRPLPPDTA+KDF+ECDIVDAADKDGWWTGV
Sbjct: 61  KKKKALVEYQTLVTEDGSTPLKEHVDALSLRPLPPDTADKDFQECDIVDAADKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LEGG YSVLFKNPMHVMDFQRNHLRLHQDWV G WVVPQKMDASILRDQLSIISED
Sbjct: 121 VCKVLEGGGYSVLFKNPMHVMDFQRNHLRLHQDWVGGNWVVPQKMDASILRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENVQ ESLK TET   K+NSY++NSRND+MEKP ++DESSASFALTSSKRRRSLSS
Sbjct: 181 ANVPENVQRESLKGTETINGKENSYTVNSRNDVMEKPGVYDESSASFALTSSKRRRSLSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDD 300
           KSRV NPLKKL EG V G P ADGSRM+ESKT RGKAF+KSATPNRDRRRRSYLNFH DD
Sbjct: 241 KSRVLNPLKKLREGIVLGTPVADGSRMIESKTLRGKAFNKSATPNRDRRRRSYLNFHSDD 300

Query: 301 DNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 360
           D+ SPN F SP+GGKK RTKEDVDGSD+LKEQ LS ING+ GN  KRSQ+T VTDKEGKE
Sbjct: 301 DSASPNSFGSPRGGKKPRTKEDVDGSDKLKEQGLSSINGNGGNKCKRSQQTQVTDKEGKE 360

Query: 361 DYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGME 420
           DYDVLE IPKEVTTSN+SERN+H+ASD KQTPV NSLH+P  VGDGEE+SNNQ  EKG+E
Sbjct: 361 DYDVLEIIPKEVTTSNESERNRHVASDGKQTPVKNSLHIPKEVGDGEEDSNNQATEKGVE 420

Query: 421 PEQQEATENSDK--------RKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTA 480
           PEQQEATENSDK        RKRGRPRKIMQE GQQQASKNSYKRKRGRPRKLMIVPTTA
Sbjct: 421 PEQQEATENSDKEATENSDRRKRGRPRKIMQESGQQQASKNSYKRKRGRPRKLMIVPTTA 480

Query: 481 EDMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQ 540
           ED+EQDGSGWKPEKAT+KSCVT           A KTNGTGTNSVDDDDRPLL+WLGGMQ
Sbjct: 481 EDVEQDGSGWKPEKATIKSCVT-----------AFKTNGTGTNSVDDDDRPLLLWLGGMQ 540

Query: 541 GSASNNSLKLGQTYG--SKRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVW 600
           GSASNNSLKLGQT G  SKRRT GSEQVD VN++RTVDG PENEVVKNQGWPFVKNSPVW
Sbjct: 541 GSASNNSLKLGQTSGSTSKRRTNGSEQVDGVNEMRTVDGAPENEVVKNQGWPFVKNSPVW 600

Query: 601 SAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFE 660
           SAIDSL VFKQIPQKPHFHPL+TYKEECREGLAIGCMVTFASLVEKITKLQF YPR++FE
Sbjct: 601 SAIDSLEVFKQIPQKPHFHPLNTYKEECREGLAIGCMVTFASLVEKITKLQFSYPRHVFE 660

Query: 661 STLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEE 720
           STLASLYDLEQHGF+ISMLCNRVNELLFIKDTEMR IEETKVAENKILEHT+NK+KLAEE
Sbjct: 661 STLASLYDLEQHGFDISMLCNRVNELLFIKDTEMRYIEETKVAENKILEHTENKSKLAEE 720

Query: 721 SKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWP 772
           SKAIEQKI ELQRRQSSIKLEIET +NEI+ALQSHVETIRECTM+TKLHFENQIALPL P
Sbjct: 721 SKAIEQKITELQRRQSSIKLEIETKENEIQALQSHVETIRECTMNTKLHFENQIALPLCP 770

BLAST of HG10023039 vs. NCBI nr
Match: XP_038899476.1 (DUF724 domain-containing protein 7-like isoform X3 [Benincasa hispida])

HSP 1 Score: 1265.0 bits (3272), Expect = 0.0e+00
Identity = 665/802 (82.92%), Postives = 700/802 (87.28%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP TNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALF+ATILKLPTTF PS
Sbjct: 1   MGEFGSPPTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFRATILKLPTTFPPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEY+TLVTEDGSTPLKEHVDALSLRPLPPDTA+KDF+ECDIVDAADKDGWWTGV
Sbjct: 61  KKKKALVEYQTLVTEDGSTPLKEHVDALSLRPLPPDTADKDFQECDIVDAADKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LEGG YSVLFKNPMHVMDFQRNHLRLHQDWV G WVVPQKMDASILRDQLSIISED
Sbjct: 121 VCKVLEGGGYSVLFKNPMHVMDFQRNHLRLHQDWVGGNWVVPQKMDASILRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENVQ ESLK                         ++DESSASFALTSSKRRRSLSS
Sbjct: 181 ANVPENVQRESLK------------------------GVYDESSASFALTSSKRRRSLSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDD 300
           KSRV NPLKKL EG V G P ADGSRM+ESKT RGKAF+KSATPNRDRRRRSYLNFH DD
Sbjct: 241 KSRVLNPLKKLREGIVLGTPVADGSRMIESKTLRGKAFNKSATPNRDRRRRSYLNFHSDD 300

Query: 301 DNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 360
           D+ SPN F SP+GGKK RTKEDVDGSD+LKEQ LS ING+ GN  KRSQ+T VTDKEGKE
Sbjct: 301 DSASPNSFGSPRGGKKPRTKEDVDGSDKLKEQGLSSINGNGGNKCKRSQQTQVTDKEGKE 360

Query: 361 DYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGME 420
           DYDVLE IPKEVTTSN+SERN+H+ASD KQTPV NSLH+P  VGDGEE+SNNQ  EKG+E
Sbjct: 361 DYDVLEIIPKEVTTSNESERNRHVASDGKQTPVKNSLHIPKEVGDGEEDSNNQATEKGVE 420

Query: 421 PEQQEATENSDK--------RKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTA 480
           PEQQEATENSDK        RKRGRPRKIMQE GQQQASKNSYKRKRGRPRKLMIVPTTA
Sbjct: 421 PEQQEATENSDKEATENSDRRKRGRPRKIMQESGQQQASKNSYKRKRGRPRKLMIVPTTA 480

Query: 481 EDMEQDGSGWKPEKATVKSCVT---------------------DLNRRNGKELSANKTNG 540
           ED+EQDGSGWKPEKAT+KSCVT                     DLNRRNG  +SA KTNG
Sbjct: 481 EDVEQDGSGWKPEKATIKSCVTAKRTKRKKGFRKMPMYFVDFQDLNRRNGNNVSAFKTNG 540

Query: 541 TGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYG--SKRRTKGSEQVDAVNKVRTVDG 600
           TGTNSVDDDDRPLL+WLGGMQGSASNNSLKLGQT G  SKRRT GSEQVD VN++RTVDG
Sbjct: 541 TGTNSVDDDDRPLLLWLGGMQGSASNNSLKLGQTSGSTSKRRTNGSEQVDGVNEMRTVDG 600

Query: 601 TPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVT 660
            PENEVVKNQGWPFVKNSPVWSAIDSL VFKQIPQKPHFHPL+TYKEECREGLAIGCMVT
Sbjct: 601 APENEVVKNQGWPFVKNSPVWSAIDSLEVFKQIPQKPHFHPLNTYKEECREGLAIGCMVT 660

Query: 661 FASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEE 720
           FASLVEKITKLQF YPR++FESTLASLYDLEQHGF+ISMLCNRVNELLFIKDTEMR IEE
Sbjct: 661 FASLVEKITKLQFSYPRHVFESTLASLYDLEQHGFDISMLCNRVNELLFIKDTEMRYIEE 720

Query: 721 TKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETI 772
           TKVAENKILEHT+NK+KLAEESKAIEQKI ELQRRQSSIKLEIET +NEI+ALQSHVETI
Sbjct: 721 TKVAENKILEHTENKSKLAEESKAIEQKITELQRRQSSIKLEIETKENEIQALQSHVETI 778

BLAST of HG10023039 vs. NCBI nr
Match: XP_038899478.1 (DUF724 domain-containing protein 3-like isoform X5 [Benincasa hispida])

HSP 1 Score: 1260.4 bits (3260), Expect = 0.0e+00
Identity = 657/794 (82.75%), Postives = 696/794 (87.66%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP TNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALF+ATILKLPTTF PS
Sbjct: 1   MGEFGSPPTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFRATILKLPTTFPPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEY+TLVTEDGSTPLKEHVDALSLRPLPPDTA+KDF+ECDIVDAADKDGWWTGV
Sbjct: 61  KKKKALVEYQTLVTEDGSTPLKEHVDALSLRPLPPDTADKDFQECDIVDAADKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LEGG YSVLFKNPMHVMDFQRNHLRLHQDWV G WVVPQKMDASILRDQLSIISED
Sbjct: 121 VCKVLEGGGYSVLFKNPMHVMDFQRNHLRLHQDWVGGNWVVPQKMDASILRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENVQ ESLK TET   K+NSY++NSRND+MEKP ++DESSASFALTSSKRRRSLSS
Sbjct: 181 ANVPENVQRESLKGTETINGKENSYTVNSRNDVMEKPGVYDESSASFALTSSKRRRSLSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDD 300
           KSRV NPLKKL EG V G P ADGSRM+ESKT RGKAF+KSATPNRDRRRRSYLNFH DD
Sbjct: 241 KSRVLNPLKKLREGIVLGTPVADGSRMIESKTLRGKAFNKSATPNRDRRRRSYLNFHSDD 300

Query: 301 DNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 360
           D+ SPN F SP+GGKK RTKEDVDGSD+LKEQ LS ING+ GN  KRSQ+T VTDKEGKE
Sbjct: 301 DSASPNSFGSPRGGKKPRTKEDVDGSDKLKEQGLSSINGNGGNKCKRSQQTQVTDKEGKE 360

Query: 361 DYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGME 420
           DYDVLE IPKEVTTSN+SERN+H+ASD KQTPV NSLH+P  VGDGEE+SNNQ  EKG+ 
Sbjct: 361 DYDVLEIIPKEVTTSNESERNRHVASDGKQTPVKNSLHIPKEVGDGEEDSNNQATEKGV- 420

Query: 421 PEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGS 480
                                  E GQQQASKNSYKRKRGRPRKLMIVPTTAED+EQDGS
Sbjct: 421 -----------------------ESGQQQASKNSYKRKRGRPRKLMIVPTTAEDVEQDGS 480

Query: 481 GWKPEKATVKSCVT---------------------DLNRRNGKELSANKTNGTGTNSVDD 540
           GWKPEKAT+KSCVT                     DLNRRNG  +SA KTNGTGTNSVDD
Sbjct: 481 GWKPEKATIKSCVTAKRTKRKKGFRKMPMYFVDFQDLNRRNGNNVSAFKTNGTGTNSVDD 540

Query: 541 DDRPLLMWLGGMQGSASNNSLKLGQTYG--SKRRTKGSEQVDAVNKVRTVDGTPENEVVK 600
           DDRPLL+WLGGMQGSASNNSLKLGQT G  SKRRT GSEQVD VN++RTVDG PENEVVK
Sbjct: 541 DDRPLLLWLGGMQGSASNNSLKLGQTSGSTSKRRTNGSEQVDGVNEMRTVDGAPENEVVK 600

Query: 601 NQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKI 660
           NQGWPFVKNSPVWSAIDSL VFKQIPQKPHFHPL+TYKEECREGLAIGCMVTFASLVEKI
Sbjct: 601 NQGWPFVKNSPVWSAIDSLEVFKQIPQKPHFHPLNTYKEECREGLAIGCMVTFASLVEKI 660

Query: 661 TKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKI 720
           TKLQF YPR++FESTLASLYDLEQHGF+ISMLCNRVNELLFIKDTEMR IEETKVAENKI
Sbjct: 661 TKLQFSYPRHVFESTLASLYDLEQHGFDISMLCNRVNELLFIKDTEMRYIEETKVAENKI 720

Query: 721 LEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTK 772
           LEHT+NK+KLAEESKAIEQKI ELQRRQSSIKLEIET +NEI+ALQSHVETIRECTM+TK
Sbjct: 721 LEHTENKSKLAEESKAIEQKITELQRRQSSIKLEIETKENEIQALQSHVETIRECTMNTK 770

BLAST of HG10023039 vs. ExPASy Swiss-Prot
Match: Q9FZD9 (DUF724 domain-containing protein 3 OS=Arabidopsis thaliana OX=3702 GN=DUF3 PE=1 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 2.9e-44
Identity = 204/752 (27.13%), Postives = 330/752 (43.88%), Query Frame = 0

Query: 32  IEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDALSLR 91
           +EVS +EEGF+GA F+A + + P     S ++K  V Y TL+  DGS+PL EH++   +R
Sbjct: 14  VEVSSEEEGFEGAWFRAVLEENP---GNSSRRKLRVRYSTLLDMDGSSPLIEHIEQRFIR 73

Query: 92  PLPP-DTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRNHL 151
           P+PP +   KD   EE  +VDA  KDGWWTGVV K +E  +Y V F  P  ++ F+R  L
Sbjct: 74  PVPPEENQQKDVVLEEGLLVDADHKDGWWTGVVVKKMEDDNYLVYFDLPPDIIQFERKQL 133

Query: 152 RLHQDWVDGKWVVPQKMDASILRDQLSIISEDANVPENVQHESLKNTETNIEKKNSYSIN 211
           R H  W  G W+ P+  +++        + E  +  E V   ++   ET+++ K  + + 
Sbjct: 134 RTHLIWTGGTWIQPEIEESNKSMFSPGTMVEVFSAKEAVWSPAMVVKETDVDDKKKFIVK 193

Query: 212 SRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMM 271
             N  +   S + + +    + +S+R R +   S V             G+    G ++ 
Sbjct: 194 DCNRYL---SCNGDEARPTNIVNSRRVRPIPPPSSVDKYALLESVETFSGLGWHKG-QVR 253

Query: 272 ESKTSRGKAFSKSATPNRDRRRRSYLN-FHCDDDNVSPNKFESPKGGKKLRTKEDVDGSD 331
           +  +         AT      R S L  F   +D V  N        K+   KE      
Sbjct: 254 KILSENRYTVRLEATQQESTIRHSDLRPFMVWEDGVWYNDL------KQKPIKE--TPPT 313

Query: 332 ELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKEDYDVLETIPKEVTTSNKSERNK----- 391
            LK + +   + ++  T   + + L +    KE  +   T  K V+ + +  +NK     
Sbjct: 314 ILKRKPMRSCSAAKSMTPTSATKHLRSFLNSKEISET-PTKAKFVSATRELGKNKADAVM 373

Query: 392 ----HLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGMEP-EQQEATENSDKRKRGR 451
               HL    ++T +   + +        E    ++ +K  EP + Q   ENS  +    
Sbjct: 374 NDKTHLLITPQETSIAPVITVTPLKQQDAETEGKKSPKKTPEPVKHQNGLENSSTQH--- 433

Query: 452 PRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGSGWKPEKATVKSCVTDLN 511
                + P ++ +++ S KRKR                EQ+ +    E  T ++C     
Sbjct: 434 -----EMPEEENSNEKSRKRKR----------------EQNQNSNLNE--TDETC----- 493

Query: 512 RRNGKELSANKTNGTG-TNSVDD-DDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGS 571
                 +S    NGT  T  VDD DD+PL  W+      +S+ S               S
Sbjct: 494 -----NVSKAGVNGTSDTIRVDDVDDQPLSSWINIPTVLSSDQS---------------S 553

Query: 572 EQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYK 631
             VD  N    V+ T     +  +  PF KN P W   +    +K +PQ PHF PL  +K
Sbjct: 554 NVVD--NSAADVEETQAKGALTIE--PFTKNLPFWKTYEMEKGYKTVPQNPHFSPLLEFK 613

Query: 632 EECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNE 691
           E+ RE  A+G MV+F  L+E++ KLQ D       S      +LE+HGF+I+   +R+N+
Sbjct: 614 EDIREWSAVGMMVSFYGLLEEVKKLQLDVSSSKLGSLSTCFAELEKHGFDIATPQSRINK 673

Query: 692 LLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETN 751
           +L ++    + +EE K  E +I        K   E   +E+K++EL+RR    K + E  
Sbjct: 674 VLSLQVGRAKKVEERKCLEKRIEAEEIEMQKFEHEMVEVERKMLELKRRAEVAKEKKEAA 694

Query: 752 DNEIEALQSHVETIRECTMSTKLHFENQIALP 768
           D  I  ++S  ETI +   + +L F   +  P
Sbjct: 734 DKMIVEMKSSAETIDQEIANVELEFITSVLAP 694

BLAST of HG10023039 vs. ExPASy Swiss-Prot
Match: O22897 (DUF724 domain-containing protein 6 OS=Arabidopsis thaliana OX=3702 GN=DUF6 PE=2 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 1.0e-41
Identity = 202/782 (25.83%), Postives = 320/782 (40.92%), Query Frame = 0

Query: 29  GSEIEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDAL 88
           GSE+EVS  EEGF  A F+  + + PT    S +KK  V Y TL+ +D  +PL E+++  
Sbjct: 8   GSEVEVSSTEEGFADAWFRGILQENPT---KSGRKKLRVRYLTLLNDDALSPLIENIEPR 67

Query: 89  SLRPLPPDTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRN 148
            +RP+PP+        EE  +VDA  KDGWWTGV+ K LE G + V + +P  +++F+RN
Sbjct: 68  FIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGKFWVYYDSPPDIIEFERN 127

Query: 149 HLRLHQDWVDGKWVVP--QKMDASILRD-QLSIISEDANVPENVQHESLKNTETNIEKKN 208
            LR H  W   KW+ P  Q++D S+     ++ +S   +  E     ++   E  ++ + 
Sbjct: 128 QLRPHLRWSGWKWLRPDIQELDKSMFSSGTMAEVSTIVDKAEVAWFPAMIIKEIEVDGEK 187

Query: 209 SYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAAD 268
            + +   N  +      DE+  +  + SS+ R +                   P  P   
Sbjct: 188 KFIVKDCNKHLSFSG--DEARTNSTIDSSRVRPT------------------PPPFPVEK 247

Query: 269 GSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHC--------------DDDNVSPNK-- 328
              M   +  RG  + +          R  L+ +C                 ++ P K  
Sbjct: 248 YELMDRVEVFRGSVWRQGLV-------RGVLDHNCYMVCLVVTKEEPVVKHSDLRPCKVW 307

Query: 329 --------------FESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLV 388
                          E+P    K +      G+  +  +  +  +  R   L++S  TL 
Sbjct: 308 EDGVWQDGPKQTPVIETPSNVMKTKPMRSCSGAKSMTPKRTTK-HARRSLNLEKSAETLT 367

Query: 389 TDKEGKEDYDVLETIPKEVTTSNK----SERNKHLASDEKQTP--VNNSLHLPDAVGDGE 448
             +      ++      +V   N     + + K +AS E  TP  V  +  L     D +
Sbjct: 368 KAESRAATGELRSKRANDVINDNTPLVITPQVKPIASVEPVTPSRVRTATPLKQTKADTQ 427

Query: 449 ENSNNQTKEKGMEPEQQE-ATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLM 508
             S   + +K +EP + E   ENS +      +K+++E       KNS K+ R R R+  
Sbjct: 428 GKS---SPKKTLEPMRDENGLENSTR------QKVLEE-------KNSEKKGRKRKRQ-- 487

Query: 509 IVPTTAEDMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLM 568
                 E+   D       K T +SC       NG+    N T+       D DD+PL  
Sbjct: 488 ------EEHNSD------LKETDESC-------NGQMAEINDTSSICN---DVDDQPLAA 547

Query: 569 WLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVNKVRTVDGTPENEVVKN-QGWPFVK 628
           W+                       T        VN         E +        PF K
Sbjct: 548 WI------------------NLPTETSIDHSPIVVNNAAIATDVEERQANDTLMILPFAK 607

Query: 629 NSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYP 688
            SP W   ++  V K  PQ PHF PL   KEE RE  A+G MV+F  L+E++  LQ D  
Sbjct: 608 KSPFWKMYETQEVCKIAPQSPHFSPLFEAKEELREWTAVGMMVSFYGLLEEVKNLQLDVS 667

Query: 689 RYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKT 748
                S   S  +LE+HGF+++   +R+N++L ++D   +  EE K  E KI        
Sbjct: 668 PSTLGSLSCSFAELEKHGFDVAAPQSRINKMLSLQDERAKKAEERKGLEKKIEAGEIEGH 700

Query: 749 KLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIA 768
              EE   +E KI+EL+R+Q   K   E  D     ++S+ E I +     +L F++  +
Sbjct: 728 TYEEEMAELELKILELKRQQVVAKEMKEATDKVTSGMKSYAEMINQEIEDLRLEFQSTAS 700

BLAST of HG10023039 vs. ExPASy Swiss-Prot
Match: Q8H0V4 (DUF724 domain-containing protein 7 OS=Arabidopsis thaliana OX=3702 GN=DUF7 PE=1 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 1.5e-35
Identity = 214/805 (26.58%), Postives = 332/805 (41.24%), Query Frame = 0

Query: 20  RHHQLPFTVGSEIEVSIDE-EGFKGALFKATILKLPTTFSPSKKKKALVEY-KTLVTEDG 79
           R  +L  + GSEIE+S  E E   G ++   IL+     + SK+KK  V +   L+  D 
Sbjct: 7   RKEKLSVSKGSEIEISSQEYEYGSGNVWYCVILE--ENLAKSKRKKLSVRHLDPLLKYDY 66

Query: 80  STPLKEHVDALSLRPLPPDT--ANKDFEECDIVDAADKDGWWTGVVCKALEGGSYSVLFK 139
           S PL +      +RP+PP       DFEE D+VDAA K GW +G V K L    + V  +
Sbjct: 67  SPPLIKTTVHRFMRPVPPPDPFPEVDFEEGDVVDAAYKGGWCSGSVVKVLGNRRFLVYLR 126

Query: 140 NPMHVMDFQRNHLRLHQDWVDGKWVVPQK-------------MDASILRDQLS------- 199
               V++  R  LR H  W D +W   +K             ++     D+L        
Sbjct: 127 FQPDVIELLRKDLRPHFVWKDEEWFRCEKQQLIESDFSAGKSVEVRTKVDKLGDVWAPAM 186

Query: 200 IISEDANVPENVQHESLKNTETNIEKKN-SYSINSRNDLMEKPSIHDESSASFALTSSKR 259
           +I ED +    V+ ++LK  E N  K + SYS                      +  S  
Sbjct: 187 VIKEDEDGTMLVKLKTLKEEEVNCTKISVSYS---------------------EIRPSPL 246

Query: 260 RRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYL 319
              L     + N    +  G  PG+          SK   GK ++    PNR+ +  S L
Sbjct: 247 PIGLRDYKLMENVDALVESGWCPGV---------VSKVLAGKRYAVDLGPNRESKEFSRL 306

Query: 320 NFHCDDDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVT 379
                         E   G      KE V GS+E           +R   ++ + RT + 
Sbjct: 307 QLR--------PSIEWKDG--IWHRKEKVSGSEESSHAVEETAASTR---IRITVRTALK 366

Query: 380 DKEG-----------KEDYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAV 439
           +K+                 +   +P      + +E  + ++    +TP+  +  L   +
Sbjct: 367 EKKALGTGINVRTTRSSSGAMHNPLPASFNGGDVAEAGR-VSVTVNETPLFETAALSGEL 426

Query: 440 GDG-EENSNNQTKEKGMEPEQQEATE---------NSDKRKRGR--PRKIMQEPGQQQAS 499
           G+   +   N++     +PE     E          +  + +G+  P+K +Q    Q++S
Sbjct: 427 GNSLADVVMNESAPVTSQPEIAAPKEFHPSVVLGVAAAVKTQGKTTPKKKLQAMKNQKSS 486

Query: 500 KNS--------YKRKRGRPRKLMIVPTTAEDMEQDG-SGWKPEKATVKSCVTDLNRRNGK 559
            N          KRKRG+PRK ++    AE  ++ G SG   + AT++            
Sbjct: 487 TNDSVGEKVSVNKRKRGQPRKFIV----AEPKQKIGVSGNNSKAATIEHA---------- 546

Query: 560 ELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVN 619
                         + DDDRPL  W   +    S++   + +T      T   + VD   
Sbjct: 547 -------------DMTDDDRPLASW---VHTGNSSSGQSVSRTPDIGLNTVVEKHVDI-- 606

Query: 620 KVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGL 679
            V T  G     V+     PFVK S +W  ++S+ VFK +PQ PHF PL   +EECREG 
Sbjct: 607 -VETPPGRESTMVL-----PFVKKSQLWKVLESMEVFKVVPQSPHFSPLLESEEECREGD 666

Query: 680 AIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDT 739
           AIG MV F+SL+EK+  LQ D P             LE+HGFN++   +R+ ++L IK+ 
Sbjct: 667 AIGRMVMFSSLLEKVNNLQVDDPISSINRIDECFLKLEKHGFNVTTPRSRIAKILSIKER 720

Query: 740 EMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEAL 768
           +   +EE K  E KI E+   + K        E+ I+ELQR++  +K    T DNEI  +
Sbjct: 727 QTCALEELKAVEEKITENDNKRRK-------YEEDIVELQRQEVLMKEAKVTLDNEIARM 720

BLAST of HG10023039 vs. ExPASy Swiss-Prot
Match: Q9ZVT1 (DUF724 domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=DUF1 PE=2 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 1.2e-32
Identity = 192/758 (25.33%), Postives = 317/758 (41.82%), Query Frame = 0

Query: 31  EIEVSIDEEGFKGALFKATILKLPTT-FSPSKKKKALVEYKTLVTEDGSTPLKEHVDALS 90
           E+E+  +E+GF+ A ++A + + PT   S SKK +     K+L  E  S+P    V+   
Sbjct: 8   EVEIFSEEDGFRNAWYRAILEETPTNPTSESKKLRFSYMTKSLNKEGSSSP--PTVEQRF 67

Query: 91  LRPLPPDTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRNH 150
           +RP+PP+       FEE  +VDA  K  W TGVV   +E  SY VLF  P  ++ F+  H
Sbjct: 68  IRPVPPENLYNGVVFEEGTMVDADYKHRWRTGVVINKMENDSYLVLFDCPPDIIQFETKH 127

Query: 151 LRLHQDWVDGKWVVPQKMDASILRDQLSIISEDANVPENVQHE-----SLKNTETNIEKK 210
           LR H DW   +WV P+  + S        + E + V + V+        +K  E + EKK
Sbjct: 128 LRAHLDWTGSEWVQPEVRELSKSMFSPGTLVEVSCVIDKVEVSWVTAMIVKEIEESGEKK 187

Query: 211 NSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKS-RVSNPLKKLGEGNVPGIPA 270
               + +++              S  +  +K   ++ S   R   PL  + E ++     
Sbjct: 188 FIVKVCNKH-------------LSCRVDEAKPNMTVDSCCVRPRPPLFFVEEYDLRDCVE 247

Query: 271 ADGSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCD-------DDNVSPNKFESPKGG 330
                       +G    K  T   +  +   +  H D       +D V  N      G 
Sbjct: 248 VFHGSSWRQGVVKGVHIEKQYTVTLEATKDKLVVKHSDLRPFKVWEDGVWHN------GP 307

Query: 331 KKLRTKEDVDGSDELKEQTLSLINGSRGNTLK---RSQRTLVTDKEGKEDYDVLETIPK- 390
           ++   KE    S+ +K++ +   +G+R  T K   +  R     +E  E+  V ET+   
Sbjct: 308 QQKPVKE--SPSNAIKQKPMCSSSGARPMTPKMATKHARISFNPEENVEELSVAETVAAT 367

Query: 391 -EVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGMEPEQQEATEN 450
            ++     +E +    +  KQT  N   +  + + +     N+ T++  M PE++ + + 
Sbjct: 368 GKLEKMGIAEESVSCVTPLKQTEANAEGNKLEPMRNQNCLRNDSTQQ--MLPEEENSKDG 427

Query: 451 SDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGSGWKPEKATV 510
           S KRKR                    + K      +M         E DG          
Sbjct: 428 STKRKR--------------------EEKHNSASSVM--------DEIDG---------- 487

Query: 511 KSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSK 570
            +C       NG E   + T  +  N+ D DD+PL   L   Q  +  NS          
Sbjct: 488 -TC-------NGSESEISNTGKSICNNDDVDDQPLSTELPYYQSLSVVNSF--------- 547

Query: 571 RRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFH 630
                +E+  A    RT+              PF K  P W + ++  ++K +PQ PHF 
Sbjct: 548 --AADAEETPA-KSARTIS-------------PFAKKLPFWKSYETDELYKSLPQSPHFS 607

Query: 631 PLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISML 690
           PL   KE+ RE  A+G MVTF  L++++  LQ D       S  +SL +LE+HGFN++  
Sbjct: 608 PLFKAKEDIREWSAVGMMVTFYCLLKEVKDLQLDDSSSKLSSLSSSLAELEKHGFNVTDP 667

Query: 691 CNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIK 750
            +R++++L ++D   +  EE K  E KI      + +  EE    E+ IIE +R+    K
Sbjct: 668 LSRISKVLPLQDKRAKKAEERKCLEKKIECEEIERKRFEEEFADFERIIIEKKRQALVAK 669

Query: 751 LEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALP 768
            + E  D  I  +++  ETI +     +L F+  ++ P
Sbjct: 728 EKKEAADKRIGEMKTCAETIDQEIKDEELEFQTTVSTP 669

BLAST of HG10023039 vs. ExPASy Swiss-Prot
Match: F4I8W1 (DUF724 domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=DUF2 PE=2 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 4.3e-27
Identity = 188/742 (25.34%), Postives = 293/742 (39.49%), Query Frame = 0

Query: 31  EIEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDALSL 90
           ++EV  +EE  KG+ ++A +   PT    +K K   V Y T + E    PL E VD   +
Sbjct: 12  KVEVFSEEEELKGSYYRAILEDNPTKSGHNKLK---VRYLTQLNEHRLAPLTEFVDQRFI 71

Query: 91  RPLPPDTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRNHL 150
           RP+P +  N    F E  +VDA  KDGWWTGVV K +E   + V F  P  ++ F++  L
Sbjct: 72  RPVPSEDVNDGVVFVEGLMVDAYLKDGWWTGVVVKTMEDEKFLVYFDCPPDIIQFEKKKL 131

Query: 151 RLHQDWVDGKWVVP--QKMDASILR--DQLSIISEDANVPENVQHESLKNTETNIEKKNS 210
           R+H DW   KW+ P  +++  S+      + +  + A +P  V  E        +EK   
Sbjct: 132 RVHLDWTGFKWIRPDNKELVKSVFSCGTMVELRFDCAWIPVIVIKE--------LEKDKR 191

Query: 211 YSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAADG 270
           + +   N              S++   SK     S + R   P   +G+  +     A  
Sbjct: 192 FLVKYWN-------------KSYSCRESKNLIVDSLRLRPMQPPLSVGKYELLDHVEAFS 251

Query: 271 SRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHCDDDNVSPNKFESPKGGKKLRTKEDVD 330
                    RG  F      +    + +    H D   + P   E   G    RTK    
Sbjct: 252 GFEWRQGVVRGIVFEGRYMVSFGATKEASQFNHSD---IRP-PMEWEDGVWHKRTKP--- 311

Query: 331 GSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKEDYDVLETIPKEVTTSNKSERNKHL 390
                K Q  + ++G+R    K      + D       DV +     +T    + +NK  
Sbjct: 312 -----KRQKETSLDGNRNVQTKEPPGNEMAD-------DVKKESGLPITLGVTATKNK-- 371

Query: 391 ASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGMEPEQQEATENSDKRKRGRPRKIMQE 450
            +  K +PV      P   G G    N  T+EK   PE+ +    + KRKRG        
Sbjct: 372 -TQGKVSPV------PMKNGFG----NESTREK--MPEEPKIKYYTRKRKRG-------- 431

Query: 451 PGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGSGWKPEKATVKSCVTDLNRRNGKEL 510
                   NSY  K                             TV              L
Sbjct: 432 ----GLKLNSYINK-----------------------------TV--------------L 491

Query: 511 SANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVNKV 570
           S+++T                     ++ SASN                 +E+  A + +
Sbjct: 492 SSDRTPNV------------------VKNSASN-----------------AEENHAKHTI 551

Query: 571 RTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAI 630
             +              PF K SPVW   +SL VFK +    HF PL   K++ REG AI
Sbjct: 552 MVL--------------PFAKKSPVWKTYESLEVFKSVSHSLHFSPLFETKQDFREGYAI 591

Query: 631 GCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEM 690
           G MVT+  L+EK   L+ D P     S   S  +LE+HGFN++   +R+++LL +KD ++
Sbjct: 612 GMMVTYFGLLEKFKDLEADVPVSQLNSLKDSFSELEKHGFNVTTPLSRIDKLLALKDRQL 591

Query: 691 RCIEETKVAENKIL-EHTKNKTKLAE-ESKAIE--QKIIELQRRQSSIKLEIETNDNEIE 750
             +EE K  + ++  E +K K +  + E K +E   KIIELQR+++++K + E    + +
Sbjct: 672 YIMEELKGFDKEMTNEFSKAKQEFDDMEQKILEVKHKIIELQRQEAALKEQKEAEKEQKD 591

Query: 751 ALQSHVETIRECTMSTKLHFEN 763
           A    +  +  C     +  E+
Sbjct: 732 AAWKKICQMESCAKDLNVELED 591

BLAST of HG10023039 vs. ExPASy TrEMBL
Match: A0A1S3BSM1 (uncharacterized protein LOC103492739 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492739 PE=4 SV=1)

HSP 1 Score: 1243.4 bits (3216), Expect = 0.0e+00
Identity = 651/776 (83.89%), Postives = 705/776 (90.85%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP+TNTHTHHHN+HR HQ PFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS
Sbjct: 1   MGEFGSPATNTHTHHHNNHRLHQFPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           K+KKALVEYKTLVTEDGS+PLKE VDALSLRPLPPDTA+KDFEECDIVDA DKDGWWTGV
Sbjct: 61  KRKKALVEYKTLVTEDGSSPLKEQVDALSLRPLPPDTADKDFEECDIVDATDKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCKALE G YSV FKNPMHVMDFQRNHLRLHQDWVDGKWVVP+KMDAS+LRDQLSIISED
Sbjct: 121 VCKALEDGGYSVFFKNPMHVMDFQRNHLRLHQDWVDGKWVVPRKMDASLLRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENV+HESLKN ETN  K+NSY++NSRNDLMEKPSI+DES ASFALTSSKRRRSL+S
Sbjct: 181 ANVPENVEHESLKNNETNNGKENSYTVNSRNDLMEKPSIYDESPASFALTSSKRRRSLTS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRD-RRRRSYLNFHCD 300
           KSRVSNPLK+L EG + G PAAD SRM++ KTSRGKAFSKSATPN+D RRRRSYLNFH D
Sbjct: 241 KSRVSNPLKRLREGVILGTPAADRSRMID-KTSRGKAFSKSATPNKDRRRRRSYLNFHGD 300

Query: 301 DDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGK 360
           DD+ SPN+   PKGGKK RTKEDVDGSD+LKEQ LS ING++GNT KRSQRT VTDKE K
Sbjct: 301 DDSASPNRSGIPKGGKKPRTKEDVDGSDKLKEQVLSFINGNKGNTYKRSQRTQVTDKERK 360

Query: 361 EDYDV--LETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEK 420
           E YDV  LETI K+VTT+N+SERNKHLA DE+QTPV  SL   D VGDGEENSNNQTKEK
Sbjct: 361 EGYDVIDLETISKDVTTNNESERNKHLAPDEQQTPVKISL---DVVGDGEENSNNQTKEK 420

Query: 421 GMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQ 480
           GMEPEQQEATENSD+RKRGRPRKI QE  QQQASKNSYKRKRGRPRKLM+VPTTAED  Q
Sbjct: 421 GMEPEQQEATENSDRRKRGRPRKITQEIEQQQASKNSYKRKRGRPRKLMLVPTTAEDSNQ 480

Query: 481 DGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASN 540
           DGS WKPEKAT+KS VTDLNRRNG E+SA KTNG+GTNSVDDDDRPLLMWLGG+QGSA+N
Sbjct: 481 DGSLWKPEKATLKSSVTDLNRRNGSEISAYKTNGSGTNSVDDDDRPLLMWLGGIQGSANN 540

Query: 541 NSLKLGQTYGS--KRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDS 600
           N+LKLGQ  GS  KRRTKGSE+VDA+N+VR VD  PE+EV KN+ WPFVKNSPVWSAIDS
Sbjct: 541 NALKLGQASGSIAKRRTKGSERVDAMNEVRRVDRMPEHEVDKNRDWPFVKNSPVWSAIDS 600

Query: 601 LGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLAS 660
           L VFK IPQKPHF PLSTYKEECREGLAIGCMVTFASLVEK+TKLQF YPR+IFESTLAS
Sbjct: 601 LEVFKHIPQKPHFQPLSTYKEECREGLAIGCMVTFASLVEKVTKLQFSYPRHIFESTLAS 660

Query: 661 LYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIE 720
           LY+LEQHGFNISMLCNRVNELLFIKD+E+R  EETKVAENKILE+ +NKTKLAEE  AIE
Sbjct: 661 LYELEQHGFNISMLCNRVNELLFIKDSEVRYAEETKVAENKILEYIENKTKLAEERHAIE 720

Query: 721 QKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWPL 772
           QKI ELQ+RQ+SIK E+ET D+EI+ALQSHVETIRECT +TKLHFENQIALPLWP+
Sbjct: 721 QKITELQKRQASIKQEMETTDHEIDALQSHVETIRECTTNTKLHFENQIALPLWPV 772

BLAST of HG10023039 vs. ExPASy TrEMBL
Match: A0A5D3D3Y2 (DUF724 domain-containing protein 7-like isoform X3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G001890 PE=4 SV=1)

HSP 1 Score: 1243.0 bits (3215), Expect = 0.0e+00
Identity = 651/776 (83.89%), Postives = 705/776 (90.85%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP+TNTHTHHHN+HR HQ PFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS
Sbjct: 1   MGEFGSPATNTHTHHHNNHRLHQFPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           K+KKALVEYKTLVTEDGS+PLKE VDALSLRPLPPDTA+KDFEECDIVDA DKDGWWTGV
Sbjct: 61  KRKKALVEYKTLVTEDGSSPLKEQVDALSLRPLPPDTADKDFEECDIVDATDKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCKALE G YSV FKNPMHVMDFQRNHLRLHQDWVDGKWVVP+KMDAS+LRDQLSIISED
Sbjct: 121 VCKALEDGGYSVFFKNPMHVMDFQRNHLRLHQDWVDGKWVVPRKMDASLLRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENV+HESLKN ETN  K+NSY++NSRNDLMEKPSI+DES ASFALTSSKRRRSL+S
Sbjct: 181 ANVPENVEHESLKNNETNNGKENSYTVNSRNDLMEKPSIYDESPASFALTSSKRRRSLTS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRD-RRRRSYLNFHCD 300
           KSRVSNPLK+L EG + G PAAD SRM++ KTSRGKAFSKSATPN+D RRRRSYLNFH D
Sbjct: 241 KSRVSNPLKRLREGVILGTPAADRSRMID-KTSRGKAFSKSATPNKDRRRRRSYLNFHGD 300

Query: 301 DDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGK 360
           DD+ SPN+   PKGGKK RTKEDVDGSD+LKEQ LS ING++GNT KRSQRT VTDKE K
Sbjct: 301 DDSASPNRSGIPKGGKKPRTKEDVDGSDKLKEQVLSFINGNKGNTYKRSQRTQVTDKERK 360

Query: 361 EDYDV--LETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEK 420
           E YDV  LETI K+VTT+N+SERNKHLA DE+QTPV  SL   D VGDGEENSNNQTKEK
Sbjct: 361 EGYDVIDLETISKDVTTNNESERNKHLAPDEQQTPVKISL---DEVGDGEENSNNQTKEK 420

Query: 421 GMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQ 480
           GMEPEQQEATENSD+RKRGRPRKI QE  QQQASKNSYKRKRGRPRKLM+VPTTAED  Q
Sbjct: 421 GMEPEQQEATENSDRRKRGRPRKITQEIEQQQASKNSYKRKRGRPRKLMLVPTTAEDSNQ 480

Query: 481 DGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASN 540
           DGS WKPEKAT+KS VTDLNRRNG E+SA KTNG+GTNSVDDDDRPLLMWLGG+QGSA+N
Sbjct: 481 DGSLWKPEKATLKSSVTDLNRRNGSEISAYKTNGSGTNSVDDDDRPLLMWLGGIQGSANN 540

Query: 541 NSLKLGQTYGS--KRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDS 600
           N+LKLGQ  GS  KRRTKGSE+VDA+N+VR VD  PE+EV KN+ WPFVKNSPVWSAIDS
Sbjct: 541 NALKLGQASGSIAKRRTKGSERVDAMNEVRRVDRMPEHEVDKNRDWPFVKNSPVWSAIDS 600

Query: 601 LGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLAS 660
           L VFK IPQKPHF PLSTYKEECREGLAIGCMVTFASLVEK+TKLQF YPR+IFESTLAS
Sbjct: 601 LEVFKHIPQKPHFQPLSTYKEECREGLAIGCMVTFASLVEKVTKLQFSYPRHIFESTLAS 660

Query: 661 LYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIE 720
           LY+LEQHGFNISMLCNRVNELLFIKD+E+R  EETKVAENKILE+ +NKTKLAEE  AIE
Sbjct: 661 LYELEQHGFNISMLCNRVNELLFIKDSEVRYAEETKVAENKILEYIENKTKLAEERHAIE 720

Query: 721 QKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWPL 772
           QKI ELQ+RQ+SIK E+ET D+EI+ALQSHVETIRECT +TKLHFENQIALPLWP+
Sbjct: 721 QKITELQKRQASIKQEMETTDHEIDALQSHVETIRECTTNTKLHFENQIALPLWPV 772

BLAST of HG10023039 vs. ExPASy TrEMBL
Match: A0A1S3BSB2 (uncharacterized protein LOC103492739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492739 PE=4 SV=1)

HSP 1 Score: 1231.1 bits (3184), Expect = 0.0e+00
Identity = 651/797 (81.68%), Postives = 705/797 (88.46%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP+TNTHTHHHN+HR HQ PFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS
Sbjct: 1   MGEFGSPATNTHTHHHNNHRLHQFPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           K+KKALVEYKTLVTEDGS+PLKE VDALSLRPLPPDTA+KDFEECDIVDA DKDGWWTGV
Sbjct: 61  KRKKALVEYKTLVTEDGSSPLKEQVDALSLRPLPPDTADKDFEECDIVDATDKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCKALE G YSV FKNPMHVMDFQRNHLRLHQDWVDGKWVVP+KMDAS+LRDQLSIISED
Sbjct: 121 VCKALEDGGYSVFFKNPMHVMDFQRNHLRLHQDWVDGKWVVPRKMDASLLRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENV+HESLKN ETN  K+NSY++NSRNDLMEKPSI+DES ASFALTSSKRRRSL+S
Sbjct: 181 ANVPENVEHESLKNNETNNGKENSYTVNSRNDLMEKPSIYDESPASFALTSSKRRRSLTS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRD-RRRRSYLNFHCD 300
           KSRVSNPLK+L EG + G PAAD SRM++ KTSRGKAFSKSATPN+D RRRRSYLNFH D
Sbjct: 241 KSRVSNPLKRLREGVILGTPAADRSRMID-KTSRGKAFSKSATPNKDRRRRRSYLNFHGD 300

Query: 301 DDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGK 360
           DD+ SPN+   PKGGKK RTKEDVDGSD+LKEQ LS ING++GNT KRSQRT VTDKE K
Sbjct: 301 DDSASPNRSGIPKGGKKPRTKEDVDGSDKLKEQVLSFINGNKGNTYKRSQRTQVTDKERK 360

Query: 361 EDYDV--LETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEK 420
           E YDV  LETI K+VTT+N+SERNKHLA DE+QTPV  SL   D VGDGEENSNNQTKEK
Sbjct: 361 EGYDVIDLETISKDVTTNNESERNKHLAPDEQQTPVKISL---DVVGDGEENSNNQTKEK 420

Query: 421 GMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQ 480
           GMEPEQQEATENSD+RKRGRPRKI QE  QQQASKNSYKRKRGRPRKLM+VPTTAED  Q
Sbjct: 421 GMEPEQQEATENSDRRKRGRPRKITQEIEQQQASKNSYKRKRGRPRKLMLVPTTAEDSNQ 480

Query: 481 DGSGWKPEKATVKSCVT---------------------DLNRRNGKELSANKTNGTGTNS 540
           DGS WKPEKAT+KS VT                     DLNRRNG E+SA KTNG+GTNS
Sbjct: 481 DGSLWKPEKATLKSSVTAKRTKRKKGFRKMPMYFVDFQDLNRRNGSEISAYKTNGSGTNS 540

Query: 541 VDDDDRPLLMWLGGMQGSASNNSLKLGQTYGS--KRRTKGSEQVDAVNKVRTVDGTPENE 600
           VDDDDRPLLMWLGG+QGSA+NN+LKLGQ  GS  KRRTKGSE+VDA+N+VR VD  PE+E
Sbjct: 541 VDDDDRPLLMWLGGIQGSANNNALKLGQASGSIAKRRTKGSERVDAMNEVRRVDRMPEHE 600

Query: 601 VVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLV 660
           V KN+ WPFVKNSPVWSAIDSL VFK IPQKPHF PLSTYKEECREGLAIGCMVTFASLV
Sbjct: 601 VDKNRDWPFVKNSPVWSAIDSLEVFKHIPQKPHFQPLSTYKEECREGLAIGCMVTFASLV 660

Query: 661 EKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAE 720
           EK+TKLQF YPR+IFESTLASLY+LEQHGFNISMLCNRVNELLFIKD+E+R  EETKVAE
Sbjct: 661 EKVTKLQFSYPRHIFESTLASLYELEQHGFNISMLCNRVNELLFIKDSEVRYAEETKVAE 720

Query: 721 NKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTM 772
           NKILE+ +NKTKLAEE  AIEQKI ELQ+RQ+SIK E+ET D+EI+ALQSHVETIRECT 
Sbjct: 721 NKILEYIENKTKLAEERHAIEQKITELQKRQASIKQEMETTDHEIDALQSHVETIRECTT 780

BLAST of HG10023039 vs. ExPASy TrEMBL
Match: A0A5A7TTF2 (DUF724 domain-containing protein 7-like isoform X3 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold427G00820 PE=4 SV=1)

HSP 1 Score: 1230.7 bits (3183), Expect = 0.0e+00
Identity = 651/797 (81.68%), Postives = 705/797 (88.46%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSP+TNTHTHHHN+HR HQ PFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS
Sbjct: 1   MGEFGSPATNTHTHHHNNHRLHQFPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           K+KKALVEYKTLVTEDGS+PLKE VDALSLRPLPPDTA+KDFEECDIVDA DKDGWWTGV
Sbjct: 61  KRKKALVEYKTLVTEDGSSPLKEQVDALSLRPLPPDTADKDFEECDIVDATDKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCKALE G YSV FKNPMHVMDFQRNHLRLHQDWVDGKWVVP+KMDAS+LRDQLSIISED
Sbjct: 121 VCKALEDGGYSVFFKNPMHVMDFQRNHLRLHQDWVDGKWVVPRKMDASLLRDQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENV+HESLKN ETN  K+NSY++NSRNDLMEKPSI+DES ASFALTSSKRRRSL+S
Sbjct: 181 ANVPENVEHESLKNNETNNGKENSYTVNSRNDLMEKPSIYDESPASFALTSSKRRRSLTS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRD-RRRRSYLNFHCD 300
           KSRVSNPLK+L EG + G PAAD SRM++ KTSRGKAFSKSATPN+D RRRRSYLNFH D
Sbjct: 241 KSRVSNPLKRLREGVILGTPAADRSRMID-KTSRGKAFSKSATPNKDRRRRRSYLNFHGD 300

Query: 301 DDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGK 360
           DD+ SPN+   PKGGKK RTKEDVDGSD+LKEQ LS ING++GNT KRSQRT VTDKE K
Sbjct: 301 DDSASPNRSGIPKGGKKPRTKEDVDGSDKLKEQVLSFINGNKGNTYKRSQRTQVTDKERK 360

Query: 361 EDYDV--LETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEK 420
           E YDV  LETI K+VTT+N+SERNKHLA DE+QTPV  SL   D VGDGEENSNNQTKEK
Sbjct: 361 EGYDVIDLETISKDVTTNNESERNKHLAPDEQQTPVKISL---DEVGDGEENSNNQTKEK 420

Query: 421 GMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQ 480
           GMEPEQQEATENSD+RKRGRPRKI QE  QQQASKNSYKRKRGRPRKLM+VPTTAED  Q
Sbjct: 421 GMEPEQQEATENSDRRKRGRPRKITQEIEQQQASKNSYKRKRGRPRKLMLVPTTAEDSNQ 480

Query: 481 DGSGWKPEKATVKSCVT---------------------DLNRRNGKELSANKTNGTGTNS 540
           DGS WKPEKAT+KS VT                     DLNRRNG E+SA KTNG+GTNS
Sbjct: 481 DGSLWKPEKATLKSSVTAKRTKRKKGFRKMPMYFVDFQDLNRRNGSEISAYKTNGSGTNS 540

Query: 541 VDDDDRPLLMWLGGMQGSASNNSLKLGQTYGS--KRRTKGSEQVDAVNKVRTVDGTPENE 600
           VDDDDRPLLMWLGG+QGSA+NN+LKLGQ  GS  KRRTKGSE+VDA+N+VR VD  PE+E
Sbjct: 541 VDDDDRPLLMWLGGIQGSANNNALKLGQASGSIAKRRTKGSERVDAMNEVRRVDRMPEHE 600

Query: 601 VVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLV 660
           V KN+ WPFVKNSPVWSAIDSL VFK IPQKPHF PLSTYKEECREGLAIGCMVTFASLV
Sbjct: 601 VDKNRDWPFVKNSPVWSAIDSLEVFKHIPQKPHFQPLSTYKEECREGLAIGCMVTFASLV 660

Query: 661 EKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAE 720
           EK+TKLQF YPR+IFESTLASLY+LEQHGFNISMLCNRVNELLFIKD+E+R  EETKVAE
Sbjct: 661 EKVTKLQFSYPRHIFESTLASLYELEQHGFNISMLCNRVNELLFIKDSEVRYAEETKVAE 720

Query: 721 NKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTM 772
           NKILE+ +NKTKLAEE  AIEQKI ELQ+RQ+SIK E+ET D+EI+ALQSHVETIRECT 
Sbjct: 721 NKILEYIENKTKLAEERHAIEQKITELQKRQASIKQEMETTDHEIDALQSHVETIRECTT 780

BLAST of HG10023039 vs. ExPASy TrEMBL
Match: A0A0A0K5U9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G412860 PE=4 SV=1)

HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 646/776 (83.25%), Postives = 695/776 (89.56%), Query Frame = 0

Query: 1   MGEFGSPSTNTHTHHHNHHRHHQLPFTVGSEIEVSIDEEGFKGALFKATILKLPTTFSPS 60
           MGEFGSPSTNTH HH N+HR HQ PFTVGSEIEVSIDEEGFKGALFKATILKLPT FSPS
Sbjct: 1   MGEFGSPSTNTHIHHPNNHRLHQFPFTVGSEIEVSIDEEGFKGALFKATILKLPTIFSPS 60

Query: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTANKDFEECDIVDAADKDGWWTGV 120
           KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTA KDFEECDIVDA DKDGWWTGV
Sbjct: 61  KKKKALVEYKTLVTEDGSTPLKEHVDALSLRPLPPDTAAKDFEECDIVDATDKDGWWTGV 120

Query: 121 VCKALEGGSYSVLFKNPMHVMDFQRNHLRLHQDWVDGKWVVPQKMDASILRDQLSIISED 180
           VCK LE G YSV FKNPMHVMDFQ NHLRLHQDWVDGKW+VPQKMDAS+LR QLSIISED
Sbjct: 121 VCKVLEDGGYSVFFKNPMHVMDFQGNHLRLHQDWVDGKWIVPQKMDASLLRGQLSIISED 180

Query: 181 ANVPENVQHESLKNTETNIEKKNSYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSS 240
           ANVPENV+H SLKN ETN EK+NSY++NSRNDLME+PSI+D+SSASFALTSSKRRRS SS
Sbjct: 181 ANVPENVEHRSLKNNETNNEKENSYTVNSRNDLMERPSIYDDSSASFALTSSKRRRSFSS 240

Query: 241 KSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRD-RRRRSYLNFHCD 300
           KSRVSNPLKKL EG + G PAAD SRM++ KTSRGKAFSKSATPN+D RRRRSYL F+ D
Sbjct: 241 KSRVSNPLKKLREGVILGKPAADRSRMID-KTSRGKAFSKSATPNKDRRRRRSYLKFNGD 300

Query: 301 DDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGK 360
           DD+ SP +  SPKGGKK RTKEDVDGSD+LK Q LS ING +GNT K+SQ+T VTDKE K
Sbjct: 301 DDSASPIRSGSPKGGKKPRTKEDVDGSDKLKVQVLSFINGKKGNTYKQSQQTQVTDKERK 360

Query: 361 EDYDV--LETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEK 420
           E YDV  LETI KEVTT+N+SERN+HLASDE+Q PV NSL     VGDGEENS NQTKEK
Sbjct: 361 EGYDVIDLETIYKEVTTNNESERNEHLASDEQQAPVKNSL---GEVGDGEENSKNQTKEK 420

Query: 421 GMEPEQQEATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQ 480
           GMEP+QQEATENSD+RKRGRPRKIMQE  QQQASKNSYKRKRGRPRKLM+VPTTAED  +
Sbjct: 421 GMEPQQQEATENSDRRKRGRPRKIMQEIEQQQASKNSYKRKRGRPRKLMLVPTTAEDSNK 480

Query: 481 DGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASN 540
           DGS WKPEKAT+KS VTDLNRRNG E+S  KTNG+GTNSVDDDDRPLLMWLGG+QGSASN
Sbjct: 481 DGSVWKPEKATLKSSVTDLNRRNGSEISEYKTNGSGTNSVDDDDRPLLMWLGGIQGSASN 540

Query: 541 NSLKLGQTYGS--KRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDS 600
           N+LKLGQ  GS  KRRTKGSEQVDAVN VR VDGTPE+EV KNQ WPFVKNSPVWSAIDS
Sbjct: 541 NALKLGQASGSSAKRRTKGSEQVDAVNGVRRVDGTPEHEVDKNQDWPFVKNSPVWSAIDS 600

Query: 601 LGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLAS 660
           L VFK IPQKPHF PLST+KEECREGLAIGCMVTFASLVEKITKLQF  PR+IFESTLAS
Sbjct: 601 LEVFKHIPQKPHFQPLSTHKEECREGLAIGCMVTFASLVEKITKLQFSNPRHIFESTLAS 660

Query: 661 LYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIE 720
           LY+LEQHGFNISMLCNRVNELLFIKD+EMR  EETKV ENKI+E+ +NKTKLAEES AIE
Sbjct: 661 LYELEQHGFNISMLCNRVNELLFIKDSEMRYGEETKVTENKIMEYIENKTKLAEESNAIE 720

Query: 721 QKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALPLWPL 772
           +KI ELQ+RQ+SIK E+ET DNEI+ALQSHV TIRECTM+TKLHFENQIALPLWP+
Sbjct: 721 EKITELQKRQASIKKEMETTDNEIDALQSHVVTIRECTMNTKLHFENQIALPLWPV 772

BLAST of HG10023039 vs. TAIR 10
Match: AT1G26540.1 (Agenet domain-containing protein )

HSP 1 Score: 181.8 bits (460), Expect = 2.1e-45
Identity = 204/752 (27.13%), Postives = 330/752 (43.88%), Query Frame = 0

Query: 32  IEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDALSLR 91
           +EVS +EEGF+GA F+A + + P     S ++K  V Y TL+  DGS+PL EH++   +R
Sbjct: 14  VEVSSEEEGFEGAWFRAVLEENP---GNSSRRKLRVRYSTLLDMDGSSPLIEHIEQRFIR 73

Query: 92  PLPP-DTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRNHL 151
           P+PP +   KD   EE  +VDA  KDGWWTGVV K +E  +Y V F  P  ++ F+R  L
Sbjct: 74  PVPPEENQQKDVVLEEGLLVDADHKDGWWTGVVVKKMEDDNYLVYFDLPPDIIQFERKQL 133

Query: 152 RLHQDWVDGKWVVPQKMDASILRDQLSIISEDANVPENVQHESLKNTETNIEKKNSYSIN 211
           R H  W  G W+ P+  +++        + E  +  E V   ++   ET+++ K  + + 
Sbjct: 134 RTHLIWTGGTWIQPEIEESNKSMFSPGTMVEVFSAKEAVWSPAMVVKETDVDDKKKFIVK 193

Query: 212 SRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMM 271
             N  +   S + + +    + +S+R R +   S V             G+    G ++ 
Sbjct: 194 DCNRYL---SCNGDEARPTNIVNSRRVRPIPPPSSVDKYALLESVETFSGLGWHKG-QVR 253

Query: 272 ESKTSRGKAFSKSATPNRDRRRRSYLN-FHCDDDNVSPNKFESPKGGKKLRTKEDVDGSD 331
           +  +         AT      R S L  F   +D V  N        K+   KE      
Sbjct: 254 KILSENRYTVRLEATQQESTIRHSDLRPFMVWEDGVWYNDL------KQKPIKE--TPPT 313

Query: 332 ELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKEDYDVLETIPKEVTTSNKSERNK----- 391
            LK + +   + ++  T   + + L +    KE  +   T  K V+ + +  +NK     
Sbjct: 314 ILKRKPMRSCSAAKSMTPTSATKHLRSFLNSKEISET-PTKAKFVSATRELGKNKADAVM 373

Query: 392 ----HLASDEKQTPVNNSLHLPDAVGDGEENSNNQTKEKGMEP-EQQEATENSDKRKRGR 451
               HL    ++T +   + +        E    ++ +K  EP + Q   ENS  +    
Sbjct: 374 NDKTHLLITPQETSIAPVITVTPLKQQDAETEGKKSPKKTPEPVKHQNGLENSSTQH--- 433

Query: 452 PRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAEDMEQDGSGWKPEKATVKSCVTDLN 511
                + P ++ +++ S KRKR                EQ+ +    E  T ++C     
Sbjct: 434 -----EMPEEENSNEKSRKRKR----------------EQNQNSNLNE--TDETC----- 493

Query: 512 RRNGKELSANKTNGTG-TNSVDD-DDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGS 571
                 +S    NGT  T  VDD DD+PL  W+      +S+ S               S
Sbjct: 494 -----NVSKAGVNGTSDTIRVDDVDDQPLSSWINIPTVLSSDQS---------------S 553

Query: 572 EQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYK 631
             VD  N    V+ T     +  +  PF KN P W   +    +K +PQ PHF PL  +K
Sbjct: 554 NVVD--NSAADVEETQAKGALTIE--PFTKNLPFWKTYEMEKGYKTVPQNPHFSPLLEFK 613

Query: 632 EECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNE 691
           E+ RE  A+G MV+F  L+E++ KLQ D       S      +LE+HGF+I+   +R+N+
Sbjct: 614 EDIREWSAVGMMVSFYGLLEEVKKLQLDVSSSKLGSLSTCFAELEKHGFDIATPQSRINK 673

Query: 692 LLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETN 751
           +L ++    + +EE K  E +I        K   E   +E+K++EL+RR    K + E  
Sbjct: 674 VLSLQVGRAKKVEERKCLEKRIEAEEIEMQKFEHEMVEVERKMLELKRRAEVAKEKKEAA 694

Query: 752 DNEIEALQSHVETIRECTMSTKLHFENQIALP 768
           D  I  ++S  ETI +   + +L F   +  P
Sbjct: 734 DKMIVEMKSSAETIDQEIANVELEFITSVLAP 694

BLAST of HG10023039 vs. TAIR 10
Match: AT2G47230.2 (DOMAIN OF UNKNOWN FUNCTION 724 6 )

HSP 1 Score: 174.9 bits (442), Expect = 2.6e-43
Identity = 202/774 (26.10%), Postives = 325/774 (41.99%), Query Frame = 0

Query: 29  GSEIEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDAL 88
           GSE+EVS  EEGF  A F+  + + PT    S +KK  V Y TL+ +D  +PL E+++  
Sbjct: 8   GSEVEVSSTEEGFADAWFRGILQENPT---KSGRKKLRVRYLTLLNDDALSPLIENIEPR 67

Query: 89  SLRPLPPDTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRN 148
            +RP+PP+        EE  +VDA  KDGWWTGV+ K LE G + V + +P  +++F+RN
Sbjct: 68  FIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGKFWVYYDSPPDIIEFERN 127

Query: 149 HLRLHQDWVDGKWVVP--QKMDASILRD-QLSIISEDANVPENVQHESLKNTETNIEKKN 208
            LR H  W   KW+ P  Q++D S+     ++ +S   +  E     ++   E  ++ + 
Sbjct: 128 QLRPHLRWSGWKWLRPDIQELDKSMFSSGTMAEVSTIVDKAEVAWFPAMIIKEIEVDGEK 187

Query: 209 SYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAAD 268
            + +   N  +      DE+  +  + SS+ R +                   P  P   
Sbjct: 188 KFIVKDCNKHLSFSG--DEARTNSTIDSSRVRPT------------------PPPFPVEK 247

Query: 269 GSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHC--------------DDDNVSPNK-- 328
              M   +  RG  + +          R  L+ +C                 ++ P K  
Sbjct: 248 YELMDRVEVFRGSVWRQGLV-------RGVLDHNCYMVCLVVTKEEPVVKHSDLRPCKVW 307

Query: 329 -------FESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVTDKEGKE 388
                   E+P    K +      G+  +  +  +  +  R   L++S  TL   +    
Sbjct: 308 EDGQTPVIETPSNVMKTKPMRSCSGAKSMTPKRTTK-HARRSLNLEKSAETLTKAESRAA 367

Query: 389 DYDVLETIPKEVTTSNK----SERNKHLASDEKQTP--VNNSLHLPDAVGDGEENSNNQT 448
             ++      +V   N     + + K +AS E  TP  V  +  L     D +  S   +
Sbjct: 368 TGELRSKRANDVINDNTPLVITPQVKPIASVEPVTPSRVRTATPLKQTKADTQGKS---S 427

Query: 449 KEKGMEPEQQE-ATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLMIVPTTAE 508
            +K +EP + E   ENS +      +K+++E       KNS K+ R R R+        E
Sbjct: 428 PKKTLEPMRDENGLENSTR------QKVLEE-------KNSEKKGRKRKRQ--------E 487

Query: 509 DMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLMWLGGMQG 568
           +   D       K T +SC       NG+    N T+       D DD+PL  W+     
Sbjct: 488 EHNSD------LKETDESC-------NGQMAEINDTSSICN---DVDDQPLAAWINLPTD 547

Query: 569 SASNNSLKLGQTYGSKRRTKGSEQVDAVNKVRTVDGTPENEVVKNQGWPFVKNSPVWSAI 628
           +       +   Y           V+       V+    N+ +     PF K SP W   
Sbjct: 548 NFFFFFFFIVLIYPETSIDHSPIVVNNAAIATDVEERQANDTL--MILPFAKKSPFWKMY 607

Query: 629 DSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYPRYIFESTL 688
           ++  V K  PQ PHF PL   KEE RE  A+G MV+F  L+E++  LQ D       S  
Sbjct: 608 ETQEVCKIAPQSPHFSPLFEAKEELREWTAVGMMVSFYGLLEEVKNLQLDVSPSTLGSLS 667

Query: 689 ASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKTKLAEESKA 748
            S  +LE+HGF+++   +R+N++L ++D   +  EE K  E KI           EE   
Sbjct: 668 CSFAELEKHGFDVAAPQSRINKMLSLQDERAKKAEERKGLEKKIEAGEIEGHTYEEEMAE 708

Query: 749 IEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIALP 768
           +E KI+EL+R+Q   K   E  D     ++S+ E I +     +L F++  + P
Sbjct: 728 LELKILELKRQQVVAKEMKEATDKVTSGMKSYAEMINQEIEDLRLEFQSTASAP 708

BLAST of HG10023039 vs. TAIR 10
Match: AT2G47230.1 (DOMAIN OF UNKNOWN FUNCTION 724 6 )

HSP 1 Score: 173.3 bits (438), Expect = 7.4e-43
Identity = 202/782 (25.83%), Postives = 320/782 (40.92%), Query Frame = 0

Query: 29  GSEIEVSIDEEGFKGALFKATILKLPTTFSPSKKKKALVEYKTLVTEDGSTPLKEHVDAL 88
           GSE+EVS  EEGF  A F+  + + PT    S +KK  V Y TL+ +D  +PL E+++  
Sbjct: 8   GSEVEVSSTEEGFADAWFRGILQENPT---KSGRKKLRVRYLTLLNDDALSPLIENIEPR 67

Query: 89  SLRPLPPDTANKD--FEECDIVDAADKDGWWTGVVCKALEGGSYSVLFKNPMHVMDFQRN 148
            +RP+PP+        EE  +VDA  KDGWWTGV+ K LE G + V + +P  +++F+RN
Sbjct: 68  FIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGKFWVYYDSPPDIIEFERN 127

Query: 149 HLRLHQDWVDGKWVVP--QKMDASILRD-QLSIISEDANVPENVQHESLKNTETNIEKKN 208
            LR H  W   KW+ P  Q++D S+     ++ +S   +  E     ++   E  ++ + 
Sbjct: 128 QLRPHLRWSGWKWLRPDIQELDKSMFSSGTMAEVSTIVDKAEVAWFPAMIIKEIEVDGEK 187

Query: 209 SYSINSRNDLMEKPSIHDESSASFALTSSKRRRSLSSKSRVSNPLKKLGEGNVPGIPAAD 268
            + +   N  +      DE+  +  + SS+ R +                   P  P   
Sbjct: 188 KFIVKDCNKHLSFSG--DEARTNSTIDSSRVRPT------------------PPPFPVEK 247

Query: 269 GSRMMESKTSRGKAFSKSATPNRDRRRRSYLNFHC--------------DDDNVSPNK-- 328
              M   +  RG  + +          R  L+ +C                 ++ P K  
Sbjct: 248 YELMDRVEVFRGSVWRQGLV-------RGVLDHNCYMVCLVVTKEEPVVKHSDLRPCKVW 307

Query: 329 --------------FESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLV 388
                          E+P    K +      G+  +  +  +  +  R   L++S  TL 
Sbjct: 308 EDGVWQDGPKQTPVIETPSNVMKTKPMRSCSGAKSMTPKRTTK-HARRSLNLEKSAETLT 367

Query: 389 TDKEGKEDYDVLETIPKEVTTSNK----SERNKHLASDEKQTP--VNNSLHLPDAVGDGE 448
             +      ++      +V   N     + + K +AS E  TP  V  +  L     D +
Sbjct: 368 KAESRAATGELRSKRANDVINDNTPLVITPQVKPIASVEPVTPSRVRTATPLKQTKADTQ 427

Query: 449 ENSNNQTKEKGMEPEQQE-ATENSDKRKRGRPRKIMQEPGQQQASKNSYKRKRGRPRKLM 508
             S   + +K +EP + E   ENS +      +K+++E       KNS K+ R R R+  
Sbjct: 428 GKS---SPKKTLEPMRDENGLENSTR------QKVLEE-------KNSEKKGRKRKRQ-- 487

Query: 509 IVPTTAEDMEQDGSGWKPEKATVKSCVTDLNRRNGKELSANKTNGTGTNSVDDDDRPLLM 568
                 E+   D       K T +SC       NG+    N T+       D DD+PL  
Sbjct: 488 ------EEHNSD------LKETDESC-------NGQMAEINDTSSICN---DVDDQPLAA 547

Query: 569 WLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVNKVRTVDGTPENEVVKN-QGWPFVK 628
           W+                       T        VN         E +        PF K
Sbjct: 548 WI------------------NLPTETSIDHSPIVVNNAAIATDVEERQANDTLMILPFAK 607

Query: 629 NSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGLAIGCMVTFASLVEKITKLQFDYP 688
            SP W   ++  V K  PQ PHF PL   KEE RE  A+G MV+F  L+E++  LQ D  
Sbjct: 608 KSPFWKMYETQEVCKIAPQSPHFSPLFEAKEELREWTAVGMMVSFYGLLEEVKNLQLDVS 667

Query: 689 RYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDTEMRCIEETKVAENKILEHTKNKT 748
                S   S  +LE+HGF+++   +R+N++L ++D   +  EE K  E KI        
Sbjct: 668 PSTLGSLSCSFAELEKHGFDVAAPQSRINKMLSLQDERAKKAEERKGLEKKIEAGEIEGH 700

Query: 749 KLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEALQSHVETIRECTMSTKLHFENQIA 768
              EE   +E KI+EL+R+Q   K   E  D     ++S+ E I +     +L F++  +
Sbjct: 728 TYEEEMAELELKILELKRQQVVAKEMKEATDKVTSGMKSYAEMINQEIEDLRLEFQSTAS 700

BLAST of HG10023039 vs. TAIR 10
Match: AT3G62300.2 (DOMAIN OF UNKNOWN FUNCTION 724 7 )

HSP 1 Score: 154.5 bits (389), Expect = 3.6e-37
Identity = 215/805 (26.71%), Postives = 320/805 (39.75%), Query Frame = 0

Query: 20  RHHQLPFTVGSEIEVSIDE-EGFKGALFKATILKLPTTFSPSKKKKALVEY-KTLVTEDG 79
           R  +L  + GSEIE+S  E E   G ++   IL+     + SK+KK  V +   L+  D 
Sbjct: 7   RKEKLSVSKGSEIEISSQEYEYGSGNVWYCVILE--ENLAKSKRKKLSVRHLDPLLKYDY 66

Query: 80  STPLKEHVDALSLRPLPPDT--ANKDFEECDIVDAADKDGWWTGVVCKALEGGSYSVLFK 139
           S PL +      +RP+PP       DFEE D+VDAA K GW +G V K L    + V  +
Sbjct: 67  SPPLIKTTVHRFMRPVPPPDPFPEVDFEEGDVVDAAYKGGWCSGSVVKVLGNRRFLVYLR 126

Query: 140 NPMHVMDFQRNHLRLHQDWVDGKWVVPQK-------------MDASILRDQLS------- 199
               V++  R  LR H  W D +W   +K             ++     D+L        
Sbjct: 127 FQPDVIELLRKDLRPHFVWKDEEWFRCEKQQLIESDFSAGKSVEVRTKVDKLGDVWAPAM 186

Query: 200 IISEDANVPENVQHESLKNTETNIEKKN-SYSINSRNDLMEKPSIHDESSASFALTSSKR 259
           +I ED +    V+ ++LK  E N  K + SYS                      +  S  
Sbjct: 187 VIKEDEDGTMLVKLKTLKEEEVNCTKISVSYS---------------------EIRPSPL 246

Query: 260 RRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYL 319
              L     + N    +  G  PG+          SK   GK ++    PNR+ +  S L
Sbjct: 247 PIGLRDYKLMENVDALVESGWCPGV---------VSKVLAGKRYAVDLGPNRESKEFSRL 306

Query: 320 NFH---------------CDDDNVSPNKFESPKGGKKLR------TKEDVDGSDELKEQT 379
                                   S +  E      ++R       KE       +  +T
Sbjct: 307 QLRPSIEWKDGIWHRKEKVSGSEESSHAVEETAASTRIRITVRTALKEKKALGTGINVRT 366

Query: 380 LSLINGSRGNTLKRSQRTLVTDKEGKEDYDVLETIPKEVTTSNKSERNKHLAS--DEKQT 439
               +G+  N L  S       + G+    V ET   E       E    LA     +  
Sbjct: 367 TRSSSGAMHNPLPASFNGGDVAEAGRVSVTVNETPLFETAAQLSGELGNSLADVVMNESA 426

Query: 440 PVNNSLHLPDAVGDGEENSNNQTKEKGMEPEQQEATENSDKRKRGRPRKIMQEPGQQQAS 499
           PV +    P+     E + +           Q + T          P+K +Q    Q++S
Sbjct: 427 PVTSQ---PEIAAPKEFHPSVVLGVAAAVKTQGKTT----------PKKKLQAMKNQKSS 486

Query: 500 KNS--------YKRKRGRPRKLMIVPTTAEDMEQDG-SGWKPEKATVKSCVTDLNRRNGK 559
            N          KRKRG+PRK ++    AE  ++ G SG   + AT++            
Sbjct: 487 TNDSVGEKVSVNKRKRGQPRKFIV----AEPKQKIGVSGNNSKAATIEHA---------- 546

Query: 560 ELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVN 619
                         + DDDRPL  W   +    S++   + +T      T   + VD   
Sbjct: 547 -------------DMTDDDRPLASW---VHTGNSSSGQSVSRTPDIGLNTVVEKHVDI-- 606

Query: 620 KVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGL 679
            V T  G     V+     PFVK S +W  ++S+ VFK +PQ PHF PL   +EECREG 
Sbjct: 607 -VETPPGRESTMVL-----PFVKKSQLWKVLESMEVFKVVPQSPHFSPLLESEEECREGD 666

Query: 680 AIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDT 739
           AIG MV F+SL+EK+  LQ D P             LE+HGFN++   +R+ ++L IK+ 
Sbjct: 667 AIGRMVMFSSLLEKVNNLQVDDPISSINRIDECFLKLEKHGFNVTTPRSRIAKILSIKER 721

Query: 740 EMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEAL 768
           +   +EE K  E KI E+   + K        E+ I+ELQR++  +K    T DNEI  +
Sbjct: 727 QTCALEELKAVEEKITENDNKRRK-------YEEDIVELQRQEVLMKEAKVTLDNEIARM 721

BLAST of HG10023039 vs. TAIR 10
Match: AT3G62300.1 (DOMAIN OF UNKNOWN FUNCTION 724 7 )

HSP 1 Score: 152.9 bits (385), Expect = 1.0e-36
Identity = 214/805 (26.58%), Postives = 332/805 (41.24%), Query Frame = 0

Query: 20  RHHQLPFTVGSEIEVSIDE-EGFKGALFKATILKLPTTFSPSKKKKALVEY-KTLVTEDG 79
           R  +L  + GSEIE+S  E E   G ++   IL+     + SK+KK  V +   L+  D 
Sbjct: 7   RKEKLSVSKGSEIEISSQEYEYGSGNVWYCVILE--ENLAKSKRKKLSVRHLDPLLKYDY 66

Query: 80  STPLKEHVDALSLRPLPPDT--ANKDFEECDIVDAADKDGWWTGVVCKALEGGSYSVLFK 139
           S PL +      +RP+PP       DFEE D+VDAA K GW +G V K L    + V  +
Sbjct: 67  SPPLIKTTVHRFMRPVPPPDPFPEVDFEEGDVVDAAYKGGWCSGSVVKVLGNRRFLVYLR 126

Query: 140 NPMHVMDFQRNHLRLHQDWVDGKWVVPQK-------------MDASILRDQLS------- 199
               V++  R  LR H  W D +W   +K             ++     D+L        
Sbjct: 127 FQPDVIELLRKDLRPHFVWKDEEWFRCEKQQLIESDFSAGKSVEVRTKVDKLGDVWAPAM 186

Query: 200 IISEDANVPENVQHESLKNTETNIEKKN-SYSINSRNDLMEKPSIHDESSASFALTSSKR 259
           +I ED +    V+ ++LK  E N  K + SYS                      +  S  
Sbjct: 187 VIKEDEDGTMLVKLKTLKEEEVNCTKISVSYS---------------------EIRPSPL 246

Query: 260 RRSLSSKSRVSNPLKKLGEGNVPGIPAADGSRMMESKTSRGKAFSKSATPNRDRRRRSYL 319
              L     + N    +  G  PG+          SK   GK ++    PNR+ +  S L
Sbjct: 247 PIGLRDYKLMENVDALVESGWCPGV---------VSKVLAGKRYAVDLGPNRESKEFSRL 306

Query: 320 NFHCDDDNVSPNKFESPKGGKKLRTKEDVDGSDELKEQTLSLINGSRGNTLKRSQRTLVT 379
                         E   G      KE V GS+E           +R   ++ + RT + 
Sbjct: 307 QLR--------PSIEWKDG--IWHRKEKVSGSEESSHAVEETAASTR---IRITVRTALK 366

Query: 380 DKEG-----------KEDYDVLETIPKEVTTSNKSERNKHLASDEKQTPVNNSLHLPDAV 439
           +K+                 +   +P      + +E  + ++    +TP+  +  L   +
Sbjct: 367 EKKALGTGINVRTTRSSSGAMHNPLPASFNGGDVAEAGR-VSVTVNETPLFETAALSGEL 426

Query: 440 GDG-EENSNNQTKEKGMEPEQQEATE---------NSDKRKRGR--PRKIMQEPGQQQAS 499
           G+   +   N++     +PE     E          +  + +G+  P+K +Q    Q++S
Sbjct: 427 GNSLADVVMNESAPVTSQPEIAAPKEFHPSVVLGVAAAVKTQGKTTPKKKLQAMKNQKSS 486

Query: 500 KNS--------YKRKRGRPRKLMIVPTTAEDMEQDG-SGWKPEKATVKSCVTDLNRRNGK 559
            N          KRKRG+PRK ++    AE  ++ G SG   + AT++            
Sbjct: 487 TNDSVGEKVSVNKRKRGQPRKFIV----AEPKQKIGVSGNNSKAATIEHA---------- 546

Query: 560 ELSANKTNGTGTNSVDDDDRPLLMWLGGMQGSASNNSLKLGQTYGSKRRTKGSEQVDAVN 619
                         + DDDRPL  W   +    S++   + +T      T   + VD   
Sbjct: 547 -------------DMTDDDRPLASW---VHTGNSSSGQSVSRTPDIGLNTVVEKHVDI-- 606

Query: 620 KVRTVDGTPENEVVKNQGWPFVKNSPVWSAIDSLGVFKQIPQKPHFHPLSTYKEECREGL 679
            V T  G     V+     PFVK S +W  ++S+ VFK +PQ PHF PL   +EECREG 
Sbjct: 607 -VETPPGRESTMVL-----PFVKKSQLWKVLESMEVFKVVPQSPHFSPLLESEEECREGD 666

Query: 680 AIGCMVTFASLVEKITKLQFDYPRYIFESTLASLYDLEQHGFNISMLCNRVNELLFIKDT 739
           AIG MV F+SL+EK+  LQ D P             LE+HGFN++   +R+ ++L IK+ 
Sbjct: 667 AIGRMVMFSSLLEKVNNLQVDDPISSINRIDECFLKLEKHGFNVTTPRSRIAKILSIKER 720

Query: 740 EMRCIEETKVAENKILEHTKNKTKLAEESKAIEQKIIELQRRQSSIKLEIETNDNEIEAL 768
           +   +EE K  E KI E+   + K        E+ I+ELQR++  +K    T DNEI  +
Sbjct: 727 QTCALEELKAVEEKITENDNKRRK-------YEEDIVELQRQEVLMKEAKVTLDNEIARM 720

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899475.10.0e+0087.20DUF724 domain-containing protein 3-like isoform X2 [Benincasa hispida][more]
XP_038899474.10.0e+0084.91DUF724 domain-containing protein 3-like isoform X1 [Benincasa hispida][more]
XP_038899477.10.0e+0086.17DUF724 domain-containing protein 3-like isoform X4 [Benincasa hispida][more]
XP_038899476.10.0e+0082.92DUF724 domain-containing protein 7-like isoform X3 [Benincasa hispida][more]
XP_038899478.10.0e+0082.75DUF724 domain-containing protein 3-like isoform X5 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9FZD92.9e-4427.13DUF724 domain-containing protein 3 OS=Arabidopsis thaliana OX=3702 GN=DUF3 PE=1 ... [more]
O228971.0e-4125.83DUF724 domain-containing protein 6 OS=Arabidopsis thaliana OX=3702 GN=DUF6 PE=2 ... [more]
Q8H0V41.5e-3526.58DUF724 domain-containing protein 7 OS=Arabidopsis thaliana OX=3702 GN=DUF7 PE=1 ... [more]
Q9ZVT11.2e-3225.33DUF724 domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=DUF1 PE=2 ... [more]
F4I8W14.3e-2725.34DUF724 domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=DUF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A1S3BSM10.0e+0083.89uncharacterized protein LOC103492739 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3D3Y20.0e+0083.89DUF724 domain-containing protein 7-like isoform X3 OS=Cucumis melo var. makuwa O... [more]
A0A1S3BSB20.0e+0081.68uncharacterized protein LOC103492739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7TTF20.0e+0081.68DUF724 domain-containing protein 7-like isoform X3 OS=Cucumis melo var. makuwa O... [more]
A0A0A0K5U90.0e+0083.25Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G412860 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G26540.12.1e-4527.13Agenet domain-containing protein [more]
AT2G47230.22.6e-4326.10DOMAIN OF UNKNOWN FUNCTION 724 6 [more]
AT2G47230.17.4e-4325.83DOMAIN OF UNKNOWN FUNCTION 724 6 [more]
AT3G62300.23.6e-3726.71DOMAIN OF UNKNOWN FUNCTION 724 7 [more]
AT3G62300.11.0e-3626.58DOMAIN OF UNKNOWN FUNCTION 724 7 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 704..724
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 217..328
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 364..387
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 364..466
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 410..441
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 223..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 281..328
NoneNo IPR availablePANTHERPTHR31917AGENET DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 578..752
NoneNo IPR availablePANTHERPTHR31917:SF81BINDING PROTEIN, PUTATIVE-RELATEDcoord: 578..752
NoneNo IPR availablePANTHERPTHR31917:SF81BINDING PROTEIN, PUTATIVE-RELATEDcoord: 3..575
NoneNo IPR availablePANTHERPTHR31917AGENET DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 3..575
IPR014002Agenet domain, plant typeSMARTSM00743agenet_At_2coord: 24..98
e-value: 4.7E-8
score: 42.8
coord: 100..156
e-value: 7.6E-8
score: 42.1
IPR017956AT hook, DNA-binding motifSMARTSM00384AT_hook_2coord: 456..469
e-value: 19.0
score: 8.0
coord: 432..444
e-value: 0.21
score: 19.0
IPR007930Protein of unknown function DUF724PFAMPF05266DUF724coord: 582..764
e-value: 3.1E-57
score: 193.5
IPR008395Agenet-like domainPFAMPF05641Agenetcoord: 29..95
e-value: 4.7E-16
score: 59.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023039.1HG10023039.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding