HG10022099 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022099
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFAD/NAD(P)-binding oxidoreductase family protein
LocationChr05: 20813560 .. 20825870 (-)
RNA-Seq ExpressionHG10022099
SyntenyHG10022099
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGGTGATTTTTTTTGGACTCCCATCATCAATTTCTTTGTTGGTTTACTCAGAAACGGAACATATTAGAGTTATTTAGTTTGTGTTTTGTGATGGATAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGCTAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAGACCCCTTTCCAAGGTTCTGCACTATCCCCCATTCTCGGCTTCACGTTGTTATATATATATATATATATATATATATATATATAGATGTATATATATACACTTTAATCCTTTATTTACTAATGGGTAGGTCAAAATTTCTGGAGGAGGCCGATGCAATGTGACGAATGGGCATTGTATCGATGCAAAGGTGAGGGGTTTTCATCAATTAATTCTTACTTTATCTTGGATTGATAATCAAGAACATGGAATGCATATTATCTGTGTGATAATTGTACTTCTTCTAGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCATGGAGTTGAACTGAAGGTTGGTTCAAAGCTGTTGATCTCTTGCTTCCTTATTGCCATATGTGAATAATTTGTGTTTAAATTTGGAATAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGGTTTGTTAATTGATGCTCTGTTCTCGGTGTTTGGTATTTGGTGGTTCGACTAATTTCCACTTTCCTTTTCTGGAAACTCTAGTGCAGCCAAGTAGTTTGTGCAAATATATCTATGCTGAGTTTTGAATTATTTGATCAGTTTCCTTGCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTACTGATTGCTAGTGGGAGTAGTCGGCAGGTTGCAATTGCTGTCTGGTTTTTATGCTTGATTGTGTTACGATTGACTTGATCCCTTAAACTTGTTACTTTTATCAAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAATTGGCAGAGTTGTCTGGGGTAAGCTCCAATCTCCTTGTATCTCTATAGTGATTAGTTCCTTTTTGTACCACTTAGCATCAGTTTGAAATGACTTTTTAAGTACATAAAAAGTGTTTACTTGAAAACATTTTATAAACACTAGAAAATCATTCTAAACATGTCCTAAGTTCAATGGATGGGAAAAAATTATGAGACCCCTAATAGTGAAGACGTATGTGAGGAAGGGCAAAAAGGTAATTGACAGGGGGCAGATAGTTAGTGGTGCGTGATAGTTGTTTTCTCGATATGGGGACCATGGGAGTGAGAGTATGGAAAGAAGCAAGAGGTGGCAGTGTGGGGTGTGAAGAATTTTAGGCAGAGGCTGAGTGTGGAGAGAACTCTAGCTGAGACTACTATCCTTTCTTTGTTTTCTTCTTCTTCTATTTTTCGTTGTGGAAACTTGTTTTCTCTGGTTTTGCTGTGTTACTAAGACTTTGAAGTTATTATTATCAATATATTGTGTCACAAAGATTGTTTTGCTGTGTGACTTGGCTTCGAAACCTAACAAAATCCTTCATTTTATGATTTTTGGTGCACTCTCTTTCATCACACTGCATGTCTCCTTTTGTCAGAGGCAAAATGCCTTCGAGATGTGCTATAGGGATTGCAACTAGTGGCATCAAACCTCACTATCTCATGTTGAACTTCATGCTTTTTCACCATTAGTTTATTTAGTTGGACATTGTTGATTGGTTAATTATTTAGACCCCATTTGTCACCCTAATCATCATAAATTAGCACGTCCAAACATTTGCTAGAGTATGCACTGATGGACGGCCAAATGGGGCCAAGCAGACCTATTAAGAGATTCTTAGGGGCTTTTTCATTGTGCACATGCACGATTGCAGGGCAGTGGCCACGTGTGGGCGCATGGTTGCTCATCCTTATCCTCTTTTTGATGTCTTTGTTGGGTTTTGACAAAATATTAGAAGAGTGGTTGTCCTATATCGGAAAGATTCAAGGAAGTGATGGGCTCCAAGCACTATAAAAGGAGTTTATGCCTTTAGTCTCAAATCGTCCCATTCAGAAATACTTTAAGCTTGGGTGTGCTAATTCTATTGTATTCTCTAGTTTAAGAGTGGGATTAAAAGGTGTAAGTAATATAAAAGCTTTGAGACAGTGCTCTTGTAATTGGGATCGGGGAGAAAATTATTCTGTGTGATTATAACAAATTTTCACAGTGGATTCCTTTCTGGTGGGTTGTAGTTTTTCTCCAGATTGGAGTTTTTCACGTAAATTTTTGTGTTTCCTAGTTTTGTGATTTTCTTTTAATTTTTCAATTATTTTTCTGCTTATTCTTGTCGTAGTAGTGGGAGGTTTTTTCAACAAGTGGTATTAGAGCTTTAGGTTTCTAGTTTCTTGAATCTCTGAGTATGTTCTGTGGTTGCATAATTGTCTGAACTTCCACATAAAAAAAGAATTTTTGAAACAGTCTACGGTAGTGTCATTAGTGTGGGTTGTTCACTGTTTGCATATTTGTTAGGAGTGAAGCAGATATGTCAAGTTTTGTGTGTCCAATAAAGTTTGATGTGGAGAAATTTGATGGAAGGATCAACTTCTGCATGTTGCAAGTACAAGTCAAGGACGTTCTGATTCAGTCTGGGTTGCCCAAGGCCTTGAAGGGAAGACCAAGTTGTGATACTTCTGATGAGTTAATCGGTGATGGTGGTCCAGGAGACTCCAATAAAGGTTCTAAGAAGAATAACATAGGCGATGAAGATTGGGAGGAAATGGATTTAATAGCTGCAAGTGCAATCAGGCTAAATTTGGCTAAAATGTTCTTGCAAATATGCGTGGATTTTTTACAGCAAAAAAGCTTCGGGAGAAGTTTGAAGCGATGTGTCAAGCAAAGAGTATCTCAAATCAATTTTACCTGAAGGAGAAGTTTCACACCTTGCAGATGGTTGAAGGTACGAAAATCTCAGATCATTTAGGTATTCTGAATGACATTGTTTTTGAGCTGAAGGTGATCGAAGTGAAGATAAAGCACCTAGACTTATCTTGTCACTTCCTCTTCTTATGAACACATGGAGCCAATCTTGATGTACAAGAAGGAGACTTTAGATTTTATCGAGGCTACTAGTGAACTTTTGTTAGAGGGAAGGAGACTGAAGAGTGAAGGGCGTACTTCACAAGAGATTCAATACTGAAAGCTAACAATTGGAAGAAGAAGAAAGAGTTTGTGCAGAAAAAGGGTTTCTGGAAATGTGGATAATCTAGACACATGGAAAAGGATTGTCCTAACTGAGCAGGTTTGCCAAAAGGCTTTGAGTCAGATGCTAACGTCTCCATCATCATGGAAGATTGTGATTTGTTCTTTAGAAGAAAGACAGTATTCATCCTCGTGGTATGTTCGCTTCACCATGATAGAGGATGTAATGGTAGCGGTTCCATAAGTTTGCACACAGACATTGACTTGGTGGTTATGCAAGGTGTGTGGTGGAAGTAGTGTCGATGGCTAATGAACTTCTGAATGAGCTAAGTAAGTGGAAGTTGCACAATAAAGTTTATCAACATGTGACAAAAAAGGAAAATCTTGGAAGGTGGTATTTCAAGTGGTTTACTCTTTATGGCGGAGTGAGTTATAATTCTCTAAGTTGGGAATTATGATTTTCAGTGAAGACAATGGTTGCCTTAGAAGTTGTGGATTCTTCAAATATCTTGCCAAGGTGGAATTTGATGGATTTTGGCACAATATTAAGAGCAGTTGTCCCACATTGAAAAGGCTCAAGCAAGTGATGGGCTTCAAGCACTGTAAAAGGAGCCTATGCTTTTAGTTTTAAATCATCTCAATTTGAAGCAGTTTAAGCCTAATATACTAACTCAGTTGTGGAGTGTGTAAGGACTGCTCCAAGAGAGTCACAATATATTACTATGATCGCCAAAATATAAGTTGCACACCTTTCGGGAGGGTGGAGCTCTCCCAAGTTCTAGTCTTACACCAATAGATACAAACTAACCATAACCACGAAATACAACAACGCGCACTATAAATATAACTAATTGTACAACTAACTAAGGGGTGAATTTCCATTTTTTACCATTCTAACATAAGTATAAATAAGTGGGGGCCTAACAATACCCCTCAGGATGAAAGATTCCTTGTCCTCAAGGGAGAAAGACAAGAATTGTTCTCAGATCAAGTTGATACTTTCCTAGGCCGTTTCATGTTCAGGCAAACCTTTCCACTGGATTATCAGCTCCCACTCCTTGATAGTGTCATTAAATCGATAAGCAAATACCTCTTCAAGAGCAACTTTCCATTCAAAGTTAGCGGTCAGCAGCAGGGGTTGGGGTTCAATAGGAGTAGTGTTGCCCAATGCTTGCTTCAACTATGATACATGGAAGACTGGATGAATGGACGATTCTACAGGCAATGCTAACTTGTAGGCAAGTTTCCCTATATGCTCTAGGATTTGGAAGGGGGATGTCTTTAATGTTGCGTGATACATGGTGTTGTGCCAATATTCAACCCAACTCAACCATTGAGCCCATTGCTTAGGTTTCTTGCTGCAAGAGCAACGAAAATACGTTTCCAGACTCTTGTTCACAACTTCTGTTTGACCATCCGTTTGGGGATAGTAGGTTGTACTCAAGGTTCGAAATATCGATGTCGATGGAAATATCGAGGTCCCGATTTTACGGAAATATCAATATCGATGGAAATTTTGGGAAACTTTATGGAAATTGTTATAATTAGTTAATGAAACTATGATTGTAACTGAGTTAGACTATAATTTACCATTTTAATCATTATTTCTATAAAGTAAGACAATATTTAGATGTATATTGTAAAATATTTGTGTAAATGTCAATGCGAGAGCAAAAATTTAAAAAAGAAATGGAAAAATAATTACAAAATTAAAGAACATGAAATATTTGGACATAAAATAATTGTAAATTTCTATTATAAAATGAAATGGATATAAAAATGAAAAATTATAGCATTAATCTAACATAACACAATAAGGATAGAAAAATCATAACATTAAAATAAAAGGAATGTAAAAAAACATAACATTAAACATAATACATAATGGAAAAGAAAGAAGAAAAAGAAAGCTCTCGAAGTAGATAGAGCTTGGGAGAAAATGAGAGGAATATTAAAGAACTCACATCAATTCAAGTTTGAGGTGAAATAATGTGAAAATTCAATGGAAAAAATTGAATGAAAATATGTTGTTTCTTGATAGTAAAAGAGAAAAAAAAAACCTCGAAATTTCGAGAAATTGGATTTTTTTCCCCTACTGAAATTTTGAGCCTATTGAAATCTCGAAATTTCGAGATTTTGATGGAAATTTGAAACTATGGTTATACTACGCATGAGTTGCTTTCCTTGTAAGCTAAATAATTCACTCCAAAAATGGCTAATAAAATTTTTATCACAGTCAAACACAATTGATTTGGGAAATCTATGAAACTTCACAATTTCTTTCACAAATAGGTTGGCCACCGATTTGGCTATATACGGATGACTCAAAGGCAGAAAATGAACATACTTGCTAAAACGATCCACACGACAAAAATGGTGGTGTATCCCTACGAACGGGGTAATCCTTCCATGAAGTCCATTGATATGACATCCCAGATTGGGTTCGACACTGGTAGAGGCTATAACAGACCAGCCGGGGAGAGAGTGCTAGGCATCAAGGGCAGAAAATGAACATACTTGCTAAAACGATCCACATGACGAAAATGGTGTTGTATCCCTGCGAACAGAGTAATCCTTCCACGAAGTCCATCAGTATGTCATCCCAGATTCGGTCCGACACTGGTAGGGGCTATAACAAACCAGCCGGGGAGAGAGTGCTAGGCATATTCTTCTGACACACTTCACATTCCTCTGCGTACTTCACCATATTTTTCATGTCAGGCCAATATAATTCACTTGTCACTTGTCAGGCGTTTGTTAGTGTGAAGAAACCCTGAGTGGCTTCCCAGAACTAAGAGTGGACTTGACCGAAAGGACAAGTTTGTCCTTGTACAGTAGATTCCTTTGGAGAATTGAGAACTTTGGATGATTCTACAGATTTGATTCTAACTTTTCACGAATCTTACAGAGGTACTCGTCTGCCTCTAGTTCCGATCGAACAACTTCCAGATCGAGAATGGAAGGGGCGTCAATAGGGGTGATCATCGGTCAGTCGACGTTGGTTTTGGGCTCAAACCGGCATCGACTACCGACTAGTCGATTTCCAGCGGTCTCAAAGGATGTTGCCACACCGGCCGACCGACCACATTACCAGTTGGTCAGTTTTTCCACAATTTTGATAAGGGAATGAAAAATATGTAAGAAATGAGAAGAAAAAAGGTAATAAGAAAGAAAAAAGAAAAATGGGGGAAGAAGAAGAAGAAAACAAACCAGAAAAAGATGAAGAAAAAAGGAAAGAAGAAGAAAAACAATCAAACAAGAATTGAAAGAAAAAATAAAGAAGGAAAAAAAGAAAACAAATGACGAAGAACAAAGAGAAGAAAAAAACGTAGATAAAGGAAAGTTTAGAAAAATAAAAAAGTTTTTAAGTTTTAAAATTATTAAATAACATAAAACAAAATAAAAATAATATATATATATATTTAAATGAAACTCACCTTGGGAAGAGATCAATTGTCTTCCCAAGGTTTGCGTTGCATTGGGACGCCAATCACCATGGGGAGGGTTTTTTAGATCACTTTGGGAAGATATAAGTTTTTAATATAATATATATATATTTTAATATTGTTAACAATGCTAGTTGGTTTTGGTCAGAATTTAGTTTTTTACTTGACCGGCTGGCCGACTGGTCAATTTTTTAGATAACGAAATTGACCGCCAACTGCCGTCAGTGCAGTTGGGTTTCGATAATTTGACTCAGTTTTTTGGATTCTTTTGCTCACCACTAGCTTGACTTGCTAGGTGCACTCATGGGGGTATGCGAGAAAATTCATCAACGACCTTATTCTGCAGTCTCGACTTGTAATGTATCAGCAACTTTGATGCCCAATTTTGGTACTCATGCTGGATCGTCTTCTATTCAATAATATGTCTGATGGATTGTTGATCGGTACAAACAATAAAACACCTACCCAACATATATGACCTCCACCTTTGAACTTCCATCACTATAGCCATCAATTCCCTTTCATAAGCTGATTTGGCTAAAGCTCGAGTGGACAAAATATTGTTGAAATATGGGATGGGTCGCTGGTTTTGGGTTAGGACTGCCCCTACCCTCGTTCCTGAGGCATCCGTTTCTACGATGAAACAGATGGAAAAATCAAGTAACGCCAACATTGGAAGGGTCATCATCACACGCTTCAGCGTTTCAAAGGCCTCAATGAACTCTTCAGTCCATAAGAATACATTGGTCCTTGTCATCCTGTGCAGGGGTTAAGCTATTTGCCCATAACCGACCACAAACTGGCGATAATAGCTTGTCAAACATAGAAAGCTTTTCAATTCTCTCACATTCTCCAATTGACACCATTCAAGAATCGCACAAACCTTTTCGGGATCTACCTCTACCCTTTCTACTGAGACCCAGTGCCCTAAATACTTAAGTCAATCTTGGGCAAAGTGACACTTCTTGAAACTAGCAAACAAAGAATTTTCTCGTACCATGTCGAAAACTGAAGTCAGGTGGGACATATGTGTTCCCATATCTAAAACTATAAACCAAAATGTCATAAAAAAATACCAACATAGAACGGCCTAGAAACGGACGAAAAACATTGTTCATAAGGGATTGAAACGTGCCAGGAGCATTCTTTAACCTGAGCGACATGACGACAAACTCATAGTGGCCATCGTGGGTACAGAAGGCCATTTTGGGAATGTTAGTTTCATGGACCTGGATTTGACGGTACCTAGACCAAAGATCAAGTTTCGAAAATACTCTTGCCCATGCAACTCATCCAACAACTCTTTGATTACGGGGATAAAAAAACTTATCAAATACAGTCAGCTCGTTCAACGCAAAGTAGTTAACATAGAAACACCAGCTATCATTTTCTTCTTAACCAACAAAACAGGACTCGCATATGGTCCATGGCTTGATCTGATGATACTCGCCTTCAATATCTTCCCCACTAACCTCTCTATTTCATCCTTTTGCACAATTGAAAATCTATAGGGTCGTAGATTAACTAGCTCACGATCATCTTTCAAGTTAATTCATTGGTCAGACTCGCGATGAGGGGGTAACCGATCAGATATTCGAAAAATATCAGAATAATGGGACAAGATTTCATGTACCTGCAGGTGAATACTAACGATCGGGGCTGCTAGTGGCAGTTTATCGTCAAACTTAGTGGTGATCGTTCGCATCTCAACCATAAAACCTTGATCAGCTTTATTCCACGATTTCTGTATACTCTTGAGCGTCAGTTCCCTTCGCATGAGACTGGGGTCCCCTTTTATAACAATTCACTCATTCCCAGTCCCAATCGATAAGTCAAACTCTTCCAGTCGACCTTTGTCACCCCCAATGTTCGTAACCATTGCATGCCTAGAACAATCGATTCCACCCAACTCTAGTGGTAAAAAATCTTGCTTGACGGTGAGTCCCAATAGCGCAACGACCACATTTTTGTAGATACCTTTACCGCAGCTGTACCTGTTCCTACTATCACTCCGTAGTTCGTTGTATCAGTCACTGAAAGCTTCAAGCTTTCCACGACTCGCGATGTGATGAAGTTGTGGGTAGCCCCACAATCAACTAAGATGATAGCATCTTCCTTTCCAATCCTCCCCATCGCTTAATCGTACTAGGGTTTGATATTCCAACAACAGTATTCAAGGAAAGCTCAGCAACTTCTCCCATGACTTCACCCGTTGTTTCAGTTTCACTATTGATAACCATCGCGTCTCCCCACACCTCGTCATCTGCTACTAGTAGTCTAAGTTCTTTGCATTTACATTTATGGCCCATTGAGTATTTTTTATCACATTGGTAACACAAGCCTTTCTCTCTCCTTGCTTGCAACTCAGCGTCCGAAAGCCTCTTGCAGGGGACTTCCTTTTTTGTTGTAGGCTGCTGACTGGTTAACGTGATGGATGGGTCAGGAACAACTTCGGTTAATTTCAATGGGCATTTGGTAGGCGACCCAAAGTTAGTGTTTAAAGGTTGATAAGGCTTACTGGCCTTGGATTAGCCCAGACAGTTATTTATAACCACAGTCTTATCCTCAACCCGTTGGACTACCTTCATGATTTGACTTAGGCCCACAGACTTGTAACAGATGACCTCTACCCAAATTTCTGGGTCTAACCCATTCAGAAATGTACTTTTCAACATTTCATCGATAAGGTGAGGTAAAGAGGCTAACAAAGCCTCAAATCGCTCCCTATAATACACAACAGTGGTCGACTGTTTGATAGCTAAAAATCGAGCACACAGTGGCCTTTCCTGCAATAGACAGAACCACTCTGTCATATGGTGTTTTAAGTCTTTCCATTTGTCAAATTGTTCCCTCCCTTCGACCCAAGGGTACGAAGGCTATGCCAGTGTAGCTAATCGACATCACCGTAACTTTGCTTCCGTTTAGGTTGACTGTGTTTTTCCTCCAGCGACGAGACTCCCAACATTCCTTTAGCCAGAATCAATATTGCTCTTTGTGTTTCTTTAGTTGCATTTGTCTGTTCCTCGAAGTTATGGAACATGAGTTCAATACTTTTCTGATTGTGCACCAAAACTTTCTTTTGATCTTCCATTTGTTGAAGGATTTTACCCATGTTTTTGGCAAGTTCTTTCTTGGCTTGATCCAACTTAGGAAGCTTTTGAACTTCTTCCCTTACCTCACTCAATGTACGTTCCAACTCATTTTGTTCTTTCAGTTGTCGTCTCACCATTTTCTTAGGTTTACTCAGAATTACCGCTCTGATACTAATGTAAGAACTCCTCCAAGAGAGTCACAATATATTACTACAATCACCAAAATACAAGTTTCACACCTTTCGAGAGGGTGGAGCTCTCTCAAGTTTTGGTATTACACCAATGGGCACAAAATAAGCATAACACAAATTACAACAACGCACATTATATATATAACTAATCGTACAACTAAGGGGTTGGATTTCTACTTTTGTCCCTCCTAACATAAGTATGAATAAGTGGGGGCCTAACAGAGTGGTATAATGCTTTGAGGGTGTTTTCTTTGTAATTTGAGTCAAGAAGAGATTTTTGTGTAATTGTAGTAATTTTTTACATTGTGGAATTGGTCTGTGGTTTTTTCTCTGATTTGGAATCTTTTCACGTTAATTATTGTGTTCCCAGTTTTATTGTTTTCTTTGAATTCCTCTGTTATTTTTCTGCTTATTCTAGTAATAGGAGATTTTTTCCCAACAATTTTGTTGTTGGTTGTGTCCTGAGGCCTCGTGATCACATCAAAGGACTTCTAAGATCATCTTGCAATGAGAACTATATGATAGCTGGCATTTTTCTCTCTATTTTTGTATGAATACTATTAGAAAATAAGTAACATCATCGTCTCCTCATATCTCAATTATATGCCAATTTCAGGTTTCATTCCCTAAGGTGAGAGCAAAGCTTAAGTTAGAAAACATACATCGGCATCTTCCACAATATACACAGGCAGGTTCTGTTTTCTTTTTTCCAGTTAGGCAAGTTCTTCTTCATGTAGAGTATTAACGCACAAAATATAACGATGCATACATTTTTTTCATATTTAGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGATTATAAAGGTTTCTTAAATTCCCTACTTGTGAAAGGTTATGGTTTAAAATTTTCTCAGCTAATAATTTTTAGTGTAGGCCTGCTCATTGTGGATTTTACACCTGATTTACATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGTATTCTCGGATCACATACTGTTCATGCAAGTTCCCAGTTTCTTCCATTTCAGTAATATCTAATTTTTGGTTTTTCATCCAGACGTCATTTTCTATTAGTATCTTATAAGTTGAATTAAAAATCACAAAAATTATGTATACTTTAAGGTGTTGAAGAAACTGCTCATTTTGATGACTCTCCTGTCTCATATATTGTATTACTTATCAATTCCTCTGTGCTCTCAACACATTTGCAGAAACAAAAAGTGCACAGTTCATGTCCTTCAGATTTTGGCCTTGTGAAAAGATTTTGGAAATATTTATTGGATCGAGAGGTTTCAACGCTATTTTCCATTATAAAATCATCACTATAGTTTTGTAGAAGTTCCCCTCGCCAATTACTTTTCTTTTTTTCATGTCCAGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCTTCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGTATACATGGTAGCTTTTCCATGTAAAATGAGATATAGCTTCCTTTTCTGATTTTATAAGCTTTCTCTTCATGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGGTTCTCTTCTACCACTCTTTCAAAAGGATAATAAGAAAAGAAAACACATATAACTTGCAACTTCTTTTTTTTTTTTTTCCTCGTCTCTCCTGTCTAGATCTCACTTAAAACAATGGAGAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTAAATTTTTATGATACCTTAGAGTTATCAATACATCAAAAACATAGCAGACTTCATTGAGATTCTATCATGTTCCATCGTAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGGTAAAATGCAACGTCATGTTTAAGAACCAGGAAGAGTAAGTTTATTTATAAGATACATCCTTGTCTGTGTGTTCCACCATTTTTATTCTTTTGCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAACTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA

mRNA sequence

ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGCTAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAGACCCCTTTCCAAGGTCAAAATTTCTGGAGGAGGCCGATGCAATGTGACGAATGGGCATTGTATCGATGCAAAGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCATGGAGTTGAACTGAAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGTTTCCTTGCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTACTGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAATTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGATTATAAAGGTTTCTTAAATTCCCTACTTGTGAAAGGTTATGGTTTAAAATTTTCTCAGCTAATAATTTTTAGTGTAGGCCTGCTCATTGTGGATTTTACACCTGATTTACATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCTTCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGAGAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAACTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA

Coding sequence (CDS)

ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGCTAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAGACCCCTTTCCAAGGTCAAAATTTCTGGAGGAGGCCGATGCAATGTGACGAATGGGCATTGTATCGATGCAAAGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCATGGAGTTGAACTGAAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGTTTCCTTGCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTACTGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAATTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGATTATAAAGGTTTCTTAAATTCCCTACTTGTGAAAGGTTATGGTTTAAAATTTTCTCAGCTAATAATTTTTAGTGTAGGCCTGCTCATTGTGGATTTTACACCTGATTTACATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCTTCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGAGAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAACTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA

Protein sequence

MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
Homology
BLAST of HG10022099 vs. NCBI nr
Match: XP_038889404.1 (uncharacterized protein YtfP isoform X2 [Benincasa hispida])

HSP 1 Score: 722.2 bits (1863), Expect = 2.5e-204
Identity = 381/473 (80.55%), Postives = 392/473 (82.88%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+ALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKI
Sbjct: 1   MNSTKALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVIVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGHC DAKSLAEHYPRG+KEFRGPFFNVHGPMDTMSWFSNHGVELK+E+DG
Sbjct: 61  SGGGRCNVTNGHCTDAKSLAEHYPRGYKEFRGPFFNVHGPMDTMSWFSNHGVELKIEEDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSNCSASIVDCL+SE+KRTGVSLQTGKVVTSAS+SSG KFALKIQKLMNC EH+EA
Sbjct: 121 RVFPVSNCSASIVDCLMSESKRTGVSLQTGKVVTSASVSSGGKFALKIQKLMNCVEHIEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQ GHSLIDPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQFGHSLIDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLF SDYK                 
Sbjct: 241 IQRHHPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFTSDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDFTPDLHLEDVKTILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFTPDLHLEDVKTILSRHKSQFMKQKVHSSCPSDFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
            SRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Sbjct: 421 QSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGEFLGRDISNLA 450

BLAST of HG10022099 vs. NCBI nr
Match: KAG7016151.1 (ytfP [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 718.8 bits (1854), Expect = 2.8e-203
Identity = 389/500 (77.80%), Postives = 401/500 (80.20%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+A+TSIVAVQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKGRPLSKVKI
Sbjct: 1   MNLTKAVTSIVAVQKLNEELLVVVGGGAAGVYGAVRAKTLAPNLNVMVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP+DTMSWFSNHGV+LKVEDDG
Sbjct: 61  SGGGRCNVTNGHFTDAKSLAEHYPRGHKEFRGPFFNVHGPIDTMSWFSNHGVQLKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPV+N SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEA
Sbjct: 121 RVFPVTNSSASIVDCLMSEAKRTGVSLQTGKVVTSASISSGGKFALKIQKLMNSVEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGP 240
           NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGP
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGP 240

Query: 241 VILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLEDVKT 300
           VILRLSAWGARDLFASDYKGFLNSL          +LI+F VGLLIVDF PD HLEDVKT
Sbjct: 241 VILRLSAWGARDLFASDYKGFLNSL----------KLILFYVGLLIVDFAPDWHLEDVKT 300

Query: 301 ILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLK 360
           ILSRHKSQFM                          EINDEILWASISNKSLASISSLLK
Sbjct: 301 ILSRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDREEINDEILWASISNKSLASISSLLK 360

Query: 361 QCIFKILG---------------------------KGQFKDEFVTAGGVPLSE------- 420
           QCIFK+LG                           KGQFKDEFVTAGGV LSE       
Sbjct: 361 QCIFKVLGKVHVLAFPCSKMVYSSSFTDFISSLFMKGQFKDEFVTAGGVQLSEVLFYSSL 420

Query: 421 -----------------ISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAG 424
                            ISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAG
Sbjct: 421 KNRTRKGNTSNSQLFLFISLKTMESKIHSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAG 480

BLAST of HG10022099 vs. NCBI nr
Match: XP_004148683.1 (uncharacterized protein LOC101210627 isoform X2 [Cucumis sativus] >KGN52437.1 hypothetical protein Csa_008489 [Cucumis sativus])

HSP 1 Score: 718.0 bits (1852), Expect = 4.8e-203
Identity = 379/473 (80.13%), Postives = 392/473 (82.88%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKI
Sbjct: 1   MNLTKALTSFVAAQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVVVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG
Sbjct: 61  SGGGRCNVTNGHYTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEA
Sbjct: 121 RVFPVSNCSSSVVDCLMSEAKRTGVSLQTGKVVASASISTGGKFALKIQKLINCFEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDFTPDLHLE+VKTIL+RHKSQFM                          
Sbjct: 301 ------GLLIVDFTPDLHLEEVKTILTRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGRLANGEFLGRDITNLA 450

BLAST of HG10022099 vs. NCBI nr
Match: XP_022993604.1 (uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima] >XP_022993605.1 uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima])

HSP 1 Score: 713.8 bits (1841), Expect = 9.0e-202
Identity = 380/473 (80.34%), Postives = 390/473 (82.45%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI
Sbjct: 1   MNLTKAVTSIVAVQKLNEEVLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGV+LKVEDDG
Sbjct: 61  SGGGRCNVTNGHFTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVQLKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSN SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEA
Sbjct: 121 RVFPVSNSSASIVDCLMSEAKRTGVSLQTGKVVTSASISSGGKFALKIQKLMNSVEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLVDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDF PDLHLEDVKTILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFAPDLHLEDVKTILSRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVQLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGEFLGRDVSNLA 450

BLAST of HG10022099 vs. NCBI nr
Match: XP_023549941.1 (uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo] >XP_023549943.1 uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 712.2 bits (1837), Expect = 2.6e-201
Identity = 379/473 (80.13%), Postives = 389/473 (82.24%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+A+TSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI
Sbjct: 1   MNLTKAVTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGV+LKVEDDG
Sbjct: 61  SGGGRCNVTNGHFTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVQLKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSN SASI+DCL++EAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEA
Sbjct: 121 RVFPVSNSSASIIDCLMAEAKRTGVSLQTGKVVTSASISSGGKFALKIQKLMNSVEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLVDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLF SDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFTSDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDF PDLHLEDVKTILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFAPDLHLEDVKTILSRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVQLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGEFLGRDISNLA 450

BLAST of HG10022099 vs. ExPASy Swiss-Prot
Match: Q795R8 (Uncharacterized protein YtfP OS=Bacillus subtilis (strain 168) OX=224308 GN=ytfP PE=4 SV=2)

HSP 1 Score: 144.1 bits (362), Expect = 3.7e-33
Identity = 123/427 (28.81%), Postives = 198/427 (46.37%), Query Frame = 0

Query: 21  LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSL 80
           ++V+GGG +G+  AI A        V++I+KG  L  K+ ISGGGRCNVTN   +  + +
Sbjct: 6   VIVIGGGPSGLMAAIAAG--EQGAGVLLIDKGNKLGRKLAISGGGRCNVTNR--LPVEEI 65

Query: 81  AEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISE 140
            +H P G+  F    F+     D + +F N G++LK ED GR+FPV++ + S+VD L++ 
Sbjct: 66  IKHIP-GNGRFLYSAFSEFNNEDIIKFFENLGIQLKEEDHGRMFPVTDKAQSVVDALLNR 125

Query: 141 AKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIA--------SGSSR 200
            K+  V+++T + + S     G    +    + N  E + +  ++IA        +GS+ 
Sbjct: 126 LKQLRVTIRTNEKIKSVLYEDGQAAGI----VTNNGEMIHSQAVIIAVGGKSVPHTGSTG 185

Query: 201 QGFSLAAQLGHSLIDPVP---------------SLFTFKIEDPQLAELSGVG-------- 260
            G+  A   GH++ +  P               +L    + D  ++ L+  G        
Sbjct: 186 DGYEWAEAAGHTITELFPTEVPVTSGEPFIKQKTLQGLSLRDVAVSVLNKKGKPIITHKM 245

Query: 261 PMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVD 320
            ML TH+GLSGP ILR S +  ++L           L         ++  +F      + 
Sbjct: 246 DMLFTHFGLSGPAILRCSQFVVKELKKQPQVPIRIDLYP-----DINEETLFQKMYKELK 305

Query: 321 FTPDLHLEDV--KTILSRHKSQFMEINDEILWASISNKSLASISSLLKQC-IFKILGKG- 380
             P   +++V    +  R+    +E N      S S          ++ C  F +L  G 
Sbjct: 306 EAPKKTIKNVLKPWMQERYLLFLLEKNGISPNVSFSELPKDPFRQFVRDCKQFTVLANGT 365

Query: 381 -QFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAG 411
                 FVT GGV + EI  K M SK    L+F GE+L+I G TGG+N  +A   G +AG
Sbjct: 366 LSLDKAFVTGGGVSVKEIDPKKMASKKMEGLYFCGEILDIHGYTGGYNITSALVTGRLAG 418

BLAST of HG10022099 vs. ExPASy Swiss-Prot
Match: B0NAQ4 (3-dehydro-bile acid delta(4,6)-reductase OS=Clostridium scindens (strain ATCC 35704 / DSM 5676 / VPI 13733 / 19) OX=411468 GN=baiN PE=1 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 7.5e-26
Identity = 108/433 (24.94%), Postives = 188/433 (43.42%), Query Frame = 0

Query: 23  VVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPL-SKVKISGGGRCNVTNGHCIDAKSLAE 82
           ++GGGA+G+  AI A     +  V ++E+   +  K+  +G GRCN+TN   +DA     
Sbjct: 6   IIGGGASGIVAAIAAARSDGDAQVFILEQKENIGKKILATGNGRCNLTN-EAMDASC--- 65

Query: 83  HYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAK 142
            Y     EF        G  +T+ +F++ G+  K    G ++P S+ +AS+++ L  E +
Sbjct: 66  -YHGEDPEFARNVLKQFGYGETLEFFASLGLFTK-SRGGYIYPRSDQAASVLELLEMELR 125

Query: 143 RTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIAS--------GSSRQG 202
           R  V + TG  V +  +S+   F ++        +   A+ +++A         GS   G
Sbjct: 126 RQKVKIYTGVRVEALKLSA-KGFVIRADG-----QRFPADRVILACGGKASKSLGSDGSG 185

Query: 203 FSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGV-------------------GPMLVTH 262
           ++LA  +GH+L   VP+L   K++    A+ +GV                   G M +T 
Sbjct: 186 YALARSMGHTLSPVVPALVQLKVKKHPFAKAAGVRTDAKVAALLGRQVLAEDTGEMQITA 245

Query: 263 WGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLH 322
           +G+SG  + ++S   A+ L+             +G  +K           + VDF P++ 
Sbjct: 246 YGISGIPVFQISRHIAKGLY-------------EGKEMK-----------VRVDFLPEME 305

Query: 323 LEDVKTILSRHKSQFMEINDEILWASISNKSL---------------------ASISSLL 382
              V+   + H  +      +     I  K L                     A    L+
Sbjct: 306 ASQVRKAFNTHLDKCPYATCQEFLTGIFPKKLIPRLLELSHIRQNFPASELKPAQWEDLI 365

Query: 383 KQC---IFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGF 404
           + C   +  I     F +  V AGGV   E+   T+ES+    L+  GE+L+++G+ GG+
Sbjct: 366 RACKQTLLTIEDTNGFDNAQVCAGGVRTGEVYPDTLESRYADGLYLTGELLDVEGICGGY 402

BLAST of HG10022099 vs. ExPASy Swiss-Prot
Match: P44941 (Uncharacterized protein HI_0933 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_0933 PE=1 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 6.6e-22
Identity = 114/420 (27.14%), Postives = 183/420 (43.57%), Query Frame = 0

Query: 22  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSLA 81
           +++G GAAG++ A +   L    +V V + G+ +  K+ +SGGG CN TN     A  L+
Sbjct: 8   IIIGAGAAGLFCAAQLAKLGK--SVTVFDNGKKIGRKILMSGGGFCNFTNLEVTPAHYLS 67

Query: 82  EHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEA 141
           ++ P   K     + N     D +S  +  G+    ++ G++F     +  IV+ L SE 
Sbjct: 68  QN-PHFVKSALARYTN----WDFISLVAEQGITYHEKELGQLF-CDEGAEQIVEMLKSEC 127

Query: 142 KRTGVSLQTGKVVTSA---SISSGVKFALKIQKLM-NCFEHVEA--NYLLIASGSSRQGF 201
            + G  +     V+          V+F L++      C   + A     +   G++  G+
Sbjct: 128 DKYGAKILLRSEVSQVERIQNDEKVRFVLQVNSTQWQCKNLIVATGGLSMPGLGATPFGY 187

Query: 202 SLAAQLGHSLIDPVPSL--FTFKIEDPQLAELSGV---------------GPMLVTHWGL 261
            +A Q G  +I P  SL  FT++  D  L  LSG+                 +L TH G+
Sbjct: 188 QIAEQFGIPVIPPRASLVPFTYRETDKFLTALSGISLPVTITALCGKSFYNQLLFTHRGI 247

Query: 262 SGPVILRLS-AWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLE 321
           SGP +L++S  W   +                   ++   L   +V   I         +
Sbjct: 248 SGPAVLQISNYWQPTE------------------SVEIDLLPNHNVEEEINQAKQSSPKQ 307

Query: 322 DVKTILSR-HKSQFME-------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKD 381
            +KTIL R    + +E       + DE++ A+IS   + ++   +    F   G   ++ 
Sbjct: 308 MLKTILVRLLPKKLVELWIEQGIVQDEVI-ANISKVRVKNLVDFIHHWEFTPNGTEGYRT 367

Query: 382 EFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGK 409
             VT GGV    IS KTMES   S L+F GEVL++ G  GG+NFQ AWS  Y    SI +
Sbjct: 368 AEVTMGGVDTKVISSKTMESNQVSGLYFIGEVLDVTGWLGGYNFQWAWSSAYACALSISR 400

BLAST of HG10022099 vs. ExPASy Swiss-Prot
Match: P37631 (Uncharacterized protein YhiN OS=Escherichia coli (strain K12) OX=83333 GN=yhiN PE=4 SV=3)

HSP 1 Score: 100.5 bits (249), Expect = 4.7e-20
Identity = 108/408 (26.47%), Postives = 181/408 (44.36%), Query Frame = 0

Query: 22  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG-RPLSKVKISGGGRCNVTNGHCIDAKSLA 81
           +++G GAAG++ +  A        V++I+ G +P  K+ +SGGGRCN TN +      L+
Sbjct: 7   IIIGAGAAGMFCSALAG--QAGRRVLLIDNGKKPGRKILMSGGGRCNFTNLYVEPGAYLS 66

Query: 82  EHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEA 141
           ++ P   K     F       D +   + HG+    +  G++F   + +  IVD L+ E 
Sbjct: 67  QN-PHFCKSALARFTQ----WDFIDLVNKHGIAWHEKTLGQLF-CDDSAQQIVDMLVDEC 126

Query: 142 KRTGVSLQTGKVVTSASISSGVKFALKIQKL-MNCFEHVEA--NYLLIASGSSRQGFSLA 201
           ++  V+ +    V S +      F L +  + + C + V A     +   G+S  G+ +A
Sbjct: 127 EKGNVTFRLRSEVLSVA-KDETGFTLDLNGMTVGCEKLVIATGGLSMPGLGASPFGYKIA 186

Query: 202 AQLGHSLIDPVPSLFTFKIEDPQLAE---LSGVG---------------PMLVTHWGLSG 261
            Q G +++     L  F +  P L E   L+GV                 +L TH GLSG
Sbjct: 187 EQFGLNVLPTRAGLVPFTLHKPLLEELQVLAGVAVPSVITAENGTVFRENLLFTHRGLSG 246

Query: 262 PVILRLSAWGARDLFAS-------DYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPD 321
           P +L++S++     F S       D + FLN                          T  
Sbjct: 247 PAVLQISSYWQPGEFVSINLLPDVDLETFLNEQRNAHPNQSLKN-------------TLA 306

Query: 322 LHLEDVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVT 381
           +HL   K ++ R   Q  +I D  L   ++ +   ++ S L     +  G   ++   VT
Sbjct: 307 VHLP--KRLVER-LQQLGQIPDVSL-KQLNVRDQQALISTLTDWRVQPNGTEGYRTAEVT 366

Query: 382 AGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGY 401
            GGV  +E+S +TME++    L+F GEV+++ G  GG+NFQ AWS  +
Sbjct: 367 LGGVDTNELSSRTMEARKVPGLYFIGEVMDVTGWLGGYNFQWAWSSAW 388

BLAST of HG10022099 vs. ExPASy TrEMBL
Match: A0A0A0KVG6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G634330 PE=4 SV=1)

HSP 1 Score: 718.0 bits (1852), Expect = 2.3e-203
Identity = 379/473 (80.13%), Postives = 392/473 (82.88%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKI
Sbjct: 1   MNLTKALTSFVAAQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVVVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG
Sbjct: 61  SGGGRCNVTNGHYTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEA
Sbjct: 121 RVFPVSNCSSSVVDCLMSEAKRTGVSLQTGKVVASASISTGGKFALKIQKLINCFEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDFTPDLHLE+VKTIL+RHKSQFM                          
Sbjct: 301 ------GLLIVDFTPDLHLEEVKTILTRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGRLANGEFLGRDITNLA 450

BLAST of HG10022099 vs. ExPASy TrEMBL
Match: A0A6J1K0M0 (uncharacterized protein LOC111489549 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489549 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.4e-202
Identity = 380/473 (80.34%), Postives = 390/473 (82.45%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI
Sbjct: 1   MNLTKAVTSIVAVQKLNEEVLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGV+LKVEDDG
Sbjct: 61  SGGGRCNVTNGHFTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVQLKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSN SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEA
Sbjct: 121 RVFPVSNSSASIVDCLMSEAKRTGVSLQTGKVVTSASISSGGKFALKIQKLMNSVEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLVDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDF PDLHLEDVKTILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFAPDLHLEDVKTILSRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVQLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGEFLGRDVSNLA 450

BLAST of HG10022099 vs. ExPASy TrEMBL
Match: A0A6J1FLC1 (uncharacterized protein LOC111445022 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445022 PE=4 SV=1)

HSP 1 Score: 710.7 bits (1833), Expect = 3.7e-201
Identity = 379/473 (80.13%), Postives = 388/473 (82.03%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ TRA+TSIV VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKGRPLSKVKI
Sbjct: 1   MNLTRAVTSIVPVQKLNEELLVVVGGGAAGVYGAVRAKTLAPNLNVMVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGV+LKVEDDG
Sbjct: 61  SGGGRCNVTNGHFTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVQLKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPV+N SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEA
Sbjct: 121 RVFPVTNSSASIVDCLMSEAKRTGVSLQTGKVVTSASISSGGKFALKIQKLMNSVEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLVDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDF PD HLEDVKTILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFAPDWHLEDVKTILSRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEISLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVQLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGEFLGRDISNLA 450

BLAST of HG10022099 vs. ExPASy TrEMBL
Match: A0A1S3C9B6 (uncharacterized protein YtfP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498464 PE=4 SV=1)

HSP 1 Score: 709.1 bits (1829), Expect = 1.1e-200
Identity = 375/473 (79.28%), Postives = 387/473 (81.82%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKI
Sbjct: 1   MNLTKALTSFVATQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVVVIEKGRPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGHC DAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG
Sbjct: 61  SGGGRCNVTNGHCTDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEA
Sbjct: 121 RVFPVSNCSSSVVDCLMSEAKRTGVSLQTGKVVASASISTGGKFALKIQKLINCFEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLKLEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Sbjct: 241 IQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDFTPDLHLEDVK IL+RHKSQFM                          
Sbjct: 301 ------GLLIVDFTPDLHLEDVKRILTRHKSQFMKQKVHSSCPSEFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EINDEILWASISNKSLASIS LLKQCIFKILGKGQFKDEFVTAGGVPLSE+SLKTMESKI
Sbjct: 361 EINDEILWASISNKSLASISYLLKQCIFKILGKGQFKDEFVTAGGVPLSEVSLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG LANGE L  DI+N A
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGGLANGEFLRGDITNWA 450

BLAST of HG10022099 vs. ExPASy TrEMBL
Match: A0A6J1BWQ2 (uncharacterized protein LOC111006025 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006025 PE=4 SV=1)

HSP 1 Score: 679.9 bits (1753), Expect = 7.0e-192
Identity = 364/474 (76.79%), Postives = 382/474 (80.59%), Query Frame = 0

Query: 1   MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKI 60
           M+  +ALTS VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG+PLSKVKI
Sbjct: 1   MNLAKALTSSVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKI 60

Query: 61  SGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG 120
           SGGGRCNVTNGH  D+KSLAEHYPRGHKEFRG FFNVHGPMDTMSWFSNHGVELK+EDDG
Sbjct: 61  SGGGRCNVTNGHSTDSKSLAEHYPRGHKEFRGSFFNVHGPMDTMSWFSNHGVELKIEDDG 120

Query: 121 RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEA 180
           RVFPVSNCSASIVDCL+ EA R GVSLQTGKVVTSAS SSG KF LKIQK++   EHVEA
Sbjct: 121 RVFPVSNCSASIVDCLMYEATRVGVSLQTGKVVTSASTSSGGKFVLKIQKIV--VEHVEA 180

Query: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG-------------- 240
           NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG              
Sbjct: 181 NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVSFPKVRAKLELEN 240

Query: 241 ----------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ 300
                     VGPMLVTHWGLSGPVILRLSAWGARDLFAS+YK                 
Sbjct: 241 MQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASNYK----------------- 300

Query: 301 LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM-------------------------- 360
                 GLLIVDFTPDLHLEDVK+ILSRHKSQFM                          
Sbjct: 301 ------GLLIVDFTPDLHLEDVKSILSRHKSQFMKQKVHSSCPSDFGLVKRFWKYLLDRE 360

Query: 361 EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI 420
           EI+DEILWAS+SNKSLAS+SSLLK+CIFK+LGKGQFKDEFVTAGGVPLSEISLKTMESKI
Sbjct: 361 EIHDEILWASVSNKSLASVSSLLKKCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKI 420

Query: 421 HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGR-DISNLA 424
           HSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANG  LGR D+ N+A
Sbjct: 421 HSRLFFAGEVLNVDGVTGGFNFQNAWSGGYIAGTSIGKLANGGLLGRGDVGNVA 449

BLAST of HG10022099 vs. TAIR 10
Match: AT5G39940.1 (FAD/NAD(P)-binding oxidoreductase family protein )

HSP 1 Score: 496.5 bits (1277), Expect = 2.1e-140
Identity = 268/462 (58.01%), Postives = 315/462 (68.18%), Query Frame = 0

Query: 6   ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGG 65
           A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGG
Sbjct: 31  AITSLADKGEKDESELLVVVGGGAAGVYGAIRAKTLSPDLRVLVIEKGSFLSKVKISGGG 90

Query: 66  RCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFP 125
           RCNVTNGHC D  +LA HYPRGHKE +G FF  HGP DTMSWFS HGV LK EDDGRVFP
Sbjct: 91  RCNVTNGHCNDTINLAGHYPRGHKELKGSFFYTHGPADTMSWFSEHGVPLKTEDDGRVFP 150

Query: 126 VSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYL 185
           VS+ S S+VDCL++EA   GV L+ GK V +ASI    KF +K+ K   +  E +EA YL
Sbjct: 151 VSDNSLSVVDCLLNEANIRGVRLERGKSVLAASIKPDGKFLVKVGKQSADTSESIEATYL 210

Query: 186 LIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG----------------- 245
           LIA+GSS++G SLA + GHS++DPVPSLFTFKI DP L EL+G                 
Sbjct: 211 LIATGSSQKGHSLATKFGHSIVDPVPSLFTFKINDPLLTELAGISFSKVQAKLKLDNPCP 270

Query: 246 -------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII 305
                  +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG                   
Sbjct: 271 DLSNLVQIGPMLVTHWGLSGPVILRLSAWGARYLFSSKYKGH------------------ 330

Query: 306 FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------IN 365
                LIVDF PD+++E  K++L  HK QF +                           +
Sbjct: 331 -----LIVDFIPDINIETAKSVLKEHKLQFSKHKVSNSYPPQFGLVNRFWRYILDREGSS 390

Query: 366 DEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR 416
            + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTAGGVPLSE+SLKTMESK+   
Sbjct: 391 KDTLWASLSNNSLSSISDLLKHCTFQVTGKGQYKDEFVTAGGVPLSEVSLKTMESKLVPN 450

BLAST of HG10022099 vs. TAIR 10
Match: AT5G39940.2 (FAD/NAD(P)-binding oxidoreductase family protein )

HSP 1 Score: 465.3 bits (1196), Expect = 5.2e-131
Identity = 253/440 (57.50%), Postives = 296/440 (67.27%), Query Frame = 0

Query: 6   ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGG 65
           A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGG
Sbjct: 31  AITSLADKGEKDESELLVVVGGGAAGVYGAIRAKTLSPDLRVLVIEKGSFLSKVKISGGG 90

Query: 66  RCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFP 125
           RCNVTNGHC D  +LA HYPRGHKE +G FF  HGP DTMSWFS HGV LK EDDGRVFP
Sbjct: 91  RCNVTNGHCNDTINLAGHYPRGHKELKGSFFYTHGPADTMSWFSEHGVPLKTEDDGRVFP 150

Query: 126 VSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYL 185
           VS+ S S+VDCL++EA   GV L+ GK V +ASI    KF +K+ K   +  E +EA YL
Sbjct: 151 VSDNSLSVVDCLLNEANIRGVRLERGKSVLAASIKPDGKFLVKVGKQSADTSESIEATYL 210

Query: 186 LIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG----------------- 245
           LIA+GSS++G SLA + GHS++DPVPSLFTFKI DP L EL+G                 
Sbjct: 211 LIATGSSQKGHSLATKFGHSIVDPVPSLFTFKINDPLLTELAGISFSKVQAKLKLDNPCP 270

Query: 246 -------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII 305
                  +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG                   
Sbjct: 271 DLSNLVQIGPMLVTHWGLSGPVILRLSAWGARYLFSSKYKGH------------------ 330

Query: 306 FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------IN 365
                LIVDF PD+++E  K++L  HK QF +                           +
Sbjct: 331 -----LIVDFIPDINIETAKSVLKEHKLQFSKHKVSNSYPPQFGLVNRFWRYILDREGSS 390

Query: 366 DEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR 394
            + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTAGGVPLSE+SLKTMESK+   
Sbjct: 391 KDTLWASLSNNSLSSISDLLKHCTFQVTGKGQYKDEFVTAGGVPLSEVSLKTMESKLVPN 447

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889404.12.5e-20480.55uncharacterized protein YtfP isoform X2 [Benincasa hispida][more]
KAG7016151.12.8e-20377.80ytfP [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_004148683.14.8e-20380.13uncharacterized protein LOC101210627 isoform X2 [Cucumis sativus] >KGN52437.1 hy... [more]
XP_022993604.19.0e-20280.34uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima] >XP_022993605... [more]
XP_023549941.12.6e-20180.13uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo] >XP_023549943.... [more]
Match NameE-valueIdentityDescription
Q795R83.7e-3328.81Uncharacterized protein YtfP OS=Bacillus subtilis (strain 168) OX=224308 GN=ytfP... [more]
B0NAQ47.5e-2624.943-dehydro-bile acid delta(4,6)-reductase OS=Clostridium scindens (strain ATCC 35... [more]
P449416.6e-2227.14Uncharacterized protein HI_0933 OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
P376314.7e-2026.47Uncharacterized protein YhiN OS=Escherichia coli (strain K12) OX=83333 GN=yhiN P... [more]
Match NameE-valueIdentityDescription
A0A0A0KVG62.3e-20380.13Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G634330 PE=4 SV=1[more]
A0A6J1K0M04.4e-20280.34uncharacterized protein LOC111489549 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FLC13.7e-20180.13uncharacterized protein LOC111445022 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3C9B61.1e-20079.28uncharacterized protein YtfP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498464 ... [more]
A0A6J1BWQ27.0e-19276.79uncharacterized protein LOC111006025 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT5G39940.12.1e-14058.01FAD/NAD(P)-binding oxidoreductase family protein [more]
AT5G39940.25.2e-13157.50FAD/NAD(P)-binding oxidoreductase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR00368FADPNRcoord: 21..40
score: 41.99
coord: 180..198
score: 22.94
NoneNo IPR availablePFAMPF03486HI0933_likecoord: 21..227
e-value: 4.3E-49
score: 167.4
coord: 312..404
e-value: 3.2E-32
score: 111.8
coord: 228..254
e-value: 1.8E-5
score: 23.7
NoneNo IPR availableGENE3D2.40.30.10Translation factorscoord: 208..351
e-value: 1.1E-79
score: 270.2
NoneNo IPR availablePIRSRPIRSR000350-3PIRSR000350-3coord: 19..80
e-value: 0.0037
score: 13.8
NoneNo IPR availableSUPERFAMILY160996HI0933 insert domain-likecoord: 207..352
IPR036188FAD/NAD(P)-binding domain superfamilyGENE3D3.50.50.60coord: 22..401
e-value: 1.1E-79
score: 270.2
IPR036188FAD/NAD(P)-binding domain superfamilySUPERFAMILY51905FAD/NAD(P)-binding domaincoord: 22..406
IPR023166HI0933-like insert domain superfamilyGENE3D1.10.8.260coord: 271..334
e-value: 1.1E-79
score: 270.2
IPR0047923-Dehydro-bile acid delta(4,6)-reductase-likeTIGRFAMTIGR00275TIGR00275coord: 22..403
e-value: 1.5E-88
score: 295.3
IPR0047923-Dehydro-bile acid delta(4,6)-reductase-likePANTHERPTHR42887OS12G0638800 PROTEINcoord: 312..412
coord: 10..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022099.1HG10022099.1mRNA