HG10007441 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007441
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSelenoprotein O
LocationChr10: 5132762 .. 5141774 (+)
RNA-Seq ExpressionHG10007441
SyntenyHG10007441
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGGAAAGAAACCAAAGTTGCCGGCAGTGGAGGGTTGAGGATCTTCAAGTTGTTGGCATCTGATGGAGGAGGAAGAATTGTGAGTGGGTGGGCAGGCATGGGTTCAGCAAGTGTGTAAGTCAGTGTGTGGTAAAACAAGTTTAAAGATTTAAAAAACTTGGTCTGGGCTGGGCTCAAAATGCCCAGAATGGGTTTTAATCCTCCTCCTCCTCCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTCCTCTCAATAATTTGCAGCTCTCATGTCTTAGGCATGTCCATGAGGTGTCTTGCTGTGTCGAAAATAAGAAAATTAATAACAAAATAATGGACACATGTCGGGGTCTAACACATGTCTGACACCAACACTTGGCCATTTTAGGAGTGTCGGTGCTTTATAGCCTGAGACCTATTCAATAGGTGAGATCCATACTTACTCACAATAATCTTCTGCTACAAAGAATGAGGTACCAAGTAGAAACACCACAACTATTTTGCCAAGAGAGCCTTATTACGTAACTGTAGATTCTTGATGCCTAGGCCCCACTCTCCAATGGCTTCAAAACAACCTCCCAATTAACTAAGTGGGCCCCTCCCCTTTATCAACCCTTTCCCAAAAAAAAATTCCTCATTGCCTTTCAATTTTCTTACTAATCGAGATGTGCACCCTAAAAAGGGAAGGGAAGTATAACAGGATTCCACTCAACACCGACTAAATCAAAGTTAATCTTCCCTCTTAGGGAAGAAAGCCTTTTTTTTAATGATGATAGCCTTTTTCTGAATTTTCTCCATCACCGGGTTCCAAAAATGCACAAATTTGGGTTATCACTTAAAGGAAGGCCAAGATAAGCATCAGGAAGCGAGTTAACTTCACACCTCACCAAAGAGTTTTTCATGATCAATCTAAAGATTGGCCAGATAGGTGGGGTCAGCCCACCTCCTAGCTTCTTATTGGAGTTCGTAGTTCAAGTTTTTAAAGAGTATTCTATTAAGGATATTTGTCTTAATTGGAATGCTTTTATTTTTACTTTGTAATGAAGCAATTTAGCATGTTTGTAATCTCTGATTTCTTGTATTTCGAATAGGAAATGATGAGGGTGCTACGGGGATGTCAACTTGGTTGAGATGTCTGGGTGCACCTCCTGATCCACAGTTTTTAGTTGTCTTGTTTTGCTATTTTGTTTCTTTGTAATTGAGCTATTGCTCTTTTCATTATATCAATGAAAAGTTTCGTTTCCATTTAAAAAAAAAAAAGACGTCACACCTCACCAAAGGTGCCCAATAACTTAACTTAGCAGACTCACAGTTCACTCCAAAATGGTGCTTTTCTTGCTATTAAATTTTAGACCTAATATAGCTTCACAGAAAGACAAAAGTCATTAAGGTTTCTAAAGGAGTTCTCCTTTTCAGAAGAAAAAAAGAGTGTCATCAACAAATTGAAGATGGGACAATTGAACTCCATTAGAACCCACCAGAAGGCCCTTCACCACACCATCATACATTGACTTGTTGAGAACGTCGACGTCCACCACCTGATTACACAGAAAGGGCGAACTAGAATCACCTTGCCTAAGACCTTTTGTCGCAAAATCTTCCCCTTGATTGCCATTGACCAATATGGAGAAGTTAGTGGACTAAAGACAATTCCGACTCCAAGCTCTACACTTAAACCCAAAACCTTTTATGCGCAAACCTTTATCTAGCAAGTCTCAACCCACGTGATCATAAACTTTCTCGAAATCCACCTCTAGAATCGCCCCTTTCTTTTTTCTCACTTGAAAACCTTCAATGGCTTCAATCACGATACCCTTAGGCATTCAAAATAGTACTAGGGAGCACTTAGGCCTAAGGTTTTCAACTTCGATAGCTTTTTTTTTTTTTTTTTTTTTGGGAATCAGACAAACCCTAGCAATTTTTTCTTGTTAATCTTCTATTTTAAGATGTTTATCTTCTAGCTTATAGCAATTTTTACATGATCTTTTGATTTTCAAGTATCCTTGTATTGTGAACATTAGTCTCTTTTCATTATATCAATGAAAAATTTTGTTTCCTTTAAAAAAAAAAGTCTCACTCATAGCAACGCCTACAATCCCTCTGTTGTGAAACTCACTAAAGACTTGCCATATGTCTTCCACTTCACATCCTCTCGATTATCTTGATAATAAAATGCCATAGTGAAATCATCTGACCTTGGCGATTTATTCTTATCAAAACCAAAAACGACCTTCTGGATCTCTTCCAAGGAGAACGGGGCCTTCAGGACCTCCTTCGCTAATAGAGATAGGCCTCCAATCCAACCTTTCCACAAAAGGCCTGGAGATTCATTGAGAAGAAGGAAAGGACCTTTTGAATCTCGCTTTCCTCCTTCACCATTTCACCATTTTCCTCATCCAAAAAATTAATAAGGTTTTTACTCCTCCCCCTTGCACTGGCTTGTAAGAAAGAGGTATTGGCATACCCTTCCTCGGTCCACTTCATTGTAGTCTTTTGGCGCCAGCTAATATCATCACTTCAAATCAAGGATTCAAGCTCCACTCTGAGACCATCCTTCTCCCTCTCCAACCTCATATGTGAGAACAAGCTCCCTTCCATCTCCAAGTCATCTTCCGAGTAACGCCTTCATCCACTACATTGTCTCCTTCTTACGCTTCAAGTGCCCAAATGCAGACAAGTTTTTAACACTTAAAGCCAAAGGTCTCCTTGAGAGCCCTATAGATAACTATTTTAGTGGCTGCATGTCTAGCAATTGGATTTTTGTAAGCATTCTTGTGAAGGATTTATTTGCGTGACTAACACATGGCCTATAGGAATTCTTCACCTTGTAACTTTTCACTTATCAATGAAAGCGTGTCAATTTTTCATTTTTTAAAAAGAAGATAATGTAGACGCGTATGATTGGTCATCATATGAGTGTTCTAATGATACTTTTCTCTTTGTATTTCAGTCTTAATCTACTGTAGTCAATTAGATGTCTGTGTATGTTCTCTCCCTTAGAGTTGTTTTCTTCTTGCAAGAGCTGAAGGGTTTGCTTTTGTCATTGCAGATTTGAAAGGCCAGATTTCCCCCTCTTGTTCTCTGGCGCATCTCCATTAGTTGGAGTGTAAGTGTGAAATACTAATTGGTTGACAGCACCAGCGTTGTGATTTTCATTATCCCGACTGCTGATATTGTGTATTATAGGTCGCCTTATGCTCAATGTTATGGGGGCCATCAGTTTGGCATGTGGGCTGGACAGTTGGGTGATGGCCGAGCAATAACCCTTGGGGAGATACTTAATTCACAATCTGAAAGGTGGGAGTTGCAGCTAAAAGGTGCTGGGAAGACTCCATACAGTCGGTTTGCAGATGGCTTAGCTGTGTTACGAAGTAGCATCAGGGAGTTCCTTTGTAGTGAAGCAATGCACAGTCTTGGAATACCAACAACTCGTGCACTTTGTCTTCTGACCACAGGAACATTTGTTACCCGAGATATGTTTTATGAGTACGTAAACAAAAAAATCTGTTCAAGATGGAAGTTATAGAAATTTTGTAGTTATACGATGATCTTATATCTTCTTGTGCATTTGAATAGGGTTTTCCCATAAGTTCAAGAATTCTTAATCTCATAATATGGTTTTTTTTTCTTTATCTGCAAAACCCTTGTTAACACAAATAAATTTCTTTAACTAAATTTTTTGTTTTAATTAATTTTAGGTCAGGGATGTATATATTTTGTTTTTCTAAAAAAATTGAAATCCTGATTTTGTCAAATACTTATATGTAGGTACTTATAGTTTTGAACATGTAAATAACTGTAAAATACTAAGAGACATAAATTCATGTTGGTTATGACAATGCAATAATAAGTTCCGATCGTTTCAATTTATAGTGGGAATGCAAAAGAAGAGCCTGGTGCAATTGTATGCAGAGTGGCTCAATCCTTTCTGCGTTTTGGATCATACCAAATTCATGCCTCTAGAGGGAAAGATGACTACAAAATTGTTCGGGCTTTAGCAGACTACGTCATCCGTCACCATTTTCCGCACCTAGAGAATATGAGCAGCAGTCAGAGTTTATCTTTCAGCACAGGCAATACAGATAGTTCAGTCGTTGATCTCACTTCAAACAAGTATGCAGGTAATCATTTCTGTTAAAAAATTGTGTGATGTTGTAGTAAAAAGTAGTTTGTGACAAGAAGTACAGGTAGTGAAAAGGCTACTTGAACTTAACCGTGTGTAGGCCGGTGTACATTGCTATTTTATGTGAAACTTAATTAATTGGTACGTGGAAGAAAAAATAAACTAATTCTCCAATTTTCCTTTTCTTGAGTTGCAGTGAGCTGTAAGGGGTAGGTTAATTCTTTTCCTTTCTAATGTTTAGATTAATTCTTTTCCAGAGCCATGTTCCAGTTGTCTGAAATTATAAAAGAATATTTAATTAAAAACACGTTTTTAGCATGTTAGACACGTGGCTAGAGCATGTCTGGATGTATCCAGGTTCGACAAATGTCCATGCTTCCTACGGTGTTAGGTCATTCCATCCTTAGCAAAGCTAGGTTATCATTTACTAGTTCTTTTTTATTTTATTTTATTTTATTTTAGATGAAAACAAGGACTTCCATTGAGAAAAAATGAAAGAATACAAGGGCATACAAAAAATCAAGTCCACAAAAAAAGGGGGAGCACCCTCTACAAGAAGGTGCTCCAACTGTGCACAATAATGCCTATAGGATAATTACAAAAATCCTTCGAATACGAAGTCCAAAGAGAAACATAAAAACGAACAATGGACCATAGATCAGAAGGATCCCTATTCACCCCTTTAAACACTCTTCTATTCCGCTCGCCCCACAAAGCCCACAAGATCGCAACCACCGTTGCACTCCATAAAAAGCTGCCTTTCCCTCCAAAAGGCGTCCTTGATTAGATCACTGACATCTTTGTGCTGCACAGTCGTCAAACCAAACGTCAAAGAGTCCCAAACACGACTCGCATACTCACAGCGCCAGAGAATATGGTTAAGGTCTTCCTCTACCTTCTGATAAAGAATGCAACAAAAAGGCCCCACCTGCTAGGGCAACTGCCTTACGAGCCTATCCATCGTGTTAGCGCGGCCGTGGAGAACCTGCCAAGTAAAGAACCTCACCTTCTTAGGGATTTTAATCCTCCAAAGAACTGTAAAAACCAACACCCCTAAAAGAGAAACATAATGTGAGGCAGTGCTAAGGCTATATAGTTTATGATCCTTCATTGTCTGAATTTGCACGTTCTTTTGTGCTATTGTATTGTTTATCTATTGAGCAAACTATTCTTCCGCCTTTTACTAAAGATTGCTTTTGTCATCTTAAGATCAAAATGTTACTCAAATATAATCAAGGCTTCAATTTGTCAGAGGCATGGGTGAAAATCTTAGGATGGTATTTTTATTTGAAATTTATGAAGATGAAGAATTTGCCTCGTGCTTCCTGAATTTATGATGTTCCGCATGTATCTTTCTCTTGAACTGCATTAAAGAAGCTGCCTCAAGTTTGGCAAAACATGTGTAGTTTGCCTCTTTCAAATACTTTAACTACTCTACTTATACAAGCAAGATAGCGTAATTGATGCATCATGGGCTTGTGCTTTGTGCCTTTAGGCTCGTTTTACACACTGTATCCTTCATATTGCCTTTTTTTTTTTGTTTTTTTAAATTTTTTACTTTTTATTTTTTGTTTGAAAAAAAATAAGAAACGAACTTTTAGTTTTTATTGATGATTGAAAAGTTACAAGTTTGTTCAAAGGTGCAAACTCCCGTAAGGATTTAGAATGGAAAATGAGAACGGGTCTTTCATAAGTTAAATGAACAGTTTTGAGTTTGACCAAAATAACCTTTGTTTCCTTCCTTGGTTACACTTACAAGTTACAAATCTTATTGGGACACTGTTTCGTGCCTATTATTAAGCAACACTTTCAATCATAACTCTCTCAAGAGTCAAGACATTGGTCTAGATTCATGTTTAAATTTTATTCCTTTACGGCATTTCGCAGCAACTGGGAATTTGTTAGCAAACCTAGGATTGCATGACCTATGTGATGAATCCTGGTTCAAAGCTTACTTCATGGCTTGTCTGTGAACTTTTGTAGAGCCGAGCGCATGGGAACTATTTTTATTTTTATTAATTTATATTTTTTTGGTAAGAAACCAAGCTTTTATTGGGAGAAGTGAAAGAATGTACAACTGGATACAAAATTCTCCATTTTATTGCATGGTTGATGCAGTTCTCCTTTTAGTTGTATTCAACTTTCTGTTTCTTAACAAAAAGGGTTACACATTTGCTTCTGTTTTCATTGCCTTTACATGGATGATGGTATCAGCGTGGACTGTAGAGGTTGCTGAGCGAACTGCTTCCTTAATAGCAAGTTGGCAGGGAGTTGGGTTCACACATGGTGTACTCAACACTGACAATATGAGCATCTTGGGTCTAACCATCGATTATGGTCCCTTCGGATTTTTGGATGCTTTTGATCCTAGTTTTACGCCTAATACAACTGATCTTCCGGGCAGAAGATACTGTTTTGCAAATCAGCCAGATATAGGCTTATGGAATATAGCCCAGTTTGCCTCAACTCTTTCAGCTGCTGAATTAATAAATGACAAAGAAGCAAACTATGCCATGGAAAGGTAGTTTTACTTTATCGAAGTTTTGTCTTTTCTAGAATTGTTACTCCGTTGCACTCATATTATGACATTTCCCCTCTCTTCTTACAGATACGGAGACAAATTTATGGATGACTATCAAGCCATTATGACCAAGAAAATTGGTCTGCCAAAGTACAACAAACAGTTAATCGGCAAGCTTCTCAACAACATGGCTGTTGACAAGGTTGATTATACAAATTTTTTTAGATCACTTTCCAATATCAAAGCTGATCCCAGCACCCCAGAGGAGGAGCTGTTGGTCCCTCTGAAGGCAGTTCTGTTAGATATTGGCAAAGAGCGTAAGGAAGCTTGGGTTAGCTGGGTAAAGACCTACATGGAGGAGGTATAGTGATATGAAATACCTGATGGTTCTTTGGAGTCGGATTATTACAATATATATAGGGGAATTTTTAAAAATAGAAAAATAGGGAAACTATTTACACAAAATAACAAAATTTTTAGATAGTTGTGATAGACGCTGATAAAAGTCTATCAGGATCTATCAGTGATAAAAATGATAGAAGTGTATCATTGATAGATGTTGATAGAAGTCTATCAATGTTTATCAGTGTTTTTTTTTGGTATTTTCTGTAAATAGTTTGACATTTTTTCTATCATGAAAATTTCCAACATATATATATTTTTGATGAGAAACGAGACAAAATCTATTGATGAAAAGAGAATACAACAAACTTGATCATCCTCTTCAGAAGGTCTTTTGACTTCTGAGATTGAAATAAGGCTATCCCATTCTTCCATCTCTCTATCAAGAAGCCCTATTCTAAAATTCAGATTCCACGCATCCCCTTTGGAAAACCAACAATTCTCACCGAAAGGCTTTTATTTTAGAGTTAACAATAGCGTAAATTGAGGGAAACTCCTGACTAGATGGTCTGTTTGTGGCTCAATGATCTTTCAAAAAATGATTCTTATCCCCTGTTCCCAACATATAACTGGTAATTCTTTGAAAGAAATTGCAAGTTTTGCTAATGTAACTCCAGTGGCTGAACTTGGATCTTTCTCATCTCATTAGGAAACCAATTACCATTTGATTGCCCACATTTGCTTCTTAGAACTCAAGGATACACTTGTACAACACTACAACCTATAGAATAGATTCTGTTCAAGTAAGGTCGTTGCCCACATTTGCTTCTAATAACCGATTTCCAGGAGGGCAGAATCCTTCTGAGTATATCTCCACAGCCATTTAGAGAGTAAGGCTGTATCTCTTCTTCTCAGATCTCCAATCCCCAGCCCACCATCACATAAAGGCACCGTGACCTTCCAACAACTAAATGGCTGCCACCATCCATGTTTGCCCGTTCCCAAAGAAAGTGCCTTAAAAATTTATCTATGGACAAACTAACTTTGGAAGGGATTTGAAAGAACATGAATTCACCAAAAGCATATTGCCTCCTTTTGAGAGGGGAAAACCTCTCCACTTAGCAAGCTTGTTCTTGATTTTATCAACAACCGGATCCCAAGACAACACCTTCTCGGGATTGTCACCTAGAGGCAATCCAAGGTAGTTAAAAGGCAGGGCTTCCACTTTACAAACAAACAACAGAGCCTCTTTTCTAACCTTGGTTTCTTAAGGAAAGCTTTGTTTCTTATATGTAAGAGAAGAGGGGGGGGGGGGGGGGGGGGGGACAGAGCCTCTTTTCTAATCTCTTTCCTATGGATTATTACAATATTTCCTTGAGTGTATCAACACATTTGGTCATTCAACCACTGATTGGATGGGGTTTCCACGTTTTCAGTGGGCTTTTATTTCCATTCATTGTTGAATCTTGTCTCTTTTTTTTTTTTTTTCTTAGAATAAATATTTGGGATTTAATTTTCTTTGACCATTGCCTACTGTACATATCTTCTGCTGGGGTGGTCTTGGCTTTCTTGGCTGTAGCTTGCAAACTTTGTTATTGGACTTAATTCAAATTTTAAAGAACTCTATAATGCTACTAGATGCTTTTTCTGTTGGTAGTTGAATTTTCATAAAATAGAGTCTTCATTTAAGGAACGTTGTTGGCATGCTTTGAACATGCAAATAAATTAGTTTTTCCTTTATCTTATTCATTATTCTATTGCTTATGCAGCTGGCTGGAAGTGGCATTTCAGATGAGGAGCGGAAGGCCTCTATGGATGCAGTAAATCCTAAATATATTCTGAGGAACTACCTTTGCCAGACTGCCATAGATGCAGCTGAACAGGGTGATTTTGGAGAGGTTCGTCGGCTGCTGAAGATAATGGAACGGCCATTCGATGAGCAGCCAGGAATGGAGAAATATGCACGGTTGCCCCCAGCGTGGGCTTATCGGCCGGGTGTTTGTATGCTTTCTTGTTCTTCTTGA

mRNA sequence

ATGGAAGGGAAAGAAACCAAAGTTGCCGGCAGTGGAGGGTTGAGGATCTTCAAGTTGTTGGCATCTGATGGAGGAGGAAGAATTGTGAGTGGGTGGGCAGGCATGGGTTCAGCAAGTGTATTTGAAAGGCCAGATTTCCCCCTCTTGTTCTCTGGCGCATCTCCATTAGTTGGAGTGTCGCCTTATGCTCAATGTTATGGGGGCCATCAGTTTGGCATGTGGGCTGGACAGTTGGGTGATGGCCGAGCAATAACCCTTGGGGAGATACTTAATTCACAATCTGAAAGGTGGGAGTTGCAGCTAAAAGGTGCTGGGAAGACTCCATACAGTCGGTTTGCAGATGGCTTAGCTGTGTTACGAAGTAGCATCAGGGAGTTCCTTTGTAGTGAAGCAATGCACAGTCTTGGAATACCAACAACTCGTGCACTTTGTCTTCTGACCACAGGAACATTTGTTACCCGAGATATGTTTTATGATGGGAATGCAAAAGAAGAGCCTGGTGCAATTGTATGCAGAGTGGCTCAATCCTTTCTGCGTTTTGGATCATACCAAATTCATGCCTCTAGAGGGAAAGATGACTACAAAATTGTTCGGGCTTTAGCAGACTACGTCATCCGTCACCATTTTCCGCACCTAGAGAATATGAGCAGCAGTCAGAGTTTATCTTTCAGCACAGGCAATACAGATAGTTCAGTCGTTGATCTCACTTCAAACAAGTATGCAGTTCTCCTTTTAGTTGTATTCAACTTTCTGTTTCTTAACAAAAAGGGTTACACATTTGCTTCTGTTTTCATTGCCTTTACATGGATGATGGTATCAGCGTGGACTGTAGAGGTTGCTGAGCGAACTGCTTCCTTAATAGCAAGTTGGCAGGGAGTTGGGTTCACACATGGTGTACTCAACACTGACAATATGAGCATCTTGGGTCTAACCATCGATTATGGTCCCTTCGGATTTTTGGATGCTTTTGATCCTAGTTTTACGCCTAATACAACTGATCTTCCGGGCAGAAGATACTGTTTTGCAAATCAGCCAGATATAGGCTTATGGAATATAGCCCAGTTTGCCTCAACTCTTTCAGCTGCTGAATTAATAAATGACAAAGAAGCAAACTATGCCATGGAAAGATACGGAGACAAATTTATGGATGACTATCAAGCCATTATGACCAAGAAAATTGGTCTGCCAAAGTACAACAAACAGTTAATCGGCAAGCTTCTCAACAACATGGCTGTTGACAAGGTTGATTATACAAATTTTTTTAGATCACTTTCCAATATCAAAGCTGATCCCAGCACCCCAGAGGAGGAGCTGTTGGTCCCTCTGAAGGCAGTTCTGTTAGATATTGGCAAAGAGCGTAAGGAAGCTTGGGTTAGCTGGGTAAAGACCTACATGGAGGAGCTGGCTGGAAGTGGCATTTCAGATGAGGAGCGGAAGGCCTCTATGGATGCAGTAAATCCTAAATATATTCTGAGGAACTACCTTTGCCAGACTGCCATAGATGCAGCTGAACAGGGTGATTTTGGAGAGGTTCGTCGGCTGCTGAAGATAATGGAACGGCCATTCGATGAGCAGCCAGGAATGGAGAAATATGCACGGTTGCCCCCAGCGTGGGCTTATCGGCCGGGTGTTTGTATGCTTTCTTGTTCTTCTTGA

Coding sequence (CDS)

ATGGAAGGGAAAGAAACCAAAGTTGCCGGCAGTGGAGGGTTGAGGATCTTCAAGTTGTTGGCATCTGATGGAGGAGGAAGAATTGTGAGTGGGTGGGCAGGCATGGGTTCAGCAAGTGTATTTGAAAGGCCAGATTTCCCCCTCTTGTTCTCTGGCGCATCTCCATTAGTTGGAGTGTCGCCTTATGCTCAATGTTATGGGGGCCATCAGTTTGGCATGTGGGCTGGACAGTTGGGTGATGGCCGAGCAATAACCCTTGGGGAGATACTTAATTCACAATCTGAAAGGTGGGAGTTGCAGCTAAAAGGTGCTGGGAAGACTCCATACAGTCGGTTTGCAGATGGCTTAGCTGTGTTACGAAGTAGCATCAGGGAGTTCCTTTGTAGTGAAGCAATGCACAGTCTTGGAATACCAACAACTCGTGCACTTTGTCTTCTGACCACAGGAACATTTGTTACCCGAGATATGTTTTATGATGGGAATGCAAAAGAAGAGCCTGGTGCAATTGTATGCAGAGTGGCTCAATCCTTTCTGCGTTTTGGATCATACCAAATTCATGCCTCTAGAGGGAAAGATGACTACAAAATTGTTCGGGCTTTAGCAGACTACGTCATCCGTCACCATTTTCCGCACCTAGAGAATATGAGCAGCAGTCAGAGTTTATCTTTCAGCACAGGCAATACAGATAGTTCAGTCGTTGATCTCACTTCAAACAAGTATGCAGTTCTCCTTTTAGTTGTATTCAACTTTCTGTTTCTTAACAAAAAGGGTTACACATTTGCTTCTGTTTTCATTGCCTTTACATGGATGATGGTATCAGCGTGGACTGTAGAGGTTGCTGAGCGAACTGCTTCCTTAATAGCAAGTTGGCAGGGAGTTGGGTTCACACATGGTGTACTCAACACTGACAATATGAGCATCTTGGGTCTAACCATCGATTATGGTCCCTTCGGATTTTTGGATGCTTTTGATCCTAGTTTTACGCCTAATACAACTGATCTTCCGGGCAGAAGATACTGTTTTGCAAATCAGCCAGATATAGGCTTATGGAATATAGCCCAGTTTGCCTCAACTCTTTCAGCTGCTGAATTAATAAATGACAAAGAAGCAAACTATGCCATGGAAAGATACGGAGACAAATTTATGGATGACTATCAAGCCATTATGACCAAGAAAATTGGTCTGCCAAAGTACAACAAACAGTTAATCGGCAAGCTTCTCAACAACATGGCTGTTGACAAGGTTGATTATACAAATTTTTTTAGATCACTTTCCAATATCAAAGCTGATCCCAGCACCCCAGAGGAGGAGCTGTTGGTCCCTCTGAAGGCAGTTCTGTTAGATATTGGCAAAGAGCGTAAGGAAGCTTGGGTTAGCTGGGTAAAGACCTACATGGAGGAGCTGGCTGGAAGTGGCATTTCAGATGAGGAGCGGAAGGCCTCTATGGATGCAGTAAATCCTAAATATATTCTGAGGAACTACCTTTGCCAGACTGCCATAGATGCAGCTGAACAGGGTGATTTTGGAGAGGTTCGTCGGCTGCTGAAGATAATGGAACGGCCATTCGATGAGCAGCCAGGAATGGAGAAATATGCACGGTTGCCCCCAGCGTGGGCTTATCGGCCGGGTGTTTGTATGCTTTCTTGTTCTTCTTGA

Protein sequence

MEGKETKVAGSGGLRIFKLLASDGGGRIVSGWAGMGSASVFERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDGNAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQSLSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNKQLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSWVKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMERPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Homology
BLAST of HG10007441 vs. NCBI nr
Match: XP_038878878.1 (protein adenylyltransferase SelO [Benincasa hispida])

HSP 1 Score: 952.2 bits (2460), Expect = 2.0e-273
Identity = 470/511 (91.98%), Postives = 475/511 (92.95%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGA+PLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEI+NS+SERWELQ
Sbjct: 171 FERPDFPLLFSGATPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEIINSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIV ALADYVIRHHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVGALADYVIRHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           LSFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 LSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTY+EELAGSGISDEERKASMDAVNPKY+LRNYLCQTAIDAAEQGDFGEVRRLLKIMER
Sbjct: 591 VKTYIEELAGSGISDEERKASMDAVNPKYVLRNYLCQTAIDAAEQGDFGEVRRLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. NCBI nr
Match: XP_004149028.1 (uncharacterized protein LOC101218327 [Cucumis sativus] >KGN66443.1 hypothetical protein Csa_007588 [Cucumis sativus])

HSP 1 Score: 951.4 bits (2458), Expect = 3.3e-273
Identity = 470/511 (91.98%), Postives = 475/511 (92.95%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDD+KIVRALADYVIRHHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDFKIVRALADYVIRHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSN+KADPS PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNLKADPSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. NCBI nr
Match: XP_008451128.1 (PREDICTED: UPF0061 protein azo1574 [Cucumis melo])

HSP 1 Score: 948.7 bits (2451), Expect = 2.2e-272
Identity = 470/511 (91.98%), Postives = 473/511 (92.56%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. NCBI nr
Match: TYK26910.1 (UPF0061 protein [Cucumis melo var. makuwa])

HSP 1 Score: 947.2 bits (2447), Expect = 6.3e-272
Identity = 469/511 (91.78%), Postives = 472/511 (92.37%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. NCBI nr
Match: KAA0042281.1 (UPF0061 protein [Cucumis melo var. makuwa])

HSP 1 Score: 947.2 bits (2447), Expect = 6.3e-272
Identity = 469/511 (91.78%), Postives = 472/511 (92.37%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 107 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 166

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 167 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 226

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 227 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 286

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 287 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 346

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 347 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 406

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 407 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 466

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 467 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 526

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 527 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 585

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 587 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 585

BLAST of HG10007441 vs. ExPASy Swiss-Prot
Match: A1K5T6 (Protein adenylyltransferase SelO OS=Azoarcus sp. (strain BH72) OX=418699 GN=selO PE=3 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 9.6e-122
Identity = 248/532 (46.62%), Postives = 310/532 (58.27%), Query Frame = 0

Query: 31  GWAGMGSASVFERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 90
           GW     AS    P+F  +F G   L G+ PYA CYGGHQFG WAGQLGDGRAITLGE+L
Sbjct: 57  GWDESDIAS----PEFAEVFGGNRLLDGMEPYAACYGGHQFGNWAGQLGDGRAITLGEVL 116

Query: 91  NSQSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGT 150
           N Q  RWELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL L+ TG 
Sbjct: 117 NGQGGRWELQLKGAGPTPYSRRADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGE 176

Query: 151 FVTRDMFYDGNAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFP 210
            V RDMFYDGN + EPGAIVCRVA SF+RFG++++ A+RG  D  ++  L D+ I   FP
Sbjct: 177 KVVRDMFYDGNPQAEPGAIVCRVAPSFIRFGNFELLAARG--DLDLLNRLIDFTIARDFP 236

Query: 211 HLENMSSSQSLSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWM 270
            +E  +  +                                                   
Sbjct: 237 GIEGSARDKR-------------------------------------------------- 296

Query: 271 MVSAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPN 330
             + W   V  RTA+++A W  VGF HGV+NTDNMSILGLTIDYGP+G++D FDP +TPN
Sbjct: 297 --ARWFETVCARTATMVAHWMRVGFVHGVMNTDNMSILGLTIDYGPYGWVDNFDPGWTPN 356

Query: 331 TTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYA-MERYGDKFMDDYQAIM 390
           TTD  GRRY F +QP I  WN+ Q A+ L  A      EA  A +  Y + +  + +A+ 
Sbjct: 357 TTDAGGRRYRFGHQPRIANWNLLQLANALFPA--FGSTEALQAGLNTYAEVYDRESRAMT 416

Query: 391 TKKIGLPKY---NKQLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVL 450
             K+GL      +  ++  L   M   +VD T FFR+L+         E +LL P  A+ 
Sbjct: 417 AAKLGLAALADADLPMVDALHGWMKRAEVDMTLFFRALA---------EVDLLKPDPALF 476

Query: 451 LDI------GKERKEAWVSWVKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI 510
           LD         E  E +  W++ Y +     G+  ++R+A M+A NP+Y++RNYL Q AI
Sbjct: 477 LDAFYDDAKRLETAEEFSGWLRLYADRCRQEGLDADQRRARMNAANPRYVMRNYLAQQAI 519

Query: 511 DAAEQGDFGEVRRLLKIMERPFDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 552
           DAAEQGD+G VR LL +M RP+DEQP    YA+  P WA  R G  MLSCSS
Sbjct: 537 DAAEQGDYGPVRSLLDVMRRPYDEQPERAAYAQRRPDWARERAGCSMLSCSS 519

BLAST of HG10007441 vs. ExPASy Swiss-Prot
Match: Q5NYD9 (Protein adenylyltransferase SelO OS=Aromatoleum aromaticum (strain EbN1) OX=76114 GN=selO PE=3 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 1.9e-117
Identity = 233/522 (44.64%), Postives = 304/522 (58.24%), Query Frame = 0

Query: 44  PDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSE----RWEL 103
           P+F  +F+G + + G+ PYA CYGGHQFG WAGQLGDGRAITLGE + ++ +    RWEL
Sbjct: 66  PEFAAVFAGNALMPGMEPYAACYGGHQFGNWAGQLGDGRAITLGEAVTTRGDGHTGRWEL 125

Query: 104 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYD 163
           QLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRALCL+ TG  V RDMFYD
Sbjct: 126 QLKGAGPTPYSRHADGRAVLRSSIREFLCSEAMHHLGVPTTRALCLVGTGEKVVRDMFYD 185

Query: 164 GNAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQ 223
           G  K EPGA+VCRVA SF+RFG+++I  SRG  D  ++  L D+ I   FP L    +++
Sbjct: 186 GRPKAEPGAVVCRVAPSFIRFGNFEIFTSRG--DEALLTRLVDFTIARDFPELGGEPATR 245

Query: 224 SLSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEV 283
                                                                + W  +V
Sbjct: 246 R----------------------------------------------------AEWFCKV 305

Query: 284 AERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 343
            ERTA +IA W  VGF HGV+NTDNMSILGLTIDYGP+G++D FDP +TPNTTD  G+RY
Sbjct: 306 CERTARMIAQWMRVGFVHGVMNTDNMSILGLTIDYGPYGWIDNFDPGWTPNTTDAGGKRY 365

Query: 344 CFANQPDIGLWNIAQFASTL----SAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGL 403
            F NQP I  WN+ Q A+ L     AAE +++      ++ Y   F ++ + ++  K+G 
Sbjct: 366 RFGNQPHIAHWNLLQLANALYPVFGAAEPLHE-----GLDLYARVFDEENRRMLAAKLGF 425

Query: 404 PKYNKQ---LIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKE 463
             +  +   L+  L   +   +VD T FFR L+++       E   + PL+       K 
Sbjct: 426 EAFGDEDATLVETLHALLTRAEVDMTIFFRGLASLDL-----EAPSIDPLRDAFYSAEKA 485

Query: 464 --RKEAWVSWVKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGE 523
              +    SW+  Y +         ++R+  M+AVNP+++LRNYL Q AIDAAEQG++  
Sbjct: 486 AVAEPEMNSWLAAYTKRTKQERTPGDQRRVRMNAVNPRFVLRNYLAQEAIDAAEQGEYAL 523

Query: 524 VRRLLKIMERPFDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 552
           V  LL +M  P+DEQPG E++A   P WA  R G  MLSCSS
Sbjct: 546 VSELLDVMRHPYDEQPGRERFAARRPDWARNRAGCSMLSCSS 523

BLAST of HG10007441 vs. ExPASy Swiss-Prot
Match: Q7UKT5 (Protein adenylyltransferase SelO OS=Rhodopirellula baltica (strain DSM 10527 / NCIMB 13988 / SH1) OX=243090 GN=selO PE=3 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 1.9e-117
Identity = 236/536 (44.03%), Postives = 321/536 (59.89%), Query Frame = 0

Query: 35  MGSASVFERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQS 94
           +GSA + E      + +G +   G+ P+A CYGGHQFG WAGQLGDGRAI LGE++ +  
Sbjct: 64  LGSAELTE------VLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVVTADE 123

Query: 95  ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTR 154
           + W LQLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL L+ TG  V R
Sbjct: 124 KHWTLQLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLR 183

Query: 155 DMFYDGNAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLEN 214
           DMFYDG+ + E GAIVCRVA SF+RFG+++I ASR  +D + ++ L ++ IR  F HL  
Sbjct: 184 DMFYDGHPEHELGAIVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFSHL-- 243

Query: 215 MSSSQSLSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSA 274
                 LS         V                                       ++A
Sbjct: 244 ------LSEPDAEIGPDV---------------------------------------IAA 303

Query: 275 WTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 334
              EV   TA ++  W  VGF HGV+NTDNMSILGLTIDYGP+G+L+ +DP +TPNTTD 
Sbjct: 304 MFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLTIDYGPYGWLEDYDPDWTPNTTDA 363

Query: 335 PGRRYCFANQPDIGLWNIAQFASTLSAAELINDKE-ANYAMERYGDKFMDDYQAIMTKKI 394
            GRRY +A+QP I  WN+   A+ L    L+ + E     +  Y ++F   + ++M  K+
Sbjct: 364 QGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPLQRGIAVYVEEFQKSWHSMMAGKL 423

Query: 395 GLPKY----NKQLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLL-- 454
           GL KY    + +L+  LL  + + + D T F+R L++I+    T E+ + + L AVL   
Sbjct: 424 GLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLADIEL--GTREQPVTLELAAVLRHL 483

Query: 455 --------DIGKERKEAWVSWVKTYMEE-LAGSGI--SDEERKASMDAVNPKYILRNYLC 514
                   ++ +E ++A + W+++Y    LA  G    D +R+  M+AVNPKY+LRNYL 
Sbjct: 484 SEAHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPAEDSQRRQRMNAVNPKYVLRNYLA 540

Query: 515 QTAIDAAEQGDFGEVRRLLKIMERPFDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 552
           Q AIDA ++GD   V  LL+++ RP+D+QPG E++A   P WA +RPG  MLSCSS
Sbjct: 544 QLAIDACDKGDDSLVSELLEVLRRPYDDQPGKERFAEKRPEWARHRPGCSMLSCSS 540

BLAST of HG10007441 vs. ExPASy Swiss-Prot
Match: Q1H0D2 (Protein adenylyltransferase SelO OS=Methylobacillus flagellatus (strain KT / ATCC 51484 / DSM 6875) OX=265072 GN=selO PE=3 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 3.2e-117
Identity = 227/508 (44.69%), Postives = 297/508 (58.46%), Query Frame = 0

Query: 51  SGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQLKGAGKTPYS 110
           +G   + G+ PYA CYGGHQFG WAGQLGDGRAI+LGE++N Q +RWELQLKGAG TPYS
Sbjct: 71  AGNGLMPGMEPYAACYGGHQFGHWAGQLGDGRAISLGEVVNRQGQRWELQLKGAGVTPYS 130

Query: 111 RFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDGNAKEEPGAIV 170
           R ADG AVLRSS+REFLCSEAMH LGIPTTRAL L+ TG  V RDMFYDG+ + E GAIV
Sbjct: 131 RMADGRAVLRSSVREFLCSEAMHHLGIPTTRALSLVQTGDVVIRDMFYDGHPQAEKGAIV 190

Query: 171 CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQSLSFSTGNTDS 230
           CRV+ SF+RFG+++I A R  DD + ++ L D+ I   FP L N    + L         
Sbjct: 191 CRVSPSFIRFGNFEIFAMR--DDKQTLQKLVDFTIDRDFPELRNYPEEERL--------- 250

Query: 231 SVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVAERTASLIASW 290
                                                     + W   +  RTA LIA W
Sbjct: 251 ------------------------------------------AEWFAIICVRTARLIAQW 310

Query: 291 QGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLW 350
             VGF HGV+NTDNMSILGLTIDYGP+G++D FDP +TPNTTD  GRRYCF  QPDI  W
Sbjct: 311 MRVGFVHGVMNTDNMSILGLTIDYGPYGWVDNFDPGWTPNTTDAAGRRYCFGRQPDIARW 370

Query: 351 N---IAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNKQ---LIG 410
           N   +AQ   TL     I D+     +  Y   + +++ A++  K G   +  +   L+ 
Sbjct: 371 NLERLAQALYTLKPEREIYDE----GLMLYDQAYNNEWGAVLAAKFGFSAWRDEYEPLLN 430

Query: 411 KLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSWVKTY 470
           ++   M   ++D T FFR L+ +  D + P+  +L    A    + +  K  +  W+  Y
Sbjct: 431 EVFGLMTQAEIDMTEFFRKLALV--DAAQPDLGIL-QSAAYSPALWETFKPRFSDWLGQY 490

Query: 471 MEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMERPFDE 530
            +     G    ER+ +M+ VNP+Y+LRNYL Q AID A+ GD   +  L+ ++ +P+DE
Sbjct: 491 AQATLADGRDPAERREAMNRVNPRYVLRNYLAQQAIDLADTGDTSMIEALMDVLRKPYDE 518

Query: 531 QPGMEKYARLPPAWA-YRPGVCMLSCSS 552
           QPG E++A L P WA ++ G  MLSCSS
Sbjct: 551 QPGKERFAALRPDWARHKAGCSMLSCSS 518

BLAST of HG10007441 vs. ExPASy Swiss-Prot
Match: C4LAV8 (Protein adenylyltransferase SelO OS=Tolumonas auensis (strain DSM 9187 / TA4) OX=595494 GN=selO PE=3 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 2.3e-115
Identity = 228/519 (43.93%), Postives = 305/519 (58.77%), Query Frame = 0

Query: 37  SASVFERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSER 96
           S +  ++P +    SG   L G+SP+A CYGGHQFG WAGQLGDGRAI+LGE++++   R
Sbjct: 59  SLAELQQPAWVAALSGNGLLDGMSPFATCYGGHQFGNWAGQLGDGRAISLGELIHNDL-R 118

Query: 97  WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDM 156
           WELQLKGAG TPYSR  DG AVLRSSIREFLCSEAM  LG+PTTRAL L+ TG  + RDM
Sbjct: 119 WELQLKGAGVTPYSRRGDGKAVLRSSIREFLCSEAMFHLGVPTTRALSLVLTGEQIWRDM 178

Query: 157 FYDGNAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMS 216
           FYDGN ++EPGAIVCRVA SF+RFG +Q+ A RG+ D  ++  L D+ I   FPHL    
Sbjct: 179 FYDGNPQQEPGAIVCRVAPSFIRFGHFQLPAMRGESD--LLNQLIDFTIDRDFPHL---- 238

Query: 217 SSQSLSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWT 276
           S+Q  +   G                                                W 
Sbjct: 239 SAQPATVRRG-----------------------------------------------VWF 298

Query: 277 VEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 336
            EV   TA L+  W  VGF HGV+NTDNMSILGLTIDYGP+G++D FD ++TPNTTD  G
Sbjct: 299 SEVCITTAKLMVEWTRVGFVHGVMNTDNMSILGLTIDYGPYGWVDNFDLNWTPNTTDAEG 358

Query: 337 RRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLP 396
            RYCF  QP I  WN+ + A  L    + +       +E + + F  +  A++  K+G  
Sbjct: 359 LRYCFGRQPAIARWNLERLAEALGTV-MTDHAILAQGIEMFDETFAQEMAAMLAAKLGWQ 418

Query: 397 KY---NKQLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKER 456
           ++   + +L+ +L + +   +VD T FFR L+ +  D S P+  +L        D+  + 
Sbjct: 419 QWLPEDSELVNRLFDLLQQAEVDMTLFFRRLALV--DVSAPDLTVLAD-AFYRDDLFCQH 478

Query: 457 KEAWVSWVKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRR 516
           + A+  W+  Y + +   G+   ER A M+ VNP Y+LRNYL Q  IDAAEQG++  +  
Sbjct: 479 QPAFTQWLTNYSQRVLSEGVLPAERAARMNQVNPVYVLRNYLAQQVIDAAEQGNYQPIAE 519

Query: 517 LLKIMERPFDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 552
           LL+++ +P+ EQ G E YA+  P WA ++PG  MLSCSS
Sbjct: 539 LLEVLRQPYTEQSGKEAYAQKRPDWARHKPGCSMLSCSS 519

BLAST of HG10007441 vs. ExPASy TrEMBL
Match: A0A0A0LXE5 (Selenoprotein O OS=Cucumis sativus OX=3659 GN=Csa_1G605720 PE=3 SV=1)

HSP 1 Score: 951.4 bits (2458), Expect = 1.6e-273
Identity = 470/511 (91.98%), Postives = 475/511 (92.95%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDD+KIVRALADYVIRHHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDFKIVRALADYVIRHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSN+KADPS PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNLKADPSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. ExPASy TrEMBL
Match: A0A1S3BRJ3 (Selenoprotein O OS=Cucumis melo OX=3656 GN=LOC103492506 PE=3 SV=1)

HSP 1 Score: 948.7 bits (2451), Expect = 1.0e-272
Identity = 470/511 (91.98%), Postives = 473/511 (92.56%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. ExPASy TrEMBL
Match: A0A5D3DUC9 (Selenoprotein O OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold699G00140 PE=3 SV=1)

HSP 1 Score: 947.2 bits (2447), Expect = 3.1e-272
Identity = 469/511 (91.78%), Postives = 472/511 (92.37%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 171 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 230

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 231 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 290

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 291 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 350

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 351 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 410

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 411 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 470

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 471 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 530

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 531 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 590

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 591 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 649

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 651 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 649

BLAST of HG10007441 vs. ExPASy TrEMBL
Match: A0A5A7TL19 (Selenoprotein O OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold824G00700 PE=3 SV=1)

HSP 1 Score: 947.2 bits (2447), Expect = 3.1e-272
Identity = 469/511 (91.78%), Postives = 472/511 (92.37%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPLLFSGASPLVG SPYAQCYGGHQFGMWAGQLGDGRAITLGEILNS+SERWELQ
Sbjct: 107 FERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQ 166

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG
Sbjct: 167 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 226

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVI HHFPHLENMSSSQS
Sbjct: 227 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIHHHFPHLENMSSSQS 286

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           +SFSTGNTDSSVVDLTSNKYA                                AWTVEVA
Sbjct: 287 VSFSTGNTDSSVVDLTSNKYA--------------------------------AWTVEVA 346

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC
Sbjct: 347 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 406

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK
Sbjct: 407 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 466

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELLVPLKAVLLDIGKERKEAWVSW
Sbjct: 467 QLISKLLNNMAVDKVDYTNFFRSLSNIKADSSIPEEELLVPLKAVLLDIGKERKEAWVSW 526

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTYMEELAGSGISDEERKASMD VNPKYILRNYLCQTAIDAAEQGDFGEVR+LLKIMER
Sbjct: 527 VKTYMEELAGSGISDEERKASMDVVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMER 585

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 587 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 585

BLAST of HG10007441 vs. ExPASy TrEMBL
Match: A0A6J1DIN2 (Selenoprotein O OS=Momordica charantia OX=3673 GN=LOC111021431 PE=3 SV=1)

HSP 1 Score: 936.0 bits (2418), Expect = 7.0e-269
Identity = 459/511 (89.82%), Postives = 469/511 (91.78%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           F+RPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ
Sbjct: 172 FQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 231

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAGKTPYSRFADGLAVLRSSIREFLCSE+MH LGIPTTRALC++TTGT VTRDMFYDG
Sbjct: 232 LKGAGKTPYSRFADGLAVLRSSIREFLCSESMHGLGIPTTRALCVVTTGTLVTRDMFYDG 291

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADY I HHFPHLENMSSSQS
Sbjct: 292 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIHHHFPHLENMSSSQS 351

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           LSFSTGN D SVVDLTSNKYA                                AWTVEVA
Sbjct: 352 LSFSTGNEDDSVVDLTSNKYA--------------------------------AWTVEVA 411

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTASL+ASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGRRYC
Sbjct: 412 ERTASLVASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYC 471

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQF+STLSAAELINDKEANYAMERYG KFMDDYQ IMTKKIGLPKYNK
Sbjct: 472 FANQPDIGLWNIAQFSSTLSAAELINDKEANYAMERYGTKFMDDYQTIMTKKIGLPKYNK 531

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           QLI KLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLDIGKERKEAWVSW
Sbjct: 532 QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSW 591

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           VKTY+EELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER
Sbjct: 592 VKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 650

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           P+DEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 652 PYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 650

BLAST of HG10007441 vs. TAIR 10
Match: AT5G13030.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0061 (InterPro:IPR003846); Has 5046 Blast hits to 4997 proteins in 1211 species: Archae - 8; Bacteria - 2327; Metazoa - 120; Fungi - 134; Plants - 48; Viruses - 0; Other Eukaryotes - 2409 (source: NCBI BLink). )

HSP 1 Score: 833.9 bits (2153), Expect = 7.2e-242
Identity = 396/511 (77.50%), Postives = 439/511 (85.91%), Query Frame = 0

Query: 41  FERPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSQSERWELQ 100
           FERPDFPL+ SGA PL G   YAQCYGGHQFGMWAGQLGDGRAITLGE+LNS+ ERWELQ
Sbjct: 155 FERPDFPLMLSGAKPLPGAMSYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQ 214

Query: 101 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGTFVTRDMFYDG 160
           LKGAG+TPYSRFADGLAVLRSSIREFLCSE MH LGIPTTRALCLLTTG  VTRDMFYDG
Sbjct: 215 LKGAGRTPYSRFADGLAVLRSSIREFLCSETMHCLGIPTTRALCLLTTGQNVTRDMFYDG 274

Query: 161 NAKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYVIRHHFPHLENMSSSQS 220
           N KEEPGAIVCRV+QSFLRFGSYQIHASRGK+D  IVR LADY I+HHFPH+E+M  S S
Sbjct: 275 NPKEEPGAIVCRVSQSFLRFGSYQIHASRGKEDLDIVRKLADYAIKHHFPHIESMDRSDS 334

Query: 221 LSFSTGNTDSSVVDLTSNKYAVLLLVVFNFLFLNKKGYTFASVFIAFTWMMVSAWTVEVA 280
           LSF TG+ D SVVDLTSNKYA                                AW VE+A
Sbjct: 335 LSFKTGDEDDSVVDLTSNKYA--------------------------------AWIVEIA 394

Query: 281 ERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 340
           ERTA+L+A WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTTDLPGRRYC
Sbjct: 395 ERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTTDLPGRRYC 454

Query: 341 FANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNK 400
           FANQPDIGLWNIAQF+ TL+ A+LIN KEANYAMERYGDKFMD+YQAIM+KK+GL KYNK
Sbjct: 455 FANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKKLGLTKYNK 514

Query: 401 QLIGKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLVPLKAVLLDIGKERKEAWVSW 460
           ++I KLLNNM+VDKVDYTNFFR L+N+KA+P+TPE ELL PLKAVLLDIGKERKEAW+ W
Sbjct: 515 EVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKERKEAWIKW 574

Query: 461 VKTYMEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMER 520
           +++Y++E+ GS +SDEERKA MD+VNPKYILRNYLCQ+AIDAAEQGDF EV  L+++M+R
Sbjct: 575 MRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVNNLIRLMKR 633

Query: 521 PFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 552
           P++EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 635 PYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 633

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878878.12.0e-27391.98protein adenylyltransferase SelO [Benincasa hispida][more]
XP_004149028.13.3e-27391.98uncharacterized protein LOC101218327 [Cucumis sativus] >KGN66443.1 hypothetical ... [more]
XP_008451128.12.2e-27291.98PREDICTED: UPF0061 protein azo1574 [Cucumis melo][more]
TYK26910.16.3e-27291.78UPF0061 protein [Cucumis melo var. makuwa][more]
KAA0042281.16.3e-27291.78UPF0061 protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A1K5T69.6e-12246.62Protein adenylyltransferase SelO OS=Azoarcus sp. (strain BH72) OX=418699 GN=selO... [more]
Q5NYD91.9e-11744.64Protein adenylyltransferase SelO OS=Aromatoleum aromaticum (strain EbN1) OX=7611... [more]
Q7UKT51.9e-11744.03Protein adenylyltransferase SelO OS=Rhodopirellula baltica (strain DSM 10527 / N... [more]
Q1H0D23.2e-11744.69Protein adenylyltransferase SelO OS=Methylobacillus flagellatus (strain KT / ATC... [more]
C4LAV82.3e-11543.93Protein adenylyltransferase SelO OS=Tolumonas auensis (strain DSM 9187 / TA4) OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LXE51.6e-27391.98Selenoprotein O OS=Cucumis sativus OX=3659 GN=Csa_1G605720 PE=3 SV=1[more]
A0A1S3BRJ31.0e-27291.98Selenoprotein O OS=Cucumis melo OX=3656 GN=LOC103492506 PE=3 SV=1[more]
A0A5D3DUC93.1e-27291.78Selenoprotein O OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold699G0014... [more]
A0A5A7TL193.1e-27291.78Selenoprotein O OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold824G0070... [more]
A0A6J1DIN27.0e-26989.82Selenoprotein O OS=Momordica charantia OX=3673 GN=LOC111021431 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G13030.17.2e-24277.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003846Protein adenylyltransferase SelOPFAMPF02696UPF0061coord: 44..517
e-value: 7.3E-124
score: 414.1
IPR003846Protein adenylyltransferase SelOPANTHERPTHR32057PROTEIN ADENYLYLTRANSFERASE SELO, MITOCHONDRIALcoord: 270..551
IPR003846Protein adenylyltransferase SelOPANTHERPTHR32057PROTEIN ADENYLYLTRANSFERASE SELO, MITOCHONDRIALcoord: 40..243
IPR003846Protein adenylyltransferase SelOHAMAPMF_00692SelOcoord: 1..543
score: 33.839348
NoneNo IPR availablePANTHERPTHR32057:SF15UPF0061 PROTEIN AZO1574-LIKEcoord: 270..551
NoneNo IPR availablePANTHERPTHR32057:SF15UPF0061 PROTEIN AZO1574-LIKEcoord: 40..243

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007441.1HG10007441.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005524 ATP binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016779 nucleotidyltransferase activity