HG10022874 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022874
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFe2OG dioxygenase domain-containing protein
LocationChr05: 29171766 .. 29176412 (+)
RNA-Seq ExpressionHG10022874
SyntenyHG10022874
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCGGGGGCGACTGATCGAGCGCGGCCGGTGGTGATGCCGGCGGCGGCGGCGATGACGGTGACGGACACAATAGCGAAGGACGCTGTGTTGGGATGGTTCAGAGGGGAGTTCGCGGCGGCGAATGCGATAATAGATGCGTTGTGTGGGCATCTGGCGCAGGTGAGTGAAAGTGGAGGATCGGAGTATGAAGCAGTGTTTGGTGCGATTCATAGACGGCGGTTGAATTGGATCCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCAGACGTCGCGGTGGAGCTTCGGAAAGTGACGGCGAAGAAGAATAATAATAAGAATCAGGAAGAGGTGAAAGGAGGCGAGGTGGAGGCCGTGGCGGTGGCTGAGGGCGAAGGCGAAGGCGATGTTGAAATGGAGGTGAAGAAGATGAGTGAAGAGGATGAGAAAGAATTTGTCGAAGAAGAAATGAATAATGGAAAATTGAAGATCGAGGAGATGTCGATTGAGATTAACGAAACTGATGGCGGAAGAAATGAGGTTTTGCCTCCCATTGAAGAAGAAGATTCCATTGGAAGCGAGATAACTGATTCAGGTAAGTATAATTCTAGTTTCTTTCTTTTCATCAATCAATCAAAATCTGTAATCCATTTTTTCAATTCTGTCGATTTCTGTTGATCTCGGAACTTGAACGCTATCTCAACTGTTCACGATCCGAAAACTTTTGATCGATCCTCCGAAATTTTCTCAAAATTCCGTCTCTGTCTGTGTCTGATGAACAGTGAAATCAAAATTCTGAATCCAGAAAATTTGTCAAAAGTGCATTTCTCAACTGTTCGAGAAATCAACAAAAAAAGCTAATAATTCTTTTCCACATTCAAAGGATTCAAGAGTCAGATCCGTGATTTTAGTCTTTACACAAACCCTGTTTCGTCAAATTTGTTCAGCTACGGATTTGATTTCTTTCTTTTTCTTTTACAATTACCTTCCAATTCATCCAAAAATTTAAAGTACTCATAAATTCAAAATCATTACGTTTTTTTATCCCATCGCTTCATCTTTACGCGAAAATTCTTTCCAAATCAAAATTTCTTAAATCCAATCGAAAATTGACTGCTAGTTGAATATCTTCCCGAGGTTTAAGTGGGGGTTTGAGAATTTCGAGGTGAAGGAAGAAAAAGTTAAGCTGTTACAAGATTCTACAGTTAGTTCCGGAATGACGAATACTGTCAGATACACAATGCCTCGAGATGGAAGCTTTGTTTAATCAAAATCAATACCCATCTTTCAATCTCAGCCCTCCATTCTATCCATAACTAAAATAAAATTAAAAAAAAAAAACCCTTTCTCCTATATTATTATTATTTGGGGGTAGGTTGAAGGGCACTTTTTTGACAAATTTCATTTGTTTTTTAAAAAAAAAAAATTCCTTTGACGACAAATTTAATATTAATTGTATCCTAGATCGTGGGTCTTTCCTATACTCTTTTTATCACTTGTAATTGGCGTATACTCTTTTTTTTTATGAACAGTAATTCAGATATTTATACTTGTAATTCAGATATCTATACTTCATATTTATTGAGAGATTCGAACTTCTAACTTGATATCAATAGTATAAATTTTATGTAATATCTAATAGTGGTGGGTAAAAAATGTCTATGACTATGTGATGACCGCTAGAGCATAAAAATTGTCACGGACAAGGAGTTGACTAAAAAGAGGCGCAATGAAAAATTAAAAACACCATCTGGCTCCATTTCTGACCATTATTTTAAATTGCACAACTCATTAAATGCATTTATTTAGTTATTTGTTAAGTAATTTATTGAGGCTGAGATGTAAATGTTTCTGTTTTCTTTTGGGAATTTTTTAAAAAATAAAATAAAATATTGGGGCAGGATCTCAAGGAGGTGGAGTTCAGGCCAATTATGCAGAAGTTGAGATTTGCTCTAACCATGAAGAGTGTGAAGCACGTCCAGGGCAGATGAAGTTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTAAAAGGCCACATGGCGAGTTCTTTCTTTTCCATTGAAATTTATTTAGTGGCCCCCCTTAATTCAATTCCCTTTCTGTAAAAATTGGCAGATTTTTCTCTCTATCATTCAATAAACGTTTCAATAGTAATTAACATGTGTTTCTACTTTTAGATTCAAAATTTTCAAATCTTTCATACCTCACACTTGTTGTACAACGATCTGAGTCAATCAAATTATATATATATATATATATATATATATATATATATATATATATATTTGTGGTAATTTAGATTTGAAATTTTCTCGAAGTTTGGTTTGGGACTGATTGTTTCATTCATTTGGTGACCCATGGTGGCAGGTGAATGTTGTGAAAGGATTGAAGTGTTATGAAGATATTTTCACCCAGTCTGAATTGGCCAGGTTGAATGATTTTGTTGATGACCTTCGTTCTGCTGCAAACAATGGGGAGCTCTCTGGTTAGTCTCTTTCATTTTATCTTTATTTGATTCTTTTTCTATTATTGATTTTGGGTCTTTTAACAGTTGTGTATGCCCTGTTTTTTTAAAGTATAAATTATGTGAGAACATGTTTTTGTTAATTGCTATTTTAAAGAGGTCCATCTTTTACTTAATTCTTTTGGATGTGTGTACTATCTTTAACATTTCTTGAAATCTATTCTAATATGGACACAATATCCTTCCTGTACTCTGTACCCATATTGTCCTACCATGTGTTTGTTATTCTCAAGGGTTTAAAATGATGATATCCACAACATTAAAGATAAAATAAAATACTGAAGTCCTAAATTGACAAAGACTTTATATTGGTGTCAACAAAATTTCATGAAAAAAACCAAAGGAACGAAGAAATTTATGTATAAAGAAATAAAATAAAATTTATATACGTGTTTTATTTTTCTTAAATAAGTTGTACTGTCTTTATTATGAATGTCAAATCCTTTCATTTATCAAAATATTGGTAGGAATATTCATCGATCTTTTATTGTTGGTTCTATTATATTAGAGAAAATGTATGAAATGACTAGTTTGAAGGCCTTTCTCTAACTTATTTTATTAATCAAACTACTATTTTATTTTCTTATCCATGTATGGGTTCTTGAAACAACGTGTTGGTTTGTTAACAAAGCTAAAATTATAAGATGTTTGTATTGGTTCAGAATCAATTATAAGATGTTTGTATTGGTTCAGAATCTGAATTGAATCTTCTTGACGTGTGCAGGAGACACATTTGTTTTATTCAATAAGCAGGTGAAAGGCAACCGGCGAGAGATGATCCAGCTTGGCGTGCCCATTTTTGGACAGATAAAAGATGATTCAACCAATATCAGCCAAACAAGTATGACATAAACTCAACTCATCCAATTGTTTAAGGTCGTTGAAACCGATGTAGGCTACTTTCGTATAAAATGAACTAAACTATAATATTGCCGACCTGAACATAATTCAATAAGTTAAAACATTAGTAACATCTATTTGTGAATGCTCATCTTCACATAATATTTTTGAAAAGAAAAGAAAAACAAAGAAAATAGCTGAGTAATGATATTCTCCATGCTTACAGGCAACATAGAGCCAATTCCACCTCTTCTTATGACGGTCATAGATCATCTCATTCAGTGGCAACTGATTCCAGAGTACAAAAGACCAAATGGATGTCTTTTCAATTTCTTTGAAGAGGTATTCTATTTCTAAACAACGTTGATTTGTCTACAGTGAGACCTATATTCAACTTACCATTTTGGTCTGAAACCAAAAATTTGATTTCTTTGATGATCACTGAAATTGGTAAGTAATTAAACTGTAGGGTGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAACCAATTTCCACTCTCTTCCTTTCTGAATCAACTATGGCTTTTGGTCGTTCTATTGTCAGTGATAACGAAGGCAACTATAAGGGGCCACTCATGCTGTCCTTGAAGGAAGGGTATGCCATTTAAACTCTGCTATCTTTTTCTCGATTCATTAATGACCTTTTGTTATAGTTTCTCGTACTCGAATCATGACCTTTTGTTATAGTTTCTCGTACTCGAATCATGACGCTTCAAAACTGATATACATGTTTTCACAGTGTTGTAGCCTTGTTTGAGTTGTATTGATGATAATATGACAAGCAATGTAATTTTACAGGTCTCTTTTGGTCATGAGAGGCAACAGTGCAGATGTTGCACGCCATGTCATGTGTGCATCTCCTAACAAAAGAGTCACCATCACGTTCTTCCGAGTTCGGCCAGACTACGATCAATGCCAATCACCAACTCCTCAGATGTCGAACGCCATGACTCTATGGCAACCGGGAGTTGCAGCCGCATGTGCCTTGCCTAATGGAGCCACCTACGGCTATGAAGCAATGGAGGTAATGCCAAAATGGGGGATCCTTCGCGCACCGGTGGTCATGTTAGCTCCTGTTCGCCCTATGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACTGGAGTGTTCTTACCATGGGCTGTTAATTCAAGAAAACCAGCTAAACATCTTCCTCCCCGTGCTCGAAAAGGACGGTTCCTTGCATTACCTTCCGCTGTCGAAACTCGTCTACCAGACTCATCTCACGAGCCAGGCATAAGTGTTTGA

mRNA sequence

ATGGCGGCGGGGGCGACTGATCGAGCGCGGCCGGTGGTGATGCCGGCGGCGGCGGCGATGACGGTGACGGACACAATAGCGAAGGACGCTGTGTTGGGATGGTTCAGAGGGGAGTTCGCGGCGGCGAATGCGATAATAGATGCGTTGTGTGGGCATCTGGCGCAGGTGAGTGAAAGTGGAGGATCGGAGTATGAAGCAGTGTTTGGTGCGATTCATAGACGGCGGTTGAATTGGATCCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCAGACGTCGCGGTGGAGCTTCGGAAAGTGACGGCGAAGAAGAATAATAATAAGAATCAGGAAGAGGTGAAAGGAGGCGAGGTGGAGGCCGTGGCGGTGGCTGAGGGCGAAGGCGAAGGCGATGTTGAAATGGAGGTGAAGAAGATGAGTGAAGAGGATGAGAAAGAATTTGTCGAAGAAGAAATGAATAATGGAAAATTGAAGATCGAGGAGATGTCGATTGAGATTAACGAAACTGATGGCGGAAGAAATGAGGTTTTGCCTCCCATTGAAGAAGAAGATTCCATTGGAAGCGAGATAACTGATTCAGGATCTCAAGGAGGTGGAGTTCAGGCCAATTATGCAGAAGTTGAGATTTGCTCTAACCATGAAGAGTGTGAAGCACGTCCAGGGCAGATGAAGTTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTGAATGTTGTGAAAGGATTGAAGTGTTATGAAGATATTTTCACCCAGTCTGAATTGGCCAGGTTGAATGATTTTGTTGATGACCTTCGTTCTGCTGCAAACAATGGGGAGCTCTCTGGAGACACATTTGTTTTATTCAATAAGCAGGTGAAAGGCAACCGGCGAGAGATGATCCAGCTTGGCGTGCCCATTTTTGGACAGATAAAAGATGATTCAACCAATATCAGCCAAACAAGCAACATAGAGCCAATTCCACCTCTTCTTATGACGGTCATAGATCATCTCATTCAGTGGCAACTGATTCCAGAGTACAAAAGACCAAATGGATGTCTTTTCAATTTCTTTGAAGAGGGTGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAACCAATTTCCACTCTCTTCCTTTCTGAATCAACTATGGCTTTTGGTCGTTCTATTGTCAGTGATAACGAAGGCAACTATAAGGGGCCACTCATGCTGTCCTTGAAGGAAGGGTCTCTTTTGGTCATGAGAGGCAACAGTGCAGATGTTGCACGCCATGTCATGTGTGCATCTCCTAACAAAAGAGTCACCATCACGTTCTTCCGAGTTCGGCCAGACTACGATCAATGCCAATCACCAACTCCTCAGATGTCGAACGCCATGACTCTATGGCAACCGGGAGTTGCAGCCGCATGTGCCTTGCCTAATGGAGCCACCTACGGCTATGAAGCAATGGAGGTAATGCCAAAATGGGGGATCCTTCGCGCACCGGTGGTCATGTTAGCTCCTGTTCGCCCTATGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACTGGAGTGTTCTTACCATGGGCTGTTAATTCAAGAAAACCAGCTAAACATCTTCCTCCCCGTGCTCGAAAAGGACGGTTCCTTGCATTACCTTCCGCTGTCGAAACTCGTCTACCAGACTCATCTCACGAGCCAGGCATAAGTGTTTGA

Coding sequence (CDS)

ATGGCGGCGGGGGCGACTGATCGAGCGCGGCCGGTGGTGATGCCGGCGGCGGCGGCGATGACGGTGACGGACACAATAGCGAAGGACGCTGTGTTGGGATGGTTCAGAGGGGAGTTCGCGGCGGCGAATGCGATAATAGATGCGTTGTGTGGGCATCTGGCGCAGGTGAGTGAAAGTGGAGGATCGGAGTATGAAGCAGTGTTTGGTGCGATTCATAGACGGCGGTTGAATTGGATCCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCAGACGTCGCGGTGGAGCTTCGGAAAGTGACGGCGAAGAAGAATAATAATAAGAATCAGGAAGAGGTGAAAGGAGGCGAGGTGGAGGCCGTGGCGGTGGCTGAGGGCGAAGGCGAAGGCGATGTTGAAATGGAGGTGAAGAAGATGAGTGAAGAGGATGAGAAAGAATTTGTCGAAGAAGAAATGAATAATGGAAAATTGAAGATCGAGGAGATGTCGATTGAGATTAACGAAACTGATGGCGGAAGAAATGAGGTTTTGCCTCCCATTGAAGAAGAAGATTCCATTGGAAGCGAGATAACTGATTCAGGATCTCAAGGAGGTGGAGTTCAGGCCAATTATGCAGAAGTTGAGATTTGCTCTAACCATGAAGAGTGTGAAGCACGTCCAGGGCAGATGAAGTTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTGAATGTTGTGAAAGGATTGAAGTGTTATGAAGATATTTTCACCCAGTCTGAATTGGCCAGGTTGAATGATTTTGTTGATGACCTTCGTTCTGCTGCAAACAATGGGGAGCTCTCTGGAGACACATTTGTTTTATTCAATAAGCAGGTGAAAGGCAACCGGCGAGAGATGATCCAGCTTGGCGTGCCCATTTTTGGACAGATAAAAGATGATTCAACCAATATCAGCCAAACAAGCAACATAGAGCCAATTCCACCTCTTCTTATGACGGTCATAGATCATCTCATTCAGTGGCAACTGATTCCAGAGTACAAAAGACCAAATGGATGTCTTTTCAATTTCTTTGAAGAGGGTGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAACCAATTTCCACTCTCTTCCTTTCTGAATCAACTATGGCTTTTGGTCGTTCTATTGTCAGTGATAACGAAGGCAACTATAAGGGGCCACTCATGCTGTCCTTGAAGGAAGGGTCTCTTTTGGTCATGAGAGGCAACAGTGCAGATGTTGCACGCCATGTCATGTGTGCATCTCCTAACAAAAGAGTCACCATCACGTTCTTCCGAGTTCGGCCAGACTACGATCAATGCCAATCACCAACTCCTCAGATGTCGAACGCCATGACTCTATGGCAACCGGGAGTTGCAGCCGCATGTGCCTTGCCTAATGGAGCCACCTACGGCTATGAAGCAATGGAGGTAATGCCAAAATGGGGGATCCTTCGCGCACCGGTGGTCATGTTAGCTCCTGTTCGCCCTATGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACTGGAGTGTTCTTACCATGGGCTGTTAATTCAAGAAAACCAGCTAAACATCTTCCTCCCCGTGCTCGAAAAGGACGGTTCCTTGCATTACCTTCCGCTGTCGAAACTCGTCTACCAGACTCATCTCACGAGCCAGGCATAAGTGTTTGA

Protein sequence

MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAKKNNNKNQEEVKGGEVEAVAVAEGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGRNEVLPPIEEEDSIGSEITDSGSQGGGVQANYAEVEICSNHEECEARPGQMKLTKGFSAKEPVNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAACALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV
Homology
BLAST of HG10022874 vs. NCBI nr
Match: XP_038896503.1 (RNA demethylase ALKBH10B [Benincasa hispida])

HSP 1 Score: 1025.4 bits (2650), Expect = 1.8e-295
Identity = 528/576 (91.67%), Postives = 544/576 (94.44%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDRARPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG
Sbjct: 3   MAAGATDRARPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKV-TAKKNNNK--NQEEVKGGE 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKV TAKK NNK   +EEVKG E
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTTAKKKNNKIQEEEEVKGDE 122

Query: 121 VEAVAVAEGEGEGDVEMEVKKMS--------EEDEKEFVEEEMNNGKLKIEEMSIEINET 180
           VEAVAVAEG+G G++EMEVKKMS        EEDEKE+VEEE NNGKLKIE+MSIEINET
Sbjct: 123 VEAVAVAEGDG-GEIEMEVKKMSEEDEKEYVEEDEKEYVEEETNNGKLKIEDMSIEINET 182

Query: 181 DGGRNEVLPPIEEEDSIGSEITDSGSQGGGVQANYAEVEICSNHEECEARPGQMKLTKGF 240
           DGGRNEVLPPIEEEDSIGSEITDSGSQ GGVQANYAEVEICSNHEECEARPGQMKLTKGF
Sbjct: 183 DGGRNEVLPPIEEEDSIGSEITDSGSQEGGVQANYAEVEICSNHEECEARPGQMKLTKGF 242

Query: 241 SAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQV 300
           SAKEP     VNVVKGLKCYED+FTQSELARLNDFVDDLRSA NNGELSG+TFVLFN QV
Sbjct: 243 SAKEPVKGHMVNVVKGLKCYEDVFTQSELARLNDFVDDLRSAGNNGELSGETFVLFNTQV 302

Query: 301 KGNRREMIQLGVPIFGQIKDDSTNISQTS-NIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           KG+RRE+IQ GVPIF QI++DS N SQTS NIEPIPP+L+TVIDHLIQWQLIPEYKRPNG
Sbjct: 303 KGSRREIIQFGVPIFRQIRNDSANNSQTSKNIEPIPPILVTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG
Sbjct: 363 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAA 480
           SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVA A
Sbjct: 423 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAGA 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVN+RK
Sbjct: 483 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNTRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PAKHLPPRARKGRFLALP AVET LPDSSHEPGISV
Sbjct: 543 PAKHLPPRARKGRFLALPPAVETHLPDSSHEPGISV 577

BLAST of HG10022874 vs. NCBI nr
Match: XP_008451400.1 (PREDICTED: uncharacterized protein LOC103492703 [Cucumis melo])

HSP 1 Score: 1009.2 bits (2608), Expect = 1.4e-290
Identity = 524/576 (90.97%), Postives = 535/576 (92.88%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDR RPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG
Sbjct: 3   MAAGATDRGRPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ---EEVKG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ   EEVKG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEEEVKG 122

Query: 121 GEVEA--VAVAEGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR 180
           GEVEA  VA A  EG+GDVEME KKMSEEDEKEFVEEE N+G LKIEE+SIEINE DGGR
Sbjct: 123 GEVEAVEVAAAVAEGDGDVEMEGKKMSEEDEKEFVEEETNDGNLKIEEISIEINEIDGGR 182

Query: 181 NEVLPPIEEEDSIGSEITDSGSQGGG-----VQANYAEVEICSNHEECEARPGQMKLTKG 240
           NEVL PIEEEDSIGSEITDSGSQGGG     VQANYA+VEICSNHEECEARPGQMKLTKG
Sbjct: 183 NEVLAPIEEEDSIGSEITDSGSQGGGGGEEEVQANYADVEICSNHEECEARPGQMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFTQSELARLNDFVD LRSAANNGELSG TF+LFNKQ
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTQSELARLNDFVDGLRSAANNGELSGGTFILFNKQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +LMTVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHILMTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL LSLKEG
Sbjct: 363 CLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGRSIVSDNEGNYKGPLTLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAA 480
           SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQP VA  
Sbjct: 423 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPTVAGT 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVN+RK
Sbjct: 483 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNTRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 PAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 578

BLAST of HG10022874 vs. NCBI nr
Match: TYK21007.1 (uncharacterized protein E5676_scaffold328G00370 [Cucumis melo var. makuwa])

HSP 1 Score: 1006.5 bits (2601), Expect = 8.9e-290
Identity = 523/576 (90.80%), Postives = 534/576 (92.71%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDR RPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG
Sbjct: 3   MAAGATDRGRPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ---EEVKG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ   EEVKG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEEEVKG 122

Query: 121 GEVEA--VAVAEGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR 180
           GEVEA  VA A  EG+GDVEME KKMSEEDEKEFVEEE N+G LKIEE+SIEINE DGGR
Sbjct: 123 GEVEAVEVAAAVAEGDGDVEMEGKKMSEEDEKEFVEEETNDGNLKIEEISIEINEIDGGR 182

Query: 181 NEVLPPIEEEDSIGSEITDSGSQGGG-----VQANYAEVEICSNHEECEARPGQMKLTKG 240
           NEVL PIEEEDSIGSEITDSGSQGGG     VQANYA+VEICSNHEECEARPG MKLTKG
Sbjct: 183 NEVLAPIEEEDSIGSEITDSGSQGGGGGEEEVQANYADVEICSNHEECEARPGLMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFTQSELARLNDFVD LRSAANNGELSG TF+LFNKQ
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTQSELARLNDFVDGLRSAANNGELSGGTFILFNKQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +LMTVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHILMTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL LSLKEG
Sbjct: 363 CLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGRSIVSDNEGNYKGPLTLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAA 480
           SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQP VA  
Sbjct: 423 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPTVAGT 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVN+RK
Sbjct: 483 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNTRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 PAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 578

BLAST of HG10022874 vs. NCBI nr
Match: XP_011659328.2 (RNA demethylase ALKBH10B [Cucumis sativus] >KAE8646325.1 hypothetical protein Csa_016374 [Cucumis sativus])

HSP 1 Score: 989.6 bits (2557), Expect = 1.1e-284
Identity = 522/597 (87.44%), Postives = 536/597 (89.78%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGAT+RARPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGH+AQVSESG
Sbjct: 3   MAAGATERARPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHMAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ-EEVK-GG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ EEVK GG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEVKGGG 122

Query: 121 EVEAVAVA----EGEGEGDVEMEVKKMSEEDEKEF------------------------V 180
           EVEAV VA    +G+G GDVEMEVKKMSEEDEKEF                        V
Sbjct: 123 EVEAVEVALAEGDGDGYGDVEMEVKKMSEEDEKEFVEEDEKEFVEEDEKEFVEEDEKEIV 182

Query: 181 EEEMNNGKLKIEEMSIEINETDGGRNEVLPPIEEEDSIGSEITDSGSQGG-GVQANYAEV 240
           EEE N+GKLKIEE+SIEINE DGGRNEVL PIEEEDSIGSEITDSGSQGG  VQAN A V
Sbjct: 183 EEETNDGKLKIEEISIEINEIDGGRNEVLAPIEEEDSIGSEITDSGSQGGEEVQANSASV 242

Query: 241 EICSNHEECEARPGQMKLTKGFSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDD 300
           EICSNHEECEARPGQMKLTKGFSAKEP     VNVVKGLKCYEDIFTQSEL RLNDFVDD
Sbjct: 243 EICSNHEECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTQSELGRLNDFVDD 302

Query: 301 LRSAANNGELSGDTFVLFNKQVKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLL 360
           LRSAANNGELSG TF+LFNKQVKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +L
Sbjct: 303 LRSAANNGELSGGTFILFNKQVKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHIL 362

Query: 361 MTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGR 420
           MTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGR
Sbjct: 363 MTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGR 422

Query: 421 SIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQC 480
           SIVSDNEGNYKGPL LSLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRP+YDQC
Sbjct: 423 SIVSDNEGNYKGPLTLSLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPEYDQC 482

Query: 481 QSPTPQMSNAMTLWQPGVAAACALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMS 540
           QSPTPQMSNAMTLWQP VA  CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMS
Sbjct: 483 QSPTPQMSNAMTLWQPTVAGTCALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMS 542

Query: 541 PGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 PGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 599

BLAST of HG10022874 vs. NCBI nr
Match: XP_022992509.1 (uncharacterized protein LOC111488819 [Cucurbita maxima])

HSP 1 Score: 952.6 bits (2461), Expect = 1.5e-273
Identity = 498/578 (86.16%), Postives = 517/578 (89.45%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDRARPV+MP AAA  VTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVS+ G
Sbjct: 3   MAAGATDRARPVMMPPAAAAAVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSDIG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQEEVKGGEV 120
           G EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAVELRKVTA  KK   KNQE     E 
Sbjct: 63  GLEYEAVFGAIHRRRLNWIPVLQMQKYHPIGDVAVELRKVTAEKKKKKKKNQE-----EE 122

Query: 121 EAVAVAEGEGEGDVEMEVKKMSEED--------EKEFVEEEMNNGKLKIEEMSIEINETD 180
           E  A A  E + DVEME KK SE D        E+EFVEEE NNGK+KIEEMSIEINET+
Sbjct: 123 EEAAAAVAEDDCDVEMEAKKTSEADENGGKMCSEEEFVEEEANNGKVKIEEMSIEINETE 182

Query: 181 GGRNEVLPPIEEEDSIGSEITDSGSQ--GGGVQANYAEVEICSNHEECEARPGQMKLTKG 240
           GGRNE L PIEEEDSIGSEITDSGSQ  GGGVQA+ AEVEICSNH ECEARPG MKLTKG
Sbjct: 183 GGRNEDLAPIEEEDSIGSEITDSGSQGGGGGVQASSAEVEICSNHGECEARPGLMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFT+SEL +LNDFVDDLRSAA NGELSG+TFVLFN+Q
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTESELVKLNDFVDDLRSAAKNGELSGETFVLFNQQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKGNRREMIQLGVPIFGQI+DDS N S+TSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGNRREMIQLGVPIFGQIRDDSANNSRTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CL NFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG
Sbjct: 363 CLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-QMSNAMTLWQPGVAA 480
           SLLVMRGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP Q+SN +TLWQPGVA 
Sbjct: 423 SLLVMRGNSADVARHVICASPNKRVTITFFRVRPDYDQCQSPTPQQISNTVTLWQPGVAG 482

Query: 481 ACALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR 540
            CALPNG TYGYEAMEVMPKWGIL APVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR
Sbjct: 483 TCALPNGVTYGYEAMEVMPKWGILHAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR 542

Query: 541 KPAKHLPPRARKGRFLALPSAVETRLPDSSHE-PGISV 560
           KPAKHLPPRARKGRFLALPS VETRLPDSS+E PGISV
Sbjct: 543 KPAKHLPPRARKGRFLALPSPVETRLPDSSYEQPGISV 575

BLAST of HG10022874 vs. ExPASy Swiss-Prot
Match: Q9ZT92 (RNA demethylase ALKBH10B OS=Arabidopsis thaliana OX=3702 GN=ALKBH10B PE=1 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 2.6e-151
Identity = 309/574 (53.83%), Postives = 398/574 (69.34%), Query Frame = 0

Query: 6   TDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSES-GGSEY 65
           T +A  V +    A  V++ + KDA++ WFRGEFAAANAIIDA+C HL    E+  GSEY
Sbjct: 25  TAKAVSVPVQVPPATVVSEGLGKDALISWFRGEFAAANAIIDAMCSHLRIAEEAVSGSEY 84

Query: 66  EAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAKKNNNKNQEEVKGGEVEAVAVA 125
           EAVF AIHRRRLNWIPVLQMQKYH IA+VA+EL+KV AKK  +  Q++            
Sbjct: 85  EAVFAAIHRRRLNWIPVLQMQKYHSIAEVAIELQKVAAKKAEDLKQKKT----------- 144

Query: 126 EGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGRNEVLPPIEEED 185
               E + E ++K++   +E+E V++E  NG+ K+ E     N+ +G   +V     E+D
Sbjct: 145 ----EEEAEEDLKEVVATEEEE-VKKECFNGE-KVTE-----NDVNGDVEDV-----EDD 204

Query: 186 SIGSEITDSGSQGG---GVQANYAEVEICSNHEECEARPGQMKLTKGFSAKE-----PVN 245
           S  S+ITDSGS       V A+ A   IC +HE+C+AR  ++K  KGF AKE      VN
Sbjct: 205 SPTSDITDSGSHQDVHQTVVADTAHQIICHSHEDCDARSCEIKPIKGFQAKEQVKGHTVN 264

Query: 246 VVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGV 305
           VVKGLK YE++  + E+++L DFV +LR A  NG+L+G++F+LFNKQ+KGN+RE+IQLGV
Sbjct: 265 VVKGLKLYEELLKEDEISKLLDFVAELREAGINGKLAGESFILFNKQIKGNKRELIQLGV 324

Query: 306 PIFGQIKDD--STNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEY 365
           PIFG +K D  S + + + NIEPIPPLL +VIDH + W+LIPEYKRPNGC+ NFFEEGEY
Sbjct: 325 PIFGHVKADENSNDTNNSVNIEPIPPLLESVIDHFVTWRLIPEYKRPNGCVINFFEEGEY 384

Query: 366 SQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSAD 425
           SQPF KPPHLEQPISTL LSESTMA+GR + SDNEGN++GPL LSLK+GSLLVMRGNSAD
Sbjct: 385 SQPFLKPPHLEQPISTLVLSESTMAYGRILSSDNEGNFRGPLTLSLKQGSLLVMRGNSAD 444

Query: 426 VARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMTLWQPGVAAACALPNGATY 485
           +ARHVMC S NKRV+ITFFR+RPD  ++  Q  +P+    MT+WQP         NG  +
Sbjct: 445 MARHVMCPSQNKRVSITFFRIRPDTYHNHSQPNSPRNDGVMTMWQPYQMTPTPFLNGYDH 504

Query: 486 GYEAMEVMPKWGILRAPVVMLA--PVRPMVM-SPG-RSQRDGTGVFLPWAV--NSRKPAK 545
              ++++MPK G+LR P+VM+A  PV+PM++ SP       GTGVFLPWA   +SRK  K
Sbjct: 505 ---SIDMMPKLGVLRPPMVMMAPPPVQPMILPSPNVMGTGGGTGVFLPWASVNSSRKHVK 564

Query: 546 HLPPRARKGRFLALPSAVETR-LPDSSHEPGISV 560
           HLPPRA+K R L LP A  +     S+ EP I+V
Sbjct: 565 HLPPRAQKKRLLPLPPAASSSPAGGSTSEPVITV 568

BLAST of HG10022874 vs. ExPASy Swiss-Prot
Match: Q9SL49 (RNA demethylase ALKBH9B OS=Arabidopsis thaliana OX=3702 GN=ALKBH9B PE=1 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 1.4e-27
Identity = 72/207 (34.78%), Postives = 113/207 (54.59%), Query Frame = 0

Query: 235 VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQL 294
           VNV+ GL+ +  +F+  E  R+ D V  L+     GEL   TF   +K ++G  RE IQ 
Sbjct: 210 VNVLDGLELHTGVFSAVEQKRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQF 269

Query: 295 GVPIFGQIKDDSTN---ISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEE 354
           G   +    D + N   I Q   ++P+P L   +I  LI+W ++P    P+ C+ N ++E
Sbjct: 270 GC-CYNYAPDRAGNPPGILQREEVDPLPHLFKVIIRKLIKWHVLPPTCVPDSCIVNIYDE 329

Query: 355 GEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSL 414
           G+       PPH++     +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+
Sbjct: 330 GDCI-----PPHIDNHDFLRPFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSV 389

Query: 415 LVMRGNSADVARHVMCASPNKRVTITF 433
           LV+ GN ADVA+H + A P KR++ITF
Sbjct: 390 LVLNGNGADVAKHCVPAVPTKRISITF 410

BLAST of HG10022874 vs. ExPASy TrEMBL
Match: A0A1S3BQT2 (uncharacterized protein LOC103492703 OS=Cucumis melo OX=3656 GN=LOC103492703 PE=4 SV=1)

HSP 1 Score: 1009.2 bits (2608), Expect = 6.6e-291
Identity = 524/576 (90.97%), Postives = 535/576 (92.88%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDR RPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG
Sbjct: 3   MAAGATDRGRPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ---EEVKG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ   EEVKG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEEEVKG 122

Query: 121 GEVEA--VAVAEGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR 180
           GEVEA  VA A  EG+GDVEME KKMSEEDEKEFVEEE N+G LKIEE+SIEINE DGGR
Sbjct: 123 GEVEAVEVAAAVAEGDGDVEMEGKKMSEEDEKEFVEEETNDGNLKIEEISIEINEIDGGR 182

Query: 181 NEVLPPIEEEDSIGSEITDSGSQGGG-----VQANYAEVEICSNHEECEARPGQMKLTKG 240
           NEVL PIEEEDSIGSEITDSGSQGGG     VQANYA+VEICSNHEECEARPGQMKLTKG
Sbjct: 183 NEVLAPIEEEDSIGSEITDSGSQGGGGGEEEVQANYADVEICSNHEECEARPGQMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFTQSELARLNDFVD LRSAANNGELSG TF+LFNKQ
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTQSELARLNDFVDGLRSAANNGELSGGTFILFNKQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +LMTVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHILMTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL LSLKEG
Sbjct: 363 CLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGRSIVSDNEGNYKGPLTLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAA 480
           SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQP VA  
Sbjct: 423 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPTVAGT 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVN+RK
Sbjct: 483 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNTRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 PAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 578

BLAST of HG10022874 vs. ExPASy TrEMBL
Match: A0A5D3DBI8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold328G00370 PE=4 SV=1)

HSP 1 Score: 1006.5 bits (2601), Expect = 4.3e-290
Identity = 523/576 (90.80%), Postives = 534/576 (92.71%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDR RPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG
Sbjct: 3   MAAGATDRGRPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ---EEVKG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ   EEVKG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEEEVKG 122

Query: 121 GEVEA--VAVAEGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR 180
           GEVEA  VA A  EG+GDVEME KKMSEEDEKEFVEEE N+G LKIEE+SIEINE DGGR
Sbjct: 123 GEVEAVEVAAAVAEGDGDVEMEGKKMSEEDEKEFVEEETNDGNLKIEEISIEINEIDGGR 182

Query: 181 NEVLPPIEEEDSIGSEITDSGSQGGG-----VQANYAEVEICSNHEECEARPGQMKLTKG 240
           NEVL PIEEEDSIGSEITDSGSQGGG     VQANYA+VEICSNHEECEARPG MKLTKG
Sbjct: 183 NEVLAPIEEEDSIGSEITDSGSQGGGGGEEEVQANYADVEICSNHEECEARPGLMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFTQSELARLNDFVD LRSAANNGELSG TF+LFNKQ
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTQSELARLNDFVDGLRSAANNGELSGGTFILFNKQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +LMTVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHILMTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL LSLKEG
Sbjct: 363 CLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGRSIVSDNEGNYKGPLTLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPGVAAA 480
           SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQP VA  
Sbjct: 423 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQPTVAGT 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVN+RK
Sbjct: 483 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNTRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           PAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 PAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 578

BLAST of HG10022874 vs. ExPASy TrEMBL
Match: A0A0A0K544 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G392330 PE=4 SV=1)

HSP 1 Score: 995.7 bits (2573), Expect = 7.6e-287
Identity = 522/581 (89.85%), Postives = 536/581 (92.25%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGAT+RARPVVMPAAAAMTVTDT+AKDAVLGWFRGEFAAANAIIDALCGH+AQVSESG
Sbjct: 3   MAAGATERARPVVMPAAAAMTVTDTLAKDAVLGWFRGEFAAANAIIDALCGHMAQVSESG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQ-EEVK-GG 120
           GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA  KK  NKNQ EEVK GG
Sbjct: 63  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAAKKKKMNKNQEEEVKGGG 122

Query: 121 EVEAVAVA----EGEGEGDVEMEVKKMSEEDEKEF--------VEEEMNNGKLKIEEMSI 180
           EVEAV VA    +G+G GDVEMEVKKMSEEDEKEF        VEEE N+GKLKIEE+SI
Sbjct: 123 EVEAVEVALAEGDGDGYGDVEMEVKKMSEEDEKEFVEEDEKEIVEEETNDGKLKIEEISI 182

Query: 181 EINETDGGRNEVLPPIEEEDSIGSEITDSGSQGG-GVQANYAEVEICSNHEECEARPGQM 240
           EINE DGGRNEVL PIEEEDSIGSEITDSGSQGG  VQAN A VEICSNHEECEARPGQM
Sbjct: 183 EINEIDGGRNEVLAPIEEEDSIGSEITDSGSQGGEEVQANSASVEICSNHEECEARPGQM 242

Query: 241 KLTKGFSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFV 300
           KLTKGFSAKEP     VNVVKGLKCYEDIFTQSEL RLNDFVDDLRSAANNGELSG TF+
Sbjct: 243 KLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTQSELGRLNDFVDDLRSAANNGELSGGTFI 302

Query: 301 LFNKQVKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEY 360
           LFNKQVKG+RREMIQLGVPIF QI ++S N SQTSNIEPIP +LMTVIDHLIQWQLIPEY
Sbjct: 303 LFNKQVKGSRREMIQLGVPIFRQIGEESGNNSQTSNIEPIPHILMTVIDHLIQWQLIPEY 362

Query: 361 KRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLML 420
           KRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL L
Sbjct: 363 KRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLVLSESTMAFGRSIVSDNEGNYKGPLTL 422

Query: 421 SLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMTLWQP 480
           SLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRP+YDQCQSPTPQMSNAMTLWQP
Sbjct: 423 SLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPEYDQCQSPTPQMSNAMTLWQP 482

Query: 481 GVAAACALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWA 540
            VA  CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWA
Sbjct: 483 TVAGTCALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWA 542

Query: 541 VNSRKPAKHLPPRARKGRFLALPSAVETRLPDSSHEPGISV 560
           VN+RKPAKHLPPRARKGRFLALP AVETRLPDSSHEPGISV
Sbjct: 543 VNTRKPAKHLPPRARKGRFLALPPAVETRLPDSSHEPGISV 583

BLAST of HG10022874 vs. ExPASy TrEMBL
Match: A0A6J1JXR1 (uncharacterized protein LOC111488819 OS=Cucurbita maxima OX=3661 GN=LOC111488819 PE=4 SV=1)

HSP 1 Score: 952.6 bits (2461), Expect = 7.4e-274
Identity = 498/578 (86.16%), Postives = 517/578 (89.45%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDRARPV+MP AAA  VTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVS+ G
Sbjct: 3   MAAGATDRARPVMMPPAAAAAVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSDIG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA--KKNNNKNQEEVKGGEV 120
           G EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAVELRKVTA  KK   KNQE     E 
Sbjct: 63  GLEYEAVFGAIHRRRLNWIPVLQMQKYHPIGDVAVELRKVTAEKKKKKKKNQE-----EE 122

Query: 121 EAVAVAEGEGEGDVEMEVKKMSEED--------EKEFVEEEMNNGKLKIEEMSIEINETD 180
           E  A A  E + DVEME KK SE D        E+EFVEEE NNGK+KIEEMSIEINET+
Sbjct: 123 EEAAAAVAEDDCDVEMEAKKTSEADENGGKMCSEEEFVEEEANNGKVKIEEMSIEINETE 182

Query: 181 GGRNEVLPPIEEEDSIGSEITDSGSQ--GGGVQANYAEVEICSNHEECEARPGQMKLTKG 240
           GGRNE L PIEEEDSIGSEITDSGSQ  GGGVQA+ AEVEICSNH ECEARPG MKLTKG
Sbjct: 183 GGRNEDLAPIEEEDSIGSEITDSGSQGGGGGVQASSAEVEICSNHGECEARPGLMKLTKG 242

Query: 241 FSAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQ 300
           FSAKEP     VNVVKGLKCYEDIFT+SEL +LNDFVDDLRSAA NGELSG+TFVLFN+Q
Sbjct: 243 FSAKEPVKGHMVNVVKGLKCYEDIFTESELVKLNDFVDDLRSAAKNGELSGETFVLFNQQ 302

Query: 301 VKGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNG 360
           VKGNRREMIQLGVPIFGQI+DDS N S+TSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNG
Sbjct: 303 VKGNRREMIQLGVPIFGQIRDDSANNSRTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNG 362

Query: 361 CLFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 420
           CL NFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG
Sbjct: 363 CLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEG 422

Query: 421 SLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-QMSNAMTLWQPGVAA 480
           SLLVMRGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP Q+SN +TLWQPGVA 
Sbjct: 423 SLLVMRGNSADVARHVICASPNKRVTITFFRVRPDYDQCQSPTPQQISNTVTLWQPGVAG 482

Query: 481 ACALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR 540
            CALPNG TYGYEAMEVMPKWGIL APVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR
Sbjct: 483 TCALPNGVTYGYEAMEVMPKWGILHAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSR 542

Query: 541 KPAKHLPPRARKGRFLALPSAVETRLPDSSHE-PGISV 560
           KPAKHLPPRARKGRFLALPS VETRLPDSS+E PGISV
Sbjct: 543 KPAKHLPPRARKGRFLALPSPVETRLPDSSYEQPGISV 575

BLAST of HG10022874 vs. ExPASy TrEMBL
Match: A0A6J1GNJ1 (uncharacterized protein LOC111456041 OS=Cucurbita moschata OX=3662 GN=LOC111456041 PE=4 SV=1)

HSP 1 Score: 950.3 bits (2455), Expect = 3.7e-273
Identity = 497/577 (86.14%), Postives = 517/577 (89.60%), Query Frame = 0

Query: 1   MAAGATDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESG 60
           MAAGATDRARPV+MP AAA  VTDT+AKDAVLGWFRGEFAAANAIIDALCGHLAQVS+ G
Sbjct: 3   MAAGATDRARPVMMPPAAAAAVTDTLAKDAVLGWFRGEFAAANAIIDALCGHLAQVSDIG 62

Query: 61  GSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTA-KKNNNKNQEEVKGGEVE 120
           G EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAVELRKVTA KK   K Q + +  E E
Sbjct: 63  GLEYEAVFGAIHRRRLNWIPVLQMQKYHPIGDVAVELRKVTAEKKKKKKKQNQEEEEEEE 122

Query: 121 AVAVAEGEGEGDVEMEVKKMSEEDE--------KEFVEEEMNNGKLKIEEMSIEINETDG 180
           A  VA  E +GDVEME KK SE DE        +EFVEEE NN K+KIEEMSIEINET+G
Sbjct: 123 AAEVA--EDDGDVEMEAKKTSEADENGGKMSSDEEFVEEEANNEKVKIEEMSIEINETEG 182

Query: 181 GRNEVLPPIEEEDSIGSEITDSGSQ--GGGVQANYAEVEICSNHEECEARPGQMKLTKGF 240
           GRNE L PIEEEDSIGSEITDSGSQ  GGGVQA+ AEVEICSNH ECEARPG MKLTKGF
Sbjct: 183 GRNEDLAPIEEEDSIGSEITDSGSQGGGGGVQASSAEVEICSNHGECEARPGLMKLTKGF 242

Query: 241 SAKEP-----VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQV 300
           SAKEP     VNVVKGLKCYEDIFT+SEL +LNDFVDDLRSAA NGELSG+TFVLFN+QV
Sbjct: 243 SAKEPVKGHMVNVVKGLKCYEDIFTESELVKLNDFVDDLRSAAKNGELSGETFVLFNQQV 302

Query: 301 KGNRREMIQLGVPIFGQIKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGC 360
           KGNRREMIQLGVPIFGQI+DDS N ++TSNIEPIPPLL TVIDHLIQWQLIPEYKRPNGC
Sbjct: 303 KGNRREMIQLGVPIFGQIRDDSANNNRTSNIEPIPPLLATVIDHLIQWQLIPEYKRPNGC 362

Query: 361 LFNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGS 420
           L NFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGS
Sbjct: 363 LVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGS 422

Query: 421 LLVMRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-QMSNAMTLWQPGVAAA 480
           LLVMRGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP QMSNA+TLWQPGVA  
Sbjct: 423 LLVMRGNSADVARHVICASPNKRVTITFFRVRPDYDQCQSPTPQQMSNAVTLWQPGVAGT 482

Query: 481 CALPNGATYGYEAMEVMPKWGILRAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 540
           C LPNGATYGYEAMEVMPKWGIL APVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK
Sbjct: 483 CTLPNGATYGYEAMEVMPKWGILHAPVVMLAPVRPMVMSPGRSQRDGTGVFLPWAVNSRK 542

Query: 541 PAKHLPPRARKGRFLALPSAVETRLPDSSHE-PGISV 560
           PAKHLPPRARKGRFLALPS VETR PDSS+E PGISV
Sbjct: 543 PAKHLPPRARKGRFLALPSPVETRRPDSSYEQPGISV 577

BLAST of HG10022874 vs. TAIR 10
Match: AT4G02940.1 (oxidoreductase, 2OG-Fe(II) oxygenase family protein )

HSP 1 Score: 537.0 bits (1382), Expect = 1.9e-152
Identity = 309/574 (53.83%), Postives = 398/574 (69.34%), Query Frame = 0

Query: 6   TDRARPVVMPAAAAMTVTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSES-GGSEY 65
           T +A  V +    A  V++ + KDA++ WFRGEFAAANAIIDA+C HL    E+  GSEY
Sbjct: 25  TAKAVSVPVQVPPATVVSEGLGKDALISWFRGEFAAANAIIDAMCSHLRIAEEAVSGSEY 84

Query: 66  EAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAKKNNNKNQEEVKGGEVEAVAVA 125
           EAVF AIHRRRLNWIPVLQMQKYH IA+VA+EL+KV AKK  +  Q++            
Sbjct: 85  EAVFAAIHRRRLNWIPVLQMQKYHSIAEVAIELQKVAAKKAEDLKQKKT----------- 144

Query: 126 EGEGEGDVEMEVKKMSEEDEKEFVEEEMNNGKLKIEEMSIEINETDGGRNEVLPPIEEED 185
               E + E ++K++   +E+E V++E  NG+ K+ E     N+ +G   +V     E+D
Sbjct: 145 ----EEEAEEDLKEVVATEEEE-VKKECFNGE-KVTE-----NDVNGDVEDV-----EDD 204

Query: 186 SIGSEITDSGSQGG---GVQANYAEVEICSNHEECEARPGQMKLTKGFSAKE-----PVN 245
           S  S+ITDSGS       V A+ A   IC +HE+C+AR  ++K  KGF AKE      VN
Sbjct: 205 SPTSDITDSGSHQDVHQTVVADTAHQIICHSHEDCDARSCEIKPIKGFQAKEQVKGHTVN 264

Query: 246 VVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGV 305
           VVKGLK YE++  + E+++L DFV +LR A  NG+L+G++F+LFNKQ+KGN+RE+IQLGV
Sbjct: 265 VVKGLKLYEELLKEDEISKLLDFVAELREAGINGKLAGESFILFNKQIKGNKRELIQLGV 324

Query: 306 PIFGQIKDD--STNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEY 365
           PIFG +K D  S + + + NIEPIPPLL +VIDH + W+LIPEYKRPNGC+ NFFEEGEY
Sbjct: 325 PIFGHVKADENSNDTNNSVNIEPIPPLLESVIDHFVTWRLIPEYKRPNGCVINFFEEGEY 384

Query: 366 SQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSAD 425
           SQPF KPPHLEQPISTL LSESTMA+GR + SDNEGN++GPL LSLK+GSLLVMRGNSAD
Sbjct: 385 SQPFLKPPHLEQPISTLVLSESTMAYGRILSSDNEGNFRGPLTLSLKQGSLLVMRGNSAD 444

Query: 426 VARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMTLWQPGVAAACALPNGATY 485
           +ARHVMC S NKRV+ITFFR+RPD  ++  Q  +P+    MT+WQP         NG  +
Sbjct: 445 MARHVMCPSQNKRVSITFFRIRPDTYHNHSQPNSPRNDGVMTMWQPYQMTPTPFLNGYDH 504

Query: 486 GYEAMEVMPKWGILRAPVVMLA--PVRPMVM-SPG-RSQRDGTGVFLPWAV--NSRKPAK 545
              ++++MPK G+LR P+VM+A  PV+PM++ SP       GTGVFLPWA   +SRK  K
Sbjct: 505 ---SIDMMPKLGVLRPPMVMMAPPPVQPMILPSPNVMGTGGGTGVFLPWASVNSSRKHVK 564

Query: 546 HLPPRARKGRFLALPSAVETR-LPDSSHEPGISV 560
           HLPPRA+K R L LP A  +     S+ EP I+V
Sbjct: 565 HLPPRAQKKRLLPLPPAASSSPAGGSTSEPVITV 568

BLAST of HG10022874 vs. TAIR 10
Match: AT2G48080.1 (oxidoreductase, 2OG-Fe(II) oxygenase family protein )

HSP 1 Score: 429.9 bits (1104), Expect = 3.2e-120
Identity = 254/545 (46.61%), Postives = 319/545 (58.53%), Query Frame = 0

Query: 22  VTDTIAKDAVLGWFRGEFAAANAIIDALCGHLAQVSESGGSEYEAVFGAIHRRRLNWIPV 81
           ++D+ AKDA+L WFRGEFAAANAIIDALC HL Q S  G ++YE+V  A+HRRRLNWIPV
Sbjct: 17  LSDSAAKDAMLTWFRGEFAAANAIIDALCAHLMQAS-GGSAQYESVMAALHRRRLNWIPV 76

Query: 82  LQMQKYHPIADVAVELRKVTAKKNNNKNQEEVKGGEVEAVAVAEGEGEGDVEMEVKKMSE 141
           LQMQKYH I+ V ++L++  AK                                      
Sbjct: 77  LQMQKYHSISQVTLQLQQHLAK-------------------------------------- 136

Query: 142 EDEKEFVEEEMNNGKLKIEEMSIEINETDGGRNEVLPPIEEEDSIGSEITDSGSQGGGVQ 201
                                         G +  L    ++DS  S+ITD GS+     
Sbjct: 137 ------------------------------GFHHHLDDDHDDDSPSSDITDGGSR----- 196

Query: 202 ANYAEVEICSNHE-ECEARPGQ-MKLTKGFSAKEPV-----NVVKGLKCYEDIFTQSELA 261
                + IC  HE ECE+R    +K +K FSAKE V     NVVKGLK Y+D+FT+ +L+
Sbjct: 197 -EEETLSICCKHEDECESRGASLLKQSKRFSAKEHVRGHTANVVKGLKLYQDVFTRPQLS 256

Query: 262 RLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGVPIFGQIKDDSTNISQTSN 321
           +L D ++ LR A  N +LSG+TFVLFNK  KG +RE++QLGVPIFG   D+        +
Sbjct: 257 KLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKRELLQLGVPIFGNTTDE-------HS 316

Query: 322 IEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKPPHLEQPISTLFLS 381
           +EPIP L+ +VIDHL+QW+LIPEYKRPNGC+ NFF+E E+SQPFQKPPH++QPISTL LS
Sbjct: 317 VEPIPTLVQSVIDHLLQWRLIPEYKRPNGCVINFFDEDEHSQPFQKPPHVDQPISTLVLS 376

Query: 382 ESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMCASPNKRVTITFFR 441
           ESTM FG  +  DN+GN++G L L LKEGSLLVMRGNSAD+ARHVMC SPNKRV ITFF+
Sbjct: 377 ESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARHVMCPSPNKRVAITFFK 436

Query: 442 VRPDYDQCQSPTPQMSNAMTLWQPGVAAACALPNGATYGYEAMEVMPKWGILRAPVVMLA 501
           ++PD  + Q P        TLW+PG                            +P+VMLA
Sbjct: 437 LKPDSGKVQPPP-------TLWRPGTP--------------------------SPLVMLA 438

Query: 502 PVRPMVMSPGRSQRDGTGVFLPWAVN-SRKPAKHLPPRARKGRFLALPSAVETRLPDSSH 559
           P      +P R    GTGVFLPW    SRKPAKHLPPR ++ R L+   +V      SS 
Sbjct: 497 P------APKRLDA-GTGVFLPWTPPVSRKPAKHLPPRVQRLRLLSSSKSVADS-ESSSP 438

BLAST of HG10022874 vs. TAIR 10
Match: AT1G14710.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 221.1 bits (562), Expect = 2.3e-57
Identity = 151/450 (33.56%), Postives = 235/450 (52.22%), Query Frame = 0

Query: 28  KDAVLGWFRGEFAAANAIIDALCGHLAQVSESGGSEYEAVFGAIHRRRLNWIPVLQMQKY 87
           +D  + W R EFAAANAIID+LC HL  V +   +EYE+V G+IH RRL W  VL MQ++
Sbjct: 29  RDGFISWLRAEFAAANAIIDSLCQHLQAVGDH--NEYESVIGSIHHRRLAWSQVLTMQQF 88

Query: 88  HPIADVAVELRKVTAKKNN--------NKNQEEVKGGEVEAVAVAEGEGEGDVEMEVKKM 147
            P+ADV+  L+++  K+          N +Q    G         +  G G        M
Sbjct: 89  FPVADVSYNLQQIAWKRQQQMPPQRHYNSDQVGKFGARRSGPGFNKHHGGGGGYRGADSM 148

Query: 148 S---------EEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR-----NEVLPPIEE--- 207
           +           D  E  EE      +K   +S+   + DG       ++V   +EE   
Sbjct: 149 ARNGHNFNGVNSDRVEHREEAKLASDVK--ALSVAEEKRDGSEKPRSDSKVEKKLEESET 208

Query: 208 -EDSIGSEITDSGSQGGGVQANYAEVEICSNHEECEARPGQMKLTKGFSAKEPVNVVKGL 267
            E+ + +   +SGS+   + +   + E   N +EC A   +  + +     + VNVV+GL
Sbjct: 209 QEEIVKNHKCNSGSKDNSLISEQKQEE---NDKECPASMAKTFVVQEMYDAKMVNVVEGL 268

Query: 268 KCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGVPIFGQ 327
           K Y+ +   +E+++L   V +LR A   G+L  + +V + +  +G+ REMIQLG+PI   
Sbjct: 269 KLYDKMLDANEVSQLVSLVTNLRLAGRRGQLQSEAYVGYKRPNRGHGREMIQLGLPIADT 328

Query: 328 IKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKP 387
             DD +   +   IEPIP  L  +I+ L+  Q+IP   +P+ C+ +FF EG++SQP    
Sbjct: 329 PPDDDS--IKDRRIEPIPSALSDIIERLVSKQIIP--VKPDACIIDFFSEGDHSQPHMFV 388

Query: 388 PHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMC 447
           P   +PIS L LSE    FGR IVS+N G+YKG L LSL  GS+L++ G SA++A++ + 
Sbjct: 389 PWFGRPISVLSLSECDYTFGRVIVSENPGDYKGSLKLSLTPGSVLLVEGKSANLAKYAIH 448

Query: 448 ASPNKRVTITFFRVRPDYDQCQSPTPQMSN 452
           A+  +R+ I+F + +P       P  +  N
Sbjct: 449 ATRKQRILISFIKSKPRNSNWGPPPSRSPN 467

BLAST of HG10022874 vs. TAIR 10
Match: AT1G14710.2 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 221.1 bits (562), Expect = 2.3e-57
Identity = 151/450 (33.56%), Postives = 235/450 (52.22%), Query Frame = 0

Query: 28  KDAVLGWFRGEFAAANAIIDALCGHLAQVSESGGSEYEAVFGAIHRRRLNWIPVLQMQKY 87
           +D  + W R EFAAANAIID+LC HL  V +   +EYE+V G+IH RRL W  VL MQ++
Sbjct: 29  RDGFISWLRAEFAAANAIIDSLCQHLQAVGDH--NEYESVIGSIHHRRLAWSQVLTMQQF 88

Query: 88  HPIADVAVELRKVTAKKNN--------NKNQEEVKGGEVEAVAVAEGEGEGDVEMEVKKM 147
            P+ADV+  L+++  K+          N +Q    G         +  G G        M
Sbjct: 89  FPVADVSYNLQQIAWKRQQQMPPQRHYNSDQVGKFGARRSGPGFNKHHGGGGGYRGADSM 148

Query: 148 S---------EEDEKEFVEEEMNNGKLKIEEMSIEINETDGGR-----NEVLPPIEE--- 207
           +           D  E  EE      +K   +S+   + DG       ++V   +EE   
Sbjct: 149 ARNGHNFNGVNSDRVEHREEAKLASDVK--ALSVAEEKRDGSEKPRSDSKVEKKLEESET 208

Query: 208 -EDSIGSEITDSGSQGGGVQANYAEVEICSNHEECEARPGQMKLTKGFSAKEPVNVVKGL 267
            E+ + +   +SGS+   + +   + E   N +EC A   +  + +     + VNVV+GL
Sbjct: 209 QEEIVKNHKCNSGSKDNSLISEQKQEE---NDKECPASMAKTFVVQEMYDAKMVNVVEGL 268

Query: 268 KCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQLGVPIFGQ 327
           K Y+ +   +E+++L   V +LR A   G+L  + +V + +  +G+ REMIQLG+PI   
Sbjct: 269 KLYDKMLDANEVSQLVSLVTNLRLAGRRGQLQSEAYVGYKRPNRGHGREMIQLGLPIADT 328

Query: 328 IKDDSTNISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEEGEYSQPFQKP 387
             DD +   +   IEPIP  L  +I+ L+  Q+IP   +P+ C+ +FF EG++SQP    
Sbjct: 329 PPDDDS--IKDRRIEPIPSALSDIIERLVSKQIIP--VKPDACIIDFFSEGDHSQPHMFV 388

Query: 388 PHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMC 447
           P   +PIS L LSE    FGR IVS+N G+YKG L LSL  GS+L++ G SA++A++ + 
Sbjct: 389 PWFGRPISVLSLSECDYTFGRVIVSENPGDYKGSLKLSLTPGSVLLVEGKSANLAKYAIH 448

Query: 448 ASPNKRVTITFFRVRPDYDQCQSPTPQMSN 452
           A+  +R+ I+F + +P       P  +  N
Sbjct: 449 ATRKQRILISFIKSKPRNSNWGPPPSRSPN 467

BLAST of HG10022874 vs. TAIR 10
Match: AT2G17970.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 125.9 bits (315), Expect = 9.9e-29
Identity = 72/207 (34.78%), Postives = 113/207 (54.59%), Query Frame = 0

Query: 235 VNVVKGLKCYEDIFTQSELARLNDFVDDLRSAANNGELSGDTFVLFNKQVKGNRREMIQL 294
           VNV+ GL+ +  +F+  E  R+ D V  L+     GEL   TF   +K ++G  RE IQ 
Sbjct: 210 VNVLDGLELHTGVFSAVEQKRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQF 269

Query: 295 GVPIFGQIKDDSTN---ISQTSNIEPIPPLLMTVIDHLIQWQLIPEYKRPNGCLFNFFEE 354
           G   +    D + N   I Q   ++P+P L   +I  LI+W ++P    P+ C+ N ++E
Sbjct: 270 GC-CYNYAPDRAGNPPGILQREEVDPLPHLFKVIIRKLIKWHVLPPTCVPDSCIVNIYDE 329

Query: 355 GEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSL 414
           G+       PPH++     +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+
Sbjct: 330 GDCI-----PPHIDNHDFLRPFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSV 389

Query: 415 LVMRGNSADVARHVMCASPNKRVTITF 433
           LV+ GN ADVA+H + A P KR++ITF
Sbjct: 390 LVLNGNGADVAKHCVPAVPTKRISITF 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896503.11.8e-29591.67RNA demethylase ALKBH10B [Benincasa hispida][more]
XP_008451400.11.4e-29090.97PREDICTED: uncharacterized protein LOC103492703 [Cucumis melo][more]
TYK21007.18.9e-29090.80uncharacterized protein E5676_scaffold328G00370 [Cucumis melo var. makuwa][more]
XP_011659328.21.1e-28487.44RNA demethylase ALKBH10B [Cucumis sativus] >KAE8646325.1 hypothetical protein Cs... [more]
XP_022992509.11.5e-27386.16uncharacterized protein LOC111488819 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9ZT922.6e-15153.83RNA demethylase ALKBH10B OS=Arabidopsis thaliana OX=3702 GN=ALKBH10B PE=1 SV=1[more]
Q9SL491.4e-2734.78RNA demethylase ALKBH9B OS=Arabidopsis thaliana OX=3702 GN=ALKBH9B PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BQT26.6e-29190.97uncharacterized protein LOC103492703 OS=Cucumis melo OX=3656 GN=LOC103492703 PE=... [more]
A0A5D3DBI84.3e-29090.80Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0K5447.6e-28789.85Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G392330 PE=4 SV=1[more]
A0A6J1JXR17.4e-27486.16uncharacterized protein LOC111488819 OS=Cucurbita maxima OX=3661 GN=LOC111488819... [more]
A0A6J1GNJ13.7e-27386.14uncharacterized protein LOC111456041 OS=Cucurbita moschata OX=3662 GN=LOC1114560... [more]
Match NameE-valueIdentityDescription
AT4G02940.11.9e-15253.83oxidoreductase, 2OG-Fe(II) oxygenase family protein [more]
AT2G48080.13.2e-12046.61oxidoreductase, 2OG-Fe(II) oxygenase family protein [more]
AT1G14710.12.3e-5733.56hydroxyproline-rich glycoprotein family protein [more]
AT1G14710.22.3e-5733.56hydroxyproline-rich glycoprotein family protein [more]
AT2G17970.19.9e-2934.782-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 145..165
NoneNo IPR availablePANTHERPTHR31447:SF102OG-FE(II) OXYGENASE FAMILY OXIDOREDUCTASEcoord: 131..559
coord: 14..111
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 234..436
IPR037151Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamilyGENE3D2.60.120.590coord: 227..443
e-value: 3.5E-44
score: 153.0
IPR044842RNA demethylase ALKBH9B/ALKBH10B-likePANTHERPTHR31447HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEIN-RELATEDcoord: 131..559
coord: 14..111

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022874.1HG10022874.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070988 demethylation
biological_process GO:0006402 mRNA catabolic process
molecular_function GO:0032451 demethylase activity
molecular_function GO:0003729 mRNA binding