HG10007636 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007636
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4220 domain-containing protein
LocationChr10: 8883122 .. 8884931 (-)
RNA-Seq ExpressionHG10007636
SyntenyHG10007636
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTTGAAGAGAAAACACAGCTCAAGCCAATTCCTTCGCATGTTTCTCTTATTTACATACTCCTTTGCTGATTGGATCACCAATTTTTCATTCGGCATACTCGTCGAAAAATATGGTAGCGGCTGCTACGACGAGTTCTTTCACCCCACATACATCATTAGAGCCTTCTTAGCTCCATTTTTACTACTTCACTTAGGTGGCTCTGACACTATCACTGCCTACTCAATGGAAGACAATGAGCTATGGCTTAGAGCTCTTCTTTCTATGCTCCTTCAACTTGTTGCTTCACTTTACATCTTCTTACTTGCCCTACAGCCAACTTCTCTCAACTATGTAGCCATTCCAGTTTTTGTTGCTGGCATAATCAAGTATGATGAGAAGATTTGGGCTTTCAGATCTGCTTCCGCCGAACGCTTGCGAGATTTTCTTGTTGTTTCAAATCATTCTCCAATCACTATCCATAATCAGGAGGAAGTGCTAGATCTTCAGATGCTTCACCTTACTTACAACTTTTTCAATAGAGACAAAAGGTTGTTTGTAGGTTTAGGTCCGACTTCCTATGATCGCCATCAAAACCGTCTTTGCTATTATGACAAATTCAACTCTAAATCCGTCTTTAAAATCATTGAGCTTGAACTTGGATTTATGTATGATTTCTTCTACACAAAATCCTTCATCAACCACTCTATGTGGGGTTGTCTTCTACGCCTCACAACCTTTTCTTCTCTTGTTAGCCATTGTCATCTATTGTATAATTGATAAGCATGAATATCCTTTAAAGTATGTACGTCTTATATTTCTCCTCTTCTTTGGAGCTTTGGGGCTTGAAATCTACTCACTTTTTTTGTTTCTATACTCTGATTGGAATGTATTATGGTTGACCACTCAATCCCCTTCTCATTCCCTGCCTCGTTTGGCTTTTAAACTCATCTCTCTTTGTGGATGGTCTGTCAAGAAAAGAAGATGCTCTAATTCTATTTTTCAGTACAATCTAATATCTCATTGTTTGAAACAGAAAAACCAAAGCTATTATTCCAAGTTCACCAAAAGCATGGCAGCATTTTCAGTACAACGACGTCCCATCTCGAATAACCTCGAAACACAAATCTTCCAACAACTAAAGCAAAAGTTGGAAGGATCATCAAGTACATTAGAAGACAGCAAGAAGGTAATTATTAATGAAGTTGGTTGGAGCCTAAATTTTGATTTGGACGAAACCATCCTCCTCTGGCACATCGCTACCGATATTTGCTATCATTCTTCAAAAATTGAAGAAAGGGCGGAAGAATCATCAAAGTCAAGCATATTGGTGTCTAATTTCTTGGCTTACCTTGTAGTGCACTGTCCATCCTTATTTCCCAGTGGAATGAGTCAAATAAGGCATAAAGCCACTAGTGAACATGTCCTTCAATTTGTGGAAGACAAGAAGTTAATGTTGAAGATCAATTTGGAGTTGAAGATTGAGGAAGCTAAGGAAAGCAAGTCAATGTTGTTTGATGCTTGTCGTGTTGCAAGGCAGCTTGAGAAAGTAGAAGGGTTAAAGAAGTGGGAGATAATAGGGAATGTGTGGGTAGAATTGTTAGCACGTATTTCATGTGAATGTGAATGGTATGACCATGCTAAAATGCTTACACAAGGAGGTAATTTGTTAACACGTGTCTGGATTTTGATGCATCATCTTGGATATCTCAAACCAGCCGATGTCTTGACCATGGAAGAACATCGACCACTTCTAGACCATGAAATCATACCTGATTCTGTGGTCGCTCAAATGTTTGATGTTATATTTAATATTGCCTCCCTCTAA

mRNA sequence

ATGGGGTTGAAGAGAAAACACAGCTCAAGCCAATTCCTTCGCATGTTTCTCTTATTTACATACTCCTTTGCTGATTGGATCACCAATTTTTCATTCGGCATACTCGTCGAAAAATATGGTAGCGGCTGCTACGACGAGTTCTTTCACCCCACATACATCATTAGAGCCTTCTTAGCTCCATTTTTACTACTTCACTTAGGTGGCTCTGACACTATCACTGCCTACTCAATGGAAGACAATGAGCTATGGCTTAGAGCTCTTCTTTCTATGCTCCTTCAACTTGTTGCTTCACTTTACATCTTCTTACTTGCCCTACAGCCAACTTCTCTCAACTATGTAGCCATTCCAGTTTTTGTTGCTGGCATAATCAAGTATGATGAGAAGATTTGGGCTTTCAGATCTGCTTCCGCCGAACGCTTGCGAGATTTTCTTGTTGTTTCAAATCATTCTCCAATCACTATCCATAATCAGGAGGAAGTGCTAGATCTTCAGATGCTTCACCTTACTTACAACTTTTTCAATAGAGACAAAAGGTTGTTTGTAGGTTTAGGTCCGACTTCCTATGATCGCCATCAAAACCGTCTTTGCTATTATGACAAATTCAACTCTAAATCCGTCTTTAAAATCATTGAGCTTGAACTTGGATTTATGTATGATTTCTTCTACACAAAATCCTTCATCAACCACTCTATGTGGGGTTGTCTTCTACGCCTCACAACCTTTTCTTCTCTTAAAAACCAAAGCTATTATTCCAAGTTCACCAAAAGCATGGCAGCATTTTCAGTACAACGACGTCCCATCTCGAATAACCTCGAAACACAAATCTTCCAACAACTAAAGCAAAAGTTGGAAGGATCATCAAGTACATTAGAAGACAGCAAGAAGGTAATTATTAATGAAGTTGGTTGGAGCCTAAATTTTGATTTGGACGAAACCATCCTCCTCTGGCACATCGCTACCGATATTTGCTATCATTCTTCAAAAATTGAAGAAAGGGCGGAAGAATCATCAAAGTCAAGCATATTGGTGTCTAATTTCTTGGCTTACCTTGTAGTGCACTGTCCATCCTTATTTCCCAGTGGAATGAGTCAAATAAGGCATAAAGCCACTAGTGAACATGTCCTTCAATTTGTGGAAGACAAGAAGTTAATGTTGAAGATCAATTTGGAGTTGAAGATTGAGGAAGCTAAGGAAAGCAAGTCAATGTTGTTTGATGCTTGTCGTGTTGCAAGGCAGCTTGAGAAAGTAGAAGGGTTAAAGAAGTGGGAGATAATAGGGAATGTGTGGGTAGAATTGTTAGCACGTATTTCATGTGAATGTGAATGGTATGACCATGCTAAAATGCTTACACAAGGAGGTAATTTGTTAACACGTGTCTGGATTTTGATGCATCATCTTGGATATCTCAAACCAGCCGATGTCTTGACCATGGAAGAACATCGACCACTTCTAGACCATGAAATCATACCTGATTCTGTGGTCGCTCAAATGTTTGATGTTATATTTAATATTGCCTCCCTCTAA

Coding sequence (CDS)

ATGGGGTTGAAGAGAAAACACAGCTCAAGCCAATTCCTTCGCATGTTTCTCTTATTTACATACTCCTTTGCTGATTGGATCACCAATTTTTCATTCGGCATACTCGTCGAAAAATATGGTAGCGGCTGCTACGACGAGTTCTTTCACCCCACATACATCATTAGAGCCTTCTTAGCTCCATTTTTACTACTTCACTTAGGTGGCTCTGACACTATCACTGCCTACTCAATGGAAGACAATGAGCTATGGCTTAGAGCTCTTCTTTCTATGCTCCTTCAACTTGTTGCTTCACTTTACATCTTCTTACTTGCCCTACAGCCAACTTCTCTCAACTATGTAGCCATTCCAGTTTTTGTTGCTGGCATAATCAAGTATGATGAGAAGATTTGGGCTTTCAGATCTGCTTCCGCCGAACGCTTGCGAGATTTTCTTGTTGTTTCAAATCATTCTCCAATCACTATCCATAATCAGGAGGAAGTGCTAGATCTTCAGATGCTTCACCTTACTTACAACTTTTTCAATAGAGACAAAAGGTTGTTTGTAGGTTTAGGTCCGACTTCCTATGATCGCCATCAAAACCGTCTTTGCTATTATGACAAATTCAACTCTAAATCCGTCTTTAAAATCATTGAGCTTGAACTTGGATTTATGTATGATTTCTTCTACACAAAATCCTTCATCAACCACTCTATGTGGGGTTGTCTTCTACGCCTCACAACCTTTTCTTCTCTTAAAAACCAAAGCTATTATTCCAAGTTCACCAAAAGCATGGCAGCATTTTCAGTACAACGACGTCCCATCTCGAATAACCTCGAAACACAAATCTTCCAACAACTAAAGCAAAAGTTGGAAGGATCATCAAGTACATTAGAAGACAGCAAGAAGGTAATTATTAATGAAGTTGGTTGGAGCCTAAATTTTGATTTGGACGAAACCATCCTCCTCTGGCACATCGCTACCGATATTTGCTATCATTCTTCAAAAATTGAAGAAAGGGCGGAAGAATCATCAAAGTCAAGCATATTGGTGTCTAATTTCTTGGCTTACCTTGTAGTGCACTGTCCATCCTTATTTCCCAGTGGAATGAGTCAAATAAGGCATAAAGCCACTAGTGAACATGTCCTTCAATTTGTGGAAGACAAGAAGTTAATGTTGAAGATCAATTTGGAGTTGAAGATTGAGGAAGCTAAGGAAAGCAAGTCAATGTTGTTTGATGCTTGTCGTGTTGCAAGGCAGCTTGAGAAAGTAGAAGGGTTAAAGAAGTGGGAGATAATAGGGAATGTGTGGGTAGAATTGTTAGCACGTATTTCATGTGAATGTGAATGGTATGACCATGCTAAAATGCTTACACAAGGAGGTAATTTGTTAACACGTGTCTGGATTTTGATGCATCATCTTGGATATCTCAAACCAGCCGATGTCTTGACCATGGAAGAACATCGACCACTTCTAGACCATGAAATCATACCTGATTCTGTGGTCGCTCAAATGTTTGATGTTATATTTAATATTGCCTCCCTCTAA

Protein sequence

MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSLKNQSYYSKFTKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKLMLKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTMEEHRPLLDHEIIPDSVVAQMFDVIFNIASL
Homology
BLAST of HG10007636 vs. NCBI nr
Match: KAA0037446.1 (uncharacterized protein E6C27_scaffold277G00320 [Cucumis melo var. makuwa] >TYK01920.1 uncharacterized protein E5676_scaffold808G00060 [Cucumis melo var. makuwa])

HSP 1 Score: 638.3 bits (1645), Expect = 5.8e-179
Identity = 369/623 (59.23%), Postives = 410/623 (65.81%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGLKRKHSSSQFLR FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA 
Sbjct: 34  MGLKRKHSSSQFLRFFLLIAYSFSDWIANFSFVMLVERYGTGCYDDFTDPNYIIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGGSDTITAYSMEDNELWLR LLS+L  L AS+YIFL AL PTSLNYV+IPV +A
Sbjct: 94  FLLLHLGGSDTITAYSMEDNELWLRTLLSLLAVLAASIYIFLQALLPTSLNYVSIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           GIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLF
Sbjct: 154 GIIKNCEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEMLHIAYYFFNRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRL YY+KF S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLSYYEKFQSNSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLFIAILTYCLIDKQEYPSTYVNLIFLLFFGALSIEIYSLFLILFSDWNVIWLLTTQS 333

Query: 301 ------------------------------------------KNQSYYSKF--TKSMAAF 360
                                                      + SYY KF  TK+MAAF
Sbjct: 334 PSNPLPRLALKLISLCGWSFKKRRCSNSISQYNLISHCLKQKNDDSYYCKFHNTKTMAAF 393

Query: 361 SVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIAT 420
           SVQ RPISNNLE  IFQQLK+KL      L        NE+GWSL  DLD++ILLWHIAT
Sbjct: 394 SVQ-RPISNNLEAHIFQQLKKKL-----VLNQEYDSGYNEIGWSLKLDLDQSILLWHIAT 453

Query: 421 DICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHV 480
           D CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPSGMSQIRHKATSE V
Sbjct: 454 DFCYYSSPKFKESEEYSESCIPPQDSISLSNFLAYFIVHHPSLFPSGMSQIRHKATSEDV 513

Query: 481 LQFVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIG 504
           L+ ++DKKL      MLK NLELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIG
Sbjct: 514 LELLQDKKLGRCNSNMLK-NLELKIEVVKEERKESMVLDACRLAGILEKLEQSQKWEIIG 573

BLAST of HG10007636 vs. NCBI nr
Match: XP_008458716.1 (PREDICTED: uncharacterized protein LOC103498043 [Cucumis melo])

HSP 1 Score: 635.6 bits (1638), Expect = 3.7e-178
Identity = 367/623 (58.91%), Postives = 409/623 (65.65%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGLKRKHSSSQF R FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA 
Sbjct: 34  MGLKRKHSSSQFRRFFLLIAYSFSDWIANFSFVMLVERYGTGCYDDFTDPNYIIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGG DTITAYSMEDNELWLR LLS+L  L AS+YIFL AL PTSLNYV+IPV +A
Sbjct: 94  FLLLHLGGFDTITAYSMEDNELWLRTLLSLLAVLAASIYIFLQALLPTSLNYVSIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           GIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLF
Sbjct: 154 GIIKNCEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEMLHIAYYFFNRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRLCYY+KF S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLCYYEKFQSNSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLFIAILTYCLIDKQEYPSTYVNLIFLLFFGALSIEIYSLFLILFSDWNVIWLLTTQS 333

Query: 301 ------------------------------------------KNQSYYSKF--TKSMAAF 360
                                                      + SYY KF  TK+MAAF
Sbjct: 334 PSNPLPRLALKLISLCGWSFKKRRCSNSISQYNLISHCLKQKNDDSYYCKFHNTKTMAAF 393

Query: 361 SVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIAT 420
           SVQ RPISNNLE  IFQQLK+KL      L        NE+GWSL  DLD++ILLWHIAT
Sbjct: 394 SVQ-RPISNNLEAHIFQQLKKKL-----VLNQEYDSGYNEIGWSLKLDLDQSILLWHIAT 453

Query: 421 DICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHV 480
           D CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPS MSQIRHKATSE V
Sbjct: 454 DFCYYSSPKFKESEEYSESCIPPQDSISLSNFLAYFIVHHPSLFPSRMSQIRHKATSEDV 513

Query: 481 LQFVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIG 504
           L+ ++DKKL      MLK NLELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIG
Sbjct: 514 LELLQDKKLGRCNSNMLK-NLELKIEVVKEERKESMVLDACRLAGILEKLEQSQKWEIIG 573

BLAST of HG10007636 vs. NCBI nr
Match: XP_004139148.1 (uncharacterized protein LOC101222078 [Cucumis sativus] >KGN66604.1 hypothetical protein Csa_007023 [Cucumis sativus])

HSP 1 Score: 628.6 bits (1620), Expect = 4.6e-176
Identity = 360/624 (57.69%), Postives = 409/624 (65.54%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           +GLKRK SSSQFLR FLL  Y+F+DWI NFSF +LVE+YG+GCYD+F  P Y+IRAFLA 
Sbjct: 34  LGLKRKCSSSQFLRFFLLIAYTFSDWIANFSFVMLVERYGTGCYDDFTDPMYMIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGGSDTITAYSMEDNELWLR LLSML  L AS+YIFL AL PTSLNY++IPV +A
Sbjct: 94  FLLLHLGGSDTITAYSMEDNELWLRTLLSMLAILAASIYIFLQALLPTSLNYISIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           G+IK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D ++L + Y FF RDKRLF
Sbjct: 154 GVIKNSEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEVLRIAYYFFIRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRL YY+KF SKS FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLSYYEKFESKSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLVIAFLTYCLIDKQEYPSTYVNLIFLLFSGALSIEIYSLFLFLFSDWNVIWLLTTQS 333

Query: 301 -----------------------------------------KNQSYYSKF--TKSMAAFS 360
                                                    KN SYY KF  TK++AAFS
Sbjct: 334 PSNPLPRLALKLISLCGWSLKKRRCSNSISQYNLISHCLEQKNDSYYFKFPSTKTIAAFS 393

Query: 361 VQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATD 420
           VQ RPISNNLE  IFQQLKQKL      L        NE+GWSL  DLD++IL+WHIATD
Sbjct: 394 VQ-RPISNNLEAHIFQQLKQKL-----VLNQEYDYGYNEIGWSLKLDLDQSILIWHIATD 453

Query: 421 ICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQ 480
            CYHSS   + +EES      + S+ +SNFLAY +VH PSLFPSGMSQIRHKATSEHVL+
Sbjct: 454 FCYHSSPKFKESEESKSCIPPQDSVSLSNFLAYFIVHHPSLFPSGMSQIRHKATSEHVLE 513

Query: 481 FVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNV 507
            ++D+KL      MLK NLEL IE  KE +  S + DA R+A  LEK+E  +KWEIIGNV
Sbjct: 514 LLQDEKLDRCRSNMLK-NLELNIEVVKEERKESRVLDAFRLAGFLEKLEQSQKWEIIGNV 573

BLAST of HG10007636 vs. NCBI nr
Match: XP_038880416.1 (uncharacterized protein LOC120072067 [Benincasa hispida])

HSP 1 Score: 540.8 bits (1392), Expect = 1.3e-149
Identity = 302/518 (58.30%), Postives = 341/518 (65.83%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGL RK +S+QFLR+FLLF YSFADWIT+FSFGILVEKYGSGCYDEF  PTYIIRA LAP
Sbjct: 34  MGLMRKRTSNQFLRLFLLFAYSFADWITSFSFGILVEKYGSGCYDEFTDPTYIIRALLAP 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGGSDTITAYSMED ELWLR LL ML QL+AS Y+FLLALQPTSL YVAIP+FVA
Sbjct: 94  FLLLHLGGSDTITAYSMEDKELWLRTLLPMLDQLLASFYLFLLALQPTSLKYVAIPIFVA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           GIIKY EKIWA R+ASAERLRDF+ VS  S I  H+QEE+ D+QMLH  Y+FFN+DKR+F
Sbjct: 154 GIIKYGEKIWALRTASAERLRDFVAVSTPSTIITHDQEELKDVQMLHTAYHFFNKDKRMF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTS+DRHQN L YY++FNSK  FKIIELELGFMYDFFYTK+ INHS  G L  L T
Sbjct: 214 VGLGPTSFDRHQNGLSYYEEFNSKLPFKIIELELGFMYDFFYTKTSINHSRRGLLFLLIT 273

Query: 241 FSSLKNQSYYSKFTKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE 300
           FSSL                                                        
Sbjct: 274 FSSL-------------------------------------------------------- 333

Query: 301 VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPS 360
                                                         +  +V +C      
Sbjct: 334 ----------------------------------------------VIAIVTYC------ 393

Query: 361 GMSQIRHKATSEHVLQFVEDKKL-----MLKINLELKIE------EAKESKSMLFDACRV 420
            M   + K TSEHVL+ ++DKKL     MLK N+ELKIE      E + +KSML D CR+
Sbjct: 394 -MIDKQPKDTSEHVLELLQDKKLGRKSSMLK-NMELKIEVNNVEKEQRNNKSMLLDGCRL 441

Query: 421 ARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYL 480
           ARQLE++E  KKWEIIGNVW+ELL RISCECEWYDHAK LTQGGNLLTRVWILMHHLGY+
Sbjct: 454 ARQLEEIEESKKWEIIGNVWMELLGRISCECEWYDHAKHLTQGGNLLTRVWILMHHLGYV 441

Query: 481 KPADVLTMEEHRPLLDHEIIPDSVVAQMFDVIFNIASL 508
           KP++V TMEE +PLLDHEI+PD V+ QMFDVIFNI SL
Sbjct: 514 KPSNVFTMEEDQPLLDHEILPDYVLEQMFDVIFNIVSL 441

BLAST of HG10007636 vs. NCBI nr
Match: XP_022141971.1 (uncharacterized protein LOC111012216 [Momordica charantia])

HSP 1 Score: 389.8 bits (1000), Expect = 3.6e-104
Identity = 251/628 (39.97%), Postives = 334/628 (53.18%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGL+RK+SS+  LR+ LL  Y  ADW    S G LV+ YGS   D FF   +I    LAP
Sbjct: 37  MGLRRKYSSNNALRLLLLLFYLSADWAATTSLGTLVKFYGS-YEDAFFGRLFI----LAP 96

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           F+LLHLGGSDTITAYSMEDN+LW R+     +Q+  + YI LLALQP  L+++ IP+FVA
Sbjct: 97  FMLLHLGGSDTITAYSMEDNDLWYRSFFGFFVQVGIAFYILLLALQPQHLDFLGIPIFVA 156

Query: 121 GIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQ 180
           GIIKY E+IW FRS S +RL D L+  +  SPI I+                        
Sbjct: 157 GIIKYGERIWVFRSTSTQRLPDLLLSTTRFSPIQINAAKSKHLHLEFPIQINVDHEATQN 216

Query: 181 EEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFM 240
                L +LH+ Y FF  +K LFV L  TSYD  Q+ L Y+ +F+S+  FK+IELELGFM
Sbjct: 217 PHFSHLHLLHIAYYFFKTNKFLFVDLTLTSYDLQQS-LHYFMQFDSREAFKVIELELGFM 276

Query: 241 YDFFYTKSFINHSMWGCLLRLTTFSS---------------------------------- 300
           YDFFYTK+ I HS WG +LRLTT  S                                  
Sbjct: 277 YDFFYTKASIIHSRWGPILRLTTLFSIVVVIVTFHIDHFDPGFNNPLTNILTLILLYGAL 336

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 337 SLEISSFILFLCSDWNVIRLTKSSYSLAHLTFKAISRCGWSVKKYRWSNSVRQYNLISCC 396

Query: 361 LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKV--- 420
           LK   Y  Y K+  T  ++      R IS+ L+T+IFQQL QKLE +    E+++K+   
Sbjct: 397 LKETKYGRYCKYFRTSYISKIMTASRNISDELKTRIFQQLTQKLEVN----EENRKLPGW 456

Query: 421 ------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAY 480
                   N++GWSL  D D++ILLWHIAT+ICYH  K  E +  S  +   L+S+FL Y
Sbjct: 457 ILRKHNCYNQLGWSLELDSDQSILLWHIATNICYHRDKETEASNCSLLEDGTLLSDFLTY 516

Query: 481 LVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKLM--------LKINLELKIEEAKESKS 489
           L+V+  SLF  GMS+IR   T +  ++F++ +K +          ++LE          S
Sbjct: 517 LLVYHHSLFLDGMSEIRFCETVDSAIEFMQQRKSIETTSDACKSMLDLETSTVYKDAGNS 576

BLAST of HG10007636 vs. ExPASy TrEMBL
Match: A0A5D3BS41 (DUF4220 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold808G00060 PE=4 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 2.8e-179
Identity = 369/623 (59.23%), Postives = 410/623 (65.81%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGLKRKHSSSQFLR FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA 
Sbjct: 34  MGLKRKHSSSQFLRFFLLIAYSFSDWIANFSFVMLVERYGTGCYDDFTDPNYIIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGGSDTITAYSMEDNELWLR LLS+L  L AS+YIFL AL PTSLNYV+IPV +A
Sbjct: 94  FLLLHLGGSDTITAYSMEDNELWLRTLLSLLAVLAASIYIFLQALLPTSLNYVSIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           GIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLF
Sbjct: 154 GIIKNCEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEMLHIAYYFFNRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRL YY+KF S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLSYYEKFQSNSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLFIAILTYCLIDKQEYPSTYVNLIFLLFFGALSIEIYSLFLILFSDWNVIWLLTTQS 333

Query: 301 ------------------------------------------KNQSYYSKF--TKSMAAF 360
                                                      + SYY KF  TK+MAAF
Sbjct: 334 PSNPLPRLALKLISLCGWSFKKRRCSNSISQYNLISHCLKQKNDDSYYCKFHNTKTMAAF 393

Query: 361 SVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIAT 420
           SVQ RPISNNLE  IFQQLK+KL      L        NE+GWSL  DLD++ILLWHIAT
Sbjct: 394 SVQ-RPISNNLEAHIFQQLKKKL-----VLNQEYDSGYNEIGWSLKLDLDQSILLWHIAT 453

Query: 421 DICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHV 480
           D CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPSGMSQIRHKATSE V
Sbjct: 454 DFCYYSSPKFKESEEYSESCIPPQDSISLSNFLAYFIVHHPSLFPSGMSQIRHKATSEDV 513

Query: 481 LQFVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIG 504
           L+ ++DKKL      MLK NLELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIG
Sbjct: 514 LELLQDKKLGRCNSNMLK-NLELKIEVVKEERKESMVLDACRLAGILEKLEQSQKWEIIG 573

BLAST of HG10007636 vs. ExPASy TrEMBL
Match: A0A1S3C8L7 (uncharacterized protein LOC103498043 OS=Cucumis melo OX=3656 GN=LOC103498043 PE=4 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 1.8e-178
Identity = 367/623 (58.91%), Postives = 409/623 (65.65%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGLKRKHSSSQF R FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA 
Sbjct: 34  MGLKRKHSSSQFRRFFLLIAYSFSDWIANFSFVMLVERYGTGCYDDFTDPNYIIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGG DTITAYSMEDNELWLR LLS+L  L AS+YIFL AL PTSLNYV+IPV +A
Sbjct: 94  FLLLHLGGFDTITAYSMEDNELWLRTLLSLLAVLAASIYIFLQALLPTSLNYVSIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           GIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLF
Sbjct: 154 GIIKNCEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEMLHIAYYFFNRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRLCYY+KF S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLCYYEKFQSNSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLFIAILTYCLIDKQEYPSTYVNLIFLLFFGALSIEIYSLFLILFSDWNVIWLLTTQS 333

Query: 301 ------------------------------------------KNQSYYSKF--TKSMAAF 360
                                                      + SYY KF  TK+MAAF
Sbjct: 334 PSNPLPRLALKLISLCGWSFKKRRCSNSISQYNLISHCLKQKNDDSYYCKFHNTKTMAAF 393

Query: 361 SVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIAT 420
           SVQ RPISNNLE  IFQQLK+KL      L        NE+GWSL  DLD++ILLWHIAT
Sbjct: 394 SVQ-RPISNNLEAHIFQQLKKKL-----VLNQEYDSGYNEIGWSLKLDLDQSILLWHIAT 453

Query: 421 DICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHV 480
           D CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPS MSQIRHKATSE V
Sbjct: 454 DFCYYSSPKFKESEEYSESCIPPQDSISLSNFLAYFIVHHPSLFPSRMSQIRHKATSEDV 513

Query: 481 LQFVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIG 504
           L+ ++DKKL      MLK NLELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIG
Sbjct: 514 LELLQDKKLGRCNSNMLK-NLELKIEVVKEERKESMVLDACRLAGILEKLEQSQKWEIIG 573

BLAST of HG10007636 vs. ExPASy TrEMBL
Match: A0A0A0LZZ2 (DUF4220 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G638510 PE=4 SV=1)

HSP 1 Score: 628.6 bits (1620), Expect = 2.2e-176
Identity = 360/624 (57.69%), Postives = 409/624 (65.54%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           +GLKRK SSSQFLR FLL  Y+F+DWI NFSF +LVE+YG+GCYD+F  P Y+IRAFLA 
Sbjct: 34  LGLKRKCSSSQFLRFFLLIAYTFSDWIANFSFVMLVERYGTGCYDDFTDPMYMIRAFLAH 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLLLHLGGSDTITAYSMEDNELWLR LLSML  L AS+YIFL AL PTSLNY++IPV +A
Sbjct: 94  FLLLHLGGSDTITAYSMEDNELWLRTLLSMLAILAASIYIFLQALLPTSLNYISIPVIIA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLF 180
           G+IK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D ++L + Y FF RDKRLF
Sbjct: 154 GVIKNSEKIWALRSASAERLRDFLAVSTPSPITTHNEEEVQDFEVLRIAYYFFIRDKRLF 213

Query: 181 VGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTT 240
           VGLGPTSYD  QNRL YY+KF SKS FKIIELELGFMYDFFYTK+ INHS+ G L RLTT
Sbjct: 214 VGLGPTSYDLQQNRLSYYEKFESKSAFKIIELELGFMYDFFYTKASINHSLCGRLFRLTT 273

Query: 241 FSSL-------------------------------------------------------- 300
           FSSL                                                        
Sbjct: 274 FSSLVIAFLTYCLIDKQEYPSTYVNLIFLLFSGALSIEIYSLFLFLFSDWNVIWLLTTQS 333

Query: 301 -----------------------------------------KNQSYYSKF--TKSMAAFS 360
                                                    KN SYY KF  TK++AAFS
Sbjct: 334 PSNPLPRLALKLISLCGWSLKKRRCSNSISQYNLISHCLEQKNDSYYFKFPSTKTIAAFS 393

Query: 361 VQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATD 420
           VQ RPISNNLE  IFQQLKQKL      L        NE+GWSL  DLD++IL+WHIATD
Sbjct: 394 VQ-RPISNNLEAHIFQQLKQKL-----VLNQEYDYGYNEIGWSLKLDLDQSILIWHIATD 453

Query: 421 ICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQ 480
            CYHSS   + +EES      + S+ +SNFLAY +VH PSLFPSGMSQIRHKATSEHVL+
Sbjct: 454 FCYHSSPKFKESEESKSCIPPQDSVSLSNFLAYFIVHHPSLFPSGMSQIRHKATSEHVLE 513

Query: 481 FVEDKKL------MLKINLELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNV 507
            ++D+KL      MLK NLEL IE  KE +  S + DA R+A  LEK+E  +KWEIIGNV
Sbjct: 514 LLQDEKLDRCRSNMLK-NLELNIEVVKEERKESRVLDAFRLAGFLEKLEQSQKWEIIGNV 573

BLAST of HG10007636 vs. ExPASy TrEMBL
Match: A0A6J1CKT2 (uncharacterized protein LOC111012216 OS=Momordica charantia OX=3673 GN=LOC111012216 PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 1.7e-104
Identity = 251/628 (39.97%), Postives = 334/628 (53.18%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           MGL+RK+SS+  LR+ LL  Y  ADW    S G LV+ YGS   D FF   +I    LAP
Sbjct: 37  MGLRRKYSSNNALRLLLLLFYLSADWAATTSLGTLVKFYGS-YEDAFFGRLFI----LAP 96

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           F+LLHLGGSDTITAYSMEDN+LW R+     +Q+  + YI LLALQP  L+++ IP+FVA
Sbjct: 97  FMLLHLGGSDTITAYSMEDNDLWYRSFFGFFVQVGIAFYILLLALQPQHLDFLGIPIFVA 156

Query: 121 GIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQ 180
           GIIKY E+IW FRS S +RL D L+  +  SPI I+                        
Sbjct: 157 GIIKYGERIWVFRSTSTQRLPDLLLSTTRFSPIQINAAKSKHLHLEFPIQINVDHEATQN 216

Query: 181 EEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFM 240
                L +LH+ Y FF  +K LFV L  TSYD  Q+ L Y+ +F+S+  FK+IELELGFM
Sbjct: 217 PHFSHLHLLHIAYYFFKTNKFLFVDLTLTSYDLQQS-LHYFMQFDSREAFKVIELELGFM 276

Query: 241 YDFFYTKSFINHSMWGCLLRLTTFSS---------------------------------- 300
           YDFFYTK+ I HS WG +LRLTT  S                                  
Sbjct: 277 YDFFYTKASIIHSRWGPILRLTTLFSIVVVIVTFHIDHFDPGFNNPLTNILTLILLYGAL 336

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 337 SLEISSFILFLCSDWNVIRLTKSSYSLAHLTFKAISRCGWSVKKYRWSNSVRQYNLISCC 396

Query: 361 LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKV--- 420
           LK   Y  Y K+  T  ++      R IS+ L+T+IFQQL QKLE +    E+++K+   
Sbjct: 397 LKETKYGRYCKYFRTSYISKIMTASRNISDELKTRIFQQLTQKLEVN----EENRKLPGW 456

Query: 421 ------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAY 480
                   N++GWSL  D D++ILLWHIAT+ICYH  K  E +  S  +   L+S+FL Y
Sbjct: 457 ILRKHNCYNQLGWSLELDSDQSILLWHIATNICYHRDKETEASNCSLLEDGTLLSDFLTY 516

Query: 481 LVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKLM--------LKINLELKIEEAKESKS 489
           L+V+  SLF  GMS+IR   T +  ++F++ +K +          ++LE          S
Sbjct: 517 LLVYHHSLFLDGMSEIRFCETVDSAIEFMQQRKSIETTSDACKSMLDLETSTVYKDAGNS 576

BLAST of HG10007636 vs. ExPASy TrEMBL
Match: A0A5B7BVN3 (DUF4220 domain-containing protein OS=Davidia involucrata OX=16924 GN=Din_042423 PE=4 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 4.2e-74
Identity = 197/633 (31.12%), Postives = 307/633 (48.50%), Query Frame = 0

Query: 1   MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAP 60
           +G +RK+++  +LR+ L  +Y  ADW+   + G+L    G     +   P+Y+I AF AP
Sbjct: 34  LGNRRKYNTRIWLRIILWLSYLSADWVATVALGVLSNSQGD---SDSEGPSYVIMAFWAP 93

Query: 61  FLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVA 120
           FLL+HLGG DTITAY++EDNELWLR LL +++Q+  +LY+F+ +L PT LN+VA+P+FVA
Sbjct: 94  FLLVHLGGPDTITAYALEDNELWLRHLLGLVVQVGVALYVFVRSLNPTDLNFVALPIFVA 153

Query: 121 GIIKYDEKIWAFRSASAERLRDFLVVS--------------------------NHSPITI 180
           GIIKY E+ W  RSAS++  R+ L+                              +P   
Sbjct: 154 GIIKYGERTWVLRSASSQHFRESLLPRPDPGPNFAKFMEEYNLKDREGYKLSWTVTPAPT 213

Query: 181 HNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELEL 240
                  D   L+  Y+FF   +RLF  L   S+   +  L ++ + + ++ F++IE+EL
Sbjct: 214 TTAHYYADATFLNAAYDFFLTYRRLFADL-ILSFQDLEKSLSFFQESSWENAFQVIEVEL 273

Query: 241 GFMYDFFYTKSFINHSMWGCLLRLTTFSSL------------------------------ 300
           GFMYD  YTK+ I +S+WG  LR T  SS                               
Sbjct: 274 GFMYDILYTKATIVYSVWGVFLRFTCLSSTIIALVVFCTIDWHGYSQVDVGISFLLLLGA 333

Query: 301 ---------------------------------KNQSYYS------KFTKSMAAFSV--- 360
                                            K  S ++      K++ SMA +++   
Sbjct: 334 IGLEIYAILLLLSSDWTLLWFSKHDNLGVNLIKKVISPFNCVTSKRKWSNSMAQYNLLSS 393

Query: 361 ----------------------------QRRPISNNLETQIFQQLKQKLEGSSSTLEDSK 420
                                           +S NL+  IF+QL +  +G+S    D K
Sbjct: 394 CFKYKPAMCCKILKCACIHRIIDDYLYESSEDVSPNLKESIFKQLVENSKGASE-FRDCK 453

Query: 421 KV-------------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERA----EESS 468
           K+              + ++GWS+  + D +ILLWHIATD+CY+S   +E A        
Sbjct: 454 KLCACRGEQVLQKHDCLEKLGWSVKDEFDYSILLWHIATDLCYYSDYGDEGANFVPHAKC 513

BLAST of HG10007636 vs. TAIR 10
Match: AT5G45540.1 (Protein of unknown function (DUF594) )

HSP 1 Score: 134.4 bits (337), Expect = 2.5e-31
Identity = 165/749 (22.03%), Postives = 272/749 (36.32%), Query Frame = 0

Query: 5   RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLL 64
           R+ ++ +   + +   Y  ADW  +++ G + +                + AF +PFLLL
Sbjct: 39  RRRTAKKLFLVLIWSAYLLADWAADYAVGQISDSQEEEAESNKPSKNRELLAFWSPFLLL 98

Query: 65  HLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGIIK 124
           HLGG DTITA ++EDNELW R L S++ Q VA++Y+ LL++ P  L    + +FV G+IK
Sbjct: 99  HLGGPDTITALALEDNELWDRHLFSLVCQAVATVYVILLSI-PNRLLTPTLIMFVGGVIK 158

Query: 125 YDEKIWAFRSASAERLRDFLV-----VSNHSPI--------------------------- 184
           Y E+  A  SAS ++ +D ++      +N++ +                           
Sbjct: 159 YVERTAALFSASLDKFKDSMLDDPDPGANYAKLMEEYEARKKMNMPTDVIVVKDPEKGRE 218

Query: 185 ---TIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKI 244
               +    E+  LQ++   Y +FN  K L V L  T+ +R ++R  ++DK  ++   +I
Sbjct: 219 GNTPVRPDNELTALQVIQYAYKYFNIFKGLIVDLIFTNQERDESRK-FFDKLTAEEALRI 278

Query: 245 IELELGFMYDFFYTKSFINHSMWGCLLR----------LTTFSSLKNQSY---------- 304
           IE+ELG +YD  +TK+ I H+  G + R          L  F   K   Y          
Sbjct: 279 IEVELGLIYDCLFTKAEILHNWTGAVFRFIALGCLVASLCLFKMNKKDQYDGFDVVLTYA 338

Query: 305 ------------------------------------------------------------ 364
                                                                       
Sbjct: 339 LLICGIALDSIALLMFCVSDWTIARLRKLKEDLEEKDTLTDRVLNWILDFKTLRWKRSKC 398

Query: 365 ---------------------------------------------YSKFTKSMAAFSVQ- 424
                                                        +S F +++   S+  
Sbjct: 399 SQDGHQVLNRNFMFRRWSEYVHAYNLIGFCLGIRPKRIHYTKGKIHSFFHQTVHILSIDT 458

Query: 425 ------------------------------------------------------------ 468
                                                                       
Sbjct: 459 AIENATRGTRQFHNWIGRFLSNLSKRDNSVIRTGLRWFLFFPQLLGLLIYNFLDFFGIKD 518

BLAST of HG10007636 vs. TAIR 10
Match: AT5G45530.1 (Protein of unknown function (DUF594) )

HSP 1 Score: 127.9 bits (320), Expect = 2.4e-29
Identity = 158/743 (21.27%), Postives = 263/743 (35.40%), Query Frame = 0

Query: 5   RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLL 64
           RK +S + L   L   Y  ADW  N++   + +  G             + A  APFLLL
Sbjct: 38  RKRTSKKLLAAVLWTAYLLADWTANYAVSQITKNQGKETEPGDPPKNKKLLALWAPFLLL 97

Query: 65  HLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGIIK 124
           HLGG DTITA ++EDN LW R L  ++ Q +A +Y  + +L+      + + +F+ G IK
Sbjct: 98  HLGGPDTITALALEDNALWQRHLFGLVSQALAGVYAVVQSLENVLWPPITL-LFITGTIK 157

Query: 125 YDEKIWAFRSASAERLRDFLVV-----SNHS----------------------------- 184
           Y E+  A  SAS ++ +D ++      SN++                             
Sbjct: 158 YVERTRALYSASLDKFKDRMLQRADAGSNYAKLMEEFASRKMSNLPTEIFLTDEPDKHER 217

Query: 185 -PITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKI 244
            P  +    ++ DL+++   + FFN  K L V L  +  +R ++R  ++ +       +I
Sbjct: 218 PPTLVKPDRDLTDLEIVQYGFKFFNTFKGLVVDLIFSFRERDESR-DFFKELKPGEALRI 277

Query: 245 IELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS------------LKNQSYYSK----- 304
           IE ELGF+Y+  YTK+ I H+  G L RL +F S            LK++ ++       
Sbjct: 278 IETELGFLYESMYTKTAILHTGIGTLFRLISFGSLLSSFFVFHRRPLKSEDFHGADVVIT 337

Query: 305 ------------------------------------------------------------ 364
                                                                       
Sbjct: 338 YVLFIVGIALDLASMVIFLLSDWTFAVLRNLKDDPEEKSTSIDSLFNWFLEFRKPRWKKH 397

Query: 365 ---------------FTKSMAA-------------------------------------- 424
                          FT+  +                                       
Sbjct: 398 TCNGNQTHEVLSTGFFTRRWSGTIYGFNFIGFCLKAKVSRIHQKRNCNLLVWDYVVSLFD 457

Query: 425 ------------------------------------------------------------ 468
                                                                       
Sbjct: 458 LVIRRIQMMIGWIKNVNRSIRSVLRQWSKKNPMIRCTVYPLYLVFFAGIPEVFRVLWKYI 517

BLAST of HG10007636 vs. TAIR 10
Match: AT5G45460.1 (unknown protein; BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF594) (TAIR:AT5G45470.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 122.9 bits (307), Expect = 7.6e-28
Identity = 85/282 (30.14%), Postives = 138/282 (48.94%), Query Frame = 0

Query: 5   RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLL 64
           RK +  + L + +  +Y  ADW  NF+ G++ +  G     +       + A  APFLLL
Sbjct: 39  RKRTPRRHLIIVIWSSYLLADWSANFAVGLISKNQGKDLKPDDPPQDKKLMALWAPFLLL 98

Query: 65  HLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGIIK 124
           HLGG DTITA+++EDN LWLR +  ++ Q +A +Y+ L +L P SL    + VF++G IK
Sbjct: 99  HLGGPDTITAFALEDNALWLRNVFGLVFQAIAGVYVVLQSL-PNSLWVTILLVFISGTIK 158

Query: 125 YDEKIWAFRSASAERLRDFLVV----------------------------------SNHS 184
           Y E+  A  SAS ++ RD ++                                     H 
Sbjct: 159 YLERTTALYSASLDKFRDSMIQGPDPGPNYAKLMEEYKAKKEAKLPTKIILIDEPDKEHR 218

Query: 185 PITIHN--------QEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFN 244
           P  + +        ++E+  L++    Y FFN  K L V L  +  +R Q+   + +  +
Sbjct: 219 PKKLEHPSLASETKRKELTHLEIAQYAYKFFNTFKGLVVNLIFSFRERDQSIEIFQNLED 278

BLAST of HG10007636 vs. TAIR 10
Match: AT5G45470.1 (Protein of unknown function (DUF594) )

HSP 1 Score: 120.9 bits (302), Expect = 2.9e-27
Identity = 84/282 (29.79%), Postives = 142/282 (50.35%), Query Frame = 0

Query: 5   RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLL 64
           RK +  + L + +  +Y  ADW  NF+ G++ +  G     +       + A  APFLLL
Sbjct: 39  RKRTPRRLLIVLVWSSYLLADWSANFAVGLISKNQGKDLKPDDPPQDKKVMALWAPFLLL 98

Query: 65  HLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGIIK 124
           HLGG DTITA+++EDN LWLR +  ++ Q +A +Y+ +++L P SL  V + VFV+G IK
Sbjct: 99  HLGGPDTITAFALEDNALWLRHVFGLVFQAIAGVYVVVMSL-PNSLWVVIVLVFVSGTIK 158

Query: 125 YDEKIWAFRSASAERLRDFLVVS-----NHSPI--------------------------- 184
           Y E+  A  SAS ++ RD ++ +     N++ +                           
Sbjct: 159 YLERTTALYSASLDKFRDSMIQAPDPGPNYAKLMEEYKAKKEARLPTKIVLIDEPDKENR 218

Query: 185 ----------TIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFN 244
                     +   ++++ DL+++   Y FFN  K L V L  +  +R ++   + +  +
Sbjct: 219 PKKLEHPALASKKRKKDLTDLEIVQYAYKFFNTFKGLVVNLIFSFRERDESLEIFENLND 278


HSP 2 Score: 108.6 bits (270), Expect = 1.5e-23
Identity = 75/252 (29.76%), Postives = 128/252 (50.79%), Query Frame = 0

Query: 262 VQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSL---------------- 321
           V   P++  L   IF++LK K +   S  E++K++ +    W+L                
Sbjct: 596 VHGEPMTRELWKFIFEELKNKSKYGDSP-ENAKRISLARGEWTLRENLPVDAEREKLVRY 655

Query: 322 --NFDLDETILLWHIATDICYHSSK---IEERAEESSKS------SILVSNFLAYLVVHC 381
               D D+++L+WHIAT++CY   +   I E  +E  K       S ++S+++ YL++  
Sbjct: 656 VTKVDYDQSLLMWHIATELCYQQHEKETIPEGYDEQRKHYSNREFSKIISDYMMYLLILQ 715

Query: 382 PSLFP--SGMSQIRHKATSEHVLQFVEDKKL----------MLKINLELKIE----EAKE 441
           P L    +G+ +IR + T     +F + + +          +  +++E +IE    +   
Sbjct: 716 PGLMSEVAGIGKIRFRDTLAETHKFFQRRHIENDRSVETATLNILDVESEIEPMGVKGDR 775

Query: 442 SKSMLFDACRVAR---QLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNL 468
           SKS+LFDA R+A+   ++EK     KWEI+  VWVELL   +C C+   H + L++GG L
Sbjct: 776 SKSVLFDASRLAKDLAEMEKTHNKDKWEILSKVWVELLCYAACHCDSTAHVEQLSRGGEL 835

BLAST of HG10007636 vs. TAIR 10
Match: AT5G45480.1 (Protein of unknown function (DUF594) )

HSP 1 Score: 115.9 bits (289), Expect = 9.3e-26
Identity = 87/276 (31.52%), Postives = 131/276 (47.46%), Query Frame = 0

Query: 4   KRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLL 63
           +RK SS + L  F+   Y  ADW  NF+ G + +  G          +  + AF  PFLL
Sbjct: 38  QRKRSSRKVLLSFIWSAYLLADWSANFAAGQISDSQGDDPEPGEPKKSAELFAFWVPFLL 97

Query: 64  LHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSLNYVAIPVFVAGII 123
           LHLGG DTITA ++EDNELWLR LL +  Q VA++Y+ L +L P +L    + VF  G+I
Sbjct: 98  LHLGGPDTITALALEDNELWLRHLLGLFFQSVATVYVLLQSL-PNALWKPILLVFATGVI 157

Query: 124 KYDEKIWAFRSASAERLRDFLVV-----SNHSPI-------------------------- 183
           KY E+  A   AS ++ +D ++       N++ +                          
Sbjct: 158 KYVERTLALYLASLDKFKDSMIQRPDPGPNYAKLMEEYAAKKDMKMPTQIIKVGEPEKDP 217

Query: 184 ----TIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFK 243
                +   +    L +L   Y +FN  K L V L  T   R +++  ++D   ++   +
Sbjct: 218 RDDAPVKPPDGFTPLNILQYAYKYFNIFKGLVVDLIFTFQQRAESKR-FFDSLKAEEALR 277

Query: 244 IIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL 245
           I+E+EL F+Y   YTK+ I H+  G L R      L
Sbjct: 278 ILEVELNFIYAALYTKAEILHNWIGFLFRFIALGCL 311


HSP 2 Score: 97.1 bits (240), Expect = 4.5e-20
Identity = 59/213 (27.70%), Postives = 114/213 (53.52%), Query Frame = 0

Query: 283 LEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSIL 342
           ++G   T +  +K++     + +  D D+++L+WHIAT++ Y + K  +    + + S +
Sbjct: 649 IQGDPETEKKREKLL----RYVMEMDYDQSLLVWHIATELLYQTKKGTKANHSAREFSKI 708

Query: 343 VSNFLAYLVVHCPSLFPS--GMSQIRHKATSEHVLQFVEDKKLM---------------- 402
           +S+++ YL++  P+L  +  G+ +IR + T E   +F + + +M                
Sbjct: 709 LSDYMMYLLMMQPTLMSAVVGIGKIRFRDTCEEAQRFFDRRHIMGISAKKAPDAKEASVA 768

Query: 403 -LKINLELKIE----EAKESKSMLFDACRVARQLE-----KVEGLKKWEIIGNVWVELLA 462
            L + +  K E    +   SKS+LFD   +A++L+     K +  + W+I+  VWVELL+
Sbjct: 769 ILSVAVPAKAEPIDVKGDRSKSVLFDGAMLAKELKGLRKNKEDDSEMWKIMSQVWVELLS 828

Query: 463 RISCECEWYDHAKMLTQGGNLLTRVWILMHHLG 468
             + +C   +HA  L++GG L++ VW+LM H G
Sbjct: 829 YAATKCGAIEHAAQLSKGGELISFVWLLMAHFG 857

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0037446.15.8e-17959.23uncharacterized protein E6C27_scaffold277G00320 [Cucumis melo var. makuwa] >TYK0... [more]
XP_008458716.13.7e-17858.91PREDICTED: uncharacterized protein LOC103498043 [Cucumis melo][more]
XP_004139148.14.6e-17657.69uncharacterized protein LOC101222078 [Cucumis sativus] >KGN66604.1 hypothetical ... [more]
XP_038880416.11.3e-14958.30uncharacterized protein LOC120072067 [Benincasa hispida][more]
XP_022141971.13.6e-10439.97uncharacterized protein LOC111012216 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BS412.8e-17959.23DUF4220 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3C8L71.8e-17858.91uncharacterized protein LOC103498043 OS=Cucumis melo OX=3656 GN=LOC103498043 PE=... [more]
A0A0A0LZZ22.2e-17657.69DUF4220 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G638510 PE=... [more]
A0A6J1CKT21.7e-10439.97uncharacterized protein LOC111012216 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A5B7BVN34.2e-7431.12DUF4220 domain-containing protein OS=Davidia involucrata OX=16924 GN=Din_042423 ... [more]
Match NameE-valueIdentityDescription
AT5G45540.12.5e-3122.03Protein of unknown function (DUF594) [more]
AT5G45530.12.4e-2921.27Protein of unknown function (DUF594) [more]
AT5G45460.17.6e-2830.14unknown protein; BEST Arabidopsis thaliana protein match is: Protein of unknown ... [more]
AT5G45470.12.9e-2729.79Protein of unknown function (DUF594) [more]
AT5G45480.19.3e-2631.52Protein of unknown function (DUF594) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007658Protein of unknown function DUF594PFAMPF04578DUF594coord: 419..467
e-value: 1.1E-16
score: 60.1
IPR025315Domain of unknown function DUF4220PFAMPF13968DUF4220coord: 20..243
e-value: 2.3E-43
score: 148.9
NoneNo IPR availablePANTHERPTHR31325:SF207OS08G0149333 PROTEINcoord: 262..467
NoneNo IPR availablePANTHERPTHR31325:SF207OS08G0149333 PROTEINcoord: 3..243
NoneNo IPR availablePANTHERPTHR31325OS01G0798800 PROTEIN-RELATEDcoord: 3..243
NoneNo IPR availablePANTHERPTHR31325OS01G0798800 PROTEIN-RELATEDcoord: 262..467

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007636.1HG10007636.1mRNA