HG10008672 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008672
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1645)
LocationChr10: 25102372 .. 25103304 (-)
RNA-Seq ExpressionHG10008672
SyntenyHG10008672
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACCGAGGAAGGTCCCGCTCATCTCTCGCTGTCTCCGACTTTTAGCAGCTATTCTTCAGGTAGCAGTAGCCTTGCTGAAATTGCGGCCAGAGTTGTTAGGGAAGTCGGTGAAGAGCCGTTTGCAGATCCTGAGAATTACGGCTGGGAGTCTCAAGGTTCGGTTTATCGTTTCCGAGAAAATTTGTCCGGTGGCCATGCGAGGGAATCATCGGAAGGAGTTAGAAGTAATGACTGTGGAAAAAATGGTGATGAGGATGAATTTGAGTTTGCTGTCCTCTGCAGAGAACCAGATGCATCTACGAGCTCTGCTCATGAAATCTTTTACAATGGCCAGATTAAGCCGGTTTATCCGGTATTCAACATGGACCTACTGCTAGACAACGGCTCGCGAGTAGATATCGGATTGGAGAATTTGAAGAAGAAGCCGGCTGTGCGTCGTTCGCCGTTGAGGAAATTGATGAACGAGGAGCGTAAAACGACGCCGTTCTCCTCATCTGGAGCGGACGATCCAGGAAGCGTTCCATCGGATACGTATTGCGTCTGGTCTCCAAGCACTGAAAAAACATCGCCTGGAAGGAGTAATAAAAGAAATTCTACAGGATCGTCGAACAGGTGGAAATTTAGGGATCTTCTGTACAACAGTCGAAGTAAAAGTGAAGGAGAGGATGAACTCATGAAGAGGAAGACTATTAAGAAGGACGATAAACCTGGAAATGTATCGAAGGGAAGGGAAGATTGTAGATCAGGCGTAGGCGTAAGCGTAACATCTTCTACTAGTTCCAATTCTGGTTTCTTCACATCCTTCACTGCACAGAACGCTCAGTATGGGAGGAACAGAACGGTGAAAGACGCGGAAAAGAGAAGAACCTACTTGCCATACAGACAACATTTGGTGGGATGTGTGGCTGATGCTAAAGCAACGATCTAG

mRNA sequence

ATGAAGACCGAGGAAGGTCCCGCTCATCTCTCGCTGTCTCCGACTTTTAGCAGCTATTCTTCAGGTAGCAGTAGCCTTGCTGAAATTGCGGCCAGAGTTGTTAGGGAAGTCGGTGAAGAGCCGTTTGCAGATCCTGAGAATTACGGCTGGGAGTCTCAAGGTTCGGTTTATCGTTTCCGAGAAAATTTGTCCGGTGGCCATGCGAGGGAATCATCGGAAGGAGTTAGAAGTAATGACTGTGGAAAAAATGGTGATGAGGATGAATTTGAGTTTGCTGTCCTCTGCAGAGAACCAGATGCATCTACGAGCTCTGCTCATGAAATCTTTTACAATGGCCAGATTAAGCCGGTTTATCCGGTATTCAACATGGACCTACTGCTAGACAACGGCTCGCGAGTAGATATCGGATTGGAGAATTTGAAGAAGAAGCCGGCTGTGCGTCGTTCGCCGTTGAGGAAATTGATGAACGAGGAGCGTAAAACGACGCCGTTCTCCTCATCTGGAGCGGACGATCCAGGAAGCGTTCCATCGGATACGTATTGCGTCTGGTCTCCAAGCACTGAAAAAACATCGCCTGGAAGGAGTAATAAAAGAAATTCTACAGGATCGTCGAACAGGTGGAAATTTAGGGATCTTCTGTACAACAGTCGAAGTAAAAGTGAAGGAGAGGATGAACTCATGAAGAGGAAGACTATTAAGAAGGACGATAAACCTGGAAATGTATCGAAGGGAAGGGAAGATTGTAGATCAGGCGTAGGCGTAAGCGTAACATCTTCTACTAGTTCCAATTCTGGTTTCTTCACATCCTTCACTGCACAGAACGCTCAGTATGGGAGGAACAGAACGGTGAAAGACGCGGAAAAGAGAAGAACCTACTTGCCATACAGACAACATTTGGTGGGATGTGTGGCTGATGCTAAAGCAACGATCTAG

Coding sequence (CDS)

ATGAAGACCGAGGAAGGTCCCGCTCATCTCTCGCTGTCTCCGACTTTTAGCAGCTATTCTTCAGGTAGCAGTAGCCTTGCTGAAATTGCGGCCAGAGTTGTTAGGGAAGTCGGTGAAGAGCCGTTTGCAGATCCTGAGAATTACGGCTGGGAGTCTCAAGGTTCGGTTTATCGTTTCCGAGAAAATTTGTCCGGTGGCCATGCGAGGGAATCATCGGAAGGAGTTAGAAGTAATGACTGTGGAAAAAATGGTGATGAGGATGAATTTGAGTTTGCTGTCCTCTGCAGAGAACCAGATGCATCTACGAGCTCTGCTCATGAAATCTTTTACAATGGCCAGATTAAGCCGGTTTATCCGGTATTCAACATGGACCTACTGCTAGACAACGGCTCGCGAGTAGATATCGGATTGGAGAATTTGAAGAAGAAGCCGGCTGTGCGTCGTTCGCCGTTGAGGAAATTGATGAACGAGGAGCGTAAAACGACGCCGTTCTCCTCATCTGGAGCGGACGATCCAGGAAGCGTTCCATCGGATACGTATTGCGTCTGGTCTCCAAGCACTGAAAAAACATCGCCTGGAAGGAGTAATAAAAGAAATTCTACAGGATCGTCGAACAGGTGGAAATTTAGGGATCTTCTGTACAACAGTCGAAGTAAAAGTGAAGGAGAGGATGAACTCATGAAGAGGAAGACTATTAAGAAGGACGATAAACCTGGAAATGTATCGAAGGGAAGGGAAGATTGTAGATCAGGCGTAGGCGTAAGCGTAACATCTTCTACTAGTTCCAATTCTGGTTTCTTCACATCCTTCACTGCACAGAACGCTCAGTATGGGAGGAACAGAACGGTGAAAGACGCGGAAAAGAGAAGAACCTACTTGCCATACAGACAACATTTGGTGGGATGTGTGGCTGATGCTAAAGCAACGATCTAG

Protein sequence

MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFRENLSGGHARESSEGVRSNDCGKNGDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPVFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDTYCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYNSRSKSEGEDELMKRKTIKKDDKPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQHLVGCVADAKATI
Homology
BLAST of HG10008672 vs. NCBI nr
Match: XP_038878608.1 (uncharacterized protein LOC120070794 [Benincasa hispida])

HSP 1 Score: 549.3 bits (1414), Expect = 2.2e-152
Identity = 282/310 (90.97%), Postives = 296/310 (95.48%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ EEGPAHLSLSPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE++GSVYRFR
Sbjct: 1   MEAEEGPAHLSLSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAEGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPV 120
           ENLS G ARES+EGVRSND GKNGDEDEFEFAVL REPD STSSAHEIFYNGQIKPVYP+
Sbjct: 61  ENLSSGRARESAEGVRSNDGGKNGDEDEFEFAVL-REPDVSTSSAHEIFYNGQIKPVYPL 120

Query: 121 FNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDTY 180
           FNMDLLLDNGSRVD GLEN+KKKPAVRR PLRKLMNEERKTTPFSSSGADD G VPS+TY
Sbjct: 121 FNMDLLLDNGSRVDSGLENVKKKPAVRRLPLRKLMNEERKTTPFSSSGADDLGGVPSETY 180

Query: 181 CVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYNSRSKSEGEDELMKRKTIKKDDKPGN 240
           C+WSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYNSRS+SEGEDELMKRKTIKKDDK GN
Sbjct: 181 CIWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYNSRSRSEGEDELMKRKTIKKDDKLGN 240

Query: 241 VSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQHLV 300
           VSKG+EDCRSG+GVSVTSST+SNSGFFTSFTAQNA YGRNRTVK+AEKRRTYLPYRQHLV
Sbjct: 241 VSKGKEDCRSGLGVSVTSSTNSNSGFFTSFTAQNAPYGRNRTVKEAEKRRTYLPYRQHLV 300

Query: 301 GCVADAKATI 311
           GCVADAKATI
Sbjct: 301 GCVADAKATI 309

BLAST of HG10008672 vs. NCBI nr
Match: XP_008464565.1 (PREDICTED: uncharacterized protein LOC103502406 [Cucumis melo] >KAA0057852.1 uncharacterized protein E6C27_scaffold274G001390 [Cucumis melo var. makuwa] >TYJ98535.1 uncharacterized protein E5676_scaffold350G001410 [Cucumis melo var. makuwa])

HSP 1 Score: 441.8 bits (1135), Expect = 4.8e-120
Identity = 245/313 (78.27%), Postives = 264/313 (84.35%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ +EGPAHLSLSPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE+QGSVYRFR
Sbjct: 1   MEPDEGPAHLSLSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDED-EFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           EN+S G A   SEGVRSND GKN D+D EFEFAVL REP +STSSA+EIFYNGQIKPVYP
Sbjct: 61  ENVSTGSA---SEGVRSNDGGKNCDDDEEFEFAVL-REPGSSTSSANEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDT 180
           VFNMDLLLDNGS VD GLENLKKKPAVRR PLRKLMNEERK T FSSSG DD G VP D+
Sbjct: 121 VFNMDLLLDNGSPVDNGLENLKKKPAVRRLPLRKLMNEERKITSFSSSGVDDLGGVPLDS 180

Query: 181 YCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLY-NSRSKSEGEDELMKRK-TIKKDDK 240
           YCVWSPS EKTSPG+ NKRNST SSNRWKFRDLLY NSRSKSE EDEL+KRK +I K+D+
Sbjct: 181 YCVWSPSPEKTSPGKRNKRNSTASSNRWKFRDLLYNNSRSKSEREDELVKRKSSIMKNDE 240

Query: 241 PGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQ 300
            GNVSKG+ D R              SGFFTSF+AQNAQYGRNR+VK+ EKRR+YLPYRQ
Sbjct: 241 TGNVSKGKVDYR--------------SGFFTSFSAQNAQYGRNRSVKEPEKRRSYLPYRQ 295

Query: 301 HLVGCVADAKATI 311
           HLVGCVADAK  I
Sbjct: 301 HLVGCVADAKGRI 295

BLAST of HG10008672 vs. NCBI nr
Match: XP_031739680.1 (uncharacterized protein LOC105435813 [Cucumis sativus] >KGN63579.1 hypothetical protein Csa_014104 [Cucumis sativus])

HSP 1 Score: 425.6 bits (1093), Expect = 3.6e-115
Identity = 238/313 (76.04%), Postives = 256/313 (81.79%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+  EGPA LS+SPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE+QGSVYRFR
Sbjct: 1   MEPGEGPARLSMSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDED-EFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           ENLS G     SEGVRSND GKNGD+D EFEFAVL REPDA TSSAHEIFYNGQIKPVYP
Sbjct: 61  ENLSNGSV---SEGVRSNDGGKNGDDDEEFEFAVL-REPDAPTSSAHEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDT 180
           VFNMDLLLDNGS VD GLE LKKKPAVRR PLRKLMNEERK T FSSSGADD G VP DT
Sbjct: 121 VFNMDLLLDNGSPVDNGLEKLKKKPAVRRLPLRKLMNEERKLTSFSSSGADDLGGVPLDT 180

Query: 181 YCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLY-NSRSKSEGEDELMKRK-TIKKDDK 240
           YCVWSPS EK S G+ NK  ST SSNRWKFRDLLY NSRSKSE ED+L KRK +I K+++
Sbjct: 181 YCVWSPSPEKKSTGKRNKTISTASSNRWKFRDLLYNNSRSKSEREDKLTKRKSSIMKNNE 240

Query: 241 PGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQ 300
            GNVSK +ED R              SGFFTSF+AQN+ YGRNR+VK+ EKRR+YLPYR+
Sbjct: 241 TGNVSKEKEDYR--------------SGFFTSFSAQNSHYGRNRSVKEPEKRRSYLPYRE 295

Query: 301 HLVGCVADAKATI 311
           HLVGCVADAK  I
Sbjct: 301 HLVGCVADAKGRI 295

BLAST of HG10008672 vs. NCBI nr
Match: KAG6587470.1 (Serine/threonine-protein kinase ATM, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 403.7 bits (1036), Expect = 1.5e-108
Identity = 230/301 (76.41%), Postives = 252/301 (83.72%), Query Frame = 0

Query: 1    MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
            M+ E+GP +LSLSPTFSSYSSGSSSLAEIAARVVREVGEEPF D +NYGWE+QGSVYRFR
Sbjct: 2996 MEAEKGPTYLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFVDDDNYGWEAQGSVYRFR 3055

Query: 61   ENLSGGHARESSEGVRSNDCGKN-GDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
            ENLS GHARES+EGV  +DCG+N GD+DEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP
Sbjct: 3056 ENLSIGHARESAEGVTIDDCGENGGDDDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 3115

Query: 121  VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMN-EERKTTPFSSSGADDPGSVPSD 180
            VFN +LLLDN SRVD G EN K KPAVRRSPLRKLMN EER T   SSS AD+ G V S+
Sbjct: 3116 VFNRNLLLDNASRVDTGSENPKTKPAVRRSPLRKLMNEEERNTMSLSSSEADNLGGVSSE 3175

Query: 181  TYCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYN-SRSKSEGEDELMKRKT-IKKDD 240
            TYCVWSP+TEKTSPGRSNKR STGSSNRWK RDLLYN SR  +EGED +M RKT IKK+D
Sbjct: 3176 TYCVWSPNTEKTSPGRSNKRISTGSSNRWKLRDLLYNYSRRHNEGEDAVMMRKTSIKKND 3235

Query: 241  KPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTY-LPY 297
            +PGNVSK  E   SG G+SVTSSTSSNSG  TSF +  A+Y   R VK+A KRR+Y  PY
Sbjct: 3236 QPGNVSKVNEG--SGSGISVTSSTSSNSG-STSFPSHTARY---RAVKEAGKRRSYFFPY 3290

BLAST of HG10008672 vs. NCBI nr
Match: KAG7021458.1 (hypothetical protein SDJN02_15183, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 399.1 bits (1024), Expect = 3.6e-107
Identity = 228/301 (75.75%), Postives = 251/301 (83.39%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ E+GP +LSLSPTFSSYSSGSSSLAEIAARVVREVGEEPF D +NYGWE+QGSVYRFR
Sbjct: 1   MEAEKGPTYLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFVDDDNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKN-GDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           ENLS GHARES+ GV  +DCG+N GD+DEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP
Sbjct: 61  ENLSIGHARESAGGVTIDDCGENGGDDDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMN-EERKTTPFSSSGADDPGSVPSD 180
           VFN +LLLDN SRVD G EN K KPAVRRSPLRKLMN EER T   SSS AD+ G V S+
Sbjct: 121 VFNRNLLLDNASRVDTGSENPKTKPAVRRSPLRKLMNEEERNTMSLSSSEADNLGGVSSE 180

Query: 181 TYCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYN-SRSKSEGEDELMKRKT-IKKDD 240
           TYCVWSP+TEKTSPGRSNKR STGSSNRWK RDLLYN SR  +EGE+ +M RKT IKK+D
Sbjct: 181 TYCVWSPNTEKTSPGRSNKRISTGSSNRWKLRDLLYNYSRRHNEGENAVMMRKTSIKKND 240

Query: 241 KPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTY-LPY 297
           +PGNVSK  E   SG G+SVTSSTSSNSG  TSF +  A+Y   R VK+A KRR+Y  PY
Sbjct: 241 QPGNVSKVNEG--SGSGISVTSSTSSNSG-STSFPSHTARY---RAVKEAGKRRSYFFPY 295

BLAST of HG10008672 vs. ExPASy TrEMBL
Match: A0A5A7UPR1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G001410 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 2.3e-120
Identity = 245/313 (78.27%), Postives = 264/313 (84.35%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ +EGPAHLSLSPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE+QGSVYRFR
Sbjct: 1   MEPDEGPAHLSLSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDED-EFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           EN+S G A   SEGVRSND GKN D+D EFEFAVL REP +STSSA+EIFYNGQIKPVYP
Sbjct: 61  ENVSTGSA---SEGVRSNDGGKNCDDDEEFEFAVL-REPGSSTSSANEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDT 180
           VFNMDLLLDNGS VD GLENLKKKPAVRR PLRKLMNEERK T FSSSG DD G VP D+
Sbjct: 121 VFNMDLLLDNGSPVDNGLENLKKKPAVRRLPLRKLMNEERKITSFSSSGVDDLGGVPLDS 180

Query: 181 YCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLY-NSRSKSEGEDELMKRK-TIKKDDK 240
           YCVWSPS EKTSPG+ NKRNST SSNRWKFRDLLY NSRSKSE EDEL+KRK +I K+D+
Sbjct: 181 YCVWSPSPEKTSPGKRNKRNSTASSNRWKFRDLLYNNSRSKSEREDELVKRKSSIMKNDE 240

Query: 241 PGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQ 300
            GNVSKG+ D R              SGFFTSF+AQNAQYGRNR+VK+ EKRR+YLPYRQ
Sbjct: 241 TGNVSKGKVDYR--------------SGFFTSFSAQNAQYGRNRSVKEPEKRRSYLPYRQ 295

Query: 301 HLVGCVADAKATI 311
           HLVGCVADAK  I
Sbjct: 301 HLVGCVADAKGRI 295

BLAST of HG10008672 vs. ExPASy TrEMBL
Match: A0A1S3CM99 (uncharacterized protein LOC103502406 OS=Cucumis melo OX=3656 GN=LOC103502406 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 2.3e-120
Identity = 245/313 (78.27%), Postives = 264/313 (84.35%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ +EGPAHLSLSPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE+QGSVYRFR
Sbjct: 1   MEPDEGPAHLSLSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDED-EFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           EN+S G A   SEGVRSND GKN D+D EFEFAVL REP +STSSA+EIFYNGQIKPVYP
Sbjct: 61  ENVSTGSA---SEGVRSNDGGKNCDDDEEFEFAVL-REPGSSTSSANEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDT 180
           VFNMDLLLDNGS VD GLENLKKKPAVRR PLRKLMNEERK T FSSSG DD G VP D+
Sbjct: 121 VFNMDLLLDNGSPVDNGLENLKKKPAVRRLPLRKLMNEERKITSFSSSGVDDLGGVPLDS 180

Query: 181 YCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLY-NSRSKSEGEDELMKRK-TIKKDDK 240
           YCVWSPS EKTSPG+ NKRNST SSNRWKFRDLLY NSRSKSE EDEL+KRK +I K+D+
Sbjct: 181 YCVWSPSPEKTSPGKRNKRNSTASSNRWKFRDLLYNNSRSKSEREDELVKRKSSIMKNDE 240

Query: 241 PGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQ 300
            GNVSKG+ D R              SGFFTSF+AQNAQYGRNR+VK+ EKRR+YLPYRQ
Sbjct: 241 TGNVSKGKVDYR--------------SGFFTSFSAQNAQYGRNRSVKEPEKRRSYLPYRQ 295

Query: 301 HLVGCVADAKATI 311
           HLVGCVADAK  I
Sbjct: 301 HLVGCVADAKGRI 295

BLAST of HG10008672 vs. ExPASy TrEMBL
Match: A0A0A0LRG8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G004920 PE=4 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 1.7e-115
Identity = 238/313 (76.04%), Postives = 256/313 (81.79%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+  EGPA LS+SPTFSSYSSGS SLAEIAARVVREVGEEPFAD +NYGWE+QGSVYRFR
Sbjct: 1   MEPGEGPARLSMSPTFSSYSSGSCSLAEIAARVVREVGEEPFADADNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDED-EFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           ENLS G     SEGVRSND GKNGD+D EFEFAVL REPDA TSSAHEIFYNGQIKPVYP
Sbjct: 61  ENLSNGSV---SEGVRSNDGGKNGDDDEEFEFAVL-REPDAPTSSAHEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGSVPSDT 180
           VFNMDLLLDNGS VD GLE LKKKPAVRR PLRKLMNEERK T FSSSGADD G VP DT
Sbjct: 121 VFNMDLLLDNGSPVDNGLEKLKKKPAVRRLPLRKLMNEERKLTSFSSSGADDLGGVPLDT 180

Query: 181 YCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLY-NSRSKSEGEDELMKRK-TIKKDDK 240
           YCVWSPS EK S G+ NK  ST SSNRWKFRDLLY NSRSKSE ED+L KRK +I K+++
Sbjct: 181 YCVWSPSPEKKSTGKRNKTISTASSNRWKFRDLLYNNSRSKSEREDKLTKRKSSIMKNNE 240

Query: 241 PGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQ 300
            GNVSK +ED R              SGFFTSF+AQN+ YGRNR+VK+ EKRR+YLPYR+
Sbjct: 241 TGNVSKEKEDYR--------------SGFFTSFSAQNSHYGRNRSVKEPEKRRSYLPYRE 295

Query: 301 HLVGCVADAKATI 311
           HLVGCVADAK  I
Sbjct: 301 HLVGCVADAKGRI 295

BLAST of HG10008672 vs. ExPASy TrEMBL
Match: A0A6J1F6C3 (uncharacterized protein LOC111441224 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441224 PE=4 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 6.6e-99
Identity = 200/249 (80.32%), Postives = 217/249 (87.15%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ E+GP +LSLSPTFSSYSSGSSSLAEIAARVVREVGEEPF D +NYGWE+QGSVYRFR
Sbjct: 1   MEAEKGPTYLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFVDDDNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKN-GDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           ENLS GHARES+EGV  +DCG+N GD+DEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP
Sbjct: 61  ENLSIGHARESAEGVTIDDCGENGGDDDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMN-EERKTTPFSSSGADDPGSVPSD 180
           VFN +LLLDN SRVD G EN K KPAVRRSPLRKLMN EER T   SSS AD+ G V S+
Sbjct: 121 VFNRNLLLDNASRVDTGSENPKTKPAVRRSPLRKLMNEEERNTMSLSSSEADNLGGVSSE 180

Query: 181 TYCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYN-SRSKSEGEDELMKRKT-IKKDD 240
           TYCVWSP+TEKTSPGRSNKR STGSSNRWK RDLLYN SR  +EGED +M RKT IKK+D
Sbjct: 181 TYCVWSPNTEKTSPGRSNKRISTGSSNRWKLRDLLYNYSRRHNEGEDAVMMRKTSIKKND 240

Query: 241 KPGNVSKGR 246
           +PGNVSK R
Sbjct: 241 QPGNVSKAR 249

BLAST of HG10008672 vs. ExPASy TrEMBL
Match: A0A6J1I4D5 (uncharacterized protein LOC111470540 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470540 PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 1.5e-98
Identity = 200/250 (80.00%), Postives = 216/250 (86.40%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+ E+GP +LSLSPTFSSYSSGSSSLAEIAARVVREVGEEPF D +NYGWE+QGSVYRFR
Sbjct: 1   MEAEKGPTYLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFVDDDNYGWEAQGSVYRFR 60

Query: 61  ENLSGGHARESSEGVRSNDCGKN-GDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120
           ENLS GHARES+EGV  +DCG+N GD+DEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP
Sbjct: 61  ENLSIGHARESAEGVTIDDCGENGGDDDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYP 120

Query: 121 VFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMN-EERKTTPFSSSGADDPGSVPSD 180
           VFN +LLLDN SRVD G EN K KPAVRRSPLRKLMN EER TT  SSS  D+ G V S+
Sbjct: 121 VFNTNLLLDNASRVDTGSENPKTKPAVRRSPLRKLMNEEERNTTSLSSSETDNLGGVSSE 180

Query: 181 TYCVWSPSTEKTSPGRSNKRNSTGSSNRWKFRDLLYN-SRSKSEGEDELMKRKT-IKKDD 240
           TYCVWSP+TEKTSPGRSNKR STGSSNRWK RDLLYN SR   EGED +M RKT IKK+D
Sbjct: 181 TYCVWSPNTEKTSPGRSNKRISTGSSNRWKLRDLLYNYSRRHIEGEDAVMMRKTSIKKND 240

Query: 241 KPGNVSKGRE 247
           +PGNVSK  E
Sbjct: 241 RPGNVSKVNE 250

BLAST of HG10008672 vs. TAIR 10
Match: AT5G62770.1 (Protein of unknown function (DUF1645) )

HSP 1 Score: 89.7 bits (221), Expect = 4.4e-18
Identity = 100/317 (31.55%), Postives = 141/317 (44.48%), Query Frame = 0

Query: 1   MKTEEGPAHLSLSPTFSSYSSGSSSLAEIAARVVREVGEEPFADPENYGWESQGSVYRFR 60
           M+T    +  S SP+F S+SS +  LA IAARVV E     F D +              
Sbjct: 1   MQTSRLLSFSSNSPSFGSFSS-AVDLAAIAARVVEE-----FRDHDQ------------T 60

Query: 61  ENLSGGHARESSEGVRSNDCGKNGDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPV 120
           ++ S  H  + ++   + DC  N           C +P    ++A EIF NGQI+P+ P 
Sbjct: 61  QSDSSPHRDDDNDSDFAFDCPSN----------TCSQP---LATADEIFCNGQIRPLNPY 120

Query: 121 FNMDLLLDNGSRVDIGLENLKK----KPAVRRSPLRKLMNEERKTTPFSSSGA-DDPGSV 180
                    G    +  +   K     P  RR  LRKLM+E+R     SSS A +D   V
Sbjct: 121 ---------GGNAPVESQPTSKITTLPPRRRRPALRKLMSEDRDPASNSSSEAEEDLTGV 180

Query: 181 PSDTYCVWSPSTE----------KTSPGRSN-KRNSTGSSNRWKFRDLLYNSRSKSEGED 240
           P +TYCVW P              +SP  S  K +S G S RWK R+LLY  RS SEG D
Sbjct: 181 PPETYCVWKPKQSNSGDDDLQRLSSSPSHSKIKSHSAGFSKRWKLRNLLY-VRSSSEGND 240

Query: 241 ELMKRKTIKKDDKPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVK 300
           +L+    +KK+D+  +  +  E+  S V                       + GR R   
Sbjct: 241 KLVFPAPVKKNDETVSDQREEEEPPSKV--------------------DGEEEGRER--- 253

Query: 301 DAEKRRTYLPYRQHLVG 302
           +  KR+TY+PYR+ ++G
Sbjct: 301 EETKRQTYVPYRKDMIG 253

BLAST of HG10008672 vs. TAIR 10
Match: AT1G23710.1 (Protein of unknown function (DUF1645) )

HSP 1 Score: 85.9 bits (211), Expect = 6.3e-17
Identity = 81/249 (32.53%), Postives = 119/249 (47.79%), Query Frame = 0

Query: 69  RESSEGVRSNDCGKNGDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPVFNMDLLL- 128
           R  S+   S +  ++ +E+E EF+  C   + S  +A E F +GQI+PV+P+FN DLL  
Sbjct: 43  RSWSKLEESVEFNEDDEEEEEEFSFACVNGEGSPITADEAFEDGQIRPVFPLFNRDLLFE 102

Query: 129 -----DNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSGADDPGS--VPSDTY 188
                D    V +  EN        R  LRKL  E+R     +  G +  GS   P   Y
Sbjct: 103 YENEDDKNDNVSVTDEN--------RPRLRKLFVEDRNG---NGDGEETEGSEKEPLGPY 162

Query: 189 CVWSPST-EKTSPGRSNKRNSTGSSNRWKFRDLLYNSRSKSEGEDELM-----KRKTIKK 248
           C W+  T  + SP    K NSTG S  W+FRDL+   RS S+G D  +       KT  +
Sbjct: 163 CSWTGGTVAEASPETCRKSNSTGFSKLWRFRDLVL--RSNSDGRDAFVFLNNSNDKTRTR 222

Query: 249 DDKPGNVSKGREDCRSGV-----GVSVTSSTS-SNSGFFTSFTAQNAQYGRNRTVKDAEK 298
                + +   E+ +  +     G   TS++S +     T+ +A    Y RNR +K+  K
Sbjct: 223 SSSSSSSTAAEENDKKVITEKKKGKEKTSTSSETKKKTTTTKSAHEKLYMRNRAMKEEVK 278

BLAST of HG10008672 vs. TAIR 10
Match: AT1G70420.1 (Protein of unknown function (DUF1645) )

HSP 1 Score: 75.1 bits (183), Expect = 1.1e-13
Identity = 73/263 (27.76%), Postives = 110/263 (41.83%), Query Frame = 0

Query: 35  REVGEEPFADPENYGWESQGSVYRFRENLSGGHARESSEGVRSNDCGKNGDEDEFEFAVL 94
           R  G+E F++      +  GS  +           +S       D    G E++F FA +
Sbjct: 12  RFAGKERFSEVYGELNDDFGSKLKISSKEEENRDDDSWWNQNGTDKDDEGREEDFSFASV 71

Query: 95  CREPDASTSSAHEIFYNGQIKPVYPVFNMDLLLDNGSRVDIGLENLKKKPAVRRSPLRKL 154
               D S  +A E F +GQI+PVYP+FN ++  D+     +            RSPL+KL
Sbjct: 72  --NADNSPITADEAFEDGQIRPVYPLFNRNIFFDDPEEKTL------------RSPLKKL 131

Query: 155 MNEERKTTPFSSSGADDPGSVPSDTYCVWSPST-EKTSPGRSNKRNSTGSSNRWKFRDLL 214
              E  TT      +D  G      YC W+  T E+ SP    K NSTG S  W+FRDL+
Sbjct: 132 F-VESTTTEEEEEESDTVG-----PYCSWTNRTVEQASPETCRKSNSTGFSKLWRFRDLV 191

Query: 215 YNSRSKSEGEDELMKRKTIKKDDKPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQ 274
             S S  +     +   +          ++     +S       + T        + +A 
Sbjct: 192 LRSNSDGKDAFVFLSNGSSSTSSTSSTAARLSGVVKSSEKGKEKTKTDKKKDKMRTKSAH 251

Query: 275 NAQYGRNRTVKDAEKRRTYLPYR 297
              Y RNR +++  KRR+YLPY+
Sbjct: 252 EKLYMRNRAMREEGKRRSYLPYK 254

BLAST of HG10008672 vs. TAIR 10
Match: AT3G27880.1 (Protein of unknown function (DUF1645) )

HSP 1 Score: 70.5 bits (171), Expect = 2.7e-12
Identity = 82/253 (32.41%), Postives = 120/253 (47.43%), Query Frame = 0

Query: 64  SGGHARESSEGVRSNDCGKNGDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPVFNM 123
           SG    E +E V    C K   + EFEF+       A+TSS      +G    V+PVFN 
Sbjct: 18  SGDRLVEIAERV----CYKEKSDAEFEFST------AATSS------SGDGGLVFPVFNK 77

Query: 124 DLLLDNGSRVDIGLENLKKKPAVRRSPLRKLMNEERKTTPFSSSG-----ADDPGSVPSD 183
           +L+  + S         +K   ++   LR+  +++     +SSS       D+  S+PS+
Sbjct: 78  NLISGDVSP--------EKVIPLKDLFLRERNDQQPPQQTYSSSSDEEEEDDEFDSIPSE 137

Query: 184 TYCVWSP--STEKTSP-GRSNKRNSTGSSN-------RWKFRDLLYNSRSKSEGEDELMK 243
            YC W+P  ST   SP G   K  STGSS+       RW+ RD L   RSKS+G+  L  
Sbjct: 138 IYCPWTPARSTADMSPSGGCRKSKSTGSSSTSTWSTKRWRLRDFL--KRSKSDGKQSLKF 197

Query: 244 RKTIKKDDKPGNVSKGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEK 302
                +DD         E  +S   VSV+ + S++  F          Y RN+ +K+ +K
Sbjct: 198 LNPTNRDDDD-------ESSKSKKKVSVSVTVSAHEKF----------YLRNKAIKEEDK 227

BLAST of HG10008672 vs. TAIR 10
Match: AT2G15760.1 (Protein of unknown function (DUF1645) )

HSP 1 Score: 51.2 bits (121), Expect = 1.7e-06
Identity = 72/242 (29.75%), Postives = 106/242 (43.80%), Query Frame = 0

Query: 84  GDEDEFEFAVLCREPDASTSSAHEIFYNGQIKPVYPVFNMDLLLDNGSRVDIGLENLKKK 143
           G ED+FEF    +    S S+A E+F  G+I+P+       +       ++I   + +K 
Sbjct: 70  GFEDDFEFNFSGQLEKTSFSAADELFDGGKIRPLRTPLTPTVSSPRSRGLEIEDSDDQKD 129

Query: 144 PAVRRSPLRKLMNEERK----TTPFSSSG--ADDPGSVPSDTYCVWSPSTEKTS------ 203
               RSP       +RK     +P   S    D+   V S      + S +K+S      
Sbjct: 130 RGRDRSPGSSSSRYDRKGSRSMSPLRVSDIMVDEEEEVQSTKMVASNTSNQKSSVFLSAI 189

Query: 204 --PGRSNKRNSTGSSNRWKFRDLLYNSRSKSEGE----DELMKRKTI--KKD-DKPGNVS 263
             PGR+ K        +WK +DLL   RS S+G      E + R  I  KKD ++  N S
Sbjct: 190 LFPGRAYK--------KWKLKDLLL-FRSASDGRPIPTKESLNRYDILTKKDAEEVRNSS 249

Query: 264 -KGREDCRSGVGVSVTSSTSSNSGFFTSFTAQNAQYGRNRTVKDAEKRRTYLPYRQHLVG 304
            + RE C S    SV+ S   N       +A    Y  NR V +  KR+T+LPY+Q  +G
Sbjct: 250 IRSRESCES----SVSRSRRRNGAV---VSAHEMHYTENRAVSEELKRKTFLPYKQGWLG 295

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878608.12.2e-15290.97uncharacterized protein LOC120070794 [Benincasa hispida][more]
XP_008464565.14.8e-12078.27PREDICTED: uncharacterized protein LOC103502406 [Cucumis melo] >KAA0057852.1 unc... [more]
XP_031739680.13.6e-11576.04uncharacterized protein LOC105435813 [Cucumis sativus] >KGN63579.1 hypothetical ... [more]
KAG6587470.11.5e-10876.41Serine/threonine-protein kinase ATM, partial [Cucurbita argyrosperma subsp. soro... [more]
KAG7021458.13.6e-10775.75hypothetical protein SDJN02_15183, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UPR12.3e-12078.27Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CM992.3e-12078.27uncharacterized protein LOC103502406 OS=Cucumis melo OX=3656 GN=LOC103502406 PE=... [more]
A0A0A0LRG81.7e-11576.04Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G004920 PE=4 SV=1[more]
A0A6J1F6C36.6e-9980.32uncharacterized protein LOC111441224 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1I4D51.5e-9880.00uncharacterized protein LOC111470540 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G62770.14.4e-1831.55Protein of unknown function (DUF1645) [more]
AT1G23710.16.3e-1732.53Protein of unknown function (DUF1645) [more]
AT1G70420.11.1e-1327.76Protein of unknown function (DUF1645) [more]
AT3G27880.12.7e-1232.41Protein of unknown function (DUF1645) [more]
AT2G15760.11.7e-0629.75Protein of unknown function (DUF1645) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012442Protein of unknown function DUF1645, plantPFAMPF07816DUF1645coord: 105..294
e-value: 9.9E-25
score: 88.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 160..204
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 148..204
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..255
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR33095FAMILY NOT NAMEDcoord: 9..307
NoneNo IPR availablePANTHERPTHR33095:SF23DUF1645 FAMILY PROTEINcoord: 9..307

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10008672.1HG10008672.1mRNA