HG10004066 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004066
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionExpansin-A1-like
LocationChr08: 13322014 .. 13325818 (-)
RNA-Seq ExpressionHG10004066
SyntenyHG10004066
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTATGCTGCTTGATTGTTTGTGTTTCTTTTATGTGATATGTGCCATTTTTTGTGTACCAACTCATTTCAAATGAACAATAAACACATCCCAACATATCTACTCAGGATTGTTGGGATCTTCCTTTTCATTGTTGGATCCATTGTCTTTTAACTTTTTGGCCTTTCCTTAAATTACTTTTTGGAAAATTTTCGTCTTGTTATTTAAACTGTTCTTCCTGGATTTGCCAATTGAAGGTGTGGGATTTTCCGCTTGAGTGATATGAGGGTTCATATAAGAGCTCAATAGCCTGTAAAATTAATGTTCACATAATACTACAGGCAGTTTGGCACAACTTGCATAGTATAAACTCTACATAGATGAAGCTGTAGTGAATTTTTTGCCTTTCTCCCTGCTTAGCTGATTAAGATTTATTGTTATTATTTTAAATTTTAGATTCCGCATTGTTTTTTTCTTTCAGTCTATATGGTTAGTTTCCCCATTATCATCCTTCATTTTATTCTATAGGAATGAGTTCTCCTATTTGGTGGGGGGGGGGGGGGGAGGGACCACTTATTGAACAACTGTTCTAAACGTGTTACAATAAACTCTATATCATTGCTCTTTGTTCGATTTAAAAATTGTACTGTATGGCTGCAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGGTGTGTTTGTAACATTTCAAATCTTGTTCTAGAGTTCTCTGTCATTAGTTTTTAAGAATGTCCTCCATGCGCAATTTTTATTTTTTTAAAAAAGCTACATGTTTTGCTTTTTGGTATGTCCTCTTTTTTAGTTTGTCTCCCTTTCTTTTTGGGCTTGGTTTTGGACACCCTTTTATTCTTTCATTTTGTTTAATCTCAATGTAAATTTGGTTATTCGTAAAAAAATTCTTTTCTCTTGATTTCTTTTCTCATATTTATATATTATTAACATTATGAATTGTTTAATTCTTTAGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGGTAAGAATATATGGAGTTTGAGTACCACTTTTCGCTGTGTAATTCTTTCAGTTGTTGTGAATTTGTTTGGTCTCTCACATGGTTTATTTCATACTTCATTGTGTTTTTCCTTATAGCCAAAAATCATCCCTTGTTTATGAAGTTTGGATTTGGGATTATTGTTTGAGGTTAGAACATTTAAAAGACCTAGTTTTGTAATATACTAGATGAGTTATCCTAAAGATGCGTTTGATGATTCCAGCATTTTTATAAAATAGAAAACCAACTTTTAACGTGTTCACCGATTGACGTTGACCCAAAAGCTTAAGGTGATAGGTTATGGTAAATTTAATTATATCAACACTTTAACACTCTCCCTCACTCGTGGGCTTTGAAATTTGTACAAAACCCAACAAGTGGAAATCAATATTAGTTGGGGAGCAAATGACATTACGAGGGTAGGGGTTTGAACACAAGACCTCCCCGCTGCAGAAGAAACCAAAATAACCCAGAGTCCTTCTCCCAAGAGTGGAACAATCTCTTCCAACAAGACGAAGAGAGATGCAACATCCGTAGCTTTCCCATTGGTCAAAGGGCGACGAAACCCAAATGAAAGAGAAACTGAGCTCCTTGATCAGATCAAAAAATCAGAAACCACCCTATTCTTAAAGGAAGACAAATAATAGATATAAAGAAACACATAGAAAAGGGGATTAGATCTCACCCAATGATCTTCCCACATTTCCTTCCCTTCCACCACTACACAATGAACTAAGTGAGAGAAAGAAGGGAACTCAAAAGAAATATCTTTCCTTAAATTTCGGTAAGCGCTTTATCCCCACCCGACAACCATTCAAAAGGATGAGGACCAAATTTACTCACGATAATCCTGTTCCAAAGAGAGTTGGGTTCAAGGGGAAAATGCCACAACTATTTGGCTAACAAAGCTTTGTGGCTTTTTGGAATACATTGAAGTTATTAGTCTTTCCCAAGGCAACCAACCAAGTAAAGAACTAGGTTTTCTTATAGATTTTAACCTCCACAGGGTAGAGAAGAGAGAGCATACGAAGGAACAAGATGAGAAGGGGACTAAGAAGGAAACTGAATGAATAAACAAGAATCTACAAGTATATGGCAAACAATATTTTATGTTTTCCTTTGTTATGTTGTTTTGGTTATGTTTTTGGAGCTTTTATTATGTTTTTGGAGCTTTTATGTCTCTAAGGTTTGCTTTTTCCTTTTTGCCTCTAATTTCTTCTATCAATCATTTTTAACTTAAGAGGAGGTTTACATCTTACCTTCAGAAGGGAAGGACTGGAGCTAAAATTTCTCCTGTTTCTTTTAGAAAAAATTCTTATATACACACCCCTCTTTGTCAATTGCCAAAGACTATTATAATATTCTAAGACAGTATCGGGTATACACCATGTTAATTGTTATGTGTTATTGAAAAATGAATGCAAAGCTTGAAAATAGTAACTATGAATCTTTGAAATCCACGTTCTGCTTGCTTCTCTCTCTCTCTCTTTTTAATAAATAATAATAATTTATTTATTTATTCATACTCAAATTATGATGCTAGTTCAAAACTTTCTTTTATGTCAATGTGCCACAAATAGTGTTTCTACTAAAGGATAATAATTTGGTGCTTTGTTATGTTGGTTTCTTGATCAACATTATGTTCGTTCACGGTGTGACAAGTCTTTTGCTTTTTATAGTTGCAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTTTGGAAAAGCAATTTGTTTTATAAATTCTAAAACTGCATATCTCGAGTTTCAACTCTAAATTTAGGATCAATTTTCACAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGTAAATCTGTCTTTATATGACTAATGTCATCTGTTTTCTTATCAGAACATTAATGATATCCTTATTTATGTTTTTTTCATTAAACGAGTAGGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGTATGTTGACTCATTTTATTTTGTCTTCAGATGGTCATTAGGACAAGCAACATGATTGATGTTATGGTATAAAATTCTTATGTTGAATTTTTGAAATTATTATAGGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA

mRNA sequence

ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA

Coding sequence (CDS)

ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA

Protein sequence

MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTISPEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFVVSH
Homology
BLAST of HG10004066 vs. NCBI nr
Match: XP_038885229.1 (uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885230.1 uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885231.1 uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885232.1 uncharacterized protein LOC120075692 [Benincasa hispida])

HSP 1 Score: 540.0 bits (1390), Expect = 1.3e-149
Identity = 272/296 (91.89%), Postives = 282/296 (95.27%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MKKVDVPMLGL LVV LAA TFEPISSLPST+PAFLWSPHHRHRFSNNI +KYVDYQTIS
Sbjct: 1   MKKVDVPMLGLSLVVFLAAATFEPISSLPSTVPAFLWSPHHRHRFSNNIEDKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVLYEGGWSKILC GKEV QHVDLAIIF+GSE  SDFMLSR VDPNLMDLLKVS
Sbjct: 61  PQELAKSVLYEGGWSKILCMGKEVEQHVDLAIIFIGSELQSDFMLSRQVDPNLMDLLKVS 120

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRIS SAFQELSSVEDESFQKL +
Sbjct: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISNSAFQELSSVEDESFQKLPM 180

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           L HSINDYMVSRMEKKP+GET+LVVFSHGDFSSP+EGN WTSESKTLLEIMTSAEHVGAK
Sbjct: 181 LAHSINDYMVSRMEKKPEGETELVVFSHGDFSSPKEGNPWTSESKTLLEIMTSAEHVGAK 240

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGS+GNGSAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSAGNGSAKSEDFCDEVCQIKSSLLEGLFV 296

BLAST of HG10004066 vs. NCBI nr
Match: XP_008456729.1 (PREDICTED: uncharacterized protein LOC103496586 isoform X2 [Cucumis melo])

HSP 1 Score: 527.3 bits (1357), Expect = 8.4e-146
Identity = 264/294 (89.80%), Postives = 278/294 (94.56%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS+SDF  SRHVDPNLM+LLKVSFS
Sbjct: 61  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKSDFTSSRHVDPNLMNLLKVSFS 120

Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
           RSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPLLP 180

Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
           HSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240

Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 294

BLAST of HG10004066 vs. NCBI nr
Match: XP_011656616.1 (uncharacterized protein LOC101220040 isoform X2 [Cucumis sativus])

HSP 1 Score: 526.6 bits (1355), Expect = 1.4e-145
Identity = 264/294 (89.80%), Postives = 277/294 (94.22%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+ DVP LGLFLVVLLAA TFEPISSLPSTIPAFLWSPH RH FSNNILEKYVDYQTIS
Sbjct: 1   MKQADVPTLGLFLVVLLAAATFEPISSLPSTIPAFLWSPHQRHGFSNNILEKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
           P+ELAKSVL EGGWS++LCTGKEV QHVDLAIIFVGSESDF  SRHVDPNLMDLLKVSFS
Sbjct: 61  PQELAKSVLNEGGWSQLLCTGKEVKQHVDLAIIFVGSESDFTSSRHVDPNLMDLLKVSFS 120

Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
           RSNFSMAFPYVAAPE+GA+EKLLISEFK+SCGHDLRIS+SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPEKGAVEKLLISEFKQSCGHDLRISSSAFQELSSVEDESFQKLSLLP 180

Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
           HSINDYMVSRME K +GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKREGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240

Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           ILYISDPFRSIRHSYVELGRFMAEGSS N SAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSVNESAKSESFCDEVCQIKSSLLEGLFV 294

BLAST of HG10004066 vs. NCBI nr
Match: XP_008456728.1 (PREDICTED: uncharacterized protein LOC103496586 isoform X1 [Cucumis melo])

HSP 1 Score: 522.7 bits (1345), Expect = 2.1e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS  +SDF  SRHVDPNLM+LLKVS
Sbjct: 61  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 120

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 121 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 180

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 181 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 240

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 296

BLAST of HG10004066 vs. NCBI nr
Match: TYK04490.1 (uncharacterized protein E5676_scaffold409G001040 [Cucumis melo var. makuwa])

HSP 1 Score: 522.7 bits (1345), Expect = 2.1e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 7   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 66

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS  +SDF  SRHVDPNLM+LLKVS
Sbjct: 67  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 126

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 127 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 186

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 187 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 246

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 247 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 302

BLAST of HG10004066 vs. ExPASy TrEMBL
Match: A0A1S3C4L5 (uncharacterized protein LOC103496586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496586 PE=4 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 4.1e-146
Identity = 264/294 (89.80%), Postives = 278/294 (94.56%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS+SDF  SRHVDPNLM+LLKVSFS
Sbjct: 61  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKSDFTSSRHVDPNLMNLLKVSFS 120

Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
           RSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPLLP 180

Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
           HSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240

Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 294

BLAST of HG10004066 vs. ExPASy TrEMBL
Match: A0A5D3BZE4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G001040 PE=4 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.0e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 7   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 66

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS  +SDF  SRHVDPNLM+LLKVS
Sbjct: 67  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 126

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 127 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 186

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 187 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 246

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 247 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 302

BLAST of HG10004066 vs. ExPASy TrEMBL
Match: A0A1S3C3X2 (uncharacterized protein LOC103496586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496586 PE=4 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.0e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+  VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1   MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS  +SDF  SRHVDPNLM+LLKVS
Sbjct: 61  PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 120

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 121 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 180

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 181 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 240

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 296

BLAST of HG10004066 vs. ExPASy TrEMBL
Match: A0A0A0KB93 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G053330 PE=4 SV=1)

HSP 1 Score: 521.5 bits (1342), Expect = 2.2e-144
Identity = 264/296 (89.19%), Postives = 277/296 (93.58%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MK+ DVP LGLFLVVLLAA TFEPISSLPSTIPAFLWSPH RH FSNNILEKYVDYQTIS
Sbjct: 75  MKQADVPTLGLFLVVLLAAATFEPISSLPSTIPAFLWSPHQRHGFSNNILEKYVDYQTIS 134

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVL EGGWS++LCTGKEV QHVDLAIIFVGSE  SDF  SRHVDPNLMDLLKVS
Sbjct: 135 PQELAKSVLNEGGWSQLLCTGKEVKQHVDLAIIFVGSELQSDFTSSRHVDPNLMDLLKVS 194

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFSMAFPYVAAPE+GA+EKLLISEFK+SCGHDLRIS+SAFQELSSVEDESFQKL L
Sbjct: 195 FSRSNFSMAFPYVAAPEKGAVEKLLISEFKQSCGHDLRISSSAFQELSSVEDESFQKLSL 254

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           LPHSINDYMVSRME K +GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 255 LPHSINDYMVSRMENKREGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 314

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILYISDPFRSIRHSYVELGRFMAEGSS N SAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 315 YEILYISDPFRSIRHSYVELGRFMAEGSSVNESAKSESFCDEVCQIKSSLLEGLFV 370

BLAST of HG10004066 vs. ExPASy TrEMBL
Match: A0A6J1H415 (uncharacterized protein LOC111459798 OS=Cucurbita moschata OX=3662 GN=LOC111459798 PE=4 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 1.4e-130
Identity = 241/296 (81.42%), Postives = 258/296 (87.16%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MKKVDVP LGL LVVLL A TFEP SSLPST+PAFLWSPHH H FSNN++EK VDYQTIS
Sbjct: 1   MKKVDVPKLGLLLVVLLVAATFEPSSSLPSTVPAFLWSPHHHHGFSNNMIEKSVDYQTIS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
           P+ELAKSVLYEGGWSK LC+ K V QHVDLAI+FVGSE  SDFMLSRHVDPNL DLLKVS
Sbjct: 61  PQELAKSVLYEGGWSKFLCSRKNVEQHVDLAIVFVGSELQSDFMLSRHVDPNLKDLLKVS 120

Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
           FSRSNFS+AFPYVAAPE G IE  LISEFKKSCGHDL IS SAF EL S+EDESFQ+L  
Sbjct: 121 FSRSNFSLAFPYVAAPESGTIENSLISEFKKSCGHDLGISNSAFHELCSIEDESFQRLP- 180

Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
           L HSINDYMVSRMEKKPKGETDLVVF HG  +SP+E N W SESK LLEIMTSAEHVG+K
Sbjct: 181 LQHSINDYMVSRMEKKPKGETDLVVFCHGGSNSPKEVNSWASESKALLEIMTSAEHVGSK 240

Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           YEILY+SDPFRSIRH+ ++L RF+AEGSSGNGS KS N CDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYVSDPFRSIRHTSMKLERFLAEGSSGNGSTKSANFCDEVCQIKSSLLEGLFV 295

BLAST of HG10004066 vs. TAIR 10
Match: AT3G13410.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endoplasmic reticulum; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G55546.1); Has 49 Blast hits to 49 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 222.2 bits (565), Expect = 5.4e-58
Identity = 129/298 (43.29%), Postives = 189/298 (63.42%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           MKK+ +  + L LV L  A  FE   + P+T+PAFLWSPH +   +N  L++ V+YQ +S
Sbjct: 1   MKKIQIGAVAL-LVFLSVASLFEIGLASPNTVPAFLWSPHLQS--ANGELDEAVNYQVMS 60

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE---SDFMLSRHVDPNLMDLLKV 120
            ++L  SV  +GGWS  LC+ K++ Q VD+A++F+G E   SD    R+ DP L++ L  
Sbjct: 61  AKDLVGSVFTQGGWSNFLCSEKKLEQPVDVALVFIGRELLSSDVSSKRNSDPALVNTLNN 120

Query: 121 SFSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQ 180
            F+ SNFS+AFPY+AAPE   +E LL+S  K++C +++ +S   F +   VED + QKL 
Sbjct: 121 LFTASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGVSNIVFSDSCFVEDGTIQKLS 180

Query: 181 LLPHSINDYMVSRMEKKPKGETDLVVF-SHGDFSSPQEGNLWTSESKTLLEIMTSAEHVG 240
            L  S  D++++R E + +GETDLVV  S G  S+ Q G    SE ++ LE+++S E  G
Sbjct: 181 DL-QSFKDHLLARRETRKEGETDLVVLCSEGSESNSQAGQS-HSERESFLELVSSVEQSG 240

Query: 241 AKYEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
           +KY  LY+SDP+     SY  L RF+AE + GN + +    CDE+C+ KSSLLEG+ V
Sbjct: 241 SKYTALYVSDPYWYT--SYKTLQRFLAETAKGNSTPEIATGCDELCKFKSSLLEGILV 291

BLAST of HG10004066 vs. TAIR 10
Match: AT1G55546.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13410.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 77.0 bits (188), Expect = 2.8e-14
Identity = 47/119 (39.50%), Postives = 72/119 (60.50%), Query Frame = 0

Query: 1   MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
           + K+ +      LVVL  A   +   + PST+PAFLWSPH   +++N   E  V+YQ +S
Sbjct: 15  LMKLAINYYQYLLVVLEFASLVDFGLASPSTVPAFLWSPH--LQYANG--ETDVNYQVMS 74

Query: 61  PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE---SDFMLSRHVDPNLMDLLK 117
            ++L  SV   GGWS  LC+ K++ Q VD+A++F+G E   SD   +++ DP L++ LK
Sbjct: 75  AKDLVDSVFTLGGWSNFLCSEKKLQQPVDVALVFIGRELLSSDVSSNQNSDPVLVNTLK 129

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885229.11.3e-14991.89uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885230.1 unchara... [more]
XP_008456729.18.4e-14689.80PREDICTED: uncharacterized protein LOC103496586 isoform X2 [Cucumis melo][more]
XP_011656616.11.4e-14589.80uncharacterized protein LOC101220040 isoform X2 [Cucumis sativus][more]
XP_008456728.12.1e-14489.19PREDICTED: uncharacterized protein LOC103496586 isoform X1 [Cucumis melo][more]
TYK04490.12.1e-14489.19uncharacterized protein E5676_scaffold409G001040 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3C4L54.1e-14689.80uncharacterized protein LOC103496586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3BZE41.0e-14489.19Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3C3X21.0e-14489.19uncharacterized protein LOC103496586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KB932.2e-14489.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G053330 PE=4 SV=1[more]
A0A6J1H4151.4e-13081.42uncharacterized protein LOC111459798 OS=Cucurbita moschata OX=3662 GN=LOC1114597... [more]
Match NameE-valueIdentityDescription
AT3G13410.15.4e-5843.29unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55546.12.8e-1439.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR352852-C-METHYL-D-ERYTHRITOL 4-PHOSPHATE CYTIDYLYLTRANSFERASEcoord: 1..295

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004066.1HG10004066.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane