HG10023351 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023351
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLEA_2 domain-containing protein
LocationChr05: 33291345 .. 33292822 (+)
RNA-Seq ExpressionHG10023351
SyntenyHG10023351
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACACCACTCCAATCTATAACAACAGCCCAATTGAGTCACCTTCACACCCGTCATTTGGTCGCCATTCAAGAAACTCATCGGCGAGCCGATTTTCGGGTATTTTCCGGTCATCTTCAGGCAGAAAAGGGAGCAGCAAGAAGCAGATTAGCAATGAAAAAGGGTGGCCTGAGTGTAATGTGATCATGGAAGAAGGGCCTTATGATGACCTTGAGGATAAGGCTCTCTCAAGACGCTTTCAGGCATTAATTGCTCTCCTGAGTTTCATTGCTTTGTTCACACTTTTCTGCCTAATCATTTGGGGCGCCAGCAGGCCTTTCAAGGCTCAGATTTCTGTCAAGGTAATCAAGTTACTAATCAAACACTATTGATATGGTTTGTATTCTCTTTACACTGTTCTCGATGATTGCCAACATGAGTTTAGCTAAACAGCTAAAATCTTCATCCACAAATGATGTGATATTCTTTAAAAGAAAAAGTGATTACCTATGTTTACTCGAGAAGAAAGAAATGACATTAGATAAAATAAAGAAACTGACAGCTTAAATAACTATGTTTAATCTTCTATATATATATGCACAACAATTCTTTTAACTGAATCAAAATTCAAATTTATGAAGCTAGATCATTTGATTAGCTTCTCTCTTCTTGACCATGTCAATTTCCAGCCCTTCATGAACAGAGACGTTCATTTGTTTAATGTGAAATCTGATCTTTTCAGAGCTTGGCTGTGCATAATTTTTATGTTGGGGAAGGTTCAGATTCCACTGGGGTACCTACCAAGTTGCTGACATTGAATAGCACATTGAGGTTAAGTGTATACAACCCTGCTACAGTATTTGGCATTCACGTTACCTCCACACCAATTGATCTCATTTATTCAGAGATTGTTGTGGCCTCTGGTCAGGTAATAAACTAATGCGAGCTTCTGTTTTCTGATTCATAACTTTGTTACTAATTGATTTAGAAAAAGAGTTTGGATTGGAGTCCAACTTGCCTTAGGCAATGAAGTTAAGAGTTAAAAACTAAATCCTGCTTGAAATGACTTTCTAAGTACTTAAAAAAAAGTGTCTTCAAGTGCTTTTAAACACTTGGTCATTCCAAACAAGCCCAAAATCATCTTAAACAGGAATTTCTGATTGTTTCATCCTTAATTGCCAAGGACATTAACTCTCATGCCTTGTGATCCAATGTGCAGTTGAAGAAATATTACCAGCCAAGAAACAGTCACCGGACAGTGTCTGTCAATTTGGAAGGAATAAAGGTTCCTATGTATGGAGCTGCATCAACTTTGACTATTCCCCCAGCAAGTAGCCCGGTTCCAATGACATTGGCATTCAAAATTCGATCACGAGGATACGTCGTAGGGCAGCTAGTAAGAACAACACATATAAAGCAAATCTCTTGCCCTGTGGGTATTGATTCTACCAGCAACAAAGCCATCGTGTTCAAGAAGAATTCCTGCACATATGAGTGA

mRNA sequence

ATGCACACCACTCCAATCTATAACAACAGCCCAATTGAGTCACCTTCACACCCGTCATTTGGTCGCCATTCAAGAAACTCATCGGCGAGCCGATTTTCGGGTATTTTCCGGTCATCTTCAGGCAGAAAAGGGAGCAGCAAGAAGCAGATTAGCAATGAAAAAGGGTGGCCTGAGTGTAATGTGATCATGGAAGAAGGGCCTTATGATGACCTTGAGGATAAGGCTCTCTCAAGACGCTTTCAGGCATTAATTGCTCTCCTGAGTTTCATTGCTTTGTTCACACTTTTCTGCCTAATCATTTGGGGCGCCAGCAGGCCTTTCAAGGCTCAGATTTCTGTCAAGAGCTTGGCTGTGCATAATTTTTATGTTGGGGAAGGTTCAGATTCCACTGGGGTACCTACCAAGTTGCTGACATTGAATAGCACATTGAGGTTAAGTGTATACAACCCTGCTACAGTATTTGGCATTCACGTTACCTCCACACCAATTGATCTCATTTATTCAGAGATTGTTGTGGCCTCTGGTCAGTTGAAGAAATATTACCAGCCAAGAAACAGTCACCGGACAGTGTCTGTCAATTTGGAAGGAATAAAGGTTCCTATGTATGGAGCTGCATCAACTTTGACTATTCCCCCAGCAAGTAGCCCGGTTCCAATGACATTGGCATTCAAAATTCGATCACGAGGATACGTCGTAGGGCAGCTAGTAAGAACAACACATATAAAGCAAATCTCTTGCCCTGTGGGTATTGATTCTACCAGCAACAAAGCCATCGTGTTCAAGAAGAATTCCTGCACATATGAGTGA

Coding sequence (CDS)

ATGCACACCACTCCAATCTATAACAACAGCCCAATTGAGTCACCTTCACACCCGTCATTTGGTCGCCATTCAAGAAACTCATCGGCGAGCCGATTTTCGGGTATTTTCCGGTCATCTTCAGGCAGAAAAGGGAGCAGCAAGAAGCAGATTAGCAATGAAAAAGGGTGGCCTGAGTGTAATGTGATCATGGAAGAAGGGCCTTATGATGACCTTGAGGATAAGGCTCTCTCAAGACGCTTTCAGGCATTAATTGCTCTCCTGAGTTTCATTGCTTTGTTCACACTTTTCTGCCTAATCATTTGGGGCGCCAGCAGGCCTTTCAAGGCTCAGATTTCTGTCAAGAGCTTGGCTGTGCATAATTTTTATGTTGGGGAAGGTTCAGATTCCACTGGGGTACCTACCAAGTTGCTGACATTGAATAGCACATTGAGGTTAAGTGTATACAACCCTGCTACAGTATTTGGCATTCACGTTACCTCCACACCAATTGATCTCATTTATTCAGAGATTGTTGTGGCCTCTGGTCAGTTGAAGAAATATTACCAGCCAAGAAACAGTCACCGGACAGTGTCTGTCAATTTGGAAGGAATAAAGGTTCCTATGTATGGAGCTGCATCAACTTTGACTATTCCCCCAGCAAGTAGCCCGGTTCCAATGACATTGGCATTCAAAATTCGATCACGAGGATACGTCGTAGGGCAGCTAGTAAGAACAACACATATAAAGCAAATCTCTTGCCCTGTGGGTATTGATTCTACCAGCAACAAAGCCATCGTGTTCAAGAAGAATTCCTGCACATATGAGTGA

Protein sequence

MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECNVIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHNFYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKYYQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTHIKQISCPVGIDSTSNKAIVFKKNSCTYE
Homology
BLAST of HG10023351 vs. NCBI nr
Match: XP_038897873.1 (uncharacterized protein LOC120085763 [Benincasa hispida])

HSP 1 Score: 515.4 bits (1326), Expect = 3.0e-142
Identity = 261/268 (97.39%), Postives = 263/268 (98.13%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQA+IA LSFIALFT FCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQAIIAFLSFIALFTFFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLK Y
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKNY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHRTVSVNLEGIKVPMYGAASTLT+PP SSPVPM LAFKIRSRGYVVGQLVRTTH
Sbjct: 225 YQPRNSHRTVSVNLEGIKVPMYGAASTLTVPPTSSPVPMILAFKIRSRGYVVGQLVRTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 312

BLAST of HG10023351 vs. NCBI nr
Match: XP_004141769.1 (uncharacterized protein LOC101220910 [Cucumis sativus] >KGN45378.1 hypothetical protein Csa_016273 [Cucumis sativus])

HSP 1 Score: 512.3 bits (1318), Expect = 2.5e-141
Identity = 261/268 (97.39%), Postives = 264/268 (98.51%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG SKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG-SKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQ LIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQVLIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQLKKY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLKKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHR VSVNLEGIKVPMYGAASTLT+PP SSPVPMTLAFKIRSRGYVVGQLV+TTH
Sbjct: 225 YQPRNSHRRVSVNLEGIKVPMYGAASTLTVPPTSSPVPMTLAFKIRSRGYVVGQLVKTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 311

BLAST of HG10023351 vs. NCBI nr
Match: XP_008462165.1 (PREDICTED: uncharacterized protein LOC103500589 [Cucumis melo] >KAA0059303.1 uncharacterized protein E6C27_scaffold242G00260 [Cucumis melo var. makuwa] >TYK04025.1 uncharacterized protein E5676_scaffold347G002240 [Cucumis melo var. makuwa])

HSP 1 Score: 508.8 bits (1309), Expect = 2.8e-140
Identity = 259/268 (96.64%), Postives = 263/268 (98.13%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG SKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG-SKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRR Q LIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRIQVLIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQLKKY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLKKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHR VSVNLEGIKVPMYGAASTLT+PP SSPVPMTL+FKIRSRGYVVGQLV+TTH
Sbjct: 225 YQPRNSHRRVSVNLEGIKVPMYGAASTLTVPPTSSPVPMTLSFKIRSRGYVVGQLVKTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 311

BLAST of HG10023351 vs. NCBI nr
Match: XP_022964211.1 (uncharacterized protein LOC111464298 [Cucurbita moschata] >XP_023513962.1 uncharacterized protein LOC111778399 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 488.8 bits (1257), Expect = 3.0e-134
Identity = 249/268 (92.91%), Postives = 257/268 (95.90%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGS K+  SN+KGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSKKQ--SNDKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQA+IALLSFI LFTL CLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQAIIALLSFITLFTLLCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQL+KY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLRKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHRTVSVNLEGIKVPMYGAASTL++P  S PVPMTL FKIRSRG VVG+LVRTTH
Sbjct: 225 YQPRNSHRTVSVNLEGIKVPMYGAASTLSLPSTSGPVPMTLVFKIRSRGNVVGKLVRTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTS KAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSTKAIVFKKNSCTYE 310

BLAST of HG10023351 vs. NCBI nr
Match: XP_023000375.1 (uncharacterized protein LOC111494630 [Cucurbita maxima])

HSP 1 Score: 488.0 bits (1255), Expect = 5.1e-134
Identity = 249/268 (92.91%), Postives = 256/268 (95.52%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGS K+  SN+KGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSKKQ--SNDKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQA+IALLSFI LFTL CLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQAIIALLSFITLFTLLCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQL+KY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLRKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHRTVSVNLEGIKVPMYGAASTL+ P  S PVPMTL FKIRSRG VVG+LVRTTH
Sbjct: 225 YQPRNSHRTVSVNLEGIKVPMYGAASTLSFPSTSGPVPMTLVFKIRSRGNVVGKLVRTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTS KAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSTKAIVFKKNSCTYE 310

BLAST of HG10023351 vs. ExPASy TrEMBL
Match: A0A0A0K7D1 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G446930 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.2e-141
Identity = 261/268 (97.39%), Postives = 264/268 (98.51%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG SKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG-SKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQ LIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQVLIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQLKKY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLKKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHR VSVNLEGIKVPMYGAASTLT+PP SSPVPMTLAFKIRSRGYVVGQLV+TTH
Sbjct: 225 YQPRNSHRRVSVNLEGIKVPMYGAASTLTVPPTSSPVPMTLAFKIRSRGYVVGQLVKTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 311

BLAST of HG10023351 vs. ExPASy TrEMBL
Match: A0A5D3BXX9 (LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G002240 PE=4 SV=1)

HSP 1 Score: 508.8 bits (1309), Expect = 1.3e-140
Identity = 259/268 (96.64%), Postives = 263/268 (98.13%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG SKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG-SKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRR Q LIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRIQVLIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQLKKY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLKKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHR VSVNLEGIKVPMYGAASTLT+PP SSPVPMTL+FKIRSRGYVVGQLV+TTH
Sbjct: 225 YQPRNSHRRVSVNLEGIKVPMYGAASTLTVPPTSSPVPMTLSFKIRSRGYVVGQLVKTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 311

BLAST of HG10023351 vs. ExPASy TrEMBL
Match: A0A1S3CHT6 (uncharacterized protein LOC103500589 OS=Cucumis melo OX=3656 GN=LOC103500589 PE=4 SV=1)

HSP 1 Score: 508.8 bits (1309), Expect = 1.3e-140
Identity = 259/268 (96.64%), Postives = 263/268 (98.13%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG SKKQISNEKGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKG-SKKQISNEKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRR Q LIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRIQVLIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQLKKY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLKKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHR VSVNLEGIKVPMYGAASTLT+PP SSPVPMTL+FKIRSRGYVVGQLV+TTH
Sbjct: 225 YQPRNSHRRVSVNLEGIKVPMYGAASTLTVPPTSSPVPMTLSFKIRSRGYVVGQLVKTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTSNKAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSNKAIVFKKNSCTYE 311

BLAST of HG10023351 vs. ExPASy TrEMBL
Match: A0A6J1HIA4 (uncharacterized protein LOC111464298 OS=Cucurbita moschata OX=3662 GN=LOC111464298 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 1.4e-134
Identity = 249/268 (92.91%), Postives = 257/268 (95.90%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGS K+  SN+KGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSKKQ--SNDKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQA+IALLSFI LFTL CLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQAIIALLSFITLFTLLCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQL+KY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLRKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHRTVSVNLEGIKVPMYGAASTL++P  S PVPMTL FKIRSRG VVG+LVRTTH
Sbjct: 225 YQPRNSHRTVSVNLEGIKVPMYGAASTLSLPSTSGPVPMTLVFKIRSRGNVVGKLVRTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTS KAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSTKAIVFKKNSCTYE 310

BLAST of HG10023351 vs. ExPASy TrEMBL
Match: A0A6J1KI59 (uncharacterized protein LOC111494630 OS=Cucurbita maxima OX=3661 GN=LOC111494630 PE=4 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 2.5e-134
Identity = 249/268 (92.91%), Postives = 256/268 (95.52%), Query Frame = 0

Query: 1   MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECN 60
           MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGS K+  SN+KGWPECN
Sbjct: 45  MHTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSKKQ--SNDKGWPECN 104

Query: 61  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 120
           VIMEEGPYDDLEDKALSRRFQA+IALLSFI LFTL CLIIWGASRPFKAQISVKSLAVHN
Sbjct: 105 VIMEEGPYDDLEDKALSRRFQAIIALLSFITLFTLLCLIIWGASRPFKAQISVKSLAVHN 164

Query: 121 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 180
           FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPAT+FGIHVTSTPIDLIYSEIVVASGQL+KY
Sbjct: 165 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATIFGIHVTSTPIDLIYSEIVVASGQLRKY 224

Query: 181 YQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTH 240
           YQPRNSHRTVSVNLEGIKVPMYGAASTL+ P  S PVPMTL FKIRSRG VVG+LVRTTH
Sbjct: 225 YQPRNSHRTVSVNLEGIKVPMYGAASTLSFPSTSGPVPMTLVFKIRSRGNVVGKLVRTTH 284

Query: 241 IKQISCPVGIDSTSNKAIVFKKNSCTYE 269
           IKQISCPVGIDSTS KAIVFKKNSCTYE
Sbjct: 285 IKQISCPVGIDSTSTKAIVFKKNSCTYE 310

BLAST of HG10023351 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 209.5 bits (532), Expect = 3.3e-54
Identity = 130/297 (43.77%), Postives = 177/297 (59.60%), Query Frame = 0

Query: 2   HTTPIYNNSPIESP--SHPSFGRHSRNSSASRFSGIFRSSSGR----KGSSKKQISNEKG 61
           H+TP+   SP+ SP  SH S GRHSR SS+SRFSG  +  S +     GS +K    EK 
Sbjct: 46  HSTPVL--SPMGSPPHSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQ 105

Query: 62  WPECNVIMEEGPYDDLE-DKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVK 121
           W EC VI EEG  DD + D  + RR   L  ++ F  LF  F LI++GA++P K +I+VK
Sbjct: 106 WKECAVIEEEGLLDDGDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVK 165

Query: 122 SLAVHNFYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVAS 181
           S+      +  G D+ GV T ++T+N+TLR+   N  T FG+HVTSTPIDL +S+I + S
Sbjct: 166 SITFETLKIQAGQDAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGS 225

Query: 182 GQLKKYYQPRNSHRTVSVNLEGIKVPMYGAASTLTIPPA--------------------- 241
           G +KK+YQ R S RTV V++ G K+P+YG+ STL +PPA                     
Sbjct: 226 GSVKKFYQGRKSERTVLVHVIGEKIPLYGSGSTL-LPPAPPAPLPKPKKKKGAPVPIPDP 285

Query: 242 ---SSPVPMTLAFKIRSRGYVVGQLVRTTHIKQISCPVGIDSTS-NKAIVFKKNSCT 267
               +PVPMTL+F +RSR YV+G+LV+    K+I C +  +  + NK IV  KN CT
Sbjct: 286 PAPPAPVPMTLSFVVRSRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKN-CT 338

BLAST of HG10023351 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 174.5 bits (441), Expect = 1.2e-43
Identity = 113/289 (39.10%), Postives = 159/289 (55.02%), Query Frame = 0

Query: 2   HTTPIYNNSPIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPECNV 61
           H+TP+   SP+ SP H        +SS+SRFS I        GS +K  + EK   +  +
Sbjct: 46  HSTPVL-TSPMGSPPH-------SHSSSSRFSKI-------NGSKRKGHAGEK---QFAM 105

Query: 62  IMEEGPYD--DLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVH 121
           I EEG  D  D E +AL RR   L  ++ F  LF  F LI++ A++P K +ISVKS+   
Sbjct: 106 IEEEGLLDDGDREQEALPRRCYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFE 165

Query: 122 NFYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKK 181
              V  G D+ G+ T ++T+N+TLR+   N  T FG+HVTS+PIDL +S+I + SG +KK
Sbjct: 166 QLKVQAGQDAGGIGTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKK 225

Query: 182 YYQPRNSHRTVSVNLEGIKVPMYGAASTLTIPP----------------------ASSPV 241
           +YQ R S RTV VN+ G K+P+YG+ STL  PP                        +PV
Sbjct: 226 FYQSRKSQRTVVVNVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPV 285

Query: 242 PMTLAFKIRSRGYVVGQLVRTTHIKQISCPVGIDSTSNKAIVFKKNSCT 267
           PM L F +RSR YV+G+LV+    K+I C +  +       +   N+CT
Sbjct: 286 PMRLNFTVRSRAYVLGKLVQPKFYKRIVCLINFEHKKLSKHIPITNNCT 316

BLAST of HG10023351 vs. TAIR 10
Match: AT3G24600.1 (Late embryogenesis abundant protein, group 2 )

HSP 1 Score: 173.3 bits (438), Expect = 2.6e-43
Identity = 100/258 (38.76%), Postives = 157/258 (60.85%), Query Frame = 0

Query: 11  PIESPSHPSFGRHSRNSSASRFSGIFRSSSGRKGSSKKQISNEKGWPE-CNVIMEEGPYD 70
           P+ +P++        +SS+   +G        KGSS++   +   WPE    I E+  YD
Sbjct: 249 PVHTPNYTILSESRLSSSSRTSNGTSGMGFRWKGSSRR---SNMYWPEKPYTINEDEVYD 308

Query: 71  DLEDKALS-RRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHNFYVGEGSD 130
           D  ++ LS  + +A++ +L  + +F++FC ++WGAS PF   +SVKS+ +H+FY GEG D
Sbjct: 309 D--NRGLSVGQCRAVLVILGTVVVFSVFCSVLWGASHPFSPIVSVKSVDIHSFYYGEGID 368

Query: 131 STGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKYYQPRNSHR 190
            TGV TK+L+ NS++++++ +PA  FGIHV+S+   L +S + +A+GQLK YYQPR S  
Sbjct: 369 RTGVATKILSFNSSVKVTIDSPAPYFGIHVSSSTFKLTFSALTLATGQLKSYYQPRKSKH 428

Query: 191 TVSVNLEGIKVPMYGAASTLTIPPASSPVPMTLAFKIRSRGYVVGQLVRTTHIKQISCPV 250
              V L G +VP+YGA   L        VP+ L F+IRSRG ++G+LV++ H   +SC  
Sbjct: 429 ISIVKLTGAEVPLYGAGPHLAASDKKGKVPVKLEFEIRSRGNLLGKLVKSKHENHVSCSF 488

Query: 251 GIDST-SNKAIVFKKNSC 266
            I S+ ++K I F   +C
Sbjct: 489 FISSSKTSKPIEFTHKTC 501

BLAST of HG10023351 vs. TAIR 10
Match: AT1G45688.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 142.9 bits (359), Expect = 3.7e-34
Identity = 90/205 (43.90%), Postives = 124/205 (60.49%), Query Frame = 0

Query: 2   HTTPIYNNSPIESP--SHPSFGRHSRNSSASRFSGIFRSSSGR----KGSSKKQISNEKG 61
           H+TP+   SP+ SP  SH S GRHSR SS+SRFSG  +  S +     GS +K    EK 
Sbjct: 46  HSTPVL--SPMGSPPHSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQ 105

Query: 62  WPECNVIMEEGPYDDLE-DKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVK 121
           W EC VI EEG  DD + D  + RR   L  ++ F  LF  F LI++GA++P K +I+VK
Sbjct: 106 WKECAVIEEEGLLDDGDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVK 165

Query: 122 SLAVHNFYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVAS 181
           S+      +  G D+ GV T ++T+N+TLR+   N  T FG+HVTSTPIDL +S+I + S
Sbjct: 166 SITFETLKIQAGQDAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGS 225

Query: 182 GQ----LKKYYQPRNSHRTVSVNLE 196
           G     ++K Y+ R    T ++NLE
Sbjct: 226 GSVSLPIQKLYRMREEIDT-NMNLE 247

BLAST of HG10023351 vs. TAIR 10
Match: AT4G35170.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 136.0 bits (341), Expect = 4.6e-32
Identity = 89/251 (35.46%), Postives = 140/251 (55.78%), Query Frame = 0

Query: 10  SPIESPSH-----PSFGRHS--RNSSASRFSGIFRS--SSGRKGSSKKQISNEKGWPECN 69
           SP  SP +      +F  HS   +SS  R SG  R+  SS +     ++   ++ + E  
Sbjct: 38  SPFGSPLNDQGQVSNFQHHSVAESSSYPRSSGPLRNEYSSVQVHDLDRRTHEDEDYDEM- 97

Query: 70  VIMEEGPYDDLEDKALSRRFQALIALLSFIALFTLFCLIIWGASRPFKAQISVKSLAVHN 129
               +GP  D + + ++R +  L  L + +  FTLFCLI+WG S+ F    ++K + + N
Sbjct: 98  ----DGP--DEKRRRITRFYSCL--LFTLVLAFTLFCLILWGVSKSFAPIATLKEMVLEN 157

Query: 130 FYVGEGSDSTGVPTKLLTLNSTLRLSVYNPATVFGIHVTSTPIDLIYSEIVVASGQLKKY 189
             V  G+D +GV T +LTLNST+R+   NPAT F +HVTS P+ L YS++++ASGQ+ ++
Sbjct: 158 LNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTSAPLQLSYSQLILASGQMGEF 217

Query: 190 YQPRNSHRTVSVNLEGIKVPMYGAASTL---TIPPASSPVPMTLAFKIRSRGYVVGQLVR 249
            Q R S R +   + G ++P+YG    L      P    +P+ L F +R+R YV+G+LV+
Sbjct: 218 SQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVLPLNLTFTLRARAYVLGRLVK 277

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897873.13.0e-14297.39uncharacterized protein LOC120085763 [Benincasa hispida][more]
XP_004141769.12.5e-14197.39uncharacterized protein LOC101220910 [Cucumis sativus] >KGN45378.1 hypothetical ... [more]
XP_008462165.12.8e-14096.64PREDICTED: uncharacterized protein LOC103500589 [Cucumis melo] >KAA0059303.1 unc... [more]
XP_022964211.13.0e-13492.91uncharacterized protein LOC111464298 [Cucurbita moschata] >XP_023513962.1 unchar... [more]
XP_023000375.15.1e-13492.91uncharacterized protein LOC111494630 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K7D11.2e-14197.39LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G446930 PE=4 ... [more]
A0A5D3BXX91.3e-14096.64LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3CHT61.3e-14096.64uncharacterized protein LOC103500589 OS=Cucumis melo OX=3656 GN=LOC103500589 PE=... [more]
A0A6J1HIA41.4e-13492.91uncharacterized protein LOC111464298 OS=Cucurbita moschata OX=3662 GN=LOC1114642... [more]
A0A6J1KI592.5e-13492.91uncharacterized protein LOC111494630 OS=Cucurbita maxima OX=3661 GN=LOC111494630... [more]
Match NameE-valueIdentityDescription
AT1G45688.13.3e-5443.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G42860.11.2e-4339.10unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G24600.12.6e-4338.76Late embryogenesis abundant protein, group 2 [more]
AT1G45688.23.7e-3443.90unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35170.14.6e-3235.46Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 144..240
e-value: 2.6E-7
score: 31.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..52
NoneNo IPR availablePANTHERPTHR31852:SF141LATE EMBRYOGENESIS ABUNDANT PROTEIN, GROUP 2coord: 7..268
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 7..268

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023351.1HG10023351.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane