HG10014307 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014307
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
LocationChr02: 9399050 .. 9400047 (-)
RNA-Seq ExpressionHG10014307
SyntenyHG10014307
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGGTATGGAATTAACTTTTCTAAATATTATTGAAATTCAACTCCAAAAATAAAGAAAATAGACATTTAGGAGCTAGCTTTTTTTTTTTTTTTTTTAAGTTCAAAAATACATATTTTGTATTTTGCAGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA

mRNA sequence

ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA

Coding sequence (CDS)

ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA

Protein sequence

MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALKLHRASTKEAREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQIFDDDFKTSFWPPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAVEEAMAMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKIEDVDGDWLA
Homology
BLAST of HG10014307 vs. NCBI nr
Match: KAE8649926.1 (hypothetical protein Csa_011922 [Cucumis sativus])

HSP 1 Score: 426.0 bits (1094), Expect = 2.6e-115
Identity = 239/307 (77.85%), Postives = 252/307 (82.08%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
           MNSTDQL NFEA A+IS  KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1   MNSTDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60

Query: 61  HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           HRA STKE AREQQQKQDQE KQS PLFPQ   CFEAEGRRKSRRNPRIYP  SYDCSFY
Sbjct: 61  HRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
           LENGSG VAPPP  +NLN EIPIQ FDDDFKT          SFW PPSSYIC T+SCPD
Sbjct: 121 LENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPD 180

Query: 181 THQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAM-AMAEIRSMS 240
           THQE+PKS+SL EEEG LMASD +FW NNDPT  SEKDMQQ  V  EEAM AMA+I+SMS
Sbjct: 181 THQELPKSVSLREEEGNLMASD-VFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMS 240

Query: 241 MDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKIED 291
           MDVKALE D  HSSDNAMEFP+WLSIN+D L Q+SNY CVEEDYLQ PDLSCFD  KIED
Sbjct: 241 MDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSCFDTWKIED 300

BLAST of HG10014307 vs. NCBI nr
Match: XP_016901295.1 (PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo] >KAA0064553.1 putative WRKY transcription factor protein 1 isoform X2 [Cucumis melo var. makuwa] >TYK20037.1 putative WRKY transcription factor protein 1 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 421.4 bits (1082), Expect = 6.3e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
           MNS DQL NFEA A+IS  KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1   MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60

Query: 61  HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           HRA STKE AREQQQKQDQE KQS PLFP+L  CFEAEGRRKS+RNPRIYP  SYDCSFY
Sbjct: 61  HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
           LENGSGFVAPPP  +NLN EIPIQ FDDDFKT          SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180

Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
           T HQE PKS+SL EEEG LMASD +FW NNDPT  +EKDMQQ AV  EEAMAMA  +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240

Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
           MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300

BLAST of HG10014307 vs. NCBI nr
Match: XP_038897806.1 (uncharacterized protein LOC120085720 [Benincasa hispida])

HSP 1 Score: 409.1 bits (1050), Expect = 3.3e-110
Identity = 220/287 (76.66%), Postives = 234/287 (81.53%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
           MNS DQLCNFEA A+ISQPKPDGE KKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1   MNSADQLCNFEAAAQISQPKPDGESKKQVRRRRHSRRRLYKEMPLDMAEARREIVTALKL 60

Query: 61  HRASTKEAREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFYLE 120
           HRASTKEAREQQQKQDQ+I QS+P+FPQL PCFE +GRRKSRRN R YP    DCSFYLE
Sbjct: 61  HRASTKEAREQQQKQDQQINQSIPIFPQLGPCFEGDGRRKSRRNTRTYP----DCSFYLE 120

Query: 121 NGSGFVAPPPVAQNLNAEIPIQIFDDDFKT------SFWPPSSYICLTVSCPDTHQEVPK 180
           NGSGFVAPP VAQNL  EIP Q FDDDFKT      SFWPPSSYI  TVSC  THQEVPK
Sbjct: 121 NGSGFVAPPSVAQNLITEIPTQSFDDDFKTSSYCPLSFWPPSSYIYPTVSCSATHQEVPK 180

Query: 181 SISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAVEE--AMAMAEIRSMSMDVKALEF 240
           SISLSEEEG LMASD +FW NND     +KDMQ+GAVEE  A AMAE+R M+MDVKALE 
Sbjct: 181 SISLSEEEGNLMASD-VFWFNND-----QKDMQEGAVEEARARAMAEVRPMTMDVKALES 240

Query: 241 DVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDF 279
           D HHS +N MEF +W SIN+D LQQHSNY CVEEDYLQ PDLS + F
Sbjct: 241 DGHHSCENPMEFSDWPSINDDFLQQHSNYHCVEEDYLQDPDLSWYQF 277

BLAST of HG10014307 vs. NCBI nr
Match: XP_022940715.1 (uncharacterized protein LOC111446225 [Cucurbita moschata])

HSP 1 Score: 352.1 bits (902), Expect = 4.7e-93
Identity = 210/329 (63.83%), Postives = 234/329 (71.12%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
           MNSTDQLCNFEA  KI QP+P   GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1   MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60

Query: 61  LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           LHRASTKEA+EQQQKQDQ+IK S+P++P Q  PCFE E R KSRRNPRIYP    DCSFY
Sbjct: 61  LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD--------------FKTSFWPPSSY 180
            ENGS F+APPPVAQ+L+ +IPIQ       F+D               +  SF PPSSY
Sbjct: 121 FENGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCSNNNNNHSFYSLSFLPPSSY 180

Query: 181 ICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----E 240
           IC T      THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++  GAV     E
Sbjct: 181 ICPTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEE 240

Query: 241 EAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHSNYK 291
           E   +AEIR  S+D K LE D   H          S+ AMEFP+WLSIN+D LQ  SNY+
Sbjct: 241 EEAMVAEIR--SVDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYQ 300

BLAST of HG10014307 vs. NCBI nr
Match: KAG6608324.1 (hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 351.7 bits (901), Expect = 6.2e-93
Identity = 206/325 (63.38%), Postives = 230/325 (70.77%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
           MNSTDQLCNFEA  KI QP+P   GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1   MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60

Query: 61  LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           LHRASTKEA+EQQQKQDQ+IK S+P++P Q  PCFE E R KSRRNPRIYP    DCSFY
Sbjct: 61  LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD------------FKTSFWPPSSYIC 180
            ENGS F+APPPVAQ+L+ +IPIQ       F+D             +  SF PPSSYIC
Sbjct: 121 FENGSHFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCNNNNHSFYSLSFLPPSSYIC 180

Query: 181 LTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----EEA 240
            T      THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++  GAV     EE 
Sbjct: 181 PTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEEEE 240

Query: 241 MAMAEIRSMSMDVKALEFDVH--------HSSDNAMEFPEWLSINNDILQQHSNYKCVEE 291
             +AEIRSM      ++   H          S+ AMEFP+WLSIN+D LQ  SNY    E
Sbjct: 241 AMVAEIRSMEEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYPFSNE 300

BLAST of HG10014307 vs. ExPASy TrEMBL
Match: A0A5A7V8V7 (Putative WRKY transcription factor protein 1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G001570 PE=4 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 3.1e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
           MNS DQL NFEA A+IS  KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1   MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60

Query: 61  HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           HRA STKE AREQQQKQDQE KQS PLFP+L  CFEAEGRRKS+RNPRIYP  SYDCSFY
Sbjct: 61  HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
           LENGSGFVAPPP  +NLN EIPIQ FDDDFKT          SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180

Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
           T HQE PKS+SL EEEG LMASD +FW NNDPT  +EKDMQQ AV  EEAMAMA  +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240

Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
           MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300

BLAST of HG10014307 vs. ExPASy TrEMBL
Match: A0A1S4DZY0 (uncharacterized protein LOC103493717 OS=Cucumis melo OX=3656 GN=LOC103493717 PE=4 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 3.1e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
           MNS DQL NFEA A+IS  KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1   MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60

Query: 61  HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           HRA STKE AREQQQKQDQE KQS PLFP+L  CFEAEGRRKS+RNPRIYP  SYDCSFY
Sbjct: 61  HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
           LENGSGFVAPPP  +NLN EIPIQ FDDDFKT          SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180

Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
           T HQE PKS+SL EEEG LMASD +FW NNDPT  +EKDMQQ AV  EEAMAMA  +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240

Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
           MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300

BLAST of HG10014307 vs. ExPASy TrEMBL
Match: A0A6J1FRD8 (uncharacterized protein LOC111446225 OS=Cucurbita moschata OX=3662 GN=LOC111446225 PE=4 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 2.3e-93
Identity = 210/329 (63.83%), Postives = 234/329 (71.12%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
           MNSTDQLCNFEA  KI QP+P   GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1   MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60

Query: 61  LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
           LHRASTKEA+EQQQKQDQ+IK S+P++P Q  PCFE E R KSRRNPRIYP    DCSFY
Sbjct: 61  LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120

Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD--------------FKTSFWPPSSY 180
            ENGS F+APPPVAQ+L+ +IPIQ       F+D               +  SF PPSSY
Sbjct: 121 FENGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCSNNNNNHSFYSLSFLPPSSY 180

Query: 181 ICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----E 240
           IC T      THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++  GAV     E
Sbjct: 181 ICPTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEE 240

Query: 241 EAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHSNYK 291
           E   +AEIR  S+D K LE D   H          S+ AMEFP+WLSIN+D LQ  SNY+
Sbjct: 241 EEAMVAEIR--SVDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYQ 300

BLAST of HG10014307 vs. ExPASy TrEMBL
Match: A0A0A0L091 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G649650 PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 1.9e-92
Identity = 192/249 (77.11%), Postives = 203/249 (81.53%), Query Frame = 0

Query: 46  MAEARREIVTALKLHRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRN 105
           MAEARREIVTALKLHRA STKE AREQQQKQDQE KQS PLFPQ   CFEAEGRRKSRRN
Sbjct: 1   MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRN 60

Query: 106 PRIYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW- 165
           PRIYP  SYDCSFYLENGSG VAPPP  +NLN EIPIQ FDDDFKT          SFW 
Sbjct: 61  PRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWP 120

Query: 166 PPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-- 225
           PPSSYIC T+SCPDTHQE+PKS+SL EEEG LMASD +FW NNDPT  SEKDMQQ  V  
Sbjct: 121 PPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASD-VFWFNNDPTGVSEKDMQQEGVLE 180

Query: 226 EEAM-AMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQ 279
           EEAM AMA+I+SMSMDVKALE D  HSSDNAMEFP+WLSIN+D L Q+SNY CVEEDYLQ
Sbjct: 181 EEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQ 240

BLAST of HG10014307 vs. ExPASy TrEMBL
Match: A0A6J1IXC1 (uncharacterized protein LOC111480786 OS=Cucurbita maxima OX=3661 GN=LOC111480786 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 8.1e-91
Identity = 206/332 (62.05%), Postives = 233/332 (70.18%), Query Frame = 0

Query: 1   MNSTDQLCNFEAVAKISQPKPD------GEPKKQVRRRRQSRRLYKETPLDMAEARREIV 60
           MNSTDQLCNFEA  KI QP+P       GE KKQVRRRR++RRLYK+ PL+MAEARREIV
Sbjct: 1   MNSTDQLCNFEA-TKIPQPQPQPQPQPHGERKKQVRRRRETRRLYKQMPLNMAEARREIV 60

Query: 61  TALKLHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYD 120
           TALKLHRASTKEA+EQQQKQDQ+IK S+P++P Q  PCFE E R KSRRNPRIYP    D
Sbjct: 61  TALKLHRASTKEAKEQQQKQDQQIKHSLPVYPHQFTPCFEPERRMKSRRNPRIYP----D 120

Query: 121 CSFYLENGSGFVAPPPVAQNLNAEIPIQI------FDDD-------------FKTSFWPP 180
           CSFY +NGS F+APPPVAQ+L+ +IPIQ       F+D              +  SF  P
Sbjct: 121 CSFYFQNGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCNNNNNHSFYSLSFLHP 180

Query: 181 SSYICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--- 240
           SSYIC T      TH+EVPKSISLSEEEG+LMASD LFWSNN PT ESEK++  GAV   
Sbjct: 181 SSYICPTFDYAATTHREVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEE 240

Query: 241 --EEAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHS 291
             EE   +AEIR  SMD K LE D   H          S+ AMEFP+WLSIN+D LQ  S
Sbjct: 241 EEEEEAMVAEIR--SMDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRS 300

BLAST of HG10014307 vs. TAIR 10
Match: AT5G21280.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 77.0 bits (188), Expect = 2.7e-14
Identity = 81/272 (29.78%), Postives = 117/272 (43.01%), Query Frame = 0

Query: 26  KKQVRRRRQSRRLYKETPLDMAEARREIVTALKLHRASTKEAREQQQKQDQEIKQSVPLF 85
           KKQVRRR  + R Y+E  L+MAEARREIVTALK HRAS ++A      Q     Q + LF
Sbjct: 53  KKQVRRRLHTSRPYQERLLNMAEARREIVTALKQHRASMRQATRIPPPQPPPPPQPLNLF 112

Query: 86  PQLCPCFEAEGRRKSRRNPR---IYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQI 145
               P         S  NP    + P      +   ++ + F+       + ++      
Sbjct: 113 SPPPP--PPPPDPFSWTNPSLNFLLPNQPLGLNLNFQDFNDFIQTSSTTSSSSSSSTSSS 172

Query: 146 FDDDFKTS---FWPPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTR 205
               F T+   +  PS     T +  D+  ++P S   S  E  ++ S   +WS      
Sbjct: 173 SSSIFPTNPHIYSSPSPPPTFTTATSDSAPQLPSS---SNGENNVVTS--AWWS------ 232

Query: 206 ESEKDMQQGAVEEAMAMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSN 265
               ++    VE      EI+  + +V  +E DV     + MEFP WL+   + L    N
Sbjct: 233 ----ELMLKTVE-----PEIKPETEEVIVVEDDVFPKFSDVMEFPSWLNQTEEELFHPYN 292

Query: 266 YKCVEEDYLQYPDLSCFDFGKIEDVDG-DWLA 291
                      P LSC + G+IE +DG DWLA
Sbjct: 293 LTDHYSSSPHNPPLSCMEIGEIEGMDGDDWLA 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAE8649926.12.6e-11577.85hypothetical protein Csa_011922 [Cucumis sativus][more]
XP_016901295.16.3e-11476.70PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo] >KAA0064553.1 put... [more]
XP_038897806.13.3e-11076.66uncharacterized protein LOC120085720 [Benincasa hispida][more]
XP_022940715.14.7e-9363.83uncharacterized protein LOC111446225 [Cucurbita moschata][more]
KAG6608324.16.2e-9363.38hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7V8V73.1e-11476.70Putative WRKY transcription factor protein 1 isoform X2 OS=Cucumis melo var. mak... [more]
A0A1S4DZY03.1e-11476.70uncharacterized protein LOC103493717 OS=Cucumis melo OX=3656 GN=LOC103493717 PE=... [more]
A0A6J1FRD82.3e-9363.83uncharacterized protein LOC111446225 OS=Cucurbita moschata OX=3662 GN=LOC1114462... [more]
A0A0A0L0911.9e-9277.11Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G649650 PE=4 SV=1[more]
A0A6J1IXC18.1e-9162.05uncharacterized protein LOC111480786 OS=Cucurbita maxima OX=3661 GN=LOC111480786... [more]
Match NameE-valueIdentityDescription
AT5G21280.12.7e-1429.78hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availablePANTHERPTHR37256:SF1E1A-BINDING PROTEIN P400-LIKEcoord: 17..148
NoneNo IPR availablePANTHERPTHR37256E1A-BINDING PROTEIN P400-LIKEcoord: 17..148
coord: 229..290
NoneNo IPR availablePANTHERPTHR37256:SF1E1A-BINDING PROTEIN P400-LIKEcoord: 229..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10014307.1HG10014307.1mRNA