HG10003846 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003846
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransmembrane protein
LocationChr08: 10248488 .. 10249399 (+)
RNA-Seq ExpressionHG10003846
SyntenyHG10003846
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGGGGTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCCCAACATTTCTGGAAATGGGGTGCTCTGATTATTGCTTTGTTGGCTACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACCACTTCGATTTCCGAACCTCTCTACCGATCTCTCCACGGCGGAGAGACCGGTGGTTTAGTTTCAGAGAATCTCAAATCTCCCCTGTTTTCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCGGGAACCGGACGACGGCTTGGATTTCCGGGTTAAAGGCTCAGGTCGGTTTTCTGGTGAATTTGACGGTCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTGCGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGACAGGAGCGTAGTGAGGCAATGGGGTGATGTGAAATTGAAATGCGAGTTTGAGGGTTTGAGTGGGAGTGTGATTTCGTTGTACGATGAGAATAAAGAGGCGGAGATCTGCTCCATTTTCAGCGGTGGAGCTCCGCTGCAAGCGGCGGTGTTATCGCCGAGGAGAATGGTGGTGGCCGCCAGTGAGGGTGTTTCGGCGAATGTGTCGCTGAAGCTTTGGGACACGCGTGGCCGGAGCCGGACGCCAGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGAGGATGTAGAAAATGTCTATCTCAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGACAGAGAATTCCCTGGCGGGAGGCGGTGACGGCTTGTGGGAATCGGGACACTAA

mRNA sequence

ATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGGGGTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCCCAACATTTCTGGAAATGGGGTGCTCTGATTATTGCTTTGTTGGCTACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACCACTTCGATTTCCGAACCTCTCTACCGATCTCTCCACGGCGGAGAGACCGGTGGTTTAGTTTCAGAGAATCTCAAATCTCCCCTGTTTTCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCGGGAACCGGACGACGGCTTGGATTTCCGGGTTAAAGGCTCAGGTCGGTTTTCTGGTGAATTTGACGGTCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTGCGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGACAGGAGCGTAGTGAGGCAATGGGGTGATGTGAAATTGAAATGCGAGTTTGAGGGTTTGAGTGGGAGTGTGATTTCGTTGTACGATGAGAATAAAGAGGCGGAGATCTGCTCCATTTTCAGCGGTGGAGCTCCGCTGCAAGCGGCGGTGTTATCGCCGAGGAGAATGGTGGTGGCCGCCAGTGAGGGTGTTTCGGCGAATGTGTCGCTGAAGCTTTGGGACACGCGTGGCCGGAGCCGGACGCCAGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGAGGATGTAGAAAATGTCTATCTCAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGACAGAGAATTCCCTGGCGGGAGGCGGTGACGGCTTGTGGGAATCGGGACACTAA

Coding sequence (CDS)

ATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGGGGTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCCCAACATTTCTGGAAATGGGGTGCTCTGATTATTGCTTTGTTGGCTACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACCACTTCGATTTCCGAACCTCTCTACCGATCTCTCCACGGCGGAGAGACCGGTGGTTTAGTTTCAGAGAATCTCAAATCTCCCCTGTTTTCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCGGGAACCGGACGACGGCTTGGATTTCCGGGTTAAAGGCTCAGGTCGGTTTTCTGGTGAATTTGACGGTCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTGCGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGACAGGAGCGTAGTGAGGCAATGGGGTGATGTGAAATTGAAATGCGAGTTTGAGGGTTTGAGTGGGAGTGTGATTTCGTTGTACGATGAGAATAAAGAGGCGGAGATCTGCTCCATTTTCAGCGGTGGAGCTCCGCTGCAAGCGGCGGTGTTATCGCCGAGGAGAATGGTGGTGGCCGCCAGTGAGGGTGTTTCGGCGAATGTGTCGCTGAAGCTTTGGGACACGCGTGGCCGGAGCCGGACGCCAGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGAGGATGTAGAAAATGTCTATCTCAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGACAGAGAATTCCCTGGCGGGAGGCGGTGACGGCTTGTGGGAATCGGGACACTAA

Protein sequence

MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVKILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGLDFRVKGSGRFSGEFDGRCCSRLRRRHCDGDLFSWPCFGLDRSVVRQWGDVKLKCEFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRGRSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGDGLWESGH
Homology
BLAST of HG10003846 vs. NCBI nr
Match: XP_038885552.1 (uncharacterized protein LOC120075889 [Benincasa hispida])

HSP 1 Score: 550.8 bits (1418), Expect = 7.2e-153
Identity = 278/307 (90.55%), Postives = 284/307 (92.51%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITE+GADL SLPNP FLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK
Sbjct: 1   MEVPVLNRITEIGADLSSLPNPKFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPL SSSESEDENEGDR+ DD  
Sbjct: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLLSSSESEDENEGDRKQDDSS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           DFRVKGSGRFSGEFDGRCCSRLRRRHC    DGDLFSWPCFG D SVVRQWGDVKLKCEF
Sbjct: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHCDGGGDGDLFSWPCFGSDGSVVRQWGDVKLKCEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGSVISLYDEN+E EICSIF+GG PLQAA LSPR+MVVAASEGVSANVSLKLWD RG
Sbjct: 181 EELSGSVISLYDENEETEICSIFNGGTPLQAAALSPRKMVVAASEGVSANVSLKLWDMRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSR PVVAAEWDSPSGKIVDVYYEDVENVYLRD GAAGIMVGDVRKVSSA+ NS  G GD
Sbjct: 241 RSRRPVVAAEWDSPSGKIVDVYYEDVENVYLRDKGAAGIMVGDVRKVSSASGNSPVGSGD 300

Query: 301 GLWESGH 304
           GLWESGH
Sbjct: 301 GLWESGH 307

BLAST of HG10003846 vs. NCBI nr
Match: XP_011656511.1 (uncharacterized protein LOC105435751 [Cucumis sativus] >KGN45941.1 hypothetical protein Csa_004792 [Cucumis sativus])

HSP 1 Score: 474.2 bits (1219), Expect = 8.6e-130
Identity = 242/306 (79.08%), Postives = 263/306 (85.95%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELG +LGSLPNP+FLSRIFTS  PSQHFWKW ALIIA LATF GIINRVK
Sbjct: 1   MEVPVLNRITELGPNLGSLPNPSFLSRIFTSVFPSQHFWKWAALIIAFLATFPGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           + IIV RRRT+TTSISEPLYRSLH G++ GLVS+NLKSPL SSSESEDENE DRE ++  
Sbjct: 61  VFIIVCRRRTKTTSISEPLYRSLHFGDSRGLVSKNLKSPLLSSSESEDENERDREHNNDS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           DFRVKGS  FSGEFDG C SR RRR C    +GDLFSWPCFGL+RSVVRQWGDVKLKCEF
Sbjct: 121 DFRVKGSSLFSGEFDGGCRSRHRRRPCNGGGNGDLFSWPCFGLERSVVRQWGDVKLKCEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGS+ISLYD N+EAEICSI SGG  L+AA +SPRRMVVAA+EGVSANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDVNEEAEICSILSGGGSLKAAAVSPRRMVVAANEGVSANVSLKLWDTRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSRTPVV  EWDSPSG IVDVYYEDV N+Y+RDN AAGIM+GDVR+ SS  E   AGGG+
Sbjct: 241 RSRTPVVGMEWDSPSGNIVDVYYEDVGNLYVRDNEAAGIMIGDVRRASSGWEKLTAGGGE 300

Query: 301 GLWESG 303
           GLWE G
Sbjct: 301 GLWEVG 306

BLAST of HG10003846 vs. NCBI nr
Match: XP_008445638.2 (PREDICTED: uncharacterized protein LOC103488596 [Cucumis melo] >KAA0036127.1 uncharacterized protein E6C27_scaffold338G00340 [Cucumis melo var. makuwa] >TYK18508.1 uncharacterized protein E5676_scaffold5066G00090 [Cucumis melo var. makuwa])

HSP 1 Score: 473.0 bits (1216), Expect = 1.9e-129
Identity = 244/307 (79.48%), Postives = 261/307 (85.02%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELGA LGSLPNPNFLSRIFTSF PSQHFWKWGALIIALLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           + II+ RRRT+TTSISEPLYRSLH  ++GGLVS+NLKSP  SSSESEDENE  RE ++  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           +FRVK S RFS E DG C SRLRRRHC    +GDLF WPCFGLDRSVVRQWGDV    EF
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGS+ISLYD+N+EAEICSIF+ G  L+A  +SPRRMVVAASEGVSANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSRTPVVA EWDSPS  IVDVYYEDVENVYLRDNGAAGIMVGDVRK SS +E   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 304
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of HG10003846 vs. NCBI nr
Match: XP_022139617.1 (uncharacterized protein LOC111010475 [Momordica charantia])

HSP 1 Score: 417.2 bits (1071), Expect = 1.2e-112
Identity = 218/302 (72.19%), Postives = 246/302 (81.46%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELGADL SLPN NFLSRI TSFSPSQHFWKWGA++IALLATF+G+INRVK
Sbjct: 1   MEVPVLNRITELGADLSSLPNANFLSRILTSFSPSQHFWKWGAVVIALLATFSGLINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           ILIIVIRRR RTT I EPL RSLHGGE GG VSENL SP F SSESEDENE    P+DG 
Sbjct: 61  ILIIVIRRR-RTTPIYEPLSRSLHGGENGGFVSENLGSPPFFSSESEDENELS-SPEDGS 120

Query: 121 DFRVKGSGRFSGEFD-GRCCSRLRRRHCDGDLFSWPCFGLDRSVVRQWGDVKLKCEFEGL 180
           DF VKGSGR S ++  GR CS LRRRH  GD  SW CFG +RSVVRQWG+V+LKC+F+ L
Sbjct: 121 DFGVKGSGRSSDDYSCGRRCSGLRRRHYGGDSLSWSCFGSERSVVRQWGEVQLKCKFDEL 180

Query: 181 SGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRGRSR 240
           SGSVISLYDEN+E EICSIFSGGAP++AA +SP  MVV+A + V  NVS+KLWDTR RS+
Sbjct: 181 SGSVISLYDENEEKEICSIFSGGAPVRAAAMSPAGMVVSAGQSVFGNVSVKLWDTRSRSQ 240

Query: 241 TPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGDGLW 300
           TP+VAAEW+SP+ KIVDVYYE+ E VYLR++  A + V DVRKV SA ENS  GG D  W
Sbjct: 241 TPIVAAEWNSPAAKIVDVYYEESEKVYLRNDSDAKLTVADVRKVCSALENSNLGGVDRWW 300

Query: 301 ES 302
            +
Sbjct: 301 HA 300

BLAST of HG10003846 vs. NCBI nr
Match: XP_022971770.1 (uncharacterized protein LOC111470454 [Cucurbita maxima])

HSP 1 Score: 414.8 bits (1065), Expect = 6.2e-112
Identity = 219/307 (71.34%), Postives = 246/307 (80.13%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLN IT  G +LGS+P+PNFLSRIFTSFS  Q FWKWGAL IALLATF+GIINR+K
Sbjct: 1   MEVPVLNMITGFGRELGSIPDPNFLSRIFTSFSIFQQFWKWGALFIALLATFSGIINRIK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEG--DREPDD 120
             +IVI RRTRTT ISEPL  SLHGGE GGL+SEN +SP  SSSESEDENEG  DREPDD
Sbjct: 61  TSVIVIHRRTRTTPISEPLSSSLHGGENGGLISENFRSPPLSSSESEDENEGDQDREPDD 120

Query: 121 GLDFRVKGSGRFSGEFDGRCCSRLRRRH----CDGDLFSWPCFGLDRSVVRQWGDVKLKC 180
            LDF VKGS RFSGEFD R  + LRRRH     +GD FSWPCF  D+SVV+QWGDVKLKC
Sbjct: 121 RLDFLVKGSVRFSGEFDDRRFTGLRRRHGSRGGNGDSFSWPCFVSDKSVVKQWGDVKLKC 180

Query: 181 EFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDT 240
           EFE LSGSVI +YDEN+EAEICSIFSGG PL+AA LS  +MVVAA E    N+SLK+WDT
Sbjct: 181 EFEELSGSVILVYDENEEAEICSIFSGGDPLKAAALSAAKMVVAARESGLGNMSLKIWDT 240

Query: 241 RGRSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGG 300
           R RS+TPV+AAEW+SP  KIVDVY E++E V + D GAAG+MVGDVRK  SA+E    GG
Sbjct: 241 RDRSQTPVIAAEWNSP--KIVDVYSEEIEKVDIGDKGAAGMMVGDVRKFWSASEKWRKGG 300

Query: 301 GDGLWES 302
           G+G WES
Sbjct: 301 GEGWWES 305

BLAST of HG10003846 vs. ExPASy TrEMBL
Match: A0A0A0KDT7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028940 PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 4.2e-130
Identity = 242/306 (79.08%), Postives = 263/306 (85.95%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELG +LGSLPNP+FLSRIFTS  PSQHFWKW ALIIA LATF GIINRVK
Sbjct: 1   MEVPVLNRITELGPNLGSLPNPSFLSRIFTSVFPSQHFWKWAALIIAFLATFPGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           + IIV RRRT+TTSISEPLYRSLH G++ GLVS+NLKSPL SSSESEDENE DRE ++  
Sbjct: 61  VFIIVCRRRTKTTSISEPLYRSLHFGDSRGLVSKNLKSPLLSSSESEDENERDREHNNDS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           DFRVKGS  FSGEFDG C SR RRR C    +GDLFSWPCFGL+RSVVRQWGDVKLKCEF
Sbjct: 121 DFRVKGSSLFSGEFDGGCRSRHRRRPCNGGGNGDLFSWPCFGLERSVVRQWGDVKLKCEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGS+ISLYD N+EAEICSI SGG  L+AA +SPRRMVVAA+EGVSANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDVNEEAEICSILSGGGSLKAAAVSPRRMVVAANEGVSANVSLKLWDTRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSRTPVV  EWDSPSG IVDVYYEDV N+Y+RDN AAGIM+GDVR+ SS  E   AGGG+
Sbjct: 241 RSRTPVVGMEWDSPSGNIVDVYYEDVGNLYVRDNEAAGIMIGDVRRASSGWEKLTAGGGE 300

Query: 301 GLWESG 303
           GLWE G
Sbjct: 301 GLWEVG 306

BLAST of HG10003846 vs. ExPASy TrEMBL
Match: A0A5A7T3F5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold5066G00090 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 9.3e-130
Identity = 244/307 (79.48%), Postives = 261/307 (85.02%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELGA LGSLPNPNFLSRIFTSF PSQHFWKWGALIIALLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           + II+ RRRT+TTSISEPLYRSLH  ++GGLVS+NLKSP  SSSESEDENE  RE ++  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           +FRVK S RFS E DG C SRLRRRHC    +GDLF WPCFGLDRSVVRQWGDV    EF
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGS+ISLYD+N+EAEICSIF+ G  L+A  +SPRRMVVAASEGVSANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSRTPVVA EWDSPS  IVDVYYEDVENVYLRDNGAAGIMVGDVRK SS +E   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 304
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of HG10003846 vs. ExPASy TrEMBL
Match: A0A1S3BCP7 (uncharacterized protein LOC103488596 OS=Cucumis melo OX=3656 GN=LOC103488596 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 9.3e-130
Identity = 244/307 (79.48%), Postives = 261/307 (85.02%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELGA LGSLPNPNFLSRIFTSF PSQHFWKWGALIIALLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           + II+ RRRT+TTSISEPLYRSLH  ++GGLVS+NLKSP  SSSESEDENE  RE ++  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHC----DGDLFSWPCFGLDRSVVRQWGDVKLKCEF 180
           +FRVK S RFS E DG C SRLRRRHC    +GDLF WPCFGLDRSVVRQWGDV    EF
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 EGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRG 240
           E LSGS+ISLYD+N+EAEICSIF+ G  L+A  +SPRRMVVAASEGVSANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGD 300
           RSRTPVVA EWDSPS  IVDVYYEDVENVYLRDNGAAGIMVGDVRK SS +E   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 304
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of HG10003846 vs. ExPASy TrEMBL
Match: A0A6J1CEG4 (uncharacterized protein LOC111010475 OS=Momordica charantia OX=3673 GN=LOC111010475 PE=4 SV=1)

HSP 1 Score: 417.2 bits (1071), Expect = 6.0e-113
Identity = 218/302 (72.19%), Postives = 246/302 (81.46%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLNRITELGADL SLPN NFLSRI TSFSPSQHFWKWGA++IALLATF+G+INRVK
Sbjct: 1   MEVPVLNRITELGADLSSLPNANFLSRILTSFSPSQHFWKWGAVVIALLATFSGLINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDREPDDGL 120
           ILIIVIRRR RTT I EPL RSLHGGE GG VSENL SP F SSESEDENE    P+DG 
Sbjct: 61  ILIIVIRRR-RTTPIYEPLSRSLHGGENGGFVSENLGSPPFFSSESEDENELS-SPEDGS 120

Query: 121 DFRVKGSGRFSGEFD-GRCCSRLRRRHCDGDLFSWPCFGLDRSVVRQWGDVKLKCEFEGL 180
           DF VKGSGR S ++  GR CS LRRRH  GD  SW CFG +RSVVRQWG+V+LKC+F+ L
Sbjct: 121 DFGVKGSGRSSDDYSCGRRCSGLRRRHYGGDSLSWSCFGSERSVVRQWGEVQLKCKFDEL 180

Query: 181 SGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDTRGRSR 240
           SGSVISLYDEN+E EICSIFSGGAP++AA +SP  MVV+A + V  NVS+KLWDTR RS+
Sbjct: 181 SGSVISLYDENEEKEICSIFSGGAPVRAAAMSPAGMVVSAGQSVFGNVSVKLWDTRSRSQ 240

Query: 241 TPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGGGDGLW 300
           TP+VAAEW+SP+ KIVDVYYE+ E VYLR++  A + V DVRKV SA ENS  GG D  W
Sbjct: 241 TPIVAAEWNSPAAKIVDVYYEESEKVYLRNDSDAKLTVADVRKVCSALENSNLGGVDRWW 300

Query: 301 ES 302
            +
Sbjct: 301 HA 300

BLAST of HG10003846 vs. ExPASy TrEMBL
Match: A0A6J1I6N0 (uncharacterized protein LOC111470454 OS=Cucurbita maxima OX=3661 GN=LOC111470454 PE=4 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 3.0e-112
Identity = 219/307 (71.34%), Postives = 246/307 (80.13%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60
           MEVPVLN IT  G +LGS+P+PNFLSRIFTSFS  Q FWKWGAL IALLATF+GIINR+K
Sbjct: 1   MEVPVLNMITGFGRELGSIPDPNFLSRIFTSFSIFQQFWKWGALFIALLATFSGIINRIK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEG--DREPDD 120
             +IVI RRTRTT ISEPL  SLHGGE GGL+SEN +SP  SSSESEDENEG  DREPDD
Sbjct: 61  TSVIVIHRRTRTTPISEPLSSSLHGGENGGLISENFRSPPLSSSESEDENEGDQDREPDD 120

Query: 121 GLDFRVKGSGRFSGEFDGRCCSRLRRRH----CDGDLFSWPCFGLDRSVVRQWGDVKLKC 180
            LDF VKGS RFSGEFD R  + LRRRH     +GD FSWPCF  D+SVV+QWGDVKLKC
Sbjct: 121 RLDFLVKGSVRFSGEFDDRRFTGLRRRHGSRGGNGDSFSWPCFVSDKSVVKQWGDVKLKC 180

Query: 181 EFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKLWDT 240
           EFE LSGSVI +YDEN+EAEICSIFSGG PL+AA LS  +MVVAA E    N+SLK+WDT
Sbjct: 181 EFEELSGSVILVYDENEEAEICSIFSGGDPLKAAALSAAKMVVAARESGLGNMSLKIWDT 240

Query: 241 RGRSRTPVVAAEWDSPSGKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSATENSLAGG 300
           R RS+TPV+AAEW+SP  KIVDVY E++E V + D GAAG+MVGDVRK  SA+E    GG
Sbjct: 241 RDRSQTPVIAAEWNSP--KIVDVYSEEIEKVDIGDKGAAGMMVGDVRKFWSASEKWRKGG 300

Query: 301 GDGLWES 302
           G+G WES
Sbjct: 301 GEGWWES 305

BLAST of HG10003846 vs. TAIR 10
Match: AT1G68440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G25400.2); Has 86 Blast hits to 86 proteins in 29 species: Archae - 0; Bacteria - 6; Metazoa - 27; Fungi - 11; Plants - 24; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 108.6 bits (270), Expect = 8.8e-24
Identity = 96/326 (29.45%), Postives = 147/326 (45.09%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFT-----SFSPSQHFWKWGALIIALLATFTGI 60
           MEVPV+NRI +    + S+ +P+FLSR            +  FWKWGALIIA LA FT  
Sbjct: 1   MEVPVINRIRDFEVGINSINDPSFLSRSVAVSGIGKLHQAYGFWKWGALIIAFLAYFTNF 60

Query: 61  INRVKILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDRE 120
           ++++  L  V+R R    S+S P     +  ++      +  S + S  E ++E+E D E
Sbjct: 61  VSKLNSL--VVRLRKIDVSVSSPTLFDDYDSDS----DVSCSSTVSSDDEKDEEDEADDE 120

Query: 121 PDD----------GLDFRVKGSGRF---SGEFDGRCCSRLRRRHCD--GDLFSWPCFG-- 180
            +D             FRV+GS  +     + D   C+ + RR+    GDLFSWP  G  
Sbjct: 121 DEDVDSIFNRRRVNGGFRVRGSDYYDDDDDQGDNGNCTWMGRRYSGSFGDLFSWPDLGGI 180

Query: 181 LDRSVVRQWGDVKLKCEFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVA 240
               VV+ W  + +  +      +V++ + +N  +     F                  A
Sbjct: 181 GSSGVVKLWDHLDIDGDDH---ENVVATFLKNYNSTSSPFF----------------WAA 240

Query: 241 ASEGVSANVSLKLWDTRGRSRTPVVAAEWDSPS---GKIVDVYYEDVENVYLRDNGAAGI 300
             +GV A V +K  D R   R P + AEW  P    G I+ V    VE VY+RD+ +  I
Sbjct: 241 EKKGVDA-VKVKACDPRAGFRMPALLAEWRQPGRLLGNIIGVDTGGVEKVYVRDDVSGEI 300

Query: 301 MVGDVRKVSSATENSLAGGGDGLWES 302
            VGD+RK +    +      +  W++
Sbjct: 301 AVGDLRKFNGVLTDLTECEAETWWDA 300

BLAST of HG10003846 vs. TAIR 10
Match: AT1G25400.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68440.1); Has 21 Blast hits to 21 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 21; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 97.1 bits (240), Expect = 2.7e-20
Identity = 89/298 (29.87%), Postives = 136/298 (45.64%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFT-----SFSPSQHFWKWGA-LIIALLATFTG 60
           MEVP++NRI +    + S+ +P++LSR            +  FWKWGA L++A  A+FT 
Sbjct: 1   MEVPIINRIGDFDMGINSINDPSYLSRALAVSGVGKLHQAYSFWKWGALLLLAFFASFTS 60

Query: 61  IINRVKILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDR 120
           +  R+K L+     R R  ++S P    L   ++    S        S S  E+++E D 
Sbjct: 61  LTTRIKTLVF----RLRNVNVSLPSQTLLCNYDSDSDWS------FSSDSSDEEKDEDDN 120

Query: 121 EPDDGL--DFRVKGSGRFSGEFD-GRCCSRLRRRHCDGDLFSWPCFGLDRSVVRQWGDVK 180
           + DD +  D RV+  G +  + D G   S    R C G        G    VV+ W ++ 
Sbjct: 121 KEDDSVNGDSRVQRFGYYHDDDDKGISGSVPWLRRCSGSFGDLLDLG-SSGVVKLWDNL- 180

Query: 181 LKCEFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKL 240
              +F G    V S + +      C  +S    L +AVL      +AA +  S  + +  
Sbjct: 181 ---DFNGEGSPVASFFSK------CGSYS---LLSSAVL------LAAEKKGSDGLEVSA 240

Query: 241 WDTRGRSRTPVVAAEWDSPS---GKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSA 287
           WD R     P + AEW  P    GKI+ V   DV+ +Y+ D+    I VGD+R V+ A
Sbjct: 241 WDARVGFGVPALLAEWKQPGRLLGKIIRVDVGDVDKIYVGDDVEGEITVGDMRMVNGA 268

BLAST of HG10003846 vs. TAIR 10
Match: AT1G25400.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68440.1). )

HSP 1 Score: 97.1 bits (240), Expect = 2.7e-20
Identity = 89/298 (29.87%), Postives = 136/298 (45.64%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLGSLPNPNFLSRIFT-----SFSPSQHFWKWGA-LIIALLATFTG 60
           MEVP++NRI +    + S+ +P++LSR            +  FWKWGA L++A  A+FT 
Sbjct: 1   MEVPIINRIGDFDMGINSINDPSYLSRALAVSGVGKLHQAYSFWKWGALLLLAFFASFTS 60

Query: 61  IINRVKILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLFSSSESEDENEGDR 120
           +  R+K L+     R R  ++S P    L   ++    S        S S  E+++E D 
Sbjct: 61  LTTRIKTLVF----RLRNVNVSLPSQTLLCNYDSDSDWS------FSSDSSDEEKDEDDN 120

Query: 121 EPDDGL--DFRVKGSGRFSGEFD-GRCCSRLRRRHCDGDLFSWPCFGLDRSVVRQWGDVK 180
           + DD +  D RV+  G +  + D G   S    R C G        G    VV+ W ++ 
Sbjct: 121 KEDDSVNGDSRVQRFGYYHDDDDKGISGSVPWLRRCSGSFGDLLDLG-SSGVVKLWDNL- 180

Query: 181 LKCEFEGLSGSVISLYDENKEAEICSIFSGGAPLQAAVLSPRRMVVAASEGVSANVSLKL 240
              +F G    V S + +      C  +S    L +AVL      +AA +  S  + +  
Sbjct: 181 ---DFNGEGSPVASFFSK------CGSYS---LLSSAVL------LAAEKKGSDGLEVSA 240

Query: 241 WDTRGRSRTPVVAAEWDSPS---GKIVDVYYEDVENVYLRDNGAAGIMVGDVRKVSSA 287
           WD R     P + AEW  P    GKI+ V   DV+ +Y+ D+    I VGD+R V+ A
Sbjct: 241 WDARVGFGVPALLAEWKQPGRLLGKIIRVDVGDVDKIYVGDDVEGEITVGDMRMVNGA 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885552.17.2e-15390.55uncharacterized protein LOC120075889 [Benincasa hispida][more]
XP_011656511.18.6e-13079.08uncharacterized protein LOC105435751 [Cucumis sativus] >KGN45941.1 hypothetical ... [more]
XP_008445638.21.9e-12979.48PREDICTED: uncharacterized protein LOC103488596 [Cucumis melo] >KAA0036127.1 unc... [more]
XP_022139617.11.2e-11272.19uncharacterized protein LOC111010475 [Momordica charantia][more]
XP_022971770.16.2e-11271.34uncharacterized protein LOC111470454 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KDT74.2e-13079.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028940 PE=4 SV=1[more]
A0A5A7T3F59.3e-13079.48Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BCP79.3e-13079.48uncharacterized protein LOC103488596 OS=Cucumis melo OX=3656 GN=LOC103488596 PE=... [more]
A0A6J1CEG46.0e-11372.19uncharacterized protein LOC111010475 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A6J1I6N03.0e-11271.34uncharacterized protein LOC111470454 OS=Cucurbita maxima OX=3661 GN=LOC111470454... [more]
Match NameE-valueIdentityDescription
AT1G68440.18.8e-2429.45unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G25400.12.7e-2029.87unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G25400.22.7e-2029.87unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 100..121
NoneNo IPR availablePANTHERPTHR36715BNAANNG41370D PROTEINcoord: 1..301
NoneNo IPR availablePANTHERPTHR36715:SF1BNAANNG41370D PROTEINcoord: 1..301

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003846.1HG10003846.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane