HG10023036 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023036
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr05: 30653832 .. 30657615 (+)
RNA-Seq ExpressionHG10023036
SyntenyHG10023036
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGCCGACGATTTCAAAATCAAATTTCATGGGAATTATTGTTTCCATTGGTTTTTGACCCGTTTCAAGCAAATATAAATATCATCTAAGGACTGACTTAGTCGTCGGTCGCCGTCTCCGATCAGAACAAATTCTCACGCCCGATTTTGATAAAGAAAGGTTTACTTCCATTATTTTACGATTACCCATTTCATTAAACTATACTTTTCCTTTTCCAATAATGCTTGTTTTAATTTTCAGATGATCAGAAGAACAGTACCGAAATCGATTTGGAATTTGTCGGTCTGAGCAGGAAAACGCTTGGAGATTTGGGTATGTTTGTACTGAATCCTTGCTATTGAGTTTCTGGGTTATTCGATTCGACTCATTCTAGGTTATTTTGCGAAATGGGTTTCTTTTTAATTTCTGTTTGCTTGGATCCTTTGTTTAGTTTTTCTATTATCGTTTCTGGGATGATGGGTTTTTGGGGATTTTAGTGGTGATGTTGTTTGTGTAGTATTGTTTTTATTCATTTTTTTTTTTGTGTGTTTTGTTAAATTTGTTTTGGCATTACCAAAGTTGTTATTTGGTTATTGGTATCGAAATATTGGTGTCGGCTCATTTCCTGTGTTGTTCTTTTTTCTTCTTCAACTATTGCTTTTATGGGGTGGGTTTTTACCAATGCTATGTTAATTATGTTAACTTTTTTATATGTTCTGGGAGTTTGTATTTTTTTTTCCTTCTTGTTTTGATTGTTGTTTTCTTGCTTATTTTGTGTTAATTTAGGAGACTCATGGGTTAGATTCTTTGGATGTCCCTTTGATGTCCTTGAGATTGACTTTCTGAGTATGGTTCAGTTATTTCAAGACTTCTGAAGTTTTGCATTTGCAGTATGTTGGAAATTTCATTTTATTGAAGTGCTGCGTGTTTTACTTTTTCCCTTCTCATAGCTTTATGGCCATTGTTATGTTACGGCGATTCAAAAAGTTGCATTGGGATTGAACATTCGTCTTTGATTGCGGAGTTCGACTTCCCTTTTCTCCATTCTTTTGTGGCGTGGAGTGAATTTTGGTTTCTATAGTCAGAAGGCTATGAGATTGAGTACATGGCTTTAAGAGAAAGGGATCTCTCATTTGATCTCGAAAGTGGGGGGAAGATCGTTGAAGAGGTTGGAAGTGTGGAACCAAGTTCAATCAAAAGAGATGTAAAGAATATTTGGAGTAGGTTGACAGAGGATTCACTGCTAAAAGATGAGCGACTCATAGCCTCAAACAGTAATTTTGCTAATTCTGTTGCCGACATCGTTGCTGATGAGAACATAGAATTCTTGATAGATAAGAATTTGGAAGGGGAAGATGATCATGAAGTTTTTGCGCATATGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCAAAGCCACCACGGCCGCCCAAAGGTCCTTCACTTGATGCCGCTGACCGAATGATGGTGAAGGAAATTGCAGTGCTTGCCATGAAAAAACGTGCAAGAGTTGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCAGAGAAAACATCCTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTAATCATCATTCAAGGTAAGCTTATTATAGGTCCAATTTCAGTATTACTTTTACTTTTAGAAACTGCTTGGTCTTGAAAACTTGAACTGCTTCGATTTATCAGCCTCTGCTATATCTGAGCTGCGGTTTGGGAAGGGCATTTGTATTGTTGGGTCCTTCAAATCTGGTTATTAGTTTATGTCTCTTTCTAGCATTATGTTTCTAAGGGAATTTTGTTCAACTGTGTTTTACAGGAATAAGCTCCAGAAGCAGTTCGATATTGCAGGGGTCGCCTGAACCTGCTGTTGGTGGTAGTAGCGGTTTCATATCCGTTCAGTACATTAAAAGCTTTCCCCCGAATGAAAGCAATGTATCCAATCCCTCCTCGTCTAAGTGCGTATTTTTAGCCGAATGGTTATGATTGATGGATGGAATAGAAGTTTCCAAGTATTCCTAGTCTAATTTTGTACTATCTTGTTGATGCAGTTCTGCTGCATAGCTCGTTTCTGGTTTTGTTCCTCTAGAAGGCAAAAGGGTGGCTGATTTGAGGAGTTTTTTGAAGTTGAGATCCTAAGTGCCAGCTTGGTGGCTCGTATGGCCTTGATTTTTGAATAGTTATTTTGATACGCAGTTGCGTCTGACGGTTTCTGTAAATAAATACGAAGAAACATCGTTTTTTTTTTTTTTTTTGTAATCAAATGCTTTTATATTTTAATTTTATGTTAACAATTTATACTGAAAAGTTCATGGTGACCTTCTAGGATGTAGGTAAGCTGAACCTACTTAAAGGTTAAGCAGACAGTCCTTTCTTCCAGGCTTTCTTTTGATGAGTTGAATTATGAACTTTTTCCTTTTCAATTTTGATAACAAAATTATCTCTGCAGAGTTATGATGATATTATTGGCTGATGCTCATCCCTCTACACTTTTGGACATATCCAATCCCGTTCATCGAGGACAAACAGGTATGGCCACTGTCCAATTGTTTCTTAAATTTGTTAGATTTATGGGCAAATCTCTCAAGTTTCAAATCAATGAAGCTATAAACTGTTGTGTCACAAATCAATGAAGCTATAAACTGTTGTGTCACTGACTTAAAAAGCTAAGGTCACGTTGAACAACACTTTAGTTTTTAGTTTCTTATGTTTGAAATTTATGTTTGTTTCTTCTCAATTCTAGAAACAAAAACAAGTTTTTATAACTTTTTTTTTTTTTAGTTTTCAAAACTTGGCTTTGTTTCTACAACATTTGGGTATCAAAGAAACTTATAGGATATTAAATCCTAGATAGGTAGGTGGCCTTGAACCCATGACTCTTTAGCCTTTTAGTTTTTTTTTTTTTTATGTTCCCACTACCACTAGACCAACCTATAATGGTTAAATAGGGACTAATATATTTCTGTTAATGTGAAAGGTATGGAGTCTGATGATAATATCAAATTCGTGAGATGATTAAACAGTATCTTCTTTTTACTTATACTTGAGATTTTTTCATCATCAAAATATATACATGTGAAGAAATTTATTGGCATAATGTTGCAAATATGGAGGGATCCAAAAATGACTACCATAAATCTTGATTGGTGCAGATTTGCAAAGTACATTTCTTAAGCTCGTGTTATTTCAGATATTGGAGAATATCCTCTTTGGTGATATGTTTTCTTTCAAATATTGAATTCTGCTTGATGGGATTTTATGATATTATGATTCTATGTACTTTAGAGTTTCCTTAATGCTCAGTTTGAACTCTACACATAGACTGTGTCTTGCAATGAAACAAAAGCTTCACCCTTTGACACCGAACCGTCTGTCCGTCATTTTCCCCTCTTAACCAATCTTCATCTTTGGTTGCAGCCAATCCCATTACTGATACATGTATGAAAACATCACATGTCGATTTCTACATTTCGAGCCTCAGGGTCGATCCTCGCAGAAAACAGCCGACATGAAAAGGGTATAGCCGAGATCATTCTCGAAGATATGTTAGCGAGTGCTAAAGAAATTTTAGCTCAGAGTTGAGAGCTTCAACAAGACAGACTGATCCTGTGGTGAAGCTATGTCTAGCTGCCTGCGATGTGCTGAGTTGTAACCGCTACGACGATCCAATGCTCAATGGGGCATCTTATGTCGAAGGAGTACAAGATTGCAATTCCGTGGATGTAGCTCGGAAGGTCGAGGCTTGCAAAGGAGCCATTCAGTAA

mRNA sequence

ATGCGGCCGACGATTTCAAAATCAAATTTCATGGGAATTATTGTTTCCATTGATGATCAGAAGAACAGTACCGAAATCGATTTGGAATTTGTCGGTCTGAGCAGGAAAACGCTTGGAGATTTGGATAAGAATTTGGAAGGGGAAGATGATCATGAAGTTTTTGCGCATATGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCAAAGCCACCACGGCCGCCCAAAGGTCCTTCACTTGATGCCGCTGACCGAATGATGGTGAAGGAAATTGCAGTGCTTGCCATGAAAAAACGTGCAAGAGTTGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCAGAGAAAACATCCTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTAATCATCATTCAAGGAATAAGCTCCAGAAGCAGTTCGATATTGCAGGGGTCGCCTGAACCTGCTGTTGGTGGTAGTAGCGGTTTCATATCCGTTCAGTACATTAAAAGCTTTCCCCCGAATGAAAGCAATGTATCCAATCCCTCCTCGTCTAAAGTTATGATGATATTATTGGCTGATGCTCATCCCTCTACACTTTTGGACATATCCAATCCCGTTCATCGAGGACAAACAGAGTTGAGAGCTTCAACAAGACAGACTGATCCTGTGGTGAAGCTATGTCTAGCTGCCTGCGATGTGCTGAGTTGTAACCGCTACGACGATCCAATGCTCAATGGGGCATCTTATGTCGAAGGAGTACAAGATTGCAATTCCGTGGATGTAGCTCGGAAGGTCGAGGCTTGCAAAGGAGCCATTCAGTAA

Coding sequence (CDS)

ATGCGGCCGACGATTTCAAAATCAAATTTCATGGGAATTATTGTTTCCATTGATGATCAGAAGAACAGTACCGAAATCGATTTGGAATTTGTCGGTCTGAGCAGGAAAACGCTTGGAGATTTGGATAAGAATTTGGAAGGGGAAGATGATCATGAAGTTTTTGCGCATATGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCAAAGCCACCACGGCCGCCCAAAGGTCCTTCACTTGATGCCGCTGACCGAATGATGGTGAAGGAAATTGCAGTGCTTGCCATGAAAAAACGTGCAAGAGTTGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCAGAGAAAACATCCTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTAATCATCATTCAAGGAATAAGCTCCAGAAGCAGTTCGATATTGCAGGGGTCGCCTGAACCTGCTGTTGGTGGTAGTAGCGGTTTCATATCCGTTCAGTACATTAAAAGCTTTCCCCCGAATGAAAGCAATGTATCCAATCCCTCCTCGTCTAAAGTTATGATGATATTATTGGCTGATGCTCATCCCTCTACACTTTTGGACATATCCAATCCCGTTCATCGAGGACAAACAGAGTTGAGAGCTTCAACAAGACAGACTGATCCTGTGGTGAAGCTATGTCTAGCTGCCTGCGATGTGCTGAGTTGTAACCGCTACGACGATCCAATGCTCAATGGGGCATCTTATGTCGAAGGAGTACAAGATTGCAATTCCGTGGATGTAGCTCGGAAGGTCGAGGCTTGCAAAGGAGCCATTCAGTAA

Protein sequence

MRPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSKVMMILLADAHPSTLLDISNPVHRGQTELRASTRQTDPVVKLCLAACDVLSCNRYDDPMLNGASYVEGVQDCNSVDVARKVEACKGAIQ
Homology
BLAST of HG10023036 vs. NCBI nr
Match: XP_038898210.1 (uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898211.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898212.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898213.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898214.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898215.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898216.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898217.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898219.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898220.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898221.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898222.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898223.1 uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898224.1 uncharacterized protein LOC120085950 [Benincasa hispida])

HSP 1 Score: 270.0 bits (689), Expect = 2.3e-68
Identity = 142/148 (95.95%), Postives = 145/148 (97.97%), Query Frame = 0

Query: 41  LDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM 100
           +DKNLEGEDDHEVF HMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM
Sbjct: 76  IDKNLEGEDDHEVFVHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM 135

Query: 101 KKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPA 160
           KKRAR ERMKALKKAKAEKTSSFNSCIPA+IITFLFFLVIIIQGISSRSSSILQGSPEPA
Sbjct: 136 KKRARAERMKALKKAKAEKTSSFNSCIPAMIITFLFFLVIIIQGISSRSSSILQGSPEPA 195

Query: 161 VGGSSGFISVQYIKSFPPNESNVSNPSS 189
           VGGSSGFISVQYIKSFPPNESN+SNP S
Sbjct: 196 VGGSSGFISVQYIKSFPPNESNISNPPS 223

BLAST of HG10023036 vs. NCBI nr
Match: XP_031744965.1 (uncharacterized protein LOC101218752 [Cucumis sativus] >XP_031744966.1 uncharacterized protein LOC101218752 [Cucumis sativus] >XP_031744967.1 uncharacterized protein LOC101218752 [Cucumis sativus] >XP_031744968.1 uncharacterized protein LOC101218752 [Cucumis sativus] >XP_031744969.1 uncharacterized protein LOC101218752 [Cucumis sativus] >KGN45581.2 hypothetical protein Csa_016785 [Cucumis sativus])

HSP 1 Score: 263.5 bits (672), Expect = 2.1e-66
Identity = 140/148 (94.59%), Postives = 143/148 (96.62%), Query Frame = 0

Query: 41  LDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM 100
           +DKNLEGED HEVF H+EKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKE+AVLAM
Sbjct: 76  IDKNLEGEDVHEVFVHVEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAM 135

Query: 101 KKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPA 160
           KKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPA
Sbjct: 136 KKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPA 195

Query: 161 VGGSSGFISVQYIKSFPPNESNVSNPSS 189
           VGGSSGFISVQYIKSFPPNESNVSNP S
Sbjct: 196 VGGSSGFISVQYIKSFPPNESNVSNPPS 223

BLAST of HG10023036 vs. NCBI nr
Match: KAA0046709.1 (putative transmembrane protein [Cucumis melo var. makuwa])

HSP 1 Score: 260.8 bits (665), Expect = 1.4e-65
Identity = 149/187 (79.68%), Postives = 156/187 (83.42%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       +DKNLEGEDDHEVFAHMEK+N
Sbjct: 102 RAIASNSNFANTVADI--------IADESLGLL------IDKNLEGEDDHEVFAHMEKSN 161

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 162 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 221

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 222 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 274

Query: 182 NVSNPSS 189
           +VSNP S
Sbjct: 282 DVSNPPS 274

BLAST of HG10023036 vs. NCBI nr
Match: XP_008451449.1 (PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_008451450.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901101.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901102.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901103.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901104.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901105.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901106.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901107.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901108.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901109.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_016901110.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo])

HSP 1 Score: 258.8 bits (660), Expect = 5.2e-65
Identity = 148/187 (79.14%), Postives = 156/187 (83.42%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       ++KNLEGEDDHEVFAHMEK+N
Sbjct: 51  RAIASNSNFANTVADI--------IADESLGLL------INKNLEGEDDHEVFAHMEKSN 110

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 111 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 170

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 171 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 223

Query: 182 NVSNPSS 189
           +VSNP S
Sbjct: 231 DVSNPPS 223

BLAST of HG10023036 vs. NCBI nr
Match: TYK18245.1 (putative transmembrane protein [Cucumis melo var. makuwa])

HSP 1 Score: 257.3 bits (656), Expect = 1.5e-64
Identity = 147/184 (79.89%), Postives = 154/184 (83.70%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       +DKNLEGEDDHEVFAHMEK+N
Sbjct: 102 RAIASNSNFANTVADI--------IADESLGLL------IDKNLEGEDDHEVFAHMEKSN 161

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 162 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 221

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 222 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 271

Query: 182 NVSN 186
           +VSN
Sbjct: 282 DVSN 271

BLAST of HG10023036 vs. ExPASy TrEMBL
Match: A0A5A7TZQ1 (Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold427G00780 PE=4 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 6.6e-66
Identity = 149/187 (79.68%), Postives = 156/187 (83.42%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       +DKNLEGEDDHEVFAHMEK+N
Sbjct: 102 RAIASNSNFANTVADI--------IADESLGLL------IDKNLEGEDDHEVFAHMEKSN 161

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 162 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 221

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 222 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 274

Query: 182 NVSNPSS 189
           +VSNP S
Sbjct: 282 DVSNPPS 274

BLAST of HG10023036 vs. ExPASy TrEMBL
Match: A0A1S4DYQ5 (uncharacterized protein LOC103492741 OS=Cucumis melo OX=3656 GN=LOC103492741 PE=4 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 2.5e-65
Identity = 148/187 (79.14%), Postives = 156/187 (83.42%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       ++KNLEGEDDHEVFAHMEK+N
Sbjct: 51  RAIASNSNFANTVADI--------IADESLGLL------INKNLEGEDDHEVFAHMEKSN 110

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 111 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 170

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 171 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 223

Query: 182 NVSNPSS 189
           +VSNP S
Sbjct: 231 DVSNPPS 223

BLAST of HG10023036 vs. ExPASy TrEMBL
Match: A0A5D3D3X4 (Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G001850 PE=4 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 7.3e-65
Identity = 147/184 (79.89%), Postives = 154/184 (83.70%), Query Frame = 0

Query: 2   RPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLDKNLEGEDDHEVFAHMEKNN 61
           R   S SNF   +  I        I  E +GL       +DKNLEGEDDHEVFAHMEK+N
Sbjct: 102 RAIASNSNFANTVADI--------IADESLGLL------IDKNLEGEDDHEVFAHMEKSN 161

Query: 62  ARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTS 121
           ARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTS
Sbjct: 162 ARGKHKNKKKALKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTS 221

Query: 122 SFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNES 181
           SFNSCIPALIITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES
Sbjct: 222 SFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPSES 271

Query: 182 NVSN 186
           +VSN
Sbjct: 282 DVSN 271

BLAST of HG10023036 vs. ExPASy TrEMBL
Match: A0A0A0K5U6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G407830 PE=4 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 1.4e-63
Identity = 140/164 (85.37%), Postives = 143/164 (87.20%), Query Frame = 0

Query: 41  LDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM 100
           +DKNLEGED HEVF H+EKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKE+AVLAM
Sbjct: 76  IDKNLEGEDVHEVFVHVEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAM 135

Query: 101 KKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ----------------G 160
           KKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ                G
Sbjct: 136 KKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASALSELQVQKAFLLLG 195

Query: 161 ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS 189
           IS RSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNP S
Sbjct: 196 ISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPPS 239

BLAST of HG10023036 vs. ExPASy TrEMBL
Match: A0A6J1JVV1 (uncharacterized protein LOC111488345 OS=Cucurbita maxima OX=3661 GN=LOC111488345 PE=4 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 8.4e-61
Identity = 128/146 (87.67%), Postives = 139/146 (95.21%), Query Frame = 0

Query: 41  LDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAM 100
           +DKNLEGEDD E FAH+EK NARGKHKNKKKA KPPRPPKGPSLDAADR +VKEIAV+AM
Sbjct: 76  IDKNLEGEDDCEAFAHVEKTNARGKHKNKKKALKPPRPPKGPSLDAADRRLVKEIAVIAM 135

Query: 101 KKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISSRSSSILQGSPEPA 160
           KKRARVERMKAL+K+KAEKTSSFNSCIPA+IITFLFFLVII+QGISSRSS +LQGSPEPA
Sbjct: 136 KKRARVERMKALRKSKAEKTSSFNSCIPAMIITFLFFLVIILQGISSRSSPMLQGSPEPA 195

Query: 161 VGGSSGFISVQYIKSFPPNESNVSNP 187
           V GSSGFISVQYIKSFPPNESN++NP
Sbjct: 196 VDGSSGFISVQYIKSFPPNESNMANP 221

BLAST of HG10023036 vs. TAIR 10
Match: AT1G02380.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G01960.1); Has 66 Blast hits to 66 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 100.1 bits (248), Expect = 2.9e-21
Identity = 73/168 (43.45%), Postives = 99/168 (58.93%), Query Frame = 0

Query: 32  GLSRKTLGDL------DKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLD 91
           G+S K   DL      D+N   E   +     EK    GK K  +KA KPPRPPKGPSL 
Sbjct: 44  GVSEKIADDLSYPLIRDEN-RVETSSQSLDLSEKKCGNGKFKKSRKASKPPRPPKGPSLS 103

Query: 92  AADRMMVKEIAVLAMKKRARVERM-KALKKAKAEKTSSFNSCIP--ALIITFLFFLVIII 151
             DR ++++I  LAM+KRAR+ERM K+LK+ KA KTS  + CI   ++IIT +FF  ++ 
Sbjct: 104 ENDRKIMRDIQELAMRKRARIERMKKSLKRLKAAKTSPSSPCITIFSMIITAIFFAFLVF 163

Query: 152 QGISSRSSSI-LQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSS 190
           QG S+ SSS+    SP P V  ++  ISVQ+   F P E    +P++S
Sbjct: 164 QGFSTGSSSMNSDKSPAPTVSPNNQMISVQFYNDFAPVEQTDPSPTTS 210

BLAST of HG10023036 vs. TAIR 10
Match: AT3G17120.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02380.1); Has 67 Blast hits to 67 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 94.4 bits (233), Expect = 1.6e-19
Identity = 62/131 (47.33%), Postives = 84/131 (64.12%), Query Frame = 0

Query: 65  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFN 124
           K K KK A KPPRPP+GPSLDAAD+ +++EIA LAM KRAR+ERM+ALKK++A K +S  
Sbjct: 69  KEKRKKSASKPPRPPRGPSLDAADQKLIREIAELAMLKRARIERMRALKKSRAAKAASAA 128

Query: 125 SC---IPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGG--SSGFISVQYIKSFPPN 184
           S    + A + T +FF V++ QG+S R++    G     V G  + GF+SVQY       
Sbjct: 129 SSLGNVLATLFTAIFFFVLVFQGLSPRAAG-SSGKSHLVVAGKANGGFVSVQY------- 188

Query: 185 ESNVSNPSSSK 191
                NPS+S+
Sbjct: 189 ---AGNPSASE 188

BLAST of HG10023036 vs. TAIR 10
Match: AT3G17120.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02380.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 94.4 bits (233), Expect = 1.6e-19
Identity = 62/131 (47.33%), Postives = 84/131 (64.12%), Query Frame = 0

Query: 65  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFN 124
           K K KK A KPPRPP+GPSLDAAD+ +++EIA LAM KRAR+ERM+ALKK++A K +S  
Sbjct: 69  KEKRKKSASKPPRPPRGPSLDAADQKLIREIAELAMLKRARIERMRALKKSRAAKAASAA 128

Query: 125 SC---IPALIITFLFFLVIIIQGISSRSSSILQGSPEPAVGG--SSGFISVQYIKSFPPN 184
           S    + A + T +FF V++ QG+S R++    G     V G  + GF+SVQY       
Sbjct: 129 SSLGNVLATLFTAIFFFVLVFQGLSPRAAG-SSGKSHLVVAGKANGGFVSVQY------- 188

Query: 185 ESNVSNPSSSK 191
                NPS+S+
Sbjct: 189 ---AGNPSASE 188

BLAST of HG10023036 vs. TAIR 10
Match: AT4G01960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02380.1); Has 67 Blast hits to 67 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 93.2 bits (230), Expect = 3.5e-19
Identity = 56/126 (44.44%), Postives = 82/126 (65.08%), Query Frame = 0

Query: 65  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFN 124
           K K  +K  KPPRPPKGP L A D+ +++EI  LAM+KRAR+ERMK L++ KA K+SS  
Sbjct: 91  KFKKTRKPSKPPRPPKGPLLSANDQKLMREITELAMRKRARIERMKTLRRLKAAKSSSPC 150

Query: 125 SCIPALIITFLFFLVIIIQGISSRSSSI-LQGSPEPAVGGSSGFISVQYIKSFPPNESNV 184
           S I A+I+T +FF+ +I QG  + ++S+    SP P    ++  +SVQ+   F P E   
Sbjct: 151 SSIFAMIVTVIFFVFLIFQGFFTSNASLNSDNSPAPNNSANNRMVSVQFYNEFAPRERID 210

Query: 185 SNPSSS 190
            +P++S
Sbjct: 211 PSPTTS 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898210.12.3e-6895.95uncharacterized protein LOC120085950 [Benincasa hispida] >XP_038898211.1 unchara... [more]
XP_031744965.12.1e-6694.59uncharacterized protein LOC101218752 [Cucumis sativus] >XP_031744966.1 uncharact... [more]
KAA0046709.11.4e-6579.68putative transmembrane protein [Cucumis melo var. makuwa][more]
XP_008451449.15.2e-6579.14PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo] >XP_008451450.1 P... [more]
TYK18245.11.5e-6479.89putative transmembrane protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7TZQ16.6e-6679.68Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A1S4DYQ52.5e-6579.14uncharacterized protein LOC103492741 OS=Cucumis melo OX=3656 GN=LOC103492741 PE=... [more]
A0A5D3D3X47.3e-6579.89Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A0A0K5U61.4e-6385.37Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G407830 PE=4 SV=1[more]
A0A6J1JVV18.4e-6187.67uncharacterized protein LOC111488345 OS=Cucurbita maxima OX=3661 GN=LOC111488345... [more]
Match NameE-valueIdentityDescription
AT1G02380.12.9e-2143.45unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G17120.11.6e-1947.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G17120.21.6e-1947.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G01960.13.5e-1944.44unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..86
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 62..76
NoneNo IPR availablePANTHERPTHR34188OS01G0299500 PROTEINcoord: 40..188
NoneNo IPR availablePANTHERPTHR34188:SF9PROTEIN, PUTATIVE-RELATEDcoord: 40..188

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023036.1HG10023036.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane