HG10013855 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013855
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr02: 5433738 .. 5437056 (-)
RNA-Seq ExpressionHG10013855
SyntenyHG10013855
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTTCTTTCCGAGTTCCTGAATAATACAATATTCTTGGTCACGAGGCCTTTCTCGTTTTTCATGCGTACATGCTCATTTATCTTGAAAACCTTTGTTGTTGTTGTTCAAACTTGGTTGGAGCTGTTGAAGACTTCGGTCAGTCTTCACTTGAACATATTTTGGACAACTTTGATGTGGATAATCGCAATCGTCTCTCTTCCTGGACGAATTTTAGCCGCTTTACAGAGGGAAAGGCAGGTTAGCCCTCTCGCCCTGTTGTTTATTGATTGCCATCTGTTCTTGTTCAATGATGATGTATTTTGTTTCTATCTTTTTTTTTTCCTCTGTTTTTTATTCTAGATTTGTCTCCGAGGAAAATATGCACAAAGAAAAATATTCTTCAACTTACGGTAGTAGATTTTTTCATAGACAGCATTTTGATGATTTTCATTTGCTTGCAAGTAGGAGTTAACTGCGTACTCAAACTTCTCATACGTGTCACGTTTTAACTACTATTTCAGCTATAAATGGGTATGATATTAAAGTGAAAAAATTTGGAAAGATCAAATAATTACGACTGTTTCCAGAGATCAAAATTATTTGGGAGTCCTTAATATTCGAAAGCATACAAAATATTATGTTAGTAATGTGTTTAAAACTTGGAAAGGGGTGAAGAAACATACCCTTGAGTTCTTTGCAGCCTGAATTATGGGACAATAGTTTGAACTATTTTACTTATTACTAGTCAATTGGAGAAGTGTTGTTCAAGGTTTTGTTTGTTCATCTTCGTTCTTTATTAGTCCCAAATGACTACAGTAGAAAGTTCATATGATTGACAATTTGAATTTGCTCTGGTTTTGTTTATTGGTGGCTTTTGGGATCTTTACTAGTCGATAAGTAATTGAAGTATTGGAGTACCGCATAGGCTACTTGGGCAGGTTCTGCTAATTTGGTTTGCCCTTTTTTTATACTTGATAATGATTGATTTCATGCAGTTGCAACAAAATTTGCAATTTCTGGAAATTGAGTTCGATAATGTTCTGTGGGAAAGAAAGGAGCTCCAAAAACAATTCCAGGCTGCTATGAAAGAGCATAAGATGATGGAATTGATGTTGGACGAACTTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTCAGGTAATGGCCCATCTATCTCCTTCAAGCTCTAGCTCCTGTACTAAAGGTCATAAATTACATCATGGAAGGTTGAAACCGAGAGCAAGTATCATAGTTTTGATAAGAAGGGGAAGGAGAAATTTATGCAGTTGTTGGAAAGGCTGAAATTTGACATAAAAACTGGGTTTACTTTTAATGAGTTGAATCTTTGTATATCTGTCAAGAACTCGGTGACAAACTCCATATAATGTCTTAACAAATGAAATCCCTCCCCCTCCCTATATATATATATATATAATATCTCTCTCACACGCACACAATGATACAGTTATTATATGAATGTGTATTTTAAGACGTTAGAATGTCCTGCAATACTGTTTGCAGTTCTGTTAACCCATGTTCCATGTGTAAACTGATTGAGCTTCTGTGATCTCTCTTAGAATGATATTAACATGGAATTGTGTTGACGAGTGCAATAAAACATAGATGTAAATTGATTCCGAGAAAGTTTTTTTTTCTTTTAGGTAGATAAACTGGCTTAAACATACCAGGAAAGCTTCACTTGTGTTGAACCATGTGGTTCTGAATTCTGATATGCATTGGCGCTATGCTAATCAGTTGTGGTATTCCAAGTAATTACCTAACAGCACATGTTTTACATGAAGCCAAGGGAACTGCCGGTTCAAATTTTTATTCAGGACTCATACTAGTTTGTTTTTTTCACCCTTAACATGTTTTTCCTAAATTTATTCAAAATCATACATCATAGTTTGTTCTTTCACTCTTGACATGTTTTTCCTAAATTTATTTAAAATCATACAGCGCTCTGTATTCTATTAAGTTAGACCAAAGTAATTCTTTAAGCTGATACCTAGTTGCTTTGGAAGTTCAATTGTGTCTCAAGTTTTACTGCCAAGGCAGGCTCTTGCAGACATGTAGTCTAATTTTTGTTGTAAAGTGGTTAGTTAGTTTTTTTGTAATTGGTTGTGTGTTTGAGCTGTAACTAATTATGTAATTAAGAGTTAGTTTGTTGTGTTATTTTCCTTATGCTTTTCTAATTTAAGGAGCTTATATCAGGCTTGTAAAGCATCTTTTATATATCAAAAATACGTTCTGCCAACACATCATGTCAATAATTTCTCTGCGGTGTGAATTAAGATTGGTTGTGTGTATTGACTTGCGGGATGGGATATTTAATTATTTGCTAGTACTCTGAGTTTTCCTGCTAAATTCTAATATAGCTAAGTTTTAATGTTTTTCTGGCTTCTCATTAAAATCATTTACTGCAACACATCATTTCTCTTCAGCGATTGTTCTTAATAATTACAATAGTTTTTTCCACCAAGCTTATGTGAAGAGTTCAAATCAGTTTTAGCTATCTTATGTAGAGAAAGAGGAAAGACTTGGCCATGGATGTAGGATGGGAAAACGTCAGTTCATGCGGAGATTTACGTCATAACCTTGGCTGTATTTTTACATGTAGTTTTAATATTGTTCTGTCATATGTTAATAATAGCTTAGCAATATCCTTCGCTTCAAATAAACCAATTGCCGTTATTTGTTCCTAACAGATGCAGAAATTGAGAAATGAAAATCTTCGACTGCAAGAAATCAAGGGTAAGGCTTATTGGAGCTTAAAAGGTCTTGGTGTCAAAAGTGAAGCACAAAAAACTGTCAGAGTTGACAGCGATATTACCTATGGTATCTCATCATGCTCATCCAGCTATAGTGGCAGCAGTGTTATTCAAGACCTCTGTCAAAGTGATGCTTTGAAAGATGATGGTATATCTGAAGAAAAATTTATCAAAATTTTAGAATCTGGGTTAAAATCTGGCATGTTCATCCACTCTCATACTGAAATCCTATCAAAAGATGAAGATGTCACTGAAATTCTTGATGAACAAAGGGAGGTTGCAGTCTCCCGAAGTCTATTTAGTACCCTATTGTCACTCTTGGTTGGAGTGATTATATGGGAAGCTGAAGAGCCTCACTTGTGCCTTGTAGTGGCTCTCATGTTTGTGGTTAGCATCTCATTGAAGAGCGTAGTTGAGTTTTTCACCACTATTAAGAATAAACCTGCTTGGGATGCTGTTGCTCTTTTGAGCTTCAACTGGTTTGCTCGTTTGCTTGCTCCTCCGGCGTTGAGGGTTATGGAATGGTTCGGTTTCTCCATTTCCTGA

mRNA sequence

ATGGCTTTTCTTTCCGAGTTCCTGAATAATACAATATTCTTGGTCACGAGGCCTTTCTCGTTTTTCATGCGTACATGCTCATTTATCTTGAAAACCTTTGTTGTTGTTGTTCAAACTTGGTTGGAGCTGTTGAAGACTTCGGTCAGTCTTCACTTGAACATATTTTGGACAACTTTGATGTGGATAATCGCAATCGTCTCTCTTCCTGGACGAATTTTAGCCGCTTTACAGAGGGAAAGGCAGTTGCAACAAAATTTGCAATTTCTGGAAATTGAGTTCGATAATGTTCTGTGGGAAAGAAAGGAGCTCCAAAAACAATTCCAGGCTGCTATGAAAGAGCATAAGATGATGGAATTGATGTTGGACGAACTTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTCAGATGCAGAAATTGAGAAATGAAAATCTTCGACTGCAAGAAATCAAGGGTAAGGCTTATTGGAGCTTAAAAGGTCTTGGTGTCAAAAGTGAAGCACAAAAAACTGTCAGAGTTGACAGCGATATTACCTATGGTATCTCATCATGCTCATCCAGCTATAGTGGCAGCAGTGTTATTCAAGACCTCTGTCAAAGTGATGCTTTGAAAGATGATGGTATATCTGAAGAAAAATTTATCAAAATTTTAGAATCTGGGTTAAAATCTGGCATGTTCATCCACTCTCATACTGAAATCCTATCAAAAGATGAAGATGTCACTGAAATTCTTGATGAACAAAGGGAGGTTGCAGTCTCCCGAAGTCTATTTAGTACCCTATTGTCACTCTTGGTTGGAGTGATTATATGGGAAGCTGAAGAGCCTCACTTGTGCCTTGTAGTGGCTCTCATGTTTGTGGTTAGCATCTCATTGAAGAGCGTAGTTGAGTTTTTCACCACTATTAAGAATAAACCTGCTTGGGATGCTGTTGCTCTTTTGAGCTTCAACTGGTTTGCTCGTTTGCTTGCTCCTCCGGCGTTGAGGGTTATGGAATGGTTCGGTTTCTCCATTTCCTGA

Coding sequence (CDS)

ATGGCTTTTCTTTCCGAGTTCCTGAATAATACAATATTCTTGGTCACGAGGCCTTTCTCGTTTTTCATGCGTACATGCTCATTTATCTTGAAAACCTTTGTTGTTGTTGTTCAAACTTGGTTGGAGCTGTTGAAGACTTCGGTCAGTCTTCACTTGAACATATTTTGGACAACTTTGATGTGGATAATCGCAATCGTCTCTCTTCCTGGACGAATTTTAGCCGCTTTACAGAGGGAAAGGCAGTTGCAACAAAATTTGCAATTTCTGGAAATTGAGTTCGATAATGTTCTGTGGGAAAGAAAGGAGCTCCAAAAACAATTCCAGGCTGCTATGAAAGAGCATAAGATGATGGAATTGATGTTGGACGAACTTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTCAGATGCAGAAATTGAGAAATGAAAATCTTCGACTGCAAGAAATCAAGGGTAAGGCTTATTGGAGCTTAAAAGGTCTTGGTGTCAAAAGTGAAGCACAAAAAACTGTCAGAGTTGACAGCGATATTACCTATGGTATCTCATCATGCTCATCCAGCTATAGTGGCAGCAGTGTTATTCAAGACCTCTGTCAAAGTGATGCTTTGAAAGATGATGGTATATCTGAAGAAAAATTTATCAAAATTTTAGAATCTGGGTTAAAATCTGGCATGTTCATCCACTCTCATACTGAAATCCTATCAAAAGATGAAGATGTCACTGAAATTCTTGATGAACAAAGGGAGGTTGCAGTCTCCCGAAGTCTATTTAGTACCCTATTGTCACTCTTGGTTGGAGTGATTATATGGGAAGCTGAAGAGCCTCACTTGTGCCTTGTAGTGGCTCTCATGTTTGTGGTTAGCATCTCATTGAAGAGCGTAGTTGAGTTTTTCACCACTATTAAGAATAAACCTGCTTGGGATGCTGTTGCTCTTTTGAGCTTCAACTGGTTTGCTCGTTTGCTTGCTCCTCCGGCGTTGAGGGTTATGGAATGGTTCGGTTTCTCCATTTCCTGA

Protein sequence

MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELMLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDSDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILSKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVVEFFTTIKNKPAWDAVALLSFNWFARLLAPPALRVMEWFGFSIS
Homology
BLAST of HG10013855 vs. NCBI nr
Match: XP_038898361.1 (uncharacterized protein LOC120086031 isoform X1 [Benincasa hispida] >XP_038898362.1 uncharacterized protein LOC120086031 isoform X1 [Benincasa hispida])

HSP 1 Score: 581.6 bits (1498), Expect = 4.3e-162
Identity = 314/356 (88.20%), Postives = 330/356 (92.70%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MAFLSEFLN TIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM
Sbjct: 5   MAFLSEFLNTTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 64

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           W++AIVSLPGRILAALQRERQL+Q LQFLEIEF+NVLWERKELQKQFQAAM+EHKMMELM
Sbjct: 65  WVVAIVSLPGRILAALQRERQLRQYLQFLEIEFNNVLWERKELQKQFQAAMREHKMMELM 124

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMIHEKATNKI+LLES+MQKLRNENLRLQEIKGKAYWSLKGL VKSEAQKT RVDS
Sbjct: 125 LDELEMIHEKATNKISLLESEMQKLRNENLRLQEIKGKAYWSLKGLDVKSEAQKTGRVDS 184

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILS 240
           DIT+GISSCSSSY  SS+IQDL QSDALKD  IS+EK IKIL+SGLKSG+FIHSHTEILS
Sbjct: 185 DITHGISSCSSSYGSSSIIQDLFQSDALKDGSISKEKLIKILDSGLKSGVFIHSHTEILS 244

Query: 241 KDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300
           KDEDVTEILDEQREVA+SRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV
Sbjct: 245 KDEDVTEILDEQREVAISRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 304

Query: 301 EFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVMEWFGFSIS 344
           EFFTTIKNKPA DAV+LLSFNWF             ARLLAPP LR++EWF FSIS
Sbjct: 305 EFFTTIKNKPALDAVSLLSFNWFVLGILAYPTLPIIARLLAPPTLRIVEWFSFSIS 360

BLAST of HG10013855 vs. NCBI nr
Match: XP_031740628.1 (uncharacterized protein LOC101204571 isoform X1 [Cucumis sativus] >XP_031740629.1 uncharacterized protein LOC101204571 isoform X1 [Cucumis sativus])

HSP 1 Score: 563.9 bits (1452), Expect = 9.3e-157
Identity = 310/360 (86.11%), Postives = 323/360 (89.72%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MAFLSEFLN TI LVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM
Sbjct: 1   MAFLSEFLNTTILLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           WIIAIVSLPGRILAAL+RERQLQQ LQFLEI+FDNVLWERKELQKQFQAAMKEHKMMELM
Sbjct: 61  WIIAIVSLPGRILAALRRERQLQQYLQFLEIKFDNVLWERKELQKQFQAAMKEHKMMELM 120

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMIHEKATNKIALLES+MQ+LRN+NLRLQEIKGK YWSLKGL VKSEAQKT RVD 
Sbjct: 121 LDELEMIHEKATNKIALLESEMQQLRNQNLRLQEIKGKDYWSLKGLDVKSEAQKTGRVDR 180

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILS 240
           DITYGISSCSS  S SS++QDLCQ DALKD  IS+EK IKILESGLKSG+ IHSHTEILS
Sbjct: 181 DITYGISSCSSRSSSSSIVQDLCQIDALKDASISKEKLIKILESGLKSGVLIHSHTEILS 240

Query: 241 KDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300
           KDE VT++LDEQREVA+SRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV
Sbjct: 241 KDEYVTQLLDEQREVAMSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300

Query: 301 EFFTTIKNKPAWDAVALLSFNWF-----------------ARLLAPPALRVMEWFGFSIS 344
           EFFTTIKNKPA DAVALLSFNWF                 AR LAP A RV+EWFGFSIS
Sbjct: 301 EFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNISRFLARFLAPLASRVVEWFGFSIS 360

BLAST of HG10013855 vs. NCBI nr
Match: KAG7024718.1 (hypothetical protein SDJN02_13536, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 513.1 bits (1320), Expect = 1.9e-141
Identity = 286/349 (81.95%), Postives = 305/349 (87.39%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MA L EFLN TI  +T PFSFF  TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLM
Sbjct: 87  MALLYEFLNITILFITWPFSFFKSTCSLILKTFVVVVQTWLELLKTSVSLHLNILWTTLM 146

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           WIIA VSLPGRILAAL+RERQLQ+NLQFL IEFDNVLWERKELQKQFQ AMKE KMMELM
Sbjct: 147 WIIAFVSLPGRILAALKRERQLQKNLQFLAIEFDNVLWERKELQKQFQTAMKEQKMMELM 206

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMIHEKATNKIALLES++QKLRNENLRLQEIKGKAYWSLKGL VKSEAQK  RV S
Sbjct: 207 LDELEMIHEKATNKIALLESEVQKLRNENLRLQEIKGKAYWSLKGLDVKSEAQKAGRVGS 266

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHT-EIL 240
           DITYGISSCSSSYS SS++QDL +SDALKD  +S+EK I ILESG +SG+ IH+HT +IL
Sbjct: 267 DITYGISSCSSSYSDSSLVQDLSRSDALKDGNVSKEKLITILESGFQSGVLIHNHTSKIL 326

Query: 241 SKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 300
           S+DED+TEILDEQREVAV RSLFSTLLSLLVGVIIW+AEEPHLCLVVALMFVVSISLKSV
Sbjct: 327 SEDEDITEILDEQREVAVHRSLFSTLLSLLVGVIIWKAEEPHLCLVVALMFVVSISLKSV 386

Query: 301 VEFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVM 336
           VEFFTTIKNKPA DAVALLSFNWF             AR+LAP A RV+
Sbjct: 387 VEFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNMARVLAPLASRVV 435

BLAST of HG10013855 vs. NCBI nr
Match: XP_038898364.1 (uncharacterized protein LOC120086031 isoform X3 [Benincasa hispida] >XP_038898365.1 uncharacterized protein LOC120086031 isoform X3 [Benincasa hispida])

HSP 1 Score: 501.1 bits (1289), Expect = 7.4e-138
Identity = 270/311 (86.82%), Postives = 286/311 (91.96%), Query Frame = 0

Query: 46  TSVSLHLNIFWTTLMWIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQK 105
           TSVSLHLNIFWTTLMW++AIVSLPGRILAALQRERQL+Q LQFLEIEF+NVLWERKELQK
Sbjct: 2   TSVSLHLNIFWTTLMWVVAIVSLPGRILAALQRERQLRQYLQFLEIEFNNVLWERKELQK 61

Query: 106 QFQAAMKEHKMMELMLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKG 165
           QFQAAM+EHKMMELMLDELEMIHEKATNKI+LLES+MQKLRNENLRLQEIKGKAYWSLKG
Sbjct: 62  QFQAAMREHKMMELMLDELEMIHEKATNKISLLESEMQKLRNENLRLQEIKGKAYWSLKG 121

Query: 166 LGVKSEAQKTVRVDSDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESG 225
           L VKSEAQKT RVDSDIT+GISSCSSSY  SS+IQDL QSDALKD  IS+EK IKIL+SG
Sbjct: 122 LDVKSEAQKTGRVDSDITHGISSCSSSYGSSSIIQDLFQSDALKDGSISKEKLIKILDSG 181

Query: 226 LKSGMFIHSHTEILSKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLV 285
           LKSG+FIHSHTEILSKDEDVTEILDEQREVA+SRSLFSTLLSLLVGVIIWEAEEPHLCLV
Sbjct: 182 LKSGVFIHSHTEILSKDEDVTEILDEQREVAISRSLFSTLLSLLVGVIIWEAEEPHLCLV 241

Query: 286 VALMFVVSISLKSVVEFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPAL 344
           VALMFVVSISLKSVVEFFTTIKNKPA DAV+LLSFNWF             ARLLAPP L
Sbjct: 242 VALMFVVSISLKSVVEFFTTIKNKPALDAVSLLSFNWFVLGILAYPTLPIIARLLAPPTL 301

BLAST of HG10013855 vs. NCBI nr
Match: XP_038898366.1 (uncharacterized protein LOC120086031 isoform X4 [Benincasa hispida] >XP_038898367.1 uncharacterized protein LOC120086031 isoform X4 [Benincasa hispida])

HSP 1 Score: 474.2 bits (1219), Expect = 9.7e-130
Identity = 256/297 (86.20%), Postives = 272/297 (91.58%), Query Frame = 0

Query: 60  MWIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMEL 119
           MW++AIVSLPGRILAALQRERQL+Q LQFLEIEF+NVLWERKELQKQFQAAM+EHKMMEL
Sbjct: 1   MWVVAIVSLPGRILAALQRERQLRQYLQFLEIEFNNVLWERKELQKQFQAAMREHKMMEL 60

Query: 120 MLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVD 179
           MLDELEMIHEKATNKI+LLES+MQKLRNENLRLQEIKGKAYWSLKGL VKSEAQKT RVD
Sbjct: 61  MLDELEMIHEKATNKISLLESEMQKLRNENLRLQEIKGKAYWSLKGLDVKSEAQKTGRVD 120

Query: 180 SDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEIL 239
           SDIT+GISSCSSSY  SS+IQDL QSDALKD  IS+EK IKIL+SGLKSG+FIHSHTEIL
Sbjct: 121 SDITHGISSCSSSYGSSSIIQDLFQSDALKDGSISKEKLIKILDSGLKSGVFIHSHTEIL 180

Query: 240 SKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 299
           SKDEDVTEILDEQREVA+SRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV
Sbjct: 181 SKDEDVTEILDEQREVAISRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 240

Query: 300 VEFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVMEWFGFSIS 344
           VEFFTTIKNKPA DAV+LLSFNWF             ARLLAPP LR++EWF FSIS
Sbjct: 241 VEFFTTIKNKPALDAVSLLSFNWFVLGILAYPTLPIIARLLAPPTLRIVEWFSFSIS 297

BLAST of HG10013855 vs. ExPASy TrEMBL
Match: A0A6J1F6D4 (uncharacterized protein LOC111442593 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111442593 PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 1.5e-128
Identity = 269/348 (77.30%), Postives = 283/348 (81.32%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MA L EFLN TI  +T PFSFF+ TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLM
Sbjct: 1   MALLYEFLNITILFITWPFSFFISTCSLILKTFVVVVQTWLELLKTSVSLHLNILWTTLM 60

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           WIIA VSLPGRILAAL+RERQLQ+NLQFL IEFDNVLWERKELQKQFQ AMKE KMMELM
Sbjct: 61  WIIAFVSLPGRILAALKRERQLQKNLQFLAIEFDNVLWERKELQKQFQTAMKEQKMMELM 120

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMIHEKATNKIALLES++QKLRNENLRLQEIKGKAYWSLKGL VKSEAQKT RV S
Sbjct: 121 LDELEMIHEKATNKIALLESEVQKLRNENLRLQEIKGKAYWSLKGLDVKSEAQKTGRVGS 180

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILS 240
           DITYGISSCSSSYS SS++QDL +SDALKD+                             
Sbjct: 181 DITYGISSCSSSYSDSSLVQDLSRSDALKDE----------------------------- 240

Query: 241 KDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300
            DED+TEILDEQREVAV RSLFSTLLSLLVGVIIW+AEEPHLCLVVALMFVVSISLKSVV
Sbjct: 241 -DEDITEILDEQREVAVHRSLFSTLLSLLVGVIIWKAEEPHLCLVVALMFVVSISLKSVV 300

Query: 301 EFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVM 336
           EFFTTIKNKPA DAVALLSFNWF             AR+LAP A RV+
Sbjct: 301 EFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNMARVLAPLASRVV 318

BLAST of HG10013855 vs. ExPASy TrEMBL
Match: A0A6J1F5N4 (uncharacterized protein LOC111442593 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442593 PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 1.5e-128
Identity = 269/348 (77.30%), Postives = 283/348 (81.32%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MA L EFLN TI  +T PFSFF+ TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLM
Sbjct: 85  MALLYEFLNITILFITWPFSFFISTCSLILKTFVVVVQTWLELLKTSVSLHLNILWTTLM 144

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           WIIA VSLPGRILAAL+RERQLQ+NLQFL IEFDNVLWERKELQKQFQ AMKE KMMELM
Sbjct: 145 WIIAFVSLPGRILAALKRERQLQKNLQFLAIEFDNVLWERKELQKQFQTAMKEQKMMELM 204

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMIHEKATNKIALLES++QKLRNENLRLQEIKGKAYWSLKGL VKSEAQKT RV S
Sbjct: 205 LDELEMIHEKATNKIALLESEVQKLRNENLRLQEIKGKAYWSLKGLDVKSEAQKTGRVGS 264

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILS 240
           DITYGISSCSSSYS SS++QDL +SDALKD+                             
Sbjct: 265 DITYGISSCSSSYSDSSLVQDLSRSDALKDE----------------------------- 324

Query: 241 KDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300
            DED+TEILDEQREVAV RSLFSTLLSLLVGVIIW+AEEPHLCLVVALMFVVSISLKSVV
Sbjct: 325 -DEDITEILDEQREVAVHRSLFSTLLSLLVGVIIWKAEEPHLCLVVALMFVVSISLKSVV 384

Query: 301 EFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVM 336
           EFFTTIKNKPA DAVALLSFNWF             AR+LAP A RV+
Sbjct: 385 EFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNMARVLAPLASRVV 402

BLAST of HG10013855 vs. ExPASy TrEMBL
Match: A0A1S3B9G5 (uncharacterized protein LOC103487633 OS=Cucumis melo OX=3656 GN=LOC103487633 PE=4 SV=1)

HSP 1 Score: 459.1 bits (1180), Expect = 1.6e-125
Identity = 254/297 (85.52%), Postives = 264/297 (88.89%), Query Frame = 0

Query: 60  MWIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMEL 119
           MWIIAIVSLPGRILAAL+RERQLQQ LQFLEIEFDNVL ERKELQKQFQAA+KEHKMMEL
Sbjct: 1   MWIIAIVSLPGRILAALRRERQLQQYLQFLEIEFDNVLLERKELQKQFQAALKEHKMMEL 60

Query: 120 MLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVD 179
           MLDELEMIHEKATNKIALLES+MQKLRNENLRLQEIKGKAYWSLKGL VKSE QKT RVD
Sbjct: 61  MLDELEMIHEKATNKIALLESEMQKLRNENLRLQEIKGKAYWSLKGLDVKSEEQKTGRVD 120

Query: 180 SDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEIL 239
            DITYGISSCSSSYS SSV+QDLCQ DALKD  IS+EK +KILESGLKSG+ IHSHTEIL
Sbjct: 121 RDITYGISSCSSSYSRSSVVQDLCQIDALKDGSISKEKLVKILESGLKSGVLIHSHTEIL 180

Query: 240 SKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 299
           SKDE VTE+LDEQREVA+SRSLFS LLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV
Sbjct: 181 SKDEYVTELLDEQREVAISRSLFSILLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 240

Query: 300 VEFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVMEWFGFSIS 344
           VEFFTTIKNKPA DAVALLSFNWF             AR LAP A RV+EW GFS S
Sbjct: 241 VEFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARFLAPLASRVVEWLGFSTS 297

BLAST of HG10013855 vs. ExPASy TrEMBL
Match: A0A0A0KWK5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G242360 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 2.1e-125
Identity = 253/301 (84.05%), Postives = 266/301 (88.37%), Query Frame = 0

Query: 60  MWIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMEL 119
           MWIIAIVSLPGRILAAL+RERQLQQ LQFLEI+FDNVLWERKELQKQFQAAMKEHKMMEL
Sbjct: 1   MWIIAIVSLPGRILAALRRERQLQQYLQFLEIKFDNVLWERKELQKQFQAAMKEHKMMEL 60

Query: 120 MLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVD 179
           MLDELEMIHEKATNKIALLES+MQ+LRN+NLRLQEIKGK YWSLKGL VKSEAQKT RVD
Sbjct: 61  MLDELEMIHEKATNKIALLESEMQQLRNQNLRLQEIKGKDYWSLKGLDVKSEAQKTGRVD 120

Query: 180 SDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEIL 239
            DITYGISSCSS  S SS++QDLCQ DALKD  IS+EK IKILESGLKSG+ IHSHTEIL
Sbjct: 121 RDITYGISSCSSRSSSSSIVQDLCQIDALKDASISKEKLIKILESGLKSGVLIHSHTEIL 180

Query: 240 SKDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 299
           SKDE VT++LDEQREVA+SRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV
Sbjct: 181 SKDEYVTQLLDEQREVAMSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSV 240

Query: 300 VEFFTTIKNKPAWDAVALLSFNWF-----------------ARLLAPPALRVMEWFGFSI 344
           VEFFTTIKNKPA DAVALLSFNWF                 AR LAP A RV+EWFGFSI
Sbjct: 241 VEFFTTIKNKPALDAVALLSFNWFVLGILAYPTLPNISRFLARFLAPLASRVVEWFGFSI 300

BLAST of HG10013855 vs. ExPASy TrEMBL
Match: A0A6J1IHW7 (uncharacterized protein LOC111477072 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111477072 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 1.0e-124
Identity = 263/348 (75.57%), Postives = 280/348 (80.46%), Query Frame = 0

Query: 1   MAFLSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLM 60
           MA L EFLN TI  +T PFSFF+ TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLM
Sbjct: 1   MALLYEFLNITILFITWPFSFFISTCSLILKTFVVVVQTWLELLKTSVSLHLNILWTTLM 60

Query: 61  WIIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMELM 120
           WIIA VSLPGRILAAL+RERQLQ+NLQFL IEFDNVLWERKELQKQFQ AMKE KMMELM
Sbjct: 61  WIIAFVSLPGRILAALKRERQLQKNLQFLAIEFDNVLWERKELQKQFQTAMKEQKMMELM 120

Query: 121 LDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVDS 180
           LDELEMI+EKATNKIALLES++QKLRNEN RLQEIKGKAYWSLKG  VKSEAQKT RV S
Sbjct: 121 LDELEMINEKATNKIALLESEVQKLRNENHRLQEIKGKAYWSLKGFDVKSEAQKTSRVGS 180

Query: 181 DITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEILS 240
           +ITYGISSCSSSYS SS++QDL +S+A KD+                             
Sbjct: 181 NITYGISSCSSSYSDSSLLQDLSRSEASKDE----------------------------- 240

Query: 241 KDEDVTEILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLCLVVALMFVVSISLKSVV 300
            DED+TEILDEQREVAV RSLFSTLLSLLVGVIIW+AEEPHLCLVVALMFVVSISLKSVV
Sbjct: 241 -DEDITEILDEQREVAVHRSLFSTLLSLLVGVIIWKAEEPHLCLVVALMFVVSISLKSVV 300

Query: 301 EFFTTIKNKPAWDAVALLSFNWF-------------ARLLAPPALRVM 336
           EFFTTIKNKPA DAV+LLSFNWF             ARLLAP A RV+
Sbjct: 301 EFFTTIKNKPALDAVSLLSFNWFVLGILAYPTLPNMARLLAPLASRVV 318

BLAST of HG10013855 vs. TAIR 10
Match: AT5G45310.1 (unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: stem, inflorescence meristem, root, leaf; EXPRESSED DURING: LP.04 four leaves visible; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 175.6 bits (444), Expect = 6.7e-44
Identity = 118/327 (36.09%), Postives = 195/327 (59.63%), Query Frame = 0

Query: 4   LSEFLNNTIFLVTRPFSFFMRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTL---M 63
           +S  ++++++L+TRPF F +  C F L+T +V      +++ +++  +L++ W  +   +
Sbjct: 7   ISGLVSSSLYLMTRPFFFCIYACVFCLRTALVTTFVSTDMVTSAIWFNLSMLWRAVRGSI 66

Query: 64  W-IIAIVSLPGRILAALQRERQLQQNLQFLEIEFDNVLWERKELQKQFQAAMKEHKMMEL 123
           W  + + + P R  A++ RER L+Q++  L  E +++ W RKE++K  + A+KE+++ME 
Sbjct: 67  WGSVLLFTFPIRFFASIPRERLLEQSIYDLRYELESLEWNRKEIEKNLREAIKEYRIMEQ 126

Query: 124 MLDELEMIHEKATNKIALLESQMQKLRNENLRLQEIKGKAYWSLKGLGVKSEAQKTVRVD 183
            LDELE  H++A +KI  LE+++Q+L+ ENL+L E+ GK Y S KG    SE    +R  
Sbjct: 127 DLDELEDEHDEAISKIEKLEAELQELKEENLQLMEVNGKDYRSKKGKVKPSEEPSEIR-- 186

Query: 184 SDITYGISSCSSSYSGSSVIQDLCQSDALKDDGISEEKFIKILESGLKSGMFIHSHTEIL 243
                                        K   I      K   + +KS ++  + + I 
Sbjct: 187 --------------------------SIHKPKNIPYASKGKAEFTSVKSPLYPFAKSTI- 246

Query: 244 SKDEDVT-EILDEQREVAVSRSLFSTLLSLLVGVIIWEAEEPHLC--LVVALMFVVSISL 303
            KDE++T  +L  ++ +AVSRS+FS +L+L+VG++++EA+E  LC  L+ AL  VV ISL
Sbjct: 247 PKDEELTPRVLGLEKNIAVSRSVFSAMLALVVGIVMYEAKEQELCTPLIGALFTVVGISL 304

Query: 304 KSVVEFFTTIKNKPAWDAVALLSFNWF 324
           KSVV+FF+T+KNKPA DAVAL+S NWF
Sbjct: 307 KSVVQFFSTVKNKPALDAVALMSLNWF 304

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898361.14.3e-16288.20uncharacterized protein LOC120086031 isoform X1 [Benincasa hispida] >XP_03889836... [more]
XP_031740628.19.3e-15786.11uncharacterized protein LOC101204571 isoform X1 [Cucumis sativus] >XP_031740629.... [more]
KAG7024718.11.9e-14181.95hypothetical protein SDJN02_13536, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_038898364.17.4e-13886.82uncharacterized protein LOC120086031 isoform X3 [Benincasa hispida] >XP_03889836... [more]
XP_038898366.19.7e-13086.20uncharacterized protein LOC120086031 isoform X4 [Benincasa hispida] >XP_03889836... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1F6D41.5e-12877.30uncharacterized protein LOC111442593 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F5N41.5e-12877.30uncharacterized protein LOC111442593 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3B9G51.6e-12585.52uncharacterized protein LOC103487633 OS=Cucumis melo OX=3656 GN=LOC103487633 PE=... [more]
A0A0A0KWK52.1e-12584.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G242360 PE=4 SV=1[more]
A0A6J1IHW71.0e-12475.57uncharacterized protein LOC111477072 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G45310.16.7e-4436.09unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: stem, infloresce... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 72..158
NoneNo IPR availablePANTHERPTHR36073FAMILY NOT NAMEDcoord: 4..324

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013855.1HG10013855.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane