HG10009044 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10009044
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein ZGRF1 isoform X2
LocationChr06: 1961948 .. 1966584 (-)
RNA-Seq ExpressionHG10009044
SyntenyHG10009044
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGGTTTCTATTTCTTTCATACTTTTAACTTCACTACGCATCTTTTATTTGAACTCCTAGATATGTAATTCTGCAGAAGAACATGTCGAATTTCGGCGATTTTGAACTGGAAAGTATGGAATTAACTAGCATTGTGGCGCTAAATCCTAGAATTGGATGTAGAGCTTTGGAAGTCTCTTGATGGAAATGATTATAGTTGATAGAATTGCTGAGATGTGATTCAGTAGTTGGTTTATGCTGTTCTGGTTTCGATTCGGAGTAACATTGTGCTAATTTGTTTGATTCCTAGAAATTTGGATTTTCTCATTCTAATTGTTAGCCTAGCCGATTTGTTCTAATGGCTAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGGTAATTAGTCTAGCTTTCCCTTATGGCTTTCTTCTTTTTAAAAAAAGTTCTTTCCATAATCAATGAGCACAGGATTAAGTTTAATTCTCTTATTGCTTGGATTATGATTAGTGTTATTTTCTTGTTTGTTACTTCAAAAACGAAGAAAGTGCAGAGAAGAATAAAACTCGGCCGAGTCTCAGCCCTTCACATAAAGTAATCAGAGGTAACTTACATTATTTGGCTAATGCAACATTTCTGTTATTATTGTTCTCCCAATTGCTTACTTAAAGCGGTTTTAATCATAACTTGGGCTAGATGTAATCTTACTGGTGATTACCATTTATTTCAGACGTGCTATTCTCATCATGTGTGTACCAATGTGCTTTTAGATTTGGCTGTAACACTGTCATTCCCTTTACATTCCAGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGGTTAGTTCATGATTTCTTTCCTGGGTTGTTATCATCATAATTTTTTTTCTTGGGGAGGGGGGCAAATGTGCAATGGATGTAGAGCATAATGTTGATGAATACGAGTCTGGCCGTCTTGATGTATTTGAAATATGTGAAAGTACTCTATATGTTTACATACGTCAGTTCAGTATGGTAAACAATATTATAAATAGCCAAATACAGAGTGTAACGAAGGTACGTCTGACAAAACAAATTAATAAACAACAGAAACTAAAATCAACTACTTTGCCTAATATGCTTTTTAAGTATATTTTCATTCCGCAGAGTCAAATTTATTATGTATCTACCAATTTATGCCATTCTGTCCTTATCCATCTTCTGTTGTCTTTATTGGCATGATTAACGTTATGATGTTTGTAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTTAACTTCTTGTTTGTTACTTGAATTATCTAGTTTACAGGTCTACCATACTTCTTGATTATTTGTGAACTACTCAATTTGTGAAATTTATATCTTCGACTCAAGATCAATACATATTAGATTTTTTGTAGGGATGTATGTCATCGATATGATGTATTGATCAATTGAAAGGTAGGCCGAGAAACTGAGAGGAGCAATAAAATTTGTAGGCAGGTCGAGGGATCAGTCAGATATGGGAGGTGGCCAACTTTAATGTCTCCTTGTAGGCAATAGTCTCTAGACTTTTTTGTAATTATGATCTAAGTTTTGGTTTTTTGGATTGGAGTCCTATATAAATAGATTTTTTTTGTTTGGTATGCTCTAGTACTCCACTGTTTTTCTCAATGAAAGAGCAGTTTCGTGGGAAAAAAAGAAAAGAAAAGGTACAATTGAAAGTTAAAGTTTCTGCCTGTACCTCTCCTGGTCTTTCTGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGGTCAGCTGTTTTTGCTACATTAATCATGATCATCGTAGATGAAATTTGTCAGCATTGTTCATTTTTTTCCTCTTTCGGCCATAAGGTGTCATGAAGTCTGTTTTCCTCTTGTTGTACGTCTACTTAGGCATTCGTAGTGATTTAGTGCAGAATGGAATGTTTTGTACACTAGTCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATAAAAAATTTCTCTGGCTCTCACCATAGGAGCTGTTTTTGCTACATCAATCATAATCATCGTAGATAAAATTTGTCAGCATTGTTCACCTTTTTCCTCGCCATAAGGTGTCATGAAGTCTGTTTTCCTCTTGTTGTACGTCTACTTAGGCATTGTTTTAGTGCAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTAATTTACTTTGTCTTGAACATCATGATACCTGCTGTCTGTTTTGCAGTTGTACGAAAATACAATTCGGTCACTCCCCTCCCCCTCCCAGAGAAGAATGTAAATTTTAGTTCACAATATGAGTTTCATGTTTAAGTTCAACTGTTTTCGTAAGCCAAACAACATTTTTGGTGCCATGGATACAGATAGATACATGCCTAAAGGTGCAAATGAGATTGTTGGAAAAGAACATCAGAACTACTGATGCCTTACTCTAACAATAGGTAGTTTACACAAAATTTCAGCTGTTATCAATCAAGTGATCACTTCTGGGACATGCTGGTCATTATCCAGCATTGATAAATGAGTAGCTGGCTAATCATCAAGTAAAAAATAAAACAAAATTCTTATCAGATTTGACCATCACATTTATAGTTCTAGAGTCGTCCGATGACTATGGATTCAAGCTTAGATTGTATAGCAGATGTACAAGAAAGATGGAAGGTTTAGGATAGGATATTAACATATTAACCAGGGTGTTTTTTTTTTTCAATATGTTATTAATGCAATTCAAAGAATTCCAAAGGTTTCTTATTATATTTTTATGAGACTAACTCTTACATGTATTTCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGGTCCGATGCACATGTAATATTTTGGTTCCTAAACCAATATTCGTATGGCTATCTAATAATTTTTCTGGCTTGCTATGCGACCTTTGGTCAAATGCAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGGTTTGTGAATGGTTACTTGATGAATTCTGTCTGAACGTCTTAGTTGTTTATGCCTTGTTCCTTCTGAACTAAATATTTATTTGTTTTATGTCATGAACATGAAGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATGTGATGTCGTGCAAAAGCACCTCAATTTAAGTCTAAAATTTCATAATTTTACTATGAGATTGTCGTGTCAAATTACAGACTTATTTCAAATTTGATCCTGCAGCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGGTGTGGTTAAATTTCTCTTTGGTGTCCTTTGAATGTACATTAGATGATTGTCATATAAGATCTTTAGTAAGTTTCCCATGACTTGACTCTGGCTTTATTCCTTTCCGAAATGTTTGTGTAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

mRNA sequence

ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

Coding sequence (CDS)

ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

Protein sequence

MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCFEFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
Homology
BLAST of HG10009044 vs. NCBI nr
Match: XP_038874789.1 (uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida])

HSP 1 Score: 911.4 bits (2354), Expect = 3.7e-261
Identity = 475/551 (86.21%), Postives = 492/551 (89.29%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEV
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F   
Sbjct: 61  IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120

Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
                            EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180

Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
           DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240

Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
           PPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300

Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
           +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGN 360

Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
           RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420

Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
           SV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ TFTGNAGT
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQPTFTGNAGT 480

Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERE 532
           LTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKICEEITYERE
Sbjct: 481 LTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKICEEITYERE 540

BLAST of HG10009044 vs. NCBI nr
Match: XP_038874787.1 (uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida])

HSP 1 Score: 904.8 bits (2337), Expect = 3.5e-259
Identity = 475/559 (84.97%), Postives = 493/559 (88.19%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEV
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F   
Sbjct: 61  IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120

Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
                            EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180

Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
           DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240

Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
           PPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300

Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
           +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGN 360

Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
           RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420

Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------S 480
           SV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ        +
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLT 480

Query: 481 TFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKIC 532
           TFTGNAGTLTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKIC
Sbjct: 481 TFTGNAGTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKIC 540

BLAST of HG10009044 vs. NCBI nr
Match: XP_038874788.1 (uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida])

HSP 1 Score: 898.7 bits (2321), Expect = 2.5e-257
Identity = 474/559 (84.79%), Postives = 492/559 (88.01%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRILKQDEV
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F   
Sbjct: 61  IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120

Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
                            EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180

Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
           DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240

Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
           PPKI  NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300

Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
           +SSSGS   QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VK ELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVK-ELGN 360

Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
           RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420

Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------S 480
           SV SY  PEPSLAEALHLS+DDQSH++PSE  D RESTKNAENNQSI LTQ        +
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLT 480

Query: 481 TFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKIC 532
           TFTGNAGTLTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKIC
Sbjct: 481 TFTGNAGTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKIC 540

BLAST of HG10009044 vs. NCBI nr
Match: XP_022928770.1 (uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata])

HSP 1 Score: 834.7 bits (2155), Expect = 4.4e-238
Identity = 435/555 (78.38%), Postives = 476/555 (85.77%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           +CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI EKSGVLRGKNFRNNSVCF   
Sbjct: 61  VCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNSVCFENK 120

Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
                               EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETEWQVLHTSNITQKAK 180

Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
           KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE 
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDETVKSGESIAFDAHLVDIGECER 240

Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
           +HKPPKIP++QG SS G+ GTRVL+  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPLSQG-SSFGDRGTRVLNEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300

Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
           IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+  GEILELPKYLVEIGEAC +VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVE 360

Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
           + NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSGH+D+N
Sbjct: 361 IANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKN 420

Query: 421 ISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
           I VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S    
Sbjct: 421 ICVSVPSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPS---- 480

Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
              TLTE++EIGHS+QL +++HVEAES SLR+TISRT+ +S  AAC LVNDEGK+CEEIT
Sbjct: 481 ---TLTEELEIGHSNQLLQTEHVEAESSSLRDTISRTQGTSQFAACELVNDEGKMCEEIT 540

BLAST of HG10009044 vs. NCBI nr
Match: XP_011654696.1 (uncharacterized protein LOC101209453 isoform X2 [Cucumis sativus])

HSP 1 Score: 830.5 bits (2144), Expect = 8.3e-237
Identity = 445/552 (80.62%), Postives = 468/552 (84.78%), Query Frame = 0

Query: 1   MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDE 60
           MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECR+LKQDE
Sbjct: 1   MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRMLKQDE 60

Query: 61  VICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF-- 120
           VICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKISE SGV+RGK+  NNSVC   
Sbjct: 61  VICSGETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVCSGA 120

Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
                            EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFH 180

Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
           DGFLKLSICG LG QV LFDENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHK 240

Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
           P KIP+N+G+SS   GG  VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Sbjct: 241 PSKIPLNEGTSSK-EGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK 300

Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
           ISSSGS  MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+
Sbjct: 301 ISSSGSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGD 360

Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
           R  DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSILQRPR RV+LSSGHTDENISV
Sbjct: 361 RKCDIRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISV 420

Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
           SV S   P+PSLAEALHL  D QSH+KPSE  +TRES KN EN+QSIALTQSTFTGNA T
Sbjct: 421 SVSSRN-PKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAET 480

Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYER 532
           LTED E G SS+L RSDHVEAESISLRN+I R    SDS AC LVN DEGKICEEITYER
Sbjct: 481 LTEDGEFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYER 540

BLAST of HG10009044 vs. ExPASy TrEMBL
Match: A0A6J1ESH9 (uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435594 PE=4 SV=1)

HSP 1 Score: 834.7 bits (2155), Expect = 2.1e-238
Identity = 435/555 (78.38%), Postives = 476/555 (85.77%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           +CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI EKSGVLRGKNFRNNSVCF   
Sbjct: 61  VCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNSVCFENK 120

Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
                               EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETEWQVLHTSNITQKAK 180

Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
           KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE 
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDETVKSGESIAFDAHLVDIGECER 240

Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
           +HKPPKIP++QG SS G+ GTRVL+  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPLSQG-SSFGDRGTRVLNEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300

Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
           IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+  GEILELPKYLVEIGEAC +VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVE 360

Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
           + NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSGH+D+N
Sbjct: 361 IANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKN 420

Query: 421 ISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
           I VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S    
Sbjct: 421 ICVSVPSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPS---- 480

Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
              TLTE++EIGHS+QL +++HVEAES SLR+TISRT+ +S  AAC LVNDEGK+CEEIT
Sbjct: 481 ---TLTEELEIGHSNQLLQTEHVEAESSSLRDTISRTQGTSQFAACELVNDEGKMCEEIT 540

BLAST of HG10009044 vs. ExPASy TrEMBL
Match: A0A0A0KQR3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1)

HSP 1 Score: 830.5 bits (2144), Expect = 4.0e-237
Identity = 445/552 (80.62%), Postives = 468/552 (84.78%), Query Frame = 0

Query: 1   MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDE 60
           MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECR+LKQDE
Sbjct: 1   MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRMLKQDE 60

Query: 61  VICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF-- 120
           VICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKISE SGV+RGK+  NNSVC   
Sbjct: 61  VICSGETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVCSGA 120

Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
                            EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFH 180

Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
           DGFLKLSICG LG QV LFDENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHK 240

Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
           P KIP+N+G+SS   GG  VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Sbjct: 241 PSKIPLNEGTSSK-EGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK 300

Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
           ISSSGS  MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+
Sbjct: 301 ISSSGSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGD 360

Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
           R  DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSILQRPR RV+LSSGHTDENISV
Sbjct: 361 RKCDIRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISV 420

Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
           SV S   P+PSLAEALHL  D QSH+KPSE  +TRES KN EN+QSIALTQSTFTGNA T
Sbjct: 421 SVSSRN-PKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAET 480

Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYER 532
           LTED E G SS+L RSDHVEAESISLRN+I R    SDS AC LVN DEGKICEEITYER
Sbjct: 481 LTEDGEFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYER 540

BLAST of HG10009044 vs. ExPASy TrEMBL
Match: A0A1S4E617 (uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103482830 PE=4 SV=1)

HSP 1 Score: 830.1 bits (2143), Expect = 5.2e-237
Identity = 446/554 (80.51%), Postives = 474/554 (85.56%), Query Frame = 0

Query: 4   MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICS 63
           MNRWKVTYT HLKQKRKVYHDGFLDIHRSSNK    TMLYDECEKLLECRIL++DEVICS
Sbjct: 1   MNRWKVTYTNHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILRKDEVICS 60

Query: 64  GETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF------ 123
           GETLIFNS+LVDIDTPLGDHKPE GLNFQ GDDKISEKSGVLRGK+ RNNSVCF      
Sbjct: 61  GETLIFNSFLVDIDTPLGDHKPEFGLNFQEGDDKISEKSGVLRGKSIRNNSVCFASAEKN 120

Query: 124 --------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGF 183
                         EFKKRRLK YGSPQ+SPDTRKTEETEWQVLYTTNITQKAKK+HDGF
Sbjct: 121 KTRPSLSPSHQIIREFKKRRLKSYGSPQTSPDTRKTEETEWQVLYTTNITQKAKKFHDGF 180

Query: 184 LKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPK 243
           LKLSICG LG QV LFDENRKLL+SRFIKK ETVK GES+AFDAHLVEIGECE DHKP K
Sbjct: 181 LKLSICGSLGSQVMLFDENRKLLNSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSK 240

Query: 244 IPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISS 303
           IP+N+G+SS   GG RVLHGQKSCFSENEIS GKEW+ LYTSQITQKSKKY NGIIKISS
Sbjct: 241 IPLNEGTSSK-EGGDRVLHGQKSCFSENEISAGKEWHVLYTSQITQKSKKYQNGIIKISS 300

Query: 304 SGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNF 363
           SGS  MQV LLNEDR ILS KH SLSK V+ GE LELPKYLVEIGEACESVKVELGNRNF
Sbjct: 301 SGSHQMQVTLLNEDRNILSRKHLSLSKNVKVGEKLELPKYLVEIGEACESVKVELGNRNF 360

Query: 364 DIRKDASFCLSGGNEEGSGRETMKKSLRD-----AHQILSILQRPRARVSLSSGHTDENI 423
           DIRKDASFC+SGG+E+GSGRET +KSLRD     AHQILSILQRPRARV+LSSGHTDENI
Sbjct: 361 DIRKDASFCISGGDEKGSGRETTQKSLRDVLYHEAHQILSILQRPRARVNLSSGHTDENI 420

Query: 424 SVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNA 483
           SVSV S   P+PS+AEALHL +DDQSH+KPSE  +TRES KNAEN+QSIALTQSTFTGNA
Sbjct: 421 SVSVSSRN-PKPSVAEALHLPIDDQSHQKPSEGQNTRESIKNAENSQSIALTQSTFTGNA 480

Query: 484 GTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITY 532
            TLTED EIG SS+L RSDHVEAESISLRN+I R    SDSAA  LVN DEGKIC+EITY
Sbjct: 481 ETLTEDGEIGQSSKL-RSDHVEAESISLRNSIPR---KSDSAAYSLVNDDEGKICQEITY 540

BLAST of HG10009044 vs. ExPASy TrEMBL
Match: A0A6J1I1P0 (protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)

HSP 1 Score: 827.0 bits (2135), Expect = 4.4e-236
Identity = 431/555 (77.66%), Postives = 476/555 (85.77%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           +CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISEKSGVLRGKNFRNNSVCF   
Sbjct: 61  VCSGETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENK 120

Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
                               EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAK 180

Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
           KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE 
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECER 240

Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
           +HKPPKIP++QG SS G+ GTRVLH  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPVSQG-SSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300

Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
           IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+  GEILELPKYLVEIGEACE+VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVE 360

Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
           L NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSG +D+N
Sbjct: 361 LANRDFDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKN 420

Query: 421 ISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
           ISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T 
Sbjct: 421 ISVSVSSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT- 480

Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
                  ++EIGHS+QL ++++VEAES SLR+TIS T+ +S  AAC LVNDEGK+CEEIT
Sbjct: 481 -------ELEIGHSNQLLQTEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEIT 540

BLAST of HG10009044 vs. ExPASy TrEMBL
Match: A0A6J1HXZ5 (protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)

HSP 1 Score: 819.3 bits (2115), Expect = 9.3e-234
Identity = 430/555 (77.48%), Postives = 474/555 (85.41%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK    TMLYDECEKLLECRILKQ+EV
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
           +CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISEKSGVLRGKNFRNNSVCF   
Sbjct: 61  VCSGETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENK 120

Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
                               EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAK 180

Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
           KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE 
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECER 240

Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
           +HKPPKIP++QG SS G+ GTRVLH  K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPVSQG-SSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300

Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
           IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+  GEILELPKYLVEIGEACE+VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVE 360

Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
           L NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSG +D+N
Sbjct: 361 LANRDFDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKN 420

Query: 421 ISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
           ISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T 
Sbjct: 421 ISVSVSSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT- 480

Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
                  ++EIGHS+Q   +++VEAES SLR+TIS T+ +S  AAC LVNDEGK+CEEIT
Sbjct: 481 -------ELEIGHSNQ---TEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEIT 539

BLAST of HG10009044 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 106.3 bits (264), Expect = 7.7e-23
Identity = 62/135 (45.93%), Postives = 77/135 (57.04%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
           M E  RW   YTKHLKQKRKVYHDGFLD+H +  K+    MLYDE + LLE R LK  EV
Sbjct: 241 MAEKQRWIAMYTKHLKQKRKVYHDGFLDLHIARKKV----MLYDEDDNLLESRTLKACEV 300

Query: 61  ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSV-CFEF 120
           + +GETL F +YLVDI  P    K  S    +  D K + K   +   NF+ +S+ C E 
Sbjct: 301 VNTGETLTFQAYLVDICDPKDGSKASSEPKVEPSDQKCARKPFTVLRPNFKKSSLRCDEK 360

Query: 121 KKRRLKCYGSPQSSP 135
           K   +  + S   SP
Sbjct: 361 KPDLVNKFSSKSLSP 371

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874789.13.7e-26186.21uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida][more]
XP_038874787.13.5e-25984.97uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida][more]
XP_038874788.12.5e-25784.79uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida][more]
XP_022928770.14.4e-23878.38uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata][more]
XP_011654696.18.3e-23780.62uncharacterized protein LOC101209453 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1ESH92.1e-23878.38uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A0A0KQR34.0e-23780.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1[more]
A0A1S4E6175.2e-23780.51uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1I1P04.4e-23677.66protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1[more]
A0A6J1HXZ59.3e-23477.48protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G10890.17.7e-2345.93unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018838Domain of unknown function DUF2439PFAMPF10382DUF2439coord: 257..333
e-value: 4.0E-14
score: 52.7
coord: 5..78
e-value: 3.7E-13
score: 49.6
coord: 143..218
e-value: 1.2E-18
score: 67.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 417..445
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 419..438
NoneNo IPR availablePANTHERPTHR28535ZINC FINGER GRF-TYPE CONTAINING 1coord: 254..336
coord: 1..252

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10009044.1HG10009044.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006302 double-strand break repair
cellular_component GO:0035861 site of double-strand break