Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGGTTTCTATTTCTTTCATACTTTTAACTTCACTACGCATCTTTTATTTGAACTCCTAGATATGTAATTCTGCAGAAGAACATGTCGAATTTCGGCGATTTTGAACTGGAAAGTATGGAATTAACTAGCATTGTGGCGCTAAATCCTAGAATTGGATGTAGAGCTTTGGAAGTCTCTTGATGGAAATGATTATAGTTGATAGAATTGCTGAGATGTGATTCAGTAGTTGGTTTATGCTGTTCTGGTTTCGATTCGGAGTAACATTGTGCTAATTTGTTTGATTCCTAGAAATTTGGATTTTCTCATTCTAATTGTTAGCCTAGCCGATTTGTTCTAATGGCTAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGGTAATTAGTCTAGCTTTCCCTTATGGCTTTCTTCTTTTTAAAAAAAGTTCTTTCCATAATCAATGAGCACAGGATTAAGTTTAATTCTCTTATTGCTTGGATTATGATTAGTGTTATTTTCTTGTTTGTTACTTCAAAAACGAAGAAAGTGCAGAGAAGAATAAAACTCGGCCGAGTCTCAGCCCTTCACATAAAGTAATCAGAGGTAACTTACATTATTTGGCTAATGCAACATTTCTGTTATTATTGTTCTCCCAATTGCTTACTTAAAGCGGTTTTAATCATAACTTGGGCTAGATGTAATCTTACTGGTGATTACCATTTATTTCAGACGTGCTATTCTCATCATGTGTGTACCAATGTGCTTTTAGATTTGGCTGTAACACTGTCATTCCCTTTACATTCCAGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGGTTAGTTCATGATTTCTTTCCTGGGTTGTTATCATCATAATTTTTTTTCTTGGGGAGGGGGGCAAATGTGCAATGGATGTAGAGCATAATGTTGATGAATACGAGTCTGGCCGTCTTGATGTATTTGAAATATGTGAAAGTACTCTATATGTTTACATACGTCAGTTCAGTATGGTAAACAATATTATAAATAGCCAAATACAGAGTGTAACGAAGGTACGTCTGACAAAACAAATTAATAAACAACAGAAACTAAAATCAACTACTTTGCCTAATATGCTTTTTAAGTATATTTTCATTCCGCAGAGTCAAATTTATTATGTATCTACCAATTTATGCCATTCTGTCCTTATCCATCTTCTGTTGTCTTTATTGGCATGATTAACGTTATGATGTTTGTAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTTAACTTCTTGTTTGTTACTTGAATTATCTAGTTTACAGGTCTACCATACTTCTTGATTATTTGTGAACTACTCAATTTGTGAAATTTATATCTTCGACTCAAGATCAATACATATTAGATTTTTTGTAGGGATGTATGTCATCGATATGATGTATTGATCAATTGAAAGGTAGGCCGAGAAACTGAGAGGAGCAATAAAATTTGTAGGCAGGTCGAGGGATCAGTCAGATATGGGAGGTGGCCAACTTTAATGTCTCCTTGTAGGCAATAGTCTCTAGACTTTTTTGTAATTATGATCTAAGTTTTGGTTTTTTGGATTGGAGTCCTATATAAATAGATTTTTTTTGTTTGGTATGCTCTAGTACTCCACTGTTTTTCTCAATGAAAGAGCAGTTTCGTGGGAAAAAAAGAAAAGAAAAGGTACAATTGAAAGTTAAAGTTTCTGCCTGTACCTCTCCTGGTCTTTCTGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGGTCAGCTGTTTTTGCTACATTAATCATGATCATCGTAGATGAAATTTGTCAGCATTGTTCATTTTTTTCCTCTTTCGGCCATAAGGTGTCATGAAGTCTGTTTTCCTCTTGTTGTACGTCTACTTAGGCATTCGTAGTGATTTAGTGCAGAATGGAATGTTTTGTACACTAGTCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATAAAAAATTTCTCTGGCTCTCACCATAGGAGCTGTTTTTGCTACATCAATCATAATCATCGTAGATAAAATTTGTCAGCATTGTTCACCTTTTTCCTCGCCATAAGGTGTCATGAAGTCTGTTTTCCTCTTGTTGTACGTCTACTTAGGCATTGTTTTAGTGCAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTAATTTACTTTGTCTTGAACATCATGATACCTGCTGTCTGTTTTGCAGTTGTACGAAAATACAATTCGGTCACTCCCCTCCCCCTCCCAGAGAAGAATGTAAATTTTAGTTCACAATATGAGTTTCATGTTTAAGTTCAACTGTTTTCGTAAGCCAAACAACATTTTTGGTGCCATGGATACAGATAGATACATGCCTAAAGGTGCAAATGAGATTGTTGGAAAAGAACATCAGAACTACTGATGCCTTACTCTAACAATAGGTAGTTTACACAAAATTTCAGCTGTTATCAATCAAGTGATCACTTCTGGGACATGCTGGTCATTATCCAGCATTGATAAATGAGTAGCTGGCTAATCATCAAGTAAAAAATAAAACAAAATTCTTATCAGATTTGACCATCACATTTATAGTTCTAGAGTCGTCCGATGACTATGGATTCAAGCTTAGATTGTATAGCAGATGTACAAGAAAGATGGAAGGTTTAGGATAGGATATTAACATATTAACCAGGGTGTTTTTTTTTTTCAATATGTTATTAATGCAATTCAAAGAATTCCAAAGGTTTCTTATTATATTTTTATGAGACTAACTCTTACATGTATTTCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGGTCCGATGCACATGTAATATTTTGGTTCCTAAACCAATATTCGTATGGCTATCTAATAATTTTTCTGGCTTGCTATGCGACCTTTGGTCAAATGCAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGGTTTGTGAATGGTTACTTGATGAATTCTGTCTGAACGTCTTAGTTGTTTATGCCTTGTTCCTTCTGAACTAAATATTTATTTGTTTTATGTCATGAACATGAAGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATGTGATGTCGTGCAAAAGCACCTCAATTTAAGTCTAAAATTTCATAATTTTACTATGAGATTGTCGTGTCAAATTACAGACTTATTTCAAATTTGATCCTGCAGCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGGTGTGGTTAAATTTCTCTTTGGTGTCCTTTGAATGTACATTAGATGATTGTCATATAAGATCTTTAGTAAGTTTCCCATGACTTGACTCTGGCTTTATTCCTTTCCGAAATGTTTGTGTAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
mRNA sequence
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
Coding sequence (CDS)
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTACCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGATTGACACACAGACAATGCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTATATTCAATAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCACAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTGATGATAAGATATCTGAAAAATCTGGTGTACTGCGAGGGAAAAATTTCCGAAATAACTCAGTTTGCTTTGAATTTAAGAAGAGAAGATTGAAGTGCTATGGATCGCCACAAAGTAGTCCGGACACTAGAAAGACAGAAGAGACAGAGTGGCAGGTCCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCAATTTGTGGATTCCTCGGGAGGCAGGTCACGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAAGATGAAACAGTAAAATGTGGAGAGTCAATGGCCTTTGATGCTCACTTAGTAGAAATTGGAGAATGTGAAAGTGACCATAAGCCTCCTAAAATTCCTATAAATCAAGGTAGTAGCAGTTCTGGAAATGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGAATGCTTTGTACACTAGCCAGATAACTCAGAAGTCCAAGAAATATCACAACGGGATCATCAAAATTTCCTCCTCTGGCTCTCTCCATATGCAGGTTGCTTTACTGAATGAAGATAGAATTATATTAAGCAGCAAACACTTCAGTTTATCTAAAAAAGTGAGGTCGGGGGAGATACTTGAGCTACCAAAATACTTGGTGGAGATTGGTGAGGCTTGTGAAAGTGTTAAAGTAGAGCTCGGCAACAGAAATTTTGATATAAGAAAAGACGCAAGTTTTTGCCTTTCTGGTGGAAATGAAGAGGGATCGGGCAGAGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAGTGAGCCTTTCTTCAGGCCATACTGATGAAAATATTAGCGTATCTGTTTTGTCATATGAGGTTCCTGAACCTTCCCTTGCGGAGGCATTACATCTTTCTGTAGATGACCAATCTCATAAAAAACCAAGTGAAAGACTGGACACGAGGGAATCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCAATCAACATTCACTGGCAATGCCGGAACATTAACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTTTTCGGTCAGACCATGTGGAGGCCGAAAGTATTTCTCTCAGAAATACAATTTCTAGGACTCGAGCTTCAAGTGACTCTGCTGCTTGTTGCCTTGTTAACGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAACTGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
Protein sequence
MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCFEFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERELDACPSFDLGI
Homology
BLAST of HG10009044 vs. NCBI nr
Match:
XP_038874789.1 (uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida])
HSP 1 Score: 911.4 bits (2354), Expect = 3.7e-261
Identity = 475/551 (86.21%), Postives = 492/551 (89.29%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECRILKQDEV
Sbjct: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F
Sbjct: 61 IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120
Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180
Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240
Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
PPKI NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300
Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
+SSSGS QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGN 360
Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420
Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
SV SY PEPSLAEALHLS+DDQSH++PSE D RESTKNAENNQSI LTQ TFTGNAGT
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQPTFTGNAGT 480
Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEITYERE 532
LTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKICEEITYERE
Sbjct: 481 LTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKICEEITYERE 540
BLAST of HG10009044 vs. NCBI nr
Match:
XP_038874787.1 (uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida])
HSP 1 Score: 904.8 bits (2337), Expect = 3.5e-259
Identity = 475/559 (84.97%), Postives = 493/559 (88.19%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECRILKQDEV
Sbjct: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F
Sbjct: 61 IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120
Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180
Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240
Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
PPKI NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300
Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
+SSSGS QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VKVELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGN 360
Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420
Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------S 480
SV SY PEPSLAEALHLS+DDQSH++PSE D RESTKNAENNQSI LTQ +
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLT 480
Query: 481 TFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKIC 532
TFTGNAGTLTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKIC
Sbjct: 481 TFTGNAGTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKIC 540
BLAST of HG10009044 vs. NCBI nr
Match:
XP_038874788.1 (uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida])
HSP 1 Score: 898.7 bits (2321), Expect = 2.5e-257
Identity = 474/559 (84.79%), Postives = 492/559 (88.01%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECRILKQDEV
Sbjct: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILKQDEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
I SGETLIFNSYLVDIDTPLGDHKPES LNFQ GDDKISEKSGVLRGKNFRNNSV F
Sbjct: 61 IGSGETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFAST 120
Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFH 180
Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
DGFLKLSICG LGRQV LFDENRKLLDSRFIKKDETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHK 240
Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
PPKI NQG SSSG GGTRVLHG+K+CFSENEISTGKEWN LYTSQ+TQKSKKYHNGIIK
Sbjct: 241 PPKILSNQG-SSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIK 300
Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
+SSSGS QV LLNEDR ILSSKHFSLSK VR GEILELPKYLVEIGEACE+VK ELGN
Sbjct: 301 VSSSGSHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVK-ELGN 360
Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
RNFDIRKDASFC+SGG+E+GSGRETMKKSLR+AHQILSILQRPRARV LSSGH DENISV
Sbjct: 361 RNFDIRKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISV 420
Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQ--------S 480
SV SY PEPSLAEALHLS+DDQSH++PSE D RESTKNAENNQSI LTQ +
Sbjct: 421 SVSSYN-PEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLT 480
Query: 481 TFTGNAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKIC 532
TFTGNAGTLTEDVEIGHSSQL RSDH EAE ISLRN+ISRTR SSD+AAC LVNDEGKIC
Sbjct: 481 TFTGNAGTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKIC 540
BLAST of HG10009044 vs. NCBI nr
Match:
XP_022928770.1 (uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata])
HSP 1 Score: 834.7 bits (2155), Expect = 4.4e-238
Identity = 435/555 (78.38%), Postives = 476/555 (85.77%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK TMLYDECEKLLECRILKQ+EV
Sbjct: 1 MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
+CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI EKSGVLRGKNFRNNSVCF
Sbjct: 61 VCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNSVCFENK 120
Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETEWQVLHTSNITQKAK 180
Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDETVKSGESIAFDAHLVDIGECER 240
Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
+HKPPKIP++QG SS G+ GTRVL+ K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPLSQG-SSFGDRGTRVLNEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300
Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+ GEILELPKYLVEIGEAC +VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVE 360
Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
+ NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSGH+D+N
Sbjct: 361 IANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKN 420
Query: 421 ISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
I VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S
Sbjct: 421 ICVSVPSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPS---- 480
Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
TLTE++EIGHS+QL +++HVEAES SLR+TISRT+ +S AAC LVNDEGK+CEEIT
Sbjct: 481 ---TLTEELEIGHSNQLLQTEHVEAESSSLRDTISRTQGTSQFAACELVNDEGKMCEEIT 540
BLAST of HG10009044 vs. NCBI nr
Match:
XP_011654696.1 (uncharacterized protein LOC101209453 isoform X2 [Cucumis sativus])
HSP 1 Score: 830.5 bits (2144), Expect = 8.3e-237
Identity = 445/552 (80.62%), Postives = 468/552 (84.78%), Query Frame = 0
Query: 1 MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDE 60
MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECR+LKQDE
Sbjct: 1 MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRMLKQDE 60
Query: 61 VICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF-- 120
VICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKISE SGV+RGK+ NNSVC
Sbjct: 61 VICSGETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVCSGA 120
Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFH 180
Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
DGFLKLSICG LG QV LFDENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHK 240
Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
P KIP+N+G+SS GG VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Sbjct: 241 PSKIPLNEGTSSK-EGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK 300
Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
ISSSGS MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+
Sbjct: 301 ISSSGSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGD 360
Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
R DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSILQRPR RV+LSSGHTDENISV
Sbjct: 361 RKCDIRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISV 420
Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
SV S P+PSLAEALHL D QSH+KPSE +TRES KN EN+QSIALTQSTFTGNA T
Sbjct: 421 SVSSRN-PKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAET 480
Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYER 532
LTED E G SS+L RSDHVEAESISLRN+I R SDS AC LVN DEGKICEEITYER
Sbjct: 481 LTEDGEFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYER 540
BLAST of HG10009044 vs. ExPASy TrEMBL
Match:
A0A6J1ESH9 (uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435594 PE=4 SV=1)
HSP 1 Score: 834.7 bits (2155), Expect = 2.1e-238
Identity = 435/555 (78.38%), Postives = 476/555 (85.77%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK TMLYDECEKLLECRILKQ+EV
Sbjct: 1 MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
+CSGETLIFNSYLVDIDTPLGDHKPESGLNFQAG DKI EKSGVLRGKNFRNNSVCF
Sbjct: 61 VCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNSVCFENK 120
Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
EFKK RLKCYGSPQSSPDTR+TEETEWQVL+T+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETEWQVLHTSNITQKAK 180
Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDETVK GES+AFDAHLV+IGECE
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDETVKSGESIAFDAHLVDIGECER 240
Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
+HKPPKIP++QG SS G+ GTRVL+ K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPLSQG-SSFGDRGTRVLNEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300
Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
IIKISSSGS HMQV LLNEDR ILSSKH SLSKK+ GEILELPKYLVEIGEAC +VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVE 360
Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
+ NR+FDIRKD SFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSGH+D+N
Sbjct: 361 IANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKN 420
Query: 421 ISVSVLSYEVPEPSL-AEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
I VSV S +VPEPSL AEAL L +DD+SHKKPSE LDTR+STKNAENNQSIALT S
Sbjct: 421 ICVSVPSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPS---- 480
Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
TLTE++EIGHS+QL +++HVEAES SLR+TISRT+ +S AAC LVNDEGK+CEEIT
Sbjct: 481 ---TLTEELEIGHSNQLLQTEHVEAESSSLRDTISRTQGTSQFAACELVNDEGKMCEEIT 540
BLAST of HG10009044 vs. ExPASy TrEMBL
Match:
A0A0A0KQR3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1)
HSP 1 Score: 830.5 bits (2144), Expect = 4.0e-237
Identity = 445/552 (80.62%), Postives = 468/552 (84.78%), Query Frame = 0
Query: 1 MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDE 60
MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECR+LKQDE
Sbjct: 1 MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRMLKQDE 60
Query: 61 VICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF-- 120
VICSGETLIFNS+LVDIDTPLGD KPESGLNFQ GDDKISE SGV+RGK+ NNSVC
Sbjct: 61 VICSGETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVCSGA 120
Query: 121 -----------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYH 180
EFKKRRLKCYGSPQ+S DTRKTEETEWQVLYTTNITQKAKK+H
Sbjct: 121 EKNKTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFH 180
Query: 181 DGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHK 240
DGFLKLSICG LG QV LFDENRKLLDSRFIKK ETVK GES+AFDAHLVEIGECE DHK
Sbjct: 181 DGFLKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHK 240
Query: 241 PPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIK 300
P KIP+N+G+SS GG VLHGQKSCFSENEISTGKEWN LYTSQITQKSKKYHNGIIK
Sbjct: 241 PSKIPLNEGTSSK-EGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK 300
Query: 301 ISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGN 360
ISSSGS MQV LLNEDR ILS KH SLSK VR GE LELPKYLVEIGEACESVKVELG+
Sbjct: 301 ISSSGSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGD 360
Query: 361 RNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDENISV 420
R DIRKDASFC+SGG+E GSGRET +KSLRDAHQILSILQRPR RV+LSSGHTDENISV
Sbjct: 361 RKCDIRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISV 420
Query: 421 SVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNAGT 480
SV S P+PSLAEALHL D QSH+KPSE +TRES KN EN+QSIALTQSTFTGNA T
Sbjct: 421 SVSSRN-PKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAET 480
Query: 481 LTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITYER 532
LTED E G SS+L RSDHVEAESISLRN+I R SDS AC LVN DEGKICEEITYER
Sbjct: 481 LTEDGEFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYER 540
BLAST of HG10009044 vs. ExPASy TrEMBL
Match:
A0A1S4E617 (uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103482830 PE=4 SV=1)
HSP 1 Score: 830.1 bits (2143), Expect = 5.2e-237
Identity = 446/554 (80.51%), Postives = 474/554 (85.56%), Query Frame = 0
Query: 4 MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEVICS 63
MNRWKVTYT HLKQKRKVYHDGFLDIHRSSNK TMLYDECEKLLECRIL++DEVICS
Sbjct: 1 MNRWKVTYTNHLKQKRKVYHDGFLDIHRSSNK----TMLYDECEKLLECRILRKDEVICS 60
Query: 64 GETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF------ 123
GETLIFNS+LVDIDTPLGDHKPE GLNFQ GDDKISEKSGVLRGK+ RNNSVCF
Sbjct: 61 GETLIFNSFLVDIDTPLGDHKPEFGLNFQEGDDKISEKSGVLRGKSIRNNSVCFASAEKN 120
Query: 124 --------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKYHDGF 183
EFKKRRLK YGSPQ+SPDTRKTEETEWQVLYTTNITQKAKK+HDGF
Sbjct: 121 KTRPSLSPSHQIIREFKKRRLKSYGSPQTSPDTRKTEETEWQVLYTTNITQKAKKFHDGF 180
Query: 184 LKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECESDHKPPK 243
LKLSICG LG QV LFDENRKLL+SRFIKK ETVK GES+AFDAHLVEIGECE DHKP K
Sbjct: 181 LKLSICGSLGSQVMLFDENRKLLNSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSK 240
Query: 244 IPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNGIIKISS 303
IP+N+G+SS GG RVLHGQKSCFSENEIS GKEW+ LYTSQITQKSKKY NGIIKISS
Sbjct: 241 IPLNEGTSSK-EGGDRVLHGQKSCFSENEISAGKEWHVLYTSQITQKSKKYQNGIIKISS 300
Query: 304 SGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVELGNRNF 363
SGS MQV LLNEDR ILS KH SLSK V+ GE LELPKYLVEIGEACESVKVELGNRNF
Sbjct: 301 SGSHQMQVTLLNEDRNILSRKHLSLSKNVKVGEKLELPKYLVEIGEACESVKVELGNRNF 360
Query: 364 DIRKDASFCLSGGNEEGSGRETMKKSLRD-----AHQILSILQRPRARVSLSSGHTDENI 423
DIRKDASFC+SGG+E+GSGRET +KSLRD AHQILSILQRPRARV+LSSGHTDENI
Sbjct: 361 DIRKDASFCISGGDEKGSGRETTQKSLRDVLYHEAHQILSILQRPRARVNLSSGHTDENI 420
Query: 424 SVSVLSYEVPEPSLAEALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTGNA 483
SVSV S P+PS+AEALHL +DDQSH+KPSE +TRES KNAEN+QSIALTQSTFTGNA
Sbjct: 421 SVSVSSRN-PKPSVAEALHLPIDDQSHQKPSEGQNTRESIKNAENSQSIALTQSTFTGNA 480
Query: 484 GTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVN-DEGKICEEITY 532
TLTED EIG SS+L RSDHVEAESISLRN+I R SDSAA LVN DEGKIC+EITY
Sbjct: 481 ETLTEDGEIGQSSKL-RSDHVEAESISLRNSIPR---KSDSAAYSLVNDDEGKICQEITY 540
BLAST of HG10009044 vs. ExPASy TrEMBL
Match:
A0A6J1I1P0 (protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)
HSP 1 Score: 827.0 bits (2135), Expect = 4.4e-236
Identity = 431/555 (77.66%), Postives = 476/555 (85.77%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK TMLYDECEKLLECRILKQ+EV
Sbjct: 1 MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
+CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISEKSGVLRGKNFRNNSVCF
Sbjct: 61 VCSGETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENK 120
Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAK 180
Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECER 240
Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
+HKPPKIP++QG SS G+ GTRVLH K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPVSQG-SSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300
Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+ GEILELPKYLVEIGEACE+VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVE 360
Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
L NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSG +D+N
Sbjct: 361 LANRDFDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKN 420
Query: 421 ISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
ISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T
Sbjct: 421 ISVSVSSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT- 480
Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
++EIGHS+QL ++++VEAES SLR+TIS T+ +S AAC LVNDEGK+CEEIT
Sbjct: 481 -------ELEIGHSNQLLQTEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEIT 540
BLAST of HG10009044 vs. ExPASy TrEMBL
Match:
A0A6J1HXZ5 (protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)
HSP 1 Score: 819.3 bits (2115), Expect = 9.3e-234
Identity = 430/555 (77.48%), Postives = 474/555 (85.41%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNK TMLYDECEKLLECRILKQ+EV
Sbjct: 1 MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNK----TMLYDECEKLLECRILKQEEV 60
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSVCF--- 120
+CSGETLIFNSYLV+IDTPLGD+KPESGLNFQAG D+ISEKSGVLRGKNFRNNSVCF
Sbjct: 61 VCSGETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENK 120
Query: 121 --------------------EFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAK 180
EFKK RLKCYGSPQ+SPDTR+TEETEWQVLYT+NITQKAK
Sbjct: 121 ASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAK 180
Query: 181 KYHDGFLKLSICGFLGRQVTLFDENRKLLDSRFIKKDETVKCGESMAFDAHLVEIGECES 240
KYHDGFLKL ICG LGRQV LFDENRKLLDSRF+KKDE VK GES+AFDAHLV+IGECE
Sbjct: 181 KYHDGFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECER 240
Query: 241 DHKPPKIPINQGSSSSGNGGTRVLHGQKSCFSENEISTGKEWNALYTSQITQKSKKYHNG 300
+HKPPKIP++QG SS G+ GTRVLH K CFSENEISTGKEW+ LYTSQITQKSKKYHNG
Sbjct: 241 EHKPPKIPVSQG-SSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNG 300
Query: 301 IIKISSSGSLHMQVALLNEDRIILSSKHFSLSKKVRSGEILELPKYLVEIGEACESVKVE 360
IIKISSSGS HMQV LLNEDRIILSSKH SLSKK+ GEILELPKYLVEIGEACE+VKVE
Sbjct: 301 IIKISSSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVE 360
Query: 361 LGNRNFDIRKDASFCLSGGNEEGSGRETMKKSLRDAHQILSILQRPRARVSLSSGHTDEN 420
L NR+FDIRKDASFC+SG +E+GS R TMKKSLRDAH+ILSILQRP+ARVSLSSG +D+N
Sbjct: 361 LANRDFDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKN 420
Query: 421 ISVSVLSYEVPEPSLA-EALHLSVDDQSHKKPSERLDTRESTKNAENNQSIALTQSTFTG 480
ISVSV S +VPEPSLA EAL L +D++SH+KPSE LDTRESTKNAE+NQS ALTQST T
Sbjct: 421 ISVSVSSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT- 480
Query: 481 NAGTLTEDVEIGHSSQLFRSDHVEAESISLRNTISRTRASSDSAACCLVNDEGKICEEIT 532
++EIGHS+Q +++VEAES SLR+TIS T+ +S AAC LVNDEGK+CEEIT
Sbjct: 481 -------ELEIGHSNQ---TEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEIT 539
BLAST of HG10009044 vs. TAIR 10
Match:
AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 106.3 bits (264), Expect = 7.7e-23
Identity = 62/135 (45.93%), Postives = 77/135 (57.04%), Query Frame = 0
Query: 1 MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKIDTQTMLYDECEKLLECRILKQDEV 60
M E RW YTKHLKQKRKVYHDGFLD+H + K+ MLYDE + LLE R LK EV
Sbjct: 241 MAEKQRWIAMYTKHLKQKRKVYHDGFLDLHIARKKV----MLYDEDDNLLESRTLKACEV 300
Query: 61 ICSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGDDKISEKSGVLRGKNFRNNSV-CFEF 120
+ +GETL F +YLVDI P K S + D K + K + NF+ +S+ C E
Sbjct: 301 VNTGETLTFQAYLVDICDPKDGSKASSEPKVEPSDQKCARKPFTVLRPNFKKSSLRCDEK 360
Query: 121 KKRRLKCYGSPQSSP 135
K + + S SP
Sbjct: 361 KPDLVNKFSSKSLSP 371
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038874789.1 | 3.7e-261 | 86.21 | uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida] | [more] |
XP_038874787.1 | 3.5e-259 | 84.97 | uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida] | [more] |
XP_038874788.1 | 2.5e-257 | 84.79 | uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida] | [more] |
XP_022928770.1 | 4.4e-238 | 78.38 | uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata] | [more] |
XP_011654696.1 | 8.3e-237 | 80.62 | uncharacterized protein LOC101209453 isoform X2 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1ESH9 | 2.1e-238 | 78.38 | uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A0A0KQR3 | 4.0e-237 | 80.62 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1 | [more] |
A0A1S4E617 | 5.2e-237 | 80.51 | uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1I1P0 | 4.4e-236 | 77.66 | protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1 | [more] |
A0A6J1HXZ5 | 9.3e-234 | 77.48 | protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G10890.1 | 7.7e-23 | 45.93 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... | [more] |