HG10017546 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017546
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionZinc finger family protein, putative isoform 1
LocationChr03: 15464545 .. 15468928 (+)
RNA-Seq ExpressionHG10017546
SyntenyHG10017546
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGGCTACAGGCCATCTGGCCAGGCTGCCGATGGCCGATGCTGTTGTGGGTGTGTTTCGATTCCAAGACTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATTCGTCGTATCGAGGTGGGACTCTTCATCGATTCTGATATCTTTGGGGTTTTAGGGGTTCGTCGATTGCTGTTTCTTTTGTGGGGTAAATAGATCTGGGAATTTTTTTGAGAATCTGGTTTAGATTTTTGAGATTTTGTTGCTTGTTCCTGCTGATTGTGTGTGGTTCTGTGTTGGCTGAGGAAAATGAGGGTTATTAGAGGGATTTAGATTGGTTTGTTTGATGGATTTGGGCTAATTTTGATCTGGAACTTGATGAGTTTTTTATGGCGTTCTTCTTGGTGGGTTCCTAGAGGCTTGCCTGTAGAATCGTTTTGGTTTGGGGAAGTGAGAAGTAGAGAAAATCAAAAGTAGTTTTGAGTAACGGAAGGCCCTGATGTTCAGGCTTTGATTATCGTCTTTCCTAATTAAACATGGGGCACCTCATTGGAGCTTGCACTAGGATTGTTCTAGAATCAGTTATGCTTGTTATTCTTTACATAATTAAATTAATTACATGTCTTCATCATTAGTGAGAAATTTTGCTCAGAGGGCGTCTGTACCTCTACCTTCCATTATCCCAAAACAAAAAAAAAAAAAAAAAAGAAGAAAAACATAGAGAGCATGGATTGCTTGTTTCAGCCAAAAATGTATTTATTTTTCAACTTTTTTTTTGGGGGGAGGGGGTGTTGAACCGGAAGTTTGAATCAACTAAATTTTCCATTTCTAGGGATTTCAATTAATCTATCGGAGTTCATGTGACAAGTTACATGGCTTATTGGCTTAAGTAAATGGAGAAAAATCTAACATTTAAAACTTGTGTGTATGTGTATGTGTATATTTTGAGTTCAATAAATAAGAGGCGGAGATTCGAAACTCCGACCTCAAGGATAACAAGTGGTACTTAAACCGGTGAACTCTTATATTGGCTAAAACTTGTGTAATTGACCTTTATCACGTGGTTTGTCAAAAATTGGATAAACATATAATTGTTTATTTTCAATCCTTTTTCTGAATAATTGGATGTTAGCACTTTTACCCACTTAAATTTTATGTACTCATGCAAAATGCTAAGATTCTTTTTTTTTTTTTTTGAATAAAATCATTTACTCCGGTCATCAAGGCACAATGCTGTCGTATGCCTCCTCCCTTCAAAAGTTTTGCCTTAGAGGTGAGAAGCCTCACCATGCAAATAAGAAAAAGGAAGAAAGTAATTTCTCTTAACTGCCTACACCATCCATATGCAGCTAGAGTGCAATGTATATTAAGCTAATGATTGGTTTGGCCTTTGTTTAGTTTAAAGTTAGATACTTGTATTTGAAATGGATCATCCATTTAATGAACCAACTTTGCGAGTTACTTGTGATTTTTCCCTTCTCCTTGGATATTGGTTAGGTTGGTAGGGCCATAAGGTTAAAAGGTTGTTTTATAATGATGCGTGGAGATTCTTCTTCGCCTTTAGTTGGAACTGAACTGTAGAATGTAGAGAGTGTTTTGCATTCAAGGAGTTTTGGATTTGGCTCGTCTCTGCTTTTTATTGCAGCTCCTTGTCCAAATACTTCCACAATTCAAGAGCCCTATTGTAGTCTTCTTTTTTGGTTATGGTGTATTTCTTATCTTCCCCTCCTTTTTGTCAAATGAATAAATTGTTTTGTGCGTAATGAGTCAGAACAGTGTTTTGGTTTTTCAAATGATCTATTTTGGCTGTTAAAGTGTATATATCAATTTAAAATAAAGGATGAGTTATATGGCTAGTTCATAAACTAATGTTTTAGTTGATTTCAGCAGATTTATGTAAAATGATATTATTTGATTTCCATATCGATAGAATGTTACCTAATATTCGTGTGTTGAAGTAAGGTATTGATTCCACAGTTCTACATGGTTGAAAATTGCAAGTTGTTTGAGGTCGATACACACGCCGATGCAACAGTTTCTTGACTTGAGGGTGGGAGGGTATAGTATTTTTCTTTAGTTGCTTTTGTTTTTGTTGACATTTTCAGTATTGTTTTGCGACTATAATAGTAACAATAGTAGTATGTTGACTTCTTAAAATTGTTTAGAATCACAAAACTGTTTTATATGGGTATAGTTAGAGCAACTGTAATTGACATTGTAGTTCCTTAAAGCTATCCTTCCATCTTGCAAACTTGACATATTATGTGTGTGCAGTGGGAATTTATATTAACGACGAAAGTCAACATAAATTCTTTAACAAGAATCAAAATGGCTCGACATGTCATCTCTGACTAATTTCTGTTTCCAGGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTATCTACTCTTTTTTATTCTGATTGATTGTGAACTCTATATACTTTATGCATATTGATCTGACGCTCTTTCAAATCAGGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCGTCAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGAAGTACTGAAATTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGGTTTGGTGTCTCTCTATCTGCTCATTCGATTTGCAATGCATTAATTTGGCTGGATGAATTTAGGTTTCCTTTAAATAATGTTTTTGTTTTTCATTTTCTAATATTAAAAATAAGATTGTTATCAATCGAGACCTTAGTTTAACTCTCATGAGCATACAGGCTACCAAAATTGTTTTCATTGTTCTTTCTGTAGCTGGATATCTTTTAACATGATGAGTAAAAAAAGTTGTATTTCTTGAGCTATTATTATTCCTATTTTTCCTTGTTTCAACGGAAATGTGTTCCGATATTTGAATTACTTTGCACTTTTATTGTAAAAATTAGCAAATGCTTGAGTTTGATCTTTGACGGTTACGAGTTGTTTTGTCCTAAATGCAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACTGCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCTCGGTCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTCCTGCTCCTGCACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCACCAGCAGCACCCGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACGACAAATCCATTAGTTGCGCCATCTCCATCTCCATGTGAGTAACACACTGATTCCGGTAGGAAACTGGATAGTGTTTTAACTACAAAAGTCCCATATATTCCATTGAACAATCCCAAATTTTCCATTGTGAAAACTTTTTCAACTGACTCAACTGATTTTGCAGCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA

mRNA sequence

ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGGCTACAGGCCATCTGGCCAGGCTGCCGATGGCCGATGCTGTTGTGGGTGTGTTTCGATTCCAAGACTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATTCGTCGTATCGAGGTGGGACTCTTCATCGATTCTGATATCTTTGGGGTTTTAGGGGTTCGTCGATTGCTGTTTCTTTTGTGGGGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCGTCAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGAAGTACTGAAATTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACTGCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCTCGGTCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTCCTGCTCCTGCACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCACCAGCAGCACCCGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACGACAAATCCATTAGTTGCGCCATCTCCATCTCCATCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA

Coding sequence (CDS)

ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGGCTACAGGCCATCTGGCCAGGCTGCCGATGGCCGATGCTGTTGTGGGTGTGTTTCGATTCCAAGACTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATTCGTCGTATCGAGGTGGGACTCTTCATCGATTCTGATATCTTTGGGGTTTTAGGGGTTCGTCGATTGCTGTTTCTTTTGTGGGGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCGTCAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGAAGTACTGAAATTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACTGCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCTCGGTCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTCCTGCTCCTGCACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCACCAGCAGCACCCGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACGACAAATCCATTAGTTGCGCCATCTCCATCTCCATCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA

Protein sequence

MALDPLRRHCSRWGKTTENSHCRPPSATGHLARLPMADAVVGVFRFQDSLASDASSFCYCPLPCSFLLFFGCPLFSIMQIKRIWVSIRRIEVGLFIDSDIFGVLGVRRLLFLLWGHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWGFTLFLILARHM
Homology
BLAST of HG10017546 vs. NCBI nr
Match: XP_038882638.1 (uncharacterized protein LOC120073837 [Benincasa hispida])

HSP 1 Score: 731.5 bits (1887), Expect = 5.4e-207
Identity = 391/431 (90.72%), Postives = 403/431 (93.50%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVERPVSLLEDNIEQL+TDIFEEF IPSIKVDILSLE L GSNRTKVVFSLD
Sbjct: 80  GHDIVATFNVERPVSLLEDNIEQLRTDIFEEFNIPSIKVDILSLESLPGSNRTKVVFSLD 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD D+ EISSTYLSLIRSTI SLVTNQFLRITKSMFGEAFSFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDESEISSTYLSLIRSTIVSLVTNQFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVTAPTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYVKLWNAEGSTVTAPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGS+
Sbjct: 260 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSE 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN  +PPT HHHHHHT LTPAISPAPATEKGAPEY SPAPERS 
Sbjct: 320 GNGPTRSPSPAPMPQPHN--NPPT-HHHHHHTRLTPAISPAPATEKGAPEYGSPAPERST 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASPKRSY AKPPGCQY  KRKSGRKEGKQSHLTPLASP +SPDHSAASPSP PQH+VNPP
Sbjct: 380 ASPKRSYTAKPPGCQY-IKRKSGRKEGKQSHLTPLASPNVSPDHSAASPSPLPQHKVNPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAP+  APALTPLPNV+YAHVQPPSKS+S+HPEKSTTNP  APSPSPSGADR  MITQWG
Sbjct: 440 AAPIVPAPALTPLPNVIYAHVQPPSKSNSNHPEKSTTNPSDAPSPSPSGADRCCMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILA HM
Sbjct: 500 FTLFLILACHM 506

BLAST of HG10017546 vs. NCBI nr
Match: KGN54878.2 (hypothetical protein Csa_012907 [Cucumis sativus])

HSP 1 Score: 730.7 bits (1885), Expect = 9.2e-207
Identity = 388/431 (90.02%), Postives = 399/431 (92.58%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLD
Sbjct: 80  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLD 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSD
Sbjct: 260 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSD 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Sbjct: 320 GNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNA 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 380 ASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP +A  PSPSGADR  MITQWG
Sbjct: 440 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIA--PSPSGADRCHMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILA HM
Sbjct: 500 FTLFLILACHM 502

BLAST of HG10017546 vs. NCBI nr
Match: XP_004144318.1 (uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus])

HSP 1 Score: 730.7 bits (1885), Expect = 9.2e-207
Identity = 388/431 (90.02%), Postives = 399/431 (92.58%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLD
Sbjct: 80  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLD 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSD
Sbjct: 260 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSD 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Sbjct: 320 GNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNA 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 380 ASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP +A  PSPSGADR  MITQWG
Sbjct: 440 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIA--PSPSGADRCHMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILA HM
Sbjct: 500 FTLFLILACHM 502

BLAST of HG10017546 vs. NCBI nr
Match: KAA0025811.1 (Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa] >TYK09645.1 Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 721.1 bits (1860), Expect = 7.3e-204
Identity = 383/431 (88.86%), Postives = 398/431 (92.34%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+D
Sbjct: 11  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSID 70

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 71  PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 130

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTI
Sbjct: 131 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYEILYIKLWNAEGSTVTAPTI 190

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+
Sbjct: 191 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQVRLSSILKHSLNGSE 250

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Sbjct: 251 GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSA 310

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 311 ASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 370

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP VA  PSPSGADR  MITQWG
Sbjct: 371 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVA--PSPSGADRCHMITQWG 430

Query: 535 FTLFLILARHM 546
           FTLFLILARHM
Sbjct: 431 FTLFLILARHM 433

BLAST of HG10017546 vs. NCBI nr
Match: XP_008455751.1 (PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo])

HSP 1 Score: 721.1 bits (1860), Expect = 7.3e-204
Identity = 383/431 (88.86%), Postives = 398/431 (92.34%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+D
Sbjct: 80  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSID 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYEILYIKLWNAEGSTVTAPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+
Sbjct: 260 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQVRLSSILKHSLNGSE 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Sbjct: 320 GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSA 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 380 ASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP VA  PSPSGADR  MITQWG
Sbjct: 440 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVA--PSPSGADRCHMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILARHM
Sbjct: 500 FTLFLILARHM 502

BLAST of HG10017546 vs. ExPASy TrEMBL
Match: A0A0A0KYS3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G420140 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 4.4e-207
Identity = 388/431 (90.02%), Postives = 399/431 (92.58%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLD
Sbjct: 80  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLD 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSD
Sbjct: 260 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSD 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Sbjct: 320 GNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNA 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 380 ASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP +A  PSPSGADR  MITQWG
Sbjct: 440 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIA--PSPSGADRCHMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILA HM
Sbjct: 500 FTLFLILACHM 502

BLAST of HG10017546 vs. ExPASy TrEMBL
Match: A0A5A7SNH7 (Zinc finger family protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold447G00440 PE=4 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 3.5e-204
Identity = 383/431 (88.86%), Postives = 398/431 (92.34%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+D
Sbjct: 11  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSID 70

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 71  PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 130

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTI
Sbjct: 131 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYEILYIKLWNAEGSTVTAPTI 190

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+
Sbjct: 191 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQVRLSSILKHSLNGSE 250

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Sbjct: 251 GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSA 310

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 311 ASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 370

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP VA  PSPSGADR  MITQWG
Sbjct: 371 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVA--PSPSGADRCHMITQWG 430

Query: 535 FTLFLILARHM 546
           FTLFLILARHM
Sbjct: 431 FTLFLILARHM 433

BLAST of HG10017546 vs. ExPASy TrEMBL
Match: A0A1S3C173 (uncharacterized protein LOC103495852 OS=Cucumis melo OX=3656 GN=LOC103495852 PE=4 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 3.5e-204
Identity = 383/431 (88.86%), Postives = 398/431 (92.34%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+D
Sbjct: 80  GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSID 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 234
           PD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFP GITIIPPQSAF
Sbjct: 140 PDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF 199

Query: 235 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 294
           LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTI
Sbjct: 200 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYEILYIKLWNAEGSTVTAPTI 259

Query: 295 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 354
           VQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+
Sbjct: 260 VQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQVRLSSILKHSLNGSE 319

Query: 355 GNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA 414
           GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Sbjct: 320 GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSA 379

Query: 415 ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPP 474
           ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPP
Sbjct: 380 ASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAA--SPSPQHQINPP 439

Query: 475 AAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWG 534
           AAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP VA  PSPSGADR  MITQWG
Sbjct: 440 AAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVA--PSPSGADRCHMITQWG 499

Query: 535 FTLFLILARHM 546
           FTLFLILARHM
Sbjct: 500 FTLFLILARHM 502

BLAST of HG10017546 vs. ExPASy TrEMBL
Match: A0A6J1HW39 (uncharacterized protein LOC111467196 OS=Cucurbita maxima OX=3661 GN=LOC111467196 PE=4 SV=1)

HSP 1 Score: 658.7 bits (1698), Expect = 2.1e-185
Identity = 360/438 (82.19%), Postives = 377/438 (86.07%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDI+ATFNVERPVSLL+DN+EQLQTDIFEEFPIPSIKVD+L L+ LSGSN T VVFSLD
Sbjct: 80  GHDILATFNVERPVSLLKDNVEQLQTDIFEEFPIPSIKVDVLYLDSLSGSNCTTVVFSLD 139

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSA 234
            D DD EIS TYLSLIRST ASLVTNQ FL +TKSMFGEAFSFEVLKFP GITIIPPQSA
Sbjct: 140 SDMDDSEISQTYLSLIRSTFASLVTNQSFLHVTKSMFGEAFSFEVLKFPGGITIIPPQSA 199

Query: 235 FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPT 294
           FLLQKVQILFNFTLNFS+HQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPT
Sbjct: 200 FLLQKVQILFNFTLNFSVHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPT 259

Query: 295 IVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGS 354
           IVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+ LNGS
Sbjct: 260 IVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHYLNGS 319

Query: 355 DGNGPARSPSPAPA------PQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSS 414
           +GN P RSPSPAPA      P +HNY HPPT HHHHHHTP+TPAISPAP TEKGAPEY S
Sbjct: 320 EGNSPVRSPSPAPAPAPTPQPHNHNYHHPPTRHHHHHHTPVTPAISPAPTTEKGAPEYGS 379

Query: 415 PAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSP 474
           PAPER+AASPKRS  A+PPGCQYRYKRKS RKEGKQ           SP HSA  PSPSP
Sbjct: 380 PAPERTAASPKRSSKAEPPGCQYRYKRKSSRKEGKQ-----------SPVHSA--PSPSP 439

Query: 475 QHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRR 534
           +H+V      VS APAL PLPNVVY HVQPPSKS+S+H E S  NP  APSPSPSGADR 
Sbjct: 440 KHRV------VSPAPALAPLPNVVYTHVQPPSKSNSNHHEPSMMNPSFAPSPSPSGADRH 498

Query: 535 RMITQWGFTLFLILARHM 546
           R ITQWGFTLFLILA HM
Sbjct: 500 RTITQWGFTLFLILAHHM 498

BLAST of HG10017546 vs. ExPASy TrEMBL
Match: A0A6J1HSR1 (uncharacterized protein LOC111466276 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111466276 PE=4 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 5.3e-184
Identity = 358/432 (82.87%), Postives = 380/432 (87.96%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GHDIVATF VERPVSLLEDNIE+L+TDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +D
Sbjct: 76  GHDIVATFVVERPVSLLEDNIERLRTDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGID 135

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSA 234
           PD DD EI STYLSLIRST ASLVTNQ FLRITKSMFGEAFSFEVLKFP GITIIPPQSA
Sbjct: 136 PDTDDPEIPSTYLSLIRSTCASLVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSA 195

Query: 235 FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPT 294
           FLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPT
Sbjct: 196 FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPT 255

Query: 295 IVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGS 354
           IVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILK+SLNG 
Sbjct: 256 IVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGM 315

Query: 355 DGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERS 414
           DG GP RSPSPAP PQSHN+ HPP+HHHHHHH+PLTP ISPAPA E GAPEY  PAP +S
Sbjct: 316 DGKGPIRSPSPAPTPQSHNFHHPPSHHHHHHHSPLTPVISPAPAPETGAPEYGLPAP-KS 375

Query: 415 AASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNP 474
           AASPKRSY AKPPGCQ  YKRKSGRKEGKQ +L+PLASP ISP HSAA  SPS QH V+P
Sbjct: 376 AASPKRSYEAKPPGCQ--YKRKSGRKEGKQPYLSPLASPSISPVHSAA--SPSQQHHVSP 435

Query: 475 PAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQW 534
                    A TPLP+V+YAHVQPPSKS+S+HPEKSTT+P + PSPSPS A    MIT+W
Sbjct: 436 -------TQASTPLPSVIYAHVQPPSKSESNHPEKSTTSPSIVPSPSPSSAHHWCMITRW 495

Query: 535 GFTLFLILARHM 546
            FTL LI+A +M
Sbjct: 496 RFTLSLIVAFYM 495

BLAST of HG10017546 vs. TAIR 10
Match: AT3G56590.2 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 310.8 bits (795), Expect = 2.1e-84
Identity = 201/408 (49.26%), Postives = 254/408 (62.25%), Query Frame = 0

Query: 116 HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDP 175
           H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP
Sbjct: 84  HRIVASFDVGKPISFMEDNLMQLENDITDEISFPMTKVVVLALERLGDLNRTMVIFAIDP 143

Query: 176 DGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 235
           + ++ +I +   SLI++   +LV  Q   R+T+S+FGE F FEVLKFP GIT+IPPQ  F
Sbjct: 144 EKENSKIPAEIESLIKAAFETLVQKQLSFRLTESLFGEPFFFEVLKFPGGITVIPPQPIF 203

Query: 236 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 295
            LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTI
Sbjct: 204 PLQKAQLLFNFTLNFSIYQIQSNFEELASQLKKGINLASYENLYITLSNSRGSTVAPPTI 263

Query: 296 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 355
           V SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S     
Sbjct: 264 VHSSVLLTFG---SSSRLKQLAQTITSSHSKNLGLNHTVFGKVKQVRLSSILPHS----- 323

Query: 356 GNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER 415
              PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP +
Sbjct: 324 ---PATSSTPSPSPQPETHQYPHHHPHHHHHHH-ELAPEPSLSPPTKGFAP---ASAPTK 383

Query: 416 SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVN 475
            +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H   
Sbjct: 384 HSPLPPRN-----PPCPYEQRRPKGNSALNHHTAPPTPAPHRSQPHPPAPNPAPPRHH-- 443

Query: 476 PPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPS 521
             A PVS     +PLP+VV+AH+ PPSKS          +P  AP+PS
Sbjct: 444 --AIPVS-----SPLPHVVFAHIPPPSKSSPESEPTGEKSPSPAPTPS 462

BLAST of HG10017546 vs. TAIR 10
Match: AT3G56590.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 310.5 bits (794), Expect = 2.8e-84
Identity = 201/412 (48.79%), Postives = 255/412 (61.89%), Query Frame = 0

Query: 116 HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDP 175
           H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP
Sbjct: 84  HRIVASFDVGKPISFMEDNLMQLENDITDEISFPMTKVVVLALERLGDLNRTMVIFAIDP 143

Query: 176 DGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFSFEVLKFPRGITIIPPQSAF 235
           + ++ +I +   SLI++   +LV  Q   R+T+S+FGE F FEVLKFP GIT+IPPQ  F
Sbjct: 144 EKENSKIPAEIESLIKAAFETLVQKQLSFRLTESLFGEPFFFEVLKFPGGITVIPPQPIF 203

Query: 236 LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTI 295
            LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTI
Sbjct: 204 PLQKAQLLFNFTLNFSIYQIQSNFEELASQLKKGINLASYENLYITLSNSRGSTVAPPTI 263

Query: 296 VQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSD 355
           V SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S     
Sbjct: 264 VHSSVLLTFG---SSSRLKQLAQTITSSHSKNLGLNHTVFGKVKQVRLSSILPHS----- 323

Query: 356 GNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER 415
              PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP +
Sbjct: 324 ---PATSSTPSPSPQPETHQYPHHHPHHHHHHH-ELAPEPSLSPPTKGFAP---ASAPTK 383

Query: 416 SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVN 475
            +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H   
Sbjct: 384 HSPLPPRN-----PPCPYEQRRPKGNSALNHHTAPPTPAPHRSQPHPPAPNPAPPRHH-- 443

Query: 476 PPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGA 525
             A PVS     +PLP+VV+AH+ PPSKS          +P  AP+P  S +
Sbjct: 444 --AIPVS-----SPLPHVVFAHIPPPSKSSPESEPTGEKSPSPAPTPCKSSS 466

BLAST of HG10017546 vs. TAIR 10
Match: AT3G10810.1 (zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 301.2 bits (770), Expect = 1.7e-81
Identity = 217/443 (48.98%), Postives = 267/443 (60.27%), Query Frame = 0

Query: 115 GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLD 174
           GH IVA+F++ R  S L +N  QLQ DIF+E    SIKV IL++EP    N TKVVF +D
Sbjct: 78  GHAIVASFSINRSASFLNENTLQLQNDIFQEMSYISIKVTILAVEPSDELNITKVVFGID 137

Query: 175 PDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSA 234
           PD    EI    LS I+    S++ NQ  L++TKS+FGE F FEVLKFP GIT+IPPQSA
Sbjct: 138 PDTGYREILPLSLSSIKEMFESVLINQSTLQLTKSLFGETFLFEVLKFPGGITVIPPQSA 197

Query: 235 FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPT 294
           F LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGSTV+ PT
Sbjct: 198 FPLQKFKIVFNFTLNYSIHQIQINFNTLASQLKNGLNLAPYENLYVSLSNSEGSTVSPPT 257

Query: 295 IVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGS 354
            V SSVLL VG + S  RLKQL  TI+GS S NLGLNNT FGKVKQVRLSS L    N S
Sbjct: 258 TVHSSVLLRVGTSNSSPRLKQLTDTITGSRSKNLGLNNTIFGKVKQVRLSSFLP---NSS 317

Query: 355 DGNGPARSPSPAPAPQS-HNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYS---SPA 414
           D +   +SPSP+P+P S H++ H   HHHHHHH            + K APE S   SPA
Sbjct: 318 DSS--TKSPSPSPSPHSKHHHHHHHHHHHHHHHHHNHHHHHHHNLSPKMAPEVSPVASPA 377

Query: 415 PERSAASPKRSYAAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPS 474
           P RS    KR+ +A P   PG +  +K K       Q   TP  +P           + +
Sbjct: 378 PHRSR---KRAPSAPPPCNPGNRVHFKEKR-----VQFSSTPAPAP----------SAGA 437

Query: 475 PQHQVNPPAAPVSRA-----PALTPLPNVVYAH-VQPPSKSDSSHPEKSTTNPLVAPSP- 534
           P HQ++ P AP+S A     P   PLP+VV+AH  QPP     + P +   N +  P P 
Sbjct: 438 PHHQLHSP-APISAAKSHIVPISAPLPHVVFAHAAQPP----ITEPREPHANEVAHPQPQ 492

Query: 535 SPSGADRRRMITQWGFTLFLILA 543
           S S A        W   L LI+A
Sbjct: 498 SSSSAIEVLPAMPWIVLLMLIVA 492

BLAST of HG10017546 vs. TAIR 10
Match: AT1G10790.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2); Has 78 Blast hits to 78 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 117.1 bits (292), Expect = 4.5e-26
Identity = 88/253 (34.78%), Postives = 130/253 (51.38%), Query Frame = 0

Query: 118 IVATFNVERPVSLLEDNIEQLQTDIFEEFPIP-SIKVDILSLEPLSGSNRTKVVFSLDPD 177
           + A+F +++PVS +  +  +++ DI     +  + KV +LSL     SN T V F++ P 
Sbjct: 84  VQASFRLQKPVSEVVRHKGKIEHDILRSIGLSNNSKVTVLSLNQSGASNYTDVEFAVLPV 143

Query: 178 GDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFSFEVLKFPRGITIIPPQSAFL 237
             D EIS   LSL+RS+   L   +  L++T S FG+  SF+VLKFP GIT+ P + A +
Sbjct: 144 PPDHEISKHSLSLLRSSFVKLFAKRSKLKLTTSGFGKPTSFQVLKFPGGITVDPLEPAPV 203

Query: 238 LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIV 297
                +LF+ T+  SI  +Q     L    E  L L PYE ++ +L N +GST++ P   
Sbjct: 204 SGVALVLFSVTIKTSISTVQDRLDLLNGLFEHMLSLEPYESVHFQLTNKQGSTISPPLTF 263

Query: 298 QSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDG 357
           Q  V   +      +RL    Q I  S + NLGL+   FG+VK +  S+ L       DG
Sbjct: 264 QVYVAFTM-RKYLHQRLNHFTQIIQTSRAKNLGLDEAVFGEVKDITFSTYL-------DG 323

Query: 358 NGPARSPSPAPAP 369
             P      APAP
Sbjct: 324 KVPDSDLELAPAP 328

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882638.15.4e-20790.72uncharacterized protein LOC120073837 [Benincasa hispida][more]
KGN54878.29.2e-20790.02hypothetical protein Csa_012907 [Cucumis sativus][more]
XP_004144318.19.2e-20790.02uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus][more]
KAA0025811.17.3e-20488.86Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa] >TYK09... [more]
XP_008455751.17.3e-20488.86PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KYS34.4e-20790.02Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G420140 PE=4 SV=1[more]
A0A5A7SNH73.5e-20488.86Zinc finger family protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=11... [more]
A0A1S3C1733.5e-20488.86uncharacterized protein LOC103495852 OS=Cucumis melo OX=3656 GN=LOC103495852 PE=... [more]
A0A6J1HW392.1e-18582.19uncharacterized protein LOC111467196 OS=Cucurbita maxima OX=3661 GN=LOC111467196... [more]
A0A6J1HSR15.3e-18482.87uncharacterized protein LOC111466276 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G56590.22.1e-8449.26hydroxyproline-rich glycoprotein family protein [more]
AT3G56590.12.8e-8448.79hydroxyproline-rich glycoprotein family protein [more]
AT3G10810.11.7e-8148.98zinc finger (C3HC4-type RING finger) family protein [more]
AT1G10790.14.5e-2634.78BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 496..518
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 349..527
NoneNo IPR availablePANTHERPTHR33826F20B24.21coord: 115..541
NoneNo IPR availablePANTHERPTHR33826:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 115..541

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017546.1HG10017546.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004175 endopeptidase activity
molecular_function GO:0008236 serine-type peptidase activity