HG10020873 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020873
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
LocationChr05: 3155822 .. 3159037 (+)
RNA-Seq ExpressionHG10020873
SyntenyHG10020873
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTTCTTCAACTTCTTCTTCTTCTTCTTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATATCCCATCAAATCCCACCAAAAAACCAAACTTTTTTCCACCCAGCTAAAGAGCTCAAGAAACTCAAGCACGTCAGAGCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTCGGGTAAAAAAAATTTATAAAAAAAATGAAGTTTTTTTTTTTTTCTTTTGAGTGTTCTGCAAAATTTAACAATTTGGGTTTTTTTTTTTTTTTTTTTTTTTTGGGTTTGTTTGTATTGTTGAAAGCAGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCAAACTCCATTGGTAGTTAACCAATTTCCCCCCTTTTGTGGGTGAAAATTTTAGTGATTATTTATGAATTTAATTGAGAAAATTGGATGAACAGGAACCACCTGAGAGGCCAAGAGGGAACAAATCCATGGAAGAAGTAGCAGAAAATTTCCAATTATGGTCAGTTTCAGGGGATTTTTGCCCTGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGGAATGGTCATGAGGTTAGAAATTACCTTCAATTTTCATTTTATTTATACAAAAATTAATGGAAATTAAAAGAGTATTATTGAATGGATTTGCAGCATGCTGTGGTATTTGTGAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACTGATCAATACGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGGTAAGTTTATGATCTGAATGTTCTTGGAAATTGTTATTTTGGATTTGAAATTTTTGTGGTTTAATCTTTTTTTTTTTTTTTGGGTTCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTATTGGACAGTGAGTTTCATGTTTTTTTTTTTTTTTTTTTGGTTGGATTATGTTTAAGGATTTGCAGAATTTGAAGAACAGAGAGACAATTAATTATGATTTTTAATTTTGTTTAATTTTTGGGCAGACTGATGCTTATCAAGCCACTGGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCAGCAATTTCGCCTATATCTTCTTACAGTGGCAAGCAATTCGATATTGGTTTAATGGTTTGGAAGGTAAGTTCATTTACTGTTTCCCTTTTTTATTATTATTTTAATTCATTCATATTGAGTTCGTCGAATGTGAGGTGGAAGTCAAATATTCGATTTTTTTCCTTTTTCTTTTTGAAAAATCAAACCTTCGATCTTAAGGTTTTGCATACAACTTACAAGTGCCTTAAACTAAATTATGTTTTTTCTTGTCTTTAATTATTTATTTTTAGAAAATTAGAGTAATTAATAGGAGAATTTTTAAAAATAAAAAAAATAAAGAAAACTATTTACACAAAATAACAAATATGTTAGATAATTGTAATAGACGCTGATAGAAGTCTATCACTAATAGATGCTTATAGAAGTCTATCAATATTTTTTTTTTGTTATTTTCTGTAAATAGTTTGACATTTTTTTTATATGTGAAAATTTTTCTAATTAATAGAAAAATTATTATAAACAGAGAAAATATAAATATTTGTTAATATTTGTTTTCAAATAATATTTCTATATTTGTAAATAGTTTGACTCATTTTTCTATATTTGAAAACAACTCTAAATGAATTTTGTGAGACTACAACTTTCCATTTGTATGTTTAGTTAAATTTTATAAATAATATAAAATTGTTCAATACAAAATTGAATGTATAAAGACTAAATAATACATTTTTGGAGAATTTTTAAAAATAAAAAATAAAGAAAACTATTTACACAAAATAGCAAAATTTTTAGATAGTTGTAATGGACGTTGATAGAAGTATATCAATATCAGTATTTTTTTTTTGTTTTTTTTCTGTAAATAGTTTGTAATTTTGTATCGAACAATTTTCTATTATTTATAAAGTTTAACTAAACATGCAAATGGAAAGTTGTAGTCTCGTAAAGTAAATTTATGTGTTTGATAAGTTAGTAATTTGTGAAATGTCACAGACTTATTAAACATATAATTATAATTATAGAAATTGAAGAAATACTTTGAAACTTAACATATGATTACTAAACACTTTTTAACCGTTTTTTTCAAATTAAAAACACTTTTTTACGACACACCCCCTTAAAAGTTTAAAGACTACAGACTACTAAACCGTATTTTAAGAAATCAAAACTGTTAACCAAACAATTCTTACATTTTATTCCTTCAAAAACTAAAGAAAAGACATGTCCATTCACTGCATCCACTATGGTTATTTCATATCCTCTTTTTTCTTTTCATATTATTTTGCTTACTATACTTGCCGTTGTATCATTATAAACATTATTATTACTCTTACTCCCCTTAGGGATAAATCTGCCCCTTCCCTTGCCTACGTAGATAGAATATTTTAAGAAGTGAACAACTTTCTTAGTCGTTCAAAAACTAAAAAACAAGATAAACAAAAAGAAAAAAAATTAGAAAACGTTACGAAACGAGCACTAAGCTAAGTTTGATAAGTTCATTAATGATTGAACGTACGTACATATACGGTTTTCAAATAGGATCCAAAGCACGGGCATTGGTGGTTGGAATACGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTATTCAGCCATTTACGGAGCCATGCTAGCATGGTACAATTTGGAGGGGAAATAGTGAACAGCAGATCTTCAGGGTTTCATACAGCCACTCAAATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTAGTTGATTGGGATAACAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGCCTCTAATAATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAG

mRNA sequence

ATGGCTTCTTCTTCAACTTCTTCTTCTTCTTCTTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATATCCCATCAAATCCCACCAAAAAACCAAACTTTTTTCCACCCAGCTAAAGAGCTCAAGAAACTCAAGCACGTCAGAGCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCAAACTCCATTGGAACCACCTGAGAGGCCAAGAGGGAACAAATCCATGGAAGAAGTAGCAGAAAATTTCCAATTATGGTCAGTTTCAGGGGATTTTTGCCCTGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGGAATGGTCATGAGCATGCTGTGGTATTTGTGAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACTGATCAATACGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTATTGGACAACTGATGCTTATCAAGCCACTGGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCAGCAATTTCGCCTATATCTTCTTACAGTGGCAAGCAATTCGATATTGGTTTAATGGTTTGGAAGGATCCAAAGCACGGGCATTGGTGGTTGGAATACGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTATTCAGCCATTTACGGAGCCATGCTAGCATGGTACAATTTGGAGGGGAAATAGTGAACAGCAGATCTTCAGGGTTTCATACAGCCACTCAAATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTAGTTGATTGGGATAACAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGCCTCTAATAATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAG

Coding sequence (CDS)

ATGGCTTCTTCTTCAACTTCTTCTTCTTCTTCTTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATATCCCATCAAATCCCACCAAAAAACCAAACTTTTTTCCACCCAGCTAAAGAGCTCAAGAAACTCAAGCACGTCAGAGCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCAAACTCCATTGGAACCACCTGAGAGGCCAAGAGGGAACAAATCCATGGAAGAAGTAGCAGAAAATTTCCAATTATGGTCAGTTTCAGGGGATTTTTGCCCTGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGGAATGGTCATGAGCATGCTGTGGTATTTGTGAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACTGATCAATACGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTATTGGACAACTGATGCTTATCAAGCCACTGGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCAGCAATTTCGCCTATATCTTCTTACAGTGGCAAGCAATTCGATATTGGTTTAATGGTTTGGAAGGATCCAAAGCACGGGCATTGGTGGTTGGAATACGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTATTCAGCCATTTACGGAGCCATGCTAGCATGGTACAATTTGGAGGGGAAATAGTGAACAGCAGATCTTCAGGGTTTCATACAGCCACTCAAATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTAGTTGATTGGGATAACAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGCCTCTAATAATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAG

Protein sequence

MASSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP
Homology
BLAST of HG10020873 vs. NCBI nr
Match: XP_038895687.1 (uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida])

HSP 1 Score: 849.7 bits (2194), Expect = 1.0e-242
Identity = 405/420 (96.43%), Postives = 412/420 (98.10%), Query Frame = 0

Query: 4   SSTSSSSSCSCSCFVVFLLLF-TSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAY 63
           +S+SSSSSCSCSCFVVFLL+F TSFSSVFS+SISHQIPPKNQTFFHPAKELKKLKH+R Y
Sbjct: 2   ASSSSSSSCSCSCFVVFLLVFLTSFSSVFSSSISHQIPPKNQTFFHPAKELKKLKHIRNY 61

Query: 64  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAEN 123
           LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKG TPLEPPERPRGN S EEVAEN
Sbjct: 62  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPLEPPERPRGNNSKEEVAEN 121

Query: 124 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 183
           FQLWS SGDFCPEGTIPIRRTTE+DIFRASSFRRFGRKPIR VRRDSSGNGHEHAVVFVN
Sbjct: 122 FQLWSASGDFCPEGTIPIRRTTERDIFRASSFRRFGRKPIRRVRRDSSGNGHEHAVVFVN 181

Query: 184 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 243
           GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR
Sbjct: 182 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 241

Query: 244 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 303
           FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
Sbjct: 242 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 301

Query: 304 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGF 363
           HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGF
Sbjct: 302 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTGTQMGSGHFAEEGF 361

Query: 364 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA+NNVWGTYFYYGGPGRNVKCP
Sbjct: 362 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQANNNVWGTYFYYGGPGRNVKCP 421

BLAST of HG10020873 vs. NCBI nr
Match: KAG6573073.1 (hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 820.5 bits (2118), Expect = 6.8e-234
Identity = 385/420 (91.67%), Postives = 403/420 (95.95%), Query Frame = 0

Query: 3   SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAY 62
           +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAY
Sbjct: 2   ASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAY 61

Query: 63  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAEN 122
           LRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N
Sbjct: 62  LRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADN 121

Query: 123 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 182
            QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVN
Sbjct: 122 HQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVN 181

Query: 183 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 242
           GEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPR
Sbjct: 182 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPR 241

Query: 243 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 302
           FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Sbjct: 242 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHG 301

Query: 303 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGF 362
           HWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGF
Sbjct: 302 HWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGF 361

Query: 363 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 422
           GKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 362 GKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 421

BLAST of HG10020873 vs. NCBI nr
Match: XP_022994621.1 (uncharacterized protein LOC111490277 [Cucurbita maxima])

HSP 1 Score: 818.1 bits (2112), Expect = 3.4e-233
Identity = 383/419 (91.41%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 4   SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYL 63
           +S+ SSS  SCSCFVVFLL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLK++RAYL
Sbjct: 2   ASSCSSSCSSCSCFVVFLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKNIRAYL 61

Query: 64  RKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENF 123
           RKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N 
Sbjct: 62  RKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADNH 121

Query: 124 QLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNG 183
           QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNG
Sbjct: 122 QLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNG 181

Query: 184 EQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRF 243
           EQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRF
Sbjct: 182 EQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRF 241

Query: 244 FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH 303
           FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHGH
Sbjct: 242 FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGH 301

Query: 304 WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFG 363
           WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFG
Sbjct: 302 WWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFG 361

Query: 364 KASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 362 KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 420

BLAST of HG10020873 vs. NCBI nr
Match: XP_023542314.1 (uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 817.4 bits (2110), Expect = 5.8e-233
Identity = 385/420 (91.67%), Postives = 402/420 (95.71%), Query Frame = 0

Query: 3   SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAY 62
           S S+SSSSS SCSCFVV LL+FTS +S FSTSI+HQ+P KNQT FHP KELKKLK++RAY
Sbjct: 6   SCSSSSSSSSSCSCFVVVLLVFTSLASAFSTSIAHQMPLKNQTLFHPTKELKKLKNIRAY 65

Query: 63  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAEN 122
           LRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N
Sbjct: 66  LRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADN 125

Query: 123 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 182
            QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVN
Sbjct: 126 HQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVN 185

Query: 183 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 242
           GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR
Sbjct: 186 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 245

Query: 243 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 302
           FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Sbjct: 246 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHG 305

Query: 303 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGF 362
           HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGF
Sbjct: 306 HWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGF 365

Query: 363 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 422
           GKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 366 GKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 425

BLAST of HG10020873 vs. NCBI nr
Match: XP_022954586.1 (uncharacterized protein LOC111456810 [Cucurbita moschata])

HSP 1 Score: 817.4 bits (2110), Expect = 5.8e-233
Identity = 384/420 (91.43%), Postives = 402/420 (95.71%), Query Frame = 0

Query: 3   SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAY 62
           +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAY
Sbjct: 2   ASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAY 61

Query: 63  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAEN 122
           LRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N
Sbjct: 62  LRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADN 121

Query: 123 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 182
            QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVN
Sbjct: 122 HQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVN 181

Query: 183 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 242
           GEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPR
Sbjct: 182 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPR 241

Query: 243 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 302
           FFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Sbjct: 242 FFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHG 301

Query: 303 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGF 362
           HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGF
Sbjct: 302 HWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGF 361

Query: 363 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 422
           GKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 362 GKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 421

BLAST of HG10020873 vs. ExPASy TrEMBL
Match: A0A6J1JZN8 (uncharacterized protein LOC111490277 OS=Cucurbita maxima OX=3661 GN=LOC111490277 PE=4 SV=1)

HSP 1 Score: 818.1 bits (2112), Expect = 1.6e-233
Identity = 383/419 (91.41%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 4   SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYL 63
           +S+ SSS  SCSCFVVFLL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLK++RAYL
Sbjct: 2   ASSCSSSCSSCSCFVVFLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKNIRAYL 61

Query: 64  RKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENF 123
           RKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N 
Sbjct: 62  RKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADNH 121

Query: 124 QLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNG 183
           QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNG
Sbjct: 122 QLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNG 181

Query: 184 EQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRF 243
           EQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRF
Sbjct: 182 EQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRF 241

Query: 244 FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH 303
           FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHGH
Sbjct: 242 FTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGH 301

Query: 304 WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFG 363
           WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFG
Sbjct: 302 WWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFG 361

Query: 364 KASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 362 KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 420

BLAST of HG10020873 vs. ExPASy TrEMBL
Match: A0A6J1GRA4 (uncharacterized protein LOC111456810 OS=Cucurbita moschata OX=3662 GN=LOC111456810 PE=4 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 2.8e-233
Identity = 384/420 (91.43%), Postives = 402/420 (95.71%), Query Frame = 0

Query: 3   SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAY 62
           +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAY
Sbjct: 2   ASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAY 61

Query: 63  LRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAEN 122
           LRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +PLEPPERPR NKSMEEVA+N
Sbjct: 62  LRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNKSMEEVADN 121

Query: 123 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 182
            QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVN
Sbjct: 122 HQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVN 181

Query: 183 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 242
           GEQYYGAKASLNIWAPRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPR
Sbjct: 182 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPR 241

Query: 243 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 302
           FFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Sbjct: 242 FFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHG 301

Query: 303 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGF 362
           HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGF
Sbjct: 302 HWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGF 361

Query: 363 GKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 422
           GKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SNNVWGTYFYYGGPGR V+CP
Sbjct: 362 GKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRCP 421

BLAST of HG10020873 vs. ExPASy TrEMBL
Match: A0A1S3AXP9 (uncharacterized protein LOC103483723 OS=Cucumis melo OX=3656 GN=LOC103483723 PE=4 SV=1)

HSP 1 Score: 813.1 bits (2099), Expect = 5.3e-232
Identity = 390/428 (91.12%), Postives = 408/428 (95.33%), Query Frame = 0

Query: 1   MASSSTSSSS----SCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKL 60
           MASSS+SSSS    SCSCSCFVV LL+FTSFSSVFS+SISHQIP KNQT FHPA+ELKKL
Sbjct: 1   MASSSSSSSSCSYCSCSCSCFVVLLLVFTSFSSVFSSSISHQIPTKNQTSFHPARELKKL 60

Query: 61  KHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRG-NKS 120
           KH+R YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHP+LKG TPLEPPERPRG N S
Sbjct: 61  KHIRNYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPDLKGHTPLEPPERPRGNNNS 120

Query: 121 MEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGH 180
           +EE  ENFQLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI RHVRRDSSGNGH
Sbjct: 121 IEEAVENFQLWSESGEFCPEGTIPIRRTTEKDIYRASSYRRYGRKPIRRHVRRDSSGNGH 180

Query: 181 EHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPE 240
           EHAVV+VNGEQYYGAKASLNIWAPRVTDQYEFS+SQIWVISGSF NDLNTIEAGWQVSPE
Sbjct: 181 EHAVVYVNGEQYYGAKASLNIWAPRVTDQYEFSISQIWVISGSFENDLNTIEAGWQVSPE 240

Query: 241 LYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLM 300
           LYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLM
Sbjct: 241 LYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLM 300

Query: 301 VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGS 360
           VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSGFHT TQMGS
Sbjct: 301 VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEVVNSRSSGFHTGTQMGS 360

Query: 361 GHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGG 420
           GHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHSDCYDIRQ +N+VWGTYFYYGG
Sbjct: 361 GHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKVLADHSDCYDIRQTTNSVWGTYFYYGG 420

Query: 421 PGRNVKCP 423
           PGRNVKCP
Sbjct: 421 PGRNVKCP 428

BLAST of HG10020873 vs. ExPASy TrEMBL
Match: A0A6J1KJ26 (uncharacterized protein LOC111495054 OS=Cucurbita maxima OX=3661 GN=LOC111495054 PE=4 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 2.4e-229
Identity = 373/410 (90.98%), Postives = 392/410 (95.61%), Query Frame = 0

Query: 13  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIK 72
           S SCFVV LL+FTSFSSVF TSI+H+ PPKNQT+FHP+KEL +LKH+RAYLRKINKPP K
Sbjct: 4   SSSCFVVLLLVFTSFSSVFPTSIAHKTPPKNQTYFHPSKELNELKHIRAYLRKINKPPTK 63

Query: 73  TIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLWSVSGDF 132
           TI+SSDGDVIDCVLSHLQPAFDHP LKG TPL PPERPRGN S EEVAENFQLWS SGDF
Sbjct: 64  TIQSSDGDVIDCVLSHLQPAFDHPHLKGHTPLRPPERPRGNNSGEEVAENFQLWSASGDF 123

Query: 133 CPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKAS 192
           CPEGTIPIRRTTE+DI+RASSFRRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKAS
Sbjct: 124 CPEGTIPIRRTTEQDIYRASSFRRFGRKPIRHMRRDSSGNGHEHAVVFVNGEQYYGAKAS 183

Query: 193 LNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY 252
           LNIWAPRVTDQ EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY
Sbjct: 184 LNIWAPRVTDQNEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY 243

Query: 253 QATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL 312
           QATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKHGHWWLEYGSGL
Sbjct: 244 QATGCYNLLCSGFVQTNNRIAIGAAISPMSSYNGKQFDVGIMVWKDPKHGHWWLEYGSGL 303

Query: 313 LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQ 372
           LVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQ
Sbjct: 304 LVGYWPAFLFSHLRSHGSMVQFGGEIVNSRGSGFHTATEMGSGHFGEEGFAKASYFRNLQ 363

Query: 373 VVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           VVDWDNNLLPLTNLH+LADHSDCYDIRQ +N+VWGTYFYYGGPGRNVKCP
Sbjct: 364 VVDWDNNLLPLTNLHVLADHSDCYDIRQGTNDVWGTYFYYGGPGRNVKCP 413

BLAST of HG10020873 vs. ExPASy TrEMBL
Match: A0A0A0LTK3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G046260 PE=4 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 2.4e-229
Identity = 382/421 (90.74%), Postives = 401/421 (95.25%), Query Frame = 0

Query: 4   SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYL 63
           +S+SSSSS SCSCFVV LL+FTSFSSV S+SISHQIP KNQT FHPAKELKKLKH+R YL
Sbjct: 2   ASSSSSSSSSCSCFVVLLLVFTSFSSVLSSSISHQIPTKNQTLFHPAKELKKLKHIRNYL 61

Query: 64  RKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGN-KSMEEVAEN 123
           RKINKPPIK I+SSDGDVIDCVLSHLQPAFDHP+LKG +PLEPPERPRGN  S EE  EN
Sbjct: 62  RKINKPPIKIIQSSDGDVIDCVLSHLQPAFDHPDLKGHSPLEPPERPRGNSNSTEEAIEN 121

Query: 124 FQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVN 183
           FQLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI+HV+RDSSGNGHEHAVV+VN
Sbjct: 122 FQLWSESGEFCPEGTIPIRRTTEKDIYRASSYRRYGRKPIKHVKRDSSGNGHEHAVVYVN 181

Query: 184 GEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPR 243
           GEQYYGAKASLNIWAPRVTDQYEFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPR
Sbjct: 182 GEQYYGAKASLNIWAPRVTDQYEFSISQIWVISGSFENDLNTIEAGWQVSPELYGDNNPR 241

Query: 244 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG 303
           FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWKDPKHG
Sbjct: 242 FFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHG 301

Query: 304 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEG 363
           HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRS SGFHT TQMGSGHFAEEG
Sbjct: 302 HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEVVNSRSNSGFHTGTQMGSGHFAEEG 361

Query: 364 FGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKC 423
           FGKASYFRNLQVVDWDNNLLPLTNL +LADHSDCYDIRQ +NNVWGTYFYYGGPGRNVKC
Sbjct: 362 FGKASYFRNLQVVDWDNNLLPLTNLQVLADHSDCYDIRQTTNNVWGTYFYYGGPGRNVKC 421

BLAST of HG10020873 vs. TAIR 10
Match: AT5G50150.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 662.1 bits (1707), Expect = 2.9e-190
Identity = 319/423 (75.41%), Postives = 361/423 (85.34%), Query Frame = 0

Query: 1   MASSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVR 60
           MASSS+SSS++ + +   + L+L  S        +   I  KNQT F P +E++KL+ V 
Sbjct: 1   MASSSSSSSTTSTITSKFIGLMLLLSLQ--LDLLLGSAIHLKNQTSFRPNREIQKLRRVE 60

Query: 61  AYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERP-RGNKSMEEV 120
           AYL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+GQ PL+ P RP +GN++  E 
Sbjct: 61  AYLSKINKPSIKTIHSPDGDVIECVPSHLQPAFDHPQLQGQKPLDSPYRPSKGNETTYEE 120

Query: 121 AENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVV 180
           + N QLWS+SG+ CP G+IPIR+TT+ D+ RA+S RRFGRK  R +RRDSSG GHEHAVV
Sbjct: 121 SFN-QLWSMSGESCPIGSIPIRKTTKNDVLRANSVRRFGRKLRRPIRRDSSGGGHEHAVV 180

Query: 181 FVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDN 240
           FVNGEQYYGAKAS+N+WAPRVTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN
Sbjct: 181 FVNGEQYYGAKASINVWAPRVTDAYEFSLSQIWLISGSFGHDLNTIEAGWQVSPELYGDN 240

Query: 241 NPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDP 300
            PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY+G+QFDIGLM+WKDP
Sbjct: 241 YPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSSYNGRQFDIGLMIWKDP 300

Query: 301 KHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAE 360
           KHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+
Sbjct: 301 KHGHWWLELGNGLLVGYWPAFLFSHLRSHASMVQFGGEVVNSRSSGAHTGTQMGSGHFAD 360

Query: 361 EGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNV 420
           EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDIRQ  NNVWGTYFYYGGPGRN 
Sbjct: 361 EGFEKAAYFRNLQVVDWDNNLLPLKNLHVLADHPACYDIRQGKNNVWGTYFYYGGPGRNP 420

Query: 421 KCP 423
           +CP
Sbjct: 421 RCP 420

BLAST of HG10020873 vs. TAIR 10
Match: AT1G23340.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 626.7 bits (1615), Expect = 1.3e-179
Identity = 295/416 (70.91%), Postives = 340/416 (81.73%), Query Frame = 0

Query: 7   SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKI 66
           SSSSSC    F++ L LF+S++S  S S S  +P        P +E++K+K +R  L+KI
Sbjct: 2   SSSSSCLFFTFILLLSLFSSYASP-SNSTSETVP------LRPQREIQKMKLIRKQLQKI 61

Query: 67  NKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLW 126
           NKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++PPE P G     E  ENFQLW
Sbjct: 62  NKPAIKTIHSSDGDTIDCVPSHHQPAFDHPLLQGQRPMDPPEMPIGYSQENESHENFQLW 121

Query: 127 SVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQY 186
           S+ G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QY
Sbjct: 122 SLYGESCPEGTIPIRRTTEQDMLRANSVRRFGRK-IRRVRRDSSSNGHEHAVGYVSGSQY 181

Query: 187 YGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTY 246
           YGAKAS+N+W PRV  QYEFSLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTY
Sbjct: 182 YGAKASINVWTPRVISQYEFSLSQIWIIAGSFAGDLNTIEAGWQISPELYGDTNPRFFTY 241

Query: 247 WTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL 306
           WT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Sbjct: 242 WTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKGGQFDISLLIWKDPKHGHWWL 301

Query: 307 EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKAS 366
           ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKAS
Sbjct: 302 QFGSGTLVGYWPVSLFTHLREHGNMVQFGGEIVNTRPGGSHTSTQMGSGHFAGEGFGKAS 361

Query: 367 YFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           YFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VWG +FYYGGPG+N KCP
Sbjct: 362 YFRNLQMVDWDNTLIPISNLKVLADHPNCYDIRGGVNRVWGNFFYYGGPGKNSKCP 409

BLAST of HG10020873 vs. TAIR 10
Match: AT1G23340.2 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 626.7 bits (1615), Expect = 1.3e-179
Identity = 295/416 (70.91%), Postives = 340/416 (81.73%), Query Frame = 0

Query: 7   SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKI 66
           SSSSSC    F++ L LF+S++S  S S S  +P        P +E++K+K +R  L+KI
Sbjct: 2   SSSSSCLFFTFILLLSLFSSYASP-SNSTSETVP------LRPQREIQKMKLIRKQLQKI 61

Query: 67  NKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLW 126
           NKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++PPE P G     E  ENFQLW
Sbjct: 62  NKPAIKTIHSSDGDTIDCVPSHHQPAFDHPLLQGQRPMDPPEMPIGYSQENESHENFQLW 121

Query: 127 SVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQY 186
           S+ G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QY
Sbjct: 122 SLYGESCPEGTIPIRRTTEQDMLRANSVRRFGRK-IRRVRRDSSSNGHEHAVGYVSGSQY 181

Query: 187 YGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTY 246
           YGAKAS+N+W PRV  QYEFSLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTY
Sbjct: 182 YGAKASINVWTPRVISQYEFSLSQIWIIAGSFAGDLNTIEAGWQISPELYGDTNPRFFTY 241

Query: 247 WTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL 306
           WT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Sbjct: 242 WTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKGGQFDISLLIWKDPKHGHWWL 301

Query: 307 EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKAS 366
           ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKAS
Sbjct: 302 QFGSGTLVGYWPVSLFTHLREHGNMVQFGGEIVNTRPGGSHTSTQMGSGHFAGEGFGKAS 361

Query: 367 YFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           YFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VWG +FYYGGPG+N KCP
Sbjct: 362 YFRNLQMVDWDNTLIPISNLKVLADHPNCYDIRGGVNRVWGNFFYYGGPGKNSKCP 409

BLAST of HG10020873 vs. TAIR 10
Match: AT1G10750.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 624.8 bits (1610), Expect = 5.1e-179
Identity = 290/410 (70.73%), Postives = 340/410 (82.93%), Query Frame = 0

Query: 13  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIK 72
           +CS  ++FL L    SS FS+ +S  + P+NQT   P  EL KLK +  +LRKINKP IK
Sbjct: 61  TCSSNILFLSLLL-LSSSFSSVLSENLSPRNQT-LRPLDELNKLKAINQHLRKINKPSIK 120

Query: 73  TIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLWSVSGDF 132
           TI S DGD+IDCVL H QPAFDHP L+GQ PL+PPERPRG+       ++FQLW + G+ 
Sbjct: 121 TIHSPDGDIIDCVLLHHQPAFDHPSLRGQKPLDPPERPRGHNRRGLRPKSFQLWGMEGET 180

Query: 133 CPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKAS 192
           CPEGT+PIRRT E+DI RA+S   FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS
Sbjct: 181 CPEGTVPIRRTKEEDILRANSVSSFGKK-LRHYRRDTSSNGHEHAVGYVSGEKYYGAKAS 240

Query: 193 LNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY 252
           +N+WAP+V +QYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAY
Sbjct: 241 INVWAPQVQNQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNYPRFFTYWTNDAY 300

Query: 253 QATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL 312
           QATGCYNLLCSGFVQTN+ IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+GSG+
Sbjct: 301 QATGCYNLLCSGFVQTNSEIAIGAAISPSSSYKGGQFDITLLIWKDPKHGNWWLEFGSGI 360

Query: 313 LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQ 372
           LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+Q
Sbjct: 361 LVGYWPSFLFTHLKEHASMVQYGGEIVNSSPFGAHTSTQMGSGHFAEEGFTKSSYFRNIQ 420

Query: 373 VVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           VVDWDNNL+P  NL +LADH +CYDI+  SN  WG+YFYYGGPG+N KCP
Sbjct: 421 VVDWDNNLVPSPNLRVLADHPNCYDIQGGSNRAWGSYFYYGGPGKNPKCP 467

BLAST of HG10020873 vs. TAIR 10
Match: AT1G70550.2 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 610.5 bits (1573), Expect = 1.0e-174
Identity = 285/408 (69.85%), Postives = 330/408 (80.88%), Query Frame = 0

Query: 15  SCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTI 74
           S F+  +LL    SS FS++ S            P +EL+KL  +R  L KINKP +KTI
Sbjct: 4   SSFLRLILLLCLVSSSFSSTTSSSNSTAADQTLRPQEELQKLTLIRQELDKINKPAVKTI 63

Query: 75  RSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNKSMEEVAENFQLWSVSGDFCP 134
           +SSDGD IDCV +H QPAFDHP L+GQ PL+PPE P+G    +   EN QLWS+SG+ CP
Sbjct: 64  QSSDGDKIDCVSTHQQPAFDHPLLQGQKPLDPPEIPKGYSEDDGSYENSQLWSLSGESCP 123

Query: 135 EGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLN 194
           EGTIPIRRTTE+D+ RASS +RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N
Sbjct: 124 EGTIPIRRTTEQDMLRASSVQRFGRK-IRRVKRDSTNNGHEHAVGYVTGRQYYGAKASIN 183

Query: 195 IWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQA 254
           +W+PRVT QYEFSLSQIWVI+GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ 
Sbjct: 184 VWSPRVTSQYEFSLSQIWVIAGSFTHDLNTIEAGWQISPELYGDTYPRFFTYWTSDAYRT 243

Query: 255 TGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLV 314
           TGCYNLLCSGFVQTN RIAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG LV
Sbjct: 244 TGCYNLLCSGFVQTNRRIAIGAAISPRSSYKGGQFDISLLIWKDPKHGHWWLQFGSGALV 303

Query: 315 GYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVV 374
           GYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+V
Sbjct: 304 GYWPAFLFTHLKQHGSMVQFGGEIVNNRPGGSHTTTQMGSGHFAGEGFGKASYFRNLQIV 363

Query: 375 DWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP 423
           DWDN L+P +NL +LADH +CYDIR  +N VWG YFYYGGPG+N +CP
Sbjct: 364 DWDNTLIPASNLKILADHPNCYDIRGGTNRVWGNYFYYGGPGKNPRCP 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895687.11.0e-24296.43uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida][more]
KAG6573073.16.8e-23491.67hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022994621.13.4e-23391.41uncharacterized protein LOC111490277 [Cucurbita maxima][more]
XP_023542314.15.8e-23391.67uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo][more]
XP_022954586.15.8e-23391.43uncharacterized protein LOC111456810 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JZN81.6e-23391.41uncharacterized protein LOC111490277 OS=Cucurbita maxima OX=3661 GN=LOC111490277... [more]
A0A6J1GRA42.8e-23391.43uncharacterized protein LOC111456810 OS=Cucurbita moschata OX=3662 GN=LOC1114568... [more]
A0A1S3AXP95.3e-23291.12uncharacterized protein LOC103483723 OS=Cucumis melo OX=3656 GN=LOC103483723 PE=... [more]
A0A6J1KJ262.4e-22990.98uncharacterized protein LOC111495054 OS=Cucurbita maxima OX=3661 GN=LOC111495054... [more]
A0A0A0LTK32.4e-22990.74Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G046260 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G50150.12.9e-19075.41Protein of Unknown Function (DUF239) [more]
AT1G23340.11.3e-17970.91Protein of Unknown Function (DUF239) [more]
AT1G23340.21.3e-17970.91Protein of Unknown Function (DUF239) [more]
AT1G10750.15.1e-17970.73Protein of Unknown Function (DUF239) [more]
AT1G70550.21.0e-17469.85Protein of Unknown Function (DUF239) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004314NeprosinPFAMPF03080Neprosincoord: 193..415
e-value: 2.3E-89
score: 298.6
NoneNo IPR availableGENE3D3.90.1320.10coord: 196..309
e-value: 1.9E-16
score: 62.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 94..115
NoneNo IPR availablePANTHERPTHR31589:SF154SUBFAMILY NOT NAMEDcoord: 10..422
NoneNo IPR availablePANTHERPTHR31589PROTEIN, PUTATIVE (DUF239)-RELATED-RELATEDcoord: 10..422
IPR025521Neprosin activation peptidePFAMPF14365Neprosin_APcoord: 73..179
e-value: 1.1E-35
score: 122.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020873.1HG10020873.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane