HG10013707 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013707
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr02: 3998396 .. 4000198 (+)
RNA-Seq ExpressionHG10013707
SyntenyHG10013707
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA

mRNA sequence

ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA

Coding sequence (CDS)

ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA

Protein sequence

MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVIGV
Homology
BLAST of HG10013707 vs. NCBI nr
Match: XP_038898752.1 (uncharacterized protein LOC120086270 [Benincasa hispida])

HSP 1 Score: 1065.1 bits (2753), Expect = 2.3e-307
Identity = 551/605 (91.07%), Postives = 565/605 (93.39%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
           MAT  QFP SKTLN SS FLHSTS SPF +PLLQ  TLTLKSHQTHKPLSI SG PNPS 
Sbjct: 1   MATS-QFPLSKTLNLSSSFLHSTSLSPFFHPLLQ--TLTLKSHQTHKPLSIRSGPPNPSF 60

Query: 61  LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
           L ISRQISH  FAN+ R+IRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA
Sbjct: 61  LPISRQISHLQFANSHRNIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120

Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
           ESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS
Sbjct: 121 ESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180

Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
           WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFS+GL RMDKSKG RYFW
Sbjct: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSQGLLRMDKSKGFRYFW 240

Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD 300
           VFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD 300

Query: 301 IIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDE-GDIEWVTYFGGLCKIV 360
           IIYVKKPVYQCRFEP DEFFQAIMPFLDPKTEQDFLFELQDDE GD+EWVTYF GLCKIV
Sbjct: 301 IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGGDVEWVTYFAGLCKIV 360

Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
           R+NPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420

Query: 421 DDIENRGGEENVITEWIETDDDNEEEY-EDQPEEDVVMET--ENEDEDEDDKREDGN--E 480
           DDIE R G+ENVITEWIETDDDN E+Y EDQPEE+VVMET  E+EDEDEDDKREDGN  E
Sbjct: 421 DDIEKRCGDENVITEWIETDDDNGEDYEEDQPEENVVMETEDEDEDEDEDDKREDGNQEE 480

Query: 481 EEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETE 540
           EEDE YWDERFRKAISSPEELEKLFK SAEV DEFY  EKEKE VGSRRATAMEDG ETE
Sbjct: 481 EEDEGYWDERFRKAISSPEELEKLFKHSAEVADEFY--EKEKESVGSRRATAMEDGDETE 540

Query: 541 MRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL 600
           +RGKRAKV+ EEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Sbjct: 541 LRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL 600

BLAST of HG10013707 vs. NCBI nr
Match: XP_008463741.1 (PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo] >KAA0066766.1 uncharacterized protein E6C27_scaffold271G001050 [Cucumis melo var. makuwa] >TYK27913.1 uncharacterized protein E5676_scaffold384G00980 [Cucumis melo var. makuwa])

HSP 1 Score: 1005.4 bits (2598), Expect = 2.1e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
           MAT  QFP  KTLNPSSPFL+STS +PFSNPLLQ  TLTLKSHQTH  KPLSI+SG  NP
Sbjct: 1   MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60

Query: 61  SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
                  QIS     ++R DIRTHAGRSKK  GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61  ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120

Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
           LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180

Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
           YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240

Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
           FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300

Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
           PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360

Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
           VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420

Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
           PD++ENR  ++NVITEWIET  DNEEEYEDQPEED+VME   E++D+++DD+RE+GN   
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480

Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
           EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y    EKE VG RRATAM+DG E 
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540

Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
           EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592

BLAST of HG10013707 vs. NCBI nr
Match: XP_022976454.1 (uncharacterized protein LOC111476853 [Cucurbita maxima])

HSP 1 Score: 1001.1 bits (2587), Expect = 4.0e-288
Identity = 514/604 (85.10%), Postives = 549/604 (90.89%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
           MAT  QFP  KTLNPSSPFLHSTS +PFSNPLLQT+TLTLKSH+T KPLSI+SG PN S+
Sbjct: 1   MATS-QFPLCKTLNPSSPFLHSTSLTPFSNPLLQTLTLTLKSHKTRKPLSIISGLPNASV 60

Query: 61  LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
           L I RQIS F FAN+R DIRT AGRSKKKGGG SPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61  LPIFRQISQFPFANSRPDIRTFAGRSKKKGGGTSPGRIEGNAEFRRKLRNNVRRKSQKPA 120

Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
           ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180

Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
           WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD++KG R+FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRNKGFRHFW 240

Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
           VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300

Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
           DIIYVKKPVYQCRFEP  EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360

Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
           R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420

Query: 421 DDIE---NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEE 480
           DD +   NR  +ENVI EWIETDDDN+ +YED+  EDVVMET  E EDE+D  E  NEEE
Sbjct: 421 DDDDDNKNRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEE 480

Query: 481 DENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMR 540
           DE+YWDERFRKAISSPEELEKL KRS E +DEFYEK+K +  +GSR+A   +DG ETE+R
Sbjct: 481 DEDYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NMGSRKAME-DDGDETELR 540

Query: 541 GKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG 600
           GKRAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Sbjct: 541 GKRAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEG 600

BLAST of HG10013707 vs. NCBI nr
Match: XP_004146025.1 (uncharacterized protein LOC101207599 [Cucumis sativus] >KGN55067.1 hypothetical protein Csa_012426 [Cucumis sativus])

HSP 1 Score: 993.8 bits (2568), Expect = 6.4e-286
Identity = 514/602 (85.38%), Postives = 545/602 (90.53%), Query Frame = 0

Query: 7   FPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNPSLLSIS 66
           FP  KTLNPSSPFL+STS +PFSNPLLQ  TLTLK H TH  KPLSI+SG      +S  
Sbjct: 6   FPPPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKPHHTHYYKPLSIISG------ISYP 65

Query: 67  RQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHF 126
            QIS FS    R DIRTHAGRSKKK GGPSPGRIEGNA+FRRKLR NARRK+QKLAESHF
Sbjct: 66  YQISLFS----RPDIRTHAGRSKKKPGGPSPGRIEGNADFRRKLRDNARRKTQKLAESHF 125

Query: 127 YRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 186
           YRRKKSN N ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV
Sbjct: 126 YRRKKSNRNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 185

Query: 187 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVR 246
           VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RYFWVFVR
Sbjct: 186 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVR 245

Query: 247 HPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 306
           HPRWRISELPWQQWTLIAEVV+E+GKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 246 HPRWRISELPWQQWTLIAEVVLESGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305

Query: 307 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 366
           KKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKIVRINPK
Sbjct: 306 KKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPK 365

Query: 367 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIEN 426
           AF+DDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDAPD++EN
Sbjct: 366 AFIDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDAPDEMEN 425

Query: 427 RGGEENVITEWIETDDDNEEEYEDQPEEDVVM----ETENEDEDEDDKREDGN--EEEDE 486
           R  ++NVITEWIET  DNEEEYE+QP+ED+VM    E E+EDE++DD++E+GN  EEEDE
Sbjct: 426 RRRDDNVITEWIET--DNEEEYEEQPKEDIVMEDMDEDEDEDEEDDDEQEEGNQEEEEDE 485

Query: 487 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 546
            YWDERFRKAISSPEELEKLFKRS E+ DE Y    EKE VG RRATAM+DG E EMRGK
Sbjct: 486 GYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEVEMRGK 545

Query: 547 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 601
           + KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG I
Sbjct: 546 KPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI 589

BLAST of HG10013707 vs. NCBI nr
Match: XP_023535253.1 (uncharacterized protein LOC111796741 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 993.4 bits (2567), Expect = 8.4e-286
Identity = 514/602 (85.38%), Postives = 546/602 (90.70%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
           MAT  QFP  KTLNPSSPFL STS +PFSNPLLQ  TLTLKSH+T KPL+I+SG PN S+
Sbjct: 1   MATS-QFPLCKTLNPSSPFLPSTSLTPFSNPLLQ--TLTLKSHKTRKPLTIISGLPNASV 60

Query: 61  LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
           L I RQIS F FAN+R DIRT AGRSKKKGGGPSPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61  LPIFRQISQFPFANSRPDIRTCAGRSKKKGGGPSPGRIEGNAEFRRKLRNNVRRKSQKPA 120

Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
           ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180

Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
           WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKG ++FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFQHFW 240

Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
           VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300

Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
           DIIYVKKPVYQCRFEP  EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360

Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
           R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420

Query: 421 -DDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDE 480
            DD ENR  +ENVI EWIETDDDN+ +YED+  EDVVMET  E EDE+D  E  NEEEDE
Sbjct: 421 DDDSENRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEEDE 480

Query: 481 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 540
           +YWDERFRKAISSPEELEKL KRS E +DEFYEK+K +   GSR+A   EDG ETE+RGK
Sbjct: 481 DYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NAGSRKAME-EDGDETELRGK 540

Query: 541 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 600
           RAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G I
Sbjct: 541 RAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEGEI 596

BLAST of HG10013707 vs. ExPASy TrEMBL
Match: A0A5A7VK56 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G00980 PE=4 SV=1)

HSP 1 Score: 1005.4 bits (2598), Expect = 1.0e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
           MAT  QFP  KTLNPSSPFL+STS +PFSNPLLQ  TLTLKSHQTH  KPLSI+SG  NP
Sbjct: 1   MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60

Query: 61  SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
                  QIS     ++R DIRTHAGRSKK  GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61  ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120

Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
           LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180

Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
           YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240

Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
           FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300

Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
           PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360

Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
           VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420

Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
           PD++ENR  ++NVITEWIET  DNEEEYEDQPEED+VME   E++D+++DD+RE+GN   
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480

Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
           EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y    EKE VG RRATAM+DG E 
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540

Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
           EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592

BLAST of HG10013707 vs. ExPASy TrEMBL
Match: A0A1S3CKF2 (uncharacterized protein LOC103501814 OS=Cucumis melo OX=3656 GN=LOC103501814 PE=4 SV=1)

HSP 1 Score: 1005.4 bits (2598), Expect = 1.0e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
           MAT  QFP  KTLNPSSPFL+STS +PFSNPLLQ  TLTLKSHQTH  KPLSI+SG  NP
Sbjct: 1   MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60

Query: 61  SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
                  QIS     ++R DIRTHAGRSKK  GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61  ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120

Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
           LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180

Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
           YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240

Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
           FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300

Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
           PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360

Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
           VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420

Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
           PD++ENR  ++NVITEWIET  DNEEEYEDQPEED+VME   E++D+++DD+RE+GN   
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480

Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
           EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y    EKE VG RRATAM+DG E 
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540

Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
           EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592

BLAST of HG10013707 vs. ExPASy TrEMBL
Match: A0A6J1INI9 (uncharacterized protein LOC111476853 OS=Cucurbita maxima OX=3661 GN=LOC111476853 PE=4 SV=1)

HSP 1 Score: 1001.1 bits (2587), Expect = 1.9e-288
Identity = 514/604 (85.10%), Postives = 549/604 (90.89%), Query Frame = 0

Query: 1   MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
           MAT  QFP  KTLNPSSPFLHSTS +PFSNPLLQT+TLTLKSH+T KPLSI+SG PN S+
Sbjct: 1   MATS-QFPLCKTLNPSSPFLHSTSLTPFSNPLLQTLTLTLKSHKTRKPLSIISGLPNASV 60

Query: 61  LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
           L I RQIS F FAN+R DIRT AGRSKKKGGG SPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61  LPIFRQISQFPFANSRPDIRTFAGRSKKKGGGTSPGRIEGNAEFRRKLRNNVRRKSQKPA 120

Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
           ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180

Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
           WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD++KG R+FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRNKGFRHFW 240

Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
           VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300

Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
           DIIYVKKPVYQCRFEP  EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360

Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
           R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420

Query: 421 DDIE---NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEE 480
           DD +   NR  +ENVI EWIETDDDN+ +YED+  EDVVMET  E EDE+D  E  NEEE
Sbjct: 421 DDDDDNKNRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEE 480

Query: 481 DENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMR 540
           DE+YWDERFRKAISSPEELEKL KRS E +DEFYEK+K +  +GSR+A   +DG ETE+R
Sbjct: 481 DEDYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NMGSRKAME-DDGDETELR 540

Query: 541 GKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG 600
           GKRAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Sbjct: 541 GKRAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEG 600

BLAST of HG10013707 vs. ExPASy TrEMBL
Match: A0A0A0L3A4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G627170 PE=4 SV=1)

HSP 1 Score: 993.8 bits (2568), Expect = 3.1e-286
Identity = 514/602 (85.38%), Postives = 545/602 (90.53%), Query Frame = 0

Query: 7   FPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNPSLLSIS 66
           FP  KTLNPSSPFL+STS +PFSNPLLQ  TLTLK H TH  KPLSI+SG      +S  
Sbjct: 6   FPPPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKPHHTHYYKPLSIISG------ISYP 65

Query: 67  RQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHF 126
            QIS FS    R DIRTHAGRSKKK GGPSPGRIEGNA+FRRKLR NARRK+QKLAESHF
Sbjct: 66  YQISLFS----RPDIRTHAGRSKKKPGGPSPGRIEGNADFRRKLRDNARRKTQKLAESHF 125

Query: 127 YRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 186
           YRRKKSN N ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV
Sbjct: 126 YRRKKSNRNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 185

Query: 187 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVR 246
           VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RYFWVFVR
Sbjct: 186 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVR 245

Query: 247 HPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 306
           HPRWRISELPWQQWTLIAEVV+E+GKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 246 HPRWRISELPWQQWTLIAEVVLESGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305

Query: 307 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 366
           KKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKIVRINPK
Sbjct: 306 KKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPK 365

Query: 367 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIEN 426
           AF+DDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDAPD++EN
Sbjct: 366 AFIDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDAPDEMEN 425

Query: 427 RGGEENVITEWIETDDDNEEEYEDQPEEDVVM----ETENEDEDEDDKREDGN--EEEDE 486
           R  ++NVITEWIET  DNEEEYE+QP+ED+VM    E E+EDE++DD++E+GN  EEEDE
Sbjct: 426 RRRDDNVITEWIET--DNEEEYEEQPKEDIVMEDMDEDEDEDEEDDDEQEEGNQEEEEDE 485

Query: 487 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 546
            YWDERFRKAISSPEELEKLFKRS E+ DE Y    EKE VG RRATAM+DG E EMRGK
Sbjct: 486 GYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEVEMRGK 545

Query: 547 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 601
           + KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG I
Sbjct: 546 KPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI 589

BLAST of HG10013707 vs. ExPASy TrEMBL
Match: A0A6J1FAH0 (uncharacterized protein LOC111443567 OS=Cucurbita moschata OX=3662 GN=LOC111443567 PE=4 SV=1)

HSP 1 Score: 991.1 bits (2561), Expect = 2.0e-285
Identity = 511/597 (85.59%), Postives = 543/597 (90.95%), Query Frame = 0

Query: 6   QFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISR 65
           QFP  KTLNPSSPFL STS +PFSNPLLQ  TLTLKSHQT KPLSI+SG PN S+L I R
Sbjct: 5   QFPLCKTLNPSSPFLPSTSLTPFSNPLLQ--TLTLKSHQTRKPLSIISGLPNASVLPIFR 64

Query: 66  QISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFY 125
           QIS F FAN+R DIRT AGRSKKKGGGPSPGRIEGNAEFRRKLR+N RRKSQK AESHFY
Sbjct: 65  QISQFPFANSRPDIRTFAGRSKKKGGGPSPGRIEGNAEFRRKLRNNVRRKSQKPAESHFY 124

Query: 126 RRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVV 185
           RRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV
Sbjct: 125 RRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYSWRGVV 184

Query: 186 VGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVRH 245
           +GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKG R+FWVFVRH
Sbjct: 185 IGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRHFWVFVRH 244

Query: 246 PRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305
           PRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 245 PRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 304

Query: 306 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 365
           KKPVYQCRFEP  EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+R+NPK
Sbjct: 305 KKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKILRVNPK 364

Query: 366 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP-DDIE 425
           AFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP DD E
Sbjct: 365 AFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDDNE 424

Query: 426 NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDENYWDE 485
           NR  +ENV+ EWIET DDN+++YED+  EDVVMET  E EDE+D  E  NEEEDE+YWDE
Sbjct: 425 NRHSDENVVMEWIET-DDNDDDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEEDEDYWDE 484

Query: 486 RFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGKRAKVR 545
           RFRKAISSPEELEKL KRS E +DEFYEK+K +   GSR+A   +DG ETE+RGKRAKV+
Sbjct: 485 RFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NAGSRKAME-DDGDETELRGKRAKVK 544

Query: 546 PEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVIGV 601
           PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G IGV
Sbjct: 545 PEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEGEIGV 595

BLAST of HG10013707 vs. TAIR 10
Match: AT3G14900.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: embryo development; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 13 growth stages; Has 17135 Blast hits to 10204 proteins in 644 species: Archae - 47; Bacteria - 1684; Metazoa - 5536; Fungi - 2506; Plants - 1043; Viruses - 361; Other Eukaryotes - 5958 (source: NCBI BLink). )

HSP 1 Score: 640.6 bits (1651), Expect = 1.3e-183
Identity = 347/619 (56.06%), Postives = 450/619 (72.70%), Query Frame = 0

Query: 9   FSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISRQIS 68
           FSKTLNPS  F      SP ++ + + +++     + +   S+           + R+  
Sbjct: 8   FSKTLNPSFSFRK----SPLNSGVRRIVSVLPAITERNYAFSVKRSE------LLLREDG 67

Query: 69  HFSFANARRDIRTHAGRSKKK-GGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFYR- 128
            F     RRD+R  AGRSKKK GGG S GRIEG+++ R++++ NAR KS+KLAES FYR 
Sbjct: 68  GF-----RRDVRALAGRSKKKLGGGSSGGRIEGDSDMRKQVKRNAREKSKKLAESLFYRL 127

Query: 129 -------RKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPY 188
                  R +  S+  D F+E+EL+ IGLGYDRMVRFM+KDDP LRHPYDW+KYGEFGPY
Sbjct: 128 YNNPDKSRSQILSSHPDKFTEEELEMIGLGYDRMVRFMDKDDPRLRHPYDWFKYGEFGPY 187

Query: 189 SWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYF 248
           SWRGVVVG+P+RG  +DE VT+I EV++HEE+EKIEQ EM   F + ++ +D + GLRYF
Sbjct: 188 SWRGVVVGDPVRGTISDECVTMIGEVENHEEFEKIEQHEMNIAFQKRVKELDSNVGLRYF 247

Query: 249 WVFVRHPRWRISELPWQQWTLIAEVVVEAG-KERLDKWSLMGRLGNKSRKNITQCAAWMR 308
           WVFVRHP+WR+SELPW+QWTL++EVVVEA  K+RLDKW+LMGRLGNKSR  I QCAAW R
Sbjct: 248 WVFVRHPKWRLSELPWEQWTLVSEVVVEADKKQRLDKWNLMGRLGNKSRSLICQCAAWFR 307

Query: 309 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 368
           PDI+YVKKPV+QCRFEP ++FF +++P+L+P TE  F+ E++DDEG +E  TY+GGLCK+
Sbjct: 308 PDIVYVKKPVFQCRFEPQEDFFNSLIPYLNPVTESGFVCEVEDDEGRVELSTYYGGLCKM 367

Query: 369 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 428
           +++   AFVDDVVNAYEKLSDEKKS+ L+FLL NHP  LLHPYTKEWKAKLEE ELGCDA
Sbjct: 368 LKVRQTAFVDDVVNAYEKLSDEKKSRVLKFLLGNHPNELLHPYTKEWKAKLEEMELGCDA 427

Query: 429 PDDIENR-----GGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDE---------- 488
           PD+ E+        E+   +EWIE + DN+++ +D  ++D  +E  ++D+          
Sbjct: 428 PDEDEDEISISGSSEKAEFSEWIEDEADNDDDDDDDDDDDGEVEEVDDDDNMVVDVEGNV 487

Query: 489 DED---DKREDGNEEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVG 548
           +ED   D+ E+ + EEDE YW+E+F KA ++ E +EKL + S  V+D+FYEK+    K  
Sbjct: 488 EEDSLEDEIEESDPEEDERYWEEQFNKATNNAERMEKLAEMSMVVSDKFYEKQL---KAL 547

Query: 549 SRRATAMEDGSETEMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYR 600
             R     +G E EMRGK+AKV+PEEW+ +GYG W KKIKKS+IPPELFLR+ VRPF YR
Sbjct: 548 EEREKGEIEGDELEMRGKKAKVKPEEWKTVGYGRWMKKIKKSRIPPELFLRAAVRPFVYR 607

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898752.12.3e-30791.07uncharacterized protein LOC120086270 [Benincasa hispida][more]
XP_008463741.12.1e-28986.00PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo] >KAA0066766.1 unc... [more]
XP_022976454.14.0e-28885.10uncharacterized protein LOC111476853 [Cucurbita maxima][more]
XP_004146025.16.4e-28685.38uncharacterized protein LOC101207599 [Cucumis sativus] >KGN55067.1 hypothetical ... [more]
XP_023535253.18.4e-28685.38uncharacterized protein LOC111796741 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7VK561.0e-28986.00Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CKF21.0e-28986.00uncharacterized protein LOC103501814 OS=Cucumis melo OX=3656 GN=LOC103501814 PE=... [more]
A0A6J1INI91.9e-28885.10uncharacterized protein LOC111476853 OS=Cucurbita maxima OX=3661 GN=LOC111476853... [more]
A0A0A0L3A43.1e-28685.38Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G627170 PE=4 SV=1[more]
A0A6J1FAH02.0e-28585.59uncharacterized protein LOC111443567 OS=Cucurbita moschata OX=3662 GN=LOC1114435... [more]
Match NameE-valueIdentityDescription
AT3G14900.11.3e-18356.06unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: embryo d... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 433..478
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..135
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 516..539
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 417..432
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..137
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 417..479
NoneNo IPR availablePANTHERPTHR37911OSJNBA0067K08.20 PROTEINcoord: 5..599

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013707.1HG10013707.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy