HG10019701 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019701
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1666)
LocationChr04: 24535619 .. 24540920 (-)
RNA-Seq ExpressionHG10019701
SyntenyHG10019701
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTTCTCGGATTCTTATAGGTCTTTTGATGAGGGGAAAATCGTGGAAAGTTCTGCTGGTGGTGGTGGTGGTGACGATTCAGTGTCGCTGGAGGAGGAGACTGACGGCGGGGGAGCGACTGAGAGTTTAGAGGAGAAGGGAAGGAATGAATTTGCTTTCAGTTTCCGGTTCCAAACGTATGAGGAATTTACCAAATCGAGTAAAGAAAATTTTGGTTGTGAGAAACTGGACTCGAGTGGTTGTTCTTCATCATTGAGCAACAGATATGAGTTTTTGCCGGAGAAAAGTACGAGCCACTTTGTGGAAGAAGCTGAAATTCCGAGCTATACGGTGGAAATTTTGAATTCTTGTTCCAATCACGAAATTTTGGGAAATGGGAATTTTTCTGTTCGAGAATTTTCTGGGAAAGTCTTGGAATGTGAAGCTGTTGATCAGAAAATTACAGAGTGCTCTGCTGATGGAACAGAGGAATTTTCTGGGAAAATTTTGAAACTTGAAGCTATTGAGGAAGGAAAATCATTTTCTGTGGAAATTTCAAAATCCCAACCTGTTGAAGAAGAAGAAGAAATTACAGAGAGCTTTTCAAATGGGAAAGAAGAATCTTCCGGGAAAATTCAGAGAGAAGAAGAAGAAGACGACAATGACTTTCTAAGGGAAACAGATTTCGCCGGTTCCGATTCAGACGCCGATGTCGACATTGGAGGCCGGTTTCTCTCCGATACCGATTTCGATTTCGATTTCAAAATCGGCGGGTACGAACCAGACGACGAGATAAACGAAGAATCGGAGAAGTCGCCGGAGGAAAATGGAAACGGTGAGGATTCAGAAGAGTTGAATGGATTGGAAACAGAGTGGGAACATCAAGAATTGATTGAACAATTAAAAATGGAATTGAAGAAAGTGAGAGCCACAGGGCTGGCCACCATTTTTGAAGAATCAGAGTCACCCAAGATAATGGGAGAATTGAAACCATGGAAGATTGATGAGAGATTTCAACATGGAGATTTAATGGAAGAGCTTCATAAATTCTATAGAACATACAGAGAACGCATGAGGAAATTGGATATCTTGAATTACCAGAAGATGTATGCAATGGGTGAGTTTCTTCTTGTTCTTCATAATTTTTTTATTATTTTTTCCCTTTAACATAATAATTTTTTAAATTTGATTTTCTAATGTATTAGTTAGATAAATTTGGATGATGGGTATGTTTTACCTTAGTTTGGTTTAATAAGTTACAGAGGGATTATCATATAAGAGAGAGAGAAGGGGTTGGTTTAACCATCTATGCATTTTTATTTTATTTTATTTTTATTTGCTCATTTTTATTTAGGGTTGGAAAATTGGAACTTTCTGAGAGGGAGTTTCCCATTTTCATTTTGTTTTGGAACTTTCTTTGGGAGGGGAAAGTTCCAAACTTTTTAAAGAGGTAGGAGCTAGTTTGATGACAATTTTATTTTCAGAAAATTAAAAATAAATTTTTGATATTTTATTATCAATTATTTCTATAAGCTAAGGTTAAAAAACTACTGTAATCAAACCATATCCTAACATCTAACATTGTTGGTGGGGATAATAATTTCATAATTTGAGGGTATTATTGCAAAATCTCCATAATTTAGAGATAGTTAGAATAGATTTATAGTCTATAATTGCAACAACTTTTGTAATTGAGTGGAATTATAGACTAAATGCAACAAATCCTATTGTTTGGATTTTTGTACATTTTTCAAATTTTATATGGGTAGCTCATTGAAAGTTGAAAATTAAACCCTATCCTACCGTTTCTTCATTTTATTTGTTTATGTTGTTAAAACAAAATTAAAATTATTTAATATTTATATTAGATGGCTTTGGTTTTGGAACTTGAAAGATTTATTATACTCTATGAATGAATTTAATGGTCTATTTTAGAAAAAAATTATTAAATGCTCATAGTTTGGTCATGTTGTGCATAAACCAAATAATTCCTTGTGACAAAGTTGAGGAAGAAACAAAGCTAACTTTCTTGTGGGTTTTAAATGGCCACTCCTTTTTGGGTTATATGGTTAATGCATATGTGTTCACTTTGGGGTTTCTTTATTTATTTATTTATTATTATTATTTATTTTTTTTAATTTCATAATGTTTAGGTCAACAAGGTGCTTTTGTCTCTAACCTATTCCCCACTTTTTATTTTTTCCCTTTTTTTCTTTTTAAAATATAATAATTAATGTCCTAGGTTCTTACCTATGGTATGACTTTTTCACATAACCCTAGTGCTAGTTAGAAATTATTCTTTAAAAACTATTTTTTTAAACCAAAAATTCAATAGAAATATATATTGACTATAGTAAAAGTTTGAATGGTTTAAAGATTTTTTTACGTGGTAATACAATTTTTCTTTACACAAGCTTAGCAAAGTTGATAAATTAAATATTAAGATATACTATTATAGGGAACAATCTAGTTAGAATCGTATTCAATTGAAGTCAATAAGTTTTTTAGTTGAGGAGGCTCTCTTTATTGCTTCTTATGTAAATGTTACTCATTTTTGTAAAATTTTTAGAGAGGAGAACAAAGTTTCCCAAAATCTAGCTGCTTTGATATGGGGCTTCCTCTTTTTGGAGAGAGTTTCCTTCCTCTGTATTATCCCTCATCCTCGAGAGGGATTGATAGTTGGTTGTTTAGTTTGTTTGGTCAATCCTATTGAATCTCGGTTTTAAAAAAAAAATTATAGCTATAAATAGTTTTTTTTAAGGGGAATTTTTAAAAATAGAAAAATAAGGAAAACTATTTACAAAAAATAGCAAAAATTTTAGATAGTTGTGATAGAAGTCTATCGCTGATAGATGCTAATAGAAGTCTGTCAATGTCTATTGGTGTTATTTTATTTTATTTTATTTTTTTTTTTTTTTTTGCTATCTTCAGTAAATAGTTTGACATTTTATCTATGGGTGAAAATTTCCTTCTTTTAATGTACCCTTTATCATTTGGCTTTTACTTATTGTGTTATTTTTTCTTTCCTAATATTTGTTTGGAAATCATAGGAGTATTGCAATCAAAGGACCCACTAAAATCATTTTCAAGCAATAGCAAATCTTCATCCCCATCAATCATCTCTCTTCTCTCACACAACCTAAGGCTATACAGACAGAAGAAATGTCAAGTGGACCCAATGGAAGACTTTATAAGAGAAGTTCATTGTGATTTGGAGATGGTATATGTAGGGCAAACGTGTCTTTCTTGGGAATTCATTCAATGGCAATATGAAAAGGCTTTAGATTTGTGGGAATCTGAACCTCATGGACTTCACCATTACAATGAAGTTGCTGGTGAGTTCCAACAATTCCAAGTTATCTTACAAAGATTCTTGGAAAATGAAGCTTTCGAAGGCCCGAGAGTAGAGAATTACGTCAAGCATCGATGTGTCGCTCGTAATCTTCTTCAAGTTCCCGTCATCAGAGGTAAATATATACTAAGAGCCTGTTTGAAATGACAGGTGCTTTTACTTGACGGTTTCTTGAAACGATCGATTCATATATAATTTTGCTTTTTTGTGTGTGTAGAGGACAAAAAGAGGGAGAGAAGGAAAGCAAGAAGAGGGAAATTAGATGATGGATATGAAGCAATAACAAGTGATATGGTGGTAGAGATGTTGCAAGAATCAATCAGAGTAATTTGGCAATTCATTAGAGCAGATAAAGATTGTCATCACAACACAAATGCAAGCCTAAAGCGCTCAAAAAAGTTGCAAGTAGAACTCCAAGATCCAGCTGATGAACAACTTCTAACTGACATCCAAACAGATCTCCAAAAGGTTAGTTTTTTTTTTTTTTTTTAACTTATATTTTATTAAAATTTTCTCATTTCACATACATTGAGCCAAAATCTCAATTTTTGTTTTTCGTTTCTTGTTTTCAATTTTTTTTAAAAAGTGTGTGGATAATCAATAACGAGAAATGAGAAATTAAGGCCTAGTTTGGTAATCATTTGGTTTTTAGTTTTTAGTTTTTCGAAAATTAAGTCTACTTTCTCTTAATTTCTTATGATGATTTTCATTTCTTTTTTTTTTTTTTTAAGTAAGAGTTGAATTCTGCCGAGTTTCAAAAACGAAAATAAGTTTTTAAAAACTACTTTTTTTAGTTTCCAAAACTTAACTTGTTTTTTAAAAATATCGGTGAAAGTAAACAACAAAGCAATAAATTTAAAGGTGGAGGAGTTGTTTATAAACTTAATTTTCAAAAACAAAAAATCAAGTAGTTATCAAATGTTGCTTAATGGTTCATTTGAGTTCTTGTTTTTGGTTCCCGAACAGAAACCAAAAACAATTGTTGGTAACTGTTTTTGTTTATTGTTACTCAATTTTTTGAAAACATTTCTAAAACGTTTTTAAAATTCTGTAAAATTGGGGGATTGAGTTTTTTTTTGTTTTTGTTTTATGTTTTTTTCCCTAAAAATATAAATAAAATGTTAAATTACAAGTTTAGTTTTTGGACTTTGAAGTTTGTGTTGAATTTTTCTTTGAACTTTAAAAAATGTCAATAGGGTTCGTAAAATTTCAATTTTGTGTCTAATATGTCTCACTTTTAATTATGTGGCCTCATTGGTCCTTGATAAATTCAAAATTTTTAAAAAAATTTGATTGATCTATTAGCTATAAATTTAAATTTAATGTGTCTAATGTCTTAGAATTTTTAATTTTGTATAAGTAAATATGTAAATATTTAAAAAAAAAATTTTGAAAGATTAAGGCCTAAATTTGTAATTTTGAAAGTTTATAAATCAAATAGGCACAAACATCTAAGTTCAAGGTAAAACTTTGTAATTTAACTTAAATAGAAGTATTATTTTATCATTTTCACTTCTAAAATGTATAATAAATATAACAAATATTTTTTATATTTAATTTTAAAAACTTATTTTCACATAACTTCTTACGTTTTTAATTTCAAACTTTAGAGAGTAAGAAACAAAAAATGATAATTACCTAACATGGTTTTTTGATTTGTTTTTCTTTATTAAAAATGAATAAACAAGAAACAATGAACAACAATAATGTTATCGTAGCGGTTATAAAAATGTTGATGAAGATAAGTTGATTGGTTTCGATTTTTCATGTTGCAGAAAGAAAAGAAGCTAAAGGAAATTGTGAGAAGTGGACATTGCATACTGAAGAAATTGCAAAAGAATGAAGAAAATGAAGAGGCAGAAGGTGCATTGTATTTCTTTTGTGAAGTGGATATGAAATTGGTGGGAAGAGTTCTTAAAATGTCGAGAATAACAACAGATCAATTGATTTGGTGTAGGAACAAATTGA

mRNA sequence

ATGGGGTTCTCGGATTCTTATAGGTCTTTTGATGAGGGGAAAATCGTGGAAAGTTCTGCTGGTGGTGGTGGTGGTGACGATTCAGTGTCGCTGGAGGAGGAGACTGACGGCGGGGGAGCGACTGAGAGTTTAGAGGAGAAGGGAAGGAATGAATTTGCTTTCAGTTTCCGGTTCCAAACGTATGAGGAATTTACCAAATCGAGTAAAGAAAATTTTGGTTGTGAGAAACTGGACTCGAGTGGTTGTTCTTCATCATTGAGCAACAGATATGAGTTTTTGCCGGAGAAAAGTACGAGCCACTTTGTGGAAGAAGCTGAAATTCCGAGCTATACGGTGGAAATTTTGAATTCTTGTTCCAATCACGAAATTTTGGGAAATGGGAATTTTTCTGTTCGAGAATTTTCTGGGAAAGTCTTGGAATGTGAAGCTGTTGATCAGAAAATTACAGAGTGCTCTGCTGATGGAACAGAGGAATTTTCTGGGAAAATTTTGAAACTTGAAGCTATTGAGGAAGGAAAATCATTTTCTGTGGAAATTTCAAAATCCCAACCTGTTGAAGAAGAAGAAGAAATTACAGAGAGCTTTTCAAATGGGAAAGAAGAATCTTCCGGGAAAATTCAGAGAGAAGAAGAAGAAGACGACAATGACTTTCTAAGGGAAACAGATTTCGCCGGTTCCGATTCAGACGCCGATGTCGACATTGGAGGCCGGTTTCTCTCCGATACCGATTTCGATTTCGATTTCAAAATCGGCGGGTACGAACCAGACGACGAGATAAACGAAGAATCGGAGAAGTCGCCGGAGGAAAATGGAAACGGTGAGGATTCAGAAGAGTTGAATGGATTGGAAACAGAGTGGGAACATCAAGAATTGATTGAACAATTAAAAATGGAATTGAAGAAAGTGAGAGCCACAGGGCTGGCCACCATTTTTGAAGAATCAGAGTCACCCAAGATAATGGGAGAATTGAAACCATGGAAGATTGATGAGAGATTTCAACATGGAGATTTAATGGAAGAGCTTCATAAATTCTATAGAACATACAGAGAACGCATGAGGAAATTGGATATCTTGAATTACCAGAAGATGTATGCAATGGGAGTATTGCAATCAAAGGACCCACTAAAATCATTTTCAAGCAATAGCAAATCTTCATCCCCATCAATCATCTCTCTTCTCTCACACAACCTAAGGCTATACAGACAGAAGAAATGTCAAGTGGACCCAATGGAAGACTTTATAAGAGAAGTTCATTGTGATTTGGAGATGGTATATGTAGGGCAAACGTGTCTTTCTTGGGAATTCATTCAATGGCAATATGAAAAGGCTTTAGATTTGTGGGAATCTGAACCTCATGGACTTCACCATTACAATGAAGTTGCTGGTGAGTTCCAACAATTCCAAGTTATCTTACAAAGATTCTTGGAAAATGAAGCTTTCGAAGGCCCGAGAGTAGAGAATTACGTCAAGCATCGATGTGTCGCTCGTAATCTTCTTCAAGTTCCCGTCATCAGAGAGGACAAAAAGAGGGAGAGAAGGAAAGCAAGAAGAGGGAAATTAGATGATGGATATGAAGCAATAACAAGTGATATGGTGGTAGAGATGTTGCAAGAATCAATCAGAGTAATTTGGCAATTCATTAGAGCAGATAAAGATTGTCATCACAACACAAATGCAAGCCTAAAGCGCTCAAAAAAGTTGCAAGTAGAACTCCAAGATCCAGCTGATGAACAACTTCTAACTGACATCCAAACAGATCTCCAAAAGAAAGAAAAGAAGCTAAAGGAAATTGTGAGAAGTGGACATTGCATACTGAAGAAATTGCAAAAGAATGAAGAAAATGAAGAGGCAGAAGGAACAAATTGA

Coding sequence (CDS)

ATGGGGTTCTCGGATTCTTATAGGTCTTTTGATGAGGGGAAAATCGTGGAAAGTTCTGCTGGTGGTGGTGGTGGTGACGATTCAGTGTCGCTGGAGGAGGAGACTGACGGCGGGGGAGCGACTGAGAGTTTAGAGGAGAAGGGAAGGAATGAATTTGCTTTCAGTTTCCGGTTCCAAACGTATGAGGAATTTACCAAATCGAGTAAAGAAAATTTTGGTTGTGAGAAACTGGACTCGAGTGGTTGTTCTTCATCATTGAGCAACAGATATGAGTTTTTGCCGGAGAAAAGTACGAGCCACTTTGTGGAAGAAGCTGAAATTCCGAGCTATACGGTGGAAATTTTGAATTCTTGTTCCAATCACGAAATTTTGGGAAATGGGAATTTTTCTGTTCGAGAATTTTCTGGGAAAGTCTTGGAATGTGAAGCTGTTGATCAGAAAATTACAGAGTGCTCTGCTGATGGAACAGAGGAATTTTCTGGGAAAATTTTGAAACTTGAAGCTATTGAGGAAGGAAAATCATTTTCTGTGGAAATTTCAAAATCCCAACCTGTTGAAGAAGAAGAAGAAATTACAGAGAGCTTTTCAAATGGGAAAGAAGAATCTTCCGGGAAAATTCAGAGAGAAGAAGAAGAAGACGACAATGACTTTCTAAGGGAAACAGATTTCGCCGGTTCCGATTCAGACGCCGATGTCGACATTGGAGGCCGGTTTCTCTCCGATACCGATTTCGATTTCGATTTCAAAATCGGCGGGTACGAACCAGACGACGAGATAAACGAAGAATCGGAGAAGTCGCCGGAGGAAAATGGAAACGGTGAGGATTCAGAAGAGTTGAATGGATTGGAAACAGAGTGGGAACATCAAGAATTGATTGAACAATTAAAAATGGAATTGAAGAAAGTGAGAGCCACAGGGCTGGCCACCATTTTTGAAGAATCAGAGTCACCCAAGATAATGGGAGAATTGAAACCATGGAAGATTGATGAGAGATTTCAACATGGAGATTTAATGGAAGAGCTTCATAAATTCTATAGAACATACAGAGAACGCATGAGGAAATTGGATATCTTGAATTACCAGAAGATGTATGCAATGGGAGTATTGCAATCAAAGGACCCACTAAAATCATTTTCAAGCAATAGCAAATCTTCATCCCCATCAATCATCTCTCTTCTCTCACACAACCTAAGGCTATACAGACAGAAGAAATGTCAAGTGGACCCAATGGAAGACTTTATAAGAGAAGTTCATTGTGATTTGGAGATGGTATATGTAGGGCAAACGTGTCTTTCTTGGGAATTCATTCAATGGCAATATGAAAAGGCTTTAGATTTGTGGGAATCTGAACCTCATGGACTTCACCATTACAATGAAGTTGCTGGTGAGTTCCAACAATTCCAAGTTATCTTACAAAGATTCTTGGAAAATGAAGCTTTCGAAGGCCCGAGAGTAGAGAATTACGTCAAGCATCGATGTGTCGCTCGTAATCTTCTTCAAGTTCCCGTCATCAGAGAGGACAAAAAGAGGGAGAGAAGGAAAGCAAGAAGAGGGAAATTAGATGATGGATATGAAGCAATAACAAGTGATATGGTGGTAGAGATGTTGCAAGAATCAATCAGAGTAATTTGGCAATTCATTAGAGCAGATAAAGATTGTCATCACAACACAAATGCAAGCCTAAAGCGCTCAAAAAAGTTGCAAGTAGAACTCCAAGATCCAGCTGATGAACAACTTCTAACTGACATCCAAACAGATCTCCAAAAGAAAGAAAAGAAGCTAAAGGAAATTGTGAGAAGTGGACATTGCATACTGAAGAAATTGCAAAAGAATGAAGAAAATGAAGAGGCAGAAGGAACAAATTGA

Protein sequence

MGFSDSYRSFDEGKIVESSAGGGGGDDSVSLEEETDGGGATESLEEKGRNEFAFSFRFQTYEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSNHEILGNGNFSVREFSGKVLECEAVDQKITECSADGTEEFSGKILKLEAIEEGKSFSVEISKSQPVEEEEEITESFSNGKEESSGKIQREEEEDDNDFLRETDFAGSDSDADVDIGGRFLSDTDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEELNGLETEWEHQELIEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLRLYRQKKCQVDPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLKEIVRSGHCILKKLQKNEENEEAEGTN
Homology
BLAST of HG10019701 vs. NCBI nr
Match: XP_038905369.1 (uncharacterized protein LOC120091420 [Benincasa hispida])

HSP 1 Score: 1050.0 bits (2714), Expect = 7.8e-303
Identity = 553/619 (89.34%), Postives = 577/619 (93.21%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGG-DDSVSLEEETDGGGATESLEEKGRNEFAFSFRFQTY 62
           FS  +R FDEGK+VESS GGGGG DDS  LEEE DG GATE+LEEKG NEFAFSFRFQTY
Sbjct: 35  FSFLFRFFDEGKVVESSVGGGGGDDDSRPLEEEADGRGATENLEEKGTNEFAFSFRFQTY 94

Query: 63  EEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSNH 122
           EEF+KSSK+NFGCEKLD SGCSSSLSNRYEFLPEKSTSHFVEE EIPSYTVE+LNSCSNH
Sbjct: 95  EEFSKSSKKNFGCEKLDWSGCSSSLSNRYEFLPEKSTSHFVEETEIPSYTVEVLNSCSNH 154

Query: 123 EILGNGNFSVREFSGKVLECEAVDQKITECSADGTEEFSGKILKLEAIEEGKSFSVEISK 182
           EI GNG+FSVREFS  VLE EAVDQ+ITE SADGTEEFS KILK E IEEGK+F VE+SK
Sbjct: 155 EISGNGDFSVREFSELVLEFEAVDQEITESSADGTEEFSEKILKFEVIEEGKTFPVEVSK 214

Query: 183 SQPV-EEEEEITESFSNGKEESSGKIQREEEEDDNDFLRETDFAGSDSDADVDIGGRFLS 242
           SQP+ EEEEEITESF+NGKEESSGKIQREEEE DNDFLRETDF GSDSD D+DIGGRFLS
Sbjct: 215 SQPIEEEEEEITESFTNGKEESSGKIQREEEE-DNDFLRETDFNGSDSDGDIDIGGRFLS 274

Query: 243 DTDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEELNGLETEWEHQELIEQLKMELK 302
           DTDFD DFKIGGYEPD+EINEE EKSPE NG+ EDSE   GLETEWEHQELIEQLKMELK
Sbjct: 275 DTDFDLDFKIGGYEPDEEINEELEKSPEGNGDEEDSE---GLETEWEHQELIEQLKMELK 334

Query: 303 KVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDILNY 362
           KVRATGLATIFEESESPKIM ELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDILNY
Sbjct: 335 KVRATGLATIFEESESPKIMDELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDILNY 394

Query: 363 QKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLRLYRQKKCQVDPMEDFIREVHCD 422
           QKMYAMGVLQSKDPLKSFSS SKSSSPSI+SLLSHNLRLYRQKKCQVDPM+DFIREVHCD
Sbjct: 395 QKMYAMGVLQSKDPLKSFSSTSKSSSPSILSLLSHNLRLYRQKKCQVDPMKDFIREVHCD 454

Query: 423 LEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQRFLENEAF 482
           LEMVYVGQ CLSWEFIQWQY+KALDLWESEPHGLHHYNEVAGEFQQFQV+LQRFLENEAF
Sbjct: 455 LEMVYVGQMCLSWEFIQWQYQKALDLWESEPHGLHHYNEVAGEFQQFQVLLQRFLENEAF 514

Query: 483 EGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEMLQESI 542
           EGPRVENYVKHRCV RNLLQVPVIREDKK +RRKARRGKL+DGYEAITSDMVVEMLQESI
Sbjct: 515 EGPRVENYVKHRCVVRNLLQVPVIREDKKGDRRKARRGKLEDGYEAITSDMVVEMLQESI 574

Query: 543 RVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLKEIVRS 602
           RVIWQFIRADKDCHH++N SLKR KKLQVELQDPADEQLLT IQTDLQKKEKKLKEI+RS
Sbjct: 575 RVIWQFIRADKDCHHHSNGSLKRPKKLQVELQDPADEQLLTHIQTDLQKKEKKLKEIMRS 634

Query: 603 GHCILKKLQKNEENEEAEG 620
           GHCILKKLQKNEENEE  G
Sbjct: 635 GHCILKKLQKNEENEETGG 649

BLAST of HG10019701 vs. NCBI nr
Match: XP_011652238.1 (uncharacterized protein LOC101211770 isoform X1 [Cucumis sativus] >KGN59576.1 hypothetical protein Csa_001432 [Cucumis sativus])

HSP 1 Score: 949.5 bits (2453), Expect = 1.4e-272
Identity = 518/626 (82.75%), Postives = 549/626 (87.70%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGDDSVSL--EEETDGGGATESLEEKGRNEFAFSFRFQT 62
           FS  +RSF+EGKIVESS    GGDD VSL  E+ET GGG  ESL EK RNEF+FSF+FQT
Sbjct: 21  FSFLFRSFNEGKIVESSVVVHGGDDDVSLSPEDETKGGGVIESL-EKERNEFSFSFKFQT 80

Query: 63  YEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           YEEF+KS+KEN  CEKLD SG SSSL NRYE LPEKSTSHFVEEAEIPSYTVE+LNSC N
Sbjct: 81  YEEFSKSNKENICCEKLDWSGGSSSLGNRYEILPEKSTSHFVEEAEIPSYTVEVLNSCLN 140

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECS-ADGTEEFSGKILKLEAIEEGKSFSVEI 182
           H +LGN +    E SGKVLE E V Q+ITECS  DGTEE SGK  K EA+EE K F    
Sbjct: 141 HGVLGNES----EVSGKVLEHEIVSQEITECSTVDGTEEVSGKFFKFEAVEEEKPF---- 200

Query: 183 SKSQPVEEEEEITESFSNGKEESSGKIQ---REEEEDDNDFLRETDFAGSDSDADVDIGG 242
             ++  +EEEEITE F N KEESS KIQ    EEEE+DNDFL+ETDFAGSDSDADVDIGG
Sbjct: 201 --TKFEDEEEEITERFRNEKEESSPKIQSEEEEEEEEDNDFLKETDFAGSDSDADVDIGG 260

Query: 243 RFLSDTDFDFDFKIGGYEPDDEIN-EESEKSPEENGNG-EDSEELNGLETEWEHQELIEQ 302
           RFLSDTDFD DFK GGYEPDDEIN EESEKS E NG G EDSEELNGLETEWEHQELIEQ
Sbjct: 261 RFLSDTDFDLDFKTGGYEPDDEINVEESEKSAEGNGKGEEDSEELNGLETEWEHQELIEQ 320

Query: 303 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRK 362
           LKMELKKVRATGLATIFEESESPKIMGELKPWKIDE+FQHGDLMEELHKFYR+YRERMRK
Sbjct: 321 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDEKFQHGDLMEELHKFYRSYRERMRK 380

Query: 363 LDILNYQKMYAMGVLQSKDPLKSFSSNSK-SSSPSIISLLSHNLRLYRQKKCQVDPMEDF 422
           LDILNYQKMYAMGVLQSKDPL SFSSN K SSS SIIS  +HNLRLYR+ KCQVDPM+DF
Sbjct: 381 LDILNYQKMYAMGVLQSKDPLNSFSSNDKSSSSSSIISAFTHNLRLYRRNKCQVDPMKDF 440

Query: 423 IREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQR 482
           IREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQV+LQR
Sbjct: 441 IREVHCDLEMVYVGQLCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVLLQR 500

Query: 483 FLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVV 542
           FLENE FEGPRVENYVKHRCVARNLLQVPVIREDK+R+RRK RRGKL+DGYEAITSDM+V
Sbjct: 501 FLENEPFEGPRVENYVKHRCVARNLLQVPVIREDKRRDRRKGRRGKLEDGYEAITSDMLV 560

Query: 543 EMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKK 602
           EMLQESIRVIWQFIRADKDCHH+TN SLKR KKLQVELQ+PADEQLLT IQ DLQKKEK+
Sbjct: 561 EMLQESIRVIWQFIRADKDCHHSTNGSLKRPKKLQVELQEPADEQLLTHIQIDLQKKEKR 620

Query: 603 LKEIVRSGHCILKKLQKNEENEEAEG 620
           LKEIVRSGHCILKKL+KNEENEE EG
Sbjct: 621 LKEIVRSGHCILKKLKKNEENEETEG 635

BLAST of HG10019701 vs. NCBI nr
Match: KAA0053756.1 (uncharacterized protein E6C27_scaffold135G001360 [Cucumis melo var. makuwa])

HSP 1 Score: 903.7 bits (2334), Expect = 9.0e-259
Identity = 495/604 (81.95%), Postives = 525/604 (86.92%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGD---DSVSLEEETDGGGATESLEEKGRNEFAFSFRFQ 62
           FS  +RSF+E KIVESS    GGD   DS+  E+ET GGG  ESLEEK RNEFAFSF+FQ
Sbjct: 30  FSFLFRSFNEAKIVESSVVVHGGDDDGDSLLPEDETKGGGVIESLEEKERNEFAFSFKFQ 89

Query: 63  TYEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCS 122
           TYEEF+KSSKEN  CE L+ SG SSSLSNRYE LPEKSTSHFVEEAEIPSYTVE+LNSCS
Sbjct: 90  TYEEFSKSSKENIFCENLEWSGGSSSLSNRYEILPEKSTSHFVEEAEIPSYTVEVLNSCS 149

Query: 123 NHEILGNGNFSVREFSGKVLECEAVDQKITECSA-DGTEEFSGKILKLEAIEEGKSFSVE 182
           NHEILGN N    EFSGKVLE E V Q+ITE SA +GTEE SGK LK EA+EE + F   
Sbjct: 150 NHEILGNEN----EFSGKVLEHEIVGQEITEGSAVNGTEEVSGKFLKFEAVEEERPF--- 209

Query: 183 ISKSQPVEEEEEITESFSNGKEESSGKIQ------REEEEDDNDFLRETDFAGSDSDADV 242
              ++  ++EEEI E F N KEESS KIQ       EEEE+DNDFL+ETDFAGSDSDADV
Sbjct: 210 ---TKFEDDEEEIMERFRNEKEESSWKIQSEEEEEEEEEEEDNDFLKETDFAGSDSDADV 269

Query: 243 DIGGRFLSDTDFDFDFKIGGYEPDDEIN-EESEKSPEENGNGEDSEELNGLETEWEHQEL 302
           DIGGRFLSDTDFD DFK GGYEPDDEIN EESEKSPE+    EDSEELNGLETEWEHQEL
Sbjct: 270 DIGGRFLSDTDFDLDFKTGGYEPDDEINIEESEKSPEK--GEEDSEELNGLETEWEHQEL 329

Query: 303 IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRER 362
           IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDE+FQHGDLMEELHKFYRTYRER
Sbjct: 330 IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDEKFQHGDLMEELHKFYRTYRER 389

Query: 363 MRKLDILNYQKMYAMGVLQSKDPLKSFSSNSK----SSSPSIISLLSHNLRLYRQKKCQV 422
           MRKLDILNYQKMYAMGVLQSKDPL SFSSNSK    SSSPSIIS  +HNLRLYRQKKCQV
Sbjct: 390 MRKLDILNYQKMYAMGVLQSKDPLNSFSSNSKSSSSSSSPSIISAFTHNLRLYRQKKCQV 449

Query: 423 DPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF 482
           DPM+DFIREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF
Sbjct: 450 DPMKDFIREVHCDLEMVYVGQLCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF 509

Query: 483 QVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAI 542
           QV+LQRFLENE FEGPRVENYVKHRCVARNLLQVPVIREDK+R+RRK RRGKL+DGYEAI
Sbjct: 510 QVLLQRFLENEPFEGPRVENYVKHRCVARNLLQVPVIREDKRRDRRKGRRGKLEDGYEAI 569

Query: 543 TSDMVVEMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDL 592
            SDM+VEMLQESIRVIWQFIRADKDCHH+TN SLKR KKLQVELQDPADEQLLT IQ DL
Sbjct: 570 RSDMLVEMLQESIRVIWQFIRADKDCHHSTNGSLKRPKKLQVELQDPADEQLLTHIQIDL 621

BLAST of HG10019701 vs. NCBI nr
Match: XP_031739052.1 (uncharacterized protein LOC101211770 isoform X2 [Cucumis sativus])

HSP 1 Score: 899.0 bits (2322), Expect = 2.2e-257
Identity = 491/596 (82.38%), Postives = 520/596 (87.25%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGDDSVSL--EEETDGGGATESLEEKGRNEFAFSFRFQT 62
           FS  +RSF+EGKIVESS    GGDD VSL  E+ET GGG  ESL EK RNEF+FSF+FQT
Sbjct: 21  FSFLFRSFNEGKIVESSVVVHGGDDDVSLSPEDETKGGGVIESL-EKERNEFSFSFKFQT 80

Query: 63  YEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           YEEF+KS+KEN  CEKLD SG SSSL NRYE LPEKSTSHFVEEAEIPSYTVE+LNSC N
Sbjct: 81  YEEFSKSNKENICCEKLDWSGGSSSLGNRYEILPEKSTSHFVEEAEIPSYTVEVLNSCLN 140

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECS-ADGTEEFSGKILKLEAIEEGKSFSVEI 182
           H +LGN +    E SGKVLE E V Q+ITECS  DGTEE SGK  K EA+EE K F    
Sbjct: 141 HGVLGNES----EVSGKVLEHEIVSQEITECSTVDGTEEVSGKFFKFEAVEEEKPF---- 200

Query: 183 SKSQPVEEEEEITESFSNGKEESSGKIQ---REEEEDDNDFLRETDFAGSDSDADVDIGG 242
             ++  +EEEEITE F N KEESS KIQ    EEEE+DNDFL+ETDFAGSDSDADVDIGG
Sbjct: 201 --TKFEDEEEEITERFRNEKEESSPKIQSEEEEEEEEDNDFLKETDFAGSDSDADVDIGG 260

Query: 243 RFLSDTDFDFDFKIGGYEPDDEIN-EESEKSPEENGNG-EDSEELNGLETEWEHQELIEQ 302
           RFLSDTDFD DFK GGYEPDDEIN EESEKS E NG G EDSEELNGLETEWEHQELIEQ
Sbjct: 261 RFLSDTDFDLDFKTGGYEPDDEINVEESEKSAEGNGKGEEDSEELNGLETEWEHQELIEQ 320

Query: 303 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRK 362
           LKMELKKVRATGLATIFEESESPKIMGELKPWKIDE+FQHGDLMEELHKFYR+YRERMRK
Sbjct: 321 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDEKFQHGDLMEELHKFYRSYRERMRK 380

Query: 363 LDILNYQKMYAMGVLQSKDPLKSFSSNSK-SSSPSIISLLSHNLRLYRQKKCQVDPMEDF 422
           LDILNYQKMYAMGVLQSKDPL SFSSN K SSS SIIS  +HNLRLYR+ KCQVDPM+DF
Sbjct: 381 LDILNYQKMYAMGVLQSKDPLNSFSSNDKSSSSSSIISAFTHNLRLYRRNKCQVDPMKDF 440

Query: 423 IREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQR 482
           IREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQV+LQR
Sbjct: 441 IREVHCDLEMVYVGQLCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVLLQR 500

Query: 483 FLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVV 542
           FLENE FEGPRVENYVKHRCVARNLLQVPVIREDK+R+RRK RRGKL+DGYEAITSDM+V
Sbjct: 501 FLENEPFEGPRVENYVKHRCVARNLLQVPVIREDKRRDRRKGRRGKLEDGYEAITSDMLV 560

Query: 543 EMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQK 590
           EMLQESIRVIWQFIRADKDCHH+TN SLKR KKLQVELQ+PADEQLLT IQ DLQK
Sbjct: 561 EMLQESIRVIWQFIRADKDCHHSTNGSLKRPKKLQVELQEPADEQLLTHIQIDLQK 605

BLAST of HG10019701 vs. NCBI nr
Match: XP_022934863.1 (uncharacterized protein LOC111441903 [Cucurbita moschata])

HSP 1 Score: 747.7 bits (1929), Expect = 8.3e-212
Identity = 427/618 (69.09%), Postives = 492/618 (79.61%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGDDSVSLEEETD-GGGATESLEEKGRNEFAFSFRFQTY 62
           F+  +RSF   KI+E+S      D S  LEEE D GGG TESLEEK   EF F+FRFQT+
Sbjct: 21  FTFLFRSFHGEKILETS----DDDHSRPLEEEADGGGGVTESLEEKETEEFVFNFRFQTF 80

Query: 63  EEFTKSSKENFGCEKLDSSGC-SSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           EEF+KS++ NFG +KLDSSGC SSS+SNRYEF PEKSTSHFVEEA++PS+TVE+LNSCS 
Sbjct: 81  EEFSKSNQGNFGSQKLDSSGCSSSSISNRYEFSPEKSTSHFVEEAKVPSFTVEVLNSCSK 140

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECSADGTEEFSGKILKLEAIEEGKSFSVEIS 182
           H  LGNGNFSVRE SGKVLE E     +T+C +DGTEEFSGKILK EA+ EG        
Sbjct: 141 HVGLGNGNFSVREVSGKVLESE-----VTDCFSDGTEEFSGKILKFEAVGEG-------- 200

Query: 183 KSQPVEEEEEITESFSNGKEESSGK--IQREEEEDDNDFLRETDFAGSDSDADVDIGGRF 242
                    EITE+  NG E+ S K   + EEEE+DN+F +      SDSD  VD+GG F
Sbjct: 201 ---------EITETSVNGGEQISEKQSEEEEEEEEDNEFRQ-----SSDSDTGVDVGGGF 260

Query: 243 LSDTDFDFDFKIGGYEPDDEINEESEKSPEE-NGNGEDSEELNGLETEWEHQELIEQLKM 302
            SD D D + + GGYEPD+EINEE EKS EE N N EDSE       EWE     E+LKM
Sbjct: 261 PSDLDLDLELETGGYEPDEEINEEPEKSREEGNENREDSE-------EWED----EELKM 320

Query: 303 ELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDI 362
           E+KK R  GLATI+EESESPK+MGEL+ WKIDER ++GDLMEELHKFY+ YRERMRKLDI
Sbjct: 321 EMKKGRGRGLATIYEESESPKVMGELRAWKIDERSEYGDLMEELHKFYKAYRERMRKLDI 380

Query: 363 LNYQKMYAMGVLQSKDPLKSFSSNSKSSSP-SIISLLSHNLRLYRQKKCQVDPMEDFIRE 422
           LN+QKMYAMGVLQSKDPLKSF S+ KSS P SI SLLSH+LRLYRQKKCQVDPM+ FIRE
Sbjct: 381 LNFQKMYAMGVLQSKDPLKSFCSDRKSSWPSSITSLLSHDLRLYRQKKCQVDPMKKFIRE 440

Query: 423 VHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPH-GLHHYNEVAGEFQQFQVILQRFL 482
           VHC+LEMVYVGQ CLSWEFIQWQYEKALDLWES+PH  LHHYN+VA +FQQFQV+LQRFL
Sbjct: 441 VHCELEMVYVGQMCLSWEFIQWQYEKALDLWESQPHTRLHHYNQVADDFQQFQVLLQRFL 500

Query: 483 ENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEM 542
           ENEAF+GPR++NYVK+R +ARNLLQVPVI+EDK R+R+KARRGK +DGYEAITSD++VE+
Sbjct: 501 ENEAFQGPRLQNYVKNRFIARNLLQVPVIKEDKTRDRKKARRGK-EDGYEAITSDILVEI 560

Query: 543 LQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLK 602
           LQESIRVIWQFIRADKD H NT    KRSKK Q ELQ+PAD+QLLT+IQTDLQKKEKK+K
Sbjct: 561 LQESIRVIWQFIRADKD-HSNTT---KRSKKFQAELQNPADKQLLTEIQTDLQKKEKKVK 591

Query: 603 EIVRSGHCILKKLQKNEE 614
           +I+RSG CILKKLQK E+
Sbjct: 621 DIMRSGDCILKKLQKKEK 591

BLAST of HG10019701 vs. ExPASy TrEMBL
Match: A0A0A0LHR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G826700 PE=4 SV=1)

HSP 1 Score: 949.5 bits (2453), Expect = 6.9e-273
Identity = 518/626 (82.75%), Postives = 549/626 (87.70%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGDDSVSL--EEETDGGGATESLEEKGRNEFAFSFRFQT 62
           FS  +RSF+EGKIVESS    GGDD VSL  E+ET GGG  ESL EK RNEF+FSF+FQT
Sbjct: 21  FSFLFRSFNEGKIVESSVVVHGGDDDVSLSPEDETKGGGVIESL-EKERNEFSFSFKFQT 80

Query: 63  YEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           YEEF+KS+KEN  CEKLD SG SSSL NRYE LPEKSTSHFVEEAEIPSYTVE+LNSC N
Sbjct: 81  YEEFSKSNKENICCEKLDWSGGSSSLGNRYEILPEKSTSHFVEEAEIPSYTVEVLNSCLN 140

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECS-ADGTEEFSGKILKLEAIEEGKSFSVEI 182
           H +LGN +    E SGKVLE E V Q+ITECS  DGTEE SGK  K EA+EE K F    
Sbjct: 141 HGVLGNES----EVSGKVLEHEIVSQEITECSTVDGTEEVSGKFFKFEAVEEEKPF---- 200

Query: 183 SKSQPVEEEEEITESFSNGKEESSGKIQ---REEEEDDNDFLRETDFAGSDSDADVDIGG 242
             ++  +EEEEITE F N KEESS KIQ    EEEE+DNDFL+ETDFAGSDSDADVDIGG
Sbjct: 201 --TKFEDEEEEITERFRNEKEESSPKIQSEEEEEEEEDNDFLKETDFAGSDSDADVDIGG 260

Query: 243 RFLSDTDFDFDFKIGGYEPDDEIN-EESEKSPEENGNG-EDSEELNGLETEWEHQELIEQ 302
           RFLSDTDFD DFK GGYEPDDEIN EESEKS E NG G EDSEELNGLETEWEHQELIEQ
Sbjct: 261 RFLSDTDFDLDFKTGGYEPDDEINVEESEKSAEGNGKGEEDSEELNGLETEWEHQELIEQ 320

Query: 303 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRK 362
           LKMELKKVRATGLATIFEESESPKIMGELKPWKIDE+FQHGDLMEELHKFYR+YRERMRK
Sbjct: 321 LKMELKKVRATGLATIFEESESPKIMGELKPWKIDEKFQHGDLMEELHKFYRSYRERMRK 380

Query: 363 LDILNYQKMYAMGVLQSKDPLKSFSSNSK-SSSPSIISLLSHNLRLYRQKKCQVDPMEDF 422
           LDILNYQKMYAMGVLQSKDPL SFSSN K SSS SIIS  +HNLRLYR+ KCQVDPM+DF
Sbjct: 381 LDILNYQKMYAMGVLQSKDPLNSFSSNDKSSSSSSIISAFTHNLRLYRRNKCQVDPMKDF 440

Query: 423 IREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQR 482
           IREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQV+LQR
Sbjct: 441 IREVHCDLEMVYVGQLCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVLLQR 500

Query: 483 FLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVV 542
           FLENE FEGPRVENYVKHRCVARNLLQVPVIREDK+R+RRK RRGKL+DGYEAITSDM+V
Sbjct: 501 FLENEPFEGPRVENYVKHRCVARNLLQVPVIREDKRRDRRKGRRGKLEDGYEAITSDMLV 560

Query: 543 EMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKK 602
           EMLQESIRVIWQFIRADKDCHH+TN SLKR KKLQVELQ+PADEQLLT IQ DLQKKEK+
Sbjct: 561 EMLQESIRVIWQFIRADKDCHHSTNGSLKRPKKLQVELQEPADEQLLTHIQIDLQKKEKR 620

Query: 603 LKEIVRSGHCILKKLQKNEENEEAEG 620
           LKEIVRSGHCILKKL+KNEENEE EG
Sbjct: 621 LKEIVRSGHCILKKLKKNEENEETEG 635

BLAST of HG10019701 vs. ExPASy TrEMBL
Match: A0A5A7UHN5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold135G001360 PE=4 SV=1)

HSP 1 Score: 903.7 bits (2334), Expect = 4.4e-259
Identity = 495/604 (81.95%), Postives = 525/604 (86.92%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGD---DSVSLEEETDGGGATESLEEKGRNEFAFSFRFQ 62
           FS  +RSF+E KIVESS    GGD   DS+  E+ET GGG  ESLEEK RNEFAFSF+FQ
Sbjct: 30  FSFLFRSFNEAKIVESSVVVHGGDDDGDSLLPEDETKGGGVIESLEEKERNEFAFSFKFQ 89

Query: 63  TYEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCS 122
           TYEEF+KSSKEN  CE L+ SG SSSLSNRYE LPEKSTSHFVEEAEIPSYTVE+LNSCS
Sbjct: 90  TYEEFSKSSKENIFCENLEWSGGSSSLSNRYEILPEKSTSHFVEEAEIPSYTVEVLNSCS 149

Query: 123 NHEILGNGNFSVREFSGKVLECEAVDQKITECSA-DGTEEFSGKILKLEAIEEGKSFSVE 182
           NHEILGN N    EFSGKVLE E V Q+ITE SA +GTEE SGK LK EA+EE + F   
Sbjct: 150 NHEILGNEN----EFSGKVLEHEIVGQEITEGSAVNGTEEVSGKFLKFEAVEEERPF--- 209

Query: 183 ISKSQPVEEEEEITESFSNGKEESSGKIQ------REEEEDDNDFLRETDFAGSDSDADV 242
              ++  ++EEEI E F N KEESS KIQ       EEEE+DNDFL+ETDFAGSDSDADV
Sbjct: 210 ---TKFEDDEEEIMERFRNEKEESSWKIQSEEEEEEEEEEEDNDFLKETDFAGSDSDADV 269

Query: 243 DIGGRFLSDTDFDFDFKIGGYEPDDEIN-EESEKSPEENGNGEDSEELNGLETEWEHQEL 302
           DIGGRFLSDTDFD DFK GGYEPDDEIN EESEKSPE+    EDSEELNGLETEWEHQEL
Sbjct: 270 DIGGRFLSDTDFDLDFKTGGYEPDDEINIEESEKSPEK--GEEDSEELNGLETEWEHQEL 329

Query: 303 IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRER 362
           IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDE+FQHGDLMEELHKFYRTYRER
Sbjct: 330 IEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDEKFQHGDLMEELHKFYRTYRER 389

Query: 363 MRKLDILNYQKMYAMGVLQSKDPLKSFSSNSK----SSSPSIISLLSHNLRLYRQKKCQV 422
           MRKLDILNYQKMYAMGVLQSKDPL SFSSNSK    SSSPSIIS  +HNLRLYRQKKCQV
Sbjct: 390 MRKLDILNYQKMYAMGVLQSKDPLNSFSSNSKSSSSSSSPSIISAFTHNLRLYRQKKCQV 449

Query: 423 DPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF 482
           DPM+DFIREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF
Sbjct: 450 DPMKDFIREVHCDLEMVYVGQLCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQF 509

Query: 483 QVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAI 542
           QV+LQRFLENE FEGPRVENYVKHRCVARNLLQVPVIREDK+R+RRK RRGKL+DGYEAI
Sbjct: 510 QVLLQRFLENEPFEGPRVENYVKHRCVARNLLQVPVIREDKRRDRRKGRRGKLEDGYEAI 569

Query: 543 TSDMVVEMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDL 592
            SDM+VEMLQESIRVIWQFIRADKDCHH+TN SLKR KKLQVELQDPADEQLLT IQ DL
Sbjct: 570 RSDMLVEMLQESIRVIWQFIRADKDCHHSTNGSLKRPKKLQVELQDPADEQLLTHIQIDL 621

BLAST of HG10019701 vs. ExPASy TrEMBL
Match: A0A6J1F8X7 (uncharacterized protein LOC111441903 OS=Cucurbita moschata OX=3662 GN=LOC111441903 PE=4 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 4.0e-212
Identity = 427/618 (69.09%), Postives = 492/618 (79.61%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGDDSVSLEEETD-GGGATESLEEKGRNEFAFSFRFQTY 62
           F+  +RSF   KI+E+S      D S  LEEE D GGG TESLEEK   EF F+FRFQT+
Sbjct: 21  FTFLFRSFHGEKILETS----DDDHSRPLEEEADGGGGVTESLEEKETEEFVFNFRFQTF 80

Query: 63  EEFTKSSKENFGCEKLDSSGC-SSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           EEF+KS++ NFG +KLDSSGC SSS+SNRYEF PEKSTSHFVEEA++PS+TVE+LNSCS 
Sbjct: 81  EEFSKSNQGNFGSQKLDSSGCSSSSISNRYEFSPEKSTSHFVEEAKVPSFTVEVLNSCSK 140

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECSADGTEEFSGKILKLEAIEEGKSFSVEIS 182
           H  LGNGNFSVRE SGKVLE E     +T+C +DGTEEFSGKILK EA+ EG        
Sbjct: 141 HVGLGNGNFSVREVSGKVLESE-----VTDCFSDGTEEFSGKILKFEAVGEG-------- 200

Query: 183 KSQPVEEEEEITESFSNGKEESSGK--IQREEEEDDNDFLRETDFAGSDSDADVDIGGRF 242
                    EITE+  NG E+ S K   + EEEE+DN+F +      SDSD  VD+GG F
Sbjct: 201 ---------EITETSVNGGEQISEKQSEEEEEEEEDNEFRQ-----SSDSDTGVDVGGGF 260

Query: 243 LSDTDFDFDFKIGGYEPDDEINEESEKSPEE-NGNGEDSEELNGLETEWEHQELIEQLKM 302
            SD D D + + GGYEPD+EINEE EKS EE N N EDSE       EWE     E+LKM
Sbjct: 261 PSDLDLDLELETGGYEPDEEINEEPEKSREEGNENREDSE-------EWED----EELKM 320

Query: 303 ELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDI 362
           E+KK R  GLATI+EESESPK+MGEL+ WKIDER ++GDLMEELHKFY+ YRERMRKLDI
Sbjct: 321 EMKKGRGRGLATIYEESESPKVMGELRAWKIDERSEYGDLMEELHKFYKAYRERMRKLDI 380

Query: 363 LNYQKMYAMGVLQSKDPLKSFSSNSKSSSP-SIISLLSHNLRLYRQKKCQVDPMEDFIRE 422
           LN+QKMYAMGVLQSKDPLKSF S+ KSS P SI SLLSH+LRLYRQKKCQVDPM+ FIRE
Sbjct: 381 LNFQKMYAMGVLQSKDPLKSFCSDRKSSWPSSITSLLSHDLRLYRQKKCQVDPMKKFIRE 440

Query: 423 VHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPH-GLHHYNEVAGEFQQFQVILQRFL 482
           VHC+LEMVYVGQ CLSWEFIQWQYEKALDLWES+PH  LHHYN+VA +FQQFQV+LQRFL
Sbjct: 441 VHCELEMVYVGQMCLSWEFIQWQYEKALDLWESQPHTRLHHYNQVADDFQQFQVLLQRFL 500

Query: 483 ENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEM 542
           ENEAF+GPR++NYVK+R +ARNLLQVPVI+EDK R+R+KARRGK +DGYEAITSD++VE+
Sbjct: 501 ENEAFQGPRLQNYVKNRFIARNLLQVPVIKEDKTRDRKKARRGK-EDGYEAITSDILVEI 560

Query: 543 LQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLK 602
           LQESIRVIWQFIRADKD H NT    KRSKK Q ELQ+PAD+QLLT+IQTDLQKKEKK+K
Sbjct: 561 LQESIRVIWQFIRADKD-HSNTT---KRSKKFQAELQNPADKQLLTEIQTDLQKKEKKVK 591

Query: 603 EIVRSGHCILKKLQKNEE 614
           +I+RSG CILKKLQK E+
Sbjct: 621 DIMRSGDCILKKLQKKEK 591

BLAST of HG10019701 vs. ExPASy TrEMBL
Match: A0A6J1BVT2 (uncharacterized protein LOC111005168 OS=Momordica charantia OX=3673 GN=LOC111005168 PE=4 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 3.0e-207
Identity = 424/641 (66.15%), Postives = 485/641 (75.66%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESSAGGGGGD--DSVSLEEETDGGGATESLEEKGRNEFAFSFRFQT 62
           F+  +R+ D GK ++      G         EEE   GG     +E+  N+F FSF+F++
Sbjct: 36  FTFFFRALDGGKKLDMEMENSGDSRLPEEEEEEEEPAGGRPPISDEEESNDFVFSFKFRS 95

Query: 63  YEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSN 122
           YEEF+K+++E+FG E        +SLSNRYEFLPEKSTS FV    IPS+ VE+LNSCS+
Sbjct: 96  YEEFSKTNREDFGSEGF------NSLSNRYEFLPEKSTSFFV----IPSFKVEVLNSCSS 155

Query: 123 HEILGNGNFSVREFSGKVLECEAVDQKITECSADGTEE-FSGKILKLEAIEEGKSFSVEI 182
           ++I G G+F V EF GK+ E E++ ++ITE S  G EE FSGK+ + E  +   + + E 
Sbjct: 156 NQIPGTGDFPVEEFPGKIPESESIGEEITESSVYGGEEGFSGKVSEEEITQSSANGNEEF 215

Query: 183 SKSQPVEEEEEITESFSNGKEESSGKI------------------QREEEEDDNDFLRET 242
                   EEEITE  +NGKEE SGKI                  QR EEE++ +   E 
Sbjct: 216 PGK---VSEEEITECSANGKEEFSGKISESKPVEDEITEKQRKVEQRVEEEEEEE---EE 275

Query: 243 DFAGSDSDADVDIGGRFLSDTDFDFDFKIGGYEPDDEINEESEKSPEENGNG---EDSEE 302
           DF GSD            SD+D  FDF +GGYEPD+EINEE+E+     G G   E+S+E
Sbjct: 276 DFTGSD------------SDSDSGFDFHVGGYEPDEEINEEAER----GGGGERTEESDE 335

Query: 303 LNGLETEWEHQELIEQLKMELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLM 362
           LNGLETEWEHQELIEQLKMELKKVRA GL TIFEESESPKIM ELKPWKID++FQHGDLM
Sbjct: 336 LNGLETEWEHQELIEQLKMELKKVRARGLPTIFEESESPKIMEELKPWKIDDKFQHGDLM 395

Query: 363 EELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLR 422
           EELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSF SNSKSSSPSI SLLS NLR
Sbjct: 396 EELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFCSNSKSSSPSITSLLSQNLR 455

Query: 423 LYRQKKCQVDPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYN 482
           LYRQKK QVDPM++FIREVHCDLEMVYVGQ CLSWEFIQWQYEKALDLWESEPHGLHHYN
Sbjct: 456 LYRQKKSQVDPMKNFIREVHCDLEMVYVGQMCLSWEFIQWQYEKALDLWESEPHGLHHYN 515

Query: 483 EVAGEFQQFQVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRG 542
           EVAGEFQQFQV+LQRFLENEAFEGPRVENYVKHRCVAR+LLQVPVIREDK R+RRKARR 
Sbjct: 516 EVAGEFQQFQVLLQRFLENEAFEGPRVENYVKHRCVARHLLQVPVIREDKMRDRRKARR- 575

Query: 543 KLDDGYEAITSDMVVEMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQ 602
            +D   EAITSDM+VE+LQESIR+IWQFIR+DKD      A+LKRSKK QVELQDP DEQ
Sbjct: 576 VIDHENEAITSDMLVEILQESIRIIWQFIRSDKD-----QATLKRSKKFQVELQDPVDEQ 635

Query: 603 LLTDIQTDLQKKEKKLKEIVRSGHCILKKLQKNEENEEAEG 620
           LL +IQ DL KKE+KLKEIVRSGHCILK+LQ+NEE EE EG
Sbjct: 636 LLMEIQADLHKKERKLKEIVRSGHCILKRLQRNEE-EETEG 637

BLAST of HG10019701 vs. ExPASy TrEMBL
Match: A0A6J1EQK9 (uncharacterized protein LOC111436904 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436904 PE=4 SV=1)

HSP 1 Score: 726.5 bits (1874), Expect = 9.5e-206
Identity = 425/622 (68.33%), Postives = 474/622 (76.21%), Query Frame = 0

Query: 3   FSDSYRSFDEGKIVESS-----AGGGGGDDSVSLEEETDGGGATESLEEKGRNEFAFSFR 62
           FS  +R  D G+ +E S       GGG DD    E+ET GGGA E  +EKG NEF FSFR
Sbjct: 41  FSFLFRPLDGGRNMEISDVVLVDDGGGCDDLQPPEKETSGGGAPEVSDEKGTNEFVFSFR 100

Query: 63  FQTYEEFTKSSKENFGCEKLDSSGCSSSLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNS 122
           FQTYEEFTKS+K+N GCE+LD      SLSNRYEFLPEKSTSHFVEE EIPS+TVE+LNS
Sbjct: 101 FQTYEEFTKSNKKNLGCERLD------SLSNRYEFLPEKSTSHFVEEPEIPSFTVEVLNS 160

Query: 123 CSNHEILGNGNFSVREFSGKVLECEAVDQKITECSADGTEEFSGKILKLEAIEEGKSFSV 182
           CSN+EI   G+FSVREFSGKVL+     Q+IT      + E SG+I + E I E    + 
Sbjct: 161 CSNYEIPRTGDFSVREFSGKVLD----SQRITV-----SYEVSGEIPESEVIGE----TA 220

Query: 183 EISKSQPVEEEEEITESFSNGKEESSGKIQREEEEDDNDFLRETDFAGSDSDADVDIGGR 242
           E+S ++    EE++ E  S  K       QR+E        R TDF              
Sbjct: 221 ELSSTK--SREEQMAEKQSELK-------QRDE--------RGTDFV------------- 280

Query: 243 FLSDTDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEELNGLETEWEHQELIEQLKM 302
               +D+D + K+GGYEPD+E NEE EK  EE    E+ EELNGLETEWEHQELIEQLKM
Sbjct: 281 ----SDYDMNLKMGGYEPDEETNEELEKWGEE----EEEEELNGLETEWEHQELIEQLKM 340

Query: 303 ELKKVRATGLATIFEESESPKIMGELKPWKIDERFQHGDLMEELHKFYRTYRERMRKLDI 362
           ELKKVRA+GL TI EESESPKIM ELKPWKIDERF+ GDLMEELH FYR+YRERMRKLDI
Sbjct: 341 ELKKVRASGLPTISEESESPKIMEELKPWKIDERFKRGDLMEELHSFYRSYRERMRKLDI 400

Query: 363 LNYQKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLRLYRQKKCQVDPMEDFIREV 422
           LNYQKMYAMGVLQSKDPLKSFSSN+KSSSPSI S+    LRLYRQKKCQVDPM+DFIREV
Sbjct: 401 LNYQKMYAMGVLQSKDPLKSFSSNTKSSSPSITSV----LRLYRQKKCQVDPMKDFIREV 460

Query: 423 HCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQRFLEN 482
           HCDLEMVYV Q CLSWEFIQWQY KALDLWESEPHGLHHYNEVAGEFQ FQV+L+RFLEN
Sbjct: 461 HCDLEMVYVAQMCLSWEFIQWQYGKALDLWESEPHGLHHYNEVAGEFQHFQVLLERFLEN 520

Query: 483 EAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEMLQ 542
           EAFEGPRVENYVK RCV RNLLQVPVIREDK R+RRKAR+ + D   EAIT+DM+VE+LQ
Sbjct: 521 EAFEGPRVENYVKQRCVVRNLLQVPVIREDKTRDRRKARQWR-DHENEAITNDMLVEILQ 580

Query: 543 ESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLKEI 602
           ESIRVI QFIRADK   HN  A+LKR KK QVELQDPAD QLLTDIQ DLQKKE+K+KE 
Sbjct: 581 ESIRVISQFIRADK--VHNLTATLKRPKKFQVELQDPADMQLLTDIQVDLQKKERKVKEK 598

Query: 603 VRSGHCILKKLQKNEENEEAEG 620
           +RSGHCILKKL+KNEE EE EG
Sbjct: 641 MRSGHCILKKLKKNEEEEETEG 598

BLAST of HG10019701 vs. TAIR 10
Match: AT5G39785.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 342.8 bits (878), Expect = 5.7e-94
Identity = 206/464 (44.40%), Postives = 301/464 (64.87%), Query Frame = 0

Query: 185 VEEEEEITESFSNGKEESSGKIQREE------EEDDNDFLRETDFAGSDSD--------A 244
           ++E++E TE           K++ E+      ++    FL E DF  SDSD         
Sbjct: 98  LKEKQEKTEDLGYSVFHGEDKVKTEDYSVSSFKKKKIRFLTEEDFLESDSDFVDSSQTFT 157

Query: 245 DVDIGGRFLSDTDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEE-----LNGLETE 304
             D  G FLSD+DF           +  + +   +  + +G+G DSEE      NG E+ 
Sbjct: 158 SNDEDG-FLSDSDF----------AETSLKKGQNRKSDNSGSGSDSEEEEEEDTNGFESL 217

Query: 305 WEHQELIEQLKMELKKVRAT-GLATIFEESES----PKIMGELKPWKIDE--RFQHGDLM 364
           WEHQ+LIEQLKME+KKV+A  GL TI EE E     PKIM +LKPW+I+E  +F+H D +
Sbjct: 218 WEHQDLIEQLKMEMKKVKAIGGLTTILEEEEEDDDCPKIMEDLKPWRIEEEKKFKHVDTI 277

Query: 365 EELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFSS-NSKSSSPSIISLLSHNL 424
            E+HKF+R+YRERMRKLDIL++QK YA+G+LQSK P ++ S+  S  S  S  S+ S N+
Sbjct: 278 GEVHKFHRSYRERMRKLDILSFQKSYALGLLQSKSPQQATSTLGSNPSQTSFSSVFSVNI 337

Query: 425 RLYRQKKCQVDPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHY 484
           RL++ KK +++PM  F++E+  +LE VYVGQ CLSWE + WQYEKA++L ES+ +G   Y
Sbjct: 338 RLWKAKKSEIEPMVQFVKEIQGELENVYVGQMCLSWEILHWQYEKAIELLESDVYGSRRY 397

Query: 485 NEVAGEFQQFQVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARR 544
           NEVAGEFQQFQV+LQRFLENE FE PRV++Y+K RCV RNLLQ+PVIRED  ++++  RR
Sbjct: 398 NEVAGEFQQFQVLLQRFLENEPFEEPRVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRR 457

Query: 545 GKLDDGYE-AITSDMVVEMLQESIRVIWQFIRADK--DCHHNTNASLKRSKKLQVELQDP 604
              ++  +  I SD +VE+++E+IR+ W+F+R DK     H+  +  K   +   E +D 
Sbjct: 458 RDYEENNDGVIKSDQLVEIMEETIRLFWRFVRCDKLTSSIHDQKSRTKSQIEPDHE-EDS 517

Query: 605 ADEQLLTDIQTDLQKKEKKLKEIVRSGHCILKKLQKNEENEEAE 619
            D ++  ++++ LQ KEK+L+++++S  CI+++ QK++E +  E
Sbjct: 518 EDLEMFAEVKSQLQNKEKRLRDVLKSERCIIRRFQKHKEEDSTE 549

BLAST of HG10019701 vs. TAIR 10
Match: AT5G39785.2 (Protein of unknown function (DUF1666) )

HSP 1 Score: 336.3 bits (861), Expect = 5.3e-92
Identity = 205/465 (44.09%), Postives = 300/465 (64.52%), Query Frame = 0

Query: 185 VEEEEEITESFSNGKEESSGKIQREE------EEDDNDFLRETDFAGSDSD--------A 244
           ++E++E TE           K++ E+      ++    FL E DF  SDSD         
Sbjct: 98  LKEKQEKTEDLGYSVFHGEDKVKTEDYSVSSFKKKKIRFLTEEDFLESDSDFVDSSQTFT 157

Query: 245 DVDIGGRFLSDTDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEE-----LNGLETE 304
             D  G FLSD+DF           +  + +   +  + +G+G DSEE      NG E+ 
Sbjct: 158 SNDEDG-FLSDSDF----------AETSLKKGQNRKSDNSGSGSDSEEEEEEDTNGFESL 217

Query: 305 WEHQELIEQLKMELKKVRAT-GLATIFEESES----PKIMGELKPWKIDE--RFQHGDLM 364
           WEHQ+LIEQLKME+KKV+A  GL TI EE E     PKIM +LKPW+I+E  +F+H D +
Sbjct: 218 WEHQDLIEQLKMEMKKVKAIGGLTTILEEEEEDDDCPKIMEDLKPWRIEEEKKFKHVDTI 277

Query: 365 EELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFSS-NSKSSSPSIISLLSHNL 424
            E+HKF+R+YRERMRKLDIL++QK YA+G+LQSK P ++ S+  S  S  S  S+ S N+
Sbjct: 278 GEVHKFHRSYRERMRKLDILSFQKSYALGLLQSKSPQQATSTLGSNPSQTSFSSVFSVNI 337

Query: 425 RLYRQKKCQVDPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHY 484
           RL++ KK +++PM  F++E+  +LE VYVGQ CLSWE + WQYEKA++L ES+ +G   Y
Sbjct: 338 RLWKAKKSEIEPMVQFVKEIQGELENVYVGQMCLSWEILHWQYEKAIELLESDVYGSRRY 397

Query: 485 NEVAGEFQQFQVILQRFLENEAFEGPRVENYVKHRCVARNLLQVPVIREDKKRERRKARR 544
           NEVAGEFQQFQV+LQRFLENE FE PRV++Y+K RCV RNLLQ+PVIRED  ++++  RR
Sbjct: 398 NEVAGEFQQFQVLLQRFLENEPFEEPRVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRR 457

Query: 545 GKLDDGYE-AITSDMVVEMLQESIRVIWQFIRADK--DCHHNTNASLKRSKKLQVELQDP 604
              ++  +  I SD +VE+++E+IR+ W+F+R DK     H+  +  K   +   E +D 
Sbjct: 458 RDYEENNDGVIKSDQLVEIMEETIRLFWRFVRCDKLTSSIHDQKSRTKSQIEPDHE-EDS 517

Query: 605 ADEQLLTDIQTDLQK-KEKKLKEIVRSGHCILKKLQKNEENEEAE 619
            D ++  ++++ LQ   EK+L+++++S  CI+++ QK++E +  E
Sbjct: 518 EDLEMFAEVKSQLQNVSEKRLRDVLKSERCIIRRFQKHKEEDSTE 550

BLAST of HG10019701 vs. TAIR 10
Match: AT1G69610.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 241.9 bits (616), Expect = 1.4e-63
Identity = 196/554 (35.38%), Postives = 295/554 (53.25%), Query Frame = 0

Query: 85  SLSNRYEFLPEKSTSHFVEEAEIPSYTVEILNSCSNHEILGNGNFSVREFSGKVL--ECE 144
           S SN +   P+K TS  + E +    +    +    ++I G  +    E S   +   CE
Sbjct: 46  SFSNLFRVNPDKETSLVLFEKQNTKDSSVFGDHQDQNQICGTQDDDQMESSLVSISKRCE 105

Query: 145 AV-DQKITECSADGTEEFSGKILKLEAIEEGKSFSVEISKSQPVEEEEEITESFSNGKEE 204
              D K      +  + FS K  +  AI        E  K + V   +E T S    K E
Sbjct: 106 FYSDNKSIVGFVEEPKAFSFKFHEYSAIT-----VQEEEKKKMVSFSDEKTSSMVLEKSE 165

Query: 205 SSGKIQREEEED------------DNDFLRETDFAGSDSDADVDIGGRFLSDTDFDFDFK 264
           S G ++   +E+                + E    G D+  D D G   L+ +    +F 
Sbjct: 166 SDGFMKEHVKEEMLVYEFMSCGVLKESLVHENLVGGEDNLFDDDDGFIELNPSLQISNFA 225

Query: 265 IGGYEPDDEIN---------EESEKSPEENGNGEDSEELNGLETEWEHQELIEQLKMELK 324
            G  E ++ I+         +E E+  +E  +G DS+     + E+EH ++IE+LK EL+
Sbjct: 226 YGEEEEEEFIHREEEMKMGFDEQEEVYDEFDDGSDSD-----DDEFEHSDVIEKLKTELR 285

Query: 325 KVRATGLATIFEESESPKIMGELKPWKIDER-FQHGDLMEELHKFYRTYRERMRKLDILN 384
             R  GL TI EESE+P  + ELKP KI+ +  QH D + E+HK Y+ Y  +MRKLD+++
Sbjct: 286 TARTGGLCTILEESETP--LQELKPLKIEPKPDQHKDRIAEIHKVYKNYAVKMRKLDVID 345

Query: 385 YQKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLRLYRQKKCQVDPMEDFIREVHC 444
            Q M+++ +L+ KD  K   +  K    S    L  N+  +++   + DP E  ++E   
Sbjct: 346 SQTMHSISLLKLKDSSKPSRNTDKPPKSS----LHQNIWPFKKHTLECDPSERLVKEASR 405

Query: 445 DLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQVILQRFLENEA 504
           D E VYVGQ CLSWE ++WQY+K L+         + YN VAGEFQ FQV+LQRF+ENE 
Sbjct: 406 DFETVYVGQVCLSWEMLRWQYDKVLEF--DSQVTTYQYNLVAGEFQLFQVLLQRFVENEP 465

Query: 505 FE-GPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAITSDMVVEMLQE 564
           F+   RVE Y+K+R   +N LQ+P++R+D+  +++    G+      A+ ++M+ E+++E
Sbjct: 466 FQNSSRVETYLKNRRHFQNFLQIPLVRDDRSSKKKCRYEGEF-----AVKTEMLREIIRE 525

Query: 565 SIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLKEIV 613
           S+ V W+F+ ADKD      + +K S + QV  QD  D +LLTDI+T LQKKEKKLKEI 
Sbjct: 526 SMSVFWEFLCADKD---EFTSMMKVSHQTQVSPQDSLDLELLTDIRTHLQKKEKKLKEIQ 573

BLAST of HG10019701 vs. TAIR 10
Match: AT3G20260.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 87.0 bits (214), Expect = 5.6e-17
Identity = 106/379 (27.97%), Postives = 158/379 (41.69%), Query Frame = 0

Query: 254 EPDDEINEESEKSPEENGNGEDSEELNGLETEWEHQELI-EQLKMELKKVRATGLATIFE 313
           E +D+I  + E++  EN      +   G E E +  + I  ++K  LK++R      +  
Sbjct: 20  EKEDKILAQQEQARSENTEAGVIDSGKGDEIEDDDDDFITNEVKRRLKELRRNSFMVLIP 79

Query: 314 ESESPKIMGELKPWKIDERFQHG---------DLMEE-------LHKFYRTYRERMRKLD 373
           E E      E +   +DE    G         D++ E           Y  Y ERM   D
Sbjct: 80  EEEEE----EEEESYLDEDDDDGEDKCSSEWRDVVAEGLQWWGGFDAVYEKYCERMLFFD 139

Query: 374 ILNYQKMYAMGVLQSKDPLKSFSSNSKSSSPSIISLLSHNLRLYRQKKCQVDPMEDF--- 433
            L+ Q++   G+  +  P       S  S  S    LS   R    KK  V P ED    
Sbjct: 140 RLSSQQLKETGIGIAPSP-------STPSPRSASKKLSSPFRCLSLKKFDV-PEEDIEHL 199

Query: 434 ----IREVHCDLEMVYVGQTCLSWEFIQWQYEKALDLWESEPHGLHHYNEVAGEFQQFQV 493
               + + + DLE  YV Q CL+WE +  QY +   L   +P     YN  A  FQQF V
Sbjct: 200 QPTEVDDPYQDLETAYVAQLCLTWEALHCQYTQLSHLISCQPETPTCYNHTAQLFQQFLV 259

Query: 494 ILQRFLENEAFE-GPRVENYVKHRCVARNLLQVPVIREDKKRERRKARRGKLDDGYEAIT 553
           +LQR++ENE FE G R E Y + R     LLQ P I+   K+E  K      D G+  + 
Sbjct: 260 LLQRYIENEPFEQGSRSELYARARNAMPKLLQAPKIQGSDKKEMEK------DTGFMVLA 319

Query: 554 SDMVVEMLQESIRVIWQFIRADKDCHHNTNASLKRSKKLQVELQDPADEQLLTDIQTDLQ 608
            D+ +++++ SI     F++ DK   +             V    P     L  +Q+ + 
Sbjct: 320 DDL-IKVIESSILTFNVFLKMDKKKPNGGIHLFGNHNNNHVNSTTP-----LLLVQSSID 374

BLAST of HG10019701 vs. TAIR 10
Match: AT1G73850.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 79.3 bits (194), Expect = 1.2e-14
Identity = 122/477 (25.58%), Postives = 205/477 (42.98%), Query Frame = 0

Query: 192 TESFSNGKEESSGKIQREEEEDDNDFLRETDFAGSD--------SDADVDIGGRFLSD-- 251
           T+ F+N  +    + + E+++D N    E  F+ ++        S  + D G   L D  
Sbjct: 121 TKEFNNVYQTHEHEEEEEDDQDSNTSSTEEHFSSANVSPYRSESSIEEEDDGVEDLPDDR 180

Query: 252 -TDFDFDFKIGGYEPDDEINEESEKSPEENGNGEDSEELNGL---ETEWEHQELIE---- 311
             D D D ++GG    D + +   K P        S   +GL   +  +  +E +     
Sbjct: 181 YDDDDEDEEVGGVSRYDVVEDLVRKQPNTTSTRGPSRFQSGLVFNDKSYNAREFVSNGGI 240

Query: 312 ---------QLKMELKKVRATGLATIFEESESPKIMGE----------LKPW----KIDE 371
                        ++++++A  L    EE E  +I GE             W    K D+
Sbjct: 241 KNDVDEIQPSSGFDMREIKAEELEE-EEEEERGQIFGESCTNGSTSKSSSEWRNSVKTDD 300

Query: 372 RFQHGDLME----ELHKFYRTYRERMRKLDILNYQKMYAMGVLQSKDPLKSFSSNSKSSS 431
            F           E +  ++ Y E M  L  ++ QK      L   + LKS     +S S
Sbjct: 301 PFSTSSRRSCPKWESYTVFQKYDEEMTFLTRISAQK------LHEAESLKSIMVEPRSIS 360

Query: 432 PSIISLLSHNLRLYRQKKCQVDPMEDFIREVHCDLEMVYVGQTCLSWEFIQWQY---EKA 491
             I+  LS N   +++K+ Q           + +LE  YV Q CL+WE + W Y   E+ 
Sbjct: 361 ERIVHKLSSN--GHKKKQKQYPGSNGSRPNPYVELESAYVAQICLTWEALSWNYKNFERK 420

Query: 492 LDLWESEPHGLHHYNEVAGEFQQFQVILQRFLENEAFE-GPRVENYVKHRCVARNLLQVP 551
               +   + +     +A +F+ F ++LQR++ENE +E G R E Y + R +A  LL VP
Sbjct: 421 RSTTQRSFNDVGCPAGIADQFRTFHILLQRYVENEPYEHGRRPEIYARMRTLAPKLLLVP 480

Query: 552 VIREDKKRERRKARRGKLDDGYEA-ITSDMVVEMLQESIRVIWQFIRADKD--CHHNTNA 611
             ++ ++ E ++      ++G+ + I+S   + +++E IR    F++ADK+  C     A
Sbjct: 481 EYQDYEEEEEKEDEN---EEGFRSRISSASFLMIMEECIRTFMNFLQADKEKPCQKIIKA 540

Query: 612 SLKRSKKLQVELQDPADEQLLTDIQTDLQKKEKKLKEIVRSGHCILKKLQKNEENEE 617
              RSK+  V   DP    L+  + T   KK+ KLKE+ R G  + KK    EE  E
Sbjct: 541 FFGRSKRGPV---DPTLVHLMKKVNT---KKKTKLKEMRRGGKYMRKKKMSIEEEME 579

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905369.17.8e-30389.34uncharacterized protein LOC120091420 [Benincasa hispida][more]
XP_011652238.11.4e-27282.75uncharacterized protein LOC101211770 isoform X1 [Cucumis sativus] >KGN59576.1 hy... [more]
KAA0053756.19.0e-25981.95uncharacterized protein E6C27_scaffold135G001360 [Cucumis melo var. makuwa][more]
XP_031739052.12.2e-25782.38uncharacterized protein LOC101211770 isoform X2 [Cucumis sativus][more]
XP_022934863.18.3e-21269.09uncharacterized protein LOC111441903 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LHR86.9e-27382.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G826700 PE=4 SV=1[more]
A0A5A7UHN54.4e-25981.95Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1F8X74.0e-21269.09uncharacterized protein LOC111441903 OS=Cucurbita moschata OX=3662 GN=LOC1114419... [more]
A0A6J1BVT23.0e-20766.15uncharacterized protein LOC111005168 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1EQK99.5e-20668.33uncharacterized protein LOC111436904 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT5G39785.15.7e-9444.40Protein of unknown function (DUF1666) [more]
AT5G39785.25.3e-9244.09Protein of unknown function (DUF1666) [more]
AT1G69610.11.4e-6335.38Protein of unknown function (DUF1666) [more]
AT3G20260.15.6e-1727.97Protein of unknown function (DUF1666) [more]
AT1G73850.11.2e-1425.58Protein of unknown function (DUF1666) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 580..600
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 179..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 250..280
NoneNo IPR availablePANTHERPTHR46741:SF2OS09G0413600 PROTEINcoord: 41..231
coord: 260..616
NoneNo IPR availablePANTHERPTHR46741OS09G0413600 PROTEINcoord: 41..231
coord: 260..616
IPR012870Protein of unknown function DUF1666PFAMPF07891DUF1666coord: 420..616
e-value: 6.3E-69
score: 232.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019701.1HG10019701.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane