HG10023239 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023239
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlant protein of unknown function (DUF247)
LocationChr05: 32442978 .. 32444720 (+)
RNA-Seq ExpressionHG10023239
SyntenyHG10023239
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTTCTCCAGCAAATCACGATTACATTCTCTTCCTGCTAAAAATTCTTGGGGATTATCCTCAGGCTTTGAGGAAAGATGGGTTAGTCTAATTCGTCAATCCATAGATGAAGAAGAGCTCGAGGAAGACATAGGATTTCCAGTATGCATATGCACTGTCCCTAAGTCTCTTATGGCCATTGATCCTGATTCCTATACTCCACAGGAAGTCGCAATTGGTCCATACCATCACTGGTGCCAAGAGCTGTATGTGATGGAGAGGTACAAGATTGTTGCAGCCAAAAGAGCTCAAAAACAGCTCCAAAGTCTCAAGTTTCACAATCTTGTTGAGAAATTGACCAAGTATGAGAGAAAGACCCGGGCGTACTACCACAAATACCTCAATTTCAATAGTGAAACATTTGCTTGGATGATGGCCATCGACGCCTCCTTCCTGCTGGAGGTCCTCCGAGTATATACCATCAGAGAAGTCAGCACATCCCTGGCCAACGACCCTGGTGTAAATACTTTAAAGTTGTCATGTTTGGTAGATTATGAGGGAAGGAAGTCAATAAATAATGCCATTCTGAGAGACATAATAATGCTTGAGAATCAGATGCCTTTATTTGTTTTGAGAAAGATGTTGGAACTTCAATCTTCAGCTCTGGAACCAGCAGAGCAACTGTTGCTTTCTATGTTGGTGGGACTGTATGAAGATCTTTCTCCTTTCAAGGTGATGGAAGATTTGGTGGAACTTCAAGTGTCGGTCTCGGAATGCTTTCATTTGCTTGATTTTCTGTACAGAGTGATTACTCCAAAATTGGCTGACTCATTGGAGATATCGGAAAATGATCAAAATCAAAAGGAAACCACCAAAGAAAATGTTGAAAACGCAAATGCCTTTAAGCACTTATGCAGTTGCTTGATTAGACTGGGAAGTGAAATTTGGAAGATTCTCTCAAAGTTGAACAAAGGTCCGGTACATTTATTCAGAAGAATAGTTAGTTCTAGACCCTTACAAGTGATCTTCAAACTGCCGTGGACAATTGTCTCTAAACTTCCTGGAATTGGGATTCTGATGAAGCCTCTCGAATGCCTATTTTCACTAAGAAAGGGTGAAGAAGAAAATGATCTAGAAAAGGGGAGTTCAAGGAAAGTAAAAGTTAAGCCTCCTTTGTTGGAGGAAATAACAATTCCTTCAGTGTCCGAGCTGACAAAATCAGGTGTTTGTTTCTTGCCCATCGATGGAGGCGTCTCAGCCGTTGCCTTTGACTCAAAGGCAGTGATATTTTACCTTCCCATCGTTAATCTGGATGTGAACTCTGAAGTAGTATTGAGAAACTTAGTCGCATATGAAGCTTCAAAACCATCGGGGCCTTTGGTTTTCACCCGTTTCATTGAACTAATGAACGGCATCATCGATTCTGAGGAGGATGTGAAATTGCTGAAGAAAAAAGGAATCATTCTGAACCATTTGGATAGCGATGCAGAAGTTGCCAAGCTCTGGAACGGGATGAGCAAATCCATCAAGTTGACGAAGGTGCCATTCTTGGATAAGGTAATTGAAGATGTAAATAAGCATTACAGTAGTAGATGGAAGGTTAAAGCTGCAAAATTTGTAGAAAAGTATGTGTTTGGTTCATGGCCATTGCTTGCACTTCTGGCTACCATTTTGCTCTTGGCCATGACTGCATTGCAAGCATTTTGCTCAGTTTATAGCTGCTCTCGGTTCTTCCATGATCTCAACACAGACGGGACCTAG

mRNA sequence

ATGAGTTTCTCCAGCAAATCACGATTACATTCTCTTCCTGCTAAAAATTCTTGGGGATTATCCTCAGGCTTTGAGGAAAGATGGGTTAGTCTAATTCGTCAATCCATAGATGAAGAAGAGCTCGAGGAAGACATAGGATTTCCAGTATGCATATGCACTGTCCCTAAGTCTCTTATGGCCATTGATCCTGATTCCTATACTCCACAGGAAGTCGCAATTGGTCCATACCATCACTGGTGCCAAGAGCTGTATGTGATGGAGAGGTACAAGATTGTTGCAGCCAAAAGAGCTCAAAAACAGCTCCAAAGTCTCAAGTTTCACAATCTTGTTGAGAAATTGACCAAGTATGAGAGAAAGACCCGGGCGTACTACCACAAATACCTCAATTTCAATAGTGAAACATTTGCTTGGATGATGGCCATCGACGCCTCCTTCCTGCTGGAGGTCCTCCGAGTATATACCATCAGAGAAGTCAGCACATCCCTGGCCAACGACCCTGGTGTAAATACTTTAAAGTTGTCATGTTTGGTAGATTATGAGGGAAGGAAGTCAATAAATAATGCCATTCTGAGAGACATAATAATGCTTGAGAATCAGATGCCTTTATTTGTTTTGAGAAAGATGTTGGAACTTCAATCTTCAGCTCTGGAACCAGCAGAGCAACTGTTGCTTTCTATGTTGGTGGGACTGTATGAAGATCTTTCTCCTTTCAAGGTGATGGAAGATTTGGTGGAACTTCAAGTGTCGGTCTCGGAATGCTTTCATTTGCTTGATTTTCTGTACAGAGTGATTACTCCAAAATTGGCTGACTCATTGGAGATATCGGAAAATGATCAAAATCAAAAGGAAACCACCAAAGAAAATGTTGAAAACGCAAATGCCTTTAAGCACTTATGCAGTTGCTTGATTAGACTGGGAAGTGAAATTTGGAAGATTCTCTCAAAGTTGAACAAAGGTCCGGTACATTTATTCAGAAGAATAGTTAGTTCTAGACCCTTACAAGTGATCTTCAAACTGCCGTGGACAATTGTCTCTAAACTTCCTGGAATTGGGATTCTGATGAAGCCTCTCGAATGCCTATTTTCACTAAGAAAGGGTGAAGAAGAAAATGATCTAGAAAAGGGGAGTTCAAGGAAAGTAAAAGTTAAGCCTCCTTTGTTGGAGGAAATAACAATTCCTTCAGTGTCCGAGCTGACAAAATCAGGTGTTTGTTTCTTGCCCATCGATGGAGGCGTCTCAGCCGTTGCCTTTGACTCAAAGGCAGTGATATTTTACCTTCCCATCGTTAATCTGGATGTGAACTCTGAAGTAGTATTGAGAAACTTAGTCGCATATGAAGCTTCAAAACCATCGGGGCCTTTGGTTTTCACCCGTTTCATTGAACTAATGAACGGCATCATCGATTCTGAGGAGGATGTGAAATTGCTGAAGAAAAAAGGAATCATTCTGAACCATTTGGATAGCGATGCAGAAGTTGCCAAGCTCTGGAACGGGATGAGCAAATCCATCAAGTTGACGAAGGTGCCATTCTTGGATAAGGTAATTGAAGATGTAAATAAGCATTACAGTAGTAGATGGAAGGTTAAAGCTGCAAAATTTGTAGAAAAGTATGTGTTTGGTTCATGGCCATTGCTTGCACTTCTGGCTACCATTTTGCTCTTGGCCATGACTGCATTGCAAGCATTTTGCTCAGTTTATAGCTGCTCTCGGTTCTTCCATGATCTCAACACAGACGGGACCTAG

Coding sequence (CDS)

ATGAGTTTCTCCAGCAAATCACGATTACATTCTCTTCCTGCTAAAAATTCTTGGGGATTATCCTCAGGCTTTGAGGAAAGATGGGTTAGTCTAATTCGTCAATCCATAGATGAAGAAGAGCTCGAGGAAGACATAGGATTTCCAGTATGCATATGCACTGTCCCTAAGTCTCTTATGGCCATTGATCCTGATTCCTATACTCCACAGGAAGTCGCAATTGGTCCATACCATCACTGGTGCCAAGAGCTGTATGTGATGGAGAGGTACAAGATTGTTGCAGCCAAAAGAGCTCAAAAACAGCTCCAAAGTCTCAAGTTTCACAATCTTGTTGAGAAATTGACCAAGTATGAGAGAAAGACCCGGGCGTACTACCACAAATACCTCAATTTCAATAGTGAAACATTTGCTTGGATGATGGCCATCGACGCCTCCTTCCTGCTGGAGGTCCTCCGAGTATATACCATCAGAGAAGTCAGCACATCCCTGGCCAACGACCCTGGTGTAAATACTTTAAAGTTGTCATGTTTGGTAGATTATGAGGGAAGGAAGTCAATAAATAATGCCATTCTGAGAGACATAATAATGCTTGAGAATCAGATGCCTTTATTTGTTTTGAGAAAGATGTTGGAACTTCAATCTTCAGCTCTGGAACCAGCAGAGCAACTGTTGCTTTCTATGTTGGTGGGACTGTATGAAGATCTTTCTCCTTTCAAGGTGATGGAAGATTTGGTGGAACTTCAAGTGTCGGTCTCGGAATGCTTTCATTTGCTTGATTTTCTGTACAGAGTGATTACTCCAAAATTGGCTGACTCATTGGAGATATCGGAAAATGATCAAAATCAAAAGGAAACCACCAAAGAAAATGTTGAAAACGCAAATGCCTTTAAGCACTTATGCAGTTGCTTGATTAGACTGGGAAGTGAAATTTGGAAGATTCTCTCAAAGTTGAACAAAGGTCCGGTACATTTATTCAGAAGAATAGTTAGTTCTAGACCCTTACAAGTGATCTTCAAACTGCCGTGGACAATTGTCTCTAAACTTCCTGGAATTGGGATTCTGATGAAGCCTCTCGAATGCCTATTTTCACTAAGAAAGGGTGAAGAAGAAAATGATCTAGAAAAGGGGAGTTCAAGGAAAGTAAAAGTTAAGCCTCCTTTGTTGGAGGAAATAACAATTCCTTCAGTGTCCGAGCTGACAAAATCAGGTGTTTGTTTCTTGCCCATCGATGGAGGCGTCTCAGCCGTTGCCTTTGACTCAAAGGCAGTGATATTTTACCTTCCCATCGTTAATCTGGATGTGAACTCTGAAGTAGTATTGAGAAACTTAGTCGCATATGAAGCTTCAAAACCATCGGGGCCTTTGGTTTTCACCCGTTTCATTGAACTAATGAACGGCATCATCGATTCTGAGGAGGATGTGAAATTGCTGAAGAAAAAAGGAATCATTCTGAACCATTTGGATAGCGATGCAGAAGTTGCCAAGCTCTGGAACGGGATGAGCAAATCCATCAAGTTGACGAAGGTGCCATTCTTGGATAAGGTAATTGAAGATGTAAATAAGCATTACAGTAGTAGATGGAAGGTTAAAGCTGCAAAATTTGTAGAAAAGTATGTGTTTGGTTCATGGCCATTGCTTGCACTTCTGGCTACCATTTTGCTCTTGGCCATGACTGCATTGCAAGCATTTTGCTCAGTTTATAGCTGCTCTCGGTTCTTCCATGATCTCAACACAGACGGGACCTAG

Protein sequence

MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKVMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRKVKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAFDSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT
Homology
BLAST of HG10023239 vs. NCBI nr
Match: XP_038899461.1 (LOW QUALITY PROTEIN: putative UPF0481 protein At3g02645 [Benincasa hispida])

HSP 1 Score: 947.2 bits (2447), Expect = 6.6e-272
Identity = 501/584 (85.79%), Postives = 526/584 (90.07%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMA 60
           MSFSSKSRLHSLPA NSWGL+SGFEERWVS IRQS DEEE EEDIG P CICTVPKSLMA
Sbjct: 1   MSFSSKSRLHSLPAGNSWGLNSGFEERWVSQIRQSFDEEEHEEDIGNPACICTVPKSLMA 60

Query: 61  IDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKT 120
             PDSYTPQEVAIGPYHHW QELYVMERYKI AAK+AQKQLQSLKFH+LVEKLTKYERKT
Sbjct: 61  -HPDSYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQSLKFHHLVEKLTKYERKT 120

Query: 121 RAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTI-REVSTSLANDPGVNTLKLSCL--V 180
           RAYYHKYLNFNSETFAWMMAID SFLLEVLRVYT+  E S S  +       KLSCL  V
Sbjct: 121 RAYYHKYLNFNSETFAWMMAIDVSFLLEVLRVYTVSEEKSVSRVSS------KLSCLVMV 180

Query: 181 DYEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPF 240
           DYEGR S +NAILRDI+MLENQ+PLF+LRKMLELQSSALEPA+QLLLSML+GLYEDLSPF
Sbjct: 181 DYEGRMSAHNAILRDILMLENQIPLFLLRKMLELQSSALEPADQLLLSMLLGLYEDLSPF 240

Query: 241 KVMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKH 300
           KV ED VELQVSVSECFHL+DFLYR+ITPKL D LEI EN+QNQKE  KENVENANAFKH
Sbjct: 241 KVTEDXVELQVSVSECFHLVDFLYRMITPKLTDPLEILENNQNQKEPVKENVENANAFKH 300

Query: 301 LCSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPL 360
            C  L RLGSEIWKILSK NKGPVHLFRRIV SRPLQVIFKLPWTI+SKLPGI ILMKPL
Sbjct: 301 FCCSLSRLGSEIWKILSKFNKGPVHLFRRIVCSRPLQVIFKLPWTIISKLPGIVILMKPL 360

Query: 361 ECLFSLRKGEEENDLEKGSSRKV-KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVA 420
           E LFSL KGEEENDLEKGSSRKV KVK PLLEEI IPSVSELTKSGVCF PIDGGVSAVA
Sbjct: 361 EHLFSLGKGEEENDLEKGSSRKVGKVKLPLLEEIAIPSVSELTKSGVCFSPIDGGVSAVA 420

Query: 421 FDSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLL 480
           F+SKA I YL  +NLDVNSEVVLRNL AYEASKPSGPLVFTRFIEL+NGIIDSEEDV+LL
Sbjct: 421 FNSKAAILYLTTINLDVNSEVVLRNLAAYEASKPSGPLVFTRFIELVNGIIDSEEDVRLL 480

Query: 481 KKKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEK 540
           K+KGIILNHL+SDAEVA+LWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAK ++K
Sbjct: 481 KEKGIILNHLNSDAEVAELWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKLLQK 540

Query: 541 YVFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           YVFGSWPLLALLATILLLAMTALQAF SVYSCSRFF+  NTD T
Sbjct: 541 YVFGSWPLLALLATILLLAMTALQAFSSVYSCSRFFNHFNTDET 577

BLAST of HG10023239 vs. NCBI nr
Match: KAA0058728.1 (putative UPF0481 protein [Cucumis melo var. makuwa])

HSP 1 Score: 912.9 bits (2358), Expect = 1.4e-261
Identity = 489/583 (83.88%), Postives = 518/583 (88.85%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGF-EERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLM 60
           MSFSSKSRLHSLPA NSWGL+SGF EERWV  IRQS+DEEELEEDIG PVCI  VPKSLM
Sbjct: 1   MSFSSKSRLHSLPAGNSWGLNSGFDEERWVIQIRQSLDEEELEEDIGIPVCIFNVPKSLM 60

Query: 61  AIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERK 120
            IDPDSYTPQEVAIGPYHHW QELY MERYKI AAKRAQKQLQSLKFH+LVEKLTK+E+K
Sbjct: 61  VIDPDSYTPQEVAIGPYHHWRQELYEMERYKIAAAKRAQKQLQSLKFHDLVEKLTKHEQK 120

Query: 121 TRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDY 180
           TRA YHKYLNFNSETFAWMMA+DASFLLEVLRVYT  E S S  +       KLS LVDY
Sbjct: 121 TRACYHKYLNFNSETFAWMMAVDASFLLEVLRVYTREETSISSVSS------KLSYLVDY 180

Query: 181 EGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV 240
           EGRKS +NAILRDI+MLENQ+PLFVLR MLELQ SA+EPA+QLLLSML+GLYEDLSPFKV
Sbjct: 181 EGRKSAHNAILRDIVMLENQIPLFVLRMMLELQFSAVEPADQLLLSMLLGLYEDLSPFKV 240

Query: 241 MEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLC 300
           MEDLVELQVSVSECFHLLDFLYR+ITPKL D+LE  E+DQNQ+E   E VE  + FKH C
Sbjct: 241 MEDLVELQVSVSECFHLLDFLYRMITPKLVDTLETMEDDQNQQEPAIEIVE--STFKHPC 300

Query: 301 SCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLEC 360
           S L  LGSEIWKILSKLN+GPVH F+RIV SRPLQVIFKLPWTIVS +PGIG LMKPLE 
Sbjct: 301 SPLSSLGSEIWKILSKLNEGPVHFFKRIVGSRPLQVIFKLPWTIVSNIPGIGTLMKPLEY 360

Query: 361 LFSLRKGEEENDLEK-GSSRK-VKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
           +FSLRK EEE D EK GSSRK  K K PLLEEITIPSVSELTKSGV FLPI+GGVSA+AF
Sbjct: 361 IFSLRKVEEEKDPEKGGSSRKDGKTKLPLLEEITIPSVSELTKSGVRFLPINGGVSAIAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           DSKAVIF LPI+ LDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DSKAVIFNLPIIKLDVNSEVVLRNLVAYEASKSSGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA++WNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY
Sbjct: 481 EKGIILNHLKSDAEVAEVWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLALLATILLLAM ALQAFCSVYSCSRFF  L+T GT
Sbjct: 541 VFGSWPLLALLATILLLAMNALQAFCSVYSCSRFFDHLSTAGT 575

BLAST of HG10023239 vs. NCBI nr
Match: XP_008461155.1 (PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo] >XP_008461156.1 PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo] >XP_016902665.1 PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo])

HSP 1 Score: 911.0 bits (2353), Expect = 5.3e-261
Identity = 488/583 (83.70%), Postives = 518/583 (88.85%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGF-EERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLM 60
           MSFSSKSRLHSLPA NSWGL+SGF EERWV  IRQS+DEEELEEDIG PVCI  VPKSLM
Sbjct: 1   MSFSSKSRLHSLPAGNSWGLNSGFDEERWVIQIRQSLDEEELEEDIGIPVCIFNVPKSLM 60

Query: 61  AIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERK 120
            IDPDSYTPQEVAIGPYHHW QELY MERYKI AAKRAQKQLQSLKFH+LVEKLTK+E+K
Sbjct: 61  VIDPDSYTPQEVAIGPYHHWRQELYEMERYKIAAAKRAQKQLQSLKFHDLVEKLTKHEQK 120

Query: 121 TRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDY 180
           TRA YHKYLNFNSETFAWMMA+DASFLLEVLRVYT  E S S  +       KLS LVDY
Sbjct: 121 TRACYHKYLNFNSETFAWMMAVDASFLLEVLRVYTREETSISSVSS------KLSYLVDY 180

Query: 181 EGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV 240
           EGRKS +NAILRDI+MLENQ+PLFVLR MLELQ SA+EPA+QLLLSML+GLYEDLSPFKV
Sbjct: 181 EGRKSAHNAILRDIVMLENQIPLFVLRMMLELQFSAVEPADQLLLSMLLGLYEDLSPFKV 240

Query: 241 MEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLC 300
           MEDLVELQVSVSECFHLLDFLYR+ITPKL D+LE  E+DQNQ+E   E VE  + FKH C
Sbjct: 241 MEDLVELQVSVSECFHLLDFLYRMITPKLVDTLETMEDDQNQQEPAIEIVE--STFKHPC 300

Query: 301 SCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLEC 360
           S L  LGSEIWKILSKLN+GPVHLF+RIV SRPLQVIFKLPWTIVS +PGIG LMKPLE 
Sbjct: 301 SPLSSLGSEIWKILSKLNEGPVHLFKRIVGSRPLQVIFKLPWTIVSNIPGIGTLMKPLEY 360

Query: 361 LFSLRKGEEENDLEK-GSSRKVKV-KPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
           +FSLRK EEE D EK GSSRK  + K PLLEEITIPSVSELTKSGV FLPI+GGVSA+AF
Sbjct: 361 IFSLRKVEEEKDPEKGGSSRKDGITKLPLLEEITIPSVSELTKSGVHFLPINGGVSAIAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           DSKAVIF LP + LDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DSKAVIFNLPTIKLDVNSEVVLRNLVAYEASKSSGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA++WNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY
Sbjct: 481 EKGIILNHLKSDAEVAEVWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLALLATILLLAM ALQAFCSVYSCSRFF  L+T GT
Sbjct: 541 VFGSWPLLALLATILLLAMNALQAFCSVYSCSRFFDHLSTAGT 575

BLAST of HG10023239 vs. NCBI nr
Match: XP_011659516.1 (putative UPF0481 protein At3g02645 [Cucumis sativus] >KAE8646429.1 hypothetical protein Csa_015865 [Cucumis sativus])

HSP 1 Score: 909.1 bits (2348), Expect = 2.0e-260
Identity = 485/583 (83.19%), Postives = 517/583 (88.68%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGF-EERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLM 60
           M FSSKSRLHSLPA NSWGL+S F EERWV  IRQS+DEEELEED G PVCI  VPKSLM
Sbjct: 1   MRFSSKSRLHSLPAGNSWGLNSDFDEERWVIQIRQSLDEEELEEDTGIPVCIFNVPKSLM 60

Query: 61  AIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERK 120
            IDPDSY PQEVAIGPYHHW QELY MERYKI AAKRAQKQLQSLKFH+LVEKLTK+E+K
Sbjct: 61  VIDPDSYIPQEVAIGPYHHWRQELYEMERYKIAAAKRAQKQLQSLKFHDLVEKLTKHEQK 120

Query: 121 TRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDY 180
           TRA YHKYLNFNSETFAWMMA+DASFLLEVLRVYT  E S S  +       KLS LVDY
Sbjct: 121 TRACYHKYLNFNSETFAWMMAVDASFLLEVLRVYTREETSISSVSS------KLSYLVDY 180

Query: 181 EGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV 240
           EGRKS +NAILRDI+MLENQ+PLFVLRKMLELQ SA+EPA+QLLLSML+GLYE LSPFKV
Sbjct: 181 EGRKSAHNAILRDIVMLENQIPLFVLRKMLELQFSAVEPADQLLLSMLLGLYEHLSPFKV 240

Query: 241 MEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLC 300
           MEDLVELQVSVSECFHLLDFLYR+ITPKLAD+LE  E+DQNQ+E   E VE  + FKH C
Sbjct: 241 MEDLVELQVSVSECFHLLDFLYRIITPKLADTLETMEDDQNQQEPAIEIVE--STFKHPC 300

Query: 301 SCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLEC 360
           S L  LGSEIWKILSKLNKGPVHLF+RI  SRPL VIFKLPWTIVS +PGIGILMKPLE 
Sbjct: 301 SPLSSLGSEIWKILSKLNKGPVHLFKRIAGSRPLLVIFKLPWTIVSNIPGIGILMKPLEY 360

Query: 361 LFSLRKGEEENDLEK-GSSRK-VKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
           +FSL+KGEEEND EK GSSRK  K++ PLLEEITIPSVSELTKSGV FLPI GGVSA+AF
Sbjct: 361 IFSLKKGEEENDPEKGGSSRKDGKIRLPLLEEITIPSVSELTKSGVLFLPIGGGVSAIAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           DSKAVIF LP + LDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLL+
Sbjct: 421 DSKAVIFNLPTIKLDVNSEVVLRNLVAYEASKSSGPLVFTRFIELMNGIIDSEEDVKLLR 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA+LWNGMSKSIKLTKVPFLDKVIEDVNKHYS+RWKVKAAKFVEKY
Sbjct: 481 EKGIILNHLKSDAEVAELWNGMSKSIKLTKVPFLDKVIEDVNKHYSNRWKVKAAKFVEKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLALLATILLLAM ALQAFCSVYSCSRFF+ L+T GT
Sbjct: 541 VFGSWPLLALLATILLLAMNALQAFCSVYSCSRFFNHLSTAGT 575

BLAST of HG10023239 vs. NCBI nr
Match: XP_004136095.1 (putative UPF0481 protein At3g02645 [Cucumis sativus] >XP_011659517.1 putative UPF0481 protein At3g02645 [Cucumis sativus] >XP_031745026.1 putative UPF0481 protein At3g02645 [Cucumis sativus] >KGN45239.1 hypothetical protein Csa_015617 [Cucumis sativus])

HSP 1 Score: 902.1 bits (2330), Expect = 2.4e-258
Identity = 481/583 (82.50%), Postives = 514/583 (88.16%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMA 60
           MS SSKSRLHSLPA + WGL+  +EE WV+ IRQS+DEEELEEDIG P CICTVP+SLMA
Sbjct: 1   MSLSSKSRLHSLPAGHYWGLNLSYEEGWVNQIRQSMDEEELEEDIGHPACICTVPRSLMA 60

Query: 61  IDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKT 120
           IDPDSYTPQEVAIGPYHHW QELYVMERYKI AA++AQKQLQSLKFHNLVEKL KYERKT
Sbjct: 61  IDPDSYTPQEVAIGPYHHWRQELYVMERYKIAAARKAQKQLQSLKFHNLVEKLAKYERKT 120

Query: 121 RAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCL--VD 180
           RA+YHKYLNFNSETFAWMMAIDASFLLEVL+VYTIRE   S++      + KLSCL  VD
Sbjct: 121 RAFYHKYLNFNSETFAWMMAIDASFLLEVLQVYTIRE-EKSISR----VSSKLSCLVVVD 180

Query: 181 YEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFK 240
            EGR+S  N ILRDI+MLENQ+PLFVLRKMLELQS ALE   QLLLSML+GL EDLSPF+
Sbjct: 181 NEGRRSAQNTILRDIVMLENQIPLFVLRKMLELQSPALEQTNQLLLSMLLGLCEDLSPFE 240

Query: 241 VMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHL 300
           ++E     QVSVSECFHLLDFLYR+ITPKLAD LEI ENDQNQKE+TKEN E+ NAFKH 
Sbjct: 241 MLEP----QVSVSECFHLLDFLYRMITPKLADPLEILENDQNQKESTKENFEDENAFKHF 300

Query: 301 CSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLE 360
           C  L RLGSEIWKILSK NKGPVHLFRRI++SRPLQVIFKLP TIVSKLPGI +LMKPL 
Sbjct: 301 CCSLSRLGSEIWKILSKFNKGPVHLFRRILNSRPLQVIFKLP-TIVSKLPGIVVLMKPLN 360

Query: 361 CLFSLRKGEEENDLEKGSSRKV-KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
            L SLRKGEEENDLEKGSS KV K+K PL EEI IPSVS+LTKSGV F  IDGGVSAVAF
Sbjct: 361 HLCSLRKGEEENDLEKGSSWKVGKIKLPLSEEIAIPSVSQLTKSGVHFSAIDGGVSAVAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           D KAVIFYLP +NLDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DPKAVIFYLPTINLDVNSEVVLRNLVAYEASKASGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA LWNGMSKSIKLTKVPFLDKVIEDVNK+YS RWKVKA+KFV+KY
Sbjct: 481 EKGIILNHLKSDAEVADLWNGMSKSIKLTKVPFLDKVIEDVNKYYSGRWKVKASKFVKKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLA LATILLLAMTALQAFCSVYSCSRFFH LN DGT
Sbjct: 541 VFGSWPLLAFLATILLLAMTALQAFCSVYSCSRFFHHLNADGT 573

BLAST of HG10023239 vs. ExPASy Swiss-Prot
Match: P0C897 (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 2.9e-129
Identity = 263/546 (48.17%), Postives = 355/546 (65.02%), Query Frame = 0

Query: 25  EERWVSLIRQSIDEEELEEDI-GFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQEL 84
           E RWV  +++S+D E  E D+    V I  VPK+LM   PDSYTP  V+IGPYH    EL
Sbjct: 18  ETRWVINVQKSLDAELEEHDLEEVTVSIFNVPKALMCSHPDSYTPHRVSIGPYHCLKPEL 77

Query: 85  YVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDA 144
           + MERYK++ A++ + Q  S +FH+LVEKL   E K RA YHKY+ FN ET  W+MA+D+
Sbjct: 78  HEMERYKLMIARKIRNQYNSFRFHDLVEKLQSMEIKIRACYHKYIGFNGETLLWIMAVDS 137

Query: 145 SFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQMPLF 204
           SFL+E L++Y+ R+V T L N  G                  +N ILRDI+M+ENQ+PLF
Sbjct: 138 SFLIEFLKIYSFRKVET-LINRVG------------------HNEILRDIMMIENQIPLF 197

Query: 205 VLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV-MEDLVELQVSVSECFHLLDFLYR 264
           VLRK LE Q  + E A+ LLLS+L GL +DLSP  +  +D   L+    EC H+LDFLY+
Sbjct: 198 VLRKTLEFQLESTESADDLLLSVLTGLCKDLSPLVIKFDDDQILKAQFQECNHILDFLYQ 257

Query: 265 VITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKGPVH 324
           +I P++ +  E+ E+D+  +    EN  N           IR   EI            H
Sbjct: 258 MIVPRIEEE-ELEEDDEENR--ADENGGNR---------AIRFMDEI-----------KH 317

Query: 325 LFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRKVKV 384
            F+R+ +SRP  +I + PW I+S LPG   L    + LF+ ++ E     ++  S     
Sbjct: 318 QFKRVFASRPADLILRFPWRIISNLPGFMALKLSADYLFTRQENEATTTRQESVSILDIE 377

Query: 385 KPPLLEEITIPSVSELTKSGVCFLP-IDGGVSAVAFDSKAVIFYLPIVNLDVNSEVVLRN 444
           KPPL+EE+TIPSVS+L K+GV F P   G +S V FDS +  FYLP++NLD+N+E VLRN
Sbjct: 378 KPPLVEELTIPSVSDLHKAGVRFKPTAHGNISTVTFDSNSGQFYLPVINLDINTETVLRN 437

Query: 445 LVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNGMSK 504
           LVAYEA+  SGPLVFTR+ EL+NGIIDSEEDV+LL+++G++++ L SD E A++WNGMSK
Sbjct: 438 LVAYEATNTSGPLVFTRYTELINGIIDSEEDVRLLREQGVLVSRLKSDQEAAEMWNGMSK 497

Query: 505 SIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMTALQA 564
           S++LTKV FLDK IEDVN++Y+ RWKVK  + VE YV+GSW +LA LA +LLL + +LQ 
Sbjct: 498 SVRLTKVGFLDKTIEDVNRYYTGRWKVKIGRLVEVYVYGSWQILAFLAAVLLLMLVSLQL 521

Query: 565 FCSVYS 568
           F  V+S
Sbjct: 558 FSLVFS 521

BLAST of HG10023239 vs. ExPASy Swiss-Prot
Match: Q9SD53 (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 7.4e-08
Identity = 58/232 (25.00%), Postives = 110/232 (47.41%), Query Frame = 0

Query: 50  CICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQELYVMERYK--IVAAKRAQKQLQSLKFH 109
           CI  VP+S +A++P +Y P+ V+IGPYH+  + L +++++K  ++     + + + ++ +
Sbjct: 47  CIFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDVEEN 106

Query: 110 NLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPG 169
            LV+ +   E K R  Y + L        +MM +D  F   +L V+ I   +  L+ DP 
Sbjct: 107 VLVKAVVDLEDKIRKSYSEELK-TGHDLMFMMVLDGCF---ILMVFLIMSGNIELSEDP- 166

Query: 170 VNTLKLSCLVDYEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSML 229
                +  L+         ++I  D+++LENQ+P FVL              + L +   
Sbjct: 167 --IFSIPWLL---------SSIQSDLLLLENQVPFFVL--------------QTLYVGSK 226

Query: 230 VGLYEDLS--PFKVMEDLVELQVSVSE------CFHLLDFLYRVITPKLADS 272
           +G+  DL+   F   ++ ++ + S  E        HLLD +     P  ++S
Sbjct: 227 IGVSSDLNRIAFHFFKNPIDKEGSYWEKHRNYKAKHLLDLIRETFLPNTSES 248

BLAST of HG10023239 vs. ExPASy TrEMBL
Match: A0A5A7UU59 (Putative UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G001980 PE=4 SV=1)

HSP 1 Score: 912.9 bits (2358), Expect = 6.7e-262
Identity = 489/583 (83.88%), Postives = 518/583 (88.85%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGF-EERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLM 60
           MSFSSKSRLHSLPA NSWGL+SGF EERWV  IRQS+DEEELEEDIG PVCI  VPKSLM
Sbjct: 1   MSFSSKSRLHSLPAGNSWGLNSGFDEERWVIQIRQSLDEEELEEDIGIPVCIFNVPKSLM 60

Query: 61  AIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERK 120
            IDPDSYTPQEVAIGPYHHW QELY MERYKI AAKRAQKQLQSLKFH+LVEKLTK+E+K
Sbjct: 61  VIDPDSYTPQEVAIGPYHHWRQELYEMERYKIAAAKRAQKQLQSLKFHDLVEKLTKHEQK 120

Query: 121 TRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDY 180
           TRA YHKYLNFNSETFAWMMA+DASFLLEVLRVYT  E S S  +       KLS LVDY
Sbjct: 121 TRACYHKYLNFNSETFAWMMAVDASFLLEVLRVYTREETSISSVSS------KLSYLVDY 180

Query: 181 EGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV 240
           EGRKS +NAILRDI+MLENQ+PLFVLR MLELQ SA+EPA+QLLLSML+GLYEDLSPFKV
Sbjct: 181 EGRKSAHNAILRDIVMLENQIPLFVLRMMLELQFSAVEPADQLLLSMLLGLYEDLSPFKV 240

Query: 241 MEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLC 300
           MEDLVELQVSVSECFHLLDFLYR+ITPKL D+LE  E+DQNQ+E   E VE  + FKH C
Sbjct: 241 MEDLVELQVSVSECFHLLDFLYRMITPKLVDTLETMEDDQNQQEPAIEIVE--STFKHPC 300

Query: 301 SCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLEC 360
           S L  LGSEIWKILSKLN+GPVH F+RIV SRPLQVIFKLPWTIVS +PGIG LMKPLE 
Sbjct: 301 SPLSSLGSEIWKILSKLNEGPVHFFKRIVGSRPLQVIFKLPWTIVSNIPGIGTLMKPLEY 360

Query: 361 LFSLRKGEEENDLEK-GSSRK-VKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
           +FSLRK EEE D EK GSSRK  K K PLLEEITIPSVSELTKSGV FLPI+GGVSA+AF
Sbjct: 361 IFSLRKVEEEKDPEKGGSSRKDGKTKLPLLEEITIPSVSELTKSGVRFLPINGGVSAIAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           DSKAVIF LPI+ LDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DSKAVIFNLPIIKLDVNSEVVLRNLVAYEASKSSGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA++WNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY
Sbjct: 481 EKGIILNHLKSDAEVAEVWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLALLATILLLAM ALQAFCSVYSCSRFF  L+T GT
Sbjct: 541 VFGSWPLLALLATILLLAMNALQAFCSVYSCSRFFDHLSTAGT 575

BLAST of HG10023239 vs. ExPASy TrEMBL
Match: A0A1S3CDL4 (putative UPF0481 protein At3g02645 OS=Cucumis melo OX=3656 GN=LOC103499824 PE=4 SV=1)

HSP 1 Score: 911.0 bits (2353), Expect = 2.6e-261
Identity = 488/583 (83.70%), Postives = 518/583 (88.85%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGF-EERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLM 60
           MSFSSKSRLHSLPA NSWGL+SGF EERWV  IRQS+DEEELEEDIG PVCI  VPKSLM
Sbjct: 1   MSFSSKSRLHSLPAGNSWGLNSGFDEERWVIQIRQSLDEEELEEDIGIPVCIFNVPKSLM 60

Query: 61  AIDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERK 120
            IDPDSYTPQEVAIGPYHHW QELY MERYKI AAKRAQKQLQSLKFH+LVEKLTK+E+K
Sbjct: 61  VIDPDSYTPQEVAIGPYHHWRQELYEMERYKIAAAKRAQKQLQSLKFHDLVEKLTKHEQK 120

Query: 121 TRAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDY 180
           TRA YHKYLNFNSETFAWMMA+DASFLLEVLRVYT  E S S  +       KLS LVDY
Sbjct: 121 TRACYHKYLNFNSETFAWMMAVDASFLLEVLRVYTREETSISSVSS------KLSYLVDY 180

Query: 181 EGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV 240
           EGRKS +NAILRDI+MLENQ+PLFVLR MLELQ SA+EPA+QLLLSML+GLYEDLSPFKV
Sbjct: 181 EGRKSAHNAILRDIVMLENQIPLFVLRMMLELQFSAVEPADQLLLSMLLGLYEDLSPFKV 240

Query: 241 MEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLC 300
           MEDLVELQVSVSECFHLLDFLYR+ITPKL D+LE  E+DQNQ+E   E VE  + FKH C
Sbjct: 241 MEDLVELQVSVSECFHLLDFLYRMITPKLVDTLETMEDDQNQQEPAIEIVE--STFKHPC 300

Query: 301 SCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLEC 360
           S L  LGSEIWKILSKLN+GPVHLF+RIV SRPLQVIFKLPWTIVS +PGIG LMKPLE 
Sbjct: 301 SPLSSLGSEIWKILSKLNEGPVHLFKRIVGSRPLQVIFKLPWTIVSNIPGIGTLMKPLEY 360

Query: 361 LFSLRKGEEENDLEK-GSSRKVKV-KPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
           +FSLRK EEE D EK GSSRK  + K PLLEEITIPSVSELTKSGV FLPI+GGVSA+AF
Sbjct: 361 IFSLRKVEEEKDPEKGGSSRKDGITKLPLLEEITIPSVSELTKSGVHFLPINGGVSAIAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           DSKAVIF LP + LDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DSKAVIFNLPTIKLDVNSEVVLRNLVAYEASKSSGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA++WNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY
Sbjct: 481 EKGIILNHLKSDAEVAEVWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLALLATILLLAM ALQAFCSVYSCSRFF  L+T GT
Sbjct: 541 VFGSWPLLALLATILLLAMNALQAFCSVYSCSRFFDHLSTAGT 575

BLAST of HG10023239 vs. ExPASy TrEMBL
Match: A0A0A0K8N9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G432200 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 1.2e-258
Identity = 481/583 (82.50%), Postives = 514/583 (88.16%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMA 60
           MS SSKSRLHSLPA + WGL+  +EE WV+ IRQS+DEEELEEDIG P CICTVP+SLMA
Sbjct: 1   MSLSSKSRLHSLPAGHYWGLNLSYEEGWVNQIRQSMDEEELEEDIGHPACICTVPRSLMA 60

Query: 61  IDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKT 120
           IDPDSYTPQEVAIGPYHHW QELYVMERYKI AA++AQKQLQSLKFHNLVEKL KYERKT
Sbjct: 61  IDPDSYTPQEVAIGPYHHWRQELYVMERYKIAAARKAQKQLQSLKFHNLVEKLAKYERKT 120

Query: 121 RAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCL--VD 180
           RA+YHKYLNFNSETFAWMMAIDASFLLEVL+VYTIRE   S++      + KLSCL  VD
Sbjct: 121 RAFYHKYLNFNSETFAWMMAIDASFLLEVLQVYTIRE-EKSISR----VSSKLSCLVVVD 180

Query: 181 YEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFK 240
            EGR+S  N ILRDI+MLENQ+PLFVLRKMLELQS ALE   QLLLSML+GL EDLSPF+
Sbjct: 181 NEGRRSAQNTILRDIVMLENQIPLFVLRKMLELQSPALEQTNQLLLSMLLGLCEDLSPFE 240

Query: 241 VMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHL 300
           ++E     QVSVSECFHLLDFLYR+ITPKLAD LEI ENDQNQKE+TKEN E+ NAFKH 
Sbjct: 241 MLEP----QVSVSECFHLLDFLYRMITPKLADPLEILENDQNQKESTKENFEDENAFKHF 300

Query: 301 CSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLE 360
           C  L RLGSEIWKILSK NKGPVHLFRRI++SRPLQVIFKLP TIVSKLPGI +LMKPL 
Sbjct: 301 CCSLSRLGSEIWKILSKFNKGPVHLFRRILNSRPLQVIFKLP-TIVSKLPGIVVLMKPLN 360

Query: 361 CLFSLRKGEEENDLEKGSSRKV-KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
            L SLRKGEEENDLEKGSS KV K+K PL EEI IPSVS+LTKSGV F  IDGGVSAVAF
Sbjct: 361 HLCSLRKGEEENDLEKGSSWKVGKIKLPLSEEIAIPSVSQLTKSGVHFSAIDGGVSAVAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           D KAVIFYLP +NLDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDVKLLK
Sbjct: 421 DPKAVIFYLPTINLDVNSEVVLRNLVAYEASKASGPLVFTRFIELMNGIIDSEEDVKLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA LWNGMSKSIKLTKVPFLDKVIEDVNK+YS RWKVKA+KFV+KY
Sbjct: 481 EKGIILNHLKSDAEVADLWNGMSKSIKLTKVPFLDKVIEDVNKYYSGRWKVKASKFVKKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLA LATILLLAMTALQAFCSVYSCSRFFH LN DGT
Sbjct: 541 VFGSWPLLAFLATILLLAMTALQAFCSVYSCSRFFHHLNADGT 573

BLAST of HG10023239 vs. ExPASy TrEMBL
Match: A0A5A7UUK5 (Putative UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G002000 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 1.1e-256
Identity = 474/583 (81.30%), Postives = 506/583 (86.79%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMA 60
           M  SSKSRLHSLPA + WGL+  +EE WV+ IRQSIDEEELEEDIG P CICTVPKSLM 
Sbjct: 1   MRLSSKSRLHSLPAGHYWGLNLSYEEGWVNQIRQSIDEEELEEDIGHPACICTVPKSLMV 60

Query: 61  IDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKT 120
            DPDSYTPQEVAIGPYHHW QELYVMERYKI AAK+ QKQLQSLKFHNLVEKL KYERK 
Sbjct: 61  FDPDSYTPQEVAIGPYHHWRQELYVMERYKIAAAKKVQKQLQSLKFHNLVEKLAKYERKI 120

Query: 121 RAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCL--VD 180
           RAYYHKYLNFNSETF WMMAIDASFLLEVL+VYTIRE   S++      + KLSCL  VD
Sbjct: 121 RAYYHKYLNFNSETFVWMMAIDASFLLEVLQVYTIRE-EKSISR----ISSKLSCLVVVD 180

Query: 181 YEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFK 240
            EGR+S  N ILRDI+MLENQ+PLFVLRKML+LQS ALE  +QLLLSML+GLYEDLSPF+
Sbjct: 181 NEGRRSAQNTILRDIVMLENQIPLFVLRKMLKLQSPALEQTDQLLLSMLLGLYEDLSPFE 240

Query: 241 VMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHL 300
           ++E     QVSVSECFHLLDFLYR+ITPKLA  LEI END+NQKE+TKEN E+ NAFKH 
Sbjct: 241 MLEP----QVSVSECFHLLDFLYRMITPKLAGPLEILENDRNQKESTKENAEDENAFKHF 300

Query: 301 CSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLE 360
           C  L  LGS IWKILSK NKGPVHLFRRI+SSRPLQVIFKLPWTIVSKLPGI ILMKPL 
Sbjct: 301 CRSLSELGSAIWKILSKFNKGPVHLFRRILSSRPLQVIFKLPWTIVSKLPGIVILMKPLS 360

Query: 361 CLFSLRKGEEENDLEKGSSRKV-KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
            L SLRKGEEEND+EKGSS KV K+K PL EEI IPSVS+LTKSGV F  IDGGVSAVAF
Sbjct: 361 HLCSLRKGEEENDIEKGSSWKVGKIKLPLSEEIAIPSVSQLTKSGVHFSSIDGGVSAVAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           D  AVIFYLP +NLDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDV+LLK
Sbjct: 421 DPNAVIFYLPTINLDVNSEVVLRNLVAYEASKASGPLVFTRFIELMNGIIDSEEDVRLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA LWNGMSKSIKLTKVPFLDKVIEDVNK+YS RWKVKAAKFV+KY
Sbjct: 481 EKGIILNHLKSDAEVADLWNGMSKSIKLTKVPFLDKVIEDVNKYYSGRWKVKAAKFVKKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLA LATILLLA+TALQAFCSVYSCSRF H LN DGT
Sbjct: 541 VFGSWPLLAFLATILLLALTALQAFCSVYSCSRFIHHLNADGT 574

BLAST of HG10023239 vs. ExPASy TrEMBL
Match: A0A1S4E348 (putative UPF0481 protein At3g02645 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499825 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 1.1e-256
Identity = 474/583 (81.30%), Postives = 507/583 (86.96%), Query Frame = 0

Query: 1   MSFSSKSRLHSLPAKNSWGLSSGFEERWVSLIRQSIDEEELEEDIGFPVCICTVPKSLMA 60
           M  SSKSRLHSLPA + WGL+  +EE WV+ IRQSIDEEELEEDIG P CICTVPKSL+ 
Sbjct: 1   MRLSSKSRLHSLPAGHYWGLNLSYEEGWVNQIRQSIDEEELEEDIGHPACICTVPKSLIV 60

Query: 61  IDPDSYTPQEVAIGPYHHWCQELYVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKT 120
            DPDSYTPQEVAIGPYHHW QELYVMERYKI AAK+AQKQLQSLKFHNLVEKL KYERK 
Sbjct: 61  FDPDSYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQSLKFHNLVEKLAKYERKI 120

Query: 121 RAYYHKYLNFNSETFAWMMAIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCL--VD 180
           RAYYHKYLNFNSETF WMMAIDASFLLEVL+VYTIRE   S++      + KLSCL  VD
Sbjct: 121 RAYYHKYLNFNSETFVWMMAIDASFLLEVLQVYTIRE-EKSISR----ISSKLSCLVVVD 180

Query: 181 YEGRKSINNAILRDIIMLENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFK 240
            EGR+S  N ILRDI+MLENQ+PLFVLRKML+LQS ALE  +QLLLSML+GLYEDLSPF+
Sbjct: 181 NEGRRSAQNTILRDIVMLENQIPLFVLRKMLKLQSPALEQTDQLLLSMLLGLYEDLSPFE 240

Query: 241 VMEDLVELQVSVSECFHLLDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHL 300
           ++E     QVSVSECFHLLDFLYR+ITPKLA  LEI END+NQKE+TKEN E+ NAFKH 
Sbjct: 241 MLEP----QVSVSECFHLLDFLYRMITPKLAGPLEILENDRNQKESTKENAEDENAFKHF 300

Query: 301 CSCLIRLGSEIWKILSKLNKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLE 360
           C  L  LGS IWKILSK NKGPVHLFRRI+SSRPLQVIFKLPWTIVSKLPGI ILMKPL 
Sbjct: 301 CRSLSELGSAIWKILSKFNKGPVHLFRRILSSRPLQVIFKLPWTIVSKLPGIVILMKPLS 360

Query: 361 CLFSLRKGEEENDLEKGSSRKV-KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAF 420
            L SLRKGEEEND+EKGSS KV K+K PL EEI IPSVS+LTKSGV F  IDGGVSAVAF
Sbjct: 361 HLCSLRKGEEENDIEKGSSWKVGKIKLPLSEEIAIPSVSQLTKSGVHFSSIDGGVSAVAF 420

Query: 421 DSKAVIFYLPIVNLDVNSEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLK 480
           D  AVIFYLP +NLDVNSEVVLRNLVAYEASK SGPLVFTRFIELMNGIIDSEEDV+LLK
Sbjct: 421 DPNAVIFYLPTINLDVNSEVVLRNLVAYEASKASGPLVFTRFIELMNGIIDSEEDVRLLK 480

Query: 481 KKGIILNHLDSDAEVAKLWNGMSKSIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKY 540
           +KGIILNHL SDAEVA LWNGMSKSIKLTKVPFLDKVIEDVNK+YS RWKVKAAKFV+KY
Sbjct: 481 EKGIILNHLKSDAEVADLWNGMSKSIKLTKVPFLDKVIEDVNKYYSGRWKVKAAKFVKKY 540

Query: 541 VFGSWPLLALLATILLLAMTALQAFCSVYSCSRFFHDLNTDGT 581
           VFGSWPLLA LATILLLA+TALQAFCSVYSCSRF H LN DGT
Sbjct: 541 VFGSWPLLAFLATILLLALTALQAFCSVYSCSRFIHHLNADGT 574

BLAST of HG10023239 vs. TAIR 10
Match: AT3G02645.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 463.8 bits (1192), Expect = 2.1e-130
Identity = 263/546 (48.17%), Postives = 355/546 (65.02%), Query Frame = 0

Query: 25  EERWVSLIRQSIDEEELEEDI-GFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQEL 84
           E RWV  +++S+D E  E D+    V I  VPK+LM   PDSYTP  V+IGPYH    EL
Sbjct: 18  ETRWVINVQKSLDAELEEHDLEEVTVSIFNVPKALMCSHPDSYTPHRVSIGPYHCLKPEL 77

Query: 85  YVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDA 144
           + MERYK++ A++ + Q  S +FH+LVEKL   E K RA YHKY+ FN ET  W+MA+D+
Sbjct: 78  HEMERYKLMIARKIRNQYNSFRFHDLVEKLQSMEIKIRACYHKYIGFNGETLLWIMAVDS 137

Query: 145 SFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQMPLF 204
           SFL+E L++Y+ R+V T L N  G                  +N ILRDI+M+ENQ+PLF
Sbjct: 138 SFLIEFLKIYSFRKVET-LINRVG------------------HNEILRDIMMIENQIPLF 197

Query: 205 VLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKV-MEDLVELQVSVSECFHLLDFLYR 264
           VLRK LE Q  + E A+ LLLS+L GL +DLSP  +  +D   L+    EC H+LDFLY+
Sbjct: 198 VLRKTLEFQLESTESADDLLLSVLTGLCKDLSPLVIKFDDDQILKAQFQECNHILDFLYQ 257

Query: 265 VITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKGPVH 324
           +I P++ +  E+ E+D+  +    EN  N           IR   EI            H
Sbjct: 258 MIVPRIEEE-ELEEDDEENR--ADENGGNR---------AIRFMDEI-----------KH 317

Query: 325 LFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRKVKV 384
            F+R+ +SRP  +I + PW I+S LPG   L    + LF+ ++ E     ++  S     
Sbjct: 318 QFKRVFASRPADLILRFPWRIISNLPGFMALKLSADYLFTRQENEATTTRQESVSILDIE 377

Query: 385 KPPLLEEITIPSVSELTKSGVCFLP-IDGGVSAVAFDSKAVIFYLPIVNLDVNSEVVLRN 444
           KPPL+EE+TIPSVS+L K+GV F P   G +S V FDS +  FYLP++NLD+N+E VLRN
Sbjct: 378 KPPLVEELTIPSVSDLHKAGVRFKPTAHGNISTVTFDSNSGQFYLPVINLDINTETVLRN 437

Query: 445 LVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNGMSK 504
           LVAYEA+  SGPLVFTR+ EL+NGIIDSEEDV+LL+++G++++ L SD E A++WNGMSK
Sbjct: 438 LVAYEATNTSGPLVFTRYTELINGIIDSEEDVRLLREQGVLVSRLKSDQEAAEMWNGMSK 497

Query: 505 SIKLTKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMTALQA 564
           S++LTKV FLDK IEDVN++Y+ RWKVK  + VE YV+GSW +LA LA +LLL + +LQ 
Sbjct: 498 SVRLTKVGFLDKTIEDVNRYYTGRWKVKIGRLVEVYVYGSWQILAFLAAVLLLMLVSLQL 521

Query: 565 FCSVYS 568
           F  V+S
Sbjct: 558 FSLVFS 521

BLAST of HG10023239 vs. TAIR 10
Match: AT3G50170.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 127.1 bits (318), Expect = 4.6e-29
Identity = 143/549 (26.05%), Postives = 234/549 (42.62%), Query Frame = 0

Query: 26  ERWVSLIRQSIDEEELEED--IGFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQEL 85
           + WV  IR  +++ + ++D  I   +CI  VP  L   D  SY PQ V++GPYHH  + L
Sbjct: 88  DSWVISIRDKLEQADRDDDTTIWGKLCIYRVPHYLQENDKKSYFPQTVSLGPYHHGKKRL 147

Query: 86  YVMERYKIVAAKRAQKQLQSLKFHNLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDA 145
             MER+K  A  +  K+L+  +       + + E K RA Y   ++ +   F  M+ +D 
Sbjct: 148 RPMERHKWRALNKVLKRLKQ-RIEMYTNAMRELEEKARACYEGPISLSRNEFTEMLVLDG 207

Query: 146 SFLLEVLR--VYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQMP 205
            F+LE+ R  V    E+  +  NDP                + + ++I RD+IMLENQ+P
Sbjct: 208 CFVLELFRGTVEGFTEIGYA-RNDP------------VFAMRGLMHSIQRDMIMLENQLP 267

Query: 206 LFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKVMEDLVELQVSVSECFHLLDFLY 265
           LFVL ++LELQ    +    ++  + V  ++ L P                     + L 
Sbjct: 268 LFVLDRLLELQLGT-QNQTGIVAHVAVKFFDPLMPTG-------------------EALT 327

Query: 266 RVITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKGPV 325
           +    KL + LE S                          L  LG          +KG +
Sbjct: 328 KPDQSKLMNWLEKS--------------------------LDTLG----------DKGEL 387

Query: 326 HLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRKVK 385
           H             +F+      S  P    L+K L                   +R  +
Sbjct: 388 HCLD----------VFRRSLLQSSPTPNTRSLLKRL-------------------TRNTR 447

Query: 386 VKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAFDSKAVIFYLPIVNLDVN--SEVVL 445
           V     +++ +  V+EL ++GV F       +   +D +    YL I  L ++  ++ + 
Sbjct: 448 VVDKRQQQL-VHCVTELREAGVKFRK---RKTDRFWDIEFKNGYLEIPKLLIHDGTKSLF 507

Query: 446 RNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNGM 505
            NL+A+E          T +I  M+ +I+S EDV  L   GII + L SD+EVA L+N +
Sbjct: 508 SNLIAFEQCHIESSNHITSYIIFMDNLINSSEDVSYLHYCGIIEHWLGSDSEVADLFNRL 533

Query: 506 SKSIKL-TKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMTA 565
            + +    K   L ++  DVN++Y+ +W V  A    KY    W   +  A ++LL +T 
Sbjct: 568 CQEVVFDPKDSHLSRLSGDVNRYYNRKWNVLKATLTHKYFNNPWAYFSFSAAVILLLLTL 533

Query: 566 LQAFCSVYS 568
            Q+F +VY+
Sbjct: 628 CQSFYAVYA 533

BLAST of HG10023239 vs. TAIR 10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 124.4 bits (311), Expect = 3.0e-28
Identity = 139/550 (25.27%), Postives = 235/550 (42.73%), Query Frame = 0

Query: 28  WVSLIRQSIDEEELEED--IGFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQELYV 87
           WV  I   +++   ++D  +   +CI  VP  L   D  SY PQ V++GPYHH  + L  
Sbjct: 80  WVISITDKLEQAHRDDDTTLWGKLCIYRVPYYLQENDNKSYFPQTVSLGPYHHGKKRLRS 139

Query: 88  MERYKIVAAKRAQKQL-QSLKFHNLVEKLTKYERKTRAYYHKYLNFNSETFAWMMAIDAS 147
           M+R+K  A  R  K+  Q +K +  ++ + + E K RA Y   L+ +S  F  M+ +D  
Sbjct: 140 MDRHKWRAVNRVLKRTNQGIKMY--IDAMRELEEKARACYEGPLSLSSNEFIEMLVLDGC 199

Query: 148 FLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQMPLFV 207
           F+LE+ R           A +  V  ++ S            ++I RD++MLENQ+PLFV
Sbjct: 200 FVLELFRGAVEGFTELGYARNDPVFAMRGSM-----------HSIQRDMVMLENQLPLFV 259

Query: 208 LRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKVMEDLVELQVSVSECFHLLDFLYRVI 267
           L ++LELQ                GL   L+                          R  
Sbjct: 260 LNRLLELQLGTRNQ---------TGLVAQLA-------------------------IRFF 319

Query: 268 TPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKGPVH-- 327
            P +     ++++ Q++ E +    ++ + F  +                    G +H  
Sbjct: 320 DPLMPTDEPLTKSGQSKLENSLARDKSFDPFADM--------------------GELHCL 379

Query: 328 -LFRR-IVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRKV 387
            +FRR ++ S P                      KP           E     K  SR  
Sbjct: 380 DVFRRSLLRSSP----------------------KP-----------EPRLTRKRWSRNT 439

Query: 388 KVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAFDSKAVIFYLPIVNLDVN--SEVV 447
           +V     +++ I  V+EL ++G+ F       +   +D +    YL I  L ++  ++ +
Sbjct: 440 RVADKRRQQL-IHCVTELKEAGIKF---RRRKTDRFWDMQFKNGYLEIPRLLIHDGTKSL 499

Query: 448 LRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNG 507
             NL+A+E          T +I  M+ +IDS EDV  L   GII + L SD+EVA L+N 
Sbjct: 500 FLNLIAFEQCHIDSSNDITSYIIFMDNLIDSHEDVSYLHYCGIIEHWLGSDSEVADLFNR 525

Query: 508 MSKSIKL-TKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMT 567
           + + +   T+  +L ++  +VN++Y  +W    A    KY    W +++  A ++LL +T
Sbjct: 560 LCQEVVFDTEDSYLSRLSIEVNRYYDHKWNAWRATLKHKYFNNPWAIVSFCAAVILLVLT 525

BLAST of HG10023239 vs. TAIR 10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 112.8 bits (281), Expect = 9.0e-25
Identity = 137/554 (24.73%), Postives = 231/554 (41.70%), Query Frame = 0

Query: 26  ERWVSLIRQSIDEEELEEDI---GFPVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQE 85
           E WV  I+  + E+ L  D       +CI  VP  L   D  SY PQ V+IGPYHH    
Sbjct: 65  EEWVISIKDKM-EKALSYDATNSWDKLCIYRVPFYLQENDKKSYLPQTVSIGPYHHGKVH 124

Query: 86  LYVMERYKIVAAKRAQKQLQSLKFHNL---VEKLTKYERKTRAYYHKYLNF-NSETFAWM 145
           L  MER+K     RA   + +   HN+   ++ + + E + RA Y   ++  NS  F  M
Sbjct: 125 LRPMERHK----WRAVNMIMARTKHNIEMYIDAMKELEEEARACYQGPIDMKNSNEFTEM 184

Query: 146 MAIDASFLLEVLR--VYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIML 205
           + +D  F+LE+ +  +   +++  +  NDP               ++ + ++I RD+IML
Sbjct: 185 LVLDGCFVLELFKGTIQGFQKIGYA-RNDP------------VFAKRGLMHSIQRDMIML 244

Query: 206 ENQMPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKVMEDLVELQVSVSECFHL 265
           ENQ+PLFVL ++L LQ+        ++  + V  ++ L P                    
Sbjct: 245 ENQLPLFVLDRLLGLQTGTPNQT-GIVAEVAVRFFKTLMP-------------------- 304

Query: 266 LDFLYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKL 325
                       ++ L  SE   + +E + E  +N         CL      + +     
Sbjct: 305 -----------TSEVLTKSERSLDSQEKSDELGDNGG-----LHCLDVFHRSLIQSSETT 364

Query: 326 NKGPVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGS 385
           N+G  +                                            E+ + +EK  
Sbjct: 365 NQGTPY--------------------------------------------EDMSMVEK-- 424

Query: 386 SRKVKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAFDSKAVIFYLPIVNLDVN-- 445
                      ++  I  V+EL  +GV F+  + G     +D +    YL I  L ++  
Sbjct: 425 -----------QQQLIHCVTELRGAGVNFMRKETG---QLWDIEFKNGYLKIPKLLIHDG 484

Query: 446 SEVVLRNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAK 505
           ++ +  NL+A+E          T +I  M+ +I+S +DV  L   GII + L SD+EVA 
Sbjct: 485 TKSLFSNLIAFEQCHTQSSNNITSYIIFMDNLINSSQDVSYLHHDGIIEHWLGSDSEVAD 503

Query: 506 LWNGMSKSIKL-TKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILL 565
           L+N + K +    K  +L ++  +VN++YS +W    A   +KY    W   +  A ++L
Sbjct: 545 LFNRLCKEVIFDPKDGYLSQLSREVNRYYSRKWNSLKATLRQKYFNNPWAYFSFSAAVIL 503

Query: 566 LAMTALQAFCSVYS 568
           L +T  Q+F +VY+
Sbjct: 605 LFLTFFQSFFAVYA 503

BLAST of HG10023239 vs. TAIR 10
Match: AT3G50130.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 109.0 bits (271), Expect = 1.3e-23
Identity = 134/548 (24.45%), Postives = 226/548 (41.24%), Query Frame = 0

Query: 26  ERWVSLIRQSIDEEELEEDIGF---PVCICTVPKSLMAIDPDSYTPQEVAIGPYHHWCQE 85
           E WV  IR  + E+ L ED       +CI  VP+ L   +  SY PQ V++GP+HH  + 
Sbjct: 114 EEWVISIRDKM-EQALREDATTSWDKLCIYRVPQYLQENNKKSYFPQTVSLGPFHHGNKH 173

Query: 86  LYVMERYKIVAAKRAQKQLQSLKFHNL---VEKLTKYERKTRAYYHKYLNFNSETFAWMM 145
           L  M+R+K     RA   + +   H++   ++ + + E + RA Y   ++ +S  F+ M+
Sbjct: 174 LLPMDRHK----WRAVNMVMARTKHDIEMYIDAMKELEDRARACYEGPIDLSSNKFSEML 233

Query: 146 AIDASFLLEVLRVYTIREVSTSLANDPGVNTLKLSCLVDYEGRKSINNAILRDIIMLENQ 205
            +D  F+LE+ R             D G + L           +   ++I RD++MLENQ
Sbjct: 234 VLDGCFVLELFR-----------GADEGFSELGYDRNDPVFAMRGSMHSIQRDMVMLENQ 293

Query: 206 MPLFVLRKMLELQSSALEPAEQLLLSMLVGLYEDLSPFKVMEDLVELQVSVSECFHLLDF 265
           +PLFVL ++LE+Q         L+  + V  ++ L P    E L +   S+ +     D 
Sbjct: 294 LPLFVLNRLLEIQLGKRHQT-GLVSRLAVRFFDPLMP--TDEPLTKTDDSLEQ-----DK 353

Query: 266 LYRVITPKLADSLEISENDQNQKETTKENVENANAFKHLCSCLIRLGSEIWKILSKLNKG 325
            +  I             D+++ E    +V   N  +   +   RL    W   +     
Sbjct: 354 FFNPIA------------DKDKGELHCLDVFRRNLLRPCSNPEPRLSRMRWSWRT----- 413

Query: 326 PVHLFRRIVSSRPLQVIFKLPWTIVSKLPGIGILMKPLECLFSLRKGEEENDLEKGSSRK 385
                 R+   R  Q+I       V++L   GI        F  RK +   D        
Sbjct: 414 ------RVADKRQQQLIH-----CVTELREAGI-------KFRTRKTDRFWD-------- 473

Query: 386 VKVKPPLLEEITIPSVSELTKSGVCFLPIDGGVSAVAFDSKAVIFYLPIVNLDVNSEVVL 445
           ++ K   LE                                     +P + +   ++ + 
Sbjct: 474 IRFKNGYLE-------------------------------------IPKLLIHDGTKSLF 533

Query: 446 RNLVAYEASKPSGPLVFTRFIELMNGIIDSEEDVKLLKKKGIILNHLDSDAEVAKLWNGM 505
            NL+A+E          T +I  M+ +IDS EDV+ L   GII + L +D EVA L+N +
Sbjct: 534 SNLIAFEQCHIDSSNDITSYIIFMDNLIDSSEDVRYLHYCGIIEHWLGNDYEVADLFNRL 557

Query: 506 SKSIKL-TKVPFLDKVIEDVNKHYSSRWKVKAAKFVEKYVFGSWPLLALLATILLLAMTA 565
            + +    +  +L ++   V+++YS +W V  A    KY    W   +  A ++LL +T 
Sbjct: 594 CQEVAFDPQNSYLSQLSNKVDRNYSRKWNVLKAILKHKYFNNPWAYFSFFAALVLLVLTL 557

Query: 566 LQAFCSVY 567
            Q+F + Y
Sbjct: 654 FQSFFTAY 557

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899461.16.6e-27285.79LOW QUALITY PROTEIN: putative UPF0481 protein At3g02645 [Benincasa hispida][more]
KAA0058728.11.4e-26183.88putative UPF0481 protein [Cucumis melo var. makuwa][more]
XP_008461155.15.3e-26183.70PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo] >XP_008461156.1 PRE... [more]
XP_011659516.12.0e-26083.19putative UPF0481 protein At3g02645 [Cucumis sativus] >KAE8646429.1 hypothetical ... [more]
XP_004136095.12.4e-25882.50putative UPF0481 protein At3g02645 [Cucumis sativus] >XP_011659517.1 putative UP... [more]
Match NameE-valueIdentityDescription
P0C8972.9e-12948.17Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 ... [more]
Q9SD537.4e-0825.00UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7UU596.7e-26283.88Putative UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A1S3CDL42.6e-26183.70putative UPF0481 protein At3g02645 OS=Cucumis melo OX=3656 GN=LOC103499824 PE=4 ... [more]
A0A0A0K8N91.2e-25882.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G432200 PE=4 SV=1[more]
A0A5A7UUK51.1e-25681.30Putative UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A1S4E3481.1e-25681.30putative UPF0481 protein At3g02645 isoform X2 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
Match NameE-valueIdentityDescription
AT3G02645.12.1e-13048.17Plant protein of unknown function (DUF247) [more]
AT3G50170.14.6e-2926.05Plant protein of unknown function (DUF247) [more]
AT3G50120.13.0e-2825.27Plant protein of unknown function (DUF247) [more]
AT3G50150.19.0e-2524.73Plant protein of unknown function (DUF247) [more]
AT3G50130.11.3e-2324.45Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 272..292
NoneNo IPR availablePANTHERPTHR31549:SF177BNACNNG05850D PROTEINcoord: 25..564
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 51..556
e-value: 1.8E-94
score: 317.3
IPR004158Protein of unknown function DUF247, plantPANTHERPTHR31549PROTEIN, PUTATIVE (DUF247)-RELATED-RELATEDcoord: 25..564

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023239.1HG10023239.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane