HG10010971 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010971
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMSC domain-containing protein
LocationChr01: 1062149 .. 1067681 (+)
RNA-Seq ExpressionHG10010971
SyntenyHG10010971
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTATCAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTAGACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGGTAAGTCTTCCATTGTTACGTTCCTTGTGTTTGTCTGTTTGGTGTTTTAGAAAATCGGAGACGAGGACATTAGAAACGACATTGTTGAATGATTTCTAGCGTATGTTCGTTCTTTGTGAAGTAATGTTCTTCACAATTAGGATGCATATTTTAGCTCACTTCTGTACATTGCATTTAGTTTTTCTTCTTTCATTTGCCAACAATAATTTCCAGTTGCATGAAATGACTTTTTCAAGCGCAATGGCGCCACGTTAGTACATTTTGTTATGCTGGAAGTAACTGGCTTGGGCGGCTGTAGGATCGGATTGCCGCTGCTTCTTTATGTCAAAATTTTAGCCGTTTTAGAGTGCAAAGGACACGCTTCTTGATTAACCAAGTGTTTAATCGTCTGTGGAATATTTGAGATTAAATTTTCATCTATCATGTTGTCTAGAGGTGACATAGAATTAAAAGTTTTTCTGTTCTCTGCTTTGTTCTGATTGCATCGATTGTTTTTCTGTAAATACTTAGAGGGCCCTTTCGTTTTACCCTAATATGTGCTTCTTATAACATTCTCTTTATGCCATTTATATCCTCCAAGTTGAATGTAAAGGTGGTGTAACTAGTTTCATTTGAAGCGATATGTTGATAAGTGATATTCATTATTTTGGTTCTTGTCTGATAGTTCTATACTTCTTACCAAAAACATGTAACTAATTTCAGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTACGTATTGAGATTTTTCCCTGGAAGCTGGAGCCTGGAATCGGTCTTTATCTATTTTTTATTTGTCAAAATTTCACACTTTATTGCATTCTTGCAGGTAGAATGGCTAGAATCTCGCCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTTGTTATCTTCATGTCCCTATCAAATTTTCACTTTTGTATTTATTTTTCTCTTATAAGTATCTCTAGCCTTGCCCCTTTTTAGTTTGTCACCTATATTTCTCTTGATGAATGATTCGTTCTTATTAATTGTTAAATTTAATGACTAAAATTTAGTTGTAAATTTGATCTGAAAGGACTGAGTATTTTTGTTGTCAGCATACTGCATACGTAGTTTTCATTTGAATTCATTATCAGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTGACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGTATGTGTTGGTGAAATACGTCATTCAATATTTTGTTTGAAGCCTGAAGTTTAGTGGAAACAATGTATTATTTGATTGAATTTCTTTGCATCCTCCTTGTTGCGTAGGATCAAGGAATTGAAATGCCCAGATCTGCTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGGTATTCTACTCTACAAGGGAGCTTTGAACTGATTAGGTTTTAGTTGCAATATGTTGAAGCTAGTTGATGATATTGAGATGTACAGCTTGTGGGATGCACATGGTTACTATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTATAAGGCTATGGCCTTCTCAATTTTATTTGTTGAGCAATCATTTTTTCTTTGCATGGTTTGCTTTGTCGTTAGGTTGGTTCCTATTTCAAGCTGAAAATTTTGTTCTCAAAGCATGCGTTAGCATACTACATAGGTTGCAAATTTCCAACAAAAAGGAGTTTGGCCTAACTATAGAAAAGGACTCCATCCAAAAGAATAAGACCAAGCTCATCATTACGAAAAGACCTTAGATCTCTTAACCTTTCTAACTATTCTACAATTCTTCTCAAGCCAAATGTCCCACAAAATGACAAAGAAAGCATGCTACAAAACTTTGCCCTTCTCCATCTCTTTAAAGGAAGTAGGTAACAACTCTACCATATTTATGTCGAGCCCAACAAACATGGAACAACCTCATCCAACAACCCCAAAGGTAATGAGCAGACTAGCAATCCCACATCATACAATCGAGGTCCTTCTCACGCCTCCCACCATTGCAGATACAACACATAGAAGTAATTTCTTTGAACACGCTTCATAATATTAACACTTCCATGTAAAACTTGCCATGCAAAGCCTTAAACTTCTTTAGAATTTTAATCTTCCAGAGTAAAGAGAAAGGAGAAGGGTAGAATAGAAAAGAATTGTAGCATGCATGTCTGTTAATTTTCTTTTCCATGTAGTATTTGAGATTTACCCTCCTGAAACAATTTCAAATTCCTATAAATCTCATTGGTTTGTTGTCCTTCTTCGAAGCTATCCTAGGTCTAAAAATCAATTGTTGGAAGAGACAAGGGTGCCATTTTTGGAGTGAATTGTGACTTGTGAGGGTGCTATGTTGAACTACTAGGGTTCTTTGGTGGGGTGTGCAGTCAGTCAGCTTCTTGGGTCCTATCTTGGTCTTCCTTTAGGAGATAGCTTGAGATTTTTCTCCATGGGAAAGTGAGTACACATGATTGTATCCAACAGTTTTTCTCCATAGTGTTACATCCTCAATGGTGTGTTCTTTGTAGGAGAGAGGAAGAGGATCTTTACCTTGGGATTGTGAGTTTGTTACCTCTATGTAGAACAGGTCCTTTAGGACATTTGGGCTTGTGCTTGCTTGTAATAGAGGTTTTTGTTTGATGTTTGAGAAGGTGCTGTTGAATTTTCATTTTTATGACAAAGGAAGTGTGCTTTGGCAGTCTTATTATTATTATTTTTTGCTTTGTTGTGTGGTGTGGGTTCGAGTGGTTTGGGGAGGATTTTGGGATGTGGTTAGGTTCAAGTTCAACTCATCCCATTGGGCGTATGTTAATATGAATTTTTTTGAACAATCAGCTTAATAGGATTCTTTTGGATTGGAGCCCCTTTCTATCTTGGGTGTGAGGGCTTTTTTTTGGGGGGGGGGGGGGGGGGGGGGTTCTTTTTCTATTTCTATTTTTTACCTTTGTATATCCTTTAATCTATCTGAATGAAAGTCCGGTTGCTTTATAAACAATCAGTTTCTTTCGTTGAAGCATCATTGAGATGCAAAGTTTTATTTATGCTTGAATATTCATTTCTTAACTCTTTGATTTCCCAAACAAGATTCTTTTAGAAGTTTATCTTAATAAGTTTATTGAAGTCCCAAAACCCTTTTATATTTATTGCCTTCAAGTTTTTATCTTTTATTATCAAGAATGTCTTCTATTGTCAAGAAAATAGAAGTGTAAACGTTTTGCCGATACAAATTCATTCCAGTAGATTTTTCCATTCTAGTGCTACTCTCTCTGTCTTCCTCACAGGTTTGCGAAATACTCGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTCAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAGAATCCTTTGTTATGGAGGAAGGTATCATGTCCCTCCTTGACGCTCATTCTGTCTTATTTCATTTAGAATGAATATATCTCACCTGGAGGTGAAGAACAATTTAACTTGCTTCTTAAGAGCTATCAAGTAATTATTTCATCATCTTGTTTTGTGTTACTTGATGACCATTCTTCTTCTTCTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATGTTAAATGTTAGAACTTTTTTTCTGAAATAACAATCTGATTGGTGAATGTTGTTCTGGATATTTGGCCTTCAGAGTTTGCTATACTACTTGGGTCGTTTAAATTCATTGAAGCCTACTCTTTTTTTTTTTTTTTTTCATATCACAGCTTTAGATGACATAATTTCTCGATTCATTATGAGGTTCATTCTGTTAAATAGGTTCTATTTGGATGTAGCCATGGCTAAATGTTGGAAACCAAGTAATTATATGCAACTCTATGCAGTGTTAATGAACTCCATCCATAAAGAGATCACATGGTGTGCTCGCCCTCTCCACATGCCAATTTATAATCTCACTTCATTCTTGATGCTTTCCCTTGTAATTTTTGACCAACCACAAGTTCAATAATTTCTCGATTCATTAGTTTGTTTGCTTGAATGAAGTTTGACAATTAATTTATTATATGCAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGGTATGAAAATATTGCCCTTTCTTTTATTTCTCTTTCTACCTTAGAATGGAACAATCTCATCTATAAATGAAATTGTGGTGAGTGAATGAAGGATTCAACAATGCAGAAAACACATGTCAACATGAATTTAGCTTAGTGGTTAAGTTATCTATACTTTCTCTAGGCTTAAATATTCAACTCCACGTGGTGTAATATTCTGACAAACAAAATACTTATATTTCTAAATTGAGATGTGTGCAAAATCCTAATTAGTAGATGTTTATACAGACAAACATTTTTAAGTCATGTGTAATGGAGCTTCGTATTGAATTTCCATCTAGATTTGGGTTATTGTTAAATATTTCAAAAATGTTAATGATAAGGACTAAAATGGGGTTTTATAATAATTGAGGCACTGAAACATAACTTTTCAAGTCGAGTTAATAACAGAATGTTGAAAACTAGGAACAAAAATGAATAAATTTGGAAGTTCAAAGTCCAAAATAGGAGTGAAACCTATCTTCTTAAGCTTAGAAGCACTAGAGGAATGTTGGAGGCTCAATGTGTTAGTAAGAAGCCTGTGCATCTCGGTGATGGAACGGTATTTGTTATGCTATCTTATTTCTATCTCTACTACATTCTGATATCCATATTCTGTTGAACAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAAATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGAGTTGTGTAATGATCTTTTACACAAACATATGCTTGATCAAGTTACAGAATCTGTTCTCATTCTATTCTCATGTGATCAACTTTTTGACTTTGAAATATAA

mRNA sequence

ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTATCAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTAGACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTGACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTGCTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACTATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTCGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTCAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAGAATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAAATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGAGTTGTGTAATGATCTTTTACACAAACATATGCTTGATCAAGTTACAGAATCTGTTCTCATTCTATTCTCATGTGATCAACTTTTTGACTTTGAAATATAA

Coding sequence (CDS)

ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTATCAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTAGACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTGACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTGCTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACTATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTCGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTCAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAGAATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAAATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGAGTTGTGTAATGATCTTTTACACAAACATATGCTTGATCAAGTTACAGAATCTGTTCTCATTCTATTCTCATGTGATCAACTTTTTGACTTTGAAATATAA

Protein sequence

MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENGELCNDLLHKHMLDQVTESVLILFSCDQLFDFEI
Homology
BLAST of HG10010971 vs. NCBI nr
Match: XP_038888162.1 (uncharacterized protein LOC120078048 [Benincasa hispida])

HSP 1 Score: 728.4 bits (1879), Expect = 3.5e-206
Identity = 353/387 (91.21%), Postives = 370/387 (95.61%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPKKRTK KRN NSDVGS GDSS SSST+LLKSIKEPPRDFFPSKDDLAAL TVLFI
Sbjct: 1   MSSTPKKRTKVKRNTNSDVGSRGDSSVSSSTMLLKSIKEPPRDFFPSKDDLAALITVLFI 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           ACL+FV+C+FFVSRL+SR PRPFCDTDADSLD LSDVCEPCPRHGECRDGKL+CLHGYRK
Sbjct: 61  ACLIFVSCDFFVSRLASRQPRPFCDTDADSLDLLSDVCEPCPRHGECRDGKLKCLHGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKED+IWDDLDGKELV+S
Sbjct: 121 HGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKEDDIWDDLDGKELVES 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTTL YAKSKALETIG LFQTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHA
Sbjct: 181 IGSDNTTLTYAKSKALETIGGLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           F VLPV LLLVGCTWLLWKL++RQY+TNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FAVLPVFLLLVGCTWLLWKLYRRQYITNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSRKAMGVSTDRMYHKIEN 388
           EK+LA+KS+S KAMGVSTD+M+ K+EN
Sbjct: 361 EKRLATKSNSGKAMGVSTDQMHSKMEN 387

BLAST of HG10010971 vs. NCBI nr
Match: KAA0038534.1 (MSC domain-containing protein [Cucumis melo var. makuwa] >TYK31131.1 MSC domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 714.5 bits (1843), Expect = 5.3e-202
Identity = 352/395 (89.11%), Postives = 367/395 (92.91%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL I
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGVDSSVSSSSLLLKSMKEPPRDFFPSKDDLAALITVLII 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEPCPRHGECRDGKLECLHGYRK
Sbjct: 61  ASLVFVSCNFFVSRLSSRHPVPFCDTDADSLDLLSDVCEPCPRHGECRDGKLECLHGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+S
Sbjct: 121 HGRLCIEDGVINEAVNKLSEWLESHLCESNAKFLCDGIGIVWVKENDIWDDLDGKELVES 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTTLMYAKSKALETIG L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHA
Sbjct: 181 IGSDNTTLMYAKSKALETIGGLLQTRQNSFGIKELKCPDLLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Sbjct: 241 FVVLPVFLLLVGCTWLLWKLYRRQNLTNRAEDLYNQVCEILEENALTSTRNSDQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSR------KAMGVSTDRMYHKIENGE 390
           +KKLASKS+S       KA+GV+ D MYHKIENGE
Sbjct: 361 KKKLASKSNSASKSNFWKAIGVNPDPMYHKIENGE 395

BLAST of HG10010971 vs. NCBI nr
Match: XP_004148518.1 (uncharacterized protein LOC101208017 isoform X1 [Cucumis sativus] >KGN60427.1 hypothetical protein Csa_002483 [Cucumis sativus])

HSP 1 Score: 713.8 bits (1841), Expect = 9.0e-202
Identity = 352/397 (88.66%), Postives = 367/397 (92.44%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFT 60
           MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL T
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGSGSGVDSSVSSSSLLLKSIKEPPRDFFPSKDDLAALIT 60

Query: 61  VLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLH 120
           VL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SDVCEPCPRHGECRDGKLECLH
Sbjct: 61  VLIIACFVFVSCNFFVSRLSSRHPIPFCDTDADSSDFISDVCEPCPRHGECRDGKLECLH 120

Query: 121 GYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKE 180
           GYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKE
Sbjct: 121 GYRKHGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKENDIWDDLDGKE 180

Query: 181 LVDSIGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWV 240
           LV+SIGSDNTTLMYAKSKALETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWV
Sbjct: 181 LVESIGSDNTTLMYAKSKALETIGGLLQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWV 240

Query: 241 LQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE 300
           LQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQVCEILEENAL STRNSGQCE
Sbjct: 241 LQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALTSTRNSGQCE 300

Query: 301 SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL 360
           SWVVASRLRDHLLLPRER+NPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL
Sbjct: 301 SWVVASRLRDHLLLPRERRNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL 360

Query: 361 SSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN 388
           SSS +KKLASKS+S       KA+GV+ D MYHKIEN
Sbjct: 361 SSSMKKKLASKSNSASKSNFWKAIGVNPDPMYHKIEN 397

BLAST of HG10010971 vs. NCBI nr
Match: XP_008465930.1 (PREDICTED: uncharacterized protein LOC103503505 isoform X3 [Cucumis melo])

HSP 1 Score: 710.3 bits (1832), Expect = 9.9e-201
Identity = 350/393 (89.06%), Postives = 365/393 (92.88%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL I
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGVDSSVSSSSLLLKSMKEPPRDFFPSKDDLAALITVLII 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEPCPRHGECRDGKLECLHGYRK
Sbjct: 61  ASLVFVSCNFFVSRLSSRHPVPFCDTDADSLDLLSDVCEPCPRHGECRDGKLECLHGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+S
Sbjct: 121 HGRLCIEDGVINEAVNKLSEWLESHLCESNAKFLCDGIGIVWVKENDIWDDLDGKELVES 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTTLMYAKSKALETIG L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHA
Sbjct: 181 IGSDNTTLMYAKSKALETIGGLLQTRQNSFGIKELKCPDLLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Sbjct: 241 FVVLPVFLLLVGCTWLLWKLYRRQNLTNRAEDLYNQVCEILEENALTSTRNSDQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSR------KAMGVSTDRMYHKIEN 388
           +KKLASKS+S       KA+GV+ D MYHKIEN
Sbjct: 361 KKKLASKSNSASKSNFWKAIGVNPDPMYHKIEN 393

BLAST of HG10010971 vs. NCBI nr
Match: XP_023533380.1 (uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 674.9 bits (1740), Expect = 4.6e-190
Identity = 330/388 (85.05%), Postives = 350/388 (90.21%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPK+RTKFK N NSDV S  DS  SSS VLL S+K PPRDFFPSKDDL  L TVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSVKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRL +R PRPFCD+DADS D LSD CEPCP HGEC +GKLEC HGYR+
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVIN+AV KL EWLES LCEANAKFLCDGIGIVWV+ED IWDDLDGK LV++
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           I SDNTT+MYAKSKALETIG LFQ RQN+LGIKELKCPD LAESYKPF CRI HWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL +RQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSRKAMGVSTDRMYHKIENG 389
           EK+LASKSSSR AMGV++D +Y K+ENG
Sbjct: 361 EKRLASKSSSRMAMGVNSDVIYSKMENG 388

BLAST of HG10010971 vs. ExPASy TrEMBL
Match: A0A5A7T509 (MSC domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G004100 PE=4 SV=1)

HSP 1 Score: 714.5 bits (1843), Expect = 2.5e-202
Identity = 352/395 (89.11%), Postives = 367/395 (92.91%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL I
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGVDSSVSSSSLLLKSMKEPPRDFFPSKDDLAALITVLII 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEPCPRHGECRDGKLECLHGYRK
Sbjct: 61  ASLVFVSCNFFVSRLSSRHPVPFCDTDADSLDLLSDVCEPCPRHGECRDGKLECLHGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+S
Sbjct: 121 HGRLCIEDGVINEAVNKLSEWLESHLCESNAKFLCDGIGIVWVKENDIWDDLDGKELVES 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTTLMYAKSKALETIG L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHA
Sbjct: 181 IGSDNTTLMYAKSKALETIGGLLQTRQNSFGIKELKCPDLLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Sbjct: 241 FVVLPVFLLLVGCTWLLWKLYRRQNLTNRAEDLYNQVCEILEENALTSTRNSDQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSR------KAMGVSTDRMYHKIENGE 390
           +KKLASKS+S       KA+GV+ D MYHKIENGE
Sbjct: 361 KKKLASKSNSASKSNFWKAIGVNPDPMYHKIENGE 395

BLAST of HG10010971 vs. ExPASy TrEMBL
Match: A0A0A0LI89 (MSC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G910640 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.3e-202
Identity = 352/397 (88.66%), Postives = 367/397 (92.44%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFT 60
           MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL T
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGSGSGVDSSVSSSSLLLKSIKEPPRDFFPSKDDLAALIT 60

Query: 61  VLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLH 120
           VL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SDVCEPCPRHGECRDGKLECLH
Sbjct: 61  VLIIACFVFVSCNFFVSRLSSRHPIPFCDTDADSSDFISDVCEPCPRHGECRDGKLECLH 120

Query: 121 GYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKE 180
           GYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKE
Sbjct: 121 GYRKHGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKENDIWDDLDGKE 180

Query: 181 LVDSIGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWV 240
           LV+SIGSDNTTLMYAKSKALETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWV
Sbjct: 181 LVESIGSDNTTLMYAKSKALETIGGLLQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWV 240

Query: 241 LQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE 300
           LQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQVCEILEENAL STRNSGQCE
Sbjct: 241 LQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALTSTRNSGQCE 300

Query: 301 SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL 360
           SWVVASRLRDHLLLPRER+NPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL
Sbjct: 301 SWVVASRLRDHLLLPRERRNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSL 360

Query: 361 SSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN 388
           SSS +KKLASKS+S       KA+GV+ D MYHKIEN
Sbjct: 361 SSSMKKKLASKSNSASKSNFWKAIGVNPDPMYHKIEN 397

BLAST of HG10010971 vs. ExPASy TrEMBL
Match: A0A1S3CQ15 (uncharacterized protein LOC103503505 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103503505 PE=4 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 4.8e-201
Identity = 350/393 (89.06%), Postives = 365/393 (92.88%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL I
Sbjct: 1   MSSTPKKRTKVKRNPNSDVGSGVDSSVSSSSLLLKSMKEPPRDFFPSKDDLAALITVLII 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEPCPRHGECRDGKLECLHGYRK
Sbjct: 61  ASLVFVSCNFFVSRLSSRHPVPFCDTDADSLDLLSDVCEPCPRHGECRDGKLECLHGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+S
Sbjct: 121 HGRLCIEDGVINEAVNKLSEWLESHLCESNAKFLCDGIGIVWVKENDIWDDLDGKELVES 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTTLMYAKSKALETIG L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHA
Sbjct: 181 IGSDNTTLMYAKSKALETIGGLLQTRQNSFGIKELKCPDLLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Sbjct: 241 FVVLPVFLLLVGCTWLLWKLYRRQNLTNRAEDLYNQVCEILEENALTSTRNSDQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSR------KAMGVSTDRMYHKIEN 388
           +KKLASKS+S       KA+GV+ D MYHKIEN
Sbjct: 361 KKKLASKSNSASKSNFWKAIGVNPDPMYHKIEN 393

BLAST of HG10010971 vs. ExPASy TrEMBL
Match: A0A6J1H2A7 (uncharacterized protein LOC111459381 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111459381 PE=4 SV=1)

HSP 1 Score: 671.4 bits (1731), Expect = 2.5e-189
Identity = 329/387 (85.01%), Postives = 348/387 (89.92%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPK+RTKFK N NSDV S  DS  SSS VLL SIK PPRDFFPSKDDL  L TVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           A LVFV+CNFFVSRL +R PRPFCD+DADS D LSD CEPCP HGEC +GKLEC HGYR+
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVIN+AV KL EWLES LCEANAKFLCDGIGIVWV+ED IWDDLDGK LV++
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           I SDNTT+MYAKSKALETIG LFQ RQN+LGIKELKCPD LAESYKPF CRI HWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL +RQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSRKAMGVSTDRMYHKIEN 388
           EK+LASKSSSR  MGV++D +Y K+EN
Sbjct: 361 EKRLASKSSSRMVMGVNSDVIYSKMEN 387

BLAST of HG10010971 vs. ExPASy TrEMBL
Match: A0A6J1E026 (uncharacterized protein LOC111026156 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111026156 PE=4 SV=1)

HSP 1 Score: 668.7 bits (1724), Expect = 1.6e-188
Identity = 321/386 (83.16%), Postives = 354/386 (91.71%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           MSSTPK+R K K NP+SD GS GDSSASSSTVLLKS+K+PPRDFFPS++DL  L TVLFI
Sbjct: 1   MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFI 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           ACLVF++CNFFVSRL+SR P PFCDTDADSLD LSD C+PCP HGECR G+LEC+ GYRK
Sbjct: 61  ACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKPCPSHGECRGGELECVRGYRK 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
           HGRLCIEDGVINEAV KL EWLES LCEANAKF+CDG+G VWVKED+IWDDLDG+ LV++
Sbjct: 121 HGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVEN 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           IGSDNTT MYAK KALETI  LFQT+QNSLGI+ELKCPDLLAESYKPF CRIHHWVL+HA
Sbjct: 181 IGSDNTTFMYAKRKALETIIGLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHA 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPV LLLVGCTWLLWKL++RQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Sbjct: 241 FVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSK 360

Query: 361 EKKLASKSSSRKAMGVSTDRMYHKIE 387
           EK+LASK SSR AM V++DR+Y K++
Sbjct: 361 EKRLASKLSSRVAMEVNSDRIYRKVD 386

BLAST of HG10010971 vs. TAIR 10
Match: AT5G46560.1 (CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 349.0 bits (894), Expect = 5.4e-96
Identity = 174/380 (45.79%), Postives = 248/380 (65.26%), Query Frame = 0

Query: 1   MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFI 60
           M S P+KR      P S+  +G    +SSS+  ++S+ EPP+  FPSK +   L  VL +
Sbjct: 1   MDSIPRKR------PKSETRTGRTPKSSSSSSPIRSMLEPPQSLFPSKGEFFTLLKVLLV 60

Query: 61  ACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDGKLECLHGYRK 120
           AC V  TCNF    LSS   + FCD++ + +D   D+CEPCP +GEC  GKL+C  GY+ 
Sbjct: 61  ACAVAFTCNFLSKSLSSNPSKSFCDSNFNPIDSDLDICEPCPINGECYQGKLQCNLGYKN 120

Query: 121 HGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDS 180
              LC+EDG INE+  KLV + E ++CE+ A   C G G +WV E+++W +L     + +
Sbjct: 121 QRNLCVEDGEINESTKKLVGYFERKVCESYAHNECYGTGTIWVPENDVWTELRSNSFLSN 180

Query: 181 IGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHA 240
           +  D +   + K KA+E +  L + R NS GI ELKCP+ +A+SYKP  CR+H W+L+H 
Sbjct: 181 L--DESAYNFLKGKAVEGVTELLEKRTNSNGIDELKCPESVAKSYKPLTCRLHQWILRHI 240

Query: 241 FVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV 300
            ++     +LVG   L  ++ ++Q  + R E+LY+QVC+ LEENA+ S +  +  CE WV
Sbjct: 241 LIISSSCAMLVGSAMLRRRIQRKQCFSRRVEELYDQVCDFLEENAVASNSAETSNCEPWV 300

Query: 301 VASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS 360
           +AS LRD+LLLPRER++PLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS S
Sbjct: 301 IASWLRDYLLLPRERRDPLLWTKVEELIKEDSRIDRYEKLLKGEKKVVWEWQVEGSLSLS 360

Query: 361 K-EKKLASKSSSRKAMGVST 379
           K +K+  ++   RK++  ST
Sbjct: 361 KLKKQRETQKKVRKSIDSST 372

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888162.13.5e-20691.21uncharacterized protein LOC120078048 [Benincasa hispida][more]
KAA0038534.15.3e-20289.11MSC domain-containing protein [Cucumis melo var. makuwa] >TYK31131.1 MSC domain-... [more]
XP_004148518.19.0e-20288.66uncharacterized protein LOC101208017 isoform X1 [Cucumis sativus] >KGN60427.1 hy... [more]
XP_008465930.19.9e-20189.06PREDICTED: uncharacterized protein LOC103503505 isoform X3 [Cucumis melo][more]
XP_023533380.14.6e-19085.05uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7T5092.5e-20289.11MSC domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A0A0LI894.3e-20288.66MSC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G910640 PE=4 SV... [more]
A0A1S3CQ154.8e-20189.06uncharacterized protein LOC103503505 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1H2A72.5e-18985.01uncharacterized protein LOC111459381 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1E0261.6e-18883.16uncharacterized protein LOC111026156 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT5G46560.15.4e-9645.79CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041885MAN1, winged-helix domainGENE3D1.10.10.1180coord: 256..350
e-value: 1.8E-12
score: 49.4
IPR018996Man1/Src1, C-terminalPFAMPF09402MSCcoord: 92..350
e-value: 9.9E-14
score: 51.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..29
IPR044780Heh2/Src1-likePANTHERPTHR47808INNER NUCLEAR MEMBRANE PROTEIN HEH2-RELATEDcoord: 47..371

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010971.1HG10010971.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005637 nuclear inner membrane
molecular_function GO:0003682 chromatin binding