HG10016628 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016628
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF21 domain-containing protein
LocationChr03: 6584650 .. 6587563 (+)
RNA-Seq ExpressionHG10016628
SyntenyHG10016628
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGTGGAATATAGCTGCTGCACTACAGGATTTTTCAGTCGCATTGGAATAGTCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCCGGCCTCACTCTTGGCCTCATGTCTATGAGCATTGTTGAGATTGAAGTCCTTGCCAAGTCTGGAAAACCAAGTGACCGTAAACATGCTGGTCATCTCTTCTTCTTCATTCTCTTCCTTTTTCTTTCACTTTCCTGGGAAAATGTTTCTGTCTTCTCTCTCTTAATGGAGTTCTTCTTTTAATGTTTGTTACTAACAAGAACTTGGCAATCCTTCATCAACATAACAGCAAGGATACTGCCAGTTGTTAGAGGACAACACTTATTGCTTTGCACTTTATTGATCTGTAATGCTGCAGCCATGGAGGTAACTGTAAATTGTTGGAAAGTTTAGATTTTTGCTGTATTGTATCAGCTGGAAAAAAATTCATAGTTTTGTATTTACTTTGACAGGCACTTCCAATATTTTTGGACAGTTTGGTGACCGCTTGGGGAGCCATATTGATCTCTGTCACTTTGATCTTATTATTTGGGGAGGTGATCCATCAAACTTTCACTTATCTTTGTTAGAATCCTCAGTGAGAACAAGTTAAGCTTTTAGGCCAAGTTAATGTAAAAGGTTAGAAATTAGAACATGTTCTCATGCTCTGTTTCTGCAGATTATTCCTCAAGCTGTTTGTTCTAGATATGGTTTAGCAATTGGTGCAACAGTGGCTCCTTTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCCGTTGCATATCCTATAAGCAAGGTAATTGCTTTCTTCAGCCAAACTTAATTGCTAAAGCACATTGAATAACAAAAAGTTGTTCACTCATTATATTTGTCGTTTTAAGAGTAGAAGTACATCACATAATAGAAGAAAAGGTTAACAAGCATCACATAATGTGTGTGTGTGTGTCTCAATTCATACCGTCTCAATTTCAATTTTTCAAGAATGAACTGAATTGGTTTTGGTCCCCAAATGGGTGAAAACAGATCAGCATCAAAAACCAAAAAATTGATTATTTGGTTGTGTTTTGACTAGCTATTGGACCTTTCACTGGGTAAAGGACACAAAGCCTTGTTCCGTAGAGCAGAACTGAAAACACTCGTAGATTTTCATGGCAATGAGGTACTTTATCTCAAGCACAAGTAACTATCCATCTATCTCTCATGTTCTCTCTTGTGGCTGAGTTTGGTTCCTTAAAAATACAGGCTGGAAAAGGAGGAGAGTTGACACGAGACGAAACAACCATAATAGGAGGAGCACTGGAACTCAGTGAGAAGGTGGCGAGAGACGCCATGACTCCCATTTCTGAAACATTTGCGATCGATATTAACGCTAATCTTGACAGGTTCACTGCCTAATCTGATCATCACCTTCACCACAGAGAAATACACCAACTTCATTCTTCTTTTCTCCTCCATCCTAATAAATATACTTAACAATGCAGCACCTTAATCAAGTTAATTCTGGAGAAGGGACATAGCAGAGTGCCTGTATTCTACGAACGCCCTACAAGTATCATTGGCCTCGTATTGGTAACGTTCGTCCATGCCAATTAGTTATAATAATAACAACAGCAACATTCTTTTACCTGAAATCTTCATATTTAAATTGTAGGTGAAGAATTTAATAACTAGGCTTTCACCAGATGGGGTACCAATTAAGAACTTCCCGATTCGGAAAATTCCAAGGTACTTTTATGCCCCTAATTTTTCTGTATACTGCATGTTAGGAAATCTTTCCAGATAACACAGCTGATCTAAGAGAAGCTTACTGATGAACGACTTCCACATCATTCCCTTTCACTATCAGCCAATATGAATTTTTGTTCTTTAATCATCAGTAATGGACACTAATGCAACAGTAAGCAAAATGAGAATTCTTTCTGGACTCCTTGTTTACTTCCCACTAGGCTTATGATTTACATACAAATGAATGCCTGCTCATTTAGGCTAATGTTATTACAGAAACCCCTTCACTTGGGTTGAATCTAAGAATAGCATAGTTACAAAACCTAAATATAAAAGACATCGCCTAGCTTTTTTGGTAAAAGAAACATTTATTGATAAAGCGGGAGAGAACTCCAAGAATAGAATGGTTACATAAGTGAATGTCAATTACTGACTAAAAAAGATAAACTGAAATGACTAAAAGGGTGTTTAGTTTTACACCAAGAAAAAGCGGTAGAAAGAACTTGTTCCATAAAACGAGGAAACGAGGAAAAAGCATCATGAAAGTTGAAAGAAAAGGACTTCCATTTTCACTAAAAATCCCAACTTGTCCATCAGGGTCTCAGAAACAATGCCGTTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCTGTTGTCGTAAGAGAAAAAGAAAATCCAGAGAAGTCAGTCAATGGAAATCAACTTAAGGGTAAGCATTTACCTTTTTAATTTCTACTTAACTTGGCTTCTTCAGTTTGACGATGATACATATAACACAAATGAACATGCAGGAAAAGATGTGAAAGTGGACATCGATGGTGAAAATCACCCGGAAAAATGTTTAAAGAGCAAGAGATCGCTAAAAAGGCTAAACACATTTGTCGATCGTAGTAATTCCTATCGAAAGTTCTCTGGAAGTAAGAAATGGTCGAAAGACTTCAACTCAGAGGTCCTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATCGGCATCATAACACTTGAAGATGTCATTGAGGAGCTTTTACAGGTACAGCACATCCTCTATCTACAGTTCTGTCTTCCTATCAATGTTGCAAAGTTTGCTAATTGATTACTTTTACTTTATGTTGACTGCAGGAGGAAATCTATGATGAGACAGATTACCGTACTTAG

mRNA sequence

ATGGGAGTGGAATATAGCTGCTGCACTACAGGATTTTTCAGTCGCATTGGAATAGTCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCCGGCCTCACTCTTGGCCTCATGTCTATGAGCATTGTTGAGATTGAAGTCCTTGCCAAGTCTGGAAAACCAAGTGACCGTAAACATGCTGCAAGGATACTGCCAGTTGTTAGAGGACAACACTTATTGCTTTGCACTTTATTGATCTGTAATGCTGCAGCCATGGAGGCACTTCCAATATTTTTGGACAGTTTGGTGACCGCTTGGGGAGCCATATTGATCTCTGTCACTTTGATCTTATTATTTGGGGAGATTATTCCTCAAGCTGTTTGTTCTAGATATGGTTTAGCAATTGGTGCAACAGTGGCTCCTTTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCCGTTGCATATCCTATAAGCAAGCTATTGGACCTTTCACTGGGTAAAGGACACAAAGCCTTGTTCCGTAGAGCAGAACTGAAAACACTCGTAGATTTTCATGGCAATGAGGCTGGAAAAGGAGGAGAGTTGACACGAGACGAAACAACCATAATAGGAGGAGCACTGGAACTCAGTGAGAAGGTGGCGAGAGACGCCATGACTCCCATTTCTGAAACATTTGCGATCGATATTAACGCTAATCTTGACAGCACCTTAATCAAGTTAATTCTGGAGAAGGGACATAGCAGAGTGCCTGTATTCTACGAACGCCCTACAAGTATCATTGGCCTCGTATTGGTGAAGAATTTAATAACTAGGCTTTCACCAGATGGGGTACCAATTAAGAACTTCCCGATTCGGAAAATTCCAAGGGTCTCAGAAACAATGCCGTTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCTGTTGTCGTAAGAGAAAAAGAAAATCCAGAGAAGTCAGTCAATGGAAATCAACTTAAGGGAAAAGATGTGAAAGTGGACATCGATGGTGAAAATCACCCGGAAAAATGTTTAAAGAGCAAGAGATCGCTAAAAAGGCTAAACACATTTGTCGATCGTAGTAATTCCTATCGAAAGTTCTCTGGAAGTAAGAAATGGTCGAAAGACTTCAACTCAGAGGTCCTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATCGGCATCATAACACTTGAAGATGTCATTGAGGAGCTTTTACAGGAGGAAATCTATGATGAGACAGATTACCGTACTTAG

Coding sequence (CDS)

ATGGGAGTGGAATATAGCTGCTGCACTACAGGATTTTTCAGTCGCATTGGAATAGTCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCCGGCCTCACTCTTGGCCTCATGTCTATGAGCATTGTTGAGATTGAAGTCCTTGCCAAGTCTGGAAAACCAAGTGACCGTAAACATGCTGCAAGGATACTGCCAGTTGTTAGAGGACAACACTTATTGCTTTGCACTTTATTGATCTGTAATGCTGCAGCCATGGAGGCACTTCCAATATTTTTGGACAGTTTGGTGACCGCTTGGGGAGCCATATTGATCTCTGTCACTTTGATCTTATTATTTGGGGAGATTATTCCTCAAGCTGTTTGTTCTAGATATGGTTTAGCAATTGGTGCAACAGTGGCTCCTTTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCCGTTGCATATCCTATAAGCAAGCTATTGGACCTTTCACTGGGTAAAGGACACAAAGCCTTGTTCCGTAGAGCAGAACTGAAAACACTCGTAGATTTTCATGGCAATGAGGCTGGAAAAGGAGGAGAGTTGACACGAGACGAAACAACCATAATAGGAGGAGCACTGGAACTCAGTGAGAAGGTGGCGAGAGACGCCATGACTCCCATTTCTGAAACATTTGCGATCGATATTAACGCTAATCTTGACAGCACCTTAATCAAGTTAATTCTGGAGAAGGGACATAGCAGAGTGCCTGTATTCTACGAACGCCCTACAAGTATCATTGGCCTCGTATTGGTGAAGAATTTAATAACTAGGCTTTCACCAGATGGGGTACCAATTAAGAACTTCCCGATTCGGAAAATTCCAAGGGTCTCAGAAACAATGCCGTTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCTGTTGTCGTAAGAGAAAAAGAAAATCCAGAGAAGTCAGTCAATGGAAATCAACTTAAGGGAAAAGATGTGAAAGTGGACATCGATGGTGAAAATCACCCGGAAAAATGTTTAAAGAGCAAGAGATCGCTAAAAAGGCTAAACACATTTGTCGATCGTAGTAATTCCTATCGAAAGTTCTCTGGAAGTAAGAAATGGTCGAAAGACTTCAACTCAGAGGTCCTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATCGGCATCATAACACTTGAAGATGTCATTGAGGAGCTTTTACAGGAGGAAATCTATGATGAGACAGATTACCGTACTTAG

Protein sequence

MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAARILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNEAGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSRVPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Homology
BLAST of HG10016628 vs. NCBI nr
Match: XP_008440429.1 (PREDICTED: DUF21 domain-containing protein At4g33700-like isoform X1 [Cucumis melo] >KAA0036392.1 DUF21 domain-containing protein [Cucumis melo var. makuwa] >TYK12788.1 DUF21 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 761.9 bits (1966), Expect = 2.9e-216
Identity = 398/419 (94.99%), Postives = 407/419 (97.14%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPV R QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTPISETFAIDINANLDS LIKLILE+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYR 360
           HMAVVVREKENPE SV GNQL+ KDVKV+IDGEN  EK LK+KRSLKRLNTFVDRSNS+R
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQQEKGLKTKRSLKRLNTFVDRSNSHR 360

Query: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 419

BLAST of HG10016628 vs. NCBI nr
Match: XP_004143412.1 (DUF21 domain-containing protein At4g33700 isoform X2 [Cucumis sativus])

HSP 1 Score: 749.6 bits (1934), Expect = 1.5e-212
Identity = 390/420 (92.86%), Postives = 405/420 (96.43%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRK+AA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKYAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPV R QHLLLCTLLICNA AMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRKQHLLLCTLLICNAVAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLA+GATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAVGATVAPFVRVLVWICFPVAYPISKLLDISLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTII GALEL+EKVARD MTPISETFAIDINANLDS L+KLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIAGALELTEKVARDVMTPISETFAIDINANLDSNLVKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGLVLVKNLITRLSPDG+PIK+FPIRKIPRVSETMPLY+ILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGIPIKSFPIRKIPRVSETMPLYNILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENH-PEKCLKSKRSLKRLNTFVDRSNSY 360
           HMAV+VREKENPE+SV GNQL+ KDVKV+IDGENH  EK L +KRSLKRLNT VDRSNSY
Sbjct: 301 HMAVIVREKENPERSVKGNQLEAKDVKVEIDGENHQQEKGLNTKRSLKRLNTLVDRSNSY 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420

BLAST of HG10016628 vs. NCBI nr
Match: XP_038882679.1 (DUF21 domain-containing protein At4g33700-like isoform X2 [Benincasa hispida])

HSP 1 Score: 748.8 bits (1932), Expect = 2.5e-212
Identity = 394/420 (93.81%), Postives = 402/420 (95.71%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           MGVEYSCCTTGFF  IGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPS RKHAA
Sbjct: 1   MGVEYSCCTTGFFGCIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSHRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPVVR QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATV PFVRVLV ICFPVAYPISKLLD SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVTPFVRVLVCICFPVAYPISKLLDFSLGKDHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTPIS+TF IDINANLDS LIKLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISKTFGIDINANLDSNLIKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGL+LVKNLITRLSPDGVPIKNFPIRKIPRVS+T+PLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLILVKNLITRLSPDGVPIKNFPIRKIPRVSKTIPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENH-PEKCLKSKRSLKRLNTFVDRSNSY 360
           HMAVVVREKENPEKSVNGNQL+  DVKVDIDGENH  EK LKSKRSLKRLNTFVDRSNSY
Sbjct: 301 HMAVVVREKENPEKSVNGNQLEANDVKVDIDGENHQQEKSLKSKRSLKRLNTFVDRSNSY 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           RKFSGSKKWSKD NSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD RT
Sbjct: 361 RKFSGSKKWSKDLNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDNRT 420

BLAST of HG10016628 vs. NCBI nr
Match: XP_022963200.1 (DUF21 domain-containing protein At2g14520-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 743.8 bits (1919), Expect = 8.1e-211
Identity = 385/420 (91.67%), Postives = 403/420 (95.95%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           MGVEYSCCT+GFFSRIGIV+FLVLFAG+MSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVVR QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLD SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTP+SETFAID+NANLDS LIKLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYE P +IIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHP-EKCLKSKRSLKRLNTFVDRSNSY 360
           HMAVVVREKENPEK ++GNQL+ +DVKVDIDGENHP EK LKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420

BLAST of HG10016628 vs. NCBI nr
Match: XP_031743033.1 (DUF21 domain-containing protein At4g33700 isoform X1 [Cucumis sativus])

HSP 1 Score: 741.5 bits (1913), Expect = 4.0e-210
Identity = 390/430 (90.70%), Postives = 405/430 (94.19%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHA- 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRK+A 
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKYAV 60

Query: 61  ---------ARILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILL 120
                    ARILPV R QHLLLCTLLICNA AMEALPIFLDSLVTAWGAILISVTLILL
Sbjct: 61  GFTLSLNITARILPVCRKQHLLLCTLLICNAVAMEALPIFLDSLVTAWGAILISVTLILL 120

Query: 121 FGEIIPQAVCSRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAEL 180
           FGEIIPQAVCSRYGLA+GATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAEL
Sbjct: 121 FGEIIPQAVCSRYGLAVGATVAPFVRVLVWICFPVAYPISKLLDISLGKEHKALFRRAEL 180

Query: 181 KTLVDFHGNEAGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLI 240
           KTLVDFHGNEAGKGGELTRDETTII GALEL+EKVARD MTPISETFAIDINANLDS L+
Sbjct: 181 KTLVDFHGNEAGKGGELTRDETTIIAGALELTEKVARDVMTPISETFAIDINANLDSNLV 240

Query: 241 KLILEKGHSRVPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYD 300
           KLILEKGHSRVPVFYERPT+IIGLVLVKNLITRLSPDG+PIK+FPIRKIPRVSETMPLY+
Sbjct: 241 KLILEKGHSRVPVFYERPTNIIGLVLVKNLITRLSPDGIPIKSFPIRKIPRVSETMPLYN 300

Query: 301 ILNDFQKGHSHMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENH-PEKCLKSKRSLKRL 360
           ILNDFQKGHSHMAV+VREKENPE+SV GNQL+ KDVKV+IDGENH  EK L +KRSLKRL
Sbjct: 301 ILNDFQKGHSHMAVIVREKENPERSVKGNQLEAKDVKVEIDGENHQQEKGLNTKRSLKRL 360

Query: 361 NTFVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQE 420
           NT VDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQE
Sbjct: 361 NTLVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQE 420

BLAST of HG10016628 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 557.4 bits (1435), Expect = 1.4e-157
Identity = 297/420 (70.71%), Postives = 351/420 (83.57%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEY CC T FF  I +++ LVLFAGLMSGLTLGLMSMS+V++EVLAKSG P DR HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVV+ QHLLLCTLLICNAAAMEALPIFLD+LVTAWGAILISVTLILLFGEIIPQ+VC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SR+GLAIGATVAPFVRVLVWIC PVA+PISKLLD  LG G  ALFRRAELKTLVD HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELT DETTII GALELSEK+A+DAMTPIS+TF IDINA LD  L+ LIL+KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV+YE+ T+IIGLVLVKNL+T    + + +KN  IR+IPRV ET+PLYDILN+FQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAVVVR--EKENPEKSVNGNQLKGKDVKVDIDGENHPEKC-LKSKRSLKRLNTFVDRSN 360
           HMAVVVR  +K +P +S +       +V+VD+D E  P++  LK +RSL++  +F +R+N
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRAN 360

Query: 361 SYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDY 418
           S    S SK+WSKD ++++L + +  LPKL EE +A+GIIT+EDVIEELLQEEI+DETD+
Sbjct: 361 SLG--SRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDH 418

BLAST of HG10016628 vs. ExPASy Swiss-Prot
Match: Q8VZI2 (DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF6 PE=1 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 9.1e-157
Identity = 295/422 (69.91%), Postives = 352/422 (83.41%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEY CC+  FF  I +++FLVLFAGLMSGLTLGLMS+S+V++EVLAKSG P  RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVV+ QHLLL TLLICNAAAME LPIFLD LVTAWGAILISVTLILLFGEIIPQ++C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLV+IC PVA+PISKLLD  LG    ALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELT DETTII GALELSEK+ +DAMTPIS+ F IDINA LD  L+ LILEKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV+YE+PT+IIGLVLVKNL+T    + +P+KN  IR+IPRV E +PLYDILN+FQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAVVVR--EKENPEKSVNGNQLKGKDVKVDIDGENHP---EKCLKSKRSLKRLNTFVDR 360
           HMAVVVR  +K +P  S NG+    K+ +VD+D E  P   E+ L++KRSL++  +F +R
Sbjct: 301 HMAVVVRQCDKIHPLPSKNGSV---KEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNR 360

Query: 361 SNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDET 418
           ++S++  S SKKWSKD ++++L +  + LPKL+EE EA+GIIT+EDVIEELLQEEI+DET
Sbjct: 361 ASSFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDET 419

BLAST of HG10016628 vs. ExPASy Swiss-Prot
Match: Q8RY60 (DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF7 PE=1 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 1.3e-115
Identity = 237/446 (53.14%), Postives = 302/446 (67.71%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M  +  CC T F   + I+I LV FAGLM+GLTLGLMS+ +V++EVL KSG+P DR +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +I PVV+ QHLLLCTLLI N+ AMEALPIFLD +V  W AIL+SVTLIL+FGEI+PQAVC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           +RYGL +GA +APFVRVL+ + FP++YPISK+LD  LGKGH  L RRAELKT V+FHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGG+LT DET+II GALEL+EK A+DAMTPIS  F+++++  L+   +  I+  GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV++  PT IIGL+LVKNL+   +   VP++   +RKIPRVSETMPLYDILN+FQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAVVVR----EKENPEKSVNG-NQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDR 360
           H+AVV +    ++++PE S NG  + K K  K     E   + C K K   +     V  
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTK----DELFKDSCRKPKAQFE-----VSE 360

Query: 361 SNSYRKFSGSKKWSKDFNSE-------------------------VLHIADDLLPKLSEE 417
              ++  +G  K  K  N E                         +L I +  +P     
Sbjct: 361 KEVFKIETGDAKSGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTN 420

BLAST of HG10016628 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 401.0 bits (1029), Expect = 1.7e-110
Identity = 228/417 (54.68%), Postives = 287/417 (68.82%), Query Frame = 0

Query: 7   CCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAARILPVV 66
           CC T F+  + + + LV+FAGLMSGLTLGLMS+SIVE+EV+ K+G+P DRK+A +ILP+V
Sbjct: 8   CCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLV 67

Query: 67  RGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLA 126
           + QHLLLCTLLI NA AMEALPIF+DSL+ AWGAILISVTLIL FGEIIPQAVCSRYGL+
Sbjct: 68  KNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLS 127

Query: 127 IGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNEAGKGGE 186
           IGA ++  VR+++ + FP++YPISKLLDL LGK H  L  RAELK+LV  HGNEAGKGGE
Sbjct: 128 IGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGE 187

Query: 187 LTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSRVPVFYE 246
           LT DETTII GAL++S+K A+DAMTP+S+ F++DIN  LD   + LI   GHSR+P++  
Sbjct: 188 LTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSV 247

Query: 247 RPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVV 306
            P  IIG +LVKNLI     D   I++ PIR++P+V   +PLYDILN FQ G SHMA VV
Sbjct: 248 NPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVV 307

Query: 307 REKENP-------EKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSY 366
             K +        EKS+NG+  K  +V + I   N  E   +S   ++ +++  D     
Sbjct: 308 GTKNHTNTNTPVHEKSINGSPNKDANVFLSIPALNSSETSHQS--PIRYIDSISD----- 367

Query: 367 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 417
                                        E+ E IGIITLEDV+EEL+QEEIYDETD
Sbjct: 368 -----------------------------EDEEVIGIITLEDVMEELIQEEIYDETD 388

BLAST of HG10016628 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 2.7e-100
Identity = 208/400 (52.00%), Postives = 274/400 (68.50%), Query Frame = 0

Query: 17  GIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAARILPVVRGQHLLLCTL 76
           GI  FLVLFAG+MSGLTLGLMS+ +VE+E+L +SG P+++K AA I PVV+ QH LL TL
Sbjct: 40  GISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTL 99

Query: 77  LICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVR 136
           L+CNA AME LPI+LD L   + AI++SVT +L FGE+IPQA+C+RYGLA+GA     VR
Sbjct: 100 LLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVR 159

Query: 137 VLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNEAGKGGELTRDETTIIG 196
           +L+ +C+P+A+PI K+LDL LG  + ALFRRA+LK LV  H  EAGKGGELT DETTII 
Sbjct: 160 ILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIHSQEAGKGGELTHDETTIIS 219

Query: 197 GALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSRVPVFYERPTSIIGLVL 256
           GAL+L+EK A++AMTPI  TF++D+N+ LD   +  IL +GHSRVPV+   P ++IGL+L
Sbjct: 220 GALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLL 279

Query: 257 VKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVVREKENPEKSV 316
           VK+L+T        +    IR+IPRV   MPLYDILN+FQKG SHMA VV+ K   +  V
Sbjct: 280 VKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVK--GKSKV 339

Query: 317 NGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEV 376
             + L  +      D +      LK + +   +   +D++N    F  ++     F+   
Sbjct: 340 PPSTLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFS--- 399

Query: 377 LHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 417
            H ++ +     E+GE IGIITLEDV EELLQEEI DETD
Sbjct: 400 -HTSEAI-----EDGEVIGIITLEDVFEELLQEEIVDETD 427

BLAST of HG10016628 vs. ExPASy TrEMBL
Match: A0A5A7SYT7 (DUF21 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G003770 PE=4 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 1.4e-216
Identity = 398/419 (94.99%), Postives = 407/419 (97.14%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPV R QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTPISETFAIDINANLDS LIKLILE+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYR 360
           HMAVVVREKENPE SV GNQL+ KDVKV+IDGEN  EK LK+KRSLKRLNTFVDRSNS+R
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQQEKGLKTKRSLKRLNTFVDRSNSHR 360

Query: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 419

BLAST of HG10016628 vs. ExPASy TrEMBL
Match: A0A1S3B147 (DUF21 domain-containing protein At4g33700-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484876 PE=4 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 1.4e-216
Identity = 398/419 (94.99%), Postives = 407/419 (97.14%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPV R QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTPISETFAIDINANLDS LIKLILE+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYR 360
           HMAVVVREKENPE SV GNQL+ KDVKV+IDGEN  EK LK+KRSLKRLNTFVDRSNS+R
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQQEKGLKTKRSLKRLNTFVDRSNSHR 360

Query: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 KFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 419

BLAST of HG10016628 vs. ExPASy TrEMBL
Match: A0A0A0KG11 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G496430 PE=4 SV=1)

HSP 1 Score: 749.6 bits (1934), Expect = 7.1e-213
Identity = 390/420 (92.86%), Postives = 405/420 (96.43%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMS+VEIEVLAKSGKPSDRK+AA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKYAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           RILPV R QHLLLCTLLICNA AMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRKQHLLLCTLLICNAVAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLA+GATVAPFVRVLVWICFPVAYPISKLLD+SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAVGATVAPFVRVLVWICFPVAYPISKLLDISLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTII GALEL+EKVARD MTPISETFAIDINANLDS L+KLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIAGALELTEKVARDVMTPISETFAIDINANLDSNLVKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYERPT+IIGLVLVKNLITRLSPDG+PIK+FPIRKIPRVSETMPLY+ILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGIPIKSFPIRKIPRVSETMPLYNILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENH-PEKCLKSKRSLKRLNTFVDRSNSY 360
           HMAV+VREKENPE+SV GNQL+ KDVKV+IDGENH  EK L +KRSLKRLNT VDRSNSY
Sbjct: 301 HMAVIVREKENPERSVKGNQLEAKDVKVEIDGENHQQEKGLNTKRSLKRLNTLVDRSNSY 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420

BLAST of HG10016628 vs. ExPASy TrEMBL
Match: A0A6J1HHB3 (DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463484 PE=4 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 3.9e-211
Identity = 385/420 (91.67%), Postives = 403/420 (95.95%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           MGVEYSCCT+GFFSRIGIV+FLVLFAG+MSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVVR QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLD SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTP+SETFAID+NANLDS LIKLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYE P +IIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGKDVKVDIDGENHP-EKCLKSKRSLKRLNTFVDRSNSY 360
           HMAVVVREKENPEK ++GNQL+ +DVKVDIDGENHP EK LKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYRT
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYRT 420

BLAST of HG10016628 vs. ExPASy TrEMBL
Match: A0A6J1HJD9 (DUF21 domain-containing protein At2g14520-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463484 PE=4 SV=1)

HSP 1 Score: 734.2 bits (1894), Expect = 3.1e-208
Identity = 387/443 (87.36%), Postives = 404/443 (91.20%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           MGVEYSCCT+GFFSRIGIV+FLVLFAG+MSGLTLGLMSMS+VEIEVLAKSGKPSDRKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVVR QHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLD SLGK HKALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELTRDETTIIGGALEL+EKVARD MTP+SETFAID+NANLDS LIKLILEKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPVFYE P +IIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVVREKENPEKSVNGNQLKGK-----------------------DVKVDIDGENHP- 360
           HMAVVVREKENPEK ++GNQL+GK                       DVKVDIDGENHP 
Sbjct: 301 HMAVVVREKENPEKPISGNQLEGKHLPSSFLFNLVDDTYNTNAHAARDVKVDIDGENHPQ 360

Query: 361 EKCLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGI 420
           EK LKSKRSLKRLNTFVDRSN +RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGI
Sbjct: 361 EKSLKSKRSLKRLNTFVDRSNYHRKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGI 420

BLAST of HG10016628 vs. TAIR 10
Match: AT2G14520.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 557.4 bits (1435), Expect = 1.0e-158
Identity = 297/420 (70.71%), Postives = 351/420 (83.57%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEY CC T FF  I +++ LVLFAGLMSGLTLGLMSMS+V++EVLAKSG P DR HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVV+ QHLLLCTLLICNAAAMEALPIFLD+LVTAWGAILISVTLILLFGEIIPQ+VC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SR+GLAIGATVAPFVRVLVWIC PVA+PISKLLD  LG G  ALFRRAELKTLVD HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELT DETTII GALELSEK+A+DAMTPIS+TF IDINA LD  L+ LIL+KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV+YE+ T+IIGLVLVKNL+T    + + +KN  IR+IPRV ET+PLYDILN+FQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAVVVR--EKENPEKSVNGNQLKGKDVKVDIDGENHPEKC-LKSKRSLKRLNTFVDRSN 360
           HMAVVVR  +K +P +S +       +V+VD+D E  P++  LK +RSL++  +F +R+N
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRAN 360

Query: 361 SYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDY 418
           S    S SK+WSKD ++++L + +  LPKL EE +A+GIIT+EDVIEELLQEEI+DETD+
Sbjct: 361 SLG--SRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDH 418

BLAST of HG10016628 vs. TAIR 10
Match: AT4G33700.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 554.7 bits (1428), Expect = 6.5e-158
Identity = 295/422 (69.91%), Postives = 352/422 (83.41%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M VEY CC+  FF  I +++FLVLFAGLMSGLTLGLMS+S+V++EVLAKSG P  RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPVV+ QHLLL TLLICNAAAME LPIFLD LVTAWGAILISVTLILLFGEIIPQ++C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLV+IC PVA+PISKLLD  LG    ALFRRAELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGGELT DETTII GALELSEK+ +DAMTPIS+ F IDINA LD  L+ LILEKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV+YE+PT+IIGLVLVKNL+T    + +P+KN  IR+IPRV E +PLYDILN+FQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAVVVR--EKENPEKSVNGNQLKGKDVKVDIDGENHP---EKCLKSKRSLKRLNTFVDR 360
           HMAVVVR  +K +P  S NG+    K+ +VD+D E  P   E+ L++KRSL++  +F +R
Sbjct: 301 HMAVVVRQCDKIHPLPSKNGSV---KEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNR 360

Query: 361 SNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDET 418
           ++S++  S SKKWSKD ++++L +  + LPKL+EE EA+GIIT+EDVIEELLQEEI+DET
Sbjct: 361 ASSFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDET 419

BLAST of HG10016628 vs. TAIR 10
Match: AT1G47330.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 417.9 bits (1073), Expect = 9.4e-117
Identity = 237/446 (53.14%), Postives = 302/446 (67.71%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAA 60
           M  +  CC T F   + I+I LV FAGLM+GLTLGLMS+ +V++EVL KSG+P DR +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  RILPVVRGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +I PVV+ QHLLLCTLLI N+ AMEALPIFLD +V  W AIL+SVTLIL+FGEI+PQAVC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNE 180
           +RYGL +GA +APFVRVL+ + FP++YPISK+LD  LGKGH  L RRAELKT V+FHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSR 240
           AGKGG+LT DET+II GALEL+EK A+DAMTPIS  F+++++  L+   +  I+  GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 VPVFYERPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           VPV++  PT IIGL+LVKNL+   +   VP++   +RKIPRVSETMPLYDILN+FQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAVVVR----EKENPEKSVNG-NQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDR 360
           H+AVV +    ++++PE S NG  + K K  K     E   + C K K   +     V  
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTK----DELFKDSCRKPKAQFE-----VSE 360

Query: 361 SNSYRKFSGSKKWSKDFNSE-------------------------VLHIADDLLPKLSEE 417
              ++  +G  K  K  N E                         +L I +  +P     
Sbjct: 361 KEVFKIETGDAKSGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTN 420

BLAST of HG10016628 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 401.0 bits (1029), Expect = 1.2e-111
Identity = 228/417 (54.68%), Postives = 287/417 (68.82%), Query Frame = 0

Query: 7   CCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAARILPVV 66
           CC T F+  + + + LV+FAGLMSGLTLGLMS+SIVE+EV+ K+G+P DRK+A +ILP+V
Sbjct: 8   CCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLV 67

Query: 67  RGQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLA 126
           + QHLLLCTLLI NA AMEALPIF+DSL+ AWGAILISVTLIL FGEIIPQAVCSRYGL+
Sbjct: 68  KNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLS 127

Query: 127 IGATVAPFVRVLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNEAGKGGE 186
           IGA ++  VR+++ + FP++YPISKLLDL LGK H  L  RAELK+LV  HGNEAGKGGE
Sbjct: 128 IGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGE 187

Query: 187 LTRDETTIIGGALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSRVPVFYE 246
           LT DETTII GAL++S+K A+DAMTP+S+ F++DIN  LD   + LI   GHSR+P++  
Sbjct: 188 LTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSV 247

Query: 247 RPTSIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVV 306
            P  IIG +LVKNLI     D   I++ PIR++P+V   +PLYDILN FQ G SHMA VV
Sbjct: 248 NPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVV 307

Query: 307 REKENP-------EKSVNGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSY 366
             K +        EKS+NG+  K  +V + I   N  E   +S   ++ +++  D     
Sbjct: 308 GTKNHTNTNTPVHEKSINGSPNKDANVFLSIPALNSSETSHQS--PIRYIDSISD----- 367

Query: 367 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 417
                                        E+ E IGIITLEDV+EEL+QEEIYDETD
Sbjct: 368 -----------------------------EDEEVIGIITLEDVMEELIQEEIYDETD 388

BLAST of HG10016628 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 367.1 bits (941), Expect = 1.9e-101
Identity = 208/400 (52.00%), Postives = 274/400 (68.50%), Query Frame = 0

Query: 17  GIVIFLVLFAGLMSGLTLGLMSMSIVEIEVLAKSGKPSDRKHAARILPVVRGQHLLLCTL 76
           GI  FLVLFAG+MSGLTLGLMS+ +VE+E+L +SG P+++K AA I PVV+ QH LL TL
Sbjct: 40  GISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTL 99

Query: 77  LICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVR 136
           L+CNA AME LPI+LD L   + AI++SVT +L FGE+IPQA+C+RYGLA+GA     VR
Sbjct: 100 LLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVR 159

Query: 137 VLVWICFPVAYPISKLLDLSLGKGHKALFRRAELKTLVDFHGNEAGKGGELTRDETTIIG 196
           +L+ +C+P+A+PI K+LDL LG  + ALFRRA+LK LV  H  EAGKGGELT DETTII 
Sbjct: 160 ILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIHSQEAGKGGELTHDETTIIS 219

Query: 197 GALELSEKVARDAMTPISETFAIDINANLDSTLIKLILEKGHSRVPVFYERPTSIIGLVL 256
           GAL+L+EK A++AMTPI  TF++D+N+ LD   +  IL +GHSRVPV+   P ++IGL+L
Sbjct: 220 GALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLL 279

Query: 257 VKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVVREKENPEKSV 316
           VK+L+T        +    IR+IPRV   MPLYDILN+FQKG SHMA VV+ K   +  V
Sbjct: 280 VKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVK--GKSKV 339

Query: 317 NGNQLKGKDVKVDIDGENHPEKCLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEV 376
             + L  +      D +      LK + +   +   +D++N    F  ++     F+   
Sbjct: 340 PPSTLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFS--- 399

Query: 377 LHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 417
            H ++ +     E+GE IGIITLEDV EELLQEEI DETD
Sbjct: 400 -HTSEAI-----EDGEVIGIITLEDVFEELLQEEIVDETD 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008440429.12.9e-21694.99PREDICTED: DUF21 domain-containing protein At4g33700-like isoform X1 [Cucumis me... [more]
XP_004143412.11.5e-21292.86DUF21 domain-containing protein At4g33700 isoform X2 [Cucumis sativus][more]
XP_038882679.12.5e-21293.81DUF21 domain-containing protein At4g33700-like isoform X2 [Benincasa hispida][more]
XP_022963200.18.1e-21191.67DUF21 domain-containing protein At2g14520-like isoform X2 [Cucurbita moschata][more]
XP_031743033.14.0e-21090.70DUF21 domain-containing protein At4g33700 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9ZQR41.4e-15770.71DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8VZI29.1e-15769.91DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8RY601.3e-11553.14DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD81.7e-11054.68DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q67XQ02.7e-10052.00DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
A0A5A7SYT71.4e-21694.99DUF21 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3B1471.4e-21694.99DUF21 domain-containing protein At4g33700-like isoform X1 OS=Cucumis melo OX=365... [more]
A0A0A0KG117.1e-21392.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G496430 PE=4 SV=1[more]
A0A6J1HHB33.9e-21191.67DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita moschata ... [more]
A0A6J1HJD93.1e-20887.36DUF21 domain-containing protein At2g14520-like isoform X1 OS=Cucurbita moschata ... [more]
Match NameE-valueIdentityDescription
AT2G14520.11.0e-15870.71CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G33700.16.5e-15869.91CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT1G47330.19.4e-11753.14CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.11.2e-11154.68CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.11.9e-10152.00CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 18..185
e-value: 6.4E-36
score: 123.6
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 8..191
score: 53.826393
NoneNo IPR availableGENE3D3.10.580.10coord: 191..323
e-value: 2.6E-38
score: 133.2
NoneNo IPR availableGENE3D3.10.580.10coord: 343..414
e-value: 3.7E-7
score: 32.1
NoneNo IPR availablePANTHERPTHR12064:SF57AND COBALT EFFLUX PROTEIN CORC, PUTATIVE-RELATEDcoord: 1..418
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 197..408
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 1..418
IPR000644CBS domainPROSITEPS51371CBScoord: 210..271
score: 8.588084
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 205..308
e-value: 5.48341E-27
score: 102.189

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016628.1HG10016628.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle