Sed0021149 (gene) Chayote v1

Overview
NameSed0021149
Typegene
OrganismSechium edule (Chayote v1)
DescriptionDUF21 domain-containing protein
LocationLG14: 18631656 .. 18636212 (-)
RNA-Seq ExpressionSed0021149
SyntenySed0021149
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATTAACCACATTAATCGACCAACCATACAGAATCCAAGCAGGAACATGCATATATAACATGTCTTTTTTTGCTGTCTCCGAGAATTAGTACTTTTCAGCTAGTTTTCAAAGAAACTCTTTCTGAACCAACACACCAAGTCAAGCAGAGGAAGACAGCACTGGAAAATGGGAGTAGAGTATAGCTGCTGCACAACAGGATTTTTCAGTCGCATTGGAATAATCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCTGGCCTAACTCTTGGTCTCATGTCTATGAGCCTGGTTGAGATCGAAGTTCTTGCCAAATCTGGAAAGCCGAGCAACCGTAAACATGCTGGTGCTCATCTTCATCATGGGAAAAAATTGACTCTTTTACTTGATGGGTTTCTTATTTTGATGTTTTTTACTAACAAGAACTTGACAATCTCTCATTGACAGCAAAGATTCTACCAGTTGTTAGAAGACAACACTTGTTGCTTTGCACTTTATTGATCTGCAATGCTGCAGCCATGGAGGTATAACTGTAAACCGATTGAAAGTTGAGTTTTTTTGCCATTTTGTATCAGCGGAAAGGAGTCCCCAAGGGGTGGCTTCGTTGGTTGGGGCTGAGAGCCTATAAGGGATGCATTTTTGGAGGTCTCAGGTTTGAGACCTACGAGTGTAGGTCATTGTAAATCTCTTGTTGTCTCTCGAGTTTGAGCCTTGTGATAAGTGCGAATGCCCCTGGGTATAGGGGAGAGAAGCTTTGATTCTCGATCATAAAAAAAGGCAAAAAACTAATGGATTTTTGTTTACTTTGACAGACACTTCCAATATTTTTGGACAGTTTGGTGACAGCTTGGGGAGCTATATTGATCTCTGTCACTTTGATCTTGTTATTTGGTGAGGTGATGTATCAAATTTTCACTTAGTTTGTTAGTATTTTCATGGAGAACAATTTGTGTGAATTTTAGGTGAAGCTTATGTAAGAGGTTAGAATTTTGAACTTGTTCTTATGCTGTATGTTTCTGCAGATTATTCCACAAGCTGTTTGTTCTAGATATGGTTTGGCAATTGGTGCAACCGTGGCTCCATTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCTGTTGCATATCCTATTAGCAAGGTCATTGTGTTTCTTCAGCTGAATTTTATTGCCAAAGAACACTTAATACATACAAAATCTTTTATTTCACTAAGGGTGTGTTTGGGCCACGTTTATGTAAACGACCGAGTTGGTTATAATATAACTCAACCGATGTTTACCCTGCCGTTTATTATAAACGGTGGTTACCTAAACCTGCGTGCCCATTTATCTAAACCTACGTGCCCAGTTATCTAAACCTTACATATATTACTTTTTGTTGCATAACCAGCTCCTCAAACACCATTTATCATAATCCATGTTATAATAACTTGCCCCTCAAACACAATATAATAACACACCGTTAATTATAACCCACCGTTTATCATAACTTTCCGTCTATCATAACCAAACAGGTGTTCCAAACACACCCTAAATATATATTTTTTAAATCTAGTTTTGAAGGAGATGTTTGGTAGGAGGGTTTGGAGGGATTTGAAGGGTTTTGGAGTTGAAACCTTGTTTGGTATGAGAAATTTGAAAGAGATAAAACTGGTTGGTACAAGAAATTTGGAAGAGATAAATCCTTGCTTGGTATGAGAAGTTTGGAGGAGATAAAAACTTGTTTGGTACAAGGGATTAAAAATAATTGAGATGGAAGTGCTAGAAAACCGTACTTGGTACGAAGGGTTGAGAGTGGTTGAAAATGGGAGTGGCAGAAAGCTTGTTTGATACGAAGGATTGGAAATGGTTGGTGTTTGGGAGGTATACTATTTGCCCCTCCCACCAAACAAGAGAAAAGAAATCTTACTCCAAACTCCCCTCTTCCACTCATACCAAACACCCCCAAACTGTTTAACAAAAGAAAAACGTAAGTAAACATGAAAGCAAAAACAGAGAATTTATGATCTGAGCCTCAGTTCATGCAGTCACATGGTCAATTTATCAAGAAGAATGAATTAAATTGGAGTTTTGGTTCCAAATGGGTAGAAACGGAACAGTACCAGAGCCTGAAAATTGATATTTTGATTTTGTTTTGATCAGCTGTTGGACTTTTCACTTGGTAAAGAACACAAAGCTCTGTTACGTAGATCAGAACTAAAAACACTTGTAGACTTTCATGGCAATGAGGTACTTTGGCTCAAACACAAGCAACAACCCTCTTCATCTCTTGTTCTCACTTGTGTGGCTGAGATTGGTTTCATAAAAATGCAGGCTGGGAAAGGAGGAGAACTGACACGAGACGAAACAACGATAATAGGAGGAGCACTCGAACTCACCGAGAAGGTTGCAAGAGACGTCATGACTCCCCTCTCTGAAACTTTTGCGATCGATATTAATGCTCATCTCGATAGGTTTCATCTTCAACACAAAAATACACCAACTGCATTATTCTTCTTTCCTCCATCCTAATAAATATGCTTATGAAAACAAATGCAGCAACTTGATCAAGTTAATTATGGAGAAGGGGCATAGCAGATTACCTGTGTTTTATGAACACCCTGAAAATATCATTGGCCTCGTATTGGTAATGTTTATCAATGATGATAATAATCATAACTATAACATTCTTTTCTCTGATATCATCATATTTAAATTGCAGGTGAAGAATTTAATAACTAGGCAGTCACCAGATGGGGTACCAATTAAGAACTTCCCAATTCGAAAAATTCCGAGGTACTTTTATGCCCCAGCTTTTATAATTAACCTAACTCATTTGTTTTGAACCTAAGACCTCTAAAGAAGAAAATCTTGGAGACCCCAAACCTCTACCAATGAGACCATTGTGAGACCTCAAGGAAGTGTTATTTTTGGGGACTGTATTATAATATGGCTAAGGGCATAGTAGAAGTTAGCTATTAGTGTTTTGCTTAAGTATGGGAATTGGTTCTAAGAAGGGTATGCATATTTTGGTATTGTTGTCTATTGAACTATGCTAAATCACCAATCAATCCAAAAACCTAAGCTGATGGATTGAGGTAAAATTTAATTATATCAATCAACATTGTTCAAGAGGTTGAGAGAGTGTAGTCCTCTTGAATGGCTATGATATTGTAACCCTTATCTATTTTTGAAACAAGGGTAACCACGCCTGTCCCTAGACTAGGTACTGGAGACATCGAAGGAGTAATATCACAGGTGAGTCTCGAACCTAGGACCTAAGCTCAAGTCTTCAACCACTGCACCACCCCTTGGGGACTGTAACCCTTATCTATTACATCAATATAATTTTCTTGTGTTTATTATTTTCGGTGTTAGTTCTATCTTGTGTTTATTGAGATGATCGGTGTTCTAACAGCCGCCCCTTGGGTTGAATTGAAAAAGAGCATAGTTAGAAAACCTACATGTAAAGAGCATCACCTAGTTGAAAGAAGAAAAATAAAACTTTCACTAAGAATCCAAACTTGTATATCAGGGTCTCAGAAACAATGCCATTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCAGTTGTTATTAGAGAAAAGGAAACTCTAGAGAAGTCCATTAGCAGTAACCAACTTGAAAGTAAGCATTACCTTCTTCATTTCTGCTTAACTTGGCTTCTTCACTTTGACAATGACACATATAACACAAATTTACATATGCAGCAAGAGATGTAAGAGTGGATATCGATGGTGAAAATCAACCACAGGAAAAAAGTTTAAAGAGCAAGAGATCACTGAAAAGACTAAACACATTTGTCGATCGCAGTAATTCATACCGAAAGTTCTCCGGAAGCAAGAAATGGTCTAAAGACTTCAATTCAGAGGTCTTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATTGGCATCATAACACTTGAAGATGTCATTGAGGAACTTTTACAGGTATTGATCCTCTATCTATAGTCTGTTCAACTATCATTGTTACAGAATTTGACTACCTTTTAACTTTTGTTCACTGCAGGAGGAAATCTATGATGAGACGGATTATCGAATCGTTTACTAGAAGACAATATGAAGAACTCAAGTAAAGTTTTGGATGATATGATTTCTTCTAGGAAAATAAGGAATGAACAATTACTTTGGAAATTATCTTTAGAGAGCTTGTCTTACTATTCAGTTTCCCGTATCCCTGTGACTGTAGAAGATATGCCCACCCAATGCAGACTTTTCTTCAAGATGTAACTTTTTTTAGTAGAAACATACAACTGCAAGTAGATAAGTGCAAATAGAAATCTGAAAACCAAATAGGCTATTGTCATCATGAATTTTCATACTAAATAAAAAATGAAGGGAAGTAGGAGTGGCTGTCTAAGTTGAAGGAAAAGTAAGAAGTAACAGTTCTTAAGTAATTGGTCAATTCAGAAATTTACATACCTGAAATTGTAGATGTCTGGAACTTTATACTCCTTGAAAATTGAAATTGTCAGATAGAAGCCATGTATAGTAGAGTTACACTTGGGC

mRNA sequence

CAAATTAACCACATTAATCGACCAACCATACAGAATCCAAGCAGGAACATGCATATATAACATGTCTTTTTTTGCTGTCTCCGAGAATTAGTACTTTTCAGCTAGTTTTCAAAGAAACTCTTTCTGAACCAACACACCAAGTCAAGCAGAGGAAGACAGCACTGGAAAATGGGAGTAGAGTATAGCTGCTGCACAACAGGATTTTTCAGTCGCATTGGAATAATCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCTGGCCTAACTCTTGGTCTCATGTCTATGAGCCTGGTTGAGATCGAAGTTCTTGCCAAATCTGGAAAGCCGAGCAACCGTAAACATGCTGCAAAGATTCTACCAGTTGTTAGAAGACAACACTTGTTGCTTTGCACTTTATTGATCTGCAATGCTGCAGCCATGGAGACACTTCCAATATTTTTGGACAGTTTGGTGACAGCTTGGGGAGCTATATTGATCTCTGTCACTTTGATCTTGTTATTTGGTGAGATTATTCCACAAGCTGTTTGTTCTAGATATGGTTTGGCAATTGGTGCAACCGTGGCTCCATTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCTGTTGCATATCCTATTAGCAAGCTGTTGGACTTTTCACTTGGTAAAGAACACAAAGCTCTGTTACGTAGATCAGAACTAAAAACACTTGTAGACTTTCATGGCAATGAGGCTGGGAAAGGAGGAGAACTGACACGAGACGAAACAACGATAATAGGAGGAGCACTCGAACTCACCGAGAAGGTTGCAAGAGACGTCATGACTCCCCTCTCTGAAACTTTTGCGATCGATATTAATGCTCATCTCGATAGCAACTTGATCAAGTTAATTATGGAGAAGGGGCATAGCAGATTACCTGTGTTTTATGAACACCCTGAAAATATCATTGGCCTCGTATTGGTGAAGAATTTAATAACTAGGCAGTCACCAGATGGGGTACCAATTAAGAACTTCCCAATTCGAAAAATTCCGAGGGTCTCAGAAACAATGCCATTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCAGTTGTTATTAGAGAAAAGGAAACTCTAGAGAAGTCCATTAGCAGTAACCAACTTGAAACAAGAGATGTAAGAGTGGATATCGATGGTGAAAATCAACCACAGGAAAAAAGTTTAAAGAGCAAGAGATCACTGAAAAGACTAAACACATTTGTCGATCGCAGTAATTCATACCGAAAGTTCTCCGGAAGCAAGAAATGGTCTAAAGACTTCAATTCAGAGGTCTTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATTGGCATCATAACACTTGAAGATGTCATTGAGGAACTTTTACAGGAGGAAATCTATGATGAGACGGATTATCGAATCGTTTACTAGAAGACAATATGAAGAACTCAAGTAAAGTTTTGGATGATATGATTTCTTCTAGGAAAATAAGGAATGAACAATTACTTTGGAAATTATCTTTAGAGAGCTTGTCTTACTATTCAGTTTCCCGTATCCCTGTGACTGTAGAAGATATGCCCACCCAATGCAGACTTTTCTTCAAGATGTAACTTTTTTTAGTAGAAACATACAACTGCAAGTAGATAAGTGCAAATAGAAATCTGAAAACCAAATAGGCTATTGTCATCATGAATTTTCATACTAAATAAAAAATGAAGGGAAGTAGGAGTGGCTGTCTAAGTTGAAGGAAAAGTAAGAAGTAACAGTTCTTAAGTAATTGGTCAATTCAGAAATTTACATACCTGAAATTGTAGATGTCTGGAACTTTATACTCCTTGAAAATTGAAATTGTCAGATAGAAGCCATGTATAGTAGAGTTACACTTGGGC

Coding sequence (CDS)

ATGGGAGTAGAGTATAGCTGCTGCACAACAGGATTTTTCAGTCGCATTGGAATAATCATCTTCTTGGTGTTGTTTGCTGGGTTGATGTCTGGCCTAACTCTTGGTCTCATGTCTATGAGCCTGGTTGAGATCGAAGTTCTTGCCAAATCTGGAAAGCCGAGCAACCGTAAACATGCTGCAAAGATTCTACCAGTTGTTAGAAGACAACACTTGTTGCTTTGCACTTTATTGATCTGCAATGCTGCAGCCATGGAGACACTTCCAATATTTTTGGACAGTTTGGTGACAGCTTGGGGAGCTATATTGATCTCTGTCACTTTGATCTTGTTATTTGGTGAGATTATTCCACAAGCTGTTTGTTCTAGATATGGTTTGGCAATTGGTGCAACCGTGGCTCCATTTGTGAGGGTTCTTGTTTGGATTTGCTTTCCTGTTGCATATCCTATTAGCAAGCTGTTGGACTTTTCACTTGGTAAAGAACACAAAGCTCTGTTACGTAGATCAGAACTAAAAACACTTGTAGACTTTCATGGCAATGAGGCTGGGAAAGGAGGAGAACTGACACGAGACGAAACAACGATAATAGGAGGAGCACTCGAACTCACCGAGAAGGTTGCAAGAGACGTCATGACTCCCCTCTCTGAAACTTTTGCGATCGATATTAATGCTCATCTCGATAGCAACTTGATCAAGTTAATTATGGAGAAGGGGCATAGCAGATTACCTGTGTTTTATGAACACCCTGAAAATATCATTGGCCTCGTATTGGTGAAGAATTTAATAACTAGGCAGTCACCAGATGGGGTACCAATTAAGAACTTCCCAATTCGAAAAATTCCGAGGGTCTCAGAAACAATGCCATTGTACGACATACTAAATGATTTCCAGAAAGGTCACAGTCATATGGCAGTTGTTATTAGAGAAAAGGAAACTCTAGAGAAGTCCATTAGCAGTAACCAACTTGAAACAAGAGATGTAAGAGTGGATATCGATGGTGAAAATCAACCACAGGAAAAAAGTTTAAAGAGCAAGAGATCACTGAAAAGACTAAACACATTTGTCGATCGCAGTAATTCATACCGAAAGTTCTCCGGAAGCAAGAAATGGTCTAAAGACTTCAATTCAGAGGTCTTGCATATTGCTGATGACCTGCTGCCCAAGCTCTCTGAAGAGGGGGAAGCAATTGGCATCATAACACTTGAAGATGTCATTGAGGAACTTTTACAGGAGGAAATCTATGATGAGACGGATTATCGAATCGTTTACTAG

Protein sequence

MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAAKILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNEAGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSRLPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRIVY
Homology
BLAST of Sed0021149 vs. NCBI nr
Match: XP_022963200.1 (DUF21 domain-containing protein At2g14520-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 750.4 bits (1936), Expect = 8.7e-213
Identity = 387/419 (92.36%), Postives = 404/419 (96.42%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAID+NA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS NQLE RDV+VDIDGEN PQEKSLKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYR 419

BLAST of Sed0021149 vs. NCBI nr
Match: XP_023518289.1 (DUF21 domain-containing protein At4g33700-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 747.7 bits (1929), Expect = 5.6e-212
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSHRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTII GALELTEKVARDVMTPLSETFAIDINA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIRGALELTEKVARDVMTPLSETFAIDINANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS NQLE RDV+VDIDGEN PQEKSLKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYR 419

BLAST of Sed0021149 vs. NCBI nr
Match: KAG6594951.1 (DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 744.2 bits (1920), Expect = 6.2e-211
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGK HKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGK-HKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS NQLE RDV+VDIDGEN PQEKSLKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYR 418

BLAST of Sed0021149 vs. NCBI nr
Match: KAG7026912.1 (DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 744.2 bits (1920), Expect = 6.2e-211
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 34  MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 93

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 94  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 153

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGK HKAL RR+ELKTLVDFHGNE
Sbjct: 154 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGK-HKALFRRAELKTLVDFHGNE 213

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINA+LDSNLIKLI+EKGHSR
Sbjct: 214 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINANLDSNLIKLILEKGHSR 273

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 274 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 333

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS NQLE RDV+VDIDGEN PQEKSLKSKRSLKRLNTFVDRSN +
Sbjct: 334 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 393

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 394 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYR 451

BLAST of Sed0021149 vs. NCBI nr
Match: XP_008440429.1 (PREDICTED: DUF21 domain-containing protein At4g33700-like isoform X1 [Cucumis melo] >KAA0036392.1 DUF21 domain-containing protein [Cucumis melo var. makuwa] >TYK12788.1 DUF21 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 742.3 bits (1915), Expect = 2.4e-210
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEYSCCTTGFFSRIGI+IFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPV RRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD SLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTP+SETFAIDINA+LDSNLIKLI+E+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYE P NIIGLVLVKNLITR SPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  E S+  NQLE +DV+V+IDGENQ QEK LK+KRSLKRLNTFVDRSNS+
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQ-QEKGLKTKRSLKRLNTFVDRSNSH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 418

BLAST of Sed0021149 vs. ExPASy Swiss-Prot
Match: Q8VZI2 (DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF6 PE=1 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 9.1e-157
Identity = 290/420 (69.05%), Postives = 350/420 (83.33%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEY CC+  FF  I +I+FLVLFAGLMSGLTLGLMS+SLV++EVLAKSG P +RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVV+ QHLLL TLLICNAAAMETLPIFLD LVTAWGAILISVTLILLFGEIIPQ++C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLV+IC PVA+PISKLLDF LG    AL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELT DETTII GALEL+EK+ +D MTP+S+ F IDINA LD +L+ LI+EKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV+YE P NIIGLVLVKNL+T    + +P+KN  IR+IPRV E +PLYDILN+FQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGEN--QPQEKSLKSKRSLKRLNTFVDRSN 360
           HMAVV+R+ + +    S N    ++ RVD+D E    PQE+ L++KRSL++  +F +R++
Sbjct: 301 HMAVVVRQCDKIHPLPSKNG-SVKEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNRAS 360

Query: 361 SYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDY 419
           S++  S SKKWSKD ++++L +  + LPKL+EE EA+GIIT+EDVIEELLQEEI+DETD+
Sbjct: 361 SFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDH 419

BLAST of Sed0021149 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 553.1 bits (1424), Expect = 2.7e-156
Identity = 292/421 (69.36%), Postives = 349/421 (82.90%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEY CC T FF  I +I+ LVLFAGLMSGLTLGLMSMSLV++EVLAKSG P +R HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVV+ QHLLLCTLLICNAAAME LPIFLD+LVTAWGAILISVTLILLFGEIIPQ+VC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SR+GLAIGATVAPFVRVLVWIC PVA+PISKLLDF LG    AL RR+ELKTLVD HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELT DETTII GALEL+EK+A+D MTP+S+TF IDINA LD +L+ LI++KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV+YE   NIIGLVLVKNL+T    + + +KN  IR+IPRV ET+PLYDILN+FQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAVVIREKE---TLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRS 360
           HMAVV+R+ +    L+ + ++N+    +VRVD+D E  PQE  LK +RSL++  +F +R+
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANE-TVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRA 360

Query: 361 NSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 419
           NS    S SK+WSKD ++++L + +  LPKL EE +A+GIIT+EDVIEELLQEEI+DETD
Sbjct: 361 NSLG--SRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETD 418

BLAST of Sed0021149 vs. ExPASy Swiss-Prot
Match: Q8RY60 (DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF7 PE=1 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 2.1e-113
Identity = 232/437 (53.09%), Postives = 309/437 (70.71%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M  +  CC T F   + III LV FAGLM+GLTLGLMS+ LV++EVL KSG+P +R +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KI PVV+ QHLLLCTLLI N+ AME LPIFLD +V  W AIL+SVTLIL+FGEI+PQAVC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           +RYGL +GA +APFVRVL+ + FP++YPISK+LD+ LGK H  LLRR+ELKT V+FHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGG+LT DET+II GALELTEK A+D MTP+S  F+++++  L+   +  IM  GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV++ +P +IIGL+LVKNL+   +   VP++   +RKIPRVSETMPLYDILN+FQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAVVIREKETLEKS--ISSNQLETRDVR-----VDIDGENQPQEKSLKSKRSLKRLNTF 360
           H+AVV ++ +  E+S   S N +E R  +     +  D   +P+ +   S++ + ++ T 
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETG 360

Query: 361 VDRS-----NSYRKFSG--------SKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITL 418
             +S        ++ SG        +KK  +  +  +L I +  +P      E +G+IT+
Sbjct: 361 DAKSGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVITM 420

BLAST of Sed0021149 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 385.2 bits (988), Expect = 9.6e-106
Identity = 224/412 (54.37%), Postives = 286/412 (69.42%), Query Frame = 0

Query: 7   CCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAAKILPVV 66
           CC T F+  + + + LV+FAGLMSGLTLGLMS+S+VE+EV+ K+G+P +RK+A KILP+V
Sbjct: 8   CCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLV 67

Query: 67  RRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLA 126
           + QHLLLCTLLI NA AME LPIF+DSL+ AWGAILISVTLIL FGEIIPQAVCSRYGL+
Sbjct: 68  KNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLS 127

Query: 127 IGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNEAGKGGE 186
           IGA ++  VR+++ + FP++YPISKLLD  LGK H  LL R+ELK+LV  HGNEAGKGGE
Sbjct: 128 IGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGE 187

Query: 187 LTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSRLPVFYE 246
           LT DETTII GAL++++K A+D MTP+S+ F++DIN  LD   + LI   GHSR+P++  
Sbjct: 188 LTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSV 247

Query: 247 HPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVI 306
           +P  IIG +LVKNLI  +  D   I++ PIR++P+V   +PLYDILN FQ G SHMA V+
Sbjct: 248 NPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVV 307

Query: 307 REKETLEKSISSNQLETRDVRVDIDGENQP-QEKSLKSKRSLKRLNTFVDRSNSYRKFSG 366
             K         N   T          N P  EKS+    + K  N F+    S    + 
Sbjct: 308 GTK---------NHTNT----------NTPVHEKSINGSPN-KDANVFL----SIPALNS 367

Query: 367 SKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 418
           S+   +     +  I+D       E+ E IGIITLEDV+EEL+QEEIYDETD
Sbjct: 368 SETSHQSPIRYIDSISD-------EDEEVIGIITLEDVMEELIQEEIYDETD 388

BLAST of Sed0021149 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 362.5 bits (929), Expect = 6.7e-99
Identity = 205/401 (51.12%), Postives = 274/401 (68.33%), Query Frame = 0

Query: 17  GIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAAKILPVVRRQHLLLCTL 76
           GI  FLVLFAG+MSGLTLGLMS+ LVE+E+L +SG P+ +K AA I PVV++QH LL TL
Sbjct: 40  GISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTL 99

Query: 77  LICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVR 136
           L+CNA AME LPI+LD L   + AI++SVT +L FGE+IPQA+C+RYGLA+GA     VR
Sbjct: 100 LLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVR 159

Query: 137 VLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNEAGKGGELTRDETTIIG 196
           +L+ +C+P+A+PI K+LD  LG  + AL RR++LK LV  H  EAGKGGELT DETTII 
Sbjct: 160 ILMTLCYPIAFPIGKILDLVLG-HNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIIS 219

Query: 197 GALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSRLPVFYEHPENIIGLVL 256
           GAL+LTEK A++ MTP+  TF++D+N+ LD   +  I+ +GHSR+PV+  +P+N+IGL+L
Sbjct: 220 GALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLL 279

Query: 257 VKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVIREKETLEKSI 316
           VK+L+T +      +    IR+IPRV   MPLYDILN+FQKG SHMA V++ K   +   
Sbjct: 280 VKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPP 339

Query: 317 SSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSE 376
           S+   E  D   D D         LK + +   +   +D++N    F  ++     F+  
Sbjct: 340 STLLEEHTDESNDSD---LTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFS-- 399

Query: 377 VLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 418
             H ++ +     E+GE IGIITLEDV EELLQEEI DETD
Sbjct: 400 --HTSEAI-----EDGEVIGIITLEDVFEELLQEEIVDETD 427

BLAST of Sed0021149 vs. ExPASy TrEMBL
Match: A0A6J1HHB3 (DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463484 PE=4 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 4.2e-213
Identity = 387/419 (92.36%), Postives = 404/419 (96.42%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAID+NA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS NQLE RDV+VDIDGEN PQEKSLKSKRSLKRLNTFVDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGNQLEARDVKVDIDGENHPQEKSLKSKRSLKRLNTFVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGIITLEDVIEELLQEEIYDETDYR 419

BLAST of Sed0021149 vs. ExPASy TrEMBL
Match: A0A5A7SYT7 (DUF21 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G003770 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.1e-210
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEYSCCTTGFFSRIGI+IFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPV RRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD SLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTP+SETFAIDINA+LDSNLIKLI+E+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYE P NIIGLVLVKNLITR SPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  E S+  NQLE +DV+V+IDGENQ QEK LK+KRSLKRLNTFVDRSNS+
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQ-QEKGLKTKRSLKRLNTFVDRSNSH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 418

BLAST of Sed0021149 vs. ExPASy TrEMBL
Match: A0A1S3B147 (DUF21 domain-containing protein At4g33700-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484876 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.1e-210
Identity = 387/419 (92.36%), Postives = 403/419 (96.18%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEYSCCTTGFFSRIGI+IFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MEVEYSCCTTGFFSRIGIVIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           +ILPV RRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  RILPVCRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLD SLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDVSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTP+SETFAIDINA+LDSNLIKLI+E+GHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPISETFAIDINANLDSNLIKLILERGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYE P NIIGLVLVKNLITR SPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYERPTNIIGLVLVKNLITRLSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  E S+  NQLE +DV+V+IDGENQ QEK LK+KRSLKRLNTFVDRSNS+
Sbjct: 301 HMAVVVREKENPEGSVGGNQLEAKDVKVEIDGENQ-QEKGLKTKRSLKRLNTFVDRSNSH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 420
           RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR
Sbjct: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYR 418

BLAST of Sed0021149 vs. ExPASy TrEMBL
Match: A0A6J1KTE8 (DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111497073 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 3.7e-209
Identity = 382/420 (90.95%), Postives = 400/420 (95.24%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLA SGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLANSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTII GALELTEKVARDVMTPLSETFAIDINA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIRGALELTEKVARDVMTPLSETFAIDINANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSY 360
           HMAVV+REKE  EK IS  QLE RDV+VDIDGEN PQEKSL+SKRSLKRLNT VDRSN +
Sbjct: 301 HMAVVVREKENPEKPISGIQLEVRDVKVDIDGENHPQEKSLRSKRSLKRLNTCVDRSNYH 360

Query: 361 RKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDYRI 420
           RKFSGSKKWSKDF+SEVLHIADD+L K +EEGEAIGIITLEDVIEE+LQEEIYDETDYRI
Sbjct: 361 RKFSGSKKWSKDFDSEVLHIADDMLAKFTEEGEAIGIITLEDVIEEILQEEIYDETDYRI 420

BLAST of Sed0021149 vs. ExPASy TrEMBL
Match: A0A6J1HJD9 (DUF21 domain-containing protein At2g14520-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463484 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 3.7e-209
Identity = 387/442 (87.56%), Postives = 404/442 (91.40%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           MGVEYSCCT+GFFSRIGI++FLVLFAG+MSGLTLGLMSMSLVEIEVLAKSGKPS+RKHAA
Sbjct: 1   MGVEYSCCTSGFFSRIGIVVFLVLFAGMMSGLTLGLMSMSLVEIEVLAKSGKPSDRKHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVVRRQHLLLCTLLICNAAAME LPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC
Sbjct: 61  KILPVVRRQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGA VAPFVR+LVWICFPVAYPISKLLDFSLGKEHKAL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGAAVAPFVRLLVWICFPVAYPISKLLDFSLGKEHKALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAID+NA+LDSNLIKLI+EKGHSR
Sbjct: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDVNANLDSNLIKLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PVFYEHP NIIGLVLVKNLIT  SPDGVPIKNFPIRK PRVSETMPLYDILNDFQKGHS
Sbjct: 241 VPVFYEHPANIIGLVLVKNLITGHSPDGVPIKNFPIRKCPRVSETMPLYDILNDFQKGHS 300

Query: 301 HMAVVIREKETLEKSISSNQLE-----------------------TRDVRVDIDGENQPQ 360
           HMAVV+REKE  EK IS NQLE                        RDV+VDIDGEN PQ
Sbjct: 301 HMAVVVREKENPEKPISGNQLEGKHLPSSFLFNLVDDTYNTNAHAARDVKVDIDGENHPQ 360

Query: 361 EKSLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGI 420
           EKSLKSKRSLKRLNTFVDRSN +RKFSGSKKWSKDF+SEVLHIADD+L KL+EEGEAIGI
Sbjct: 361 EKSLKSKRSLKRLNTFVDRSNYHRKFSGSKKWSKDFDSEVLHIADDMLAKLTEEGEAIGI 420

BLAST of Sed0021149 vs. TAIR 10
Match: AT4G33700.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 554.7 bits (1428), Expect = 6.5e-158
Identity = 290/420 (69.05%), Postives = 350/420 (83.33%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEY CC+  FF  I +I+FLVLFAGLMSGLTLGLMS+SLV++EVLAKSG P +RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVV+ QHLLL TLLICNAAAMETLPIFLD LVTAWGAILISVTLILLFGEIIPQ++C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SRYGLAIGATVAPFVRVLV+IC PVA+PISKLLDF LG    AL RR+ELKTLVDFHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELT DETTII GALEL+EK+ +D MTP+S+ F IDINA LD +L+ LI+EKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV+YE P NIIGLVLVKNL+T    + +P+KN  IR+IPRV E +PLYDILN+FQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAVVIREKETLEKSISSNQLETRDVRVDIDGEN--QPQEKSLKSKRSLKRLNTFVDRSN 360
           HMAVV+R+ + +    S N    ++ RVD+D E    PQE+ L++KRSL++  +F +R++
Sbjct: 301 HMAVVVRQCDKIHPLPSKNG-SVKEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNRAS 360

Query: 361 SYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETDY 419
           S++  S SKKWSKD ++++L +  + LPKL+EE EA+GIIT+EDVIEELLQEEI+DETD+
Sbjct: 361 SFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDH 419

BLAST of Sed0021149 vs. TAIR 10
Match: AT2G14520.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 553.1 bits (1424), Expect = 1.9e-157
Identity = 292/421 (69.36%), Postives = 349/421 (82.90%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M VEY CC T FF  I +I+ LVLFAGLMSGLTLGLMSMSLV++EVLAKSG P +R HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KILPVV+ QHLLLCTLLICNAAAME LPIFLD+LVTAWGAILISVTLILLFGEIIPQ+VC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           SR+GLAIGATVAPFVRVLVWIC PVA+PISKLLDF LG    AL RR+ELKTLVD HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGGELT DETTII GALEL+EK+A+D MTP+S+TF IDINA LD +L+ LI++KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV+YE   NIIGLVLVKNL+T    + + +KN  IR+IPRV ET+PLYDILN+FQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAVVIREKE---TLEKSISSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRS 360
           HMAVV+R+ +    L+ + ++N+    +VRVD+D E  PQE  LK +RSL++  +F +R+
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANE-TVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRA 360

Query: 361 NSYRKFSGSKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 419
           NS    S SK+WSKD ++++L + +  LPKL EE +A+GIIT+EDVIEELLQEEI+DETD
Sbjct: 361 NSLG--SRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETD 418

BLAST of Sed0021149 vs. TAIR 10
Match: AT1G47330.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 410.6 bits (1054), Expect = 1.5e-114
Identity = 232/437 (53.09%), Postives = 309/437 (70.71%), Query Frame = 0

Query: 1   MGVEYSCCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAA 60
           M  +  CC T F   + III LV FAGLM+GLTLGLMS+ LV++EVL KSG+P +R +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KILPVVRRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVC 120
           KI PVV+ QHLLLCTLLI N+ AME LPIFLD +V  W AIL+SVTLIL+FGEI+PQAVC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNE 180
           +RYGL +GA +APFVRVL+ + FP++YPISK+LD+ LGK H  LLRR+ELKT V+FHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSR 240
           AGKGG+LT DET+II GALELTEK A+D MTP+S  F+++++  L+   +  IM  GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 LPVFYEHPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHS 300
           +PV++ +P +IIGL+LVKNL+   +   VP++   +RKIPRVSETMPLYDILN+FQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAVVIREKETLEKS--ISSNQLETRDVR-----VDIDGENQPQEKSLKSKRSLKRLNTF 360
           H+AVV ++ +  E+S   S N +E R  +     +  D   +P+ +   S++ + ++ T 
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETG 360

Query: 361 VDRS-----NSYRKFSG--------SKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITL 418
             +S        ++ SG        +KK  +  +  +L I +  +P      E +G+IT+
Sbjct: 361 DAKSGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVITM 420

BLAST of Sed0021149 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 385.2 bits (988), Expect = 6.8e-107
Identity = 224/412 (54.37%), Postives = 286/412 (69.42%), Query Frame = 0

Query: 7   CCTTGFFSRIGIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAAKILPVV 66
           CC T F+  + + + LV+FAGLMSGLTLGLMS+S+VE+EV+ K+G+P +RK+A KILP+V
Sbjct: 8   CCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLV 67

Query: 67  RRQHLLLCTLLICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLA 126
           + QHLLLCTLLI NA AME LPIF+DSL+ AWGAILISVTLIL FGEIIPQAVCSRYGL+
Sbjct: 68  KNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLS 127

Query: 127 IGATVAPFVRVLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNEAGKGGE 186
           IGA ++  VR+++ + FP++YPISKLLD  LGK H  LL R+ELK+LV  HGNEAGKGGE
Sbjct: 128 IGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGE 187

Query: 187 LTRDETTIIGGALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSRLPVFYE 246
           LT DETTII GAL++++K A+D MTP+S+ F++DIN  LD   + LI   GHSR+P++  
Sbjct: 188 LTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSV 247

Query: 247 HPENIIGLVLVKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVI 306
           +P  IIG +LVKNLI  +  D   I++ PIR++P+V   +PLYDILN FQ G SHMA V+
Sbjct: 248 NPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVV 307

Query: 307 REKETLEKSISSNQLETRDVRVDIDGENQP-QEKSLKSKRSLKRLNTFVDRSNSYRKFSG 366
             K         N   T          N P  EKS+    + K  N F+    S    + 
Sbjct: 308 GTK---------NHTNT----------NTPVHEKSINGSPN-KDANVFL----SIPALNS 367

Query: 367 SKKWSKDFNSEVLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 418
           S+   +     +  I+D       E+ E IGIITLEDV+EEL+QEEIYDETD
Sbjct: 368 SETSHQSPIRYIDSISD-------EDEEVIGIITLEDVMEELIQEEIYDETD 388

BLAST of Sed0021149 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 362.5 bits (929), Expect = 4.7e-100
Identity = 205/401 (51.12%), Postives = 274/401 (68.33%), Query Frame = 0

Query: 17  GIIIFLVLFAGLMSGLTLGLMSMSLVEIEVLAKSGKPSNRKHAAKILPVVRRQHLLLCTL 76
           GI  FLVLFAG+MSGLTLGLMS+ LVE+E+L +SG P+ +K AA I PVV++QH LL TL
Sbjct: 40  GISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTL 99

Query: 77  LICNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQAVCSRYGLAIGATVAPFVR 136
           L+CNA AME LPI+LD L   + AI++SVT +L FGE+IPQA+C+RYGLA+GA     VR
Sbjct: 100 LLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVR 159

Query: 137 VLVWICFPVAYPISKLLDFSLGKEHKALLRRSELKTLVDFHGNEAGKGGELTRDETTIIG 196
           +L+ +C+P+A+PI K+LD  LG  + AL RR++LK LV  H  EAGKGGELT DETTII 
Sbjct: 160 ILMTLCYPIAFPIGKILDLVLG-HNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIIS 219

Query: 197 GALELTEKVARDVMTPLSETFAIDINAHLDSNLIKLIMEKGHSRLPVFYEHPENIIGLVL 256
           GAL+LTEK A++ MTP+  TF++D+N+ LD   +  I+ +GHSR+PV+  +P+N+IGL+L
Sbjct: 220 GALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLL 279

Query: 257 VKNLITRQSPDGVPIKNFPIRKIPRVSETMPLYDILNDFQKGHSHMAVVIREKETLEKSI 316
           VK+L+T +      +    IR+IPRV   MPLYDILN+FQKG SHMA V++ K   +   
Sbjct: 280 VKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPP 339

Query: 317 SSNQLETRDVRVDIDGENQPQEKSLKSKRSLKRLNTFVDRSNSYRKFSGSKKWSKDFNSE 376
           S+   E  D   D D         LK + +   +   +D++N    F  ++     F+  
Sbjct: 340 STLLEEHTDESNDSD---LTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFS-- 399

Query: 377 VLHIADDLLPKLSEEGEAIGIITLEDVIEELLQEEIYDETD 418
             H ++ +     E+GE IGIITLEDV EELLQEEI DETD
Sbjct: 400 --HTSEAI-----EDGEVIGIITLEDVFEELLQEEIVDETD 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022963200.18.7e-21392.36DUF21 domain-containing protein At2g14520-like isoform X2 [Cucurbita moschata][more]
XP_023518289.15.6e-21292.36DUF21 domain-containing protein At4g33700-like isoform X2 [Cucurbita pepo subsp.... [more]
KAG6594951.16.2e-21192.36DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7026912.16.2e-21192.36DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosp... [more]
XP_008440429.12.4e-21092.36PREDICTED: DUF21 domain-containing protein At4g33700-like isoform X1 [Cucumis me... [more]
Match NameE-valueIdentityDescription
Q8VZI29.1e-15769.05DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9ZQR42.7e-15669.36DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8RY602.1e-11353.09DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD89.6e-10654.37DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q67XQ06.7e-9951.12DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
A0A6J1HHB34.2e-21392.36DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita moschata ... [more]
A0A5A7SYT71.1e-21092.36DUF21 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3B1471.1e-21092.36DUF21 domain-containing protein At4g33700-like isoform X1 OS=Cucumis melo OX=365... [more]
A0A6J1KTE83.7e-20990.95DUF21 domain-containing protein At2g14520-like isoform X2 OS=Cucurbita maxima OX... [more]
A0A6J1HJD93.7e-20987.56DUF21 domain-containing protein At2g14520-like isoform X1 OS=Cucurbita moschata ... [more]
Match NameE-valueIdentityDescription
AT4G33700.16.5e-15869.05CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT2G14520.11.9e-15769.36CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT1G47330.11.5e-11453.09CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.16.8e-10754.37CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.14.7e-10051.12CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.10.580.10coord: 191..323
e-value: 1.4E-38
score: 134.1
NoneNo IPR availablePANTHERPTHR12064:SF57AND COBALT EFFLUX PROTEIN CORC, PUTATIVE-RELATEDcoord: 1..412
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 197..408
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 18..185
e-value: 2.6E-36
score: 124.9
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 8..191
score: 54.812424
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 1..412
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 205..308
e-value: 3.54088E-28
score: 105.27

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0021149.1Sed0021149.1mRNA
Sed0021149.2Sed0021149.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle