MS008845.1 (mRNA) Bitter gourd (TR) v1

Overview
NameMS008845.1
TypemRNA
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF21 domain-containing protein
Locationscaffold4: 4203733 .. 4207581 (-)
Sequence length1431
RNA-Seq ExpressionMS008845.1
SyntenyMS008845.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAAGATGTCCGGTGTTGCGAGCCGGAGTTCTGGTTGTACATGTTGGCAATCGTAGGTCTGGTGACCTTCGCTGGACTGATGGCTGGGCTCACTTTGGGTCTCATGTCTCTCGGCCTCGTTGACCTGGAAGTTCTTATAAAGTCCGGCCGCCCTCAGGATCGCAAACACGCCGGTAATCTCTATTATCACCTCCCGCAACTATGGAAATTCATTTCGTATCTGTGGCCAGAAACTTGAATCCCGACGATTTCCGTATTGTCACTTGAAACTTTTCAAGAACATTTTAGTTACCTGTAAAAATTATTGTTATTATTTTAAGTAACCACACGGCAGCTGAAGGACCTGTACATTGAATTATGCAATTCCGTGTTAGGATCTGGGAATTAGGTAGTAGTGTGCGCGTGCCTATGAACCGAGAGTACTAAGTAGGATTGGGGGAGTTTTAGAAGCTTAAGCGAAATGTTTCATACCCATGGGATCACCTAGAGTAGTAATGTTATGATTTCTTTGGCAGCTAAGATATTCCCAGTGGTAAAGAATCAGCATCTGCTTCTCTGCACACTTTTGATTGGTAACTCTTTAGCAATGGAGGTAACTATATAAACGTTTGCACATCTCTGTTTGTGGAATCTACATAATTTTCTAAGTGTAAACTTAATGATGTTGTATGTGTCCCCTGTAGGCTCTTCCGATATTCTTGGACAAGATTGTACCTCCTTGGGCAGCTATCCTTGTTTCAGTTACTCTAATCCTCATGTTCGGAGAGGTACTTTTGATCACACCAGTTTGAGAATAGTTTTTGCCATTGGATATTATTGGGTTAATGTGACGTAGACTTGACTGCAGATATTGCCACAAGCGATTTGTACTCGTTATGGGTTGAAAGTTGGAGCAATAATGGCACCTTTTGTTCGCATTCTTCTTATGTTATTCTTTCCCATTTCATATCCAATTAGTAAGGTAAAATTCCCTTACACCTCTTCTAAGTTCAAGCATGGAAATGAAAATTAAAGTTTGAGACTGATAAGATAATAGTTCTATAGGCCATATATACATGATATTTGGGATGCTCATTTGAATATCTTATTTAACTGCAGGTAGCCAATAATAGTGCTATAGGCCATATATACATGATATTTAGTTTCTTTTACATCTTTTTTTTTTTTAAAAATAAATTCAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTCCTCTTAAGGAGAGCAGAGCTCAAGACATTTGTGAACTTTCATGGCAATGAGGTAATCAACTTGGGGATGTGCTCATGGGATCATTTGCATGAGTTAATATTCTAGTATTTGAGTGCTCTAACGGAACATTATCCTTATCTTGTGTGCTCTGAATTTTCCTTCTGCTCCAATCACCATTATTTGATGGATTTGCCATGTCTAAAATATATGTTGTCTTAGGCTGGAAAAGGTGGAGATTTAACTCACGACGAGACTACTATTATTGCTGGGGCACTTGAATTGACTGAAAAGACAGCAAAAGATGCCATGACTCCTATATCAAATGCATTTTCCCTTGATCTGGATGCGACTCTTGATTTGTGGGTTTTTTCTTTCTTCCAGTTCTTGTATGTCTGTCTAATATCATTTTTCCGTTTTTCTAAAGCCCCTTCTGTATATGGCAGGGATACACTCAATGCTATAATGACGAAAGGGCATAGCAGAGTTCCTGTATATTCTGGAGATCCAAAAAATATAATTGGACTAGTTCTGGTAATTCACTTGCATCATCCCTAGCTGTTATAAACAATAACATTACATGATAGGTTGAAATGAATCCTCTTTCTTGCTCTCTCTATTGTATCACCTTCTAATGTAGCAGGATAACGAGACCTGAGAATAAAATTGAGAAATGGCTAAGAGGGAGTTCTCAGGTCCCTCTTGCTTCATTTTTCAAAATCAGGGTGGCTGTTTACATGCAAAACTGAAGTGCTCTGCATTGTGAATAAACAAACTCTTCTAGTAAAATAGAACTGTCACCGTGTTTTTGGACTAAGGAGTTTAAGCATGTGGCCTAAGTTGCCGGGCTGCAGACTAATGAATAGGAAAACAGAGGGCCTTGGCCAAGCTGAGTTCTTTCAAATTAGAAACACCAACCTTGAGCCTAATTGGACAGACATCACTAGTTTAAATGACATAGGCGACATACCTCTAATGTGGGAATGGAGGTTGCTTTCGCATATTTTTCTCAATCTGTTCATTGGAGTGAGCGGAAGTTGATGAAATTGTTGCTTTTTATTTATTTATTTTAAATTTCCTGATTAATATATATATTTTTATTTGGTGCGTGTTCATTGTGTCATATTATGATACTCAGACTTAAGGTATAGTTCATGCCATCGAGCATCTGCTCATTTGTTCATCGTTTTAACATTTTTCAGGTTAAAAATCTTTTGACTGTTGATCCAGACGACAGAATTTCCCTCAAAAAAATGATTATTAGAAAAATTCCACGGTATGAGAATATTCATTGCTTGTATCTTTATGCATTTAGATGAAATGACCTGTTGGCAGTTTTGGACCTGGAAAAAAAGCTTTATTATTTTTTCTTCCTCAACTACTGCAGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGGCATAGTCACATTGCTGTGGTATTCAAGAAACATGGTTACCAATCTGAGGCATTGCTGAAAAAAGGTGAGAATGTGCAAAGCAGAAATTTTATATTAATTGGATAAACAAAGAAAGTAATTGCGAACAAATAAACTATTCAGTTATGTTTGGGGCAGGAGGGTTTGGGATCTTAAAATTTGTTTGTATGCTATCCATTTATTTAAGGCTCAGATTTTATAGTCATGATATTTCTCATCCAAATTTGGGAAGGTTCTACACCTTCGTCTAGCATTGAACTCGTATTTACTCATGTTCTTGAGGTCCAAATTCCTTGGAACTGTCTAGATCCACCAATTATAGGGCTTCACATGAAATGCTCTTAGTTGTGCTACTTTTGCTGTGAACATATCTTATTCGTTTGTTACTTTTGACATACCCTTTTCCTGCCTGGTTGATTCTCAGACAATGGAGTTGACTCTGGTGCTGATGCTGCTACTCAAAACTTAGTGATGAAACTGGAATCAGTTGATGCTCAAACAACAGCTGAAAAGGGTGGAGGCCAACAGACAAAGAAAAGTCCACCAGCTACTCCTGCGTTTAAAAAACGACACAAAGGTTGTTCATTTTGCATTTTAGATGTCGAAAATGCACCTCTTCCCATCTTTCCACCTAGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGTGATGACAAATTCTTTAACTTGCAACTAAGCATGGTTTACAATGTTGGGAGGTTAGTTCTGCCAGCACTGTTATTCAATGTTGATCTTACGTATCATATAATCATCTTGTCAGGAGGAGATATTAGACGAAACAGATGAGTATGTCAATATCCACAACAGGTACGTGTCGGTCTCCCTAGCTTTATAAAAGCACTTGTATGCTTTATGTCTACCTGTTTCGCTATAGCAACATATAGCTAGCCTGGCTAGAACTTGTGCATCGTTTTGCTTTCTTGCCGAGAGTGGATCAATATGGATCTAAGGAATTGGATTACATTTTGTTCTTGTAAAACATATTTCTTATGGATTCATTTGATGTTCCTGTTGGATGGTCACAAAGATAATAGGGAATTTTGCTGAATGCAGAATAAAAATCAACATGCAAGCATCTCCAGAAAAACCAGGCACCAACCCATCGCAGCTTTCCTCATATGTTAAAATG

mRNA sequence

ATGGCAGAAGATGTCCGGTGTTGCGAGCCGGAGTTCTGGTTGTACATGTTGGCAATCGTAGGTCTGGTGACCTTCGCTGGACTGATGGCTGGGCTCACTTTGGGTCTCATGTCTCTCGGCCTCGTTGACCTGGAAGTTCTTATAAAGTCCGGCCGCCCTCAGGATCGCAAACACGCCGCTAAGATATTCCCAGTGGTAAAGAATCAGCATCTGCTTCTCTGCACACTTTTGATTGGTAACTCTTTAGCAATGGAGGCTCTTCCGATATTCTTGGACAAGATTGTACCTCCTTGGGCAGCTATCCTTGTTTCAGTTACTCTAATCCTCATGTTCGGAGAGATATTGCCACAAGCGATTTGTACTCGTTATGGGTTGAAAGTTGGAGCAATAATGGCACCTTTTGTTCGCATTCTTCTTATGTTATTCTTTCCCATTTCATATCCAATTAGTAAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTCCTCTTAAGGAGAGCAGAGCTCAAGACATTTGTGAACTTTCATGGCAATGAGGCTGGAAAAGGTGGAGATTTAACTCACGACGAGACTACTATTATTGCTGGGGCACTTGAATTGACTGAAAAGACAGCAAAAGATGCCATGACTCCTATATCAAATGCATTTTCCCTTGATCTGGATGCGACTCTTGATTTGTGGGTTTTTTCTTTCTTCCAGTTCTTGTATGTCTGTCTAATATCATTTTTCCGTTTTTCTAAAGCCCCTTCTGTATATGGCAGGGATACACTCAATGCTATAATGACGAAAGGGCATAGCAGAGTTCCTGTATATTCTGGAGATCCAAAAAATATAATTGGACTAGTTCTGGTTAAAAATCTTTTGACTGTTGATCCAGACGACAGAATTTCCCTCAAAAAAATGATTATTAGAAAAATTCCACGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGGCATAGTCACATTGCTGTGGTATTCAAGAAACATGGTTACCAATCTGAGGCATTGCTGAAAAAAGACAATGGAGTTGACTCTGGTGCTGATGCTGCTACTCAAAACTTAGTGATGAAACTGGAATCAGTTGATGCTCAAACAACAGCTGAAAAGGGTGGAGGCCAACAGACAAAGAAAAGTCCACCAGCTACTCCTGCGTTTAAAAAACGACACAAAGGTTGTTCATTTTGCATTTTAGATGTCGAAAATGCACCTCTTCCCATCTTTCCACCTAGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGAGGAGATATTAGACGAAACAGATGAGTATGTCAATATCCACAACAGAATAAAAATCAACATGCAAGCATCTCCAGAAAAACCAGGCACCAACCCATCGCAGCTTTCCTCATATGTTAAAATG

Coding sequence (CDS)

ATGGCAGAAGATGTCCGGTGTTGCGAGCCGGAGTTCTGGTTGTACATGTTGGCAATCGTAGGTCTGGTGACCTTCGCTGGACTGATGGCTGGGCTCACTTTGGGTCTCATGTCTCTCGGCCTCGTTGACCTGGAAGTTCTTATAAAGTCCGGCCGCCCTCAGGATCGCAAACACGCCGCTAAGATATTCCCAGTGGTAAAGAATCAGCATCTGCTTCTCTGCACACTTTTGATTGGTAACTCTTTAGCAATGGAGGCTCTTCCGATATTCTTGGACAAGATTGTACCTCCTTGGGCAGCTATCCTTGTTTCAGTTACTCTAATCCTCATGTTCGGAGAGATATTGCCACAAGCGATTTGTACTCGTTATGGGTTGAAAGTTGGAGCAATAATGGCACCTTTTGTTCGCATTCTTCTTATGTTATTCTTTCCCATTTCATATCCAATTAGTAAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTCCTCTTAAGGAGAGCAGAGCTCAAGACATTTGTGAACTTTCATGGCAATGAGGCTGGAAAAGGTGGAGATTTAACTCACGACGAGACTACTATTATTGCTGGGGCACTTGAATTGACTGAAAAGACAGCAAAAGATGCCATGACTCCTATATCAAATGCATTTTCCCTTGATCTGGATGCGACTCTTGATTTGTGGGTTTTTTCTTTCTTCCAGTTCTTGTATGTCTGTCTAATATCATTTTTCCGTTTTTCTAAAGCCCCTTCTGTATATGGCAGGGATACACTCAATGCTATAATGACGAAAGGGCATAGCAGAGTTCCTGTATATTCTGGAGATCCAAAAAATATAATTGGACTAGTTCTGGTTAAAAATCTTTTGACTGTTGATCCAGACGACAGAATTTCCCTCAAAAAAATGATTATTAGAAAAATTCCACGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGGCATAGTCACATTGCTGTGGTATTCAAGAAACATGGTTACCAATCTGAGGCATTGCTGAAAAAAGACAATGGAGTTGACTCTGGTGCTGATGCTGCTACTCAAAACTTAGTGATGAAACTGGAATCAGTTGATGCTCAAACAACAGCTGAAAAGGGTGGAGGCCAACAGACAAAGAAAAGTCCACCAGCTACTCCTGCGTTTAAAAAACGACACAAAGGTTGTTCATTTTGCATTTTAGATGTCGAAAATGCACCTCTTCCCATCTTTCCACCTAGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGAGGAGATATTAGACGAAACAGATGAGTATGTCAATATCCACAACAGAATAAAAATCAACATGCAAGCATCTCCAGAAAAACCAGGCACCAACCCATCGCAGCTTTCCTCATATGTTAAAATG

Protein sequence

MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCLISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLKKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM
Homology
BLAST of MS008845.1 vs. NCBI nr
Match: XP_022140386.1 (DUF21 domain-containing protein At1g47330-like [Momordica charantia])

HSP 1 Score: 853.6 bits (2204), Expect = 8.2e-244
Identity = 447/477 (93.71%), Postives = 447/477 (93.71%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVR CEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA
Sbjct: 82  MAEDVRSCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 141

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC
Sbjct: 142 KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 201

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 202 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 261

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDL             
Sbjct: 262 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDL------------- 321

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          DTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK
Sbjct: 322 ---------------DTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 381

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ
Sbjct: 382 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 441

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NLVMKLESVDAQTTAEKGGGQQ KKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE
Sbjct: 442 NLVMKLESVDAQTTAEKGGGQQIKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 501

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 478
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM
Sbjct: 502 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 530

BLAST of MS008845.1 vs. NCBI nr
Match: XP_022941204.1 (DUF21 domain-containing protein At1g47330 [Cucurbita moschata])

HSP 1 Score: 789.6 bits (2038), Expect = 1.5e-224
Identity = 410/475 (86.32%), Postives = 426/475 (89.68%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LLM+FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLMVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAK+AMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKNAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+ L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSAAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NL MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ PP EE
Sbjct: 361 NLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNV 447

BLAST of MS008845.1 vs. NCBI nr
Match: KAG6608536.1 (DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 789.6 bits (2038), Expect = 1.5e-224
Identity = 410/475 (86.32%), Postives = 426/475 (89.68%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LLM+FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLMVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAK+AMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKNAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+ L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSAAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NL MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ PP EE
Sbjct: 361 NLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPKV 447

BLAST of MS008845.1 vs. NCBI nr
Match: XP_023525037.1 (DUF21 domain-containing protein At1g47330 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 788.1 bits (2034), Expect = 4.2e-224
Identity = 410/475 (86.32%), Postives = 425/475 (89.47%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LLM+FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLMIFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+ L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVLLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSTAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NL MK+E VDAQT AEK GG+QTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ PP EE
Sbjct: 361 NLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNV 447

BLAST of MS008845.1 vs. NCBI nr
Match: XP_022981956.1 (DUF21 domain-containing protein At1g47330 [Cucurbita maxima])

HSP 1 Score: 781.2 bits (2016), Expect = 5.2e-222
Identity = 405/475 (85.26%), Postives = 422/475 (88.84%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALP+FLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPVFLDMIVPPWVAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LL +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLTVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNI+GLVLVKNLLTVDP+D + L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIVGLVLVKNLLTVDPEDGVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSAAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           N  MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ P  EE
Sbjct: 361 NFAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPAGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNV 447

BLAST of MS008845.1 vs. ExPASy Swiss-Prot
Match: Q8RY60 (DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF7 PE=1 SV=1)

HSP 1 Score: 601.3 bits (1549), Expect = 9.6e-171
Identity = 327/484 (67.56%), Postives = 371/484 (76.65%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           M+ D+ CC   F LY++ I+ LV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDR +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KIFPVVKNQHLLLCTLLIGNS+AMEALPIFLDKIVPPW AIL+SVTLIL+FGEI+PQA+C
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LL+LFFPISYPISKVLDWMLGKGH VLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLT DET+II GALELTEKTAKDAMTPISNAFSL+LD  L+L             
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLN IM+ GHSRVPVY  +P +IIGL+LVKNLL VD    + L+
Sbjct: 241 ---------------ETLNTIMSVGHSRVPVYFRNPTHIIGLILVKNLLAVDARKEVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAAT- 360
           KM +RKIPRVSE MPLYDILNEFQKGHSHIAVV+K    Q ++    +NG++   +  T 
Sbjct: 301 KMSMRKIPRVSETMPLYDILNEFQKGHSHIAVVYKDLDEQEQSPETSENGIERRKNKKTK 360

Query: 361 ------------------QNLVMKLESVDAQTTAEKGGGQQT---KKSPPATPAFKKRHK 420
                             +  V K+E+ DA++   + G +Q    K S  A PA KKRH+
Sbjct: 361 DELFKDSCRKPKAQFEVSEKEVFKIETGDAKSGKSENGEEQQGSGKTSLLAAPA-KKRHR 420

Query: 421 GCSFCILDVENAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ 463
           GCSFCILD+EN P+P FP +EEVVGVITMEDVIEELLQEEILDETDEYVNIHNRI++NM 
Sbjct: 421 GCSFCILDIENTPIPDFPTNEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIRVNMH 455

BLAST of MS008845.1 vs. ExPASy Swiss-Prot
Match: Q8VZI2 (DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF6 PE=1 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 5.7e-123
Identity = 252/459 (54.90%), Postives = 311/459 (67.76%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MA +  CC P F++++  IV LV FAGLM+GLTLGLMSL LVDLEVL KSG P+ RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLL TLLI N+ AME LPIFLD +V  W AIL+SVTLIL+FGEI+PQ+IC
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           +RYGL +GA +APFVR+L+ +  P+++PISK+LD++LG   A L RRAELKT V+FHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGG+LTHDETTIIAGALEL+EK  KDAMTPIS+ F +D++A LD              
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLD-------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                         RD +N I+ KGHSRVPVY   P NIIGLVLVKNLLT++PD+ I +K
Sbjct: 241 --------------RDLMNLILEKGHSRVPVYYEQPTNIIGLVLVKNLLTINPDEEIPVK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQSEALLKKDNGVDSGAD 360
            + IR+IPRV E +PLYDILNEFQKG SH+AVV ++    H   S+    K+  VD  ++
Sbjct: 301 NVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCDKIHPLPSKNGSVKEARVDVDSE 360

Query: 361 AA---------TQNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDV 420
                      T+  + K +S   + ++ KGG +            KK  K     IL +
Sbjct: 361 GTPTPQERMLRTKRSLQKWKSFPNRASSFKGGSKS-----------KKWSKDNDADILQL 420

Query: 421 ENAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEY 447
              PLP     EE VG+ITMEDVIEELLQEEI DETD +
Sbjct: 421 NGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDHH 420

BLAST of MS008845.1 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 439.1 bits (1128), Expect = 6.3e-122
Identity = 251/455 (55.16%), Postives = 308/455 (67.69%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MA +  CC   F++++  IV LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLI N+ AMEALPIFLD +V  W AIL+SVTLIL+FGEI+PQ++C
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           +R+GL +GA +APFVR+L+ +  P+++PISK+LD++LG G   L RRAELKT V+ HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGG+LTHDETTIIAGALEL+EK AKDAMTPIS+ F +D++A LD              
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLD-------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                         RD +N I+ KGHSRVPVY     NIIGLVLVKNLLT++PD+ I +K
Sbjct: 241 --------------RDLMNLILDKGHSRVPVYYEQRTNIIGLVLVKNLLTINPDEEIQVK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQS-EALLKKDNGVDSGA 360
            + IR+IPRV E +PLYDILNEFQKGHSH+AVV ++    H  QS +A  +  N V    
Sbjct: 301 NVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCDKIHPLQSNDAANETVNEVRVDV 360

Query: 361 DAATQNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCS----FCILDVENAP 420
           D        KL+         +   Q+ K  P    +   R K  S      IL +   P
Sbjct: 361 DYERSPQETKLK--------RRRSLQKWKSFPNRANSLGSRSKRWSKDNDADILQLNEHP 419

Query: 421 LPIFPPSEEVVGVITMEDVIEELLQEEILDETDEY 447
           LP     E+ VG+ITMEDVIEELLQEEI DETD +
Sbjct: 421 LPKLDEEEDAVGIITMEDVIEELLQEEIFDETDHH 419

BLAST of MS008845.1 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 428.7 bits (1101), Expect = 8.6e-119
Identity = 235/467 (50.32%), Postives = 314/467 (67.24%), Query Frame = 0

Query: 2   AEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAK 61
           A DV CCE  FW+Y+L  V LV FAGLM+GLTLGLMSL +V+LEV+IK+G P DRK+A K
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 62  IFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICT 121
           I P+VKNQHLLLCTLLIGN+LAMEALPIF+D ++P W AIL+SVTLIL FGEI+PQA+C+
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 122 RYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEA 181
           RYGL +GA ++  VR+++++FFP+SYPISK+LD +LGK H+ LL RAELK+ V  HGNEA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 182 GKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCLI 241
           GKGG+LTHDETTII+GAL++++K+AKDAMTP+S  FSLD++  LD               
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLD--------------- 242

Query: 242 SFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLKK 301
                          T+  I + GHSR+P+YS +P  IIG +LVKNL+ V P+D  S++ 
Sbjct: 243 -------------EKTMGLIASAGHSRIPIYSVNPNVIIGFILVKNLIKVRPEDETSIRD 302

Query: 302 MIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVF--KKHGYQSEALLKKDNGVDSGADAAT 361
           + IR++P+V  ++PLYDILN FQ G SH+A V   K H   +  + +K        DA  
Sbjct: 303 LPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVVGTKNHTNTNTPVHEKSINGSPNKDA-- 362

Query: 362 QNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSE 421
            N+ + + ++++  T+         +SP                I  +++    I    E
Sbjct: 363 -NVFLSIPALNSSETSH--------QSP----------------IRYIDS----ISDEDE 410

Query: 422 EVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGT 467
           EV+G+IT+EDV+EEL+QEEI DETD+YV +H RI INM  S   P T
Sbjct: 423 EVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINMPMSGNSPET 410

BLAST of MS008845.1 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 7.0e-105
Identity = 224/452 (49.56%), Postives = 296/452 (65.49%), Query Frame = 0

Query: 22  LVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKIFPVVKNQHLLLCTLLIGNS 81
           LV FAG+M+GLTLGLMSLGLV+LE+L +SG P ++K AA IFPVV+ QH LL TLL+ N+
Sbjct: 45  LVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNA 104

Query: 82  LAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRILLML 141
           +AME LPI+LDK+   + AI++SVT +L FGE++PQAICTRYGL VGA     VRIL+ L
Sbjct: 105 MAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTL 164

Query: 142 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 201
            +PI++PI K+LD +LG   A L RRA+LK  V+ H  EAGKGG+LTHDETTII+GAL+L
Sbjct: 165 CYPIAFPIGKILDLVLGHNDA-LFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDL 224

Query: 202 TEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCLISFFRFSKAPSVYGRDTLNAI 261
           TEKTA++AMTPI + FSLD+++ LD W                           + +  I
Sbjct: 225 TEKTAQEAMTPIESTFSLDVNSKLD-W---------------------------EAMGKI 284

Query: 262 MTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLKKMIIRKIPRVSEDMPLYDILN 321
           + +GHSRVPVYSG+PKN+IGL+LVK+LLTV P+    +  + IR+IPRV  DMPLYDILN
Sbjct: 285 LARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILN 344

Query: 322 EFQKGHSHIAVVFKKHGYQS--EALLKKDNGVDSGADAATQNLVMKLES--VDAQTTAEK 381
           EFQKG SH+A V K  G      + L +++  +S     T  L++K E    +   T +K
Sbjct: 345 EFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDK 404

Query: 382 GGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEEVVGVITMEDVIEELLQE 441
             GQ   ++  + P       G S     +E+          EV+G+IT+EDV EELLQE
Sbjct: 405 ANGQSFFQNNESGP------HGFSHTSEAIEDG---------EVIGIITLEDVFEELLQE 452

Query: 442 EILDETDEYVNIHNRIKINMQASPEKPGTNPS 470
           EI+DETDEYV++H RI++   A+       PS
Sbjct: 465 EIVDETDEYVDVHKRIRVAAAAAASSIARAPS 452

BLAST of MS008845.1 vs. ExPASy TrEMBL
Match: A0A6J1CFY3 (DUF21 domain-containing protein At1g47330-like OS=Momordica charantia OX=3673 GN=LOC111011071 PE=4 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 4.0e-244
Identity = 447/477 (93.71%), Postives = 447/477 (93.71%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVR CEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA
Sbjct: 82  MAEDVRSCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 141

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC
Sbjct: 142 KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 201

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 202 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 261

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDL             
Sbjct: 262 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDL------------- 321

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          DTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK
Sbjct: 322 ---------------DTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 381

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ
Sbjct: 382 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 441

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NLVMKLESVDAQTTAEKGGGQQ KKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE
Sbjct: 442 NLVMKLESVDAQTTAEKGGGQQIKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 501

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 478
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM
Sbjct: 502 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 530

BLAST of MS008845.1 vs. ExPASy TrEMBL
Match: A0A6J1FMM0 (DUF21 domain-containing protein At1g47330 OS=Cucurbita moschata OX=3662 GN=LOC111446582 PE=4 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 7.1e-225
Identity = 410/475 (86.32%), Postives = 426/475 (89.68%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LLM+FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLMVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAK+AMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKNAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+ L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSAAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           NL MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ PP EE
Sbjct: 361 NLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNV 447

BLAST of MS008845.1 vs. ExPASy TrEMBL
Match: A0A6J1IXZ7 (DUF21 domain-containing protein At1g47330 OS=Cucurbita maxima OX=3661 GN=LOC111480949 PE=4 SV=1)

HSP 1 Score: 781.2 bits (2016), Expect = 2.5e-222
Identity = 405/475 (85.26%), Postives = 422/475 (88.84%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L IVGLV FAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIVGLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALP+FLD IVPPW A+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPVFLDMIVPPWVAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LL +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLTVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNI+GLVLVKNLLTVDP+D + L+
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIVGLVLVKNLLTVDPEDGVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDS A AAT 
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSAAGAATH 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           N  MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+ P  EE
Sbjct: 361 NFAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPAGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYV 476
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP QLS  V
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNV 447

BLAST of MS008845.1 vs. ExPASy TrEMBL
Match: A0A1S3CRQ3 (DUF21 domain-containing protein At1g47330 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503947 PE=4 SV=1)

HSP 1 Score: 781.2 bits (2016), Expect = 2.5e-222
Identity = 408/477 (85.53%), Postives = 424/477 (88.89%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L I+GLV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIIGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPWAA+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWAAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAP VRILLM+FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPLVRILLMVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+SLK
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVSLK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHG+QSE L  KDNGVDSG  AA Q
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGHQSETLPNKDNGVDSGDAAAAQ 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           N+ MK+ESVDAQT AEK GGQQTKKSPPATPAFKKRH+GCSFCILDVENAPLP+FPP EE
Sbjct: 361 NIGMKMESVDAQTVAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVFPPREE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 478
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP  N SQ S  V +
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSINQSQPSPNVNL 449

BLAST of MS008845.1 vs. ExPASy TrEMBL
Match: A0A0A0LGK3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G856000 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 403/477 (84.49%), Postives = 420/477 (88.05%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MAEDVRCCE +F+L++L I GLV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA
Sbjct: 1   MAEDVRCCESKFFLFLLIIAGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLIGNSLAMEALPIFLD IVPPWAA+LVSVTLILMFGEILPQAIC
Sbjct: 61  KILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWAAVLVSVTLILMFGEILPQAIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAP VRILL++FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPLVRILLIVFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLTHDETTIIAGALELTEKTAKDAMT ISNAFSLDLDATLDL             
Sbjct: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDP+DR+SLK
Sbjct: 241 ---------------ETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVSLK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAATQ 360
           KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHG+QSE L KKD GV+SG  AA Q
Sbjct: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGHQSETLPKKDIGVNSGDAAAAQ 360

Query: 361 NLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEE 420
           N+ MK+ESVDAQT AEK GG QTKKSPPATPAFKKRH+GCSFCILDVENAPLP+FP  EE
Sbjct: 361 NIGMKMESVDAQTVAEKAGGLQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVFPLGEE 420

Query: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNPSQLSSYVKM 478
           VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEK   N  QLS  V +
Sbjct: 421 VVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKLSINQPQLSPNVNL 449

BLAST of MS008845.1 vs. TAIR 10
Match: AT1G47330.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 601.3 bits (1549), Expect = 6.8e-172
Identity = 327/484 (67.56%), Postives = 371/484 (76.65%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           M+ D+ CC   F LY++ I+ LV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDR +A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KIFPVVKNQHLLLCTLLIGNS+AMEALPIFLDKIVPPW AIL+SVTLIL+FGEI+PQA+C
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           TRYGLKVGAIMAPFVR+LL+LFFPISYPISKVLDWMLGKGH VLLRRAELKTFVNFHGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGGDLT DET+II GALELTEKTAKDAMTPISNAFSL+LD  L+L             
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNL------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                          +TLN IM+ GHSRVPVY  +P +IIGL+LVKNLL VD    + L+
Sbjct: 241 ---------------ETLNTIMSVGHSRVPVYFRNPTHIIGLILVKNLLAVDARKEVPLR 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSGADAAT- 360
           KM +RKIPRVSE MPLYDILNEFQKGHSHIAVV+K    Q ++    +NG++   +  T 
Sbjct: 301 KMSMRKIPRVSETMPLYDILNEFQKGHSHIAVVYKDLDEQEQSPETSENGIERRKNKKTK 360

Query: 361 ------------------QNLVMKLESVDAQTTAEKGGGQQT---KKSPPATPAFKKRHK 420
                             +  V K+E+ DA++   + G +Q    K S  A PA KKRH+
Sbjct: 361 DELFKDSCRKPKAQFEVSEKEVFKIETGDAKSGKSENGEEQQGSGKTSLLAAPA-KKRHR 420

Query: 421 GCSFCILDVENAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ 463
           GCSFCILD+EN P+P FP +EEVVGVITMEDVIEELLQEEILDETDEYVNIHNRI++NM 
Sbjct: 421 GCSFCILDIENTPIPDFPTNEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIRVNMH 455

BLAST of MS008845.1 vs. TAIR 10
Match: AT4G33700.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 442.6 bits (1137), Expect = 4.1e-124
Identity = 252/459 (54.90%), Postives = 311/459 (67.76%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MA +  CC P F++++  IV LV FAGLM+GLTLGLMSL LVDLEVL KSG P+ RK+AA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLL TLLI N+ AME LPIFLD +V  W AIL+SVTLIL+FGEI+PQ+IC
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           +RYGL +GA +APFVR+L+ +  P+++PISK+LD++LG   A L RRAELKT V+FHGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGG+LTHDETTIIAGALEL+EK  KDAMTPIS+ F +D++A LD              
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLD-------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                         RD +N I+ KGHSRVPVY   P NIIGLVLVKNLLT++PD+ I +K
Sbjct: 241 --------------RDLMNLILEKGHSRVPVYYEQPTNIIGLVLVKNLLTINPDEEIPVK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQSEALLKKDNGVDSGAD 360
            + IR+IPRV E +PLYDILNEFQKG SH+AVV ++    H   S+    K+  VD  ++
Sbjct: 301 NVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCDKIHPLPSKNGSVKEARVDVDSE 360

Query: 361 AA---------TQNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDV 420
                      T+  + K +S   + ++ KGG +            KK  K     IL +
Sbjct: 361 GTPTPQERMLRTKRSLQKWKSFPNRASSFKGGSKS-----------KKWSKDNDADILQL 420

Query: 421 ENAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEY 447
              PLP     EE VG+ITMEDVIEELLQEEI DETD +
Sbjct: 421 NGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDHH 420

BLAST of MS008845.1 vs. TAIR 10
Match: AT2G14520.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 439.1 bits (1128), Expect = 4.5e-123
Identity = 251/455 (55.16%), Postives = 308/455 (67.69%), Query Frame = 0

Query: 1   MAEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAA 60
           MA +  CC   F++++  IV LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR HAA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KIFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAIC 120
           KI PVVKNQHLLLCTLLI N+ AMEALPIFLD +V  W AIL+SVTLIL+FGEI+PQ++C
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 TRYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNE 180
           +R+GL +GA +APFVR+L+ +  P+++PISK+LD++LG G   L RRAELKT V+ HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCL 240
           AGKGG+LTHDETTIIAGALEL+EK AKDAMTPIS+ F +D++A LD              
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLD-------------- 240

Query: 241 ISFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLK 300
                         RD +N I+ KGHSRVPVY     NIIGLVLVKNLLT++PD+ I +K
Sbjct: 241 --------------RDLMNLILDKGHSRVPVYYEQRTNIIGLVLVKNLLTINPDEEIQVK 300

Query: 301 KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQS-EALLKKDNGVDSGA 360
            + IR+IPRV E +PLYDILNEFQKGHSH+AVV ++    H  QS +A  +  N V    
Sbjct: 301 NVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCDKIHPLQSNDAANETVNEVRVDV 360

Query: 361 DAATQNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCS----FCILDVENAP 420
           D        KL+         +   Q+ K  P    +   R K  S      IL +   P
Sbjct: 361 DYERSPQETKLK--------RRRSLQKWKSFPNRANSLGSRSKRWSKDNDADILQLNEHP 419

Query: 421 LPIFPPSEEVVGVITMEDVIEELLQEEILDETDEY 447
           LP     E+ VG+ITMEDVIEELLQEEI DETD +
Sbjct: 421 LPKLDEEEDAVGIITMEDVIEELLQEEIFDETDHH 419

BLAST of MS008845.1 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 428.7 bits (1101), Expect = 6.1e-120
Identity = 235/467 (50.32%), Postives = 314/467 (67.24%), Query Frame = 0

Query: 2   AEDVRCCEPEFWLYMLAIVGLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAK 61
           A DV CCE  FW+Y+L  V LV FAGLM+GLTLGLMSL +V+LEV+IK+G P DRK+A K
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 62  IFPVVKNQHLLLCTLLIGNSLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICT 121
           I P+VKNQHLLLCTLLIGN+LAMEALPIF+D ++P W AIL+SVTLIL FGEI+PQA+C+
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 122 RYGLKVGAIMAPFVRILLMLFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEA 181
           RYGL +GA ++  VR+++++FFP+SYPISK+LD +LGK H+ LL RAELK+ V  HGNEA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 182 GKGGDLTHDETTIIAGALELTEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCLI 241
           GKGG+LTHDETTII+GAL++++K+AKDAMTP+S  FSLD++  LD               
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLD--------------- 242

Query: 242 SFFRFSKAPSVYGRDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLKK 301
                          T+  I + GHSR+P+YS +P  IIG +LVKNL+ V P+D  S++ 
Sbjct: 243 -------------EKTMGLIASAGHSRIPIYSVNPNVIIGFILVKNLIKVRPEDETSIRD 302

Query: 302 MIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVF--KKHGYQSEALLKKDNGVDSGADAAT 361
           + IR++P+V  ++PLYDILN FQ G SH+A V   K H   +  + +K        DA  
Sbjct: 303 LPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVVGTKNHTNTNTPVHEKSINGSPNKDA-- 362

Query: 362 QNLVMKLESVDAQTTAEKGGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSE 421
            N+ + + ++++  T+         +SP                I  +++    I    E
Sbjct: 363 -NVFLSIPALNSSETSH--------QSP----------------IRYIDS----ISDEDE 410

Query: 422 EVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGT 467
           EV+G+IT+EDV+EEL+QEEI DETD+YV +H RI INM  S   P T
Sbjct: 423 EVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINMPMSGNSPET 410

BLAST of MS008845.1 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 382.5 bits (981), Expect = 5.0e-106
Identity = 224/452 (49.56%), Postives = 296/452 (65.49%), Query Frame = 0

Query: 22  LVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKIFPVVKNQHLLLCTLLIGNS 81
           LV FAG+M+GLTLGLMSLGLV+LE+L +SG P ++K AA IFPVV+ QH LL TLL+ N+
Sbjct: 45  LVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNA 104

Query: 82  LAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRILLML 141
           +AME LPI+LDK+   + AI++SVT +L FGE++PQAICTRYGL VGA     VRIL+ L
Sbjct: 105 MAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTL 164

Query: 142 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 201
            +PI++PI K+LD +LG   A L RRA+LK  V+ H  EAGKGG+LTHDETTII+GAL+L
Sbjct: 165 CYPIAFPIGKILDLVLGHNDA-LFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDL 224

Query: 202 TEKTAKDAMTPISNAFSLDLDATLDLWVFSFFQFLYVCLISFFRFSKAPSVYGRDTLNAI 261
           TEKTA++AMTPI + FSLD+++ LD W                           + +  I
Sbjct: 225 TEKTAQEAMTPIESTFSLDVNSKLD-W---------------------------EAMGKI 284

Query: 262 MTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPDDRISLKKMIIRKIPRVSEDMPLYDILN 321
           + +GHSRVPVYSG+PKN+IGL+LVK+LLTV P+    +  + IR+IPRV  DMPLYDILN
Sbjct: 285 LARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILN 344

Query: 322 EFQKGHSHIAVVFKKHGYQS--EALLKKDNGVDSGADAATQNLVMKLES--VDAQTTAEK 381
           EFQKG SH+A V K  G      + L +++  +S     T  L++K E    +   T +K
Sbjct: 345 EFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDK 404

Query: 382 GGGQQTKKSPPATPAFKKRHKGCSFCILDVENAPLPIFPPSEEVVGVITMEDVIEELLQE 441
             GQ   ++  + P       G S     +E+          EV+G+IT+EDV EELLQE
Sbjct: 405 ANGQSFFQNNESGP------HGFSHTSEAIEDG---------EVIGIITLEDVFEELLQE 452

Query: 442 EILDETDEYVNIHNRIKINMQASPEKPGTNPS 470
           EI+DETDEYV++H RI++   A+       PS
Sbjct: 465 EIVDETDEYVDVHKRIRVAAAAAASSIARAPS 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140386.18.2e-24493.71DUF21 domain-containing protein At1g47330-like [Momordica charantia][more]
XP_022941204.11.5e-22486.32DUF21 domain-containing protein At1g47330 [Cucurbita moschata][more]
KAG6608536.11.5e-22486.32DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023525037.14.2e-22486.32DUF21 domain-containing protein At1g47330 [Cucurbita pepo subsp. pepo][more]
XP_022981956.15.2e-22285.26DUF21 domain-containing protein At1g47330 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8RY609.6e-17167.56DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8VZI25.7e-12354.90DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9ZQR46.3e-12255.16DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD88.6e-11950.32DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q67XQ07.0e-10549.56DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
A0A6J1CFY34.0e-24493.71DUF21 domain-containing protein At1g47330-like OS=Momordica charantia OX=3673 GN... [more]
A0A6J1FMM07.1e-22586.32DUF21 domain-containing protein At1g47330 OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1IXZ72.5e-22285.26DUF21 domain-containing protein At1g47330 OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A1S3CRQ32.5e-22285.53DUF21 domain-containing protein At1g47330 isoform X1 OS=Cucumis melo OX=3656 GN=... [more]
A0A0A0LGK31.1e-21784.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G856000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G47330.16.8e-17267.56CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G33700.14.1e-12454.90CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT2G14520.14.5e-12355.16CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.16.1e-12050.32CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.15.0e-10649.56CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 18..189
e-value: 1.2E-34
score: 119.5
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 8..191
score: 56.445137
NoneNo IPR availableGENE3D3.10.580.10coord: 388..442
e-value: 2.9E-6
score: 29.2
coord: 189..351
e-value: 1.3E-36
score: 127.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 371..394
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 374..388
NoneNo IPR availablePANTHERPTHR12064:SF36DOMAIN-CONTAINING PROTEIN, PUTATIVE, EXPRESSED-RELATEDcoord: 1..470
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 255..436
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 1..470
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 205..338
e-value: 3.62855E-27
score: 103.344

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
MS008845MS008845gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
MS008845.1-cdsMS008845.1-cds-scaffold4:4203733..4203808CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4204055..4204101CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4204216..4204481CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4204860..4204975CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4205087..4205160CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4205829..4206140CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4206309..4206395CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4206616..4206729CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4206810..4206893CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4206985..4207061CDS
MS008845.1-cdsMS008845.1-cds-scaffold4:4207404..4207581CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
MS008845.1MS008845.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis