CmoCh19G000510 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh19G000510
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionDUF21 domain-containing protein At4g14240
LocationCmo_Chr19: 293532 .. 299003 (+)
RNA-Seq ExpressionCmoCh19G000510
SyntenyCmoCh19G000510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGCTTTCTCGTTGCTCATACCGCCCCTTTCCCTTCTGTCTGCCGCCCCTTTTTCCGGCAATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGGTCCGCTGTCTTTCCACTCTCCCTCTGGTTCAATTCTTTTGTAATTATGCTCGTACTCGCTTTTCATTGCTTGATTTCTATTTGCGTAGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGTATTCGATTCTGCACTCTCAAATTCTGTTCTAATTACGACTTTTCTTTTTCTTCAATTCAATCTATGGATTCAATGCTGAATACTATTAATTTCTGGTTACTTGCATCCCGCTCTTATTATTTGGAGCGGTCAGATTTATGGCAAGTCGATAATTCATTGCTGACGTTCATTTCGTTTTTAAATTTTCTTGAAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGGTATAACATTTTTCCTATTGACAATACGTTTCATTCTATAACAAGCCTATGCCATATTGCATCTAATTTTCATTATGTAGTAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTAACCTCTGAAGTAGGCAGAAAGCTGCTGCTGCTGCTGCTGCTGCATTCCTTTTTTTCTCTTGCATATGAAATTTACTATGTGCATGATTGCTTGTAGATTGTACTCTATTTCCTTATTCCACTCTACTCTGATCATGCATTGACTCCAAAAAGGTCGTCTATTAACCTTTTGTTGTTTTCCCTCTAATTGAAGTCGTCTATTTACTTTTCTATAAATTTCATGATTGGTGATTATATTAGGGTAATCATGAAATCCAGAATTTGAGTAAGCCTTAAATCTGCTAGTTTGTCTCGTATGCGTGCTGCATTATTTGAGCTGAACCCATGGGTTGTAGCGACCTATGATTGAACTAAACAATAAAGCATCGACTTTAAGTCTTTACTCATGTATTTCTTCTTTGGCTTCTAATGTAGGCTTCACTTTTGCAGTGTCAAAAAGTTAACCTGGACCTTCTTCCATATTCTCAACCTTCTCTTGCAGTATATATCTCCTTAGTTCTACCTTTTCAAAGAGAAAAGGAATAACTGTACTTTTTTTTTTCTTTTTTTTGTCAATGAAAAGGAGTAATTTGTCTTCCTTTTTTCTATCTTGTTTAATTAACTGTTTTGCGAACTAGGTGATTGCATATTTGAGATTGAAGAGATGTACACTGGGTTATTTCACTTTCTTCATGTGGCCTTTGTTATTACTTACCGATATTCTGGCTCTAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGTAACAAGAAGTATAATATAGTAGCATTTGAAGTTAATGAGATATATTCTGGTTTGGACAAAGTATGAAAACCGTGCTATAACATTGTGGAAGGGTGCAAGATTTGATTAAGAGCAAAATGTCAGGGATTTATTATGAAATGGTCCTTAAGTTACTTTTTCATTTAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGGTATCTATTCTCATAAGATGAGCCTTGTAATCTTTATGCATATTATGATTTATCTACCATCTTTGTTTTATCATGCCATTTTGATTTAGGGAAACGCCTGCATTTTGGTTTGGTATAAGAAAATACTTGTTCTGAAAACACGTGTATAGAATCTTAATAAAGGGTTAGTCTAATATAACAAGTATTCATTTTTTTCCTCTAGTAAATTGCCTGCATTTTGGTTTGGTATAAGAAAATACTTGTTCTGAAAACACGTGTATAGAATCTTAATAAAGGGTTAGTCTAATATAACAAGTATTCAGTTTTTTCCTCTAGTAAATTGGGAGTTTCAAGGATTTTTGCTTGCAAAATCTACTGGATATGTGGATATTAGTTGCTTTTTATACCATCACCATCTGGTTGTGTCCCTAGTCATTATGTATTTATTCATATGAACAAAATTTTCCTTGTACTTTTCAATATAACTTTACAGCTAATATATATCATATATGTGCCTCCTTAACATATTAACCATTTTCTATTCTGTTTTTTCATTTTCAAAGTTCCAATTTAATTCCTCACGTATTGCTTTTTTTATTTTTCAATCAAGTCCTTTTTCTGTAAACACTAAAAAGAAAGGTGTGGTAATTAAGGATGGATTTTTTTATTATTTTTATTGAGACCTTCGGCTGTTAGTCTCACTCATAAGCTTGACTCCTTTAAGATTATGAGACATCTTTTCGATTTACTTTGTTGTAAAGAAAATTTACAGATCAAATGACTGAACTTTCTGAATGCAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGTTAGTTTGGCATCAAAATCATGATTAATTTATTTCTTGTTAAACCTTGCCTTTCTGTTCATTTATATTAATAATGCTGATGATTGCTAATATCTTTTCTGGCAGGTTTAAATTCCTTGAATCCTCTTCTCTATCTCTATACAATTACTTTCCAAGATTTTTCTCTAAAAACTTTATCTCTTCTTCTCTTCACTTTTATTTCCCTGCCTCCACATATGTTTCTCTCCTCTTTAAAAAGCGGAAAACAAAAAATGGTTTCATGGTTTTTGTTTTTACAAACAGAAAAACCAAAAACAGAAAACAGAAATGATTATCAAACTAAACTGGCACCCAAGTAATTGCAAACTGATTTTTACCCTATTTCTTGTACAGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTATCAGTAGAATTCTTTGCAATTACATTTATGCCTTCTCGTTTTCAACCTCCATTTGACTTGTATGTTTCTTTTCAAATGCAATGGCAAGCTCTCAAGAGTAGGAAATAACATCGCTCATTTTCTTAAAAAGAAATTCCAATTTTCTTTTGATTGTTGAGCCCTGCACTATCTTTTTCAATTAGTTGCAAGTTTTTTTTTTCCCCCTGGATTCAGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGTAATGAATGTTCTTCCCTCATATCTAATTAATTATAAATGACTTCACCTCTTAACATGTTATGCAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGTGAGCACAACTTCCACGATTTATATAGTAGAAAATAATTATTAATTTGTTCTATCATGACAGGTAGGATTAGGCAGCTGAACTATCAACTTAAATTTTAACTCCTGGTGACCAATCTTTTTGTGCAAATAAAATTCATTTACGTTTTATCTTTTCTGTGTTTCTTCATAAAAAATGCACGTATCCATGTTTCAACGTATATTGTAAAAATAGATAATTGGCTTCCACTGAATTGCCTCATAATGCAGGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGTACATATTTTTTTTCTTTTAGCATCAGGCTTATTTGGCTGATTGTGTGCTTGAATGTAGAATGTTTCTCACGTTTGTAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGTAAGTCAAAGTCCAATTCTCAAGTTGTTGAGTACTTACTTTTATGTAGCTATTAGCAGCATTTCTAAAAGCTAAAATTGATCCTACATCAAAATATTCTTTTCTTGGTTTGTTCTACTCTATTTGATGAGTCTTTTTGGCCAATTATATCCATTACGGCTGAGTTAAATTGCATAAGATTTTGGAAAATGAACGAACTCCCAAGGTGTAATTGCCGTATCAAGAATACAGGCAACAGGAAGTGCTTTGTTATGACGTGTTTGGGAATGATTTTGTTTTGTTTTTTTTTTGTTATTCATTTACATTCTTTAATAAACAGGTATTTTAGCGATTCTATGTTTATTTTATCCCATTTCCATGATAACTTTCCAAGATCACACTTTTGTTTCTCCTTCCAAAATAATAAGAAATAGAGGGAAAGGTAGCAGAAATGTTCCCCAACGGGCCTTTATTTGCACTACTTTTTGACTAATAAATGTCTCTAATATTGTGAGCGTGTAAGTTAAATAATTTGTAATTCACAGGGAGTTCGAAGTAGGGTGGTCAAGATCCAGGAAGGCTGCTGGAGATGTAGATTCGATGAGATTTTAAGGAATTGCGCTGGTACTTCTACCTAGTATTAAAAGATGAATCAACAAAGTATCAAACGGAATTCATTACGTGCTTCACCAAGCTTTAAACTCTCCAGCTTCCGTAGTATCGATCAGAAGATTTTTAATTTTAGCTTTAGAATAAATAACCATCCTAAGAAAGCTCTGTCGTCATTACTTCAAGCAAATCAATAACGAGAATAGTATGAAATTTTCTCTTTCTTGTACACAGTAGAAGATAATAGTAGCTGCTGATGTAGAGAACGAAAGAGCAATATAATATGACTCGCTTTTGTTAATTCATCTTCAATGGGTTTTTGTGGGGGGAGAGTTACTGTAAATATATGTTTAGATAATTTATTATTCAT

mRNA sequence

TTCGCTTTCTCGTTGCTCATACCGCCCCTTTCCCTTCTGTCTGCCGCCCCTTTTTCCGGCAATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGGAGTTCGAAGTAGGGTGGTCAAGATCCAGGAAGGCTGCTGGAGATGTAGATTCGATGAGATTTTAAGGAATTGCGCTGGTACTTCTACCTAGTATTAAAAGATGAATCAACAAAGTATCAAACGGAATTCATTACGTGCTTCACCAAGCTTTAAACTCTCCAGCTTCCGTAGTATCGATCAGAAGATTTTTAATTTTAGCTTTAGAATAAATAACCATCCTAAGAAAGCTCTGTCGTCATTACTTCAAGCAAATCAATAACGAGAATAGTATGAAATTTTCTCTTTCTTGTACACAGTAGAAGATAATAGTAGCTGCTGATGTAGAGAACGAAAGAGCAATATAATATGACTCGCTTTTGTTAATTCATCTTCAATGGGTTTTTGTGGGGGGAGAGTTACTGTAAATATATGTTTAGATAATTTATTATTCAT

Coding sequence (CDS)

ATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGGAGTTCGAAGTAG

Protein sequence

MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKVHRSSK
Homology
BLAST of CmoCh19G000510 vs. ExPASy Swiss-Prot
Match: Q9ZVS8 (Putative DUF21 domain-containing protein At1g03270 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF4 PE=4 SV=2)

HSP 1 Score: 621.3 bits (1601), Expect = 9.1e-177
Identity = 341/463 (73.65%), Postives = 383/463 (82.72%), Query Frame = 0

Query: 8   TLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLM 67
           TLA   AA+  N+        V EA+DI F + WWFV  G++C LVLFAGIMSGLTLGLM
Sbjct: 6   TLALVRAAYSLNSF-------VFEAEDIRFGSPWWFVVVGVACFLVLFAGIMSGLTLGLM 65

Query: 68  SLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHP 127
           SL LVELEILQ++GSS EKKQAA I+PV++KQHQLLVTLLLCNA AMEALPI LDKIFHP
Sbjct: 66  SLGLVELEILQQSGSSAEKKQAAAILPVVKKQHQLLVTLLLCNAAAMEALPICLDKIFHP 125

Query: 128 FVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALL 187
           FVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPI+YP+GKVLDA++
Sbjct: 126 FVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFLWLVRILMIICYPIAYPIGKVLDAVI 185

Query: 188 GHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFS 247
           GH+D LFRRAQLKALVSIH QEAGKGGELTH+ET IISGALDL++KTAE AMTPIESTFS
Sbjct: 186 GHNDTLFRRAQLKALVSIHSQEAGKGGELTHEETMIISGALDLSQKTAEEAMTPIESTFS 245

Query: 248 LDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRR 307
           LDVN+ LDWE IGKIL+RGHSR+PVY  NPKNIIGLLLVKSLLTVRAETE PVS+VSIR+
Sbjct: 246 LDVNTKLDWETIGKILSRGHSRIPVYLGNPKNIIGLLLVKSLLTVRAETEAPVSSVSIRK 305

Query: 308 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNK--NSVFVSDGDKHEKNKFTSKISPL 367
           IPRVPSDMPLYDILNEFQKGSSHMAAVVKVK+K+K  N   +S+G+  ++N    + S L
Sbjct: 306 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKDKDKKNNMQLLSNGETPKENMKFYQSSNL 365

Query: 368 FIPLLSKHDNDLERVDVDIEKAPRNTDIR-QTMQHNVVATNGVCNLFEDIEDGEVIGIIT 427
             PLL    +D   V VDI+K P++   R +  Q N   T  +  L ED ED EVIGIIT
Sbjct: 366 TAPLLKHESHD---VVVDIDKVPKHVKNRGRNFQQNGTVTRDLPCLLEDNEDAEVIGIIT 425

Query: 428 LEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARA 468
           LEDVFEELLQ EIVDETDVY+DVHKR+RVAAAAA A SS+ RA
Sbjct: 426 LEDVFEELLQAEIVDETDVYIDVHKRVRVAAAAAAAVSSITRA 458

BLAST of CmoCh19G000510 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 601.3 bits (1549), Expect = 9.8e-171
Identity = 336/473 (71.04%), Postives = 382/473 (80.76%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M L+N++  A   +    +N +N         + IPF +  W  YAG+SC LVLFAGIMS
Sbjct: 1   MHLINAVAAARILSGIGQSNGNN-------GGEAIPFGSFEWITYAGISCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+  EKKQAA I PV+QKQHQLLVTLLLCNA AME LPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDK+F+ +VA++LSVTFVL FGE+IPQAIC+RYGLAVGANF+WLVRILM +CYPI++P+G
Sbjct: 121 LDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LD +LGH+DALFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTA+ AMT
Sbjct: 181 KILDLVLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSRVPVYS NPKN+IGLLLVKSLLTVR ETET V
Sbjct: 241 PIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAV IRRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K    V      E++   S
Sbjct: 301 SAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSK----VPPSTLLEEHTDES 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL K + + + V V I+KA    + +   Q+N    +G  +  E IEDGEVI
Sbjct: 361 NDSDLTAPLLLKREGNHDNVIVTIDKA----NGQSFFQNNESGPHGFSHTSEAIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETD YVDVHKRIRVAAAA  AASS+ARAPS+R+L
Sbjct: 421 GIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAA--AASSIARAPSSRKL 456

BLAST of CmoCh19G000510 vs. ExPASy Swiss-Prot
Match: Q4V3C7 (DUF21 domain-containing protein At4g14230 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF2 PE=2 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 2.9e-162
Identity = 315/448 (70.31%), Postives = 362/448 (80.80%), Query Frame = 0

Query: 26  QVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTE 85
           Q   ++++ IPF +  W  YAG+SC LVLFAGIMSGLTLGLMSL LVELEILQR+G+  E
Sbjct: 18  QSNALQSEAIPFGSLEWITYAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPKE 77

Query: 86  KKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEII 145
           KKQ+A I PV+QKQHQLLVTLLL NA AME LPIYLDKIF+ +VA++LSVTFVL  GE+I
Sbjct: 78  KKQSAAIFPVVQKQHQLLVTLLLFNALAMEGLPIYLDKIFNEYVAIILSVTFVLFVGEVI 137

Query: 146 PQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLGHHDALFRRAQLKALVSI 205
           PQAIC+RYGLAVGAN +WLVRILM++ YPIS+P+ K+LD +LGH+D LFRRAQLKALVSI
Sbjct: 138 PQAICTRYGLAVGANLVWLVRILMVLSYPISFPIAKMLDWVLGHNDPLFRRAQLKALVSI 197

Query: 206 HGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILAR 265
           HG+ AGKGGELTHDETTIISGALDLTEKTA+ AMTPIESTFSLDVNS LD EA+ KI AR
Sbjct: 198 HGEAAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDREAMDKIQAR 257

Query: 266 GHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQ 325
           GHSRVPVYS+NPKN+IGLLLVKSLLTVR ET T VSAV IRRIPRVP++MPLYDILNEFQ
Sbjct: 258 GHSRVPVYSDNPKNVIGLLLVKSLLTVRPETGTLVSAVGIRRIPRVPANMPLYDILNEFQ 317

Query: 326 KGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIE 385
           KGSSHMAAVVKVK K+K        +   ++  +S  S L  PLL K + + + V V I+
Sbjct: 318 KGSSHMAAVVKVKGKSKGHPSTLHEENSGESNVSSNNSELTAPLLLKREGNHDSVIVRID 377

Query: 386 KAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYV 445
           KA   + I +          G  +  E+IEDG+VIGIITLEDVFEELLQEEIVDETD Y+
Sbjct: 378 KANGQSFISE------AGRQGFSHTSEEIEDGDVIGIITLEDVFEELLQEEIVDETDEYI 437

Query: 446 DVHKRIRVAAAAAVAASSVARAPSTRRL 474
           DVHKRIRVA  AAVA SS+ARAPS RRL
Sbjct: 438 DVHKRIRVATVAAVAISSLARAPSGRRL 459

BLAST of CmoCh19G000510 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 375.9 bits (964), Expect = 6.7e-103
Identity = 212/425 (49.88%), Postives = 289/425 (68.00%), Query Frame = 0

Query: 32  ADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAAT 91
           A+D+P     ++VY  +   LV+FAG+MSGLTLGLMSL++VELE++ + G   ++K A  
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 92  IIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICS 151
           I+P+++ QH LL TLL+ NA AMEALPI++D +   + A+L+SVT +L FGEIIPQA+CS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 152 RYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLG-HHDALFRRAQLKALVSIHGQEA 211
           RYGL++GA   +LVR+++I+ +P+SYP+ K+LD LLG  H  L  RA+LK+LV +HG EA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 212 GKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRV 271
           GKGGELTHDETTIISGALD+++K+A+ AMTP+   FSLD+N  LD + +G I + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 272 PVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH 331
           P+YS NP  IIG +LVK+L+ VR E ET +  + IRR+P+V  ++PLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 332 MAAVVKVKEKNKNSVFVSDGDKHEK--NKFTSKISPLFIPLLSKHDNDLERVDVDIEKAP 391
           MAAVV  K     +  V     HEK  N   +K + +F+ +                  P
Sbjct: 303 MAAVVGTKNHTNTNTPV-----HEKSINGSPNKDANVFLSI------------------P 362

Query: 392 RNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYVDVH 451
                  + Q  +   + + +     ED EVIGIITLEDV EEL+QEEI DETD YV++H
Sbjct: 363 ALNSSETSHQSPIRYIDSISD-----EDEEVIGIITLEDVMEELIQEEIYDETDQYVELH 399

Query: 452 KRIRV 454
           KRI +
Sbjct: 423 KRITI 399

BLAST of CmoCh19G000510 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 375.2 bits (962), Expect = 1.1e-102
Identity = 218/427 (51.05%), Postives = 287/427 (67.21%), Query Frame = 0

Query: 42  WFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQ 101
           +F++  +  LLVLFAG+MSGLTLGLMS++LV+LE+L ++G+  ++  AA I+PV++ QH 
Sbjct: 12  FFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQHL 71

Query: 102 LLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANF 161
           LL TLL+CNA AMEALPI+LD +   + A+L+SVT +L+FGEIIPQ++CSR+GLA+GA  
Sbjct: 72  LLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGATV 131

Query: 162 IWLVRILMIICYPISYPVGKVLDALLGH-HDALFRRAQLKALVSIHGQEAGKGGELTHDE 221
              VR+L+ IC P+++P+ K+LD LLGH   ALFRRA+LK LV +HG EAGKGGELTHDE
Sbjct: 132 APFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHDE 191

Query: 222 TTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNI 281
           TTII+GAL+L+EK A+ AMTPI  TF +D+N+ LD + +  IL +GHSRVPVY E   NI
Sbjct: 192 TTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTNI 251

Query: 282 IGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEK 341
           IGL+LVK+LLT+  + E  V  V+IRRIPRVP  +PLYDILNEFQKG SHMA VV+    
Sbjct: 252 IGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQ--- 311

Query: 342 NKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIEKAPRNTDI--RQTMQ 401
                               KI PL     +    +  RVDVD E++P+ T +  R+++Q
Sbjct: 312 ------------------CDKIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQ 371

Query: 402 HNVVATNGVCNLFE-----------DI------------EDGEVIGIITLEDVFEELLQE 443
                 N   +L             DI            E+ + +GIIT+EDV EELLQE
Sbjct: 372 KWKSFPNRANSLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQE 417

BLAST of CmoCh19G000510 vs. ExPASy TrEMBL
Match: A0A6J1HI72 (putative DUF21 domain-containing protein At1g03270 OS=Cucurbita moschata OX=3662 GN=LOC111463815 PE=4 SV=1)

HSP 1 Score: 924.9 bits (2389), Expect = 1.4e-265
Identity = 485/485 (100.00%), Postives = 485/485 (100.00%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS
Sbjct: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG
Sbjct: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT
Sbjct: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS
Sbjct: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
           KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI
Sbjct: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV 480
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV 480

Query: 481 HRSSK 486
           HRSSK
Sbjct: 481 HRSSK 485

BLAST of CmoCh19G000510 vs. ExPASy TrEMBL
Match: A0A6J1HSD5 (putative DUF21 domain-containing protein At1g03270 OS=Cucurbita maxima OX=3661 GN=LOC111467005 PE=4 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 7.3e-262
Identity = 478/485 (98.56%), Postives = 481/485 (99.18%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS
Sbjct: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQRTGSSTEKKQAA IIPVLQKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAAAIIPVLQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG
Sbjct: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT
Sbjct: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEK KFTS
Sbjct: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKKKFTS 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
           KISPLF PLLS+HDNDL+ VDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI
Sbjct: 361 KISPLFTPLLSEHDNDLDSVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV 480
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV 480

Query: 481 HRSSK 486
           HR+SK
Sbjct: 481 HRNSK 485

BLAST of CmoCh19G000510 vs. ExPASy TrEMBL
Match: A0A6J1DS73 (putative DUF21 domain-containing protein At1g03270 OS=Momordica charantia OX=3673 GN=LOC111022702 PE=4 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 1.2e-216
Identity = 412/485 (84.95%), Postives = 435/485 (89.69%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           ML L++LT   +A          + QV V   DDIPFPTA WF+YAGLSC+LVLFAGIMS
Sbjct: 1   MLFLDTLTTLASA---------RNKQVGVGVGDDIPFPTACWFLYAGLSCVLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLV+LEILQRTG+ST+K  AATIIPVL KQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLNLVDLEILQRTGTSTQKNHAATIIPVLHKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAV LSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPISYPVG
Sbjct: 121 LDKIFHPFVAVFLSVTFVLAFGEIIPQAICSRYGLAVGANFVWLVRILMIICYPISYPVG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH DALFRRAQLKALVSIHGQEAGKGGELTH+ETTIISGALDLTEKTAEAAMT
Sbjct: 181 KVLDAVLGHDDALFRRAQLKALVSIHGQEAGKGGELTHNETTIISGALDLTEKTAEAAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS N KNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGNSKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNS+  +DG+K+E+ K  S
Sbjct: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSMLTNDGEKYEEEKIIS 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
            ISPL IPLL+KHDNDLE V VDIEKAP NT  +QTM+ NV A NGVCNL E+IED EV+
Sbjct: 361 GISPLAIPLLTKHDNDLESVYVDIEKAPINTGHKQTMKPNVAAMNGVCNLLEEIEDEEVV 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLPLDRSKV 480
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAA+SVA APS RRL LDRSKV
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAATSVAWAPSARRLTLDRSKV 476

Query: 481 HRSSK 486
           H SSK
Sbjct: 481 HWSSK 476

BLAST of CmoCh19G000510 vs. ExPASy TrEMBL
Match: A0A6P9E664 (putative DUF21 domain-containing protein At1g03270 isoform X2 OS=Juglans regia OX=51240 GN=LOC109015012 PE=4 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 4.1e-196
Identity = 378/474 (79.75%), Postives = 410/474 (86.50%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           ML LN+LTLA T    FF+N+       V EA+DI F T WWFVYAG SCLLVLFAGIMS
Sbjct: 1   MLWLNALTLART---MFFSND------IVFEAEDIRFGTPWWFVYAGASCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+ TEKKQAA I+PV+QKQHQLLVTLLLCNACAMEALP+Y
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTPTEKKQAAVILPVVQKQHQLLVTLLLCNACAMEALPLY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGL+VGANF+WLVRILMIICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLSVGANFVWLVRILMIICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLT KTAE AMT
Sbjct: 181 KVLDAVLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTGKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS N KNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGNQKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVK K+KN    ++G+K ++ K  +
Sbjct: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKSKNPQSTAEGEKFKEGKVAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRN-TDIRQTMQHNVVATNGVCNLFEDIEDGEV 420
           + S L  PLL+ +D    RV +DIEK PR  T+  +   +   ATN +C+L EDIEDGEV
Sbjct: 361 RNSQLTTPLLTDNDEKSGRVIIDIEKHPRPITNEHKVRPNGTAATNNLCHLSEDIEDGEV 420

Query: 421 IGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           IGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAA   SSV R+PS R+L
Sbjct: 421 IGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAAVYSSVMRSPSNRKL 465

BLAST of CmoCh19G000510 vs. ExPASy TrEMBL
Match: W9QKK0 (Putative DUF21 domain-containing protein OS=Morus notabilis OX=981085 GN=L484_025039 PE=4 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 2.0e-195
Identity = 379/473 (80.13%), Postives = 410/473 (86.68%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           +LLLN+LTLA T      +         V EA+DI F   WWFV+AG+SCLLVLFAGIMS
Sbjct: 6   VLLLNALTLARTMTVSTSD--------LVFEAEDIEFGQPWWFVFAGVSCLLVLFAGIMS 65

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQR+G+STEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 66  GLTLGLMSLNLVELEILQRSGTSTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 125

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPISYP+G
Sbjct: 126 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFVWLVRILMIICYPISYPIG 185

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH++ LFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 186 KVLDAVLGHNEVLFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 245

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDV+S LDWEAIGKILARGHSRVPV+S NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 246 PIESTFSLDVSSKLDWEAIGKILARGHSRVPVFSGNPKNIIGLLLVKSLLTVRAETETPV 305

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRR+PRVP+DMPLYDILNEFQKGSSHMAAVVK+K K+K      DG+K E++ FT+
Sbjct: 306 SAVSIRRMPRVPADMPLYDILNEFQKGSSHMAAVVKIKGKSKIPQPALDGEKCEEDTFTN 365

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD+    + +D+EKA R     QT+Q N V TNG     EDIEDGEVI
Sbjct: 366 AKSQLTTPLLTKHDDKSGSIVIDVEKASRPLTNMQTLQQNGVTTNGFPYSSEDIEDGEVI 425

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS RRL
Sbjct: 426 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSNRRL 468

BLAST of CmoCh19G000510 vs. TAIR 10
Match: AT1G03270.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 621.3 bits (1601), Expect = 6.5e-178
Identity = 341/463 (73.65%), Postives = 383/463 (82.72%), Query Frame = 0

Query: 8   TLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLM 67
           TLA   AA+  N+        V EA+DI F + WWFV  G++C LVLFAGIMSGLTLGLM
Sbjct: 6   TLALVRAAYSLNSF-------VFEAEDIRFGSPWWFVVVGVACFLVLFAGIMSGLTLGLM 65

Query: 68  SLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHP 127
           SL LVELEILQ++GSS EKKQAA I+PV++KQHQLLVTLLLCNA AMEALPI LDKIFHP
Sbjct: 66  SLGLVELEILQQSGSSAEKKQAAAILPVVKKQHQLLVTLLLCNAAAMEALPICLDKIFHP 125

Query: 128 FVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALL 187
           FVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPI+YP+GKVLDA++
Sbjct: 126 FVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFLWLVRILMIICYPIAYPIGKVLDAVI 185

Query: 188 GHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFS 247
           GH+D LFRRAQLKALVSIH QEAGKGGELTH+ET IISGALDL++KTAE AMTPIESTFS
Sbjct: 186 GHNDTLFRRAQLKALVSIHSQEAGKGGELTHEETMIISGALDLSQKTAEEAMTPIESTFS 245

Query: 248 LDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRR 307
           LDVN+ LDWE IGKIL+RGHSR+PVY  NPKNIIGLLLVKSLLTVRAETE PVS+VSIR+
Sbjct: 246 LDVNTKLDWETIGKILSRGHSRIPVYLGNPKNIIGLLLVKSLLTVRAETEAPVSSVSIRK 305

Query: 308 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNK--NSVFVSDGDKHEKNKFTSKISPL 367
           IPRVPSDMPLYDILNEFQKGSSHMAAVVKVK+K+K  N   +S+G+  ++N    + S L
Sbjct: 306 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKDKDKKNNMQLLSNGETPKENMKFYQSSNL 365

Query: 368 FIPLLSKHDNDLERVDVDIEKAPRNTDIR-QTMQHNVVATNGVCNLFEDIEDGEVIGIIT 427
             PLL    +D   V VDI+K P++   R +  Q N   T  +  L ED ED EVIGIIT
Sbjct: 366 TAPLLKHESHD---VVVDIDKVPKHVKNRGRNFQQNGTVTRDLPCLLEDNEDAEVIGIIT 425

Query: 428 LEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARA 468
           LEDVFEELLQ EIVDETDVY+DVHKR+RVAAAAA A SS+ RA
Sbjct: 426 LEDVFEELLQAEIVDETDVYIDVHKRVRVAAAAAAAVSSITRA 458

BLAST of CmoCh19G000510 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 601.3 bits (1549), Expect = 7.0e-172
Identity = 336/473 (71.04%), Postives = 382/473 (80.76%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M L+N++  A   +    +N +N         + IPF +  W  YAG+SC LVLFAGIMS
Sbjct: 1   MHLINAVAAARILSGIGQSNGNN-------GGEAIPFGSFEWITYAGISCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+  EKKQAA I PV+QKQHQLLVTLLLCNA AME LPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDK+F+ +VA++LSVTFVL FGE+IPQAIC+RYGLAVGANF+WLVRILM +CYPI++P+G
Sbjct: 121 LDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LD +LGH+DALFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTA+ AMT
Sbjct: 181 KILDLVLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSRVPVYS NPKN+IGLLLVKSLLTVR ETET V
Sbjct: 241 PIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAV IRRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K    V      E++   S
Sbjct: 301 SAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSK----VPPSTLLEEHTDES 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL K + + + V V I+KA    + +   Q+N    +G  +  E IEDGEVI
Sbjct: 361 NDSDLTAPLLLKREGNHDNVIVTIDKA----NGQSFFQNNESGPHGFSHTSEAIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETD YVDVHKRIRVAAAA  AASS+ARAPS+R+L
Sbjct: 421 GIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAA--AASSIARAPSSRKL 456

BLAST of CmoCh19G000510 vs. TAIR 10
Match: AT4G14240.2 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 582.0 bits (1499), Expect = 4.4e-166
Identity = 330/473 (69.77%), Postives = 375/473 (79.28%), Query Frame = 0

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M L+N++  A   +    +N +N         + IPF +  W  YAG+SC LVLFAGIMS
Sbjct: 1   MHLINAVAAARILSGIGQSNGNN-------GGEAIPFGSFEWITYAGISCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR         +A I PV+QKQHQLLVTLLLCNA AME LPIY
Sbjct: 61  GLTLGLMSLGLVELEILQR---------SAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDK+F+ +VA++LSVTFVL FGE+IPQAIC+RYGLAVGANF+WLVRILM +CYPI++P+G
Sbjct: 121 LDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LD +LGH+DALFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTA+ AMT
Sbjct: 181 KILDLVLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSRVPVYS NPKN+IGLLLVKSLLTVR ETET V
Sbjct: 241 PIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAV IRRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K    V      E++   S
Sbjct: 301 SAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSK----VPPSTLLEEHTDES 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL K + + + V V I+KA    + +   Q+N    +G  +  E IEDGEVI
Sbjct: 361 NDSDLTAPLLLKREGNHDNVIVTIDKA----NGQSFFQNNESGPHGFSHTSEAIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETD YVDVHKRIRVAAAA  AASS+ARAPS+R+L
Sbjct: 421 GIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAA--AASSIARAPSSRKL 447

BLAST of CmoCh19G000510 vs. TAIR 10
Match: AT4G14230.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 573.2 bits (1476), Expect = 2.0e-163
Identity = 315/448 (70.31%), Postives = 362/448 (80.80%), Query Frame = 0

Query: 26  QVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTE 85
           Q   ++++ IPF +  W  YAG+SC LVLFAGIMSGLTLGLMSL LVELEILQR+G+  E
Sbjct: 18  QSNALQSEAIPFGSLEWITYAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPKE 77

Query: 86  KKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEII 145
           KKQ+A I PV+QKQHQLLVTLLL NA AME LPIYLDKIF+ +VA++LSVTFVL  GE+I
Sbjct: 78  KKQSAAIFPVVQKQHQLLVTLLLFNALAMEGLPIYLDKIFNEYVAIILSVTFVLFVGEVI 137

Query: 146 PQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLGHHDALFRRAQLKALVSI 205
           PQAIC+RYGLAVGAN +WLVRILM++ YPIS+P+ K+LD +LGH+D LFRRAQLKALVSI
Sbjct: 138 PQAICTRYGLAVGANLVWLVRILMVLSYPISFPIAKMLDWVLGHNDPLFRRAQLKALVSI 197

Query: 206 HGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILAR 265
           HG+ AGKGGELTHDETTIISGALDLTEKTA+ AMTPIESTFSLDVNS LD EA+ KI AR
Sbjct: 198 HGEAAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDREAMDKIQAR 257

Query: 266 GHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQ 325
           GHSRVPVYS+NPKN+IGLLLVKSLLTVR ET T VSAV IRRIPRVP++MPLYDILNEFQ
Sbjct: 258 GHSRVPVYSDNPKNVIGLLLVKSLLTVRPETGTLVSAVGIRRIPRVPANMPLYDILNEFQ 317

Query: 326 KGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIE 385
           KGSSHMAAVVKVK K+K        +   ++  +S  S L  PLL K + + + V V I+
Sbjct: 318 KGSSHMAAVVKVKGKSKGHPSTLHEENSGESNVSSNNSELTAPLLLKREGNHDSVIVRID 377

Query: 386 KAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYV 445
           KA   + I +          G  +  E+IEDG+VIGIITLEDVFEELLQEEIVDETD Y+
Sbjct: 378 KANGQSFISE------AGRQGFSHTSEEIEDGDVIGIITLEDVFEELLQEEIVDETDEYI 437

Query: 446 DVHKRIRVAAAAAVAASSVARAPSTRRL 474
           DVHKRIRVA  AAVA SS+ARAPS RRL
Sbjct: 438 DVHKRIRVATVAAVAISSLARAPSGRRL 459

BLAST of CmoCh19G000510 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 375.9 bits (964), Expect = 4.8e-104
Identity = 212/425 (49.88%), Postives = 289/425 (68.00%), Query Frame = 0

Query: 32  ADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAAT 91
           A+D+P     ++VY  +   LV+FAG+MSGLTLGLMSL++VELE++ + G   ++K A  
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 92  IIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICS 151
           I+P+++ QH LL TLL+ NA AMEALPI++D +   + A+L+SVT +L FGEIIPQA+CS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 152 RYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLG-HHDALFRRAQLKALVSIHGQEA 211
           RYGL++GA   +LVR+++I+ +P+SYP+ K+LD LLG  H  L  RA+LK+LV +HG EA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 212 GKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRV 271
           GKGGELTHDETTIISGALD+++K+A+ AMTP+   FSLD+N  LD + +G I + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 272 PVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH 331
           P+YS NP  IIG +LVK+L+ VR E ET +  + IRR+P+V  ++PLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 332 MAAVVKVKEKNKNSVFVSDGDKHEK--NKFTSKISPLFIPLLSKHDNDLERVDVDIEKAP 391
           MAAVV  K     +  V     HEK  N   +K + +F+ +                  P
Sbjct: 303 MAAVVGTKNHTNTNTPV-----HEKSINGSPNKDANVFLSI------------------P 362

Query: 392 RNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYVDVH 451
                  + Q  +   + + +     ED EVIGIITLEDV EEL+QEEI DETD YV++H
Sbjct: 363 ALNSSETSHQSPIRYIDSISD-----EDEEVIGIITLEDVMEELIQEEIYDETDQYVELH 399

Query: 452 KRIRV 454
           KRI +
Sbjct: 423 KRITI 399

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZVS89.1e-17773.65Putative DUF21 domain-containing protein At1g03270 OS=Arabidopsis thaliana OX=37... [more]
Q67XQ09.8e-17171.04DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q4V3C72.9e-16270.31DUF21 domain-containing protein At4g14230 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD86.7e-10349.88DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9ZQR41.1e-10251.05DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
A0A6J1HI721.4e-265100.00putative DUF21 domain-containing protein At1g03270 OS=Cucurbita moschata OX=3662... [more]
A0A6J1HSD57.3e-26298.56putative DUF21 domain-containing protein At1g03270 OS=Cucurbita maxima OX=3661 G... [more]
A0A6J1DS731.2e-21684.95putative DUF21 domain-containing protein At1g03270 OS=Momordica charantia OX=367... [more]
A0A6P9E6644.1e-19679.75putative DUF21 domain-containing protein At1g03270 isoform X2 OS=Juglans regia O... [more]
W9QKK02.0e-19580.13Putative DUF21 domain-containing protein OS=Morus notabilis OX=981085 GN=L484_02... [more]
Match NameE-valueIdentityDescription
AT1G03270.16.5e-17873.65CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.17.0e-17271.04CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.24.4e-16669.77CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14230.12.0e-16370.31CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.14.8e-10449.88CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.10.580.10coord: 220..352
e-value: 1.7E-43
score: 150.1
NoneNo IPR availableGENE3D3.10.580.10coord: 384..440
e-value: 1.3E-6
score: 30.4
NoneNo IPR availablePANTHERPTHR12064:SF69BNAC05G01850D PROTEINcoord: 1..479
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 224..434
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 48..219
e-value: 4.2E-37
score: 127.5
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 38..220
score: 49.011486
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 1..479
IPR000644CBS domainPROSITEPS51371CBScoord: 239..300
score: 8.690958
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 234..335
e-value: 1.0927E-25
score: 99.4924

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G000510.1CmoCh19G000510.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0043130 ubiquitin binding