CmoCh19G000510 (gene) Cucurbita moschata (Rifu)

NameCmoCh19G000510
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionDUF21 domain-containing-like protein
LocationCmo_Chr19 : 293532 .. 299003 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGCTTTCTCGTTGCTCATACCGCCCCTTTCCCTTCTGTCTGCCGCCCCTTTTTCCGGCAATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGGTCCGCTGTCTTTCCACTCTCCCTCTGGTTCAATTCTTTTGTAATTATGCTCGTACTCGCTTTTCATTGCTTGATTTCTATTTGCGTAGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGTATTCGATTCTGCACTCTCAAATTCTGTTCTAATTACGACTTTTCTTTTTCTTCAATTCAATCTATGGATTCAATGCTGAATACTATTAATTTCTGGTTACTTGCATCCCGCTCTTATTATTTGGAGCGGTCAGATTTATGGCAAGTCGATAATTCATTGCTGACGTTCATTTCGTTTTTAAATTTTCTTGAAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGGTATAACATTTTTCCTATTGACAATACGTTTCATTCTATAACAAGCCTATGCCATATTGCATCTAATTTTCATTATGTAGTAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTAACCTCTGAAGTAGGCAGAAAGCTGCTGCTGCTGCTGCTGCTGCATTCCTTTTTTTCTCTTGCATATGAAATTTACTATGTGCATGATTGCTTGTAGATTGTACTCTATTTCCTTATTCCACTCTACTCTGATCATGCATTGACTCCAAAAAGGTCGTCTATTAACCTTTTGTTGTTTTCCCTCTAATTGAAGTCGTCTATTTACTTTTCTATAAATTTCATGATTGGTGATTATATTAGGGTAATCATGAAATCCAGAATTTGAGTAAGCCTTAAATCTGCTAGTTTGTCTCGTATGCGTGCTGCATTATTTGAGCTGAACCCATGGGTTGTAGCGACCTATGATTGAACTAAACAATAAAGCATCGACTTTAAGTCTTTACTCATGTATTTCTTCTTTGGCTTCTAATGTAGGCTTCACTTTTGCAGTGTCAAAAAGTTAACCTGGACCTTCTTCCATATTCTCAACCTTCTCTTGCAGTATATATCTCCTTAGTTCTACCTTTTCAAAGAGAAAAGGAATAACTGTACTTTTTTTTTTCTTTTTTTTGTCAATGAAAAGGAGTAATTTGTCTTCCTTTTTTCTATCTTGTTTAATTAACTGTTTTGCGAACTAGGTGATTGCATATTTGAGATTGAAGAGATGTACACTGGGTTATTTCACTTTCTTCATGTGGCCTTTGTTATTACTTACCGATATTCTGGCTCTAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGTAACAAGAAGTATAATATAGTAGCATTTGAAGTTAATGAGATATATTCTGGTTTGGACAAAGTATGAAAACCGTGCTATAACATTGTGGAAGGGTGCAAGATTTGATTAAGAGCAAAATGTCAGGGATTTATTATGAAATGGTCCTTAAGTTACTTTTTCATTTAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGGTATCTATTCTCATAAGATGAGCCTTGTAATCTTTATGCATATTATGATTTATCTACCATCTTTGTTTTATCATGCCATTTTGATTTAGGGAAACGCCTGCATTTTGGTTTGGTATAAGAAAATACTTGTTCTGAAAACACGTGTATAGAATCTTAATAAAGGGTTAGTCTAATATAACAAGTATTCATTTTTTTCCTCTAGTAAATTGCCTGCATTTTGGTTTGGTATAAGAAAATACTTGTTCTGAAAACACGTGTATAGAATCTTAATAAAGGGTTAGTCTAATATAACAAGTATTCAGTTTTTTCCTCTAGTAAATTGGGAGTTTCAAGGATTTTTGCTTGCAAAATCTACTGGATATGTGGATATTAGTTGCTTTTTATACCATCACCATCTGGTTGTGTCCCTAGTCATTATGTATTTATTCATATGAACAAAATTTTCCTTGTACTTTTCAATATAACTTTACAGCTAATATATATCATATATGTGCCTCCTTAACATATTAACCATTTTCTATTCTGTTTTTTCATTTTCAAAGTTCCAATTTAATTCCTCACGTATTGCTTTTTTTATTTTTCAATCAAGTCCTTTTTCTGTAAACACTAAAAAGAAAGGTGTGGTAATTAAGGATGGATTTTTTTATTATTTTTATTGAGACCTTCGGCTGTTAGTCTCACTCATAAGCTTGACTCCTTTAAGATTATGAGACATCTTTTCGATTTACTTTGTTGTAAAGAAAATTTACAGATCAAATGACTGAACTTTCTGAATGCAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGTTAGTTTGGCATCAAAATCATGATTAATTTATTTCTTGTTAAACCTTGCCTTTCTGTTCATTTATATTAATAATGCTGATGATTGCTAATATCTTTTCTGGCAGGTTTAAATTCCTTGAATCCTCTTCTCTATCTCTATACAATTACTTTCCAAGATTTTTCTCTAAAAACTTTATCTCTTCTTCTCTTCACTTTTATTTCCCTGCCTCCACATATGTTTCTCTCCTCTTTAAAAAGCGGAAAACAAAAAATGGTTTCATGGTTTTTGTTTTTACAAACAGAAAAACCAAAAACAGAAAACAGAAATGATTATCAAACTAAACTGGCACCCAAGTAATTGCAAACTGATTTTTACCCTATTTCTTGTACAGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTATCAGTAGAATTCTTTGCAATTACATTTATGCCTTCTCGTTTTCAACCTCCATTTGACTTGTATGTTTCTTTTCAAATGCAATGGCAAGCTCTCAAGAGTAGGAAATAACATCGCTCATTTTCTTAAAAAGAAATTCCAATTTTCTTTTGATTGTTGAGCCCTGCACTATCTTTTTCAATTAGTTGCAAGTTTTTTTTTTCCCCCTGGATTCAGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGTAATGAATGTTCTTCCCTCATATCTAATTAATTATAAATGACTTCACCTCTTAACATGTTATGCAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGTGAGCACAACTTCCACGATTTATATAGTAGAAAATAATTATTAATTTGTTCTATCATGACAGGTAGGATTAGGCAGCTGAACTATCAACTTAAATTTTAACTCCTGGTGACCAATCTTTTTGTGCAAATAAAATTCATTTACGTTTTATCTTTTCTGTGTTTCTTCATAAAAAATGCACGTATCCATGTTTCAACGTATATTGTAAAAATAGATAATTGGCTTCCACTGAATTGCCTCATAATGCAGGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGTACATATTTTTTTTCTTTTAGCATCAGGCTTATTTGGCTGATTGTGTGCTTGAATGTAGAATGTTTCTCACGTTTGTAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGTAAGTCAAAGTCCAATTCTCAAGTTGTTGAGTACTTACTTTTATGTAGCTATTAGCAGCATTTCTAAAAGCTAAAATTGATCCTACATCAAAATATTCTTTTCTTGGTTTGTTCTACTCTATTTGATGAGTCTTTTTGGCCAATTATATCCATTACGGCTGAGTTAAATTGCATAAGATTTTGGAAAATGAACGAACTCCCAAGGTGTAATTGCCGTATCAAGAATACAGGCAACAGGAAGTGCTTTGTTATGACGTGTTTGGGAATGATTTTGTTTTGTTTTTTTTTTGTTATTCATTTACATTCTTTAATAAACAGGTATTTTAGCGATTCTATGTTTATTTTATCCCATTTCCATGATAACTTTCCAAGATCACACTTTTGTTTCTCCTTCCAAAATAATAAGAAATAGAGGGAAAGGTAGCAGAAATGTTCCCCAACGGGCCTTTATTTGCACTACTTTTTGACTAATAAATGTCTCTAATATTGTGAGCGTGTAAGTTAAATAATTTGTAATTCACAGGGAGTTCGAAGTAGGGTGGTCAAGATCCAGGAAGGCTGCTGGAGATGTAGATTCGATGAGATTTTAAGGAATTGCGCTGGTACTTCTACCTAGTATTAAAAGATGAATCAACAAAGTATCAAACGGAATTCATTACGTGCTTCACCAAGCTTTAAACTCTCCAGCTTCCGTAGTATCGATCAGAAGATTTTTAATTTTAGCTTTAGAATAAATAACCATCCTAAGAAAGCTCTGTCGTCATTACTTCAAGCAAATCAATAACGAGAATAGTATGAAATTTTCTCTTTCTTGTACACAGTAGAAGATAATAGTAGCTGCTGATGTAGAGAACGAAAGAGCAATATAATATGACTCGCTTTTGTTAATTCATCTTCAATGGGTTTTTGTGGGGGGAGAGTTACTGTAAATATATGTTTAGATAATTTATTATTCAT

mRNA sequence

TTCGCTTTCTCGTTGCTCATACCGCCCCTTTCCCTTCTGTCTGCCGCCCCTTTTTCCGGCAATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGGAGTTCGAAGTAGGGTGGTCAAGATCCAGGAAGGCTGCTGGAGATGTAGATTCGATGAGATTTTAAGGAATTGCGCTGGTACTTCTACCTAGTATTAAAAGATGAATCAACAAAGTATCAAACGGAATTCATTACGTGCTTCACCAAGCTTTAAACTCTCCAGCTTCCGTAGTATCGATCAGAAGATTTTTAATTTTAGCTTTAGAATAAATAACCATCCTAAGAAAGCTCTGTCGTCATTACTTCAAGCAAATCAATAACGAGAATAGTATGAAATTTTCTCTTTCTTGTACACAGTAGAAGATAATAGTAGCTGCTGATGTAGAGAACGAAAGAGCAATATAATATGACTCGCTTTTGTTAATTCATCTTCAATGGGTTTTTGTGGGGGGAGAGTTACTGTAAATATATGTTTAGATAATTTATTATTCAT

Coding sequence (CDS)

ATGCTGCTCCTTAATTCCTTGACATTGGCCACCACCGCCGCCGCCTTCTTCTTCAACAACAACCACAATCATTACCAGGTCCCTGTGGTGGAAGCCGATGATATTCCTTTTCCCACTGCATGGTGGTTCGTTTACGCTGGCCTCTCCTGTCTCCTCGTTTTGTTCGCTGGTATTATGTCTGGCCTCACCCTTGGCCTCATGTCTCTCAACCTCGTCGAGCTTGAAATCTTGCAGCGCACTGGTTCTTCCACCGAGAAAAAGCAAGCTGCTACTATTATTCCGGTGCTGCAGAAGCAGCACCAATTGCTAGTTACGCTGCTGCTTTGTAATGCTTGTGCTATGGAGGCTCTTCCTATATACCTTGATAAAATCTTCCATCCTTTTGTCGCGGTTTTGCTATCTGTGACCTTTGTTCTTGTCTTTGGCGAGATTATTCCGCAAGCAATATGCTCGAGATATGGACTTGCTGTTGGTGCAAATTTTATATGGCTAGTGCGTATTTTGATGATCATCTGTTATCCAATTTCATACCCGGTTGGAAAGGTCTTGGATGCATTACTTGGTCACCATGATGCTCTGTTTAGGAGAGCTCAGTTGAAAGCCCTTGTTTCTATCCATGGACAAGAGGCTGGGAAGGGAGGTGAACTCACACACGATGAGACGACCATCATCAGTGGGGCATTGGACTTGACAGAAAAGACTGCAGAGGCGGCTATGACACCAATAGAATCAACATTTTCTTTGGATGTGAATTCCAACTTGGACTGGGAAGCAATTGGAAAAATACTTGCACGTGGTCATAGTCGTGTCCCAGTCTATTCTGAGAATCCAAAGAATATAATTGGTCTCCTATTGGTGAAAAGTCTTCTGACAGTAAGAGCAGAAACGGAAACTCCAGTCAGTGCTGTTTCCATACGGAGAATTCCTAGGGTTCCTTCAGATATGCCATTATATGATATCCTAAATGAGTTCCAAAAGGGAAGTAGTCATATGGCGGCTGTAGTCAAGGTCAAGGAGAAGAATAAGAACTCAGTGTTTGTCAGTGATGGAGATAAACATGAGAAAAATAAATTTACCTCTAAGATATCTCCGCTTTTCATTCCCTTGCTCTCAAAACATGACAATGATTTGGAACGTGTTGATGTTGACATTGAAAAAGCTCCAAGGAATACTGATATCAGGCAAACCATGCAGCATAATGTGGTTGCAACAAATGGAGTGTGCAATTTGTTTGAAGATATTGAAGATGGGGAAGTTATTGGGATAATCACTTTAGAGGATGTTTTTGAAGAACTTCTGCAAGAGGAAATTGTAGATGAGACAGACGTATATGTGGATGTCCATAAAAGGATACGTGTGGCTGCGGCTGCAGCTGTGGCTGCTTCATCTGTGGCCCGAGCTCCATCAACTCGTAGGTTGCCACTTGACAGGTCAAAAGTCCACCGGAGTTCGAAGTAG
BLAST of CmoCh19G000510 vs. Swiss-Prot
Match: Y1327_ARATH (Putative DUF21 domain-containing protein At1g03270 OS=Arabidopsis thaliana GN=CBSDUF4 PE=4 SV=2)

HSP 1 Score: 621.3 bits (1601), Expect = 8.8e-177
Identity = 341/463 (73.65%), Postives = 383/463 (82.72%), Query Frame = 1

Query: 8   TLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLM 67
           TLA   AA+  N+        V EA+DI F + WWFV  G++C LVLFAGIMSGLTLGLM
Sbjct: 6   TLALVRAAYSLNSF-------VFEAEDIRFGSPWWFVVVGVACFLVLFAGIMSGLTLGLM 65

Query: 68  SLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHP 127
           SL LVELEILQ++GSS EKKQAA I+PV++KQHQLLVTLLLCNA AMEALPI LDKIFHP
Sbjct: 66  SLGLVELEILQQSGSSAEKKQAAAILPVVKKQHQLLVTLLLCNAAAMEALPICLDKIFHP 125

Query: 128 FVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALL 187
           FVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPI+YP+GKVLDA++
Sbjct: 126 FVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFLWLVRILMIICYPIAYPIGKVLDAVI 185

Query: 188 GHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFS 247
           GH+D LFRRAQLKALVSIH QEAGKGGELTH+ET IISGALDL++KTAE AMTPIESTFS
Sbjct: 186 GHNDTLFRRAQLKALVSIHSQEAGKGGELTHEETMIISGALDLSQKTAEEAMTPIESTFS 245

Query: 248 LDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRR 307
           LDVN+ LDWE IGKIL+RGHSR+PVY  NPKNIIGLLLVKSLLTVRAETE PVS+VSIR+
Sbjct: 246 LDVNTKLDWETIGKILSRGHSRIPVYLGNPKNIIGLLLVKSLLTVRAETEAPVSSVSIRK 305

Query: 308 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNK--NSVFVSDGDKHEKNKFTSKISPL 367
           IPRVPSDMPLYDILNEFQKGSSHMAAVVKVK+K+K  N   +S+G+  ++N    + S L
Sbjct: 306 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKDKDKKNNMQLLSNGETPKENMKFYQSSNL 365

Query: 368 FIPLLSKHDNDLERVDVDIEKAPRNTDIR-QTMQHNVVATNGVCNLFEDIEDGEVIGIIT 427
             PLL    +D   V VDI+K P++   R +  Q N   T  +  L ED ED EVIGIIT
Sbjct: 366 TAPLLKHESHD---VVVDIDKVPKHVKNRGRNFQQNGTVTRDLPCLLEDNEDAEVIGIIT 425

Query: 428 LEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARA 468
           LEDVFEELLQ EIVDETDVY+DVHKR+RVAAAAA A SS+ RA
Sbjct: 426 LEDVFEELLQAEIVDETDVYIDVHKRVRVAAAAAAAVSSITRA 458

BLAST of CmoCh19G000510 vs. Swiss-Prot
Match: Y4424_ARATH (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 600.9 bits (1548), Expect = 1.2e-170
Identity = 336/473 (71.04%), Postives = 382/473 (80.76%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M L+N++  A   +    +N +N         + IPF +  W  YAG+SC LVLFAGIMS
Sbjct: 1   MHLINAVAAARILSGIGQSNGNNG-------GEAIPFGSFEWITYAGISCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+  EKKQAA I PV+QKQHQLLVTLLLCNA AME LPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDK+F+ +VA++LSVTFVL FGE+IPQAIC+RYGLAVGANF+WLVRILM +CYPI++P+G
Sbjct: 121 LDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LD +LGH+DALFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTA+ AMT
Sbjct: 181 KILDLVLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSRVPVYS NPKN+IGLLLVKSLLTVR ETET V
Sbjct: 241 PIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAV IRRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K    V      E++   S
Sbjct: 301 SAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSK----VPPSTLLEEHTDES 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL K + + + V V I+KA    + +   Q+N    +G  +  E IEDGEVI
Sbjct: 361 NDSDLTAPLLLKREGNHDNVIVTIDKA----NGQSFFQNNESGPHGFSHTSEAIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETD YVDVHKRIRVAAAA  AASS+ARAPS+R+L
Sbjct: 421 GIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAA--AASSIARAPSSRKL 456

BLAST of CmoCh19G000510 vs. Swiss-Prot
Match: Y4423_ARATH (DUF21 domain-containing protein At4g14230 OS=Arabidopsis thaliana GN=CBSDUF2 PE=2 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 2.8e-162
Identity = 315/448 (70.31%), Postives = 362/448 (80.80%), Query Frame = 1

Query: 26  QVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTE 85
           Q   ++++ IPF +  W  YAG+SC LVLFAGIMSGLTLGLMSL LVELEILQR+G+  E
Sbjct: 18  QSNALQSEAIPFGSLEWITYAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPKE 77

Query: 86  KKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEII 145
           KKQ+A I PV+QKQHQLLVTLLL NA AME LPIYLDKIF+ +VA++LSVTFVL  GE+I
Sbjct: 78  KKQSAAIFPVVQKQHQLLVTLLLFNALAMEGLPIYLDKIFNEYVAIILSVTFVLFVGEVI 137

Query: 146 PQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLGHHDALFRRAQLKALVSI 205
           PQAIC+RYGLAVGAN +WLVRILM++ YPIS+P+ K+LD +LGH+D LFRRAQLKALVSI
Sbjct: 138 PQAICTRYGLAVGANLVWLVRILMVLSYPISFPIAKMLDWVLGHNDPLFRRAQLKALVSI 197

Query: 206 HGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILAR 265
           HG+ AGKGGELTHDETTIISGALDLTEKTA+ AMTPIESTFSLDVNS LD EA+ KI AR
Sbjct: 198 HGEAAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDREAMDKIQAR 257

Query: 266 GHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQ 325
           GHSRVPVYS+NPKN+IGLLLVKSLLTVR ET T VSAV IRRIPRVP++MPLYDILNEFQ
Sbjct: 258 GHSRVPVYSDNPKNVIGLLLVKSLLTVRPETGTLVSAVGIRRIPRVPANMPLYDILNEFQ 317

Query: 326 KGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIE 385
           KGSSHMAAVVKVK K+K        +   ++  +S  S L  PLL K + + + V V I+
Sbjct: 318 KGSSHMAAVVKVKGKSKGHPSTLHEENSGESNVSSNNSELTAPLLLKREGNHDSVIVRID 377

Query: 386 KAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYV 445
           KA   + I +          G  +  E+IEDG+VIGIITLEDVFEELLQEEIVDETD Y+
Sbjct: 378 KANGQSFISE------AGRQGFSHTSEEIEDGDVIGIITLEDVFEELLQEEIVDETDEYI 437

Query: 446 DVHKRIRVAAAAAVAASSVARAPSTRRL 474
           DVHKRIRVA  AAVA SS+ARAPS RRL
Sbjct: 438 DVHKRIRVATVAAVAISSLARAPSGRRL 459

BLAST of CmoCh19G000510 vs. Swiss-Prot
Match: Y5279_ARATH (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 375.6 bits (963), Expect = 8.5e-103
Identity = 211/425 (49.65%), Postives = 291/425 (68.47%), Query Frame = 1

Query: 32  ADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAAT 91
           A+D+P     ++VY  +   LV+FAG+MSGLTLGLMSL++VELE++ + G   ++K A  
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 92  IIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICS 151
           I+P+++ QH LL TLL+ NA AMEALPI++D +   + A+L+SVT +L FGEIIPQA+CS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 152 RYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLG-HHDALFRRAQLKALVSIHGQEA 211
           RYGL++GA   +LVR+++I+ +P+SYP+ K+LD LLG  H  L  RA+LK+LV +HG EA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 212 GKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRV 271
           GKGGELTHDETTIISGALD+++K+A+ AMTP+   FSLD+N  LD + +G I + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 272 PVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH 331
           P+YS NP  IIG +LVK+L+ VR E ET +  + IRR+P+V  ++PLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 332 MAAVVKVKEKNKNSVFVSDGDKHEK--NKFTSKISPLFIPLLSKHDNDLERVDVDIEKAP 391
           MAAVV  K     +  V     HEK  N   +K + +F+ + + + ++            
Sbjct: 303 MAAVVGTKNHTNTNTPV-----HEKSINGSPNKDANVFLSIPALNSSETSH--------- 362

Query: 392 RNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYVDVH 451
                    Q  +   + + +     ED EVIGIITLEDV EEL+QEEI DETD YV++H
Sbjct: 363 ---------QSPIRYIDSISD-----EDEEVIGIITLEDVMEELIQEEIYDETDQYVELH 399

Query: 452 KRIRV 454
           KRI +
Sbjct: 423 KRITI 399

BLAST of CmoCh19G000510 vs. Swiss-Prot
Match: Y2452_ARATH (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 375.2 bits (962), Expect = 1.1e-102
Identity = 218/427 (51.05%), Postives = 287/427 (67.21%), Query Frame = 1

Query: 42  WFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQ 101
           +F++  +  LLVLFAG+MSGLTLGLMS++LV+LE+L ++G+  ++  AA I+PV++ QH 
Sbjct: 12  FFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQHL 71

Query: 102 LLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANF 161
           LL TLL+CNA AMEALPI+LD +   + A+L+SVT +L+FGEIIPQ++CSR+GLA+GA  
Sbjct: 72  LLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGATV 131

Query: 162 IWLVRILMIICYPISYPVGKVLDALLGH-HDALFRRAQLKALVSIHGQEAGKGGELTHDE 221
              VR+L+ IC P+++P+ K+LD LLGH   ALFRRA+LK LV +HG EAGKGGELTHDE
Sbjct: 132 APFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHDE 191

Query: 222 TTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNI 281
           TTII+GAL+L+EK A+ AMTPI  TF +D+N+ LD + +  IL +GHSRVPVY E   NI
Sbjct: 192 TTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTNI 251

Query: 282 IGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEK 341
           IGL+LVK+LLT+  + E  V  V+IRRIPRVP  +PLYDILNEFQKG SHMA VV+    
Sbjct: 252 IGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQ--- 311

Query: 342 NKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIEKAPRNTDI--RQTMQ 401
                               KI PL     +    +  RVDVD E++P+ T +  R+++Q
Sbjct: 312 ------------------CDKIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQ 371

Query: 402 HNVVATNGVCNLFE-----------DI------------EDGEVIGIITLEDVFEELLQE 443
                 N   +L             DI            E+ + +GIIT+EDV EELLQE
Sbjct: 372 KWKSFPNRANSLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQE 417

BLAST of CmoCh19G000510 vs. TrEMBL
Match: W9QKK0_9ROSA (Putative DUF21 domain-containing protein OS=Morus notabilis GN=L484_025039 PE=4 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 5.9e-196
Identity = 379/473 (80.13%), Postives = 410/473 (86.68%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           +LLLN+LTLA T      +         V EA+DI F   WWFV+AG+SCLLVLFAGIMS
Sbjct: 6   VLLLNALTLARTMTVSTSDL--------VFEAEDIEFGQPWWFVFAGVSCLLVLFAGIMS 65

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQR+G+STEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 66  GLTLGLMSLNLVELEILQRSGTSTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 125

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPISYP+G
Sbjct: 126 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFVWLVRILMIICYPISYPIG 185

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH++ LFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 186 KVLDAVLGHNEVLFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 245

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDV+S LDWEAIGKILARGHSRVPV+S NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 246 PIESTFSLDVSSKLDWEAIGKILARGHSRVPVFSGNPKNIIGLLLVKSLLTVRAETETPV 305

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRR+PRVP+DMPLYDILNEFQKGSSHMAAVVK+K K+K      DG+K E++ FT+
Sbjct: 306 SAVSIRRMPRVPADMPLYDILNEFQKGSSHMAAVVKIKGKSKIPQPALDGEKCEEDTFTN 365

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD+    + +D+EKA R     QT+Q N V TNG     EDIEDGEVI
Sbjct: 366 AKSQLTTPLLTKHDDKSGSIVIDVEKASRPLTNMQTLQQNGVTTNGFPYSSEDIEDGEVI 425

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS RRL
Sbjct: 426 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSNRRL 468

BLAST of CmoCh19G000510 vs. TrEMBL
Match: B9T4A2_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0176310 PE=4 SV=1)

HSP 1 Score: 688.3 bits (1775), Expect = 6.6e-195
Identity = 380/474 (80.17%), Postives = 405/474 (85.44%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLLLN+LTLA T     F+ NH      V EADDI F T WWF+YAG+SCLLVLFAGIMS
Sbjct: 1   MLLLNALTLART----MFSINHI-----VFEADDIKFATLWWFIYAGISCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+GS TEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGSFTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAV+LSVTFVL FGEIIPQAICSRYGL VGAN +WLVRILM ICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVVLSVTFVLAFGEIIPQAICSRYGLYVGANLVWLVRILMFICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA LGH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAALGHDDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS  PKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGCPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPS+MPLYDILNEFQKGSSHMAAVVKV  K+KN+   SDG+K  + KF +
Sbjct: 301 SAVSIRRIPRVPSNMPLYDILNEFQKGSSHMAAVVKVHAKSKNAQPTSDGEKFNEIKFAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD   E + +D+EKA R   I+Q   H++        L ED+EDGEVI
Sbjct: 361 GDSQLNAPLLTKHDGKSEHLLIDVEKAARPMTIKQQKTHDIP------RLSEDVEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLP 475
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRV AAA  AAS VARAPS RRLP
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVVAAA--AASYVARAPSNRRLP 457

BLAST of CmoCh19G000510 vs. TrEMBL
Match: A0A061DQY0_THECC (CBS domain-containing protein with a domain of Uncharacterized protein function isoform 1 OS=Theobroma cacao GN=TCM_004436 PE=4 SV=1)

HSP 1 Score: 684.9 bits (1766), Expect = 7.3e-194
Identity = 370/473 (78.22%), Postives = 410/473 (86.68%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLL N++ LA T      N+        + E DDIPF +  WFVYAG SCLLVLFAGIMS
Sbjct: 1   MLLQNAIVLARTIMTLSPNDI-------LFEPDDIPFGSVKWFVYAGFSCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMS++LVELEILQR+G+ TEKKQAATI+PV+++QHQLLVTLLLCNACAMEALPI 
Sbjct: 61  GLTLGLMSMSLVELEILQRSGTITEKKQAATILPVVKRQHQLLVTLLLCNACAMEALPIS 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGL+VGANF+WLVRILMIICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLSVGANFVWLVRILMIICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA++GH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAVIGHGDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSR+PVY+ NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAVGKILARGHSRIPVYAGNPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVP+ MPLYDILNEFQKGSSHMAAVVKVKEK K+  F  DG+K ++++ T+
Sbjct: 301 SAVSIRRIPRVPAHMPLYDILNEFQKGSSHMAAVVKVKEKTKDPEFFDDGEKFDEHRVTN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+K+D  L  V VD+EK  R   +++T+Q N V  N + +  EDIEDGEVI
Sbjct: 361 GNSQLTTPLLTKYDTKLNSVAVDVEKPSRPITVKKTLQENGVTANTLHHFTEDIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS+RRL
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSSRRL 464

BLAST of CmoCh19G000510 vs. TrEMBL
Match: F6HQ68_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00300 PE=4 SV=1)

HSP 1 Score: 679.5 bits (1752), Expect = 3.0e-192
Identity = 371/475 (78.11%), Postives = 407/475 (85.68%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M LLN+LTL +      F          V+  +DI F + WWFVYAG+SC LVLFAGIMS
Sbjct: 1   MSLLNALTLGSMPTTGEF----------VLRTEDIEFGSLWWFVYAGVSCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+S EKKQAA I+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTSAEKKQAAAILPVVQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVA+LLSVTFVL FGEIIPQAIC+RYGL+VGANF+WLVRILMIICYPI++P+G
Sbjct: 121 LDKIFHPFVAILLSVTFVLAFGEIIPQAICTRYGLSVGANFVWLVRILMIICYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH+DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAVLGHNDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIR+IPRVP+DMPLYDILNEFQKGSSHMAAVVKVK KNKN +   DG++ E+NK  +
Sbjct: 301 SAVSIRKIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKNKNPLPKGDGERFEENKVAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPR--NTDIRQTMQHNVVATNGVCNLFEDIEDGE 420
             S    PLL+  ++  E V VDI+K P+  NT+ +   Q N   TN + +L EDIEDGE
Sbjct: 361 GNSQYTTPLLANDNDKSENVVVDIDKVPKPTNTNKQTPSQQNGATTNSLPHLPEDIEDGE 420

Query: 421 VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AAS VARAPS+RRL
Sbjct: 421 VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASFVARAPSSRRL 463

BLAST of CmoCh19G000510 vs. TrEMBL
Match: A0A067JCB9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21962 PE=4 SV=1)

HSP 1 Score: 676.0 bits (1743), Expect = 3.4e-191
Identity = 380/475 (80.00%), Postives = 405/475 (85.26%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           ML++N+L LA T   F  N+        V E DDI F   WWFVYAG+SCLLVLFAGIMS
Sbjct: 1   MLIVNALALARTM--FSINDI-------VFEPDDIEFGNVWWFVYAGVSCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+GSSTEKKQAA IIPV+QKQHQLLVTLLLCNACAMEALPI 
Sbjct: 61  GLTLGLMSLGLVELEILQRSGSSTEKKQAAVIIPVVQKQHQLLVTLLLCNACAMEALPIC 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGE+IPQAICSRYGL VGANF+WLVRILMIICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVLLSVTFVLAFGEVIPQAICSRYGLFVGANFVWLVRILMIICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LDA LGH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KILDAALGHSDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS + KNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGSQKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRV SDMPLYDILNEFQKGSSHMAAVVKV  K+K         K   +KF +
Sbjct: 301 SAVSIRRIPRVTSDMPLYDILNEFQKGSSHMAAVVKVHAKSK---------KFNNSKFAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQ--TMQHNVVATNGVCNLFEDIEDGE 420
             S L  PLL+KHD+  E V +DIEKA R T I++  T++ N VATN + +L EDIEDGE
Sbjct: 361 GDSELNTPLLNKHDDKSESVIIDIEKAARPTTIKENLTLEPNGVATNMMPHLSEDIEDGE 420

Query: 421 VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS RRL
Sbjct: 421 VIGIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSNRRL 455

BLAST of CmoCh19G000510 vs. TAIR10
Match: AT1G03270.1 (AT1G03270.1 CBS domain-containing protein with a domain of unknown function (DUF21))

HSP 1 Score: 621.3 bits (1601), Expect = 5.0e-178
Identity = 341/463 (73.65%), Postives = 383/463 (82.72%), Query Frame = 1

Query: 8   TLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLM 67
           TLA   AA+  N+        V EA+DI F + WWFV  G++C LVLFAGIMSGLTLGLM
Sbjct: 6   TLALVRAAYSLNSF-------VFEAEDIRFGSPWWFVVVGVACFLVLFAGIMSGLTLGLM 65

Query: 68  SLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHP 127
           SL LVELEILQ++GSS EKKQAA I+PV++KQHQLLVTLLLCNA AMEALPI LDKIFHP
Sbjct: 66  SLGLVELEILQQSGSSAEKKQAAAILPVVKKQHQLLVTLLLCNAAAMEALPICLDKIFHP 125

Query: 128 FVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALL 187
           FVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPI+YP+GKVLDA++
Sbjct: 126 FVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFLWLVRILMIICYPIAYPIGKVLDAVI 185

Query: 188 GHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFS 247
           GH+D LFRRAQLKALVSIH QEAGKGGELTH+ET IISGALDL++KTAE AMTPIESTFS
Sbjct: 186 GHNDTLFRRAQLKALVSIHSQEAGKGGELTHEETMIISGALDLSQKTAEEAMTPIESTFS 245

Query: 248 LDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRR 307
           LDVN+ LDWE IGKIL+RGHSR+PVY  NPKNIIGLLLVKSLLTVRAETE PVS+VSIR+
Sbjct: 246 LDVNTKLDWETIGKILSRGHSRIPVYLGNPKNIIGLLLVKSLLTVRAETEAPVSSVSIRK 305

Query: 308 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNK--NSVFVSDGDKHEKNKFTSKISPL 367
           IPRVPSDMPLYDILNEFQKGSSHMAAVVKVK+K+K  N   +S+G+  ++N    + S L
Sbjct: 306 IPRVPSDMPLYDILNEFQKGSSHMAAVVKVKDKDKKNNMQLLSNGETPKENMKFYQSSNL 365

Query: 368 FIPLLSKHDNDLERVDVDIEKAPRNTDIR-QTMQHNVVATNGVCNLFEDIEDGEVIGIIT 427
             PLL    +D   V VDI+K P++   R +  Q N   T  +  L ED ED EVIGIIT
Sbjct: 366 TAPLLKHESHD---VVVDIDKVPKHVKNRGRNFQQNGTVTRDLPCLLEDNEDAEVIGIIT 425

Query: 428 LEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARA 468
           LEDVFEELLQ EIVDETDVY+DVHKR+RVAAAAA A SS+ RA
Sbjct: 426 LEDVFEELLQAEIVDETDVYIDVHKRVRVAAAAAAAVSSITRA 458

BLAST of CmoCh19G000510 vs. TAIR10
Match: AT4G14240.1 (AT4G14240.1 CBS domain-containing protein with a domain of unknown function (DUF21))

HSP 1 Score: 600.9 bits (1548), Expect = 7.0e-172
Identity = 336/473 (71.04%), Postives = 382/473 (80.76%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           M L+N++  A   +    +N +N         + IPF +  W  YAG+SC LVLFAGIMS
Sbjct: 1   MHLINAVAAARILSGIGQSNGNNG-------GEAIPFGSFEWITYAGISCFLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+G+  EKKQAA I PV+QKQHQLLVTLLLCNA AME LPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDK+F+ +VA++LSVTFVL FGE+IPQAIC+RYGLAVGANF+WLVRILM +CYPI++P+G
Sbjct: 121 LDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           K+LD +LGH+DALFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTA+ AMT
Sbjct: 181 KILDLVLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSRVPVYS NPKN+IGLLLVKSLLTVR ETET V
Sbjct: 241 PIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAV IRRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K    V      E++   S
Sbjct: 301 SAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSK----VPPSTLLEEHTDES 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL K + + + V V I+KA    + +   Q+N    +G  +  E IEDGEVI
Sbjct: 361 NDSDLTAPLLLKREGNHDNVIVTIDKA----NGQSFFQNNESGPHGFSHTSEAIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETD YVDVHKRIRVAAAA  AASS+ARAPS+R+L
Sbjct: 421 GIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAA--AASSIARAPSSRKL 456

BLAST of CmoCh19G000510 vs. TAIR10
Match: AT4G14230.1 (AT4G14230.1 CBS domain-containing protein with a domain of unknown function (DUF21))

HSP 1 Score: 573.2 bits (1476), Expect = 1.6e-163
Identity = 315/448 (70.31%), Postives = 362/448 (80.80%), Query Frame = 1

Query: 26  QVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTE 85
           Q   ++++ IPF +  W  YAG+SC LVLFAGIMSGLTLGLMSL LVELEILQR+G+  E
Sbjct: 18  QSNALQSEAIPFGSLEWITYAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPKE 77

Query: 86  KKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEII 145
           KKQ+A I PV+QKQHQLLVTLLL NA AME LPIYLDKIF+ +VA++LSVTFVL  GE+I
Sbjct: 78  KKQSAAIFPVVQKQHQLLVTLLLFNALAMEGLPIYLDKIFNEYVAIILSVTFVLFVGEVI 137

Query: 146 PQAICSRYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLGHHDALFRRAQLKALVSI 205
           PQAIC+RYGLAVGAN +WLVRILM++ YPIS+P+ K+LD +LGH+D LFRRAQLKALVSI
Sbjct: 138 PQAICTRYGLAVGANLVWLVRILMVLSYPISFPIAKMLDWVLGHNDPLFRRAQLKALVSI 197

Query: 206 HGQEAGKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILAR 265
           HG+ AGKGGELTHDETTIISGALDLTEKTA+ AMTPIESTFSLDVNS LD EA+ KI AR
Sbjct: 198 HGEAAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDREAMDKIQAR 257

Query: 266 GHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQ 325
           GHSRVPVYS+NPKN+IGLLLVKSLLTVR ET T VSAV IRRIPRVP++MPLYDILNEFQ
Sbjct: 258 GHSRVPVYSDNPKNVIGLLLVKSLLTVRPETGTLVSAVGIRRIPRVPANMPLYDILNEFQ 317

Query: 326 KGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIE 385
           KGSSHMAAVVKVK K+K        +   ++  +S  S L  PLL K + + + V V I+
Sbjct: 318 KGSSHMAAVVKVKGKSKGHPSTLHEENSGESNVSSNNSELTAPLLLKREGNHDSVIVRID 377

Query: 386 KAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYV 445
           KA   + I +          G  +  E+IEDG+VIGIITLEDVFEELLQEEIVDETD Y+
Sbjct: 378 KANGQSFISE------AGRQGFSHTSEEIEDGDVIGIITLEDVFEELLQEEIVDETDEYI 437

Query: 446 DVHKRIRVAAAAAVAASSVARAPSTRRL 474
           DVHKRIRVA  AAVA SS+ARAPS RRL
Sbjct: 438 DVHKRIRVATVAAVAISSLARAPSGRRL 459

BLAST of CmoCh19G000510 vs. TAIR10
Match: AT5G52790.1 (AT5G52790.1 CBS domain-containing protein with a domain of unknown function (DUF21))

HSP 1 Score: 375.6 bits (963), Expect = 4.8e-104
Identity = 211/425 (49.65%), Postives = 291/425 (68.47%), Query Frame = 1

Query: 32  ADDIPFPTAWWFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAAT 91
           A+D+P     ++VY  +   LV+FAG+MSGLTLGLMSL++VELE++ + G   ++K A  
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 92  IIPVLQKQHQLLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICS 151
           I+P+++ QH LL TLL+ NA AMEALPI++D +   + A+L+SVT +L FGEIIPQA+CS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 152 RYGLAVGANFIWLVRILMIICYPISYPVGKVLDALLG-HHDALFRRAQLKALVSIHGQEA 211
           RYGL++GA   +LVR+++I+ +P+SYP+ K+LD LLG  H  L  RA+LK+LV +HG EA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 212 GKGGELTHDETTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRV 271
           GKGGELTHDETTIISGALD+++K+A+ AMTP+   FSLD+N  LD + +G I + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 272 PVYSENPKNIIGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH 331
           P+YS NP  IIG +LVK+L+ VR E ET +  + IRR+P+V  ++PLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 332 MAAVVKVKEKNKNSVFVSDGDKHEK--NKFTSKISPLFIPLLSKHDNDLERVDVDIEKAP 391
           MAAVV  K     +  V     HEK  N   +K + +F+ + + + ++            
Sbjct: 303 MAAVVGTKNHTNTNTPV-----HEKSINGSPNKDANVFLSIPALNSSETSH--------- 362

Query: 392 RNTDIRQTMQHNVVATNGVCNLFEDIEDGEVIGIITLEDVFEELLQEEIVDETDVYVDVH 451
                    Q  +   + + +     ED EVIGIITLEDV EEL+QEEI DETD YV++H
Sbjct: 363 ---------QSPIRYIDSISD-----EDEEVIGIITLEDVMEELIQEEIYDETDQYVELH 399

Query: 452 KRIRV 454
           KRI +
Sbjct: 423 KRITI 399

BLAST of CmoCh19G000510 vs. TAIR10
Match: AT2G14520.1 (AT2G14520.1 CBS domain-containing protein with a domain of unknown function (DUF21))

HSP 1 Score: 375.2 bits (962), Expect = 6.2e-104
Identity = 218/427 (51.05%), Postives = 287/427 (67.21%), Query Frame = 1

Query: 42  WFVYAGLSCLLVLFAGIMSGLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQ 101
           +F++  +  LLVLFAG+MSGLTLGLMS++LV+LE+L ++G+  ++  AA I+PV++ QH 
Sbjct: 12  FFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQHL 71

Query: 102 LLVTLLLCNACAMEALPIYLDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANF 161
           LL TLL+CNA AMEALPI+LD +   + A+L+SVT +L+FGEIIPQ++CSR+GLA+GA  
Sbjct: 72  LLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGATV 131

Query: 162 IWLVRILMIICYPISYPVGKVLDALLGH-HDALFRRAQLKALVSIHGQEAGKGGELTHDE 221
              VR+L+ IC P+++P+ K+LD LLGH   ALFRRA+LK LV +HG EAGKGGELTHDE
Sbjct: 132 APFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHDE 191

Query: 222 TTIISGALDLTEKTAEAAMTPIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNI 281
           TTII+GAL+L+EK A+ AMTPI  TF +D+N+ LD + +  IL +GHSRVPVY E   NI
Sbjct: 192 TTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTNI 251

Query: 282 IGLLLVKSLLTVRAETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEK 341
           IGL+LVK+LLT+  + E  V  V+IRRIPRVP  +PLYDILNEFQKG SHMA VV+    
Sbjct: 252 IGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQ--- 311

Query: 342 NKNSVFVSDGDKHEKNKFTSKISPLFIPLLSKHDNDLERVDVDIEKAPRNTDI--RQTMQ 401
                               KI PL     +    +  RVDVD E++P+ T +  R+++Q
Sbjct: 312 ------------------CDKIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQ 371

Query: 402 HNVVATNGVCNLFE-----------DI------------EDGEVIGIITLEDVFEELLQE 443
                 N   +L             DI            E+ + +GIIT+EDV EELLQE
Sbjct: 372 KWKSFPNRANSLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQE 417

BLAST of CmoCh19G000510 vs. NCBI nr
Match: gi|703076596|ref|XP_010090374.1| (Putative DUF21 domain-containing protein [Morus notabilis])

HSP 1 Score: 691.8 bits (1784), Expect = 8.5e-196
Identity = 379/473 (80.13%), Postives = 410/473 (86.68%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           +LLLN+LTLA T      +         V EA+DI F   WWFV+AG+SCLLVLFAGIMS
Sbjct: 6   VLLLNALTLARTMTVSTSDL--------VFEAEDIEFGQPWWFVFAGVSCLLVLFAGIMS 65

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQR+G+STEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 66  GLTLGLMSLNLVELEILQRSGTSTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 125

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVRILMIICYPISYP+G
Sbjct: 126 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFVWLVRILMIICYPISYPIG 185

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH++ LFRRAQLKALVSIH QEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 186 KVLDAVLGHNEVLFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 245

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDV+S LDWEAIGKILARGHSRVPV+S NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 246 PIESTFSLDVSSKLDWEAIGKILARGHSRVPVFSGNPKNIIGLLLVKSLLTVRAETETPV 305

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRR+PRVP+DMPLYDILNEFQKGSSHMAAVVK+K K+K      DG+K E++ FT+
Sbjct: 306 SAVSIRRMPRVPADMPLYDILNEFQKGSSHMAAVVKIKGKSKIPQPALDGEKCEEDTFTN 365

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD+    + +D+EKA R     QT+Q N V TNG     EDIEDGEVI
Sbjct: 366 AKSQLTTPLLTKHDDKSGSIVIDVEKASRPLTNMQTLQQNGVTTNGFPYSSEDIEDGEVI 425

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS RRL
Sbjct: 426 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSNRRL 468

BLAST of CmoCh19G000510 vs. NCBI nr
Match: gi|223527135|gb|EEF29310.1| (conserved hypothetical protein [Ricinus communis])

HSP 1 Score: 688.3 bits (1775), Expect = 9.4e-195
Identity = 380/474 (80.17%), Postives = 405/474 (85.44%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLLLN+LTLA T     F+ NH      V EADDI F T WWF+YAG+SCLLVLFAGIMS
Sbjct: 1   MLLLNALTLART----MFSINHI-----VFEADDIKFATLWWFIYAGISCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+GS TEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGSFTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAV+LSVTFVL FGEIIPQAICSRYGL VGAN +WLVRILM ICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVVLSVTFVLAFGEIIPQAICSRYGLYVGANLVWLVRILMFICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA LGH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAALGHDDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS  PKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGCPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPS+MPLYDILNEFQKGSSHMAAVVKV  K+KN+   SDG+K  + KF +
Sbjct: 301 SAVSIRRIPRVPSNMPLYDILNEFQKGSSHMAAVVKVHAKSKNAQPTSDGEKFNEIKFAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD   E + +D+EKA R   I+Q   H++        L ED+EDGEVI
Sbjct: 361 GDSQLNAPLLTKHDGKSEHLLIDVEKAARPMTIKQQKTHDIP------RLSEDVEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLP 475
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRV AAA  AAS VARAPS RRLP
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVVAAA--AASYVARAPSNRRLP 457

BLAST of CmoCh19G000510 vs. NCBI nr
Match: gi|1000939687|ref|XP_015583232.1| (PREDICTED: uncharacterized protein LOC8266776 [Ricinus communis])

HSP 1 Score: 688.3 bits (1775), Expect = 9.4e-195
Identity = 380/474 (80.17%), Postives = 405/474 (85.44%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLLLN+LTLA T     F+ NH      V EADDI F T WWF+YAG+SCLLVLFAGIMS
Sbjct: 1   MLLLNALTLART----MFSINHI-----VFEADDIKFATLWWFIYAGISCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSL LVELEILQR+GS TEKKQAATI+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 61  GLTLGLMSLGLVELEILQRSGSFTEKKQAATILPVVQKQHQLLVTLLLCNACAMEALPIY 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAV+LSVTFVL FGEIIPQAICSRYGL VGAN +WLVRILM ICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVVLSVTFVLAFGEIIPQAICSRYGLYVGANLVWLVRILMFICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA LGH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAALGHDDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS  PKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGCPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVPS+MPLYDILNEFQKGSSHMAAVVKV  K+KN+   SDG+K  + KF +
Sbjct: 301 SAVSIRRIPRVPSNMPLYDILNEFQKGSSHMAAVVKVHAKSKNAQPTSDGEKFNEIKFAN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD   E + +D+EKA R   I+Q   H++        L ED+EDGEVI
Sbjct: 361 GDSQLNAPLLTKHDGKSEHLLIDVEKAARPMTIKQQKTHDIP------RLSEDVEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRLP 475
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRV AAA  AAS VARAPS RRLP
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVVAAA--AASYVARAPSNRRLP 457

BLAST of CmoCh19G000510 vs. NCBI nr
Match: gi|1009147752|ref|XP_015891576.1| (PREDICTED: DUF21 domain-containing protein At4g14240 [Ziziphus jujuba])

HSP 1 Score: 686.4 bits (1770), Expect = 3.6e-194
Identity = 378/473 (79.92%), Postives = 408/473 (86.26%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           +LL+N+LTL  T      +         V EA+DI F   WWFVYAG+SCL+VLFAGIMS
Sbjct: 2   VLLINALTLPRTMMVSSTDL--------VFEAEDIEFGNPWWFVYAGVSCLMVLFAGIMS 61

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMSLNLVELEILQR+G+  EKKQAA+I+PV+QKQHQLLVTLLLCNACAMEALPIY
Sbjct: 62  GLTLGLMSLNLVELEILQRSGTCAEKKQAASILPVVQKQHQLLVTLLLCNACAMEALPIY 121

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGLAVGANF+WLVR+LM+ICYPISYP+G
Sbjct: 122 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANFVWLVRVLMVICYPISYPIG 181

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA+LGH+DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 182 KVLDAVLGHNDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 241

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEAIGKILARGHSRVPVYS N KNIIGLLLVKSLLTVRAETETPV
Sbjct: 242 PIESTFSLDVNSKLDWEAIGKILARGHSRVPVYSGNAKNIIGLLLVKSLLTVRAETETPV 301

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRR+PRVP+DMPLYDILNEFQKGSSHMAAVVKVK K+K      D +K E+   T+
Sbjct: 302 SAVSIRRMPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKIPQPTVDKEKFEE---TN 361

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+KHD+    V VDIEK PR T  +Q++Q N V TNG+    EDIEDGEVI
Sbjct: 362 AKSDLTAPLLTKHDDKSGTVFVDIEKTPRTTTNKQSVQQNGVTTNGLPQPSEDIEDGEVI 421

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS RRL
Sbjct: 422 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSNRRL 461

BLAST of CmoCh19G000510 vs. NCBI nr
Match: gi|590717685|ref|XP_007050665.1| (CBS domain-containing protein with a domain of Uncharacterized protein function isoform 1 [Theobroma cacao])

HSP 1 Score: 684.9 bits (1766), Expect = 1.0e-193
Identity = 370/473 (78.22%), Postives = 410/473 (86.68%), Query Frame = 1

Query: 1   MLLLNSLTLATTAAAFFFNNNHNHYQVPVVEADDIPFPTAWWFVYAGLSCLLVLFAGIMS 60
           MLL N++ LA T      N+        + E DDIPF +  WFVYAG SCLLVLFAGIMS
Sbjct: 1   MLLQNAIVLARTIMTLSPNDI-------LFEPDDIPFGSVKWFVYAGFSCLLVLFAGIMS 60

Query: 61  GLTLGLMSLNLVELEILQRTGSSTEKKQAATIIPVLQKQHQLLVTLLLCNACAMEALPIY 120
           GLTLGLMS++LVELEILQR+G+ TEKKQAATI+PV+++QHQLLVTLLLCNACAMEALPI 
Sbjct: 61  GLTLGLMSMSLVELEILQRSGTITEKKQAATILPVVKRQHQLLVTLLLCNACAMEALPIS 120

Query: 121 LDKIFHPFVAVLLSVTFVLVFGEIIPQAICSRYGLAVGANFIWLVRILMIICYPISYPVG 180
           LDKIFHPFVAVLLSVTFVL FGEIIPQAICSRYGL+VGANF+WLVRILMIICYPI+YP+G
Sbjct: 121 LDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLSVGANFVWLVRILMIICYPIAYPIG 180

Query: 181 KVLDALLGHHDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEAAMT 240
           KVLDA++GH DALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAE AMT
Sbjct: 181 KVLDAVIGHGDALFRRAQLKALVSIHGQEAGKGGELTHDETTIISGALDLTEKTAEEAMT 240

Query: 241 PIESTFSLDVNSNLDWEAIGKILARGHSRVPVYSENPKNIIGLLLVKSLLTVRAETETPV 300
           PIESTFSLDVNS LDWEA+GKILARGHSR+PVY+ NPKNIIGLLLVKSLLTVRAETETPV
Sbjct: 241 PIESTFSLDVNSKLDWEAVGKILARGHSRIPVYAGNPKNIIGLLLVKSLLTVRAETETPV 300

Query: 301 SAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKEKNKNSVFVSDGDKHEKNKFTS 360
           SAVSIRRIPRVP+ MPLYDILNEFQKGSSHMAAVVKVKEK K+  F  DG+K ++++ T+
Sbjct: 301 SAVSIRRIPRVPAHMPLYDILNEFQKGSSHMAAVVKVKEKTKDPEFFDDGEKFDEHRVTN 360

Query: 361 KISPLFIPLLSKHDNDLERVDVDIEKAPRNTDIRQTMQHNVVATNGVCNLFEDIEDGEVI 420
             S L  PLL+K+D  L  V VD+EK  R   +++T+Q N V  N + +  EDIEDGEVI
Sbjct: 361 GNSQLTTPLLTKYDTKLNSVAVDVEKPSRPITVKKTLQENGVTANTLHHFTEDIEDGEVI 420

Query: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAAAVAASSVARAPSTRRL 474
           GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA  AASSVARAPS+RRL
Sbjct: 421 GIITLEDVFEELLQEEIVDETDVYVDVHKRIRVAAAA--AASSVARAPSSRRL 464

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1327_ARATH8.8e-17773.65Putative DUF21 domain-containing protein At1g03270 OS=Arabidopsis thaliana GN=CB... [more]
Y4424_ARATH1.2e-17071.04DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana GN=CBSDUF1 PE=... [more]
Y4423_ARATH2.8e-16270.31DUF21 domain-containing protein At4g14230 OS=Arabidopsis thaliana GN=CBSDUF2 PE=... [more]
Y5279_ARATH8.5e-10349.65DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana GN=CBSDUF5 PE=... [more]
Y2452_ARATH1.1e-10251.05DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana GN=CBSDUF3 PE=... [more]
Match NameE-valueIdentityDescription
W9QKK0_9ROSA5.9e-19680.13Putative DUF21 domain-containing protein OS=Morus notabilis GN=L484_025039 PE=4 ... [more]
B9T4A2_RICCO6.6e-19580.17Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0176310 PE=4 SV=1[more]
A0A061DQY0_THECC7.3e-19478.22CBS domain-containing protein with a domain of Uncharacterized protein function ... [more]
F6HQ68_VITVI3.0e-19278.11Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00300 PE=4 SV=... [more]
A0A067JCB9_JATCU3.4e-19180.00Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21962 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G03270.15.0e-17873.65 CBS domain-containing protein with a domain of unknown function (DUF... [more]
AT4G14240.17.0e-17271.04 CBS domain-containing protein with a domain of unknown function (DUF... [more]
AT4G14230.11.6e-16370.31 CBS domain-containing protein with a domain of unknown function (DUF... [more]
AT5G52790.14.8e-10449.65 CBS domain-containing protein with a domain of unknown function (DUF... [more]
AT2G14520.16.2e-10451.05 CBS domain-containing protein with a domain of unknown function (DUF... [more]
Match NameE-valueIdentityDescription
gi|703076596|ref|XP_010090374.1|8.5e-19680.13Putative DUF21 domain-containing protein [Morus notabilis][more]
gi|223527135|gb|EEF29310.1|9.4e-19580.17conserved hypothetical protein [Ricinus communis][more]
gi|1000939687|ref|XP_015583232.1|9.4e-19580.17PREDICTED: uncharacterized protein LOC8266776 [Ricinus communis][more]
gi|1009147752|ref|XP_015891576.1|3.6e-19479.92PREDICTED: DUF21 domain-containing protein At4g14240 [Ziziphus jujuba][more]
gi|590717685|ref|XP_007050665.1|1.0e-19378.22CBS domain-containing protein with a domain of Uncharacterized protein function ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000644CBS_dom
IPR002550CNNM
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G000510.1CmoCh19G000510.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000644CBS domainPROFILEPS51371CBScoord: 370..440
score: 7.207coord: 239..300
score: 8.691coord: 304..365
score: 6
IPR002550Domain of unknown function DUF21PFAMPF01595DUF21coord: 48..219
score: 4.7
NoneNo IPR availableGENE3DG3DSA:3.10.580.10coord: 222..335
score: 5.
NoneNo IPR availablePANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 26..340
score: 1.1E-293coord: 410..473
score: 1.1E
NoneNo IPR availablePANTHERPTHR12064:SF32SUBFAMILY NOT NAMEDcoord: 26..340
score: 1.1E-293coord: 410..473
score: 1.1E
NoneNo IPR availableunknownSSF54631CBS-domain paircoord: 224..337
score: 1.08E-18coord: 415..434
score: 1.08