CmaCh01G020540 (gene) Cucurbita maxima (Rimu)

NameCmaCh01G020540
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionDomain of Uncharacterized protein function (DUF23)
LocationCma_Chr01 : 13045384 .. 13049037 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAGCAATGAATACACTAAATATGATATGATATGCGCATATGGCCACTAATCTCATCGCACCGACAGACANAGCAAAAGAGACTGGCGGAGGCCAAGGACTGAAGTTCCCCTTTCTGCCTTCATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGTACGTGTTCGATCTAATTATGTTTGGAAGATACTTGTTGGATCTGAGGTCAGGTAGGATGTAGATAGGGATGTATGAATTGATATATATATATTTGATCCCCTTGTCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGGTACTGCTAATGCGAAGCTCTTCTCCTGATGTGATTACAGCTCGAGAACATCCATCCGCTTTCATCATCTTCCAAGTTGCAAATTTGGGTGTAGGAATGTGTGAATTGTAAGTAGAAGTCGTGAATGTATGTGGTAGACAGAATTGGGGGCAAATCCGATTGGAGTGGCTGGAGTTTCTGAATGTGGATGTAATGATGGGGGAGGGGAATTATATTTGCCCAAACTTTACTCCTCAGTTACTAACTTACTCCTCCTCCGTTACTAATCTACCCTGGTACTGGTGGTGGTAAAAGAACAGTGACTTTGAATCCAGTGATAGAAAGGGATACATAGTTGCTTTAAGATTGTAATGAGTAGATTAAGGGACTTTTTTTTTTCTTTTCATTCTTTCTACTTTGAGCTCTAGCCTTGCAATTTTAAGGGACTTTTTTCTTTTTTCTTTTTTCTTTTCTTTTCGTGTATTATGAATATGATTATGAATGGGTTGAGACAGAGAAAATTGGTGAGGGGTAGTTTGTGCCTTCCTCTGCTTCCGCTTCTTTGGAGCGTGGAAGTGGAAGTACAGGGAAAGAGAACTGTGTCCTTACCGTTGCTACTTACTATGCGTATGTAAATCAACTCCACAGTAACAAAAGTGGATTCTTTGATATTTGCTTTCTGAGGACCTAAAGTAATAAAAGGCAATAATTATATATAATTACCGTTGTGTCCTTTGAATTGTTGTAATTAATTATGGTCTTTTATCTTCTTATGTGACAATTGTTATGGAAATTACTTTATTTAGAAACTTAAAAAACATCAGACAAGAAGAAGGTAGTGTGTTTTTAAATAAATAAATAAGGTCATAAATAAGAATGATGTTGATGGATTATGCATTGATTAGTAAGGGCATTTTTCAATTTGTGTGCTCGGGGAACAAGATTGTGGTCATCTTTTATTTTTATTTTTGTGTGGGTGTGTGTTGGTTTGGCGTGTCCGTTGACTACGTCGGGTTTCTTTTTGATTGGTATTAGCATTAAATGGAAAGGCCGCGTTGAAGGTAGTTTTATGACGTAGACGATGACCAATTTTCTGTTTCCTCATCGCTTACCTCGCACCAGCATCACATTTCAATAAATGATCCATCCCCTCCTGGTTATACCAATATCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTCCTCGGCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTCCTCGGCTCCTCCTCCCATTCGCAATTTCTCATTTCTGACTGCCCATAATCCCACAAAATTCCGTGTCTGAAACGTAGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

mRNA sequence

GGAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAGCAATGAATACACTAAATATGATATGATATGCGCATATGGCCACTAATCTCATCGCACCGACAGACANAGCAAAAGAGACTGGCGGAGGCCAAGGACTGAAGTTCCCCTTTCTGCCTTCATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

Coding sequence (CDS)

ATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

Protein sequence

MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAIDRHQQQLGLILMGACLSDCLNHPKPSSVSPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSLSDSFLCNSDRLYYDDFIPPLPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQNASPNDRRKKGRVSPLLNLSDSDHIISKEPSKKNAAADTSASPSVRKLQRLTSRRAKMAVRSFKLKLSTIYEGAVL
BLAST of CmaCh01G020540 vs. Swiss-Prot
Match: GALS1_ARATH (Galactan beta-1,4-galactosyltransferase GALS1 OS=Arabidopsis thaliana GN=GALS1 PE=2 SV=2)

HSP 1 Score: 739.6 bits (1908), Expect = 3.2e-212
Identity = 349/472 (73.94%), Postives = 404/472 (85.59%), Query Frame = 1

Query: 21  CFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAAL---ANLTIPPN 80
           CFE KP++AT LAL+LVM +WNLPPYY NL+ +TAR CSA T T    L   +N T   N
Sbjct: 16  CFEKKPIIATLLALSLVMIVWNLPPYYHNLI-STARPCSAVTTTTTTTLLSSSNFTSAEN 75

Query: 81  --TSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASK 140
             TSL  T +AA++KY +T P+DPN R+FQPFGNAAALFVLMGAYRGGP TF+++GLASK
Sbjct: 76  FTTSLSTTTAAASQKYDST-PSDPNKRVFQPFGNAAALFVLMGAYRGGPTTFSVIGLASK 135

Query: 141 PIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGK 200
           PIHVYG PWYKCEWI NNG+SIRAKA KILPDWGYGRVYTVVVVNCTF  NPN DN+GGK
Sbjct: 136 PIHVYGKPWYKCEWISNNGTSIRAKAQKILPDWGYGRVYTVVVVNCTFNSNPNSDNTGGK 195

Query: 201 LTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWM 260
           L +NAYY +S K +E+FT LEE  G Y+ SK+ PPY Y+YLYCGSSLYGN+SA+R+REWM
Sbjct: 196 LILNAYYNESPKLFERFTTLEESAGIYDESKYSPPYQYDYLYCGSSLYGNVSASRMREWM 255

Query: 261 AYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVV 320
           AYHAWFFG KSHFVFHDAGGVSPEVR VLEPW+RAGRVT+Q+IR QS+YDGYYYNQFL+V
Sbjct: 256 AYHAWFFGDKSHFVFHDAGGVSPEVRKVLEPWIRAGRVTVQNIRDQSQYDGYYYNQFLIV 315

Query: 321 NDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDS 380
           NDCLHRYR+AANWTF+FDVDEYIYLP G++LESVL+EFS  TQFTIEQNPMSS+LC+NDS
Sbjct: 316 NDCLHRYRYAANWTFFFDVDEYIYLPHGNTLESVLDEFSVNTQFTIEQNPMSSVLCINDS 375

Query: 381 AQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRY 440
           +Q+Y R+WGFEKLLFKD ++ I RDRKYAIQAKNA+ATGVHMSEN++G T HKTE+KIRY
Sbjct: 376 SQDYPRQWGFEKLLFKDSRTKIRRDRKYAIQAKNAFATGVHMSENIVGKTLHKTETKIRY 435

Query: 441 YHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFER 488
           YHYHN+I V  ELCRE LPNSA   VT++N+ P+VYDD MKKL  TIKEFE+
Sbjct: 436 YHYHNTITVHEELCREMLPNSAKKKVTLYNKLPYVYDDNMKKLVKTIKEFEQ 485

BLAST of CmaCh01G020540 vs. Swiss-Prot
Match: GALS2_ARATH (Galactan beta-1,4-galactosyltransferase GALS2 OS=Arabidopsis thaliana GN=GALS2 PE=2 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 3.0e-117
Identity = 212/399 (53.13%), Postives = 262/399 (65.66%), Query Frame = 1

Query: 102 RIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIR--A 161
           R F  +G AA  FVLM AYRGG  TFA++GL+SKP+HVY HP Y+CEWI  N S  R   
Sbjct: 116 RTFTGYGWAAYNFVLMNAYRGGVNTFAVIGLSSKPLHVYSHPTYRCEWIPLNQSDNRILT 175

Query: 162 KAYKILPDWGYGRVYTVVVVNCTFPLNP--NHDNSGGKLTVNAYYGQSQKKY-EKFTALE 221
              KIL DWGYGRVYT VVVNCTFP N   N  N+GG L ++A  G + +   +    L 
Sbjct: 176 DGTKILTDWGYGRVYTTVVVNCTFPSNTVINPKNTGGTLLLHATTGDTDRNITDSIPVLT 235

Query: 222 ELPGSYN----ASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHD 281
           E P + +     S  R    Y+YLYCGSSLYGNLS  RIREW+AYH  FFG +SHFV HD
Sbjct: 236 ETPNTVDFALYESNLRRREKYDYLYCGSSLYGNLSPQRIREWIAYHVRFFGERSHFVLHD 295

Query: 282 AGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYF 341
           AGG++ EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF+VVNDCLHRYR  A W F+F
Sbjct: 296 AGGITEEVFEVLKPWIELGRVTVHDIREQERFDGYYHNQFMVVNDCLHRYRFMAKWMFFF 355

Query: 342 DVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN-DSAQNYSRKWGFEKLLFK 401
           DVDE+IY+P  SS+ SV+     ++QFTIEQ PMSS LC + D      RKWGFEKL ++
Sbjct: 356 DVDEFIYVPAKSSISSVMVSLEEYSQFTIEQMPMSSQLCYDGDGPARTYRKWGFEKLAYR 415

Query: 402 DIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCRE 461
           D+K    RDRKYA+Q +N +ATGVHMS+++ G T H+ E KIRY+HYH SI  R E CR 
Sbjct: 416 DVKKVPRRDRKYAVQPRNVFATGVHMSQHLQGKTYHRAEGKIRYFHYHGSISQRREPCRH 475

Query: 462 FLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
               + I    +    P+V D  M+ +   +K FE   I
Sbjct: 476 LYNGTRI----VHENNPYVLDTTMRDIGLAVKTFEIRTI 510

BLAST of CmaCh01G020540 vs. Swiss-Prot
Match: GALS3_ARATH (Galactan beta-1,4-galactosyltransferase GALS3 OS=Arabidopsis thaliana GN=GALS3 PE=2 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 3.1e-114
Identity = 220/477 (46.12%), Postives = 293/477 (61.43%), Query Frame = 1

Query: 27  LVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASA 86
           L+  C   TL+ F+    P   +L  +  R C +  ++A       T+  ++S P    +
Sbjct: 35  LLVLCTLATLLPFI----PSSFSLSTSDFRFCISRFSSAVPLNTTTTVEESSSSP----S 94

Query: 87  AAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYK 146
             K     L      R F  +G+AA  FV M AYRGG  +FA++GL+SKP+HVYGHP Y+
Sbjct: 95  PEKNLDRVLDNGVIKRTFTGYGSAAYNFVSMSAYRGGVNSFAVIGLSSKPLHVYGHPSYR 154

Query: 147 CEWIFNNGSS--IRAKAYKILPDWGYGRVYTVVVVNCTFP----LNPNHDNSGGKLTVNA 206
           CEW+  + +   I    +KIL DWGYGR+YT VVVNCTF     +NP   NSGG L ++A
Sbjct: 155 CEWVSLDPTQDPISTTGFKILTDWGYGRIYTTVVVNCTFSSISAVNPQ--NSGGTLILHA 214

Query: 207 YYGQSQKKY-EKFTALEELPGS-----YNASKFRPPYDYEYLYCGSSLYGNLSAARIREW 266
             G       +  + L E P S     YN++K    YDY  LYCGSSLYGNLS  R+REW
Sbjct: 215 TTGDPTLNLTDSISVLTEPPKSVDFDLYNSTKKTKKYDY--LYCGSSLYGNLSPQRVREW 274

Query: 267 MAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLV 326
           +AYH  FFG +SHFV HDAGG+  EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF++
Sbjct: 275 IAYHVRFFGERSHFVLHDAGGIHEEVFEVLKPWIELGRVTLHDIRDQERFDGYYHNQFMI 334

Query: 327 VNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN- 386
           VNDCLHRYR    W F+FDVDE++++P   ++ SV+E    ++QFTIEQ PMSS +C + 
Sbjct: 335 VNDCLHRYRFMTKWMFFFDVDEFLHVPVKETISSVMESLEEYSQFTIEQMPMSSRICYSG 394

Query: 387 DSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKI 446
           D      RKWG EKL ++D+K    RDRKYA+Q +N +ATGVHMS+N+ G T HK ESKI
Sbjct: 395 DGPARTYRKWGIEKLAYRDVKKVPRRDRKYAVQPENVFATGVHMSQNLQGKTYHKAESKI 454

Query: 447 RYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
           RY+HYH SI  R E CR+   +S +    +F  TP+V D  +  +   ++ FE   I
Sbjct: 455 RYFHYHGSISQRREPCRQLFNDSRV----VFENTPYVLDTTICDVGLAVRTFELRTI 495

BLAST of CmaCh01G020540 vs. TrEMBL
Match: A0A067KKI4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08012 PE=4 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 2.1e-226
Identity = 381/492 (77.44%), Postives = 423/492 (85.98%), Query Frame = 1

Query: 1   MRKD-GPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCS 60
           MRKD  P +S      GKL  CFETKPLVAT LALTLVM LWNLPPYYQNLL TT RSCS
Sbjct: 1   MRKDCPPLSSFAGGTAGKLSLCFETKPLVATVLALTLVMLLWNLPPYYQNLLSTT-RSCS 60

Query: 61  AP-TNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMG 120
           AP  +TA    +N +  P TS   T S + +KYST + TDPN RIFQ +GNAAALFV MG
Sbjct: 61  APAASTASLIASNASSLPITSYASTTSVSEQKYSTPVVTDPNKRIFQAYGNAAALFVQMG 120

Query: 121 AYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVV 180
           AYRGGPRTFA+VGLASKPIHV+G PWYKCEWI NNGSS+RAKAYK+LPDWGYGRVYTVVV
Sbjct: 121 AYRGGPRTFAVVGLASKPIHVFGRPWYKCEWISNNGSSLRAKAYKMLPDWGYGRVYTVVV 180

Query: 181 VNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYC 240
           VNCTF +NPN DN+GGKL +NAYYG+SQ+KYEKF ALEE PGSYN SK+ PPY YEYLYC
Sbjct: 181 VNCTFSVNPNEDNAGGKLMLNAYYGESQRKYEKFVALEEAPGSYNESKYHPPYQYEYLYC 240

Query: 241 GSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDI 300
           GSSLYGNLSAAR+REWMAYHAWFFGS SHFVFHDAGGVSPEVRA LEPWVRAGR T+QDI
Sbjct: 241 GSSLYGNLSAARMREWMAYHAWFFGSSSHFVFHDAGGVSPEVRAALEPWVRAGRATVQDI 300

Query: 301 RAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQ 360
           R Q+E+DGYYYNQFLVVNDCLHRYR+AANWTFYFDVDEYIYLP G++LESVL+EFS +TQ
Sbjct: 301 RGQAEFDGYYYNQFLVVNDCLHRYRYAANWTFYFDVDEYIYLPLGNTLESVLKEFSDYTQ 360

Query: 361 FTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMS 420
           FTIEQNPMSS+LCLNDS ++YSR+WGFEKLLF++ ++GI RDRKYAIQAK A+ATGVHMS
Sbjct: 361 FTIEQNPMSSVLCLNDSTRDYSREWGFEKLLFRESRTGIRRDRKYAIQAKKAFATGVHMS 420

Query: 421 ENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKL 480
           ENV+G T HKTE KIRYYHYHNSI V GELCREFLP SA  NVT++++ P+VYDD MKKL
Sbjct: 421 ENVVGKTLHKTEDKIRYYHYHNSITVPGELCREFLPPSAKKNVTLYDKLPYVYDDNMKKL 480

Query: 481 ADTIKEFERHAI 491
           A TIKEFER  I
Sbjct: 481 AATIKEFERKTI 491

BLAST of CmaCh01G020540 vs. TrEMBL
Match: A0A061F459_THECC (Domain of Uncharacterized protein function (DUF23) OS=Theobroma cacao GN=TCM_024735 PE=4 SV=1)

HSP 1 Score: 780.8 bits (2015), Expect = 1.4e-222
Identity = 375/496 (75.60%), Postives = 421/496 (84.88%), Query Frame = 1

Query: 1   MRKDGPATSIGD--AQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSC 60
           MRK+   +S     A  GKLF CFETKPLVAT LALTLVM LWNLPPYYQNLL TT R C
Sbjct: 1   MRKEAAPSSAPSTAASFGKLFVCFETKPLVATLLALTLVMLLWNLPPYYQNLLSTT-RPC 60

Query: 61  SAPTNTADAALANLTIPPNTSLPFTASAAAKK--YST--TLPTDPNNRIFQPFGNAAALF 120
           SAP+ T+ AA     +  N SLP+TA+  A+K  YS     P DPN R+F+ +GNAAALF
Sbjct: 61  SAPSLTSAAAATTSLLATNVSLPYTATPVAEKKYYSAPKAKPRDPNKRVFEAYGNAAALF 120

Query: 121 VLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVY 180
           V MGAYRGGP TFA+VGLASKPIHV+G PWYKCEWI NNGSS RAKAYK+LPDWGYGRVY
Sbjct: 121 VRMGAYRGGPTTFAVVGLASKPIHVFGRPWYKCEWISNNGSSYRAKAYKMLPDWGYGRVY 180

Query: 181 TVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYE 240
           TV+VVNCTFP NPN DN GGKL +NAYYG+SQ+KYEKFTALEE PGSYN SK+  P+ YE
Sbjct: 181 TVLVVNCTFPFNPNQDNLGGKLMINAYYGESQRKYEKFTALEEAPGSYNESKYHSPFQYE 240

Query: 241 YLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVT 300
           YLYCGSSLYGNLSA R+REWMAYHAWFFG  SHFVFHDAGGV+PEVRA L+PWVRAGR T
Sbjct: 241 YLYCGSSLYGNLSADRMREWMAYHAWFFGPNSHFVFHDAGGVTPEVRAALDPWVRAGRAT 300

Query: 301 IQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS 360
           +QDIR Q+E+DGYYYNQFLVVNDCLHRYRHAANWTF+FDVDEYIYLP+G++LESVL EFS
Sbjct: 301 MQDIRDQAEFDGYYYNQFLVVNDCLHRYRHAANWTFFFDVDEYIYLPDGNTLESVLNEFS 360

Query: 361 AFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATG 420
            +TQFTIEQNPMSS+LCLN+S++ YSR+WGFEKLLF++ ++GI RDRKYAIQAKNAYATG
Sbjct: 361 DYTQFTIEQNPMSSVLCLNNSSEQYSRQWGFEKLLFRESRTGIRRDRKYAIQAKNAYATG 420

Query: 421 VHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDK 480
           VHMSENVIG T HKTE+KIRYYHYHN+I V  ELCREFLP SA +NVT FN+ P+VYDD 
Sbjct: 421 VHMSENVIGKTLHKTETKIRYYHYHNTITVHQELCREFLPLSAKNNVTWFNKLPYVYDDN 480

Query: 481 MKKLADTIKEFERHAI 491
           MKKLA+TIKEFER  I
Sbjct: 481 MKKLANTIKEFERKTI 495

BLAST of CmaCh01G020540 vs. TrEMBL
Match: B9R8V6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1602470 PE=4 SV=1)

HSP 1 Score: 773.9 bits (1997), Expect = 1.7e-220
Identity = 373/495 (75.35%), Postives = 420/495 (84.85%), Query Frame = 1

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKD P  S      GKL +C ETKPLVAT LALTLVM LWNLPPYYQNLL TT R CSA
Sbjct: 1   MRKDCPPLS--SVTGGKLPSCLETKPLVATLLALTLVMLLWNLPPYYQNLLSTT-RPCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           P   A  A +N +     SLP T S + +KYST + +DPN RIF+ +GNAAALFV MGAY
Sbjct: 61  PAAAAALAASNAS-----SLPIT-SVSEQKYSTGV-SDPNKRIFEAYGNAAALFVKMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHV+G PWYKCEWI NNGSS+RAKAYK+LPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVFGRPWYKCEWISNNGSSMRAKAYKMLPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPN +N+GGKL +NAYYG+S +KYEK  ALEE PGSYN S + PPY YEYLYCGS
Sbjct: 181 CTFPLNPNRENAGGKLILNAYYGESPRKYEKIVALEEAPGSYNDSNYHPPYQYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAAR+REWMAYHAWFFG  SHFVFHDAGGVSP+VRA LEPWVRAGR T+QDIR 
Sbjct: 241 SLYGNLSAARMREWMAYHAWFFGPSSHFVFHDAGGVSPQVRAALEPWVRAGRATVQDIRG 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           Q+E+DGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLP+GS+L+SVL EFS +TQFT
Sbjct: 301 QAEFDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPDGSTLQSVLAEFSDYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSS+LCLNDS+ +Y R+WGFEKLLF++ +SGI RDRKYAIQAKNA+ATGVHMSEN
Sbjct: 361 IEQNPMSSVLCLNDSSHDYPREWGFEKLLFRESRSGIRRDRKYAIQAKNAFATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           V+G T HKTE+KIRYYHYHNSI V+GELCR+ LP SA HNVT +N+ P+VYDD MKKL  
Sbjct: 421 VVGKTLHKTETKIRYYHYHNSITVQGELCRQLLPASAKHNVTWYNKLPYVYDDNMKKLVT 480

Query: 481 TIKEFERHAIDRHQQ 496
           TI++FER+ I   +Q
Sbjct: 481 TIRDFERNTIGNVRQ 485

BLAST of CmaCh01G020540 vs. TrEMBL
Match: A0A0D2TS76_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G359700 PE=4 SV=1)

HSP 1 Score: 772.3 bits (1993), Expect = 4.9e-220
Identity = 370/508 (72.83%), Postives = 421/508 (82.87%), Query Frame = 1

Query: 1   MRKDG-----PATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTA 60
           MRK+      P+ + G   +GKLF CFETK LV T LALTLVMFLWNLPPYYQNLL TT 
Sbjct: 1   MRKEAAPSSVPSAAAGTTTLGKLFICFETKTLVTTLLALTLVMFLWNLPPYYQNLLSTT- 60

Query: 61  RSCSAPTNTAD--AALANLT---IPPNTSLPFTASAAAKKYSTTL---PTDPNNRIFQPF 120
           R CS P  +    A+ A++T   I  N S+P+ A+  AKKY+T     P DPN R+F+ +
Sbjct: 61  RPCSVPITSVSVSASAASVTASLISTNVSMPYKANPVAKKYNTATRPKPKDPNKRVFESY 120

Query: 121 GNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPD 180
           GNAAALFV MGAYRGGP TFA+VGLASKPIHVYG PW+KCEWI NNGSS RAKAYK+LPD
Sbjct: 121 GNAAALFVQMGAYRGGPTTFAVVGLASKPIHVYGKPWFKCEWISNNGSSYRAKAYKMLPD 180

Query: 181 WGYGRVYTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKF 240
           WGYGRVYTVVVVNCTFP NPN DN+GGKL VNAYYG+SQ+KYEKF ALEE PGSYN SKF
Sbjct: 181 WGYGRVYTVVVVNCTFPFNPNQDNNGGKLMVNAYYGESQRKYEKFMALEESPGSYNESKF 240

Query: 241 RPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPW 300
            PPY YEYLYCGSSLYGNLSA RIREWMAYHAWFFG  SHFVFHDAGGVSPEV AVLEPW
Sbjct: 241 NPPYQYEYLYCGSSLYGNLSADRIREWMAYHAWFFGPSSHFVFHDAGGVSPEVMAVLEPW 300

Query: 301 VRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLE 360
           V+AGRVT+QDIR Q+EYDGYYYNQFLVVNDCLHRYR+AANWTF+FDVDEYIYLP G++LE
Sbjct: 301 VKAGRVTVQDIRDQAEYDGYYYNQFLVVNDCLHRYRYAANWTFFFDVDEYIYLPHGNTLE 360

Query: 361 SVLEEFSAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQA 420
           SVL EFS +TQFTI+QNPMSSMLCLN S+Q YSR+WGFEKLLF++ ++ I RDRKYAIQA
Sbjct: 361 SVLNEFSGYTQFTIQQNPMSSMLCLNGSSQEYSRQWGFEKLLFRESRTKIRRDRKYAIQA 420

Query: 421 KNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQT 480
           KNA+ATGVHMSEN++G + HKTE+KI YYHYHN+I    ELCRE+LP++A+  VT FN+ 
Sbjct: 421 KNAFATGVHMSENIVGKSLHKTETKIHYYHYHNTITQHQELCREYLPSTAVKQVTWFNKL 480

Query: 481 PFVYDDKMKKLADTIKEFERHAIDRHQQ 496
           P+VYDD MKKLA+TIK+FE   I +  Q
Sbjct: 481 PYVYDDNMKKLANTIKQFELETIGKQPQ 507

BLAST of CmaCh01G020540 vs. TrEMBL
Match: B9H5Y4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s28030g PE=4 SV=1)

HSP 1 Score: 771.2 bits (1990), Expect = 1.1e-219
Identity = 377/501 (75.25%), Postives = 420/501 (83.83%), Query Frame = 1

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTAR-SCS 60
           MRKD    S G A V KL  CFETKPLVAT LALTLVM LWNLPPYYQNLL TT R SCS
Sbjct: 1   MRKDCQPPSSG-ATVSKLPACFETKPLVATLLALTLVMLLWNLPPYYQNLLSTTTRPSCS 60

Query: 61  APTNTADAALANLTIPPNTSLPFTASAAA-KKY-----STTLPTDPNNRIFQPFGNAAAL 120
           AP  T   +        NTS  FT+++ + +KY     S++   DPN RIFQ +GNAAAL
Sbjct: 61  APETTVSIS--------NTSSSFTSTSLSDQKYLSSSSSSSSNADPNKRIFQAYGNAAAL 120

Query: 121 FVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRV 180
           FV MGAYRGGP TFA+VGLASKPIHV+  PWYKCEWI NNGSSIRAKAYK+LPDWGYGRV
Sbjct: 121 FVQMGAYRGGPTTFAVVGLASKPIHVFRLPWYKCEWISNNGSSIRAKAYKMLPDWGYGRV 180

Query: 181 YTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDY 240
           YTVVVVNCTFP+NPN DN+GG+L +NAYY +SQ+KYEKF ALEELPGSYN SKFRPPY Y
Sbjct: 181 YTVVVVNCTFPVNPNQDNAGGRLMLNAYYDESQRKYEKFMALEELPGSYNESKFRPPYQY 240

Query: 241 EYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRV 300
           EYLYCGSSLYGNLSA+R REWMAYHAWFFG  SHFVFHDAGGVSPEVRA L+PWVRAGR 
Sbjct: 241 EYLYCGSSLYGNLSASRFREWMAYHAWFFGPSSHFVFHDAGGVSPEVRAALDPWVRAGRA 300

Query: 301 TIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEF 360
           T+QDIR Q+E+DGYYYNQFLVVNDCLHRYR++ANWTFYFDVDEYIYLPEG++LESVL++F
Sbjct: 301 TVQDIRGQAEFDGYYYNQFLVVNDCLHRYRYSANWTFYFDVDEYIYLPEGNTLESVLKDF 360

Query: 361 SAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYAT 420
           S +TQFTIEQNPMSS LC NDS Q+Y R+WGFEKLLF++ ++GI RDRKYAIQAKNAYAT
Sbjct: 361 SNYTQFTIEQNPMSSALCFNDSTQDYPRQWGFEKLLFRESRTGIRRDRKYAIQAKNAYAT 420

Query: 421 GVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDD 480
           GVHMSENVIG T H+TE+KIRYYHYHNSI V GELCREFLP SA +NVT +N  P+VYDD
Sbjct: 421 GVHMSENVIGKTLHQTETKIRYYHYHNSIQVPGELCREFLPLSAKNNVTWYNGLPYVYDD 480

Query: 481 KMKKLADTIKEFERHAIDRHQ 495
            MKKLA TIK+FER+ I   Q
Sbjct: 481 NMKKLASTIKDFERNTIGNVQ 492

BLAST of CmaCh01G020540 vs. TAIR10
Match: AT2G33570.1 (AT2G33570.1 Domain of unknown function (DUF23))

HSP 1 Score: 739.6 bits (1908), Expect = 1.8e-213
Identity = 349/472 (73.94%), Postives = 404/472 (85.59%), Query Frame = 1

Query: 21  CFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAAL---ANLTIPPN 80
           CFE KP++AT LAL+LVM +WNLPPYY NL+ +TAR CSA T T    L   +N T   N
Sbjct: 16  CFEKKPIIATLLALSLVMIVWNLPPYYHNLI-STARPCSAVTTTTTTTLLSSSNFTSAEN 75

Query: 81  --TSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASK 140
             TSL  T +AA++KY +T P+DPN R+FQPFGNAAALFVLMGAYRGGP TF+++GLASK
Sbjct: 76  FTTSLSTTTAAASQKYDST-PSDPNKRVFQPFGNAAALFVLMGAYRGGPTTFSVIGLASK 135

Query: 141 PIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGK 200
           PIHVYG PWYKCEWI NNG+SIRAKA KILPDWGYGRVYTVVVVNCTF  NPN DN+GGK
Sbjct: 136 PIHVYGKPWYKCEWISNNGTSIRAKAQKILPDWGYGRVYTVVVVNCTFNSNPNSDNTGGK 195

Query: 201 LTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWM 260
           L +NAYY +S K +E+FT LEE  G Y+ SK+ PPY Y+YLYCGSSLYGN+SA+R+REWM
Sbjct: 196 LILNAYYNESPKLFERFTTLEESAGIYDESKYSPPYQYDYLYCGSSLYGNVSASRMREWM 255

Query: 261 AYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVV 320
           AYHAWFFG KSHFVFHDAGGVSPEVR VLEPW+RAGRVT+Q+IR QS+YDGYYYNQFL+V
Sbjct: 256 AYHAWFFGDKSHFVFHDAGGVSPEVRKVLEPWIRAGRVTVQNIRDQSQYDGYYYNQFLIV 315

Query: 321 NDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDS 380
           NDCLHRYR+AANWTF+FDVDEYIYLP G++LESVL+EFS  TQFTIEQNPMSS+LC+NDS
Sbjct: 316 NDCLHRYRYAANWTFFFDVDEYIYLPHGNTLESVLDEFSVNTQFTIEQNPMSSVLCINDS 375

Query: 381 AQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRY 440
           +Q+Y R+WGFEKLLFKD ++ I RDRKYAIQAKNA+ATGVHMSEN++G T HKTE+KIRY
Sbjct: 376 SQDYPRQWGFEKLLFKDSRTKIRRDRKYAIQAKNAFATGVHMSENIVGKTLHKTETKIRY 435

Query: 441 YHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFER 488
           YHYHN+I V  ELCRE LPNSA   VT++N+ P+VYDD MKKL  TIKEFE+
Sbjct: 436 YHYHNTITVHEELCREMLPNSAKKKVTLYNKLPYVYDDNMKKLVKTIKEFEQ 485

BLAST of CmaCh01G020540 vs. TAIR10
Match: AT5G44670.1 (AT5G44670.1 Domain of unknown function (DUF23))

HSP 1 Score: 424.1 bits (1089), Expect = 1.7e-118
Identity = 212/399 (53.13%), Postives = 262/399 (65.66%), Query Frame = 1

Query: 102 RIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIR--A 161
           R F  +G AA  FVLM AYRGG  TFA++GL+SKP+HVY HP Y+CEWI  N S  R   
Sbjct: 116 RTFTGYGWAAYNFVLMNAYRGGVNTFAVIGLSSKPLHVYSHPTYRCEWIPLNQSDNRILT 175

Query: 162 KAYKILPDWGYGRVYTVVVVNCTFPLNP--NHDNSGGKLTVNAYYGQSQKKY-EKFTALE 221
              KIL DWGYGRVYT VVVNCTFP N   N  N+GG L ++A  G + +   +    L 
Sbjct: 176 DGTKILTDWGYGRVYTTVVVNCTFPSNTVINPKNTGGTLLLHATTGDTDRNITDSIPVLT 235

Query: 222 ELPGSYN----ASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHD 281
           E P + +     S  R    Y+YLYCGSSLYGNLS  RIREW+AYH  FFG +SHFV HD
Sbjct: 236 ETPNTVDFALYESNLRRREKYDYLYCGSSLYGNLSPQRIREWIAYHVRFFGERSHFVLHD 295

Query: 282 AGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYF 341
           AGG++ EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF+VVNDCLHRYR  A W F+F
Sbjct: 296 AGGITEEVFEVLKPWIELGRVTVHDIREQERFDGYYHNQFMVVNDCLHRYRFMAKWMFFF 355

Query: 342 DVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN-DSAQNYSRKWGFEKLLFK 401
           DVDE+IY+P  SS+ SV+     ++QFTIEQ PMSS LC + D      RKWGFEKL ++
Sbjct: 356 DVDEFIYVPAKSSISSVMVSLEEYSQFTIEQMPMSSQLCYDGDGPARTYRKWGFEKLAYR 415

Query: 402 DIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCRE 461
           D+K    RDRKYA+Q +N +ATGVHMS+++ G T H+ E KIRY+HYH SI  R E CR 
Sbjct: 416 DVKKVPRRDRKYAVQPRNVFATGVHMSQHLQGKTYHRAEGKIRYFHYHGSISQRREPCRH 475

Query: 462 FLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
               + I    +    P+V D  M+ +   +K FE   I
Sbjct: 476 LYNGTRI----VHENNPYVLDTTMRDIGLAVKTFEIRTI 510

BLAST of CmaCh01G020540 vs. TAIR10
Match: AT4G20170.1 (AT4G20170.1 Domain of unknown function (DUF23))

HSP 1 Score: 414.1 bits (1063), Expect = 1.7e-115
Identity = 220/477 (46.12%), Postives = 293/477 (61.43%), Query Frame = 1

Query: 27  LVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASA 86
           L+  C   TL+ F+    P   +L  +  R C +  ++A       T+  ++S P    +
Sbjct: 35  LLVLCTLATLLPFI----PSSFSLSTSDFRFCISRFSSAVPLNTTTTVEESSSSP----S 94

Query: 87  AAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYK 146
             K     L      R F  +G+AA  FV M AYRGG  +FA++GL+SKP+HVYGHP Y+
Sbjct: 95  PEKNLDRVLDNGVIKRTFTGYGSAAYNFVSMSAYRGGVNSFAVIGLSSKPLHVYGHPSYR 154

Query: 147 CEWIFNNGSS--IRAKAYKILPDWGYGRVYTVVVVNCTFP----LNPNHDNSGGKLTVNA 206
           CEW+  + +   I    +KIL DWGYGR+YT VVVNCTF     +NP   NSGG L ++A
Sbjct: 155 CEWVSLDPTQDPISTTGFKILTDWGYGRIYTTVVVNCTFSSISAVNPQ--NSGGTLILHA 214

Query: 207 YYGQSQKKY-EKFTALEELPGS-----YNASKFRPPYDYEYLYCGSSLYGNLSAARIREW 266
             G       +  + L E P S     YN++K    YDY  LYCGSSLYGNLS  R+REW
Sbjct: 215 TTGDPTLNLTDSISVLTEPPKSVDFDLYNSTKKTKKYDY--LYCGSSLYGNLSPQRVREW 274

Query: 267 MAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLV 326
           +AYH  FFG +SHFV HDAGG+  EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF++
Sbjct: 275 IAYHVRFFGERSHFVLHDAGGIHEEVFEVLKPWIELGRVTLHDIRDQERFDGYYHNQFMI 334

Query: 327 VNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN- 386
           VNDCLHRYR    W F+FDVDE++++P   ++ SV+E    ++QFTIEQ PMSS +C + 
Sbjct: 335 VNDCLHRYRFMTKWMFFFDVDEFLHVPVKETISSVMESLEEYSQFTIEQMPMSSRICYSG 394

Query: 387 DSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKI 446
           D      RKWG EKL ++D+K    RDRKYA+Q +N +ATGVHMS+N+ G T HK ESKI
Sbjct: 395 DGPARTYRKWGIEKLAYRDVKKVPRRDRKYAVQPENVFATGVHMSQNLQGKTYHKAESKI 454

Query: 447 RYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
           RY+HYH SI  R E CR+   +S +    +F  TP+V D  +  +   ++ FE   I
Sbjct: 455 RYFHYHGSISQRREPCRQLFNDSRV----VFENTPYVLDTTICDVGLAVRTFELRTI 495

BLAST of CmaCh01G020540 vs. TAIR10
Match: AT1G76600.1 (AT1G76600.1 unknown protein)

HSP 1 Score: 158.7 bits (400), Expect = 1.3e-38
Identity = 107/218 (49.08%), Postives = 140/218 (64.22%), Query Frame = 1

Query: 502 MGACLSDCLNHPKPSSVSPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSLSDS-- 561
           MG C+S   N    SS      TAK++++ G LREY VP+  S+VL++E++SSS S S  
Sbjct: 1   MGLCVSVNRNEYVSSST-----TAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSS 60

Query: 562 --FLCNSDRLYYDDFIPPLPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQ 621
             FLCNSD LYYDDFIP +  DE L  NQIYF+LP S   +RLSAS MAALAVKAS+A++
Sbjct: 61  SYFLCNSDSLYYDDFIPAIESDEILQANQIYFVLPISKRQYRLSASDMAALAVKASVAIE 120

Query: 622 NAS--PNDRRKKGRVSPLLNLSD-SDHIIS------------------KEPSKKNAAADT 681
            A+   N RR+ GR+SP++ L+  +D+ I+                  K P++     DT
Sbjct: 121 KAAGKKNRRRRSGRISPVVTLNQANDNRIAAVNNRIGGEATNMMMQKGKLPNRTTPFKDT 180

Query: 682 ---SASPSVRKLQRLTSRRAKMAVRSFKLKLSTIYEGA 692
              S S SVRKL+R TS RAK+AVRSF+L+LSTIYEG+
Sbjct: 181 NGYSRSGSVRKLKRYTSGRAKLAVRSFRLRLSTIYEGS 213

BLAST of CmaCh01G020540 vs. TAIR10
Match: AT1G21010.1 (AT1G21010.1 unknown protein)

HSP 1 Score: 147.1 bits (370), Expect = 3.9e-35
Identity = 92/195 (47.18%), Postives = 128/195 (65.64%), Query Frame = 1

Query: 523 PTAKVISLQGHLREYPVPISVSRVLQTEN-------SSSSLSDSFLCNSDRLYYDDFIPP 582
           PT K++++ G LREY VP+  S+VL+ E+       SSS  S  F+C+SD LYYDDFIP 
Sbjct: 16  PTVKIVTVNGDLREYNVPVIASQVLEAESAAAYSSSSSSRPSSYFICDSDSLYYDDFIPA 75

Query: 583 LPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQNASPND--RRKKGRVSPL 642
           +  +E L  +QIYF+LP S    RL+AS MAALAVKAS+A+QN+   +  RRKK R+SP+
Sbjct: 76  IKSEEPLQADQIYFVLPISKRQSRLTASDMAALAVKASVAIQNSVKKESRRRKKVRISPV 135

Query: 643 LNLSDSDHIISKEPSKK---------------NAAADTSASPSVRKLQRLTSRRAKMAVR 694
           + L+ S+  ++   S+                 A++  + S SVR L+R TS+RAK+AVR
Sbjct: 136 MMLTGSNDSVNGNGSETTVKKGRPFVSKTAPVKASSGINRSGSVRNLRRYTSKRAKLAVR 195

BLAST of CmaCh01G020540 vs. NCBI nr
Match: gi|802604275|ref|XP_012073538.1| (PREDICTED: uncharacterized protein LOC105635149 [Jatropha curcas])

HSP 1 Score: 793.5 bits (2048), Expect = 3.0e-226
Identity = 381/492 (77.44%), Postives = 423/492 (85.98%), Query Frame = 1

Query: 1   MRKD-GPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCS 60
           MRKD  P +S      GKL  CFETKPLVAT LALTLVM LWNLPPYYQNLL TT RSCS
Sbjct: 1   MRKDCPPLSSFAGGTAGKLSLCFETKPLVATVLALTLVMLLWNLPPYYQNLLSTT-RSCS 60

Query: 61  AP-TNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMG 120
           AP  +TA    +N +  P TS   T S + +KYST + TDPN RIFQ +GNAAALFV MG
Sbjct: 61  APAASTASLIASNASSLPITSYASTTSVSEQKYSTPVVTDPNKRIFQAYGNAAALFVQMG 120

Query: 121 AYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVV 180
           AYRGGPRTFA+VGLASKPIHV+G PWYKCEWI NNGSS+RAKAYK+LPDWGYGRVYTVVV
Sbjct: 121 AYRGGPRTFAVVGLASKPIHVFGRPWYKCEWISNNGSSLRAKAYKMLPDWGYGRVYTVVV 180

Query: 181 VNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYC 240
           VNCTF +NPN DN+GGKL +NAYYG+SQ+KYEKF ALEE PGSYN SK+ PPY YEYLYC
Sbjct: 181 VNCTFSVNPNEDNAGGKLMLNAYYGESQRKYEKFVALEEAPGSYNESKYHPPYQYEYLYC 240

Query: 241 GSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDI 300
           GSSLYGNLSAAR+REWMAYHAWFFGS SHFVFHDAGGVSPEVRA LEPWVRAGR T+QDI
Sbjct: 241 GSSLYGNLSAARMREWMAYHAWFFGSSSHFVFHDAGGVSPEVRAALEPWVRAGRATVQDI 300

Query: 301 RAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQ 360
           R Q+E+DGYYYNQFLVVNDCLHRYR+AANWTFYFDVDEYIYLP G++LESVL+EFS +TQ
Sbjct: 301 RGQAEFDGYYYNQFLVVNDCLHRYRYAANWTFYFDVDEYIYLPLGNTLESVLKEFSDYTQ 360

Query: 361 FTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMS 420
           FTIEQNPMSS+LCLNDS ++YSR+WGFEKLLF++ ++GI RDRKYAIQAK A+ATGVHMS
Sbjct: 361 FTIEQNPMSSVLCLNDSTRDYSREWGFEKLLFRESRTGIRRDRKYAIQAKKAFATGVHMS 420

Query: 421 ENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKL 480
           ENV+G T HKTE KIRYYHYHNSI V GELCREFLP SA  NVT++++ P+VYDD MKKL
Sbjct: 421 ENVVGKTLHKTEDKIRYYHYHNSITVPGELCREFLPPSAKKNVTLYDKLPYVYDDNMKKL 480

Query: 481 ADTIKEFERHAI 491
           A TIKEFER  I
Sbjct: 481 AATIKEFERKTI 491

BLAST of CmaCh01G020540 vs. NCBI nr
Match: gi|590636240|ref|XP_007028812.1| (Domain of Uncharacterized protein function (DUF23) [Theobroma cacao])

HSP 1 Score: 780.8 bits (2015), Expect = 2.0e-222
Identity = 375/496 (75.60%), Postives = 421/496 (84.88%), Query Frame = 1

Query: 1   MRKDGPATSIGD--AQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSC 60
           MRK+   +S     A  GKLF CFETKPLVAT LALTLVM LWNLPPYYQNLL TT R C
Sbjct: 1   MRKEAAPSSAPSTAASFGKLFVCFETKPLVATLLALTLVMLLWNLPPYYQNLLSTT-RPC 60

Query: 61  SAPTNTADAALANLTIPPNTSLPFTASAAAKK--YST--TLPTDPNNRIFQPFGNAAALF 120
           SAP+ T+ AA     +  N SLP+TA+  A+K  YS     P DPN R+F+ +GNAAALF
Sbjct: 61  SAPSLTSAAAATTSLLATNVSLPYTATPVAEKKYYSAPKAKPRDPNKRVFEAYGNAAALF 120

Query: 121 VLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVY 180
           V MGAYRGGP TFA+VGLASKPIHV+G PWYKCEWI NNGSS RAKAYK+LPDWGYGRVY
Sbjct: 121 VRMGAYRGGPTTFAVVGLASKPIHVFGRPWYKCEWISNNGSSYRAKAYKMLPDWGYGRVY 180

Query: 181 TVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYE 240
           TV+VVNCTFP NPN DN GGKL +NAYYG+SQ+KYEKFTALEE PGSYN SK+  P+ YE
Sbjct: 181 TVLVVNCTFPFNPNQDNLGGKLMINAYYGESQRKYEKFTALEEAPGSYNESKYHSPFQYE 240

Query: 241 YLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVT 300
           YLYCGSSLYGNLSA R+REWMAYHAWFFG  SHFVFHDAGGV+PEVRA L+PWVRAGR T
Sbjct: 241 YLYCGSSLYGNLSADRMREWMAYHAWFFGPNSHFVFHDAGGVTPEVRAALDPWVRAGRAT 300

Query: 301 IQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS 360
           +QDIR Q+E+DGYYYNQFLVVNDCLHRYRHAANWTF+FDVDEYIYLP+G++LESVL EFS
Sbjct: 301 MQDIRDQAEFDGYYYNQFLVVNDCLHRYRHAANWTFFFDVDEYIYLPDGNTLESVLNEFS 360

Query: 361 AFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATG 420
            +TQFTIEQNPMSS+LCLN+S++ YSR+WGFEKLLF++ ++GI RDRKYAIQAKNAYATG
Sbjct: 361 DYTQFTIEQNPMSSVLCLNNSSEQYSRQWGFEKLLFRESRTGIRRDRKYAIQAKNAYATG 420

Query: 421 VHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDK 480
           VHMSENVIG T HKTE+KIRYYHYHN+I V  ELCREFLP SA +NVT FN+ P+VYDD 
Sbjct: 421 VHMSENVIGKTLHKTETKIRYYHYHNTITVHQELCREFLPLSAKNNVTWFNKLPYVYDDN 480

Query: 481 MKKLADTIKEFERHAI 491
           MKKLA+TIKEFER  I
Sbjct: 481 MKKLANTIKEFERKTI 495

BLAST of CmaCh01G020540 vs. NCBI nr
Match: gi|255539368|ref|XP_002510749.1| (PREDICTED: uncharacterized protein LOC8269962 [Ricinus communis])

HSP 1 Score: 773.9 bits (1997), Expect = 2.4e-220
Identity = 373/495 (75.35%), Postives = 420/495 (84.85%), Query Frame = 1

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKD P  S      GKL +C ETKPLVAT LALTLVM LWNLPPYYQNLL TT R CSA
Sbjct: 1   MRKDCPPLS--SVTGGKLPSCLETKPLVATLLALTLVMLLWNLPPYYQNLLSTT-RPCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           P   A  A +N +     SLP T S + +KYST + +DPN RIF+ +GNAAALFV MGAY
Sbjct: 61  PAAAAALAASNAS-----SLPIT-SVSEQKYSTGV-SDPNKRIFEAYGNAAALFVKMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHV+G PWYKCEWI NNGSS+RAKAYK+LPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVFGRPWYKCEWISNNGSSMRAKAYKMLPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPN +N+GGKL +NAYYG+S +KYEK  ALEE PGSYN S + PPY YEYLYCGS
Sbjct: 181 CTFPLNPNRENAGGKLILNAYYGESPRKYEKIVALEEAPGSYNDSNYHPPYQYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAAR+REWMAYHAWFFG  SHFVFHDAGGVSP+VRA LEPWVRAGR T+QDIR 
Sbjct: 241 SLYGNLSAARMREWMAYHAWFFGPSSHFVFHDAGGVSPQVRAALEPWVRAGRATVQDIRG 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           Q+E+DGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLP+GS+L+SVL EFS +TQFT
Sbjct: 301 QAEFDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPDGSTLQSVLAEFSDYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSS+LCLNDS+ +Y R+WGFEKLLF++ +SGI RDRKYAIQAKNA+ATGVHMSEN
Sbjct: 361 IEQNPMSSVLCLNDSSHDYPREWGFEKLLFRESRSGIRRDRKYAIQAKNAFATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           V+G T HKTE+KIRYYHYHNSI V+GELCR+ LP SA HNVT +N+ P+VYDD MKKL  
Sbjct: 421 VVGKTLHKTETKIRYYHYHNSITVQGELCRQLLPASAKHNVTWYNKLPYVYDDNMKKLVT 480

Query: 481 TIKEFERHAIDRHQQ 496
           TI++FER+ I   +Q
Sbjct: 481 TIRDFERNTIGNVRQ 485

BLAST of CmaCh01G020540 vs. NCBI nr
Match: gi|823199590|ref|XP_012434961.1| (PREDICTED: uncharacterized protein LOC105761642 [Gossypium raimondii])

HSP 1 Score: 772.3 bits (1993), Expect = 7.1e-220
Identity = 370/508 (72.83%), Postives = 421/508 (82.87%), Query Frame = 1

Query: 1   MRKDG-----PATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTA 60
           MRK+      P+ + G   +GKLF CFETK LV T LALTLVMFLWNLPPYYQNLL TT 
Sbjct: 1   MRKEAAPSSVPSAAAGTTTLGKLFICFETKTLVTTLLALTLVMFLWNLPPYYQNLLSTT- 60

Query: 61  RSCSAPTNTAD--AALANLT---IPPNTSLPFTASAAAKKYSTTL---PTDPNNRIFQPF 120
           R CS P  +    A+ A++T   I  N S+P+ A+  AKKY+T     P DPN R+F+ +
Sbjct: 61  RPCSVPITSVSVSASAASVTASLISTNVSMPYKANPVAKKYNTATRPKPKDPNKRVFESY 120

Query: 121 GNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPD 180
           GNAAALFV MGAYRGGP TFA+VGLASKPIHVYG PW+KCEWI NNGSS RAKAYK+LPD
Sbjct: 121 GNAAALFVQMGAYRGGPTTFAVVGLASKPIHVYGKPWFKCEWISNNGSSYRAKAYKMLPD 180

Query: 181 WGYGRVYTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKF 240
           WGYGRVYTVVVVNCTFP NPN DN+GGKL VNAYYG+SQ+KYEKF ALEE PGSYN SKF
Sbjct: 181 WGYGRVYTVVVVNCTFPFNPNQDNNGGKLMVNAYYGESQRKYEKFMALEESPGSYNESKF 240

Query: 241 RPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPW 300
            PPY YEYLYCGSSLYGNLSA RIREWMAYHAWFFG  SHFVFHDAGGVSPEV AVLEPW
Sbjct: 241 NPPYQYEYLYCGSSLYGNLSADRIREWMAYHAWFFGPSSHFVFHDAGGVSPEVMAVLEPW 300

Query: 301 VRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLE 360
           V+AGRVT+QDIR Q+EYDGYYYNQFLVVNDCLHRYR+AANWTF+FDVDEYIYLP G++LE
Sbjct: 301 VKAGRVTVQDIRDQAEYDGYYYNQFLVVNDCLHRYRYAANWTFFFDVDEYIYLPHGNTLE 360

Query: 361 SVLEEFSAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQA 420
           SVL EFS +TQFTI+QNPMSSMLCLN S+Q YSR+WGFEKLLF++ ++ I RDRKYAIQA
Sbjct: 361 SVLNEFSGYTQFTIQQNPMSSMLCLNGSSQEYSRQWGFEKLLFRESRTKIRRDRKYAIQA 420

Query: 421 KNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQT 480
           KNA+ATGVHMSEN++G + HKTE+KI YYHYHN+I    ELCRE+LP++A+  VT FN+ 
Sbjct: 421 KNAFATGVHMSENIVGKSLHKTETKIHYYHYHNTITQHQELCREYLPSTAVKQVTWFNKL 480

Query: 481 PFVYDDKMKKLADTIKEFERHAIDRHQQ 496
           P+VYDD MKKLA+TIK+FE   I +  Q
Sbjct: 481 PYVYDDNMKKLANTIKQFELETIGKQPQ 507

BLAST of CmaCh01G020540 vs. NCBI nr
Match: gi|224083390|ref|XP_002307008.1| (hypothetical protein POPTR_0005s28030g [Populus trichocarpa])

HSP 1 Score: 771.2 bits (1990), Expect = 1.6e-219
Identity = 377/501 (75.25%), Postives = 420/501 (83.83%), Query Frame = 1

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTAR-SCS 60
           MRKD    S G A V KL  CFETKPLVAT LALTLVM LWNLPPYYQNLL TT R SCS
Sbjct: 1   MRKDCQPPSSG-ATVSKLPACFETKPLVATLLALTLVMLLWNLPPYYQNLLSTTTRPSCS 60

Query: 61  APTNTADAALANLTIPPNTSLPFTASAAA-KKY-----STTLPTDPNNRIFQPFGNAAAL 120
           AP  T   +        NTS  FT+++ + +KY     S++   DPN RIFQ +GNAAAL
Sbjct: 61  APETTVSIS--------NTSSSFTSTSLSDQKYLSSSSSSSSNADPNKRIFQAYGNAAAL 120

Query: 121 FVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRV 180
           FV MGAYRGGP TFA+VGLASKPIHV+  PWYKCEWI NNGSSIRAKAYK+LPDWGYGRV
Sbjct: 121 FVQMGAYRGGPTTFAVVGLASKPIHVFRLPWYKCEWISNNGSSIRAKAYKMLPDWGYGRV 180

Query: 181 YTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDY 240
           YTVVVVNCTFP+NPN DN+GG+L +NAYY +SQ+KYEKF ALEELPGSYN SKFRPPY Y
Sbjct: 181 YTVVVVNCTFPVNPNQDNAGGRLMLNAYYDESQRKYEKFMALEELPGSYNESKFRPPYQY 240

Query: 241 EYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRV 300
           EYLYCGSSLYGNLSA+R REWMAYHAWFFG  SHFVFHDAGGVSPEVRA L+PWVRAGR 
Sbjct: 241 EYLYCGSSLYGNLSASRFREWMAYHAWFFGPSSHFVFHDAGGVSPEVRAALDPWVRAGRA 300

Query: 301 TIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEF 360
           T+QDIR Q+E+DGYYYNQFLVVNDCLHRYR++ANWTFYFDVDEYIYLPEG++LESVL++F
Sbjct: 301 TVQDIRGQAEFDGYYYNQFLVVNDCLHRYRYSANWTFYFDVDEYIYLPEGNTLESVLKDF 360

Query: 361 SAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYAT 420
           S +TQFTIEQNPMSS LC NDS Q+Y R+WGFEKLLF++ ++GI RDRKYAIQAKNAYAT
Sbjct: 361 SNYTQFTIEQNPMSSALCFNDSTQDYPRQWGFEKLLFRESRTGIRRDRKYAIQAKNAYAT 420

Query: 421 GVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDD 480
           GVHMSENVIG T H+TE+KIRYYHYHNSI V GELCREFLP SA +NVT +N  P+VYDD
Sbjct: 421 GVHMSENVIGKTLHQTETKIRYYHYHNSIQVPGELCREFLPLSAKNNVTWYNGLPYVYDD 480

Query: 481 KMKKLADTIKEFERHAIDRHQ 495
            MKKLA TIK+FER+ I   Q
Sbjct: 481 NMKKLASTIKDFERNTIGNVQ 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GALS1_ARATH3.2e-21273.94Galactan beta-1,4-galactosyltransferase GALS1 OS=Arabidopsis thaliana GN=GALS1 P... [more]
GALS2_ARATH3.0e-11753.13Galactan beta-1,4-galactosyltransferase GALS2 OS=Arabidopsis thaliana GN=GALS2 P... [more]
GALS3_ARATH3.1e-11446.12Galactan beta-1,4-galactosyltransferase GALS3 OS=Arabidopsis thaliana GN=GALS3 P... [more]
Match NameE-valueIdentityDescription
A0A067KKI4_JATCU2.1e-22677.44Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08012 PE=4 SV=1[more]
A0A061F459_THECC1.4e-22275.60Domain of Uncharacterized protein function (DUF23) OS=Theobroma cacao GN=TCM_024... [more]
B9R8V6_RICCO1.7e-22075.35Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1602470 PE=4 SV=1[more]
A0A0D2TS76_GOSRA4.9e-22072.83Uncharacterized protein OS=Gossypium raimondii GN=B456_007G359700 PE=4 SV=1[more]
B9H5Y4_POPTR1.1e-21975.25Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s28030g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G33570.11.8e-21373.94 Domain of unknown function (DUF23)[more]
AT5G44670.11.7e-11853.13 Domain of unknown function (DUF23)[more]
AT4G20170.11.7e-11546.12 Domain of unknown function (DUF23)[more]
AT1G76600.11.3e-3849.08 unknown protein[more]
AT1G21010.13.9e-3547.18 unknown protein[more]
Match NameE-valueIdentityDescription
gi|802604275|ref|XP_012073538.1|3.0e-22677.44PREDICTED: uncharacterized protein LOC105635149 [Jatropha curcas][more]
gi|590636240|ref|XP_007028812.1|2.0e-22275.60Domain of Uncharacterized protein function (DUF23) [Theobroma cacao][more]
gi|255539368|ref|XP_002510749.1|2.4e-22075.35PREDICTED: uncharacterized protein LOC8269962 [Ricinus communis][more]
gi|823199590|ref|XP_012434961.1|7.1e-22072.83PREDICTED: uncharacterized protein LOC105761642 [Gossypium raimondii][more]
gi|224083390|ref|XP_002307008.1|1.6e-21975.25hypothetical protein POPTR_0005s28030g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008166Glyco_transf_92
IPR025322DUF4228_plant
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042546 cell wall biogenesis
biological_process GO:0045489 pectin biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005575 cellular_component
molecular_function GO:0048531 beta-1,3-galactosyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G020540.1CmaCh01G020540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008166Glycosyltransferase family 92PFAMPF01697Glyco_transf_92coord: 234..449
score: 1.1
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 502..689
score: 6.7
NoneNo IPR availablePANTHERPTHR21461UNCHARACTERIZEDcoord: 1..511
score: 1.1E
NoneNo IPR availablePANTHERPTHR21461:SF11SUBFAMILY NOT NAMEDcoord: 1..511
score: 1.1E