BLAST of CmoCh03G004900 vs. Swiss-Prot
Match:
CPL1_ARATH (RNA polymerase II C-terminal domain phosphatase-like 1 OS=Arabidopsis thaliana GN=CPL1 PE=1 SV=1)
HSP 1 Score: 909.4 bits (2349), Expect = 2.7e-263
Identity = 503/812 (61.95%), Postives = 599/812 (73.77%), Query Frame = 1
Query: 6 VYQGDELLGEVEIYPE-----------EKNGYKNIEVKE-----IRISHFSQPSERCPPL 65
V+ GD LGE+EIYP ++ K EV E IRISHFSQ ERCPPL
Sbjct: 9 VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGIRISHFSQSGERCPPL 68
Query: 66 AVLHTIAASGICFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDH 125
A+L TI++ G+CFK+E+ S +Q+ L L +SSC+ +NK+A+M+ G EELHLVAMYS +
Sbjct: 69 AILTTISSCGLCFKLEASPSPAQES-LSLFYSSCLRDNKTAVMLLGGEELHLVAMYSENI 128
Query: 126 DKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKIS 185
PCFW F+VA G+Y+SCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKI+ QR+I+
Sbjct: 129 KNDRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIDGFQRRIN 188
Query: 186 SEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVR 245
+E+DPQR A ++AE++RYQDDK +LKQY E+DQV+ENG+VIK QSE+VPALSDNHQP VR
Sbjct: 189 NEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALSDNHQPLVR 248
Query: 246 PLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 305
PLIRL EKNIILTRINP IRDTSVLVR+RP+WE+LRSYLTA+GRKRFEVYVCTMAERDYA
Sbjct: 249 PLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVCTMAERDYA 308
Query: 306 LEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDE 365
LEMWRLLDP+ NLIN +LL RIVCVKSG +KSLFNVF DG CHPKMALVIDDRLKVWDE
Sbjct: 309 LEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVIDDRLKVWDE 368
Query: 366 KDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISY 425
KDQPRVHVVPAFAPYY+P AE PVLCVARNVAC VRGGFFR+FD+ LL +I ISY
Sbjct: 369 KDQPRVHVVPAFAPYYSPQAEA-AATPVLCVARNVACGVRGGFFRDFDDSLLPRIAEISY 428
Query: 426 EDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSA 485
E+DA DIPSPPDVS+YL SED+ S NGNKD L+FDGM+D EV+RR+K+A ASS V A
Sbjct: 429 ENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAISASSAVLPA 488
Query: 486 ---DPRVPS-LQYTMASASG-TVPVPPY------------YPNMPLPHVDSVAQVA---- 545
DPR+ + +Q+ MASAS +VPVP +P++P +A
Sbjct: 489 ANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQPTSIAKHLV 548
Query: 546 ASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVV 605
SEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ SEP+F RPP Q
Sbjct: 549 PSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQRPP--VQAP 608
Query: 606 GPRAQPRGSWSPMEEEMSPLQL-SWTRKEFPVDEEPIR-EKHRSNHPSFFPKNDSSFPPD 665
Q R W P+EEEM P Q+ KE+P+D E I EKHR HPSFF K D+S D
Sbjct: 609 PSHVQSRNGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSD 668
Query: 666 RIPHENQRLSKEAFYRDDRVRVSRR-PSSYPAFSGDEIPMNQSSSRSRENDIESGRSI-W 725
R+ HEN+R KE+ RD+++R + P S+P F G++ NQSSSR+ + D RS+
Sbjct: 669 RMLHENRRPPKESLRRDEQLRSNNNLPDSHP-FYGEDASWNQSSSRNSDLDFLPERSVSA 728
Query: 726 SETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAA 777
+ET L IA+K G KVE+KP+LVSSTDL+F+VEAW +KIGEGIGK+RREA AA
Sbjct: 729 TETSADVLHGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAA 788
BLAST of CmoCh03G004900 vs. Swiss-Prot
Match:
CPL2_ARATH (RNA polymerase II C-terminal domain phosphatase-like 2 OS=Arabidopsis thaliana GN=CPL2 PE=1 SV=3)
HSP 1 Score: 595.9 bits (1535), Expect = 6.5e-169
Identity = 363/785 (46.24%), Postives = 476/785 (60.64%), Query Frame = 1
Query: 2 YKSVVYQGDELLGEVEIYPEEKNGYKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGIC 61
+KSVVY GD LGE+++ + EIRI H S ERCPPLA+L TIA+ +
Sbjct: 6 HKSVVYHGDLRLGELDVNHVSSSHEFRFPNDEIRIHHLSPAGERCPPLAILQTIASFAVR 65
Query: 62 FKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGFNV 121
K+ES +HL H+ C E K+A+++ G EE+HLVAM S++ K++PCFW F+V
Sbjct: 66 CKLESSAPVKSQELMHL-HAVCFHELKTAVVMLGDEEIHLVAMPSKE--KKFPCFWCFSV 125
Query: 122 AMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAGML 181
GLY+SCL MLN RCL IVFDLDETL+VANTM+SFED+IEAL+ IS E+DP R GM
Sbjct: 126 PSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDPVRINGMS 185
Query: 182 AEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNIIL 241
AE++RY DD+++LKQY +ND +NG ++K+Q E V SD + RP+IRL EKN +L
Sbjct: 186 AELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRLPEKNTVL 245
Query: 242 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSN 301
TRI P+IRDTSVLV+LRPAWE+LRSYLTA+ RKRFEVYVCTMAERDYALEMWRLLDP+++
Sbjct: 246 TRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWRLLDPEAH 305
Query: 302 LINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 361
LI+ KEL DRIVCVK ++KSL +VF G CHPKMA+VIDDR+KVW++KDQPRVHVV A+
Sbjct: 306 LISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPRVHVVSAY 365
Query: 362 APYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSPPD 421
PYYAP AE VVP LCVARNVAC VRG FF+EFDE L+ I + YEDD ++P PD
Sbjct: 366 LPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVENLPPSPD 425
Query: 422 VSNYLGSEDEYSVSNGNKDTLTF-DGMSDMEVDRRMKDAFLASSTVNSADPRVPSLQYTM 481
VSNY+ ED SNGN + +GM EV+RR+ A A + A
Sbjct: 426 VSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQAAAADHSTLPA----------- 485
Query: 482 ASASGTVPVPPYYPNMPLPHVDSVAQVAA----SEPSLQSSPAREEGEVPESELDPDTRR 541
S + P P +P+ S A AA +PSL +P R+ +
Sbjct: 486 TSNAEQKPETPKPQIAVIPNNASTATAAALLPSHKPSLLGAPRRDGFTFSDGG------- 545
Query: 542 RLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEEMSPLQLSWTRKE 601
R L+++ G D R + ++P L + P P + G W +E R
Sbjct: 546 RPLMMRPGVDIRNQNFNQPPILAKIPMQPPSSSMHSP--GGWLVDDEN---------RPS 605
Query: 602 FPVDEEPIREKHRSNHPSFFPKND-SSFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSY 661
FP + +PS FP S P H + S+E DD R + PS
Sbjct: 606 FPGRPSGL-------YPSQFPHGTPGSAPVGPFAHPSHLRSEEVAMDDDLKR--QNPSRQ 665
Query: 662 PAFSGDEIPMNQSSSRSRENDIESGRSIWSETP--VGALQEIAMKFGTKVEFKPALVSST 721
G I N S RE+ + G+S ++ V ALQEI + G+KVEF+ + ++
Sbjct: 666 TTEGG--ISQNHLVSNGREHHTDGGKSNGGQSHLFVSALQEIGRRCGSKVEFRTVISTNK 725
Query: 722 DLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKADSTSANDMNKFP 778
+LQF+VE F GEKIG G+ KT+++A + AAE ++++LA YV+ + A + K P
Sbjct: 726 ELQFSVEVLFTGEKIGIGMAKTKKDAHQQAAENALRSLAEKYVAHV---APLARETEKGP 744
BLAST of CmoCh03G004900 vs. TrEMBL
Match:
A0A0A0KLF7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G517200 PE=4 SV=1)
HSP 1 Score: 1410.2 bits (3649), Expect = 0.0e+00
Identity = 711/803 (88.54%), Postives = 752/803 (93.65%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPEEKNGYKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60
MYKSVVY GDELLG+VEIYPEEKNGYKNIEVKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1 MYKSVVYHGDELLGDVEIYPEEKNGYKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
Query: 61 CFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGFN 120
CFKMESKTSQSQD PL+LLHSSCIMENK+AIM+FG+EELHLVAM+SRD DKQYPCFWGFN
Sbjct: 61 CFKMESKTSQSQDTPLNLLHSSCIMENKTAIMMFGVEELHLVAMFSRDLDKQYPCFWGFN 120
Query: 121 VAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAGM 180
VAMGLYNSCL MLNLRCLGIVFDLDETLVVANTMRSFED+IEALQRKISSEVDPQR GM
Sbjct: 121 VAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRANGM 180
Query: 181 LAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNII 240
LAEV+RYQDDK+ILKQYAENDQVIENGKVIKSQSEVVPALSDNHQP VRPLIRLHEKNII
Sbjct: 181 LAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKNII 240
Query: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS
Sbjct: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
Query: 361 FAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSPP 420
FAPYYAPNAEGNN +PVLCVARNVAC VRGGFF+EFD++LLQKI +ISYEDD NDIPSPP
Sbjct: 361 FAPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQKISDISYEDDVNDIPSPP 420
Query: 421 DVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSADPRVPSLQYTM 480
DVSNYL SEDEYS++NGNKD TFDGM DMEVDRRMKDAFLASST+NSADPRV SLQYTM
Sbjct: 421 DVSNYLVSEDEYSIANGNKDMPTFDGMPDMEVDRRMKDAFLASSTINSADPRVSSLQYTM 480
Query: 481 ASASGTVPVP------PYYPNMPLPHVDSVAQVAASEPSLQSSPAREEGEVPESELDPDT 540
ASAS +VP+P PY+PNMPLPHV+SVA VA +EPSLQSSPAREEGEVPESELDPDT
Sbjct: 481 ASASCSVPLPPKQVTMPYFPNMPLPHVNSVAHVAPNEPSLQSSPAREEGEVPESELDPDT 540
Query: 541 RRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEEMSPLQLSWT- 600
RRRLLILQHGQDTRER SSEPAF RPPPL QV PRAQ RG+WSPMEEEMSP QL+ +
Sbjct: 541 RRRLLILQHGQDTRERLSSEPAFPARPPPLQQVAAPRAQSRGNWSPMEEEMSPRQLNRSA 600
Query: 601 RKEFPVDEE--PIREKHRSNHPSFFPKNDSSFPPDRIPHENQRLSKEAFYRDDRVRVSRR 660
RK+FPVD E P+REKHRSNHPSFF K D+S PDRIPH+NQRL KEAFYRDDR+RVSRR
Sbjct: 601 RKDFPVDAEPMPMREKHRSNHPSFFAKVDNSILPDRIPHDNQRLPKEAFYRDDRMRVSRR 660
Query: 661 PSSYPAFSGDEIPMNQSSSRSRENDIESGRSIWSETPVGALQEIAMKFGTKVEFKPALVS 720
PSSYPAFSG+EIPMNQSSSRSR++DIESGRSIWSETPVGALQEIAMKFGTKVEFKP LV
Sbjct: 661 PSSYPAFSGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLVP 720
Query: 721 STDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKADSTSANDMNK 780
STDLQF+VEAWFVGEKIGEGIG TRR+AQR AAEGSIKNLAN+YVSRCKAD +SANDMNK
Sbjct: 721 STDLQFSVEAWFVGEKIGEGIGHTRRDAQRQAAEGSIKNLANIYVSRCKADPSSANDMNK 780
Query: 781 FPNDNGSGKRMRTDFHGNLPKPK 795
FP+DNGSGKRM+ DFH +LPK K
Sbjct: 781 FPSDNGSGKRMKLDFHRHLPKTK 803
BLAST of CmoCh03G004900 vs. TrEMBL
Match:
A0A061GMH8_THECC (C-terminal domain phosphatase-like 1 isoform 3 OS=Theobroma cacao GN=TCM_029910 PE=4 SV=1)
HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 586/830 (70.60%), Postives = 662/830 (79.76%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPE----------EKNGYKNI-----EVKEIRISHFSQPSER 60
MYKSVVY+G+E+LGEVEIYP+ E+ + I E+KEIRI + +Q SER
Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63
Query: 61 CPPLAVLHTIAASGICFKMESKT----SQSQDMP-LHLLHSSCIMENKSAIMVFGMEELH 120
CPPLAVLHTI +SGICFKMES S SQD P LHLLHS CI +NK+A+M G ELH
Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123
Query: 121 LVAMYSRDHDKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDK 180
LVAMYSR+ D+ PCFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETL+VANTMRSFED+
Sbjct: 124 LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 183
Query: 181 IEALQRKISSEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPAL 240
IEALQRK+++EVDPQR AGM+AE++RYQDDK ILKQYAENDQV+ENGKVIK QSEVVPAL
Sbjct: 184 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 243
Query: 241 SDNHQPFVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 300
SDNHQP +RPLIRL EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 244 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 303
Query: 301 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVI 360
CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVKSGSRKSLFNVFQDG CHPKMALVI
Sbjct: 304 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 363
Query: 361 DDRLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVL 420
DDRLKVWDEKDQPRVHVVPAFAPYYAP AE NN +PVLCVARNVAC VRGGFFREFDE L
Sbjct: 364 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 423
Query: 421 LQKIYNISYEDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAF 480
LQ+I ISYEDD DIPSPPDV NYL SED+ S NGNKD L FDGM+D EV+RR+K+A
Sbjct: 424 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 483
Query: 481 LASSTVNSA----DPRV-PSLQYTMASASGTVPVPPYYPNM--------PL--PHVDSVA 540
A+STV+SA DPR+ PSLQYTM S+S ++P P++ PL P V VA
Sbjct: 484 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 543
Query: 541 QVAASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPP--P 600
VA EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ EPAF PP P
Sbjct: 544 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAF---PPVRP 603
Query: 601 LPQVVGPRAQPRGSWSPMEEEMSPLQLSWTR-KEFPVDEEPIR-EKHRSNHPSFFPKNDS 660
QV PR Q RGSW EEEMSP QL+ KEFP+D E + EKHR HP FFPK +S
Sbjct: 604 TMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVES 663
Query: 661 SFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGR 720
S P DR+ ENQRLSKEA +RDDR+ ++ PSSY +FSG+E+P++QSSS R+ D ESGR
Sbjct: 664 SIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR 723
Query: 721 SIWS-ETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQ 780
++ S ET G LQ+IAMK G KVEF+PALV+S DLQF++EAWF GEK+GEG+G+TRREAQ
Sbjct: 724 TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQ 783
Query: 781 RHAAEGSIKNLANVYVSRCKADSTSA-NDMNKFPNDNGSGKRMRTDFHGN 790
R AAE SIKNLAN Y+SR K DS SA D+++ N N +G + GN
Sbjct: 784 RQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGN 826
BLAST of CmoCh03G004900 vs. TrEMBL
Match:
A0A061GFW4_THECC (C-terminal domain phosphatase-like 1 isoform 2 OS=Theobroma cacao GN=TCM_029910 PE=4 SV=1)
HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 586/830 (70.60%), Postives = 662/830 (79.76%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPE----------EKNGYKNI-----EVKEIRISHFSQPSER 60
MYKSVVY+G+E+LGEVEIYP+ E+ + I E+KEIRI + +Q SER
Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63
Query: 61 CPPLAVLHTIAASGICFKMESKT----SQSQDMP-LHLLHSSCIMENKSAIMVFGMEELH 120
CPPLAVLHTI +SGICFKMES S SQD P LHLLHS CI +NK+A+M G ELH
Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123
Query: 121 LVAMYSRDHDKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDK 180
LVAMYSR+ D+ PCFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETL+VANTMRSFED+
Sbjct: 124 LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 183
Query: 181 IEALQRKISSEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPAL 240
IEALQRK+++EVDPQR AGM+AE++RYQDDK ILKQYAENDQV+ENGKVIK QSEVVPAL
Sbjct: 184 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 243
Query: 241 SDNHQPFVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 300
SDNHQP +RPLIRL EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 244 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 303
Query: 301 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVI 360
CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVKSGSRKSLFNVFQDG CHPKMALVI
Sbjct: 304 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 363
Query: 361 DDRLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVL 420
DDRLKVWDEKDQPRVHVVPAFAPYYAP AE NN +PVLCVARNVAC VRGGFFREFDE L
Sbjct: 364 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 423
Query: 421 LQKIYNISYEDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAF 480
LQ+I ISYEDD DIPSPPDV NYL SED+ S NGNKD L FDGM+D EV+RR+K+A
Sbjct: 424 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 483
Query: 481 LASSTVNSA----DPRV-PSLQYTMASASGTVPVPPYYPNM--------PL--PHVDSVA 540
A+STV+SA DPR+ PSLQYTM S+S ++P P++ PL P V VA
Sbjct: 484 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 543
Query: 541 QVAASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPP--P 600
VA EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ EPAF PP P
Sbjct: 544 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAF---PPVRP 603
Query: 601 LPQVVGPRAQPRGSWSPMEEEMSPLQLSWTR-KEFPVDEEPIR-EKHRSNHPSFFPKNDS 660
QV PR Q RGSW EEEMSP QL+ KEFP+D E + EKHR HP FFPK +S
Sbjct: 604 TMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVES 663
Query: 661 SFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGR 720
S P DR+ ENQRLSKEA +RDDR+ ++ PSSY +FSG+E+P++QSSS R+ D ESGR
Sbjct: 664 SIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR 723
Query: 721 SIWS-ETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQ 780
++ S ET G LQ+IAMK G KVEF+PALV+S DLQF++EAWF GEK+GEG+G+TRREAQ
Sbjct: 724 TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQ 783
Query: 781 RHAAEGSIKNLANVYVSRCKADSTSA-NDMNKFPNDNGSGKRMRTDFHGN 790
R AAE SIKNLAN Y+SR K DS SA D+++ N N +G + GN
Sbjct: 784 RQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGN 826
BLAST of CmoCh03G004900 vs. TrEMBL
Match:
A0A061GGL6_THECC (C-terminal domain phosphatase-like 1 isoform 1 OS=Theobroma cacao GN=TCM_029910 PE=4 SV=1)
HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 586/830 (70.60%), Postives = 662/830 (79.76%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPE----------EKNGYKNI-----EVKEIRISHFSQPSER 60
MYKSVVY+G+E+LGEVEIYP+ E+ + I E+KEIRI + +Q SER
Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63
Query: 61 CPPLAVLHTIAASGICFKMESKT----SQSQDMP-LHLLHSSCIMENKSAIMVFGMEELH 120
CPPLAVLHTI +SGICFKMES S SQD P LHLLHS CI +NK+A+M G ELH
Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123
Query: 121 LVAMYSRDHDKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDK 180
LVAMYSR+ D+ PCFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETL+VANTMRSFED+
Sbjct: 124 LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 183
Query: 181 IEALQRKISSEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPAL 240
IEALQRK+++EVDPQR AGM+AE++RYQDDK ILKQYAENDQV+ENGKVIK QSEVVPAL
Sbjct: 184 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 243
Query: 241 SDNHQPFVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 300
SDNHQP +RPLIRL EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 244 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 303
Query: 301 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVI 360
CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVKSGSRKSLFNVFQDG CHPKMALVI
Sbjct: 304 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 363
Query: 361 DDRLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVL 420
DDRLKVWDEKDQPRVHVVPAFAPYYAP AE NN +PVLCVARNVAC VRGGFFREFDE L
Sbjct: 364 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 423
Query: 421 LQKIYNISYEDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAF 480
LQ+I ISYEDD DIPSPPDV NYL SED+ S NGNKD L FDGM+D EV+RR+K+A
Sbjct: 424 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 483
Query: 481 LASSTVNSA----DPRV-PSLQYTMASASGTVPVPPYYPNM--------PL--PHVDSVA 540
A+STV+SA DPR+ PSLQYTM S+S ++P P++ PL P V VA
Sbjct: 484 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 543
Query: 541 QVAASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPP--P 600
VA EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ EPAF PP P
Sbjct: 544 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAF---PPVRP 603
Query: 601 LPQVVGPRAQPRGSWSPMEEEMSPLQLSWTR-KEFPVDEEPIR-EKHRSNHPSFFPKNDS 660
QV PR Q RGSW EEEMSP QL+ KEFP+D E + EKHR HP FFPK +S
Sbjct: 604 TMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVES 663
Query: 661 SFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGR 720
S P DR+ ENQRLSKEA +RDDR+ ++ PSSY +FSG+E+P++QSSS R+ D ESGR
Sbjct: 664 SIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR 723
Query: 721 SIWS-ETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQ 780
++ S ET G LQ+IAMK G KVEF+PALV+S DLQF++EAWF GEK+GEG+G+TRREAQ
Sbjct: 724 TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQ 783
Query: 781 RHAAEGSIKNLANVYVSRCKADSTSA-NDMNKFPNDNGSGKRMRTDFHGN 790
R AAE SIKNLAN Y+SR K DS SA D+++ N N +G + GN
Sbjct: 784 RQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGN 826
BLAST of CmoCh03G004900 vs. TrEMBL
Match:
A0A067JAV3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21412 PE=4 SV=1)
HSP 1 Score: 1073.9 bits (2776), Expect = 8.6e-311
Identity = 577/826 (69.85%), Postives = 668/826 (80.87%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYP------EEKNGYKNI--EV---KEIRISHFSQPSERCPPL 60
MYKS VY+G+ELLGEVEIYP EE+N K + E+ KEIRISHFSQPSERCPPL
Sbjct: 1 MYKSAVYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMGKEIRISHFSQPSERCPPL 60
Query: 61 AVLHTIAASGICFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDH 120
AVLHTI G+CFKMESK S S D PLHLLHSSCI ENK+A++ G EELHLVA+YSR++
Sbjct: 61 AVLHTITC-GMCFKMESKNSLSLDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNN 120
Query: 121 DKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKIS 180
++QYPCFWGFNV+ GLYNSCLVMLNLRCLGIVFDLDETL+VANTMRSFED+IEALQRKI+
Sbjct: 121 ERQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKIN 180
Query: 181 SEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVR 240
+EVDPQR AGML+EV+RYQDDK ILKQY ENDQVIENG+VIK+Q EVVPALSDNHQ VR
Sbjct: 181 TEVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVR 240
Query: 241 PLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 300
PLIRL E+NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA
Sbjct: 241 PLIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 300
Query: 301 LEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDE 360
LEMWRLLDP+SNLI+ KELLDRIVCVKSG RKSLFNVFQDG CHPKMALVIDDRLKVWDE
Sbjct: 301 LEMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDE 360
Query: 361 KDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISY 420
KDQPRVHVVPAFAPYYAP AE NN VPVLCVARNVAC VRGGFF+EFDE LLQ+I +ISY
Sbjct: 361 KDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISY 420
Query: 421 EDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASS----T 480
EDD NDIPSPPDVS+YL SED+ S SNG++D L+FDGM+D EV++R+K+A A+S T
Sbjct: 421 EDDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKRLKEAISAASLFPAT 480
Query: 481 VNSADPRV-PSLQYTMASASGTVPVPPYYP------NMPLPH----VDSVAQVAASEPSL 540
VN+ DPRV P+LQY++AS+S ++PV P N+ P V +AQV EPSL
Sbjct: 481 VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSNIQFPQAASLVKPLAQVGPPEPSL 540
Query: 541 QSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQP 600
QSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ SSE RP QV PR Q
Sbjct: 541 QSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPS--MQVSVPRVQS 600
Query: 601 RGSWSPMEEEMSPLQLSWT-RKEFPVDEEPIR-EKHRSNHPSFFPKNDSSFPPDR--IPH 660
RGSW P+EEEMSP QL+ T +EFP++ EP+ EKH+ +HPSFFPK ++ DR + +
Sbjct: 601 RGSWVPVEEEMSPRQLNLTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMGMVN 660
Query: 661 ENQRLSKEAFYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGRSIWS-ETPV 720
EN RL K A YRDDR+R + ++Y SG+EIP+++SSS +R+ D ES R++ S ETPV
Sbjct: 661 ENLRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSSAETPV 720
Query: 721 GALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIK 780
ALQEIAMK G KVEF+ +LV S DLQF+ EAWF GE++GEGIGKTRREAQR AAE SIK
Sbjct: 721 EALQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAESSIK 780
Query: 781 NLANVYVSRCKADSTSAN-DMNKFPNDNGSGKRMRTDFHGNLPKPK 795
NLAN+Y+ R K D+ + + D +++ + N +G + G+ P PK
Sbjct: 781 NLANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPK 823
BLAST of CmoCh03G004900 vs. TAIR10
Match:
AT4G21670.1 (AT4G21670.1 C-terminal domain phosphatase-like 1)
HSP 1 Score: 909.4 bits (2349), Expect = 1.5e-264
Identity = 503/812 (61.95%), Postives = 599/812 (73.77%), Query Frame = 1
Query: 6 VYQGDELLGEVEIYPE-----------EKNGYKNIEVKE-----IRISHFSQPSERCPPL 65
V+ GD LGE+EIYP ++ K EV E IRISHFSQ ERCPPL
Sbjct: 9 VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGIRISHFSQSGERCPPL 68
Query: 66 AVLHTIAASGICFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDH 125
A+L TI++ G+CFK+E+ S +Q+ L L +SSC+ +NK+A+M+ G EELHLVAMYS +
Sbjct: 69 AILTTISSCGLCFKLEASPSPAQES-LSLFYSSCLRDNKTAVMLLGGEELHLVAMYSENI 128
Query: 126 DKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKIS 185
PCFW F+VA G+Y+SCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKI+ QR+I+
Sbjct: 129 KNDRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIDGFQRRIN 188
Query: 186 SEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVR 245
+E+DPQR A ++AE++RYQDDK +LKQY E+DQV+ENG+VIK QSE+VPALSDNHQP VR
Sbjct: 189 NEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALSDNHQPLVR 248
Query: 246 PLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 305
PLIRL EKNIILTRINP IRDTSVLVR+RP+WE+LRSYLTA+GRKRFEVYVCTMAERDYA
Sbjct: 249 PLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVCTMAERDYA 308
Query: 306 LEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDE 365
LEMWRLLDP+ NLIN +LL RIVCVKSG +KSLFNVF DG CHPKMALVIDDRLKVWDE
Sbjct: 309 LEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVIDDRLKVWDE 368
Query: 366 KDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISY 425
KDQPRVHVVPAFAPYY+P AE PVLCVARNVAC VRGGFFR+FD+ LL +I ISY
Sbjct: 369 KDQPRVHVVPAFAPYYSPQAEA-AATPVLCVARNVACGVRGGFFRDFDDSLLPRIAEISY 428
Query: 426 EDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSA 485
E+DA DIPSPPDVS+YL SED+ S NGNKD L+FDGM+D EV+RR+K+A ASS V A
Sbjct: 429 ENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAISASSAVLPA 488
Query: 486 ---DPRVPS-LQYTMASASG-TVPVPPY------------YPNMPLPHVDSVAQVA---- 545
DPR+ + +Q+ MASAS +VPVP +P++P +A
Sbjct: 489 ANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQPTSIAKHLV 548
Query: 546 ASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVV 605
SEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ SEP+F RPP Q
Sbjct: 549 PSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQRPP--VQAP 608
Query: 606 GPRAQPRGSWSPMEEEMSPLQL-SWTRKEFPVDEEPIR-EKHRSNHPSFFPKNDSSFPPD 665
Q R W P+EEEM P Q+ KE+P+D E I EKHR HPSFF K D+S D
Sbjct: 609 PSHVQSRNGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSD 668
Query: 666 RIPHENQRLSKEAFYRDDRVRVSRR-PSSYPAFSGDEIPMNQSSSRSRENDIESGRSI-W 725
R+ HEN+R KE+ RD+++R + P S+P F G++ NQSSSR+ + D RS+
Sbjct: 669 RMLHENRRPPKESLRRDEQLRSNNNLPDSHP-FYGEDASWNQSSSRNSDLDFLPERSVSA 728
Query: 726 SETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAA 777
+ET L IA+K G KVE+KP+LVSSTDL+F+VEAW +KIGEGIGK+RREA AA
Sbjct: 729 TETSADVLHGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAA 788
BLAST of CmoCh03G004900 vs. TAIR10
Match:
AT5G01270.2 (AT5G01270.2 carboxyl-terminal domain (ctd) phosphatase-like 2)
HSP 1 Score: 595.9 bits (1535), Expect = 3.7e-170
Identity = 363/785 (46.24%), Postives = 476/785 (60.64%), Query Frame = 1
Query: 2 YKSVVYQGDELLGEVEIYPEEKNGYKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGIC 61
+KSVVY GD LGE+++ + EIRI H S ERCPPLA+L TIA+ +
Sbjct: 6 HKSVVYHGDLRLGELDVNHVSSSHEFRFPNDEIRIHHLSPAGERCPPLAILQTIASFAVR 65
Query: 62 FKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGFNV 121
K+ES +HL H+ C E K+A+++ G EE+HLVAM S++ K++PCFW F+V
Sbjct: 66 CKLESSAPVKSQELMHL-HAVCFHELKTAVVMLGDEEIHLVAMPSKE--KKFPCFWCFSV 125
Query: 122 AMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAGML 181
GLY+SCL MLN RCL IVFDLDETL+VANTM+SFED+IEAL+ IS E+DP R GM
Sbjct: 126 PSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDPVRINGMS 185
Query: 182 AEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNIIL 241
AE++RY DD+++LKQY +ND +NG ++K+Q E V SD + RP+IRL EKN +L
Sbjct: 186 AELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRLPEKNTVL 245
Query: 242 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSN 301
TRI P+IRDTSVLV+LRPAWE+LRSYLTA+ RKRFEVYVCTMAERDYALEMWRLLDP+++
Sbjct: 246 TRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWRLLDPEAH 305
Query: 302 LINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 361
LI+ KEL DRIVCVK ++KSL +VF G CHPKMA+VIDDR+KVW++KDQPRVHVV A+
Sbjct: 306 LISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPRVHVVSAY 365
Query: 362 APYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSPPD 421
PYYAP AE VVP LCVARNVAC VRG FF+EFDE L+ I + YEDD ++P PD
Sbjct: 366 LPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVENLPPSPD 425
Query: 422 VSNYLGSEDEYSVSNGNKDTLTF-DGMSDMEVDRRMKDAFLASSTVNSADPRVPSLQYTM 481
VSNY+ ED SNGN + +GM EV+RR+ A A + A
Sbjct: 426 VSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQAAAADHSTLPA----------- 485
Query: 482 ASASGTVPVPPYYPNMPLPHVDSVAQVAA----SEPSLQSSPAREEGEVPESELDPDTRR 541
S + P P +P+ S A AA +PSL +P R+ +
Sbjct: 486 TSNAEQKPETPKPQIAVIPNNASTATAAALLPSHKPSLLGAPRRDGFTFSDGG------- 545
Query: 542 RLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEEMSPLQLSWTRKE 601
R L+++ G D R + ++P L + P P + G W +E R
Sbjct: 546 RPLMMRPGVDIRNQNFNQPPILAKIPMQPPSSSMHSP--GGWLVDDEN---------RPS 605
Query: 602 FPVDEEPIREKHRSNHPSFFPKND-SSFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSY 661
FP + +PS FP S P H + S+E DD R + PS
Sbjct: 606 FPGRPSGL-------YPSQFPHGTPGSAPVGPFAHPSHLRSEEVAMDDDLKR--QNPSRQ 665
Query: 662 PAFSGDEIPMNQSSSRSRENDIESGRSIWSETP--VGALQEIAMKFGTKVEFKPALVSST 721
G I N S RE+ + G+S ++ V ALQEI + G+KVEF+ + ++
Sbjct: 666 TTEGG--ISQNHLVSNGREHHTDGGKSNGGQSHLFVSALQEIGRRCGSKVEFRTVISTNK 725
Query: 722 DLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKADSTSANDMNKFP 778
+LQF+VE F GEKIG G+ KT+++A + AAE ++++LA YV+ + A + K P
Sbjct: 726 ELQFSVEVLFTGEKIGIGMAKTKKDAHQQAAENALRSLAEKYVAHV---APLARETEKGP 744
BLAST of CmoCh03G004900 vs. NCBI nr
Match:
gi|659078741|ref|XP_008439881.1| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Cucumis melo])
HSP 1 Score: 1416.4 bits (3665), Expect = 0.0e+00
Identity = 713/804 (88.68%), Postives = 752/804 (93.53%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPEEKNGYKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60
MYKSVVY GDELLG+VEIYPEEKNGYKNI+VKEIRISHFSQPSERCPPLAVLHTIAASGI
Sbjct: 1 MYKSVVYHGDELLGDVEIYPEEKNGYKNIDVKEIRISHFSQPSERCPPLAVLHTIAASGI 60
Query: 61 CFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGFN 120
CFKMESKTSQSQD PL+LLHSSCIMENK+AIM+FG+EELHLVAM+SRD D+QYPCFWGFN
Sbjct: 61 CFKMESKTSQSQDTPLNLLHSSCIMENKTAIMMFGVEELHLVAMFSRDLDRQYPCFWGFN 120
Query: 121 VAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAGM 180
VAMGLYNSCL MLNLRCLGIVFDLDETLVVANTMRSFED+IEALQRKISSEVDPQR GM
Sbjct: 121 VAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRANGM 180
Query: 181 LAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNII 240
LAEV+RYQDDK+ILKQYAENDQVIENGKVIKSQSEVVPALSDNHQP VRPLIRLHEKNII
Sbjct: 181 LAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKNII 240
Query: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS
Sbjct: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
Query: 361 FAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSPP 420
F+PYYAPNAEGNN +PVLCVARNVAC VRGGFF+EFD++LLQKI +ISYED NDIPSPP
Sbjct: 361 FSPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQKISDISYEDGVNDIPSPP 420
Query: 421 DVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSADPRVPSLQYTM 480
DVSNYL SEDEYS++NGNKD TFDGM DMEVDRRMKDAFLASST+NSADPRV SLQYTM
Sbjct: 421 DVSNYLVSEDEYSIANGNKDIPTFDGMPDMEVDRRMKDAFLASSTINSADPRVSSLQYTM 480
Query: 481 ASASGTVP-------VPPYYPNMPLPHVDSVAQVAASEPSLQSSPAREEGEVPESELDPD 540
ASASG VP +PPY+PNMP+PHV+SVA VA +EPSLQSSPAREEGEVPESELDPD
Sbjct: 481 ASASGAVPLPPKQVSMPPYFPNMPIPHVNSVAHVAPNEPSLQSSPAREEGEVPESELDPD 540
Query: 541 TRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEEMSPLQLSWT 600
TRRRLLILQHGQDTRER SSEPAF GRPPPL QV PRAQ RGSWSPMEEEMSP QLS T
Sbjct: 541 TRRRLLILQHGQDTRERLSSEPAFPGRPPPLQQVAAPRAQSRGSWSPMEEEMSPRQLSRT 600
Query: 601 -RKEFPVDEE--PIREKHRSNHPSFFPKNDSSFPPDRIPHENQRLSKEAFYRDDRVRVSR 660
RKEFPVD E P+REKHRSNHPSFFPK D+ PDRIPHENQRL K AFYRDDR+RVSR
Sbjct: 601 ARKEFPVDAEPMPMREKHRSNHPSFFPKVDNPILPDRIPHENQRLPKGAFYRDDRMRVSR 660
Query: 661 RPSSYPAFSGDEIPMNQSSSRSRENDIESGRSIWSETPVGALQEIAMKFGTKVEFKPALV 720
RPSSYPAF G+EIPMNQSSSRSR++DIESGRSIWSETPVGALQEIAMKFGTKVEFKP LV
Sbjct: 661 RPSSYPAFPGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLV 720
Query: 721 SSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKADSTSANDMN 780
STDLQF+VEAWFVGEKIGEGIG TRR+AQRHAAEGSIKNLAN+YVSRCKAD++SANDMN
Sbjct: 721 PSTDLQFSVEAWFVGEKIGEGIGNTRRDAQRHAAEGSIKNLANIYVSRCKADTSSANDMN 780
Query: 781 KFPNDNGSGKRMRTDFHGNLPKPK 795
KFP+DNGSGKRM+ DFH +LPK K
Sbjct: 781 KFPSDNGSGKRMKLDFHRHLPKTK 804
BLAST of CmoCh03G004900 vs. NCBI nr
Match:
gi|449433867|ref|XP_004134718.1| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Cucumis sativus])
HSP 1 Score: 1410.2 bits (3649), Expect = 0.0e+00
Identity = 711/803 (88.54%), Postives = 752/803 (93.65%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPEEKNGYKNIEVKEIRISHFSQPSERCPPLAVLHTIAASGI 60
MYKSVVY GDELLG+VEIYPEEKNGYKNIEVKEIRI+HFSQPSERCPPLAVLHTIAASGI
Sbjct: 1 MYKSVVYHGDELLGDVEIYPEEKNGYKNIEVKEIRITHFSQPSERCPPLAVLHTIAASGI 60
Query: 61 CFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGFN 120
CFKMESKTSQSQD PL+LLHSSCIMENK+AIM+FG+EELHLVAM+SRD DKQYPCFWGFN
Sbjct: 61 CFKMESKTSQSQDTPLNLLHSSCIMENKTAIMMFGVEELHLVAMFSRDLDKQYPCFWGFN 120
Query: 121 VAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAGM 180
VAMGLYNSCL MLNLRCLGIVFDLDETLVVANTMRSFED+IEALQRKISSEVDPQR GM
Sbjct: 121 VAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKISSEVDPQRANGM 180
Query: 181 LAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNII 240
LAEV+RYQDDK+ILKQYAENDQVIENGKVIKSQSEVVPALSDNHQP VRPLIRLHEKNII
Sbjct: 181 LAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPVVRPLIRLHEKNII 240
Query: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS
Sbjct: 241 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 300
Query: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 301 NLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVPA 360
Query: 361 FAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSPP 420
FAPYYAPNAEGNN +PVLCVARNVAC VRGGFF+EFD++LLQKI +ISYEDD NDIPSPP
Sbjct: 361 FAPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQKISDISYEDDVNDIPSPP 420
Query: 421 DVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSADPRVPSLQYTM 480
DVSNYL SEDEYS++NGNKD TFDGM DMEVDRRMKDAFLASST+NSADPRV SLQYTM
Sbjct: 421 DVSNYLVSEDEYSIANGNKDMPTFDGMPDMEVDRRMKDAFLASSTINSADPRVSSLQYTM 480
Query: 481 ASASGTVPVP------PYYPNMPLPHVDSVAQVAASEPSLQSSPAREEGEVPESELDPDT 540
ASAS +VP+P PY+PNMPLPHV+SVA VA +EPSLQSSPAREEGEVPESELDPDT
Sbjct: 481 ASASCSVPLPPKQVTMPYFPNMPLPHVNSVAHVAPNEPSLQSSPAREEGEVPESELDPDT 540
Query: 541 RRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEEMSPLQLSWT- 600
RRRLLILQHGQDTRER SSEPAF RPPPL QV PRAQ RG+WSPMEEEMSP QL+ +
Sbjct: 541 RRRLLILQHGQDTRERLSSEPAFPARPPPLQQVAAPRAQSRGNWSPMEEEMSPRQLNRSA 600
Query: 601 RKEFPVDEE--PIREKHRSNHPSFFPKNDSSFPPDRIPHENQRLSKEAFYRDDRVRVSRR 660
RK+FPVD E P+REKHRSNHPSFF K D+S PDRIPH+NQRL KEAFYRDDR+RVSRR
Sbjct: 601 RKDFPVDAEPMPMREKHRSNHPSFFAKVDNSILPDRIPHDNQRLPKEAFYRDDRMRVSRR 660
Query: 661 PSSYPAFSGDEIPMNQSSSRSRENDIESGRSIWSETPVGALQEIAMKFGTKVEFKPALVS 720
PSSYPAFSG+EIPMNQSSSRSR++DIESGRSIWSETPVGALQEIAMKFGTKVEFKP LV
Sbjct: 661 PSSYPAFSGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLVP 720
Query: 721 STDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKADSTSANDMNK 780
STDLQF+VEAWFVGEKIGEGIG TRR+AQR AAEGSIKNLAN+YVSRCKAD +SANDMNK
Sbjct: 721 STDLQFSVEAWFVGEKIGEGIGHTRRDAQRQAAEGSIKNLANIYVSRCKADPSSANDMNK 780
Query: 781 FPNDNGSGKRMRTDFHGNLPKPK 795
FP+DNGSGKRM+ DFH +LPK K
Sbjct: 781 FPSDNGSGKRMKLDFHRHLPKTK 803
BLAST of CmoCh03G004900 vs. NCBI nr
Match:
gi|645237091|ref|XP_008225045.1| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Prunus mume])
HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 588/817 (71.97%), Postives = 671/817 (82.13%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPEE---KNGYKNI--EVKEIRISHFSQPSERCPPLAVLHTI 60
MYKSVVY+G+ELLGEVEIYPEE KN KN+ E+KEIRIS+FSQ SERCPP+AVLHTI
Sbjct: 1 MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60
Query: 61 AASGICFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPC 120
++ G+CFKMESKTSQSQD PL LLHSSC+MENK+A+M G EELHLVAM+SR+ DK+YPC
Sbjct: 61 SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMHSRNSDKRYPC 120
Query: 121 FWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQ 180
FWGF+VA GLYNSCLVMLNLRCLGIVFDLDETL+VANTMRSFED+IEALQRKISSEVD Q
Sbjct: 121 FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDSQ 180
Query: 181 RTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLH 240
R +GMLAE++RYQDDK ILKQYAENDQV+ENG+VIK+QSE VPALSDNHQP +RPLIRL
Sbjct: 181 RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLL 240
Query: 241 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300
EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL
Sbjct: 241 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300
Query: 301 LDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRV 360
LDPDSNLIN +LLDRIVCVKSGSRKSLFNVFQ+ CHPKMALVIDDRLKVWD++DQPRV
Sbjct: 301 LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360
Query: 361 HVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDAND 420
HVVPAFAPYYAP AE NN VPVLCVARNVAC VRGGFFREFD+ LLQKI + YEDD D
Sbjct: 361 HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420
Query: 421 IPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKD----AFLASSTVNSADP 480
+PS PDVSNYL SED+ S NGN+D L FDG++D+EV+RRMK+ A + SS V S DP
Sbjct: 421 VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATSAASMVSSVVTSIDP 480
Query: 481 RVPSLQYTMASASGTVPVPP------YYPNMPLPHVDSVAQ----VAASEPSLQSSPARE 540
R+ SLQYT+A +S T+ +P +P++ P S+ + V ++EPSLQSSPARE
Sbjct: 481 RLASLQYTVAPSSSTLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSTEPSLQSSPARE 540
Query: 541 EGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPM 600
EGEVPESELDPDTRRRLLILQHGQDTR++ SEP F RPP V PRAQ R W P+
Sbjct: 541 EGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASV--PRAQSRPGWFPV 600
Query: 601 EEEMSPLQLS-WTRKEFPVDEEPIR-EKHRSNHPSFFPKNDSSFPPDRIPHENQRLSKEA 660
EEEMSP QLS K+ P+D EP++ EKHR +H SFFPK ++S P DRI ENQRL KEA
Sbjct: 601 EEEMSPRQLSRMVPKDLPLDPEPVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEA 660
Query: 661 FYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGRSIW-SETPVGALQEIAMK 720
F+RDDR+R + S Y + SG+EIP+++SSS +R+ D ESGR+I +ETP G LQEIAMK
Sbjct: 661 FHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISNAETPAGVLQEIAMK 720
Query: 721 FGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSR 780
G KVEF+PALV+S +LQF VEAWF GEKIGEG GKTRREA AAEGS+KNLAN+Y+SR
Sbjct: 721 CGAKVEFRPALVASMELQFYVEAWFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLSR 780
Query: 781 CKADSTSAN-DMNKFPNDNGSGKRMRTDFHGNLPKPK 795
K DS S + DMNKFPN N +G + G P PK
Sbjct: 781 VKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPK 814
BLAST of CmoCh03G004900 vs. NCBI nr
Match:
gi|1009109431|ref|XP_015890182.1| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Ziziphus jujuba])
HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 583/814 (71.62%), Postives = 660/814 (81.08%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPEEKNGYKNIEV-KEIRISHFSQPSERCPPLAVLHTIAASG 60
MYKSVVY+G+E LGEVEI+P E + K I+ KEIRISHFSQ SERCPPLAVLHTI + G
Sbjct: 1 MYKSVVYKGEEFLGEVEIFPGENDNKKIIDDGKEIRISHFSQASERCPPLAVLHTITSCG 60
Query: 61 ICFKMESKTSQSQDMPLHLLHSSCIMENKSAIMVFGMEELHLVAMYSRDHDKQYPCFWGF 120
+CFKMESKTSQSQD PL LLHSSCI ENK+A+M+ G EELHLVAMYSR+ DKQYPCFWGF
Sbjct: 61 VCFKMESKTSQSQDTPLFLLHSSCIKENKTAVMLLGGEELHLVAMYSRNSDKQYPCFWGF 120
Query: 121 NVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKIEALQRKISSEVDPQRTAG 180
VA GLYNSCL +LNLRCLGIVFDLDETL+VANTMRSFED+IEALQRKISSE DPQR +G
Sbjct: 121 IVAFGLYNSCLGLLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEADPQRISG 180
Query: 181 MLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPALSDNHQPFVRPLIRLHEKNI 240
MLAEV+RYQDDK ILKQYA++DQV+ENG+VIK QSEVVPALSD + VRPLIRL+EKNI
Sbjct: 181 MLAEVKRYQDDKNILKQYADSDQVVENGRVIKIQSEVVPALSDTYTTLVRPLIRLNEKNI 240
Query: 241 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 300
ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD
Sbjct: 241 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 300
Query: 301 SNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVHVVP 360
SNLIN KELLDRIVCVKSG RKSLFNVFQ G CHPKMALVIDDRLKVWDEKDQPRVHVVP
Sbjct: 301 SNLINSKELLDRIVCVKSGLRKSLFNVFQGGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 360
Query: 361 AFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVLLQKIYNISYEDDANDIPSP 420
AFAPYYAP AE NN VPVLCVARNVAC VRGGFF++FD+ LLQKI +ISYEDD +IPSP
Sbjct: 361 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKDFDDGLLQKITDISYEDDVKEIPSP 420
Query: 421 PDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAFLASSTVNSA----DPRV-P 480
PDVSNYL SED+ S SNGN+D L FDGM+D+EV+RR+K+A A+STV S+ DPR+ P
Sbjct: 421 PDVSNYLVSEDDGSTSNGNRDPLPFDGMADVEVERRLKEAISAASTVASSVTNIDPRLAP 480
Query: 481 SLQYTMASASGTVPVPP------YYPNMPLPH----VDSVAQVAASEPSLQSSPAREEGE 540
LQ T+ S+SG++P+P +PN+ P V + V + +LQ+SPAREEGE
Sbjct: 481 PLQTTIGSSSGSLPLPTTQVSVMNFPNVQFPQAASAVKPLGHVGNMDSNLQNSPAREEGE 540
Query: 541 VPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPPPLPQVVGPRAQPRGSWSPMEEE 600
VPESELDPDTRRRLLILQHGQDTR+ SSEP F RP QV PR Q RG W EEE
Sbjct: 541 VPESELDPDTRRRLLILQHGQDTRDLTSSEPPFPVRPS--VQVSVPRVQSRGGWFLAEEE 600
Query: 601 MSPLQLS-WTRKEFPVDEEPIR-EKHRSNHPSFFPKNDSSFPPDRIPHENQRLSKEAFYR 660
MSP Q+S KEFP+D EP+ EKHR +HPSFFPK +S P DRI HENQRL KEAF R
Sbjct: 601 MSPRQVSRVVPKEFPLDSEPLHVEKHRPHHPSFFPKVESPIPSDRILHENQRLPKEAFQR 660
Query: 661 DDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGRSI-WSETPVGALQEIAMKFGT 720
+ R + Y +FSG+EIP+++SSS ++E D ES R++ +ETP GAL EIAMK GT
Sbjct: 661 E---RSNNSLPGYHSFSGEEIPLSRSSSSNKEVDFESSRAVSIAETPAGALHEIAMKCGT 720
Query: 721 KVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQRHAAEGSIKNLANVYVSRCKA 780
KVEF+PALVSST+LQFAVEAWF GEKIGEG G+TRREAQ AAEGS+KNLAN+YVSR K
Sbjct: 721 KVEFRPALVSSTELQFAVEAWFAGEKIGEGTGRTRREAQCQAAEGSLKNLANIYVSRVKP 780
Query: 781 DSTS-ANDMNKFPNDNGSGKRMRTDFHGNLPKPK 795
DS S D +KFP+ + +G + G+ PK
Sbjct: 781 DSGSLLLDGSKFPDMSENGFLSHANSFGSRGTPK 809
BLAST of CmoCh03G004900 vs. NCBI nr
Match:
gi|590624713|ref|XP_007025681.1| (C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao])
HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 586/830 (70.60%), Postives = 662/830 (79.76%), Query Frame = 1
Query: 1 MYKSVVYQGDELLGEVEIYPE----------EKNGYKNI-----EVKEIRISHFSQPSER 60
MYKSVVY+G+E+LGEVEIYP+ E+ + I E+KEIRI + +Q SER
Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63
Query: 61 CPPLAVLHTIAASGICFKMESKT----SQSQDMP-LHLLHSSCIMENKSAIMVFGMEELH 120
CPPLAVLHTI +SGICFKMES S SQD P LHLLHS CI +NK+A+M G ELH
Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123
Query: 121 LVAMYSRDHDKQYPCFWGFNVAMGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDK 180
LVAMYSR+ D+ PCFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETL+VANTMRSFED+
Sbjct: 124 LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 183
Query: 181 IEALQRKISSEVDPQRTAGMLAEVERYQDDKLILKQYAENDQVIENGKVIKSQSEVVPAL 240
IEALQRK+++EVDPQR AGM+AE++RYQDDK ILKQYAENDQV+ENGKVIK QSEVVPAL
Sbjct: 184 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 243
Query: 241 SDNHQPFVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 300
SDNHQP +RPLIRL EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 244 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 303
Query: 301 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVI 360
CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVKSGSRKSLFNVFQDG CHPKMALVI
Sbjct: 304 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 363
Query: 361 DDRLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNVVPVLCVARNVACYVRGGFFREFDEVL 420
DDRLKVWDEKDQPRVHVVPAFAPYYAP AE NN +PVLCVARNVAC VRGGFFREFDE L
Sbjct: 364 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 423
Query: 421 LQKIYNISYEDDANDIPSPPDVSNYLGSEDEYSVSNGNKDTLTFDGMSDMEVDRRMKDAF 480
LQ+I ISYEDD DIPSPPDV NYL SED+ S NGNKD L FDGM+D EV+RR+K+A
Sbjct: 424 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 483
Query: 481 LASSTVNSA----DPRV-PSLQYTMASASGTVPVPPYYPNM--------PL--PHVDSVA 540
A+STV+SA DPR+ PSLQYTM S+S ++P P++ PL P V VA
Sbjct: 484 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 543
Query: 541 QVAASEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERQSSEPAFLGRPP--P 600
VA EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ EPAF PP P
Sbjct: 544 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAF---PPVRP 603
Query: 601 LPQVVGPRAQPRGSWSPMEEEMSPLQLSWTR-KEFPVDEEPIR-EKHRSNHPSFFPKNDS 660
QV PR Q RGSW EEEMSP QL+ KEFP+D E + EKHR HP FFPK +S
Sbjct: 604 TMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVES 663
Query: 661 SFPPDRIPHENQRLSKEAFYRDDRVRVSRRPSSYPAFSGDEIPMNQSSSRSRENDIESGR 720
S P DR+ ENQRLSKEA +RDDR+ ++ PSSY +FSG+E+P++QSSS R+ D ESGR
Sbjct: 664 SIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR 723
Query: 721 SIWS-ETPVGALQEIAMKFGTKVEFKPALVSSTDLQFAVEAWFVGEKIGEGIGKTRREAQ 780
++ S ET G LQ+IAMK G KVEF+PALV+S DLQF++EAWF GEK+GEG+G+TRREAQ
Sbjct: 724 TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQ 783
Query: 781 RHAAEGSIKNLANVYVSRCKADSTSA-NDMNKFPNDNGSGKRMRTDFHGN 790
R AAE SIKNLAN Y+SR K DS SA D+++ N N +G + GN
Sbjct: 784 RQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGN 826
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
CPL1_ARATH | 2.7e-263 | 61.95 | RNA polymerase II C-terminal domain phosphatase-like 1 OS=Arabidopsis thaliana G... | [more] |
CPL2_ARATH | 6.5e-169 | 46.24 | RNA polymerase II C-terminal domain phosphatase-like 2 OS=Arabidopsis thaliana G... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KLF7_CUCSA | 0.0e+00 | 88.54 | Uncharacterized protein OS=Cucumis sativus GN=Csa_6G517200 PE=4 SV=1 | [more] |
A0A061GMH8_THECC | 0.0e+00 | 70.60 | C-terminal domain phosphatase-like 1 isoform 3 OS=Theobroma cacao GN=TCM_029910 ... | [more] |
A0A061GFW4_THECC | 0.0e+00 | 70.60 | C-terminal domain phosphatase-like 1 isoform 2 OS=Theobroma cacao GN=TCM_029910 ... | [more] |
A0A061GGL6_THECC | 0.0e+00 | 70.60 | C-terminal domain phosphatase-like 1 isoform 1 OS=Theobroma cacao GN=TCM_029910 ... | [more] |
A0A067JAV3_JATCU | 8.6e-311 | 69.85 | Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21412 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G21670.1 | 1.5e-264 | 61.95 | C-terminal domain phosphatase-like 1 | [more] |
AT5G01270.2 | 3.7e-170 | 46.24 | carboxyl-terminal domain (ctd) phosphatase-like 2 | [more] |