Homology
BLAST of Sgr021939 vs. NCBI nr
Match:
XP_022148889.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Momordica charantia])
HSP 1 Score: 1782.7 bits (4616), Expect = 0.0e+00
Identity = 978/1268 (77.13%), Postives = 1020/1268 (80.44%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLYK 60
MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLE+GAK+ SNRE RVWTMSDLYK
Sbjct: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLETGAKLVPSKDSNREPRVWTMSDLYK 60
Query: 61 NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKR-SSPSSFANGKEDGNST 120
NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFV+EADPEEKSKR SSPS AN GNST
Sbjct: 61 NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPEEKSKRSSSPSPLAN----GNST 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
KE GKV ID S+DE+D N NVEREEGELEEGEIDMD+EFVEEVV+SKAMLSD D DCD
Sbjct: 121 KEEGKVTIDDSSDEMDYGNANVEREEGELEEGEIDMDTEFVEEVVESKAMLSDSGDTDCD 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
+E DL KKELDD+VKLIQKTLDGVTIDAAQKSF+EVC+ LHSSIE ++LLQEKV P K
Sbjct: 181 GQESDLVKKELDDQVKLIQKTLDGVTIDAAQKSFEEVCTQLHSSIEIFLKLLQEKVFPXK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
DALIQRLYAALRIINSV SYVK CNPPLFSPEQIKSVEVKM
Sbjct: 241 DALIQRLYAALRIINSVFCSMNLNEKEEYKQHLSRLLSYVKNCNPPLFSPEQIKSVEVKM 300
Query: 301 PST---DYLPSMRASAKESTFDFVNQVA----------FGLH------------ACWVVG 360
PST DYL +RA+AKE+ N V G H V+
Sbjct: 301 PSTDSLDYLSIIRANAKEAEIHIPNGVKNKDFYSGSTNAGPHLTSSTKLPSDSMPVGVMA 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KNN NILSDG QSGVSN++GRG LLPLLDLHKDHDVDSLPSPTREAPS FPVQK GN P
Sbjct: 361 KNNPNILSDGSQSGVSNLRGRGPLLPLLDLHKDHDVDSLPSPTREAPSIFPVQKLGNTPP 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG-- 480
KVALAMDG RSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC DGAGD+GG
Sbjct: 421 KVALAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGAGDIGGEV 480
Query: 481 ---------------------------------------SIKGLISPINVAPPSCVSNPT 540
SIKGLISPINVAPPSCVSNPT
Sbjct: 481 SSSSIIRSLKASNSPKLGQNVSNSASNISAGYFPNMESSSIKGLISPINVAPPSCVSNPT 540
Query: 541 VKPLAKSRDPRLRIVNSD-------------------AKSAATINLRKQKMDEEPNIDGP 600
VKPL KSRDPR RI+NSD A+S ATINLRKQKM EEPN+DGP
Sbjct: 541 VKPLPKSRDPRRRIINSDASALDLNPRTIASVQNSSIAESDATINLRKQKMGEEPNVDGP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDPDFQVGIRW--------------- 660
E KRQR GSQN AVA+SD+ + S GW L+ VG R
Sbjct: 601 EMKRQRTGSQNHAVAASDVRTGS---GGW-----LEDTMPVGPRLSSRNQMEISEADATE 660
Query: 661 ------------------SANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKS 720
SA+NDASLPSLLKDI VNPTM ++LLK+SQQQ LAAELKLKS
Sbjct: 661 KLNVTNNSVAGNECTPSISASNDASLPSLLKDIAVNPTMFLSLLKMSQQQHLAAELKLKS 720
Query: 721 SEPEKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATV---DDLGKVR 780
SE EKNAICPTSL+PC GS+PL NTP+VTSG LQQS GT SVPSPPVATV DDLGKVR
Sbjct: 721 SELEKNAICPTSLNPCQGSSPLVNTPSVTSGILQQSTGTSSVPSPPVATVSRQDDLGKVR 780
Query: 781 MKPRDPRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASSQ 840
MKPRDPRRILHGNSLQKVG+LGNEQ KGIVP APNTEGS+D+PNGHKQEG GDLRLASSQ
Sbjct: 781 MKPRDPRRILHGNSLQKVGNLGNEQSKGIVPTAPNTEGSKDVPNGHKQEGLGDLRLASSQ 840
Query: 841 PLLPDITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTT 900
+ PDITR FTKNLKNIADILS SSP SS +SSSKPVKLDRMDTN+VGSSS DSKVVTT
Sbjct: 841 SVPPDITRPFTKNLKNIADILSGSSPPTSSLSSSSKPVKLDRMDTNSVGSSSIDSKVVTT 900
Query: 901 ATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 960
ATQA DMV SRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD
Sbjct: 901 ATQAVDMVGLSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 960
Query: 961 LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA 1020
LDHTLLNSAKFVEV+PVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA
Sbjct: 961 LDHTLLNSAKFVEVEPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA 1020
Query: 1021 SELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGV 1080
SELYELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGDERVPKSKDLEGV
Sbjct: 1021 SELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDERVPKSKDLEGV 1080
Query: 1081 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1128
LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT
Sbjct: 1081 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1140
BLAST of Sgr021939 vs. NCBI nr
Match:
XP_022960085.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita moschata])
HSP 1 Score: 1701.8 bits (4406), Expect = 0.0e+00
Identity = 924/1261 (73.28%), Postives = 996/1261 (78.98%), Query Frame = 0
Query: 1 MGKDES-VKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLY 60
MGK + VK DVEEGEISDT SVEEI+EEDFNKLE+ K+ SNRE VWTMSDLY
Sbjct: 1 MGKHTNCVKTPDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60
Query: 61 KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDGNST 120
NYPTMCRGYA GLYNLAWA+AVQNKPLN+IF+++ADP++KS RSS S F N KE GN T
Sbjct: 61 NNYPTMCRGYAPGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
KE K++ID + D+++ DN +VE+EEGELEEGEIDMD+EFVEEVVDSK MLSD D DC
Sbjct: 121 KEEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDC- 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
+E DL+ KELDD++KLI KTLDGVTIDAAQKSFQEVCS L SSIET ++L+Q KVVPRK
Sbjct: 181 -QEIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQEVCSQLLSSIETFLELVQGKVVPRK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
D LIQRLYAALRIINSV S+VK CNPPLFSPEQIKSVEVKM
Sbjct: 241 DVLIQRLYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKM 300
Query: 301 PSTDYL---PSMRASAKESTFDFVNQVA----FGLHA------------------CWVVG 360
PSTD L P MRASAK+ N V + +A V
Sbjct: 301 PSTDSLDQFPDMRASAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTV 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KN+LN+ SD L SGV N+KGRG LLPLLDLHKDHDVDSLPSPTREAP+ F VQKSG+ P
Sbjct: 361 KNSLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPV 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG-- 480
KVA AMDG R HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC DG GD+GG
Sbjct: 421 KVAHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGGGDIGGEV 480
Query: 481 ---------------------------------------SIKGLISPINVAPPSCVSNPT 540
S KGLISP NVAPPSCVSNP
Sbjct: 481 SSSSILRSSKASNSSKLAQTVSQSASSISTGLFPNLESSSTKGLISPSNVAPPSCVSNPI 540
Query: 541 VKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNIDGP 600
KPLAKSRDPRLR+VNS+A +SA T+NLRKQKMD EPNID P
Sbjct: 541 AKPLAKSRDPRLRMVNSEASAMDLNPRTMTSVQSPSVVESAVTVNLRKQKMDVEPNIDAP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGW-----KILCQLDPDFQV-------------- 660
E KRQRIGSQN A ++SDL + S GW + +L Q+
Sbjct: 601 EMKRQRIGSQNHAFSASDLRAGSG-SGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVT 660
Query: 661 ---------GIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEK 720
G SA+ +ASLPSLLKDIVVNPTML++LLK++QQ+Q+AAELKLKSSEPEK
Sbjct: 661 NNSGAGNSCGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLKSSEPEK 720
Query: 721 NAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRR 780
NAICPT+++PCLGS+PL N PA+TSG LQQS GTPSVPSPPV TVDD+GKVRMKPRDPRR
Sbjct: 721 NAICPTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTVDDVGKVRMKPRDPRR 780
Query: 781 ILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDI-PNGHKQEGQGDLRLASSQPLLPDIT 840
ILHGNSL KVGS+GNEQ K +VPA PN EGSRDI PNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781 ILHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIVPNGHKQEGQGNLRLASSQPLLPDIG 840
Query: 841 RQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAADM 900
RQFT NLKNIADI+SV SP SS SSSKPVKLD DTNAVGSSS DSK+V TATQ DM
Sbjct: 841 RQFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDIKDTNAVGSSSIDSKIVATATQVVDM 900
Query: 901 VSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960
V PSRS G WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901 VGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
Query: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
Query: 1021 LYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
LYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAV 1080
Query: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1128
VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV
Sbjct: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1140
BLAST of Sgr021939 vs. NCBI nr
Match:
XP_023514332.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1689.1 bits (4373), Expect = 0.0e+00
Identity = 916/1261 (72.64%), Postives = 994/1261 (78.83%), Query Frame = 0
Query: 1 MGKDES-VKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLY 60
MGK + VK +DVEEGEISDT SVEEI+EEDFNKLE+ K+ SNRE VWTMSDLY
Sbjct: 1 MGKHTNCVKTQDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60
Query: 61 KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDGNST 120
NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+++ADP++KS RSS S F N KE GN T
Sbjct: 61 NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
K+ K++ID + D+++ DN +VE+EEGELEEGEIDMD+EFVEEVVDSK MLSD +D D
Sbjct: 121 KQEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSD--SLDTD 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
+E DL+ KELDD++KLI KTLD VTIDAAQKSF EVCS L SSIET ++L+Q KVVPRK
Sbjct: 181 YQEIDLKNKELDDQLKLIHKTLDAVTIDAAQKSFHEVCSQLLSSIETFLELVQGKVVPRK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
DALIQRLYAALRIINSV S+VK CN PLFSPEQIKSVEVKM
Sbjct: 241 DALIQRLYAALRIINSVFCSMNPKEKEECKPHLSRLLSFVKNCNTPLFSPEQIKSVEVKM 300
Query: 301 PST---DYLPSMRASAKESTFDFVNQVA----FGLHA------------------CWVVG 360
PST D+ P MR SAK+ N V + +A V
Sbjct: 301 PSTDSLDHFPHMRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTV 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KNNLN+ SD L SGV N+KGRG LLPLLDLHKDHDVDSLPSPTREAP+ F VQKSG+ P
Sbjct: 361 KNNLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPV 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG-- 480
KVA AMDG R HPYETDA+KAVSTYQQKFGRSSFSMADRLPSPTPSEEC DG GD+GG
Sbjct: 421 KVARAMDGSRVHPYETDAVKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGGGDIGGEV 480
Query: 481 ---------------------------------------SIKGLISPINVAPPSCVSNPT 540
S KGLISP+NVAPPS VSNP
Sbjct: 481 SSSSIFRSSKASNSYKLAQTVSNSASSISTGLFPNLESSSTKGLISPLNVAPPSSVSNPI 540
Query: 541 VKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNIDGP 600
KPLAKSRDPRLR+V S+A +SA T+N+RKQKMD EPNID P
Sbjct: 541 AKPLAKSRDPRLRMVTSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGW-----KILCQLDPDFQV-------------- 660
E KRQRIGSQN A ++SDL + S GW + +L Q+
Sbjct: 601 EMKRQRIGSQNHAFSASDLRAGSG-SGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVT 660
Query: 661 ---------GIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEK 720
G SA+ +ASLPSLLKDIVVNPTML++LLK++QQ+Q+AAELKL SSEPEK
Sbjct: 661 NNSGAGNSRGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEK 720
Query: 721 NAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRR 780
NAICPT+++PCLGS+PL N PA+TSG LQQS GTPSVPSPPV TVDD+GKVRMKPRDPRR
Sbjct: 721 NAICPTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTVDDVGKVRMKPRDPRR 780
Query: 781 ILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRD-IPNGHKQEGQGDLRLASSQPLLPDIT 840
ILHGNSL KVGS+GNEQ K +VPA PN EGSRD IPNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781 ILHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDIG 840
Query: 841 RQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAADM 900
RQFT NLKNIADI+SV SP SS SSSKPVKLDR DTNAVGSSS DSK+V TATQA DM
Sbjct: 841 RQFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDRKDTNAVGSSSIDSKIVATATQAVDM 900
Query: 901 VSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960
V PSRS G WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901 VGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
Query: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
SAKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961 SAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
Query: 1021 LYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
LYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAV 1080
Query: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1128
VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV
Sbjct: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1140
BLAST of Sgr021939 vs. NCBI nr
Match:
KAG6592819.1 (RNA polymerase II C-terminal domain phosphatase-like 3, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1688.3 bits (4371), Expect = 0.0e+00
Identity = 915/1261 (72.56%), Postives = 992/1261 (78.67%), Query Frame = 0
Query: 1 MGKDES-VKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLY 60
MGK + VK +DVEEGEISDT SVEEI+EEDFNKLE+ K+ SNRE VWTMSDLY
Sbjct: 1 MGKHTNCVKTQDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60
Query: 61 KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDGNST 120
NYPTMCRGYASGLYNLAWA+AVQNKPLN+IF+++ADP+ KS RSS S F N KE GN T
Sbjct: 61 NNYPTMCRGYASGLYNLAWAKAVQNKPLNEIFLMDADPDHKSNRSSSSPFRNAKEHGNGT 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
K+ K++ID + D+++ DN +VE+EEGELEEGEIDMD+EFVEEVVDSK MLSD D DC
Sbjct: 121 KQEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDC- 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
E DL+ KELDD++KLI KTLDGVTIDAAQKSFQ+VCS L SSIET ++L+Q KVVPRK
Sbjct: 181 -REIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQQVCSQLLSSIETFLELVQGKVVPRK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
DALIQR YAALRIINSV S+VK CNPPLFSPEQIKSVEVKM
Sbjct: 241 DALIQRCYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKM 300
Query: 301 PST---DYLPSMRASAKESTFDFVNQVA----FGLHA------------------CWVVG 360
PST D+ P R SAK+ N V + +A V
Sbjct: 301 PSTDSLDHFPDTRDSAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTI 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KNNLN+ SD L SGV N+KGRG L PLLDLHKDHDVDSLPSPTREAP+ F VQKSG+ P
Sbjct: 361 KNNLNLSSDSLLSGVPNVKGRGPLHPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPM 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGGSI 480
KVA MDG R HPYETDA+KAVSTYQQKFGRSSFSMADRLPSPTPSEEC DG GD+GG +
Sbjct: 421 KVAHDMDGSRVHPYETDAVKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGGGDIGGEV 480
Query: 481 -----------------------------------------KGLISPINVAPPSCVSNPT 540
KGLISP+NVAPPS VSNP
Sbjct: 481 SSSSIFRSSKASSSSKLAQTVSNSASSISTGLFPNLESSTTKGLISPLNVAPPSSVSNPI 540
Query: 541 VKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNIDGP 600
KPLAKSRDPRLR+V S+A +SA T+N+RKQKMD EPNID P
Sbjct: 541 AKPLAKSRDPRLRMVTSEASAMDLNPRTMTSVQNPSVVESAVTVNMRKQKMDVEPNIDAP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGW-----KILCQLDPDFQV-------------- 660
E KRQRIGSQN A ++SDL + S GW + +L Q+
Sbjct: 601 EMKRQRIGSQNHAFSASDLRAGSG-SGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVT 660
Query: 661 ---------GIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEK 720
G SA+ +ASLPSLLKDIVVNPTML++LLK++QQ+Q+AAELKL SSEPEK
Sbjct: 661 NNSGAGNLRGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLNSSEPEK 720
Query: 721 NAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRR 780
NAICPT+++PCLGS+PL N PA+TSG LQQS GTPSVPSPPV TVDD+GKVRMKPRDPRR
Sbjct: 721 NAICPTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTVDDVGKVRMKPRDPRR 780
Query: 781 ILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRD-IPNGHKQEGQGDLRLASSQPLLPDIT 840
ILHGNSL KVGS+GNEQ K +VPA PN EGSRD IPNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781 ILHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIIPNGHKQEGQGNLRLASSQPLLPDIG 840
Query: 841 RQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAADM 900
RQFT NLKNIADI+SV SP SS SSSKPVKLDR DTNAVGSSS DSK+V TATQA DM
Sbjct: 841 RQFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDRKDTNAVGSSSIDSKIVATATQAVDM 900
Query: 901 VSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960
V PSRS G WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901 VGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
Query: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
Query: 1021 LYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
LYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAV 1080
Query: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1128
VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV
Sbjct: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1140
BLAST of Sgr021939 vs. NCBI nr
Match:
XP_011656791.1 (RNA polymerase II C-terminal domain phosphatase-like 3 [Cucumis sativus] >KGN46418.1 hypothetical protein Csa_005260 [Cucumis sativus])
HSP 1 Score: 1686.8 bits (4367), Expect = 0.0e+00
Identity = 929/1269 (73.21%), Postives = 989/1269 (77.94%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAK----VASKDSSNREARVWTMS 60
MGKDE +KIEDVEEGEISDTASVEEISEEDFNKL+S A V SKD SNRE RVWTMS
Sbjct: 1 MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSKD-SNRETRVWTMS 60
Query: 61 DLYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDG 120
DLYKNYP M GYASGLYNLAWAQAVQNKPLNDIFV+EAD +EKSK SS + F N K+DG
Sbjct: 61 DLYKNYPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDG 120
Query: 121 -NSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRD 180
N+TKE +VVID S DE++CDN N E+EEGELEEGEIDMD+EFVEEV DSKAMLSD RD
Sbjct: 121 SNTTKEEDRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRD 180
Query: 181 MDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKV 240
MD + +EFDL KELD+ +K IQKTLDGVTIDAAQKSFQEVCS +HSSIET ++LLQ KV
Sbjct: 181 MDINGQEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKV 240
Query: 241 VPRKDALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSV 300
VPRKDALIQRLYAALR+INSV SYVK C+PPLFSPEQIKSV
Sbjct: 241 VPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSV 300
Query: 301 EVKMPST---DYLPSMRASAKE---------STFDFV-------------NQVAFGLHAC 360
EVKMPST D+LPSMR SAKE DF N++A
Sbjct: 301 EVKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPF 360
Query: 361 WVVGKNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSG 420
V GKNNLNILS+GLQSGVS+IKGRG LLPLLDLHKDHD DSLPSPTREAP+ F VQKSG
Sbjct: 361 GVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG 420
Query: 421 NAPAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDV 480
NAP K+A +DG RSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DG GD+
Sbjct: 421 NAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDI 480
Query: 481 GGSIKG----------------------------------------LISPINVAPPSCVS 540
GG + LISP+NVAPPS VS
Sbjct: 481 GGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVS 540
Query: 541 NPTVKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNI 600
NPTVKPLAKSRDPRLRIVNSDA +SAAT++LRKQKMD EPN
Sbjct: 541 NPTVKPLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNT 600
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDPDFQVGIRW------------ 660
DGPE KR RIGSQNLAVA+SD+ +VS GW L+ G R
Sbjct: 601 DGPEVKRLRIGSQNLAVAASDVRAVSG-SGGW-----LEDTMPAGPRLFNRNQMEIAEAN 660
Query: 661 ---------------------SANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELK 720
+ +NDASLPSLLKDIVVNPTML+NLLK+SQQQQLAAELK
Sbjct: 661 ATEKSNVTNNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELK 720
Query: 721 LKSSEPEKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSV-PSPPVATVDDLGKV 780
LKSSEPEKNAICPTSL+PC GS+PL N P TSG LQQS GTPS P V DDLGKV
Sbjct: 721 LKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGTPSASPVVAVGRQDDLGKV 780
Query: 781 RMKPRDPRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASS 840
RMKPRDPRR+LHGNSLQKVGSLGN+Q KG+VP A NTEGSRDIPNGHKQEGQGD +LASS
Sbjct: 781 RMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASS 840
Query: 841 QPLLPDITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVT 900
Q +LPDI RQFT NLKNIADI+SV SP SS SSSKP VGSSS DSK VT
Sbjct: 841 QTILPDIGRQFTNNLKNIADIMSVPSPPTSSPNSSSKP----------VGSSSMDSKPVT 900
Query: 901 TATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVL 960
TA QA DM + SRSQG WGDLEHLF+ YDDKQKAAIQRERARRIEEQKKMFAARKLCLVL
Sbjct: 901 TAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVL 960
Query: 961 DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEK 1020
DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEK
Sbjct: 961 DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEK 1020
Query: 1021 ASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEG 1080
ASELYELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEG
Sbjct: 1021 ASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEG 1080
Query: 1081 VLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG 1128
VLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG
Sbjct: 1081 VLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG 1140
BLAST of Sgr021939 vs. ExPASy Swiss-Prot
Match:
Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)
HSP 1 Score: 909.4 bits (2349), Expect = 3.9e-263
Identity = 581/1273 (45.64%), Postives = 746/1273 (58.60%), Query Frame = 0
Query: 1 MGKDESVKI-EDVEEGEISDTASVE-EISEEDFNK-----------LESGAKVASKDSSN 60
MG DE++ + DVEEGEI D+ + E E+ + + +G + SN
Sbjct: 15 MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74
Query: 61 REARVWTMSDLYKNYPTMCRGYA-SGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSP 120
+RVWTM +L YP R YA SGL NLAWA+AVQNKP N+ V++ +P
Sbjct: 75 GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134
Query: 121 SSFANGKEDGNSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDM-----DSEFVE 180
+E K+VI+ S D E+EEGELEEGEID+ D VE
Sbjct: 135 -------------RESDKIVIEDSDD---------EKEEGELEEGEIDLVDNASDDNLVE 194
Query: 181 EVVDSKAMLSDCRDMDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLH 240
+ +S ++S D ++ L++++L+ KVKLI+ L+ ++ AQ F+ VCS +
Sbjct: 195 KDTESVVLIS----ADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRIL 254
Query: 241 SSIETSMQLLQEK-VVPRKDALIQRLYAALRIINSVASYVKTCNPPLFSPEQIKSVEVKM 300
++E+ +L+ + P++D L+Q +A+L+ IN V C+ S E+ K ++
Sbjct: 255 GALESLRELVSDNDDFPKRDTLVQLSFASLQTINYV-----FCSMNNISKERNKETMSRL 314
Query: 301 PS--TDYLPSMRASAKESTFDFVNQ----VAFGLHACWVVGKNNLN-------------- 360
+ D+ + +++ + +NQ A + A + N+N
Sbjct: 315 LTLVNDHFSQFLSFNQKNEIETMNQDLSRSAIAVFA-GTSSEENVNQMTQPSNGDSFLAK 374
Query: 361 -ILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQ------KSGNA 420
+ S+ G + ++ R +LPLLDLHKDHD DSLPSPTRE PV + G
Sbjct: 375 KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGRHTMVRPGFP 434
Query: 421 PAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG 480
+ + +G + + YE+DA KAVSTYQQKFG +S D LPSPTPS E +DG GDVGG
Sbjct: 435 VGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDVGG 494
Query: 481 SIKGLI-------------------------------SPINVAPP----------SCVSN 540
+ + S + PP S+
Sbjct: 495 EVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSMPVANSVSSTVPPHHLSIHAISAPTASD 554
Query: 541 PTVKPLAKSRDPRLRIVNSDAK--------------------SAATINLRKQKMDEEPNI 600
TVKP AKSRDPRLR+ DA SA +N RKQK +E I
Sbjct: 555 QTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELSADLVNPRKQKAADEFLI 614
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDP-------------------- 660
DGP KRQ+ + A+ G + + + + + P
Sbjct: 615 DGPAWKRQK-SDTDAPKAAGTGGWLEDTESSGLLKLESKPRLIENGVTSMTSSVMPTSAV 674
Query: 661 DFQVGIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEKNAICP 720
+R ++ + ASL SLLKDI VNPTML+NLLK+ ++Q++ + K +P + A P
Sbjct: 675 SVSQKVRTASTDTASLQSLLKDIAVNPTMLLNLLKMGERQKVPEKAIQKPMDPRRAAQLP 734
Query: 721 TSLSPCLGSTPLA--NTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRRILH 780
S STPL+ + A+ + +L S + P A + G +RMKPRDPRRILH
Sbjct: 735 GSSVQPGVSTPLSIPASNALAANSLNSGVLQDSSQNAPAA---ESGSIRMKPRDPRRILH 794
Query: 781 GNSLQKVGSLGNEQFKGIVPA----------APNTEGSRDIPNGHKQEGQGDLRLASSQP 840
G++LQ+ S +Q K P+ A + E + G ++ S
Sbjct: 795 GSTLQRTDSSMEKQTKVNDPSTLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGE 854
Query: 841 LL----PDITRQFTKNLKNIADILSVSSPSASSQTS-SSKPVKLDR-MDTNAVGSSSSDS 900
LL PD + QFTKNLK+IAD++ VS + S S +K +R + N ++ D
Sbjct: 855 LLSGKTPDFSTQFTKNLKSIADMVVVSQQLGNPPASMHSVQLKTERDVKHNPSNPNAQDE 914
Query: 901 KVVTTATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKL 960
V +A P+RS +WGD+EHLFEGYDD Q+ AIQRER RR+EEQ KMFA++KL
Sbjct: 915 DVSVSAASVTAAAGPTRSMNSWGDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKL 974
Query: 961 CLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWN 1020
LVLD+DHTLLNSAKF EV+ H+EILRKKEEQDREK RHLFRF HMGMWTKLRPG+WN
Sbjct: 975 SLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWN 1034
Query: 1021 FLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSK 1080
FLEKAS+LYELHLYTMGNKLYATEMAK+LDPKG LF GRVIS+GDDGDPLDGDERVPKSK
Sbjct: 1035 FLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSK 1094
Query: 1081 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 1128
DLEGV+GMES+VVIIDDSVRVWP +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE
Sbjct: 1095 DLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEV 1154
BLAST of Sgr021939 vs. ExPASy Swiss-Prot
Match:
Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)
HSP 1 Score: 266.2 bits (679), Expect = 1.7e-69
Identity = 150/321 (46.73%), Postives = 206/321 (64.17%), Query Frame = 0
Query: 812 RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE--QDREKVQ-RHLFRFPHMGMWTKL 871
RKL LVLDLDHTLLN+ ++ P +E L+ QD V LF M M TKL
Sbjct: 121 RKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSHTHSLQDGCNVSGGSLFLLEFMQMMTKL 180
Query: 872 RPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDE 931
RP V +FL++ASE++ +++YTMG++ YA +MAK+LDPKG F RVISR DDG
Sbjct: 181 RPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-DDG------- 240
Query: 932 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 991
V K L+ VLG ESAV+I+DD+ WP +K NLIV+ERY +F S RQF SL E
Sbjct: 241 TVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSE 300
Query: 992 IDHDERPEDGTLASSLAVIQRIHQTFFSHPVLDGV---DVRNILASEQQKILAGCRIVFS 1051
+ DE DG LA+ L V+++ H FF + V +G+ DVR +L +++IL GC+IVFS
Sbjct: 301 LKSDESEPDGALATVLKVLKQAHALFFEN-VDEGISNRDVRLMLKQVRKEILKGCKIVFS 360
Query: 1052 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSAGRFVVHP 1111
RVFP +A P HPLW+ AE+ GA C ++D VTHVVA +GT+K WA+ ++VVH
Sbjct: 361 RVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHR 420
Query: 1112 GWVEASALLYRRANEQDFAIK 1127
GW++A+ L+ + E++F ++
Sbjct: 421 GWIDAANYLWMKQPEENFGLE 430
BLAST of Sgr021939 vs. ExPASy Swiss-Prot
Match:
Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)
HSP 1 Score: 140.2 bits (352), Expect = 1.4e-31
Identity = 106/338 (31.36%), Postives = 164/338 (48.52%), Query Frame = 0
Query: 804 EQKKMFAARKLCLVLDLDHTLLN-SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHM 863
++ + RKL L++DLD T+++ S K + VD + + + K R
Sbjct: 134 DENNLITNRKLVLLVDLDQTIIHTSDKPMTVDTENHKDITKYNLHSRVYT---------- 193
Query: 864 GMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGD 923
TKLRP FL K S +YE+H+ T G + YA +A++LDP LF R++SR D
Sbjct: 194 ---TKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR----D 253
Query: 924 PLDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF--------- 983
L + K+ +L+ + ++ VVIIDD VW +++ LI ++ Y +F
Sbjct: 254 ELFSAQH--KTNNLKALFPCGDNLVVIIDDRSDVWMYSEA-LIQIKPYRFFKEVGDINAP 313
Query: 984 PCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSHPVLDG-----VDVRN 1043
S+ Q P +E D+ ED L V+ IH ++ L G +DV+
Sbjct: 314 KNSKEQM----PVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVLLDVKE 373
Query: 1044 ILASEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSL 1103
++ E+ K+L GC IVFS + P+GE +++ QFGAV + + VTHVV
Sbjct: 374 VIKEERHKVLDGCVIVFSGIVPMGEKLERT-DIYRLCTQFGAVIVPDVTDDVTHVVGARY 433
Query: 1104 GTDKVNWALSAGRFVVHPGWVEASALLYRRANEQDFAI 1126
GT KV A +FVV WV A + +A+E F +
Sbjct: 434 GTQKVYQANRLNKFVVTVQWVYACVEKWLKADENLFQL 443
BLAST of Sgr021939 vs. ExPASy Swiss-Prot
Match:
F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)
HSP 1 Score: 120.6 bits (301), Expect = 1.2e-25
Identity = 73/219 (33.33%), Postives = 120/219 (54.79%), Query Frame = 0
Query: 812 RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPG 871
+KL LVLDLDHTLL++ + ++ + R+ + + M TKLRP
Sbjct: 384 KKLHLVLDLDHTLLHTVMVPSLSQAEKYLIEEAGSATRDDLWKIKAVGDPMEFLTKLRPF 443
Query: 872 VWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVP 931
+ +FL++A+E + +++YT G+++YA ++ +++DPK F RVI++ + P
Sbjct: 444 LRDFLKEANEFFTMYVYTKGSRVYAKQVLELIDPKKLYFGDRVITKTES----------P 503
Query: 932 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 991
K L+ VL E VVI+DD+ VWP +K NL+ + +Y+YF R G E
Sbjct: 504 HMKTLDFVLAEERGVVIVDDTRNVWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKT 563
Query: 992 DERPEDGTLASSLAVIQRIHQTFFS-HPVLDGVDVRNIL 1030
DE +G LA+ L +++ +HQ FF L+ DVR++L
Sbjct: 564 DESESEGGLANVLKLLKEVHQRFFRVEEELESKDVRSLL 588
BLAST of Sgr021939 vs. ExPASy Swiss-Prot
Match:
Q9P376 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=fcp1 PE=1 SV=1)
HSP 1 Score: 109.0 bits (271), Expect = 3.5e-22
Identity = 121/477 (25.37%), Postives = 181/477 (37.95%), Query Frame = 0
Query: 783 FEGYDDKQKA-----------AIQRERARRIEEQ--KKMFAARKLCLVLDLDHTLLNSAK 842
+ GY D +A + E A R+E + K++ ++L L++DLD T++++
Sbjct: 121 YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 180
Query: 843 FVEVDPVHDEILRKKEEQD----REKVQRHLFRFPH---MGMWTKLRPGVWNFLEKASEL 902
VDP E + + R+ +L P + K RPG+ FL+K SEL
Sbjct: 181 ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCYYIKFRPGLAQFLQKISEL 240
Query: 903 YELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGM 962
YELH+YTMG K YA E+AK++DP G LF RV+SR D G K L +
Sbjct: 241 YELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPC 300
Query: 963 E-SAVVIIDDSVRVWPHNKLNLIVVERYTYF-----------------PCSRRQFGLLGP 1022
+ S VV+IDD VW N NLI V Y +F P + L P
Sbjct: 301 DTSMVVVIDDRGDVWDWNP-NLIKVVPYEFFVGIGDINSNFLAKSTPLPEQEQLIPLEIP 360
Query: 1023 -----------------------------------------------------------S 1082
+
Sbjct: 361 KDEPDSVDEINEENEETPEYDSSNSSYAQDSSTIPEKTLLKDTFLQNREALEEQNKERVT 420
Query: 1083 LLEIDHDERP------------------------EDGTLASSLAVIQRIHQTFFSHPVLD 1128
LE+ ERP D L V++ IH ++ +
Sbjct: 421 ALELQKSERPLAKQQNALLEDEGKPTPSHTLLHNRDHELERLEKVLKDIHAVYYEEE--N 480
BLAST of Sgr021939 vs. ExPASy TrEMBL
Match:
A0A6J1D5D6 (Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111017451 PE=4 SV=1)
HSP 1 Score: 1782.7 bits (4616), Expect = 0.0e+00
Identity = 978/1268 (77.13%), Postives = 1020/1268 (80.44%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLYK 60
MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLE+GAK+ SNRE RVWTMSDLYK
Sbjct: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLETGAKLVPSKDSNREPRVWTMSDLYK 60
Query: 61 NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKR-SSPSSFANGKEDGNST 120
NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFV+EADPEEKSKR SSPS AN GNST
Sbjct: 61 NYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVMEADPEEKSKRSSSPSPLAN----GNST 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
KE GKV ID S+DE+D N NVEREEGELEEGEIDMD+EFVEEVV+SKAMLSD D DCD
Sbjct: 121 KEEGKVTIDDSSDEMDYGNANVEREEGELEEGEIDMDTEFVEEVVESKAMLSDSGDTDCD 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
+E DL KKELDD+VKLIQKTLDGVTIDAAQKSF+EVC+ LHSSIE ++LLQEKV P K
Sbjct: 181 GQESDLVKKELDDQVKLIQKTLDGVTIDAAQKSFEEVCTQLHSSIEIFLKLLQEKVFPXK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
DALIQRLYAALRIINSV SYVK CNPPLFSPEQIKSVEVKM
Sbjct: 241 DALIQRLYAALRIINSVFCSMNLNEKEEYKQHLSRLLSYVKNCNPPLFSPEQIKSVEVKM 300
Query: 301 PST---DYLPSMRASAKESTFDFVNQVA----------FGLH------------ACWVVG 360
PST DYL +RA+AKE+ N V G H V+
Sbjct: 301 PSTDSLDYLSIIRANAKEAEIHIPNGVKNKDFYSGSTNAGPHLTSSTKLPSDSMPVGVMA 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KNN NILSDG QSGVSN++GRG LLPLLDLHKDHDVDSLPSPTREAPS FPVQK GN P
Sbjct: 361 KNNPNILSDGSQSGVSNLRGRGPLLPLLDLHKDHDVDSLPSPTREAPSIFPVQKLGNTPP 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG-- 480
KVALAMDG RSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC DGAGD+GG
Sbjct: 421 KVALAMDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGAGDIGGEV 480
Query: 481 ---------------------------------------SIKGLISPINVAPPSCVSNPT 540
SIKGLISPINVAPPSCVSNPT
Sbjct: 481 SSSSIIRSLKASNSPKLGQNVSNSASNISAGYFPNMESSSIKGLISPINVAPPSCVSNPT 540
Query: 541 VKPLAKSRDPRLRIVNSD-------------------AKSAATINLRKQKMDEEPNIDGP 600
VKPL KSRDPR RI+NSD A+S ATINLRKQKM EEPN+DGP
Sbjct: 541 VKPLPKSRDPRRRIINSDASALDLNPRTIASVQNSSIAESDATINLRKQKMGEEPNVDGP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDPDFQVGIRW--------------- 660
E KRQR GSQN AVA+SD+ + S GW L+ VG R
Sbjct: 601 EMKRQRTGSQNHAVAASDVRTGS---GGW-----LEDTMPVGPRLSSRNQMEISEADATE 660
Query: 661 ------------------SANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKS 720
SA+NDASLPSLLKDI VNPTM ++LLK+SQQQ LAAELKLKS
Sbjct: 661 KLNVTNNSVAGNECTPSISASNDASLPSLLKDIAVNPTMFLSLLKMSQQQHLAAELKLKS 720
Query: 721 SEPEKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATV---DDLGKVR 780
SE EKNAICPTSL+PC GS+PL NTP+VTSG LQQS GT SVPSPPVATV DDLGKVR
Sbjct: 721 SELEKNAICPTSLNPCQGSSPLVNTPSVTSGILQQSTGTSSVPSPPVATVSRQDDLGKVR 780
Query: 781 MKPRDPRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASSQ 840
MKPRDPRRILHGNSLQKVG+LGNEQ KGIVP APNTEGS+D+PNGHKQEG GDLRLASSQ
Sbjct: 781 MKPRDPRRILHGNSLQKVGNLGNEQSKGIVPTAPNTEGSKDVPNGHKQEGLGDLRLASSQ 840
Query: 841 PLLPDITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTT 900
+ PDITR FTKNLKNIADILS SSP SS +SSSKPVKLDRMDTN+VGSSS DSKVVTT
Sbjct: 841 SVPPDITRPFTKNLKNIADILSGSSPPTSSLSSSSKPVKLDRMDTNSVGSSSIDSKVVTT 900
Query: 901 ATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 960
ATQA DMV SRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD
Sbjct: 901 ATQAVDMVGLSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 960
Query: 961 LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA 1020
LDHTLLNSAKFVEV+PVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA
Sbjct: 961 LDHTLLNSAKFVEVEPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKA 1020
Query: 1021 SELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGV 1080
SELYELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGDERVPKSKDLEGV
Sbjct: 1021 SELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDERVPKSKDLEGV 1080
Query: 1081 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1128
LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT
Sbjct: 1081 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1140
BLAST of Sgr021939 vs. ExPASy TrEMBL
Match:
A0A6J1H839 (Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC111460939 PE=4 SV=1)
HSP 1 Score: 1701.8 bits (4406), Expect = 0.0e+00
Identity = 924/1261 (73.28%), Postives = 996/1261 (78.98%), Query Frame = 0
Query: 1 MGKDES-VKIEDVEEGEISDTASVEEISEEDFNKLESGAKVASKDSSNREARVWTMSDLY 60
MGK + VK DVEEGEISDT SVEEI+EEDFNKLE+ K+ SNRE VWTMSDLY
Sbjct: 1 MGKHTNCVKTPDVEEGEISDTPSVEEITEEDFNKLETAPKLLPSKHSNRETTVWTMSDLY 60
Query: 61 KNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDGNST 120
NYPTMCRGYA GLYNLAWA+AVQNKPLN+IF+++ADP++KS RSS S F N KE GN T
Sbjct: 61 NNYPTMCRGYAPGLYNLAWAKAVQNKPLNEIFLMDADPDDKSNRSSSSPFRNAKEHGNGT 120
Query: 121 KEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRDMDCD 180
KE K++ID + D+++ DN +VE+EEGELEEGEIDMD+EFVEEVVDSK MLSD D DC
Sbjct: 121 KEEAKLIIDITGDDMNSDNADVEKEEGELEEGEIDMDTEFVEEVVDSKPMLSDSLDTDC- 180
Query: 181 SEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKVVPRK 240
+E DL+ KELDD++KLI KTLDGVTIDAAQKSFQEVCS L SSIET ++L+Q KVVPRK
Sbjct: 181 -QEIDLKNKELDDQLKLIHKTLDGVTIDAAQKSFQEVCSQLLSSIETFLELVQGKVVPRK 240
Query: 241 DALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSVEVKM 300
D LIQRLYAALRIINSV S+VK CNPPLFSPEQIKSVEVKM
Sbjct: 241 DVLIQRLYAALRIINSVFCSMNPKEKEEYKQHLSRLLSFVKNCNPPLFSPEQIKSVEVKM 300
Query: 301 PSTDYL---PSMRASAKESTFDFVNQVA----FGLHA------------------CWVVG 360
PSTD L P MRASAK+ N V + +A V
Sbjct: 301 PSTDSLDQFPDMRASAKDVEIHIPNGVKNKDFYSAYATATPHLTSSTKLPSDSMPVGVTV 360
Query: 361 KNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSGNAPA 420
KN+LN+ SD L SGV N+KGRG LLPLLDLHKDHDVDSLPSPTREAP+ F VQKSG+ P
Sbjct: 361 KNSLNLSSDSLLSGVPNVKGRGPLLPLLDLHKDHDVDSLPSPTREAPTVFSVQKSGHIPV 420
Query: 421 KVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG-- 480
KVA AMDG R HPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC DG GD+GG
Sbjct: 421 KVAHAMDGSRVHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEC-DGGGDIGGEV 480
Query: 481 ---------------------------------------SIKGLISPINVAPPSCVSNPT 540
S KGLISP NVAPPSCVSNP
Sbjct: 481 SSSSILRSSKASNSSKLAQTVSQSASSISTGLFPNLESSSTKGLISPSNVAPPSCVSNPI 540
Query: 541 VKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNIDGP 600
KPLAKSRDPRLR+VNS+A +SA T+NLRKQKMD EPNID P
Sbjct: 541 AKPLAKSRDPRLRMVNSEASAMDLNPRTMTSVQSPSVVESAVTVNLRKQKMDVEPNIDAP 600
Query: 601 ETKRQRIGSQNLAVASSDLGSVSEVVAGW-----KILCQLDPDFQV-------------- 660
E KRQRIGSQN A ++SDL + S GW + +L Q+
Sbjct: 601 EMKRQRIGSQNHAFSASDLRAGSG-SGGWLEDTMSAVPRLSSRNQMEIAEANATEKNNVT 660
Query: 661 ---------GIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEK 720
G SA+ +ASLPSLLKDIVVNPTML++LLK++QQ+Q+AAELKLKSSEPEK
Sbjct: 661 NNSGAGNSCGPTISASKEASLPSLLKDIVVNPTMLLSLLKMNQQKQVAAELKLKSSEPEK 720
Query: 721 NAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRR 780
NAICPT+++PCLGS+PL N PA+TSG LQQS GTPSVPSPPV TVDD+GKVRMKPRDPRR
Sbjct: 721 NAICPTAVNPCLGSSPLVNAPALTSGILQQSAGTPSVPSPPVVTVDDVGKVRMKPRDPRR 780
Query: 781 ILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDI-PNGHKQEGQGDLRLASSQPLLPDIT 840
ILHGNSL KVGS+GNEQ K +VPA PN EGSRDI PNGHKQEGQG+LRLASSQPLLPDI
Sbjct: 781 ILHGNSLHKVGSMGNEQLKSVVPAVPNPEGSRDIVPNGHKQEGQGNLRLASSQPLLPDIG 840
Query: 841 RQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAADM 900
RQFT NLKNIADI+SV SP SS SSSKPVKLD DTNAVGSSS DSK+V TATQ DM
Sbjct: 841 RQFTNNLKNIADIMSVPSPPTSSHNSSSKPVKLDIKDTNAVGSSSIDSKIVATATQVVDM 900
Query: 901 VSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLN 960
V PSRS G WGDLEHLFEGYDDKQKAAIQRERARRI+EQKKMFAARKLCLVLDLDHTLLN
Sbjct: 901 VGPSRSHGAWGDLEHLFEGYDDKQKAAIQRERARRIDEQKKMFAARKLCLVLDLDHTLLN 960
Query: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH
Sbjct: 961 SAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELH 1020
Query: 1021 LYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAV 1080
LYTMGNKLYATEMAKVLDPKG LFAGRV+SRGDDGDPLDG+ERVPKSKDLEGVLGMESAV
Sbjct: 1021 LYTMGNKLYATEMAKVLDPKGVLFAGRVLSRGDDGDPLDGEERVPKSKDLEGVLGMESAV 1080
Query: 1081 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1128
VIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV
Sbjct: 1081 VIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAV 1140
BLAST of Sgr021939 vs. ExPASy TrEMBL
Match:
A0A0A0KAB9 (Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G091910 PE=4 SV=1)
HSP 1 Score: 1686.8 bits (4367), Expect = 0.0e+00
Identity = 929/1269 (73.21%), Postives = 989/1269 (77.94%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAK----VASKDSSNREARVWTMS 60
MGKDE +KIEDVEEGEISDTASVEEISEEDFNKL+S A V SKD SNRE RVWTMS
Sbjct: 1 MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSKD-SNRETRVWTMS 60
Query: 61 DLYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDG 120
DLYKNYP M GYASGLYNLAWAQAVQNKPLNDIFV+EAD +EKSK SS + F N K+DG
Sbjct: 61 DLYKNYPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDG 120
Query: 121 -NSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRD 180
N+TKE +VVID S DE++CDN N E+EEGELEEGEIDMD+EFVEEV DSKAMLSD RD
Sbjct: 121 SNTTKEEDRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRD 180
Query: 181 MDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKV 240
MD + +EFDL KELD+ +K IQKTLDGVTIDAAQKSFQEVCS +HSSIET ++LLQ KV
Sbjct: 181 MDINGQEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKV 240
Query: 241 VPRKDALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSV 300
VPRKDALIQRLYAALR+INSV SYVK C+PPLFSPEQIKSV
Sbjct: 241 VPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSV 300
Query: 301 EVKMPST---DYLPSMRASAKE---------STFDFV-------------NQVAFGLHAC 360
EVKMPST D+LPSMR SAKE DF N++A
Sbjct: 301 EVKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPF 360
Query: 361 WVVGKNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSG 420
V GKNNLNILS+GLQSGVS+IKGRG LLPLLDLHKDHD DSLPSPTREAP+ F VQKSG
Sbjct: 361 GVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG 420
Query: 421 NAPAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDV 480
NAP K+A +DG RSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DG GD+
Sbjct: 421 NAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDI 480
Query: 481 GGSIKG----------------------------------------LISPINVAPPSCVS 540
GG + LISP+NVAPPS VS
Sbjct: 481 GGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVS 540
Query: 541 NPTVKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNI 600
NPTVKPLAKSRDPRLRIVNSDA +SAAT++LRKQKMD EPN
Sbjct: 541 NPTVKPLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNT 600
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDPDFQVGIRW------------ 660
DGPE KR RIGSQNLAVA+SD+ +VS GW L+ G R
Sbjct: 601 DGPEVKRLRIGSQNLAVAASDVRAVSG-SGGW-----LEDTMPAGPRLFNRNQMEIAEAN 660
Query: 661 ---------------------SANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELK 720
+ +NDASLPSLLKDIVVNPTML+NLLK+SQQQQLAAELK
Sbjct: 661 ATEKSNVTNNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELK 720
Query: 721 LKSSEPEKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSV-PSPPVATVDDLGKV 780
LKSSEPEKNAICPTSL+PC GS+PL N P TSG LQQS GTPS P V DDLGKV
Sbjct: 721 LKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGTPSASPVVAVGRQDDLGKV 780
Query: 781 RMKPRDPRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASS 840
RMKPRDPRR+LHGNSLQKVGSLGN+Q KG+VP A NTEGSRDIPNGHKQEGQGD +LASS
Sbjct: 781 RMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASS 840
Query: 841 QPLLPDITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVT 900
Q +LPDI RQFT NLKNIADI+SV SP SS SSSKP VGSSS DSK VT
Sbjct: 841 QTILPDIGRQFTNNLKNIADIMSVPSPPTSSPNSSSKP----------VGSSSMDSKPVT 900
Query: 901 TATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVL 960
TA QA DM + SRSQG WGDLEHLF+ YDDKQKAAIQRERARRIEEQKKMFAARKLCLVL
Sbjct: 901 TAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVL 960
Query: 961 DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEK 1020
DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEK
Sbjct: 961 DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEK 1020
Query: 1021 ASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEG 1080
ASELYELHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEG
Sbjct: 1021 ASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEG 1080
Query: 1081 VLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG 1128
VLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG
Sbjct: 1081 VLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDG 1140
BLAST of Sgr021939 vs. ExPASy TrEMBL
Match:
A0A5A7TDW7 (Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold122G001260 PE=4 SV=1)
HSP 1 Score: 1679.5 bits (4348), Expect = 0.0e+00
Identity = 925/1263 (73.24%), Postives = 992/1263 (78.54%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAK----VASKDSSNREARVWTMS 60
MGKDE +KIEDVEEGEISDTASVEEISEEDFNKL+S A V SKD SNRE RVWTMS
Sbjct: 1 MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSAPPKVVVPSKD-SNRE-RVWTMS 60
Query: 61 DLYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDG 120
+LYKNYP+M GYASGLYNLAWAQAVQNKPLNDIFV+EAD +EKSKRSS ++ N K+DG
Sbjct: 61 ELYKNYPSMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKRSSSTTVGNAKDDG 120
Query: 121 -NSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRD 180
N+TKE +V+ID S DE++CDN N E+EEGELEEGEIDMD+EFVEEV DSKAMLSD R+
Sbjct: 121 SNTTKEEDRVLIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRE 180
Query: 181 MDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKV 240
MD +EFDL KELD+ +KLIQKTLDGVTIDAAQKSFQEVCS LHSSIET ++L+Q KV
Sbjct: 181 MDIHGQEFDLENKELDELLKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFVELVQGKV 240
Query: 241 VPRKDALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSV 300
VPRKDAL+QRLYAA R+INSV SYVK C+PPLFSPEQIKSV
Sbjct: 241 VPRKDALVQRLYAAFRLINSVFCSMNLNEKEEHKEQLSRLLSYVKNCDPPLFSPEQIKSV 300
Query: 301 EVKMPSTDYLP---SMRASAKE---------STFDFV-------------NQVAFGLHAC 360
EVKMPSTDYL SM+ SAKE DF N++A
Sbjct: 301 EVKMPSTDYLDQLLSMKGSAKEVEIHIPNGVKVKDFYSAYTDASSQLTPSNKLASDSITF 360
Query: 361 WVVGKNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSG 420
V GKNN NILS+GLQSGVS+IKGRG LLPLLDLHKDHD DSLPSPTREAP+ F VQKSG
Sbjct: 361 GVKGKNNPNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG 420
Query: 421 NAPAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDV 480
NAP K+A A+DGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DG GD+
Sbjct: 421 NAPTKMAFAVDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDI 480
Query: 481 GGSIKG----------------------------------------LISPINVAPPSCVS 540
GG + LISP+NVAPPS VS
Sbjct: 481 GGEVSSSSIIRSLKSSNASKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVS 540
Query: 541 NPTVKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNI 600
NPTVKPLAKSRDPRLRIVNSDA +SAAT++LRKQKMD EPN
Sbjct: 541 NPTVKPLAKSRDPRLRIVNSDASAMDLNPRTITSVQSSSILESAATLHLRKQKMDGEPNT 600
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVS--------EVVAGWKILCQLDPDFQVGIRWSANN 660
DGPE KR RIGSQNLAVA+SD+ +VS + AG ++ + + N
Sbjct: 601 DGPEMKRPRIGSQNLAVAASDVRAVSGSGGWLEDTIPAGPRLFNRNQMEIAEANATEKTN 660
Query: 661 -------------------DASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEP 720
DASLPSLLKDIVVNPTML+NLLK+SQQQQLAAELKLKSSEP
Sbjct: 661 VTNNSGSENECTPTINNSKDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEP 720
Query: 721 EKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSV-PSPPVATVDDLGKVRMKPRD 780
EKNAICPTSL+PC GS+PL N PAVTSG LQQS GTPS P V DDLGKVRMKPRD
Sbjct: 721 EKNAICPTSLNPCQGSSPLINAPAVTSGILQQSAGTPSASPVVAVGRQDDLGKVRMKPRD 780
Query: 781 PRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASSQPLLPD 840
PRR+LHGNSLQKVGSLGN+Q KGIVP NTEGSRDI NGHKQ+GQGD +LASSQ LLPD
Sbjct: 781 PRRVLHGNSLQKVGSLGNDQLKGIVPTTSNTEGSRDILNGHKQDGQGDSKLASSQTLLPD 840
Query: 841 ITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAA 900
I RQFT NLKNIADI+SV SP SSQ SSSKP VGSSS DSK VTTA+QA
Sbjct: 841 IGRQFTNNLKNIADIMSVPSPPTSSQNSSSKP----------VGSSSMDSKPVTTASQAV 900
Query: 901 DMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL 960
DM +PSRSQG WGDLEHLF+ YDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL
Sbjct: 901 DMAAPSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL 960
Query: 961 LNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYE 1020
LNSAKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYE
Sbjct: 961 LNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYE 1020
Query: 1021 LHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMES 1080
LHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEGVLGMES
Sbjct: 1021 LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMES 1080
Query: 1081 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 1128
VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL
Sbjct: 1081 GVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 1140
BLAST of Sgr021939 vs. ExPASy TrEMBL
Match:
A0A1S3CB96 (Protein-serine/threonine phosphatase OS=Cucumis melo OX=3656 GN=LOC103498885 PE=4 SV=1)
HSP 1 Score: 1677.9 bits (4344), Expect = 0.0e+00
Identity = 924/1263 (73.16%), Postives = 991/1263 (78.46%), Query Frame = 0
Query: 1 MGKDESVKIEDVEEGEISDTASVEEISEEDFNKLESGAK----VASKDSSNREARVWTMS 60
MGKDE +KIEDVEEGEISDTASVEEISEEDFNKL+S A V SKD SNRE RVWTMS
Sbjct: 1 MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSAPPKVVVPSKD-SNRE-RVWTMS 60
Query: 61 DLYKNYPTMCRGYASGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSPSSFANGKEDG 120
+LYKNYP+M GYASGLYNLAWAQAVQNKPLNDIFV+EAD +EKSKRSS ++ N K+DG
Sbjct: 61 ELYKNYPSMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKRSSSTTVGNAKDDG 120
Query: 121 -NSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDMDSEFVEEVVDSKAMLSDCRD 180
N+TKE +V+ID S DE++CDN N E+EEGELEEGEIDMD+EFVEEV DSKAMLSD R+
Sbjct: 121 SNTTKEEDRVLIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRE 180
Query: 181 MDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLHSSIETSMQLLQEKV 240
MD +EFDL KELD+ +KLIQKTLDGVTIDAAQKSFQEVCS LHSSIET ++L+Q KV
Sbjct: 181 MDIHGQEFDLENKELDELLKLIQKTLDGVTIDAAQKSFQEVCSQLHSSIETFVELVQGKV 240
Query: 241 VPRKDALIQRLYAALRIINSV-------------------ASYVKTCNPPLFSPEQIKSV 300
VPRKDAL+QRLYAA R+INSV SYVK C+PPLFSPEQIKSV
Sbjct: 241 VPRKDALVQRLYAAFRLINSVFCSMNLNEKEEHKEQLSRLLSYVKNCDPPLFSPEQIKSV 300
Query: 301 EVKMPSTDYLP---SMRASAKE---------STFDFV-------------NQVAFGLHAC 360
EVKMPSTDYL SM+ S KE DF N++A
Sbjct: 301 EVKMPSTDYLDQLLSMKGSVKEVEIHIPNGVKVKDFYSAYTDASSQLTPSNKLASDSITF 360
Query: 361 WVVGKNNLNILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQKSG 420
V GKNN NILS+GLQSGVS+IKGRG LLPLLDLHKDHD DSLPSPTREAP+ F VQKSG
Sbjct: 361 GVKGKNNPNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSG 420
Query: 421 NAPAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDV 480
NAP K+A A+DGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE DG GD+
Sbjct: 421 NAPTKMAFAVDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDI 480
Query: 481 GGSIKG----------------------------------------LISPINVAPPSCVS 540
GG + LISP+NVAPPS VS
Sbjct: 481 GGEVSSSSIIRSLKSSNASKPGQKSNFASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVS 540
Query: 541 NPTVKPLAKSRDPRLRIVNSDA-------------------KSAATINLRKQKMDEEPNI 600
NPTVKPLAKSRDPRLRIVNSDA +SAAT++LRKQKMD EPN
Sbjct: 541 NPTVKPLAKSRDPRLRIVNSDASAMDLNPRTMTSVQSSSILESAATLHLRKQKMDGEPNT 600
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVS--------EVVAGWKILCQLDPDFQVGIRWSANN 660
DGPE KR RIGSQNLAVA+SD+ +VS + AG ++ + + N
Sbjct: 601 DGPEMKRPRIGSQNLAVAASDVRAVSGSGGWLEDTIPAGPRLFNRNQMEIAEANATEKTN 660
Query: 661 -------------------DASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEP 720
DASLPSLLKDIVVNPTML+NLLK+SQQQQLAAELKLKSSEP
Sbjct: 661 VTNNSGSENECTPTINNSKDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEP 720
Query: 721 EKNAICPTSLSPCLGSTPLANTPAVTSGNLQQSGGTPSV-PSPPVATVDDLGKVRMKPRD 780
EKNAICPTSL+PC GS+PL N PAVTSG LQQS GTPS P V DDLGKVRMKPRD
Sbjct: 721 EKNAICPTSLNPCQGSSPLINAPAVTSGILQQSAGTPSASPVVAVGRQDDLGKVRMKPRD 780
Query: 781 PRRILHGNSLQKVGSLGNEQFKGIVPAAPNTEGSRDIPNGHKQEGQGDLRLASSQPLLPD 840
PRR+LHGNSLQKVGSLGN+Q KGIVP NTEGSRDI NGHKQ+GQGD +LASSQ LLPD
Sbjct: 781 PRRVLHGNSLQKVGSLGNDQLKGIVPTTSNTEGSRDILNGHKQDGQGDSKLASSQTLLPD 840
Query: 841 ITRQFTKNLKNIADILSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAA 900
I RQFT NLKNIADI+SV SP SSQ SSSKP VGSSS DSK VTTA+QA
Sbjct: 841 IGRQFTNNLKNIADIMSVPSPPTSSQNSSSKP----------VGSSSMDSKPVTTASQAV 900
Query: 901 DMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL 960
DM +PSRSQG WGDLEHLF+ YDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL
Sbjct: 901 DMAAPSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL 960
Query: 961 LNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYE 1020
LNSAKFVEVDPVHDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGVWNFLEKASELYE
Sbjct: 961 LNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYE 1020
Query: 1021 LHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMES 1080
LHLYTMGNKLYATEMAKVLDPKG LFAGRVISRGDDGDPLDGD+RVPKSKDLEGVLGMES
Sbjct: 1021 LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMES 1080
Query: 1081 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 1128
VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL
Sbjct: 1081 GVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSL 1140
BLAST of Sgr021939 vs. TAIR 10
Match:
AT2G33540.1 (C-terminal domain phosphatase-like 3 )
HSP 1 Score: 909.4 bits (2349), Expect = 2.8e-264
Identity = 581/1273 (45.64%), Postives = 746/1273 (58.60%), Query Frame = 0
Query: 1 MGKDESVKI-EDVEEGEISDTASVE-EISEEDFNK-----------LESGAKVASKDSSN 60
MG DE++ + DVEEGEI D+ + E E+ + + +G + SN
Sbjct: 15 MGNDENLMVMVDVEEGEIPDSVNTEIEVKHKSTTTTADVGGDVDVGVVAGGRGGGGGGSN 74
Query: 61 REARVWTMSDLYKNYPTMCRGYA-SGLYNLAWAQAVQNKPLNDIFVVEADPEEKSKRSSP 120
+RVWTM +L YP R YA SGL NLAWA+AVQNKP N+ V++ +P
Sbjct: 75 GNSRVWTMEELISQYPAY-RPYANSGLSNLAWARAVQNKPFNEGLVMDYEP--------- 134
Query: 121 SSFANGKEDGNSTKEGGKVVIDGSADEIDCDNVNVEREEGELEEGEIDM-----DSEFVE 180
+E K+VI+ S D E+EEGELEEGEID+ D VE
Sbjct: 135 -------------RESDKIVIEDSDD---------EKEEGELEEGEIDLVDNASDDNLVE 194
Query: 181 EVVDSKAMLSDCRDMDCDSEEFDLRKKELDDKVKLIQKTLDGVTIDAAQKSFQEVCSLLH 240
+ +S ++S D ++ L++++L+ KVKLI+ L+ ++ AQ F+ VCS +
Sbjct: 195 KDTESVVLIS----ADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRIL 254
Query: 241 SSIETSMQLLQEK-VVPRKDALIQRLYAALRIINSVASYVKTCNPPLFSPEQIKSVEVKM 300
++E+ +L+ + P++D L+Q +A+L+ IN V C+ S E+ K ++
Sbjct: 255 GALESLRELVSDNDDFPKRDTLVQLSFASLQTINYV-----FCSMNNISKERNKETMSRL 314
Query: 301 PS--TDYLPSMRASAKESTFDFVNQ----VAFGLHACWVVGKNNLN-------------- 360
+ D+ + +++ + +NQ A + A + N+N
Sbjct: 315 LTLVNDHFSQFLSFNQKNEIETMNQDLSRSAIAVFA-GTSSEENVNQMTQPSNGDSFLAK 374
Query: 361 -ILSDGLQSGVSNIKGRGALLPLLDLHKDHDVDSLPSPTREAPSFFPVQ------KSGNA 420
+ S+ G + ++ R +LPLLDLHKDHD DSLPSPTRE PV + G
Sbjct: 375 KLTSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGRHTMVRPGFP 434
Query: 421 PAKVALAMDGPRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEECDDGAGDVGG 480
+ + +G + + YE+DA KAVSTYQQKFG +S D LPSPTPS E +DG GDVGG
Sbjct: 435 VGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDVGG 494
Query: 481 SIKGLI-------------------------------SPINVAPP----------SCVSN 540
+ + S + PP S+
Sbjct: 495 EVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSMPVANSVSSTVPPHHLSIHAISAPTASD 554
Query: 541 PTVKPLAKSRDPRLRIVNSDAK--------------------SAATINLRKQKMDEEPNI 600
TVKP AKSRDPRLR+ DA SA +N RKQK +E I
Sbjct: 555 QTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELSADLVNPRKQKAADEFLI 614
Query: 601 DGPETKRQRIGSQNLAVASSDLGSVSEVVAGWKILCQLDP-------------------- 660
DGP KRQ+ + A+ G + + + + + P
Sbjct: 615 DGPAWKRQK-SDTDAPKAAGTGGWLEDTESSGLLKLESKPRLIENGVTSMTSSVMPTSAV 674
Query: 661 DFQVGIRWSANNDASLPSLLKDIVVNPTMLINLLKISQQQQLAAELKLKSSEPEKNAICP 720
+R ++ + ASL SLLKDI VNPTML+NLLK+ ++Q++ + K +P + A P
Sbjct: 675 SVSQKVRTASTDTASLQSLLKDIAVNPTMLLNLLKMGERQKVPEKAIQKPMDPRRAAQLP 734
Query: 721 TSLSPCLGSTPLA--NTPAVTSGNLQQSGGTPSVPSPPVATVDDLGKVRMKPRDPRRILH 780
S STPL+ + A+ + +L S + P A + G +RMKPRDPRRILH
Sbjct: 735 GSSVQPGVSTPLSIPASNALAANSLNSGVLQDSSQNAPAA---ESGSIRMKPRDPRRILH 794
Query: 781 GNSLQKVGSLGNEQFKGIVPA----------APNTEGSRDIPNGHKQEGQGDLRLASSQP 840
G++LQ+ S +Q K P+ A + E + G ++ S
Sbjct: 795 GSTLQRTDSSMEKQTKVNDPSTLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKMKISGE 854
Query: 841 LL----PDITRQFTKNLKNIADILSVSSPSASSQTS-SSKPVKLDR-MDTNAVGSSSSDS 900
LL PD + QFTKNLK+IAD++ VS + S S +K +R + N ++ D
Sbjct: 855 LLSGKTPDFSTQFTKNLKSIADMVVVSQQLGNPPASMHSVQLKTERDVKHNPSNPNAQDE 914
Query: 901 KVVTTATQAADMVSPSRSQGTWGDLEHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKL 960
V +A P+RS +WGD+EHLFEGYDD Q+ AIQRER RR+EEQ KMFA++KL
Sbjct: 915 DVSVSAASVTAAAGPTRSMNSWGDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKL 974
Query: 961 CLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWN 1020
LVLD+DHTLLNSAKF EV+ H+EILRKKEEQDREK RHLFRF HMGMWTKLRPG+WN
Sbjct: 975 SLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWN 1034
Query: 1021 FLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSK 1080
FLEKAS+LYELHLYTMGNKLYATEMAK+LDPKG LF GRVIS+GDDGDPLDGDERVPKSK
Sbjct: 1035 FLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSK 1094
Query: 1081 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 1128
DLEGV+GMES+VVIIDDSVRVWP +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE
Sbjct: 1095 DLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEV 1154
BLAST of Sgr021939 vs. TAIR 10
Match:
AT5G58003.1 (C-terminal domain phosphatase-like 4 )
HSP 1 Score: 266.2 bits (679), Expect = 1.2e-70
Identity = 150/321 (46.73%), Postives = 206/321 (64.17%), Query Frame = 0
Query: 812 RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE--QDREKVQ-RHLFRFPHMGMWTKL 871
RKL LVLDLDHTLLN+ ++ P +E L+ QD V LF M M TKL
Sbjct: 121 RKLYLVLDLDHTLLNTTILRDLKP-EEEYLKSHTHSLQDGCNVSGGSLFLLEFMQMMTKL 180
Query: 872 RPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDE 931
RP V +FL++ASE++ +++YTMG++ YA +MAK+LDPKG F RVISR DDG
Sbjct: 181 RPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-DDG------- 240
Query: 932 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 991
V K L+ VLG ESAV+I+DD+ WP +K NLIV+ERY +F S RQF SL E
Sbjct: 241 TVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSE 300
Query: 992 IDHDERPEDGTLASSLAVIQRIHQTFFSHPVLDGV---DVRNILASEQQKILAGCRIVFS 1051
+ DE DG LA+ L V+++ H FF + V +G+ DVR +L +++IL GC+IVFS
Sbjct: 301 LKSDESEPDGALATVLKVLKQAHALFFEN-VDEGISNRDVRLMLKQVRKEILKGCKIVFS 360
Query: 1052 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSAGRFVVHP 1111
RVFP +A P HPLW+ AE+ GA C ++D VTHVVA +GT+K WA+ ++VVH
Sbjct: 361 RVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHR 420
Query: 1112 GWVEASALLYRRANEQDFAIK 1127
GW++A+ L+ + E++F ++
Sbjct: 421 GWIDAANYLWMKQPEENFGLE 430
BLAST of Sgr021939 vs. TAIR 10
Match:
AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )
HSP 1 Score: 132.9 bits (333), Expect = 1.6e-30
Identity = 82/225 (36.44%), Postives = 129/225 (57.33%), Query Frame = 0
Query: 812 RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKVQRHLFRFPHMG----MWTK 871
+KL LVLDLDHTLL+S + ++++ + RE L++F +G K
Sbjct: 65 KKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTRE----DLWKFRPIGHPIDRLIK 124
Query: 872 LRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGD 931
LRP V +FL++A+E++ + +YTMG+++YA + +++DPK F RVI++ +
Sbjct: 125 LRPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDES------- 184
Query: 932 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 991
P+ K L VL E VVI+DD+ +WPH+K NLI + +Y YF R+ GL S
Sbjct: 185 ---PRMKTLNLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYS 244
Query: 992 EIDHDERPEDGTLASSLAVIQRIHQTFF---SHPVLDGVDVRNIL 1030
E DE DG LA+ L +++ +H+ FF VL+ +DVR++L
Sbjct: 245 EKKTDEGENDGGLANVLKLLREVHRRFFIVEVEEVLESMDVRSLL 271
BLAST of Sgr021939 vs. TAIR 10
Match:
AT1G20320.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )
HSP 1 Score: 122.9 bits (307), Expect = 1.7e-27
Identity = 85/234 (36.32%), Postives = 124/234 (52.99%), Query Frame = 0
Query: 812 RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKE-EQDREKVQRHLFRFPHMGMWTKLRP 871
RKL LVLDLDHTLL+S + +L + + +D + R M KLRP
Sbjct: 75 RKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE--------MLIKLRP 134
Query: 872 GVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGGLFAGRVISRGDDGDPLDGDERV 931
V FL++A+E++ +++YTMGN+ YA + K +DPK F RVI+R + G
Sbjct: 135 FVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGDRVITRDESG--------- 194
Query: 932 PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 991
SK L+ VL E VVI+DD+ VWP ++ NL+ + +Y+YF S E
Sbjct: 195 -FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF--RDYSHDKESKSYAEEK 254
Query: 992 HDERPEDGTLASSLAVIQRIHQTFFSHPV--LDGVDVRNILASEQQKILAGCRI 1043
DE G+LA+ L V++ +HQ FF + LD DVR +L ++Q I +I
Sbjct: 255 RDESRNQGSLANVLKVLKDVHQEFFRGGIEELDSKDVRLLL--QEQHIAVSIKI 286
BLAST of Sgr021939 vs. TAIR 10
Match:
AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )
HSP 1 Score: 120.9 bits (302), Expect = 6.4e-27
Identity = 101/315 (32.06%), Postives = 156/315 (49.52%), Query Frame = 0
Query: 720 LSVSSPSASSQTSSSKPVKLDRMDTNAVGSSSSDSKVVTTATQAADMVSPSRSQGTWGDL 779
+SV + S + S K K+D N+ S++ D V + R +G
Sbjct: 1 MSVMQQNLSVEPKSKKR-KIDSEINNSSSSTNCDHFFVRYGICCNCRSNVERHRGR--SF 60
Query: 780 EHLFEGYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDE 839
++L +G Q + I +R+ Q F +KL LVLDLDHTLL++ V + + E
Sbjct: 61 DYLVDGL---QLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHT---VMISNLTKE 120
Query: 840 ILRKKEEQDREKVQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEM 899
EE+D + R L KLRP V FL++A++++ +++YTMG++ YA +
Sbjct: 121 ETYLIEEEDSREDLRRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNV 180
Query: 900 AKVLDPKGGLFAGRVISRGDDGDPLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 959
++DP+ F RVI+R + P K L+ VL E VVI+DD+ VWP +
Sbjct: 181 LNLIDPEKVYFGDRVITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDH 240
Query: 960 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIQRIHQTFFSHPV 1019
K NL+ + +Y YF R S E DE DG+LA+ L VI+++++ FFS V
Sbjct: 241 KRNLLEITKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVIKQVYEGFFSGGV 296
Query: 1020 -----LDGVDVRNIL 1030
+D DVR +L
Sbjct: 301 EKDLDIDSKDVRLLL 296
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022148889.1 | 0.0e+00 | 77.13 | RNA polymerase II C-terminal domain phosphatase-like 3 [Momordica charantia] | [more] |
XP_022960085.1 | 0.0e+00 | 73.28 | RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita moschata] | [more] |
XP_023514332.1 | 0.0e+00 | 72.64 | RNA polymerase II C-terminal domain phosphatase-like 3 [Cucurbita pepo subsp. pe... | [more] |
KAG6592819.1 | 0.0e+00 | 72.56 | RNA polymerase II C-terminal domain phosphatase-like 3, partial [Cucurbita argyr... | [more] |
XP_011656791.1 | 0.0e+00 | 73.21 | RNA polymerase II C-terminal domain phosphatase-like 3 [Cucumis sativus] >KGN464... | [more] |
Match Name | E-value | Identity | Description | |
Q8LL04 | 3.9e-263 | 45.64 | RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... | [more] |
Q00IB6 | 1.7e-69 | 46.73 | RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... | [more] |
Q95QG8 | 1.4e-31 | 31.36 | RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... | [more] |
F4JCB2 | 1.2e-25 | 33.33 | RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... | [more] |
Q9P376 | 3.5e-22 | 25.37 | RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1D5D6 | 0.0e+00 | 77.13 | Protein-serine/threonine phosphatase OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A6J1H839 | 0.0e+00 | 73.28 | Protein-serine/threonine phosphatase OS=Cucurbita moschata OX=3662 GN=LOC1114609... | [more] |
A0A0A0KAB9 | 0.0e+00 | 73.21 | Protein-serine/threonine phosphatase OS=Cucumis sativus OX=3659 GN=Csa_6G091910 ... | [more] |
A0A5A7TDW7 | 0.0e+00 | 73.24 | Protein-serine/threonine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E... | [more] |
A0A1S3CB96 | 0.0e+00 | 73.16 | Protein-serine/threonine phosphatase OS=Cucumis melo OX=3656 GN=LOC103498885 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G33540.1 | 2.8e-264 | 45.64 | C-terminal domain phosphatase-like 3 | [more] |
AT5G58003.1 | 1.2e-70 | 46.73 | C-terminal domain phosphatase-like 4 | [more] |
AT2G04930.1 | 1.6e-30 | 36.44 | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein | [more] |
AT1G20320.1 | 1.7e-27 | 36.32 | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein | [more] |
AT5G54210.1 | 6.4e-27 | 32.06 | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein | [more] |