BLAST of Cp4.1LG04g01060 vs. Swiss-Prot
Match:
C3H19_ARATH (Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana GN=NERD PE=1 SV=3)
HSP 1 Score: 929.1 bits (2400), Expect = 7.2e-269
Identity = 715/1793 (39.88%), Postives = 953/1793 (53.15%), Query Frame = 1
Query: 4 EENDSSKHDQPSSPLLSVDDGNDLD-VKCHTHRELHSNEEQHCLFQSAINELEFPSNSSV 63
E + S ++PSS LSV + N +D + +RE+ EQ E+E S
Sbjct: 151 ENKEVSMEEEPSSHELSVCEVNGVDSLNDEENREVG---EQIVCGSMGGEEIESDLESKK 210
Query: 64 ESLQPSDAIRGDESLVAETCLEVEETEIAGVK--ACRNGIEDMGEDSVKLEVEPDIAAMG 123
E + D I +E A+ V EI K AC G ++ L D + G
Sbjct: 211 EKV---DVI--EEETTAQAASLVNAIEIPDDKEVACVAGFTEISSQDKGL----DESGNG 270
Query: 124 LLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVG 183
L E +++ + + A+ G EMD+ +++ E V D+T
Sbjct: 271 FLDEEPVKELQIGEGAKDLTDGDAKEGVDVTEDEMDIQVLKKSKEEEKV------DSTTE 330
Query: 184 CGETDTC---LSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQ 243
E +T + DV E+++ T V T KE DD K+ D +
Sbjct: 331 L-EIETMRLEVHDVATEMSDKTVISSAVVTQFTGETSNDKETV--MDDVKEDVDKD---- 390
Query: 244 ETFSMEDGKLGVPVQLVEKSELKQSLVDGAV--VEEGRTENLADRTGETLKMENDSSKTD 303
E GK + + + E +E + V+ V +EG A+ G+T+ +E +
Sbjct: 391 ----SEAGK-SLDIHVPEATEEVDTDVNYGVGIEKEGDGVGGAEEAGQTVDLEEIREENQ 450
Query: 304 EVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHV 363
E L+ ++D E + EV ++D+ + T +LA++ + V
Sbjct: 451 E--LSKELAQVD-----ETKISEMSEVTETMIKDEDQEKDDNMT--DLAEDVENHRDSSV 510
Query: 364 TDDNIEVLKIENVEDREAGVQGLGVADESAE--VGKIENLVDETAEAENVTNYTAESMEN 423
D IE E ED E +GV + E +GK++ E T E E
Sbjct: 511 AD--IE----EGREDHE----DMGVTETQKETVLGKVDRTKIAEVSEETDTRIEDEDQEK 570
Query: 424 LDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTE 483
D+ T E++ ++ AD ++EG S+E MT ++ A+E E
Sbjct: 571 DDEMTDVAEDVKTHGDSSVAD-----IEEGRESQEE---MTETQEDSVMADEEPE----- 630
Query: 484 EVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHP 543
EV+E +K S+G KRKRG+N+K + KK EEDVCF+CFDGGDLVLCDRRGC KAYHP
Sbjct: 631 EVEEENK-SAGGKRKRGRNTKTVKG--TGKKKEEDVCFMCFDGGDLVLCDRRGCTKAYHP 690
Query: 544 SCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGF 603
SC++RDEAFF+ KG+WNCGWHLCS CEKTA Y+CYTC FSLCKGC K+AV C+RGNKG
Sbjct: 691 SCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYLCYTCMFSLCKGCAKDAVFFCIRGNKGL 750
Query: 604 CEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNP 663
CE CM V LIE+ +Q E Q+DFNDKTSWEYLFK+YW DLK LSL+ +EL AK P
Sbjct: 751 CETCMETVKLIERKQQ-EKEPAQLDFNDKTSWEYLFKDYWIDLKTQLSLSPEELDQAKRP 810
Query: 664 WKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSM 723
KG ET S+ + E D DGGSD D S KKRK + RSKS + E
Sbjct: 811 LKGHETNASKQGTASET-DYVTDGGSDSD-------SSPKKRKTRSRSKSGSAEK----- 870
Query: 724 PIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRR 783
I+ +D +EWASKELL+ V+HM+ GDR+ L +VQ LLL YIKR LRDPRR
Sbjct: 871 -ILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPMLEVQTLLLAYIKRYNLRDPRR 930
Query: 784 KSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTES-SQLEGDG 843
KSQ+ICDSRL+NLFGK VGHFEML LL+SHFL +E Q +D+QG + DTE + ++ D
Sbjct: 931 KSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQADDIQGDIVDTEEPNHVDVDE 990
Query: 844 YTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESF 903
D K+ K+KKR+ RKK ++G QSNLDD+AA+D+HNINLIYLRR+LVE L+ED +F
Sbjct: 991 NLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDFAAVDMHNINLIYLRRSLVEDLLEDSTAF 1050
Query: 904 HEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVI 963
EKV +FVR+RISGN QKQDLYRLVQVVGTSKA EPYKVGKK TD +LEILNL+KTEVI
Sbjct: 1051 EEKVASAFVRLRISGN-QKQDLYRLVQVVGTSKAPEPYKVGKKTTDYVLEILNLDKTEVI 1110
Query: 964 SIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSH 1023
SIDIISNQ+FTE+ECKRL+QSIKCG++NRLTVGD+QE+A++LQ+ RVK+ +E EI+R SH
Sbjct: 1111 SIDIISNQDFTEDECKRLKQSIKCGLINRLTVGDIQEKAIALQEVRVKNLLEAEILRFSH 1170
Query: 1024 LRDRASEKGRRKEYPFY----------NIMECVEKLQLLKTPEERQRRLEELPGIHTDPN 1083
LRDRAS+ GRRKEYP+ + ECVEKLQLLK+PEERQRRLEE+P IH DP
Sbjct: 1171 LRDRASDMGRRKEYPYLLKLSNSLTMLTLRECVEKLQLLKSPEERQRRLEEIPEIHADPK 1230
Query: 1084 MDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSST- 1143
MDP ESEDEDE ++K +E R S F+RR R+P+SP K G + N+SW+GT N+S+T
Sbjct: 1231 MDPDCESEDEDEKEEKEKEKQLRPRSSSFNRRGRDPISPRKGGFSSNESWTGTSNYSNTS 1290
Query: 1144 -NRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTA 1203
NR+LSR+ SG+G + +G+ S + ++++ W+ RE +V+ +K + E A
Sbjct: 1291 ANRELSRSYSGRGSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPRSVSIPETPA 1350
Query: 1204 RNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQL 1263
R++ + A EL S S A P+V +Q N++EKIW Y+DPSGKVQGPFSM QL
Sbjct: 1351 RSSRAIAPPELSPRIASEISMAPPAV-VSQPVPKSNDSEKIWHYKDPSGKVQGPFSMAQL 1410
Query: 1264 RKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKP 1323
RKW+NTGYFPA L +W+A++ DS+LLTD LA K
Sbjct: 1411 RKWNNTGYFPAKLEIWKANESPLDSVLLTDALA---------------------GLFQKQ 1470
Query: 1324 QGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDR 1383
A S M Q S GQS+ +S+ + A +IE+PR S D
Sbjct: 1471 TQAVDNSYMKAQVAAFS---------GQSS----QSEPNLGFAARIAPTTIEIPRNSQDT 1530
Query: 1384 WSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSH 1443
WS SLPSPTP+ Q+ TP A S + + ++
Sbjct: 1531 WSQGG------SLPSPTPN---------QITTPTAKRRNFESRWSPTKPSPQSANQSMNY 1590
Query: 1444 SGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSR 1503
S + + T I + N ++ T P D ++S N + + S
Sbjct: 1591 SVAQSGQSQTSRIDIPVVVNS------AGALQPQTYPIPTPDPINVSVNHSATLHSPTPA 1650
Query: 1504 NPPIETQTVETNIS-SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMP 1563
+++T+ S+ P Q +G SP+ S S PG F S+ W+
Sbjct: 1651 GGKQSWGSMQTDHGGSNTPSSQNNSTSYGTPSPS---VLPSQSQPG---FPPSDSWKVA- 1710
Query: 1564 PIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGLES-QNHSWGPMPSGNPNMTWAPSAP 1623
+PS P + WGM +P + QN SWG + NPNM W A
Sbjct: 1711 -VPSQP-----NAQAQAQWGMNMVNNNQNSAQPQAPANQNSSWG-QGTVNPNMGWVGPAQ 1770
Query: 1624 PNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGW--------QAHSSIPPQVNA 1683
GSS S+ T+ GW AP QG GW Q+ S + Q
Sbjct: 1771 TGVNVNWGGSSVPSTVQGITHSGWVAPVQGQTQAYPNPGWGPTGHPQSQSQSQVQAQAGT 1772
Query: 1684 T-PGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGK 1743
T GW+ P G + N N NW N ++ GG GN
Sbjct: 1831 TGSGWMQPGQG-IQSGNSNQNWGT-------------------QNQTAIPSGGSGGNQAG 1772
Query: 1744 SWG-MPPSYGGGGG-----SSSRLPYNNKGQKLCKY-HESGHCKKGGSCDYRH 1756
WG S G G S N KGQ++CK+ E+GHC+KG SC+Y H
Sbjct: 1891 YWGNQQQSQNGDSGYGWNRQSGGQQNNFKGQRVCKFFRENGHCRKGASCNYLH 1772
BLAST of Cp4.1LG04g01060 vs. Swiss-Prot
Match:
C3H44_ARATH (Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana GN=At3g51120 PE=2 SV=3)
HSP 1 Score: 573.9 bits (1478), Expect = 5.9e-162
Identity = 340/718 (47.35%), Postives = 438/718 (61.00%), Query Frame = 1
Query: 412 ENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDV 471
+ L +L +A EE+ + VD+ N + T A M
Sbjct: 6 KQLQQGVPELASLAGREESSVRGIDLMRVDQCEEIGVNQVPALSVPASTVAGAVAVPMSN 65
Query: 472 TEEV---DEASKGSSGAKRKRGKNSKAPARVP----------SRKKVEEDVCFICFDGGD 531
+EV DEA+ KRKRG+ +A A P ++ EEDVCFICFDGGD
Sbjct: 66 EQEVKVIDEAAP----IKRKRGRPPRAQANTPLHIRPPPPPPKKEDKEEDVCFICFDGGD 125
Query: 532 LVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGC 591
LVLCDRR CPKAYHP+CI RDEAFFR +WNCGWH+C C+K + YMCYTCTFS+CK C
Sbjct: 126 LVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQKASSYMCYTCTFSVCKRC 185
Query: 592 IKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKG 651
IK+A + VRGN G C C++ +MLIE QG E ++DF+DK SWEYLFK YW LK
Sbjct: 186 IKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFDDKLSWEYLFKVYWLCLKE 245
Query: 652 SLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAK 711
LSLT DEL A NPWK E N+ P + N LDV+ N G+ ++R +
Sbjct: 246 ELSLTVDELTRANNPWK--EVPNTAPKVESQNDHTN---NRALDVAVN---GTKRRRTS- 305
Query: 712 RRSKSQAKETNSPSMPIIPDSQGPST-----DNNVEWASKELLEFVMHMKNGDRTVLSQF 771
+SP++P D + PS + WA+KELLEFV MKNGD +VLSQF
Sbjct: 306 ----------DSPTLPNKLDGKNPSNILKKAPGDTSWATKELLEFVSFMKNGDTSVLSQF 365
Query: 772 DVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQ- 831
DVQ LLL+YIK+ LRDP +KSQ++CD L LFGK RVGHFEMLKLLESH LI+E +
Sbjct: 366 DVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQEKPKG 425
Query: 832 INDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNI 891
G SQ+E D D + R+ R+MR+K D R NLD YAAID+HNI
Sbjct: 426 AKTTNGETTHAVPSQIEEDSVHDPMVRDRR---RKMRRKTDGRVQNENLDAYAAIDVHNI 485
Query: 892 NLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKV 951
NLIYLRR +E L++D EKVVG+ +RI++SG+ QK D++RLVQVVGTSKA Y++
Sbjct: 486 NLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSDQKLDIHRLVQVVGTSKAIASYQL 545
Query: 952 GKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAM 1011
G K TD++LEILNL+K EVISID +S+Q TE+ECKRLRQSIKCG+ RLTV D+ + A
Sbjct: 546 GAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVDILKTAA 605
Query: 1012 SLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLE 1071
+LQ R+ + +E EI++L+HLRDRA +KL+LLK+PEERQR L+
Sbjct: 606 TLQAMRINEALEAEILKLNHLRDRA------------------KKLELLKSPEERQRLLQ 665
Query: 1072 ELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLND 1111
E+P +HTDP+MDPSH ++ ++Q+ + ++ G P G NLN+
Sbjct: 666 EVPEVHTDPSMDPSHALSEDAGLGTRKQDNHVKAQSKG----------PQNKGVNLNN 669
BLAST of Cp4.1LG04g01060 vs. Swiss-Prot
Match:
Y5843_ARATH (Uncharacterized protein At5g08430 OS=Arabidopsis thaliana GN=At5g08430 PE=1 SV=2)
HSP 1 Score: 183.7 bits (465), Expect = 1.7e-44
Identity = 161/562 (28.65%), Postives = 262/562 (46.62%), Query Frame = 1
Query: 728 VEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLF 787
V W S++L+EF+ + ++S++DV + +YI + L DP K +++CD RL LF
Sbjct: 30 VGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKKVVCDKRLVLLF 89
Query: 788 GKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRR 847
G + ++ LLE H+ +D D++ L D + K KR
Sbjct: 90 GTRTIFRMKVYDLLEKHYKENQD-----------DSDFDFLYEDE-PQIICHSEKIAKRT 149
Query: 848 MRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISG 907
+ RG +AAI NI L+YLR++LV+ L++ ++F K++GSFVRI+
Sbjct: 150 SKVVKKPRGT------FAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRIKSDP 209
Query: 908 NAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEE 967
N Q Y+LVQV G K D LL++ N K +SI ++S+ F++EE
Sbjct: 210 NDYLQKYPYQLVQVTGVKKEHGT-------DDFLLQVTNYVKD--VSISVLSDDNFSQEE 269
Query: 968 CKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEY 1027
C+ L Q IK G+L + T+ +++E+A L + K W+ EI L L DRA+EKG R+E
Sbjct: 270 CEDLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRE- 329
Query: 1028 PFYNIMECVEKLQLLKTPEERQRRLEELPGI-------HTDPNMDPSHESEDEDEADDKR 1087
+ E ++K +LL+ P+E+ R L E+P + + + + +H+S++E +
Sbjct: 330 ----LSEYLDKRELLQNPDEQARLLREVPEVIGEELVQNPEVSSPEAHKSDNEQRLSESP 389
Query: 1088 QET-YTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFS--- 1147
+ + + G SN + T + N+ L ++ G
Sbjct: 390 LSCIHETPEARNLFGGEDQQFNNGYVMSNPITTPGITSCATEINKGLPTWIASAGAEYLH 449
Query: 1148 ---NQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELP 1207
Q + I GE E S + + N + QV P+ EL
Sbjct: 450 VDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQPNPSEVI---------ELS 509
Query: 1208 SAARSVNSAASPSVGTTQNAATVN-ETEKI-WRYQDPSGKVQGPFSMVQLRKWSNTGYFP 1267
N ++ ++ + EK+ W Y+DP G VQGPFS+ QL+ WS+ YF
Sbjct: 510 DDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLTQLKAWSDAEYFT 550
Query: 1268 ADLRVWRASDKQDDSLLLTDVL 1273
RVW + + ++LLTDVL
Sbjct: 570 KQFRVWMTGESMESAVLLTDVL 550
BLAST of Cp4.1LG04g01060 vs. Swiss-Prot
Match:
NSD3_MOUSE (Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1 PE=1 SV=2)
HSP 1 Score: 88.6 bits (218), Expect = 7.5e-16
Identity = 47/113 (41.59%), Postives = 60/113 (53.10%), Query Frame = 1
Query: 472 TEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAY 531
T VDE +K AK K+ + KA A K + ED CF C DGG+LV+CD++ CPKAY
Sbjct: 1296 TSAVDEKTKN---AKLKKRRKVKAEA-----KPIHEDYCFQCGDGGELVMCDKKDCPKAY 1355
Query: 532 HPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 585
H C+N + G+W C WH C C A C C S CK K A++
Sbjct: 1356 HLLCLNLTQP---PHGKWECPWHRCDECGSVAVSFCEFCPHSFCKAHGKGALV 1397
BLAST of Cp4.1LG04g01060 vs. Swiss-Prot
Match:
NSD3_HUMAN (Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens GN=WHSC1L1 PE=1 SV=1)
HSP 1 Score: 85.1 bits (209), Expect = 8.3e-15
Identity = 42/109 (38.53%), Postives = 58/109 (53.21%), Query Frame = 1
Query: 476 DEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSC 535
+E ++ K+KR K P K++ ED CF C DGG+LV+CD++ CPKAYH C
Sbjct: 1296 NEEKAKNAKLKQKRRKIKTEP------KQMHEDYCFQCGDGGELVMCDKKDCPKAYHLLC 1355
Query: 536 INRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 585
+N + + G+W C WH C C A C C S CK K A++
Sbjct: 1356 LNLTQPPY---GKWECPWHQCDECSSAAVSFCEFCPHSFCKDHEKGALV 1395
BLAST of Cp4.1LG04g01060 vs. TrEMBL
Match:
A0A0A0K4G1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G006220 PE=4 SV=1)
HSP 1 Score: 2514.6 bits (6516), Expect = 0.0e+00
Identity = 1369/1819 (75.26%), Postives = 1487/1819 (81.75%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS DQ SS L VDDG LDVKC T+RE L SNE+QHC+ +S+I E F N
Sbjct: 1 MEAEEDDSSYQDQKSSSLY-VDDGK-LDVKCDTNREELLSNEQQHCVSKSSIIETGFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L TC E VEE E RN I+DMGEDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVGTCSEMEKKDLVEERERVEENDFRNIIQDMGEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
P IA GLL + F+DVK+ EE KA++EF EG+LL M VG AENQVEGNVLM N
Sbjct: 121 PGIAKAGLLEQRAFDDVKKNTGVTEEEKALSEFAEGELLPGMVFVGVAENQVEGNVLMAN 180
Query: 181 LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
++TV GC ET TCLS VLAE LAETTPFV GVD T NLV++ EVEE+AD
Sbjct: 181 FSEHTVVDGSAGCVETTETTCLSYVLAEERLAETTPFVQGVDVTVATNLVQKTEVEEHAD 240
Query: 241 DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
D DSKD EV KQE F++E +LGV VQL E SELK SLVDG V EGRTENLADRTGET
Sbjct: 241 DTNDSKDTEVPKQENFAVEKMELGVQVQLEEDSELKVSLVDGVV--EGRTENLADRTGET 300
Query: 301 LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
LKMEN SS ++EVGL +FA EI V + N EDKT+E DGMC+E+KA D NLA
Sbjct: 301 LKMENASSTSNEVGLTHFAVEIKETVNIGNDEDKTMETDGMCVEEKATDVGMME---NLA 360
Query: 361 DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
DETP+IKGV V D +IE LKIE++EDREAGVQGLG+ADES V K+EN+ DE AE E V
Sbjct: 361 DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADESPVVEKLENVADENAEPEGVQ 420
Query: 421 -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
T+YTAE + EN+ DDKTAQ EE+AM EE E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421 VTDYTAEEVKSENVEDDKTAQGEEVAMGEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480
Query: 481 TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
TEAAEEVEEMD TEEVDE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481 TEAAEEVEEMDATEEVDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540
Query: 541 VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541 VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
Query: 601 KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
KNAVILCVRGNKGFCE CMRFV IEKNEQG+ EKGQIDFNDK SWEYLFKEYWTDLKGS
Sbjct: 601 KNAVILCVRGNKGFCETCMRFVTSIEKNEQGNKEKGQIDFNDKNSWEYLFKEYWTDLKGS 660
Query: 661 LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661 LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720
Query: 721 RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
RS+SQAKE +SPSMP SQG STD+NVEW SKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721 RSRSQAKEMSSPSMPATA-SQGLSTDDNVEWGSKELLEFVMHMKNGDRTVLSQFDVQALL 780
Query: 781 LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL S
Sbjct: 781 LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHVS 840
Query: 841 VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841 VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900
Query: 901 NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901 NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960
Query: 961 LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVGDLQERAMSLQDARV
Sbjct: 961 LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGDLQERAMSLQDARV 1020
Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
KDWMETEIVRLSHL + ECVEKLQLLKTPEERQRR+EE+P IH
Sbjct: 1021 KDWMETEIVRLSHLHSLL-------------LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080
Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140
Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
+TNRD+SRNLSGKGF+NQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSE+T
Sbjct: 1141 NTNRDMSRNLSGKGFANQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEIT 1200
Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
ARNALSGAASE SAA SVN AAS SVGTTQNAATVNE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPAASSSVGTTQNAATVNESEKIWHYQDPSGKVQGPFSMVQ 1260
Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320
Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
PQG T+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSGD
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGD 1380
Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSEN 1440
RWSSDHGNK+FT+LPSPTPSSGG+KEQPFQ+A F S GG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGSKEQPFQVAASFMEAKSLSGTAGGGLHGSSVMQGSEN 1440
Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
DSLRSH G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLV
Sbjct: 1441 DSLRSHLGRNSSEKGLGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500
Query: 1501 QSINSRNPPIETQ------------------------TVETNISSSMPPGQTLHRRWGEM 1560
QSINSRNPPIE VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGHGSGSILKRETDTSEAWQNAHSLKVESNVSSSMPPAQTLHSRWGEM 1560
Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
SPAQNAA T+SFS+ G+++F SS+PWRS PI SNP HIQ STPPN+PWGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGMSSFPSSDPWRSTAPISSNPQHIQCSTPPNLPWGMG 1620
Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNP 1680
APEGQSTVPR G ESQN +WGPMPSGNPNM W P+ PPNAT MMWG++AQSS TNP
Sbjct: 1621 APEGQSTVPRQGSESQNQTWGPMPSGNPNMGWGPTGPPPNATAMMWGATAQSSGPAATNP 1680
Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1740
GW APGQGP NN+QGW AHS +PP VNATPGWV PN+ PMPPMNMNP+W PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNLQGWPAHSPMPPPVNATPGWVGPNVAPMPPMNMNPSWLVPSVNQNM 1740
Query: 1741 WSNEHGKNGDRFSN-PDSVSHGGDPGNGGKSWGMPPSY--GGGGGSSSRLPYNNKGQKLC 1757
W NEHGKNG+RFSN D SHGGDPGNG KSWGM PS+ GGGGG +SR PY N+ QKLC
Sbjct: 1741 WGNEHGKNGNRFSNQKDGGSHGGDPGNGDKSWGMQPSFGGGGGGGGNSRSPY-NRVQKLC 1793
BLAST of Cp4.1LG04g01060 vs. TrEMBL
Match:
A0A061DZP0_THECC (Nucleic acid binding,zinc ion binding,DNA binding, putative isoform 2 OS=Theobroma cacao GN=TCM_006789 PE=4 SV=1)
HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 842/1751 (48.09%), Postives = 1084/1751 (61.91%), Query Frame = 1
Query: 99 GIEDMGEDSV----KLEVEPDIAAMGLLGETVFNDVKEEDAGAEEVK-AVAEFGEGDLLC 158
G+ D E V K +V D A ++ E D+ + AE ++ AVAE +L
Sbjct: 130 GVVDREEGHVAQEEKADVAEDAAVDDVMEEMEKADLSDGGGTAEGIEVAVAERQVAELAE 189
Query: 159 EMDLVGGAENQVEGNVLMVNLPDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANL 218
E G + V+ ++ P++ G + AE+ T + ++ T VA++
Sbjct: 190 E---AGNEQKVVDDVQDQISSPEDKEVAGVAEERGIAEAAEVDGVTEQIVVMEETCVADV 249
Query: 219 VERKEVEENADDPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGR 278
VE + + + A+ ++ I V ++ + + G+ +++SE+ V+ +++E +
Sbjct: 250 VEERGIAKAAEVGVVTEQIGVMEEAGLADMTERTGI----MDESEVAGVAVEREMLKEKQ 309
Query: 279 TENLADRT---GETL--KMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLE 338
+N ++T GET+ M S +E + + A + E VE + LE
Sbjct: 310 VDNEVEQTEILGETVVVNMVEKSESLEEKLMVDVAERF--GIGEETRVTDLVEKREL-LE 369
Query: 339 DKAADATTKTTTGNLADETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVG 398
DK N AD ++ V D V K +++E+ Q +G E E
Sbjct: 370 DKEEV--------NFADPNEILEDTGVVD---MVEKSQSLEE-----QLVGNVSEQTENL 429
Query: 399 KIENLVDET--AEAENVTNYTAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGS 458
+ N V ET AE + VT +E E + +E++ E TE +D G G+
Sbjct: 430 EDTNAVRETGMAEVDTVTGEESEKAEGTETGNV-VEDVEKAEGTE--------IDVGDGA 489
Query: 459 EENDA-------NMTYLVGETEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNS--KAPA 518
E +A +MT V E EAAEE E+ EEV++ASK +SG KRKRGKNS K A
Sbjct: 490 EGVEAAEDTEMLDMTEEV-EMEAAEETED---AEEVEDASK-ASGGKRKRGKNSNSKVLA 549
Query: 519 RVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCS 578
R PSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYH +C+ RDEAFFRAKG+WNCGWHLCS
Sbjct: 550 RAPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHTACVGRDEAFFRAKGKWNCGWHLCS 609
Query: 579 NCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQI 638
NC+K A+YMCYTCTFSLCKGCIK+AVIL VRGNKG CE+CM +MLIE+NEQ Q+
Sbjct: 610 NCKKNAYYMCYTCTFSLCKGCIKDAVILSVRGNKGLCESCMNLIMLIERNEQA-----QV 669
Query: 639 DFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDG 698
+F+DK+SWEYLFK+YW DLK LS+ DEL AKNPWKGSE ++ +SP E +D N G
Sbjct: 670 NFDDKSSWEYLFKDYWIDLKRRLSINSDELAQAKNPWKGSEGRAAKQESPDE-HDFNDGG 729
Query: 699 GSDLDVSE-NEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELL 758
GS D S N E +SK+R+ + +SKS+A+E +SPS + +G STD + EWASKELL
Sbjct: 730 GSGSDGSSGNAEVTASKRRRTRSQSKSRAREGDSPST-VTASGEGASTDESAEWASKELL 789
Query: 759 EFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFE 818
E VMHM+NGD++VLS+ ++ L+L+YI+++KLRD R KS +ICD+RL++LFGKPRVGH E
Sbjct: 790 EVVMHMRNGDKSVLSRMELSQLILDYIQKHKLRDRRNKSYVICDTRLKSLFGKPRVGHIE 849
Query: 819 MLKLLESH-FLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQR 878
ML LL+ H F +ED Q +++QGSV D E++QLE D +DA KT K+KKR+ RKKGD R
Sbjct: 850 MLNLLDPHIFFTKEDSQTDEIQGSVVDAEANQLEADWNSDAMTKTGKDKKRKTRKKGDAR 909
Query: 879 GLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLY 938
GLQSNLDDYAAID+HNINLIYLRRNLVE LIED E+FH+KVVGSFVRIRISG QKQDLY
Sbjct: 910 GLQSNLDDYAAIDMHNINLIYLRRNLVEDLIEDTETFHDKVVGSFVRIRISGAGQKQDLY 969
Query: 939 RLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIK 998
RLVQVVGT+K +E Y+VGK+ TD LLEILNLNKTE++SIDIISNQEFTE+ECKRLRQSIK
Sbjct: 970 RLVQVVGTNKVAETYRVGKRTTDFLLEILNLNKTEIVSIDIISNQEFTEDECKRLRQSIK 1029
Query: 999 CGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECV 1058
CG++NRLTVGD+QE+AM++Q RVKDW+E+EI+RLSHLRDRASEKG RKE + ECV
Sbjct: 1030 CGLINRLTVGDIQEKAMAIQAVRVKDWLESEIMRLSHLRDRASEKGHRKE-----LRECV 1089
Query: 1059 EKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRT 1118
EKLQ+LKTPEERQRRLEE+P IH DPNMDPS+ESE+++ DDKRQ+ Y RGSGFSRR
Sbjct: 1090 EKLQILKTPEERQRRLEEIPEIHVDPNMDPSYESEEDEGEDDKRQDNYMRPRGSGFSRRG 1149
Query: 1119 REPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSH 1178
REP+SP K G + +DSWSGTRN+SS NR+LSRNLS KG ++G+D++G+GE++NEN W+
Sbjct: 1150 REPISPRKGGLSSSDSWSGTRNYSSMNRELSRNLSNKGLMSKGDDSVGAGEMVNENLWNL 1209
Query: 1179 GREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAARSVNSAASPSVGTTQNAATV 1238
GRE + + PN WDK + + SSE+ RN S E S S S S G T A +
Sbjct: 1210 GRERETQ-PNSWDKPKTALSSEIGTRNTHSVVTQEPSSKVVSEISPTPLSTGVTA-AVQI 1269
Query: 1239 NETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGK 1298
NETEKIWRYQDPSGKVQGPFSMVQLRKW++TGYFPA+L++WR ++KQDDS+LLTD L GK
Sbjct: 1270 NETEKIWRYQDPSGKVQGPFSMVQLRKWNDTGYFPAELKIWRTTEKQDDSILLTDALVGK 1329
Query: 1299 IPKDTSSVDNSI-QAQAHASSFVAKPQGATVQSGMDVQ---------NTGTSNPHTNPTS 1358
KD DNS +AQ + GAT++ GM+ Q N +P +S
Sbjct: 1330 FQKDPPVADNSFPKAQV---ALYGSGVGATLKQGMENQVGERSRFDQNHVAWSPQRTLSS 1389
Query: 1359 YGQSAGGRWKSQTEV-SPTGIPASASIEVPRYSGDRWSSDHGNKDFTSLPSPTPS---SG 1418
GQSA WKSQTE S TG PA +S+E+P+YS D W SD T+LPSPTP+ SG
Sbjct: 1390 SGQSAVESWKSQTEAPSSTGRPAPSSLEMPKYSRDAWGSD------TNLPSPTPNQNPSG 1449
Query: 1419 GTKEQPFQMA---TPFASSAG---GGSLHGSSLMQGSENDSLRSHSGLNAAEKGTGLGPI 1478
G K Q F+ TP SS S G++ G + ++ SG AA +
Sbjct: 1450 GAKGQVFESKWSPTPVQSSVSVSVANSFRGAT--SGLQPPTVVLESGSPAAPVVHSHMAV 1509
Query: 1479 NGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSRNPPIET--------- 1538
+G + + S +N AD+K++ +L +LVQ ++S NP +ET
Sbjct: 1510 SGESLRTQVNAQAS-------INSGADMKNVGVSLQNLVQPVSSHNPSLETHGWGSGSVL 1569
Query: 1539 -----------------------QTVETNISSSMPPGQTLHRRWGE-MSPAQNAATASFS 1598
Q +E N S +MPP + W + + QN+A S
Sbjct: 1570 RQEVVAASSIPATGTQAWGNASAQKLEPNPSLAMPPQPASYGHWNDALQSGQNSAPLSTG 1629
Query: 1599 TP------GLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGLES 1658
P G +S+ WR P+ SN +Q P N+PWGM + Q V R +
Sbjct: 1630 NPAGHFPTGQPTMLASDSWRPTAPVQSN---VQLPAPTNLPWGMAVADNQGAVLRQAPGN 1689
Query: 1659 QNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQ 1718
Q+ WGPMP GN NM W P N + WG+S+Q SA V NP W APGQG N
Sbjct: 1690 QSTGWGPMP-GNQNMGWGAPVPANPN-VNWGASSQGSAPVNPNPSWAAPGQGQMPGNANS 1749
Query: 1719 GWQAHSSI----------PPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHG 1756
GW A + P VN + GWVAP G P + NP + APS N GMW NE
Sbjct: 1750 GWTAPGNAIPGWAPPGQGPAVVNTSSGWVAPGQGATPG-SANPGYVAPSGNSGMWGNEQN 1799
BLAST of Cp4.1LG04g01060 vs. TrEMBL
Match:
V7AUM1_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_009G003300g PE=4 SV=1)
HSP 1 Score: 1300.0 bits (3363), Expect = 0.0e+00
Identity = 790/1514 (52.18%), Postives = 971/1514 (64.13%), Query Frame = 1
Query: 304 GEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHVTDDNIEVL 363
GE+ A E VE D +DA T + DE + KG VTD + L
Sbjct: 35 GELSAAAAQEVV---AVEPDATMETAVESDAGVGT---HAMDEVIEEKGTEVTDVDDMAL 94
Query: 364 KIENVEDR-----EAGVQGLGVADESAEVGKIENLVDETAEAENVTNYTAESMENLDDKT 423
++ENVE+ +A +G D + + E DE + E + + ++++
Sbjct: 95 EMENVEEEANLTIDAEEDEIGDEDANEDALMEEEEEDEQQQGEEEEEEEEKQQQGVEEEE 154
Query: 424 AQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTEEVDEA 483
+ ++ +EE EE D+ +G EE DA+ + +TE EE EE V
Sbjct: 155 EEQQQAEEDEEEEEEDEGEEEQQQG---EEEDADADAGMTKTEDTEEKEEKSV------- 214
Query: 484 SKGSSGAKRKRG--KNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCI 543
SG KRKRG KN+KA RV SRKK EEDVCFICFDGGDLVLCDRRGCPKAYHPSC+
Sbjct: 215 ----SGGKRKRGAGKNAKATGRVASRKKTEEDVCFICFDGGDLVLCDRRGCPKAYHPSCV 274
Query: 544 NRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEA 603
NRDEAFFRAKG+WNCGWHLCSNCE+ A+YMCYTCTFSLCKGCIK+AVILCVRGNKGFCE
Sbjct: 275 NRDEAFFRAKGKWNCGWHLCSNCERNANYMCYTCTFSLCKGCIKDAVILCVRGNKGFCET 334
Query: 604 CMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKG 663
CMR VMLIE+N QGS GQIDF+DK SWEYLFK+Y+ DLK LSLTFDE+ AKNPWKG
Sbjct: 335 CMRTVMLIEQNVQGSNV-GQIDFDDKNSWEYLFKDYYIDLKEKLSLTFDEITQAKNPWKG 394
Query: 664 SETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSMPII 723
S+ L+S+ +SP EL+D D GSD D S +S SK+RKAK+R KS++KE N +
Sbjct: 395 SDMLHSKEESPDELFDAPNDRGSDSDSSYENDSNRSKRRKAKKRGKSRSKEGNLHGAVTV 454
Query: 724 PDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQ 783
+ GPS +++ EWASKELLEFVMHM+NGD++VLSQFDVQALLLEYIKRNKLRDPRRKSQ
Sbjct: 455 SGADGPSGNDSAEWASKELLEFVMHMRNGDKSVLSQFDVQALLLEYIKRNKLRDPRRKSQ 514
Query: 784 IICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDA 843
IICD+RL+NLFGKPRVGHFEMLKLLESHFL++ED Q D+QGSV DTE S LEGDG ++
Sbjct: 515 IICDARLQNLFGKPRVGHFEMLKLLESHFLLKEDSQAEDMQGSVVDTEVSHLEGDGNPNS 574
Query: 844 SGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKV 903
K K+K+R+ RKKGD+RGLQ+N+DDYAAID HNI LIYLRRNLVE L+ED E FH+KV
Sbjct: 575 YMKAGKDKRRKNRKKGDERGLQTNVDDYAAIDNHNITLIYLRRNLVEDLLEDTEKFHDKV 634
Query: 904 VGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDI 963
VGSFVRIRISG+ QKQDLYRLVQVVGT KA+EPYKVGK+MTD LLEILNLNKTE++SIDI
Sbjct: 635 VGSFVRIRISGSGQKQDLYRLVQVVGTCKAAEPYKVGKRMTDTLLEILNLNKTEIVSIDI 694
Query: 964 ISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDR 1023
ISNQEFTE+ECKRLRQSIKCG++NRLTVGD+Q++A+ LQ RVKDW+ETEIVRLSHLRDR
Sbjct: 695 ISNQEFTEDECKRLRQSIKCGLINRLTVGDIQDKALVLQAVRVKDWLETEIVRLSHLRDR 754
Query: 1024 ASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMDPSHES-EDEDEA 1083
ASEKGRRKE + ECVEKLQLLKTPEERQRRLEE+P IH DPNMDPS+ES EDEDE
Sbjct: 755 ASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEEIPEIHVDPNMDPSYESEEDEDEM 814
Query: 1084 DDKRQETYTLSRGS-GFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGF 1143
DDKR+E Y RGS F RR R+ VSP ++ S NDSWSGTRN+S+ N++LSRNLS KGF
Sbjct: 815 DDKRRENYMRPRGSTSFGRRGRDIVSP-RSVSVSNDSWSGTRNYSNANQELSRNLSSKGF 874
Query: 1144 SNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSA 1203
S +GE+A E++N+ GR+ + + N W++Q++S S E A++ S S+ S
Sbjct: 875 SVKGENASNVNEVLNDTHLHPGRDRESQLSNSWERQKLSSSLESGAKSNQSLVTSDSFST 934
Query: 1204 ARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 1263
A SA S G T +A +NETEK W YQDPSGKVQGPFSMVQLRKWSNTGYFPADLR
Sbjct: 935 AVLEASATPSSAGITPSALKINETEKTWHYQDPSGKVQGPFSMVQLRKWSNTGYFPADLR 994
Query: 1264 VWRASDKQDDSLLLTDVLAGKIPKDTSSVDNS--IQAQAHASSFVAK-PQGATVQSGMDV 1323
+WR ++KQDDS+L+TD LAG K+ S VD + + + +S+ K QG Q G
Sbjct: 995 IWRTTEKQDDSILVTDALAGNFSKEPSMVDKAQKVHDLHYPASYSRKSAQGTEGQVGERP 1054
Query: 1324 ---QNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPAS-ASIEVPRYSGDRWSSDHGN 1383
QN+G+ N H+ S GQ+ GG W+S+ ++ S ++EVP+ + W SD G+
Sbjct: 1055 SFDQNSGSLNSHSTLGSPGQTTGGSWRSKDNMNSLANRTSPLAVEVPKNPANGWGSDAGS 1114
Query: 1384 K-DFTSLPSPTPSS--GGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSHSGLN 1443
+ + T+LPSPTP + G TK Q F+ GSL G+S +H GL
Sbjct: 1115 RNEATNLPSPTPQTTPGVTKVQAFENKWSPTPVQLPGSLIGNSFP--------GNHGGLQ 1174
Query: 1444 AAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNP-----------AADIKSISANL--- 1503
A+ + + S P S+ ID++ ++P D+K N+
Sbjct: 1175 ASLVVHAEHAVQNPEKGSSQPGISSASIDNSKLHPQPAAVAPVLPSGVDLKMAGTNMQNQ 1234
Query: 1504 -------HSLVQSINSRNPPIETQTVETNISS-----SMPPGQTLHRRWGEMSPAQNAA- 1563
H+ Q S P +SS +MP H W + S QN A
Sbjct: 1235 VVRSHNSHAEAQGWGSAGVPKPELQAWGGVSSQPNPAAMPAQPASHGPWVDASSVQNTAS 1294
Query: 1564 ------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHI--QSSTPPNIPWGMGAPEGQSTV 1623
+ S TPG ++SEPWR PP S+ P+I S PPN+PWGMG P
Sbjct: 1295 FNTGNPSPSLPTPGFLGMNTSEPWR--PPASSSQPNITAPSPAPPNMPWGMGMP------ 1354
Query: 1624 PRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGTNPGWNAPGQG- 1683
+QN +WG + N N TW P+ P A +NPGW AP QG
Sbjct: 1355 -----GNQNMNWGGVVPANMNATWMPTQVP--------------APGNSNPGWAAPNQGL 1414
Query: 1684 ------PPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWS 1743
PPV N GW VN GWV G + P N NP W P+ N GMW
Sbjct: 1415 PPSQGLPPV--NAVGWVGPGQGRSHVNVNAGWVGSGQG-LAPGNANPVWVPPAGNPGMWG 1474
Query: 1744 NEHGKNGDRFSNP-DSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKLCKYHE 1756
+E NGDRF N D +HG D G GGKSW S+ G G+ SR P+ + + +CKYHE
Sbjct: 1475 SEQSHNGDRFPNQGDRGTHGRDSGYGGKSWNRQSSF--GRGAPSRPPFGGQ-RGVCKYHE 1480
BLAST of Cp4.1LG04g01060 vs. TrEMBL
Match:
A0A0S3RA09_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.02G011300 PE=4 SV=1)
HSP 1 Score: 1290.8 bits (3339), Expect = 0.0e+00
Identity = 793/1519 (52.21%), Postives = 983/1519 (64.71%), Query Frame = 1
Query: 287 MENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADE 346
M+N EV + GE+ A + E VE D +AA + + + DE
Sbjct: 96 MQNIPYAAAEVPEPDTVGELSAAAAVH--EVAAVEPDATM---EAAVESDEGVGAQVMDE 155
Query: 347 TPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENVTNY 406
+ KG VTD + L++ENVE+ L + E E+G + D E E+
Sbjct: 156 VIEEKGDEVTDVDDVALEMENVEEEG----NLAIDAEEDEIGDEDANEDALMEEEDDEQQ 215
Query: 407 TAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEV 466
E E +++ Q + + EEE ++ +G EE + + GE EA E+
Sbjct: 216 QGEEEEEGEEEEKQQQGVEEEEEEQQ---------QGEEEEEEEEEEEHQQGE-EAEEDA 275
Query: 467 E----EMDVTEEVDEASKGSSGAKRKRG--KNSKAPARVPSRKKVEEDVCFICFDGGDLV 526
+ + D TEE +E K SG KRKRG KN+K RV SRKK EEDVCFICFDGGDLV
Sbjct: 276 DAGMAKTDDTEEKEE--KSVSGGKRKRGAGKNAKTTGRVASRKKTEEDVCFICFDGGDLV 335
Query: 527 LCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIK 586
LCDRRGCPKAYHPSC+NRDEAFFRAKG+WNCGWHLCSNCE+ A+YMCYTCTFSLCKGCIK
Sbjct: 336 LCDRRGCPKAYHPSCVNRDEAFFRAKGKWNCGWHLCSNCERNANYMCYTCTFSLCKGCIK 395
Query: 587 NAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSL 646
+AVILCVRGNKGFCE CMR VMLIE+N QGS GQ+DF+DK SWEYLFK+Y+ DLK L
Sbjct: 396 DAVILCVRGNKGFCETCMRTVMLIEQNVQGSNV-GQVDFDDKNSWEYLFKDYYIDLKEKL 455
Query: 647 SLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRR 706
SLTFDE+ AKNPWKGS+ L+S+ +SP EL+D D GSD D S +S K+RKAK+R
Sbjct: 456 SLTFDEISQAKNPWKGSDMLHSKEESPDELFDATNDRGSDSDSSYENDSNRPKRRKAKKR 515
Query: 707 SKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLL 766
K ++KE NS + + GPS D++ EWASKELLEFV+HM+NGD++VLSQFDVQALLL
Sbjct: 516 GKPRSKEGNSNGAVTVSGADGPSGDDSSEWASKELLEFVIHMRNGDKSVLSQFDVQALLL 575
Query: 767 EYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSV 826
EYIKRNKLRDPRRKSQIICD+RL+NLFGKPRVGHFEMLKLLESHFL++ED Q DLQGSV
Sbjct: 576 EYIKRNKLRDPRRKSQIICDARLQNLFGKPRVGHFEMLKLLESHFLLKEDSQAEDLQGSV 635
Query: 827 ADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRN 886
DTE S LEGDG ++ K K+KKR+ RKKGD RGLQ+N+DDYAAID HNINLIYLRRN
Sbjct: 636 VDTEVSHLEGDGNPNSYTKAGKDKKRKNRKKGDDRGLQTNVDDYAAIDNHNINLIYLRRN 695
Query: 887 LVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDIL 946
LVE L+ED E FH+KVVG+FVRIRISG+ QKQDLYRLVQVVGT KA+EPYKVGK+MTD L
Sbjct: 696 LVEDLLEDTEKFHDKVVGAFVRIRISGSGQKQDLYRLVQVVGTCKAAEPYKVGKRMTDTL 755
Query: 947 LEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVK 1006
LEILNLNKTE++SIDIISNQEFTE+ECKRLRQSIKCG++NRLTVGD+Q++A+ LQ RVK
Sbjct: 756 LEILNLNKTEIVSIDIISNQEFTEDECKRLRQSIKCGLINRLTVGDIQDKALVLQAVRVK 815
Query: 1007 DWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTD 1066
DW+ETEIVRLSHLRDRASEKGRRKE + ECVEKLQLLKTPEERQRRLEE+P IH D
Sbjct: 816 DWLETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRLEEIPEIHVD 875
Query: 1067 PNMDPSHES-EDEDEADDKRQETYTLSRGS-GFSRRTREPVSPGKAGSNLNDSWSGTRNF 1126
PNMDPS+ES EDEDE DDKR+E Y RGS F RR R+ SP ++ S N+SWSGTRN+
Sbjct: 876 PNMDPSYESEEDEDEMDDKRRENYMRPRGSTSFGRRGRDIASP-RSVSISNESWSGTRNY 935
Query: 1127 SSTNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEM 1186
S+TN++L RNLS KGFS +GE+A E++N+ GR+ + + N W++Q++S S E
Sbjct: 936 SNTNQELGRNLSNKGFSIKGENASNVNEVLNDTHLLQGRDRESQLSNSWERQKLSSSLES 995
Query: 1187 TARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMV 1246
A++ S S+ S A SAA S G T + +NETEK+W YQDPSGK+QGPFSMV
Sbjct: 996 GAKSTQSLVTSDSFSTAVLEASAAPSSAGITPSTLKINETEKMWHYQDPSGKIQGPFSMV 1055
Query: 1247 QLRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNS--IQAQAHASSF 1306
QLRKWSNTGYFPADLRVWR ++KQDDS+L+TD LAG K+ S VD + + + +S+
Sbjct: 1056 QLRKWSNTGYFPADLRVWRTTEKQDDSILVTDALAGNFSKEPSMVDKAQKVHDLHYPASY 1115
Query: 1307 VAK-PQGATVQSGMDV---QNTGTSNPHTNPTSYGQSAGGRWKSQTEV-SPTGIPASASI 1366
K QG Q+G QN+G+ N H+ S Q+ GG W+S+ + S P+ ++
Sbjct: 1116 SRKSAQGMEGQAGERPTFDQNSGSLNSHSTLGSPAQTTGGSWRSKDNMNSLASRPSPLAV 1175
Query: 1367 EVPRYSGDRWSSDHGNK-DFTSLPSPTPSS--GGTKEQPFQMATPFASSAGGGSLHGSSL 1426
EVP+ + W SD G++ + T+LPSPTP + G +K Q F+ GSL G+S
Sbjct: 1176 EVPKNPANGWGSDAGSRNETTNLPSPTPQTTPGVSKGQAFENKWSPTPVQLPGSLVGNSF 1235
Query: 1427 --MQGSENDSLRSH--SGLNAAEKGTGLGPINGLQNHHS-LPVRPSSIIDDTLVNPAADI 1486
G SL H + EKG+ I+ + + +S L +P+ + +++ D+
Sbjct: 1236 PSNHGGLQASLVVHPEHAVQNPEKGSSQPGISSVSSDNSRLHPQPAPVA--PVLHSGLDL 1295
Query: 1487 KS----------ISANLHSLVQSINSRNPPIETQTVETNISS-----SMPPGQTLHRRWG 1546
K +S N H+ Q S P +SS +MP H W
Sbjct: 1296 KMAGTNMQNQVVLSHNSHAEAQGWGSAGVPRPELQAWGGVSSQPNSATMPAQPASHGPWV 1355
Query: 1547 EMSPAQNAA-------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHI--QSSTPPNIPWG 1606
+ S QN A +A TPG ++ EPWR PP S+ P+I S PPN+PWG
Sbjct: 1356 DASSVQNTASFNTGNPSAGLPTPGFLGMNTPEPWR--PPASSSQPNITGPSPAPPNMPWG 1415
Query: 1607 MGAPEGQSTVPRPGLESQNHSW-GPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASVGT 1666
MG P +QN +W G +P+ N N W P+ Q A +
Sbjct: 1416 MGMP-----------GNQNMNWGGVVPAANMNANWIPT--------------QGPAPGNS 1475
Query: 1667 NPGWNAPGQG-PPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSAN 1726
NPGW AP QG PPV N GW VN GWV P G +PP N NP W P+ N
Sbjct: 1476 NPGWAAPSQGLPPV--NAVGWVGPGQGRSHVNVNAGWVGPGQG-VPPGNANPVWVPPAGN 1535
Query: 1727 QGMWSNEHGKNGDRFSNP-DSVSHGGDPGNGGKSWGMPPSYGGGGGSSSRLPYNNKGQKL 1756
G+W +E NGDRF N D S D G GGKSW S G G+ SR P+ + + +
Sbjct: 1536 PGVWGSEQSHNGDRFPNQGDRGSQSRDSGYGGKSWNNRQS-SFGRGAPSRPPFGGQ-RGV 1552
BLAST of Cp4.1LG04g01060 vs. TrEMBL
Match:
A0A061E0K8_THECC (Nucleic acid binding,zinc ion binding,DNA binding, putative isoform 1 OS=Theobroma cacao GN=TCM_006789 PE=4 SV=1)
HSP 1 Score: 1285.8 bits (3326), Expect = 0.0e+00
Identity = 839/1771 (47.37%), Postives = 1082/1771 (61.10%), Query Frame = 1
Query: 99 GIEDMGEDSV----KLEVEPDIAAMGLLGETVFNDVKEEDAGAEEVK-AVAEFGEGDLLC 158
G+ D E V K +V D A ++ E D+ + AE ++ AVAE +L
Sbjct: 130 GVVDREEGHVAQEEKADVAEDAAVDDVMEEMEKADLSDGGGTAEGIEVAVAERQVAELAE 189
Query: 159 EMDLVGGAENQVEGNVLMVNLPDNTVGCGETDTCLSDVLAELAETTPFVHGVDTTDVANL 218
E G + V+ ++ P++ G + AE+ T + ++ T VA++
Sbjct: 190 E---AGNEQKVVDDVQDQISSPEDKEVAGVAEERGIAEAAEVDGVTEQIVVMEETCVADV 249
Query: 219 VERKEVEENADDPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGR 278
VE + + + A+ ++ I V ++ + + G+ +++SE+ V+ +++E +
Sbjct: 250 VEERGIAKAAEVGVVTEQIGVMEEAGLADMTERTGI----MDESEVAGVAVEREMLKEKQ 309
Query: 279 TENLADRT---GETL--KMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLE 338
+N ++T GET+ M S +E + + A + E VE + LE
Sbjct: 310 VDNEVEQTEILGETVVVNMVEKSESLEEKLMVDVAERF--GIGEETRVTDLVEKREL-LE 369
Query: 339 DKAADATTKTTTGNLADETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVG 398
DK N AD ++ V D V K +++E+ Q +G E E
Sbjct: 370 DKEEV--------NFADPNEILEDTGVVD---MVEKSQSLEE-----QLVGNVSEQTENL 429
Query: 399 KIENLVDET--AEAENVTNYTAESMENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGS 458
+ N V ET AE + VT +E E + +E++ E TE +D G G+
Sbjct: 430 EDTNAVRETGMAEVDTVTGEESEKAEGTETGNV-VEDVEKAEGTE--------IDVGDGA 489
Query: 459 EENDA-------NMTYLVGETEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNS--KAPA 518
E +A +MT V E EAAEE E+ EEV++ASK +SG KRKRGKNS K A
Sbjct: 490 EGVEAAEDTEMLDMTEEV-EMEAAEETED---AEEVEDASK-ASGGKRKRGKNSNSKVLA 549
Query: 519 RVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCS 578
R PSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYH +C+ RDEAFFRAKG+WNCGWHLCS
Sbjct: 550 RAPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHTACVGRDEAFFRAKGKWNCGWHLCS 609
Query: 579 NCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQI 638
NC+K A+YMCYTCTFSLCKGCIK+AVIL VRGNKG CE+CM +MLIE+NEQ Q+
Sbjct: 610 NCKKNAYYMCYTCTFSLCKGCIKDAVILSVRGNKGLCESCMNLIMLIERNEQA-----QV 669
Query: 639 DFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDG 698
+F+DK+SWEYLFK+YW DLK LS+ DEL AKNPWKGSE ++ +SP E +D N G
Sbjct: 670 NFDDKSSWEYLFKDYWIDLKRRLSINSDELAQAKNPWKGSEGRAAKQESPDE-HDFNDGG 729
Query: 699 GSDLDVSE-NEESGSSKKRKAKRRSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELL 758
GS D S N E +SK+R+ + +SKS+A+E +SPS + +G STD + EWASKELL
Sbjct: 730 GSGSDGSSGNAEVTASKRRRTRSQSKSRAREGDSPST-VTASGEGASTDESAEWASKELL 789
Query: 759 EFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFE 818
E VMHM+NGD++VLS+ ++ L+L+YI+++KLRD R KS +ICD+RL++LFGKPRVGH E
Sbjct: 790 EVVMHMRNGDKSVLSRMELSQLILDYIQKHKLRDRRNKSYVICDTRLKSLFGKPRVGHIE 849
Query: 819 MLKLLESH-FLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQR 878
ML LL+ H F +ED Q +++QGSV D E++QLE D +DA KT K+KKR+ RKKGD R
Sbjct: 850 MLNLLDPHIFFTKEDSQTDEIQGSVVDAEANQLEADWNSDAMTKTGKDKKRKTRKKGDAR 909
Query: 879 GLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLY 938
GLQSNLDDYAAID+HNINLIYLRRNLVE LIED E+FH+KVVGSFVRIRISG QKQDLY
Sbjct: 910 GLQSNLDDYAAIDMHNINLIYLRRNLVEDLIEDTETFHDKVVGSFVRIRISGAGQKQDLY 969
Query: 939 RLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIK 998
RLVQVVGT+K +E Y+VGK+ TD LLEILNLNKTE++SIDIISNQEFTE+ECKRLRQSIK
Sbjct: 970 RLVQVVGTNKVAETYRVGKRTTDFLLEILNLNKTEIVSIDIISNQEFTEDECKRLRQSIK 1029
Query: 999 CGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECV 1058
CG++NRLTVGD+QE+AM++Q RVKDW+E+EI+RLSHLRDRASEKG RKEYP I+ V
Sbjct: 1030 CGLINRLTVGDIQEKAMAIQAVRVKDWLESEIMRLSHLRDRASEKGHRKEYPLLVILLSV 1089
Query: 1059 --------------------EKLQLLKTPEERQRRLEELPGIHTDPNMDPSHESEDEDEA 1118
+ +LKTPEERQRRLEE+P IH DPNMDPS+ESE+++
Sbjct: 1090 LLSNSWMLVYIFFMAYGILLTFVVILKTPEERQRRLEEIPEIHVDPNMDPSYESEEDEGE 1149
Query: 1119 DDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFS 1178
DDKRQ+ Y RGSGFSRR REP+SP K G + +DSWSGTRN+SS NR+LSRNLS KG
Sbjct: 1150 DDKRQDNYMRPRGSGFSRRGREPISPRKGGLSSSDSWSGTRNYSSMNRELSRNLSNKGLM 1209
Query: 1179 NQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELPSAA 1238
++G+D++G+GE++NEN W+ GRE + + PN WDK + + SSE+ RN S E S
Sbjct: 1210 SKGDDSVGAGEMVNENLWNLGRERETQ-PNSWDKPKTALSSEIGTRNTHSVVTQEPSSKV 1269
Query: 1239 RSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWSNTGYFPADLRV 1298
S S S G T A +NETEKIWRYQDPSGKVQGPFSMVQLRKW++TGYFPA+L++
Sbjct: 1270 VSEISPTPLSTGVTA-AVQINETEKIWRYQDPSGKVQGPFSMVQLRKWNDTGYFPAELKI 1329
Query: 1299 WRASDKQDDSLLLTDVLAGKIPKDTSSVDNSI-QAQAHASSFVAKPQGATVQSGMDVQ-- 1358
WR ++KQDDS+LLTD L GK KD DNS +AQ + GAT++ GM+ Q
Sbjct: 1330 WRTTEKQDDSILLTDALVGKFQKDPPVADNSFPKAQV---ALYGSGVGATLKQGMENQVG 1389
Query: 1359 -------NTGTSNPHTNPTSYGQSAGGRWKSQTEV-SPTGIPASASIEVPRYSGDRWSSD 1418
N +P +S GQSA WKSQTE S TG PA +S+E+P+YS D W SD
Sbjct: 1390 ERSRFDQNHVAWSPQRTLSSSGQSAVESWKSQTEAPSSTGRPAPSSLEMPKYSRDAWGSD 1449
Query: 1419 HGNKDFTSLPSPTPS---SGGTKEQPFQMA---TPFASSAG---GGSLHGSSLMQGSEND 1478
T+LPSPTP+ SGG K Q F+ TP SS S G++ G +
Sbjct: 1450 ------TNLPSPTPNQNPSGGAKGQVFESKWSPTPVQSSVSVSVANSFRGAT--SGLQPP 1509
Query: 1479 SLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQ 1538
++ SG AA ++G + + S +N AD+K++ +L +LVQ
Sbjct: 1510 TVVLESGSPAAPVVHSHMAVSGESLRTQVNAQAS-------INSGADMKNVGVSLQNLVQ 1569
Query: 1539 SINSRNPPIET--------------------------------QTVETNISSSMPPGQTL 1598
++S NP +ET Q +E N S +MPP
Sbjct: 1570 PVSSHNPSLETHGWGSGSVLRQEVVAASSIPATGTQAWGNASAQKLEPNPSLAMPPQPAS 1629
Query: 1599 HRRWGE-MSPAQNAATASFSTP------GLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNI 1658
+ W + + QN+A S P G +S+ WR P+ SN +Q P N+
Sbjct: 1630 YGHWNDALQSGQNSAPLSTGNPAGHFPTGQPTMLASDSWRPTAPVQSN---VQLPAPTNL 1689
Query: 1659 PWGMGAPEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSAPPNATGMMWGSSAQSSASV 1718
PWGM + Q V R +Q+ WGPMP GN NM W P N + WG+S+Q SA V
Sbjct: 1690 PWGMAVADNQGAVLRQAPGNQSTGWGPMP-GNQNMGWGAPVPANPN-VNWGASSQGSAPV 1749
Query: 1719 GTNPGWNAPGQGPPVRNNIQGWQAHSSI----------PPQVNATPGWVAPNLGPMPPMN 1756
NP W APGQG N GW A + P VN + GWVAP G P +
Sbjct: 1750 NPNPSWAAPGQGQMPGNANSGWTAPGNAIPGWAPPGQGPAVVNTSSGWVAPGQGATPG-S 1809
BLAST of Cp4.1LG04g01060 vs. TAIR10
Match:
AT2G16485.1 (AT2G16485.1 nucleic acid binding;zinc ion binding;DNA binding)
HSP 1 Score: 929.1 bits (2400), Expect = 4.0e-270
Identity = 715/1793 (39.88%), Postives = 953/1793 (53.15%), Query Frame = 1
Query: 4 EENDSSKHDQPSSPLLSVDDGNDLD-VKCHTHRELHSNEEQHCLFQSAINELEFPSNSSV 63
E + S ++PSS LSV + N +D + +RE+ EQ E+E S
Sbjct: 151 ENKEVSMEEEPSSHELSVCEVNGVDSLNDEENREVG---EQIVCGSMGGEEIESDLESKK 210
Query: 64 ESLQPSDAIRGDESLVAETCLEVEETEIAGVK--ACRNGIEDMGEDSVKLEVEPDIAAMG 123
E + D I +E A+ V EI K AC G ++ L D + G
Sbjct: 211 EKV---DVI--EEETTAQAASLVNAIEIPDDKEVACVAGFTEISSQDKGL----DESGNG 270
Query: 124 LLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVNLPDNTVG 183
L E +++ + + A+ G EMD+ +++ E V D+T
Sbjct: 271 FLDEEPVKELQIGEGAKDLTDGDAKEGVDVTEDEMDIQVLKKSKEEEKV------DSTTE 330
Query: 184 CGETDTC---LSDVLAELAETTPFVHGVDTTDVANLVERKEVEENADDPKDSKDIEVAKQ 243
E +T + DV E+++ T V T KE DD K+ D +
Sbjct: 331 L-EIETMRLEVHDVATEMSDKTVISSAVVTQFTGETSNDKETV--MDDVKEDVDKD---- 390
Query: 244 ETFSMEDGKLGVPVQLVEKSELKQSLVDGAV--VEEGRTENLADRTGETLKMENDSSKTD 303
E GK + + + E +E + V+ V +EG A+ G+T+ +E +
Sbjct: 391 ----SEAGK-SLDIHVPEATEEVDTDVNYGVGIEKEGDGVGGAEEAGQTVDLEEIREENQ 450
Query: 304 EVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPKIKGVHV 363
E L+ ++D E + EV ++D+ + T +LA++ + V
Sbjct: 451 E--LSKELAQVD-----ETKISEMSEVTETMIKDEDQEKDDNMT--DLAEDVENHRDSSV 510
Query: 364 TDDNIEVLKIENVEDREAGVQGLGVADESAE--VGKIENLVDETAEAENVTNYTAESMEN 423
D IE E ED E +GV + E +GK++ E T E E
Sbjct: 511 AD--IE----EGREDHE----DMGVTETQKETVLGKVDRTKIAEVSEETDTRIEDEDQEK 570
Query: 424 LDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDVTE 483
D+ T E++ ++ AD ++EG S+E MT ++ A+E E
Sbjct: 571 DDEMTDVAEDVKTHGDSSVAD-----IEEGRESQEE---MTETQEDSVMADEEPE----- 630
Query: 484 EVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDRRGCPKAYHP 543
EV+E +K S+G KRKRG+N+K + KK EEDVCF+CFDGGDLVLCDRRGC KAYHP
Sbjct: 631 EVEEENK-SAGGKRKRGRNTKTVKG--TGKKKEEDVCFMCFDGGDLVLCDRRGCTKAYHP 690
Query: 544 SCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVILCVRGNKGF 603
SC++RDEAFF+ KG+WNCGWHLCS CEKTA Y+CYTC FSLCKGC K+AV C+RGNKG
Sbjct: 691 SCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYLCYTCMFSLCKGCAKDAVFFCIRGNKGL 750
Query: 604 CEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTFDELVHAKNP 663
CE CM V LIE+ +Q E Q+DFNDKTSWEYLFK+YW DLK LSL+ +EL AK P
Sbjct: 751 CETCMETVKLIERKQQ-EKEPAQLDFNDKTSWEYLFKDYWIDLKTQLSLSPEELDQAKRP 810
Query: 664 WKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQAKETNSPSM 723
KG ET S+ + E D DGGSD D S KKRK + RSKS + E
Sbjct: 811 LKGHETNASKQGTASET-DYVTDGGSDSD-------SSPKKRKTRSRSKSGSAEK----- 870
Query: 724 PIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRR 783
I+ +D +EWASKELL+ V+HM+ GDR+ L +VQ LLL YIKR LRDPRR
Sbjct: 871 -ILSSGDKNLSDETMEWASKELLDLVVHMRRGDRSFLPMLEVQTLLLAYIKRYNLRDPRR 930
Query: 784 KSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTES-SQLEGDG 843
KSQ+ICDSRL+NLFGK VGHFEML LL+SHFL +E Q +D+QG + DTE + ++ D
Sbjct: 931 KSQVICDSRLQNLFGKSHVGHFEMLNLLDSHFLKKEQNQADDIQGDIVDTEEPNHVDVDE 990
Query: 844 YTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESF 903
D K+ K+KKR+ RKK ++G QSNLDD+AA+D+HNINLIYLRR+LVE L+ED +F
Sbjct: 991 NLDHPVKSGKDKKRKTRKKNVRKGRQSNLDDFAAVDMHNINLIYLRRSLVEDLLEDSTAF 1050
Query: 904 HEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVI 963
EKV +FVR+RISGN QKQDLYRLVQVVGTSKA EPYKVGKK TD +LEILNL+KTEVI
Sbjct: 1051 EEKVASAFVRLRISGN-QKQDLYRLVQVVGTSKAPEPYKVGKKTTDYVLEILNLDKTEVI 1110
Query: 964 SIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSH 1023
SIDIISNQ+FTE+ECKRL+QSIKCG++NRLTVGD+QE+A++LQ+ RVK+ +E EI+R SH
Sbjct: 1111 SIDIISNQDFTEDECKRLKQSIKCGLINRLTVGDIQEKAIALQEVRVKNLLEAEILRFSH 1170
Query: 1024 LRDRASEKGRRKEYPFY----------NIMECVEKLQLLKTPEERQRRLEELPGIHTDPN 1083
LRDRAS+ GRRKEYP+ + ECVEKLQLLK+PEERQRRLEE+P IH DP
Sbjct: 1171 LRDRASDMGRRKEYPYLLKLSNSLTMLTLRECVEKLQLLKSPEERQRRLEEIPEIHADPK 1230
Query: 1084 MDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSST- 1143
MDP ESEDEDE ++K +E R S F+RR R+P+SP K G + N+SW+GT N+S+T
Sbjct: 1231 MDPDCESEDEDEKEEKEKEKQLRPRSSSFNRRGRDPISPRKGGFSSNESWTGTSNYSNTS 1290
Query: 1144 -NRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTA 1203
NR+LSR+ SG+G + +G+ S + ++++ W+ RE +V+ +K + E A
Sbjct: 1291 ANRELSRSYSGRGSTGRGDYLGSSDDKVSDSMWTSAREREVQPSLGSEKPRSVSIPETPA 1350
Query: 1204 RNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQL 1263
R++ + A EL S S A P+V +Q N++EKIW Y+DPSGKVQGPFSM QL
Sbjct: 1351 RSSRAIAPPELSPRIASEISMAPPAV-VSQPVPKSNDSEKIWHYKDPSGKVQGPFSMAQL 1410
Query: 1264 RKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKP 1323
RKW+NTGYFPA L +W+A++ DS+LLTD LA K
Sbjct: 1411 RKWNNTGYFPAKLEIWKANESPLDSVLLTDALA---------------------GLFQKQ 1470
Query: 1324 QGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDR 1383
A S M Q S GQS+ +S+ + A +IE+PR S D
Sbjct: 1471 TQAVDNSYMKAQVAAFS---------GQSS----QSEPNLGFAARIAPTTIEIPRNSQDT 1530
Query: 1384 WSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPFASSAGGGSLHGSSLMQGSENDSLRSH 1443
WS SLPSPTP+ Q+ TP A S + + ++
Sbjct: 1531 WSQGG------SLPSPTPN---------QITTPTAKRRNFESRWSPTKPSPQSANQSMNY 1590
Query: 1444 SGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINSR 1503
S + + T I + N ++ T P D ++S N + + S
Sbjct: 1591 SVAQSGQSQTSRIDIPVVVNS------AGALQPQTYPIPTPDPINVSVNHSATLHSPTPA 1650
Query: 1504 NPPIETQTVETNIS-SSMPPGQTLHRRWGEMSPAQNAATASFSTPGLTNFSSSEPWRSMP 1563
+++T+ S+ P Q +G SP+ S S PG F S+ W+
Sbjct: 1651 GGKQSWGSMQTDHGGSNTPSSQNNSTSYGTPSPS---VLPSQSQPG---FPPSDSWKVA- 1710
Query: 1564 PIPSNPPHIQSSTPPNIPWGMGAPEGQSTVPRPGLES-QNHSWGPMPSGNPNMTWAPSAP 1623
+PS P + WGM +P + QN SWG + NPNM W A
Sbjct: 1711 -VPSQP-----NAQAQAQWGMNMVNNNQNSAQPQAPANQNSSWG-QGTVNPNMGWVGPAQ 1770
Query: 1624 PNATGMMWGSSAQSSASVGTNPGWNAPGQGPPVRNNIQGW--------QAHSSIPPQVNA 1683
GSS S+ T+ GW AP QG GW Q+ S + Q
Sbjct: 1771 TGVNVNWGGSSVPSTVQGITHSGWVAPVQGQTQAYPNPGWGPTGHPQSQSQSQVQAQAGT 1772
Query: 1684 T-PGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEHGKNGDRFSNPDSVSHGGDPGNGGK 1743
T GW+ P G + N N NW N ++ GG GN
Sbjct: 1831 TGSGWMQPGQG-IQSGNSNQNWGT-------------------QNQTAIPSGGSGGNQAG 1772
Query: 1744 SWG-MPPSYGGGGG-----SSSRLPYNNKGQKLCKY-HESGHCKKGGSCDYRH 1756
WG S G G S N KGQ++CK+ E+GHC+KG SC+Y H
Sbjct: 1891 YWGNQQQSQNGDSGYGWNRQSGGQQNNFKGQRVCKFFRENGHCRKGASCNYLH 1772
BLAST of Cp4.1LG04g01060 vs. TAIR10
Match:
AT3G51120.1 (AT3G51120.1 DNA binding;zinc ion binding;nucleic acid binding;nucleic acid binding)
HSP 1 Score: 573.9 bits (1478), Expect = 3.3e-163
Identity = 340/718 (47.35%), Postives = 438/718 (61.00%), Query Frame = 1
Query: 412 ENLDDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAEEVEEMDV 471
+ L +L +A EE+ + VD+ N + T A M
Sbjct: 6 KQLQQGVPELASLAGREESSVRGIDLMRVDQCEEIGVNQVPALSVPASTVAGAVAVPMSN 65
Query: 472 TEEV---DEASKGSSGAKRKRGKNSKAPARVP----------SRKKVEEDVCFICFDGGD 531
+EV DEA+ KRKRG+ +A A P ++ EEDVCFICFDGGD
Sbjct: 66 EQEVKVIDEAAP----IKRKRGRPPRAQANTPLHIRPPPPPPKKEDKEEDVCFICFDGGD 125
Query: 532 LVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGC 591
LVLCDRR CPKAYHP+CI RDEAFFR +WNCGWH+C C+K + YMCYTCTFS+CK C
Sbjct: 126 LVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICGTCQKASSYMCYTCTFSVCKRC 185
Query: 592 IKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKG 651
IK+A + VRGN G C C++ +MLIE QG E ++DF+DK SWEYLFK YW LK
Sbjct: 186 IKDADYVIVRGNMGLCGTCIKPIMLIENIAQGDNEAVKVDFDDKLSWEYLFKVYWLCLKE 245
Query: 652 SLSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAK 711
LSLT DEL A NPWK E N+ P + N LDV+ N G+ ++R +
Sbjct: 246 ELSLTVDELTRANNPWK--EVPNTAPKVESQNDHTN---NRALDVAVN---GTKRRRTS- 305
Query: 712 RRSKSQAKETNSPSMPIIPDSQGPST-----DNNVEWASKELLEFVMHMKNGDRTVLSQF 771
+SP++P D + PS + WA+KELLEFV MKNGD +VLSQF
Sbjct: 306 ----------DSPTLPNKLDGKNPSNILKKAPGDTSWATKELLEFVSFMKNGDTSVLSQF 365
Query: 772 DVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQ- 831
DVQ LLL+YIK+ LRDP +KSQ++CD L LFGK RVGHFEMLKLLESH LI+E +
Sbjct: 366 DVQGLLLDYIKKKNLRDPLQKSQVLCDQMLVKLFGKQRVGHFEMLKLLESHVLIQEKPKG 425
Query: 832 INDLQGSVADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNI 891
G SQ+E D D + R+ R+MR+K D R NLD YAAID+HNI
Sbjct: 426 AKTTNGETTHAVPSQIEEDSVHDPMVRDRR---RKMRRKTDGRVQNENLDAYAAIDVHNI 485
Query: 892 NLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKV 951
NLIYLRR +E L++D EKVVG+ +RI++SG+ QK D++RLVQVVGTSKA Y++
Sbjct: 486 NLIYLRRKFLESLLDDINKVDEKVVGTILRIKVSGSDQKLDIHRLVQVVGTSKAIASYQL 545
Query: 952 GKKMTDILLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAM 1011
G K TD++LEILNL+K EVISID +S+Q TE+ECKRLRQSIKCG+ RLTV D+ + A
Sbjct: 546 GAKTTDVMLEILNLDKREVISIDQLSDQNITEDECKRLRQSIKCGLNKRLTVVDILKTAA 605
Query: 1012 SLQDARVKDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLE 1071
+LQ R+ + +E EI++L+HLRDRA +KL+LLK+PEERQR L+
Sbjct: 606 TLQAMRINEALEAEILKLNHLRDRA------------------KKLELLKSPEERQRLLQ 665
Query: 1072 ELPGIHTDPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLND 1111
E+P +HTDP+MDPSH ++ ++Q+ + ++ G P G NLN+
Sbjct: 666 EVPEVHTDPSMDPSHALSEDAGLGTRKQDNHVKAQSKG----------PQNKGVNLNN 669
BLAST of Cp4.1LG04g01060 vs. TAIR10
Match:
AT2G18090.1 (AT2G18090.1 PHD finger family protein / SWIB complex BAF60b domain-containing protein / GYF domain-containing protein)
HSP 1 Score: 322.4 bits (825), Expect = 1.7e-87
Identity = 169/380 (44.47%), Postives = 239/380 (62.89%), Query Frame = 1
Query: 469 MDVTEEVDEASKGSSGAKRKRGKNSKAPARVPS-----RKKVEEDVCFICFDGGDLVLCD 528
+D ++DE S + +RG+ + A+ S +++ +EDVCF+CFDGG LVLCD
Sbjct: 36 LDSDVKLDEEDSDSLKKRGRRGRPPRILAKASSPPISRKRREDEDVCFVCFDGGSLVLCD 95
Query: 529 RRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAV 588
RRGCPKAYHP+C+ R EAFFR++ +WNCGWH+C+ C+K + YMCYTC +S+CK C++++
Sbjct: 96 RRGCPKAYHPACVKRTEAFFRSRSKWNCGWHICTTCQKDSFYMCYTCPYSVCKRCVRSSE 155
Query: 589 ILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLT 648
+ VR NKGFC CM+ +MLIE + + EK Q+DF+D+ SWEYLFK YW LK L L+
Sbjct: 156 YVVVRENKGFCGICMKTIMLIENAAEANKEKVQVDFDDQGSWEYLFKIYWVSLKEKLGLS 215
Query: 649 FDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKS 708
D+L AKNPWK S + ++ + +++ + DG S G K R+AK R
Sbjct: 216 LDDLTKAKNPWKSSSSTAAKRRTTSRVHEKD-DGNS---------PGVMKIRRAKVRKMD 275
Query: 709 QAKETNSPSMPIIPDSQGPSTDNN-------------VEWASKELLEFVMHMKNGDRTVL 768
+N GPS D+N WA+ ELL+FV +MKNGD +VL
Sbjct: 276 AVSVSN----------LGPSLDSNCSLGDRLPQLTSAATWATNELLDFVGYMKNGDISVL 335
Query: 769 SQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFL--IR 828
S++DVQ L+LEY++RN L++ + S+I+CDS+L LFGK RV + EMLKLL+SHF+ +R
Sbjct: 336 SKYDVQTLVLEYVRRNNLQNSPQNSEIMCDSKLMRLFGKERVDNLEMLKLLDSHFIDQVR 394
BLAST of Cp4.1LG04g01060 vs. TAIR10
Match:
AT5G63700.1 (AT5G63700.1 zinc ion binding;DNA binding)
HSP 1 Score: 233.0 bits (593), Expect = 1.4e-60
Identity = 162/588 (27.55%), Postives = 291/588 (49.49%), Query Frame = 1
Query: 507 EDVCFICFDGGDLVLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYM 566
ED CFIC DGG+L+LCD + CPK YH SC+ +D + + + C WH C C+KT
Sbjct: 22 EDWCFICKDGGNLMLCDFKDCPKVYHESCVEKDSSASKNGDSYICMWHSCYLCKKTPKLC 81
Query: 567 CYTCTFSLCKGCIKNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWE 626
C C+ ++C+GC+ +A + ++G+KG C C +V +E+ ++ ++D D+ ++E
Sbjct: 82 CLCCSHAVCEGCVTHAEFIQLKGDKGLCNQCQEYVFALEEIQEYDAAGDKLDLTDRNTFE 141
Query: 627 YLFKEYWTDLKGSLSLTFDEL--VHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVS 686
LF EYW K LTFD++ V A P K + D L D+ S
Sbjct: 142 CLFLEYWEIAKKQEGLTFDDVRKVCASKPQKKGVKSKYKDDPKFSL--------GDVHTS 201
Query: 687 ENEESGSSKKRKAKRR--------SKS-----QAKETNSPSMPIIPDSQGPSTDNNV--- 746
++++ G K K + SKS + K + P + + + D
Sbjct: 202 KSQKKGDKLKNKDDPKFALGDAHTSKSGKKGVKLKNKDDPKFLVSDHAVEDAVDYKKVGK 261
Query: 747 -------EWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDS 806
W SK L++F+ + R +SQ V++++ YI+ L D +K ++ CD
Sbjct: 262 NKRMEFIRWGSKPLIDFLTSIGEDTREAMSQHSVESVIRRYIREKNLLDREKKKKVHCDE 321
Query: 807 RLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGD--GYTDASGK 866
+L ++F K + + LL +H ++E++ D E +E + +++ + K
Sbjct: 322 KLYSIFRKKSINQKRIYTLLNTH--LKENL---DQVEYFTPLELGFIEKNEKRFSEKNDK 381
Query: 867 TRKEKKRRMRKKGDQRGLQSNLD------DYAAIDIHNINLIYLRRNLVEYLIEDEESFH 926
K++ + D + + +A I+ N+ L+YLR++LV L++ +SF
Sbjct: 382 VMMPCKKQKTESSDDEICEKEVQPEMRATGFATINADNLKLVYLRKSLVLELLKQNDSFV 441
Query: 927 EKVVGSFVRIRISGNAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVI 986
+KVVGSFV+++ N + + Y+++QV G A + + +LL + + +
Sbjct: 442 DKVVGSFVKVK---NGPRDFMAYQILQVTGIKNADD------QSEGVLLHVSGM--ASGV 501
Query: 987 SIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSH 1046
SI + + + EEE K L+Q + G+L + TV +++++A +L K W+ ++ L
Sbjct: 502 SISKLDDSDIREEEIKDLKQKVMNGLLRQTTVVEMEQKAKALHYDITKHWIARQLNILQK 561
Query: 1047 LRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTD 1061
+ A+EKG R+E + E +E+ +LL+ P E++R L+E+P I D
Sbjct: 562 RINCANEKGWRRE-----LEEYLEQRELLEKPSEQERLLKEIPRIIED 580
BLAST of Cp4.1LG04g01060 vs. TAIR10
Match:
AT5G08430.1 (AT5G08430.1 SWIB/MDM2 domain;Plus-3;GYF)
HSP 1 Score: 183.7 bits (465), Expect = 9.6e-46
Identity = 161/562 (28.65%), Postives = 262/562 (46.62%), Query Frame = 1
Query: 728 VEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIKRNKLRDPRRKSQIICDSRLENLF 787
V W S++L+EF+ + ++S++DV + +YI + L DP K +++CD RL LF
Sbjct: 30 VGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKKKVVCDKRLVLLF 89
Query: 788 GKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTESSQLEGDGYTDASGKTRKEKKRR 847
G + ++ LLE H+ +D D++ L D + K KR
Sbjct: 90 GTRTIFRMKVYDLLEKHYKENQD-----------DSDFDFLYEDE-PQIICHSEKIAKRT 149
Query: 848 MRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEYLIEDEESFHEKVVGSFVRIRISG 907
+ RG +AAI NI L+YLR++LV+ L++ ++F K++GSFVRI+
Sbjct: 150 SKVVKKPRGT------FAAIVSDNIKLLYLRKSLVQELLKSPDTFEGKMLGSFVRIKSDP 209
Query: 908 NAQKQDL-YRLVQVVGTSKASEPYKVGKKMTDILLEILNLNKTEVISIDIISNQEFTEEE 967
N Q Y+LVQV G K D LL++ N K +SI ++S+ F++EE
Sbjct: 210 NDYLQKYPYQLVQVTGVKKEHGT-------DDFLLQVTNYVKD--VSISVLSDDNFSQEE 269
Query: 968 CKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWMETEIVRLSHLRDRASEKGRRKEY 1027
C+ L Q IK G+L + T+ +++E+A L + K W+ EI L L DRA+EKG R+E
Sbjct: 270 CEDLHQRIKNGLLKKPTIVEMEEKAKKLHKDQTKHWLGREIELLKRLIDRANEKGWRRE- 329
Query: 1028 PFYNIMECVEKLQLLKTPEERQRRLEELPGI-------HTDPNMDPSHESEDEDEADDKR 1087
+ E ++K +LL+ P+E+ R L E+P + + + + +H+S++E +
Sbjct: 330 ----LSEYLDKRELLQNPDEQARLLREVPEVIGEELVQNPEVSSPEAHKSDNEQRLSESP 389
Query: 1088 QET-YTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRDLSRNLSGKGFS--- 1147
+ + + G SN + T + N+ L ++ G
Sbjct: 390 LSCIHETPEARNLFGGEDQQFNNGYVMSNPITTPGITSCATEINKGLPTWIASAGAEYLH 449
Query: 1148 ---NQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNALSGAASELP 1207
Q + I GE E S + + N + QV P+ EL
Sbjct: 450 VDVEQPANGIIGGETPTEESKVSQLQSSIPVNNVDNGSQVQPNPSEVI---------ELS 509
Query: 1208 SAARSVNSAASPSVGTTQNAATVN-ETEKI-WRYQDPSGKVQGPFSMVQLRKWSNTGYFP 1267
N ++ ++ + EK+ W Y+DP G VQGPFS+ QL+ WS+ YF
Sbjct: 510 DDDEDDNGDGETLDPKVEDVRVLSYDKEKLNWLYKDPQGLVQGPFSLTQLKAWSDAEYFT 550
Query: 1268 ADLRVWRASDKQDDSLLLTDVL 1273
RVW + + ++LLTDVL
Sbjct: 570 KQFRVWMTGESMESAVLLTDVL 550
BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match:
gi|778722712|ref|XP_004148557.2| (PREDICTED: zinc finger CCCH domain-containing protein 19 isoform X1 [Cucumis sativus])
HSP 1 Score: 2541.1 bits (6585), Expect = 0.0e+00
Identity = 1381/1819 (75.92%), Postives = 1499/1819 (82.41%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS DQ SS L VDDG LDVKC T+RE L SNE+QHC+ +S+I E F N
Sbjct: 1 MEAEEDDSSYQDQKSSSLY-VDDGK-LDVKCDTNREELLSNEQQHCVSKSSIIETGFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L TC E VEE E RN I+DMGEDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVGTCSEMEKKDLVEERERVEENDFRNIIQDMGEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
P IA GLL + F+DVK+ EE KA++EF EG+LL M VG AENQVEGNVLM N
Sbjct: 121 PGIAKAGLLEQRAFDDVKKNTGVTEEEKALSEFAEGELLPGMVFVGVAENQVEGNVLMAN 180
Query: 181 LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
++TV GC ET TCLS VLAE LAETTPFV GVD T NLV++ EVEE+AD
Sbjct: 181 FSEHTVVDGSAGCVETTETTCLSYVLAEERLAETTPFVQGVDVTVATNLVQKTEVEEHAD 240
Query: 241 DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
D DSKD EV KQE F++E +LGV VQL E SELK SLVDG V EGRTENLADRTGET
Sbjct: 241 DTNDSKDTEVPKQENFAVEKMELGVQVQLEEDSELKVSLVDGVV--EGRTENLADRTGET 300
Query: 301 LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
LKMEN SS ++EVGL +FA EI V + N EDKT+E DGMC+E+KA D NLA
Sbjct: 301 LKMENASSTSNEVGLTHFAVEIKETVNIGNDEDKTMETDGMCVEEKATDVGMME---NLA 360
Query: 361 DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
DETP+IKGV V D +IE LKIE++EDREAGVQGLG+ADES V K+EN+ DE AE E V
Sbjct: 361 DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADESPVVEKLENVADENAEPEGVQ 420
Query: 421 -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
T+YTAE + EN+ DDKTAQ EE+AM EE E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421 VTDYTAEEVKSENVEDDKTAQGEEVAMGEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480
Query: 481 TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
TEAAEEVEEMD TEEVDE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481 TEAAEEVEEMDATEEVDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540
Query: 541 VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541 VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
Query: 601 KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
KNAVILCVRGNKGFCE CMRFV IEKNEQG+ EKGQIDFNDK SWEYLFKEYWTDLKGS
Sbjct: 601 KNAVILCVRGNKGFCETCMRFVTSIEKNEQGNKEKGQIDFNDKNSWEYLFKEYWTDLKGS 660
Query: 661 LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661 LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720
Query: 721 RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
RS+SQAKE +SPSMP SQG STD+NVEW SKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721 RSRSQAKEMSSPSMPATA-SQGLSTDDNVEWGSKELLEFVMHMKNGDRTVLSQFDVQALL 780
Query: 781 LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL S
Sbjct: 781 LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHVS 840
Query: 841 VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841 VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900
Query: 901 NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901 NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960
Query: 961 LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVGDLQERAMSLQDARV
Sbjct: 961 LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGDLQERAMSLQDARV 1020
Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
KDWMETEIVRLSHLRDRASEKGRRKE + ECVEKLQLLKTPEERQRR+EE+P IH
Sbjct: 1021 KDWMETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080
Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140
Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
+TNRD+SRNLSGKGF+NQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSE+T
Sbjct: 1141 NTNRDMSRNLSGKGFANQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEIT 1200
Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
ARNALSGAASE SAA SVN AAS SVGTTQNAATVNE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPAASSSVGTTQNAATVNESEKIWHYQDPSGKVQGPFSMVQ 1260
Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320
Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
PQG T+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSGD
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGD 1380
Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSEN 1440
RWSSDHGNK+FT+LPSPTPSSGG+KEQPFQ+A F S GG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGSKEQPFQVAASFMEAKSLSGTAGGGLHGSSVMQGSEN 1440
Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
DSLRSH G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLV
Sbjct: 1441 DSLRSHLGRNSSEKGLGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500
Query: 1501 QSINSRNPPIETQ------------------------TVETNISSSMPPGQTLHRRWGEM 1560
QSINSRNPPIE VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGHGSGSILKRETDTSEAWQNAHSLKVESNVSSSMPPAQTLHSRWGEM 1560
Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
SPAQNAA T+SFS+ G+++F SS+PWRS PI SNP HIQ STPPN+PWGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGMSSFPSSDPWRSTAPISSNPQHIQCSTPPNLPWGMG 1620
Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNP 1680
APEGQSTVPR G ESQN +WGPMPSGNPNM W P+ PPNAT MMWG++AQSS TNP
Sbjct: 1621 APEGQSTVPRQGSESQNQTWGPMPSGNPNMGWGPTGPPPNATAMMWGATAQSSGPAATNP 1680
Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1740
GW APGQGP NN+QGW AHS +PP VNATPGWV PN+ PMPPMNMNP+W PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNLQGWPAHSPMPPPVNATPGWVGPNVAPMPPMNMNPSWLVPSVNQNM 1740
Query: 1741 WSNEHGKNGDRFSN-PDSVSHGGDPGNGGKSWGMPPSY--GGGGGSSSRLPYNNKGQKLC 1757
W NEHGKNG+RFSN D SHGGDPGNG KSWGM PS+ GGGGG +SR PY N+ QKLC
Sbjct: 1741 WGNEHGKNGNRFSNQKDGGSHGGDPGNGDKSWGMQPSFGGGGGGGGNSRSPY-NRVQKLC 1800
BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match:
gi|700187939|gb|KGN43172.1| (hypothetical protein Csa_7G006220 [Cucumis sativus])
HSP 1 Score: 2514.6 bits (6516), Expect = 0.0e+00
Identity = 1369/1819 (75.26%), Postives = 1487/1819 (81.75%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS DQ SS L VDDG LDVKC T+RE L SNE+QHC+ +S+I E F N
Sbjct: 1 MEAEEDDSSYQDQKSSSLY-VDDGK-LDVKCDTNREELLSNEQQHCVSKSSIIETGFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L TC E VEE E RN I+DMGEDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVGTCSEMEKKDLVEERERVEENDFRNIIQDMGEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
P IA GLL + F+DVK+ EE KA++EF EG+LL M VG AENQVEGNVLM N
Sbjct: 121 PGIAKAGLLEQRAFDDVKKNTGVTEEEKALSEFAEGELLPGMVFVGVAENQVEGNVLMAN 180
Query: 181 LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
++TV GC ET TCLS VLAE LAETTPFV GVD T NLV++ EVEE+AD
Sbjct: 181 FSEHTVVDGSAGCVETTETTCLSYVLAEERLAETTPFVQGVDVTVATNLVQKTEVEEHAD 240
Query: 241 DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
D DSKD EV KQE F++E +LGV VQL E SELK SLVDG V EGRTENLADRTGET
Sbjct: 241 DTNDSKDTEVPKQENFAVEKMELGVQVQLEEDSELKVSLVDGVV--EGRTENLADRTGET 300
Query: 301 LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
LKMEN SS ++EVGL +FA EI V + N EDKT+E DGMC+E+KA D NLA
Sbjct: 301 LKMENASSTSNEVGLTHFAVEIKETVNIGNDEDKTMETDGMCVEEKATDVGMME---NLA 360
Query: 361 DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
DETP+IKGV V D +IE LKIE++EDREAGVQGLG+ADES V K+EN+ DE AE E V
Sbjct: 361 DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADESPVVEKLENVADENAEPEGVQ 420
Query: 421 -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
T+YTAE + EN+ DDKTAQ EE+AM EE E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421 VTDYTAEEVKSENVEDDKTAQGEEVAMGEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480
Query: 481 TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
TEAAEEVEEMD TEEVDE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481 TEAAEEVEEMDATEEVDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540
Query: 541 VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541 VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
Query: 601 KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
KNAVILCVRGNKGFCE CMRFV IEKNEQG+ EKGQIDFNDK SWEYLFKEYWTDLKGS
Sbjct: 601 KNAVILCVRGNKGFCETCMRFVTSIEKNEQGNKEKGQIDFNDKNSWEYLFKEYWTDLKGS 660
Query: 661 LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661 LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720
Query: 721 RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
RS+SQAKE +SPSMP SQG STD+NVEW SKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721 RSRSQAKEMSSPSMPATA-SQGLSTDDNVEWGSKELLEFVMHMKNGDRTVLSQFDVQALL 780
Query: 781 LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL S
Sbjct: 781 LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHVS 840
Query: 841 VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841 VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900
Query: 901 NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901 NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960
Query: 961 LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVGDLQERAMSLQDARV
Sbjct: 961 LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGDLQERAMSLQDARV 1020
Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
KDWMETEIVRLSHL + ECVEKLQLLKTPEERQRR+EE+P IH
Sbjct: 1021 KDWMETEIVRLSHLHSLL-------------LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080
Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140
Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
+TNRD+SRNLSGKGF+NQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSE+T
Sbjct: 1141 NTNRDMSRNLSGKGFANQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEIT 1200
Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
ARNALSGAASE SAA SVN AAS SVGTTQNAATVNE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPAASSSVGTTQNAATVNESEKIWHYQDPSGKVQGPFSMVQ 1260
Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320
Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
PQG T+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSGD
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGD 1380
Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSEN 1440
RWSSDHGNK+FT+LPSPTPSSGG+KEQPFQ+A F S GG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGSKEQPFQVAASFMEAKSLSGTAGGGLHGSSVMQGSEN 1440
Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
DSLRSH G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLV
Sbjct: 1441 DSLRSHLGRNSSEKGLGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500
Query: 1501 QSINSRNPPIETQ------------------------TVETNISSSMPPGQTLHRRWGEM 1560
QSINSRNPPIE VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGHGSGSILKRETDTSEAWQNAHSLKVESNVSSSMPPAQTLHSRWGEM 1560
Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
SPAQNAA T+SFS+ G+++F SS+PWRS PI SNP HIQ STPPN+PWGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGMSSFPSSDPWRSTAPISSNPQHIQCSTPPNLPWGMG 1620
Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNP 1680
APEGQSTVPR G ESQN +WGPMPSGNPNM W P+ PPNAT MMWG++AQSS TNP
Sbjct: 1621 APEGQSTVPRQGSESQNQTWGPMPSGNPNMGWGPTGPPPNATAMMWGATAQSSGPAATNP 1680
Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1740
GW APGQGP NN+QGW AHS +PP VNATPGWV PN+ PMPPMNMNP+W PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNLQGWPAHSPMPPPVNATPGWVGPNVAPMPPMNMNPSWLVPSVNQNM 1740
Query: 1741 WSNEHGKNGDRFSN-PDSVSHGGDPGNGGKSWGMPPSY--GGGGGSSSRLPYNNKGQKLC 1757
W NEHGKNG+RFSN D SHGGDPGNG KSWGM PS+ GGGGG +SR PY N+ QKLC
Sbjct: 1741 WGNEHGKNGNRFSNQKDGGSHGGDPGNGDKSWGMQPSFGGGGGGGGNSRSPY-NRVQKLC 1793
BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match:
gi|659094430|ref|XP_008448056.1| (PREDICTED: zinc finger CCCH domain-containing protein 19 isoform X1 [Cucumis melo])
HSP 1 Score: 2502.6 bits (6485), Expect = 0.0e+00
Identity = 1357/1791 (75.77%), Postives = 1471/1791 (82.13%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS HDQ SS LDVKC T+RE LHSNE+QHC +S+I E EF N
Sbjct: 1 MEAEEDDSSYHDQKSS---------SLDVKCDTNREELHSNEQQHCASKSSIIETEFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L +TC E VEE EI K RN I+DM EDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVDTCSEMEKKDLVEEKEIKEEKDSRNIIQDMAEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
PDI GL + F+DVKE EE KA++EF +G+LL EM VG AENQ EGNVLM N
Sbjct: 121 PDIEKTGLSEQRAFDDVKENTGVTEEEKALSEFAQGELLPEMVFVGVAENQAEGNVLMAN 180
Query: 181 LPDNTV-----GCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENAD 240
++TV GC ET TCLSDVLAE LAETT FV VD TD NLV++ +VEE+AD
Sbjct: 181 FSEHTVVDGSAGCVETTETTCLSDVLAEETLAETTLFVQDVDVTDAINLVQKTKVEEHAD 240
Query: 241 DPKDSKDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGET 300
D DSKD EV KQE FS+E +LGV VQL E SELK SLVDGAV EGRTENLADR GET
Sbjct: 241 DANDSKDTEVPKQENFSVEKMELGVRVQLEENSELKGSLVDGAV--EGRTENLADRPGET 300
Query: 301 LKMENDSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLA 360
LK EN SS T+EVGL + A EI V + N EDKT+E+DGMC+EDKA T NL
Sbjct: 301 LKRENASSTTNEVGLTHIAVEIKETVNVGNAEDKTIEMDGMCMEDKA---TAVGMMENLT 360
Query: 361 DETPKIKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV- 420
DETP+IKGV V D +IE LKIE++EDREAGVQGLG+AD+S V K+EN+ DE AEAE V
Sbjct: 361 DETPEIKGVDVADYSIEELKIEDMEDREAGVQGLGLADKSPVVEKLENVADENAEAEGVQ 420
Query: 421 -TNYTAESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGE 480
T+YTAE + EN+ DDKTAQ EEIAM EE E DD VYLVDEGIGSEE D NMTYLV E
Sbjct: 421 VTDYTAEEVKSENVEDDKTAQGEEIAMAEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEE 480
Query: 481 TEAAEEVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDL 540
TEAAEEVEEMDVTEE+DE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDL
Sbjct: 481 TEAAEEVEEMDVTEEMDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDL 540
Query: 541 VLCDRRGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
VLCDRRGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI
Sbjct: 541 VLCDRRGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCI 600
Query: 601 KNAVILCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGS 660
KNAVI CVRGNKGFCE CMRFV IEKNEQGSTEKGQIDFNDK SWEYLFKEYW DLKGS
Sbjct: 601 KNAVIFCVRGNKGFCETCMRFVTSIEKNEQGSTEKGQIDFNDKNSWEYLFKEYWIDLKGS 660
Query: 661 LSLTFDELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKR 720
LSLTFDELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+
Sbjct: 661 LSLTFDELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKK 720
Query: 721 RSKSQAKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
RS+SQAKE +SPSMP I DSQG S D+NVEWASKELLEFVMHMKNGDRTVLSQFDVQALL
Sbjct: 721 RSRSQAKEMSSPSMPAIADSQGLSADDNVEWASKELLEFVMHMKNGDRTVLSQFDVQALL 780
Query: 781 LEYIKRNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGS 840
LEYIKRNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL GS
Sbjct: 781 LEYIKRNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHGS 840
Query: 841 VADTESSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRR 900
VA+TESSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+R
Sbjct: 841 VAETESSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKR 900
Query: 901 NLVEYLIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDI 960
NLVEYLIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDI
Sbjct: 901 NLVEYLIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDI 960
Query: 961 LLEILNLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARV 1020
LLEILNLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVG+LQERAMSLQDARV
Sbjct: 961 LLEILNLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGELQERAMSLQDARV 1020
Query: 1021 KDWMETEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHT 1080
KDWMETEIVRLSHLRDRASEKGRRKE + ECVEKLQLLKTPEERQRR+EE+P IH
Sbjct: 1021 KDWMETEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHA 1080
Query: 1081 DPNMDPSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFS 1140
DPNMDPSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS
Sbjct: 1081 DPNMDPSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFS 1140
Query: 1141 STNRDLSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMT 1200
+TNRD+SRNLSGKGFSNQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSEMT
Sbjct: 1141 NTNRDMSRNLSGKGFSNQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEMT 1200
Query: 1201 ARNALSGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQ 1260
ARNALSGAASE SAA SVN S SVGTTQNAAT NE+EKIW YQDPSGKVQGPFSMVQ
Sbjct: 1201 ARNALSGAASE-SSAAHSVNPTVSSSVGTTQNAATANESEKIWHYQDPSGKVQGPFSMVQ 1260
Query: 1261 LRKWSNTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAK 1320
LRKWSNTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +
Sbjct: 1261 LRKWSNTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGR 1320
Query: 1321 PQGATVQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGD 1380
PQG T+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSG+
Sbjct: 1321 PQGGTLQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGE 1380
Query: 1381 RWSSDHGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSEN 1440
RWSSDHGNK+FT+LPSPTPSSGGTKEQPFQ+A F S GGG LHGSS+MQGSEN
Sbjct: 1381 RWSSDHGNKNFTNLPSPTPSSGGTKEQPFQVAASFMEAKSLSGTGGGGLHGSSVMQGSEN 1440
Query: 1441 DSLRSHSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLV 1500
D LRSH G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLV
Sbjct: 1441 DPLRSHLGRNSSEKGMGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLV 1500
Query: 1501 QSINSRNPPIE------------------------TQTVETNISSSMPPGQTLHRRWGEM 1560
QSINSRNPPIE + VE+N+SSSMPP QTLH RWGEM
Sbjct: 1501 QSINSRNPPIEAHGRGSGSILKRETDTSEAWQNAQSHKVESNVSSSMPPAQTLHSRWGEM 1560
Query: 1561 SPAQNAA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMG 1620
SPAQNAA T+SFS+ GL+NF SS+PWRS PI +NP HIQ STPPN+ WGMG
Sbjct: 1561 SPAQNAAVTSFSAGSSTSSFSSAGLSNFPSSDPWRSTAPISNNPQHIQCSTPPNLAWGMG 1620
Query: 1621 APEGQSTVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNP 1680
APEGQSTVPRPG ESQN +WGPMPSGNPNM W P+A PPNA+ MMWG++AQSS TNP
Sbjct: 1621 APEGQSTVPRPGSESQNQTWGPMPSGNPNMGWGPTAPPPNASAMMWGTTAQSSGPAATNP 1680
Query: 1681 GWNAPGQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGM 1731
GW APGQGP NNIQGW AHS +PP VNATPGWV N+ PMPPMNMNP+W PS NQ M
Sbjct: 1681 GWIAPGQGPAAGNNIQGWPAHSPMPPPVNATPGWVGSNVAPMPPMNMNPSWLVPSVNQNM 1740
BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match:
gi|778722715|ref|XP_011658553.1| (PREDICTED: zinc finger CCCH domain-containing protein 19 isoform X2 [Cucumis sativus])
HSP 1 Score: 2501.1 bits (6481), Expect = 0.0e+00
Identity = 1364/1814 (75.19%), Postives = 1481/1814 (81.64%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS DQ SS L VDDG LDVKC T+RE L SNE+QHC+ +S+I E F N
Sbjct: 1 MEAEEDDSSYQDQKSSSLY-VDDGK-LDVKCDTNREELLSNEQQHCVSKSSIIETGFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L TC E VEE E RN I+DMGEDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVGTCSEMEKKDLVEERERVEENDFRNIIQDMGEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
P IA GLL + F+DVK+ EE KA++EF E E V+
Sbjct: 121 PGIAKAGLLEQRAFDDVKKNTGVTEEEKALSEFAE-----------------EHTVV--- 180
Query: 181 LPDNTVGCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENADDPKDS 240
D + GC ET TCLS VLAE LAETTPFV GVD T NLV++ EVEE+ADD DS
Sbjct: 181 --DGSAGCVETTETTCLSYVLAEERLAETTPFVQGVDVTVATNLVQKTEVEEHADDTNDS 240
Query: 241 KDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMEN 300
KD EV KQE F++E +LGV VQL E SELK SLVDG V EGRTENLADRTGETLKMEN
Sbjct: 241 KDTEVPKQENFAVEKMELGVQVQLEEDSELKVSLVDGVV--EGRTENLADRTGETLKMEN 300
Query: 301 DSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPK 360
SS ++EVGL +FA EI V + N EDKT+E DGMC+E+KA D NLADETP+
Sbjct: 301 ASSTSNEVGLTHFAVEIKETVNIGNDEDKTMETDGMCVEEKATDVGMME---NLADETPE 360
Query: 361 IKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV--TNYT 420
IKGV V D +IE LKIE++EDREAGVQGLG+ADES V K+EN+ DE AE E V T+YT
Sbjct: 361 IKGVDVADYSIEELKIEDMEDREAGVQGLGLADESPVVEKLENVADENAEPEGVQVTDYT 420
Query: 421 AESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAE 480
AE + EN+ DDKTAQ EE+AM EE E DD VYLVDEGIGSEE D NMTYLV ETEAAE
Sbjct: 421 AEEVKSENVEDDKTAQGEEVAMGEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEETEAAE 480
Query: 481 EVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDR 540
EVEEMD TEEVDE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDLVLCDR
Sbjct: 481 EVEEMDATEEVDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDLVLCDR 540
Query: 541 RGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 600
RGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI
Sbjct: 541 RGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 600
Query: 601 LCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTF 660
LCVRGNKGFCE CMRFV IEKNEQG+ EKGQIDFNDK SWEYLFKEYWTDLKGSLSLTF
Sbjct: 601 LCVRGNKGFCETCMRFVTSIEKNEQGNKEKGQIDFNDKNSWEYLFKEYWTDLKGSLSLTF 660
Query: 661 DELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQ 720
DELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+RS+SQ
Sbjct: 661 DELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKKRSRSQ 720
Query: 721 AKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 780
AKE +SPSMP SQG STD+NVEW SKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK
Sbjct: 721 AKEMSSPSMPATA-SQGLSTDDNVEWGSKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 780
Query: 781 RNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTE 840
RNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL SVA+TE
Sbjct: 781 RNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHVSVAETE 840
Query: 841 SSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEY 900
SSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+RNLVEY
Sbjct: 841 SSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKRNLVEY 900
Query: 901 LIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEIL 960
LIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDILLEIL
Sbjct: 901 LIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDILLEIL 960
Query: 961 NLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWME 1020
NLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVGDLQERAMSLQDARVKDWME
Sbjct: 961 NLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGDLQERAMSLQDARVKDWME 1020
Query: 1021 TEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMD 1080
TEIVRLSHLRDRASEKGRRKE + ECVEKLQLLKTPEERQRR+EE+P IH DPNMD
Sbjct: 1021 TEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHADPNMD 1080
Query: 1081 PSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRD 1140
PSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS+TNRD
Sbjct: 1081 PSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFSNTNRD 1140
Query: 1141 LSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNAL 1200
+SRNLSGKGF+NQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSE+TARNAL
Sbjct: 1141 MSRNLSGKGFANQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEITARNAL 1200
Query: 1201 SGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWS 1260
SGAASE SAA SVN AAS SVGTTQNAATVNE+EKIW YQDPSGKVQGPFSMVQLRKWS
Sbjct: 1201 SGAASE-SSAAHSVNPAASSSVGTTQNAATVNESEKIWHYQDPSGKVQGPFSMVQLRKWS 1260
Query: 1261 NTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGAT 1320
NTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +PQG T
Sbjct: 1261 NTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGRPQGGT 1320
Query: 1321 VQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSD 1380
+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSGDRWSSD
Sbjct: 1321 LQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGDRWSSD 1380
Query: 1381 HGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSENDSLRS 1440
HGNK+FT+LPSPTPSSGG+KEQPFQ+A F S GG LHGSS+MQGSENDSLRS
Sbjct: 1381 HGNKNFTNLPSPTPSSGGSKEQPFQVAASFMEAKSLSGTAGGGLHGSSVMQGSENDSLRS 1440
Query: 1441 HSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINS 1500
H G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLVQSINS
Sbjct: 1441 HLGRNSSEKGLGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLVQSINS 1500
Query: 1501 RNPPIETQ------------------------TVETNISSSMPPGQTLHRRWGEMSPAQN 1560
RNPPIE VE+N+SSSMPP QTLH RWGEMSPAQN
Sbjct: 1501 RNPPIEAHGHGSGSILKRETDTSEAWQNAHSLKVESNVSSSMPPAQTLHSRWGEMSPAQN 1560
Query: 1561 AA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQ 1620
AA T+SFS+ G+++F SS+PWRS PI SNP HIQ STPPN+PWGMGAPEGQ
Sbjct: 1561 AAVTSFSAGSSTSSFSSAGMSSFPSSDPWRSTAPISSNPQHIQCSTPPNLPWGMGAPEGQ 1620
Query: 1621 STVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNPGWNAP 1680
STVPR G ESQN +WGPMPSGNPNM W P+ PPNAT MMWG++AQSS TNPGW AP
Sbjct: 1621 STVPRQGSESQNQTWGPMPSGNPNMGWGPTGPPPNATAMMWGATAQSSGPAATNPGWIAP 1680
Query: 1681 GQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEH 1740
GQGP NN+QGW AHS +PP VNATPGWV PN+ PMPPMNMNP+W PS NQ MW NEH
Sbjct: 1681 GQGPAAGNNLQGWPAHSPMPPPVNATPGWVGPNVAPMPPMNMNPSWLVPSVNQNMWGNEH 1740
Query: 1741 GKNGDRFSN-PDSVSHGGDPGNGGKSWGMPPSY--GGGGGSSSRLPYNNKGQKLCKYHES 1757
GKNG+RFSN D SHGGDPGNG KSWGM PS+ GGGGG +SR PY N+ QKLCKYHES
Sbjct: 1741 GKNGNRFSNQKDGGSHGGDPGNGDKSWGMQPSFGGGGGGGGNSRSPY-NRVQKLCKYHES 1774
BLAST of Cp4.1LG04g01060 vs. NCBI nr
Match:
gi|659094432|ref|XP_008448057.1| (PREDICTED: zinc finger CCCH domain-containing protein 19 isoform X2 [Cucumis melo])
HSP 1 Score: 2461.4 bits (6378), Expect = 0.0e+00
Identity = 1340/1786 (75.03%), Postives = 1453/1786 (81.35%), Query Frame = 1
Query: 1 MEAEENDSSKHDQPSSPLLSVDDGNDLDVKCHTHRE-LHSNEEQHCLFQSAINELEFPSN 60
MEAEE+DSS HDQ SS LDVKC T+RE LHSNE+QHC +S+I E EF N
Sbjct: 1 MEAEEDDSSYHDQKSS---------SLDVKCDTNREELHSNEQQHCASKSSIIETEFSPN 60
Query: 61 SSVESLQPSDAIRGDESLVAETCLE------VEETEIAGVKACRNGIEDMGEDSVKLEVE 120
+ VESL P DAI GDE L +TC E VEE EI K RN I+DM EDSVKLE+E
Sbjct: 61 TVVESLPPRDAILGDEILAVDTCSEMEKKDLVEEKEIKEEKDSRNIIQDMAEDSVKLEIE 120
Query: 121 PDIAAMGLLGETVFNDVKEEDAGAEEVKAVAEFGEGDLLCEMDLVGGAENQVEGNVLMVN 180
PDI GL + F+DVKE EE KA++EF + E V+
Sbjct: 121 PDIEKTGLSEQRAFDDVKENTGVTEEEKALSEFAQ-----------------EHTVV--- 180
Query: 181 LPDNTVGCGETD--TCLSDVLAE--LAETTPFVHGVDTTDVANLVERKEVEENADDPKDS 240
D + GC ET TCLSDVLAE LAETT FV VD TD NLV++ +VEE+ADD DS
Sbjct: 181 --DGSAGCVETTETTCLSDVLAEETLAETTLFVQDVDVTDAINLVQKTKVEEHADDANDS 240
Query: 241 KDIEVAKQETFSMEDGKLGVPVQLVEKSELKQSLVDGAVVEEGRTENLADRTGETLKMEN 300
KD EV KQE FS+E +LGV VQL E SELK SLVDGAV EGRTENLADR GETLK EN
Sbjct: 241 KDTEVPKQENFSVEKMELGVRVQLEENSELKGSLVDGAV--EGRTENLADRPGETLKREN 300
Query: 301 DSSKTDEVGLANFAGEIDGAVTMENTEDKTVEVDGMCLEDKAADATTKTTTGNLADETPK 360
SS T+EVGL + A EI V + N EDKT+E+DGMC+EDKA T NL DETP+
Sbjct: 301 ASSTTNEVGLTHIAVEIKETVNVGNAEDKTIEMDGMCMEDKA---TAVGMMENLTDETPE 360
Query: 361 IKGVHVTDDNIEVLKIENVEDREAGVQGLGVADESAEVGKIENLVDETAEAENV--TNYT 420
IKGV V D +IE LKIE++EDREAGVQGLG+AD+S V K+EN+ DE AEAE V T+YT
Sbjct: 361 IKGVDVADYSIEELKIEDMEDREAGVQGLGLADKSPVVEKLENVADENAEAEGVQVTDYT 420
Query: 421 AESM--ENL-DDKTAQLEEIAMEEETEEADDRVYLVDEGIGSEENDANMTYLVGETEAAE 480
AE + EN+ DDKTAQ EEIAM EE E DD VYLVDEGIGSEE D NMTYLV ETEAAE
Sbjct: 421 AEEVKSENVEDDKTAQGEEIAMAEEIAEPDDMVYLVDEGIGSEETDVNMTYLVEETEAAE 480
Query: 481 EVEEMDVTEEVDEASKGSSGAKRKRGKNSKAPARVPSRKKVEEDVCFICFDGGDLVLCDR 540
EVEEMDVTEE+DE + SSG+KRKRGKNSKAPARV SRKKVEEDVCFICFDGGDLVLCDR
Sbjct: 481 EVEEMDVTEEMDEPNISSSGSKRKRGKNSKAPARVASRKKVEEDVCFICFDGGDLVLCDR 540
Query: 541 RGCPKAYHPSCINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 600
RGCPKAYHP+CINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI
Sbjct: 541 RGCPKAYHPACINRDEAFFRAKGRWNCGWHLCSNCEKTAHYMCYTCTFSLCKGCIKNAVI 600
Query: 601 LCVRGNKGFCEACMRFVMLIEKNEQGSTEKGQIDFNDKTSWEYLFKEYWTDLKGSLSLTF 660
CVRGNKGFCE CMRFV IEKNEQGSTEKGQIDFNDK SWEYLFKEYW DLKGSLSLTF
Sbjct: 601 FCVRGNKGFCETCMRFVTSIEKNEQGSTEKGQIDFNDKNSWEYLFKEYWIDLKGSLSLTF 660
Query: 661 DELVHAKNPWKGSETLNSRPDSPGELYDGNVDGGSDLDVSENEESGSSKKRKAKRRSKSQ 720
DELVHAKNPWKGSETL SRPDSPGEL DGNVDGGSDLDVSENEESGSSKKRKAK+RS+SQ
Sbjct: 661 DELVHAKNPWKGSETLTSRPDSPGELCDGNVDGGSDLDVSENEESGSSKKRKAKKRSRSQ 720
Query: 721 AKETNSPSMPIIPDSQGPSTDNNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 780
AKE +SPSMP I DSQG S D+NVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK
Sbjct: 721 AKEMSSPSMPAIADSQGLSADDNVEWASKELLEFVMHMKNGDRTVLSQFDVQALLLEYIK 780
Query: 781 RNKLRDPRRKSQIICDSRLENLFGKPRVGHFEMLKLLESHFLIREDVQINDLQGSVADTE 840
RNKLRDPRRKSQIICDSRLE+LFGKPRVGHFEMLKLLESHFLI+ED QINDL GSVA+TE
Sbjct: 781 RNKLRDPRRKSQIICDSRLESLFGKPRVGHFEMLKLLESHFLIKEDAQINDLHGSVAETE 840
Query: 841 SSQLEGDGYTDASGKTRKEKKRRMRKKGDQRGLQSNLDDYAAIDIHNINLIYLRRNLVEY 900
SSQLE DG TD SGK +KEKKRR RKK D+RGLQSNLDDYAAIDIHNINLIYL+RNLVEY
Sbjct: 841 SSQLEADG-TDGSGKIKKEKKRRTRKK-DERGLQSNLDDYAAIDIHNINLIYLKRNLVEY 900
Query: 901 LIEDEESFHEKVVGSFVRIRISGNAQKQDLYRLVQVVGTSKASEPYKVGKKMTDILLEIL 960
LIEDEESFH+KVVGSFVRIRISG+AQKQDLYRLVQVVGTSKASEPYKVGK+MTDILLEIL
Sbjct: 901 LIEDEESFHDKVVGSFVRIRISGSAQKQDLYRLVQVVGTSKASEPYKVGKRMTDILLEIL 960
Query: 961 NLNKTEVISIDIISNQEFTEEECKRLRQSIKCGILNRLTVGDLQERAMSLQDARVKDWME 1020
NLNKTEV+SIDIISNQEFTE+ECKRLRQS+KCGI+NRLTVG+LQERAMSLQDARVKDWME
Sbjct: 961 NLNKTEVVSIDIISNQEFTEDECKRLRQSMKCGIINRLTVGELQERAMSLQDARVKDWME 1020
Query: 1021 TEIVRLSHLRDRASEKGRRKEYPFYNIMECVEKLQLLKTPEERQRRLEELPGIHTDPNMD 1080
TEIVRLSHLRDRASEKGRRKE + ECVEKLQLLKTPEERQRR+EE+P IH DPNMD
Sbjct: 1021 TEIVRLSHLRDRASEKGRRKE-----LRECVEKLQLLKTPEERQRRIEEIPEIHADPNMD 1080
Query: 1081 PSHESEDEDEADDKRQETYTLSRGSGFSRRTREPVSPGKAGSNLNDSWSGTRNFSSTNRD 1140
PSHESEDEDEADDKR+ETYTLSR + F RRTREPVSPGK GS+LNDSWSGTRNFS+TNRD
Sbjct: 1081 PSHESEDEDEADDKRRETYTLSRSTSFGRRTREPVSPGKGGSHLNDSWSGTRNFSNTNRD 1140
Query: 1141 LSRNLSGKGFSNQGEDAIGSGEIINENSWSHGREGDVKKPNKWDKQQVSPSSEMTARNAL 1200
+SRNLSGKGFSNQG+DAIGSGEIINE SW HGRE DVKK +KWDK QVSPSSEMTARNAL
Sbjct: 1141 MSRNLSGKGFSNQGDDAIGSGEIINETSWGHGRERDVKKTSKWDK-QVSPSSEMTARNAL 1200
Query: 1201 SGAASELPSAARSVNSAASPSVGTTQNAATVNETEKIWRYQDPSGKVQGPFSMVQLRKWS 1260
SGAASE SAA SVN S SVGTTQNAAT NE+EKIW YQDPSGKVQGPFSMVQLRKWS
Sbjct: 1201 SGAASE-SSAAHSVNPTVSSSVGTTQNAATANESEKIWHYQDPSGKVQGPFSMVQLRKWS 1260
Query: 1261 NTGYFPADLRVWRASDKQDDSLLLTDVLAGKIPKDTSSVDNSIQAQAHASSFVAKPQGAT 1320
NTGYFP DLR+WR SD+Q+DSLLLTDVLAGKI KDT NS+Q ++S FV +PQG T
Sbjct: 1261 NTGYFPTDLRIWRISDQQEDSLLLTDVLAGKISKDTPLTSNSLQVHPNSSPFVGRPQGGT 1320
Query: 1321 VQSGMDVQNTGTSNPHTNPTSYGQSAGGRWKSQTEVSPTGIPASASIEVPRYSGDRWSSD 1380
+QSG+D QN +SN HTNPTSY QS+GGRWKSQ EVSPTG P S SI+VPRYSG+RWSSD
Sbjct: 1321 LQSGVDGQNASSSNSHTNPTSYDQSSGGRWKSQNEVSPTGRPVSGSIKVPRYSGERWSSD 1380
Query: 1381 HGNKDFTSLPSPTPSSGGTKEQPFQMATPF-----ASSAGGGSLHGSSLMQGSENDSLRS 1440
HGNK+FT+LPSPTPSSGGTKEQPFQ+A F S GGG LHGSS+MQGSEND LRS
Sbjct: 1381 HGNKNFTNLPSPTPSSGGTKEQPFQVAASFMEAKSLSGTGGGGLHGSSVMQGSENDPLRS 1440
Query: 1441 HSGLNAAEKGTGLGPINGLQNHHSLPVRPSSIIDDTLVNPAADIKSISANLHSLVQSINS 1500
H G N++EKG G GPIN LQNH S PVR S IIDD +NPAADI+SISANL SLVQSINS
Sbjct: 1441 HLGRNSSEKGMGSGPINALQNHQSQPVRQSPIIDDASLNPAADIRSISANLQSLVQSINS 1500
Query: 1501 RNPPIE------------------------TQTVETNISSSMPPGQTLHRRWGEMSPAQN 1560
RNPPIE + VE+N+SSSMPP QTLH RWGEMSPAQN
Sbjct: 1501 RNPPIEAHGRGSGSILKRETDTSEAWQNAQSHKVESNVSSSMPPAQTLHSRWGEMSPAQN 1560
Query: 1561 AA---------TASFSTPGLTNFSSSEPWRSMPPIPSNPPHIQSSTPPNIPWGMGAPEGQ 1620
AA T+SFS+ GL+NF SS+PWRS PI +NP HIQ STPPN+ WGMGAPEGQ
Sbjct: 1561 AAVTSFSAGSSTSSFSSAGLSNFPSSDPWRSTAPISNNPQHIQCSTPPNLAWGMGAPEGQ 1620
Query: 1621 STVPRPGLESQNHSWGPMPSGNPNMTWAPSA-PPNATGMMWGSSAQSSASVGTNPGWNAP 1680
STVPRPG ESQN +WGPMPSGNPNM W P+A PPNA+ MMWG++AQSS TNPGW AP
Sbjct: 1621 STVPRPGSESQNQTWGPMPSGNPNMGWGPTAPPPNASAMMWGTTAQSSGPAATNPGWIAP 1680
Query: 1681 GQGPPVRNNIQGWQAHSSIPPQVNATPGWVAPNLGPMPPMNMNPNWHAPSANQGMWSNEH 1731
GQGP NNIQGW AHS +PP VNATPGWV N+ PMPPMNMNP+W PS NQ MW NEH
Sbjct: 1681 GQGPAAGNNIQGWPAHSPMPPPVNATPGWVGSNVAPMPPMNMNPSWLVPSVNQNMWGNEH 1738
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
C3H19_ARATH | 7.2e-269 | 39.88 | Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis thaliana GN=NERD PE... | [more] |
C3H44_ARATH | 5.9e-162 | 47.35 | Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis thaliana GN=At3g511... | [more] |
Y5843_ARATH | 1.7e-44 | 28.65 | Uncharacterized protein At5g08430 OS=Arabidopsis thaliana GN=At5g08430 PE=1 SV=2 | [more] |
NSD3_MOUSE | 7.5e-16 | 41.59 | Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1 PE=1 SV=2 | [more] |
NSD3_HUMAN | 8.3e-15 | 38.53 | Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens GN=WHSC1L1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0K4G1_CUCSA | 0.0e+00 | 75.26 | Uncharacterized protein OS=Cucumis sativus GN=Csa_7G006220 PE=4 SV=1 | [more] |
A0A061DZP0_THECC | 0.0e+00 | 48.09 | Nucleic acid binding,zinc ion binding,DNA binding, putative isoform 2 OS=Theobro... | [more] |
V7AUM1_PHAVU | 0.0e+00 | 52.18 | Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_009G003300g PE=4 SV=1 | [more] |
A0A0S3RA09_PHAAN | 0.0e+00 | 52.21 | Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.02G011300 PE=... | [more] |
A0A061E0K8_THECC | 0.0e+00 | 47.37 | Nucleic acid binding,zinc ion binding,DNA binding, putative isoform 1 OS=Theobro... | [more] |
Match Name | E-value | Identity | Description | |
AT2G16485.1 | 4.0e-270 | 39.88 | nucleic acid binding;zinc ion binding;DNA binding | [more] |
AT3G51120.1 | 3.3e-163 | 47.35 | DNA binding;zinc ion binding;nucleic acid binding;nucleic acid bindi... | [more] |
AT2G18090.1 | 1.7e-87 | 44.47 | PHD finger family protein / SWIB complex BAF60b domain-containing pr... | [more] |
AT5G63700.1 | 1.4e-60 | 27.55 | zinc ion binding;DNA binding | [more] |
AT5G08430.1 | 9.6e-46 | 28.65 | SWIB/MDM2 domain;Plus-3;GYF | [more] |