BLAST of CmaCh04G012940 vs. Swiss-Prot
Match:
POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)
HSP 1 Score: 557.4 bits (1435), Expect = 4.6e-157
Identity = 411/1321 (31.11%), Postives = 654/1321 (49.51%), Query Frame = 1
Query: 79 WYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFASTSRARAMQIRMELSTIQ 138
W D+ S I ++++V++ ++ TAR W LE + S + + ++ +L +
Sbjct: 52 WADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALH 111
Query: 139 KKDMT-IADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYT 198
+ T + L LA +G +IE+E+ +L L YD L T+I
Sbjct: 112 MSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIE 171
Query: 199 VSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGR-GSRGRSRQLNSGHGQS 258
+ DV + +L E + Q R R N GR G+RG+S+ + ++
Sbjct: 172 LKDVTSALLLNEKMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRN 231
Query: 259 RRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATSGYT 318
N PG + + G+ N D + + N ++ + SG
Sbjct: 232 CYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE 291
Query: 319 SDTNWYVDTGATDHITNDLERLTTRERYTGTD--QIQVANGAGLSISHIGNSLIS---GS 378
S+ W VDT A+ H T + RY D +++ N + I+ IG+ I G
Sbjct: 292 SE--WVVDTAASHHATPVRDLFC---RYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGC 351
Query: 379 SLVLKHILYAPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLL--HGRCKNGL 438
+LVLK + + P + +LIS +A D + + N K R+TK L+ G + L
Sbjct: 352 TLVLKDVRHVPDLRMNLIS--GIALDRDGYESYFANQ---KWRLTKGSLVIAKGVARGTL 411
Query: 439 YVLPHNFSQALLTA---KLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNAC 498
Y Q L A ++S + WH+R+GH S ++IL +L + C+ C
Sbjct: 412 YRTNAEICQGELNAAQDEISVDLWHKRMGH-MSEKGLQILAKKSLISYAKGTTVKPCDYC 471
Query: 499 QLGKAHQLPFGSSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLR 558
GK H++ F +S L L+++DV GP I S+ +KY+V+F+DD SR +W+Y L+
Sbjct: 472 LFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILK 531
Query: 559 CKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEY--HRLHNYFKSTGIEHHISCPHTHQ 618
K V VF +F VE K++ ++SD GGEY Y S GI H + P T Q
Sbjct: 532 TKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQ 591
Query: 619 QNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFG 678
NG+ ER +R IVE ++L A +P S+W EA TAC+LINR PS + + P
Sbjct: 592 HNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTN 651
Query: 679 KSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISR 738
K YS L+VFGC + ++ KL ++ CIF+GY
Sbjct: 652 KEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGY--------------------G 711
Query: 739 DVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENAL---TDIEPVVSNSHMN 798
D F ++ PV + + E+ + D+ V N +
Sbjct: 712 DEEFGYRLWD---------------PVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIP 771
Query: 799 DGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQH 858
+ T S+N + ++ + + E+ E + ++ + EH PT+ QH
Sbjct: 772 NFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEH----PTQGEEQH 831
Query: 859 -PMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHP-- 918
P+R R + SR++ S + +I EP +L+E + HP
Sbjct: 832 QPLRRSERPRV---------------ESRRYPSTEYV---LISDDREPESLKEVLSHPEK 891
Query: 919 -RWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGF 978
+ AM +E+ +L++N T+ LV G + KWV+K+K+ D + R KARLV KGF
Sbjct: 892 NQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGF 951
Query: 979 KQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPG 1038
+Q+ G+D+ + FSPV+K ++IR ILSLA + + Q+D++ AFLHG L+EE+YM QP G
Sbjct: 952 EQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEG 1011
Query: 1039 FQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREIT- 1098
F+ + K K+ +CKL K+LYGLKQAP+ W+ + + + + +D ++ + E
Sbjct: 1012 FEVAGK-KHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNF 1071
Query: 1099 IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIEV--KKTRDGIILSQ 1158
I +L+YVDD++IV +L L F +KDLG + LG+++ ++T + LSQ
Sbjct: 1072 IILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQ 1131
Query: 1159 RRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEE-----QFKYRSTVGALQY- 1218
+Y +L+R NM+ KP+STP+ KL ++ P + EE + Y S VG+L Y
Sbjct: 1132 EKYIERVLERFNMKNAKPVSTPLAGHLKLSKKM-CPTTVEEKGNMAKVPYSSAVGSLMYA 1191
Query: 1219 LTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMMLSGFSDA 1278
+ TRPD+A AV V ++L P HW AVK ILRY++GT + S +L G++DA
Sbjct: 1192 MVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDA 1251
Query: 1279 DWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLL 1338
D AG D+R+S++G+ ISW S+ Q V+ S+TEAEY A EMIW+K L
Sbjct: 1252 DMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFL 1301
Query: 1339 KELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSS 1369
+ELG++Q K ++CD+ A L+ N ++HARTKHI+V +H++RE V ++++V IS++
Sbjct: 1312 QELGLHQ-KEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTN 1301
BLAST of CmaCh04G012940 vs. Swiss-Prot
Match:
COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)
HSP 1 Score: 327.8 bits (839), Expect = 6.0e-88
Identity = 182/485 (37.53%), Postives = 279/485 (57.53%), Query Frame = 1
Query: 899 WRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQ 958
W A+N EL+A K N TW + N++DS+WV+ VK G+ R KARLVA+GF Q
Sbjct: 906 WEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQ 965
Query: 959 RFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQ 1018
++ +DY +TF+PV + S+ R ILSL + + Q+D++ AFL+G LKEE+YMR P G
Sbjct: 966 KYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGIS 1025
Query: 1019 DSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREI--TI 1078
++ + +CKL KA+YGLKQA + W L E F S D ++IL I I
Sbjct: 1026 CNS---DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENI 1085
Query: 1079 YMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRY 1138
Y+L+YVDD++I + + L F + DL ++++F+GI ++ D I LSQ Y
Sbjct: 1086 YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAY 1145
Query: 1139 ALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK--YRSTVGALQYLTM-TRP 1198
+L + NME C +STP+ S K+ E L+++E RS +G L Y+ + TRP
Sbjct: 1146 VKKILSKFNMENCNAVSTPLPS--KINYEL---LNSDEDCNTPCRSLIGCLMYIMLCTRP 1205
Query: 1199 DLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM---LSGFSDADWA 1258
DL AVN + +Y W +KR+LRY+KGT+ + + +K+ + G+ D+DWA
Sbjct: 1206 DLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWA 1265
Query: 1259 GCPDDRRSTSGFAV-FLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKE 1318
G DR+ST+G+ NLI W++++Q +V+ SSTEAEY A+ E +W+K LL
Sbjct: 1266 GSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTS 1325
Query: 1319 LGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQ 1375
+ + +++ DN G + +NP H R KHI++ +HF REQV + + +I + +Q
Sbjct: 1326 INIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQ 1382
BLAST of CmaCh04G012940 vs. Swiss-Prot
Match:
M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)
HSP 1 Score: 218.8 bits (556), Expect = 3.9e-55
Identity = 111/228 (48.68%), Postives = 152/228 (66.67%), Query Frame = 1
Query: 1076 IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRR 1135
+Y+L+YVDDI++ SS+ LI +L F++KDLG + YFLGI++K G+ LSQ +
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60
Query: 1136 YALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDL 1195
YA +L M CKPMSTP+ + +RS VGALQYLT+TRPD+
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLPLK---LNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDI 120
Query: 1196 AFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM-LSGFSDADWAGCPD 1255
++AVN VCQ +H PT A + +KR+LRYVKGT+ G+ I K++ + + F D+DWAGC
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180
Query: 1256 DRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIW 1303
RRST+GF FLG N+ISWS+++Q TVSRSSTE EY+A+A AE+ W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of CmaCh04G012940 vs. Swiss-Prot
Match:
YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)
HSP 1 Score: 156.4 bits (394), Expect = 2.4e-36
Identity = 93/311 (29.90%), Postives = 168/311 (54.02%), Query Frame = 1
Query: 994 VDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLI 1053
+D+ AFL+ + E +Y++QPPGF + P +Y+ +L +YGLKQAP W+ + L
Sbjct: 1 MDVDTAFLNSTMDEPIYVKQPPGFVNERNP-DYVWELYGGMYGLKQAPLLWNEHINNTLK 60
Query: 1054 ELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGD 1113
++GF + L+ + IY+ +YVDD+++ + S + +R+ Q+L +++KDLG
Sbjct: 61 KIGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120
Query: 1114 LEYFLGIEVKKTRDG-IILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSA 1173
++ FLG+ + ++ +G I LS + Y + K TP+ +++ LF L
Sbjct: 121 VDKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHL-- 180
Query: 1174 EEQFKYRSTVGALQYLTMT-RPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALG 1233
++ Y+S VG L + T RPD+++ V+ + ++L P H + +R+LRY+ T ++
Sbjct: 181 KDITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMC 240
Query: 1234 VKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRK-QATVSRSSTEAE 1293
+K + S + L+ + DA D ST G+ L ++WSS+K + + STEAE
Sbjct: 241 LKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAE 300
Query: 1294 YKAIANLTAEM 1301
Y + E+
Sbjct: 301 YITASETVMEI 308
BLAST of CmaCh04G012940 vs. Swiss-Prot
Match:
YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)
HSP 1 Score: 125.9 bits (315), Expect = 3.4e-27
Identity = 159/660 (24.09%), Postives = 286/660 (43.33%), Query Frame = 1
Query: 786 NIASDNLSGVSLSSADNTRSS-------EEIAEYEAESSSINAQNQTHEHVSDQPTEAAS 845
N+ +DN + VS N +S + A E SS+N + + H D+ S
Sbjct: 1181 NVPTDNGTNVSPRLEQNIEASGSPVQTVNKSAFLNKEFSSLNMKRKRKRH--DKNNSLTS 1240
Query: 846 QHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPII------ETATEPRNLQE 905
R + R+ + K D + T++ P I E ++ +L+E
Sbjct: 1241 YELERDKKRSKKNRVKLIPD-------------NMETVSAPKIRAIYYNEAISKNPDLKE 1300
Query: 906 AMQHPRWRGAMNDELSALKRNATWDLVPPK-----PGINLIDSKWVYKVKRKADGSVERL 965
++ + A + EL LK +D+ P ++ + ++ KR
Sbjct: 1301 KHEYKQ---AYHKELQNLKDMKVFDVDVKYSRSEIPDNLIVPTNTIFTKKRNGI-----Y 1360
Query: 966 KARLVAKGFKQRFGVDYTDTFSPVIKPST----IRVILSLAVTKGWNMRQVDIQNAFLHG 1025
KAR+V +G Q DT+S + S I++ L +A + M+ +DI +AFL+
Sbjct: 1361 KARIVCRGDTQS-----PDTYSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYA 1420
Query: 1026 ILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVAD 1085
L+EE+Y+ P + + KL KALYGLKQ+PK W+ L L +G K +
Sbjct: 1421 KLEEEIYIPHP-------HDRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT 1480
Query: 1086 SSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDL------EYF 1145
L+ +++ + I +YVDD +I +S++Q + I KLK +F +K G L
Sbjct: 1481 PGLYQTEDKNLMI--AVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDI 1540
Query: 1146 LGIEV--KKTRDGIILSQRRYALDLLKRVN--MEKCKPMSTPMGSAEKLFREQGIPLSAE 1205
LG+++ K I L+ + + + K+ N ++K + S P S K+ ++ + +E
Sbjct: 1541 LGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE 1600
Query: 1206 EQF-----KYRSTVGALQYLT-MTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGT 1265
E+F K + +G L Y+ R D+ FAV KV + ++ P + + + +I++Y+
Sbjct: 1601 EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRY 1660
Query: 1266 LALGVKIQK---STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRS 1325
+G+ + + +DA G D +S G ++ G N+ + S K S
Sbjct: 1661 KDIGIHYDRDCNKDKKVIAITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVS 1720
Query: 1326 STEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIE 1385
STEAE AI A+ +K LKELG + + D+ A + + K
Sbjct: 1721 STEAELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTW 1780
Query: 1386 VDFHFVREQVARKAMEVRFISSSDQAREKFNMSVRSLAHPFSISEMARTWDMCARDVISE 1405
+ ++E++ K++++ I+ + L P S S+ R + + S+
Sbjct: 1781 IKTEIIKEKIKEKSIKLLKITGKGNIAD-------LLTKPVSASDFKRFIQVLKNKITSQ 1795
BLAST of CmaCh04G012940 vs. TrEMBL
Match:
Q75LJ1_ORYSJ (Putative copia-like retrotransposon protein OS=Oryza sativa subsp. japonica GN=OSJNBa0096I06.16 PE=4 SV=1)
HSP 1 Score: 1263.8 bits (3269), Expect = 0.0e+00
Identity = 670/1425 (47.02%), Postives = 928/1425 (65.12%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S + ++P +S KLT+ NY LWS Q+L +R L G + G+ APS I
Sbjct: 4 SSSSTASTNPLFGVQVSEKLTKGNYALWSAQVLAAIRGARLDGHITGATAAPSMEIEKTA 63
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
S++T + I+NP + W+ DQ VL + S+++ ++L+ + +TA +AW + F +
Sbjct: 64 SDKTTEK--IVNPAYQEWFASDQQVLGFLLSTLSRDILTQVATASTAAQAWQQVCAMFTA 123
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
++AR++ +R+ L+ QK +M+I++Y K+K L D +A+ GK +++E+L+AY+L GL D
Sbjct: 124 QTKARSLNVRLTLTNTQKGNMSISEYCGKMKALADEIASSGKPLDEEDLVAYVLNGLDDD 183
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGRG 241
++P+V++I R + T+++VY+ +L++E R Q ++ N + A
Sbjct: 184 FEPVVSAIVARNESTTMAEVYSQLLNFENRQ----ALRQAHASANAAVVVFSVVAAAAVA 243
Query: 242 SRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQA 301
G + L G + + G+ CWHR+D+ Y
Sbjct: 244 HVGAAILLLQDVGAALEEIR------------LGV--------------CWHRYDETYVP 303
Query: 302 ENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSIS 361
+ +++A A + Y DTNWY+DTGATDHITN+L++L RE+Y G D+I A+GAG+ I
Sbjct: 304 D---ERSAAAAAAYGIDTNWYLDTGATDHITNELDKLDVREKYKGGDKIHTASGAGMEIK 363
Query: 362 HIGNSLISGSS--LVLKHILYAPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKL 421
HIG+S+I + L LK+IL+ P+ K+LIS RLA DN A +E H NYFL+KDR T+
Sbjct: 364 HIGDSVIHTPTRELHLKNILHVPQAKKNLISAHRLAMDNFAFLEVHSNYFLIKDRATRNT 423
Query: 422 LLHGRCKNGLYVLPHNFSQALLTAKL-SKEQWHRRLGHPASPITIRILQDNNLAIDTNIP 481
+L GRC+ LY LP + ++ + A S +WH RLGHPA PI R+L NNL T
Sbjct: 424 ILKGRCRRRLYSLPTSPAKQVHAATTPSFSRWHSRLGHPAVPIVTRVLSKNNLPCSTVAN 483
Query: 482 SSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRY 541
SIC+ACQ GK+HQLP+ S VS+ PL+LI ++VWGP+ SV K+YVSF+DD+S++
Sbjct: 484 KDSICDACQKGKSHQLPYPKSSSVSSQPLELIFSNVWGPAPISVGGKKFYVSFIDDYSKF 543
Query: 542 VWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISC 601
W+Y L+ KS+V F +FQ VE KI +VQ+DWGGEY +L+ +F GI HH+SC
Sbjct: 544 TWVYLLKHKSEVFQKFQEFQTLVERFFGHKILAVQTDWGGEYQKLNTFFAKIGISHHVSC 603
Query: 602 PHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPL 661
P+ HQQNG ERKHRH++E L LLA A+MP+ +WDEA A +LINR PS+ I PL
Sbjct: 604 PYAHQQNGSAERKHRHLIEVALTLLAHASMPIKFWDEAVLAAAYLINRTPSKVINFACPL 663
Query: 662 HKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGR 721
+LF + P+Y+ LR+FGCA WPNLRPYN KL+FR+ RC+FLGYS+ HKG+KCL +TGR
Sbjct: 664 EQLFKEKPNYTALRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIATGR 723
Query: 722 IYISRDVVFDENIFPFEE-----SKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPV 781
+Y+SRDV FDE+IFPF E + P L+P L+ L N + + P
Sbjct: 724 VYVSRDVTFDESIFPFSELHSNAGARLRAEISLLPPSLVPHLSSLGG-EQNNHVLNYPPN 783
Query: 782 VSNSHMNDGQTDNIASDNLS-GVSLSSADNTRSSEEIAEYEAESSSINAQNQTH------ 841
V TD +N G + + ++ E A +++ AQ+ H
Sbjct: 784 V---------TDQFGEENAEIGEEIVANGEENAAAAADENAAAAANGGAQDDVHGVAYDA 843
Query: 842 --EHVS----DQPTEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPI 901
EH S D AA QH + ++VQA T S TS AS+ + +
Sbjct: 844 SPEHSSPVTDDAMASAAEQHGNPIQ-EEHLVQASPQT-----ASSTSPSVASSAGVHDDV 903
Query: 902 I------------ETATEP--------------RNLQEAMQHPRWRGAMNDELSALKRNA 961
E A P ++L+EA+ + W+ AM+ E AL N
Sbjct: 904 TTDQSDQTDQAMPEAAVAPIRPKTRLQSGIRKEKSLEEAVNNKHWKEAMDAEYMALIENK 963
Query: 962 TWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKP 1021
TW LVPP+ G N+ID KWVYKVKRKADGS++R KARLVAKGFKQR+G+DY DTFSPV+K
Sbjct: 964 TWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKA 1023
Query: 1022 STIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKAL 1081
+TIR++LSLAV++GW++RQ+D++NAFLHG+L+EEVYM+QPPG++ + P NY+CKL KAL
Sbjct: 1024 ATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMKQPPGYEKKSMP-NYVCKLDKAL 1083
Query: 1082 YGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQA 1141
YGLKQAP+AW+SRL+ KL ELGF S AD+SLF K +++I++LIYVDDII+ SS A
Sbjct: 1084 YGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQVSIFLLIYVDDIIVASSVPDA 1143
Query: 1142 TERLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMST 1201
T L+Q+L DFA+KDLGDL YFLGIEV K +DG++LSQ +YA DLL+RV M +CKP+ST
Sbjct: 1144 TSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQEKYASDLLRRVGMYECKPVST 1203
Query: 1202 PMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHW 1261
P+ ++EKL +G L ++ +YRS VGALQYLT+TRPD++F++NKVCQ+LH PT HW
Sbjct: 1204 PLSTSEKLSVNEGTLLGPQDSTQYRSVVGALQYLTLTRPDISFSINKVCQFLHAPTTTHW 1263
Query: 1262 GAVKRILRYVKGTLALGVKI-QKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISW 1321
AVKRILRYVK T+ G+K + ++++SGFSDADWAG PDDRRST GFAVFLG NL+SW
Sbjct: 1264 AAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAGSPDDRRSTGGFAVFLGPNLVSW 1323
Query: 1322 SSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSN 1379
S+RKQATVSRSS EAEYKA+AN TAE++W+++LL+ELGV +A +LWCDNLGA YL++N
Sbjct: 1324 SARKQATVSRSSIEAEYKALANATAEIMWVQTLLQELGVESPRAAKLWCDNLGAKYLSAN 1376
BLAST of CmaCh04G012940 vs. TrEMBL
Match:
Q2QRW4_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g26180 PE=4 SV=1)
HSP 1 Score: 1237.6 bits (3201), Expect = 0.0e+00
Identity = 682/1463 (46.62%), Postives = 928/1463 (63.43%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S + ++P IS KL + N+ LW Q+ +R L G + G+ P IAV
Sbjct: 3 SSSSTAATNPFQGHPISEKLGKANHALWKVQVSAAVRGARLQGHLTGATKRPPAEIAVT- 62
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
+ G K NP W DQ VL + SS+T EVL + TA E W +E+ +++
Sbjct: 63 --KDGATKKEPNPAHEDWEATDQQVLGYLLSSLTREVLMQVATCDTAAEVWSAIEQMYST 122
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKR-IEDEELIAYMLQGLGP 181
+RARA+ R L+ +K +M+ +YF K+K LGD +A G R I++EELI Y++ GLG
Sbjct: 123 HTRARAINTRFALTNTKKGNMSTPEYFAKMKSLGDEMATAGGRPIDEEELIQYIITGLGE 182
Query: 182 DYDPLVTSITTRTDVYTVSDVYAHMLSYEMRH-LRKGTFEQLSSANNVNRISIRGGANGG 241
Y +V+++ R + +VSD+Y+ +L++E R + +G E + N S GG
Sbjct: 183 GYSEVVSAVCARVEPISVSDLYSQVLNFEARQAIYRGAQEVTVNVANRGGFS-HGGRGNS 242
Query: 242 RGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGI----VCQICGKPNHDALQCWHRF 301
G +G SR G G R N G + ++ G+ +CQ+C K H A CW+R+
Sbjct: 243 NGGQGGSRGGGGGGGHGR---GNGGGGHGRGRTPGGVDKRPICQVCFKRGHTAADCWYRY 302
Query: 302 DQAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANG 361
D+ Y + AA A + Y DTNWY+DTGATDH+T +L++LT +E+Y G +QI A+G
Sbjct: 303 DEDYVPDAK-HVAAAAVNSYGVDTNWYIDTGATDHVTGELDKLTMKEKYNGGEQIHTASG 362
Query: 362 AGLSISHIGNSLI---SGSSLVLKHILYAPKINKHLISVQRLASDNNAVVEFHPNYFLVK 421
AG+ ISHIG++++ + ++ L+++LY P+ K+L+SV RL +DN+A +E H +YF +K
Sbjct: 363 AGMDISHIGHAIVHNPNSRNIHLRNVLYVPQAKKNLVSVHRLVNDNSAFLELHRDYFFLK 422
Query: 422 DRVTKKLLLHGRCKNGLYVLPHN-FSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNL 481
D++T+K LL GR LY LPH+ QA KLS ++WH RLGHP+ +++ +L
Sbjct: 423 DQITRKTLLKGRSWRRLYPLPHSSLKQAYSAIKLSLDRWHHRLGHPSITTVKQVVNKFSL 482
Query: 482 AIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSF 541
+++ S S+CNACQ K+HQLPF S VS PL+L+ +DVWGP+ SV KYYVSF
Sbjct: 483 PC-SDLSSESVCNACQQAKSHQLPFPISSSVSKHPLELVFSDVWGPAPESVGRKKYYVSF 542
Query: 542 VDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTG 601
+DDFS++ WIY ++ KS V F +FQ VE + KI ++Q+DWGGEY L+++F G
Sbjct: 543 IDDFSKFTWIYLIQHKSQVFDKFQEFQVMVERQFDRKILAMQTDWGGEYQSLNSFFSKIG 602
Query: 602 IEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRT 661
I HH+SCPHTHQQNG ERKHRHIVE GL+LL+ A+MPL +WDEAF A +LINR+PSR
Sbjct: 603 IVHHVSCPHTHQQNGSAERKHRHIVEIGLSLLSHASMPLKFWDEAFQVATYLINRVPSRV 662
Query: 662 IQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKC 721
I +PL KLF + PDYS LR+FGCACWPNLRPYNN KL FR+ RC+FLG+S+ HKG+KC
Sbjct: 663 IHNISPLEKLFKQKPDYSSLRIFGCACWPNLRPYNNHKLQFRSKRCVFLGFSTMHKGFKC 722
Query: 722 LNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIE 781
L STGRIYISRDVVFDENIFPF E H L T L I
Sbjct: 723 LEVSTGRIYISRDVVFDENIFPFTEL----------HANAGARLRSEIDILTPELLGPIR 782
Query: 782 PVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAE--------SSSINAQN 841
V + M+ +N S ++S + A+ + +AE +S +
Sbjct: 783 SVGNEQCMH--PVNNPLSADVSAALSNRANEPHRDGAVHPADAEDPPATPPLDASSGPEP 842
Query: 842 QTHEHVSDQPTEAASQHPMRT---------------RLRNNIVQAKQFTDGTI----RYS 901
H S T ++ +HP T +++ QA Q + + S
Sbjct: 843 DRVVHHSPAAT-SSGRHPGPTPGSVPRGAASSLAEETAEDSVSQAVQEQESQVVQEQEQS 902
Query: 902 ETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWR------------------------ 961
+++ A AVT T ++ A EA PR R
Sbjct: 903 SPAQEHAQAVTDETNTLQHADVTDTGSEAPAGPRTRLQSGVRKEKVYTDGTIKYKHSWFT 962
Query: 962 --GAMNDELSALKRNATWDLVPPKPGINLIDS-----------------KWVYKVKRKAD 1021
G ++L ALK + W L L+ + KWVYK+KRKAD
Sbjct: 963 ASGEPTNDLEALK-DKNWKLAMDSEYDALVKNKTWHLVPPQRGRNIIGCKWVYKIKRKAD 1022
Query: 1022 GSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFL 1081
G+++R KARLVAKGFKQR+G+DY DTFSPV+K +TIR+ILSLAV+KGW++RQ+D+QNAFL
Sbjct: 1023 GTLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIILSLAVSKGWSLRQLDVQNAFL 1082
Query: 1082 HGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASV 1141
HG L+EEVYM QPPGF+D KP +++CKL KALYGLKQAP+AW SRL+ KL++LGFK S
Sbjct: 1083 HGYLEEEVYMLQPPGFEDPTKP-HHVCKLDKALYGLKQAPRAWFSRLSKKLMDLGFKGSK 1142
Query: 1142 ADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIE 1201
D+SLF L +IT+++L+YVDDII+ SSS++AT L+Q LK +FA+KDLG+L YFLGIE
Sbjct: 1143 PDTSLFFLNKGDITMFVLVYVDDIIVASSSEKATAALLQDLKGEFALKDLGELHYFLGIE 1202
Query: 1202 VKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRST 1261
V K ++GI+L+Q +YA DLLK+V M CKP +TP+ +EKL +G L E+ YRS
Sbjct: 1203 VSKVQNGIVLNQDKYANDLLKKVGMIDCKPANTPLSVSEKLSLHEGSLLGPEDASHYRSV 1262
Query: 1262 VGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMM 1321
VGALQYLT+TRPD+AF+VNKVCQ+LH PT HW AVKRILRY+K LG++I KS + +
Sbjct: 1263 VGALQYLTLTRPDIAFSVNKVCQFLHAPTTVHWIAVKRILRYLKQCTRLGLEIHKSGSTL 1322
Query: 1322 LSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEM 1381
+SGFSDADWAGC DDRRST GFA+FLG+NL+SW++RKQATVSRSSTE+EYKAIAN TAE+
Sbjct: 1323 VSGFSDADWAGCLDDRRSTGGFAIFLGSNLVSWNARKQATVSRSSTESEYKAIANATAEI 1382
BLAST of CmaCh04G012940 vs. TrEMBL
Match:
A5BFR8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012106 PE=4 SV=1)
HSP 1 Score: 1048.9 bits (2711), Expect = 5.6e-303
Identity = 582/1414 (41.16%), Postives = 844/1414 (59.69%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
SIP+ M ++ + VKL + NY+LW +QI + + F+DG+ P + ++
Sbjct: 25 SIPSYQM----LNHTLPVKLDRTNYILWRSQIDNVIFANGFEDFIDGTSICPEKDLSPG- 84
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
++NP F W QD+ +LS I SS+T +++ ++G T+ AW LE F+S
Sbjct: 85 ---------VMNPAFVAWRRQDRTILSWIYSSLTPGIMAQIIGHNTSHSAWNALESIFSS 144
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
+SRAR MQ+R+EL + +K M++ DY K+K D LAAIG+ + +++ + +L GLG D
Sbjct: 145 SSRARIMQLRLELQSTKKGSMSMIDYIMKIKGAADNLAAIGEPVSEQDQVMNLLGGLGSD 204
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGA---NG 241
Y+ +VT+I R D ++ +++ +L++E R ++ + EQ+S AN + + RGG NG
Sbjct: 205 YNAVVTAINIRDDKISLEAIHSMLLAFEHRLEQQSSIEQMS-ANYASSSNNRGGGRKFNG 264
Query: 242 GRGS-----------RGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHD 301
GRG RGR R +GQ R ++P +P CQ+CGK H
Sbjct: 265 GRGQGYSPNNNNYTYRGRGR--GGRNGQGGRQNSSPSEKPQ---------CQLCGKFGHT 324
Query: 302 ALQCWHRFDQAYQA-------------ENNLKQAALATSGYTSDTNWYVDTGATDHITND 361
A C+HRFD ++Q +NN+ + S +D +WY+D+GA+ H+T +
Sbjct: 325 AQICYHRFDISFQGGQTTISHSLNNGNQNNIPAMVASASNNPADESWYLDSGASHHLTQN 384
Query: 362 LERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGS--SLVLKHILYAPKINKHLISVQ 421
L LT+ YTGTD++ + NG LSIS+IG+ + S LK + + P I+ +LISV
Sbjct: 385 LGNLTSTSPYTGTDKVTIGNGKHLSISNIGSKQLHSHTHSFRLKKVFHVPFISANLISVA 444
Query: 422 RLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLP-----------HN----FS 481
+ S+NNA++EFH N F VKD TK +L G+ +NGLY P +N S
Sbjct: 445 KFCSENNALIEFHSNAFFVKDLHTKMVLAQGKLENGLYKFPVFSNLKPYSSINNASAFHS 504
Query: 482 QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG 541
Q T + E WH RLGH + I +++ N+A + S +C+ CQL K+H+LP
Sbjct: 505 QFSSTVENKAELWHNRLGHASFDIVSKVMNTCNVA--SGKYKSFVCSDCQLAKSHRLPTQ 564
Query: 542 SSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQ 601
S ++ PL+L++TD+WGP SI S + ++Y++ FVDD+SRY W Y L+ K +F
Sbjct: 565 LSNFHASKPLELVYTDIWGPASIKSTSGARYFILFVDDYSRYTWFYSLQTKDQALPIFKX 624
Query: 602 FQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIV 661
F+ +E +TKI+ +QSD GGE+ ++ ++ GI H SCP+ QNG VERKHRH+V
Sbjct: 625 FKLQMENQFDTKIKCLQSDNGGEFRSFTSFLQAVGIAHRFSCPYNSXQNGRVERKHRHVV 684
Query: 662 ETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGC 721
ETGLALL+ A++P+ YW AF T FLINRMPS+ ++ D+P LF + PDY RVFGC
Sbjct: 685 ETGLALLSHASLPMKYWHYAFQTXTFLINRMPSKVLEYDSPYFTLFRRHPDYKSFRVFGC 744
Query: 722 ACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEE 781
C+P +RPYN KL +R+ +C+FLGYS +HKG+ CL+ +TGR+YI+ VVFDE+ FP +
Sbjct: 745 LCYPFIRPYNTHKLQYRSVQCLFLGYSLNHKGFLCLDYATGRVYITPHVVFDESTFPLAQ 804
Query: 782 SKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVS 841
SK + + + PAL SF + P SH + S++ +
Sbjct: 805 SKSSSSSNDTSAEGSTPALITPPSF------PCLLPDSKISHASIDSHSLSTSESPIPTT 864
Query: 842 LSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQF 901
SS +T SS + +S +P A M TR I + K
Sbjct: 865 SSSPLDTSSSSPAIDLSPKS-------------VPEPQITALAPRMTTRSMRGITKKKTI 924
Query: 902 TDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATW 961
D SA+ + +EP L++A + P W AM E++AL RN TW
Sbjct: 925 LDL------------SAIKV--------SEPSTLKQAFKDPNWTKAMEMEIAALHRNHTW 984
Query: 962 DLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPST 1021
DLV P +N+I KWVYK+K K DGS+ER KARLVAKG+ Q G+DY +TFSPV+K +T
Sbjct: 985 DLVEQPPNVNVIGCKWVYKLKHKPDGSIERYKARLVAKGYNQTHGLDYFETFSPVVKAAT 1044
Query: 1022 IRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYG 1081
IR+IL++A++ W +RQ+D+ NAFL+G L+E+VYM QPPG+ D P N +C+LKKALYG
Sbjct: 1045 IRIILTVALSFKWEIRQLDVHNAFLNGELEEQVYMSQPPGYFDPQFP-NRVCRLKKALYG 1104
Query: 1082 LKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATE 1141
LKQAP+AW RL+ L++ GF S DSS+F+ + T+ +L+YVDDI++ SS
Sbjct: 1105 LKQAPRAWFQRLSSALLQWGFSMSRTDSSMFLHFGKATTLIVLVYVDDILVTGSSSTQIS 1164
Query: 1142 RLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM 1201
LI KL FA++DLG L +FLGIEV + LSQ +Y DLL R + KP +TP
Sbjct: 1165 SLIAKLDSVFALRDLGQLSFFLGIEVSYNEGSMTLSQTKYISDLLHRTELFDTKPANTPG 1224
Query: 1202 GSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGA 1261
+ L + G P++ + YRS VGALQY+T+TRPD+AFAVNK CQ++ PT AHW +
Sbjct: 1225 AVGKNLSKFDGDPMT--DVTHYRSVVGALQYVTLTRPDIAFAVNKACQFMQQPTTAHWLS 1284
Query: 1262 VKRILRYVKGTLALGVKIQ-KSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSS 1321
VKRILRY++GT+ G+ S + + GF+DADW DDRRS+SG+ V+LG NL+SWSS
Sbjct: 1285 VKRILRYLRGTMQDGLLFSPSSNLTIEGFTDADWGAHLDDRRSSSGYLVYLGGNLVSWSS 1344
Query: 1322 RKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPV 1370
KQ VSRSS E+EY+ + TAE++W+++LL+EL V P LW DN+ A ++ NPV
Sbjct: 1345 TKQKVVSRSSAESEYRGLVFATAEIVWMQALLQELCVPIPAIPLLWYDNISAYHMAKNPV 1368
BLAST of CmaCh04G012940 vs. TrEMBL
Match:
A5BQ73_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039158 PE=4 SV=1)
HSP 1 Score: 1014.2 bits (2621), Expect = 1.5e-292
Identity = 578/1408 (41.05%), Postives = 832/1408 (59.09%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
SIP+ M + S+ VKL + NY+LW +QI + + F+DG+ P + + P
Sbjct: 22 SIPSYQMLNYSLP----VKLDRTNYILWRSQIDNVIFANGFEDFIDGTSVCPEKEL--RP 81
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
E INP F QD+ +LS I SS+T +++ ++G ++ AW LE+ F+S
Sbjct: 82 GE--------INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNSSHSAWNALEKIFSS 141
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
SRAR MQ+ +E + +K M++ DY KVK D+LAAIG+ + +++ I +L GLG D
Sbjct: 142 CSRARIMQLXLEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSEQDQIMNLLGGLGSD 201
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLS--SANNVNRISIRGGA--- 241
Y+ +VT+I R D ++ V++ +L++E R ++G+ EQL SAN + + RGG
Sbjct: 202 YNAVVTAINIREDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSANYASXSNNRGGGRKY 261
Query: 242 NGGRGSRGRSRQLN-SGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRF 301
NGGRG N G G+ R GRQ S SS CQ+CGK H C+HRF
Sbjct: 262 NGGRGPNFMMTNSNFRGRGRGXR-YGQSGRQNSS--SSERPQCQLCGKFGHTVQVCYHRF 321
Query: 302 DQAYQA------------ENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRER 361
D +Q+ +N A +A S +D NWY+D+GA+ H+T ++ LT
Sbjct: 322 DITFQSTQNNTTGVSNSGNSNXMPAMVAXSNNXADDNWYLDSGASHHLTQNVANLTNATP 381
Query: 362 YTGTDQIQVANGAGLSISHIGNSLISGS--SLVLKHILYAPKINKHLISVQRLASDNNAV 421
YTG D++ + NG L+IS+ + + + S LK + + P I+ +LISV + SDNNA+
Sbjct: 382 YTGADKVTIGNGKHLTISNTXFTRLFSNPHSFQLKKVFHVPFISANLISVAKFCSDNNAL 441
Query: 422 VEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLP--HNFSQALL-----------TAKLSKE 481
+EFH N F +KD TK++L G+ +NGLY P N A + + +E
Sbjct: 442 IEFHSNGFFLKDLHTKRVLAQGKLENGLYKFPVISNKKTAYVGITNDSTFQCSNIENKRE 501
Query: 482 QWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ 541
WH RLGH A+ I RI+ + N++ +++C++CQL K+H+LP S ++ PL+
Sbjct: 502 LWHHRLGHAATDIVTRIMHNCNVSCGKY--KATVCSSCQLAKSHRLPTHLSSFHASKPLE 561
Query: 542 LIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNT 601
L++TD+WGP S+ S + +KY++ FVDD+SRY W+Y L+ K +F F+ VE
Sbjct: 562 LVYTDIWGPASVTSTSGAKYFILFVDDYSRYTWLYLLQSKDQALPIFKXFKLQVENQFEA 621
Query: 602 KIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQAN 661
KI+ +Q D GGE+ ++ + +GI H SCP+ QNG VERKHRH+VETGLALLA A
Sbjct: 622 KIKCLQXDNGGEFRSFMSFLQESGILHRFSCPYNSSQNGRVERKHRHVVETGLALLAHAG 681
Query: 662 MPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNN 721
+PL +W AF TA FLINRMPS+ + +P LF ++PDY LRVFGC C+P +RPYNN
Sbjct: 682 LPLKFWSYAFQTATFLINRMPSKVLXNASPYFALFKRNPDYKFLRVFGCLCYPFIRPYNN 741
Query: 722 KKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPH 781
KL +R+ +C+FLGYS HKGY CL+ TGR+Y+S VVFDE FPF ++ + + +
Sbjct: 742 HKLQYRSLKCVFLGYSLHHKGYLCLDNLTGRVYVSPHVVFDETQFPFAQNISSSPSKDAS 801
Query: 782 HPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSG-----VSLSSADN 841
++P + + P + H G ++AS NL+ S +
Sbjct: 802 DESIIPXI-----------IVSSNPSTLSFH---GSNHSMASPNLTSTLTHPTSPTDTPT 861
Query: 842 TRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQFTDGTIR 901
TR S E EAE + AQ Q V P + TR + I + K
Sbjct: 862 TRXSRE-PVLEAE-VXLPAQQQV---VVPPPR-------VTTRSMSGITKRK-------- 921
Query: 902 YSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPK 961
I +EP L++A++ P W AM+ E++AL +N TWDLV P
Sbjct: 922 ------------PIFNLAASKISEPTTLKQAIKDPNWAEAMHTEIAALHKNQTWDLVDPP 981
Query: 962 PGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILS 1021
+N+I KWVYK+K K DGSV+R KARLVA+GF Q FG+DY +TFSPV+K +TIR++L+
Sbjct: 982 KDVNIIGCKWVYKLKYKXDGSVDRYKARLVARGFNQTFGLDYFETFSPVVKAATIRIVLT 1041
Query: 1022 LAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPK 1081
+ ++ W +RQ+D VYM QPPGF P N +CKLKKALYGLKQ+P+
Sbjct: 1042 IXLSYRWELRQLD-------------VYMAQPPGFLHPNHP-NKVCKLKKALYGLKQSPR 1101
Query: 1082 AWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKL 1141
AW ++L+ L+ GF +S DSS+F+ R T+ +L+YVDDII+ SS ++LI KL
Sbjct: 1102 AWFTKLSSALLSXGFNSSRTDSSMFVHFGRHSTLIVLVYVDDIIVTGSSPILIQQLIHKL 1161
Query: 1142 KIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKL 1201
FA++DLG L YFLGIEV + LSQR+Y DLL+R +M K +TP L
Sbjct: 1162 HSLFALRDLGQLSYFLGIEVTYDGGSLHLSQRKYITDLLQRTSMLDSKAAATPGTVGLSL 1221
Query: 1202 FREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILR 1261
+ G + ++ YRS VGALQY T+TRPD+AF++NK CQ++H PT HW +VKRILR
Sbjct: 1222 SQFDGDLM--DDVTMYRSVVGALQYATLTRPDIAFSINKACQFMHRPTSTHWSSVKRILR 1281
Query: 1262 YVKGTLALGVKIQKST-MMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATV 1321
Y+KGT+ G+ +Q S + ++DADW PDDRRS+SG+ V+LG NL+SW++ KQ V
Sbjct: 1282 YLKGTITHGLLLQPSAHFTIQAYTDADWGAQPDDRRSSSGYLVYLGNNLVSWTASKQKVV 1341
Query: 1322 SRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTK 1370
SRSS E+EY+ +A TAE+IW ++LL+EL + + P L+ DN+ A Y+ NPVFHARTK
Sbjct: 1342 SRSSAESEYRGLAIATAEIIWTQALLRELCISITSIPTLYYDNISAYYMAKNPVFHARTK 1348
BLAST of CmaCh04G012940 vs. TrEMBL
Match:
A0A151U9E7_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan GN=KK1_020152 PE=4 SV=1)
HSP 1 Score: 995.7 bits (2573), Expect = 5.7e-287
Identity = 570/1425 (40.00%), Postives = 821/1425 (57.61%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S T + S IS KL+++N+L W Q +R L + G+M P+Q
Sbjct: 5 SATTKQFIPQAFSNPISSKLSEDNFLTWRQQAESTIRGYRLKKHILGAMHVPAQH----- 64
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
+ K +++PE+ + QD L+ S + S+ + MVG + W LE FAS
Sbjct: 65 KTDEDKAKGVLSPEYEDYDQQDNLLKSWLLESMEPQFKVRMVGYEWCHQIWNNLEIYFAS 124
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
++AR Q++++L I KK I Y ++K + + LAAIG + E I + GL +
Sbjct: 125 QTKARVKQLKIQLRGI-KKTTAINQYLLEIKKIVNNLAAIGSPLNTAEHIDAIFDGLSEE 184
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQL--------SSANNVNRISIR 241
YDP +TS+ TRT+ Y+V + + +++ E R + E + S N+ + + +
Sbjct: 185 YDPFITSVLTRTEEYSVEQIESLLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQ 244
Query: 242 GGANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWH 301
G + G+ GR G G+S + +QP CQ+CGK H A CWH
Sbjct: 245 GRGSNNNGTHGRGPHRGRGRGRSAQHFGRGNKQP----------CQVCGKIGHIAFHCWH 304
Query: 302 RFDQAYQAEN-----------NLKQAALATSGYTS---DTNWYVDTGATDHITNDLERLT 361
R+DQ Y N N +Q +G + D WY D+GAT+H+T+DL L
Sbjct: 305 RYDQQYTEPNLNHNTNVYNPGNQQQMQAMIAGSQNMVYDDQWYPDSGATNHLTSDLNNLG 364
Query: 362 TRERYTGTDQIQVANGAGLSISHIGNSL----ISGSSLVLKHILYAPKINKHLISVQRLA 421
++ YTG D+I + NG + I+H G S +S LK +L+ P I K+L+SV +
Sbjct: 365 SKTDYTGQDKIHMGNGQAIGINHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFC 424
Query: 422 SDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYV------------------LPHNFS 481
DNN VEFH NY LVK + +K+ LL G KNGLYV +P N +
Sbjct: 425 KDNNVYVEFHTNYCLVKSQDSKETLLRGNLKNGLYVFDEVQILKPDVYEHTVDSVPTNRT 484
Query: 482 QALLT-AKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPF 541
+ + S + WH RL H + I +++ N+ + N S +C +C +GK+H LPF
Sbjct: 485 TLYVQRTRKSYDVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPF 544
Query: 542 GSSQHVSTAPLQLIHTDVWGPS-IASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFL 601
S V +PL+L+++DVWGPS AS +YYV F D FS+Y WIYF+ KS+ F+
Sbjct: 545 SDSYTVYNSPLELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFI 604
Query: 602 QFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHI 661
F+ VE KI+ QSD G E+ L F GI H +SCPH+HQQNG ERKHRHI
Sbjct: 605 HFKSMVENQFGHKIKMFQSDGGKEFTCLTKLFNENGITHRLSCPHSHQQNGTAERKHRHI 664
Query: 662 VETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFG 721
E GL LL+ + +P +W +AF+TA +INR+P+ + +P LF K PDY +LR FG
Sbjct: 665 TEVGLTLLSSSGLPHIFWSDAFSTAIHVINRLPTPILGHKSPYEVLFNKIPDYLLLRPFG 724
Query: 722 CACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFE 781
CAC+P+LRP+N+ K+ FR+++C+F+GYS HKGYKC + G+I ISR+VVFDE+ FP
Sbjct: 725 CACYPHLRPFNSHKMDFRSSQCVFIGYSMQHKGYKCFT-TNGKIIISRNVVFDEDTFPGP 784
Query: 782 ESKPPNKT---TNPHHPVLLPALAKLAS-FYTENALTDIEPVVSNSHMNDGQTDNIASDN 841
N ++ H P +P++ L S N+ T + P+VS+ +++
Sbjct: 785 SYFNTNNVAGLSSSHSPQPVPSIHVLPSQSLAPNSATTV-PLVSSI------DESLQDSQ 844
Query: 842 LSGVSLSSADNTRSSEEIAEYEAESSSINAQN--QTHEHVSDQPTEAASQHPMRTRLRNN 901
L+ + +S DN +S SS N Q T + T +QHPM TR +
Sbjct: 845 LTSTTDTSQDNLITS---------SSQPNPQPIITTVPDPASPATRTTNQHPMITRAKIG 904
Query: 902 IVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSA 961
+ + K F +EP ++ +A+ + W+ AM DE +A
Sbjct: 905 VFKPKAFL-------------------------ATSEPISVAQALDNKHWKQAMQDEYNA 964
Query: 962 LKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFS 1021
L +N TWDLV PG I KW++K K K DGS+ R KARLVA+GF Q+ G+DYTDTFS
Sbjct: 965 LLKNHTWDLVSLPPGRTAITCKWIFKNKYKQDGSILRHKARLVARGFSQQQGLDYTDTFS 1024
Query: 1022 PVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICK 1081
PV+KP +IR++L+LAV+KGW + Q+D+ NAFL+G LKE++YM QP GF + A N +CK
Sbjct: 1025 PVVKPVSIRIVLALAVSKGWPIHQIDVDNAFLNGDLKEDIYMLQPQGFHNGA--PNTVCK 1084
Query: 1082 LKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVS 1141
L+KALYGLKQAP+AW +LT L++LGFK + +DSSLF ++ +IY+LIYVDDIII S
Sbjct: 1085 LRKALYGLKQAPRAWFQKLTLILLQLGFKQARSDSSLFTFTSKVTSIYILIYVDDIIITS 1144
Query: 1142 SSDQATERLIQKLKIDFAVKDLGDLEYFLGIEVKKT-RDGIILSQRRYALDLLKRVNMEK 1201
+S A + LI L F +KDLG L YFLGIEV GI LSQ +Y D+L ++NM
Sbjct: 1145 NSPAAIQTLISTLHRFFPLKDLGSLHYFLGIEVNSDGAGGIHLSQTKYLKDILDKLNMSN 1204
Query: 1202 CKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHT 1261
KP+ TPM + KL + G + + YRS +GALQYLT+TRPD+A+ VNK+CQ++H
Sbjct: 1205 AKPVKTPMAPSLKLTND-GTAVCPDPSL-YRSVIGALQYLTITRPDIAYTVNKLCQFMHH 1264
Query: 1262 PTDAHWGAVKRILRYVKGTLALGVKIQKST-MMLSGFSDADWAGCPDDRRSTSGFAVFLG 1321
P ++HW AVKR+LRY++GTL G+ +KST + + D+DWA +D RSTSG V+LG
Sbjct: 1265 PLESHWKAVKRLLRYLQGTLQHGLHYKKSTDTGIHAYCDSDWASDHEDMRSTSGNCVYLG 1324
Query: 1322 ANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGA 1373
N++SW ++KQ VSRSSTEAE++++A+L E+ +I++LL EL ++ P +WCDN GA
Sbjct: 1325 PNIVSWMAKKQRVVSRSSTEAEFRSLASLVTEVQYIQNLLSELHNSSTQPPLIWCDNQGA 1367
BLAST of CmaCh04G012940 vs. TAIR10
Match:
AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)
HSP 1 Score: 408.3 bits (1048), Expect = 2.0e-113
Identity = 212/482 (43.98%), Postives = 305/482 (63.28%), Query Frame = 1
Query: 881 IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKA 940
I A EP EA + W GAM+DE+ A++ TW++ P I KWVYK+K +
Sbjct: 80 IAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNS 139
Query: 941 DGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAF 1000
DG++ER KARLVAKG+ Q+ G+D+ +TFSPV K +++++IL+++ + + Q+DI NAF
Sbjct: 140 DGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAF 199
Query: 1001 LHGILKEEVYMRQPPGF---QDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGF 1060
L+G L EE+YM+ PPG+ Q + P N +C LKK++YGLKQA + W + + LI GF
Sbjct: 200 LNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGF 259
Query: 1061 KASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYF 1120
S +D + F+ + + +L+YVDDIII S++D A + L +LK F ++DLG L+YF
Sbjct: 260 VQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYF 319
Query: 1121 LGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK 1180
LG+E+ ++ GI + QR+YALDLL + CKP S PM + G +
Sbjct: 320 LGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDF--VDAKA 379
Query: 1181 YRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGV-KIQK 1240
YR +G L YL +TR D++FAVNK+ Q+ P AH AV +IL Y+KGT+ G+ +
Sbjct: 380 YRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQ 439
Query: 1241 STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANL 1300
+ M L FSDA + C D RRST+G+ +FLG +LISW S+KQ VS+SS EAEY+A++
Sbjct: 440 AEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFA 499
Query: 1301 TAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVAR 1359
T EM+W+ +EL + SK L+CDN A ++ +N VFH RTKHIE D H VRE+
Sbjct: 500 TDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVY 559
BLAST of CmaCh04G012940 vs. TAIR10
Match:
ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)
HSP 1 Score: 218.8 bits (556), Expect = 2.2e-56
Identity = 111/228 (48.68%), Postives = 152/228 (66.67%), Query Frame = 1
Query: 1076 IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRR 1135
+Y+L+YVDDI++ SS+ LI +L F++KDLG + YFLGI++K G+ LSQ +
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60
Query: 1136 YALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDL 1195
YA +L M CKPMSTP+ + +RS VGALQYLT+TRPD+
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLPLK---LNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDI 120
Query: 1196 AFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM-LSGFSDADWAGCPD 1255
++AVN VCQ +H PT A + +KR+LRYVKGT+ G+ I K++ + + F D+DWAGC
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180
Query: 1256 DRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIW 1303
RRST+GF FLG N+ISWS+++Q TVSRSSTE EY+A+A AE+ W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of CmaCh04G012940 vs. TAIR10
Match:
ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))
HSP 1 Score: 111.7 bits (278), Expect = 3.8e-24
Identity = 55/112 (49.11%), Postives = 80/112 (71.43%), Query Frame = 1
Query: 873 AVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKW 932
++TITT I + EP+++ A++ P W AM +EL AL RN TW LVPP N++ KW
Sbjct: 17 SLTITTTIKK---EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKW 76
Query: 933 VYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLA 985
V+K K +DG+++RLKARLVAKGF Q G+ + +T+SPV++ +TIR IL++A
Sbjct: 77 VFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125
BLAST of CmaCh04G012940 vs. TAIR10
Match:
ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)
HSP 1 Score: 80.5 bits (197), Expect = 9.3e-15
Identity = 42/88 (47.73%), Postives = 54/88 (61.36%), Query Frame = 1
Query: 1187 YLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQ-KSTMMLSGFS 1246
YLT+TRPDL FAVN++ Q+ A AV ++L YVKGT+ G+ S + L F+
Sbjct: 2 YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61
Query: 1247 DADWAGCPDDRRSTSGFAV-----FLGA 1269
D+DWA CPD RRS +GF FLGA
Sbjct: 62 DSDWASCPDTRRSVTGFCSLVPLWFLGA 89
BLAST of CmaCh04G012940 vs. TAIR10
Match:
AT1G34070.1 (AT1G34070.1 Retrotransposon gag protein (InterPro:IPR005162))
HSP 1 Score: 79.7 bits (195), Expect = 1.6e-14
Identity = 66/261 (25.29%), Postives = 108/261 (41.38%), Query Frame = 1
Query: 17 ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEF 76
+ + + + NY W L + S +++G +DG++ + N
Sbjct: 22 VMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTL-------------------LPTNAND 81
Query: 77 TVWYPQDQLV-LSLINSSVTEEVLSTMVGITTAREAWITLERQFASTSRARAMQIRMELS 136
W +D +V LSL + ++ + V +T+R+ W+ ++ QF + ARA+++ EL
Sbjct: 82 VNWQKRDGIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELR 141
Query: 137 TIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDV 196
T DM +ADY+RK+K L D+L + + D L+ Y+L GL P +D ++ I R
Sbjct: 142 TKDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPF 201
Query: 197 YTVSDVYAHMLSYEMR-------------HLRKGTFEQLSSANNVNRISIRGGANGGRGS 256
+ D + E R H T S A V GG G
Sbjct: 202 PSFDDAATMLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQRSGGNQMGYRG 261
Query: 257 RGRSRQLNSGHGQSRRTVNNP 264
RGR + G G N P
Sbjct: 262 RGRGNNIFRGRGGRFSYYNMP 263
BLAST of CmaCh04G012940 vs. NCBI nr
Match:
gi|40714683|gb|AAR88589.1| (putative copia-like retrotransposon protein [Oryza sativa Japonica Group])
HSP 1 Score: 1263.8 bits (3269), Expect = 0.0e+00
Identity = 670/1425 (47.02%), Postives = 928/1425 (65.12%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S + ++P +S KLT+ NY LWS Q+L +R L G + G+ APS I
Sbjct: 4 SSSSTASTNPLFGVQVSEKLTKGNYALWSAQVLAAIRGARLDGHITGATAAPSMEIEKTA 63
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
S++T + I+NP + W+ DQ VL + S+++ ++L+ + +TA +AW + F +
Sbjct: 64 SDKTTEK--IVNPAYQEWFASDQQVLGFLLSTLSRDILTQVATASTAAQAWQQVCAMFTA 123
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
++AR++ +R+ L+ QK +M+I++Y K+K L D +A+ GK +++E+L+AY+L GL D
Sbjct: 124 QTKARSLNVRLTLTNTQKGNMSISEYCGKMKALADEIASSGKPLDEEDLVAYVLNGLDDD 183
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGRG 241
++P+V++I R + T+++VY+ +L++E R Q ++ N + A
Sbjct: 184 FEPVVSAIVARNESTTMAEVYSQLLNFENRQ----ALRQAHASANAAVVVFSVVAAAAVA 243
Query: 242 SRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQA 301
G + L G + + G+ CWHR+D+ Y
Sbjct: 244 HVGAAILLLQDVGAALEEIR------------LGV--------------CWHRYDETYVP 303
Query: 302 ENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSIS 361
+ +++A A + Y DTNWY+DTGATDHITN+L++L RE+Y G D+I A+GAG+ I
Sbjct: 304 D---ERSAAAAAAYGIDTNWYLDTGATDHITNELDKLDVREKYKGGDKIHTASGAGMEIK 363
Query: 362 HIGNSLISGSS--LVLKHILYAPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKL 421
HIG+S+I + L LK+IL+ P+ K+LIS RLA DN A +E H NYFL+KDR T+
Sbjct: 364 HIGDSVIHTPTRELHLKNILHVPQAKKNLISAHRLAMDNFAFLEVHSNYFLIKDRATRNT 423
Query: 422 LLHGRCKNGLYVLPHNFSQALLTAKL-SKEQWHRRLGHPASPITIRILQDNNLAIDTNIP 481
+L GRC+ LY LP + ++ + A S +WH RLGHPA PI R+L NNL T
Sbjct: 424 ILKGRCRRRLYSLPTSPAKQVHAATTPSFSRWHSRLGHPAVPIVTRVLSKNNLPCSTVAN 483
Query: 482 SSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRY 541
SIC+ACQ GK+HQLP+ S VS+ PL+LI ++VWGP+ SV K+YVSF+DD+S++
Sbjct: 484 KDSICDACQKGKSHQLPYPKSSSVSSQPLELIFSNVWGPAPISVGGKKFYVSFIDDYSKF 543
Query: 542 VWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISC 601
W+Y L+ KS+V F +FQ VE KI +VQ+DWGGEY +L+ +F GI HH+SC
Sbjct: 544 TWVYLLKHKSEVFQKFQEFQTLVERFFGHKILAVQTDWGGEYQKLNTFFAKIGISHHVSC 603
Query: 602 PHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPL 661
P+ HQQNG ERKHRH++E L LLA A+MP+ +WDEA A +LINR PS+ I PL
Sbjct: 604 PYAHQQNGSAERKHRHLIEVALTLLAHASMPIKFWDEAVLAAAYLINRTPSKVINFACPL 663
Query: 662 HKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGR 721
+LF + P+Y+ LR+FGCA WPNLRPYN KL+FR+ RC+FLGYS+ HKG+KCL +TGR
Sbjct: 664 EQLFKEKPNYTALRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIATGR 723
Query: 722 IYISRDVVFDENIFPFEE-----SKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPV 781
+Y+SRDV FDE+IFPF E + P L+P L+ L N + + P
Sbjct: 724 VYVSRDVTFDESIFPFSELHSNAGARLRAEISLLPPSLVPHLSSLGG-EQNNHVLNYPPN 783
Query: 782 VSNSHMNDGQTDNIASDNLS-GVSLSSADNTRSSEEIAEYEAESSSINAQNQTH------ 841
V TD +N G + + ++ E A +++ AQ+ H
Sbjct: 784 V---------TDQFGEENAEIGEEIVANGEENAAAAADENAAAAANGGAQDDVHGVAYDA 843
Query: 842 --EHVS----DQPTEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPI 901
EH S D AA QH + ++VQA T S TS AS+ + +
Sbjct: 844 SPEHSSPVTDDAMASAAEQHGNPIQ-EEHLVQASPQT-----ASSTSPSVASSAGVHDDV 903
Query: 902 I------------ETATEP--------------RNLQEAMQHPRWRGAMNDELSALKRNA 961
E A P ++L+EA+ + W+ AM+ E AL N
Sbjct: 904 TTDQSDQTDQAMPEAAVAPIRPKTRLQSGIRKEKSLEEAVNNKHWKEAMDAEYMALIENK 963
Query: 962 TWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKP 1021
TW LVPP+ G N+ID KWVYKVKRKADGS++R KARLVAKGFKQR+G+DY DTFSPV+K
Sbjct: 964 TWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKA 1023
Query: 1022 STIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKAL 1081
+TIR++LSLAV++GW++RQ+D++NAFLHG+L+EEVYM+QPPG++ + P NY+CKL KAL
Sbjct: 1024 ATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMKQPPGYEKKSMP-NYVCKLDKAL 1083
Query: 1082 YGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQA 1141
YGLKQAP+AW+SRL+ KL ELGF S AD+SLF K +++I++LIYVDDII+ SS A
Sbjct: 1084 YGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQVSIFLLIYVDDIIVASSVPDA 1143
Query: 1142 TERLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMST 1201
T L+Q+L DFA+KDLGDL YFLGIEV K +DG++LSQ +YA DLL+RV M +CKP+ST
Sbjct: 1144 TSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQEKYASDLLRRVGMYECKPVST 1203
Query: 1202 PMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHW 1261
P+ ++EKL +G L ++ +YRS VGALQYLT+TRPD++F++NKVCQ+LH PT HW
Sbjct: 1204 PLSTSEKLSVNEGTLLGPQDSTQYRSVVGALQYLTLTRPDISFSINKVCQFLHAPTTTHW 1263
Query: 1262 GAVKRILRYVKGTLALGVKI-QKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISW 1321
AVKRILRYVK T+ G+K + ++++SGFSDADWAG PDDRRST GFAVFLG NL+SW
Sbjct: 1264 AAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAGSPDDRRSTGGFAVFLGPNLVSW 1323
Query: 1322 SSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSN 1379
S+RKQATVSRSS EAEYKA+AN TAE++W+++LL+ELGV +A +LWCDNLGA YL++N
Sbjct: 1324 SARKQATVSRSSIEAEYKALANATAEIMWVQTLLQELGVESPRAAKLWCDNLGAKYLSAN 1376
BLAST of CmaCh04G012940 vs. NCBI nr
Match:
gi|77555253|gb|ABA98049.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])
HSP 1 Score: 1237.6 bits (3201), Expect = 0.0e+00
Identity = 682/1463 (46.62%), Postives = 928/1463 (63.43%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S + ++P IS KL + N+ LW Q+ +R L G + G+ P IAV
Sbjct: 3 SSSSTAATNPFQGHPISEKLGKANHALWKVQVSAAVRGARLQGHLTGATKRPPAEIAVT- 62
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
+ G K NP W DQ VL + SS+T EVL + TA E W +E+ +++
Sbjct: 63 --KDGATKKEPNPAHEDWEATDQQVLGYLLSSLTREVLMQVATCDTAAEVWSAIEQMYST 122
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKR-IEDEELIAYMLQGLGP 181
+RARA+ R L+ +K +M+ +YF K+K LGD +A G R I++EELI Y++ GLG
Sbjct: 123 HTRARAINTRFALTNTKKGNMSTPEYFAKMKSLGDEMATAGGRPIDEEELIQYIITGLGE 182
Query: 182 DYDPLVTSITTRTDVYTVSDVYAHMLSYEMRH-LRKGTFEQLSSANNVNRISIRGGANGG 241
Y +V+++ R + +VSD+Y+ +L++E R + +G E + N S GG
Sbjct: 183 GYSEVVSAVCARVEPISVSDLYSQVLNFEARQAIYRGAQEVTVNVANRGGFS-HGGRGNS 242
Query: 242 RGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGI----VCQICGKPNHDALQCWHRF 301
G +G SR G G R N G + ++ G+ +CQ+C K H A CW+R+
Sbjct: 243 NGGQGGSRGGGGGGGHGR---GNGGGGHGRGRTPGGVDKRPICQVCFKRGHTAADCWYRY 302
Query: 302 DQAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANG 361
D+ Y + AA A + Y DTNWY+DTGATDH+T +L++LT +E+Y G +QI A+G
Sbjct: 303 DEDYVPDAK-HVAAAAVNSYGVDTNWYIDTGATDHVTGELDKLTMKEKYNGGEQIHTASG 362
Query: 362 AGLSISHIGNSLI---SGSSLVLKHILYAPKINKHLISVQRLASDNNAVVEFHPNYFLVK 421
AG+ ISHIG++++ + ++ L+++LY P+ K+L+SV RL +DN+A +E H +YF +K
Sbjct: 363 AGMDISHIGHAIVHNPNSRNIHLRNVLYVPQAKKNLVSVHRLVNDNSAFLELHRDYFFLK 422
Query: 422 DRVTKKLLLHGRCKNGLYVLPHN-FSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNL 481
D++T+K LL GR LY LPH+ QA KLS ++WH RLGHP+ +++ +L
Sbjct: 423 DQITRKTLLKGRSWRRLYPLPHSSLKQAYSAIKLSLDRWHHRLGHPSITTVKQVVNKFSL 482
Query: 482 AIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSF 541
+++ S S+CNACQ K+HQLPF S VS PL+L+ +DVWGP+ SV KYYVSF
Sbjct: 483 PC-SDLSSESVCNACQQAKSHQLPFPISSSVSKHPLELVFSDVWGPAPESVGRKKYYVSF 542
Query: 542 VDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTG 601
+DDFS++ WIY ++ KS V F +FQ VE + KI ++Q+DWGGEY L+++F G
Sbjct: 543 IDDFSKFTWIYLIQHKSQVFDKFQEFQVMVERQFDRKILAMQTDWGGEYQSLNSFFSKIG 602
Query: 602 IEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRT 661
I HH+SCPHTHQQNG ERKHRHIVE GL+LL+ A+MPL +WDEAF A +LINR+PSR
Sbjct: 603 IVHHVSCPHTHQQNGSAERKHRHIVEIGLSLLSHASMPLKFWDEAFQVATYLINRVPSRV 662
Query: 662 IQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKC 721
I +PL KLF + PDYS LR+FGCACWPNLRPYNN KL FR+ RC+FLG+S+ HKG+KC
Sbjct: 663 IHNISPLEKLFKQKPDYSSLRIFGCACWPNLRPYNNHKLQFRSKRCVFLGFSTMHKGFKC 722
Query: 722 LNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIE 781
L STGRIYISRDVVFDENIFPF E H L T L I
Sbjct: 723 LEVSTGRIYISRDVVFDENIFPFTEL----------HANAGARLRSEIDILTPELLGPIR 782
Query: 782 PVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAE--------SSSINAQN 841
V + M+ +N S ++S + A+ + +AE +S +
Sbjct: 783 SVGNEQCMH--PVNNPLSADVSAALSNRANEPHRDGAVHPADAEDPPATPPLDASSGPEP 842
Query: 842 QTHEHVSDQPTEAASQHPMRT---------------RLRNNIVQAKQFTDGTI----RYS 901
H S T ++ +HP T +++ QA Q + + S
Sbjct: 843 DRVVHHSPAAT-SSGRHPGPTPGSVPRGAASSLAEETAEDSVSQAVQEQESQVVQEQEQS 902
Query: 902 ETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWR------------------------ 961
+++ A AVT T ++ A EA PR R
Sbjct: 903 SPAQEHAQAVTDETNTLQHADVTDTGSEAPAGPRTRLQSGVRKEKVYTDGTIKYKHSWFT 962
Query: 962 --GAMNDELSALKRNATWDLVPPKPGINLIDS-----------------KWVYKVKRKAD 1021
G ++L ALK + W L L+ + KWVYK+KRKAD
Sbjct: 963 ASGEPTNDLEALK-DKNWKLAMDSEYDALVKNKTWHLVPPQRGRNIIGCKWVYKIKRKAD 1022
Query: 1022 GSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFL 1081
G+++R KARLVAKGFKQR+G+DY DTFSPV+K +TIR+ILSLAV+KGW++RQ+D+QNAFL
Sbjct: 1023 GTLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIILSLAVSKGWSLRQLDVQNAFL 1082
Query: 1082 HGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASV 1141
HG L+EEVYM QPPGF+D KP +++CKL KALYGLKQAP+AW SRL+ KL++LGFK S
Sbjct: 1083 HGYLEEEVYMLQPPGFEDPTKP-HHVCKLDKALYGLKQAPRAWFSRLSKKLMDLGFKGSK 1142
Query: 1142 ADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGDLEYFLGIE 1201
D+SLF L +IT+++L+YVDDII+ SSS++AT L+Q LK +FA+KDLG+L YFLGIE
Sbjct: 1143 PDTSLFFLNKGDITMFVLVYVDDIIVASSSEKATAALLQDLKGEFALKDLGELHYFLGIE 1202
Query: 1202 VKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRST 1261
V K ++GI+L+Q +YA DLLK+V M CKP +TP+ +EKL +G L E+ YRS
Sbjct: 1203 VSKVQNGIVLNQDKYANDLLKKVGMIDCKPANTPLSVSEKLSLHEGSLLGPEDASHYRSV 1262
Query: 1262 VGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMM 1321
VGALQYLT+TRPD+AF+VNKVCQ+LH PT HW AVKRILRY+K LG++I KS + +
Sbjct: 1263 VGALQYLTLTRPDIAFSVNKVCQFLHAPTTVHWIAVKRILRYLKQCTRLGLEIHKSGSTL 1322
Query: 1322 LSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEM 1381
+SGFSDADWAGC DDRRST GFA+FLG+NL+SW++RKQATVSRSSTE+EYKAIAN TAE+
Sbjct: 1323 VSGFSDADWAGCLDDRRSTGGFAIFLGSNLVSWNARKQATVSRSSTESEYKAIANATAEI 1382
BLAST of CmaCh04G012940 vs. NCBI nr
Match:
gi|147790209|emb|CAN61322.1| (hypothetical protein VITISV_012106 [Vitis vinifera])
HSP 1 Score: 1048.9 bits (2711), Expect = 8.1e-303
Identity = 582/1414 (41.16%), Postives = 844/1414 (59.69%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
SIP+ M ++ + VKL + NY+LW +QI + + F+DG+ P + ++
Sbjct: 25 SIPSYQM----LNHTLPVKLDRTNYILWRSQIDNVIFANGFEDFIDGTSICPEKDLSPG- 84
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
++NP F W QD+ +LS I SS+T +++ ++G T+ AW LE F+S
Sbjct: 85 ---------VMNPAFVAWRRQDRTILSWIYSSLTPGIMAQIIGHNTSHSAWNALESIFSS 144
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
+SRAR MQ+R+EL + +K M++ DY K+K D LAAIG+ + +++ + +L GLG D
Sbjct: 145 SSRARIMQLRLELQSTKKGSMSMIDYIMKIKGAADNLAAIGEPVSEQDQVMNLLGGLGSD 204
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGA---NG 241
Y+ +VT+I R D ++ +++ +L++E R ++ + EQ+S AN + + RGG NG
Sbjct: 205 YNAVVTAINIRDDKISLEAIHSMLLAFEHRLEQQSSIEQMS-ANYASSSNNRGGGRKFNG 264
Query: 242 GRGS-----------RGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHD 301
GRG RGR R +GQ R ++P +P CQ+CGK H
Sbjct: 265 GRGQGYSPNNNNYTYRGRGR--GGRNGQGGRQNSSPSEKPQ---------CQLCGKFGHT 324
Query: 302 ALQCWHRFDQAYQA-------------ENNLKQAALATSGYTSDTNWYVDTGATDHITND 361
A C+HRFD ++Q +NN+ + S +D +WY+D+GA+ H+T +
Sbjct: 325 AQICYHRFDISFQGGQTTISHSLNNGNQNNIPAMVASASNNPADESWYLDSGASHHLTQN 384
Query: 362 LERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGS--SLVLKHILYAPKINKHLISVQ 421
L LT+ YTGTD++ + NG LSIS+IG+ + S LK + + P I+ +LISV
Sbjct: 385 LGNLTSTSPYTGTDKVTIGNGKHLSISNIGSKQLHSHTHSFRLKKVFHVPFISANLISVA 444
Query: 422 RLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLP-----------HN----FS 481
+ S+NNA++EFH N F VKD TK +L G+ +NGLY P +N S
Sbjct: 445 KFCSENNALIEFHSNAFFVKDLHTKMVLAQGKLENGLYKFPVFSNLKPYSSINNASAFHS 504
Query: 482 QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG 541
Q T + E WH RLGH + I +++ N+A + S +C+ CQL K+H+LP
Sbjct: 505 QFSSTVENKAELWHNRLGHASFDIVSKVMNTCNVA--SGKYKSFVCSDCQLAKSHRLPTQ 564
Query: 542 SSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQ 601
S ++ PL+L++TD+WGP SI S + ++Y++ FVDD+SRY W Y L+ K +F
Sbjct: 565 LSNFHASKPLELVYTDIWGPASIKSTSGARYFILFVDDYSRYTWFYSLQTKDQALPIFKX 624
Query: 602 FQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIV 661
F+ +E +TKI+ +QSD GGE+ ++ ++ GI H SCP+ QNG VERKHRH+V
Sbjct: 625 FKLQMENQFDTKIKCLQSDNGGEFRSFTSFLQAVGIAHRFSCPYNSXQNGRVERKHRHVV 684
Query: 662 ETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGC 721
ETGLALL+ A++P+ YW AF T FLINRMPS+ ++ D+P LF + PDY RVFGC
Sbjct: 685 ETGLALLSHASLPMKYWHYAFQTXTFLINRMPSKVLEYDSPYFTLFRRHPDYKSFRVFGC 744
Query: 722 ACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEE 781
C+P +RPYN KL +R+ +C+FLGYS +HKG+ CL+ +TGR+YI+ VVFDE+ FP +
Sbjct: 745 LCYPFIRPYNTHKLQYRSVQCLFLGYSLNHKGFLCLDYATGRVYITPHVVFDESTFPLAQ 804
Query: 782 SKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVS 841
SK + + + PAL SF + P SH + S++ +
Sbjct: 805 SKSSSSSNDTSAEGSTPALITPPSF------PCLLPDSKISHASIDSHSLSTSESPIPTT 864
Query: 842 LSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQF 901
SS +T SS + +S +P A M TR I + K
Sbjct: 865 SSSPLDTSSSSPAIDLSPKS-------------VPEPQITALAPRMTTRSMRGITKKKTI 924
Query: 902 TDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATW 961
D SA+ + +EP L++A + P W AM E++AL RN TW
Sbjct: 925 LDL------------SAIKV--------SEPSTLKQAFKDPNWTKAMEMEIAALHRNHTW 984
Query: 962 DLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPST 1021
DLV P +N+I KWVYK+K K DGS+ER KARLVAKG+ Q G+DY +TFSPV+K +T
Sbjct: 985 DLVEQPPNVNVIGCKWVYKLKHKPDGSIERYKARLVAKGYNQTHGLDYFETFSPVVKAAT 1044
Query: 1022 IRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYG 1081
IR+IL++A++ W +RQ+D+ NAFL+G L+E+VYM QPPG+ D P N +C+LKKALYG
Sbjct: 1045 IRIILTVALSFKWEIRQLDVHNAFLNGELEEQVYMSQPPGYFDPQFP-NRVCRLKKALYG 1104
Query: 1082 LKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATE 1141
LKQAP+AW RL+ L++ GF S DSS+F+ + T+ +L+YVDDI++ SS
Sbjct: 1105 LKQAPRAWFQRLSSALLQWGFSMSRTDSSMFLHFGKATTLIVLVYVDDILVTGSSSTQIS 1164
Query: 1142 RLIQKLKIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM 1201
LI KL FA++DLG L +FLGIEV + LSQ +Y DLL R + KP +TP
Sbjct: 1165 SLIAKLDSVFALRDLGQLSFFLGIEVSYNEGSMTLSQTKYISDLLHRTELFDTKPANTPG 1224
Query: 1202 GSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGA 1261
+ L + G P++ + YRS VGALQY+T+TRPD+AFAVNK CQ++ PT AHW +
Sbjct: 1225 AVGKNLSKFDGDPMT--DVTHYRSVVGALQYVTLTRPDIAFAVNKACQFMQQPTTAHWLS 1284
Query: 1262 VKRILRYVKGTLALGVKIQ-KSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSS 1321
VKRILRY++GT+ G+ S + + GF+DADW DDRRS+SG+ V+LG NL+SWSS
Sbjct: 1285 VKRILRYLRGTMQDGLLFSPSSNLTIEGFTDADWGAHLDDRRSSSGYLVYLGGNLVSWSS 1344
Query: 1322 RKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPV 1370
KQ VSRSS E+EY+ + TAE++W+++LL+EL V P LW DN+ A ++ NPV
Sbjct: 1345 TKQKVVSRSSAESEYRGLVFATAEIVWMQALLQELCVPIPAIPLLWYDNISAYHMAKNPV 1368
BLAST of CmaCh04G012940 vs. NCBI nr
Match:
gi|147856699|emb|CAN81355.1| (hypothetical protein VITISV_039158 [Vitis vinifera])
HSP 1 Score: 1014.2 bits (2621), Expect = 2.2e-292
Identity = 578/1408 (41.05%), Postives = 832/1408 (59.09%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
SIP+ M + S+ VKL + NY+LW +QI + + F+DG+ P + + P
Sbjct: 22 SIPSYQMLNYSLP----VKLDRTNYILWRSQIDNVIFANGFEDFIDGTSVCPEKEL--RP 81
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
E INP F QD+ +LS I SS+T +++ ++G ++ AW LE+ F+S
Sbjct: 82 GE--------INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNSSHSAWNALEKIFSS 141
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
SRAR MQ+ +E + +K M++ DY KVK D+LAAIG+ + +++ I +L GLG D
Sbjct: 142 CSRARIMQLXLEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSEQDQIMNLLGGLGSD 201
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLS--SANNVNRISIRGGA--- 241
Y+ +VT+I R D ++ V++ +L++E R ++G+ EQL SAN + + RGG
Sbjct: 202 YNAVVTAINIREDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSANYASXSNNRGGGRKY 261
Query: 242 NGGRGSRGRSRQLN-SGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRF 301
NGGRG N G G+ R GRQ S SS CQ+CGK H C+HRF
Sbjct: 262 NGGRGPNFMMTNSNFRGRGRGXR-YGQSGRQNSS--SSERPQCQLCGKFGHTVQVCYHRF 321
Query: 302 DQAYQA------------ENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRER 361
D +Q+ +N A +A S +D NWY+D+GA+ H+T ++ LT
Sbjct: 322 DITFQSTQNNTTGVSNSGNSNXMPAMVAXSNNXADDNWYLDSGASHHLTQNVANLTNATP 381
Query: 362 YTGTDQIQVANGAGLSISHIGNSLISGS--SLVLKHILYAPKINKHLISVQRLASDNNAV 421
YTG D++ + NG L+IS+ + + + S LK + + P I+ +LISV + SDNNA+
Sbjct: 382 YTGADKVTIGNGKHLTISNTXFTRLFSNPHSFQLKKVFHVPFISANLISVAKFCSDNNAL 441
Query: 422 VEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLP--HNFSQALL-----------TAKLSKE 481
+EFH N F +KD TK++L G+ +NGLY P N A + + +E
Sbjct: 442 IEFHSNGFFLKDLHTKRVLAQGKLENGLYKFPVISNKKTAYVGITNDSTFQCSNIENKRE 501
Query: 482 QWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ 541
WH RLGH A+ I RI+ + N++ +++C++CQL K+H+LP S ++ PL+
Sbjct: 502 LWHHRLGHAATDIVTRIMHNCNVSCGKY--KATVCSSCQLAKSHRLPTHLSSFHASKPLE 561
Query: 542 LIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNT 601
L++TD+WGP S+ S + +KY++ FVDD+SRY W+Y L+ K +F F+ VE
Sbjct: 562 LVYTDIWGPASVTSTSGAKYFILFVDDYSRYTWLYLLQSKDQALPIFKXFKLQVENQFEA 621
Query: 602 KIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQAN 661
KI+ +Q D GGE+ ++ + +GI H SCP+ QNG VERKHRH+VETGLALLA A
Sbjct: 622 KIKCLQXDNGGEFRSFMSFLQESGILHRFSCPYNSSQNGRVERKHRHVVETGLALLAHAG 681
Query: 662 MPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNN 721
+PL +W AF TA FLINRMPS+ + +P LF ++PDY LRVFGC C+P +RPYNN
Sbjct: 682 LPLKFWSYAFQTATFLINRMPSKVLXNASPYFALFKRNPDYKFLRVFGCLCYPFIRPYNN 741
Query: 722 KKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPH 781
KL +R+ +C+FLGYS HKGY CL+ TGR+Y+S VVFDE FPF ++ + + +
Sbjct: 742 HKLQYRSLKCVFLGYSLHHKGYLCLDNLTGRVYVSPHVVFDETQFPFAQNISSSPSKDAS 801
Query: 782 HPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSG-----VSLSSADN 841
++P + + P + H G ++AS NL+ S +
Sbjct: 802 DESIIPXI-----------IVSSNPSTLSFH---GSNHSMASPNLTSTLTHPTSPTDTPT 861
Query: 842 TRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQFTDGTIR 901
TR S E EAE + AQ Q V P + TR + I + K
Sbjct: 862 TRXSRE-PVLEAE-VXLPAQQQV---VVPPPR-------VTTRSMSGITKRK-------- 921
Query: 902 YSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPK 961
I +EP L++A++ P W AM+ E++AL +N TWDLV P
Sbjct: 922 ------------PIFNLAASKISEPTTLKQAIKDPNWAEAMHTEIAALHKNQTWDLVDPP 981
Query: 962 PGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILS 1021
+N+I KWVYK+K K DGSV+R KARLVA+GF Q FG+DY +TFSPV+K +TIR++L+
Sbjct: 982 KDVNIIGCKWVYKLKYKXDGSVDRYKARLVARGFNQTFGLDYFETFSPVVKAATIRIVLT 1041
Query: 1022 LAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPK 1081
+ ++ W +RQ+D VYM QPPGF P N +CKLKKALYGLKQ+P+
Sbjct: 1042 IXLSYRWELRQLD-------------VYMAQPPGFLHPNHP-NKVCKLKKALYGLKQSPR 1101
Query: 1082 AWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKL 1141
AW ++L+ L+ GF +S DSS+F+ R T+ +L+YVDDII+ SS ++LI KL
Sbjct: 1102 AWFTKLSSALLSXGFNSSRTDSSMFVHFGRHSTLIVLVYVDDIIVTGSSPILIQQLIHKL 1161
Query: 1142 KIDFAVKDLGDLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKL 1201
FA++DLG L YFLGIEV + LSQR+Y DLL+R +M K +TP L
Sbjct: 1162 HSLFALRDLGQLSYFLGIEVTYDGGSLHLSQRKYITDLLQRTSMLDSKAAATPGTVGLSL 1221
Query: 1202 FREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILR 1261
+ G + ++ YRS VGALQY T+TRPD+AF++NK CQ++H PT HW +VKRILR
Sbjct: 1222 SQFDGDLM--DDVTMYRSVVGALQYATLTRPDIAFSINKACQFMHRPTSTHWSSVKRILR 1281
Query: 1262 YVKGTLALGVKIQKST-MMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATV 1321
Y+KGT+ G+ +Q S + ++DADW PDDRRS+SG+ V+LG NL+SW++ KQ V
Sbjct: 1282 YLKGTITHGLLLQPSAHFTIQAYTDADWGAQPDDRRSSSGYLVYLGNNLVSWTASKQKVV 1341
Query: 1322 SRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTK 1370
SRSS E+EY+ +A TAE+IW ++LL+EL + + P L+ DN+ A Y+ NPVFHARTK
Sbjct: 1342 SRSSAESEYRGLAIATAEIIWTQALLRELCISITSIPTLYYDNISAYYMAKNPVFHARTK 1348
BLAST of CmaCh04G012940 vs. NCBI nr
Match:
gi|1012364758|gb|KYP75940.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan])
HSP 1 Score: 995.7 bits (2573), Expect = 8.1e-287
Identity = 570/1425 (40.00%), Postives = 821/1425 (57.61%), Query Frame = 1
Query: 2 SIPTNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEP 61
S T + S IS KL+++N+L W Q +R L + G+M P+Q
Sbjct: 5 SATTKQFIPQAFSNPISSKLSEDNFLTWRQQAESTIRGYRLKKHILGAMHVPAQH----- 64
Query: 62 SEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS 121
+ K +++PE+ + QD L+ S + S+ + MVG + W LE FAS
Sbjct: 65 KTDEDKAKGVLSPEYEDYDQQDNLLKSWLLESMEPQFKVRMVGYEWCHQIWNNLEIYFAS 124
Query: 122 TSRARAMQIRMELSTIQKKDMTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPD 181
++AR Q++++L I KK I Y ++K + + LAAIG + E I + GL +
Sbjct: 125 QTKARVKQLKIQLRGI-KKTTAINQYLLEIKKIVNNLAAIGSPLNTAEHIDAIFDGLSEE 184
Query: 182 YDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQL--------SSANNVNRISIR 241
YDP +TS+ TRT+ Y+V + + +++ E R + E + S N+ + + +
Sbjct: 185 YDPFITSVLTRTEEYSVEQIESLLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQ 244
Query: 242 GGANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWH 301
G + G+ GR G G+S + +QP CQ+CGK H A CWH
Sbjct: 245 GRGSNNNGTHGRGPHRGRGRGRSAQHFGRGNKQP----------CQVCGKIGHIAFHCWH 304
Query: 302 RFDQAYQAEN-----------NLKQAALATSGYTS---DTNWYVDTGATDHITNDLERLT 361
R+DQ Y N N +Q +G + D WY D+GAT+H+T+DL L
Sbjct: 305 RYDQQYTEPNLNHNTNVYNPGNQQQMQAMIAGSQNMVYDDQWYPDSGATNHLTSDLNNLG 364
Query: 362 TRERYTGTDQIQVANGAGLSISHIGNSL----ISGSSLVLKHILYAPKINKHLISVQRLA 421
++ YTG D+I + NG + I+H G S +S LK +L+ P I K+L+SV +
Sbjct: 365 SKTDYTGQDKIHMGNGQAIGINHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFC 424
Query: 422 SDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYV------------------LPHNFS 481
DNN VEFH NY LVK + +K+ LL G KNGLYV +P N +
Sbjct: 425 KDNNVYVEFHTNYCLVKSQDSKETLLRGNLKNGLYVFDEVQILKPDVYEHTVDSVPTNRT 484
Query: 482 QALLT-AKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPF 541
+ + S + WH RL H + I +++ N+ + N S +C +C +GK+H LPF
Sbjct: 485 TLYVQRTRKSYDVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPF 544
Query: 542 GSSQHVSTAPLQLIHTDVWGPS-IASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFL 601
S V +PL+L+++DVWGPS AS +YYV F D FS+Y WIYF+ KS+ F+
Sbjct: 545 SDSYTVYNSPLELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFI 604
Query: 602 QFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHI 661
F+ VE KI+ QSD G E+ L F GI H +SCPH+HQQNG ERKHRHI
Sbjct: 605 HFKSMVENQFGHKIKMFQSDGGKEFTCLTKLFNENGITHRLSCPHSHQQNGTAERKHRHI 664
Query: 662 VETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFG 721
E GL LL+ + +P +W +AF+TA +INR+P+ + +P LF K PDY +LR FG
Sbjct: 665 TEVGLTLLSSSGLPHIFWSDAFSTAIHVINRLPTPILGHKSPYEVLFNKIPDYLLLRPFG 724
Query: 722 CACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFE 781
CAC+P+LRP+N+ K+ FR+++C+F+GYS HKGYKC + G+I ISR+VVFDE+ FP
Sbjct: 725 CACYPHLRPFNSHKMDFRSSQCVFIGYSMQHKGYKCFT-TNGKIIISRNVVFDEDTFPGP 784
Query: 782 ESKPPNKT---TNPHHPVLLPALAKLAS-FYTENALTDIEPVVSNSHMNDGQTDNIASDN 841
N ++ H P +P++ L S N+ T + P+VS+ +++
Sbjct: 785 SYFNTNNVAGLSSSHSPQPVPSIHVLPSQSLAPNSATTV-PLVSSI------DESLQDSQ 844
Query: 842 LSGVSLSSADNTRSSEEIAEYEAESSSINAQN--QTHEHVSDQPTEAASQHPMRTRLRNN 901
L+ + +S DN +S SS N Q T + T +QHPM TR +
Sbjct: 845 LTSTTDTSQDNLITS---------SSQPNPQPIITTVPDPASPATRTTNQHPMITRAKIG 904
Query: 902 IVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSA 961
+ + K F +EP ++ +A+ + W+ AM DE +A
Sbjct: 905 VFKPKAFL-------------------------ATSEPISVAQALDNKHWKQAMQDEYNA 964
Query: 962 LKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFS 1021
L +N TWDLV PG I KW++K K K DGS+ R KARLVA+GF Q+ G+DYTDTFS
Sbjct: 965 LLKNHTWDLVSLPPGRTAITCKWIFKNKYKQDGSILRHKARLVARGFSQQQGLDYTDTFS 1024
Query: 1022 PVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICK 1081
PV+KP +IR++L+LAV+KGW + Q+D+ NAFL+G LKE++YM QP GF + A N +CK
Sbjct: 1025 PVVKPVSIRIVLALAVSKGWPIHQIDVDNAFLNGDLKEDIYMLQPQGFHNGA--PNTVCK 1084
Query: 1082 LKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVS 1141
L+KALYGLKQAP+AW +LT L++LGFK + +DSSLF ++ +IY+LIYVDDIII S
Sbjct: 1085 LRKALYGLKQAPRAWFQKLTLILLQLGFKQARSDSSLFTFTSKVTSIYILIYVDDIIITS 1144
Query: 1142 SSDQATERLIQKLKIDFAVKDLGDLEYFLGIEVKKT-RDGIILSQRRYALDLLKRVNMEK 1201
+S A + LI L F +KDLG L YFLGIEV GI LSQ +Y D+L ++NM
Sbjct: 1145 NSPAAIQTLISTLHRFFPLKDLGSLHYFLGIEVNSDGAGGIHLSQTKYLKDILDKLNMSN 1204
Query: 1202 CKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHT 1261
KP+ TPM + KL + G + + YRS +GALQYLT+TRPD+A+ VNK+CQ++H
Sbjct: 1205 AKPVKTPMAPSLKLTND-GTAVCPDPSL-YRSVIGALQYLTITRPDIAYTVNKLCQFMHH 1264
Query: 1262 PTDAHWGAVKRILRYVKGTLALGVKIQKST-MMLSGFSDADWAGCPDDRRSTSGFAVFLG 1321
P ++HW AVKR+LRY++GTL G+ +KST + + D+DWA +D RSTSG V+LG
Sbjct: 1265 PLESHWKAVKRLLRYLQGTLQHGLHYKKSTDTGIHAYCDSDWASDHEDMRSTSGNCVYLG 1324
Query: 1322 ANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGA 1373
N++SW ++KQ VSRSSTEAE++++A+L E+ +I++LL EL ++ P +WCDN GA
Sbjct: 1325 PNIVSWMAKKQRVVSRSSTEAEFRSLASLVTEVQYIQNLLSELHNSSTQPPLIWCDNQGA 1367
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
POLX_TOBAC | 4.6e-157 | 31.11 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
COPIA_DROME | 6.0e-88 | 37.53 | Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3 | [more] |
M810_ARATH | 3.9e-55 | 48.68 | Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... | [more] |
YCH4_YEAST | 2.4e-36 | 29.90 | Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... | [more] |
YJ41B_YEAST | 3.4e-27 | 24.09 | Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
Match Name | E-value | Identity | Description | |
Q75LJ1_ORYSJ | 0.0e+00 | 47.02 | Putative copia-like retrotransposon protein OS=Oryza sativa subsp. japonica GN=O... | [more] |
Q2QRW4_ORYSJ | 0.0e+00 | 46.62 | Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... | [more] |
A5BFR8_VITVI | 5.6e-303 | 41.16 | Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012106 PE=4 SV=1 | [more] |
A5BQ73_VITVI | 1.5e-292 | 41.05 | Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039158 PE=4 SV=1 | [more] |
A0A151U9E7_CAJCA | 5.7e-287 | 40.00 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 2.0e-113 | 43.98 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 2.2e-56 | 48.68 | ATMG00810.1 DNA/RNA polymerases superfamily protein | [more] |
ATMG00820.1 | 3.8e-24 | 49.11 | ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
ATMG00240.1 | 9.3e-15 | 47.73 | ATMG00240.1 Gag-Pol-related retrotransposon family protein | [more] |
AT1G34070.1 | 1.6e-14 | 25.29 | Retrotransposon gag protein (InterPro:IPR005162) | [more] |