Homology
BLAST of CSPI04G22620 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 737.3 bits (1902), Expect = 2.9e-211
Identity = 422/964 (43.78%), Postives = 574/964 (59.54%), Query Frame = 0
Query: 276 LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQ 335
+ KS+ +PF S ++ P IYSD+W +P S+D +RYY++F+D ++ YTW+YPLKQ
Sbjct: 505 INKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQ 564
Query: 336 KSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQNG 395
KS E F F ++N+F I F SDNGGE+ + GI+ S P+T NG
Sbjct: 565 KSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNG 624
Query: 396 RAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKL 455
+ERKHRHIVETGLTLL+ A++ YW AF V LIN + TP LQ SP ++LF
Sbjct: 625 LSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSP 684
Query: 456 KVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVK 515
+ L++FGC C+P LRPY K S +CV+LG S T + CL +T R++ISRHV+
Sbjct: 685 NYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVR 744
Query: 516 FNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT 575
F+EN FPFS+ + S Q S + P +T AP+ P +T +
Sbjct: 745 FDENCFPFSN-YLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSS 804
Query: 576 LPFPFPIPPM----------SSIPSSPINITP--NNP----------PSVHSTA-----N 635
PF + SS PSSP P N P HS+ N
Sbjct: 805 PSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNN 864
Query: 636 PTNSNP-------NLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSN-S 695
PTN +P + P S S + T +S +S +PPS L P + V +N +
Sbjct: 865 PTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQA 924
Query: 696 HIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLAT 755
+ THSM TRAKA I KP S + + +EP +AL +W+ AM +E A +
Sbjct: 925 PLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGN 984
Query: 756 NTWSLVPPSSSQ-NIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVV 815
+TW LVPP S IVG +WIF K NSDGS+ RYKARLVAKG++Q PG+D+ ETFSPV+
Sbjct: 985 HTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVI 1044
Query: 816 KASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA 875
K++++ +VL + + R W +RQLD NNAFL G L + VY++QPP ++ +YVCKL KA
Sbjct: 1045 KSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKA 1104
Query: 876 IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPS 935
+YGLKQAPRAW L LL GFVNS SD+SLF+L+ SI+ +L YVDD +ITGNDP+
Sbjct: 1105 LYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPT 1164
Query: 936 LIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAP 995
L+ + + +L ++F++KD L YFL + K + G LS+ +Y+ DLL R M K
Sbjct: 1165 LLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVT 1224
Query: 996 SPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQ 1055
+P LS L DP+ YR +G+LQYL TRPDI+Y VN LSQF+ PT+ H Q
Sbjct: 1225 TPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQ 1284
Query: 1056 AVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWS 1115
A+KRILRY++GT G+ + LS+ A+SDADWA + DD + Y V++G++ +SWS
Sbjct: 1285 ALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWS 1344
Query: 1116 SKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNP 1175
SKKQ V SSTE+EYR++A + E+ W+ LL EL + + PVI+CDN+ A L NP
Sbjct: 1345 SKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATYLCANP 1404
Query: 1176 VFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVI 1203
VFH+R KHI ID FIR+QV GAL V +V + DQLA+ LTKPL+ + F+N SK+GV
Sbjct: 1405 VFHSRMKHIAIDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASKIGVTR 1464
BLAST of CSPI04G22620 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 728.0 bits (1878), Expect = 1.8e-208
Identity = 420/970 (43.30%), Postives = 573/970 (59.07%), Query Frame = 0
Query: 276 LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQ 335
+ KSH +PF NS + +P IYSD+W +P S D +RYY++F+D ++ YTW+YPLKQ
Sbjct: 484 INKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQ 543
Query: 336 KSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQNG 395
KS + F F V+N+F I SDNGGE+ ++ GI+ S P+T NG
Sbjct: 544 KSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNG 603
Query: 396 RAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKL 455
+ERKHRHIVE GLTLL+ A++ YW AF V LIN + TP LQ SP ++LF Q
Sbjct: 604 LSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPP 663
Query: 456 KVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVK 515
E LK+FGC C+P LRPY K S++C ++G S T + CL TGR++ SRHV+
Sbjct: 664 NYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQ 723
Query: 516 FNENDFPFSDLFYPAQSTCSTQSAS----PSLAFFKSWPSPNQTQSNMAPN-------PQ 575
F+E FPFS + ++ +S S PS + P + P+ P
Sbjct: 724 FDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPS 783
Query: 576 GPQLTSTTQIT---LPFPFPIPPMSSIPSSPINITP-------------------NNPPS 635
P TTQ++ LP P SS P++P + P NNP
Sbjct: 784 SPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNP 843
Query: 636 VHSTANPTNSNPNLPHNPLS------PSTTIT-PRENSPYSSSSPPSPLSL-GPASIDHT 695
+ N N N LP +P+S PST+I+ P S S+S+PP P L P I
Sbjct: 844 NSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVN 903
Query: 696 VQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEY 755
Q + + THSM TRAK I KP S + + +EP +A+ +W++AM +E
Sbjct: 904 AQ---APVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEI 963
Query: 756 CALLATNTWSLV-PPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFE 815
A + +TW LV PP S IVG +WIF K NSDGS+ RYKARLVAKG++Q PG+D+ E
Sbjct: 964 NAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAE 1023
Query: 816 TFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYV 875
TFSPV+K++++ +VL + + R W +RQLD NNAFL G L + VY++QPP +V YV
Sbjct: 1024 TFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYV 1083
Query: 876 CKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAII 935
C+L KAIYGLKQAPRAW L LL GFVNS SD+SLF+L+ SII +L YVDD +I
Sbjct: 1084 CRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILI 1143
Query: 936 TGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMT 995
TGND L++ + +L ++F++K+ L YFL + K + G LS+ +Y DLL R M
Sbjct: 1144 TGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLARTNML 1203
Query: 996 DLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRP 1055
K +P L+ L DP+ YR +G+LQYL TRPD++Y VN LSQ++ P
Sbjct: 1204 TAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMP 1263
Query: 1056 TDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGN 1115
TD HW A+KR+LRY++GT G+ + LS+ A+SDADWA + DD + Y V++G+
Sbjct: 1264 TDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGH 1323
Query: 1116 NLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAG 1175
+ +SWSSKKQ V SSTE+EYR++A + E+ W+ LL EL + S PVI+CDN+ A
Sbjct: 1324 HPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVGAT 1383
Query: 1176 ALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRS 1203
L NPVFH+R KHI +D FIR+QV GAL V +V + DQLA+ LTKPL+ F+N
Sbjct: 1384 YLCANPVFHSRMKHIALDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQNFSR 1443
BLAST of CSPI04G22620 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 411.0 bits (1055), Expect = 4.8e-113
Identity = 296/940 (31.49%), Postives = 447/940 (47.55%), Query Frame = 0
Query: 277 GKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQK 336
GK H + F S ++YSD+ GP S +Y++ F+DD S W+Y LK K
Sbjct: 463 GKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTK 522
Query: 337 SSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKIQHLCLNLGINCRFSCPYTSAQN 396
+ FQ F V+ + +K +SDNGGEY ++ + C + GI + P T N
Sbjct: 523 DQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHN 582
Query: 397 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 456
G AER +R IVE ++L A + ++W +A T LIN + L P N++
Sbjct: 583 GVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKE 642
Query: 457 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKT-GRIFISRHV 516
+ +LK+FGC F + Q TK S C+++G G++ ++ SR V
Sbjct: 643 VSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDV 702
Query: 517 KFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI 576
F E++ +T ++M+ +
Sbjct: 703 VFRESEV--------------------------------RTAADMSEKVKNG-------- 762
Query: 577 TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSP 636
IP +IPS T NNP S ST + + P +
Sbjct: 763 ------IIPNFVTIPS-----TSNNPTSAESTTDEVSEQGEQPGEVIEQG---------- 822
Query: 637 YSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTL--- 696
L G ++H Q H P + R++ + + S T++ L
Sbjct: 823 -------EQLDEGVEEVEHPTQGEEQHQP----LRRSERPRVESRRYPS---TEYVLISD 882
Query: 697 -TEPTRIKEALI---TLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNS 756
EP +KE L Q KAM E +L T+ LV + + KW+FKLK++
Sbjct: 883 DREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDG 942
Query: 757 DGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAF 816
D + RYKARLV KGF Q G+DF E FSPVVK +++ +LS+ + + QLD AF
Sbjct: 943 DCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAF 1002
Query: 817 LNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSK 876
L+G LEE +Y+ QP + + H VCKLNK++YGLKQAPR W + + ++ +
Sbjct: 1003 LHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTY 1062
Query: 877 SDSSLFILR-CQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSF 936
SD ++ R +++ I+LL YVDD +I G D LI L L K F +KDLG L
Sbjct: 1063 SDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGM 1122
Query: 937 QV--KYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLS------ALDSKLLDD 996
++ + LS+EKY++ +L R M + K +P LS ++ K
Sbjct: 1123 KIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMA 1182
Query: 997 PSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLL 1056
Y S +G+L Y + TRPDIA+ V +S+FL+ P HW+AVK ILRY+ GT L
Sbjct: 1183 KVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLC 1242
Query: 1057 FQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRA 1116
F S D + ++DAD A +ID+RK + Y +SW SK Q VA S+TE+EY A
Sbjct: 1243 FGGS-DPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIA 1302
Query: 1117 LALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRD 1176
E+IWLK+ L EL + + V++CD+ SA L+ N ++H RTKHI++ +IR+
Sbjct: 1303 ATETGKEMIWLKRFLQELGLHQK-EYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIRE 1325
Query: 1177 QVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGV 1197
V +L V + + + A+ LTK + ++F + +G+
Sbjct: 1363 MVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELVGM 1325
BLAST of CSPI04G22620 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 380.9 bits (977), Expect = 5.3e-104
Identity = 298/968 (30.79%), Postives = 460/968 (47.52%), Query Frame = 0
Query: 277 GKSHNLPFP--NSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 336
GK LPF ++H K P +++SD+ GP + D Y+++F+D ++ Y Y +K
Sbjct: 461 GKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIK 520
Query: 337 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKIQHLCLNLGINCRFSCPYTSA 396
KS FQ FV + FN + DNG EY +++ C+ GI+ + P+T
Sbjct: 521 YKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQ 580
Query: 397 QNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTL--QGLSPIERL 456
NG +ER R I E T+++ A + ++W +A LT LIN + + L +P E
Sbjct: 581 LNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMW 640
Query: 457 FNQKLKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFI- 516
N+K +++L++FG + ++ Q KF S K +++G P GFK FI
Sbjct: 641 HNKKPYLKHLRVFGATVYVHIKNKQ-GKFDDKSFKSIFVGYEP--NGFKLWDAVNEKFIV 700
Query: 517 SRHVKFNENDF------PFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQ-------- 576
+R V +E + F +F ++ PN+++
Sbjct: 701 ARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFL 760
Query: 577 --SNMAPNPQGPQLTSTTQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNP 636
S + N P S I F P S I ++ S N +
Sbjct: 761 KDSKESENKNFPN-DSRKIIQTEF-----PNESKECDNIQFLKDSKESNKYFLNESKKRK 820
Query: 637 NLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD 696
H S + +P S + L ID+ + N I I +++
Sbjct: 821 RDDHLNESKGS------GNPNESRESETAEHLKEIGIDNPTK--NDGIE----IINRRSE 880
Query: 697 IFKPKACISKSFTDWTLTE------------PTRIKEALI---TLQWKKAMDAEYCALLA 756
K K IS + D +L + P E W++A++ E A
Sbjct: 881 RLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKI 940
Query: 757 TNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVV 816
NTW++ ++NIV S+W+F +K N G+ RYKARLVA+GF Q +D+ ETF+PV
Sbjct: 941 NNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVA 1000
Query: 817 KASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA 876
+ S+ +LS+ + + Q+D AFLNG L+E +Y+ P S +S VCKLNKA
Sbjct: 1001 RISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKA 1060
Query: 877 IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFIL---RCQHSIILLLAYVDDAIITGN 936
IYGLKQA R W +AL FVNS D ++IL +I +LL YVDD +I
Sbjct: 1061 IYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLL-YVDDVVIATG 1120
Query: 937 DPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLK 996
D + + + L ++F + DL + +F+ +++ + LS+ YV +L + M +
Sbjct: 1121 DMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCN 1180
Query: 997 A--APSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRP 1056
A P PS + L D D + RS IG L Y + TRPD+ VN LS++ +
Sbjct: 1181 AVSTPLPSKINYELLNSDE---DCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKN 1240
Query: 1057 TDLHWQAVKRILRYISGTKQFGLLFQH--SFDLSISAFSDADWASNIDDRKFVSAYCV-F 1116
WQ +KR+LRY+ GT L+F+ +F+ I + D+DWA + DRK + Y
Sbjct: 1241 NSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKM 1300
Query: 1117 IGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNL 1176
NL+ W++K+Q VA SSTE+EY AL A E +WLK LL +++ I+ DN
Sbjct: 1301 FDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQ 1360
Query: 1177 SAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRN 1198
++A NP H R KHI+I F R+QV + + Y+P+ +QLA+ TKPL ++F
Sbjct: 1361 GCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVE 1401
BLAST of CSPI04G22620 vs. ExPASy Swiss-Prot
Match:
P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)
HSP 1 Score: 191.0 bits (484), Expect = 7.8e-47
Identity = 97/224 (43.30%), Postives = 140/224 (62.50%), Query Frame = 0
Query: 883 LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYV 942
LL YVDD ++TG+ +L+ L+ L F++KDLG + YFL Q+K G LS+ KY
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 943 DDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMV 1002
+ +L+ M D K +P + K S++ + DPS +RS +GALQYLT TRPDI+Y V
Sbjct: 63 EQILNNAGMLDCKPMSTPLPL-KLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122
Query: 1003 NHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKF 1062
N + Q + PT + +KR+LRY+ GT GL + L++ AF D+DWA R+
Sbjct: 123 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182
Query: 1063 VSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIW 1107
+ +C F+G N++SWS+K+Q V+ SSTE+EYRALAL A E+ W
Sbjct: 183 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of CSPI04G22620 vs. ExPASy TrEMBL
Match:
A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)
HSP 1 Score: 877.5 bits (2266), Expect = 6.6e-251
Identity = 456/928 (49.14%), Postives = 609/928 (65.62%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q GK H LPF +S SHA+EP ++++D+WGPAP ++ F+YY+ F+DD+S +TWIYPLK
Sbjct: 472 QYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLK 531
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
QKS V+AF F +NQFN IKV Q D GGEYK +Q L + GI R SCPYTS QN
Sbjct: 532 QKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQN 591
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERKHRHI E GLTLLAQA M L+YWW+AF T V LIN + + Q SP + ++
Sbjct: 592 GRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKE 651
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVK 514
+ LK FGC C+PCL+PY K YH+ +CV+LG S +HKG+KCL+ GRIFISRHV
Sbjct: 652 PDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVI 711
Query: 515 FNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT 574
FNE+ FPF D F +S T PS +F P T N+ + P L +
Sbjct: 712 FNEDHFPFHDGFLNTRSPLKTTINVPSTSF------PLCTAGNVIDDASMPILEA----- 771
Query: 575 LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPY 634
+P + V+S TN+ P+ + + IT ++
Sbjct: 772 --------------ENPAETNTEDSQDVNSDTEQTNNGPSEDNTTHEETLDITQQQ---- 831
Query: 635 SSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPK-ACISKSFTDWTLTEP 694
S+G AS Q +N+ +H++ TR+K+ I KPK I + T EP
Sbjct: 832 ---------SVGEAS-----QNTNT---SHAIHTRSKSGIHKPKLPYIGLTETYKDTMEP 891
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
KEAL WK+AM E+ AL++ TW LVP + +NIV SKW+FK K DGS++R
Sbjct: 892 ANAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERR 951
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVAKGF Q G+D+ ETFSPV+KAST+ ++LSI + W +RQLD NNAFLNG L+E
Sbjct: 952 KARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKE 1011
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
+V++ QP +V S+ +++CKL+KAIYGLKQAPRAW ++L ALLNWGF N+KSDSSLF+
Sbjct: 1012 TVFMHQPEGFVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLFL 1071
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
L+ + I LL YVDD I+TG++ +Q+ + L+ F+LKDLG L YFL +V+ G
Sbjct: 1072 LKGKDHITFLLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDASG 1131
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
L + KY+ DLL + +M + P+P + G+ + ++ + L DP+++R IG LQYLT+
Sbjct: 1132 MYLKQSKYIGDLLKKFKMDNASPCPTPMITGRQFT-VEGEKLKDPTVFRQAIGGLQYLTH 1191
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
T PDIA+ VN LSQ++ P+ HWQ +KRILRY+ GT + L + S DL I+ FSDADW
Sbjct: 1192 TTPDIAFSVNKLSQYMSSPSIDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADW 1251
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
A++IDDRK +S CVF+G L+SWSS+KQ VV+ SSTESEYRALA A E+ W++ LL E
Sbjct: 1252 ATSIDDRKSMSGQCVFLGETLISWSSRKQKVVSRSSTESEYRALADLAAEIAWIRSLLTE 1311
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L++ KP++WCDNLSA ALA+NPV H R+KHIEIDV +IRDQVL+ + V YVP+TDQ
Sbjct: 1312 LELPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQ 1352
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPP 1202
+A+CLTKPL+H++F LR KLGV++SPP
Sbjct: 1372 IADCLTKPLSHTRFSQLRDKLGVILSPP 1352
BLAST of CSPI04G22620 vs. ExPASy TrEMBL
Match:
A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)
HSP 1 Score: 835.9 bits (2158), Expect = 2.2e-238
Identity = 451/946 (47.67%), Postives = 597/946 (63.11%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q GK H LPF S SH +EP A+I+SD+WGPAP S F+YY+ F+DD+S +TWI+PLK
Sbjct: 472 QFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLK 531
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
QKS + AF F +NQFN IK+ Q D GGEYK +Q + + GI R SCPYTS QN
Sbjct: 532 QKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQN 591
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERKHRH+ E GLTLLAQA M L YWW+AF T V LIN + + SP +F ++
Sbjct: 592 GRAERKHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKRE 651
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVK 514
LK FGC C+PCL+PY K +H+ +CV++G S +HKG+KC++ GRIF+SRHV
Sbjct: 652 PDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVI 711
Query: 515 FNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT 574
FNEN FPF F ++ T + + S+ + + TQ + P+ T++ Q T
Sbjct: 712 FNENHFPFHGGFLDTKNPLKTLTDNSSI-LLPTCSAGATTQDAIEPDNN----TTSDQNT 771
Query: 575 LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPY 634
SI SS N N V S+ N+N + ST +NS
Sbjct: 772 ----------HSIESSDNN---ENEEQVDSSEFFVNTN--------NSSTQDIEADNSVD 831
Query: 635 SSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPK-ACISKSFTDWTLTEP 694
S S ++ +I Q NS+ TH M TR+K I KPK + + TD EP
Sbjct: 832 SEDRNNSTMT---GTIQQQAQQDNSN--THWMRTRSKDGIHKPKIPYVGMAETDSEEKEP 891
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
+KEAL WK+AMD EY AL++ +TW+LVP +NI+ SKWIFK K SDGSI+R
Sbjct: 892 KSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDGSIERR 951
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVAKGF Q G+DF ETFSPVVK+ST+ ++L+I + W +RQLD NNAFLNG+L+E
Sbjct: 952 KARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLKE 1011
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
+V++ QP Y+ ++ +++CKL+KAIYGLKQAPRAW ++L L+NWGF N+K+D+SLF
Sbjct: 1012 TVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQNAKNDTSLFF 1071
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
L+ LL YVDD I+TG++ +++ L+ ++LKDLG L YFL +V D G
Sbjct: 1072 LKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFLGVEVHRDDSG 1131
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
L + KY+ D+L + M + A P+P V G+ A + +L+ +P+LYR IGALQYLTN
Sbjct: 1132 MYLRQTKYIRDVLKKFNMENTSACPTPMVTGRQFIA-EGELMSNPTLYRQAIGALQYLTN 1191
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDIA+ VN LSQ++ PT HWQ +KRILRY+ GTK L + S +L I+ F DADW
Sbjct: 1192 TRPDIAFAVNKLSQYMSTPTIEHWQGIKRILRYLQGTKNHSLHIKPSTNLHIAGFLDADW 1251
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK----- 1114
A++ DDRK CVF+G LVSW+S+KQ VV+ SSTESEYR+LA EV
Sbjct: 1252 ATSTDDRKSTGGQCVFLGETLVSWASRKQKVVSRSSTESEYRSLADLVAEVSTSSVATLL 1311
Query: 1115 --------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI 1174
LL EL + KPV+WCDNLSA ALA+NPV H R+KHIEID+ +I
Sbjct: 1312 SSERFLLAHFSTRFTLLEELKLPILRKPVLWCDNLSAKALASNPVMHARSKHIEIDMHYI 1371
Query: 1175 RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP 1201
RDQVL+ + + YVP+ DQ+A+CLTKPL H++F +R KLGV +SP
Sbjct: 1372 RDQVLENKVTIAYVPTADQIADCLTKPLPHTRFNIMRDKLGVTVSP 1385
BLAST of CSPI04G22620 vs. ExPASy TrEMBL
Match:
A5AYB0 (Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_041509 PE=4 SV=1)
HSP 1 Score: 815.1 bits (2104), Expect = 4.1e-232
Identity = 453/944 (47.99%), Postives = 591/944 (62.61%), Query Frame = 0
Query: 272 LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIY 331
+I L KSH+LP+ S SHA P A+I++DLWGPAPS S RY+++F+DDYS +TWIY
Sbjct: 514 IICPLAKSHSLPYSLSSSHASHPLALIHTDLWGPAPSTSITGARYFLIFIDDYSRHTWIY 573
Query: 332 PLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTS 391
L K A+++F F V+NQ TIK QSDNGGE+ + GI +FSCP+T
Sbjct: 574 FLSTKDQALQSFITFRKMVENQLQTTIKCIQSDNGGEFLAFKPYLEAHGILHQFSCPHTP 633
Query: 392 AQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLF 451
QNGRAERK RH+VETGL L+AQ+ + YW AF T V LIN + L SP + LF
Sbjct: 634 QQNGRAERKIRHLVETGLALMAQSFLPSKYWTYAFQTAVYLINLLPAKLLHFQSPTQTLF 693
Query: 452 NQKLKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFIS 511
++ +L++FGC+CFP LRPY K Y S CV+LG +P HKG+ CL T RI+IS
Sbjct: 694 HKLPNYHHLRVFGCLCFPSLRPYTQHKLCYRSTACVFLGYAPAHKGYLCLDVSTNRIYIS 753
Query: 512 RHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST 571
R+V F+E+ FPF QS+SP P+ P L S+
Sbjct: 754 RNVIFHESSFPF-------------QSSSP-------------------PSSPSPHLPSS 813
Query: 572 TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRE 631
T + P P S SSPI IT ++ P PL P T
Sbjct: 814 TPALINSPSLSAPSSPAVSSPI-ITSDSXP------------------PLIPVPFAT--- 873
Query: 632 NSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTL 691
+SP + S PP PL+ TH M+TRAK+ I K +SF
Sbjct: 874 SSPAAPSPPPLPLN------------------THPMVTRAKSGIHK-----KRSFIVQHT 933
Query: 692 TEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSI 751
TEP +A W +AM++EY ALL NTWSLVPP SS +IVG +WI+KLK DGSI
Sbjct: 934 TEPRTYSQASKNDSWVQAMNSEYQALLRNNTWSLVPPPSSAHIVGCRWIYKLKYRPDGSI 993
Query: 752 QRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQ 811
R+KARLVA+GF Q PG+D+F+TFSPVVK T+ ++L++ ++ W++RQLD NAFLNG
Sbjct: 994 DRHKARLVAQGFTQTPGIDYFDTFSPVVKPCTIRLILALAVSFQWSVRQLDVENAFLNGD 1053
Query: 812 LEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS 871
LEE V++TQP +V ++ YVCKL+KA+YGLKQAPRAW L ALL++GF +S++D+S
Sbjct: 1054 LEEEVFMTQPQGFVNPTYPTYVCKLHKALYGLKQAPRAWFQKLRIALLDYGFQSSRADTS 1113
Query: 872 LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYL 931
LFI I++LL YVDD ++TG++P L+ ++ L +FAL+DLG LSYFL Q + L
Sbjct: 1114 LFIFHTATDILILLVYVDDILVTGSNPMLVSHFISYLRTKFALRDLGPLSYFLGIQAQQL 1173
Query: 932 DCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY 991
L++ KY+ DLL+R QM K AP+P +G+TLS D L DPS YR T+GALQY
Sbjct: 1174 GSVLHLNQHKYIADLLNRTQMETSKPAPTPGRLGRTLSQSDGMSLSDPSEYRRTVGALQY 1233
Query: 992 LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSD 1051
+T TRPDIA+ VN QF+ +P+D+HW AVKRILRY+ GT GL FQ + + + +SD
Sbjct: 1234 VTLTRPDIAFAVNKACQFMAKPSDVHWMAVKRILRYLKGTIHLGLHFQPAASMELQGYSD 1293
Query: 1052 ADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQL 1111
ADWAS DDR+ S YCVF+G+NL+SWSS KQ +V+ SS ESEYR L E++W++ L
Sbjct: 1294 ADWASCPDDRRSTSGYCVFLGSNLISWSSSKQRLVSKSSAESEYRGLVSLTAELVWIQSL 1353
Query: 1112 LNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS 1171
L EL + TS P++WCDN SA LA NPVFH+R+KHIE+D+ FIR++VL+ L + YVPS
Sbjct: 1354 LQELCLPTS-PPILWCDNQSAAHLAANPVFHSRSKHIELDLHFIREKVLRQELQICYVPS 1376
Query: 1172 TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD 1215
DQLA+ TK L +QF NLRSKL V PP LRGD DN D
Sbjct: 1414 GDQLADIFTKHLPITQFCNLRSKL-TVTYPPLSLRGD--DNQTD 1376
BLAST of CSPI04G22620 vs. ExPASy TrEMBL
Match:
A0A438FWJ3 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_333 PE=4 SV=1)
HSP 1 Score: 810.8 bits (2093), Expect = 7.6e-231
Identity = 440/936 (47.01%), Postives = 582/936 (62.18%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q KSH LPF S S A P A++++DLWGPA S RY+ILF+DD+S ++WIYPL
Sbjct: 511 QFAKSHKLPFNVSVSRASHPLALLHADLWGPASIPSTTGARYFILFVDDFSRFSWIYPLH 570
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
K A+ F F V+NQFN+ I+ +SDNGGE+K GI +FSCPYT QN
Sbjct: 571 SKDQALSVFIKFKSLVENQFNSRIQCLRSDNGGEFKAFSSYLATHGIKSQFSCPYTPEQN 630
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERK RHI+ETGL LLA A++ +W AF T + LIN + T L SP + LF +
Sbjct: 631 GRAERKLRHIIETGLALLATASLPFKFWLYAFHTAIFLINRLPTKVLNYQSPFQILFGKS 690
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHV 514
KIFGC+C+P +RPY K SY S +CV+LG S HKG+ CL+ TGR++++RHV
Sbjct: 691 PNYHIFKIFGCLCYPYIRPYNKNKLSYRSSQCVFLGYSSNHKGYMCLNPLTGRLYVTRHV 750
Query: 515 KFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI 574
F+E FPF QST P+Q S++ +
Sbjct: 751 VFHETVFPF-------QST------------------PDQ---------------SSSVV 810
Query: 575 TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSP 634
T+P P +P S P V S + T +PST+ P N P
Sbjct: 811 TIPTPAFLPCSS--------------PPVSSLRSHT-----------TPSTSSPPLTNMP 870
Query: 635 YSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEP 694
S+ S P + + A I T +P ++ H M+TRAK I K K S ++EP
Sbjct: 871 SSTISLPDLIQVPFADIS-TSEPHPTN--QHPMVTRAKNGISKKKVYFSSH-----ISEP 930
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
T +A+ W AM+ E+ AL NTW LVPP S+ NI+G KW++KLK DG++ RY
Sbjct: 931 TTFTQAVKDSNWVLAMEKEFSALQRNNTWHLVPPPSNGNIIGCKWVYKLKYKPDGTVDRY 990
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVA+GF Q G+D+FETFSPVVKAST+ ++L++ L+ W++ QLD NAFL+G LEE
Sbjct: 991 KARLVAQGFTQTLGLDYFETFSPVVKASTIRIILAVALSFNWSVHQLDVQNAFLHGDLEE 1050
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
V++ QPP ++ S + +VCKLNKA+YGLKQAPRAW N LS +LL WGF S++DSS+FI
Sbjct: 1051 HVFMQQPPGFINSQYPSHVCKLNKALYGLKQAPRAWYNKLSTSLLGWGFQASRADSSMFI 1110
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
H +++LL YVDD ++TG+ + + S +T L+ FAL+DLG ++YFL +V
Sbjct: 1111 HHSTHDVLILLIYVDDILVTGSSSAQVSSFITRLNSSFALRDLGYVNYFLGIEVVRSGTM 1170
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
F LS+ KY DLL R M D K A +P ++G+TLS LD + D +LYRST+GALQYLT
Sbjct: 1171 FHLSQHKYTQDLLSRTAMLDSKPATTPGLLGQTLSHLDGEPFSDATLYRSTVGALQYLTL 1230
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDI++ VN QF+ PT HW AVKRILRY+ GT +G+ Q S L I ++DADW
Sbjct: 1231 TRPDISFAVNKACQFMATPTTTHWLAVKRILRYLKGTLSYGIQMQQSTSLDIHGYTDADW 1290
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
AS DDR+ Y +F+G NLVSWSS KQ VV+ SS ESEYRALA A E+IW++ +L E
Sbjct: 1291 ASCPDDRRSTGGYGIFLGPNLVSWSSNKQKVVSRSSAESEYRALASATSEMIWIQYVLQE 1350
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L +S+S P++WCDN SA LA NPVFH RTKHIE+D+ FIRD VL+ L ++Y+PS +Q
Sbjct: 1351 LCLSSSSPPLLWCDNKSAAHLAANPVFHARTKHIEMDLHFIRDHVLRKQLVIQYLPSAEQ 1372
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR 1210
+A+ TK ++ SQF + R+KL VV S P LRGD R
Sbjct: 1411 VADIFTKHISSSQFLSFRTKLSVVPS-PVSLRGDDR 1372
BLAST of CSPI04G22620 vs. ExPASy TrEMBL
Match:
A0A438JAU4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2773 PE=3 SV=1)
HSP 1 Score: 810.1 bits (2091), Expect = 1.3e-230
Identity = 440/936 (47.01%), Postives = 582/936 (62.18%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q KSH LPF S S A P A++++DLWGPA S RY+ILF+DD+S ++WIYPL
Sbjct: 618 QFAKSHKLPFNVSVSRASHPLALLHADLWGPASIPSTTGARYFILFVDDFSRFSWIYPLH 677
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
K A+ F F V+NQFN+ I+ +SDNGGE+K GI +FSCPYT QN
Sbjct: 678 SKDQALSVFIKFKSLVENQFNSRIQCLRSDNGGEFKAFSSYLATHGIKSQFSCPYTPEQN 737
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERK RHI+ETGL LLA A++ +W AF T + LIN + T L SP + LF +
Sbjct: 738 GRAERKLRHIIETGLALLATASLPFKFWLYAFHTTIFLINRLPTKVLNYQSPFQILFGKS 797
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHV 514
KIFGC+C+P +RPY K SY S +CV+LG S HKG+ CL+ TGR++++RHV
Sbjct: 798 PNYHIFKIFGCLCYPYIRPYNKNKLSYRSSQCVFLGYSSNHKGYMCLNPLTGRLYVTRHV 857
Query: 515 KFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI 574
F+E FPF QST P+Q S++ +
Sbjct: 858 VFHETVFPF-------QST------------------PDQ---------------SSSVV 917
Query: 575 TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSP 634
T+P P +P S P V S + T +PST+ P N P
Sbjct: 918 TIPTPAFLPCSS--------------PPVSSLRSHT-----------TPSTSSPPLTNMP 977
Query: 635 YSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEP 694
S+ S P + + A I T +P ++ H M+TRAK I K K S ++EP
Sbjct: 978 SSTISLPDLIQVPFADIS-TSEPHPTN--QHPMVTRAKNGISKKKVYFSSH-----ISEP 1037
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
T +A+ W AM+ E+ AL NTW LVPP S+ NI+G KW++KLK DG++ RY
Sbjct: 1038 TTFTQAVKDSNWVLAMEKEFSALQRNNTWHLVPPPSNGNIIGCKWVYKLKYKPDGTVDRY 1097
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVA+GF Q G+D+FETFSPVVKAST+ ++L++ L+ W++ QLD NAFL+G LEE
Sbjct: 1098 KARLVAQGFTQTLGLDYFETFSPVVKASTIRIILAVALSFNWSVHQLDVQNAFLHGDLEE 1157
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
V++ QPP ++ S + +VCKLNKA+YGLKQAPRAW N LS +LL WGF S++DSS+FI
Sbjct: 1158 HVFMQQPPGFINSQYPSHVCKLNKALYGLKQAPRAWYNKLSTSLLGWGFQASRADSSMFI 1217
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
H +++LL YVDD ++TG+ + + S +T L+ FAL+DLG ++YFL +V
Sbjct: 1218 HHSTHDVLILLIYVDDILVTGSSSAQVSSFITRLNYSFALRDLGYVNYFLGIEVVSSGTM 1277
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
F LS+ KY DLL R M D K A +P ++G+TLS LD + D +LYRST+GALQYLT
Sbjct: 1278 FHLSQHKYTQDLLSRTAMLDSKPATTPGLLGQTLSHLDGEPFSDATLYRSTVGALQYLTL 1337
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDI++ VN QF+ PT HW AVKRILRY+ GT +G+ Q S L I ++DADW
Sbjct: 1338 TRPDISFAVNKACQFMATPTTTHWLAVKRILRYLKGTLSYGIQMQQSTSLDIHGYTDADW 1397
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
AS DDR+ Y +F+G NLVSWSS KQ VV+ SS ESEYRALA A E+IW++ +L E
Sbjct: 1398 ASCPDDRRSTGGYGIFLGPNLVSWSSNKQKVVSRSSAESEYRALASATSEMIWIQYVLQE 1457
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L +S+S P++WCDN SA LA NPVFH RTKHIE+D+ FIRD VL+ L ++Y+PS +Q
Sbjct: 1458 LCLSSSSPPLLWCDNKSAAHLAANPVFHARTKHIEMDLHFIRDHVLRKQLVIQYLPSAEQ 1479
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR 1210
+A+ TK ++ SQF + R+KL VV S P LRGD R
Sbjct: 1518 VADIFTKHISSSQFLSFRTKLSVVPS-PVSLRGDDR 1479
BLAST of CSPI04G22620 vs. NCBI nr
Match:
GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])
HSP 1 Score: 877.5 bits (2266), Expect = 1.4e-250
Identity = 456/928 (49.14%), Postives = 609/928 (65.62%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q GK H LPF +S SHA+EP ++++D+WGPAP ++ F+YY+ F+DD+S +TWIYPLK
Sbjct: 472 QYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLK 531
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
QKS V+AF F +NQFN IKV Q D GGEYK +Q L + GI R SCPYTS QN
Sbjct: 532 QKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQN 591
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERKHRHI E GLTLLAQA M L+YWW+AF T V LIN + + Q SP + ++
Sbjct: 592 GRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKE 651
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVK 514
+ LK FGC C+PCL+PY K YH+ +CV+LG S +HKG+KCL+ GRIFISRHV
Sbjct: 652 PDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVI 711
Query: 515 FNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT 574
FNE+ FPF D F +S T PS +F P T N+ + P L +
Sbjct: 712 FNEDHFPFHDGFLNTRSPLKTTINVPSTSF------PLCTAGNVIDDASMPILEA----- 771
Query: 575 LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPY 634
+P + V+S TN+ P+ + + IT ++
Sbjct: 772 --------------ENPAETNTEDSQDVNSDTEQTNNGPSEDNTTHEETLDITQQQ---- 831
Query: 635 SSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPK-ACISKSFTDWTLTEP 694
S+G AS Q +N+ +H++ TR+K+ I KPK I + T EP
Sbjct: 832 ---------SVGEAS-----QNTNT---SHAIHTRSKSGIHKPKLPYIGLTETYKDTMEP 891
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
KEAL WK+AM E+ AL++ TW LVP + +NIV SKW+FK K DGS++R
Sbjct: 892 ANAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERR 951
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVAKGF Q G+D+ ETFSPV+KAST+ ++LSI + W +RQLD NNAFLNG L+E
Sbjct: 952 KARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKE 1011
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
+V++ QP +V S+ +++CKL+KAIYGLKQAPRAW ++L ALLNWGF N+KSDSSLF+
Sbjct: 1012 TVFMHQPEGFVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLFL 1071
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
L+ + I LL YVDD I+TG++ +Q+ + L+ F+LKDLG L YFL +V+ G
Sbjct: 1072 LKGKDHITFLLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDASG 1131
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
L + KY+ DLL + +M + P+P + G+ + ++ + L DP+++R IG LQYLT+
Sbjct: 1132 MYLKQSKYIGDLLKKFKMDNASPCPTPMITGRQFT-VEGEKLKDPTVFRQAIGGLQYLTH 1191
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
T PDIA+ VN LSQ++ P+ HWQ +KRILRY+ GT + L + S DL I+ FSDADW
Sbjct: 1192 TTPDIAFSVNKLSQYMSSPSIDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADW 1251
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
A++IDDRK +S CVF+G L+SWSS+KQ VV+ SSTESEYRALA A E+ W++ LL E
Sbjct: 1252 ATSIDDRKSMSGQCVFLGETLISWSSRKQKVVSRSSTESEYRALADLAAEIAWIRSLLTE 1311
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L++ KP++WCDNLSA ALA+NPV H R+KHIEIDV +IRDQVL+ + V YVP+TDQ
Sbjct: 1312 LELPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQ 1352
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPP 1202
+A+CLTKPL+H++F LR KLGV++SPP
Sbjct: 1372 IADCLTKPLSHTRFSQLRDKLGVILSPP 1352
BLAST of CSPI04G22620 vs. NCBI nr
Match:
GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])
HSP 1 Score: 835.9 bits (2158), Expect = 4.6e-238
Identity = 451/946 (47.67%), Postives = 597/946 (63.11%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q GK H LPF S SH +EP A+I+SD+WGPAP S F+YY+ F+DD+S +TWI+PLK
Sbjct: 472 QFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLK 531
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
QKS + AF F +NQFN IK+ Q D GGEYK +Q + + GI R SCPYTS QN
Sbjct: 532 QKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQN 591
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERKHRH+ E GLTLLAQA M L YWW+AF T V LIN + + SP +F ++
Sbjct: 592 GRAERKHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKRE 651
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVK 514
LK FGC C+PCL+PY K +H+ +CV++G S +HKG+KC++ GRIF+SRHV
Sbjct: 652 PDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVI 711
Query: 515 FNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT 574
FNEN FPF F ++ T + + S+ + + TQ + P+ T++ Q T
Sbjct: 712 FNENHFPFHGGFLDTKNPLKTLTDNSSI-LLPTCSAGATTQDAIEPDNN----TTSDQNT 771
Query: 575 LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPY 634
SI SS N N V S+ N+N + ST +NS
Sbjct: 772 ----------HSIESSDNN---ENEEQVDSSEFFVNTN--------NSSTQDIEADNSVD 831
Query: 635 SSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPK-ACISKSFTDWTLTEP 694
S S ++ +I Q NS+ TH M TR+K I KPK + + TD EP
Sbjct: 832 SEDRNNSTMT---GTIQQQAQQDNSN--THWMRTRSKDGIHKPKIPYVGMAETDSEEKEP 891
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
+KEAL WK+AMD EY AL++ +TW+LVP +NI+ SKWIFK K SDGSI+R
Sbjct: 892 KSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDGSIERR 951
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVAKGF Q G+DF ETFSPVVK+ST+ ++L+I + W +RQLD NNAFLNG+L+E
Sbjct: 952 KARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLKE 1011
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
+V++ QP Y+ ++ +++CKL+KAIYGLKQAPRAW ++L L+NWGF N+K+D+SLF
Sbjct: 1012 TVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQNAKNDTSLFF 1071
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
L+ LL YVDD I+TG++ +++ L+ ++LKDLG L YFL +V D G
Sbjct: 1072 LKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFLGVEVHRDDSG 1131
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
L + KY+ D+L + M + A P+P V G+ A + +L+ +P+LYR IGALQYLTN
Sbjct: 1132 MYLRQTKYIRDVLKKFNMENTSACPTPMVTGRQFIA-EGELMSNPTLYRQAIGALQYLTN 1191
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDIA+ VN LSQ++ PT HWQ +KRILRY+ GTK L + S +L I+ F DADW
Sbjct: 1192 TRPDIAFAVNKLSQYMSTPTIEHWQGIKRILRYLQGTKNHSLHIKPSTNLHIAGFLDADW 1251
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK----- 1114
A++ DDRK CVF+G LVSW+S+KQ VV+ SSTESEYR+LA EV
Sbjct: 1252 ATSTDDRKSTGGQCVFLGETLVSWASRKQKVVSRSSTESEYRSLADLVAEVSTSSVATLL 1311
Query: 1115 --------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI 1174
LL EL + KPV+WCDNLSA ALA+NPV H R+KHIEID+ +I
Sbjct: 1312 SSERFLLAHFSTRFTLLEELKLPILRKPVLWCDNLSAKALASNPVMHARSKHIEIDMHYI 1371
Query: 1175 RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP 1201
RDQVL+ + + YVP+ DQ+A+CLTKPL H++F +R KLGV +SP
Sbjct: 1372 RDQVLENKVTIAYVPTADQIADCLTKPLPHTRFNIMRDKLGVTVSP 1385
BLAST of CSPI04G22620 vs. NCBI nr
Match:
CAN73924.1 (hypothetical protein VITISV_041509 [Vitis vinifera])
HSP 1 Score: 815.1 bits (2104), Expect = 8.4e-232
Identity = 453/944 (47.99%), Postives = 591/944 (62.61%), Query Frame = 0
Query: 272 LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIY 331
+I L KSH+LP+ S SHA P A+I++DLWGPAPS S RY+++F+DDYS +TWIY
Sbjct: 514 IICPLAKSHSLPYSLSSSHASHPLALIHTDLWGPAPSTSITGARYFLIFIDDYSRHTWIY 573
Query: 332 PLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTS 391
L K A+++F F V+NQ TIK QSDNGGE+ + GI +FSCP+T
Sbjct: 574 FLSTKDQALQSFITFRKMVENQLQTTIKCIQSDNGGEFLAFKPYLEAHGILHQFSCPHTP 633
Query: 392 AQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLF 451
QNGRAERK RH+VETGL L+AQ+ + YW AF T V LIN + L SP + LF
Sbjct: 634 QQNGRAERKIRHLVETGLALMAQSFLPSKYWTYAFQTAVYLINLLPAKLLHFQSPTQTLF 693
Query: 452 NQKLKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFIS 511
++ +L++FGC+CFP LRPY K Y S CV+LG +P HKG+ CL T RI+IS
Sbjct: 694 HKLPNYHHLRVFGCLCFPSLRPYTQHKLCYRSTACVFLGYAPAHKGYLCLDVSTNRIYIS 753
Query: 512 RHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST 571
R+V F+E+ FPF QS+SP P+ P L S+
Sbjct: 754 RNVIFHESSFPF-------------QSSSP-------------------PSSPSPHLPSS 813
Query: 572 TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRE 631
T + P P S SSPI IT ++ P PL P T
Sbjct: 814 TPALINSPSLSAPSSPAVSSPI-ITSDSXP------------------PLIPVPFAT--- 873
Query: 632 NSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTL 691
+SP + S PP PL+ TH M+TRAK+ I K +SF
Sbjct: 874 SSPAAPSPPPLPLN------------------THPMVTRAKSGIHK-----KRSFIVQHT 933
Query: 692 TEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSI 751
TEP +A W +AM++EY ALL NTWSLVPP SS +IVG +WI+KLK DGSI
Sbjct: 934 TEPRTYSQASKNDSWVQAMNSEYQALLRNNTWSLVPPPSSAHIVGCRWIYKLKYRPDGSI 993
Query: 752 QRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQ 811
R+KARLVA+GF Q PG+D+F+TFSPVVK T+ ++L++ ++ W++RQLD NAFLNG
Sbjct: 994 DRHKARLVAQGFTQTPGIDYFDTFSPVVKPCTIRLILALAVSFQWSVRQLDVENAFLNGD 1053
Query: 812 LEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS 871
LEE V++TQP +V ++ YVCKL+KA+YGLKQAPRAW L ALL++GF +S++D+S
Sbjct: 1054 LEEEVFMTQPQGFVNPTYPTYVCKLHKALYGLKQAPRAWFQKLRIALLDYGFQSSRADTS 1113
Query: 872 LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYL 931
LFI I++LL YVDD ++TG++P L+ ++ L +FAL+DLG LSYFL Q + L
Sbjct: 1114 LFIFHTATDILILLVYVDDILVTGSNPMLVSHFISYLRTKFALRDLGPLSYFLGIQAQQL 1173
Query: 932 DCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY 991
L++ KY+ DLL+R QM K AP+P +G+TLS D L DPS YR T+GALQY
Sbjct: 1174 GSVLHLNQHKYIADLLNRTQMETSKPAPTPGRLGRTLSQSDGMSLSDPSEYRRTVGALQY 1233
Query: 992 LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSD 1051
+T TRPDIA+ VN QF+ +P+D+HW AVKRILRY+ GT GL FQ + + + +SD
Sbjct: 1234 VTLTRPDIAFAVNKACQFMAKPSDVHWMAVKRILRYLKGTIHLGLHFQPAASMELQGYSD 1293
Query: 1052 ADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQL 1111
ADWAS DDR+ S YCVF+G+NL+SWSS KQ +V+ SS ESEYR L E++W++ L
Sbjct: 1294 ADWASCPDDRRSTSGYCVFLGSNLISWSSSKQRLVSKSSAESEYRGLVSLTAELVWIQSL 1353
Query: 1112 LNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS 1171
L EL + TS P++WCDN SA LA NPVFH+R+KHIE+D+ FIR++VL+ L + YVPS
Sbjct: 1354 LQELCLPTS-PPILWCDNQSAAHLAANPVFHSRSKHIELDLHFIREKVLRQELQICYVPS 1376
Query: 1172 TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD 1215
DQLA+ TK L +QF NLRSKL V PP LRGD DN D
Sbjct: 1414 GDQLADIFTKHLPITQFCNLRSKL-TVTYPPLSLRGD--DNQTD 1376
BLAST of CSPI04G22620 vs. NCBI nr
Match:
RVW64314.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])
HSP 1 Score: 810.8 bits (2093), Expect = 1.6e-230
Identity = 440/936 (47.01%), Postives = 582/936 (62.18%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q KSH LPF S S A P A++++DLWGPA S RY+ILF+DD+S ++WIYPL
Sbjct: 511 QFAKSHKLPFNVSVSRASHPLALLHADLWGPASIPSTTGARYFILFVDDFSRFSWIYPLH 570
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
K A+ F F V+NQFN+ I+ +SDNGGE+K GI +FSCPYT QN
Sbjct: 571 SKDQALSVFIKFKSLVENQFNSRIQCLRSDNGGEFKAFSSYLATHGIKSQFSCPYTPEQN 630
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERK RHI+ETGL LLA A++ +W AF T + LIN + T L SP + LF +
Sbjct: 631 GRAERKLRHIIETGLALLATASLPFKFWLYAFHTAIFLINRLPTKVLNYQSPFQILFGKS 690
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHV 514
KIFGC+C+P +RPY K SY S +CV+LG S HKG+ CL+ TGR++++RHV
Sbjct: 691 PNYHIFKIFGCLCYPYIRPYNKNKLSYRSSQCVFLGYSSNHKGYMCLNPLTGRLYVTRHV 750
Query: 515 KFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI 574
F+E FPF QST P+Q S++ +
Sbjct: 751 VFHETVFPF-------QST------------------PDQ---------------SSSVV 810
Query: 575 TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSP 634
T+P P +P S P V S + T +PST+ P N P
Sbjct: 811 TIPTPAFLPCSS--------------PPVSSLRSHT-----------TPSTSSPPLTNMP 870
Query: 635 YSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEP 694
S+ S P + + A I T +P ++ H M+TRAK I K K S ++EP
Sbjct: 871 SSTISLPDLIQVPFADIS-TSEPHPTN--QHPMVTRAKNGISKKKVYFSSH-----ISEP 930
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
T +A+ W AM+ E+ AL NTW LVPP S+ NI+G KW++KLK DG++ RY
Sbjct: 931 TTFTQAVKDSNWVLAMEKEFSALQRNNTWHLVPPPSNGNIIGCKWVYKLKYKPDGTVDRY 990
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVA+GF Q G+D+FETFSPVVKAST+ ++L++ L+ W++ QLD NAFL+G LEE
Sbjct: 991 KARLVAQGFTQTLGLDYFETFSPVVKASTIRIILAVALSFNWSVHQLDVQNAFLHGDLEE 1050
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
V++ QPP ++ S + +VCKLNKA+YGLKQAPRAW N LS +LL WGF S++DSS+FI
Sbjct: 1051 HVFMQQPPGFINSQYPSHVCKLNKALYGLKQAPRAWYNKLSTSLLGWGFQASRADSSMFI 1110
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
H +++LL YVDD ++TG+ + + S +T L+ FAL+DLG ++YFL +V
Sbjct: 1111 HHSTHDVLILLIYVDDILVTGSSSAQVSSFITRLNSSFALRDLGYVNYFLGIEVVRSGTM 1170
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
F LS+ KY DLL R M D K A +P ++G+TLS LD + D +LYRST+GALQYLT
Sbjct: 1171 FHLSQHKYTQDLLSRTAMLDSKPATTPGLLGQTLSHLDGEPFSDATLYRSTVGALQYLTL 1230
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDI++ VN QF+ PT HW AVKRILRY+ GT +G+ Q S L I ++DADW
Sbjct: 1231 TRPDISFAVNKACQFMATPTTTHWLAVKRILRYLKGTLSYGIQMQQSTSLDIHGYTDADW 1290
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
AS DDR+ Y +F+G NLVSWSS KQ VV+ SS ESEYRALA A E+IW++ +L E
Sbjct: 1291 ASCPDDRRSTGGYGIFLGPNLVSWSSNKQKVVSRSSAESEYRALASATSEMIWIQYVLQE 1350
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L +S+S P++WCDN SA LA NPVFH RTKHIE+D+ FIRD VL+ L ++Y+PS +Q
Sbjct: 1351 LCLSSSSPPLLWCDNKSAAHLAANPVFHARTKHIEMDLHFIRDHVLRKQLVIQYLPSAEQ 1372
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR 1210
+A+ TK ++ SQF + R+KL VV S P LRGD R
Sbjct: 1411 VADIFTKHISSSQFLSFRTKLSVVPS-PVSLRGDDR 1372
BLAST of CSPI04G22620 vs. NCBI nr
Match:
RVX06084.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])
HSP 1 Score: 810.1 bits (2091), Expect = 2.7e-230
Identity = 440/936 (47.01%), Postives = 582/936 (62.18%), Query Frame = 0
Query: 275 QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLK 334
Q KSH LPF S S A P A++++DLWGPA S RY+ILF+DD+S ++WIYPL
Sbjct: 618 QFAKSHKLPFNVSVSRASHPLALLHADLWGPASIPSTTGARYFILFVDDFSRFSWIYPLH 677
Query: 335 QKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQN 394
K A+ F F V+NQFN+ I+ +SDNGGE+K GI +FSCPYT QN
Sbjct: 678 SKDQALSVFIKFKSLVENQFNSRIQCLRSDNGGEFKAFSSYLATHGIKSQFSCPYTPEQN 737
Query: 395 GRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQK 454
GRAERK RHI+ETGL LLA A++ +W AF T + LIN + T L SP + LF +
Sbjct: 738 GRAERKLRHIIETGLALLATASLPFKFWLYAFHTTIFLINRLPTKVLNYQSPFQILFGKS 797
Query: 455 LKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHV 514
KIFGC+C+P +RPY K SY S +CV+LG S HKG+ CL+ TGR++++RHV
Sbjct: 798 PNYHIFKIFGCLCYPYIRPYNKNKLSYRSSQCVFLGYSSNHKGYMCLNPLTGRLYVTRHV 857
Query: 515 KFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI 574
F+E FPF QST P+Q S++ +
Sbjct: 858 VFHETVFPF-------QST------------------PDQ---------------SSSVV 917
Query: 575 TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSP 634
T+P P +P S P V S + T +PST+ P N P
Sbjct: 918 TIPTPAFLPCSS--------------PPVSSLRSHT-----------TPSTSSPPLTNMP 977
Query: 635 YSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEP 694
S+ S P + + A I T +P ++ H M+TRAK I K K S ++EP
Sbjct: 978 SSTISLPDLIQVPFADIS-TSEPHPTN--QHPMVTRAKNGISKKKVYFSSH-----ISEP 1037
Query: 695 TRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRY 754
T +A+ W AM+ E+ AL NTW LVPP S+ NI+G KW++KLK DG++ RY
Sbjct: 1038 TTFTQAVKDSNWVLAMEKEFSALQRNNTWHLVPPPSNGNIIGCKWVYKLKYKPDGTVDRY 1097
Query: 755 KARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEE 814
KARLVA+GF Q G+D+FETFSPVVKAST+ ++L++ L+ W++ QLD NAFL+G LEE
Sbjct: 1098 KARLVAQGFTQTLGLDYFETFSPVVKASTIRIILAVALSFNWSVHQLDVQNAFLHGDLEE 1157
Query: 815 SVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI 874
V++ QPP ++ S + +VCKLNKA+YGLKQAPRAW N LS +LL WGF S++DSS+FI
Sbjct: 1158 HVFMQQPPGFINSQYPSHVCKLNKALYGLKQAPRAWYNKLSTSLLGWGFQASRADSSMFI 1217
Query: 875 LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCG 934
H +++LL YVDD ++TG+ + + S +T L+ FAL+DLG ++YFL +V
Sbjct: 1218 HHSTHDVLILLIYVDDILVTGSSSAQVSSFITRLNYSFALRDLGYVNYFLGIEVVSSGTM 1277
Query: 935 FILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTN 994
F LS+ KY DLL R M D K A +P ++G+TLS LD + D +LYRST+GALQYLT
Sbjct: 1278 FHLSQHKYTQDLLSRTAMLDSKPATTPGLLGQTLSHLDGEPFSDATLYRSTVGALQYLTL 1337
Query: 995 TRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADW 1054
TRPDI++ VN QF+ PT HW AVKRILRY+ GT +G+ Q S L I ++DADW
Sbjct: 1338 TRPDISFAVNKACQFMATPTTTHWLAVKRILRYLKGTLSYGIQMQQSTSLDIHGYTDADW 1397
Query: 1055 ASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNE 1114
AS DDR+ Y +F+G NLVSWSS KQ VV+ SS ESEYRALA A E+IW++ +L E
Sbjct: 1398 ASCPDDRRSTGGYGIFLGPNLVSWSSNKQKVVSRSSAESEYRALASATSEMIWIQYVLQE 1457
Query: 1115 LDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ 1174
L +S+S P++WCDN SA LA NPVFH RTKHIE+D+ FIRD VL+ L ++Y+PS +Q
Sbjct: 1458 LCLSSSSPPLLWCDNKSAAHLAANPVFHARTKHIEMDLHFIRDHVLRKQLVIQYLPSAEQ 1479
Query: 1175 LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR 1210
+A+ TK ++ SQF + R+KL VV S P LRGD R
Sbjct: 1518 VADIFTKHISSSQFLSFRTKLSVVPS-PVSLRGDDR 1479
BLAST of CSPI04G22620 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 376.3 bits (965), Expect = 9.3e-104
Identity = 205/482 (42.53%), Postives = 280/482 (58.09%), Query Frame = 0
Query: 680 CISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWI 739
CI+K+ EP+ EA L W AMD E A+ T+TW + ++ +G KW+
Sbjct: 79 CIAKA------KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWV 138
Query: 740 FKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQ 799
+K+K NSDG+I+RYKARLVAKG+ Q G+DF ETFSPV K +++ ++L+I +TL Q
Sbjct: 139 YKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQ 198
Query: 800 LDFNNAFLNGQLEESVYITQPPEYVG----SSHSHYVCKLNKAIYGLKQAPRAWNNTLSK 859
LD +NAFLNG L+E +Y+ PP Y S + VC L K+IYGLKQA R W S
Sbjct: 199 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 258
Query: 860 ALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKD 919
L+ +GFV S SD + F+ + +L YVDD II N+ + + L + L F L+D
Sbjct: 259 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRD 318
Query: 920 LGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLL 979
LG L YFL ++ G + + KY DLL + K + P T SA
Sbjct: 319 LGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDF 378
Query: 980 DDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGL 1039
D YR IG L YL TR DI++ VN LSQF + P H QAV +IL YI GT GL
Sbjct: 379 VDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGL 438
Query: 1040 LFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYR 1099
+ ++ + FSDA + S D R+ + YC+F+G +L+SW SKKQ VV+ SS E+EYR
Sbjct: 439 FYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYR 498
Query: 1100 ALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIR 1158
AL+ A E++WL Q EL + S +++CDN +A +ATN VFH RTKHIE D +R
Sbjct: 499 ALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVR 554
BLAST of CSPI04G22620 vs. TAIR 10
Match:
ATMG00810.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 191.0 bits (484), Expect = 5.5e-48
Identity = 97/224 (43.30%), Postives = 140/224 (62.50%), Query Frame = 0
Query: 883 LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYV 942
LL YVDD ++TG+ +L+ L+ L F++KDLG + YFL Q+K G LS+ KY
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 943 DDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMV 1002
+ +L+ M D K +P + K S++ + DPS +RS +GALQYLT TRPDI+Y V
Sbjct: 63 EQILNNAGMLDCKPMSTPLPL-KLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122
Query: 1003 NHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKF 1062
N + Q + PT + +KR+LRY+ GT GL + L++ AF D+DWA R+
Sbjct: 123 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182
Query: 1063 VSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIW 1107
+ +C F+G N++SWS+K+Q V+ SSTE+EYRALAL A E+ W
Sbjct: 183 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of CSPI04G22620 vs. TAIR 10
Match:
ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )
HSP 1 Score: 113.6 bits (283), Expect = 1.1e-24
Identity = 59/124 (47.58%), Postives = 78/124 (62.90%), Query Frame = 0
Query: 666 MITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLV 725
M+TR+KA I K S + T EP + AL W +AM E AL TW LV
Sbjct: 1 MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60
Query: 726 PPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWV 785
PP +QNI+G KW+FK K +SDG++ R KARLVAKGFHQ G+ F ET+SPVV+ +T+
Sbjct: 61 PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120
Query: 786 VLSI 790
+L++
Sbjct: 121 ILNV 124
BLAST of CSPI04G22620 vs. TAIR 10
Match:
ATMG00240.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 75.1 bits (183), Expect = 4.4e-13
Identity = 35/81 (43.21%), Postives = 49/81 (60.49%), Query Frame = 0
Query: 990 YLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFS 1049
YLT TRPD+ + VN LSQF QAV ++L Y+ GT GL + + DL + AF+
Sbjct: 2 YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61
Query: 1050 DADWASNIDDRKFVSAYCVFI 1071
D+DWAS D R+ V+ +C +
Sbjct: 62 DSDWASCPDTRRSVTGFCSLV 82
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q94HW2 | 2.9e-211 | 43.78 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 1.8e-208 | 43.30 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 4.8e-113 | 31.49 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 5.3e-104 | 30.79 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
P92519 | 7.8e-47 | 43.30 | Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A2Z6MBG6 | 6.6e-251 | 49.14 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2Z6P4D5 | 2.2e-238 | 47.67 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A5AYB0 | 4.1e-232 | 47.99 | Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITI... | [more] |
A0A438FWJ3 | 7.6e-231 | 47.01 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... | [more] |
A0A438JAU4 | 1.3e-230 | 47.01 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... | [more] |
Match Name | E-value | Identity | Description | |
GAU19483.1 | 1.4e-250 | 49.14 | hypothetical protein TSUD_77270 [Trifolium subterraneum] | [more] |
GAU51268.1 | 4.6e-238 | 47.67 | hypothetical protein TSUD_412550 [Trifolium subterraneum] | [more] |
CAN73924.1 | 8.4e-232 | 47.99 | hypothetical protein VITISV_041509 [Vitis vinifera] | [more] |
RVW64314.1 | 1.6e-230 | 47.01 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | [more] |
RVX06084.1 | 2.7e-230 | 47.01 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 9.3e-104 | 42.53 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 5.5e-48 | 43.30 | DNA/RNA polymerases superfamily protein | [more] |
ATMG00820.1 | 1.1e-24 | 47.58 | Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
ATMG00240.1 | 4.4e-13 | 43.21 | Gag-Pol-related retrotransposon family protein | [more] |