Homology
BLAST of CSPI04G12000 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 726.9 bits (1875), Expect = 4.1e-208
Identity = 459/1308 (35.09%), Postives = 708/1308 (54.13%), Query Frame = 0
Query: 25 NEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEG 84
+E+A I +SD + TA+ I L ++ + + K +Y M EG
Sbjct: 56 DERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEG 115
Query: 85 TSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL--- 144
T+ H+ G IEE ++ +L SLP S+ T K L
Sbjct: 116 TNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDV 175
Query: 145 --TTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGK 204
LLNE R + G+ E R + R S++ ++G A+ K K + K
Sbjct: 176 TSALLLNEKMRKKPENQGQALITEGR----GRSYQRSSNNYGRSG-----ARGKSKNRSK 235
Query: 205 TPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV----- 264
+ +N CY+C Q GH+ R+CP +K + ET G+ + +V
Sbjct: 236 SRVRN----------CYNCNQPGHFKRDCPN---PRKGKGETSGQKNDDNTAAMVQNNDN 295
Query: 265 VETCLVEYENC--------TWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMV 324
V + E E C W++D+ A++H + + + G+ T+K+G
Sbjct: 296 VVLFINEEEECMHLSGPESEWVVDTAASHH---ATPVRDLFCRYVAGDFGTVKMGNTSYS 355
Query: 325 SASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKG- 384
+ +GD+ + ++LK+V +VP ++ NLIS + Y S+ N+ + KG
Sbjct: 356 KIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYE-SYFANQKWRLTKGS 415
Query: 385 ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRI 444
+++ + LY+ N EI + E + ++S + LWH R+GH++ +
Sbjct: 416 LVIAKGVARGTLYR---------TNAEICQ-GELNAAQDEISVD--LWHKRMGHMSEKGL 475
Query: 445 GRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKAR 504
L K L+S + ++ PC+ CL GK + SF R L+LV+SD+CGPM +++
Sbjct: 476 QILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESM 535
Query: 505 GGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMD 564
GG +YF++FIDD SR +Y++ K + F+++ A VE E G+ +K LRSD GGEY
Sbjct: 536 GGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTS 595
Query: 565 LRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETA 624
F +Y +GI+ + + P TPQ NGV+ER NRT+++ VRSM+ +++ SFWG A++TA
Sbjct: 596 REFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTA 655
Query: 625 AYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFI 684
Y++N PS ++ E P +W ++ S H +++GC AHV + KL+ +S C FI
Sbjct: 656 CYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFI 715
Query: 685 GYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDK 744
GY E G +DP + K+ S + F E +R S+ V I S +
Sbjct: 716 GYGDEEFGYRLWDPVKKKVIRSRDVVF-RESEVRTAADMSEKVKNGIIPNFVTIPSTSNN 775
Query: 745 PSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDR 804
P+S+ D+ + G+ HP+Q Q + RRS R + R
Sbjct: 776 PTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRR 835
Query: 805 YLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDV 864
Y V+I DD +P + K+ + +++Q +KAM EMES+ N + LV+ P
Sbjct: 836 YPS--TEYVLISDD--REPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGK 895
Query: 865 KPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAT 924
+P+ CKW++K K+D K+ +KARLV KG+ Q++G+D++E FSPV + SIR +LS+A
Sbjct: 896 RPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAA 955
Query: 925 FYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNI 984
D E+ Q+DVKTAFL+G+LEE IYM QPEGF ++ VCKL KS+YGL+ R W +
Sbjct: 956 SLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYM 1015
Query: 985 RFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQ 1044
+FD+ +KS + + +PCVY K+ + L+LYVDD+L++G D G + +K L+
Sbjct: 1016 KFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKS 1075
Query: 1045 FQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLS 1104
F MKDLG AQ +LG++IVR R ++ L +SQ YI+++L R+ M+N+K P + LS
Sbjct: 1076 FDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLS 1135
Query: 1105 KEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK 1164
K+ CP T +E +M +PY+SAVGSLMYAM+CTRPDI ++VG+VSR+ NPG++HW AVK
Sbjct: 1136 KKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVK 1195
Query: 1165 NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQ 1224
IL+YLR T L +G D IL GYTD+D D D RKS++G +FT +G A+ W+S Q
Sbjct: 1196 WILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQ 1255
Query: 1225 ICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRS 1275
C+ ST EAEY+A E KE +W ++FL +L + ++ +YCD+ A+ S+
Sbjct: 1256 KCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYV---VYCDSQSAIDLSKNSMY 1315
BLAST of CSPI04G12000 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 520.8 bits (1340), Expect = 4.5e-146
Identity = 392/1409 (27.82%), Postives = 666/1409 (47.27%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KA A+ I+ +SD S TA++I+++L ++ + + + K + + ++
Sbjct: 50 KAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLS 109
Query: 83 EGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSF----IPFQTNASLN- 142
S+ H+ D ++ +A G IEE++++S +L +LP + +T + N
Sbjct: 110 SEMSLLSHFHIFDELISELLAA--GAKIEEMDKISHLLITLPSCYDGIITAIETLSEENL 169
Query: 143 KIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKK 202
+ F LL++ + +N K+V + + + K + +KP K
Sbjct: 170 TLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRV--TKPKKIFKGN 229
Query: 203 GKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYD 262
K K KC+HCG+ GH ++C Y EK+ + T
Sbjct: 230 SKYKV-------------KCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIA 289
Query: 263 LLVVETCLVE-YENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSA 322
+V E +NC ++LDSGA++H+ + S + E LK+ GE + A
Sbjct: 290 FMVKEVNNTSVMDNCGFVLDSGASDHL---INDESLYTDSVEVVPPLKIAVAKQGEFIYA 349
Query: 323 SAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC 382
+ G ++L D + L++VL+ + NL+S+ + E I F+ + + G++V
Sbjct: 350 TKRGIVRL-RNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMV- 409
Query: 383 SAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLV 442
+ + +LN + + K +N LWH R GHI+ ++ +
Sbjct: 410 -------------VKNSGMLNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIK 469
Query: 443 KSGLLSPLE-----DNSLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPM 502
+ + S + S CE CL GK + F K L+ K PL +VHSD+CGP+
Sbjct: 470 RKNMFSDQSLLNNLELSCEICEPCLNGKQARLPF--KQLKDKTHIKRPLFVVHSDVCGPI 529
Query: 503 NVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRG 562
YF+ F+D ++ Y YLI +KS+ F+++ A+ E + L D G
Sbjct: 530 TPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNG 589
Query: 563 GEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGY 622
EY+ R + ++ GI L+ P TPQ NGVSER RT+ + R+M+S +++ SFWG
Sbjct: 590 REYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGE 649
Query: 623 ALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRS 682
A+ TA Y++N +PS+++ S+TPYE+W +K L+H R++G +V ++N + K + +S
Sbjct: 650 AVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQGKFDDKS 709
Query: 683 KLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK 742
F+GY E G +D K V+ + E + + + + V + SK + +K
Sbjct: 710 FKSIFVGY--EPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENK 769
Query: 743 --PSSSTKVV------------------DKTRKSGQSHPS-------------------- 802
P+ S K++ D ++ P+
Sbjct: 770 NFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNI 829
Query: 803 QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP----- 862
Q L++ + S + + D +L ET + + GI++P
Sbjct: 830 QFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDG 889
Query: 863 ----------------LTYKQ--------------AMKDV-----------DRDQWIKAM 922
++Y + DV D+ W +A+
Sbjct: 890 IEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAI 949
Query: 923 DLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD 982
+ E+ + N+ WT+ +P + + +W++ K + G +KARLVA+G+TQ+ +D
Sbjct: 950 NTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQID 1009
Query: 983 YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE 1042
YEETF+PVA + S R +LS+ Y+ ++ QMDVKTAFLNG L+E IYM P+G
Sbjct: 1010 YEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNS 1069
Query: 1043 QKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVY---KKVVNSIIAFLVLY 1102
VCKL K+IYGL+ +R W F+ A+K F + + C+Y K +N I +++LY
Sbjct: 1070 DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENI-YVLLY 1129
Query: 1103 VDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDK 1162
VDD+++ D+ + + K++L +F+M DL + ++ +GI+I + + +SQ++Y+ K
Sbjct: 1130 VDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKK 1189
Query: 1163 MLSRYKMQNSKKGLLP----YRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLC 1222
+LS++ M+N P Y + S E C N P S +G LMY MLC
Sbjct: 1190 ILSKFNMENCNAVSTPLPSKINYELLNSDEDC-----------NTPCRSLIGCLMYIMLC 1249
Query: 1223 TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLI----LTGYTD 1282
TRPD+ +V ++SRY S + W +K +L+YL+ T D L++ K+L + GY D
Sbjct: 1250 TRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIF-KKNLAFENKIIGYVD 1309
BLAST of CSPI04G12000 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 469.2 bits (1206), Expect = 1.5e-130
Identity = 385/1417 (27.17%), Postives = 615/1417 (43.40%), Query Frame = 0
Query: 32 ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV 91
IL ++S + TA +I ++LR ++ P + HV
Sbjct: 88 ILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYG----------------------HV 147
Query: 92 --LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLL 151
L + F+ + G P++ QV +LE+LP + P + +LT L+
Sbjct: 148 TQLRFITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLI 207
Query: 152 NELQRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKG 211
N + L + + ANV T + + RG + ++ N+ Q G
Sbjct: 208 NRESKLLALNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSR 267
Query: 212 KTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDL--------LVV 271
+Q K G+C C GH + CP+ + + Q L V
Sbjct: 268 SDNRQPK----PYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAV 327
Query: 272 ETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKL 331
+ Y W+LDSGAT+HI F N S+ + G + + G + + G L
Sbjct: 328 NS---PYNANNWLLDSGATHHITSDF-NNLSFHQPYTGGDDVMIADGSTIPITHTGSASL 387
Query: 332 FFRDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSA 391
R + L VLYVP + +NLIS+ + + + + SF++ + G+ +
Sbjct: 388 PTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKD---LNTGVPLLQG 447
Query: 392 ILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKS 451
+D LY+ + +++ + + SS WH RLGH +L + ++ +
Sbjct: 448 KTKDELYEWP------IASSQAVSMFASPCSKATHSS----WHSRLGHPSLAILNSVISN 507
Query: 452 GLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEY 511
L L + L C C K K F+ + + PLE ++SD+ + + Y Y
Sbjct: 508 HSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRY 567
Query: 512 FISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRD 571
++ F+D ++RY +Y + KS + F +K+ VEN I L SD GGE++ L RD
Sbjct: 568 YVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVL--RD 627
Query: 572 YLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILN 631
YL ++GI S P TP+ NG+SER++R +++M +++S + + ++W YA A Y++N
Sbjct: 628 YLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLIN 687
Query: 632 NVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKE 691
+P+ + ++P++ G+ + +++GC + ++ N KLE +SK C F+GY
Sbjct: 688 RLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLT 747
Query: 692 SRGGLFYDPQENKIFVSTNATFLEE----------------------------------- 751
L +++ S + F E
Sbjct: 748 QSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTP 807
Query: 752 ----------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRK 811
H+ D PR S L ++S S+I PSSS + +
Sbjct: 808 LVLPAPPCLGPHL-DTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQP 867
Query: 812 SGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP---------------- 871
+ Q H +Q L P + + P++ L ++ + P
Sbjct: 868 TAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPS 927
Query: 872 ------------------------------------DDGI----------------EDPL 931
DGI +P
Sbjct: 928 SSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPR 987
Query: 932 TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKV 991
T QAMKD D+W +AM E+ + N W LV P V +GC+WI+ +K + G +
Sbjct: 988 TAIQAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSL 1047
Query: 992 QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN 1051
+KARLVAKGY QR G+DY ETFSPV SIRI+L +A + I Q+DV AFL G
Sbjct: 1048 NRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGT 1107
Query: 1052 LEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPC 1111
L + +YMSQP GF+++D+ VC+L+K+IYGL+ R+W + T + + GF ++ +
Sbjct: 1108 LTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTS 1167
Query: 1112 VYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN 1171
++ I ++++YVDDIL+ GND L L+ +F +K+ D Y LGI+
Sbjct: 1168 LFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIE--AK 1227
Query: 1172 RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYA 1231
R + L +SQ Y +L+R M +K P L+ K P E Y
Sbjct: 1228 RVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTE------YR 1287
Query: 1232 SAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTK 1283
VGSL Y + TRPD+ Y+V +S+Y P DHW A+K +L+YL T D+ + +
Sbjct: 1288 GIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGN 1347
BLAST of CSPI04G12000 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 461.5 bits (1186), Expect = 3.2e-128
Identity = 374/1410 (26.52%), Postives = 619/1410 (43.90%), Query Frame = 0
Query: 32 ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREH 91
+L ++S + TA +I ++LR ++ P + H K+ +GT ++ ++
Sbjct: 88 VLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYG--HVTQLRTQLKQWTKGTKTIDDY 147
Query: 92 VLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLN 151
+ ++ F+ + G P++ QV +LE+LP+ + P + LT LLN
Sbjct: 148 MQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLN 207
Query: 152 ELQRFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTP 211
+ ++ + AN TT G+ + + N +
Sbjct: 208 HESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKPWQQSSTNF 267
Query: 212 KQNKGKKGAEKGKCYHCGQNGHWLRNCPK---YLAEKKAEKET------QGKYDLLVVET 271
N + GKC CG GH + C + +L+ +++ Q + +L +
Sbjct: 268 HPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALGS- 327
Query: 272 CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFF 331
Y + W+LDSGAT+HI F S + + G+ + V G + S G L
Sbjct: 328 ---PYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGD-DVMVADGSTIPISHTGSTSLST 387
Query: 332 RDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSAIL 391
+ R + L N+LYVP + +NLIS+ + + + + SF++ + G+ +
Sbjct: 388 KSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKD---LNTGVPLLQGKT 447
Query: 392 EDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGL 451
+D LY+ + +++ + + + SS WH RLGH + + ++ +
Sbjct: 448 KDELYEWP------IASSQPVSLFASPSSKATHSS----WHARLGHPAPSILNSVISNYS 507
Query: 452 LSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFI 511
LS L + C CL K K F+ + + PLE ++SD+ + + Y Y++
Sbjct: 508 LSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYV 567
Query: 512 SFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL 571
F+D ++RY +Y + KS E F +K +EN I SD GGE++ L +Y
Sbjct: 568 IFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVAL--WEYF 627
Query: 572 IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNV 631
++GI S P TP+ NG+SER++R +++ +++S + + ++W YA A Y++N +
Sbjct: 628 SQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRL 687
Query: 632 PSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESR 691
P+ + E+P++ G + R++GC + ++ N KL+ +S+ C F+GY
Sbjct: 688 PTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQS 747
Query: 692 GGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL------------ 751
L Q +++++S + F E +++ + S V
Sbjct: 748 AYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPV 807
Query: 752 -------------------------KEISKSAIDKPSSST----------------KVVD 811
++S S +D SS+
Sbjct: 808 LPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQ 867
Query: 812 KTRKSGQSHPS-----------------QQLREPRRSGR--------------------- 871
T+ Q+H S Q L P +S
Sbjct: 868 PTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSI 927
Query: 872 VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMK 931
++H P ++ P GI +P T QA+K
Sbjct: 928 LIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALK 987
Query: 932 DVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL 991
D ++W AM E+ + N W LV P+ V +GC+WI+ +K + G + +KARL
Sbjct: 988 D---ERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARL 1047
Query: 992 VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYM 1051
VAKGY QR G+DY ETFSPV SIRI+L +A + I Q+DV AFL G L + +YM
Sbjct: 1048 VAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYM 1107
Query: 1052 SQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVN 1111
SQP GFI++D+ VCKL+K++YGL+ R+W + + + GF +V + ++
Sbjct: 1108 SQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRG 1167
Query: 1112 SIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLA 1171
I ++++YVDDIL+ GND L + L+ +F +KD + Y LGI+ R L
Sbjct: 1168 KSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIE--AKRVPTGLH 1227
Query: 1172 MSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLM 1231
+SQ YI +L+R M +K P LS K E Y VGSL
Sbjct: 1228 LSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTE------YRGIVGSLQ 1287
Query: 1232 YAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGY 1283
Y + TRPDI Y+V +S++ P +H A+K IL+YL T ++ + + L L Y
Sbjct: 1288 Y-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAY 1347
BLAST of CSPI04G12000 vs. ExPASy Swiss-Prot
Match:
Q99337 (Transposon Ty1-NL2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY1B-NL2 PE=3 SV=1)
HSP 1 Score: 151.8 bits (382), Expect = 5.4e-35
Identity = 285/1326 (21.49%), Postives = 498/1326 (37.56%), Query Frame = 0
Query: 263 ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNV 322
+LDSGA+ + S S S +I + + +A+GDL+ F+D V
Sbjct: 459 LLDSGASRTLIRSAHHIHS--ASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 518
Query: 323 LYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVL 382
L+ P + +L+S++ + F N G ++ + + Y + ++L
Sbjct: 519 LHTPNIAYDLLSLNELAAVDITACFTKN-VLERSDGTVLAPIVKYGDFYWV---SKKYLL 578
Query: 383 NTEIFRTAETQNKRQKVSSNAYLW---HLRLGHINLNRIGRLVKSGLLSPLEDNSLP--- 442
+ I N S+ Y + H L H N I +K+ ++ ++ +
Sbjct: 579 PSNI-SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSS 638
Query: 443 ----PCESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFI 502
C CL GK TK R G L+ + P + +H+D+ GP++ + YFISF
Sbjct: 639 AIDYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFT 698
Query: 503 DDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI 562
D+ +++ +Y +H + + L+ F A ++N+ ++ +++ DRG EY + +L
Sbjct: 699 DETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLE 758
Query: 563 ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVP 622
+NGI + + + +GV+ER NRTLLD R+ + S + + W A+E + + N++
Sbjct: 759 KNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLA 818
Query: 623 SKSVSETPYELWKGRKGSLRHFRI----------WGCPAHVLVQNPKKLEH--------- 682
S K +K + +H + +G P V NP H
Sbjct: 819 SP----------KSKKSARQHAGLAGLDISTLLPFGQPVIVNDHNPNSKIHPRGIPGYAL 878
Query: 683 ---RSKLCFFIGYP----------------KESR------GGLFYDPQENKI------FV 742
R+ + I P KESR L +D N++ F+
Sbjct: 879 HPSRNSYGYIIYLPSLKKTVDTTNYVILQGKESRLDQFNYDALTFDEDLNRLTASYQSFI 938
Query: 743 STN-------ATFLEEDHIRDHQPR---------------------SKLVLK-------- 802
++N F + + QPR SK V K
Sbjct: 939 ASNEIQQSDDLNFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRAPRE 998
Query: 803 ---EISKSAI--DKPSSSTKVVDKTRKSG----------------------QSHPSQ--- 862
ISKS I K SST + +G SH S+
Sbjct: 999 VDPNISKSDILPSKKRSSTPQISDIESTGSGGMHRLDVPLLAPMSQSNTHESSHTSKSKD 1058
Query: 863 -------------------------------QLREPRRSGRVVHQ--------------- 922
Q E R++H+
Sbjct: 1059 FRHSDSYSDNETNHTNVPISSTGGTNNKTVPQTSEQETEKRIIHRSPSIDTSSSESNSLH 1118
Query: 923 --------------------------PD-------------------------------- 982
PD
Sbjct: 1119 HVVPIKTSDTCPKENTEESIIADLPLPDLPPEPPTKLSDSFKELPPINSRQTNSSLGGIG 1178
Query: 983 ------------RYLGLIETQVVIPDD--------GIEDP-------------------- 1042
R L ET++ + D +E P
Sbjct: 1179 DSNAYTTINSKKRSLEDNETEIKVSRDTWNTKNMRSLEPPRSKKRIHLIAAVKAVKSIKP 1238
Query: 1043 ----LTYKQAM---KDV-DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCK----- 1102
L Y +A+ KD+ +++++I+A E+ + W D+ D K I K
Sbjct: 1239 IRTTLRYDEAITYNKDIKEKEKYIEAYHKEVNQLLKMKTWD-TDKYYDRKEIDPKRVINS 1298
Query: 1103 -WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE 1162
+I+ RKRD T KAR VA+G Q S ++ LS+A +Y
Sbjct: 1299 MFIFNRKRD-----GTHKARFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYY 1358
Query: 1163 IWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTA 1222
I Q+D+ +A+L +++E +Y+ P D K+ +LKKS+YGL+ +W ++T
Sbjct: 1359 ITQLDISSAYLYADIKEELYIRPPPHLGMND---KLIRLKKSLYGLKQSGANW---YET- 1418
Query: 1223 IKSYGFEQ-NVDE----PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQF 1275
IKSY EQ +++E CV+K NS + + L+VDD++L D+ I L Q+
Sbjct: 1419 IKSYLIEQCDMEEVRGWSCVFK---NSQVT-ICLFVDDMILFSKDLNANKKIITTLKKQY 1478
BLAST of CSPI04G12000 vs. ExPASy TrEMBL
Match:
E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)
HSP 1 Score: 2192.5 bits (5680), Expect = 0.0e+00
Identity = 1075/1232 (87.26%), Postives = 1154/1232 (93.67%), Query Frame = 0
Query: 22 IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRM 81
+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTKRM
Sbjct: 63 VKANDKARVYILASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTKRM 122
Query: 82 KEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL 141
KEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNL
Sbjct: 123 KEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEFNL 182
Query: 142 TTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTP 201
TTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K GPSK AQ+KKKGKGK P
Sbjct: 183 TTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSSKNKVGPSK--AQMKKKGKGKAP 242
Query: 202 KQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCT 261
+K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + T
Sbjct: 243 NTSKVKKNADKGKCFHCNQDGHWKRNCPKYLAEKKAEKATQGKYDLLVVETCLVECDAST 302
Query: 262 WILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN 321
WILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+
Sbjct: 303 WILDSGATNHICFSFQETSSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFFQDRYLILKD 362
Query: 322 VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFV 381
VLYVP MKRNLISI+CILE +Y ISFE+NE F+ KGI +CSAI E+NLYKLRPTRAN V
Sbjct: 363 VLYVPLMKRNLISIACILEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRANVV 422
Query: 382 LNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESC 441
LNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESC
Sbjct: 423 LNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCESC 482
Query: 442 LEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH 501
LEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+H
Sbjct: 483 LEGKMTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLH 542
Query: 502 HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQ 561
HKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQ
Sbjct: 543 HKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQ 602
Query: 562 QNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR 621
QNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGR
Sbjct: 603 QNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGR 662
Query: 622 KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNAT 681
K SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNAT
Sbjct: 663 KSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNAT 722
Query: 682 FLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV 741
FLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK S QSH SQ+LR PRRSGRV
Sbjct: 723 FLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRRSGRV 782
Query: 742 VHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV 801
VHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLV
Sbjct: 783 VHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLV 842
Query: 802 DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRI 861
D P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRI
Sbjct: 843 DLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRI 902
Query: 862 LLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC 921
LLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGL+
Sbjct: 903 LLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQA 962
Query: 922 SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKK 981
SRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KK
Sbjct: 963 SRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKK 1022
Query: 982 WLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRY 1041
WL QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+
Sbjct: 1023 WLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRH 1082
Query: 1042 GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDH 1101
GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDH
Sbjct: 1083 GIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDH 1142
Query: 1102 WTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVW 1161
WTAVKNILKYLR T++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNG AVVW
Sbjct: 1143 WTAVKNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVW 1202
Query: 1162 RSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS 1221
RS+KQ CI DSTMEAEYVA CEAAKE VW RKFLTDLEVVPNMHLPITLYCDNSGAVANS
Sbjct: 1203 RSVKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANS 1262
Query: 1222 REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ 1254
+EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Sbjct: 1263 KEPRSHKRGKHIERKYHLIREIVHRGDVVVTQ 1292
BLAST of CSPI04G12000 vs. ExPASy TrEMBL
Match:
A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1324 PLTAKVFEGHLESLGLR 1337
BLAST of CSPI04G12000 vs. ExPASy TrEMBL
Match:
A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1324 PLTAKVFEGHLESLGLR 1337
BLAST of CSPI04G12000 vs. ExPASy TrEMBL
Match:
A0A5A7TZD7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G001300 PE=4 SV=1)
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 63 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 122
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 123 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 182
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 183 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 242
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 243 KANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 302
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 303 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 362
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 363 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 422
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 423 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 482
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 483 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 542
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 543 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 602
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 603 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 662
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 663 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 722
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 723 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 782
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 783 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 842
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 843 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 902
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 903 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 962
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 963 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1022
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1023 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1082
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1083 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1142
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1143 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1202
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1203 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1262
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1263 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1322
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1323 PLTAKVFEGHLESLGLR 1336
BLAST of CSPI04G12000 vs. ExPASy TrEMBL
Match:
A0A5A7TWB9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G00310 PE=4 SV=1)
HSP 1 Score: 1857.4 bits (4810), Expect = 0.0e+00
Identity = 923/1265 (72.96%), Postives = 1072/1265 (84.74%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DYML+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVF LNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFILNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1272
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
BLAST of CSPI04G12000 vs. NCBI nr
Match:
ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])
HSP 1 Score: 2192.5 bits (5680), Expect = 0.0e+00
Identity = 1075/1232 (87.26%), Postives = 1154/1232 (93.67%), Query Frame = 0
Query: 22 IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRM 81
+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTKRM
Sbjct: 63 VKANDKARVYILASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTKRM 122
Query: 82 KEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL 141
KEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNL
Sbjct: 123 KEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEFNL 182
Query: 142 TTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTP 201
TTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K GPSK AQ+KKKGKGK P
Sbjct: 183 TTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSSKNKVGPSK--AQMKKKGKGKAP 242
Query: 202 KQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCT 261
+K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + T
Sbjct: 243 NTSKVKKNADKGKCFHCNQDGHWKRNCPKYLAEKKAEKATQGKYDLLVVETCLVECDAST 302
Query: 262 WILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN 321
WILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+
Sbjct: 303 WILDSGATNHICFSFQETSSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFFQDRYLILKD 362
Query: 322 VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFV 381
VLYVP MKRNLISI+CILE +Y ISFE+NE F+ KGI +CSAI E+NLYKLRPTRAN V
Sbjct: 363 VLYVPLMKRNLISIACILEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRANVV 422
Query: 382 LNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESC 441
LNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESC
Sbjct: 423 LNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCESC 482
Query: 442 LEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH 501
LEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+H
Sbjct: 483 LEGKMTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLH 542
Query: 502 HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQ 561
HKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQ
Sbjct: 543 HKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQ 602
Query: 562 QNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR 621
QNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGR
Sbjct: 603 QNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGR 662
Query: 622 KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNAT 681
K SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNAT
Sbjct: 663 KSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNAT 722
Query: 682 FLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV 741
FLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK S QSH SQ+LR PRRSGRV
Sbjct: 723 FLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRRSGRV 782
Query: 742 VHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV 801
VHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLV
Sbjct: 783 VHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLV 842
Query: 802 DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRI 861
D P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRI
Sbjct: 843 DLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRI 902
Query: 862 LLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC 921
LLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGL+
Sbjct: 903 LLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQA 962
Query: 922 SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKK 981
SRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KK
Sbjct: 963 SRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKK 1022
Query: 982 WLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRY 1041
WL QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+
Sbjct: 1023 WLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRH 1082
Query: 1042 GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDH 1101
GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDH
Sbjct: 1083 GIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDH 1142
Query: 1102 WTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVW 1161
WTAVKNILKYLR T++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNG AVVW
Sbjct: 1143 WTAVKNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVW 1202
Query: 1162 RSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS 1221
RS+KQ CI DSTMEAEYVA CEAAKE VW RKFLTDLEVVPNMHLPITLYCDNSGAVANS
Sbjct: 1203 RSVKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANS 1262
Query: 1222 REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ 1254
+EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Sbjct: 1263 KEPRSHKRGKHIERKYHLIREIVHRGDVVVTQ 1292
BLAST of CSPI04G12000 vs. NCBI nr
Match:
KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 63 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 122
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 123 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 182
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 183 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 242
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 243 KANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 302
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 303 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 362
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 363 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 422
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 423 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 482
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 483 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 542
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 543 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 602
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 603 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 662
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 663 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 722
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 723 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 782
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 783 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 842
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 843 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 902
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 903 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 962
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 963 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1022
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1023 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1082
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1083 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1142
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1143 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1202
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1203 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1262
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1263 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1322
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1323 PLTAKVFEGHLESLGLR 1336
BLAST of CSPI04G12000 vs. NCBI nr
Match:
KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1324 PLTAKVFEGHLESLGLR 1337
BLAST of CSPI04G12000 vs. NCBI nr
Match:
KAA0031826.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0039313.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0043789.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0048789.1 gag/pol protein [Cucumis melo var. makuwa])
HSP 1 Score: 1879.0 bits (4866), Expect = 0.0e+00
Identity = 933/1277 (73.06%), Postives = 1083/1277 (84.81%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1282
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
Query: 1283 ALTAKVFEGHLHNLGLR 1284
LTAKVFEGHL +LGLR
Sbjct: 1324 PLTAKVFEGHLESLGLR 1337
BLAST of CSPI04G12000 vs. NCBI nr
Match:
TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])
HSP 1 Score: 1857.4 bits (4810), Expect = 0.0e+00
Identity = 923/1265 (72.96%), Postives = 1072/1265 (84.74%), Query Frame = 0
Query: 23 KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMK 82
KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ + ++H+A+KYIY RM
Sbjct: 64 KANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMN 123
Query: 83 EGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT 142
EG SVREHVL+MM+HFN+AE+NG I+E +QVSFILESLP+SF+ F++NA +NKI + LT
Sbjct: 124 EGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLT 183
Query: 143 TLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTP 202
TLLNELQ F++L KG++ EANVAT+ RKF RGS+S TK+ PS N + KKK G+
Sbjct: 184 TLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGN 243
Query: 203 KQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLV 262
K N K A KG C+HC Q GHW RNCPKYLAEKK K QGKYDLLV+ETCLV
Sbjct: 244 KANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLETCLV 303
Query: 263 EYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR 322
E ++ WI+DSGATNH+C SFQ SSW++L GE+T++VGTG +VSA AVG L+L +
Sbjct: 304 ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS 363
Query: 323 YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRP 382
+++L+NV VP +KRNLIS+ C+LEQ Y ++F +N+ F++ G+ +CSA LE+NLY LR
Sbjct: 364 FLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRS 423
Query: 383 TRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDN 442
+ +LNTE+F+TA TQNKR K+S NA+LWHLRLGHINLNRI RLVK+GLLS LE+N
Sbjct: 424 LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEEN 483
Query: 443 SLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 502
SLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSR
Sbjct: 484 SLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSR 543
Query: 503 YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ 562
YG++YL+ HKS +LEKFKEYKAEVEN L KTIK RSDRGGEYMDL+F++YL+E GI SQ
Sbjct: 544 YGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ 603
Query: 563 LSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET 622
LSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Sbjct: 604 LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSET 663
Query: 623 PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENK 682
P +LW GRKGSLRHFRIWGCPAHVL NPKKLE RSKLC F+GYPK +RGG FYDP++NK
Sbjct: 664 PLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNK 723
Query: 683 IFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQS 742
+FVSTNATFLEEDHIR+H+PRSK+VL E+SK +++PS+ T+VV S ++
Sbjct: 724 VFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVGSSTRT 783
Query: 743 HPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMD 802
H Q LREPRRSGRV + P RY+ L ET VI D IEDPLT+K+AM+DVD+D+WIKAM+
Sbjct: 784 HQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMN 843
Query: 803 LEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDY 862
LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR GKVQTFKARLVAKGYTQ EGVDY
Sbjct: 844 LELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY 903
Query: 863 EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ 922
EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI QEQ
Sbjct: 904 EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQ 963
Query: 923 KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 982
K+CKL +SIYGL+ SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N +AFLVLYVDDI
Sbjct: 964 KICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDI 1023
Query: 983 LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSR 1042
LLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +
Sbjct: 1024 LLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVK 1083
Query: 1043 YKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYS 1102
Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+
Sbjct: 1084 YSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYA 1143
Query: 1103 VGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS 1162
VG+VSRYQSNPG HWTAVK ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKS
Sbjct: 1144 VGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLVYGSKDLILTGYTDSDFQTDRDSRKS 1203
Query: 1163 TSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL 1222
TSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM
Sbjct: 1204 TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSK 1263
Query: 1223 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK 1272
PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S N+ADPFTK
Sbjct: 1264 PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTK 1323
BLAST of CSPI04G12000 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 295.0 bits (754), Expect = 2.8e-79
Identity = 166/484 (34.30%), Postives = 270/484 (55.79%), Query Frame = 0
Query: 764 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 823
++P TY +A + + W AMD E+ +M W + P + KPIGCKW+YK K +
Sbjct: 84 KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143
Query: 824 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL 883
G ++ +KARLVAKGYTQ+EG+D+ ETFSPV L S++++L+I+ Y++ + Q+D+ AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203
Query: 884 NGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFE 943
NG+L+E IYM P G+ + + VC LKKSIYGL+ SR W ++F + +GF
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263
Query: 944 QNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVL 1003
Q+ + + K+ ++ +++YVDDI++ N+ + ++K L F+++DLG +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323
Query: 1004 GIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED 1063
G++I R+ + + Q Y +L + K +P + S + + D
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH----SGGDFVD 383
Query: 1064 MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYM 1123
+ Y +G LMY + TR DI ++V +S++ P H AV IL Y++ T
Sbjct: 384 AK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQG 443
Query: 1124 LMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEY 1183
L Y ++ ++ L ++D+ FQ+ KD R+ST+G L + W+S KQ + S+ EAEY
Sbjct: 444 LFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEY 503
Query: 1184 VATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYH 1243
A A E +W +F +L++ + P L+CDN+ A+ + H+R KHIE H
Sbjct: 504 RALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCH 553
BLAST of CSPI04G12000 vs. TAIR 10
Match:
ATMG00810.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 96.7 bits (239), Expect = 1.5e-19
Identity = 75/236 (31.78%), Postives = 116/236 (49.15%), Query Frame = 0
Query: 958 FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA 1017
+L+LYVDDILL G+ L + L+ F MKDLG Y LGIQI + L +SQ
Sbjct: 2 YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61
Query: 1018 SYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYA 1077
Y +++L+ M + K LP + +S + P + D R+I VG+L Y
Sbjct: 62 KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP----DPSDFRSI-----VGALQYL 121
Query: 1078 MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTD 1137
L TRPDI Y+V +V + P + +K +L+Y++ T + + ++ L + + D
Sbjct: 122 TL-TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 181
Query: 1138 SDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVW 1191
SD+ R+ST+G L + W + +Q + S+ E EY A A E W
Sbjct: 182 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of CSPI04G12000 vs. TAIR 10
Match:
ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )
HSP 1 Score: 82.0 bits (201), Expect = 3.8e-15
Identity = 40/103 (38.83%), Postives = 62/103 (60.19%), Query Frame = 0
Query: 764 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 823
++P + A+KD W +AM E++++ N W LV P + +GCKW++K K
Sbjct: 26 KEPKSVIFALKD---PGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSD 85
Query: 824 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA 867
G + KARLVAKG+ Q EG+ + ET+SPV +IR +L++A
Sbjct: 86 GTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125
BLAST of CSPI04G12000 vs. TAIR 10
Match:
ATMG00240.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 46.2 bits (108), Expect = 2.3e-04
Identity = 25/72 (34.72%), Postives = 39/72 (54.17%), Query Frame = 0
Query: 1079 TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMY-GTKDLILTGYTDSDF 1138
TRPD+ ++V +S++ S AV +L Y++ T L Y T DL L + DSD+
Sbjct: 6 TRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFADSDW 65
Query: 1139 QTDKDARKSTSG 1150
+ D R+S +G
Sbjct: 66 ASCPDTRRSVTG 77
BLAST of CSPI04G12000 vs. TAIR 10
Match:
ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )
HSP 1 Score: 45.8 bits (107), Expect = 3.0e-04
Identity = 26/82 (31.71%), Postives = 46/82 (56.10%), Query Frame = 0
Query: 570 NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHF 629
NRT+++ VRSM+ + +F A TA +I+N PS +++ P E+W + +
Sbjct: 2 NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61
Query: 630 RIWGCPAHVLVQNPKKLEHRSK 651
R +GC A++ + KL+ R+K
Sbjct: 62 RRFGCVAYIHC-DEGKLKPRAK 82
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P10978 | 4.1e-208 | 35.09 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 4.5e-146 | 27.82 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
Q9ZT94 | 1.5e-130 | 27.17 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
Q94HW2 | 3.2e-128 | 26.52 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q99337 | 5.4e-35 | 21.49 | Transposon Ty1-NL2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... | [more] |
Match Name | E-value | Identity | Description | |
E2GK51 | 0.0e+00 | 87.26 | Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1 | [more] |
A0A5A7SMH8 | 0.0e+00 | 73.06 | Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... | [more] |
A0A5D3CPJ6 | 0.0e+00 | 73.06 | Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... | [more] |
A0A5A7TZD7 | 0.0e+00 | 73.06 | Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G0013... | [more] |
A0A5A7TWB9 | 0.0e+00 | 72.96 | Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G0031... | [more] |
Match Name | E-value | Identity | Description | |
ADJ18449.1 | 0.0e+00 | 87.26 | gag/pol protein, partial [Bryonia dioica] | [more] |
KAA0048404.1 | 0.0e+00 | 73.06 | gag/pol protein [Cucumis melo var. makuwa] | [more] |
KAA0035879.1 | 0.0e+00 | 73.06 | gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... | [more] |
KAA0031826.1 | 0.0e+00 | 73.06 | gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumi... | [more] |
TYK14550.1 | 0.0e+00 | 72.96 | gag/pol protein [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 2.8e-79 | 34.30 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 1.5e-19 | 31.78 | DNA/RNA polymerases superfamily protein | [more] |
ATMG00820.1 | 3.8e-15 | 38.83 | Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
ATMG00240.1 | 2.3e-04 | 34.72 | Gag-Pol-related retrotransposon family protein | [more] |
ATMG00710.1 | 3.0e-04 | 31.71 | Polynucleotidyl transferase, ribonuclease H-like superfamily protein | [more] |