CSPI06G15660 (gene) Wild cucumber (PI 183967)

NameCSPI06G15660
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGag/pol protein
LocationChr6 : 14003487 .. 14005298 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAGTTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACTTTTATGAACGGTAATCTTGAAGAGAGTATCTATATGTGTCAAACAAAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCATCTAGATCCTGGAATATGAGATTTGATACTGCGATAAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGAATATCTTATTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGTCGTACATATATGGAATTCATTTGTCAAAAGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCATTGTTGGAAGTTTAATGTATGCAAAGTTATGTACCAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAACCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATCGTACAAAGGATAAGATGCTAGAAAGTCTACATCATGATCAGTATTTAGTCTAA

mRNA sequence

ATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAGTTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACTTTTATGAACGGTAATCTTGAAGAGAGTATCTATATGTGTCAAACAAAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCATCTAGATCCTGGAATATGAGATTTGATACTGCGATAAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGAATATCTTATTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGTCGTACATATATGGAATTCATTTGTCAAAAGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCATTGTTGGAAGTTTAATGTATGCAAAGTTATGTACCAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAACCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATCGTACAAAGGATAAGATGCTAGAAAGTCTACATCATGATCAGTATTTAGTCTAA

Coding sequence (CDS)

ATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAGTTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACTTTTATGAACGGTAATCTTGAAGAGAGTATCTATATGTGTCAAACAAAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCATCTAGATCCTGGAATATGAGATTTGATACTGCGATAAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGAATATCTTATTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGTCGTACATATATGGAATTCATTTGTCAAAAGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCATTGTTGGAAGTTTAATGTATGCAAAGTTATGTACCAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAACCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATCGTACAAAGGATAAGATGCTAGAAAGTCTACATCATGATCAGTATTTAGTCTAA
BLAST of CSPI06G15660 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 440.3 bits (1131), Expect = 3.5e-122
Identity = 244/622 (39.23%), Postives = 371/622 (59.65%), Query Frame = 1

Query: 5    FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAY 64
            F +Y   +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA Y
Sbjct: 560  FEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACY 619

Query: 65   ILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPA--HVLVQNPKKLEHRSKLCFFIGY 124
            ++N  PS  ++ E P  +W  ++ S  H +++GC A  HV  +   KL+ +S  C FIGY
Sbjct: 620  LINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGY 679

Query: 125  PKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAI------DKPS 184
              E  G   +DP + K+  S +  F E + +R     S+ V   I  + +      + P+
Sbjct: 680  GDEEFGYRLWDPVKKKVIRSRDVVFRESE-VRTAADMSEKVKNGIIPNFVTIPSTSNNPT 739

Query: 185  SSTKVVDKTRKSGQS-------------------HPSQ---QLREPRRSGRVVHQPDRYL 244
            S+    D+  + G+                    HP+Q   Q +  RRS R   +  RY 
Sbjct: 740  SAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRYP 799

Query: 245  GLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKP 304
                  V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P
Sbjct: 800  ST--EYVLISDD--REPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRP 859

Query: 305  IGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFY 364
            + CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   
Sbjct: 860  LKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASL 919

Query: 365  DYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRF 424
            D E+ Q+DVKT F++G+LEE IYM Q +GF    ++  VCKL KS+YGLKQA R W M+F
Sbjct: 920  DLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKF 979

Query: 425  DTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQ 484
            D+ +KS  + +   +PCVY K+   +    L+LYVDD+L++G D   +  +K  L+  F 
Sbjct: 980  DSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFD 1039

Query: 485  MKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKE 544
            MKDLG AQ +LG++IVR R ++ L +SQ  YI+++L R+ M+N+K         + LSK+
Sbjct: 1040 MKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKK 1099

Query: 545  QCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNI 595
             CP T +E  +M  +PY+S VGSLMYA +CTRPDI ++VG+VSR+  NPG++HW  VK I
Sbjct: 1100 MCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWI 1159

BLAST of CSPI06G15660 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 237.7 bits (605), Expect = 3.4e-61
Identity = 150/493 (30.43%), Postives = 261/493 (52.94%), Query Frame = 1

Query: 111  HRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRD-------HQPRSKLVL 170
            + SK C  I + K+S+       + NK F++ +     +DH+ +       ++ R     
Sbjct: 782  NESKECDNIQFLKDSK-------ESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETA 841

Query: 171  KEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLG--LIETQVV 230
            + + +  ID P+ +  +    R+S +     Q+          ++ D  L   ++    +
Sbjct: 842  EHLKEIGIDNPTKNDGIEIINRRSERLKTKPQIS--------YNEEDNSLNKVVLNAHTI 901

Query: 231  IPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYK 290
              D     P ++ +     D+  W +A++ E+ +   N+ WT+  +P +   +  +W++ 
Sbjct: 902  FNDV----PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFS 961

Query: 291  RKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMD 350
             K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMD
Sbjct: 962  VKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMD 1021

Query: 351  VKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYG 410
            VKT F+NG L+E IYM   +G         VCKL K+IYGLKQA+R W   F+ A+K   
Sbjct: 1022 VKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECE 1081

Query: 411  FEQNVDEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGD 470
            F  +  + C+Y   K  +N  I +++LYVDD+++   D+  + + K++L  +F+M DL +
Sbjct: 1082 FVNSSVDRCIYILDKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNE 1141

Query: 471  AQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNS---KKGLLSYI-YGIHLSKEQC 530
             ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N       L S I Y +  S E C
Sbjct: 1142 IKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDC 1201

Query: 531  PKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILK 588
                       N P  S++G LMY  LCTRPD+  +V ++SRY S    + W  +K +L+
Sbjct: 1202 -----------NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLR 1239

BLAST of CSPI06G15660 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 120.6 bits (301), Expect = 6.1e-26
Identity = 78/260 (30.00%), Postives = 127/260 (48.85%), Query Frame = 1

Query: 340 MDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKS 399
           MDV T F+N  ++E IY+ Q  GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 400 YGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDA 459
            GF ++  E  +Y +  +    ++ +YVDD+L+     +    +K+ L   + MKDLG  
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 460 QYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQ 519
              LG+ I     N  + +S   YI K  S  ++   K         +  SK     T  
Sbjct: 121 DKFLGLNI-HQSSNGDITLSLQDYIAKAASESEINTFKLTQTP----LCNSKPLFETTSP 180

Query: 520 EVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRT 579
            ++D+   PY SIVG L++     RPDI Y V ++SR+   P   H  + + +L+YL  T
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 580 KDYMLMYRTKDKMLESLHHD 600
           +   L YR+  ++  +++ D
Sbjct: 241 RSMCLKYRSGSQLALTVYCD 253

BLAST of CSPI06G15660 vs. Swiss-Prot
Match: YH11B_YEAST (Transposon Ty1-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-H PE=3 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 4.4e-16
Identity = 95/419 (22.67%), Postives = 194/419 (46.30%), Query Frame = 1

Query: 198  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMY 257
            EP RS + +H       +   + +      ++ +TY + +K+  ++++I+A   E+  + 
Sbjct: 1243 EPPRSKKRIHLIAAVKAVKSIKPIRTTLRYDEAITYNKDIKE--KEKYIQAYHKEVNQLL 1302

Query: 258  FNSVWTLVDQPNDVKPIGCK------WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYE 317
                W   D+  D K I  K      +I+ RKRD      T KAR VA+G      + + 
Sbjct: 1303 MMKTWD-TDRYYDRKEIDPKRVINSMFIFNRKRDG-----THKARFVARG-----DIQHP 1362

Query: 318  ETFSPVAMLKSIR-----ILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQ 377
            +T+ P     ++        LS+A   +Y I Q+D+ + ++  +++E +Y+         
Sbjct: 1363 DTYDPGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPHL--- 1422

Query: 378  DQEQKVCKLKKSIYGLKQASRSWNMRFDT-AIKSYGFEQNVDEPCVYKKVVNSIIAFLVL 437
                K+ +LKKS+YGLKQ+  +W     +  IK  G E+     CV+K   NS +  + L
Sbjct: 1423 GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVFK---NSQVT-ICL 1482

Query: 438  YVDDILLIGNDVEYLIDIKKWLAMQFQMK--DLGDA----QY-VLGIQIVRNRKNKTLAM 497
            +VDD++L   D+     I   L  Q+  K  +LG++    QY +LG++I + ++ K + +
Sbjct: 1483 FVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEI-KYQRGKYMKL 1542

Query: 498  SQASYIDKMLSRYKMQNSKKGLLSYI---YGIHLSKEQCPKTPQEVEDMRNIPYASIVGS 557
               + + + + +  +  + KG         G+++ +++      E ++  +     ++G 
Sbjct: 1543 GMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVH-EMQKLIGL 1602

Query: 558  LMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLE 595
              Y     R D+ Y +  ++++   P R        +++++  T+D  L++  K+K  E
Sbjct: 1603 ASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWH-KNKPTE 1638

BLAST of CSPI06G15660 vs. Swiss-Prot
Match: YG13B_YEAST (Transposon Ty1-GR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-GR3 PE=3 SV=3)

HSP 1 Score: 87.0 bits (214), Expect = 7.5e-16
Identity = 92/406 (22.66%), Postives = 190/406 (46.80%), Query Frame = 1

Query: 198  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMY 257
            EP RS + +H       +   + +      ++ +TY + +K+  ++++I+A   E+  + 
Sbjct: 1205 EPPRSKKRIHLIAAVKAVKSIKPIRTTLRYDEAITYNKDIKE--KEKYIEAYHKEVNQLL 1264

Query: 258  FNSVWTLVDQPNDVKPIGCK------WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYE 317
              + W   D+  D K I  K      +I+ RKRD      T KAR VA+G  Q       
Sbjct: 1265 KMNTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDG-----THKARFVARGDIQHPDTYDS 1324

Query: 318  ETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQK 377
               S      ++   LS+A   +Y I Q+D+ + ++  +++E +Y+             K
Sbjct: 1325 GMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPHL---GMNDK 1384

Query: 378  VCKLKKSIYGLKQASRSWNMRFDT-AIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI 437
            + +LKKS+YGLKQ+  +W     +  IK  G E+     CV+K   NS +  + L+VDD+
Sbjct: 1385 LIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVFK---NSQVT-ICLFVDDM 1444

Query: 438  LLIGNDVEYLIDIKKWLAMQFQMK--DLGDA----QY-VLGIQIVRNRKNKTLAMSQASY 497
            +L   D+     I   L MQ+  K  +LG++    QY +LG++I + ++ K + +   + 
Sbjct: 1445 ILFSKDLNSNKRIIAKLKMQYDTKIINLGESDDEIQYDILGLEI-KYQRGKYMKLGMENS 1504

Query: 498  IDKMLSRYKM---QNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAK 557
            + + + +  +    N +K       G+++++++  +  ++   M+      ++G   Y  
Sbjct: 1505 LTEKIPKLNVPLNPNGRKLGAPGQPGLYINQQEL-ELEEDDYKMKVHEMQKLIGLASYVG 1564

Query: 558  LCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMY 587
               R D+ Y +  ++++   P +        +++++  T+D  L++
Sbjct: 1565 YKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIW 1593

BLAST of CSPI06G15660 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 1060.8 bits (2742), Expect = 6.1e-307
Identity = 513/593 (86.51%), Postives = 556/593 (93.76%), Query Frame = 1

Query: 1    MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALE 60
            MD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALE
Sbjct: 576  MDSKFQDYLIEFGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALE 635

Query: 61   TAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIG 120
            TA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+G
Sbjct: 636  TAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVG 695

Query: 121  YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKV 180
            YPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKV
Sbjct: 696  YPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKV 755

Query: 181  VDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDV 240
            VDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DV
Sbjct: 756  VDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDV 815

Query: 241  DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK 300
            DRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Sbjct: 816  DRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAK 875

Query: 301  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQT 360
            GYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKT F+NGNLEESIYM Q 
Sbjct: 876  GYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQP 935

Query: 361  KGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSII 420
            +GFI QDQEQKVCKL+KSIYGLKQASRSWN+RFDTAIKSYGFEQNVDEPCVYKK+VNS++
Sbjct: 936  EGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVV 995

Query: 421  AFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ 480
            AFL+LYVDDILLIGNDVEYL D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQ
Sbjct: 996  AFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQ 1055

Query: 481  ASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAK 540
            ASYIDK+LSRYKMQNSKKG L + +GIHLSKEQCPKTPQEVEDMRNIPY+S VGSLMYA 
Sbjct: 1056 ASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAM 1115

Query: 541  LCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML 594
            LCTRPDICYSVG+VSRYQSNPGRDHWT VKNILKYLRRT++YML+Y  KD +L
Sbjct: 1116 LCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYMLVYGAKDLIL 1168

BLAST of CSPI06G15660 vs. TrEMBL
Match: A0A165U314_9ROSI (Gag/pol protein OS=Momordica dioica PE=4 SV=1)

HSP 1 Score: 838.6 bits (2165), Expect = 4.9e-240
Identity = 415/595 (69.75%), Postives = 480/595 (80.67%), Query Frame = 1

Query: 1    MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALE 60
            M   F D+L E GI SQLSAP TPQ NGVSERRNRTLLDMVRSMMS++ + DSFWGYA E
Sbjct: 583  MSSEFGDHLREFGIVSQLSAPGTPQCNGVSERRNRTLLDMVRSMMSYADLPDSFWGYARE 642

Query: 61   TAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIG 120
                ILN VPSKSV ETPYELW GRK SL   +IWGCPAHV    PKKLE RS+ C F+G
Sbjct: 643  RERAILNRVPSKSVEETPYELWYGRKSSLSFLKIWGCPAHVKKLQPKKLEPRSEKCLFVG 702

Query: 121  YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAI-----DKPS 180
            YPKE+RG  FY PQENK+FV+TN  FLE++ +  HQP SK+VLK + +  I     DKPS
Sbjct: 703  YPKETRGYYFYHPQENKVFVATNEAFLEKEFLSRHQPGSKIVLKAVVEPLIPLDGTDKPS 762

Query: 181  SSTKVV-DKTR-KSGQSHP--SQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPL 240
            SSTKVV DK      QSH    Q+LR PRRSGR    P+RYLGL+ETQ++I D+G EDP 
Sbjct: 763  SSTKVVVDKAEVNDDQSHTPDQQELRVPRRSGRSRRAPNRYLGLVETQIMILDNGEEDPT 822

Query: 241  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQ 300
             YKQAM   D DQW+KAM+ EMESMY N VWTLVD P+DVKPIGCKWIYK+KRD    V 
Sbjct: 823  NYKQAMVGPDSDQWLKAMNSEMESMYDNKVWTLVDLPSDVKPIGCKWIYKKKRDQDSNVT 882

Query: 301  TFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNL 360
             FKARLVAKG+T+   + YEETFSPVAMLKSIRI+L+IA F+DYEIWQMDVKT F+NGNL
Sbjct: 883  VFKARLVAKGFTRSLSLSYEETFSPVAMLKSIRIILAIAAFFDYEIWQMDVKTAFLNGNL 942

Query: 361  EESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCV 420
            EESIYM Q +GF+ QDQEQK CKL+ SIYGLKQASRSWN+RFD  IK++GF QNVDE CV
Sbjct: 943  EESIYMIQPEGFVAQDQEQKACKLQGSIYGLKQASRSWNIRFDEVIKAFGFIQNVDESCV 1002

Query: 421  YKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNR 480
            YKK+  S++AFL+LYVDDILLIGNDVEYL D+KKWL   F MKDLG+AQY+LGI+I R+R
Sbjct: 1003 YKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYILGIRIYRDR 1062

Query: 481  KNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYAS 540
             NKT+ MSQ++YIDK+LSR+KMQ+SKKGLL + +GIHLSKEQCPKTPQEVEDMRNIPY+S
Sbjct: 1063 SNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSS 1122

Query: 541  IVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMY 587
             +GSLMYA LCTRPD+CY++ +VSRYQSNPGRDHWT VKNILKYLRRT++  L+Y
Sbjct: 1123 AIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVY 1177

BLAST of CSPI06G15660 vs. TrEMBL
Match: O23864_9ORYZ (Polyprotein OS=Oryza australiensis PE=4 SV=1)

HSP 1 Score: 633.3 bits (1632), Expect = 3.1e-178
Identity = 317/595 (53.28%), Postives = 420/595 (70.59%), Query Frame = 1

Query: 5    FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAY 64
            F ++L + GI  QL+ P TPQ NGVSERRNRTLLDMVRSMMS S +  SFWGYALETAA 
Sbjct: 575  FGNHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLSFWGYALETAAL 634

Query: 65   ILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKE 124
             LN VPSKSV +TPYE+W G+  SL   +IWGC A+V      KL  +S  CF +GYPKE
Sbjct: 635  TLNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQSDKLTPKSDKCFVVGYPKE 694

Query: 125  SRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKT 184
            ++G  FY+ ++ K+FV+ +  FLE++ +       ++ L+E+ ++  +  S++T+   + 
Sbjct: 695  TKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEVQETP-ETVSATTE--PQQ 754

Query: 185  RKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQ 244
                 + P      PRRS R    PDRY G  +  +++ D+  ++P TY++AM   D ++
Sbjct: 755  EDQSVAPPVVDTPAPRRSERSRRAPDRYTGAEQRDILLLDN--DEPKTYEEAMVGHDSNK 814

Query: 245  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQ 304
            W+ AM  E+ESMY N VW LVD P+ VK I CKW++K+K D  G V  +KARLVAKG+ Q
Sbjct: 815  WLGAMKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQ 874

Query: 305  REGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFI 364
             +GVDY+ETFSPVAMLKSIRI+L+IA ++DYEIWQMDVKT F+NGNL E +YM Q +GF+
Sbjct: 875  IQGVDYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFV 934

Query: 365  EQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLV 424
            + +   K+CKL+KSIYGLKQASRSWN+RFD  IK +GF +N +E CVYKKV  S I FL+
Sbjct: 935  DPESPGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLI 994

Query: 425  LYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYI 484
            LYVDDILLIGND+  L  +K  L   F MKDLG+A Y+LGI+I R+R  + + +SQ++YI
Sbjct: 995  LYVDDILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYI 1054

Query: 485  DKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTR 544
            DK+L R+ M +SKKG L   +GI+LSK QCP+T  E   M  +PYAS +GS+MYA LCTR
Sbjct: 1055 DKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLCTR 1114

Query: 545  PDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD 600
            PD+ Y++   SRYQS+PG  HWT VKNILKYLRRTKD  L+Y  ++ ++ S + D
Sbjct: 1115 PDVSYALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTD 1164

BLAST of CSPI06G15660 vs. TrEMBL
Match: O81506_ARATH (Putative retrotransposon protein OS=Arabidopsis thaliana GN=T7M24.7 PE=4 SV=1)

HSP 1 Score: 632.1 bits (1629), Expect = 6.9e-178
Identity = 317/595 (53.28%), Postives = 412/595 (69.24%), Query Frame = 1

Query: 5   FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAY 64
           F D+L E GI SQL+ P TPQ NGVSERRNRTLLDMVRSMMS + +   FWGYALET+A+
Sbjct: 219 FSDHLRECGIVSQLTPPGTPQWNGVSERRNRTLLDMVRSMMSHTDLPSPFWGYALETSAF 278

Query: 65  ILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKE 124
           +LN  PSKSV +TPYE+W G+  +L   +IWGC ++       KL  +S  C+F+GYPKE
Sbjct: 279 MLNRCPSKSVEKTPYEIWTGKVPNLSFLKIWGCESYAKRLITDKLGPKSDKCYFVGYPKE 338

Query: 125 SRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKT 184
           ++G  FY P +NK+FV  N  FLE + +      SK++L+E+ +   D P+S  +     
Sbjct: 339 TKGYYFYHPTDNKVFVVRNGAFLEREFLSKGTSGSKVLLEEVREPQGDVPTSQEEHQLDL 398

Query: 185 RKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQ 244
           R+  +  P     E RRS R  H+PDR+   +     +     ++P +Y++A+   D D+
Sbjct: 399 RRVVE--PILVEPEVRRSERSRHEPDRFRDWVMDDHALFMIESDEPTSYEEALMGPDSDK 458

Query: 245 WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQ 304
           W++A   EMESM  N VWTLVD P+ VKPI CKWI+K+K D  G +Q +KA LVAKGY Q
Sbjct: 459 WLEAAKSEMESMSQNKVWTLVDLPDGVKPIECKWIFKKKIDMDGNIQIYKAGLVAKGYKQ 518

Query: 305 REGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFI 364
             G+DY+ET+SPVAMLKSIRILL+ A  YDYEIWQMDVKT F+NGNLEE +YM Q +GF 
Sbjct: 519 VHGIDYDETYSPVAMLKSIRILLATAAHYDYEIWQMDVKTAFLNGNLEEHVYMTQPEGFT 578

Query: 365 EQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLV 424
             +  +KVCKL +SIYGLKQASRSWN+RF+ AIK + F +N +EPCVYKK   S +AFLV
Sbjct: 579 VPEAARKVCKLHRSIYGLKQASRSWNLRFNEAIKEFDFIRNEEEPCVYKKTSGSAVAFLV 638

Query: 425 LYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYI 484
           LYVDDILL+GND+  L  +K WL   F MKD+G+A Y+LGI+I R+R NK + +SQ +YI
Sbjct: 639 LYVDDILLLGNDIPLLQSVKTWLGSCFSMKDMGEAAYILGIRIYRDRLNKIIGLSQDTYI 698

Query: 485 DKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTR 544
           DK+L R+ M +SKKG +   +GI LSK QCP T  E E M  IPYAS +GS+MYA L TR
Sbjct: 699 DKVLHRFNMHDSKKGFIPMSHGITLSKTQCPSTHDERERMSKIPYASAIGSIMYAMLYTR 758

Query: 545 PDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD 600
           PD+  ++ M SRYQS+PG  HW  V+NI KYLRRTKD  L+Y   ++++ S + D
Sbjct: 759 PDVACALSMTSRYQSDPGESHWIVVRNIFKYLRRTKDKFLVYGGSEELVVSGYTD 811

BLAST of CSPI06G15660 vs. TrEMBL
Match: A5AUE7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 2.5e-175
Identity = 309/470 (65.74%), Postives = 375/470 (79.79%), Query Frame = 1

Query: 121 YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKV 180
           YPK +RGGLFY  QENK+FVSTNATFLE +++ D +P SK+VL+E+    I    + T V
Sbjct: 39  YPKGTRGGLFYSAQENKVFVSTNATFLEYNYMADFKPISKVVLEELLADEISP--TPTTV 98

Query: 181 VDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDV 240
           V++ RK   +        PRRSGR +  P RY    E QV + D   +DPLT+K AM DV
Sbjct: 99  VERQRKETTAQDLTP-PPPRRSGREIRLPIRYRENGEAQVAVTDGSDDDPLTFKMAMDDV 158

Query: 241 DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK 300
           DR++W +AM LE+ESMY NSVW LVD P  +KPIGCKWIYK KR   GKV+TFKARLVAK
Sbjct: 159 DREKWQEAMKLEIESMYSNSVWKLVDLPEGIKPIGCKWIYKXKRGPNGKVETFKARLVAK 218

Query: 301 GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQT 360
           G+TQ+EGVDYE+TFSPV MLKSIRILLSI  +YDYEIWQMDVKT F+NG+LEE+IYM Q 
Sbjct: 219 GFTQKEGVDYEDTFSPVXMLKSIRILLSIXAYYDYEIWQMDVKTXFLNGHLEETIYMVQP 278

Query: 361 KGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSII 420
           +GF+ +DQEQKVCKL++SIYGLKQASRSWN+ F+ AIKSYGFEQN+ EPCVYK++    +
Sbjct: 279 EGFVVKDQEQKVCKLQRSIYGLKQASRSWNIIFNEAIKSYGFEQNLGEPCVYKQIGGDKV 338

Query: 421 AFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ 480
            FLVLYVDDILLIGNDVE L  +K WLA QFQMKDLG+A Y+LGIQ+ R+RKN+ LA+SQ
Sbjct: 339 VFLVLYVDDILLIGNDVESLSKVKNWLASQFQMKDLGEASYILGIQMTRDRKNRLLALSQ 398

Query: 481 ASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAK 540
           A+YIDK+L ++ M+NSKKG L   +G+HLSKEQCPKTPQ+ E MR +PYAS VGSLMYA 
Sbjct: 399 AAYIDKVLVKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEKMRRVPYASAVGSLMYAM 458

Query: 541 LCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKD 591
           LCTRPDIC++VG+VSRYQSNPG DHW  VK+ILKYLRRT++YML+Y  ++
Sbjct: 459 LCTRPDICFAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYMLVYSGRE 505

BLAST of CSPI06G15660 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 220.7 bits (561), Expect = 2.4e-57
Identity = 124/369 (33.60%), Postives = 207/369 (56.10%), Query Frame = 1

Query: 228 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 287
           ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 288 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFM 347
           G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+   F+
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 348 NGNLEESIYMCQTKGFIEQDQE----QKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFE 407
           NG+L+E IYM    G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 408 QNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVL 467
           Q+  +   + K+  ++   +++YVDDI++  N+   + ++K  L   F+++DLG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 468 GIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVED 527
           G++I R+     + + Q  Y   +L    +   K   +     +  S      +  +  D
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH----SGGDFVD 383

Query: 528 MRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYM 587
            +   Y  ++G LMY ++ TR DI ++V  +S++   P   H   V  IL Y++ T    
Sbjct: 384 AK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQG 440

Query: 588 LMYRTKDKM 593
           L Y ++ +M
Sbjct: 444 LFYSSQAEM 440

BLAST of CSPI06G15660 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 82.0 bits (201), Expect = 1.4e-15
Identity = 40/103 (38.83%), Postives = 62/103 (60.19%), Query Frame = 1

Query: 228 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 287
           ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    
Sbjct: 26  KEPKSVIFALKDPG---WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSD 85

Query: 288 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA 331
           G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Sbjct: 86  GTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI06G15660 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 71.2 bits (173), Expect = 2.4e-12
Identity = 57/171 (33.33%), Postives = 85/171 (49.71%), Query Frame = 1

Query: 422 FLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA 481
           +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 482 SYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKL 541
            Y +++L+   M + K   +S    + L+         +  D R     SIVG+L Y  L
Sbjct: 62  KYAEQILNNAGMLDCKP--MSTPLPLKLNSSVSTAKYPDPSDFR-----SIVGALQYLTL 121

Query: 542 CTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKM 593
            TRPDI Y+V +V +    P    +  +K +L+Y++ T  + L      K+
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKL 162

BLAST of CSPI06G15660 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 1060.8 bits (2742), Expect = 8.7e-307
Identity = 513/593 (86.51%), Postives = 556/593 (93.76%), Query Frame = 1

Query: 1    MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALE 60
            MD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALE
Sbjct: 576  MDSKFQDYLIEFGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALE 635

Query: 61   TAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIG 120
            TA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+G
Sbjct: 636  TAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVG 695

Query: 121  YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKV 180
            YPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKV
Sbjct: 696  YPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKV 755

Query: 181  VDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDV 240
            VDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DV
Sbjct: 756  VDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDV 815

Query: 241  DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK 300
            DRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Sbjct: 816  DRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAK 875

Query: 301  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQT 360
            GYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKT F+NGNLEESIYM Q 
Sbjct: 876  GYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQP 935

Query: 361  KGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSII 420
            +GFI QDQEQKVCKL+KSIYGLKQASRSWN+RFDTAIKSYGFEQNVDEPCVYKK+VNS++
Sbjct: 936  EGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVV 995

Query: 421  AFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ 480
            AFL+LYVDDILLIGNDVEYL D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQ
Sbjct: 996  AFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQ 1055

Query: 481  ASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAK 540
            ASYIDK+LSRYKMQNSKKG L + +GIHLSKEQCPKTPQEVEDMRNIPY+S VGSLMYA 
Sbjct: 1056 ASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAM 1115

Query: 541  LCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML 594
            LCTRPDICYSVG+VSRYQSNPGRDHWT VKNILKYLRRT++YML+Y  KD +L
Sbjct: 1116 LCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRTRNYMLVYGAKDLIL 1168

BLAST of CSPI06G15660 vs. NCBI nr
Match: gi|1019597807|gb|AMY96445.1| (gag/pol protein [Momordica dioica])

HSP 1 Score: 838.6 bits (2165), Expect = 7.0e-240
Identity = 415/595 (69.75%), Postives = 480/595 (80.67%), Query Frame = 1

Query: 1    MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALE 60
            M   F D+L E GI SQLSAP TPQ NGVSERRNRTLLDMVRSMMS++ + DSFWGYA E
Sbjct: 583  MSSEFGDHLREFGIVSQLSAPGTPQCNGVSERRNRTLLDMVRSMMSYADLPDSFWGYARE 642

Query: 61   TAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIG 120
                ILN VPSKSV ETPYELW GRK SL   +IWGCPAHV    PKKLE RS+ C F+G
Sbjct: 643  RERAILNRVPSKSVEETPYELWYGRKSSLSFLKIWGCPAHVKKLQPKKLEPRSEKCLFVG 702

Query: 121  YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAI-----DKPS 180
            YPKE+RG  FY PQENK+FV+TN  FLE++ +  HQP SK+VLK + +  I     DKPS
Sbjct: 703  YPKETRGYYFYHPQENKVFVATNEAFLEKEFLSRHQPGSKIVLKAVVEPLIPLDGTDKPS 762

Query: 181  SSTKVV-DKTR-KSGQSHP--SQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPL 240
            SSTKVV DK      QSH    Q+LR PRRSGR    P+RYLGL+ETQ++I D+G EDP 
Sbjct: 763  SSTKVVVDKAEVNDDQSHTPDQQELRVPRRSGRSRRAPNRYLGLVETQIMILDNGEEDPT 822

Query: 241  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQ 300
             YKQAM   D DQW+KAM+ EMESMY N VWTLVD P+DVKPIGCKWIYK+KRD    V 
Sbjct: 823  NYKQAMVGPDSDQWLKAMNSEMESMYDNKVWTLVDLPSDVKPIGCKWIYKKKRDQDSNVT 882

Query: 301  TFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNL 360
             FKARLVAKG+T+   + YEETFSPVAMLKSIRI+L+IA F+DYEIWQMDVKT F+NGNL
Sbjct: 883  VFKARLVAKGFTRSLSLSYEETFSPVAMLKSIRIILAIAAFFDYEIWQMDVKTAFLNGNL 942

Query: 361  EESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCV 420
            EESIYM Q +GF+ QDQEQK CKL+ SIYGLKQASRSWN+RFD  IK++GF QNVDE CV
Sbjct: 943  EESIYMIQPEGFVAQDQEQKACKLQGSIYGLKQASRSWNIRFDEVIKAFGFIQNVDESCV 1002

Query: 421  YKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNR 480
            YKK+  S++AFL+LYVDDILLIGNDVEYL D+KKWL   F MKDLG+AQY+LGI+I R+R
Sbjct: 1003 YKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDLGEAQYILGIRIYRDR 1062

Query: 481  KNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYAS 540
             NKT+ MSQ++YIDK+LSR+KMQ+SKKGLL + +GIHLSKEQCPKTPQEVEDMRNIPY+S
Sbjct: 1063 SNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPKTPQEVEDMRNIPYSS 1122

Query: 541  IVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMY 587
             +GSLMYA LCTRPD+CY++ +VSRYQSNPGRDHWT VKNILKYLRRT++  L+Y
Sbjct: 1123 AIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYLRRTRNMFLVY 1177

BLAST of CSPI06G15660 vs. NCBI nr
Match: gi|2443320|dbj|BAA22288.1| (polyprotein [Oryza australiensis])

HSP 1 Score: 633.3 bits (1632), Expect = 4.5e-178
Identity = 317/595 (53.28%), Postives = 420/595 (70.59%), Query Frame = 1

Query: 5    FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAY 64
            F ++L + GI  QL+ P TPQ NGVSERRNRTLLDMVRSMMS S +  SFWGYALETAA 
Sbjct: 575  FGNHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLSFWGYALETAAL 634

Query: 65   ILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKE 124
             LN VPSKSV +TPYE+W G+  SL   +IWGC A+V      KL  +S  CF +GYPKE
Sbjct: 635  TLNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQSDKLTPKSDKCFVVGYPKE 694

Query: 125  SRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKT 184
            ++G  FY+ ++ K+FV+ +  FLE++ +       ++ L+E+ ++  +  S++T+   + 
Sbjct: 695  TKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEVQETP-ETVSATTE--PQQ 754

Query: 185  RKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQ 244
                 + P      PRRS R    PDRY G  +  +++ D+  ++P TY++AM   D ++
Sbjct: 755  EDQSVAPPVVDTPAPRRSERSRRAPDRYTGAEQRDILLLDN--DEPKTYEEAMVGHDSNK 814

Query: 245  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQ 304
            W+ AM  E+ESMY N VW LVD P+ VK I CKW++K+K D  G V  +KARLVAKG+ Q
Sbjct: 815  WLGAMKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQ 874

Query: 305  REGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFI 364
             +GVDY+ETFSPVAMLKSIRI+L+IA ++DYEIWQMDVKT F+NGNL E +YM Q +GF+
Sbjct: 875  IQGVDYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFV 934

Query: 365  EQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLV 424
            + +   K+CKL+KSIYGLKQASRSWN+RFD  IK +GF +N +E CVYKKV  S I FL+
Sbjct: 935  DPESPGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLI 994

Query: 425  LYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYI 484
            LYVDDILLIGND+  L  +K  L   F MKDLG+A Y+LGI+I R+R  + + +SQ++YI
Sbjct: 995  LYVDDILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYI 1054

Query: 485  DKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTR 544
            DK+L R+ M +SKKG L   +GI+LSK QCP+T  E   M  +PYAS +GS+MYA LCTR
Sbjct: 1055 DKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLCTR 1114

Query: 545  PDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD 600
            PD+ Y++   SRYQS+PG  HWT VKNILKYLRRTKD  L+Y  ++ ++ S + D
Sbjct: 1115 PDVSYALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTD 1164

BLAST of CSPI06G15660 vs. NCBI nr
Match: gi|3319362|gb|AAC26250.1| (contains similarity to reverse transcriptase (Pfam: rvt.hmm, score 19.29) [Arabidopsis thaliana])

HSP 1 Score: 632.1 bits (1629), Expect = 1.0e-177
Identity = 317/595 (53.28%), Postives = 412/595 (69.24%), Query Frame = 1

Query: 5   FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAY 64
           F D+L E GI SQL+ P TPQ NGVSERRNRTLLDMVRSMMS + +   FWGYALET+A+
Sbjct: 219 FSDHLRECGIVSQLTPPGTPQWNGVSERRNRTLLDMVRSMMSHTDLPSPFWGYALETSAF 278

Query: 65  ILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKE 124
           +LN  PSKSV +TPYE+W G+  +L   +IWGC ++       KL  +S  C+F+GYPKE
Sbjct: 279 MLNRCPSKSVEKTPYEIWTGKVPNLSFLKIWGCESYAKRLITDKLGPKSDKCYFVGYPKE 338

Query: 125 SRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKT 184
           ++G  FY P +NK+FV  N  FLE + +      SK++L+E+ +   D P+S  +     
Sbjct: 339 TKGYYFYHPTDNKVFVVRNGAFLEREFLSKGTSGSKVLLEEVREPQGDVPTSQEEHQLDL 398

Query: 185 RKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQ 244
           R+  +  P     E RRS R  H+PDR+   +     +     ++P +Y++A+   D D+
Sbjct: 399 RRVVE--PILVEPEVRRSERSRHEPDRFRDWVMDDHALFMIESDEPTSYEEALMGPDSDK 458

Query: 245 WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQ 304
           W++A   EMESM  N VWTLVD P+ VKPI CKWI+K+K D  G +Q +KA LVAKGY Q
Sbjct: 459 WLEAAKSEMESMSQNKVWTLVDLPDGVKPIECKWIFKKKIDMDGNIQIYKAGLVAKGYKQ 518

Query: 305 REGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFI 364
             G+DY+ET+SPVAMLKSIRILL+ A  YDYEIWQMDVKT F+NGNLEE +YM Q +GF 
Sbjct: 519 VHGIDYDETYSPVAMLKSIRILLATAAHYDYEIWQMDVKTAFLNGNLEEHVYMTQPEGFT 578

Query: 365 EQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLV 424
             +  +KVCKL +SIYGLKQASRSWN+RF+ AIK + F +N +EPCVYKK   S +AFLV
Sbjct: 579 VPEAARKVCKLHRSIYGLKQASRSWNLRFNEAIKEFDFIRNEEEPCVYKKTSGSAVAFLV 638

Query: 425 LYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYI 484
           LYVDDILL+GND+  L  +K WL   F MKD+G+A Y+LGI+I R+R NK + +SQ +YI
Sbjct: 639 LYVDDILLLGNDIPLLQSVKTWLGSCFSMKDMGEAAYILGIRIYRDRLNKIIGLSQDTYI 698

Query: 485 DKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTR 544
           DK+L R+ M +SKKG +   +GI LSK QCP T  E E M  IPYAS +GS+MYA L TR
Sbjct: 699 DKVLHRFNMHDSKKGFIPMSHGITLSKTQCPSTHDERERMSKIPYASAIGSIMYAMLYTR 758

Query: 545 PDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD 600
           PD+  ++ M SRYQS+PG  HW  V+NI KYLRRTKD  L+Y   ++++ S + D
Sbjct: 759 PDVACALSMTSRYQSDPGESHWIVVRNIFKYLRRTKDKFLVYGGSEELVVSGYTD 811

BLAST of CSPI06G15660 vs. NCBI nr
Match: gi|147768021|emb|CAN69397.1| (hypothetical protein VITISV_021035 [Vitis vinifera])

HSP 1 Score: 623.6 bits (1607), Expect = 3.5e-175
Identity = 309/470 (65.74%), Postives = 375/470 (79.79%), Query Frame = 1

Query: 121 YPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKV 180
           YPK +RGGLFY  QENK+FVSTNATFLE +++ D +P SK+VL+E+    I    + T V
Sbjct: 39  YPKGTRGGLFYSAQENKVFVSTNATFLEYNYMADFKPISKVVLEELLADEISP--TPTTV 98

Query: 181 VDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDV 240
           V++ RK   +        PRRSGR +  P RY    E QV + D   +DPLT+K AM DV
Sbjct: 99  VERQRKETTAQDLTP-PPPRRSGREIRLPIRYRENGEAQVAVTDGSDDDPLTFKMAMDDV 158

Query: 241 DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK 300
           DR++W +AM LE+ESMY NSVW LVD P  +KPIGCKWIYK KR   GKV+TFKARLVAK
Sbjct: 159 DREKWQEAMKLEIESMYSNSVWKLVDLPEGIKPIGCKWIYKXKRGPNGKVETFKARLVAK 218

Query: 301 GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQT 360
           G+TQ+EGVDYE+TFSPV MLKSIRILLSI  +YDYEIWQMDVKT F+NG+LEE+IYM Q 
Sbjct: 219 GFTQKEGVDYEDTFSPVXMLKSIRILLSIXAYYDYEIWQMDVKTXFLNGHLEETIYMVQP 278

Query: 361 KGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSII 420
           +GF+ +DQEQKVCKL++SIYGLKQASRSWN+ F+ AIKSYGFEQN+ EPCVYK++    +
Sbjct: 279 EGFVVKDQEQKVCKLQRSIYGLKQASRSWNIIFNEAIKSYGFEQNLGEPCVYKQIGGDKV 338

Query: 421 AFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ 480
            FLVLYVDDILLIGNDVE L  +K WLA QFQMKDLG+A Y+LGIQ+ R+RKN+ LA+SQ
Sbjct: 339 VFLVLYVDDILLIGNDVESLSKVKNWLASQFQMKDLGEASYILGIQMTRDRKNRLLALSQ 398

Query: 481 ASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAK 540
           A+YIDK+L ++ M+NSKKG L   +G+HLSKEQCPKTPQ+ E MR +PYAS VGSLMYA 
Sbjct: 399 AAYIDKVLVKFAMENSKKGNLPSRHGVHLSKEQCPKTPQDEEKMRRVPYASAVGSLMYAM 458

Query: 541 LCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKD 591
           LCTRPDIC++VG+VSRYQSNPG DHW  VK+ILKYLRRT++YML+Y  ++
Sbjct: 459 LCTRPDICFAVGVVSRYQSNPGLDHWVAVKHILKYLRRTRNYMLVYSGRE 505

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC3.5e-12239.23Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME3.4e-6130.43Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST6.1e-2630.00Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YH11B_YEAST4.4e-1622.67Transposon Ty1-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YG13B_YEAST7.5e-1622.66Transposon Ty1-GR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
E2GK51_BRYDI6.1e-30786.51Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
A0A165U314_9ROSI4.9e-24069.75Gag/pol protein OS=Momordica dioica PE=4 SV=1[more]
O23864_9ORYZ3.1e-17853.28Polyprotein OS=Oryza australiensis PE=4 SV=1[more]
O81506_ARATH6.9e-17853.28Putative retrotransposon protein OS=Arabidopsis thaliana GN=T7M24.7 PE=4 SV=1[more]
A5AUE7_VITVI2.5e-17565.74Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021035 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.12.4e-5733.60 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.11.4e-1538.83ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00810.12.4e-1233.33ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|8.7e-30786.51gag/pol protein [Bryonia dioica][more]
gi|1019597807|gb|AMY96445.1|7.0e-24069.75gag/pol protein [Momordica dioica][more]
gi|2443320|dbj|BAA22288.1|4.5e-17853.28polyprotein [Oryza australiensis][more]
gi|3319362|gb|AAC26250.1|1.0e-17753.28contains similarity to reverse transcriptase (Pfam: rvt.hmm, score 19.29) [Arabi... [more]
gi|147768021|emb|CAN69397.1|3.5e-17565.74hypothetical protein VITISV_021035 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G15660.1CSPI06G15660.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 1..86
score: 14
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 4..89
score: 1.3
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 4..94
score: 8.04
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 259..498
score: 4.9
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 5..210
score: 2.8E-242coord: 228..594
score: 2.8E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 5..210
score: 2.8E-242coord: 228..594
score: 2.8E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 512..590
score: 9.78E-13coord: 259..483
score: 9.78

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None