CSPI04G20730 (gene) Wild cucumber (PI 183967)

NameCSPI04G20730
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLINE-1 reverse transcriptase like
LocationChr4 : 18992295 .. 18993560 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG

mRNA sequence

ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG

Coding sequence (CDS)

ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG
BLAST of CSPI04G20730 vs. Swiss-Prot
Match: YTX2_XENLA (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis PE=3 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.2e-38
Identity = 114/418 (27.27%), Postives = 202/418 (48.33%), Query Frame = 1

Query: 7   LKAILKSWNKETFGKIFSQKQVLIDKINYLD---SLEESSCLNEENVKERENCRGALLDL 66
           LK + + + K   G+  ++ + L  ++  L+   S  E   L  E ++ +E    AL ++
Sbjct: 293 LKLLCQEYTKSVSGQRNAEIEALNGEVLDLEQRLSGSEDQALQCEYLERKE----ALRNM 352

Query: 67  IVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVDE 126
             ++ +    +S++  L + +  S FF+     + ++  ++ L + +G  L   + I D 
Sbjct: 353 EQRQARGAFVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDGTPLEDPEAIRDR 412

Query: 127 ILSFFSNLYGTRISSPFICDILNWRGL---SLQDSSLLEVPFTEKEIREVVFEMGCLKSP 186
             SF+ NL+     SP  C+ L W GL   S +    LE P T  E+ + +  M   KSP
Sbjct: 413 ARSFYQNLFSPDPISPDACEEL-WDGLPVVSERRKERLETPITLDELSQALRLMPHNKSP 472

Query: 187 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 246
           G DGLT EF++  W+ L  D  RV  + FK G +   C    + L+PKK +   + ++RP
Sbjct: 473 GLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRP 532

Query: 247 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 306
           +SL+++ YK+++K +  RLK VL  +I+  Q   V GR I D +    + +      G  
Sbjct: 533 VSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHFARRTGLS 592

Query: 307 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 366
              L LD EKA+D+VD  +L   ++   FG +   ++    ++    + +N      +  
Sbjct: 593 LAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAF 652

Query: 367 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLL 419
            RG+RQG PL+  L+++  +   CL+     ++ L G   +     +    YADD +L
Sbjct: 653 GRGVRQGCPLSGQLYSLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSAYADDVIL 700

BLAST of CSPI04G20730 vs. Swiss-Prot
Match: LIN1_NYCCO (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang PE=3 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 6.8e-32
Identity = 114/422 (27.01%), Postives = 189/422 (44.79%), Query Frame = 1

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           K   L+A LK   +E    +    + L          EE S       KE    R  L +
Sbjct: 297 KFIALQAFLKKTEREEVNNLMGHLKQL--------EKEEHSNPKPSRRKEITKIRAELNE 356

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           +  K     I KSK  +  +  +           ++ KS++SS+ +   +      EI  
Sbjct: 357 IENKRIIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQK 416

Query: 123 EILSFFSNLYGTRISSPFICD----ILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLK 182
            +  ++  LY  +  +    D      +   LS ++  +L  P +  EI   +  +   K
Sbjct: 417 ILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKK 476

Query: 183 SPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSD 242
           SPGPDG T EFY+     L   L+ +FQ+  K GI+     E  I LIPK  K+  R  +
Sbjct: 477 SPGPDGFTSEFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKEN 536

Query: 243 FRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWS-L 302
           +RPISL+    K+++K+L  R+++ +  II+  Q+ F+ G Q    I  +   +   + L
Sbjct: 537 YRPISLMNIDAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKL 596

Query: 303 RGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRG 362
           + +  ++L +D EKA+D +   F+   +K  G      K I    S    +II+NG    
Sbjct: 597 KNKDHMILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLK 656

Query: 363 KIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDT 419
               + G RQG PL+P LF IV +  +  I    E++++KG H    SE++    +ADD 
Sbjct: 657 SFPLRSGTRQGCPLSPLLFNIVMEVLAIAI---REEKAIKGIHIG--SEEIKLSLFADDM 705

BLAST of CSPI04G20730 vs. Swiss-Prot
Match: LORF2_HUMAN (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens PE=1 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.7e-30
Identity = 88/331 (26.59%), Postives = 158/331 (47.73%), Query Frame = 1

Query: 94  VSARKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRG 153
           +  ++ K+ + ++ + +G       EI   I  ++ +LY  ++ +        D      
Sbjct: 381 IKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPR 440

Query: 154 LSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFF 213
           L+ ++   L  P T  EI  ++  +   KSPGPDG T EFY++    L   L+++FQ   
Sbjct: 441 LNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIE 500

Query: 214 KNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIIN 273
           K GI+     E  I LIPK  ++  +  +FRPISL+    K+++K+L  R+++ +  +I+
Sbjct: 501 KEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIH 560

Query: 274 DSQMAFVEGRQILDAILTASEAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLK 333
             Q+ F+ G Q    I  +   +   +  + +  V++ +D EKA+DK+   F+   +   
Sbjct: 561 HDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDAEKAFDKIQQPFMLKTLNKL 620

Query: 334 GFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIH 393
           G      K I         +II+NG+       K G RQG PL+P LF IV +    L  
Sbjct: 621 GIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEV---LAR 680

Query: 394 YCNEKRSLKGFHFENLSEDLTHLQYADDTLL 419
              +++ +KG       E++    +ADD ++
Sbjct: 681 AIRQEKEIKGIQLG--KEEVKLSLFADDMIV 706

BLAST of CSPI04G20730 vs. Swiss-Prot
Match: LORF2_MOUSE (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus GN=Pol PE=1 SV=2)

HSP 1 Score: 131.3 bits (329), Expect = 2.4e-29
Identity = 92/327 (28.13%), Postives = 157/327 (48.01%), Query Frame = 1

Query: 98  KSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQ 157
           + K +++ + + +G      +EI + I SF+  LY T++ +        D      L+  
Sbjct: 392 RDKILINKIRNEKGDITTDPEEIQNTIRSFYKRLYSTKLENLDEMDKFLDRYQVPKLNQD 451

Query: 158 DSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGI 217
               L  P + KEI  V+  +   KSPGPDG + EFY+     L   L ++F      G 
Sbjct: 452 QVDHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTFKEDLIPILHKLFHKIEVEGT 511

Query: 218 INRRCNETYIYLIPK-KKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQM 277
           +     E  I LIPK +K+  ++ +FRPISL+    K+++K+L  R+++ + +II+  Q+
Sbjct: 512 LPNSFYEATITLIPKPQKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQV 571

Query: 278 AFVEGRQILDAILTASEAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGK 337
            F+ G Q    I  +   +   + L+ +  +++ LD EKA+DK+   F+   ++  G   
Sbjct: 572 GFIPGMQGWFNIRKSINVIHYINKLKDKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQG 631

Query: 338 RCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNE 397
                I    S    +I VNG     I  K G RQG PL+P+LF IV +    L     +
Sbjct: 632 PYLNMIKAIYSKPVANIKVNGEKLEAIPLKSGTRQGCPLSPYLFNIVLEV---LARAIRQ 691

Query: 398 KRSLKGFHFENLSEDLTHLQYADDTLL 419
           ++ +KG       E++     ADD ++
Sbjct: 692 QKEIKGIQIG--KEEVKISLLADDMIV 713

BLAST of CSPI04G20730 vs. Swiss-Prot
Match: PO22_POPJA (Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment) OS=Popillia japonica PE=3 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 2.0e-15
Identity = 72/246 (29.27%), Postives = 119/246 (48.37%), Query Frame = 1

Query: 179 SPGPDGLTGEFYKKSWNILKSDLVRVF-QDFFKNGIINRRCNETYIYLIPKKKEAARVSD 238
           +PG DGLT +       I ++ L R F Q     G +          LIPK  +    S+
Sbjct: 22  APGSDGLTVQA------ITRTRLPRNFVQLHLLRGHVPTPWTAMRTTLIPKDGDLENPSN 81

Query: 239 FRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEW--S 298
           +RPI++ ++L +++ ++L  RL+  +   ++ +Q  +      +D  L  S  +D +  S
Sbjct: 82  WRPITIASALQRLLHRILAKRLEAAVE--LHPAQKGYAR----IDGTLVNSLLLDTYISS 141

Query: 299 LRGRKGV--LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVN-G 358
            R ++    ++ LD+ KA+D V  S +  A++  G  +    +I G LS +  +I V  G
Sbjct: 142 RREQRKTYNVVSLDVRKAFDTVSHSSICRALQRLGIDEGTSNYITGSLSDSTTTIRVGPG 201

Query: 359 RPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQY 418
               KI  +RG++QGDPL+PFLF  V D   C +      +S  G       E +  L +
Sbjct: 202 SQTRKICIRRGVKQGDPLSPFLFNAVLDELLCSL------QSTPGIGGTIGEEKIPVLAF 249

BLAST of CSPI04G20730 vs. TrEMBL
Match: A5BV95_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_026478 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 3.8e-106
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 897  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 956

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 957  EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1016

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1017 SEEIVNFFGNLYSKPVGESWRXEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1076

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1077 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1136

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1137 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1196

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1197 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1256

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1257 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1316

BLAST of CSPI04G20730 vs. TrEMBL
Match: M5XUF8_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015473mg PE=4 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 3.0e-103
Identity = 191/415 (46.02%), Postives = 270/415 (65.06%), Query Frame = 1

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + D
Sbjct: 530 RLRTIKQKIKDWNKEVFGDLVSAKKEAEARIAALDLMEGQGGLDNILRKEREDLYFMVSD 589

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           L+ KE+  W Q+ K+ W R+G+ N+ FFH     R+ ++ +  L       +V E EI  
Sbjct: 590 LVHKEELKWRQRGKIQWARDGDSNTKFFHRIARGRRKRNFIQKLEVAGAGVVVNEWEIEL 649

Query: 123 EILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGP 182
           EI++FF NLY +   + +  + LNW  +S++++  LE PF E+E++  VF+ G  KSPGP
Sbjct: 650 EIINFFKNLYSSNAEAGWCLEGLNWNAISVEEAEWLERPFEEEEVKRAVFDCGIDKSPGP 709

Query: 183 DGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPIS 242
           DG +   ++  W  +K DL++V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPIS
Sbjct: 710 DGFSMLLFQSCWEYVKEDLMKVMADFFNCGIINAITNETFICLIPKKKESIKVSDFRPIS 769

Query: 243 LITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV 302
           L+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Sbjct: 770 LVTSLYKMVSKVLASRLREVLGSTISSYQSAFVQGRQILDAALIANEVVEESRRLNKSGM 829

Query: 303 LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKR 362
           + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGKI A R
Sbjct: 830 VFKIDLEKAYDHVEWRFVDEVLIRKGFGDRWRSWIRGCLETANFSVMINGRPRGKIRASR 889

Query: 363 GIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
           G+RQGDPL+PFLFT+V D  S ++    +     G    +   +++HLQ+ADDT+
Sbjct: 890 GLRQGDPLSPFLFTLVMDVLSRIMEKAQDTDQFHGLSPGHGMVEVSHLQFADDTI 944

BLAST of CSPI04G20730 vs. TrEMBL
Match: A5CAA2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_030956 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 5.1e-103
Identity = 188/421 (44.66%), Postives = 283/421 (67.22%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KL+ +KA LK WNK +FG++  +K+ ++  +   DSLE+   L+ E + +R   +G L
Sbjct: 1104 MRKLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIKKGEL 1163

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS FFH   + R+++  +  L +  G+ +   + I
Sbjct: 1164 EELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESI 1223

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +     +  + L+W  +S + +  LE PFTE+EI + +F+M   K+P
Sbjct: 1224 KEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAP 1283

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W ++K DLV+VF +F ++GIIN+  N ++I L+PKK  + R+SDFRP
Sbjct: 1284 GPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRP 1343

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  R+++VL   I+ +Q AFV+GRQILDA+L A+E VDE    G +
Sbjct: 1344 ISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRSGEE 1403

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD V W FLD  M++KGFG R RKW+ GCLS+ +F+++VNG  +G + A
Sbjct: 1404 GVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKGWVKA 1463

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+  L+GF        ++HLQ+ADDT+  S
Sbjct: 1464 SRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDTIFFS 1523

Query: 421  S 422
            S
Sbjct: 1524 S 1524

BLAST of CSPI04G20730 vs. TrEMBL
Match: A5BH71_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013913 PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 1.5e-102
Identity = 189/421 (44.89%), Postives = 281/421 (66.75%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            + KL+ +KA LK WNK +FG++  +K+ ++  +   DSLE+   L+ E + +R   +G L
Sbjct: 668  IRKLQFVKAKLKEWNKTSFGELSKRKKYILSDLANFDSLEQEGGLSHELLVQRALRKGEL 727

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS+FFH   + R+++  +  L +  G  L   K I
Sbjct: 728  EELILREEIHWRQKARVKWVKEGDCNSNFFHKVANGRQNRKFIKELENESGLMLKDSKSI 787

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +    P+  + L+W  +S + +S LE PFTE+EI + VF+M   K+P
Sbjct: 788  KEEILRYFEKLYVSPSGEPWRVEGLDWSPISGESASRLESPFTEEEIYKAVFQMDRDKAP 847

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W ++K DLVRVF +F ++GIIN+  N  +I L+PKK  + R+SDFRP
Sbjct: 848  GPDGFTIAVFQDCWKVIKEDLVRVFAEFHRSGIINQSTNAFFIVLLPKKSMSRRISDFRP 907

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  RL+ VL   I+ +Q AFV+GRQILDA+L A+E VD+    G +
Sbjct: 908  ISLITSLYKIIAKVLAGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDDKRRSGEE 967

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD   W FLD  +++KGF  R RKW+ GCLS+ +F+++VNG  +G + A
Sbjct: 968  GVVFKIDFEKAYDHASWDFLDHVLEMKGFSLRWRKWMRGCLSSVSFAVLVNGNAKGWVKA 1027

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+  L+GF        ++HLQ+A+DT+  S
Sbjct: 1028 SRGLRQGDPLSPFLFTIVADVLSRMLLKVEERNVLEGFRVGRNRTRVSHLQFANDTIFFS 1087

Query: 421  S 422
            S
Sbjct: 1088 S 1088

BLAST of CSPI04G20730 vs. TrEMBL
Match: A5BQD9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_000232 PE=4 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 1.9e-102
Identity = 191/421 (45.37%), Postives = 276/421 (65.56%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KL+ LKA LK WNK  FG +  +K+ ++  I   DS+E+   L+ E + +R   +G L
Sbjct: 1002 MRKLQFLKAKLKEWNKNAFGDLIERKKCILLDIANFDSMEQEGGLSPELLIQRAVRKGEL 1061

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS FFH   + R+++  +  L +  G  L     I
Sbjct: 1062 EELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKVLENERGLVLDNSDSI 1121

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +     +  + L+W  +S + +S LE PFTE+EI + +F+M   K+P
Sbjct: 1122 KEEILRYFEKLYASPSGESWRVEGLDWSPISSESASRLESPFTEEEISKAIFQMDRDKAP 1181

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W+++K DLVRVF +F ++GIIN+  N ++I L+PKK  A ++SD+RP
Sbjct: 1182 GPDGFTIAVFQDCWDVIKEDLVRVFDEFHRSGIINQSTNASFIVLLPKKSMAKKLSDYRP 1241

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  RL+ VL   I+ +Q AFV+GRQILDA+L A+E VDE      +
Sbjct: 1242 ISLITSLYKIIAKVLAGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKKRSXEE 1301

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD V W FLD  M+ KGF  R RKWI GCLS+ +F+I+VNG  +G + A
Sbjct: 1302 GVVFKIDFEKAYDHVSWDFLDHVMEKKGFNPRWRKWIRGCLSSVSFAILVNGNAKGWVKA 1361

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+   +GF        ++HLQ+ADDT+  S
Sbjct: 1362 SRGLRQGDPLSPFLFTIVADVMSRMLLRAEERNVFEGFRVGRNRTRVSHLQFADDTIFFS 1421

Query: 421  S 422
            S
Sbjct: 1422 S 1422

BLAST of CSPI04G20730 vs. TAIR10
Match: AT1G43760.1 (AT1G43760.1 DNAse I-like superfamily protein)

HSP 1 Score: 114.8 bits (286), Expect = 1.3e-25
Identity = 79/262 (30.15%), Postives = 135/262 (51.53%), Query Frame = 1

Query: 2   EKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSL----EESSCLNEENVKERE-NC 61
           E LK  K   K  N++ FG I  + +  +D +  + S        S    E+V  ++ N 
Sbjct: 367 EHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARKKWNF 426

Query: 62  RGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVT 121
             A L+   +      QKS++ WL++G+ N+ FFH  + A ++K+++  L   +   +  
Sbjct: 427 FAAALESFYR------QKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVEN 486

Query: 122 EKEIVDEILSFFSNLYG------TRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREV 181
             ++ + I++++++L G      T  S   I DI  +R      S L  +P ++KEI   
Sbjct: 487 VTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALP-SDKEITAA 546

Query: 182 VFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKK 241
           VF M   K+PGPD  T EF+ +SW ++K   +   ++FF+ G + +R N T I LIPK  
Sbjct: 547 VFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVT 606

Query: 242 EAARVSDFRPISLITSLYKVIS 253
              ++S FRP+S  T +YK+I+
Sbjct: 607 GVDQLSMFRPVSCCTVVYKIIT 621

BLAST of CSPI04G20730 vs. TAIR10
Match: ATMG01250.1 (ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase))

HSP 1 Score: 65.9 bits (159), Expect = 7.0e-11
Identity = 29/68 (42.65%), Postives = 40/68 (58.82%), Query Frame = 1

Query: 349 IVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLT 408
           I+NG P+G +   RG+RQGDPL+P+LF +  +  S L     E+  L G    N S  + 
Sbjct: 13  IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72

Query: 409 HLQYADDT 417
           HL +ADDT
Sbjct: 73  HLLFADDT 80

BLAST of CSPI04G20730 vs. TAIR10
Match: AT4G20520.1 (AT4G20520.1 RNA binding;RNA-directed DNA polymerases)

HSP 1 Score: 59.3 bits (142), Expect = 6.6e-09
Identity = 33/76 (43.42%), Postives = 48/76 (63.16%), Query Frame = 1

Query: 258 RLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV----LLKLDLEKAYD 317
           RLK ++ ++I  +Q +F+ GR   D I+   EAV   S+R +KGV    LLKLDLEKAYD
Sbjct: 4   RLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVH--SMRRKKGVKGWMLLKLDLEKAYD 63

Query: 318 KVDWSFLDMAMKLKGF 330
           ++ W +L+  +   GF
Sbjct: 64  RIRWDYLEDTLISAGF 77

BLAST of CSPI04G20730 vs. NCBI nr
Match: gi|147784237|emb|CAN75040.1| (hypothetical protein VITISV_026478 [Vitis vinifera])

HSP 1 Score: 393.3 bits (1009), Expect = 5.4e-106
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 897  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 956

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 957  EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1016

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1017 SEEIVNFFGNLYSKPVGESWRXEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1076

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1077 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1136

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1137 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1196

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1197 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1256

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1257 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1316

BLAST of CSPI04G20730 vs. NCBI nr
Match: gi|596221970|ref|XP_007224079.1| (hypothetical protein PRUPE_ppa015473mg, partial [Prunus persica])

HSP 1 Score: 383.6 bits (984), Expect = 4.3e-103
Identity = 191/415 (46.02%), Postives = 270/415 (65.06%), Query Frame = 1

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + D
Sbjct: 530 RLRTIKQKIKDWNKEVFGDLVSAKKEAEARIAALDLMEGQGGLDNILRKEREDLYFMVSD 589

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           L+ KE+  W Q+ K+ W R+G+ N+ FFH     R+ ++ +  L       +V E EI  
Sbjct: 590 LVHKEELKWRQRGKIQWARDGDSNTKFFHRIARGRRKRNFIQKLEVAGAGVVVNEWEIEL 649

Query: 123 EILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGP 182
           EI++FF NLY +   + +  + LNW  +S++++  LE PF E+E++  VF+ G  KSPGP
Sbjct: 650 EIINFFKNLYSSNAEAGWCLEGLNWNAISVEEAEWLERPFEEEEVKRAVFDCGIDKSPGP 709

Query: 183 DGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPIS 242
           DG +   ++  W  +K DL++V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPIS
Sbjct: 710 DGFSMLLFQSCWEYVKEDLMKVMADFFNCGIINAITNETFICLIPKKKESIKVSDFRPIS 769

Query: 243 LITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV 302
           L+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Sbjct: 770 LVTSLYKMVSKVLASRLREVLGSTISSYQSAFVQGRQILDAALIANEVVEESRRLNKSGM 829

Query: 303 LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKR 362
           + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGKI A R
Sbjct: 830 VFKIDLEKAYDHVEWRFVDEVLIRKGFGDRWRSWIRGCLETANFSVMINGRPRGKIRASR 889

Query: 363 GIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
           G+RQGDPL+PFLFT+V D  S ++    +     G    +   +++HLQ+ADDT+
Sbjct: 890 GLRQGDPLSPFLFTLVMDVLSRIMEKAQDTDQFHGLSPGHGMVEVSHLQFADDTI 944

BLAST of CSPI04G20730 vs. NCBI nr
Match: gi|147803328|emb|CAN68838.1| (hypothetical protein VITISV_030956 [Vitis vinifera])

HSP 1 Score: 382.9 bits (982), Expect = 7.4e-103
Identity = 188/421 (44.66%), Postives = 283/421 (67.22%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KL+ +KA LK WNK +FG++  +K+ ++  +   DSLE+   L+ E + +R   +G L
Sbjct: 1104 MRKLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIKKGEL 1163

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS FFH   + R+++  +  L +  G+ +   + I
Sbjct: 1164 EELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESI 1223

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +     +  + L+W  +S + +  LE PFTE+EI + +F+M   K+P
Sbjct: 1224 KEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAP 1283

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W ++K DLV+VF +F ++GIIN+  N ++I L+PKK  + R+SDFRP
Sbjct: 1284 GPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRP 1343

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  R+++VL   I+ +Q AFV+GRQILDA+L A+E VDE    G +
Sbjct: 1344 ISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRSGEE 1403

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD V W FLD  M++KGFG R RKW+ GCLS+ +F+++VNG  +G + A
Sbjct: 1404 GVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKGWVKA 1463

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+  L+GF        ++HLQ+ADDT+  S
Sbjct: 1464 SRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDTIFFS 1523

Query: 421  S 422
            S
Sbjct: 1524 S 1524

BLAST of CSPI04G20730 vs. NCBI nr
Match: gi|147815144|emb|CAN67932.1| (hypothetical protein VITISV_013913 [Vitis vinifera])

HSP 1 Score: 381.3 bits (978), Expect = 2.1e-102
Identity = 189/421 (44.89%), Postives = 281/421 (66.75%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            + KL+ +KA LK WNK +FG++  +K+ ++  +   DSLE+   L+ E + +R   +G L
Sbjct: 668  IRKLQFVKAKLKEWNKTSFGELSKRKKYILSDLANFDSLEQEGGLSHELLVQRALRKGEL 727

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS+FFH   + R+++  +  L +  G  L   K I
Sbjct: 728  EELILREEIHWRQKARVKWVKEGDCNSNFFHKVANGRQNRKFIKELENESGLMLKDSKSI 787

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +    P+  + L+W  +S + +S LE PFTE+EI + VF+M   K+P
Sbjct: 788  KEEILRYFEKLYVSPSGEPWRVEGLDWSPISGESASRLESPFTEEEIYKAVFQMDRDKAP 847

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W ++K DLVRVF +F ++GIIN+  N  +I L+PKK  + R+SDFRP
Sbjct: 848  GPDGFTIAVFQDCWKVIKEDLVRVFAEFHRSGIINQSTNAFFIVLLPKKSMSRRISDFRP 907

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  RL+ VL   I+ +Q AFV+GRQILDA+L A+E VD+    G +
Sbjct: 908  ISLITSLYKIIAKVLAGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDDKRRSGEE 967

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD   W FLD  +++KGF  R RKW+ GCLS+ +F+++VNG  +G + A
Sbjct: 968  GVVFKIDFEKAYDHASWDFLDHVLEMKGFSLRWRKWMRGCLSSVSFAVLVNGNAKGWVKA 1027

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+  L+GF        ++HLQ+A+DT+  S
Sbjct: 1028 SRGLRQGDPLSPFLFTIVADVLSRMLLKVEERNVLEGFRVGRNRTRVSHLQFANDTIFFS 1087

Query: 421  S 422
            S
Sbjct: 1088 S 1088

BLAST of CSPI04G20730 vs. NCBI nr
Match: gi|147857332|emb|CAN79190.1| (hypothetical protein VITISV_000232 [Vitis vinifera])

HSP 1 Score: 380.9 bits (977), Expect = 2.8e-102
Identity = 191/421 (45.37%), Postives = 276/421 (65.56%), Query Frame = 1

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KL+ LKA LK WNK  FG +  +K+ ++  I   DS+E+   L+ E + +R   +G L
Sbjct: 1002 MRKLQFLKAKLKEWNKNAFGDLIERKKCILLDIANFDSMEQEGGLSPELLIQRAVRKGEL 1061

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             +LI++E+  W QK+++ W++EG+ NS FFH   + R+++  +  L +  G  L     I
Sbjct: 1062 EELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKVLENERGLVLDNSDSI 1121

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EIL +F  LY +     +  + L+W  +S + +S LE PFTE+EI + +F+M   K+P
Sbjct: 1122 KEEILRYFEKLYASPSGESWRVEGLDWSPISSESASRLESPFTEEEISKAIFQMDRDKAP 1181

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   ++  W+++K DLVRVF +F ++GIIN+  N ++I L+PKK  A ++SD+RP
Sbjct: 1182 GPDGFTIAVFQDCWDVIKEDLVRVFDEFHRSGIINQSTNASFIVLLPKKSMAKKLSDYRP 1241

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISLITSLYK+I+KVL  RL+ VL   I+ +Q AFV+GRQILDA+L A+E VDE      +
Sbjct: 1242 ISLITSLYKIIAKVLAGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKKRSXEE 1301

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            GV+ K+D EKAYD V W FLD  M+ KGF  R RKWI GCLS+ +F+I+VNG  +G + A
Sbjct: 1302 GVVFKIDFEKAYDHVSWDFLDHVMEKKGFNPRWRKWIRGCLSSVSFAILVNGNAKGWVKA 1361

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFTIV D  S ++    E+   +GF        ++HLQ+ADDT+  S
Sbjct: 1362 SRGLRQGDPLSPFLFTIVADVMSRMLLRAEERNVFEGFRVGRNRTRVSHLQFADDTIFFS 1421

Query: 421  S 422
            S
Sbjct: 1422 S 1422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YTX2_XENLA2.2e-3827.27Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis PE=3 SV=1[more]
LIN1_NYCCO6.8e-3227.01LINE-1 reverse transcriptase homolog OS=Nycticebus coucang PE=3 SV=1[more]
LORF2_HUMAN1.7e-3026.59LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens PE=1 SV=1[more]
LORF2_MOUSE2.4e-2928.13LINE-1 retrotransposable element ORF2 protein OS=Mus musculus GN=Pol PE=1 SV=2[more]
PO22_POPJA2.0e-1529.27Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fra... [more]
Match NameE-valueIdentityDescription
A5BV95_VITVI3.8e-10645.24Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_026478 PE=4 SV=1[more]
M5XUF8_PRUPE3.0e-10346.02Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015473mg PE=4 S... [more]
A5CAA2_VITVI5.1e-10344.66Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_030956 PE=4 SV=1[more]
A5BH71_VITVI1.5e-10244.89Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013913 PE=4 SV=1[more]
A5BQD9_VITVI1.9e-10245.37Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_000232 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G43760.11.3e-2530.15 DNAse I-like superfamily protein[more]
ATMG01250.17.0e-1142.65ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)[more]
AT4G20520.16.6e-0943.42 RNA binding;RNA-directed DNA polymerases[more]
Match NameE-valueIdentityDescription
gi|147784237|emb|CAN75040.1|5.4e-10645.24hypothetical protein VITISV_026478 [Vitis vinifera][more]
gi|596221970|ref|XP_007224079.1|4.3e-10346.02hypothetical protein PRUPE_ppa015473mg, partial [Prunus persica][more]
gi|147803328|emb|CAN68838.1|7.4e-10344.66hypothetical protein VITISV_030956 [Vitis vinifera][more]
gi|147815144|emb|CAN67932.1|2.1e-10244.89hypothetical protein VITISV_013913 [Vitis vinifera][more]
gi|147857332|emb|CAN79190.1|2.8e-10245.37hypothetical protein VITISV_000232 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G20730.1CSPI04G20730.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 226..420
score: 1.3
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 207..421
score: 15
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 48..378
score: 4.1E-101coord: 2..31
score: 4.1E
NoneNo IPR availablePANTHERPTHR19446:SF368SUBFAMILY NOT NAMEDcoord: 2..31
score: 4.1E-101coord: 48..378
score: 4.1E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 178..420
score: 1.78

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI04G20730Cucsa.357490Cucumber (Gy14) v1cgycpiB529
The following gene(s) are paralogous to this gene:

None