CSPI07G08330 (gene) Wild cucumber (PI 183967)

NameCSPI07G08330
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7 : 6092900 .. 6094937 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACCTGAAGTAGCCATCCAACTTATGGGTTTTACAAATGCTAAGGACCTATGGGAAGTCACTCGAGATTTGTTTGGAATCCAGTCAAGAGCTGAGGAAGATTTTCTTCGCCAAACATTCCAAACGACGAGGAAATGTAATCTTAAAATGGAAGAGTATTTATGAACTATGAAAAATAATGCCGATAACCTTGGGCAAGCCGGCAGTCCTGTACCTCTGTGTGCTCTTGTTTCACAGGTGTTGTTGGGGTTAGATGAGGTTTACAACCCAATCATCGCTGTAATCCAAGGTAAACCAGACATATCGTGGTTGGATATGCAATCTGAACTTCTCATTTTTTAGAAAAGATTAGAGCATCAGAATAGCCAACGAAACACTGGAAATGCAGTCCGAAATGTCACTATCAATGTAGCTCAGAGGACAAATTCAAACGAACATAAATCCCACAATGGCCAACAATTTTATGGACAGCGAGGAAATTCAAACAATGGCAGAGGACGCGGTCGGGGAAGAGGAAACAAACCCACATGTCAAGTATGCGATAAGTATGATCACTATGCACTTGCTTGTTATAATCAATTTAACAGAGATTTTATGAGTCCTTTAGTTCAAGATCGAGGACAAAACTTGAGTGCATCTGCAAACCCAAATCCTTCTGCCTTTGTGACTACCCAGAGTCCCACTCCGTTTGCTACTTCTGAAACTGTAATTGATCCAAATTGGTATATTGATAGCGGAGCTACAAATCACGTCACGACAAAATCGTTTGGCTTGACCAATCCTACTGAATACTCAGGTAAAGAAAAAGCAACAGTAGGAAATGGAAATAACTTGAATATCTCTTGTGTTGGACATTCTTATTTGACTGATGCAAAACAGTTTTTAGCTCTTAAAAATGTACTTTGTGTTCCTGATATTACAAAGAACTTGGTTAGTGGTTCAAAACTTGCCCAAGATAATCATATATTTCTAGAGTTTCATAATTATTATTGTCTTGTTAAGGACAAAGCTACAGGGGAAATACTGCTGAAAGAAACACTTAAGGATGGTCTATATCACCTGGAAAATGTTGGTTTGATGGTTGCTGCTGAGTTAGAACATAGTACTATCAAGAAACAGCTTATACATAAAAATAAGGAGACATCGACATTTATTTTGTCAGGAGGAAAAAATCTTGTCAGTATCAATGTTGCTGTTTGGCATAAATGTTTAGGCCATCATTCTTCAAAGATTTTGAACACTTTAGTAAAGAAATGTAATGCATGTCAACTGGAAAAAGCACACAACCTCCCCTTTACTATTTCTCAATCTAGAGCTGCTGAATCTTTTGATATTATTTACTCTGATTTGTGGGGTCCTACACCAATTTGCTCATCCGATGGATTCCGTTATTACATAATTTTTGTGGATGATTATAGCAGATATACTTTAGATTTATCTTTTGAGCAGAAAAGTACTGCTATTGATGCTTTTAAGCACTTTATTACATATGTCAAAAACCAATTCAATAAATCCATCAAAGTTTTTCAGTCAGACAATGGGGGAGAGTATAAGAAGATACATCTACTGTGCACTACCTTAGGGATCAATTGGGATGCCCACACCTATTTTACAAGGTATGTCACCTATTGAATTGTTGTTTGCTCACAAATTATATTTTTCTCAATTGAAGGTGTTCGGGTGCTCTTGCTTTCCATGCTTAAGACCTTACCAATCCAATAAGTTTCAACAACGTTCTGAAAAGTGTGTGTTACTTGGCCCAAGCCCAATACACAAGGGGTTCAAGTGTTTGTCCAAGTCGGGCAAAGTGTTTATCTCACGACATGTTCAATTTAACGAGCATGAATTCCCCTTTTCTCAAATGTTCCCTCCAACCCATTTACCGTCGGCCCAATCTAATCCACCTTCTCTTTCCCTTGCCATCCCAATTAACTTCAACTCCATGGCCCAAAACAATTCAATTCCATCGGCCCAATTTTATTCCCACTCCACCCCCATCACAAATTCCTCACAATGTCTCCTCTCATTCACTTGA

mRNA sequence

ATGACACCTGAAGTAGCCATCCAACTTATGGGTTTTACAAATGCTAAGGACCTATGGGAAGTCACTCGAGATTTGTTTGGAATCCAGTCAAGAGCTGAGGAAGATTTTCTTCGCCAAACATTCCAAACGACGAGGAAATCCGGCAGTCCTGTACCTCTGTGTGCTCTTGTTTCACAGGTGTTGTTGGGGTTAGATGAGAAAAGATTAGAGCATCAGAATAGCCAACGAAACACTGGAAATGCAGTCCGAAATGTCACTATCAATGTAGCTCAGAGGACAAATTCAAACGAACATAAATCCCACAATGGCCAACAATTTTATGGACAGCGAGGAAATTCAAACAATGGCAGAGGACGCGGTCGGGGAAGAGGAAACAAACCCACATGTCAAGTATGCGATAAGTATGATCACTATGCACTTGCTTGTTATAATCAATTTAACAGAGATTTTATGAGTCCTTTAGTTCAAGATCGAGGACAAAACTTGAGTGCATCTGCAAACCCAAATCCTTCTGCCTTTGTGACTACCCAGAGTCCCACTCCGTTTGCTACTTCTGAAACTGTAATTGATCCAAATTGGTATATTGATAGCGGAGCTACAAATCACGTCACGACAAAATCGTTTGGCTTGACCAATCCTACTGAATACTCAGGTAAAGAAAAAGCAACAGTAGGAAATGGAAATAACTTGAATATCTCTTGTGTTGGACATTCTTATTTGACTGATGCAAAACAGTTTTTAGCTCTTAAAAATGTACTTTGTGTTCCTGATATTACAAAGAACTTGGTTAGTGGTTCAAAACTTGCCCAAGATAATCATATATTTCTAGAGTTTCATAATTATTATTGTCTTGTTAAGGACAAAGCTACAGGGGAAATACTGCTGAAAGAAACACTTAAGGATGGTCTATATCACCTGGAAAATGTTGGTTTGATGGTTGCTGCTGAGTTAGAACATAGTACTATCAAGAAACAGCTTATACATAAAAATAAGGAGACATCGACATTTATTTTGTCAGGAGGAAAAAATCTTGTCAGTATCAATGTTGCTGTTTGGCATAAATGTTTAGGCCATCATTCTTCAAAGATTTTGAACACTTTAGTAAAGAAATGTAATGCATGTCAACTGGAAAAAGCACACAACCTCCCCTTTACTATTTCTCAATCTAGAGCTGCTGAATCTTTTGATATTATTTACTCTGATTTGTGGGGTCCTACACCAATTTGCTCATCCGATGGATTCCGTTATTACATAATTTTTGTGGATGATTATAGCAGATATACTTTAGATTTATCTTTTGAGCAGAAAAGTACTGCTATTGATGCTTTTAAGCACTTTATTACATATGTCAAAAACCAATTCAATAAATCCATCAAAGTTTTTCAGTCAGACAATGGGGGAGAGTATAAGAAGATACATCTACTGTGCACTACCTTAGGGATCAATTGGGATGCCCACACCTATTTTACAAGACCTTACCAATCCAATAAGTTTCAACAACGTTCTGAAAAGTGTGTGTTACTTGGCCCAAGCCCAATACACAAGGGGTTCAAGTGTTTGTCCAAGTCGGGCAAAGTGTTTATCTCACGACATGTTCAATTTAACGAGCATGAATTCCCCTTTTCTCAAATGTTCCCTCCAACCCATTTACCGTCGGCCCAATCTAATCCACCTTCTCTTTCCCTTGCCATCCCAATTAACTTCAACTCCATGGCCCAAAACAATTCAATTCCATCGGCCCAATTTTATTCCCACTCCACCCCCATCACAAATTCCTCACAATGTCTCCTCTCATTCACTTGA

Coding sequence (CDS)

ATGACACCTGAAGTAGCCATCCAACTTATGGGTTTTACAAATGCTAAGGACCTATGGGAAGTCACTCGAGATTTGTTTGGAATCCAGTCAAGAGCTGAGGAAGATTTTCTTCGCCAAACATTCCAAACGACGAGGAAATCCGGCAGTCCTGTACCTCTGTGTGCTCTTGTTTCACAGGTGTTGTTGGGGTTAGATGAGAAAAGATTAGAGCATCAGAATAGCCAACGAAACACTGGAAATGCAGTCCGAAATGTCACTATCAATGTAGCTCAGAGGACAAATTCAAACGAACATAAATCCCACAATGGCCAACAATTTTATGGACAGCGAGGAAATTCAAACAATGGCAGAGGACGCGGTCGGGGAAGAGGAAACAAACCCACATGTCAAGTATGCGATAAGTATGATCACTATGCACTTGCTTGTTATAATCAATTTAACAGAGATTTTATGAGTCCTTTAGTTCAAGATCGAGGACAAAACTTGAGTGCATCTGCAAACCCAAATCCTTCTGCCTTTGTGACTACCCAGAGTCCCACTCCGTTTGCTACTTCTGAAACTGTAATTGATCCAAATTGGTATATTGATAGCGGAGCTACAAATCACGTCACGACAAAATCGTTTGGCTTGACCAATCCTACTGAATACTCAGGTAAAGAAAAAGCAACAGTAGGAAATGGAAATAACTTGAATATCTCTTGTGTTGGACATTCTTATTTGACTGATGCAAAACAGTTTTTAGCTCTTAAAAATGTACTTTGTGTTCCTGATATTACAAAGAACTTGGTTAGTGGTTCAAAACTTGCCCAAGATAATCATATATTTCTAGAGTTTCATAATTATTATTGTCTTGTTAAGGACAAAGCTACAGGGGAAATACTGCTGAAAGAAACACTTAAGGATGGTCTATATCACCTGGAAAATGTTGGTTTGATGGTTGCTGCTGAGTTAGAACATAGTACTATCAAGAAACAGCTTATACATAAAAATAAGGAGACATCGACATTTATTTTGTCAGGAGGAAAAAATCTTGTCAGTATCAATGTTGCTGTTTGGCATAAATGTTTAGGCCATCATTCTTCAAAGATTTTGAACACTTTAGTAAAGAAATGTAATGCATGTCAACTGGAAAAAGCACACAACCTCCCCTTTACTATTTCTCAATCTAGAGCTGCTGAATCTTTTGATATTATTTACTCTGATTTGTGGGGTCCTACACCAATTTGCTCATCCGATGGATTCCGTTATTACATAATTTTTGTGGATGATTATAGCAGATATACTTTAGATTTATCTTTTGAGCAGAAAAGTACTGCTATTGATGCTTTTAAGCACTTTATTACATATGTCAAAAACCAATTCAATAAATCCATCAAAGTTTTTCAGTCAGACAATGGGGGAGAGTATAAGAAGATACATCTACTGTGCACTACCTTAGGGATCAATTGGGATGCCCACACCTATTTTACAAGACCTTACCAATCCAATAAGTTTCAACAACGTTCTGAAAAGTGTGTGTTACTTGGCCCAAGCCCAATACACAAGGGGTTCAAGTGTTTGTCCAAGTCGGGCAAAGTGTTTATCTCACGACATGTTCAATTTAACGAGCATGAATTCCCCTTTTCTCAAATGTTCCCTCCAACCCATTTACCGTCGGCCCAATCTAATCCACCTTCTCTTTCCCTTGCCATCCCAATTAACTTCAACTCCATGGCCCAAAACAATTCAATTCCATCGGCCCAATTTTATTCCCACTCCACCCCCATCACAAATTCCTCACAATGTCTCCTCTCATTCACTTGA
BLAST of CSPI07G08330 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 2.4e-22
Identity = 106/409 (25.92%), Postives = 167/409 (40.83%), Query Frame = 1

Query: 109 QRGNSNNGRGRGRGRGNKPT------CQVCDKYDHYALACYN----------QFNRDFMS 168
           QR ++N GR   RG+    +      C  C++  H+   C N          Q N D  +
Sbjct: 206 QRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTA 265

Query: 169 PLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTN 228
            +VQ+         N N   F+  +      +     +  W +D+ A++H T        
Sbjct: 266 AMVQN---------NDNVVLFINEEEECMHLSGP---ESEWVVDTAASHHATPVRDLFCR 325

Query: 229 PTEYSGKEKATV--GNGNNLNISCVGHSYL-TDAKQFLALKNVLCVPDITKNLVSGSKLA 288
              Y   +  TV  GN +   I+ +G   + T+    L LK+V  VPD+  NL+SG  L 
Sbjct: 326 ---YVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALD 385

Query: 289 QDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHK 348
           +D      + +Y+   K                 + L    L++A  +   T+ +     
Sbjct: 386 RDG-----YESYFANQK-----------------WRLTKGSLVIAKGVARGTLYR----- 445

Query: 349 NKETSTFILSGGKNLVS--INVAVWHKCLGHHSSKILNTLVKK-------------CNAC 408
              T+  I  G  N     I+V +WHK +GH S K L  L KK             C+ C
Sbjct: 446 ---TNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYC 505

Query: 409 QLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFE 468
              K H + F  S  R     D++YSD+ GP  I S  G +Y++ F+DD SR       +
Sbjct: 506 LFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILK 565

Query: 469 QKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEY--KKIHLLCTTLGI 482
            K      F+ F   V+ +  + +K  +SDNGGEY  ++    C++ GI
Sbjct: 566 TKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGI 569

BLAST of CSPI07G08330 vs. Swiss-Prot
Match: YG22B_YEAST (Transposon Ty2-GR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-GR2 PE=3 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 6.7e-09
Identity = 68/320 (21.25%), Postives = 132/320 (41.25%), Query Frame = 1

Query: 179 PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHS 238
           PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ 
Sbjct: 440 PTHTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNS-EINIVDAQKQDIPINAIGNL 499

Query: 239 YLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKET 298
           +             L  P+I  +L+S S+LA  N       N      +++ G +L    
Sbjct: 500 HFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN----TLERSDGTVLAP-I 559

Query: 299 LKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFI------LSGGKNLVSINVAVW 358
           +K G ++  +   ++ + +   TI    ++K+K  + +       + G  N  SI  ++ 
Sbjct: 560 VKHGDFYWLSKKYLIPSHISKLTINN--VNKSKSVNKYPYPLIHRMLGHANFRSIQKSLK 619

Query: 359 HKCLGHHSSKIL---NTLVKKCNACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGP 418
              + +     +   N    +C  C + K+    H     +    + E F  +++D++GP
Sbjct: 620 KNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGP 679

Query: 419 TPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS 478
                     Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q 
Sbjct: 680 VHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQM 739

Query: 479 DNGGEY--KKIHLLCTTLGI 482
           D G EY  K +H   T  GI
Sbjct: 740 DRGSEYTNKTLHKFFTNRGI 751

BLAST of CSPI07G08330 vs. Swiss-Prot
Match: YG21B_YEAST (Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-GR1 PE=5 SV=2)

HSP 1 Score: 63.9 bits (154), Expect = 6.7e-09
Identity = 68/320 (21.25%), Postives = 132/320 (41.25%), Query Frame = 1

Query: 179 PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHS 238
           PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ 
Sbjct: 440 PTHTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNS-EINIVDAQKQDIPINAIGNL 499

Query: 239 YLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKET 298
           +             L  P+I  +L+S S+LA  N       N      +++ G +L    
Sbjct: 500 HFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN----TLERSDGTVLAP-I 559

Query: 299 LKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFI------LSGGKNLVSINVAVW 358
           +K G ++  +   ++ + +   TI    ++K+K  + +       + G  N  SI  ++ 
Sbjct: 560 VKHGDFYWLSKKYLIPSHISKLTINN--VNKSKSVNKYPYPLIHRMLGHANFRSIQKSLK 619

Query: 359 HKCLGHHSSKIL---NTLVKKCNACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGP 418
              + +     +   N    +C  C + K+    H     +    + E F  +++D++GP
Sbjct: 620 KNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGP 679

Query: 419 TPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS 478
                     Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q 
Sbjct: 680 VHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQM 739

Query: 479 DNGGEY--KKIHLLCTTLGI 482
           D G EY  K +H   T  GI
Sbjct: 740 DRGSEYTNKTLHKFFTNRGI 751

BLAST of CSPI07G08330 vs. Swiss-Prot
Match: YO21B_YEAST (Transposon Ty2-OR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-OR1 PE=3 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 6.7e-09
Identity = 68/320 (21.25%), Postives = 132/320 (41.25%), Query Frame = 1

Query: 179 PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHS 238
           PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ 
Sbjct: 440 PTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNS-EINIVDAQKQDIPINAIGNL 499

Query: 239 YLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKET 298
           +             L  P+I  +L+S S+LA  N       N      +++ G +L    
Sbjct: 500 HFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN----TLERSDGTVLAP-I 559

Query: 299 LKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFI------LSGGKNLVSINVAVW 358
           +K G ++  +   ++ + +   TI    ++K+K  + +       + G  N  SI  ++ 
Sbjct: 560 VKHGDFYWLSKKYLIPSHISKLTINN--VNKSKSVNKYPYPLIHRMLGHANFRSIQKSLK 619

Query: 359 HKCLGHHSSKIL---NTLVKKCNACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGP 418
              + +     +   N    +C  C + K+    H     +    + E F  +++D++GP
Sbjct: 620 KNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGP 679

Query: 419 TPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS 478
                     Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q 
Sbjct: 680 VHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQM 739

Query: 479 DNGGEY--KKIHLLCTTLGI 482
           D G EY  K +H   T  GI
Sbjct: 740 DRGSEYTNKTLHKFFTNRGI 751

BLAST of CSPI07G08330 vs. Swiss-Prot
Match: YD22B_YEAST (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 63.9 bits (154), Expect = 6.7e-09
Identity = 68/320 (21.25%), Postives = 132/320 (41.25%), Query Frame = 1

Query: 179 PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHS 238
           PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ 
Sbjct: 440 PTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNS-EINIVDAQKQDIPINAIGNL 499

Query: 239 YLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKET 298
           +             L  P+I  +L+S S+LA  N       N      +++ G +L    
Sbjct: 500 HFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN----TLERSDGTVLAP-I 559

Query: 299 LKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFI------LSGGKNLVSINVAVW 358
           +K G ++  +   ++ + +   TI    ++K+K  + +       + G  N  SI  ++ 
Sbjct: 560 VKHGDFYWLSKKYLIPSHISKLTINN--VNKSKSVNKYPYPLIHRMLGHANFRSIQKSLK 619

Query: 359 HKCLGHHSSKIL---NTLVKKCNACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGP 418
              + +     +   N    +C  C + K+    H     +    + E F  +++D++GP
Sbjct: 620 KNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGP 679

Query: 419 TPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS 478
                     Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q 
Sbjct: 680 VHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQM 739

Query: 479 DNGGEY--KKIHLLCTTLGI 482
           D G EY  K +H   T  GI
Sbjct: 740 DRGSEYTNKTLHKFFTNRGI 751

BLAST of CSPI07G08330 vs. TrEMBL
Match: A0A151S6M8_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 2.1e-78
Identity = 218/618 (35.28%), Postives = 295/618 (47.73%), Query Frame = 1

Query: 1   MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------- 60
           MT EVA QL+    ++ +WE  + L G  +R+   FL+  F  TRK              
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 61  -------SGSPVPLCALVSQVLLGLD----------------------------EKRLEH 120
                  +GS V    LV+Q L GLD                            E RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 121 QNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFY-GQRGNSNNGRGRGRGRGNKPT 180
            N+Q N T N   N+ TI   +R  SN      G Q   G RG    GRGRGR   ++  
Sbjct: 121 INNQSNLTLNPSSNISTILYNRRGKSNAFGGGRGGQINRGARG----GRGRGRATKDRIV 180

Query: 181 CQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETV 240
           CQVC K  H A  CY++FN++++     ++        N N +A+V + S        TV
Sbjct: 181 CQVCCKPGHAASHCYHRFNKNYIGQNSDEQKSEKDKEQNYNFNAYVASPS--------TV 240

Query: 241 IDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLA 300
            D +WY DSGA+NHVT     +    E  GK   TVGNG NL I   G S L   ++ L 
Sbjct: 241 EDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLN 300

Query: 301 LKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLEN 360
           LK++L VP ITKNL+S SKL  DN I++EFH+  C VKDK TG ILL+  +KDGLY L  
Sbjct: 301 LKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDGLYQLPG 360

Query: 361 VGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTL 420
                      ST K+  +    KET                  WH+ LGH +SK+LN +
Sbjct: 361 GST--------STNKRPHVFFSIKET------------------WHRKLGHPNSKVLNEV 420

Query: 421 VKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGF 480
           +K CN             ACQ  KAHNLPF  S S A E  D+++SD+WGP PI S  GF
Sbjct: 421 MKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGF 480

Query: 481 RYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHL 540
           +YY++F+DD+SR+T     +QKS    AF  F   V+NQFNK IK  Q D GGE+K +  
Sbjct: 481 KYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIKTLQCDGGGEFKSLSK 540

Query: 541 LCTTLGINWDAHTYFTRPYQSNKFQQRSEKCVLLG-------PSPIHKGFKCLSKSGKVF 546
           +    GI       +T   Q+ + +++    V  G         P+H  ++  S +  + 
Sbjct: 541 VLIKTGIQLRESCPYTSA-QNGRAERKHRHVVESGLTLLAQAKMPLHYWWEAFSTAVFLI 579

BLAST of CSPI07G08330 vs. TrEMBL
Match: A0A151U9E7_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan GN=KK1_020152 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.6e-57
Identity = 154/446 (34.53%), Postives = 229/446 (51.35%), Query Frame = 1

Query: 61  LLGLDEKRLEHQNSQRNT---GNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR 120
           LL   E+RLE   +  +     N  +    +  Q TN     ++NG    G     + GR
Sbjct: 201 LLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQGRGSNNNGTHGRGP----HRGR 260

Query: 121 GRGR-----GRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANP-NPS 180
           GRGR     GRGNK  CQVC K  H A  C++++++ +  P       NL+ + N  NP 
Sbjct: 261 GRGRSAQHFGRGNKQPCQVCGKIGHIAFHCWHRYDQQYTEP-------NLNHNTNVYNPG 320

Query: 181 AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLN 240
                Q+     +   V D  WY DSGATNH+T+    L + T+Y+G++K  +GNG  + 
Sbjct: 321 NQQQMQAMIA-GSQNMVYDDQWYPDSGATNHLTSDLNNLGSKTDYTGQDKIHMGNGQAIG 380

Query: 241 ISCVGHSYLTD--AKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKA 300
           I+  G S+     + +   LK +L VP ITKNL+S SK  +DN++++EFH  YCLVK + 
Sbjct: 381 INHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFCKDNNVYVEFHTNYCLVKSQD 440

Query: 301 TGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV 360
           + E LL+  LK+GLY  + V ++     EH+      +  N+ T+ ++    K+      
Sbjct: 441 SKETLLRGNLKNGLYVFDEVQILKPDVYEHTV---DSVPTNR-TTLYVQRTRKSY----- 500

Query: 361 AVWHKCLGHHSSKILNTLVKKCN--------------ACQLEKAHNLPFTISQSRAAESF 420
            VWH  L H S KI+  ++K CN              +C + K+H LPF+ S +      
Sbjct: 501 DVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPFSDSYTVYNSPL 560

Query: 421 DIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFN 480
           +++YSD+WGP+   S +GFRYY+ F D +S+YT       KS     F HF + V+NQF 
Sbjct: 561 ELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFIHFKSMVENQFG 620

Query: 481 KSIKVFQSDNGGEYKKIHLLCTTLGI 482
             IK+FQSD G E+  +  L    GI
Sbjct: 621 HKIKMFQSDGGKEFTCLTKLFNENGI 625

BLAST of CSPI07G08330 vs. TrEMBL
Match: A0A151SEF4_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_024838 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.6e-57
Identity = 154/446 (34.53%), Postives = 229/446 (51.35%), Query Frame = 1

Query: 61  LLGLDEKRLEHQNSQRNT---GNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR 120
           LL   E+RLE   +  +     N  +    +  Q TN     ++NG    G     + GR
Sbjct: 67  LLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQGRGSNNNGTHGRGP----HRGR 126

Query: 121 GRGR-----GRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANP-NPS 180
           GRGR     GRGNK  CQVC K  H A  C++++++ +  P       NL+ + N  NP 
Sbjct: 127 GRGRSAQHFGRGNKQPCQVCGKIGHIAFHCWHRYDQQYTEP-------NLNHNTNVYNPG 186

Query: 181 AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLN 240
                Q+     +   V D  WY DSGATNH+T+    L + T+Y+G++K  +GNG  + 
Sbjct: 187 NQQQMQAMIA-GSQNMVYDDQWYPDSGATNHLTSDLNNLGSKTDYTGQDKIHMGNGQAIG 246

Query: 241 ISCVGHSYLTD--AKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKA 300
           I+  G S+     + +   LK +L VP ITKNL+S SK  +DN++++EFH  YCLVK + 
Sbjct: 247 INHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFCKDNNVYVEFHTNYCLVKSQD 306

Query: 301 TGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV 360
           + E LL+  LK+GLY  + V ++     EH+      +  N+ T+ ++    K+      
Sbjct: 307 SKETLLRGNLKNGLYVFDEVQILKPDVYEHTV---DSVPTNR-TTLYVQRTRKSY----- 366

Query: 361 AVWHKCLGHHSSKILNTLVKKCN--------------ACQLEKAHNLPFTISQSRAAESF 420
            VWH  L H S KI+  ++K CN              +C + K+H LPF+ S +      
Sbjct: 367 DVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPFSDSYTVYNSPL 426

Query: 421 DIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFN 480
           +++YSD+WGP+   S +GFRYY+ F D +S+YT       KS     F HF + V+NQF 
Sbjct: 427 ELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFIHFKSMVENQFG 486

Query: 481 KSIKVFQSDNGGEYKKIHLLCTTLGI 482
             IK+FQSD G E+  +  L    GI
Sbjct: 487 HKIKMFQSDGGKEFTCLTKLFNENGI 491

BLAST of CSPI07G08330 vs. TrEMBL
Match: A5BUG6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_005638 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 7.9e-57
Identity = 171/579 (29.53%), Postives = 263/579 (45.42%), Query Frame = 1

Query: 8   QLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRKS-------------------- 67
           Q++G  ++   W      F   SRA    LR   Q+T+K                     
Sbjct: 3   QIIGHNSSHSAWNALEKTFSSSSRARIMQLRLELQSTKKGSLSMIDYIMKVKGAADSLAA 62

Query: 68  -GSPVPLCALVSQVLLGL--------------DEK-----------RLEHQNSQRNTGNA 127
            G PV     V  +L GL              D+K             EH+  Q+++   
Sbjct: 63  IGEPVSEQDQVMNLLGGLGSDYNAVVTAINIKDDKISIEVVHSMLLAFEHRLEQQSSIEQ 122

Query: 128 VRNVTINVAQRTNSN-EHKSHNGQQFYGQRGNSNNGRGRGRGRGN--------------K 187
             +++ N A  +NS    + +NG +      N +N   RGRGRG               K
Sbjct: 123 FSSISANYASSSNSRGSGRRYNGGRGQNHTPNISNYTYRGRGRGGRYGQNGRHNSNSSEK 182

Query: 188 PTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSE 247
           P CQ+C K+ H    CY++F+  + S     +  N S S   NP++      P   A+S 
Sbjct: 183 PQCQLCGKFGHTVQICYHKFDISYQS----SQSSNTSPSNASNPNSI-----PAMVASSN 242

Query: 248 TVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVG-HSYLTDAKQ 307
            + +  WY+DSGA +H+T     LT+ + Y+G +K T+GNG +L+IS  G H  L+D++ 
Sbjct: 243 NLAEDTWYLDSGANHHLTQSVGNLTSSSPYTGIDKVTIGNGKHLSISNTGSHRLLSDSRS 302

Query: 308 FLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYH 367
           F  LK V  V  I+ NL+S +K   DN+   EF +    VKD  T ++L +  L++GLY 
Sbjct: 303 F-HLKKVFHVHFISANLISVAKFYLDNNALFEFRSNSFFVKDLHTKKVLAQGKLENGLYR 362

Query: 368 LENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILN 427
              +              K++       S+   S   ++    V +WH  LGH S+ I+ 
Sbjct: 363 FPVLN------------SKKVAFVGAINSSTFYSHNSSIFDNKVKLWHHRLGHASTNIVT 422

Query: 428 TLVKKCN--------------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGP 487
            +++ CN                    +CQL K+H LP  +S S A++  +++++DLWGP
Sbjct: 423 QIMQSCNVSFEKNKNTVCSTVCSTVCSSCQLAKSHRLPTHLSLSCASKPLELVHTDLWGP 482

Query: 488 TPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDN 505
             + S+ G RY+I+F+DDYSRYT     + K  A+ AFK F   V+NQF+  IK  QSDN
Sbjct: 483 ASVKSTSGARYFILFLDDYSRYTWFYPLQTKDQALPAFKKFKLQVENQFDAKIKCLQSDN 542

BLAST of CSPI07G08330 vs. TrEMBL
Match: A5BFT3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017741 PE=4 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 1.0e-56
Identity = 162/484 (33.47%), Postives = 232/484 (47.93%), Query Frame = 1

Query: 59  QVLLGLDEKRLEHQNSQRNTGNAVRNVTINVAQRTN--------SNEHKSHNGQQ----- 118
           + LL   E R+E  N+  ++  +    + N  ++ N        +N   SH+G       
Sbjct: 196 EALLMAHESRVEKNNNSLDSSPSAHVASSNAVEKGNRFKQDYYAANSQGSHSGYNGGFGR 255

Query: 119 ---------FYGQRG-----NSNNGRGRGRGRGNK-----------------PTCQVCDK 178
                    FYG RG     N  + RG  RGRGNK                 P CQ+C K
Sbjct: 256 GGDFGRRGGFYGGRGFNWNYNGRSNRGGFRGRGNKGSFQARPPWNSDNQNEKPACQLCGK 315

Query: 179 YDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVT--TQSPTPFATSETVIDPN 238
             H    CY +F+  F  P      QNLS S N +P A+ +   Q      TSE   D N
Sbjct: 316 IGHVVAQCYYRFDHTFQVP------QNLS-SRNSSPRAYYSFSPQVNGVIPTSEVFSDDN 375

Query: 239 WYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTD--AKQFLALK 298
           WY DSGA+NHVT     L    E++G+ +  VGNG  L+I  +G S      + + L L 
Sbjct: 376 WYPDSGASNHVTPNPENLMKSAEFAGQNQVHVGNGTGLSIKHIGQSEFLSPFSSKPLLLN 435

Query: 299 NVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVG 358
           ++L VP ITKNL+S SK A+DN +F EFH+  C VKD+ T  +L+   ++DGLY  ++  
Sbjct: 436 HLLHVPSITKNLLSVSKFAKDNKVFFEFHSDSCFVKDQVTQAVLMVGKVRDGLYAFDSSH 495

Query: 359 LMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKK 418
           L  A     S  K   +  +  +S    +     +S    +WHK LGH S+  +  ++ K
Sbjct: 496 L--ALRPTQSLSKSPSVVASSFSSKVCTTS----LSSTFDLWHKRLGHPSAATIKNVLSK 555

Query: 419 CN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYY 478
           CN             +C L K H  PF++S +   +  ++I+ DLWGPT + S+ G+RYY
Sbjct: 556 CNVAHINKMDSNFCSSCCLGKIHRFPFSLSHTTYTKPLELIHLDLWGPTLVLSNSGYRYY 615

Query: 479 IIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCT 482
           I FVD +SR++       KS AI  F +F T V+ QF+  IK  Q+D GGE++       
Sbjct: 616 IHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWGGEFRAFQSYLA 666

BLAST of CSPI07G08330 vs. NCBI nr
Match: gi|1012339207|gb|KYP50444.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 301.6 bits (771), Expect = 3.1e-78
Identity = 218/618 (35.28%), Postives = 295/618 (47.73%), Query Frame = 1

Query: 1   MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------- 60
           MT EVA QL+    ++ +WE  + L G  +R+   FL+  F  TRK              
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 61  -------SGSPVPLCALVSQVLLGLD----------------------------EKRLEH 120
                  +GS V    LV+Q L GLD                            E RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 121 QNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFY-GQRGNSNNGRGRGRGRGNKPT 180
            N+Q N T N   N+ TI   +R  SN      G Q   G RG    GRGRGR   ++  
Sbjct: 121 INNQSNLTLNPSSNISTILYNRRGKSNAFGGGRGGQINRGARG----GRGRGRATKDRIV 180

Query: 181 CQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETV 240
           CQVC K  H A  CY++FN++++     ++        N N +A+V + S        TV
Sbjct: 181 CQVCCKPGHAASHCYHRFNKNYIGQNSDEQKSEKDKEQNYNFNAYVASPS--------TV 240

Query: 241 IDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLA 300
            D +WY DSGA+NHVT     +    E  GK   TVGNG NL I   G S L   ++ L 
Sbjct: 241 EDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLN 300

Query: 301 LKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLEN 360
           LK++L VP ITKNL+S SKL  DN I++EFH+  C VKDK TG ILL+  +KDGLY L  
Sbjct: 301 LKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDGLYQLPG 360

Query: 361 VGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTL 420
                      ST K+  +    KET                  WH+ LGH +SK+LN +
Sbjct: 361 GST--------STNKRPHVFFSIKET------------------WHRKLGHPNSKVLNEV 420

Query: 421 VKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGF 480
           +K CN             ACQ  KAHNLPF  S S A E  D+++SD+WGP PI S  GF
Sbjct: 421 MKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGF 480

Query: 481 RYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHL 540
           +YY++F+DD+SR+T     +QKS    AF  F   V+NQFNK IK  Q D GGE+K +  
Sbjct: 481 KYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIKTLQCDGGGEFKSLSK 540

Query: 541 LCTTLGINWDAHTYFTRPYQSNKFQQRSEKCVLLG-------PSPIHKGFKCLSKSGKVF 546
           +    GI       +T   Q+ + +++    V  G         P+H  ++  S +  + 
Sbjct: 541 VLIKTGIQLRESCPYTSA-QNGRAERKHRHVVESGLTLLAQAKMPLHYWWEAFSTAVFLI 579

BLAST of CSPI07G08330 vs. NCBI nr
Match: gi|147816383|emb|CAN68489.1| (hypothetical protein VITISV_037543 [Vitis vinifera])

HSP 1 Score: 237.3 bits (604), Expect = 7.1e-59
Identity = 164/536 (30.60%), Postives = 244/536 (45.52%), Query Frame = 1

Query: 1   MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRKS------------- 60
           +TPE+  Q++G+ ++   W      F   SRA    LR  FQTTRK              
Sbjct: 164 LTPEIMGQIVGYQSSHAXWFALEXXFXASSRARVMQLRLEFQTTRKGSLTMMEYILKLKS 223

Query: 61  --------GSPVPLCALVSQVLLGLDEK----------RLEHQNSQRNTGNAVRNVTINV 120
                   G PV     + Q+L GL             R +  NS         N+    
Sbjct: 224 LADNLAAIGEPVTDRDQILQLLGGLGADYNSIVASLTAREDEDNSVAEDNVISANLATPQ 283

Query: 121 AQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD 180
            Q  N+      N Q  +  R  +N GR +     ++P CQ+C K+ H  + CY++F+ +
Sbjct: 284 YQHFNNKRSSGQNRQSGFNTRRGTNGGRSQSSQ--HRPQCQLCGKFGHTVVRCYHRFDIN 343

Query: 181 FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFG 240
           F     Q    N+       P+A    Q     A+  T+ D  W+ D+GAT+H++     
Sbjct: 344 F-----QGYNPNMDTVQTNKPNA--KNQVQAMMASPSTISDEAWFFDTGATHHLSQSIDP 403

Query: 241 LTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLA 300
           L++   Y G +K  VGNG +L I   G ++   + +   L+ VL VPDI  NL+S S+  
Sbjct: 404 LSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPSSSKTFQLRQVLHVPDIATNLISVSQFC 463

Query: 301 QDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHK 360
            DN+ F EFH  +  VKD+ T +ILL+ +L+ GLY      +   A    S+  +     
Sbjct: 464 ADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRFPARFVPSPAAFVSSSYDRSSNLS 523

Query: 361 NKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKKCN------------ACQLE 420
              T+T               +WH  LGH +  IL  ++  CN            ACQ  
Sbjct: 524 LTTTTT---------------LWHSRLGHPADNILKHILTSCNISHQCHKNNVCCACQFA 583

Query: 421 KAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKS 480
           K+H LPF +  SRA+    ++++DLWGP  I S+ G RY+I+FVDD+SR++       K 
Sbjct: 584 KSHKLPFNVXVSRASHPLALLHADLWGPXSIPSTTGARYFILFVDDFSRFSWIYPLHSKD 643

Query: 481 TAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGINWDAHTYFTRPY 494
            A+  F  F + V+NQFN  I+  +SDNGGE+K       T GI     + F+ PY
Sbjct: 644 QALSVFIKFKSLVENQFNSRIQCLRSDNGGEFKAFSSYLATHGIK----SQFSCPY 671

BLAST of CSPI07G08330 vs. NCBI nr
Match: gi|1012342015|gb|KYP53212.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 232.3 bits (591), Expect = 2.3e-57
Identity = 154/446 (34.53%), Postives = 229/446 (51.35%), Query Frame = 1

Query: 61  LLGLDEKRLEHQNSQRNT---GNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR 120
           LL   E+RLE   +  +     N  +    +  Q TN     ++NG    G     + GR
Sbjct: 67  LLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQGRGSNNNGTHGRGP----HRGR 126

Query: 121 GRGR-----GRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANP-NPS 180
           GRGR     GRGNK  CQVC K  H A  C++++++ +  P       NL+ + N  NP 
Sbjct: 127 GRGRSAQHFGRGNKQPCQVCGKIGHIAFHCWHRYDQQYTEP-------NLNHNTNVYNPG 186

Query: 181 AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLN 240
                Q+     +   V D  WY DSGATNH+T+    L + T+Y+G++K  +GNG  + 
Sbjct: 187 NQQQMQAMIA-GSQNMVYDDQWYPDSGATNHLTSDLNNLGSKTDYTGQDKIHMGNGQAIG 246

Query: 241 ISCVGHSYLTD--AKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKA 300
           I+  G S+     + +   LK +L VP ITKNL+S SK  +DN++++EFH  YCLVK + 
Sbjct: 247 INHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFCKDNNVYVEFHTNYCLVKSQD 306

Query: 301 TGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV 360
           + E LL+  LK+GLY  + V ++     EH+      +  N+ T+ ++    K+      
Sbjct: 307 SKETLLRGNLKNGLYVFDEVQILKPDVYEHTV---DSVPTNR-TTLYVQRTRKSY----- 366

Query: 361 AVWHKCLGHHSSKILNTLVKKCN--------------ACQLEKAHNLPFTISQSRAAESF 420
            VWH  L H S KI+  ++K CN              +C + K+H LPF+ S +      
Sbjct: 367 DVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPFSDSYTVYNSPL 426

Query: 421 DIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFN 480
           +++YSD+WGP+   S +GFRYY+ F D +S+YT       KS     F HF + V+NQF 
Sbjct: 427 ELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFIHFKSMVENQFG 486

Query: 481 KSIKVFQSDNGGEYKKIHLLCTTLGI 482
             IK+FQSD G E+  +  L    GI
Sbjct: 487 HKIKMFQSDGGKEFTCLTKLFNENGI 491

BLAST of CSPI07G08330 vs. NCBI nr
Match: gi|1012364758|gb|KYP75940.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan])

HSP 1 Score: 232.3 bits (591), Expect = 2.3e-57
Identity = 154/446 (34.53%), Postives = 229/446 (51.35%), Query Frame = 1

Query: 61  LLGLDEKRLEHQNSQRNT---GNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR 120
           LL   E+RLE   +  +     N  +    +  Q TN     ++NG    G     + GR
Sbjct: 201 LLMAQEERLEKHKASESVPLQANLAQGSFNSRKQFTNQGRGSNNNGTHGRGP----HRGR 260

Query: 121 GRGR-----GRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANP-NPS 180
           GRGR     GRGNK  CQVC K  H A  C++++++ +  P       NL+ + N  NP 
Sbjct: 261 GRGRSAQHFGRGNKQPCQVCGKIGHIAFHCWHRYDQQYTEP-------NLNHNTNVYNPG 320

Query: 181 AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLN 240
                Q+     +   V D  WY DSGATNH+T+    L + T+Y+G++K  +GNG  + 
Sbjct: 321 NQQQMQAMIA-GSQNMVYDDQWYPDSGATNHLTSDLNNLGSKTDYTGQDKIHMGNGQAIG 380

Query: 241 ISCVGHSYLTD--AKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKA 300
           I+  G S+     + +   LK +L VP ITKNL+S SK  +DN++++EFH  YCLVK + 
Sbjct: 381 INHTGTSFFHTPMSSKIFTLKELLHVPHITKNLLSVSKFCKDNNVYVEFHTNYCLVKSQD 440

Query: 301 TGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV 360
           + E LL+  LK+GLY  + V ++     EH+      +  N+ T+ ++    K+      
Sbjct: 441 SKETLLRGNLKNGLYVFDEVQILKPDVYEHTV---DSVPTNR-TTLYVQRTRKSY----- 500

Query: 361 AVWHKCLGHHSSKILNTLVKKCN--------------ACQLEKAHNLPFTISQSRAAESF 420
            VWH  L H S KI+  ++K CN              +C + K+H LPF+ S +      
Sbjct: 501 DVWHNRLAHASRKIVQAVMKTCNVPMPINHQDSIVCKSCCIGKSHTLPFSDSYTVYNSPL 560

Query: 421 DIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFN 480
           +++YSD+WGP+   S +GFRYY+ F D +S+YT       KS     F HF + V+NQF 
Sbjct: 561 ELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSETAHHFIHFKSMVENQFG 620

Query: 481 KSIKVFQSDNGGEYKKIHLLCTTLGI 482
             IK+FQSD G E+  +  L    GI
Sbjct: 621 HKIKMFQSDGGKEFTCLTKLFNENGI 625

BLAST of CSPI07G08330 vs. NCBI nr
Match: gi|147790209|emb|CAN61322.1| (hypothetical protein VITISV_012106 [Vitis vinifera])

HSP 1 Score: 230.3 bits (586), Expect = 8.7e-57
Identity = 156/458 (34.06%), Postives = 230/458 (50.22%), Query Frame = 1

Query: 66  EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEH-KSHNGQQFYGQRGNSNNGRGRGRGRG 125
           E RLE Q+S       +  ++ N A  +N+    +  NG +  G   N+NN   RGRGRG
Sbjct: 218 EHRLEQQSS-------IEQMSANYASSSNNRGGGRKFNGGRGQGYSPNNNNYTYRGRGRG 277

Query: 126 N--------------KPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNP 185
                          KP CQ+C K+ H A  CY++F+  F          +L+     N 
Sbjct: 278 GRNGQGGRQNSSPSEKPQCQLCGKFGHTAQICYHRFDISFQGGQTTI-SHSLNNGNQNNI 337

Query: 186 SAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNL 245
            A V + S  P        D +WY+DSGA++H+T     LT+ + Y+G +K T+GNG +L
Sbjct: 338 PAMVASASNNP-------ADESWYLDSGASHHLTQNLGNLTSTSPYTGTDKVTIGNGKHL 397

Query: 246 NISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKAT 305
           +IS +G   L        LK V  VP I+ NL+S +K   +N+  +EFH+    VKD  T
Sbjct: 398 SISNIGSKQLHSHTHSFRLKKVFHVPFISANLISVAKFCSENNALIEFHSNAFFVKDLHT 457

Query: 306 GEILLKETLKDGLYHLENVGLMVAAELE-HSTIKKQLIHKNKETSTFILSGGKNLVSINV 365
             +L +  L++GLY        V + L+ +S+I       ++ +ST         V    
Sbjct: 458 KMVLAQGKLENGLYKFP-----VFSNLKPYSSINNASAFHSQFSST---------VENKA 517

Query: 366 AVWHKCLGHHSSKILNTLVKKCNA------------CQLEKAHNLPFTISQSRAAESFDI 425
            +WH  LGH S  I++ ++  CN             CQL K+H LP  +S   A++  ++
Sbjct: 518 ELWHNRLGHASFDIVSKVMNTCNVASGKYKSFVCSDCQLAKSHRLPTQLSNFHASKPLEL 577

Query: 426 IYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKS 485
           +Y+D+WGP  I S+ G RY+I+FVDDYSRYT   S + K  A+  FK F   ++NQF+  
Sbjct: 578 VYTDIWGPASIKSTSGARYFILFVDDYSRYTWFYSLQTKDQALPIFKXFKLQMENQFDTK 637

Query: 486 IKVFQSDNGGEYKKIHLLCTTLGINWDAHTYFTRPYQS 496
           IK  QSDNGGE++        +GI   AH  F+ PY S
Sbjct: 638 IKCLQSDNGGEFRSFTSFLQAVGI---AHR-FSCPYNS 642

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC2.4e-2225.92Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
YG22B_YEAST6.7e-0921.25Transposon Ty2-GR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YG21B_YEAST6.7e-0921.25Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YO21B_YEAST6.7e-0921.25Transposon Ty2-OR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YD22B_YEAST6.7e-0921.25Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A151S6M8_CAJCA2.1e-7835.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151U9E7_CAJCA1.6e-5734.53Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... [more]
A0A151SEF4_CAJCA1.6e-5734.53Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A5BUG6_VITVI7.9e-5729.53Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_005638 PE=4 SV=1[more]
A5BFT3_VITVI1.0e-5633.47Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_017741 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|1012339207|gb|KYP50444.1|3.1e-7835.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|147816383|emb|CAN68489.1|7.1e-5930.60hypothetical protein VITISV_037543 [Vitis vinifera][more]
gi|1012342015|gb|KYP53212.1|2.3e-5734.53Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|1012364758|gb|KYP75940.1|2.3e-5734.53Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus ca... [more]
gi|147790209|emb|CAN61322.1|8.7e-5734.06hypothetical protein VITISV_012106 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G08330.1CSPI07G08330.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 395..494
score: 1.4
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 379..481
score: 10
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 388..487
score: 1.7
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 390..487
score: 4.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 349..544
score: 3.3E-101coord: 1..323
score: 3.3E
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 349..544
score: 3.3E-101coord: 1..323
score: 3.3E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI07G08330Cucsa.247860Cucumber (Gy14) v1cgycpiB370
The following gene(s) are paralogous to this gene:

None