CSPI07G13050 (gene) Wild cucumber (PI 183967)

NameCSPI07G13050
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7 : 11493755 .. 11495774 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGACCACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCGGTGGTATATTTTGGTGTCGAAGAAGGATGTAAAGCTCATCGCTTATATGACCCAGGTCGTGAAAAACTACAAATTAGTAGAGATGTTGTTTTTCAAGAGAATCTTGAATGGCCTTGGAACGAAGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTGTTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAATGTCATACCACATGCAACTGAGATACCTGCGATTGGAGCAACCGGTCCATCTCCTCCATCAACGAACACACCGGTCCGTCTAAGATCTCTCACTCATATCTACATCAACACAGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGGTGATGATGGTAGTGTCTGAAGGATTGATTTGTTACCAAGAAGTTGTTAAAGAGGCCCAACTGGTACAAAGTAATGGAGAACGAATTAAAATCCATTGAGAAAAACAACACATGGAGTCTGACCAAGCTTCCACCAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAGTGTTGAAGTTGTCAAGCACAATGCAAGATTGGTTGCTAAAGGCATTGACTTTGAAGAAGTTTTAGCACCAGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGTACTAGTTGCAAATCAAAGTTGGGAGGTACACCATCTAGATGTGAAGTCGACATTTCTCAATGGAGAACTGGAAGAGGAAGTATATGTTACTCAATCAGAGGGTTTTGAGGTCCCAAATAAAAAACACAAGGTGCATAGATTGTCGAAGGCTCTCTACGGATTAAGGCAAGCTCCACGAGCTTGGAACATTCAACTTGATAGGAGTCTCAAAGAGCTTGGTTTTGGAAAATGCACTCAAAAGCAAGTAGTCTACACAAGAAGTGAAGGAGAAGAATGTGTTCTTGTTGAAGTGTATGTTGACAATCTCATTGTAGCAGGAAATAGCACTGAAAAGGTCAATAAGTTCAAGCAGCAAATGATGGCAGAATTTGAAATGAGCCACTTAGGTCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCACAAAGACATGGAAGGAGCACCGATTGAAGCTACGGAGTACAGAAACATCGTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAAATCTTTCATATGTTTTTGGGATGGCGAGTAAGTATATGGAAAGGCCTACAACCATACATTACAAGGTTGTCAAGCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGCTCACTTATACGAAAGGTCCTAGACAATTCGATATATTAGGTTACTCTGACAGTGATTTAGTCAGTGATCTCGACGGGAGGAAAAGTACAAGTGGAATGAAATTTTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACAGTGACACTCTCATCTTGCAAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCGTTGTGGTTAAGATGCATTGTTAGCGAGATAGTCAGAATGGAGCCAAGATCAGTAACATTATTCGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCTGTATTTCATGGTTGTGGCAAGCACATAGATACATGTTTTCATTTCATTCAAGAGTGTGTTGAGAATGGACAAATTATCGTTGAATTTGTCAATACTAGAGAACAACGAGCCGATGTTTTGACTAAAGCATTGACAAGAGTAAAGTTAGCTGCTATGTGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTTAG

mRNA sequence

ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGACCACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCGGTGGTATATTTTGGTGTCGAAGAAGGATGTAAAGCTCATCGCTTATATGACCCAGGTCGTGAAAAACTACAAATTAGTAGAGATGTTGTTTTTCAAGAGAATCTTGAATGGCCTTGGAACGAAGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTGTTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAATGTCATACCACATGCAACTGAGATACCTGCGATTGGAGCAACCGGTCCATCTCCTCCATCAACGAACACACCGGTCCGTCTAAGATCTCTCACTCATATCTACATCAACACAGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAAGTTGTTAAAGAGGCCCAACTGGTACAAAGTAATGGAGAACGAATTAAAATCCATTGAGAAAAACAACACATGGAGTCTGACCAAGCTTCCACCAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAGTGTTGAAGTTGTCAAGCACAATGCAAGATTGGTTGCTAAAGGCATTGACTTTGAAGAAGTTTTAGCACCAGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGTACTAGTTGCAAATCAAAGTTGGGAGGTACACCATCTAGATGTGAAGTCGACATTTCTCAATGGAGAACTGGAAGAGGAAGTATATGTTACTCAATCAGAGGGTTTTGAGGTCCCAAATAAAAAACACAAGGTGCATAGATTGTCGAAGGCTCTCTACGGATTAAGGCAAGCTCCACGAGCTTGGAACATTCAACTTGATAGGAGTCTCAAAGAGCTTGGTTTTGGAAAATGCACTCAAAAGCAAGTAGTCTACACAAGAAGTGAAGGAGAAGAATGTGTTCTTGTTGAAGTGTATGTTGACAATCTCATTGTAGCAGGAAATAGCACTGAAAAGGTCAATAAGTTCAAGCAGCAAATGATGGCAGAATTTGAAATGAGCCACTTAGGTCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCACAAAGACATGGAAGGAGCACCGATTGAAGCTACGGAGTACAGAAACATCGTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAAATCTTTCATATGTTTTTGGGATGGCGAGTAAGTATATGGAAAGGCCTACAACCATACATTACAAGGTTGTCAAGCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGCTCACTTATACGAAAGGTCCTAGACAATTCGATATATTAGGTTACTCTGACAGTGATTTAGTCAGTGATCTCGACGGGAGGAAAAGTACAAGTGGAATGAAATTTTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACAGTGACACTCTCATCTTGCAAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCGTTGTGGTTAAGATGCATTGTTAGCGAGATAGTCAGAATGGAGCCAAGATCAGTAACATTATTCGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCTGTATTTCATGGTTGTGGCAAGCACATAGATACATGTTTTCATTTCATTCAAGAGTGTGTTGAGAATGGACAAATTATCGTTGAATTTGTCAATACTAGAGAACAACGAGCCGATGTTTTGACTAAAGCATTGACAAGAGTAAAGTTAGCTGCTATGTGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTTAG

Coding sequence (CDS)

ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGACCACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCGGTGGTATATTTTGGTGTCGAAGAAGGATGTAAAGCTCATCGCTTATATGACCCAGGTCGTGAAAAACTACAAATTAGTAGAGATGTTGTTTTTCAAGAGAATCTTGAATGGCCTTGGAACGAAGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTGTTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAATGTCATACCACATGCAACTGAGATACCTGCGATTGGAGCAACCGGTCCATCTCCTCCATCAACGAACACACCGGTCCGTCTAAGATCTCTCACTCATATCTACATCAACACAGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAAGTTGTTAAAGAGGCCCAACTGGTACAAAGTAATGGAGAACGAATTAAAATCCATTGAGAAAAACAACACATGGAGTCTGACCAAGCTTCCACCAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAGTGTTGAAGTTGTCAAGCACAATGCAAGATTGGTTGCTAAAGGCATTGACTTTGAAGAAGTTTTAGCACCAGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGTACTAGTTGCAAATCAAAGTTGGGAGGTACACCATCTAGATGTGAAGTCGACATTTCTCAATGGAGAACTGGAAGAGGAAGTATATGTTACTCAATCAGAGGGTTTTGAGGTCCCAAATAAAAAACACAAGGTGCATAGATTGTCGAAGGCTCTCTACGGATTAAGGCAAGCTCCACGAGCTTGGAACATTCAACTTGATAGGAGTCTCAAAGAGCTTGGTTTTGGAAAATGCACTCAAAAGCAAGTAGTCTACACAAGAAGTGAAGGAGAAGAATGTGTTCTTGTTGAAGTGTATGTTGACAATCTCATTGTAGCAGGAAATAGCACTGAAAAGGTCAATAAGTTCAAGCAGCAAATGATGGCAGAATTTGAAATGAGCCACTTAGGTCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCACAAAGACATGGAAGGAGCACCGATTGAAGCTACGGAGTACAGAAACATCGTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAAATCTTTCATATGTTTTTGGGATGGCGAGTAAGTATATGGAAAGGCCTACAACCATACATTACAAGGTTGTCAAGCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGCTCACTTATACGAAAGGTCCTAGACAATTCGATATATTAGGTTACTCTGACAGTGATTTAGTCAGTGATCTCGACGGGAGGAAAAGTACAAGTGGAATGAAATTTTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACAGTGACACTCTCATCTTGCAAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCGTTGTGGTTAAGATGCATTGTTAGCGAGATAGTCAGAATGGAGCCAAGATCAGTAACATTATTCGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCTGTATTTCATGGTTGTGGCAAGCACATAGATACATGTTTTCATTTCATTCAAGAGTGTGTTGAGAATGGACAAATTATCGTTGAATTTGTCAATACTAGAGAACAACGAGCCGATGTTTTGACTAAAGCATTGACAAGAGTAAAGTTAGCTGCTATGTGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTTAG
BLAST of CSPI07G13050 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 3.6e-104
Identity = 240/690 (34.78%), Postives = 378/690 (54.78%), Query Frame = 1

Query: 8    AHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQ 67
            +HL+VFGC A+         KLDD+S P ++ G  +    +RL+DP ++K+  SRDVVF+
Sbjct: 646  SHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFR 705

Query: 68   EN---LEWPWNEVVSDG-----------------KEITEFQVMDQFCSDEFENLEDAETR 127
            E+        +E V +G                  E T  +V +Q      E +E  E  
Sbjct: 706  ESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQG-EQPGEVIEQGEQL 765

Query: 128  VENVIPHATEIPAIGATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQENEKL---LKR 187
             E V     E P  G     P   +   R+ S    Y +TE V+  D++E E L   L  
Sbjct: 766  DEGV--EEVEHPTQGEEQHQPLRRSERPRVESRR--YPSTEYVLISDDREPESLKEVLSH 825

Query: 188  PN---WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA 247
            P      K M+ E++S++KN T+ L +LP G +P+  KWVFKLKKD   ++V++ ARLV 
Sbjct: 826  PEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVV 885

Query: 248  KG------IDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQ 307
            KG      IDF+E+ +PV ++ ++R IL L A+   EV  LDVK+ FL+G+LEEE+Y+ Q
Sbjct: 886  KGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQ 945

Query: 308  SEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGE- 367
             EGFEV  KKH V +L+K+LYGL+QAPR W ++ D  +K   + K      VY +   E 
Sbjct: 946  PEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSEN 1005

Query: 368  ECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEV--EQQKGRILL 427
              +++ +YVD++++ G     + K K  +   F+M  LG     LG+++  E+   ++ L
Sbjct: 1006 NFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWL 1065

Query: 428  KQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDM------EGAPIEATEYRNIVGCLRY 487
             Q  Y +R+L +F M +      P+    +L K M      E   +    Y + VG L Y
Sbjct: 1066 SQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMY 1125

Query: 488  -LLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGY 547
             ++ TRP++++  G+ S+++E P   H++ VK ILRYLRGT    L +  G     + GY
Sbjct: 1126 AMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCF--GGSDPILKGY 1185

Query: 548  SDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLR 607
            +D+D+  D+D RKS++G  F  +   +SW S+ QK V LS+ +AE+IAAT    + +WL+
Sbjct: 1186 TDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLK 1245

Query: 608  CIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFV 656
              + E+  +  +   ++ D++SAI L KN ++H   KHID  +H+I+E V++  + V  +
Sbjct: 1246 RFLQEL-GLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKI 1305

BLAST of CSPI07G13050 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 312.0 bits (798), Expect = 1.6e-83
Identity = 176/504 (34.92%), Postives = 291/504 (57.74%), Query Frame = 1

Query: 164  RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKG 223
            + +W + +  EL + + NNTW++TK P     +  +WVF +K +     +++ ARLVA+G
Sbjct: 903  KSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARG 962

Query: 224  ------IDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSE 283
                  ID+EE  APVAR+ + R IL LV   + +VH +DVK+ FLNG L+EE+Y+   +
Sbjct: 963  FTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQ 1022

Query: 284  GFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGE--E 343
            G         V +L+KA+YGL+QA R W    +++LKE  F   +  + +Y   +G   E
Sbjct: 1023 GISC--NSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINE 1082

Query: 344  CVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQP 403
             + V +YVD++++A     ++N FK+ +M +F M+ L  + +++GI +E Q+ +I L Q 
Sbjct: 1083 NIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQS 1142

Query: 404  TYAKRILSQFGMADCNATKYPMEPKAQ---LHKDMEGAPIEATEYRNIVGCLRYL-LNTR 463
             Y K+ILS+F M +CNA   P+  K     L+ D +      T  R+++GCL Y+ L TR
Sbjct: 1143 AYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCN----TPCRSLIGCLMYIMLCTR 1202

Query: 464  PNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPR-QFDILGYSDSDL 523
            P+L+    + S+Y  +  +  ++ +K++LRYL+GTI   L + K    +  I+GY DSD 
Sbjct: 1203 PDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDW 1262

Query: 524  VSDLDGRKSTSGMKFYLNE-SLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVS 583
                  RKST+G  F + + +L+ WN+++Q +V  SS +AE++A   A  +ALWL+ +++
Sbjct: 1263 AGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLT 1322

Query: 584  EIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTRE 643
             I       + ++ DN+  I++  NP  H   KHID  +HF +E V+N  I +E++ T  
Sbjct: 1323 SINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTEN 1382

Query: 644  QRADVLTKALTRVKLAAMCQLLGV 654
            Q AD+ TK L   +   +   LG+
Sbjct: 1383 QLADIFTKPLPAARFVELRDKLGL 1400

BLAST of CSPI07G13050 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 145.2 bits (365), Expect = 2.5e-33
Identity = 94/303 (31.02%), Postives = 156/303 (51.49%), Query Frame = 1

Query: 256 LDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKE 315
           +DV + FLN  ++E +YV Q  GF        V  L   +YGL+QAP  WN  ++ +LK+
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 316 LGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLL 375
           +GF +   +  +Y RS  +  + + VYVD+L+VA  S +  ++ KQ++   + M  LG +
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 376 SYYLGIEVEQ-QKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEA 435
             +LG+ + Q   G I L    Y  +  S+  +     T+ P+     L +       + 
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 436 TEYRNIVGCLRYLLNT-RPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTY 495
           T Y++IVG L +  NT RP++SY   + S+++  P  IH +  +++LRYL  T    L Y
Sbjct: 181 TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 496 TKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK-TVTLSSCKAEFI 555
             G  Q  +  Y D+   +  D   ST G    L  + V+W+S+K K  + + S +AE+I
Sbjct: 241 RSG-SQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYI 300

BLAST of CSPI07G13050 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 2.4e-31
Identity = 72/223 (32.29%), Postives = 126/223 (56.50%), Query Frame = 1

Query: 341 VYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKR 400
           +YVD++++ G+S   +N    Q+ + F M  LG + Y+LGI+++     + L Q  YA++
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 401 ILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGM 460
           IL+  GM DC     P+  K          P + +++R+IVG L+YL  TRP++SY   +
Sbjct: 65  ILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAVNI 124

Query: 461 ASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKST 520
             + M  PT   + ++K++LRY++GTI  GL Y     + ++  + DSD       R+ST
Sbjct: 125 VCQRMHEPTLADFDLLKRVLRYVKGTIFHGL-YIHKNSKLNVQAFCDSDWAGCTSTRRST 184

Query: 521 SGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALW 564
           +G   +L  +++SW++++Q TV+ SS + E+ A    A +  W
Sbjct: 185 TGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G13050 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 1.6e-19
Identity = 111/552 (20.11%), Postives = 249/552 (45.11%), Query Frame = 1

Query: 129  PSTNTPVRLRSLTHIYINTEEVVGGDEQENEKLLKRPNWYKVMEN--ELKSIEKNNTWSL 188
            P     V  + +  IY N  E +  +    EK   +  ++K ++N  ++K  + +  +S 
Sbjct: 1256 PDNMETVSAQKIRAIYYN--EAISKNPDLKEKHEYKQAYHKELQNLKDMKVFDVDVKYSR 1315

Query: 189  TKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKGID-----FEEVLAPVARLDTVRV 248
            +++P  +  +    +F  K++       + AR+V +G       +  +       + +++
Sbjct: 1316 SEIPD-NLIVPTNTIFTKKRNGI-----YKARIVCRGDTQSPDTYSVITTESLNHNHIKI 1375

Query: 249  ILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQA 308
             L++  N++  +  LD+   FL  +LEEE+Y+        P+ +  V +L+KALYGL+Q+
Sbjct: 1376 FLMIANNRNMFMKTLDINHAFLYAKLEEEIYIPH------PHDRRCVVKLNKALYGLKQS 1435

Query: 309  PRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQ 368
            P+ WN  L + L  +G    +    +Y     ++ +++ VYVD+ ++A ++ +++++F  
Sbjct: 1436 PKEWNDHLRQYLNGIGLKDNSYTPGLYQTE--DKNLMIAVYVDDCVIAASNEQRLDEFIN 1495

Query: 369  QMMAEFEMSHLGLL------SYYLGIEVEQQK--GRILLKQPTYAKRILSQFG------- 428
            ++ + FE+   G L      +  LG+++   K  G I L   ++  R+  ++        
Sbjct: 1496 KLKSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIR 1555

Query: 429  -MADCNATKYPMEPKAQ-LHKDMEGAPIEATEYRNIVGCLRYLLN-TRPNLSYVFGMASK 488
              +  + + Y ++PK   L    E       + + ++G L Y+ +  R ++++     ++
Sbjct: 1556 KSSIPHMSTYKIDPKKDVLQMSEEEFRQGVLKLQQLLGELNYVRHKCRYDINFAVKKVAR 1615

Query: 489  YM----ERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKS 548
             +    ER   + YK+++ ++RY    IH+     K  +   ++  +D+ + S+ D  +S
Sbjct: 1616 LVNYPHERVFYMIYKIIQYLVRYKDIGIHYDRDCNKDKK---VIAITDASVGSEYDA-QS 1675

Query: 549  TSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSV 608
              G+  +   ++ +  S K     +SS +AE  A       +  L+  + E+   +   +
Sbjct: 1676 RIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGYADSETLKVTLKELGEGDNNDI 1735

Query: 609  TLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKAL 652
             +  D+K AI  +         K        I+E ++   I +  +  +   AD+LTK +
Sbjct: 1736 VMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEKSIKLLKITGKGNIADLLTKPV 1787

BLAST of CSPI07G13050 vs. TrEMBL
Match: B8BDZ6_ORYSI (Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4 SV=1)

HSP 1 Score: 763.8 bits (1971), Expect = 1.7e-217
Identity = 402/716 (56.15%), Postives = 494/716 (68.99%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +S
Sbjct: 713  GRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVS 772

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDV+F+EN+ W W+ VV+  +  TEF V      ++  +   A      V  +    PA+
Sbjct: 773  RDVIFEENVPWQWS-VVAGEQNSTEFTV-----EEDGVDAPPAGAPAYPVPRYRAPSPAV 832

Query: 122  -----------------------GATGPSPPSTNT------------------PVRLRSL 181
                                    +T PS P+T +                  PVR RSL
Sbjct: 833  PQSPLASPVGASPSLPTSPQSSPSSTPPSTPATGSVGPVASPGSSGDLRSDEGPVRFRSL 892

Query: 182  THIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNNTWSL 241
              I      V +  DE + + LL              +P W   M  EL++IEKN+TW+L
Sbjct: 893  EDIMREAPRVDLVEDEHDGDALLAEMEEPSSYREAAGQPAWENAMAQELQAIEKNSTWAL 952

Query: 242  TKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVR 301
            T LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVR
Sbjct: 953  TALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLVAKGYVQRQGVDFEEVFAPVARLDTVR 1012

Query: 302  VILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQ 361
            VIL + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQ
Sbjct: 1013 VILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQ 1072

Query: 362  APRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFK 421
            APRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYVD+LIV G + +++  FK
Sbjct: 1073 APRAWNTRLDKCLKELGFARCTQEQAVYTRGKGQAGVIVGVYVDDLIVTGENPQEIAMFK 1132

Query: 422  QQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPK 481
            QQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP+
Sbjct: 1133 QQMMGEFEMSDLGLLSYYLGIEVIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPR 1192

Query: 482  AQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQIL 541
            + LHKD +G PI+ATEYR ++GCLRYLL+TRP+LSY  G+AS++MERPTT+H K VK IL
Sbjct: 1193 SLLHKDADGNPIDATEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMIL 1252

Query: 542  RYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK 601
            RYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQK
Sbjct: 1253 RYLKGTLDSGLVFASGSGSLDITGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQK 1312

Query: 602  TVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGC 657
            TV LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG 
Sbjct: 1313 TVALSSCEAEFMAATAAACHALWLRALLSEMMGTEAKPVKLFVDNKSAIALMKNPVFHGR 1372

BLAST of CSPI07G13050 vs. TrEMBL
Match: Q10RM4_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os03g05850 PE=4 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 2.9e-217
Identity = 396/722 (54.85%), Postives = 495/722 (68.56%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HL+VFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +S
Sbjct: 691  GRKPQLGHLKVFGCTAHAKNTQPHLKKLDDRSAPYVYLGVEEGSKAHRLFDPRRGRIHVS 750

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPH------- 121
            RDVVF+EN+ W W      G+E T+F + ++      E L    T    V P+       
Sbjct: 751  RDVVFEENVPWEWTSAA--GQEPTDFAMEEE----PGEQLPSPATAAGVVPPYQAPSPGR 810

Query: 122  ---------ATEIPAI-----------GATGPSPPSTNT------------------PVR 181
                     A E+P+            G   P  PSTN+                  P R
Sbjct: 811  RAGKEAVVAAEEVPSPASPVAASPTLPGTPTPGSPSTNSAGVVPSPGTDDNIDTDDGPRR 870

Query: 182  LRSLTHIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNN 241
             RSL  +      V +  DE + E LL              +P W + M+ E+++IEKN 
Sbjct: 871  YRSLADVLREAPRVDLVEDECDGEALLAESEEPSSYREAAGQPAWEEAMQREMEAIEKNK 930

Query: 242  TWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKG------IDFEEVLAPVARL 301
            TW L  LP GH+ IGLKWV+KLKK+ + E++KH ARLVAKG      +DFEEV APVARL
Sbjct: 931  TWELAMLPAGHRAIGLKWVYKLKKNTAGEIIKHKARLVAKGYVQKQGVDFEEVFAPVARL 990

Query: 302  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALY 361
            DTVRV+L + A++ W+VHHLDVKS FLNGELEEEVYV Q EGF    K+H V +L KALY
Sbjct: 991  DTVRVVLAVAADRRWQVHHLDVKSAFLNGELEEEVYVAQPEGFARSGKEHLVLKLHKALY 1050

Query: 362  GLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKV 421
            GLRQAPRAWNI+LDRSL+ELGF +CTQ+Q VYTR  G + ++V VYVD+LIV G +  ++
Sbjct: 1051 GLRQAPRAWNIRLDRSLRELGFDRCTQEQAVYTRGRGSDGIIVGVYVDDLIVTGENPSEL 1110

Query: 422  NKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYP 481
              FK+QMM EFEMS LGLL+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+   P
Sbjct: 1111 KVFKEQMMGEFEMSDLGLLTYYLGIEVDQDESATTLKQTAYAKKLLSQFGMMECNSVSIP 1170

Query: 482  MEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVV 541
            ++P++QL KD EG P++ATEYR I+G LRYLL+TRP+LSY  G+AS++MERPT +H+K V
Sbjct: 1171 IDPRSQLSKDPEGHPVDATEYRRIIGSLRYLLHTRPDLSYAVGVASRFMERPTVMHFKAV 1230

Query: 542  KQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS 601
            KQILRY++GT+ +GL Y  G     I GY+DSDL  DLD R+ST GM FY+N+SLV+W+S
Sbjct: 1231 KQILRYIKGTMDYGLVYAAGTGALKITGYTDSDLAGDLDDRRSTGGMAFYINQSLVAWSS 1290

Query: 602  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPV 659
            QKQKTV LSSC+AEF+AATTAACQALWLR +++E+  +E ++V LFVDN+SAIALMKNPV
Sbjct: 1291 QKQKTVALSSCEAEFMAATTAACQALWLRLLLAEVAGVEEKAVKLFVDNRSAIALMKNPV 1350

BLAST of CSPI07G13050 vs. TrEMBL
Match: Q0J8A6_ORYSJ (Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1)

HSP 1 Score: 762.7 bits (1968), Expect = 3.7e-217
Identity = 402/716 (56.15%), Postives = 492/716 (68.72%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +S
Sbjct: 713  GRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVS 772

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDV+F+EN+ W W+ VV+  +  TEF V      ++  +   A      V  +    PA+
Sbjct: 773  RDVIFEENVPWQWS-VVAGEQNSTEFTV-----EEDGVDAPPAGAPAYPVPRYRAPSPAV 832

Query: 122  -----------------------GATGPSPPSTNT------------------PVRLRSL 181
                                    +T PS P+T +                  PVR RSL
Sbjct: 833  PQSPPASPVGASSSLPTSPQSSPSSTPPSTPATGSAGPVASPGSGGDLRSDEGPVRFRSL 892

Query: 182  THIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNNTWSL 241
              I      V +  DE + + LL              +P W   M  EL++IEKN+TW+L
Sbjct: 893  EDIMREAPRVDLVEDEHDGDALLAEMEEPSSYREAAGQPAWENAMAQELQAIEKNSTWAL 952

Query: 242  TKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVR 301
            T LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVR
Sbjct: 953  TALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLVAKGYVQRQGVDFEEVFAPVARLDTVR 1012

Query: 302  VILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQ 361
            VIL + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQ
Sbjct: 1013 VILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQ 1072

Query: 362  APRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFK 421
            APRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYVD+LIV G +  ++  FK
Sbjct: 1073 APRAWNTRLDKCLKELGFARCTQEQAVYTRGKGQAGVIVGVYVDDLIVTGENPHEIAMFK 1132

Query: 422  QQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPK 481
            QQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP+
Sbjct: 1133 QQMMGEFEMSDLGLLSYYLGIEVIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPR 1192

Query: 482  AQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQIL 541
            + LHKD +G PI+ATEYR ++GCLRYLL+TRP+LSY  G+AS++MERPTT+H K VK IL
Sbjct: 1193 SLLHKDADGNPIDATEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMIL 1252

Query: 542  RYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK 601
            RYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQK
Sbjct: 1253 RYLKGTLDSGLVFASGSGSLDITGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQK 1312

Query: 602  TVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGC 657
            TV LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG 
Sbjct: 1313 TVALSSCEAEFMAATAAACHALWLRALLSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGR 1372

BLAST of CSPI07G13050 vs. TrEMBL
Match: Q338J6_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os10g26030 PE=4 SV=2)

HSP 1 Score: 647.5 bits (1669), Expect = 1.7e-182
Identity = 348/662 (52.57%), Postives = 436/662 (65.86%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            G+KPHL HLRVFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +S
Sbjct: 560  GKKPHLGHLRVFGCTAHAKVTAPHLKKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVS 619

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDVVF EN  W W+    +    TEF+V +   +++    E A                 
Sbjct: 620  RDVVFDENTPWQWSAAAGEVTS-TEFEVEEPVGAEQPALAEQA----------------- 679

Query: 122  GATGPSPPSTNT-PVRLRSLTHIYINTEEVVGGDEQENEKLLK--------------RPN 181
            G   P    ++  PVR RSL  I +    V   ++ ++ + L                P 
Sbjct: 680  GLASPHTAGSDVGPVRYRSLAEIMLEAPRVDLVEDDDDARALLAEMEEPLSYREATGEPA 739

Query: 182  WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA----- 241
            W   M  EL++IEKN TWSL  LP GHK IGLKWVFKLKK+ + EV+KH ARLVA     
Sbjct: 740  WVNAMNKELEAIEKNKTWSLCMLPAGHKAIGLKWVFKLKKNTAGEVIKHKARLVANGYVQ 799

Query: 242  -KGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFE 301
             +G+DF+EV APVARLDTVR IL + A++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF 
Sbjct: 800  QQGVDFDEVFAPVARLDTVRAILAVAADRRWQVHHLDVKSAFLNGDLEEEVYVSQLEGFV 859

Query: 302  VPNKKHKVHRLSKALYGLRQAPRAWNIQLDR---SLKELGFGKCTQKQVVYTRSEGEECV 361
               K+H V+ LSKALYGLR   +A      R   S+KELGF + ++              
Sbjct: 860  EKGKEHLVYELSKALYGLR---QAPRAWNTRLDRSMKELGFSRPSE-------------- 919

Query: 362  LVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTY 421
                              +  FKQQMM EFE+S LGLL+YYLGIEV Q    I +KQ  Y
Sbjct: 920  ------------------ITAFKQQMMGEFEISDLGLLTYYLGIEVLQGTDGIAIKQAAY 979

Query: 422  AKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYV 481
            A++IL+QFGM DCN+T  P+E ++QLHK  EG+ ++ TEYR ++GCLRYLL+T+P+LSY 
Sbjct: 980  ARKILTQFGMLDCNSTSIPIEHRSQLHKVAEGSTVDPTEYRRVIGCLRYLLHTQPDLSYA 1039

Query: 482  FGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGR 541
             G+ SK+ME+ T +H+K VKQILRYL+GTI+ GL ++ G    +I G++DSDL  D D R
Sbjct: 1040 VGVVSKFMEQLTVMHFKAVKQILRYLKGTINCGLMFSGGNDAVEITGFTDSDLAGDSDDR 1099

Query: 542  KSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPR 601
            +STSGM FY N SLVSW+SQKQKTV LSSC+AEF+AAT AACQALWLR ++ E++  E R
Sbjct: 1100 RSTSGMAFYFNSSLVSWSSQKQKTVALSSCEAEFMAATAAACQALWLRGLLIEMIGAEAR 1159

Query: 602  SVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTK 640
             V L+VDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+G+I +EFV T EQRAD LTK
Sbjct: 1160 PVKLYVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGKIQIEFVRTEEQRADALTK 1168

BLAST of CSPI07G13050 vs. TrEMBL
Match: Q84SW8_ORYSJ (Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1)

HSP 1 Score: 643.7 bits (1659), Expect = 2.5e-181
Identity = 359/712 (50.42%), Postives = 442/712 (62.08%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            G+KPHL HLRVFGC A+ K T PHLKKLDDRS+PVVY GVEEG KAHRL+DP R ++ +S
Sbjct: 666  GKKPHLGHLRVFGCTAHAKVTAPHLKKLDDRSNPVVYLGVEEGSKAHRLFDPRRRQIIVS 725

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAET----RVENVIPHATE 121
            RDVVF EN  W W+    +    TEF+V +   +++    E A +    R       A +
Sbjct: 726  RDVVFDENTPWQWSAAAGEVTS-TEFEVEEPVGAEQPAPAEQAGSVPWYRAPPAGRRAGK 785

Query: 122  IPAIGATGPSPP----------------------------------STNTPVRLRSLTHI 181
             P +     +PP                                  S + PVR RSL  I
Sbjct: 786  EPEVAEQRGTPPASPARFSPTLPSTPTLGSSSTHSAEVQASPRTAGSDDEPVRYRSLAEI 845

Query: 182  YINTEEV-VGGDEQENEKLL-------------KRPNWYKVMENELKSIEKNNTWSLTKL 241
             I    V +  D+ + E LL               P W   M  EL++IEKN TWSL  L
Sbjct: 846  MIEAPRVDLVEDDDDTEALLVEMEEPTSYREAAGEPAWVNAMNKELEAIEKNKTWSLCML 905

Query: 242  PPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL 301
            P  HK IGLKWVFKLKK+ + EV+KH ARLVAK      G+DF+EV APVARLDTVR IL
Sbjct: 906  PASHKAIGLKWVFKLKKNTAGEVIKHKARLVAKGYVQRQGVDFDEVFAPVARLDTVRAIL 965

Query: 302  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPR 361
             +  ++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF    K+H V++LSKALYGLRQAPR
Sbjct: 966  PVAVDRRWQVHHLDVKSAFLNGDLEEEVYVSQPEGFVEKGKEHLVYKLSKALYGLRQAPR 1025

Query: 362  AWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQM 421
            AWN +LDRS+KELGF +C Q+Q VYTR  G   ++V VYVD+LIV G S   +  FKQQM
Sbjct: 1026 AWNTRLDRSMKELGFSRCAQEQAVYTRGTGSTGIIVGVYVDDLIVTGESPSDITAFKQQM 1085

Query: 422  MAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQL 481
            M EFEMS LGLL+YYLGIE+ +      +  PT  +R++                 +  L
Sbjct: 1086 MGEFEMSDLGLLTYYLGIELHKDAQGSTV-DPTEYRRVIGCL--------------RYLL 1145

Query: 482  HKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYL 541
            H                          RP+LSY  G+AS++MERPT +H+K VKQILRYL
Sbjct: 1146 HT-------------------------RPDLSYAVGVASRFMERPTVMHFKAVKQILRYL 1205

Query: 542  RGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT 601
            +GTI+ GL ++ G    +I G++DSDL  D D R+STSGM FY N SLVSW+SQKQKTV 
Sbjct: 1206 KGTINCGLMFSGGNGAVEITGFTDSDLAGDSDDRRSTSGMAFYFNGSLVSWSSQKQKTVA 1265

Query: 602  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKH 656
            LSSC+AEF+AAT AAC ALWLR ++ E++  E R V L+VDNKSAIALMKNPVFHG  KH
Sbjct: 1266 LSSCEAEFMAATAAACHALWLRGLLIEMIGAEARPVKLYVDNKSAIALMKNPVFHGRSKH 1325

BLAST of CSPI07G13050 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 280.4 bits (716), Expect = 2.8e-75
Identity = 154/457 (33.70%), Postives = 253/457 (55.36%), Query Frame = 1

Query: 167 WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKG--- 226
           W   M++E+ ++E  +TW +  LPP  KPIG KWV+K+K +    + ++ ARLVAKG   
Sbjct: 98  WCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQ 157

Query: 227 ---IDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFE 286
              IDF E  +PV +L +V++IL + A  ++ +H LD+ + FLNG+L+EE+Y+    G+ 
Sbjct: 158 QEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYA 217

Query: 287 VPN----KKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEEC 346
                    + V  L K++YGL+QA R W ++   +L   GF +       + +      
Sbjct: 218 ARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLF 277

Query: 347 VLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPT 406
           + V VYVD++I+  N+   V++ K Q+ + F++  LG L Y+LG+E+ +    I + Q  
Sbjct: 278 LCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRK 337

Query: 407 YAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSY 466
           YA  +L + G+  C  +  PM+P         G  ++A  YR ++G L YL  TR ++S+
Sbjct: 338 YALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISF 397

Query: 467 VFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDG 526
                S++ E P   H + V +IL Y++GT+  GL Y+    +  +  +SD+   S  D 
Sbjct: 398 AVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYS-SQAEMQLQVFSDASFQSCKDT 457

Query: 527 RKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEP 586
           R+ST+G   +L  SL+SW S+KQ+ V+ SS +AE+ A + A  + +WL     E+     
Sbjct: 458 RRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLS 517

Query: 587 RSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQE 614
           +   LF DN +AI +  N VFH   KHI++  H ++E
Sbjct: 518 KPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of CSPI07G13050 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 138.7 bits (348), Expect = 1.3e-32
Identity = 72/223 (32.29%), Postives = 126/223 (56.50%), Query Frame = 1

Query: 341 VYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKR 400
           +YVD++++ G+S   +N    Q+ + F M  LG + Y+LGI+++     + L Q  YA++
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 401 ILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGM 460
           IL+  GM DC     P+  K          P + +++R+IVG L+YL  TRP++SY   +
Sbjct: 65  ILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAVNI 124

Query: 461 ASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKST 520
             + M  PT   + ++K++LRY++GTI  GL Y     + ++  + DSD       R+ST
Sbjct: 125 VCQRMHEPTLADFDLLKRVLRYVKGTIFHGL-YIHKNSKLNVQAFCDSDWAGCTSTRRST 184

Query: 521 SGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALW 564
           +G   +L  +++SW++++Q TV+ SS + E+ A    A +  W
Sbjct: 185 TGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G13050 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 63.5 bits (153), Expect = 5.5e-10
Identity = 36/88 (40.91%), Postives = 46/88 (52.27%), Query Frame = 1

Query: 162 LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA 221
           LK P W + M+ EL ++ +N TW L   P     +G KWVFK K      + +  ARLVA
Sbjct: 35  LKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVA 94

Query: 222 K------GIDFEEVLAPVARLDTVRVIL 244
           K      GI F E  +PV R  T+R IL
Sbjct: 95  KGFHQEEGIYFVETYSPVVRTATIRTIL 122

BLAST of CSPI07G13050 vs. NCBI nr
Match: gi|218201855|gb|EEC84282.1| (hypothetical protein OsI_30754 [Oryza sativa Indica Group])

HSP 1 Score: 763.8 bits (1971), Expect = 2.4e-217
Identity = 402/716 (56.15%), Postives = 494/716 (68.99%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +S
Sbjct: 713  GRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVS 772

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDV+F+EN+ W W+ VV+  +  TEF V      ++  +   A      V  +    PA+
Sbjct: 773  RDVIFEENVPWQWS-VVAGEQNSTEFTV-----EEDGVDAPPAGAPAYPVPRYRAPSPAV 832

Query: 122  -----------------------GATGPSPPSTNT------------------PVRLRSL 181
                                    +T PS P+T +                  PVR RSL
Sbjct: 833  PQSPLASPVGASPSLPTSPQSSPSSTPPSTPATGSVGPVASPGSSGDLRSDEGPVRFRSL 892

Query: 182  THIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNNTWSL 241
              I      V +  DE + + LL              +P W   M  EL++IEKN+TW+L
Sbjct: 893  EDIMREAPRVDLVEDEHDGDALLAEMEEPSSYREAAGQPAWENAMAQELQAIEKNSTWAL 952

Query: 242  TKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVR 301
            T LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVR
Sbjct: 953  TALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLVAKGYVQRQGVDFEEVFAPVARLDTVR 1012

Query: 302  VILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQ 361
            VIL + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQ
Sbjct: 1013 VILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQ 1072

Query: 362  APRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFK 421
            APRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYVD+LIV G + +++  FK
Sbjct: 1073 APRAWNTRLDKCLKELGFARCTQEQAVYTRGKGQAGVIVGVYVDDLIVTGENPQEIAMFK 1132

Query: 422  QQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPK 481
            QQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP+
Sbjct: 1133 QQMMGEFEMSDLGLLSYYLGIEVIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPR 1192

Query: 482  AQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQIL 541
            + LHKD +G PI+ATEYR ++GCLRYLL+TRP+LSY  G+AS++MERPTT+H K VK IL
Sbjct: 1193 SLLHKDADGNPIDATEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMIL 1252

Query: 542  RYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK 601
            RYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQK
Sbjct: 1253 RYLKGTLDSGLVFASGSGSLDITGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQK 1312

Query: 602  TVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGC 657
            TV LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG 
Sbjct: 1313 TVALSSCEAEFMAATAAACHALWLRALLSEMMGTEAKPVKLFVDNKSAIALMKNPVFHGR 1372

BLAST of CSPI07G13050 vs. NCBI nr
Match: gi|108706239|gb|ABF94034.1| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 763.1 bits (1969), Expect = 4.1e-217
Identity = 396/722 (54.85%), Postives = 495/722 (68.56%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HL+VFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +S
Sbjct: 691  GRKPQLGHLKVFGCTAHAKNTQPHLKKLDDRSAPYVYLGVEEGSKAHRLFDPRRGRIHVS 750

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPH------- 121
            RDVVF+EN+ W W      G+E T+F + ++      E L    T    V P+       
Sbjct: 751  RDVVFEENVPWEWTSAA--GQEPTDFAMEEE----PGEQLPSPATAAGVVPPYQAPSPGR 810

Query: 122  ---------ATEIPAI-----------GATGPSPPSTNT------------------PVR 181
                     A E+P+            G   P  PSTN+                  P R
Sbjct: 811  RAGKEAVVAAEEVPSPASPVAASPTLPGTPTPGSPSTNSAGVVPSPGTDDNIDTDDGPRR 870

Query: 182  LRSLTHIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNN 241
             RSL  +      V +  DE + E LL              +P W + M+ E+++IEKN 
Sbjct: 871  YRSLADVLREAPRVDLVEDECDGEALLAESEEPSSYREAAGQPAWEEAMQREMEAIEKNK 930

Query: 242  TWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKG------IDFEEVLAPVARL 301
            TW L  LP GH+ IGLKWV+KLKK+ + E++KH ARLVAKG      +DFEEV APVARL
Sbjct: 931  TWELAMLPAGHRAIGLKWVYKLKKNTAGEIIKHKARLVAKGYVQKQGVDFEEVFAPVARL 990

Query: 302  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALY 361
            DTVRV+L + A++ W+VHHLDVKS FLNGELEEEVYV Q EGF    K+H V +L KALY
Sbjct: 991  DTVRVVLAVAADRRWQVHHLDVKSAFLNGELEEEVYVAQPEGFARSGKEHLVLKLHKALY 1050

Query: 362  GLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKV 421
            GLRQAPRAWNI+LDRSL+ELGF +CTQ+Q VYTR  G + ++V VYVD+LIV G +  ++
Sbjct: 1051 GLRQAPRAWNIRLDRSLRELGFDRCTQEQAVYTRGRGSDGIIVGVYVDDLIVTGENPSEL 1110

Query: 422  NKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYP 481
              FK+QMM EFEMS LGLL+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+   P
Sbjct: 1111 KVFKEQMMGEFEMSDLGLLTYYLGIEVDQDESATTLKQTAYAKKLLSQFGMMECNSVSIP 1170

Query: 482  MEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVV 541
            ++P++QL KD EG P++ATEYR I+G LRYLL+TRP+LSY  G+AS++MERPT +H+K V
Sbjct: 1171 IDPRSQLSKDPEGHPVDATEYRRIIGSLRYLLHTRPDLSYAVGVASRFMERPTVMHFKAV 1230

Query: 542  KQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS 601
            KQILRY++GT+ +GL Y  G     I GY+DSDL  DLD R+ST GM FY+N+SLV+W+S
Sbjct: 1231 KQILRYIKGTMDYGLVYAAGTGALKITGYTDSDLAGDLDDRRSTGGMAFYINQSLVAWSS 1290

Query: 602  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPV 659
            QKQKTV LSSC+AEF+AATTAACQALWLR +++E+  +E ++V LFVDN+SAIALMKNPV
Sbjct: 1291 QKQKTVALSSCEAEFMAATTAACQALWLRLLLAEVAGVEEKAVKLFVDNRSAIALMKNPV 1350

BLAST of CSPI07G13050 vs. NCBI nr
Match: gi|113622864|dbj|BAF22809.1| (Os08g0125300 [Oryza sativa Japonica Group])

HSP 1 Score: 762.7 bits (1968), Expect = 5.3e-217
Identity = 402/716 (56.15%), Postives = 492/716 (68.72%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +S
Sbjct: 713  GRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVS 772

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDV+F+EN+ W W+ VV+  +  TEF V      ++  +   A      V  +    PA+
Sbjct: 773  RDVIFEENVPWQWS-VVAGEQNSTEFTV-----EEDGVDAPPAGAPAYPVPRYRAPSPAV 832

Query: 122  -----------------------GATGPSPPSTNT------------------PVRLRSL 181
                                    +T PS P+T +                  PVR RSL
Sbjct: 833  PQSPPASPVGASSSLPTSPQSSPSSTPPSTPATGSAGPVASPGSGGDLRSDEGPVRFRSL 892

Query: 182  THIYINTEEV-VGGDEQENEKLLK-------------RPNWYKVMENELKSIEKNNTWSL 241
              I      V +  DE + + LL              +P W   M  EL++IEKN+TW+L
Sbjct: 893  EDIMREAPRVDLVEDEHDGDALLAEMEEPSSYREAAGQPAWENAMAQELQAIEKNSTWAL 952

Query: 242  TKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVR 301
            T LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVR
Sbjct: 953  TALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLVAKGYVQRQGVDFEEVFAPVARLDTVR 1012

Query: 302  VILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQ 361
            VIL + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQ
Sbjct: 1013 VILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQ 1072

Query: 362  APRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFK 421
            APRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYVD+LIV G +  ++  FK
Sbjct: 1073 APRAWNTRLDKCLKELGFARCTQEQAVYTRGKGQAGVIVGVYVDDLIVTGENPHEIAMFK 1132

Query: 422  QQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPK 481
            QQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP+
Sbjct: 1133 QQMMGEFEMSDLGLLSYYLGIEVIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPR 1192

Query: 482  AQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQIL 541
            + LHKD +G PI+ATEYR ++GCLRYLL+TRP+LSY  G+AS++MERPTT+H K VK IL
Sbjct: 1193 SLLHKDADGNPIDATEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMIL 1252

Query: 542  RYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK 601
            RYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQK
Sbjct: 1253 RYLKGTLDSGLVFASGSGSLDITGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQK 1312

Query: 602  TVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGC 657
            TV LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG 
Sbjct: 1313 TVALSSCEAEFMAATAAACHALWLRALLSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGR 1372

BLAST of CSPI07G13050 vs. NCBI nr
Match: gi|110289052|gb|ABB47537.2| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 647.5 bits (1669), Expect = 2.5e-182
Identity = 348/662 (52.57%), Postives = 436/662 (65.86%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            G+KPHL HLRVFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +S
Sbjct: 560  GKKPHLGHLRVFGCTAHAKVTAPHLKKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVS 619

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVENVIPHATEIPAI 121
            RDVVF EN  W W+    +    TEF+V +   +++    E A                 
Sbjct: 620  RDVVFDENTPWQWSAAAGEVTS-TEFEVEEPVGAEQPALAEQA----------------- 679

Query: 122  GATGPSPPSTNT-PVRLRSLTHIYINTEEVVGGDEQENEKLLK--------------RPN 181
            G   P    ++  PVR RSL  I +    V   ++ ++ + L                P 
Sbjct: 680  GLASPHTAGSDVGPVRYRSLAEIMLEAPRVDLVEDDDDARALLAEMEEPLSYREATGEPA 739

Query: 182  WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA----- 241
            W   M  EL++IEKN TWSL  LP GHK IGLKWVFKLKK+ + EV+KH ARLVA     
Sbjct: 740  WVNAMNKELEAIEKNKTWSLCMLPAGHKAIGLKWVFKLKKNTAGEVIKHKARLVANGYVQ 799

Query: 242  -KGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFE 301
             +G+DF+EV APVARLDTVR IL + A++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF 
Sbjct: 800  QQGVDFDEVFAPVARLDTVRAILAVAADRRWQVHHLDVKSAFLNGDLEEEVYVSQLEGFV 859

Query: 302  VPNKKHKVHRLSKALYGLRQAPRAWNIQLDR---SLKELGFGKCTQKQVVYTRSEGEECV 361
               K+H V+ LSKALYGLR   +A      R   S+KELGF + ++              
Sbjct: 860  EKGKEHLVYELSKALYGLR---QAPRAWNTRLDRSMKELGFSRPSE-------------- 919

Query: 362  LVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTY 421
                              +  FKQQMM EFE+S LGLL+YYLGIEV Q    I +KQ  Y
Sbjct: 920  ------------------ITAFKQQMMGEFEISDLGLLTYYLGIEVLQGTDGIAIKQAAY 979

Query: 422  AKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYV 481
            A++IL+QFGM DCN+T  P+E ++QLHK  EG+ ++ TEYR ++GCLRYLL+T+P+LSY 
Sbjct: 980  ARKILTQFGMLDCNSTSIPIEHRSQLHKVAEGSTVDPTEYRRVIGCLRYLLHTQPDLSYA 1039

Query: 482  FGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGR 541
             G+ SK+ME+ T +H+K VKQILRYL+GTI+ GL ++ G    +I G++DSDL  D D R
Sbjct: 1040 VGVVSKFMEQLTVMHFKAVKQILRYLKGTINCGLMFSGGNDAVEITGFTDSDLAGDSDDR 1099

Query: 542  KSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPR 601
            +STSGM FY N SLVSW+SQKQKTV LSSC+AEF+AAT AACQALWLR ++ E++  E R
Sbjct: 1100 RSTSGMAFYFNSSLVSWSSQKQKTVALSSCEAEFMAATAAACQALWLRGLLIEMIGAEAR 1159

Query: 602  SVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTK 640
             V L+VDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+G+I +EFV T EQRAD LTK
Sbjct: 1160 PVKLYVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGKIQIEFVRTEEQRADALTK 1168

BLAST of CSPI07G13050 vs. NCBI nr
Match: gi|29150404|gb|AAO72413.1| (gag-pol polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 643.7 bits (1659), Expect = 3.6e-181
Identity = 359/712 (50.42%), Postives = 442/712 (62.08%), Query Frame = 1

Query: 2    GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQIS 61
            G+KPHL HLRVFGC A+ K T PHLKKLDDRS+PVVY GVEEG KAHRL+DP R ++ +S
Sbjct: 666  GKKPHLGHLRVFGCTAHAKVTAPHLKKLDDRSNPVVYLGVEEGSKAHRLFDPRRRQIIVS 725

Query: 62   RDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAET----RVENVIPHATE 121
            RDVVF EN  W W+    +    TEF+V +   +++    E A +    R       A +
Sbjct: 726  RDVVFDENTPWQWSAAAGEVTS-TEFEVEEPVGAEQPAPAEQAGSVPWYRAPPAGRRAGK 785

Query: 122  IPAIGATGPSPP----------------------------------STNTPVRLRSLTHI 181
             P +     +PP                                  S + PVR RSL  I
Sbjct: 786  EPEVAEQRGTPPASPARFSPTLPSTPTLGSSSTHSAEVQASPRTAGSDDEPVRYRSLAEI 845

Query: 182  YINTEEV-VGGDEQENEKLL-------------KRPNWYKVMENELKSIEKNNTWSLTKL 241
             I    V +  D+ + E LL               P W   M  EL++IEKN TWSL  L
Sbjct: 846  MIEAPRVDLVEDDDDTEALLVEMEEPTSYREAAGEPAWVNAMNKELEAIEKNKTWSLCML 905

Query: 242  PPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL 301
            P  HK IGLKWVFKLKK+ + EV+KH ARLVAK      G+DF+EV APVARLDTVR IL
Sbjct: 906  PASHKAIGLKWVFKLKKNTAGEVIKHKARLVAKGYVQRQGVDFDEVFAPVARLDTVRAIL 965

Query: 302  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPR 361
             +  ++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF    K+H V++LSKALYGLRQAPR
Sbjct: 966  PVAVDRRWQVHHLDVKSAFLNGDLEEEVYVSQPEGFVEKGKEHLVYKLSKALYGLRQAPR 1025

Query: 362  AWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQM 421
            AWN +LDRS+KELGF +C Q+Q VYTR  G   ++V VYVD+LIV G S   +  FKQQM
Sbjct: 1026 AWNTRLDRSMKELGFSRCAQEQAVYTRGTGSTGIIVGVYVDDLIVTGESPSDITAFKQQM 1085

Query: 422  MAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQL 481
            M EFEMS LGLL+YYLGIE+ +      +  PT  +R++                 +  L
Sbjct: 1086 MGEFEMSDLGLLTYYLGIELHKDAQGSTV-DPTEYRRVIGCL--------------RYLL 1145

Query: 482  HKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYL 541
            H                          RP+LSY  G+AS++MERPT +H+K VKQILRYL
Sbjct: 1146 HT-------------------------RPDLSYAVGVASRFMERPTVMHFKAVKQILRYL 1205

Query: 542  RGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT 601
            +GTI+ GL ++ G    +I G++DSDL  D D R+STSGM FY N SLVSW+SQKQKTV 
Sbjct: 1206 KGTINCGLMFSGGNGAVEITGFTDSDLAGDSDDRRSTSGMAFYFNGSLVSWSSQKQKTVA 1265

Query: 602  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKH 656
            LSSC+AEF+AAT AAC ALWLR ++ E++  E R V L+VDNKSAIALMKNPVFHG  KH
Sbjct: 1266 LSSCEAEFMAATAAACHALWLRGLLIEMIGAEARPVKLYVDNKSAIALMKNPVFHGRSKH 1325

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC3.6e-10434.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.6e-8334.92Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST2.5e-3331.02Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
M810_ARATH2.4e-3132.29Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YH41B_YEAST1.6e-1920.11Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
B8BDZ6_ORYSI1.7e-21756.15Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4... [more]
Q10RM4_ORYSJ2.9e-21754.85Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
Q0J8A6_ORYSJ3.7e-21756.15Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1[more]
Q338J6_ORYSJ1.7e-18252.57Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
Q84SW8_ORYSJ2.5e-18150.42Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.12.8e-7533.70 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.11.3e-3232.29ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.15.5e-1040.91ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|218201855|gb|EEC84282.1|2.4e-21756.15hypothetical protein OsI_30754 [Oryza sativa Indica Group][more]
gi|108706239|gb|ABF94034.1|4.1e-21754.85retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
gi|113622864|dbj|BAF22809.1|5.3e-21756.15Os08g0125300 [Oryza sativa Japonica Group][more]
gi|110289052|gb|ABB47537.2|2.5e-18252.57retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
gi|29150404|gb|AAO72413.1|3.6e-18150.42gag-pol polyprotein [Oryza sativa Japonica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009536 plastid
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G13050.1CSPI07G13050.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 181..418
score: 8.5
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..578
score: 4.7E
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 1..578
score: 4.7E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 180..397
score: 7.78E-26coord: 422..609
score: 7.78

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None