CSPI01G13260 (gene) Wild cucumber (PI 183967)

NameCSPI01G13260
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr1 : 8734055 .. 8737038 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCTAACATGTTGCAGCCTCAACTTCCTCGTTTTGAGGGAAAAAACTATATGCGGTGGAGCCAGCAAATGAAAGTTCTTTATGGATCTCAAGATCTTTGGGATATTGTTGACAACGGATATTCAAAGCCAGAAAGTGAGAATGATCTTTTAGCACAACGACTCAATGAGTTGAGAGATGCTAGAAAAAAGAATAAGAAGGCATTATTTTTCATCTACCAAGCTGTGGATGAAAATATTTTTGAAAGAATATCAGGAGTCTCTACTGCTAAAGCAGCATGGGATGCATTGCAAAATTTGTATGAATGTGAAGAAAATGCAAAATTGGTCCGATTACAAACACTTCGAGCTGAATTTGATACAATTCGAATGAAAGATTCTGAAACTATTGAAGAAATTTTCAACTGTGTTCTCTTATTGTTAATCAATTGAGATCAAATGGAGAAACAATTGAAGATCAAAGGATTGTTGAGAAGATTCTTAGAAGCATGACTAGAAGATATGGGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGTTTAATGGGATCTCTTCATTCTCATGAGCTCTGATTGAAGATGTTTGATTCTACTCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATTGAGGTCGATCCAATGGAAGAAGATGTGGACGTGGTGGCAGTGGCAATGGACGATCCAACGTTGTAACAAATACAAAGTCAAAAAGCAGAGACAATCAATCCTTTTCAAATAGAGGACGGGGAAGAAGTTCAAATAGAGGAAGTGGTAGAAGTAGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAATCACATGACAAGAAGAAAGGATATTTTTATATCTGTAGATGAATCTCATCAAAATGTAGTGAAGATTGGTGACAACAAGAAGCTTGAAGTCAAATGAAAATGAGATATTCTTATCAAGAAAAAAAAGGGGAACAAAAAGAATTACTGATGTGTATTATGTTTCAGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGAGGTTATCTTTAAAGATAACCTCATGTCCTCTTAGGAGAAGTTGTCCAACACTTAAAAGATTGTGTTTGAGACCTGAAACATAATACACATCAGTAATTCTTTTTGTTCCCCTTTTTTTTCTTGATAAGAATATCTCATTTTCCTTTGACTTCAAGCATCTTGTTGTCACCAGTCTTCACTACATTTTGATGAGATTCATCTAAAGATATAAAAATATCCTTTCTTCCTGTCATGTGGTTACTACAACCACTATCAAGATACCATATTTCTTCAGTGCTTGATTTTTGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACGCATGGAGGTAACTGTTATTTTCTCACATTTATTGATGACTAGAGTCGAAAAACATGGATTTATCTACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTATGTTCGGATCGTGGAGGAGAATATATTGTTTTTGCAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGGTGGGGAGACACAGTAACTTATGCTATTTATCTTCTAAATAGAGCTTCAACGGAAAGTGTGCAATGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACTGTTAGTCACCTAAGAATGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAAAGGTAAGCTAGATGATAAATCAGAAAAATGCATTTTTGCTGGGTACAGTGAGAACTCCAAGGCCTACAGACTATACAACCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAAGAAGAAACGAGATATGGGAGTTAGTAAAATTACCAGAAAATAAAAGGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTATGGATTATAAAGAAGTTTTTGCACCGGTAACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCATTCCTAAATGGGTATTTAGAGGATGAAATATATGTTGAGCAGCCCACCGGTTATGCAAAGATTGGAGAAGAAAATAAGGTGTATCGATTAAAGAAAGTCTTGTATGGGCTAAAGCAAGCACCAAGGGCTTGGTACAGTCGCATCGACAATTTTTTCTTAAAGGATGGTTTCAGAAGATGTCCATATGA

mRNA sequence

ATGTCATCTAACATGTTGCAGCCTCAACTTCCTCGTTTTGAGGGAAAAAACTATATGCGGTGGAGCCAGCAAATGAAAGTTCTTTATGGATCTCAAGATCTTTGGGATATTGTTGACAACGGATATTCAAAGCCAGAAAGTGAGAATGATCTTTTAGCACAACGACTCAATGAGTTGAGAGATGCTAGAAAAAAGAATAAGAAGGCATTATTTTTCATCTACCAAGCTGTGGATGAAAATATTTTTGAAAGAATATCAGGAGTCTCTACTGCTAAAGCAGCATGGGATGCATTGCAAAATTTGTATGAATGTGAAGAAAATGCAAAATTGGTCCGATTACAAACACTTCGAGCTGAATTTGATACAATTCGAATGAAAGATTCTGAAACTATTGAAGAAATTTTCAACTGTAAGAAGCTTTTCATATGCAGTCCTCCTATTGAGGTCGATCCAATGGAAGAAGATGTGGACGTGGTGGCAGTGGCAATGGACGATCCAACGTTACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAATCACATGACAAGAAGAAAGGATATTTTTATATCTGTAGATGAATCTCATCAAAATGTAGTGAAGATTGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGAGTGCTTGATTTTTGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACGCATGGAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGGTGGGGAGACACAGTAACTTATGCTATTTATCTTCTAAATAGAGCTTCAACGGAAAGTGTGCAATGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACTGTTAGTCACCTAAGAATGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAAAGGTAAGCTAGATGATAAATCAGAAAAATGCATTTTTGCTGGGTACAGTGAGAACTCCAAGGCCTACAGACTATACAACCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAAGAAGAAACGAGATATGGGAGTTAGTAAAATTACCAGAAAATAAAAGGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTATGGATTATAAAGAAGTTTTTGCACCGGTAACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCATTCCTAAATGGGTATTTAGAGGATGAAATATATGTTGAGCAGCCCACCGGTTATGCAAAGATTGGAGAAGAAAATAAGAAGATGTCCATATGA

Coding sequence (CDS)

ATGTCATCTAACATGTTGCAGCCTCAACTTCCTCGTTTTGAGGGAAAAAACTATATGCGGTGGAGCCAGCAAATGAAAGTTCTTTATGGATCTCAAGATCTTTGGGATATTGTTGACAACGGATATTCAAAGCCAGAAAGTGAGAATGATCTTTTAGCACAACGACTCAATGAGTTGAGAGATGCTAGAAAAAAGAATAAGAAGGCATTATTTTTCATCTACCAAGCTGTGGATGAAAATATTTTTGAAAGAATATCAGGAGTCTCTACTGCTAAAGCAGCATGGGATGCATTGCAAAATTTGTATGAATGTGAAGAAAATGCAAAATTGGTCCGATTACAAACACTTCGAGCTGAATTTGATACAATTCGAATGAAAGATTCTGAAACTATTGAAGAAATTTTCAACTGTAAGAAGCTTTTCATATGCAGTCCTCCTATTGAGGTCGATCCAATGGAAGAAGATGTGGACGTGGTGGCAGTGGCAATGGACGATCCAACGTTACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAATCACATGACAAGAAGAAAGGATATTTTTATATCTGTAGATGAATCTCATCAAAATGTAGTGAAGATTGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGAGTGCTTGATTTTTGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACGCATGGAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGGTGGGGAGACACAGTAACTTATGCTATTTATCTTCTAAATAGAGCTTCAACGGAAAGTGTGCAATGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACTGTTAGTCACCTAAGAATGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAAAGGTAAGCTAGATGATAAATCAGAAAAATGCATTTTTGCTGGGTACAGTGAGAACTCCAAGGCCTACAGACTATACAACCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAAGAAGAAACGAGATATGGGAGTTAGTAAAATTACCAGAAAATAAAAGGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTATGGATTATAAAGAAGTTTTTGCACCGGTAACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCATTCCTAAATGGGTATTTAGAGGATGAAATATATGTTGAGCAGCCCACCGGTTATGCAAAGATTGGAGAAGAAAATAAGAAGATGTCCATATGA
BLAST of CSPI01G13260 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 6.2e-64
Identity = 147/418 (35.17%), Postives = 240/418 (57.42%), Query Frame = 1

Query: 273 VHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLP 332
           + +D  G   +    ++   +GI+H+KTV  TPQ NGVAER NR I+E  RSML+  KLP
Sbjct: 547 LRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLP 606

Query: 333 DQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHISDEKK 392
             FWG  + V  A YL+NR+ +  +    P+  W+  + + SHL++FGC A++H+  E++
Sbjct: 607 KSFWG--EAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQR 666

Query: 393 GKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHV 452
            KLDDKS  CIF GY +    YRL++P+ KKVI SRDV F E+++               
Sbjct: 667 TKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEV-----------RTAA 726

Query: 453 DMDGKKDARDLELEVTQPLTS--PSSSHSTSDE------------ETTPRKTRNIQEIYN 512
           DM  K     +   VT P TS  P+S+ ST+DE            E   +    ++E+ +
Sbjct: 727 DMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEH 786

Query: 513 TSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQ-EIDAI------- 572
            ++        R  +   V+   + + + V   +  + E+ K+ ++  E + +       
Sbjct: 787 PTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEE 846

Query: 573 ----RRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKE 632
               ++N  ++LV+LP+ KR L  KW+++ K   + ++ +YKARLVVKG++QK G+D+ E
Sbjct: 847 MESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDE 906

Query: 633 VFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEEN 657
           +F+PV ++ ++R +L+LAA  + +V Q+DVK+AFL+G LE+EIY+EQP G+   G+++
Sbjct: 907 IFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKH 951

BLAST of CSPI01G13260 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 122.1 bits (305), Expect = 2.3e-26
Identity = 79/253 (31.23%), Postives = 126/253 (49.80%), Query Frame = 1

Query: 433  DEAKLWQWNAPNEDQNPLHVDM-------------DGKKDARDLELEVTQPLTSPSSS-- 492
            D  K+ Q   PNE +   ++               + KK  RD  L  ++   +P+ S  
Sbjct: 771  DSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRE 830

Query: 493  -----HSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDF---------------ALFANV 552
                 H        P K   I+ I   S R+  +  + +                +F +V
Sbjct: 831  SETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDV 890

Query: 553  DPVYFEEAIQDE--NWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGE 612
               + E   +D+  +W++A+N E++A + N  W + K PENK  +  +W++  K  + G 
Sbjct: 891  PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGN 950

Query: 613  VQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNG 649
              +YKARLV +G+ QK+ +DY+E FAPV R+ + R +L+L  + N KVHQMDVK+AFLNG
Sbjct: 951  PIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNG 1010

BLAST of CSPI01G13260 vs. Swiss-Prot
Match: M820_ARATH (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 88.2 bits (217), Expect = 3.7e-16
Identity = 40/92 (43.48%), Postives = 62/92 (67.39%), Query Frame = 1

Query: 525 AIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQKYKARLV 584
           A++D  W  AM +E+DA+ RN+ W LV  P N+  LG KW+++TKL  +G + + KARLV
Sbjct: 34  ALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLV 93

Query: 585 VKGYKQKFGMDYKEVFAPVTRLETVRLLLALA 617
            KG+ Q+ G+ + E ++PV R  T+R +L +A
Sbjct: 94  AKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI01G13260 vs. Swiss-Prot
Match: M710_ARATH (Uncharacterized mitochondrial protein AtMg00710 OS=Arabidopsis thaliana GN=AtMg00710 PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 6.3e-08
Identity = 32/87 (36.78%), Postives = 47/87 (54.02%), Query Frame = 1

Query: 315 NRIIMELARSMLKAKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVS 374
           NR I+E  RSML    LP  F    D    A++++N+  + ++    P E W    PT S
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRA--DAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYS 61

Query: 375 HLRMFGCIAYSHISDEKKGKLDDKSEK 402
           +LR FGC+AY H  +   GKL  +++K
Sbjct: 62  YLRRFGCVAYIHCDE---GKLKPRAKK 83

BLAST of CSPI01G13260 vs. TrEMBL
Match: A6YTD9_CUCME (Integrase OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 2.4e-131
Identity = 237/374 (63.37%), Postives = 298/374 (79.68%), Query Frame = 1

Query: 287 GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGWGDTVTYAI 346
           G+F KE GI HQ T R T QQNGVAERKNR IME+ARSMLKAK LP++FWG  D V   +
Sbjct: 567 GNFFKEQGIHHQMTARMTTQQNGVAERKNRTIMEMARSMLKAKNLPNEFWG--DAVACTV 626

Query: 347 YLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHISDEKKGKLDDKSEKCIFAG 406
           Y+LNRA T+SV  +TP EAW   KP+VSHL++F  IAYSHI ++ +GKLDDKSEKCI  G
Sbjct: 627 YILNRAPTKSVPGMTPYEAWCDEKPSVSHLKVFRSIAYSHIPNQLRGKLDDKSEKCIMVG 686

Query: 407 YSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLEL 466
           Y+ENSKAYRLYNP+S+K+II+RDV F E + W WN   +E ++P HV+++  + A++LE 
Sbjct: 687 YNENSKAYRLYNPVSRKIIINRDVIFSEDESWNWNDDVDEAKSPFHVNINENEVAQELEQ 746

Query: 467 EVTQPLTSPSSS--HSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFE 526
              Q + S SSS   STS++E +PR+ R+IQEIYN + RI  +   +FALFA V PV F+
Sbjct: 747 AKIQAVESSSSSTSSSTSNDEISPRRMRSIQEIYNNTNRINVDHFANFALFAGVGPVTFD 806

Query: 527 EAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQKYKARL 586
           EAIQDE WK AM+QEIDAIRRNE WEL++LP NK+ALGVKW+YRTKLK +G V+ YKARL
Sbjct: 807 EAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEIYKARL 866

Query: 587 VVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYV 646
           VVKGYKQ++G+DY+E+FAPVTR+ET+RL+L+LAA+N WKVHQMD+KSAFLNG+L+DEI+V
Sbjct: 867 VVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVHQMDIKSAFLNGHLKDEIFV 926

Query: 647 EQPTGYAKIGEENK 658
            QP GY + GEE K
Sbjct: 927 AQPLGYVQRGEEEK 938

BLAST of CSPI01G13260 vs. TrEMBL
Match: A0A068B703_GOSBA (Polyprotein OS=Gossypium barbadense PE=4 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 1.3e-113
Identity = 203/392 (51.79%), Postives = 284/392 (72.45%), Query Frame = 1

Query: 271 ELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKK 330
           +++ +D  G      +  F K++GI HQ T RRTPQQNGVAERKNR I+++ARSM+K K 
Sbjct: 608 KILRSDRGGEYTAKLYESFCKDHGIIHQLTARRTPQQNGVAERKNRTILDMARSMIKGKH 667

Query: 331 LPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHISDE 390
           LP  FW   + V  A+YLLN+  T+SV+  TP+EAWSG KP V HL++FGCIAY+H+ ++
Sbjct: 668 LPRTFWA--EAVECAVYLLNQCPTKSVRHKTPEEAWSGHKPRVGHLKIFGCIAYAHVPEQ 727

Query: 391 KKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPL 450
           ++ KLDD+ EKCIF GY + SKAYRLYNP++KK+IISRDV+FDEA  W+W+   +    L
Sbjct: 728 QRKKLDDRGEKCIFIGYDKRSKAYRLYNPLTKKLIISRDVEFDEADYWRWSEEEKKVEGL 787

Query: 451 HVDMD--GKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEH 510
             + D   +++  D +   T   +SP+SS  +S  +  P +TR++ +IYN++  +  E  
Sbjct: 788 FFNEDDNNQEEQGDDQSPGTTAPSSPTSSSGSSSLDEAPTRTRSLNDIYNSTEPV--ETQ 847

Query: 511 VDFALFA---NVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWI 570
            D++LF      DPV +EEAI++  WK AM++EI AIRRN+ WEL  LPE    +GVKW+
Sbjct: 848 FDYSLFCLMTECDPVTYEEAIENNKWKKAMDEEIAAIRRNDTWELTSLPEGHSPIGVKWV 907

Query: 571 YRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQ 630
           Y+TK  + G+V+KYKARLV KGYKQ+ G+DY E+FAPV R++T+RLL+A+AA+  WK++Q
Sbjct: 908 YKTKTNKEGKVEKYKARLVAKGYKQRQGVDYDEIFAPVARIDTIRLLIAVAAQYKWKIYQ 967

Query: 631 MDVKSAFLNGYLEDEIYVEQPTGYAKIGEENK 658
           MDVKSAFLNGYLE+E+Y+EQP GY+  G+E+K
Sbjct: 968 MDVKSAFLNGYLEEEVYIEQPPGYSIQGKEDK 995

BLAST of CSPI01G13260 vs. TrEMBL
Match: V9H0W6_ARATH (Lectin receptor kinase (Fragment) OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 4.3e-112
Identity = 211/398 (53.02%), Postives = 280/398 (70.35%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGVAERKNR I+E+ARSMLK
Sbjct: 85  LVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLK 144

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 145 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 204

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 205 PDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 264

Query: 448 NPL-HVDMDGKKDARDL--ELEVTQPLTSPSSSH-STSDEETTPRKTRNIQEIYNTSRRI 507
           N   H + D  +  R+     E T P TSP+SS    S  E TPR  R+IQE+Y  +   
Sbjct: 265 NFFPHFEEDKPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPR-FRSIQELYEVTEN- 324

Query: 508 LDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVK 567
             E    F LFA  +P+ F+EAI+ + W++AM++EI +I++N+ WEL  LP   +A+GVK
Sbjct: 325 -QENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVK 384

Query: 568 WIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKV 627
           W+Y+ K    GEV++YKARLV KGY Q+ G+DY E+FAPV RLETVRL+++LAA+N WK+
Sbjct: 385 WVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEIFAPVARLETVRLIISLAAQNKWKI 444

Query: 628 HQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           HQMDVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 445 HQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 477

BLAST of CSPI01G13260 vs. TrEMBL
Match: Q9M197_ARATH (Copia-type reverse transcriptase-like protein OS=Arabidopsis thaliana GN=T16L24.270 PE=4 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 4.3e-112
Identity = 211/398 (53.02%), Postives = 280/398 (70.35%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGVAERKNR I+E+ARSMLK
Sbjct: 588 LVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLK 647

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 648 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 707

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 708 PDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 767

Query: 448 NPL-HVDMDGKKDARDL--ELEVTQPLTSPSSSH-STSDEETTPRKTRNIQEIYNTSRRI 507
           N   H + D  +  R+     E T P TSP+SS    S  E TPR  R+IQE+Y  +   
Sbjct: 768 NFFPHFEEDKPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPR-FRSIQELYEVTEN- 827

Query: 508 LDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVK 567
             E    F LFA  +P+ F+EAI+ + W++AM++EI +I++N+ WEL  LP   +A+GVK
Sbjct: 828 -QENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVK 887

Query: 568 WIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKV 627
           W+Y+ K    GEV++YKARLV KGY Q+ G+DY E+FAPV RLETVRL+++LAA+N WK+
Sbjct: 888 WVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEIFAPVARLETVRLIISLAAQNKWKI 947

Query: 628 HQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           HQMDVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 948 HQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 980

BLAST of CSPI01G13260 vs. TrEMBL
Match: Q9C536_ARATH (Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 3.3e-104
Identity = 197/395 (49.87%), Postives = 264/395 (66.84%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGVAERKNR I+E+ARSMLK
Sbjct: 588 LVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLK 647

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 648 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 707

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 708 PDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 767

Query: 448 NPL-HVDMDGKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRNIQEIYNTSRRILDE 507
           N   H + D  +  R+ E    +P T P+S  S+  EE                      
Sbjct: 768 NFFPHFEEDKPEPTRE-EPPSEEPTTPPTSPTSSQIEE---------------------- 827

Query: 508 EHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIY 567
                      +P+ F+EAI+ + W++AM++EI +I++N+ WEL  LP   +A+GVKW+Y
Sbjct: 828 ---------KCEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVY 887

Query: 568 RTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQM 627
           + K    GEV++YKARLV KGY Q+ G+DY EVFAPV RLETVRL+++LAA+N WK+HQM
Sbjct: 888 KAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQM 947

Query: 628 DVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           DVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 948 DVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 948

BLAST of CSPI01G13260 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 130.2 bits (326), Expect = 4.8e-30
Identity = 74/193 (38.34%), Postives = 111/193 (57.51%), Query Frame = 1

Query: 474 PSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVD-FALFANVDPVY----------- 533
           P  S  TS   T  RK   +Q+ Y  S   L    +  F  +  V P+Y           
Sbjct: 27  PEPSVHTSHRRT--RKPAYLQDYYCHSVASLTIHDISQFLSYEKVSPLYHSFLVCIAKAK 86

Query: 534 ----FEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQ 593
               + EA +   W  AM+ EI A+     WE+  LP NK+ +G KW+Y+ K   +G ++
Sbjct: 87  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 146

Query: 594 KYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYL 651
           +YKARLV KGY Q+ G+D+ E F+PV +L +V+L+LA++A  N+ +HQ+D+ +AFLNG L
Sbjct: 147 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 206

BLAST of CSPI01G13260 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 88.2 bits (217), Expect = 2.1e-17
Identity = 40/92 (43.48%), Postives = 62/92 (67.39%), Query Frame = 1

Query: 525 AIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQKYKARLV 584
           A++D  W  AM +E+DA+ RN+ W LV  P N+  LG KW+++TKL  +G + + KARLV
Sbjct: 34  ALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLV 93

Query: 585 VKGYKQKFGMDYKEVFAPVTRLETVRLLLALA 617
            KG+ Q+ G+ + E ++PV R  T+R +L +A
Sbjct: 94  AKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI01G13260 vs. TAIR10
Match: AT1G48720.1 (AT1G48720.1 unknown protein)

HSP 1 Score: 84.3 bits (207), Expect = 3.0e-16
Identity = 39/92 (42.39%), Postives = 62/92 (67.39%), Query Frame = 1

Query: 1  MSSNMLQPQLPRFEGKNYMRWSQQMKVLYGSQDLWDIVDNGYSKPESENDLLAQRLNELR 60
          M+SN +  Q+P     NY  WS +MK + G+ D+W+IV+ G+ +PE+E  L   + + LR
Sbjct: 1  MASNNVPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLR 60

Query: 61 DARKKNKKALFFIYQAVDENIFERISGVSTAK 93
          D+RK++KKAL  IYQ +DE+ FE++   ++AK
Sbjct: 61 DSRKRDKKALCLIYQGLDEDTFEKVVEATSAK 92

BLAST of CSPI01G13260 vs. TAIR10
Match: ATMG00710.1 (ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 60.8 bits (146), Expect = 3.5e-09
Identity = 32/87 (36.78%), Postives = 47/87 (54.02%), Query Frame = 1

Query: 315 NRIIMELARSMLKAKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVS 374
           NR I+E  RSML    LP  F    D    A++++N+  + ++    P E W    PT S
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRA--DAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYS 61

Query: 375 HLRMFGCIAYSHISDEKKGKLDDKSEK 402
           +LR FGC+AY H  +   GKL  +++K
Sbjct: 62  YLRRFGCVAYIHCDE---GKLKPRAKK 83

BLAST of CSPI01G13260 vs. NCBI nr
Match: gi|150036244|gb|ABR67407.1| (integrase [Cucumis melo subsp. melo])

HSP 1 Score: 477.6 bits (1228), Expect = 3.4e-131
Identity = 237/374 (63.37%), Postives = 298/374 (79.68%), Query Frame = 1

Query: 287 GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGWGDTVTYAI 346
           G+F KE GI HQ T R T QQNGVAERKNR IME+ARSMLKAK LP++FWG  D V   +
Sbjct: 567 GNFFKEQGIHHQMTARMTTQQNGVAERKNRTIMEMARSMLKAKNLPNEFWG--DAVACTV 626

Query: 347 YLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHISDEKKGKLDDKSEKCIFAG 406
           Y+LNRA T+SV  +TP EAW   KP+VSHL++F  IAYSHI ++ +GKLDDKSEKCI  G
Sbjct: 627 YILNRAPTKSVPGMTPYEAWCDEKPSVSHLKVFRSIAYSHIPNQLRGKLDDKSEKCIMVG 686

Query: 407 YSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLEL 466
           Y+ENSKAYRLYNP+S+K+II+RDV F E + W WN   +E ++P HV+++  + A++LE 
Sbjct: 687 YNENSKAYRLYNPVSRKIIINRDVIFSEDESWNWNDDVDEAKSPFHVNINENEVAQELEQ 746

Query: 467 EVTQPLTSPSSS--HSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFE 526
              Q + S SSS   STS++E +PR+ R+IQEIYN + RI  +   +FALFA V PV F+
Sbjct: 747 AKIQAVESSSSSTSSSTSNDEISPRRMRSIQEIYNNTNRINVDHFANFALFAGVGPVTFD 806

Query: 527 EAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWIYRTKLKQNGEVQKYKARL 586
           EAIQDE WK AM+QEIDAIRRNE WEL++LP NK+ALGVKW+YRTKLK +G V+ YKARL
Sbjct: 807 EAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEIYKARL 866

Query: 587 VVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYV 646
           VVKGYKQ++G+DY+E+FAPVTR+ET+RL+L+LAA+N WKVHQMD+KSAFLNG+L+DEI+V
Sbjct: 867 VVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVHQMDIKSAFLNGHLKDEIFV 926

Query: 647 EQPTGYAKIGEENK 658
            QP GY + GEE K
Sbjct: 927 AQPLGYVQRGEEEK 938

BLAST of CSPI01G13260 vs. NCBI nr
Match: gi|651219311|gb|AIC77183.1| (polyprotein [Gossypium barbadense])

HSP 1 Score: 418.7 bits (1075), Expect = 1.9e-113
Identity = 203/392 (51.79%), Postives = 284/392 (72.45%), Query Frame = 1

Query: 271 ELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKK 330
           +++ +D  G      +  F K++GI HQ T RRTPQQNGVAERKNR I+++ARSM+K K 
Sbjct: 608 KILRSDRGGEYTAKLYESFCKDHGIIHQLTARRTPQQNGVAERKNRTILDMARSMIKGKH 667

Query: 331 LPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHISDE 390
           LP  FW   + V  A+YLLN+  T+SV+  TP+EAWSG KP V HL++FGCIAY+H+ ++
Sbjct: 668 LPRTFWA--EAVECAVYLLNQCPTKSVRHKTPEEAWSGHKPRVGHLKIFGCIAYAHVPEQ 727

Query: 391 KKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPL 450
           ++ KLDD+ EKCIF GY + SKAYRLYNP++KK+IISRDV+FDEA  W+W+   +    L
Sbjct: 728 QRKKLDDRGEKCIFIGYDKRSKAYRLYNPLTKKLIISRDVEFDEADYWRWSEEEKKVEGL 787

Query: 451 HVDMD--GKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEH 510
             + D   +++  D +   T   +SP+SS  +S  +  P +TR++ +IYN++  +  E  
Sbjct: 788 FFNEDDNNQEEQGDDQSPGTTAPSSPTSSSGSSSLDEAPTRTRSLNDIYNSTEPV--ETQ 847

Query: 511 VDFALFA---NVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVKWI 570
            D++LF      DPV +EEAI++  WK AM++EI AIRRN+ WEL  LPE    +GVKW+
Sbjct: 848 FDYSLFCLMTECDPVTYEEAIENNKWKKAMDEEIAAIRRNDTWELTSLPEGHSPIGVKWV 907

Query: 571 YRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKVHQ 630
           Y+TK  + G+V+KYKARLV KGYKQ+ G+DY E+FAPV R++T+RLL+A+AA+  WK++Q
Sbjct: 908 YKTKTNKEGKVEKYKARLVAKGYKQRQGVDYDEIFAPVARIDTIRLLIAVAAQYKWKIYQ 967

Query: 631 MDVKSAFLNGYLEDEIYVEQPTGYAKIGEENK 658
           MDVKSAFLNGYLE+E+Y+EQP GY+  G+E+K
Sbjct: 968 MDVKSAFLNGYLEEEVYIEQPPGYSIQGKEDK 995

BLAST of CSPI01G13260 vs. NCBI nr
Match: gi|1769898|emb|CAA69272.1| (lectin receptor kinase [Arabidopsis thaliana])

HSP 1 Score: 413.7 bits (1062), Expect = 6.1e-112
Identity = 211/398 (53.02%), Postives = 280/398 (70.35%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGVAERKNR I+E+ARSMLK
Sbjct: 85  LVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLK 144

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 145 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 204

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 205 PDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 264

Query: 448 NPL-HVDMDGKKDARDL--ELEVTQPLTSPSSSH-STSDEETTPRKTRNIQEIYNTSRRI 507
           N   H + D  +  R+     E T P TSP+SS    S  E TPR  R+IQE+Y  +   
Sbjct: 265 NFFPHFEEDKPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPR-FRSIQELYEVTEN- 324

Query: 508 LDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVK 567
             E    F LFA  +P+ F+EAI+ + W++AM++EI +I++N+ WEL  LP   +A+GVK
Sbjct: 325 -QENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVK 384

Query: 568 WIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKV 627
           W+Y+ K    GEV++YKARLV KGY Q+ G+DY E+FAPV RLETVRL+++LAA+N WK+
Sbjct: 385 WVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEIFAPVARLETVRLIISLAAQNKWKI 444

Query: 628 HQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           HQMDVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 445 HQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 477

BLAST of CSPI01G13260 vs. NCBI nr
Match: gi|6996308|emb|CAB75469.1| (copia-type reverse transcriptase-like protein [Arabidopsis thaliana])

HSP 1 Score: 413.7 bits (1062), Expect = 6.1e-112
Identity = 211/398 (53.02%), Postives = 280/398 (70.35%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGVAERKNR I+E+ARSMLK
Sbjct: 588 LVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLK 647

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 648 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 707

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 708 PDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 767

Query: 448 NPL-HVDMDGKKDARDL--ELEVTQPLTSPSSSH-STSDEETTPRKTRNIQEIYNTSRRI 507
           N   H + D  +  R+     E T P TSP+SS    S  E TPR  R+IQE+Y  +   
Sbjct: 768 NFFPHFEEDKPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPR-FRSIQELYEVTEN- 827

Query: 508 LDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVK 567
             E    F LFA  +P+ F+EAI+ + W++AM++EI +I++N+ WEL  LP   +A+GVK
Sbjct: 828 -QENLTLFCLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVK 887

Query: 568 WIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKV 627
           W+Y+ K    GEV++YKARLV KGY Q+ G+DY E+FAPV RLETVRL+++LAA+N WK+
Sbjct: 888 WVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEIFAPVARLETVRLIISLAAQNKWKI 947

Query: 628 HQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           HQMDVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 948 HQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 980

BLAST of CSPI01G13260 vs. NCBI nr
Match: gi|6850900|emb|CAB71063.1| (copia-type polyprotein [Arabidopsis thaliana])

HSP 1 Score: 411.4 bits (1056), Expect = 3.0e-111
Identity = 210/398 (52.76%), Postives = 279/398 (70.10%), Query Frame = 1

Query: 268 LIFELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLK 327
           L+ + + +D  G   +     + ++NGI+ Q TV R+PQQNGV ERKNR I+E+ARSMLK
Sbjct: 588 LVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLK 647

Query: 328 AKKLPDQFWGWGDTVTYAIYLLNRASTESVQCITPQEAWSGLKPTVSHLRMFGCIAYSHI 387
           +K+LP + W   + V  A+YLLNR+ T+SV   TPQEAWSG KP VSHLR+FG IA++H+
Sbjct: 648 SKRLPKELWA--EAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHV 707

Query: 388 SDEKKGKLDDKSEKCIFAGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQ 447
            DEK+ KLDDKSEK IF GY  NSK Y+LYNP +KK IISR++ FDE   W WN+  ED 
Sbjct: 708 PDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDY 767

Query: 448 NPL-HVDMDGKKDARDL--ELEVTQPLTSPSSSH-STSDEETTPRKTRNIQEIYNTSRRI 507
           N   H + D  +  R+     E T P TSP+SS    S  E TPR  R+IQE+Y  +   
Sbjct: 768 NFFPHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPR-FRSIQELYEVTEN- 827

Query: 508 LDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNEIWELVKLPENKRALGVK 567
             E    F LFA  +P+ F++AI+ + W++AM++EI +I++N+ WEL  LP   +A+GVK
Sbjct: 828 -QENLTLFCLFAECEPMDFQKAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVK 887

Query: 568 WIYRTKLKQNGEVQKYKARLVVKGYKQKFGMDYKEVFAPVTRLETVRLLLALAAKNNWKV 627
           W+Y+ K    GEV++YKARLV KGY Q+ G+DY EVFAPV RLETVRL+++LAA+N WK+
Sbjct: 888 WVYKAKKNSKGEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLIISLAAQNKWKI 947

Query: 628 HQMDVKSAFLNGYLEDEIYVEQPTGYAKIGEENKKMSI 662
           HQMDVKSAFLNG LE+E+Y+EQP GY   GEE+K + +
Sbjct: 948 HQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRL 980

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC6.2e-6435.17Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME2.3e-2631.23Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M820_ARATH3.7e-1643.48Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg0... [more]
M710_ARATH6.3e-0836.78Uncharacterized mitochondrial protein AtMg00710 OS=Arabidopsis thaliana GN=AtMg0... [more]
Match NameE-valueIdentityDescription
A6YTD9_CUCME2.4e-13163.37Integrase OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A068B703_GOSBA1.3e-11351.79Polyprotein OS=Gossypium barbadense PE=4 SV=1[more]
V9H0W6_ARATH4.3e-11253.02Lectin receptor kinase (Fragment) OS=Arabidopsis thaliana PE=4 SV=1[more]
Q9M197_ARATH4.3e-11253.02Copia-type reverse transcriptase-like protein OS=Arabidopsis thaliana GN=T16L24.... [more]
Q9C536_ARATH3.3e-10449.87Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.14.8e-3038.34 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.12.1e-1743.48ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
AT1G48720.13.0e-1642.39 unknown protein[more]
ATMG00710.13.5e-0936.78ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|150036244|gb|ABR67407.1|3.4e-13163.37integrase [Cucumis melo subsp. melo][more]
gi|651219311|gb|AIC77183.1|1.9e-11351.79polyprotein [Gossypium barbadense][more]
gi|1769898|emb|CAA69272.1|6.1e-11253.02lectin receptor kinase [Arabidopsis thaliana][more]
gi|6996308|emb|CAB75469.1|6.1e-11253.02copia-type reverse transcriptase-like protein [Arabidopsis thaliana][more]
gi|6850900|emb|CAB71063.1|3.0e-11152.76copia-type polyprotein [Arabidopsis thaliana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G13260.1CSPI01G13260.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 210..370
score: 13
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 234..373
score: 1.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 271..379
score: 1.02
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 545..658
score: 6.6
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 11..657
score: 1.9E
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 11..657
score: 1.9E
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 63..136
score: 3.3

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None