CSPI04G19830 (gene) Wild cucumber (PI 183967)

NameCSPI04G19830
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon 297 family
LocationChr4 : 17654720 .. 17655679 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCATGATGACTTCACCAAATTTAGCACTGCCAAATTTTACATTGCCGTTTAAAATAGAGACTGACACATCAGGTTATGGAATAGGTGATGTACTTACTCAGAATAATCGACCAATACCCTATTTCAGTCACACTTTGGCCATGGGAGATAGAGCCAAGCCAGTCTATGAACGAGAATTAATGGTTGTAGTATACGCAAACCAACGATGGAAACCATATCTTTTGGGAAGGAGATTTGTAGTTAAAACGGATCAGCGCTCATTGAAGTTTTTATTAGAGCAGAGGGTTATTCAACCTCAACATCAAAGGTGGATGGCTAAATTGCTGGGGTACAATTTTGATGTAGTGTACAAACCTGGCCTTGAAAACAAAGCAGCAGACGCACTATCGAGGGTCCCTCCAACGGTACATCTAAATCATCTCATTGCTCCAGCCTTGCTGGACACATCAGTCCTTAAAACAGAAGTGGAAGAGGATGTCAAGTTGAAGGAAATCATCGCAAAGTTAGAAAAGAGTGAAGCAATTTCTGGTTTTGCTATGCATCAAGGCATGCTGAGGAAGAGACGATTGGTGACATGAAAGAATTCGGTCCTATTGCCTATGGTTCTTCACACTTATCATGACTCAGTCTTTGGAGGCCATTTGGGATGCTTGACGGCTTATAAAAGATTAACAGGTGAGCTATATCGGGAAGGGATGAAAGTTGATGTGCAAAAATACTGTGAGGAATGTTTGGCCTGTCAACGTAACAAAACAATGGCTTTATCACCTGCCGGATTACTCATGTCCCTAGAAATAGCAGATGCGGTTTGGAGCGAGATATCTATGGACTTCATTGATGGCCTACCTAAATCCAAAGGACATGAGGTAATACTTGTTGTAGATATATTTTTGAAACGGAGACAAACCTCTTTATTAAATGAGACTAATGCTCAAAGTATAAGAGAATTATACTAA

mRNA sequence

ATGGCCATGATGACTTCACCAAATTTAGCACTGCCAAATTTTACATTGCCGTTTAAAATAGAGACTGACACATCAGGTTATGGAATAGGTGATGTACTTACTCAGAATAATCGACCAATACCCTATTTCAGTCACACTTTGGCCATGGGAGATAGAGCCAAGCCAGTCTATGAACGAGAATTAATGGTTGTAGTATACGCAAACCAACGATGGAAACCATATCTTTTGGGAAGGAGATTTGTAGTTAAAACGGATCAGCGCTCATTGAAGTTTTTATTAGAGCAGAGGGTTATTCAACCTCAACATCAAAGGTGGATGGCTAAATTGCTGGGGTACAATTTTGATGTAGTGTACAAACCTGGCCTTGAAAACAAAGCAGCAGACGCACTATCGAGGGTCCCTCCAACGGTACATCTAAATCATCTCATTGCTCCAGCCTTGCTGGACACATCAGTCCTTAAAACAGAAGTGGAAGAGGATGTCAAGTTGAAGGAAATCATCGCAAAGTTAGAAAAGAGTGAAGCAATTTCTGTCTTTGGAGGCCATTTGGGATGCTTGACGGCTTATAAAAGATTAACAGGTGAGCTATATCGGGAAGGGATGAAAGTTGATGTGCAAAAATACTGTGAGGAATGTTTGGCCTGTCAACGTAACAAAACAATGGCTTTATCACCTGCCGGATTACTCATGTCCCTAGAAATAGCAGATGCGGTTTGGAGCGAGATATCTATGGACTTCATTGATGGCCTACCTAAATCCAAAGGACATGAGGTAATACTTGTTGTAGATATATTTTTGAAACGGAGACAAACCTCTTTATTAAATGAGACTAATGCTCAAAGTATAAGAGAATTATACTAA

Coding sequence (CDS)

ATGGCCATGATGACTTCACCAAATTTAGCACTGCCAAATTTTACATTGCCGTTTAAAATAGAGACTGACACATCAGGTTATGGAATAGGTGATGTACTTACTCAGAATAATCGACCAATACCCTATTTCAGTCACACTTTGGCCATGGGAGATAGAGCCAAGCCAGTCTATGAACGAGAATTAATGGTTGTAGTATACGCAAACCAACGATGGAAACCATATCTTTTGGGAAGGAGATTTGTAGTTAAAACGGATCAGCGCTCATTGAAGTTTTTATTAGAGCAGAGGGTTATTCAACCTCAACATCAAAGGTGGATGGCTAAATTGCTGGGGTACAATTTTGATGTAGTGTACAAACCTGGCCTTGAAAACAAAGCAGCAGACGCACTATCGAGGGTCCCTCCAACGGTACATCTAAATCATCTCATTGCTCCAGCCTTGCTGGACACATCAGTCCTTAAAACAGAAGTGGAAGAGGATGTCAAGTTGAAGGAAATCATCGCAAAGTTAGAAAAGAGTGAAGCAATTTCTGTCTTTGGAGGCCATTTGGGATGCTTGACGGCTTATAAAAGATTAACAGGTGAGCTATATCGGGAAGGGATGAAAGTTGATGTGCAAAAATACTGTGAGGAATGTTTGGCCTGTCAACGTAACAAAACAATGGCTTTATCACCTGCCGGATTACTCATGTCCCTAGAAATAGCAGATGCGGTTTGGAGCGAGATATCTATGGACTTCATTGATGGCCTACCTAAATCCAAAGGACATGAGGTAATACTTGTTGTAGATATATTTTTGAAACGGAGACAAACCTCTTTATTAAATGAGACTAATGCTCAAAGTATAAGAGAATTATACTAA
BLAST of CSPI04G19830 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 4.5e-19
Identity = 52/141 (36.88%), Postives = 81/141 (57.45%), Query Frame = 1

Query: 3   MMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNN----RPIPYFSHTLAMGDRAKPVYE 62
           + +S  LA P FT PF + TD S + IG VL+Q++    RPI Y S +L   +      E
Sbjct: 420 LCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIE 479

Query: 63  RELMVVVYANQRWKPYLLGRRFV-VKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVV 122
           +E++ ++++    + YL G   + V TD + L F L  R    + +RW A++  YN +++
Sbjct: 480 KEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELI 539

Query: 123 YKPGLENKAADALSRVPPTVH 139
           YKPG  N  ADALSR+PP ++
Sbjct: 540 YKPGKSNVVADALSRIPPQLN 560

BLAST of CSPI04G19830 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 7.7e-19
Identity = 48/127 (37.80%), Postives = 73/127 (57.48%), Query Frame = 1

Query: 7   PNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELMVVVY 66
           P L +P+FT  F + TD S   +G VL+Q+  P+ Y S TL   +      E+EL+ +V+
Sbjct: 498 PILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVW 557

Query: 67  ANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGLENKA 126
           A + ++ YLLGR F + +D + L +L   +    +  RW  KL  ++FD+ Y  G EN  
Sbjct: 558 ATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCV 617

Query: 127 ADALSRV 134
           ADALSR+
Sbjct: 618 ADALSRI 624

BLAST of CSPI04G19830 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 1.3e-18
Identity = 48/131 (36.64%), Postives = 76/131 (58.02%), Query Frame = 1

Query: 3   MMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELM 62
           ++  P L LP+F   F + TD S   +G VL+QN  PI + S TL   +      E+EL+
Sbjct: 493 IIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELL 552

Query: 63  VVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGL 122
            +V+A + ++ YLLGR+F++ +D + L++L   +    + +RW  +L  Y F + Y  G 
Sbjct: 553 AIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGK 612

Query: 123 ENKAADALSRV 134
           EN  ADALSR+
Sbjct: 613 ENSVADALSRI 623

BLAST of CSPI04G19830 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 88.2 bits (217), Expect = 1.6e-16
Identity = 50/125 (40.00%), Postives = 70/125 (56.00%), Query Frame = 1

Query: 9   LALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELMVVVYAN 68
           L  P+F  PF + TD S  GIG VL+Q  RPI   S TL   ++     EREL+ +V+A 
Sbjct: 486 LKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWAL 545

Query: 69  QRWKPYLLGRRFV-VKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGLENKAA 128
            + + +L G R + + TD + L F +  R    + +RW + +  +N  V YKPG EN  A
Sbjct: 546 GKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVA 605

Query: 129 DALSR 133
           DALSR
Sbjct: 606 DALSR 610

BLAST of CSPI04G19830 vs. Swiss-Prot
Match: POL4_DROME (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 3.9e-15
Identity = 49/129 (37.98%), Postives = 66/129 (51.16%), Query Frame = 1

Query: 9   LALPNFTLPFKIETDTSGYGIGDVLTQNNR----PIPYFSHTLAMGDRAKPVYERELMVV 68
           L  P+F+  F I TD S    G VLTQN+     P+ Y S     G+  K   E+EL  +
Sbjct: 607 LQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAI 666

Query: 69  VYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGLEN 128
            +A   ++PY+ G+ F VKTD R L +L        +  R   +L  YNF V Y  G +N
Sbjct: 667 HWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDN 726

Query: 129 KAADALSRV 134
             ADALSR+
Sbjct: 727 HVADALSRI 735

BLAST of CSPI04G19830 vs. TrEMBL
Match: A5B2I6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 2.8e-68
Identity = 136/299 (45.48%), Postives = 189/299 (63.21%), Query Frame = 1

Query: 3    MMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELM 62
            M T P LALPNF+  F +E D SGYG+G VL Q++RP+ YFS  L   +R K +YERELM
Sbjct: 1519 MTTIPVLALPNFSQLFIVEMDASGYGLGTVLMQSHRPVAYFSQVLTARERQKSIYERELM 1578

Query: 63   VVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGL 122
             +V A Q+W+ YLLGR F+V+TDQ SLKFLLEQR++   +Q+W+AKL GY+F++ ++PG 
Sbjct: 1579 AIVLAVQKWRHYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDFEIQFRPGX 1638

Query: 123  ENKAADALSRVPPTVHLNHLIAPALLDTSVLKTEVEEDVKLKEI---------------- 182
            ENKAADALSR+P ++ L  L+ P+ +DT ++ ++VE D  L +I                
Sbjct: 1639 ENKAADALSRIPISMELXALMVPSRIDTXLISSQVEADPHLXKIKQRLLXDPDAYPRYSL 1698

Query: 183  -------------------IAKLEKSEAISVFGGHLGCLTAYKRLTGELYREGMKVDVQK 242
                               +  L +    SV GGH G L  YKRLT + +  GMK D+++
Sbjct: 1699 DHGILLYKGRLVLPKASPLVPALLQEGHASVVGGHSGFLXTYKRLTRDFFWVGMKNDIKE 1758

Query: 243  YCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVVDIFL 267
            + E+CL CQ+NKT+ LSPAGLL  L I D +W +++MDFI+GLPKS+    ++V   FL
Sbjct: 1759 FVEKCLVCQQNKTLTLSPAGLLQPLPIPDKIWDDVTMDFIEGLPKSE----VIVTRYFL 1813

BLAST of CSPI04G19830 vs. TrEMBL
Match: E2DMZ5_BETVU (Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 2.1e-60
Identity = 131/297 (44.11%), Postives = 171/297 (57.58%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            A+  +P L +PNF+LPF IE D SGYG+G VL Q   PI YFS TL    RAK +YE+EL
Sbjct: 902  ALTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHPIAYFSKTLGERARAKSIYEKEL 961

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M VV A Q+WK +LLGR FV+ +DQ+SL+ LL QR I P +Q+W+ KLLG++F++ YKPG
Sbjct: 962  MAVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIGPAYQKWVGKLLGFDFEIKYKPG 1021

Query: 122  LENKAADALSRV-PPTVHLNHLIAPALLDTSVLKTEVEEDVKLKEIIAKLEKSEA----- 181
              NK ADALSR  PP    N L +       ++   + +D  L+ ++A++          
Sbjct: 1022 GHNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIRQDADLQHLMAEVTAGRTPLQGF 1081

Query: 182  ------------------------------ISVFGGHLGCLTAYKRLTGELYREGMKVDV 241
                                           S  GGH G    YKRL GE Y +GMK DV
Sbjct: 1082 TVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHSGIFKTYKRLAGEWYWKGMKKDV 1141

Query: 242  QKYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
              + + C  CQ+ KT  LSPAGLL  L I  A+W +ISMDF++GLPKS+G + ILVV
Sbjct: 1142 TTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDISMDFVEGLPKSQGWDTILVV 1198

BLAST of CSPI04G19830 vs. TrEMBL
Match: A0A087HFW3_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA2G074100 PE=4 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 3.7e-60
Identity = 133/296 (44.93%), Postives = 175/296 (59.12%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            AM T P LAL +FT  F +E+D SG G+G VL Q  RP+ YFS  L    R K VYEREL
Sbjct: 915  AMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQQQRPLAYFSQALTERQRLKSVYEREL 974

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M +V+A Q+W+ YLLGR+FVV+TDQ+SLKFLLEQR I  ++Q+W+ KLLG++F++ YKPG
Sbjct: 975  MAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQREINLEYQKWLTKLLGFDFEIQYKPG 1034

Query: 122  LENKAADALSRVPPTVHLNHLIAPALLDTSVLKTEVEEDVKL------------------ 181
            LENKAADALSR    VH+  L  PA +  + + TEV++D  L                  
Sbjct: 1035 LENKAADALSRKDIAVHMCALSVPAAIQLAHINTEVDKDPDLHKLKQEVLLDTAAHSEFS 1094

Query: 182  --------KEIIAKLEKSEAISVF---------GGHLGCLTAYKRLTGELYREGMKVDVQ 241
                    K  +     S  + V          GGH G L   KR+    Y +GM   ++
Sbjct: 1095 VVQGRLLRKGKLVLPATSLLVDVILQEFHTGKLGGHGGVLKTQKRIAEVFYWKGMMSRIR 1154

Query: 242  KYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
             +   C  CQR+K   L+PAGLL  L I + VW +ISMDF++GLPKS+G  V++VV
Sbjct: 1155 DFVAACQVCQRHKYSTLAPAGLLQPLPIPEQVWEDISMDFVEGLPKSEGFAVVMVV 1210

BLAST of CSPI04G19830 vs. TrEMBL
Match: A0A0V0HQ75_SOLCH (Putative ovule protein (Fragment) OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 2.4e-59
Identity = 132/295 (44.75%), Postives = 165/295 (55.93%), Query Frame = 1

Query: 3   MMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELM 62
           M   P LA+PNF+ PF IE D SG+G+G VL QN +PI YFS  L+   R   VYERELM
Sbjct: 1   MTQVPVLAMPNFSQPFVIEADASGFGVGAVLMQNGKPISYFSRMLSSRARQCSVYERELM 60

Query: 63  VVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGL 122
            +V A +RW  YLLG  F++KTDQ++LKFLL QRV+    Q+W++KL+GY F++ YKPG+
Sbjct: 61  AIVLAVKRWNHYLLGHHFIIKTDQKALKFLLGQRVMDENQQKWVSKLMGYKFEIKYKPGV 120

Query: 123 ENKAADALSRVPPTVHLNHLIAPALLDTSVLKTEVEEDVKLKEIIAK------------- 182
           EN+ ADALSR   +  L         D      EV+ D K+  I+ K             
Sbjct: 121 ENRVADALSRRGESSELYAFSIWQYKDKEEWDQEVQRDTKMASILQKSISGQQSDDNYSL 180

Query: 183 ----------------------LEKSEAISVFGGHLGCLTAYKRLTGELYREGMKVDVQK 242
                                 L K    S  GGH G L  YKRL+  +Y EGMK DVQ 
Sbjct: 181 KNGCLLYKGRLVLPKGSSRIPGLLKEFHSSPIGGHSGYLRTYKRLSENIYWEGMKRDVQD 240

Query: 243 YCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
           +   C  CQ+NK+  LSPAGLL  L I   VW ++SMDFI GLPKS   + ILVV
Sbjct: 241 FVARCEICQKNKSQTLSPAGLLQPLPIPHHVWEDVSMDFITGLPKSHRFDTILVV 295

BLAST of CSPI04G19830 vs. TrEMBL
Match: A0A087GZN3_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G271700 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 3.1e-59
Identity = 126/301 (41.86%), Postives = 178/301 (59.14%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            AM ++P LALP+F+  F +E+D SG+G+G VL Q + PI YFSH L   ++ KP+YEREL
Sbjct: 1061 AMSSAPVLALPDFSESFIVESDASGFGVGAVLMQRHNPIAYFSHGLTEREQLKPIYEREL 1120

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M +V A Q+WK YLLG++FVV+TDQ+SLKFLLEQR +   +QRW+ KLLGY FD++YKPG
Sbjct: 1121 MAIVMAIQKWKHYLLGKKFVVRTDQKSLKFLLEQREVSLDYQRWLVKLLGYEFDIIYKPG 1180

Query: 122  LENKAADALSRVP-----PTVHLNHLIAPALLDTSVLKTEVEEDVKLKEIIAKLEK---- 181
            +EN AAD LSR+        V L+    P  +    +  E+EE   ++++  +L++    
Sbjct: 1181 VENAAADGLSRIERVELSEAVMLSAFSVPTAIQLQDIFKEIEESDYIQKLSKQLDEGVPL 1240

Query: 182  ----------------------SEAI---------SVFGGHLGCLTAYKRLTGELYREGM 241
                                  S AI          + GGH G L   KR+  + Y   M
Sbjct: 1241 KPGYTRHRGRMFYKGRLVLPPGSAAIPWILEEFHAGIQGGHSGVLKTQKRIQAQFYWPKM 1300

Query: 242  KVDVQKYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILV 263
            + D+Q++   C  CQ +K   L+PAGLL  L I + +W +++MDFI+GLP S G  VI+V
Sbjct: 1301 RQDIQEFVATCQICQTHKYSTLAPAGLLQPLPIPEKIWEDVAMDFIEGLPTSNGVNVIMV 1360

BLAST of CSPI04G19830 vs. NCBI nr
Match: gi|147854459|emb|CAN78588.1| (hypothetical protein VITISV_043911 [Vitis vinifera])

HSP 1 Score: 266.9 bits (681), Expect = 4.0e-68
Identity = 136/299 (45.48%), Postives = 189/299 (63.21%), Query Frame = 1

Query: 3    MMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERELM 62
            M T P LALPNF+  F +E D SGYG+G VL Q++RP+ YFS  L   +R K +YERELM
Sbjct: 1519 MTTIPVLALPNFSQLFIVEMDASGYGLGTVLMQSHRPVAYFSQVLTARERQKSIYERELM 1578

Query: 63   VVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPGL 122
             +V A Q+W+ YLLGR F+V+TDQ SLKFLLEQR++   +Q+W+AKL GY+F++ ++PG 
Sbjct: 1579 AIVLAVQKWRHYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDFEIQFRPGX 1638

Query: 123  ENKAADALSRVPPTVHLNHLIAPALLDTSVLKTEVEEDVKLKEI---------------- 182
            ENKAADALSR+P ++ L  L+ P+ +DT ++ ++VE D  L +I                
Sbjct: 1639 ENKAADALSRIPISMELXALMVPSRIDTXLISSQVEADPHLXKIKQRLLXDPDAYPRYSL 1698

Query: 183  -------------------IAKLEKSEAISVFGGHLGCLTAYKRLTGELYREGMKVDVQK 242
                               +  L +    SV GGH G L  YKRLT + +  GMK D+++
Sbjct: 1699 DHGILLYKGRLVLPKASPLVPALLQEGHASVVGGHSGFLXTYKRLTRDFFWVGMKNDIKE 1758

Query: 243  YCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVVDIFL 267
            + E+CL CQ+NKT+ LSPAGLL  L I D +W +++MDFI+GLPKS+    ++V   FL
Sbjct: 1759 FVEKCLVCQQNKTLTLSPAGLLQPLPIPDKIWDDVTMDFIEGLPKSE----VIVTRYFL 1813

BLAST of CSPI04G19830 vs. NCBI nr
Match: gi|923869199|ref|XP_013709039.1| (PREDICTED: uncharacterized protein LOC106412673 [Brassica napus])

HSP 1 Score: 248.8 bits (634), Expect = 1.1e-62
Identity = 134/303 (44.22%), Postives = 181/303 (59.74%), Query Frame = 1

Query: 1    MAMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYERE 60
            +AM+T+P LALP+FT PF +E+D SG+G+G VL QNN PI YFSH L   ++ KP+YERE
Sbjct: 873  IAMVTAPVLALPDFTKPFIVESDASGFGLGAVLMQNNHPIAYFSHGLTPREQLKPIYERE 932

Query: 61   LMVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKP 120
            LM +V + Q+W+ YLLGRRFVV+TDQ+SLK+LLEQR I   +QRW+ ++LGY FD+ YK 
Sbjct: 933  LMAIVMSIQKWRHYLLGRRFVVRTDQQSLKYLLEQREITLDYQRWLTRILGYEFDIEYKV 992

Query: 121  GLENKAADALSRVPPTV------HLNHLIAPALLDTSVLKTEVEEDVKLKEIIAKLEKSE 180
            G ENK AD LSR+  TV       L  L  P  L    L  E++ED +++ +IAKL + E
Sbjct: 993  GSENKVADGLSRIDHTVIDEAGLTLLALTVPVTLQMQDLYREIDEDEEIQGMIAKLLQGE 1052

Query: 181  AI-----------------------------------SVFGGHLGCLTAYKRLTGELYRE 240
             +                                   ++ GGH G L   +R+    Y  
Sbjct: 1053 GVKQGFCLVHGRLFYKQKLVIPRSSNQIPVILQECHDTIMGGHAGVLRTLQRVKAMFYWP 1112

Query: 241  GMKVDVQKYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVI 263
             M+  VQ+Y   C  CQ +K   LSPAGLL  +E+   +W +I+MDF++GLP S+G  VI
Sbjct: 1113 KMRSVVQEYVAACSVCQTHKYSTLSPAGLLQPIELPVRIWEDIAMDFVEGLPVSQGVNVI 1172

BLAST of CSPI04G19830 vs. NCBI nr
Match: gi|923614274|ref|XP_013745228.1| (PREDICTED: uncharacterized protein LOC106447810 [Brassica napus])

HSP 1 Score: 244.2 bits (622), Expect = 2.8e-61
Identity = 137/296 (46.28%), Postives = 177/296 (59.80%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            AM T P LALP+F   F IE+D SG G+G VL Q  RPI YFS  L    + K VYEREL
Sbjct: 923  AMATVPVLALPDFNEQFVIESDASGVGLGAVLMQRQRPIAYFSQALTERQQMKSVYEREL 982

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M +V+A Q+W+ YLLGR+FVV+TDQ+SLKFLLEQR I  ++QRW+ K+LG++FD+ YKPG
Sbjct: 983  MAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQREINMEYQRWLTKILGFDFDIHYKPG 1042

Query: 122  LENKAADALSR-----------VPPTVHLNHLIAPALLDTSVLKT--EVEEDV------- 181
            LENKAADALSR           VP ++ L  + +    D+ + K   E+ +D        
Sbjct: 1043 LENKAADALSRKSPVTELFAVSVPVSIQLEEVGSEVERDSELSKLIQELTQDPSSHPDYT 1102

Query: 182  ---------------KLKEIIAKLEKSEAISVFGGHLGCLTAYKRLTGELYREGMKVDVQ 241
                           K  ++I  + K    S +GGH G L   KR+ G  Y  GM  D++
Sbjct: 1103 LVQGRLLRHGKLVLPKTSKLIELILKEYHDSKYGGHGGVLKTQKRIGGLFYWAGMMTDIR 1162

Query: 242  KYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
            KY   C  CQR+K   L+P GLL  L +   VW +IS+DFI+GLPKS+G  VILVV
Sbjct: 1163 KYVASCQTCQRHKYSTLAPGGLLQPLPVPTNVWEDISLDFIEGLPKSEGVNVILVV 1218

BLAST of CSPI04G19830 vs. NCBI nr
Match: gi|261865347|gb|ACY01928.1| (hypothetical protein [Beta vulgaris])

HSP 1 Score: 240.7 bits (613), Expect = 3.1e-60
Identity = 131/297 (44.11%), Postives = 171/297 (57.58%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            A+  +P L +PNF+LPF IE D SGYG+G VL Q   PI YFS TL    RAK +YE+EL
Sbjct: 902  ALTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHPIAYFSKTLGERARAKSIYEKEL 961

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M VV A Q+WK +LLGR FV+ +DQ+SL+ LL QR I P +Q+W+ KLLG++F++ YKPG
Sbjct: 962  MAVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIGPAYQKWVGKLLGFDFEIKYKPG 1021

Query: 122  LENKAADALSRV-PPTVHLNHLIAPALLDTSVLKTEVEEDVKLKEIIAKLEKSEA----- 181
              NK ADALSR  PP    N L +       ++   + +D  L+ ++A++          
Sbjct: 1022 GHNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIRQDADLQHLMAEVTAGRTPLQGF 1081

Query: 182  ------------------------------ISVFGGHLGCLTAYKRLTGELYREGMKVDV 241
                                           S  GGH G    YKRL GE Y +GMK DV
Sbjct: 1082 TVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHSGIFKTYKRLAGEWYWKGMKKDV 1141

Query: 242  QKYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
              + + C  CQ+ KT  LSPAGLL  L I  A+W +ISMDF++GLPKS+G + ILVV
Sbjct: 1142 TTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDISMDFVEGLPKSQGWDTILVV 1198

BLAST of CSPI04G19830 vs. NCBI nr
Match: gi|674248250|gb|KFK41015.1| (hypothetical protein AALP_AA2G074100 [Arabis alpina])

HSP 1 Score: 240.0 bits (611), Expect = 5.2e-60
Identity = 133/296 (44.93%), Postives = 175/296 (59.12%), Query Frame = 1

Query: 2    AMMTSPNLALPNFTLPFKIETDTSGYGIGDVLTQNNRPIPYFSHTLAMGDRAKPVYEREL 61
            AM T P LAL +FT  F +E+D SG G+G VL Q  RP+ YFS  L    R K VYEREL
Sbjct: 915  AMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQQQRPLAYFSQALTERQRLKSVYEREL 974

Query: 62   MVVVYANQRWKPYLLGRRFVVKTDQRSLKFLLEQRVIQPQHQRWMAKLLGYNFDVVYKPG 121
            M +V+A Q+W+ YLLGR+FVV+TDQ+SLKFLLEQR I  ++Q+W+ KLLG++F++ YKPG
Sbjct: 975  MAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQREINLEYQKWLTKLLGFDFEIQYKPG 1034

Query: 122  LENKAADALSRVPPTVHLNHLIAPALLDTSVLKTEVEEDVKL------------------ 181
            LENKAADALSR    VH+  L  PA +  + + TEV++D  L                  
Sbjct: 1035 LENKAADALSRKDIAVHMCALSVPAAIQLAHINTEVDKDPDLHKLKQEVLLDTAAHSEFS 1094

Query: 182  --------KEIIAKLEKSEAISVF---------GGHLGCLTAYKRLTGELYREGMKVDVQ 241
                    K  +     S  + V          GGH G L   KR+    Y +GM   ++
Sbjct: 1095 VVQGRLLRKGKLVLPATSLLVDVILQEFHTGKLGGHGGVLKTQKRIAEVFYWKGMMSRIR 1154

Query: 242  KYCEECLACQRNKTMALSPAGLLMSLEIADAVWSEISMDFIDGLPKSKGHEVILVV 263
             +   C  CQR+K   L+PAGLL  L I + VW +ISMDF++GLPKS+G  V++VV
Sbjct: 1155 DFVAACQVCQRHKYSTLAPAGLLQPLPIPEQVWEDISMDFVEGLPKSEGFAVVMVV 1210

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL5_DROME4.5e-1936.88Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
POL3_DROME7.7e-1937.80Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL2_DROME1.3e-1836.64Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POLY_DROME1.6e-1640.00Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
POL4_DROME3.9e-1537.98Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
Match NameE-valueIdentityDescription
A5B2I6_VITVI2.8e-6845.48Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1[more]
E2DMZ5_BETVU2.1e-6044.11Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1[more]
A0A087HFW3_ARAAL3.7e-6044.93Uncharacterized protein OS=Arabis alpina GN=AALP_AA2G074100 PE=4 SV=1[more]
A0A0V0HQ75_SOLCH2.4e-5944.75Putative ovule protein (Fragment) OS=Solanum chacoense PE=4 SV=1[more]
A0A087GZN3_ARAAL3.1e-5941.86Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G271700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|147854459|emb|CAN78588.1|4.0e-6845.48hypothetical protein VITISV_043911 [Vitis vinifera][more]
gi|923869199|ref|XP_013709039.1|1.1e-6244.22PREDICTED: uncharacterized protein LOC106412673 [Brassica napus][more]
gi|923614274|ref|XP_013745228.1|2.8e-6146.28PREDICTED: uncharacterized protein LOC106447810 [Brassica napus][more]
gi|261865347|gb|ACY01928.1|3.1e-6044.11hypothetical protein [Beta vulgaris][more]
gi|674248250|gb|KFK41015.1|5.2e-6044.93hypothetical protein AALP_AA2G074100 [Arabis alpina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G19830.1CSPI04G19830.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 2..262
score: 3.6
NoneNo IPR availablePANTHERPTHR24559:SF201SUBFAMILY NOT NAMEDcoord: 2..262
score: 3.6
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 2..119
score: 1.27