CSPI01G33130 (gene) Wild cucumber (PI 183967)

NameCSPI01G33130
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-I Gag-Pol polyprotein
LocationChr1 : 28017536 .. 28019301 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAAATTGGTTGATGAGATGCTAATTTCAGGAGTAATTCGTCCGAGCAATAGTCCTTATTCCAGTCCGGTTTTGTTGGTGAGGAAGAAAGACGGAAGTTAGAGATGTTGTGTTGACTACAGGGCATTAAATAATGTCACAATCCCTGACAAATTCCCCATTCCTGTTATTGAGGAGTTGTTTGATGAACTCAATGGGGCAAAATGGTTTTCTAAGATTGACCTTAGGGCTGGATACCATTAAATTCGGATGCGAGGTGAAGACATAGAGAAAACGGCATTCCGGACACATGAGGGACATTATGAATTCATGGTCATGCCATTTGGGTTGACTAATGCTCCATCTACGTTGCAGTCATTGATGAACACGATTTTTCGACCATATTTGCGGAAATTCATATTAGTATTCTTTGATGATATTCTGATATACAATCAGGATTTAGAAGAGCACCTGCAGCATGCGTTGGAAGTTCTAAGGAAGAGTGAACTGTATGCTAACATAAAAAAATGTAGCTTTGCGAAATAACGGGTGGATTATTTAGGCCATGTCATCTCTCGACAAGGGGTGGAGGTAAACCCGGAGAAAATCCGAGCAATCAAAGAATGGCCCATACCGACGAATGTGAGGGAAGTTCGGGGGTTCCTTGGATTGACAGGGTACTATAGGAAATTTGTACAACACTATGGATCTATTGCAGCCCCGTTAACTCAGCTAATGAAGAATGGAGGATTCAGATGGACCGAGGAAACAAATGAAGCCTTTCGGCATTTGCAGGATGCAATGGTGACCCTTCCAGTCTTGGCACAACCAGATTTCAGCTACACTTTTGAACTCGAAACTGATGCTTCGGGGTATGGGATAGGGCAGTCTTGATGCAAGCCAAGCGACCTATCGCGTATTTCAGCCACACTTTGACTTTGAGAGATAGAGCTAAACCAGTGTACGAGAGGGAGTTGATGGCAGTAGTATTAGCAGTACAGTGATGGCGACCTTACCTACTAGGGAATAAATTTGTGGTCCGAACAGATCAAAAGTCTTTAAAGTTCTTGTTAGAACAATGAGTCATACAACCGCAATACCAGAAATGGGTAGCCAAATTACTAGGTTACTCCTTCGAAGTGGTGTATAAACCACGTCTGGACAATAAGGCAGCTGACGCTTTGTCAAGAATACCTCATACCATTGAACTATGCAATTTGACAGCACCGGCACTGGTGGACATAGAGGTAATCAAGAAAGAAGTAGAGGAAGATGAGAAGTTGAGCAAAATGTTGATTTAATTGCAAATAGAGGAAGGGGACAATACGAATACATTCTCAATCCAACAGGGGATGCTTAAGTATAAAGGAAGATTAGTGCTATCTAGACAATCAACACTAATTCCAAACATACTGCATACATATCACGATTTTGTATTAGGAGGTCATTCGGGATTTCTGCGAACGTATAAGAGAATGATGGGAGAATTGTATTGGGAAGGCATGAAGGAGGATGTTAAGAAGTATTGTAAGGAATGCATAGTTTGTCAAAAAAATAAAACATTGGCTCTGTCCCCAGCACGGTTGCTAATGCCTTTAGAGATTCCAAACAGTGTTTGGAGCGATATTTCTATGGATTTCATTGAAGGTTTACCAAGGTCGAGTGGATTTGAAGTGATATTTGTAGTGGTTGATAGATTCAGCAAATACGGTCATTTCCTACCTCTTAAATATCCTTTTACAGCTAAAACAGTGGCTGACTTGTTTGTGAAGGAGATCGTATGA

mRNA sequence

ATGGAGAAATTGGTTGATGAGATGCTAATTTCAGGAGTAATTCGTCCGAGCAATAGTCCTTATTCCAGTCCGGTTTTGTTGGTGAGGAAGAAAGACGGAAGTGAAGACATAGAGAAAACGGCATTCCGGACACATGAGGGACATTATGAATTCATGGTCATGCCATTTGGGTTGACTAATGCTCCATCTACGTTGCAGTCATTGATGAACACGATTTTTCGACCATATTTGCGGAAATTCATATTAGTATTCTTTGATGATATTCTGATATACAATCAGGATTTAGAAGAGCACCTGCAGCATGCGTTGGAAGTTCTAAGGAAGAGGGTGGAGGTAAACCCGGAGAAAATCCGAGCAATCAAAGAATGGCCCATACCGACGAATGTGAGGGAAGTTCGGGGGTTCCTTGGATTGACAGGGTACTATAGGAAATTTGTACAACACTATGGATCTATTGCAGCCCCGTTAACTCAGCTAATGAAGAATGGAGGATTCAGATGGACCGAGGAAACAAATGAAGCCTTTCGGCATTTGCAGGATGCAATGGTGACCCTTCCAGTCTTGGCACAACCAGATTTCAGCTACACTTTTGAACTCGAAACTGATGCTTCGGGCCACACTTTGACTTTGAGAGATAGAGCTAAACCAGTGTACGAGAGGGAGTTGATGGCAGTAGTATTAGCAAAATGGGTAGCCAAATTACTAGGTTACTCCTTCGAAGTGGTGTATAAACCACGTCTGGACAATAAGGCAGCTGACGCTTTGTCAAGAATACCTCATACCATTGAACTATGCAATTTGACAGCACCGGCACTGGTGGACATAGAGGGGATGCTTAAGTATAAAGGAAGATTAGTGCTATCTAGACAATCAACACTAATTCCAAACATACTGCATACATATCACGATTTTGTATTAGGAGGTCATTCGGGATTTCTGCGAACGTATAAGAGAATGATGGGAGAATTGTATTGGGAAGGCATGAAGGAGGATGTTAAGAAGTATTGTAAGGAATGCATAGTTTGTCAAAAAAATAAAACATTGGCTCTGTCCCCAGCACGGTTGCTAATGCCTTTAGAGATTCCAAACAGTGTTTGGAGCGATATTTCTATGGATTTCATTGAAGGTTTACCAAGGTCGAGTGGATTTGAAGTGATATTTGTAGTGGTTGATAGATTCAGCAAATACGGTCATTTCCTACCTCTTAAATATCCTTTTACAGCTAAAACAGTGGCTGACTTGTTTGTGAAGGAGATCGTATGA

Coding sequence (CDS)

ATGGAGAAATTGGTTGATGAGATGCTAATTTCAGGAGTAATTCGTCCGAGCAATAGTCCTTATTCCAGTCCGGTTTTGTTGGTGAGGAAGAAAGACGGAAGTGAAGACATAGAGAAAACGGCATTCCGGACACATGAGGGACATTATGAATTCATGGTCATGCCATTTGGGTTGACTAATGCTCCATCTACGTTGCAGTCATTGATGAACACGATTTTTCGACCATATTTGCGGAAATTCATATTAGTATTCTTTGATGATATTCTGATATACAATCAGGATTTAGAAGAGCACCTGCAGCATGCGTTGGAAGTTCTAAGGAAGAGGGTGGAGGTAAACCCGGAGAAAATCCGAGCAATCAAAGAATGGCCCATACCGACGAATGTGAGGGAAGTTCGGGGGTTCCTTGGATTGACAGGGTACTATAGGAAATTTGTACAACACTATGGATCTATTGCAGCCCCGTTAACTCAGCTAATGAAGAATGGAGGATTCAGATGGACCGAGGAAACAAATGAAGCCTTTCGGCATTTGCAGGATGCAATGGTGACCCTTCCAGTCTTGGCACAACCAGATTTCAGCTACACTTTTGAACTCGAAACTGATGCTTCGGGCCACACTTTGACTTTGAGAGATAGAGCTAAACCAGTGTACGAGAGGGAGTTGATGGCAGTAGTATTAGCAAAATGGGTAGCCAAATTACTAGGTTACTCCTTCGAAGTGGTGTATAAACCACGTCTGGACAATAAGGCAGCTGACGCTTTGTCAAGAATACCTCATACCATTGAACTATGCAATTTGACAGCACCGGCACTGGTGGACATAGAGGGGATGCTTAAGTATAAAGGAAGATTAGTGCTATCTAGACAATCAACACTAATTCCAAACATACTGCATACATATCACGATTTTGTATTAGGAGGTCATTCGGGATTTCTGCGAACGTATAAGAGAATGATGGGAGAATTGTATTGGGAAGGCATGAAGGAGGATGTTAAGAAGTATTGTAAGGAATGCATAGTTTGTCAAAAAAATAAAACATTGGCTCTGTCCCCAGCACGGTTGCTAATGCCTTTAGAGATTCCAAACAGTGTTTGGAGCGATATTTCTATGGATTTCATTGAAGGTTTACCAAGGTCGAGTGGATTTGAAGTGATATTTGTAGTGGTTGATAGATTCAGCAAATACGGTCATTTCCTACCTCTTAAATATCCTTTTACAGCTAAAACAGTGGCTGACTTGTTTGTGAAGGAGATCGTATGA
BLAST of CSPI01G33130 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 3.1e-37
Identity = 101/262 (38.55%), Postives = 134/262 (51.15%), Query Frame = 1

Query: 35  EDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQD 94
           E + KTAF T  GHYE++ MPFGL NAP+T Q  MN I RP L K  LV+ DDI++++  
Sbjct: 315 ESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTS 374

Query: 95  LEEHLQHALEVLRKRVEV------------------------------NPEKIRAIKEWP 154
           L+EHLQ    V  K  +                               NPEKI AI+++P
Sbjct: 375 LDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYP 434

Query: 155 IPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQ-LMKNGGFRWTE-ETNEAFRHLQDAM 214
           IPT  +E++ FLGLTGYYRKF+ ++  IA P+T+ L KN     T  E + AF+ L+  +
Sbjct: 435 IPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLI 494

Query: 215 VTLPVLAQPDFSYTFELETDAS-----------GHTLTLRDRA-------KPVYERELMA 242
              P+L  PDF+  F L TDAS           GH L+   R            E+EL+A
Sbjct: 495 SEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLA 554

BLAST of CSPI01G33130 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 7.7e-36
Identity = 102/260 (39.23%), Postives = 130/260 (50.00%), Query Frame = 1

Query: 35  EDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQD 94
           E I KTAF T  GHYE++ MPFGL NAP+T Q  MN I RP L K  LV+ DDI+I++  
Sbjct: 314 ESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTS 373

Query: 95  LEEHLQHA------------------LEVLRKR------------VEVNPEKIRAIKEWP 154
           L EHL                      E L+K             ++ NP K++AI  +P
Sbjct: 374 LTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYP 433

Query: 155 IPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTE--ETNEAFRHLQDAM 214
           IPT  +E+R FLGLTGYYRKF+ +Y  IA P+T  +K      T+  E  EAF  L+  +
Sbjct: 434 IPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALI 493

Query: 215 VTLPVLAQPDFSYTFELETDAS-----------GHTL-----TLRDRA--KPVYERELMA 240
           +  P+L  PDF   F L TDAS           GH +     TL D        E+EL+A
Sbjct: 494 IRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLA 553

BLAST of CSPI01G33130 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 1.8e-32
Identity = 104/363 (28.65%), Postives = 159/363 (43.80%), Query Frame = 1

Query: 36  DIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDL 95
           DI KTAF T  G YEF+ +PFGL NAP+  Q +++ I R ++ K   V+ DDI+++++D 
Sbjct: 232 DIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDY 291

Query: 96  EEH---------------LQHALE---------------VLRKRVEVNPEKIRAIKEWPI 155
           + H               LQ  LE               V    ++ +P+K+RAI E P 
Sbjct: 292 DTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPP 351

Query: 156 PTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMK------------NGGFRWTEETNE 215
           PT+V+E++ FLG+T YYRKF+Q Y  +A PLT L +                   E   +
Sbjct: 352 PTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQ 411

Query: 216 AFRHLQDAMVTLPVLAQPDFSYTFELETDAS----GHTLTLRD--RAKPV---------- 275
           +F  L+  + +  +LA P F+  F L TDAS    G  L+  D  R +P+          
Sbjct: 412 SFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKT 471

Query: 276 ------YERELMAVV-------------------------------------LAKWVAKL 298
                  E+E++A++                                     L +W A++
Sbjct: 472 EENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARI 531

BLAST of CSPI01G33130 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 4.7e-25
Identity = 78/254 (30.71%), Postives = 122/254 (48.03%), Query Frame = 1

Query: 36  DIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDL 95
           D EKT+F  + G YEF  +PFGL NA S  Q  ++ + R  + K   V+ DD++I++++ 
Sbjct: 291 DREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENE 350

Query: 96  EEHLQHALEVLRKRVEV------------------------------NPEKIRAIKEWPI 155
            +H++H   VL+  ++                               +PEK++AI+E+P 
Sbjct: 351 SDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPE 410

Query: 156 PTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMK--NGG----------FRWTEETNE 215
           P  V +VR FLGL  YYR F++ + +IA P+T ++K  NG             + E    
Sbjct: 411 PDCVYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRN 470

Query: 216 AFRHLQDAMVTLPV-LAQPDFSYTFELETD--ASGHTLTLRDRAKPV------------- 229
           AF+ L++ + +  V L  PDF   F+L TD  ASG    L    +P+             
Sbjct: 471 AFQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQN 530

BLAST of CSPI01G33130 vs. Swiss-Prot
Match: TF23_SCHPO (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 1.5e-23
Identity = 66/209 (31.58%), Postives = 101/209 (48.33%), Query Frame = 1

Query: 27  LVRKKDGSEDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFD 86
           L+R + G E   K AFR   G +E++VMP+G++ AP+  Q  +NTI        ++ + D
Sbjct: 509 LIRVRKGDE--HKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMD 568

Query: 87  DILIYNQDLEEHLQHALEVLRKRVEVN------------------------------PEK 146
           DILI+++   EH++H  +VL+K    N                               E 
Sbjct: 569 DILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQEN 628

Query: 147 IRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNG-GFRWTEETNEAF 205
           I  + +W  P N +E+R FLG   Y RKF+     +  PL  L+K    ++WT    +A 
Sbjct: 629 IDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAI 688

BLAST of CSPI01G33130 vs. TrEMBL
Match: A0A087FZ16_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs42979U000100 PE=4 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 9.3e-97
Identity = 223/569 (39.19%), Postives = 295/569 (51.85%), Query Frame = 1

Query: 1    MEKLVDEMLISGVIRPSNSPYSSPVLLVRKK----------------------------- 60
            +EK V  ML +G+I+ S SP+SS VLLV+KK                             
Sbjct: 443  IEKQVASMLAAGIIQASGSPFSSHVLLVKKKDGSWRFCVDYRALNKVTIPDSFPIPMIDQ 502

Query: 61   -----------------DG-------SEDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQS 120
                              G       SED+ KTAF TH+GHYEF+VMPF LTNAP+T QS
Sbjct: 503  LLEELHGATIFSKLDLKSGYHQILVKSEDVPKTAFHTHDGHYEFLVMPFSLTNAPATFQS 562

Query: 121  LMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQH---ALEVLRKR--------------- 180
            LMN +FR YLRKF+LVFFDDIL+Y++ L EH QH    L +L++                
Sbjct: 563  LMNDVFRGYLRKFVLVFFDDILVYSKSLREHQQHLGLVLALLQQHQLFANQRKCEFGRTK 622

Query: 181  ------------VEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLT 240
                        V  +PEKI+A+  WP P NV+ +RGFLGLTGYYRKFVQ YG IA PLT
Sbjct: 623  LEYLGHVVSGQGVAADPEKIQAMVSWPEPQNVKALRGFLGLTGYYRKFVQKYGEIARPLT 682

Query: 241  QLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTFELETDASG------------ 300
             L+K   F+W  E   AF+ L++AM T+PVLA  DF+  F +E+DASG            
Sbjct: 683  ALLKKDQFQWNAEATVAFQKLKEAMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQQQR 742

Query: 301  ------HTLTLRDRAKP------------VYERELMAVVLAKWVAKLLGYSFEVVYKPRL 360
                    LT R R K             + E+  + +   KW+ KLLG+ FE+ YKP L
Sbjct: 743  PLAYFSQALTERQRLKKFVVRTDQKSLKFLLEQREINMEYQKWLTKLLGFDFEIHYKPGL 802

Query: 361  DNKAADALSRIPHTIELCNLTAPALVDIEGMLKYKGR---LVLSRQSTLIPNILHTYHDF 420
            +NKAADALSR    ++LC L+ PA + +E +     +   L   ++  L+    H+    
Sbjct: 803  ENKAADALSRRDMALQLCALSVPAAIQLEHINTEVDKDPVLHKLKEEVLLDAASHSEFSV 862

BLAST of CSPI01G33130 vs. TrEMBL
Match: A0A087HBU4_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G264600 PE=4 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 4.2e-89
Identity = 203/542 (37.45%), Postives = 279/542 (51.48%), Query Frame = 1

Query: 1    MEKLVDEMLISGVIRPSNSPYSSPVLLVRKKDGSEDIEKTAFRTHEGHYEFMVMPFGLTN 60
            +++L+DE+  + V    +       +LV+    + D+ KTAF TH+GHYEF+VMPFGLTN
Sbjct: 574  IDQLLDELNGAAVFSKLDLRSGYHQILVK----AADVPKTAFHTHDGHYEFLVMPFGLTN 633

Query: 61   APSTLQS--------LMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQHALEVL------ 120
            AP+T QS         +      +    ILV+   +  +   L+  LQ  L++L      
Sbjct: 634  APATFQSVMNDVFRKYLRKFVLVFFDD-ILVYSRTLAEHKDHLQTVLQTVLQLLADNQMF 693

Query: 121  ---------------------RKRVEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKF 180
                                 ++ V  +P KI+A+ +WP+P  ++ +RGFLGLTGYYRKF
Sbjct: 694  ANKNKCQFGSAEVEYLGHVITQQGVAADPSKIKAMTDWPVPKTIKALRGFLGLTGYYRKF 753

Query: 181  VQHYGSIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTFELETDASG 240
            V+ YG+I  PLT L+K   F W+EE  +AF  L+ AM T+PVLA  DFS  F +E+DASG
Sbjct: 754  VRGYGNIVKPLTSLLKKDKFGWSEEAEQAFEALKPAMSTVPVLALADFSELFVVESDASG 813

Query: 241  ------------------HTLTLRDRAKPVYERELMAVVLA------------------- 300
                                LT   + K VYERELMA+V A                   
Sbjct: 814  IGLGAVLMQQQKPIALFSQALTDIQKLKSVYERELMAIVFAIQKWRHYLLGCKFLVITDQ 873

Query: 301  -----------------KWVAKLLGYSFEVVYKPRLDNKAADALSRI---PHTI------ 360
                             KW+ K+LG+ F++ YKPRL+NKAADALSR+   PH        
Sbjct: 874  KSLKFLLEQREVNLEYQKWLTKILGFDFDIHYKPRLENKAADALSRVEAVPHLFALSVPE 933

Query: 361  ---------------ELCNLTAPALVD---------IEGMLKYKGRLVLSRQSTLIPNIL 420
                           EL  L    + D         + G L   GRLVL ++S ++  IL
Sbjct: 934  ALQLKEIDREVEQNPELGKLKLEVIADPTAHDEFTVVNGRLLRNGRLVLPKESPMVKLIL 993

BLAST of CSPI01G33130 vs. TrEMBL
Match: Q9LP90_ARATH (T32E20.30 OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 3.4e-83
Identity = 203/557 (36.45%), Postives = 282/557 (50.63%), Query Frame = 1

Query: 1    MEKLVDEMLISGVIRPSNSPYSSPVLLVRKK----------------------------- 60
            ME LV EML +G+IR S SP+SSPVLLV+KK                             
Sbjct: 544  MEGLVSEMLDNGIIRASKSPFSSPVLLVKKKDQSWRFCVDYRALNRATIPNKFPIPMIDQ 603

Query: 61   -----DGS-------------------EDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQS 120
                  G+                   EDIEKT FRTH+GH+EF+VMPFGL+NAP+T QS
Sbjct: 604  LLDELHGAIIFSKLDLRAGYHQIRMKVEDIEKTTFRTHDGHFEFLVMPFGLSNAPATFQS 663

Query: 121  LMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQHALEVLR------------------KR 180
             MN + RP+LRKF+LVFFDDILIY+++ +EH +H   VL+                  + 
Sbjct: 664  SMNDMLRPFLRKFVLVFFDDILIYSRNEQEHEEHLAMVLKVLEEHQFYANRKKPYHITQG 723

Query: 181  VEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTE 240
            V  +P K  A+ +W  P +V+E+RGFLGLTGYYR+F++ YG++A PLT+L+K   F W+E
Sbjct: 724  VSTDPTKTVAMTKWVTPQSVKELRGFLGLTGYYRRFLKGYGTLARPLTELLKKDSFVWSE 783

Query: 241  ETNEAFRHLQDAM-----VTLP-------VLAQPDFSYTFELETDA------------SG 300
               EAF  L+ AM     + LP       + ++      +E E  A             G
Sbjct: 784  SAQEAFDALKRAMSTAPVLALPDFGKVHGLTSKEQLKPVYERELMAIVLSIQKWKHYLMG 843

Query: 301  HTLTLRDRAKPV---YERELMAVVLAKWVAKLLGYSFEVVYKPRLDNKAADALSRIPH-- 360
                L    K +    E+  +++   KW+ KLL Y F+++YK  +DNKAAD LSR+    
Sbjct: 844  RRFVLHTDQKSLKFLQEQREVSMDYQKWLTKLLHYEFDILYKLGVDNKAADGLSRMVQPT 903

Query: 361  ----TIELCNLTAPAL---------VDIEGMLKYKGRLVLSRQS-----TLIPNILHTYH 420
                ++ L   T P +         +D    L++  +  LS +      T+    L    
Sbjct: 904  GSFSSMLLMAFTVPTVLQLHDLYEEIDSNAHLQHLVKECLSAKQGTSAYTVKEGRLWKKQ 963

BLAST of CSPI01G33130 vs. TrEMBL
Match: A0A151RRN1_CAJCA (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cajanus cajan GN=KK1_033279 PE=4 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 1.3e-82
Identity = 182/445 (40.90%), Postives = 241/445 (54.16%), Query Frame = 1

Query: 35  EDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRK-------FILVF--- 94
           ED  KTAFRTH+GHYE++VMPFGLTNAP+T Q LMN +F+  LRK        ILV+   
Sbjct: 85  EDRHKTAFRTHQGHYEWLVMPFGLTNAPATFQQLMNRVFQKLLRKCVLVFFDDILVYSPN 144

Query: 95  ----FDDILIYNQDLEEHLQHAL----------------EVLRKRVEVNPEKIRAIKEWP 154
                  +    Q L+ H+ +A                  V  K V ++  K++AI  WP
Sbjct: 145 WSSHLQHLEAVLQLLQSHVLYAKLSKCTFATQQVDYLGHTVSAKGVSMDKAKVQAILNWP 204

Query: 155 IPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVT 214
            PTN++++RGFLG+TGYYR+F+++Y ++A PLT L+K   F W++  ++ F+ L++A+ T
Sbjct: 205 EPTNLKQLRGFLGITGYYRRFIKNYAALAEPLTNLLKKDAFHWSDIASKTFQSLREAITT 264

Query: 215 LPVLAQPDFSYTFELETDASGHTLTLRDRAKPVYERELMAVVLAKWVAKLLGYSFEVVYK 274
            PVLA P+F+  F LETDASG  +                                  YK
Sbjct: 265 APVLALPNFNQPFILETDASGTGIG--------------------------------AYK 324

Query: 275 PRLDNKAADALSR----------------IPHTI-------------ELCNLTAPALVDI 334
           P  DN  ADALSR                + H I             EL N   P L   
Sbjct: 325 PGKDNIPADALSRSFYMAWSETQPTFLQELKHDIATDEYWKQQLQDCELGNNQNPHLSSK 384

Query: 335 EGMLKYKGRLVLSRQSTLIPNILHTYHDFVLGGHSGFLRTYKRMMGELYWEGMKEDVKKY 394
           + +L +KGRLV+ +QS LI  IL  YH   +GGHSG  RT  R+  E YW  MKE + ++
Sbjct: 385 DQLLFWKGRLVIPQQSPLISKILEEYHCSPIGGHSGIARTISRVKAEFYWPKMKEQIHRF 444

Query: 395 CKECIVCQKNKTLALSPARLLMPLEIPNSVWSDISMDFIEGLPRSSGFEVIFVVVDRFSK 421
            + C +CQ+ K  A+ PA LL PL IP+ +W DISMDFI GLP S GF VI V+VDR SK
Sbjct: 445 VQHCSICQQAKYAAVQPAGLLQPLPIPSQIWEDISMDFITGLPVSKGFTVILVIVDRLSK 497

BLAST of CSPI01G33130 vs. TrEMBL
Match: A0A087HA04_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G181900 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 1.8e-79
Identity = 190/482 (39.42%), Postives = 253/482 (52.49%), Query Frame = 1

Query: 1    MEKLVDEMLISGVIRPSNSPYSSPVLLVRKKDGSEDIEKTAFRTHEGHYEFMVMPFGLTN 60
            +++L+DE+  + V    +       +LV+K    ED+ KTAFRTHEGHYEF+V PFGL+N
Sbjct: 703  IDQLLDELHGAKVFSKLDLKAGYHQILVKK----EDVPKTAFRTHEGHYEFLVRPFGLSN 762

Query: 61   APSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQHALEVL-----------RKR 120
            AP+T QSLMN +F+ YLR+F+LVFFDDIL+Y+Q L EH  H   VL           RK+
Sbjct: 763  APATFQSLMNEVFKKYLRRFVLVFFDDILVYSQTLAEHQDHLRTVLGVLEEHQLYANRKK 822

Query: 121  -------------------VEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYG 180
                               V  +PEKI A+++WP+P NV+ +RGFLGLTGYYRKFVQ YG
Sbjct: 823  CYFGCESVEYLGHVISAEGVSADPEKISAMEKWPVPRNVKALRGFLGLTGYYRKFVQRYG 882

Query: 181  SIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTFELETDASG----- 240
             IA  LT L+K   F W  E +EAF  L+ AMVT+PVLA  DF+  F +E+DASG     
Sbjct: 883  EIARTLTALLKKDKFSWGPEADEAFLKLKRAMVTVPVLAMADFTTLFVVESDASGVGLGA 942

Query: 241  -------------HTLTLRDRAKPVYERELMAVVLA------------------------ 300
                           LT R   K +YERELMA+V A                        
Sbjct: 943  VLMQNQRPIAYFSQALTERQMLKSIYERELMAIVFAIQKWRHYLLVRTDQKSLKFLLEQR 1002

Query: 301  -------KWVAKLLGYSFEVVYKPRLDNKAADALSRIPHTIELCNLTAPALVDIEGM--- 360
                   +W+ K+LG+ FE+ YKP L+NKAADALSR     +L  L+ P  + +E +   
Sbjct: 1003 EINVEYQRWLTKILGFDFEIHYKPGLENKAADALSRKEAMPQLFALSVPTAIQLEDIGSE 1062

Query: 361  ------LKYKGRLVLSRQST-----------------LIPN-------ILHTYHDFVLGG 367
                  LK     VL   ST                 ++P        IL  +HD  +GG
Sbjct: 1063 VDKDPQLKKLKEEVLRDPSTHPNYAVVQGRLLRQGKLVLPRTSQLVGVILREFHDGKVGG 1122

BLAST of CSPI01G33130 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 104.4 bits (259), Expect = 1.8e-22
Identity = 49/107 (45.79%), Postives = 66/107 (61.68%), Query Frame = 1

Query: 91  YNQDLEEHLQHALEVLRKRVEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYG 150
           + Q    +L H   +  + V  +P K+ A+  WP P N  E+RGFLGLTGYYR+FV++YG
Sbjct: 25  FGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYG 84

Query: 151 SIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTF 198
            I  PLT+L+K    +WTE    AF+ L+ A+ TLPVLA PD    F
Sbjct: 85  KIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

BLAST of CSPI01G33130 vs. NCBI nr
Match: gi|659113889|ref|XP_008456800.1| (PREDICTED: uncharacterized protein LOC103496640 [Cucumis melo])

HSP 1 Score: 396.0 bits (1016), Expect = 8.4e-107
Identity = 237/517 (45.84%), Postives = 293/517 (56.67%), Query Frame = 1

Query: 1   MEKLVDEMLISGVIRP---------------SNSPYSSPVLLVRKKDGSEDIEKTAFRTH 60
           ME+LV+EML S VIRP               S     S    +R  D  EDI KT FRTH
Sbjct: 251 MERLVEEMLSSRVIRPMVEELFDELNRATVFSKIDLKSGYHQIRMVD--EDIPKTTFRTH 310

Query: 61  EGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQHALEV 120
           EGHYEF+VMPFGLTNAP+T Q+LMNTIF+PYLRKF+LVFFDDILIYN+D  +H+ H  +V
Sbjct: 311 EGHYEFLVMPFGLTNAPATFQALMNTIFKPYLRKFVLVFFDDILIYNKDERDHVGHIEKV 370

Query: 121 L------------------------------RKRVEVNPEKIRAIKEWPIPTNVREVRGF 180
                                          R+ VEV+P+KI+AI +WP           
Sbjct: 371 FLTLRRHALYANKKKCSFAQLKIEYLGHVISREGVEVDPDKIKAIADWP----------- 430

Query: 181 LGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSY 240
                             + LTQL+K GGF+W EE  EAF  L+ AM+TLPVLA P F +
Sbjct: 431 ------------------SHLTQLLKKGGFKWNEEAEEAFLKLKTAMLTLPVLALPSFDH 490

Query: 241 TFELETDASG------------------HTLTLRDRAKPVYERELMAVVLAKWVAKLLGY 300
            FE+ETDASG                  HT ++RDRA+PVY                   
Sbjct: 491 PFEIETDASGYGVGAMLVQSKRLIAFYSHTSSMRDRARPVY------------------- 550

Query: 301 SFEVVYKPRLDNKAADALSRIPHTIELCNLTAPALVD----------------IEGMLKY 360
             E+VYKP L+NKA DALSR P  I+L  ++AP LVD                I   L  
Sbjct: 551 --ELVYKPSLENKAIDALSRKPPDIQLSVISAPYLVDLKIIKDEVEKDEKPQKITTALCA 610

Query: 361 KGRLVLSR------------------QSTLIPNILHTYHDFVLGGHSGFLRTYKRMMGEL 420
            G L  S+                   S+LIP++L+ ++D V+GGHSGFLRTYKR+  EL
Sbjct: 611 DGGLQNSKFSLRNGFLHYKNRLVLSKTSSLIPSMLNIFNDSVVGGHSGFLRTYKRVASEL 670

BLAST of CSPI01G33130 vs. NCBI nr
Match: gi|731403730|ref|XP_010655166.1| (PREDICTED: uncharacterized protein LOC104880390 [Vitis vinifera])

HSP 1 Score: 369.8 bits (948), Expect = 6.4e-99
Identity = 203/447 (45.41%), Postives = 269/447 (60.18%), Query Frame = 1

Query: 36  DIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDL 95
           DI KTAFRTH GHYE++VMPFGL NAP T Q++MN+IFRP LRK ILVFFDDILIY+   
Sbjct: 556 DIPKTAFRTHNGHYEYLVMPFGLCNAPYTFQAIMNSIFRPSLRKLILVFFDDILIYSPTW 615

Query: 96  EEHLQH---ALEVLRK-------------RVEV--------------NPEKIRAIKEWPI 155
           E+HL+H    L VLR+             + E+              + +KI A+  WP 
Sbjct: 616 EQHLEHVQLTLAVLRQHQFYVKMSKCAFGKQELEYLGHIITHRGVKVDEKKIEAMVAWPR 675

Query: 156 PTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMVTL 215
           P+N+ E+ GFLGLTGYYRKFVQ YG IA PLT L+K G F+W +E   AF  L+ AM + 
Sbjct: 676 PSNITELLGFLGLTGYYRKFVQGYGLIARPLTNLLKKGKFQWNDEAEAAFLALKQAMTST 735

Query: 216 PVLAQPDFSYTFELETDASGHTLTLRDRAKPVYERELMAVVLAKWVAKLLGYSFEVVYKP 275
           P LA P+F+  F +ETDASG+ +   D+     E++       KWVAKLLGY +E++++P
Sbjct: 736 PTLAMPNFTEPFTIETDASGNRI---DQRVATPEQQ-------KWVAKLLGYDYEIIFRP 795

Query: 276 RLDNKAADALSRIPHTIELCNLTAPALVDI------------------------------ 335
             +N AADALSR   +  L  L   + VDI                              
Sbjct: 796 GRENSAADALSRRQESPLLAALHF-SEVDIWKQIREASKSDSYVQLLGKKAGDPPHGNLT 855

Query: 336 --EGMLKYKGRLVLSRQSTLIPNILHTYHDFVLGGHSGFLRTYKRMMGELYWEGMKEDVK 395
             +G+L YKG++V+    +L   +L+  HD  +GGHSG LRTY+R+  + YW  M + V+
Sbjct: 856 WRDGLLFYKGKVVVPADHSLRAKLLYEVHDSKVGGHSGILRTYRRLQQQFYWPKMHKAVQ 915

Query: 396 KYCKECIVCQKNKTLALSPARLLMPLEIPNSVWSDISMDFIEGLPRSSGFEVIFVVVDRF 421
           KY ++C VCQ+ K    +PA LL PL IP  VW DI++DFIEGLP S G + I VVVDR 
Sbjct: 916 KYVQKCEVCQRIKPETKAPAGLLQPLPIPAQVWEDITLDFIEGLPTSHGKDTILVVVDRL 975

BLAST of CSPI01G33130 vs. NCBI nr
Match: gi|727485485|ref|XP_010418661.1| (PREDICTED: uncharacterized protein LOC104704240 [Camelina sativa])

HSP 1 Score: 363.6 bits (932), Expect = 4.6e-97
Identity = 199/456 (43.64%), Postives = 269/456 (58.99%), Query Frame = 1

Query: 34  SEDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQ 93
           +ED+  TAFR+H+GHYEF+VM FGLTNAP+T QSLMN IFRP+LR+ +LVFFDDILIY++
Sbjct: 31  AEDVPNTAFRSHDGHYEFLVMSFGLTNAPTTFQSLMNEIFRPFLRRCVLVFFDDILIYSR 90

Query: 94  DLEEHL-----------QHALEVLRKR-------------------VEVNPEKIRAIKEW 153
            +EEH            QH L   RK+                   V  +P KI+A+ EW
Sbjct: 91  TMEEHQNHLREVLKLLRQHQLYANRKKCQFGTTRIAYLGHVISSEGVAADPSKIQAMLEW 150

Query: 154 PIPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLTQLMKNGGFRWTEETNEAFRHLQDAMV 213
             PTN++ +RGFLGLTGYYR+FV  YG IA PLT +++   F W+E++  AF  L+ AM 
Sbjct: 151 EPPTNIKTLRGFLGLTGYYRRFVLGYGEIAKPLTDMLRKDQFEWSEKSEAAFDKLRTAMT 210

Query: 214 TLPVLAQPDFSYTFELETDASG------------------HTLTLRDRAKPVYERELMAV 273
           T+PVLA P+FS TF +E+DASG                    L+ R R K +YERELMA+
Sbjct: 211 TVPVLALPNFSETFVVESDASGFGLGAVLMQKERPIAYYNQALSDRQRLKSIYERELMAI 270

Query: 274 VLA--KWVAKLLG--------------YSFEVVYKPRLDNKAAD-----ALSRIPHTIEL 333
           VL+  KW   +LG              ++  V    +L+   A+     AL  I   +  
Sbjct: 271 VLSVQKWCHYILGRRIGEQGGRCSLSLFAISVPAAIQLEEICAEVDKDPALQVIIRDLRK 330

Query: 334 CNLTAPALVDIEGMLKYKGRLVLSRQSTLIPNILHTYHDFVLGGHSGFLRTYKRMMGELY 393
              + P    + G L  +G+LV+   S L   I+  +HD  LGGH G L+T KR+    +
Sbjct: 331 DGSSHPEFSLVNGRLLRQGKLVIPSGSALTGLIMREFHDGKLGGHGGVLKTQKRIGELFF 390

Query: 394 WEGMKEDVKKYCKECIVCQKNKTLALSPARLLMPLEIPNSVWSDISMDFIEGLPRSSGFE 421
           WEGM  D++++   C+VCQ+ K   L+P  LL PL +P  +W DISMDF+EGLP+S G  
Sbjct: 391 WEGMMTDIRRHVAACLVCQRYKYSTLAPGGLLQPLPVPEKIWKDISMDFVEGLPKSGGNN 450

BLAST of CSPI01G33130 vs. NCBI nr
Match: gi|674229083|gb|KFK22868.1| (hypothetical protein AALP_AAs42979U000100, partial [Arabis alpina])

HSP 1 Score: 362.1 bits (928), Expect = 1.3e-96
Identity = 223/569 (39.19%), Postives = 295/569 (51.85%), Query Frame = 1

Query: 1    MEKLVDEMLISGVIRPSNSPYSSPVLLVRKK----------------------------- 60
            +EK V  ML +G+I+ S SP+SS VLLV+KK                             
Sbjct: 443  IEKQVASMLAAGIIQASGSPFSSHVLLVKKKDGSWRFCVDYRALNKVTIPDSFPIPMIDQ 502

Query: 61   -----------------DG-------SEDIEKTAFRTHEGHYEFMVMPFGLTNAPSTLQS 120
                              G       SED+ KTAF TH+GHYEF+VMPF LTNAP+T QS
Sbjct: 503  LLEELHGATIFSKLDLKSGYHQILVKSEDVPKTAFHTHDGHYEFLVMPFSLTNAPATFQS 562

Query: 121  LMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQH---ALEVLRKR--------------- 180
            LMN +FR YLRKF+LVFFDDIL+Y++ L EH QH    L +L++                
Sbjct: 563  LMNDVFRGYLRKFVLVFFDDILVYSKSLREHQQHLGLVLALLQQHQLFANQRKCEFGRTK 622

Query: 181  ------------VEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQHYGSIAAPLT 240
                        V  +PEKI+A+  WP P NV+ +RGFLGLTGYYRKFVQ YG IA PLT
Sbjct: 623  LEYLGHVVSGQGVAADPEKIQAMVSWPEPQNVKALRGFLGLTGYYRKFVQKYGEIARPLT 682

Query: 241  QLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTFELETDASG------------ 300
             L+K   F+W  E   AF+ L++AM T+PVLA  DF+  F +E+DASG            
Sbjct: 683  ALLKKDQFQWNAEATVAFQKLKEAMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQQQR 742

Query: 301  ------HTLTLRDRAKP------------VYERELMAVVLAKWVAKLLGYSFEVVYKPRL 360
                    LT R R K             + E+  + +   KW+ KLLG+ FE+ YKP L
Sbjct: 743  PLAYFSQALTERQRLKKFVVRTDQKSLKFLLEQREINMEYQKWLTKLLGFDFEIHYKPGL 802

Query: 361  DNKAADALSRIPHTIELCNLTAPALVDIEGMLKYKGR---LVLSRQSTLIPNILHTYHDF 420
            +NKAADALSR    ++LC L+ PA + +E +     +   L   ++  L+    H+    
Sbjct: 803  ENKAADALSRRDMALQLCALSVPAAIQLEHINTEVDKDPVLHKLKEEVLLDAASHSEFSV 862

BLAST of CSPI01G33130 vs. NCBI nr
Match: gi|659098000|ref|XP_008449923.1| (PREDICTED: uncharacterized protein LOC103491653 [Cucumis melo])

HSP 1 Score: 355.9 bits (912), Expect = 9.6e-95
Identity = 198/410 (48.29%), Postives = 248/410 (60.49%), Query Frame = 1

Query: 45  HEGHYEFMVMPFGLTNAPSTLQSLMNTIFRPYLRKFILVFFDDILIYNQDLEEHLQH--- 104
           HEGHYEF+VMPFGLTNAP+T Q LMNTIF+PYLRKFILVFFDDIL+Y+ + +EH+ H   
Sbjct: 2   HEGHYEFLVMPFGLTNAPTTFQPLMNTIFKPYLRKFILVFFDDILVYSNNEKEHVSHMEK 61

Query: 105 ALEVLRKRVEVNPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVQ-----HYGSIAAPL 164
            L  LR       +K  +  +  I      + G  G+     K        HYGSIAAPL
Sbjct: 62  VLSTLRDHALYANKKKYSFAQLKIEYLGHVISGE-GVEVDLEKIKSIADKLHYGSIAAPL 121

Query: 165 TQLMKNGGFRWTEETNEAFRHLQDAMVTLPVLAQPDFSYTFELETDASG----------- 224
           TQL K GGF+W EE  E F+ L+ AM++LPV A P+F++  E+E DASG           
Sbjct: 122 TQLHKKGGFKWIEEVKEEFKKLKKAMLSLPVFAIPNFNHPVEIEIDASGFGVGVVLTQLE 181

Query: 225 -------HTLTLRDRAKPVYERELMAVVLAKWVAKLLGYSFEVVYKPRLDNKAADALSRI 284
                  HTL +RDR K VYERELM                       L+NKAAD LSR 
Sbjct: 182 RPIAFYSHTLAIRDRPKLVYERELMVC---------------------LENKAADTLSRK 241

Query: 285 PHTIELCNLTAPALVDIEG----------------------------------MLKYKGR 344
           P  ++LC ++AP L+D++                                   ML YK R
Sbjct: 242 PPKVQLCRISAPILIDLKTIKEEVEKDEKLHMLVTEWKKDSKQKNNNFFMKNEMLHYKDR 301

Query: 345 LVLSRQSTLIPNILHTYHDFVLGGHSGFLRTYKRMMGELYWEGMKEDVKKYCKECIVCQK 395
           LVLS+ S+LI  +L+T+HD V+G HS FLRT KR+  ELYW+GMK DVK++C+EC++CQ+
Sbjct: 302 LVLSKTSSLIRAMLNTFHDSVVGRHSSFLRTSKRVADELYWQGMKADVKRHCEECVICQR 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL3_DROME3.1e-3738.55Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL2_DROME7.7e-3639.23Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POL5_DROME1.8e-3228.65Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
POLY_DROME4.7e-2530.71Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
TF23_SCHPO1.5e-2331.58Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A087FZ16_ARAAL9.3e-9739.19Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs42979U000100 PE=4... [more]
A0A087HBU4_ARAAL4.2e-8937.45Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G264600 PE=4 SV=1[more]
Q9LP90_ARATH3.4e-8336.45T32E20.30 OS=Arabidopsis thaliana PE=4 SV=1[more]
A0A151RRN1_CAJCA1.3e-8240.90Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cajanus cajan GN=KK1_... [more]
A0A087HA04_ARAAL1.8e-7939.42Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G181900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.11.8e-2245.79ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659113889|ref|XP_008456800.1|8.4e-10745.84PREDICTED: uncharacterized protein LOC103496640 [Cucumis melo][more]
gi|731403730|ref|XP_010655166.1|6.4e-9945.41PREDICTED: uncharacterized protein LOC104880390 [Vitis vinifera][more]
gi|727485485|ref|XP_010418661.1|4.6e-9743.64PREDICTED: uncharacterized protein LOC104704240 [Camelina sativa][more]
gi|674229083|gb|KFK22868.1|1.3e-9639.19hypothetical protein AALP_AAs42979U000100, partial [Arabis alpina][more]
gi|659098000|ref|XP_008449923.1|9.6e-9548.29PREDICTED: uncharacterized protein LOC103491653 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR012337RNaseH-like_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G33130.1CSPI01G33130.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 44..120
score: 5.2
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..140
score: 8
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 362..417
score: 5.
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 359..419
score: 9.8
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 2..33
score: 4.
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 40..109
score: 1.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 21..419
score: 7.9E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 21..419
score: 7.9E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 2..244
score: 7.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI01G33130Melon (DHL92) v3.6.1cpimedB038
CSPI01G33130Cucumber (Gy14) v2cgybcpiB002
CSPI01G33130Cucumber (Chinese Long) v3cpicucB000
CSPI01G33130Cucumber (Gy14) v1cgycpiB584
CSPI01G33130Cucumber (Chinese Long) v2cpicuB001
CSPI01G33130Melon (DHL92) v3.5.1cpimeB044
CSPI01G33130Watermelon (97103) v1cpiwmB020
CSPI01G33130Bottle gourd (USVL1VR-Ls)cpilsiB059