CSPI03G25080 (gene) Wild cucumber (PI 183967)

NameCSPI03G25080
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr3 : 22290261 .. 22291553 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGAAAACAGAAACAAGGTGCAAATGATAGAAGATATCATGAAAGGGTCTGATAGGAGTGGGAAGCTTTTGATGTCGGTGAAGCAAACTCAAAATCGTTTGTACAAGATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCATGGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCGTGATTACCAAACAAGCCAGATTGCCCTTCCCCCGTCAATCAACATATAGAGCAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCACGTACTCTTGCTGGAAACAAGTATTTTCTGTTGATCGTTGACGATTCCACGAGATGGATGTGGTTGTATATGTTGGAGGCAAAAAGTGATGGATTTGAAGCATTCAATAAATTCAAACTCTTAATGGAGAACAAAACGGAGTACAAGATCAGAACGCTCCGGATGGATCGAGGTGGTGAGTTCTTATCTGCAGAGTTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACCCCTCACCGCTCCATATTCACCACAACAAAATGGCATTATAGAGCGTCGTAACCGCACCGTAATGGCGAGGACGAGATCACTCCTCAAAAGCATGCATGTGCCTGCAAAATTTTGGGGAGAGGCATTGAGACACACGGTTTATTTGTTAAATTGTCTTCCAACGAAGGCCCTTGGAGAACGCACACCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGAACACAACCCCTCACTTCAAGAAACTCGATGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGTGGAAAACTACAAATCAGTAGAGATGTTCTTTTTCAAGAGAATCTTGAATGGGCTTGGAATGAAGTTGTCAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTTGGGTTGAAAATGCCTTCCCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTTCATCTCCTCCATCGACGAACACACCGGTTCGTCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGTAGGTGGTGATGAATAA

mRNA sequence

ATGACAGAAAACAGAAACAAGGTGCAAATGATAGAAGATATCATGAAAGGGTCTGATAGGAGTGGGAAGCTTTTGATGTCGGTGAAGCAAACTCAAAATCGTTTGTACAAGATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCATGGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCGTGATTACCAAACAAGCCAGATTGCCCTTCCCCCGTCAATCAACATATAGAGCAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCACGTACTCTTGCTGGAAACAAGTATTTTCTGTTGATCGTTGACGATTCCACGAGATGGATGTGGTTGTATATGTTGGAGGCAAAAAGTGATGGATTTGAAGCATTCAATAAATTCAAACTCTTAATGGAGAACAAAACGGAGTACAAGATCAGAACGCTCCGGATGGATCGAGGTGGTGAGTTCTTATCTGCAGAGTTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACCCCTCACCGCTCCATATTCACCACAACAAAATGGCATTATAGAGCGTCGTAACCGCACCGTAATGGCGAGGACGAGATCACTCCTCAAAAGCATGCATGTGCCTGCAAAATTTTGGGGAGAGGCATTGAGACACACGGTTTATTTGTTAAATTGTCTTCCAACGAAGGCCCTTGGAGAACGCACACCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGAACACAACCCCTCACTTCAAGAAACTCGATGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGTGGAAAACTACAAATCAGTAGAGATGTTCTTTTTCAAGAGAATCTTGAATGGGCTTGGAATGAAGTTGTCAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTTGGGTTGAAAATGCCTTCCCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTTCATCTCCTCCATCGACGAACACACCGGTTCGTCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGTAGGTGGTGATGAATAA

Coding sequence (CDS)

ATGACAGAAAACAGAAACAAGGTGCAAATGATAGAAGATATCATGAAAGGGTCTGATAGGAGTGGGAAGCTTTTGATGTCGGTGAAGCAAACTCAAAATCGTTTGTACAAGATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCATGGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCGTGATTACCAAACAAGCCAGATTGCCCTTCCCCCGTCAATCAACATATAGAGCAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCACGTACTCTTGCTGGAAACAAGTATTTTCTGTTGATCGTTGACGATTCCACGAGATGGATGTGGTTGTATATGTTGGAGGCAAAAAGTGATGGATTTGAAGCATTCAATAAATTCAAACTCTTAATGGAGAACAAAACGGAGTACAAGATCAGAACGCTCCGGATGGATCGAGGTGGTGAGTTCTTATCTGCAGAGTTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACCCCTCACCGCTCCATATTCACCACAACAAAATGGCATTATAGAGCGTCGTAACCGCACCGTAATGGCGAGGACGAGATCACTCCTCAAAAGCATGCATGTGCCTGCAAAATTTTGGGGAGAGGCATTGAGACACACGGTTTATTTGTTAAATTGTCTTCCAACGAAGGCCCTTGGAGAACGCACACCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGAACACAACCCCTCACTTCAAGAAACTCGATGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGTGGAAAACTACAAATCAGTAGAGATGTTCTTTTTCAAGAGAATCTTGAATGGGCTTGGAATGAAGTTGTCAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTTGGGTTGAAAATGCCTTCCCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTTCATCTCCTCCATCGACGAACACACCGGTTCGTCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGTAGGTGGTGATGAATAA
BLAST of CSPI03G25080 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 235.3 bits (599), Expect = 1.2e-60
Identity = 114/323 (35.29%), Postives = 186/323 (57.59%), Query Frame = 1

Query: 22  GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVV 81
           G L+++    +  LY+   +  +        E    LWH R+GH++   L+++ +K L+ 
Sbjct: 388 GSLVIAKGVARGTLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLIS 447

Query: 82  GVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFL 141
                T   K C+ C+  KQ R+ F + S+ R    L+L+++D+CGP+   ++ GNKYF+
Sbjct: 448 YAKGTTV--KPCDYCLFGKQHRVSF-QTSSERKLNILDLVYSDVCGPMEIESMGGNKYFV 507

Query: 142 LIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFC 201
             +DD++R +W+Y+L+ K   F+ F KF  L+E +T  K++ LR D GGE+ S EF ++C
Sbjct: 508 TFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYC 567

Query: 202 KKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCL 261
              GI    T P +PQ NG+ ER NRT++ + RS+L+   +P  FWGEA++   YL+N  
Sbjct: 568 SSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRS 627

Query: 262 PTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCK 321
           P+  L    P   W  ++   +HL+VFGC A+         KLDD+S P ++ G  +   
Sbjct: 628 PSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEF 687

Query: 322 AHRLYDPGRGKLQISRDVLFQEN 345
            +RL+DP + K+  SRDV+F+E+
Sbjct: 688 GYRLWDPVKKKVIRSRDVVFRES 707

BLAST of CSPI03G25080 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 206.5 bits (524), Expect = 6.0e-52
Identity = 118/345 (34.20%), Postives = 180/345 (52.17%), Query Frame = 1

Query: 58  LWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPN---KLCEACVITKQARLPFPR-QSTYR 117
           LWH R GH++   L  +  K +     L+       ++CE C+  KQARLPF + +    
Sbjct: 417 LWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTH 476

Query: 118 AEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLM 177
            ++PL ++H+D+CGPI+P TL    YF++ VD  T +   Y+++ KSD F  F  F    
Sbjct: 477 IKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKS 536

Query: 178 ENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMART 237
           E     K+  L +D G E+LS E  QFC K+GI   LT P++PQ NG+ ER  RT+  + 
Sbjct: 537 EAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKA 596

Query: 238 RSLLKSMHVPAKFWGEALRHTVYLLNCLPTKAL--GERTPFEAWMGRKPHLAHLRVFGCV 297
           R+++    +   FWGEA+    YL+N +P++AL    +TP+E W  +KP+L HLRVFG  
Sbjct: 597 RTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGAT 656

Query: 298 AYVKNTTPHFK----KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEW 357
            YV     H K    K DD+S   ++ G E      +L+D    K  ++RDV+  E    
Sbjct: 657 VYV-----HIKNKQGKFDDKSFKSIFVGYEP--NGFKLWDAVNEKFIVARDVVVDET--- 716

Query: 358 AWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATE 393
             N V S   +     + D   S+      D+   ++  FP+ ++
Sbjct: 717 --NMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESK 749

BLAST of CSPI03G25080 vs. Swiss-Prot
Match: YP41B_YEAST (Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-P PE=5 SV=2)

HSP 1 Score: 93.6 bits (231), Expect = 5.7e-18
Identity = 82/339 (24.19%), Postives = 148/339 (43.66%), Query Frame = 1

Query: 8   VQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVN 67
           V+M E I + SD S   + ++K T +  +K+  +++       +LED     H R+GH  
Sbjct: 522 VKMNELIERPSDDSK--INAIKPTSSPGFKLNKRSI-------TLEDA----HKRMGHTG 581

Query: 68  FHDLK-LMGEKKLVVGVPLVTQPNKL-CEACVI---TKQARLPFPRQSTYRAEKPLELLH 127
              ++  +        + L+ +PN+  C+ C I   TK+        +     +P     
Sbjct: 582 IQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTDHEPGSSWC 641

Query: 128 ADICGPISPRTLAGNKYFLLIVDDSTRWMWL--YMLEAKSDGFEAFNKFKLLMENKTEYK 187
            DI GP+S       +Y L++VD++TR+     +  +          K    +E + + K
Sbjct: 642 MDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQIRKNIQYVETQFDRK 701

Query: 188 IRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSM 247
           +R +  DRG EF + +  ++   +GI   LT+      NG  ER  RT++    +LL+  
Sbjct: 702 VREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVTDATTLLRQS 761

Query: 248 HVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKP---HLAHLRVFGCVAYVKNT 307
           ++  KFW  A+     + NCL  K+ G + P +A + R+P    L     FG    + N 
Sbjct: 762 NLRVKFWEYAVTSATNIRNCLEHKSTG-KLPLKA-ISRQPVTVRLMSFLPFGEKGIIWN- 821

Query: 308 TPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQIS 337
             + KKL     P +    +     ++ + P + K+  S
Sbjct: 822 -HNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTS 843

BLAST of CSPI03G25080 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 5.7e-18
Identity = 82/339 (24.19%), Postives = 148/339 (43.66%), Query Frame = 1

Query: 8   VQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVN 67
           V+M E I + SD S   + ++K T +  +K+  +++       +LED     H R+GH  
Sbjct: 522 VKMNELIERPSDDSK--INAIKPTSSPGFKLNKRSI-------TLEDA----HKRMGHTG 581

Query: 68  FHDLK-LMGEKKLVVGVPLVTQPNKL-CEACVI---TKQARLPFPRQSTYRAEKPLELLH 127
              ++  +        + L+ +PN+  C+ C I   TK+        +     +P     
Sbjct: 582 IQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTDHEPGSSWC 641

Query: 128 ADICGPISPRTLAGNKYFLLIVDDSTRWMWL--YMLEAKSDGFEAFNKFKLLMENKTEYK 187
            DI GP+S       +Y L++VD++TR+     +  +          K    +E + + K
Sbjct: 642 MDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQIRKNIQYVETQFDRK 701

Query: 188 IRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSM 247
           +R +  DRG EF + +  ++   +GI   LT+      NG  ER  RT++    +LL+  
Sbjct: 702 VREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVTDATTLLRQS 761

Query: 248 HVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKP---HLAHLRVFGCVAYVKNT 307
           ++  KFW  A+     + NCL  K+ G + P +A + R+P    L     FG    + N 
Sbjct: 762 NLRVKFWEYAVTSATNIRNCLEHKSTG-KLPLKA-ISRQPVTVRLMSFLPFGEKGIIWN- 821

Query: 308 TPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQIS 337
             + KKL     P +    +     ++ + P + K+  S
Sbjct: 822 -HNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTS 843

BLAST of CSPI03G25080 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 89.0 bits (219), Expect = 1.4e-16
Identity = 81/339 (23.89%), Postives = 147/339 (43.36%), Query Frame = 1

Query: 8   VQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVN 67
           V+M E I + SD S   + ++K T +  +K+  +++       +LED     H R+GH  
Sbjct: 523 VKMNELIERPSDDSK--INAIKPTSSPGFKLNKRSI-------TLEDA----HKRMGHTG 582

Query: 68  FHDLK-LMGEKKLVVGVPLVTQPNKL-CEACVI---TKQARLPFPRQSTYRAEKPLELLH 127
              ++  +        + L+ +PN+  C+ C I   TK+        +     +P     
Sbjct: 583 IQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTDHEPGSSWC 642

Query: 128 ADICGPISPRTLAGNKYFLLIVDDSTRWMWL--YMLEAKSDGFEAFNKFKLLMENKTEYK 187
            DI GP+S       +Y L++VD++TR+     +  +          K    +E + + K
Sbjct: 643 MDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRK 702

Query: 188 IRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSM 247
           +R +  DRG EF + +  ++   +GI   LT+      NG  ER  RT++    +LL+  
Sbjct: 703 VREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQS 762

Query: 248 HVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKP---HLAHLRVFGCVAYVKNT 307
           ++  KFW  A+     + N L  K+ G + P +A + R+P    L     FG    + N 
Sbjct: 763 NLRVKFWEYAVTSATNIRNYLEHKSTG-KLPLKA-ISRQPVTVRLMSFLPFGEKGIIWN- 822

Query: 308 TPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQIS 337
             + KKL     P +    +     ++ + P + K+  S
Sbjct: 823 -HNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTS 844

BLAST of CSPI03G25080 vs. TrEMBL
Match: Q0J8A6_ORYSJ (Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 4.4e-134
Identity = 219/365 (60.00%), Postives = 285/365 (78.08%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLW
Sbjct: 436 LTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYRIELKLATPVCLLTRMDEPAWLW 495

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ R PFP  + +RAE+PLE
Sbjct: 496 HARLGHVNFQAMKLLADKGMAGGIPAITHPNQLCQACLVAKQIRQPFPATANFRAEEPLE 555

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    
Sbjct: 556 LLHIDLCGPITPTTMAGNRYFMLIVDDFSRWMWMFVIKTKDQALEAFTKFKPLAENTAGR 615

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
           +I+TLR DRGGEFLS EF Q C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K 
Sbjct: 616 RIKTLRSDRGGEFLSGEFAQLCEQAGIQRHLTAPYSPQQNGVVERRNRSVMAMARSLMKG 675

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Sbjct: 676 MSVPGRFWGEAVRHAVYLLNRLPTKAMGDRTPFEAWTGRKPQLGHLRVFGCIAHAKITTP 735

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  
Sbjct: 736 NQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVSRDVIFEENVPWQWS-VVAGEQNS 795

Query: 361 TEFQV 365
           TEF V
Sbjct: 796 TEFTV 799

BLAST of CSPI03G25080 vs. TrEMBL
Match: B8BDZ6_ORYSI (Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 4.4e-134
Identity = 219/365 (60.00%), Postives = 285/365 (78.08%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLW
Sbjct: 436 LTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYRIELKLATPVCLLTRMDEPAWLW 495

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ R PFP  + +RAE+PLE
Sbjct: 496 HARLGHVNFQAMKLLADKGMAGGIPAITHPNQLCQACLVAKQIRQPFPATANFRAEEPLE 555

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    
Sbjct: 556 LLHIDLCGPITPTTMAGNRYFMLIVDDFSRWMWMFVIKTKDQALEAFTKFKPLAENTAGR 615

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
           +I+TLR DRGGEFLS EF Q C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K 
Sbjct: 616 RIKTLRSDRGGEFLSGEFAQLCEQAGIQRHLTAPYSPQQNGVVERRNRSVMAMARSLMKG 675

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Sbjct: 676 MSVPGRFWGEAVRHAVYLLNRLPTKAMGDRTPFEAWTGRKPQLGHLRVFGCIAHAKITTP 735

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  
Sbjct: 736 NQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVSRDVIFEENVPWQWS-VVAGEQNS 795

Query: 361 TEFQV 365
           TEF V
Sbjct: 796 TEFTV 799

BLAST of CSPI03G25080 vs. TrEMBL
Match: Q338J6_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os10g26030 PE=4 SV=2)

HSP 1 Score: 466.8 bits (1200), Expect = 2.8e-128
Identity = 230/431 (53.36%), Postives = 292/431 (67.75%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D +K  D++  +L+M V++T NRLY+I L+   QVCLL SL++P WLW
Sbjct: 283 LTETGHRVMMDGDDLKVFDKNPWRLVMKVRRTSNRLYRIELQLASQVCLLASLDNPAWLW 342

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H R+GHVNFH LKL+ +K++  GVP V  PN+LC+AC++ KQ R PFP  + YRAE PLE
Sbjct: 343 HARIGHVNFHALKLLVDKEMASGVPTVHHPNQLCQACLVAKQVRQPFPGMANYRAEAPLE 402

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T AGN+YF+LIVDD + WMW+++++ K      F KFK L +N    
Sbjct: 403 LLHMDLCGPITPSTFAGNRYFMLIVDDFSNWMWVFVIKLKDQALAVFEKFKPLAKNTVGR 462

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRGGEFLS +F + C    IER LTAPYSPQQN ++ERRNRTVMA  RSLLK 
Sbjct: 463 TIKTLRTDRGGEFLSGKFARVCDAASIERHLTAPYSPQQNDVVERRNRTVMAMARSLLKG 522

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH ++LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Sbjct: 523 MSVPGRMWGEAVRHAIFLLNWLPTKAMGNRTPFEAWTGKKPHLGHLRVFGCTAHAKVTAP 582

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H KKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+       E+
Sbjct: 583 HLKKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAG---EV 642

Query: 361 TEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETS--SSPPSTNT---PVRLRS 420
           T         S EFE  E          P   E PA+ E +  +SP +  +   PVR RS
Sbjct: 643 T---------STEFEVEE----------PVGAEQPALAEQAGLASPHTAGSDVGPVRYRS 691

Query: 421 LSDIYANTEEV 426
           L++I      V
Sbjct: 703 LAEIMLEAPRV 691

BLAST of CSPI03G25080 vs. TrEMBL
Match: Q84SW8_ORYSJ (Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 4.7e-128
Identity = 223/411 (54.26%), Postives = 288/411 (70.07%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D ++  D++  +L+M V+++ NRLY+I L+    VCLL SL+DP WLW
Sbjct: 389 LTETGHRVVMDGDDLEVFDKNPWRLVMKVRRSSNRLYRIELQLASPVCLLASLDDPAWLW 448

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNFH LKL+ +K++  GVP V  PN+LC+AC++ KQ R PFP  + Y AE PLE
Sbjct: 449 HARLGHVNFHALKLLVDKEMAAGVPAVHHPNQLCQACLVAKQVRQPFPGMANYLAEAPLE 508

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T  GN+YF+LIVDD + WMW++++++K     AF KFK L EN    
Sbjct: 509 LLHMDLCGPITPSTFTGNRYFMLIVDDFSHWMWVFVIKSKDQALAAFEKFKPLAENTAGR 568

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRGGEFLS EF + C   GIER LT PYSPQQNG++ERRNRTVMA  RSLLK 
Sbjct: 569 TIKTLRTDRGGEFLSGEFARVCDAAGIERHLTVPYSPQQNGVVERRNRTVMAMARSLLKG 628

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH V+LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Sbjct: 629 MSVPGRMWGEAVRHAVFLLNRLPTKAMGNRTPFEAWTGKKPHLGHLRVFGCTAHAKVTAP 688

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H KKLDDRS+P+VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    
Sbjct: 689 HLKKLDDRSNPVVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAGEVTS- 748

Query: 361 TEFQVMDQFYSDEFENLEDAET--W--VENAFPHATEIPAIGETSSSPPST 407
           TEF+V +   +++    E A +  W     A   A + P + E   +PP++
Sbjct: 749 TEFEVEEPVGAEQPAPAEQAGSVPWYRAPPAGRRAGKEPEVAEQRGTPPAS 798

BLAST of CSPI03G25080 vs. TrEMBL
Match: Q7XMW2_ORYSJ (OSJNBb0040D15.12 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0040D15.12 PE=4 SV=2)

HSP 1 Score: 459.5 bits (1181), Expect = 4.4e-126
Identity = 230/458 (50.22%), Postives = 299/458 (65.28%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D ++  D++  +L+M V++T NRLY+I L+    VCLL SL+DP WLW
Sbjct: 249 LTETGHRVVMDGDDLEVFDKNPWRLVMKVRRTSNRLYRIELQLASPVCLLASLDDPAWLW 308

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNFH LKL+ +K++V GVP V  PN+LC+AC++ KQ R  FP  + YRAE PLE
Sbjct: 309 HARLGHVNFHALKLLVDKEMVAGVPAVHHPNQLCQACLVAKQVRQSFPGMANYRAEAPLE 368

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T AGN+YF+LIVDD +RWMW++++++K     A  KFK L EN    
Sbjct: 369 LLHMDLCGPITPSTFAGNRYFMLIVDDFSRWMWVFVIKSKDQALAASEKFKPLAENTAGR 428

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRG EFLS EF + C   GIER LTAPYSPQQNG++E RNRTVMA  RSLLK 
Sbjct: 429 TIKTLRTDRGSEFLSGEFARVCDAAGIERHLTAPYSPQQNGVVEHRNRTVMAMARSLLKG 488

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH V+LLN LPTKA+G RT FEAWMG+KPHL HL VFGC A+ K T P
Sbjct: 489 MSVPGRMWGEAVRHAVFLLNRLPTKAMGNRTSFEAWMGKKPHLGHLWVFGCTAHTKVTAP 548

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H  KLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    
Sbjct: 549 HLMKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAGEVTS- 608

Query: 361 TEFQVMDQFYSDEFENL-------------EDAETWVENAFPHAT---------EIPAIG 420
           TEF+V +   +++  ++             ++ E   +   P A+           P +G
Sbjct: 609 TEFEVEEPVGAEQAASVPWYRAPPAGRRAGKEPEVAEQRGTPPASPARFSPTLPSTPTLG 668

Query: 421 ETSS----------SPPSTNTPVRLRSLSDIYANTEEV 426
            +S+          +P S + PVR RSL +I      V
Sbjct: 669 SSSTHSAEVQASPRTPGSDDGPVRYRSLVEIMLEAPRV 705

BLAST of CSPI03G25080 vs. NCBI nr
Match: gi|113622864|dbj|BAF22809.1| (Os08g0125300 [Oryza sativa Japonica Group])

HSP 1 Score: 486.1 bits (1250), Expect = 6.3e-134
Identity = 219/365 (60.00%), Postives = 285/365 (78.08%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLW
Sbjct: 436 LTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYRIELKLATPVCLLTRMDEPAWLW 495

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ R PFP  + +RAE+PLE
Sbjct: 496 HARLGHVNFQAMKLLADKGMAGGIPAITHPNQLCQACLVAKQIRQPFPATANFRAEEPLE 555

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    
Sbjct: 556 LLHIDLCGPITPTTMAGNRYFMLIVDDFSRWMWMFVIKTKDQALEAFTKFKPLAENTAGR 615

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
           +I+TLR DRGGEFLS EF Q C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K 
Sbjct: 616 RIKTLRSDRGGEFLSGEFAQLCEQAGIQRHLTAPYSPQQNGVVERRNRSVMAMARSLMKG 675

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Sbjct: 676 MSVPGRFWGEAVRHAVYLLNRLPTKAMGDRTPFEAWTGRKPQLGHLRVFGCIAHAKITTP 735

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  
Sbjct: 736 NQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVSRDVIFEENVPWQWS-VVAGEQNS 795

Query: 361 TEFQV 365
           TEF V
Sbjct: 796 TEFTV 799

BLAST of CSPI03G25080 vs. NCBI nr
Match: gi|218201855|gb|EEC84282.1| (hypothetical protein OsI_30754 [Oryza sativa Indica Group])

HSP 1 Score: 486.1 bits (1250), Expect = 6.3e-134
Identity = 219/365 (60.00%), Postives = 285/365 (78.08%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLW
Sbjct: 436 LTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYRIELKLATPVCLLTRMDEPAWLW 495

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ R PFP  + +RAE+PLE
Sbjct: 496 HARLGHVNFQAMKLLADKGMAGGIPAITHPNQLCQACLVAKQIRQPFPATANFRAEEPLE 555

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    
Sbjct: 556 LLHIDLCGPITPTTMAGNRYFMLIVDDFSRWMWMFVIKTKDQALEAFTKFKPLAENTAGR 615

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
           +I+TLR DRGGEFLS EF Q C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K 
Sbjct: 616 RIKTLRSDRGGEFLSGEFAQLCEQAGIQRHLTAPYSPQQNGVVERRNRSVMAMARSLMKG 675

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Sbjct: 676 MSVPGRFWGEAVRHAVYLLNRLPTKAMGDRTPFEAWTGRKPQLGHLRVFGCIAHAKITTP 735

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  
Sbjct: 736 NQKKLDDRSAPYVYLGVEEGSKAHRLFDPRCGRIHVSRDVIFEENVPWQWS-VVAGEQNS 795

Query: 361 TEFQV 365
           TEF V
Sbjct: 796 TEFTV 799

BLAST of CSPI03G25080 vs. NCBI nr
Match: gi|110289052|gb|ABB47537.2| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 466.8 bits (1200), Expect = 4.0e-128
Identity = 230/431 (53.36%), Postives = 292/431 (67.75%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D +K  D++  +L+M V++T NRLY+I L+   QVCLL SL++P WLW
Sbjct: 283 LTETGHRVMMDGDDLKVFDKNPWRLVMKVRRTSNRLYRIELQLASQVCLLASLDNPAWLW 342

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H R+GHVNFH LKL+ +K++  GVP V  PN+LC+AC++ KQ R PFP  + YRAE PLE
Sbjct: 343 HARIGHVNFHALKLLVDKEMASGVPTVHHPNQLCQACLVAKQVRQPFPGMANYRAEAPLE 402

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T AGN+YF+LIVDD + WMW+++++ K      F KFK L +N    
Sbjct: 403 LLHMDLCGPITPSTFAGNRYFMLIVDDFSNWMWVFVIKLKDQALAVFEKFKPLAKNTVGR 462

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRGGEFLS +F + C    IER LTAPYSPQQN ++ERRNRTVMA  RSLLK 
Sbjct: 463 TIKTLRTDRGGEFLSGKFARVCDAASIERHLTAPYSPQQNDVVERRNRTVMAMARSLLKG 522

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH ++LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Sbjct: 523 MSVPGRMWGEAVRHAIFLLNWLPTKAMGNRTPFEAWTGKKPHLGHLRVFGCTAHAKVTAP 582

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H KKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+       E+
Sbjct: 583 HLKKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAG---EV 642

Query: 361 TEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETS--SSPPSTNT---PVRLRS 420
           T         S EFE  E          P   E PA+ E +  +SP +  +   PVR RS
Sbjct: 643 T---------STEFEVEE----------PVGAEQPALAEQAGLASPHTAGSDVGPVRYRS 691

Query: 421 LSDIYANTEEV 426
           L++I      V
Sbjct: 703 LAEIMLEAPRV 691

BLAST of CSPI03G25080 vs. NCBI nr
Match: gi|29150404|gb|AAO72413.1| (gag-pol polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 466.1 bits (1198), Expect = 6.8e-128
Identity = 223/411 (54.26%), Postives = 288/411 (70.07%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D ++  D++  +L+M V+++ NRLY+I L+    VCLL SL+DP WLW
Sbjct: 389 LTETGHRVVMDGDDLEVFDKNPWRLVMKVRRSSNRLYRIELQLASPVCLLASLDDPAWLW 448

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNFH LKL+ +K++  GVP V  PN+LC+AC++ KQ R PFP  + Y AE PLE
Sbjct: 449 HARLGHVNFHALKLLVDKEMAAGVPAVHHPNQLCQACLVAKQVRQPFPGMANYLAEAPLE 508

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T  GN+YF+LIVDD + WMW++++++K     AF KFK L EN    
Sbjct: 509 LLHMDLCGPITPSTFTGNRYFMLIVDDFSHWMWVFVIKSKDQALAAFEKFKPLAENTAGR 568

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRGGEFLS EF + C   GIER LT PYSPQQNG++ERRNRTVMA  RSLLK 
Sbjct: 569 TIKTLRTDRGGEFLSGEFARVCDAAGIERHLTVPYSPQQNGVVERRNRTVMAMARSLLKG 628

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH V+LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Sbjct: 629 MSVPGRMWGEAVRHAVFLLNRLPTKAMGNRTPFEAWTGKKPHLGHLRVFGCTAHAKVTAP 688

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H KKLDDRS+P+VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    
Sbjct: 689 HLKKLDDRSNPVVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAGEVTS- 748

Query: 361 TEFQVMDQFYSDEFENLEDAET--W--VENAFPHATEIPAIGETSSSPPST 407
           TEF+V +   +++    E A +  W     A   A + P + E   +PP++
Sbjct: 749 TEFEVEEPVGAEQPAPAEQAGSVPWYRAPPAGRRAGKEPEVAEQRGTPPAS 798

BLAST of CSPI03G25080 vs. NCBI nr
Match: gi|38345658|emb|CAE04422.2| (OSJNBb0040D15.12 [Oryza sativa Japonica Group])

HSP 1 Score: 459.5 bits (1181), Expect = 6.3e-126
Identity = 230/458 (50.22%), Postives = 299/458 (65.28%), Query Frame = 1

Query: 1   MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLW 60
           +TE  ++V M  D ++  D++  +L+M V++T NRLY+I L+    VCLL SL+DP WLW
Sbjct: 249 LTETGHRVVMDGDDLEVFDKNPWRLVMKVRRTSNRLYRIELQLASPVCLLASLDDPAWLW 308

Query: 61  HVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLE 120
           H RLGHVNFH LKL+ +K++V GVP V  PN+LC+AC++ KQ R  FP  + YRAE PLE
Sbjct: 309 HARLGHVNFHALKLLVDKEMVAGVPAVHHPNQLCQACLVAKQVRQSFPGMANYRAEAPLE 368

Query: 121 LLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEY 180
           LLH D+CGPI+P T AGN+YF+LIVDD +RWMW++++++K     A  KFK L EN    
Sbjct: 369 LLHMDLCGPITPSTFAGNRYFMLIVDDFSRWMWVFVIKSKDQALAASEKFKPLAENTAGR 428

Query: 181 KIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKS 240
            I+TLR DRG EFLS EF + C   GIER LTAPYSPQQNG++E RNRTVMA  RSLLK 
Sbjct: 429 TIKTLRTDRGSEFLSGEFARVCDAAGIERHLTAPYSPQQNGVVEHRNRTVMAMARSLLKG 488

Query: 241 MHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP 300
           M VP + WGEA+RH V+LLN LPTKA+G RT FEAWMG+KPHL HL VFGC A+ K T P
Sbjct: 489 MSVPGRMWGEAVRHAVFLLNRLPTKAMGNRTSFEAWMGKKPHLGHLWVFGCTAHTKVTAP 548

Query: 301 HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEI 360
           H  KLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    
Sbjct: 549 HLMKLDDRSNPFVYLGVEEGSKAHRLFDPRRRQIIVSRDVVFDENTPWQWSAAAGEVTS- 608

Query: 361 TEFQVMDQFYSDEFENL-------------EDAETWVENAFPHAT---------EIPAIG 420
           TEF+V +   +++  ++             ++ E   +   P A+           P +G
Sbjct: 609 TEFEVEEPVGAEQAASVPWYRAPPAGRRAGKEPEVAEQRGTPPASPARFSPTLPSTPTLG 668

Query: 421 ETSS----------SPPSTNTPVRLRSLSDIYANTEEV 426
            +S+          +P S + PVR RSL +I      V
Sbjct: 669 SSSTHSAEVQASPRTPGSDDGPVRYRSLVEIMLEAPRV 705

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.2e-6035.29Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME6.0e-5234.20Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YP41B_YEAST5.7e-1824.19Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YH41B_YEAST5.7e-1824.19Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YJ41B_YEAST1.4e-1623.89Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
Q0J8A6_ORYSJ4.4e-13460.00Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1[more]
B8BDZ6_ORYSI4.4e-13460.00Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4... [more]
Q338J6_ORYSJ2.8e-12853.36Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
Q84SW8_ORYSJ4.7e-12854.26Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1[more]
Q7XMW2_ORYSJ4.4e-12650.22OSJNBb0040D15.12 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0040D15.12 PE=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|113622864|dbj|BAF22809.1|6.3e-13460.00Os08g0125300 [Oryza sativa Japonica Group][more]
gi|218201855|gb|EEC84282.1|6.3e-13460.00hypothetical protein OsI_30754 [Oryza sativa Indica Group][more]
gi|110289052|gb|ABB47537.2|4.0e-12853.36retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
gi|29150404|gb|AAO72413.1|6.8e-12854.26gag-pol polyprotein [Oryza sativa Japonica Group][more]
gi|38345658|emb|CAE04422.2|6.3e-12650.22OSJNBb0040D15.12 [Oryza sativa Japonica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G25080.1CSPI03G25080.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 115..232
score: 6.0
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 113..279
score: 26
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 117..283
score: 7.5
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 116..288
score: 8.47
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 34..101
score: 4.4
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..425
score: 9.8E
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 1..425
score: 9.8E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI03G25080Melon (DHL92) v3.6.1cpimedB197
CSPI03G25080Cucumber (Gy14) v2cgybcpiB110
CSPI03G25080Wax gourdcpiwgoB298
CSPI03G25080Cucumber (Chinese Long) v3cpicucB144
CSPI03G25080Cucumber (Gy14) v1cgycpiB451
CSPI03G25080Cucumber (Chinese Long) v2cpicuB120
CSPI03G25080Melon (DHL92) v3.5.1cpimeB206
CSPI03G25080Watermelon (97103) v1cpiwmB229
CSPI03G25080Bottle gourd (USVL1VR-Ls)cpilsiB230