CmoCh05G002090 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G002090
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-BL Gag-Pol polyprotein
LocationCmo_Chr05 : 905025 .. 907163 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA

mRNA sequence

ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA

Coding sequence (CDS)

ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA
BLAST of CmoCh05G002090 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 1.8e-114
Identity = 257/703 (36.56%), Postives = 376/703 (53.49%), Query Frame = 1

Query: 2   TGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLV 61
           T  R  F    +G  GTVK G+ S  +I G G I   +  G    L DV  +P L+ NL+
Sbjct: 305 TPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLI 364

Query: 62  SLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEVS 121
           S   LD  G          ++      +     R T  LY    EI Q   L+A  +E+S
Sbjct: 365 SGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGT--LYRTNAEICQG-ELNAAQDEIS 424

Query: 122 WR-WHARYGHLNFPALEKLQKKELVHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRA 181
              WH R GH++   L+ L KK L+      KG   K CD CL GKQ R  F + ++ R 
Sbjct: 425 VDLWHKRMGHMSEKGLQILAKKSLIS---YAKGTTVKPCDYCLFGKQHRVSFQT-SSERK 484

Query: 182 DEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAE 241
              L+LV+ D+CGP++  + GG   F+  +DD SR +W+ +L+ K +  +  ++  A  E
Sbjct: 485 LNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE 544

Query: 242 AECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTAR 301
            E  +K++ LR+D GGE+TS  F +YC   GI+   T P +PQ NGV ER N+TIV   R
Sbjct: 545 RETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 604

Query: 302 SLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYM 361
           S+L  A +P  FWGEAV TA YL+NRSP+  L  + P   W NK+ +  H +VFGC A+ 
Sbjct: 605 SMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFA 664

Query: 362 KVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIE 421
            V +    KLD + +  +FIGY      YRL+DPV  +   SRDVVF E+      D+ E
Sbjct: 665 HVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSE 724

Query: 422 ADRD---PNQFTVEYLVTEPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDT 481
             ++   PN  T+      P    +   E S    G  P  V     +  +   + +H T
Sbjct: 725 KVKNGIIPNFVTIPSTSNNPTSAESTTDEVS--EQGEQPGEVIEQGEQLDEGVEEVEHPT 784

Query: 482 DLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEA----EKNPCWRKAM 541
             E +++ +       E P + +R       +      EP +  E     EKN    KAM
Sbjct: 785 QGEEQHQPL----RRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQL-MKAM 844

Query: 542 QEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD 601
           QEEM S+ +N T+ L ++P G R +  KWVFKLK++   ++V++KARLV KG+ QK+G+D
Sbjct: 845 QEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGID 904

Query: 602 FEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNP 661
           F+E+F+PV ++ S+R +L++AA    EV  +DVK+AFL+G+L+E +Y+ QP GF      
Sbjct: 905 FDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKK 964

Query: 662 NKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMY 696
           + V +L+K+LYGL+QAPR W  K D  + S  + +  S+  +Y
Sbjct: 965 HMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVY 993

BLAST of CmoCh05G002090 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 233.0 bits (593), Expect = 9.9e-60
Identity = 134/395 (33.92%), Postives = 209/395 (52.91%), Query Frame = 1

Query: 27  VEIEGRGTILFISKGGEHR-------KLTDVYFIPRLKANLVSLGQLDETGCFISIERGL 86
           + +  +G  ++ +K G  R        L DV F      NL+S+ +L E G  I  ++  
Sbjct: 318 IAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSG 377

Query: 87  LKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL 146
           + I  N   ++  +    N + V+     Q  S++AK +     WH R+GH++   L ++
Sbjct: 378 VTISKNGLMVVKNSGMLNN-VPVINF---QAYSINAKHKNNFRLWHERFGHISDGKLLEI 437

Query: 147 QKKELVHGLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIK 206
           ++K +      +  +    ++C+ CL GKQ R PF   +       PL +VH D+CGPI 
Sbjct: 438 KRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPIT 497

Query: 207 PATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGG 266
           P T   K+ F++ VD  + +    L++ KS+     +   A++EA    K+  L  D G 
Sbjct: 498 PVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGR 557

Query: 267 EFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEA 326
           E+ S    ++C + GI  HLT P++PQ NGV ER  +TI   AR+++  A +   FWGEA
Sbjct: 558 EYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEA 617

Query: 327 VMTAVYLLNRSPTRSL--DGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRG 386
           V+TA YL+NR P+R+L    KTPYE W+NKKP + H RVFG   Y+ +      K D + 
Sbjct: 618 VLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQ-GKFDDKS 677

Query: 387 LKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDE 409
            K +F+GYEP    ++L+D V  +  V+RDVV DE
Sbjct: 678 FKSIFVGYEP--NGFKLWDAVNEKFIVARDVVVDE 705

BLAST of CmoCh05G002090 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 90.1 bits (222), Expect = 1.0e-16
Identity = 80/278 (28.78%), Postives = 123/278 (44.24%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 562 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 621

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRF-MWLTLLQAKSEAAEA-VKRIKAR 244
            EP      DI GP+  +    K   L++VD+ +R+ M  T     +E   A V++    
Sbjct: 622 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQY 681

Query: 245 AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGT 304
            E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TI+  
Sbjct: 682 VETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITD 741

Query: 305 ARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFGC 364
           A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG 
Sbjct: 742 ATTLLRQSNLRVKFWEYAVTSATNIRNYLEHKS-TGKLPLKAISRQPVTVRLMSFLPFGE 801

Query: 365 VAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 802 KGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 836

BLAST of CmoCh05G002090 vs. Swiss-Prot
Match: YP41B_YEAST (Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-P PE=5 SV=2)

HSP 1 Score: 89.7 bits (221), Expect = 1.4e-16
Identity = 79/279 (28.32%), Postives = 122/279 (43.73%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 561 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 620

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKAR-- 244
            EP      DI GP+  +    K   L++VD+ +R+  +T       A   + +I+    
Sbjct: 621 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYC-MTSTHFNKNAETILAQIRKNIQ 680

Query: 245 -AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVG 304
             E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TIV 
Sbjct: 681 YVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVT 740

Query: 305 TARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFG 364
            A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG
Sbjct: 741 DATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKS-TGKLPLKAISRQPVTVRLMSFLPFG 800

Query: 365 CVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                 +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 801 EKGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 835

BLAST of CmoCh05G002090 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 1.4e-16
Identity = 79/279 (28.32%), Postives = 122/279 (43.73%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 561 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 620

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKAR-- 244
            EP      DI GP+  +    K   L++VD+ +R+  +T       A   + +I+    
Sbjct: 621 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYC-MTSTHFNKNAETILAQIRKNIQ 680

Query: 245 -AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVG 304
             E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TIV 
Sbjct: 681 YVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVT 740

Query: 305 TARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFG 364
            A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG
Sbjct: 741 DATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKS-TGKLPLKAISRQPVTVRLMSFLPFG 800

Query: 365 CVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                 +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 801 EKGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 835

BLAST of CmoCh05G002090 vs. TrEMBL
Match: Q7XPB1_ORYSJ (OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=4 SV=2)

HSP 1 Score: 905.2 bits (2338), Expect = 5.0e-260
Identity = 441/715 (61.68%), Postives = 540/715 (75.52%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+
Sbjct: 405  MTGSRSAFAELDTAVTGTVRFGDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANI 464

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L + D +  LL + RR+ + LY ++L+ID+PV L+A++ E 
Sbjct: 465  VSLGQLDRSGSKVLIHHGILHVWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEP 524

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 525  AWRWHARYGHLNFPALRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 584

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE 
Sbjct: 585  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEV 644

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 645  ESGRKLRALRMDRGSEFTSIEFGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARS 704

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K
Sbjct: 705  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVK 764

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVI-E 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W  V  +
Sbjct: 765  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPD 824

Query: 421  ADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRT 480
                   FTVE +VT               P         T  PP+   PE VEF TP T
Sbjct: 825  GAPQLEPFTVEQVVTTTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPT 884

Query: 481  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKN 540
             DS LDAD D D+  RYR +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +
Sbjct: 885  QDSILDADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEAD 944

Query: 541  PCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY 600
            P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Sbjct: 945  PSWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGY 1004

Query: 601  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPG 660
            VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPG
Sbjct: 1005 VQRQGVDFDEVFALVARLESVRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPG 1064

Query: 661  FLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            F+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F R +SEHG+YT   G
Sbjct: 1065 FVDDNHKNKVYRLHKALYGLRQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRG 1118

BLAST of CmoCh05G002090 vs. TrEMBL
Match: A0B9X7_ORYSA (OSIGBa0135C09.3 protein OS=Oryza sativa GN=OSIGBa0135C09.3 PE=4 SV=1)

HSP 1 Score: 873.2 bits (2255), Expect = 2.1e-250
Identity = 431/716 (60.20%), Postives = 526/716 (73.46%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+
Sbjct: 338  MTGSRSAFAKLDTAVTGTVRFGDGSVVRIEGRGTVLFSCRFGEHRGIAGVYYIPRLTANI 397

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L++ D +  LL + RR+ + LY ++L ID+PV L+A++ + 
Sbjct: 398  VSLGQLDRSGSKVLIHHGVLRVWDPRGHLLVRVRRSDDCLYTIKLNIDRPVYLAARSAKP 457

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 458  AWRWHARYGHLNFPSLRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 517

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE 
Sbjct: 518  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTLIRSKDEAANAIKHFQAHAEV 577

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 578  ESGRKLRALRTDRGGEFTSIEFGEYCANLRVGRQLTAPYSPQQNGVVERRNQTIVATARS 637

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K
Sbjct: 638  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQRPAVHFLRTFGCVGHVK 697

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W   +  
Sbjct: 698  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDAAWDWGP-LTP 757

Query: 421  DRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPR 480
            D  P    FTVE +VT               P         T  PP+   PE VEF TP 
Sbjct: 758  DGAPQLEPFTVEQVVTTTIGTAPASSLTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPP 817

Query: 481  TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEK 540
            T DS LDAD D D+  RY  +D+L+G   PPG A R LE++ ELH VSADEP + AEAE 
Sbjct: 818  TQDSILDADADDDVVPRYHLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEA 877

Query: 541  NPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG 600
            +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Sbjct: 878  DPNWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKW----------------ARLVAKG 937

Query: 601  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPP 660
            YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPP
Sbjct: 938  YVQRQGVDFDEVFAPVARLELVRLLLAIAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPP 997

Query: 661  GFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            GF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL F R +SEHG+YT   G
Sbjct: 998  GFVDDNHKNKVYRLHKALYGLRQAPRAWNTKLDSSLLSLGFHRSSSEHGVYTRTRG 1035

BLAST of CmoCh05G002090 vs. TrEMBL
Match: Q0J5Y3_ORYSJ (Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1)

HSP 1 Score: 854.4 bits (2206), Expect = 1.0e-244
Identity = 433/733 (59.07%), Postives = 526/733 (71.76%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+
Sbjct: 399  MTGTRSAFSELNTGIRGTVKFGDGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNI 458

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLDE     S E G+LKI + QRRLL +  R+ NRLYV++L I +PV L+A+  ++
Sbjct: 459  VSLGQLDEEKFKWSCEDGVLKIWNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDI 518

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA 
Sbjct: 519  AWRWHARFGHLNFRALEKLGRAVMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAK 578

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEA
Sbjct: 579  EKLELVHGDICGPVTPATPSGNKLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEA 638

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Sbjct: 639  EAGRKLRTLRTDRGGEFTAHAFAEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARS 698

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K
Sbjct: 699  MMKAKSLPGWFWGEAVNTAVFLLNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVK 758

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
                 LAKLD R + +VF+GYE G+KAYR Y+PV  R HVSRD VF+E   W+W     A
Sbjct: 759  NGGQRLAKLDDRSMPMVFVGYEAGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWGAEKGA 818

Query: 421  --DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP--------------------- 480
              D D   F VE+L T P  +GG       A  + TS P                     
Sbjct: 819  GPDDDIEPFVVEHLATGPTGQGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGP 878

Query: 481  --PAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAE 540
              PA A    +EFA+P   D  LD DHD D+  R+R +D+L+G   PPGLA RE+ E   
Sbjct: 879  RTPASASSPAIEFASPPQGDLDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--G 938

Query: 541  LHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR 600
            L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Sbjct: 939  LMVAIEDEPATAEEAKQVKEWREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKK 998

Query: 601  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKS 660
            +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKS
Sbjct: 999  DEHGNITKHKARLVAKGYVQRQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKS 1058

Query: 661  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKR 701
            AFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+QAPRAWN+KLD +LL L F R
Sbjct: 1059 AFLNGDLAEEVYVQQPPGFVAAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFAR 1118

BLAST of CmoCh05G002090 vs. TrEMBL
Match: B8BH06_ORYSI (Uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_33720 PE=4 SV=1)

HSP 1 Score: 829.3 bits (2141), Expect = 3.5e-237
Identity = 414/723 (57.26%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1   MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
           MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 28  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLAANI 87

Query: 61  VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
           +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 88  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 147

Query: 121 SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
           +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 148 AWRWHARLGHINFRVLCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 207

Query: 181 EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
           EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 208 EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 267

Query: 241 ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
           +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 268 KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 327

Query: 301 LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
           +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 328 MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 387

Query: 361 VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 388 TTTPNLKKLDDRSRPMIFVGYEPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 447

Query: 421 DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
           D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 448 DLD-TDFVVEYTTVYHPGSLSGTRQDAWEPPARSSSSPRTPSDSPTAGRTPSVHGDAPAV 507

Query: 481 EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
           EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 508 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 567

Query: 541 TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
           TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 568 TFAQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 627

Query: 601 ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
           ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 628 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 687

Query: 661 VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
           VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 688 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 746

BLAST of CmoCh05G002090 vs. TrEMBL
Match: Q7XEA3_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os10g29420 PE=4 SV=2)

HSP 1 Score: 828.9 bits (2140), Expect = 4.6e-237
Identity = 413/723 (57.12%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 585  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLTANI 644

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 645  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 704

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR GH+NF AL K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 705  AWRWHARLGHINFRALCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 764

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 765  EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 824

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 825  KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 884

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 885  MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 944

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
             T P+L KLD R   ++F+GY+PGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 945  TTTPNLKKLDDRSRPMIFVGYKPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 1004

Query: 421  DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
            D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 1005 DLD-TDFVVEYTTVYHPGSLSGTRQDAGEPPARSSSSPRTPSDSPTAGRTPSVHGDALAV 1064

Query: 481  EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
            EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 1065 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 1124

Query: 541  TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
            TF +AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 1125 TFGQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 1184

Query: 601  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
            ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 1185 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 1244

Query: 661  VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
            VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 1245 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 1303

BLAST of CmoCh05G002090 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 169.9 bits (429), Expect = 5.8e-42
Identity = 79/184 (42.93%), Postives = 122/184 (66.30%), Query Frame = 1

Query: 513 ADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGE 572
           A EP+T+ EA++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K N  G 
Sbjct: 83  AKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGT 142

Query: 573 VVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNG 632
           + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI+A +++ +H +D+ +AFLNG
Sbjct: 143 IERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNG 202

Query: 633 ELKETVYVRQPPGFL----DNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRC 692
           +L E +Y++ PPG+     D+  PN V  L K++YGL+QA R W  K   TL+   F + 
Sbjct: 203 DLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQS 262

BLAST of CmoCh05G002090 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 87.4 bits (215), Expect = 3.8e-17
Identity = 42/99 (42.42%), Postives = 64/99 (64.65%), Query Frame = 1

Query: 515 EPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVV 574
           EP +   A K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K +  G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 575 KHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIA 614
           + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh05G002090 vs. TAIR10
Match: ATMG00710.1 (ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 68.9 bits (167), Expect = 1.4e-11
Identity = 36/95 (37.89%), Postives = 52/95 (54.74%), Query Frame = 1

Query: 291 NQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHF 350
           N+TI+   RS+L   G+P  F  +A  TAV+++N+ P+ +++   P E W+   PT  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 351 RVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGS 386
           R FGCVAY+        KL PR  K    G E GS
Sbjct: 62  RRFGCVAYIHCDE---GKLKPRAKK----GEEKGS 89

BLAST of CmoCh05G002090 vs. NCBI nr
Match: gi|38344222|emb|CAE03692.2| (OSJNBb0026E15.10 [Oryza sativa Japonica Group])

HSP 1 Score: 905.2 bits (2338), Expect = 7.2e-260
Identity = 441/715 (61.68%), Postives = 540/715 (75.52%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+
Sbjct: 405  MTGSRSAFAELDTAVTGTVRFGDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANI 464

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L + D +  LL + RR+ + LY ++L+ID+PV L+A++ E 
Sbjct: 465  VSLGQLDRSGSKVLIHHGILHVWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEP 524

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 525  AWRWHARYGHLNFPALRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 584

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE 
Sbjct: 585  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEV 644

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 645  ESGRKLRALRMDRGSEFTSIEFGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARS 704

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K
Sbjct: 705  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVK 764

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVI-E 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W  V  +
Sbjct: 765  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPD 824

Query: 421  ADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRT 480
                   FTVE +VT               P         T  PP+   PE VEF TP T
Sbjct: 825  GAPQLEPFTVEQVVTTTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPT 884

Query: 481  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKN 540
             DS LDAD D D+  RYR +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +
Sbjct: 885  QDSILDADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEAD 944

Query: 541  PCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY 600
            P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Sbjct: 945  PSWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGY 1004

Query: 601  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPG 660
            VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPG
Sbjct: 1005 VQRQGVDFDEVFALVARLESVRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPG 1064

Query: 661  FLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            F+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F R +SEHG+YT   G
Sbjct: 1065 FVDDNHKNKVYRLHKALYGLRQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRG 1118

BLAST of CmoCh05G002090 vs. NCBI nr
Match: gi|116634828|emb|CAH66352.1| (OSIGBa0135C09.3 [Oryza sativa Indica Group])

HSP 1 Score: 873.2 bits (2255), Expect = 3.0e-250
Identity = 431/716 (60.20%), Postives = 526/716 (73.46%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+
Sbjct: 338  MTGSRSAFAKLDTAVTGTVRFGDGSVVRIEGRGTVLFSCRFGEHRGIAGVYYIPRLTANI 397

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L++ D +  LL + RR+ + LY ++L ID+PV L+A++ + 
Sbjct: 398  VSLGQLDRSGSKVLIHHGVLRVWDPRGHLLVRVRRSDDCLYTIKLNIDRPVYLAARSAKP 457

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 458  AWRWHARYGHLNFPSLRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 517

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE 
Sbjct: 518  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTLIRSKDEAANAIKHFQAHAEV 577

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 578  ESGRKLRALRTDRGGEFTSIEFGEYCANLRVGRQLTAPYSPQQNGVVERRNQTIVATARS 637

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K
Sbjct: 638  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQRPAVHFLRTFGCVGHVK 697

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W   +  
Sbjct: 698  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDAAWDWGP-LTP 757

Query: 421  DRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPR 480
            D  P    FTVE +VT               P         T  PP+   PE VEF TP 
Sbjct: 758  DGAPQLEPFTVEQVVTTTIGTAPASSLTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPP 817

Query: 481  TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEK 540
            T DS LDAD D D+  RY  +D+L+G   PPG A R LE++ ELH VSADEP + AEAE 
Sbjct: 818  TQDSILDADADDDVVPRYHLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEA 877

Query: 541  NPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG 600
            +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Sbjct: 878  DPNWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKW----------------ARLVAKG 937

Query: 601  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPP 660
            YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPP
Sbjct: 938  YVQRQGVDFDEVFAPVARLELVRLLLAIAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPP 997

Query: 661  GFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            GF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL F R +SEHG+YT   G
Sbjct: 998  GFVDDNHKNKVYRLHKALYGLRQAPRAWNTKLDSSLLSLGFHRSSSEHGVYTRTRG 1035

BLAST of CmoCh05G002090 vs. NCBI nr
Match: gi|113623687|dbj|BAF23632.1| (Os08g0389500 [Oryza sativa Japonica Group])

HSP 1 Score: 854.4 bits (2206), Expect = 1.5e-244
Identity = 433/733 (59.07%), Postives = 526/733 (71.76%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+
Sbjct: 399  MTGTRSAFSELNTGIRGTVKFGDGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNI 458

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLDE     S E G+LKI + QRRLL +  R+ NRLYV++L I +PV L+A+  ++
Sbjct: 459  VSLGQLDEEKFKWSCEDGVLKIWNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDI 518

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA 
Sbjct: 519  AWRWHARFGHLNFRALEKLGRAVMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAK 578

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEA
Sbjct: 579  EKLELVHGDICGPVTPATPSGNKLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEA 638

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Sbjct: 639  EAGRKLRTLRTDRGGEFTAHAFAEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARS 698

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K
Sbjct: 699  MMKAKSLPGWFWGEAVNTAVFLLNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVK 758

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
                 LAKLD R + +VF+GYE G+KAYR Y+PV  R HVSRD VF+E   W+W     A
Sbjct: 759  NGGQRLAKLDDRSMPMVFVGYEAGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWGAEKGA 818

Query: 421  --DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP--------------------- 480
              D D   F VE+L T P  +GG       A  + TS P                     
Sbjct: 819  GPDDDIEPFVVEHLATGPTGQGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGP 878

Query: 481  --PAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAE 540
              PA A    +EFA+P   D  LD DHD D+  R+R +D+L+G   PPGLA RE+ E   
Sbjct: 879  RTPASASSPAIEFASPPQGDLDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--G 938

Query: 541  LHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR 600
            L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Sbjct: 939  LMVAIEDEPATAEEAKQVKEWREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKK 998

Query: 601  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKS 660
            +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKS
Sbjct: 999  DEHGNITKHKARLVAKGYVQRQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKS 1058

Query: 661  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKR 701
            AFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+QAPRAWN+KLD +LL L F R
Sbjct: 1059 AFLNGDLAEEVYVQQPPGFVAAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFAR 1118

BLAST of CmoCh05G002090 vs. NCBI nr
Match: gi|218184581|gb|EEC67008.1| (hypothetical protein OsI_33720 [Oryza sativa Indica Group])

HSP 1 Score: 829.3 bits (2141), Expect = 5.0e-237
Identity = 414/723 (57.26%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1   MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
           MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 28  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLAANI 87

Query: 61  VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
           +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 88  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 147

Query: 121 SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
           +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 148 AWRWHARLGHINFRVLCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 207

Query: 181 EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
           EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 208 EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 267

Query: 241 ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
           +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 268 KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 327

Query: 301 LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
           +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 328 MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 387

Query: 361 VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 388 TTTPNLKKLDDRSRPMIFVGYEPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 447

Query: 421 DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
           D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 448 DLD-TDFVVEYTTVYHPGSLSGTRQDAWEPPARSSSSPRTPSDSPTAGRTPSVHGDAPAV 507

Query: 481 EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
           EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 508 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 567

Query: 541 TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
           TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 568 TFAQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 627

Query: 601 ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
           ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 628 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 687

Query: 661 VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
           VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 688 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 746

BLAST of CmoCh05G002090 vs. NCBI nr
Match: gi|110289120|gb|AAP53887.2| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 828.9 bits (2140), Expect = 6.5e-237
Identity = 413/723 (57.12%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 585  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLTANI 644

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 645  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 704

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR GH+NF AL K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 705  AWRWHARLGHINFRALCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 764

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 765  EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 824

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 825  KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 884

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 885  MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 944

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
             T P+L KLD R   ++F+GY+PGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 945  TTTPNLKKLDDRSRPMIFVGYKPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 1004

Query: 421  DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
            D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 1005 DLD-TDFVVEYTTVYHPGSLSGTRQDAGEPPARSSSSPRTPSDSPTAGRTPSVHGDALAV 1064

Query: 481  EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
            EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 1065 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 1124

Query: 541  TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
            TF +AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 1125 TFGQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 1184

Query: 601  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
            ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 1185 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 1244

Query: 661  VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
            VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 1245 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 1303

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.8e-11436.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME9.9e-6033.92Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YJ41B_YEAST1.0e-1628.78Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YP41B_YEAST1.4e-1628.32Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YH41B_YEAST1.4e-1628.32Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
Q7XPB1_ORYSJ5.0e-26061.68OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=... [more]
A0B9X7_ORYSA2.1e-25060.20OSIGBa0135C09.3 protein OS=Oryza sativa GN=OSIGBa0135C09.3 PE=4 SV=1[more]
Q0J5Y3_ORYSJ1.0e-24459.07Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1[more]
B8BH06_ORYSI3.5e-23757.26Uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_33720 PE=4 SV=1[more]
Q7XEA3_ORYSJ4.6e-23757.12Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Match NameE-valueIdentityDescription
AT4G23160.15.8e-4242.93 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.13.8e-1742.42ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00710.11.4e-1137.89ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|38344222|emb|CAE03692.2|7.2e-26061.68OSJNBb0026E15.10 [Oryza sativa Japonica Group][more]
gi|116634828|emb|CAH66352.1|3.0e-25060.20OSIGBa0135C09.3 [Oryza sativa Indica Group][more]
gi|113623687|dbj|BAF23632.1|1.5e-24459.07Os08g0389500 [Oryza sativa Japonica Group][more]
gi|218184581|gb|EEC67008.1|5.0e-23757.26hypothetical protein OsI_33720 [Oryza sativa Indica Group][more]
gi|110289120|gb|AAP53887.2|6.5e-23757.12retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009536 plastid
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G002090.1CmoCh05G002090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 180..294
score: 3.9
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 178..344
score: 27
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 182..348
score: 3.7
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 181..353
score: 1.64
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 542..700
score: 2.9
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 99..166
score: 5.8
NoneNo IPR availableunknownCoilCoilcoord: 221..241
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..700
score:
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 1..700
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 542..686
score: 8.

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh05G002090Silver-seed gourdcarcmoB1047
CmoCh05G002090Silver-seed gourdcarcmoB1097
CmoCh05G002090Cucumber (Chinese Long) v3cmocucB0918
CmoCh05G002090Cucumber (Chinese Long) v3cmocucB0933
CmoCh05G002090Cucumber (Chinese Long) v3cmocucB0945
CmoCh05G002090Watermelon (97103) v2cmowmbB777
CmoCh05G002090Watermelon (97103) v2cmowmbB802
CmoCh05G002090Wax gourdcmowgoB0944
CmoCh05G002090Wax gourdcmowgoB0953
CmoCh05G002090Wax gourdcmowgoB0960
CmoCh05G002090Cucurbita maxima (Rimu)cmacmoB483
CmoCh05G002090Cucurbita moschata (Rifu)cmocmoB033
CmoCh05G002090Cucurbita moschata (Rifu)cmocmoB156
CmoCh05G002090Cucurbita moschata (Rifu)cmocmoB369
CmoCh05G002090Cucurbita moschata (Rifu)cmocmoB481
CmoCh05G002090Cucumber (Gy14) v1cgycmoB0232
CmoCh05G002090Cucumber (Gy14) v1cgycmoB0643
CmoCh05G002090Cucurbita maxima (Rimu)cmacmoB199
CmoCh05G002090Cucurbita maxima (Rimu)cmacmoB662
CmoCh05G002090Cucurbita maxima (Rimu)cmacmoB786
CmoCh05G002090Cucurbita maxima (Rimu)cmacmoB871
CmoCh05G002090Wild cucumber (PI 183967)cmocpiB782
CmoCh05G002090Wild cucumber (PI 183967)cmocpiB807
CmoCh05G002090Cucumber (Chinese Long) v2cmocuB777
CmoCh05G002090Cucumber (Chinese Long) v2cmocuB802
CmoCh05G002090Melon (DHL92) v3.5.1cmomeB718
CmoCh05G002090Melon (DHL92) v3.5.1cmomeB735
CmoCh05G002090Watermelon (Charleston Gray)cmowcgB716
CmoCh05G002090Watermelon (97103) v1cmowmB748
CmoCh05G002090Watermelon (97103) v1cmowmB753
CmoCh05G002090Cucurbita pepo (Zucchini)cmocpeB720
CmoCh05G002090Cucurbita pepo (Zucchini)cmocpeB734
CmoCh05G002090Cucurbita pepo (Zucchini)cmocpeB740
CmoCh05G002090Cucurbita pepo (Zucchini)cmocpeB750
CmoCh05G002090Bottle gourd (USVL1VR-Ls)cmolsiB708
CmoCh05G002090Bottle gourd (USVL1VR-Ls)cmolsiB715
CmoCh05G002090Bottle gourd (USVL1VR-Ls)cmolsiB731
CmoCh05G002090Cucumber (Gy14) v2cgybcmoB238
CmoCh05G002090Cucumber (Gy14) v2cgybcmoB832
CmoCh05G002090Melon (DHL92) v3.6.1cmomedB815
CmoCh05G002090Melon (DHL92) v3.6.1cmomedB826
CmoCh05G002090Melon (DHL92) v3.6.1cmomedB832
CmoCh05G002090Silver-seed gourdcarcmoB0147
CmoCh05G002090Silver-seed gourdcarcmoB0263
CmoCh05G002090Silver-seed gourdcarcmoB0375