CmoCh05G002090.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh05G002090.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-BL Gag-Pol polyprotein
LocationCmo_Chr05 : 905025 .. 907163 (-)
Sequence length2139
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA

mRNA sequence

ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA

Coding sequence (CDS)

ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTTCATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTTCCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA
BLAST of CmoCh05G002090.1 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 1.8e-114
Identity = 257/703 (36.56%), Postives = 376/703 (53.49%), Query Frame = 1

Query: 2   TGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLV 61
           T  R  F    +G  GTVK G+ S  +I G G I   +  G    L DV  +P L+ NL+
Sbjct: 305 TPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLI 364

Query: 62  SLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEVS 121
           S   LD  G          ++      +     R T  LY    EI Q   L+A  +E+S
Sbjct: 365 SGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGT--LYRTNAEICQG-ELNAAQDEIS 424

Query: 122 WR-WHARYGHLNFPALEKLQKKELVHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRA 181
              WH R GH++   L+ L KK L+      KG   K CD CL GKQ R  F + ++ R 
Sbjct: 425 VDLWHKRMGHMSEKGLQILAKKSLIS---YAKGTTVKPCDYCLFGKQHRVSFQT-SSERK 484

Query: 182 DEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAE 241
              L+LV+ D+CGP++  + GG   F+  +DD SR +W+ +L+ K +  +  ++  A  E
Sbjct: 485 LNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE 544

Query: 242 AECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTAR 301
            E  +K++ LR+D GGE+TS  F +YC   GI+   T P +PQ NGV ER N+TIV   R
Sbjct: 545 RETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 604

Query: 302 SLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYM 361
           S+L  A +P  FWGEAV TA YL+NRSP+  L  + P   W NK+ +  H +VFGC A+ 
Sbjct: 605 SMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFA 664

Query: 362 KVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIE 421
            V +    KLD + +  +FIGY      YRL+DPV  +   SRDVVF E+      D+ E
Sbjct: 665 HVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSE 724

Query: 422 ADRD---PNQFTVEYLVTEPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDT 481
             ++   PN  T+      P    +   E S    G  P  V     +  +   + +H T
Sbjct: 725 KVKNGIIPNFVTIPSTSNNPTSAESTTDEVS--EQGEQPGEVIEQGEQLDEGVEEVEHPT 784

Query: 482 DLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEA----EKNPCWRKAM 541
             E +++ +       E P + +R       +      EP +  E     EKN    KAM
Sbjct: 785 QGEEQHQPL----RRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQL-MKAM 844

Query: 542 QEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD 601
           QEEM S+ +N T+ L ++P G R +  KWVFKLK++   ++V++KARLV KG+ QK+G+D
Sbjct: 845 QEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGID 904

Query: 602 FEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNP 661
           F+E+F+PV ++ S+R +L++AA    EV  +DVK+AFL+G+L+E +Y+ QP GF      
Sbjct: 905 FDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKK 964

Query: 662 NKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMY 696
           + V +L+K+LYGL+QAPR W  K D  + S  + +  S+  +Y
Sbjct: 965 HMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVY 993

BLAST of CmoCh05G002090.1 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 233.0 bits (593), Expect = 9.9e-60
Identity = 134/395 (33.92%), Postives = 209/395 (52.91%), Query Frame = 1

Query: 27  VEIEGRGTILFISKGGEHR-------KLTDVYFIPRLKANLVSLGQLDETGCFISIERGL 86
           + +  +G  ++ +K G  R        L DV F      NL+S+ +L E G  I  ++  
Sbjct: 318 IAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSG 377

Query: 87  LKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL 146
           + I  N   ++  +    N + V+     Q  S++AK +     WH R+GH++   L ++
Sbjct: 378 VTISKNGLMVVKNSGMLNN-VPVINF---QAYSINAKHKNNFRLWHERFGHISDGKLLEI 437

Query: 147 QKKELVHGLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIK 206
           ++K +      +  +    ++C+ CL GKQ R PF   +       PL +VH D+CGPI 
Sbjct: 438 KRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPIT 497

Query: 207 PATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGG 266
           P T   K+ F++ VD  + +    L++ KS+     +   A++EA    K+  L  D G 
Sbjct: 498 PVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGR 557

Query: 267 EFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEA 326
           E+ S    ++C + GI  HLT P++PQ NGV ER  +TI   AR+++  A +   FWGEA
Sbjct: 558 EYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEA 617

Query: 327 VMTAVYLLNRSPTRSL--DGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRG 386
           V+TA YL+NR P+R+L    KTPYE W+NKKP + H RVFG   Y+ +      K D + 
Sbjct: 618 VLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQ-GKFDDKS 677

Query: 387 LKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDE 409
            K +F+GYEP    ++L+D V  +  V+RDVV DE
Sbjct: 678 FKSIFVGYEP--NGFKLWDAVNEKFIVARDVVVDE 705

BLAST of CmoCh05G002090.1 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 90.1 bits (222), Expect = 1.0e-16
Identity = 80/278 (28.78%), Postives = 123/278 (44.24%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 562 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 621

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRF-MWLTLLQAKSEAAEA-VKRIKAR 244
            EP      DI GP+  +    K   L++VD+ +R+ M  T     +E   A V++    
Sbjct: 622 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQY 681

Query: 245 AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGT 304
            E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TI+  
Sbjct: 682 VETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITD 741

Query: 305 ARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFGC 364
           A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG 
Sbjct: 742 ATTLLRQSNLRVKFWEYAVTSATNIRNYLEHKS-TGKLPLKAISRQPVTVRLMSFLPFGE 801

Query: 365 VAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 802 KGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 836

BLAST of CmoCh05G002090.1 vs. Swiss-Prot
Match: YP41B_YEAST (Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-P PE=5 SV=2)

HSP 1 Score: 89.7 bits (221), Expect = 1.4e-16
Identity = 79/279 (28.32%), Postives = 122/279 (43.73%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 561 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 620

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKAR-- 244
            EP      DI GP+  +    K   L++VD+ +R+  +T       A   + +I+    
Sbjct: 621 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYC-MTSTHFNKNAETILAQIRKNIQ 680

Query: 245 -AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVG 304
             E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TIV 
Sbjct: 681 YVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVT 740

Query: 305 TARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFG 364
            A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG
Sbjct: 741 DATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKS-TGKLPLKAISRQPVTVRLMSFLPFG 800

Query: 365 CVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                 +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 801 EKGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 835

BLAST of CmoCh05G002090.1 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 1.4e-16
Identity = 79/279 (28.32%), Postives = 122/279 (43.73%), Query Frame = 1

Query: 125 HARYGHLNFPALEK-LQKKELVHGLPEIKGVNKL-CDGCLIGK--QRRTPFPSRTAYRAD 184
           H R GH     +E  ++       L  IK  N+  C  C I K  +R     S   +  D
Sbjct: 561 HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 620

Query: 185 -EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKAR-- 244
            EP      DI GP+  +    K   L++VD+ +R+  +T       A   + +I+    
Sbjct: 621 HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYC-MTSTHFNKNAETILAQIRKNIQ 680

Query: 245 -AEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVG 304
             E + ++K+R + +DRG EFT+    +Y    GI   LT+      NG  ER  +TIV 
Sbjct: 681 YVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVT 740

Query: 305 TARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVH--HFRVFG 364
            A +LL  + +  +FW  AV +A  + N    +S  GK P +A   +  TV    F  FG
Sbjct: 741 DATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKS-TGKLPLKAISRQPVTVRLMSFLPFG 800

Query: 365 CVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDP 394
                 +   +  KL P GL  + +  +P S  Y+ + P
Sbjct: 801 EKGI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIP 835

BLAST of CmoCh05G002090.1 vs. TrEMBL
Match: Q7XPB1_ORYSJ (OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=4 SV=2)

HSP 1 Score: 905.2 bits (2338), Expect = 5.0e-260
Identity = 441/715 (61.68%), Postives = 540/715 (75.52%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+
Sbjct: 405  MTGSRSAFAELDTAVTGTVRFGDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANI 464

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L + D +  LL + RR+ + LY ++L+ID+PV L+A++ E 
Sbjct: 465  VSLGQLDRSGSKVLIHHGILHVWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEP 524

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 525  AWRWHARYGHLNFPALRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 584

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE 
Sbjct: 585  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEV 644

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 645  ESGRKLRALRMDRGSEFTSIEFGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARS 704

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K
Sbjct: 705  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVK 764

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVI-E 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W  V  +
Sbjct: 765  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPD 824

Query: 421  ADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRT 480
                   FTVE +VT               P         T  PP+   PE VEF TP T
Sbjct: 825  GAPQLEPFTVEQVVTTTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPT 884

Query: 481  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKN 540
             DS LDAD D D+  RYR +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +
Sbjct: 885  QDSILDADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEAD 944

Query: 541  PCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY 600
            P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Sbjct: 945  PSWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGY 1004

Query: 601  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPG 660
            VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPG
Sbjct: 1005 VQRQGVDFDEVFALVARLESVRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPG 1064

Query: 661  FLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            F+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F R +SEHG+YT   G
Sbjct: 1065 FVDDNHKNKVYRLHKALYGLRQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRG 1118

BLAST of CmoCh05G002090.1 vs. TrEMBL
Match: A0B9X7_ORYSA (OSIGBa0135C09.3 protein OS=Oryza sativa GN=OSIGBa0135C09.3 PE=4 SV=1)

HSP 1 Score: 873.2 bits (2255), Expect = 2.1e-250
Identity = 431/716 (60.20%), Postives = 526/716 (73.46%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+
Sbjct: 338  MTGSRSAFAKLDTAVTGTVRFGDGSVVRIEGRGTVLFSCRFGEHRGIAGVYYIPRLTANI 397

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L++ D +  LL + RR+ + LY ++L ID+PV L+A++ + 
Sbjct: 398  VSLGQLDRSGSKVLIHHGVLRVWDPRGHLLVRVRRSDDCLYTIKLNIDRPVYLAARSAKP 457

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 458  AWRWHARYGHLNFPSLRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 517

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE 
Sbjct: 518  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTLIRSKDEAANAIKHFQAHAEV 577

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 578  ESGRKLRALRTDRGGEFTSIEFGEYCANLRVGRQLTAPYSPQQNGVVERRNQTIVATARS 637

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K
Sbjct: 638  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQRPAVHFLRTFGCVGHVK 697

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W   +  
Sbjct: 698  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDAAWDWGP-LTP 757

Query: 421  DRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPR 480
            D  P    FTVE +VT               P         T  PP+   PE VEF TP 
Sbjct: 758  DGAPQLEPFTVEQVVTTTIGTAPASSLTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPP 817

Query: 481  TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEK 540
            T DS LDAD D D+  RY  +D+L+G   PPG A R LE++ ELH VSADEP + AEAE 
Sbjct: 818  TQDSILDADADDDVVPRYHLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEA 877

Query: 541  NPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG 600
            +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Sbjct: 878  DPNWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKW----------------ARLVAKG 937

Query: 601  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPP 660
            YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPP
Sbjct: 938  YVQRQGVDFDEVFAPVARLELVRLLLAIAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPP 997

Query: 661  GFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            GF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL F R +SEHG+YT   G
Sbjct: 998  GFVDDNHKNKVYRLHKALYGLRQAPRAWNTKLDSSLLSLGFHRSSSEHGVYTRTRG 1035

BLAST of CmoCh05G002090.1 vs. TrEMBL
Match: Q0J5Y3_ORYSJ (Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1)

HSP 1 Score: 854.4 bits (2206), Expect = 1.0e-244
Identity = 433/733 (59.07%), Postives = 526/733 (71.76%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+
Sbjct: 399  MTGTRSAFSELNTGIRGTVKFGDGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNI 458

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLDE     S E G+LKI + QRRLL +  R+ NRLYV++L I +PV L+A+  ++
Sbjct: 459  VSLGQLDEEKFKWSCEDGVLKIWNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDI 518

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA 
Sbjct: 519  AWRWHARFGHLNFRALEKLGRAVMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAK 578

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEA
Sbjct: 579  EKLELVHGDICGPVTPATPSGNKLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEA 638

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Sbjct: 639  EAGRKLRTLRTDRGGEFTAHAFAEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARS 698

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K
Sbjct: 699  MMKAKSLPGWFWGEAVNTAVFLLNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVK 758

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
                 LAKLD R + +VF+GYE G+KAYR Y+PV  R HVSRD VF+E   W+W     A
Sbjct: 759  NGGQRLAKLDDRSMPMVFVGYEAGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWGAEKGA 818

Query: 421  --DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP--------------------- 480
              D D   F VE+L T P  +GG       A  + TS P                     
Sbjct: 819  GPDDDIEPFVVEHLATGPTGQGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGP 878

Query: 481  --PAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAE 540
              PA A    +EFA+P   D  LD DHD D+  R+R +D+L+G   PPGLA RE+ E   
Sbjct: 879  RTPASASSPAIEFASPPQGDLDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--G 938

Query: 541  LHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR 600
            L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Sbjct: 939  LMVAIEDEPATAEEAKQVKEWREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKK 998

Query: 601  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKS 660
            +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKS
Sbjct: 999  DEHGNITKHKARLVAKGYVQRQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKS 1058

Query: 661  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKR 701
            AFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+QAPRAWN+KLD +LL L F R
Sbjct: 1059 AFLNGDLAEEVYVQQPPGFVAAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFAR 1118

BLAST of CmoCh05G002090.1 vs. TrEMBL
Match: B8BH06_ORYSI (Uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_33720 PE=4 SV=1)

HSP 1 Score: 829.3 bits (2141), Expect = 3.5e-237
Identity = 414/723 (57.26%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1   MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
           MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 28  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLAANI 87

Query: 61  VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
           +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 88  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 147

Query: 121 SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
           +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 148 AWRWHARLGHINFRVLCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 207

Query: 181 EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
           EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 208 EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 267

Query: 241 ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
           +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 268 KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 327

Query: 301 LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
           +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 328 MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 387

Query: 361 VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 388 TTTPNLKKLDDRSRPMIFVGYEPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 447

Query: 421 DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
           D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 448 DLD-TDFVVEYTTVYHPGSLSGTRQDAWEPPARSSSSPRTPSDSPTAGRTPSVHGDAPAV 507

Query: 481 EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
           EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 508 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 567

Query: 541 TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
           TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 568 TFAQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 627

Query: 601 ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
           ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 628 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 687

Query: 661 VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
           VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 688 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 746

BLAST of CmoCh05G002090.1 vs. TrEMBL
Match: Q7XEA3_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os10g29420 PE=4 SV=2)

HSP 1 Score: 828.9 bits (2140), Expect = 4.6e-237
Identity = 413/723 (57.12%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 585  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLTANI 644

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 645  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 704

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR GH+NF AL K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 705  AWRWHARLGHINFRALCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 764

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 765  EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 824

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 825  KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 884

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 885  MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 944

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
             T P+L KLD R   ++F+GY+PGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 945  TTTPNLKKLDDRSRPMIFVGYKPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 1004

Query: 421  DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
            D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 1005 DLD-TDFVVEYTTVYHPGSLSGTRQDAGEPPARSSSSPRTPSDSPTAGRTPSVHGDALAV 1064

Query: 481  EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
            EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 1065 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 1124

Query: 541  TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
            TF +AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 1125 TFGQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 1184

Query: 601  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
            ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 1185 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 1244

Query: 661  VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
            VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 1245 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 1303

BLAST of CmoCh05G002090.1 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 169.9 bits (429), Expect = 5.8e-42
Identity = 79/184 (42.93%), Postives = 122/184 (66.30%), Query Frame = 1

Query: 513 ADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGE 572
           A EP+T+ EA++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K N  G 
Sbjct: 83  AKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGT 142

Query: 573 VVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNG 632
           + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI+A +++ +H +D+ +AFLNG
Sbjct: 143 IERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNG 202

Query: 633 ELKETVYVRQPPGFL----DNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRC 692
           +L E +Y++ PPG+     D+  PN V  L K++YGL+QA R W  K   TL+   F + 
Sbjct: 203 DLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQS 262

BLAST of CmoCh05G002090.1 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 87.4 bits (215), Expect = 3.8e-17
Identity = 42/99 (42.42%), Postives = 64/99 (64.65%), Query Frame = 1

Query: 515 EPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVV 574
           EP +   A K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K +  G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 575 KHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIA 614
           + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh05G002090.1 vs. TAIR10
Match: ATMG00710.1 (ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 68.9 bits (167), Expect = 1.4e-11
Identity = 36/95 (37.89%), Postives = 52/95 (54.74%), Query Frame = 1

Query: 291 NQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHF 350
           N+TI+   RS+L   G+P  F  +A  TAV+++N+ P+ +++   P E W+   PT  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 351 RVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGS 386
           R FGCVAY+        KL PR  K    G E GS
Sbjct: 62  RRFGCVAYIHCDE---GKLKPRAKK----GEEKGS 89

BLAST of CmoCh05G002090.1 vs. NCBI nr
Match: gi|38344222|emb|CAE03692.2| (OSJNBb0026E15.10 [Oryza sativa Japonica Group])

HSP 1 Score: 905.2 bits (2338), Expect = 7.2e-260
Identity = 441/715 (61.68%), Postives = 540/715 (75.52%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+
Sbjct: 405  MTGSRSAFAELDTAVTGTVRFGDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANI 464

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L + D +  LL + RR+ + LY ++L+ID+PV L+A++ E 
Sbjct: 465  VSLGQLDRSGSKVLIHHGILHVWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEP 524

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 525  AWRWHARYGHLNFPALRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 584

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE 
Sbjct: 585  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEV 644

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 645  ESGRKLRALRMDRGSEFTSIEFGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARS 704

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K
Sbjct: 705  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVK 764

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVI-E 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W  V  +
Sbjct: 765  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPD 824

Query: 421  ADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRT 480
                   FTVE +VT               P         T  PP+   PE VEF TP T
Sbjct: 825  GAPQLEPFTVEQVVTTTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPT 884

Query: 481  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKN 540
             DS LDAD D D+  RYR +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +
Sbjct: 885  QDSILDADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEAD 944

Query: 541  PCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY 600
            P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Sbjct: 945  PSWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGY 1004

Query: 601  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPG 660
            VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPG
Sbjct: 1005 VQRQGVDFDEVFALVARLESVRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPG 1064

Query: 661  FLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            F+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F R +SEHG+YT   G
Sbjct: 1065 FVDDNHKNKVYRLHKALYGLRQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRG 1118

BLAST of CmoCh05G002090.1 vs. NCBI nr
Match: gi|116634828|emb|CAH66352.1| (OSIGBa0135C09.3 [Oryza sativa Indica Group])

HSP 1 Score: 873.2 bits (2255), Expect = 3.0e-250
Identity = 431/716 (60.20%), Postives = 526/716 (73.46%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+
Sbjct: 338  MTGSRSAFAKLDTAVTGTVRFGDGSVVRIEGRGTVLFSCRFGEHRGIAGVYYIPRLTANI 397

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLD +G  + I  G+L++ D +  LL + RR+ + LY ++L ID+PV L+A++ + 
Sbjct: 398  VSLGQLDRSGSKVLIHHGVLRVWDPRGHLLVRVRRSDDCLYTIKLNIDRPVYLAARSAKP 457

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRAD
Sbjct: 458  AWRWHARYGHLNFPSLRKLAQQEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRAD 517

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE 
Sbjct: 518  EHLELVHGDLCGPIEPATPAGNRYFLLLVDDMSRYMWLTLIRSKDEAANAIKHFQAHAEV 577

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Sbjct: 578  ESGRKLRALRTDRGGEFTSIEFGEYCANLRVGRQLTAPYSPQQNGVVERRNQTIVATARS 637

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K
Sbjct: 638  MMKAKGVPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQRPAVHFLRTFGCVGHVK 697

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            +T+P L KLD R   +V +GYE GSKAYRLYDPV  R HVSRDVVFDE+  W W   +  
Sbjct: 698  ITKPGLKKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDAAWDWGP-LTP 757

Query: 421  DRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPR 480
            D  P    FTVE +VT               P         T  PP+   PE VEF TP 
Sbjct: 758  DGAPQLEPFTVEQVVTTTIGTAPASSLTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPP 817

Query: 481  TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEK 540
            T DS LDAD D D+  RY  +D+L+G   PPG A R LE++ ELH VSADEP + AEAE 
Sbjct: 818  TQDSILDADADDDVVPRYHLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEA 877

Query: 541  NPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG 600
            +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Sbjct: 878  DPNWRGAMQDELNAIVDNDTWSLTDLPHGHRAIGLKW----------------ARLVAKG 937

Query: 601  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPP 660
            YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPP
Sbjct: 938  YVQRQGVDFDEVFAPVARLELVRLLLAIAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPP 997

Query: 661  GFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG 701
            GF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL F R +SEHG+YT   G
Sbjct: 998  GFVDDNHKNKVYRLHKALYGLRQAPRAWNTKLDSSLLSLGFHRSSSEHGVYTRTRG 1035

BLAST of CmoCh05G002090.1 vs. NCBI nr
Match: gi|113623687|dbj|BAF23632.1| (Os08g0389500 [Oryza sativa Japonica Group])

HSP 1 Score: 854.4 bits (2206), Expect = 1.5e-244
Identity = 433/733 (59.07%), Postives = 526/733 (71.76%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+
Sbjct: 399  MTGTRSAFSELNTGIRGTVKFGDGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNI 458

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            VSLGQLDE     S E G+LKI + QRRLL +  R+ NRLYV++L I +PV L+A+  ++
Sbjct: 459  VSLGQLDEEKFKWSCEDGVLKIWNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDI 518

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA 
Sbjct: 519  AWRWHARFGHLNFRALEKLGRAVMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAK 578

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEA
Sbjct: 579  EKLELVHGDICGPVTPATPSGNKLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEA 638

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            E  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Sbjct: 639  EAGRKLRTLRTDRGGEFTAHAFAEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARS 698

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K
Sbjct: 699  MMKAKSLPGWFWGEAVNTAVFLLNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVK 758

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
                 LAKLD R + +VF+GYE G+KAYR Y+PV  R HVSRD VF+E   W+W     A
Sbjct: 759  NGGQRLAKLDDRSMPMVFVGYEAGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWGAEKGA 818

Query: 421  --DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP--------------------- 480
              D D   F VE+L T P  +GG       A  + TS P                     
Sbjct: 819  GPDDDIEPFVVEHLATGPTGQGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGP 878

Query: 481  --PAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAE 540
              PA A    +EFA+P   D  LD DHD D+  R+R +D+L+G   PPGLA RE+ E   
Sbjct: 879  RTPASASSPAIEFASPPQGDLDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--G 938

Query: 541  LHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR 600
            L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Sbjct: 939  LMVAIEDEPATAEEAKQVKEWREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKK 998

Query: 601  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKS 660
            +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKS
Sbjct: 999  DEHGNITKHKARLVAKGYVQRQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKS 1058

Query: 661  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKR 701
            AFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+QAPRAWN+KLD +LL L F R
Sbjct: 1059 AFLNGDLAEEVYVQQPPGFVAAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFAR 1118

BLAST of CmoCh05G002090.1 vs. NCBI nr
Match: gi|218184581|gb|EEC67008.1| (hypothetical protein OsI_33720 [Oryza sativa Indica Group])

HSP 1 Score: 829.3 bits (2141), Expect = 5.0e-237
Identity = 414/723 (57.26%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1   MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
           MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 28  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLAANI 87

Query: 61  VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
           +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 88  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 147

Query: 121 SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
           +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 148 AWRWHARLGHINFRVLCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 207

Query: 181 EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
           EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 208 EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 267

Query: 241 ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
           +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 268 KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 327

Query: 301 LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
           +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 328 MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 387

Query: 361 VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
            T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 388 TTTPNLKKLDDRSRPMIFVGYEPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 447

Query: 421 DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
           D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 448 DLD-TDFVVEYTTVYHPGSLSGTRQDAWEPPARSSSSPRTPSDSPTAGRTPSVHGDAPAV 507

Query: 481 EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
           EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 508 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 567

Query: 541 TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
           TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 568 TFAQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 627

Query: 601 ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
           ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 628 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 687

Query: 661 VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
           VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 688 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 746

BLAST of CmoCh05G002090.1 vs. NCBI nr
Match: gi|110289120|gb|AAP53887.2| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 828.9 bits (2140), Expect = 6.5e-237
Identity = 413/723 (57.12%), Postives = 514/723 (71.09%), Query Frame = 1

Query: 1    MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANL 60
            MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN+
Sbjct: 585  MTGSRMAFADLDTNITGNVRLGDGSVVRIAGRGTILFACKNGEHRTLSNTYYLPRLTANI 644

Query: 61   VSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEEV 120
            +S+GQLDETG  +  E G++++ D QRRLL +  RT  RLY+L++ + +PV L+A  +E 
Sbjct: 645  ISIGQLDETGFKVLAEDGIMRVWDEQRRLLARIPRTPGRLYMLDINLARPVCLAAHADED 704

Query: 121  SWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRAD 180
            +WRWHAR GH+NF AL K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+D
Sbjct: 705  AWRWHARLGHINFRALCKMGKEELVRGLPCLSQVDQVCEACLAGKHRRSPFPRQALCRSD 764

Query: 181  EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEA 240
            EPL L+HGD+CGPI PATP G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE 
Sbjct: 765  EPLALLHGDLCGPITPATPSGNRYFLLLVDDYSRYMWVALLSTKDAAPAAIKRIQAAAER 824

Query: 241  ECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS 300
            +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Sbjct: 825  KSGRKLRALRTDRGGEFTSTQFAEYCAELGMRRELTAPYSPQQNGVVERRNQSVVGTARS 884

Query: 301  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMK 360
            +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K
Sbjct: 885  MLKAKGLPGMFWGEAINTAVYLLNRSSSKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVK 944

Query: 361  VTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEA 420
             T P+L KLD R   ++F+GY+PGSKAYR YDP   R H+SRD+VFDE   W W+    A
Sbjct: 945  TTTPNLKKLDDRSRPMIFVGYKPGSKAYRAYDPATRRVHISRDIVFDEAAQWDWDAEAAA 1004

Query: 421  DRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------V 480
            D D   F VEY  V  P       Q+   PPA +   P                     V
Sbjct: 1005 DLD-TDFVVEYTTVYHPGSLSGTRQDAGEPPARSSSSPRTPSDSPTAGRTPSVHGDALAV 1064

Query: 481  EFATPRT-ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPN 540
            EF +P T A + LDADHD D   R+R MD+++G    PGLA RE++E  EL  VS +EP 
Sbjct: 1065 EFVSPPTGAAANLDADHD-DAPLRFRTMDNVLGPAMLPGLANREVQE--ELMMVSGEEPA 1124

Query: 541  TFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK 600
            TF +AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Sbjct: 1125 TFGQAERDEDWRRAMLDEISSIEENKTWRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHK 1184

Query: 601  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKET 660
            ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E 
Sbjct: 1185 ARLVAKGYVQRAGIDFDEVFAPVARLDSVRLLLALAAQEGWMVHHMDVKSAFLNGELIEE 1244

Query: 661  VYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTY 701
            VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD TL  L FK+   EHG+Y  
Sbjct: 1245 VYVVQPPGFEIDGQENKVYRLDKALYGLRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYAR 1303

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.8e-11436.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME9.9e-6033.92Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YJ41B_YEAST1.0e-1628.78Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YP41B_YEAST1.4e-1628.32Transposon Ty4-P Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YH41B_YEAST1.4e-1628.32Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
Q7XPB1_ORYSJ5.0e-26061.68OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=... [more]
A0B9X7_ORYSA2.1e-25060.20OSIGBa0135C09.3 protein OS=Oryza sativa GN=OSIGBa0135C09.3 PE=4 SV=1[more]
Q0J5Y3_ORYSJ1.0e-24459.07Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1[more]
B8BH06_ORYSI3.5e-23757.26Uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_33720 PE=4 SV=1[more]
Q7XEA3_ORYSJ4.6e-23757.12Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Match NameE-valueIdentityDescription
AT4G23160.15.8e-4242.93 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.13.8e-1742.42ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00710.11.4e-1137.89ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|38344222|emb|CAE03692.2|7.2e-26061.68OSJNBb0026E15.10 [Oryza sativa Japonica Group][more]
gi|116634828|emb|CAH66352.1|3.0e-25060.20OSIGBa0135C09.3 [Oryza sativa Indica Group][more]
gi|113623687|dbj|BAF23632.1|1.5e-24459.07Os08g0389500 [Oryza sativa Japonica Group][more]
gi|218184581|gb|EEC67008.1|5.0e-23757.26hypothetical protein OsI_33720 [Oryza sativa Indica Group][more]
gi|110289120|gb|AAP53887.2|6.5e-23757.12retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009536 plastid
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0046872 metal ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh05G002090CmoCh05G002090gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh05G002090.1CmoCh05G002090.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh05G002090.1.CDS.1CmoCh05G002090.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh05G002090.1.exon.1CmoCh05G002090.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 180..294
score: 3.9
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 178..344
score: 27
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 182..348
score: 3.7
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 181..353
score: 1.64
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 542..700
score: 2.9
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 99..166
score: 5.8
NoneNo IPR availableunknownCoilCoilcoord: 221..241
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..700
score:
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 1..700
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 542..686
score: 8.