CmoCh05G000040 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G000040
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTransposon Ty1-OL Gag-Pol polyprotein
LocationCmo_Chr05 : 15211 .. 18601 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGCTCAATCCGAGGAGGACGAGCCAGCTCTCTTCATGGTAACCGCATGCGTCCCCAATTTCGATTCCAAATCCGACGACGATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGCTGGCACCAACTGGGATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGCTGGCACCAACTGGGGAGCCAATTCAACTGGAGGAGGAGCGAGTGTTCGCTCAGATCGGCGAAAGGGACGAGCAACACGAGCACCAGCAATGGATCCTTGACACAGGGGCAACAAACCATATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATCCTGTTCGTCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAGCTCGATGAGACAGGCTGCTTCATTTCCATCGAGCGCGGACTACTGAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGACGGGTGCTTCATAGGCAAACAGAGGCGCACACCCTTTCCGTCTCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGCGGTAAGAGTCTCTTCCTCCTGTTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACTCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGATTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGCACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAATAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTTTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGGAAGACGCCATATGAAGCCTGGTACAACAAAAAACCAACAGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCTCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTGGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAGCACCTTCTGGCAGTGGAATGACGTGATCGAGACAGACCGTAATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTACCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAGCTCGAGGAAGTGGCCGAACTACATGCCGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGATTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCCTGCTGGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTACTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACAGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGCAAAAAGCGACTGATCGTGGGAGTGTACGTCGACGACCTCATAATCACTGGAGGCGACGTGGGAGTCCTCGGAAGGTTCAAGAAGGAGATGTCGAAAAACTTCGAGATGAGCGATCTCGGTGTGCTCAGCTACTACCTCGGCATCGAAGTGCAACAGAACTCTTCCGGCATCTCCATCTGCCAAAGTGCATACGCGAGGAAGCTGCTGGACACAACTGGGCTTGTGGACAGTAATCCTACCAGGACGCCAATGGAGGCCCGACTTCAACTAAGGAAGGCCGGCACTACGACGACAGTCGACTCCACCAATTACCGCAGCATTGTCGGGAGTCTGCGCTATCTGGTAAACTCTCGTCCTGATCTTGCTTATTCTGTTGGATATGTGAGCAGGTTTATGGAAGCACCTAGGGAGGAGCATCTAGTGGCTGTCAAGCGCATCCTGCGCTATGTAGCCGGAACCAGAGGCTGGGGCGTAAGATACTGCGCTGGGAGCGAAAAGGAGAAACTCAAACTGGTCGGCTACAGCGATAGTGACATGGCCGGTGACGTTGATGATCGTAAGAGCACCAGCGGAATGATCTACTTTCTCTCAGGCGGCGCGATCTGCTGGCAATCAACAAAACAGAAGGTAGTTGCTTTGTCTTCCTGCGAAGCAGAATACATCGCCGCTTCGATGGCAGCAACTCAAGGGATCTGGCTTGCACGACTAATGGAAGAACTCATCGGGAGAGAAAGCGATTCACCAATGCTATACGTAGACAACAAAGCTACGATCTCCCTGATCAAAAATCCAGTTTTGCACGACCGGAGCAAGCACATAGAAACCAGATTCCACTACATTCGAGAATGTGCAGATCGAGGGCTCATCAAAATTGATTTCATCCGAACAGAGGAACAACTTGGAGACATTTTCACCAAATCCCTGGCGCGGGTGAAATTTGAAGAACTACGCTCAAAGATCGGAGTTCAAATAATAAGGTACAGCTTTTAG

mRNA sequence

ATGTGGCTCAATCCGAGGAGGACGAGCCAGCTCTCTTCATGCAAAGCTGGCACCAACTGGGATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGCTGGCACCAACTGGGGAGCCAATTCAACTGGAGGAGGAGCGAGTGTTCGCTCAGATCGGCGAAAGGGACGAGCAACACGAGCACCAGCAATGGATCCTTGACACAGGGGCAACAAACCATATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATCCTGTTCGTCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAGCTCGATGAGACAGGCTGCTTCATTTCCATCGAGCGCGGACTACTGAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGACGGGTGCTTCATAGGCAAACAGAGGCGCACACCCTTTCCGTCTCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGCGGTAAGAGTCTCTTCCTCCTGTTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACTCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGATTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGCACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAATAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTTTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGGAAGACGCCATATGAAGCCTGGTACAACAAAAAACCAACAGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCTCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTGGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAGCACCTTCTGGCAGTGGAATGACGTGATCGAGACAGACCGTAATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTACCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAGCTCGAGGAAGTGGCCGAACTACATGCCGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGATTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCCTGCTGGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTACTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACAGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGCAAAAAGCGACTGATCGTGGGAGTGTACGTCGACGACCTCATAATCACTGGAGGCGACGTGGGAGTCCTCGGAAGGTTCAAGAAGGAGATGTCGAAAAACTTCGAGATGAGCGATCTCGGTGTGCTCAGCTACTACCTCGGCATCGAAGTGCAACAGAACTCTTCCGGCATCTCCATCTGCCAAAGTGCATACGCGAGGAAGCTGCTGGACACAACTGGGCTTGTGGACAGTAATCCTACCAGGACGCCAATGGAGGCCCGACTTCAACTAAGGAAGGCCGGCACTACGACGACAGTCGACTCCACCAATTACCGCAGCATTGTCGGGAGTCTGCGCTATCTGGTAAACTCTCGTCCTGATCTTGCTTATTCTGTTGGATATGTGAGCAGGTTTATGGAAGCACCTAGGGAGGAGCATCTAGTGGCTGTCAAGCGCATCCTGCGCTATGTAGCCGGAACCAGAGGCTGGGGCGTAAGATACTGCGCTGGGAGCGAAAAGGAGAAACTCAAACTGGTCGGCTACAGCGATAGTGACATGGCCGGTGACGTTGATGATCGTAAGAGCACCAGCGGAATGATCTACTTTCTCTCAGGCGGCGCGATCTGCTGGCAATCAACAAAACAGAAGGTAGTTGCTTTGTCTTCCTGCGAAGCAGAATACATCGCCGCTTCGATGGCAGCAACTCAAGGGATCTGGCTTGCACGACTAATGGAAGAACTCATCGGGAGAGAAAGCGATTCACCAATGCTATACGTAGACAACAAAGCTACGATCTCCCTGATCAAAAATCCAGTTTTGCACGACCGGAGCAAGCACATAGAAACCAGATTCCACTACATTCGAGAATGTGCAGATCGAGGGCTCATCAAAATTGATTTCATCCGAACAGAGGAACAACTTGGAGACATTTTCACCAAATCCCTGGCGCGGGTGAAATTTGAAGAACTACGCTCAAAGATCGGAGTTCAAATAATAAGGTACAGCTTTTAG

Coding sequence (CDS)

ATGTGGCTCAATCCGAGGAGGACGAGCCAGCTCTCTTCATGCAAAGCTGGCACCAACTGGGATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGCTGGCACCAACTGGGGAGCCAATTCAACTGGAGGAGGAGCGAGTGTTCGCTCAGATCGGCGAAAGGGACGAGCAACACGAGCACCAGCAATGGATCCTTGACACAGGGGCAACAAACCATATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATCCTGTTCGTCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAGCTCGATGAGACAGGCTGCTTCATTTCCATCGAGCGCGGACTACTGAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGGCTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGACGGGTGCTTCATAGGCAAACAGAGGCGCACACCCTTTCCGTCTCGAACAGCCTACCGAGCCGATGAGCCATTGGAGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGCGGTAAGAGTCTCTTCCTCCTGTTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACTCTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGATTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGCACAGACCGAGGCGGAGAATTCACCTCGGCAAGTTTCAATAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCATTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTTTATCTCCTCAATCGGTCACCAACCCGAAGCCTCGACGGGAAGACGCCATATGAAGCCTGGTACAACAAAAAACCAACAGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCTCATCTCGCCAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTGGGGGGGCGAGCTCACGTGTCTCGCGACGTCGTCTTCGACGAAAGCACCTTCTGGCAGTGGAATGACGTGATCGAGACAGACCGTAATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGAGCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCTAGGTACCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAGCTCGAGGAAGTGGCCGAACTACATGCCGTCAGTGCAGATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGATTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCCTGCTGGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTACTGCGCCTGCACAAAGCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACAGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTACGGCCACGGCAAAAAGCGACTGATCGTGGGAGTGTACGTCGACGACCTCATAATCACTGGAGGCGACGTGGGAGTCCTCGGAAGGTTCAAGAAGGAGATGTCGAAAAACTTCGAGATGAGCGATCTCGGTGTGCTCAGCTACTACCTCGGCATCGAAGTGCAACAGAACTCTTCCGGCATCTCCATCTGCCAAAGTGCATACGCGAGGAAGCTGCTGGACACAACTGGGCTTGTGGACAGTAATCCTACCAGGACGCCAATGGAGGCCCGACTTCAACTAAGGAAGGCCGGCACTACGACGACAGTCGACTCCACCAATTACCGCAGCATTGTCGGGAGTCTGCGCTATCTGGTAAACTCTCGTCCTGATCTTGCTTATTCTGTTGGATATGTGAGCAGGTTTATGGAAGCACCTAGGGAGGAGCATCTAGTGGCTGTCAAGCGCATCCTGCGCTATGTAGCCGGAACCAGAGGCTGGGGCGTAAGATACTGCGCTGGGAGCGAAAAGGAGAAACTCAAACTGGTCGGCTACAGCGATAGTGACATGGCCGGTGACGTTGATGATCGTAAGAGCACCAGCGGAATGATCTACTTTCTCTCAGGCGGCGCGATCTGCTGGCAATCAACAAAACAGAAGGTAGTTGCTTTGTCTTCCTGCGAAGCAGAATACATCGCCGCTTCGATGGCAGCAACTCAAGGGATCTGGCTTGCACGACTAATGGAAGAACTCATCGGGAGAGAAAGCGATTCACCAATGCTATACGTAGACAACAAAGCTACGATCTCCCTGATCAAAAATCCAGTTTTGCACGACCGGAGCAAGCACATAGAAACCAGATTCCACTACATTCGAGAATGTGCAGATCGAGGGCTCATCAAAATTGATTTCATCCGAACAGAGGAACAACTTGGAGACATTTTCACCAAATCCCTGGCGCGGGTGAAATTTGAAGAACTACGCTCAAAGATCGGAGTTCAAATAATAAGGTACAGCTTTTAG
BLAST of CmoCh05G000040 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 3.2e-190
Identity = 402/1052 (38.21%), Postives = 592/1052 (56.27%), Query Frame = 1

Query: 64   QWILDTGATNHMTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRKLTD 123
            +W++DT A++H T  R  F    +G  GTVK G+ S  +I G G I   +  G    L D
Sbjct: 293  EWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKD 352

Query: 124  VYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYILELEIDQ 183
            V  +P L+ NL+S   LD  G          ++      +     R T  LY    EI Q
Sbjct: 353  VRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGT--LYRTNAEICQ 412

Query: 184  PVSLSAKTEEVSWR-WHARYGHLNFPALEKLQKKELVHGLPEIKGVN-KLCDGCFIGKQR 243
               L+A  +E+S   WH R GH++   L+ L KK L+      KG   K CD C  GKQ 
Sbjct: 413  G-ELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLIS---YAKGTTVKPCDYCLFGKQH 472

Query: 244  RTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKSEA 303
            R  F + ++ R    L+LV+ D+CGP++  + GG   F+  +DD SR +W+ +L+ K + 
Sbjct: 473  RVSFQT-SSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQV 532

Query: 304  AEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASFNKYCDEIGIQRHLTAPYSPQQNGVV 363
             +  ++  A  E E  +K++ LR+D GGE+TS  F +YC   GI+   T P +PQ NGV 
Sbjct: 533  FQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVA 592

Query: 364  ERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTV 423
            ER N+TIV   RS+L  A +P  FWGEAV TA YL+NRSP+  L  + P   W NK+ + 
Sbjct: 593  ERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSY 652

Query: 424  HHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFD 483
             H +VFGC A+  V +    KLD + +  +FIGY      YRL+DPV  +   SRDVVF 
Sbjct: 653  SHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFR 712

Query: 484  ESTFWQWNDVIETDRN---PNQFTVEYLVTEPEEGGAQHQETSPPPAGAPPEPVEFATPR 543
            ES      D+ E  +N   PN  T+      P    +   E S    G  P  V     +
Sbjct: 713  ESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVS--EQGEQPGEVIEQGEQ 772

Query: 544  TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEA-- 603
              +   + +H T  E +++ +       E P + +R       +      EP +  E   
Sbjct: 773  LDEGVEEVEHPTQGEEQHQPL----RRSERPRVESRRYPSTEYVLISDDREPESLKEVLS 832

Query: 604  --EKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARL 663
              EKN    KAMQEEM S+ +N T+ L ++P G R +  KWVFKLK++   ++V++KARL
Sbjct: 833  HPEKNQL-MKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARL 892

Query: 664  VAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYV 723
            V KG+ QK+G+DF+E+F+PV ++ S+R +L++AA    EV  +DVK+AFL+G+L+E +Y+
Sbjct: 893  VVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYM 952

Query: 724  RQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHG 783
             QP GF      + V +L+K+LYGL+QAPR W  K DS + S  + +  S+  +Y     
Sbjct: 953  EQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFS 1012

Query: 784  KKR-LIVGVYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGIEV--QQNSSGI 843
            +   +I+ +YVDD++I G D G++ + K ++SK+F+M DLG     LG+++  ++ S  +
Sbjct: 1013 ENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKL 1072

Query: 844  SICQSAYARKLLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTN------YRSIVGSL 903
             + Q  Y  ++L+   + ++ P  TP+   L+L K    TTV+         Y S VGSL
Sbjct: 1073 WLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSL 1132

Query: 904  RY-LVNSRPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLK 963
             Y +V +RPD+A++VG VSRF+E P +EH  AVK ILRY+ GT G  +  C G     LK
Sbjct: 1133 MYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCL--CFGGSDPILK 1192

Query: 964  LVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAATQG 1023
              GY+D+DMAGD+D+RKS++G ++  SGGAI WQS  QK VALS+ EAEYIAA+    + 
Sbjct: 1193 --GYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEM 1252

Query: 1024 IWLARLMEELIGRESDSPMLYVDNKATISLIKNPVLHDRSKHIETRFHYIRECADRGLIK 1083
            IWL R ++EL G      ++Y D+++ I L KN + H R+KHI+ R+H+IRE  D   +K
Sbjct: 1253 IWLKRFLQEL-GLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLK 1312

Query: 1084 IDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
            +  I T E   D+ TK + R KFE  +  +G+
Sbjct: 1313 VLKISTNENPADMLTKVVPRNKFELCKELVGM 1325

BLAST of CmoCh05G000040 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 351.3 bits (900), Expect = 3.9e-95
Identity = 189/525 (36.00%), Postives = 298/525 (56.76%), Query Frame = 1

Query: 579  VAELHAVSADEPNTFAEAE---KNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKW 638
            V   H +  D PN+F E +       W +A+  E+ +   N TW++   P     +  +W
Sbjct: 880  VLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRW 939

Query: 639  VFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVH 698
            VF +K NE G  +++KARLVA+G+ QK  +D+EE FAPVAR+ S RF+L++   ++ +VH
Sbjct: 940  VFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVH 999

Query: 699  HMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLL 758
             MDVK+AFLNG LKE +Y+R P G   + N + V +L+KA+YGL+QA R W    +  L 
Sbjct: 1000 QMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALK 1059

Query: 759  SLNFKRCASEHGMYTYGHG--KKRLIVGVYVDDLIITGGDVGVLGRFKKEMSKNFEMSDL 818
               F   + +  +Y    G   + + V +YVDD++I  GD+  +  FK+ + + F M+DL
Sbjct: 1060 ECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDL 1119

Query: 819  GVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPMEARLQLRKAGTTTTV 878
              + +++GI ++     I + QSAY +K+L    + + N   TP+ +++      +    
Sbjct: 1120 NEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDC 1179

Query: 879  DSTNYRSIVGSLRY-LVNSRPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGV 938
             +T  RS++G L Y ++ +RPDL  +V  +SR+      E    +KR+LRY+ GT    +
Sbjct: 1180 -NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKL 1239

Query: 939  RYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIY-FLSGGAICWQSTKQKVVALSSCE 998
             +      E  K++GY DSD AG   DRKST+G ++       ICW + +Q  VA SS E
Sbjct: 1240 IFKKNLAFEN-KIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTE 1299

Query: 999  AEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPVLHDRSKHIETRF 1058
            AEY+A   A  + +WL  L+  +  +  +   +Y DN+  IS+  NP  H R+KHI+ ++
Sbjct: 1300 AEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKY 1359

Query: 1059 HYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
            H+ RE     +I +++I TE QL DIFTK L   +F ELR K+G+
Sbjct: 1360 HFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of CmoCh05G000040 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 204.1 bits (518), Expect = 7.7e-51
Identity = 114/305 (37.38%), Postives = 173/305 (56.72%), Query Frame = 1

Query: 697 MDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLS 756
           MDV +AFLN  + E +YV+QPPGF++  NP+ V  L+  +YGL+QAP  WN  +++TL  
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 757 LNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVL 816
           + F R   EHG+Y        + + VYVDDL++      +  R K+E++K + M DLG +
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 817 SYYLGIEVQQNSSG-ISICQSAYARKLLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDS 876
             +LG+ + Q+S+G I++    Y  K    + +     T+TP+     L +  +    D 
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 877 TNYRSIVGSLRYLVNS-RPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGVRY 936
           T Y+SIVG L +  N+ RPD++Y V  +SRF+  PR  HL + +R+LRY+  TR   ++Y
Sbjct: 181 TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 937 CAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQSTKQK-VVALSSCEAE 996
            +GS+   L L  Y D+      D   ST G +  L+G  + W S K K V+ + S EAE
Sbjct: 241 RSGSQ---LALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAE 300

Query: 997 YIAAS 999
           YI AS
Sbjct: 301 YITAS 302

BLAST of CmoCh05G000040 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.0e-34
Identity = 81/225 (36.00%), Postives = 128/225 (56.89%), Query Frame = 1

Query: 782  VYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARK 841
            +YVDD+++TG    +L     ++S  F M DLG + Y+LGI+++ + SG+ + Q+ YA +
Sbjct: 5    LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 842  LLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGY 901
            +L+  G++D  P  TP+  +L      T    D +++RSIVG+L+YL  +RPD++Y+V  
Sbjct: 65   ILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNI 124

Query: 902  VSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRK 961
            V + M  P       +KR+LRYV GT   G+      +  KL +  + DSD AG    R+
Sbjct: 125  VCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYI---HKNSKLNVQAFCDSDWAGCTSTRR 184

Query: 962  STSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAATQGIW 1007
            ST+G   FL    I W + +Q  V+ SS E EY A ++ A +  W
Sbjct: 185  STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh05G000040 vs. Swiss-Prot
Match: YD14B_YEAST (Transposon Ty1-DR5 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-DR5 PE=3 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 3.2e-25
Identity = 128/518 (24.71%), Postives = 238/518 (45.95%), Query Frame = 1

Query: 604  KAMQEEMTSITENQTWSLE------DMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAK 663
            +A  +E+  + +  TW  +      ++ P  R I   ++F  KR+       HKAR VA+
Sbjct: 1252 EAYHKEVNQLLKMNTWDTDKYYDRKEIDP-KRVINSMFIFNRKRDGT-----HKARFVAR 1311

Query: 664  GYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQP 723
            G +Q        + +      ++   L++A  +++ +  +D+ SA+L  ++KE +Y+R P
Sbjct: 1312 GDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPP 1371

Query: 724  PGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLSLNFKRCASEHGM-YTYGHGKK 783
            P    ND   K++RL K+LYGL+Q+   W   + S L+    K+C  E    ++      
Sbjct: 1372 PHLGMND---KLIRLKKSLYGLKQSGANWYETIKSYLI----KQCGMEEVRGWSCVFKNS 1431

Query: 784  RLIVGVYVDDLIITGGDVG----VLGRFKKEMSK---NFEMSDLGVLSYYLGIEVQ-QNS 843
            ++ + ++VDD+I+   D+     ++   KK+      N   SD  +    LG+E++ Q  
Sbjct: 1432 QVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRG 1491

Query: 844  SGISI-CQSAYARKLLDTTGLVDSNPTRTPMEARLQ--LRKAGTTTTVDSTNYRSIVGSL 903
              + +  +++   K+      V  NP    + A  Q  L        +D   Y+  V  +
Sbjct: 1492 KYMKLGMENSLTEKIPKLN--VPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEM 1551

Query: 904  RYLV--------NSRPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAG 963
            + L+          R DL Y +  +++ +  P  + L     +++++  TR   + +   
Sbjct: 1552 QKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKN 1611

Query: 964  SEKE-KLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIA 1023
               E   KLV  SD+   G+    KS  G IY L+G  I  +STK  +   S+ EAE  A
Sbjct: 1612 KPTEPDNKLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHA 1671

Query: 1024 ASMAATQGIWLARLMEELIGRESDSPMLYVDNKATIS-LIKNPVLHDRSKHIETRFHYIR 1083
             S +      L+ L++EL  ++  +  L  D+K+TIS +I N     R++   T+   +R
Sbjct: 1672 ISESVPLLNNLSHLVQEL-NKKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLR 1731

Query: 1084 ECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSK 1094
            +      + + +I T++ + D+ TK L    F+ L +K
Sbjct: 1732 DEVSGNHLHVCYIETKKNIADVMTKPLPIKTFKLLTNK 1752

BLAST of CmoCh05G000040 vs. TrEMBL
Match: Q7XPB1_ORYSJ (OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=4 SV=2)

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 678/1077 (62.95%), Postives = 829/1077 (76.97%), Query Frame = 1

Query: 36   APTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKF 95
            +P GE + + E +VFAQ+ +  E H+   WILDTGATNHMTG+RSAF+ELD+ + GTV+F
Sbjct: 368  SPIGE-LAVVEAKVFAQLDDGGE-HDPAMWILDTGATNHMTGSRSAFAELDTAVTGTVRF 427

Query: 96   GDGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLK 155
            GDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L 
Sbjct: 428  GDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANIVSLGQLDRSGSKVLIHHGILH 487

Query: 156  ICDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQK 215
            + D +  LL + RR+ + LY ++L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL +
Sbjct: 488  VWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEPAWRWHARYGHLNFPALRKLAQ 547

Query: 216  KELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGG 275
            +E+V GLP ++ V ++CDGC +GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G
Sbjct: 548  QEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRADEHLELVHGDLCGPIEPATPAG 607

Query: 276  KSLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSAS 335
               FLLLVDD SR+MWLT++++K EAA AIK  +ARAE E  +K+R LR DRG EFTS  
Sbjct: 608  NRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEVESGRKLRALRMDRGSEFTSIE 667

Query: 336  FNKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVY 395
            F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS++   G+PGRFWGEA+ TAV+
Sbjct: 668  FGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARSMMKAKGVPGRFWGEAMSTAVF 727

Query: 396  LLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGY 455
            LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K+T+P L KLD R   +V +GY
Sbjct: 728  LLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVKITKPGLKKLDDRSAPMVLLGY 787

Query: 456  EPGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPN--QFTVEYLVT----- 515
            E GSKAYRLYDPV  R HVSRDVVFDE   W W  V   D  P    FTVE +VT     
Sbjct: 788  EQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTP-DGAPQLEPFTVEQVVTTTIGT 847

Query: 516  ---------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM 575
                      P         T  PP+   PE VEF TP T DS LDAD D D+  RYR +
Sbjct: 848  APASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPTQDSILDADADDDVVPRYRLV 907

Query: 576  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTW 635
            D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TW
Sbjct: 908  DNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEADPSWRGAMQDELNAIVDNDTW 967

Query: 636  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLES 695
            SL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGYVQ+QGVDF+EVFA VARLES
Sbjct: 968  SLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGYVQRQGVDFDEVFALVARLES 1027

Query: 696  VRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGL 755
            VR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGL
Sbjct: 1028 VRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPGFVDDNHKNKVYRLHKALYGL 1087

Query: 756  RQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGR 815
            RQAPRAWNAKLDS+LLSL F R +SEHG+YT   G +RL+VGVYVDDLIITG     +  
Sbjct: 1088 RQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRGGRRLMVGVYVDDLIITGDHDDEIRS 1147

Query: 816  FKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPME 875
            FK EM K F+MSDLG L YYLGIEV Q+S GI++ Q+AYA K+L+  GL D NP +TPME
Sbjct: 1148 FKGEMMKLFKMSDLGALRYYLGIEVTQDSDGITLGQAAYAGKILEKAGLKDCNPCQTPME 1207

Query: 876  ARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKR 935
             RL+LRK      VD+T YRS+VGSLRYLVN+RPDLA+SVGYVSRFME+PRE+HL AV+R
Sbjct: 1208 VRLKLRKGSDFPLVDATLYRSLVGSLRYLVNTRPDLAFSVGYVSRFMESPREDHLAAVRR 1267

Query: 936  ILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQS 995
            ILRYVAGTR WG+R+  G+      LVGYSDSD+AGD D+RKSTSG I+F++GG + WQS
Sbjct: 1268 ILRYVAGTRCWGIRFGPGARCALPMLVGYSDSDLAGDPDERKSTSGQIFFINGGPVTWQS 1327

Query: 996  TKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPV 1055
            +KQKVVALSSCEAEYIAA+ A  QG+WLARL+ E++G E  +P+L VDN++TISLIKNPV
Sbjct: 1328 SKQKVVALSSCEAEYIAAAAATCQGVWLARLLAEVLGDEITAPLLKVDNQSTISLIKNPV 1387

Query: 1056 LHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
             HDRSKHI+ ++HYIRECA++ LI++ F+ T EQLGDIFTKSL R +F+ELRSKIGV
Sbjct: 1388 HHDRSKHIDVKYHYIRECAEKKLIEMMFVGTAEQLGDIFTKSLGRTRFQELRSKIGV 1440

BLAST of CmoCh05G000040 vs. TrEMBL
Match: Q0J5Y3_ORYSJ (Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1)

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 647/1095 (59.09%), Postives = 801/1095 (73.15%), Query Frame = 1

Query: 37   PTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFG 96
            P  E I L+E ++F Q+G  +   E  +WILDTGATNHMTG RSAFSEL++GIRGTVKFG
Sbjct: 362  PALEQIHLDESKLFVQLGG-EHGGEATRWILDTGATNHMTGTRSAFSELNTGIRGTVKFG 421

Query: 97   DGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKI 156
            DGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+VSLGQLDE     S E G+LKI
Sbjct: 422  DGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNIVSLGQLDEEKFKWSCEDGVLKI 481

Query: 157  CDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKK 216
             + QRRLL +  R+ NRLY+++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL + 
Sbjct: 482  WNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDIAWRWHARFGHLNFRALEKLGRA 541

Query: 217  ELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGK 276
             +V GLP I  V+++CD C +GKQRR PFPS+  YRA E LELVHGDICGP+ PATP G 
Sbjct: 542  VMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAKEKLELVHGDICGPVTPATPSGN 601

Query: 277  SLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASF 336
             LFLLLVDD SR+MWL LL +K +A+ AIKR  A AEAE  +K+R LRTDRGGEFT+ +F
Sbjct: 602  KLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEAEAGRKLRTLRTDRGGEFTAHAF 661

Query: 337  NKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYL 396
             +YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS++    +PG FWGEAV TAV+L
Sbjct: 662  AEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARSMMKAKSLPGWFWGEAVNTAVFL 721

Query: 397  LNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYE 456
            LNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K     LAKLD R + +VF+GYE
Sbjct: 722  LNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVKNGGQRLAKLDDRSMPMVFVGYE 781

Query: 457  PGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQ----FTVEYLVTEPE- 516
             G+KAYR Y+PV  R HVSRD VF+E   W+W    E    P+     F VE+L T P  
Sbjct: 782  AGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWG--AEKGAGPDDDIEPFVVEHLATGPTG 841

Query: 517  EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD 576
            +GG       A  + TS P                       PA A    +EFA+P   D
Sbjct: 842  QGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGPRTPASASSPAIEFASPPQGD 901

Query: 577  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPC 636
              LD DHD D+  R+R +D+L+G   PPGLA RE+ E   L     DEP T  EA++   
Sbjct: 902  LDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--GLMVAIEDEPATAEEAKQVKE 961

Query: 637  WRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ 696
            WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K++E G + KHKARLVAKGYVQ
Sbjct: 962  WREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKKDEHGNITKHKARLVAKGYVQ 1021

Query: 697  KQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL 756
            +QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKSAFLNG+L E VYV+QPPGF+
Sbjct: 1022 RQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKSAFLNGDLAEEVYVQQPPGFV 1081

Query: 757  DNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVG 816
               +  KVL+LHKALYGL+QAPRAWN+KLDS+LL L F R   EHG+YT   G+KRL+VG
Sbjct: 1082 AAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFARSECEHGLYTRSDGEKRLVVG 1141

Query: 817  VYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARK 876
            +YVDDLIITGG   V+  FK EM   F MSDLGVLSYYLGIEV+Q   GI + Q+AYA+K
Sbjct: 1142 IYVDDLIITGGSTEVINTFKTEMKTLFRMSDLGVLSYYLGIEVRQGRRGIELLQAAYAKK 1201

Query: 877  LLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGY 936
            +L+  G+   NP  TPMEARL+L K  T+  VD+T YRS++GSLRYL+N+RPD+A++VGY
Sbjct: 1202 ILEKAGMGTCNPCATPMEARLKLSKQSTSPAVDATEYRSLIGSLRYLMNTRPDMAFAVGY 1261

Query: 937  VSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRK 996
            +SRFME PR+EHL A+K +LRYVAGT  +G+ Y +G    +  LVGYSDSDMAGD+DDRK
Sbjct: 1262 LSRFMENPRQEHLAAMKHLLRYVAGTIDYGLVYTSGD--TEFNLVGYSDSDMAGDIDDRK 1321

Query: 997  STSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDS 1056
            STSG+IYFL G  + WQS KQ+VVALSSCEAEYIA + AA QG+WL RL+++++G     
Sbjct: 1322 STSGIIYFLGGNPVAWQSQKQRVVALSSCEAEYIAGAAAACQGVWLRRLLQDVVGVSGPP 1381

Query: 1057 PMLYVDNKATISLIKNPVLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKS 1097
            P L +DN++ I+L KNPVLHDRSKHI+T+FH++REC D G +++ F+ T+ QL DI TK+
Sbjct: 1382 PQLKMDNQSAIALSKNPVLHDRSKHIDTKFHFLRECVDSGAVRLAFVSTQAQLADIMTKA 1441

BLAST of CmoCh05G000040 vs. TrEMBL
Match: Q7XTU6_ORYSJ (OSJNBb0034I13.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0034I13.10 PE=4 SV=1)

HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 616/1075 (57.30%), Postives = 771/1075 (71.72%), Query Frame = 1

Query: 37   PTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFG 96
            P+G+ + L E++V     +  E+     W LDTGATNHMTG RSAF+ELD+G+ GTVKFG
Sbjct: 356  PSGQEVHLTEKKVILDHEDGGEEEVTGDWFLDTGATNHMTGVRSAFAELDTGVVGTVKFG 415

Query: 97   DGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKI 156
            DGSV+EI+GRGT++F  K G+HR L  VY+IP+L+ N++S+G+LD  G    I  G+  +
Sbjct: 416  DGSVIEIQGRGTVVFRCKNGDHRSLDAVYYIPKLRKNIISVGRLDARGYDAHIWGGVCTL 475

Query: 157  CDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKK 216
             D    LL + +R  N LYIL+L I  PV ++A   + +WRWHAR+GHLNF +L +L + 
Sbjct: 476  RDPNGLLLAKVKRDINYLYILKLHIANPVCMAASGGDTAWRWHARFGHLNFQSLRRLAQG 535

Query: 217  ELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGK 276
             +V GLP I   ++LCDGC  GKQRR PFP    +RA E LELVHGD+CGPI PATPGG+
Sbjct: 536  NMVRGLPTIDHTDQLCDGCLAGKQRRLPFPEEAKFRAQEALELVHGDLCGPITPATPGGR 595

Query: 277  SLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASF 336
              FLLLVDD SR MW+ LL  K EAA AIK+ +A  E E  +K+R LRTDRGGEFTS  F
Sbjct: 596  KYFLLLVDDMSRHMWIRLLSGKHEAATAIKQFQAGVELESGRKLRALRTDRGGEFTSVEF 655

Query: 337  NKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYL 396
              YC + G++R LTAPYSPQQN VVERRNQT+V  ARS+L  AG+P RFWGEAV+ AVY+
Sbjct: 656  MDYCTDRGMRRELTAPYSPQQNRVVERRNQTVVAAARSMLKAAGLPARFWGEAVVAAVYV 715

Query: 397  LNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYE 456
            LNRSPT++LDG TPYEAW+ ++P+V H RVFGCV Y+K  +P+L KLD RG ++VFIGYE
Sbjct: 716  LNRSPTKALDGVTPYEAWHGRRPSVEHLRVFGCVGYVKTVKPNLRKLDDRGTRMVFIGYE 775

Query: 457  PGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQFTVEYLVT--EPEEGG 516
             GSKAYR+YDPV  R  VSRDVVFDE+  W W D  +      +FTV++ V+   P    
Sbjct: 776  QGSKAYRMYDPVAQRVCVSRDVVFDETATWAWRDPEDAATEEEEFTVDFFVSPVAPSVAD 835

Query: 517  AQHQETSPPPAGAPPEPV------------EFATPRTADSTLDADHDTDLEARYRRMDDL 576
            A  Q  +P  AG  P               EF TP T+  T + D       RYRR+ D+
Sbjct: 836  AGEQTGTPVQAGVSPVSTGVLSSPPRAPNGEFCTPPTS-VTPETDGG---PVRYRRVQDI 895

Query: 577  VGGGEPPGLAARELE-EVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSL 636
            +   EP       L+ + ++   ++ +EP +F EAEK+ CWR+AM EE+ S+ ENQTWSL
Sbjct: 896  LSTTEP------VLDFDYSDQCLIATEEPTSFVEAEKHECWRRAMVEELRSVEENQTWSL 955

Query: 637  EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVR 696
             ++P GH+AIGLKWV+KLK++  G +VKHKARLVAKGYVQ+QGVDF+EVFAPVAR+E+VR
Sbjct: 956  AELPAGHKAIGLKWVYKLKKDPSGAIVKHKARLVAKGYVQQQGVDFDEVFAPVARMETVR 1015

Query: 697  FLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQ 756
             L+A+AA   WE+HHMDVKSAFLNGEL+E VYV QPPGF D  N +KVL+L KALYGLRQ
Sbjct: 1016 LLVALAAQKGWEIHHMDVKSAFLNGELEEEVYVVQPPGFDDKTNASKVLKLRKALYGLRQ 1075

Query: 757  APRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGRFK 816
            APRAWNAKLD+TLLSL F + A+E  +Y  G G  +LIVGVYVDDLIITG     +  FK
Sbjct: 1076 APRAWNAKLDNTLLSLKFNKSATESAVYVRGVGDSKLIVGVYVDDLIITGSQKKEIDAFK 1135

Query: 817  KEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPMEAR 876
             +M + F MSDLG LSYYLG+EV Q   GI + QSAYA K+L+ TG+   NPT+ PMEAR
Sbjct: 1136 LQMKQRFNMSDLGFLSYYLGMEVVQKGEGIFLSQSAYAGKILEKTGMEGCNPTQVPMEAR 1195

Query: 877  LQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKRIL 936
            L+L K GT   VD T YRSIVGSLRYLVN+RPDLAYSVGYVSRFME P  EH  AVK IL
Sbjct: 1196 LKLSKEGTGECVDPTEYRSIVGSLRYLVNTRPDLAYSVGYVSRFMEKPTSEHWAAVKHIL 1255

Query: 937  RYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQSTK 996
            RY++GT   G  Y    E    KLVG+SDSDMAGD+DDRKST+G+++   G  I WQS K
Sbjct: 1256 RYISGTIKTGCWY-GREEVGNAKLVGFSDSDMAGDLDDRKSTTGVLFRYGGSLISWQSQK 1315

Query: 997  QKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPVLH 1056
            QKVVALSSCEAEYIAA+ AA QGIWL+RL+ EL+  E     L +DNK+ I+L KNPV H
Sbjct: 1316 QKVVALSSCEAEYIAATTAACQGIWLSRLIAELLDAEPGQTTLMIDNKSAINLCKNPVFH 1375

Query: 1057 DRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
            DRSKHI+TR+H+IREC ++  I ++++ +E+QL D+ TK + RV+F+ELR K+G+
Sbjct: 1376 DRSKHIDTRYHFIRECVEKKQIAVEYVCSEDQLADLLTKPVGRVRFKELRRKMGL 1419

BLAST of CmoCh05G000040 vs. TrEMBL
Match: Q7XEA3_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os10g29420 PE=4 SV=2)

HSP 1 Score: 1224.2 bits (3166), Expect = 0.0e+00
Identity = 616/1084 (56.83%), Postives = 780/1084 (71.96%), Query Frame = 1

Query: 42   IQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFGDGSVV 101
            ++L E +VFA + +  + H+  +WI+D+GA+NHMTG+R AF++LD+ I G V+ GDGSVV
Sbjct: 553  VELVEMKVFAALDDAAD-HDPGRWIMDSGASNHMTGSRMAFADLDTNITGNVRLGDGSVV 612

Query: 102  EIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQR 161
             I GRGTILF  K GEHR L++ Y++PRL AN++S+GQLDETG  +  E G++++ D QR
Sbjct: 613  RIAGRGTILFACKNGEHRTLSNTYYLPRLTANIISIGQLDETGFKVLAEDGIMRVWDEQR 672

Query: 162  RLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHG 221
            RLL +  RT  RLY+L++ + +PV L+A  +E +WRWHAR GH+NF AL K+ K+ELV G
Sbjct: 673  RLLARIPRTPGRLYMLDINLARPVCLAAHADEDAWRWHARLGHINFRALCKMGKEELVRG 732

Query: 222  LPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLL 281
            LP +  V+++C+ C  GK RR+PFP +   R+DEPL L+HGD+CGPI PATP G   FLL
Sbjct: 733  LPCLSQVDQVCEACLAGKHRRSPFPRQALCRSDEPLALLHGDLCGPITPATPSGNRYFLL 792

Query: 282  LVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASFNKYCD 341
            LVDD SR+MW+ LL  K  A  AIKRI+A AE +  +K+R LRTDRGGEFTS  F +YC 
Sbjct: 793  LVDDYSRYMWVALLSTKDAAPAAIKRIQAAAERKSGRKLRALRTDRGGEFTSTQFAEYCA 852

Query: 342  EIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSP 401
            E+G++R LTAPYSPQQNGVVERRNQ++VGTARS+L   G+PG FWGEA+ TAVYLLNRS 
Sbjct: 853  ELGMRRELTAPYSPQQNGVVERRNQSVVGTARSMLKAKGLPGMFWGEAINTAVYLLNRSS 912

Query: 402  TRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKA 461
            ++ + GKTPY  W    P VHH R FGCVA++K T P+L KLD R   ++F+GY+PGSKA
Sbjct: 913  SKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVKTTTPNLKKLDDRSRPMIFVGYKPGSKA 972

Query: 462  YRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQFTVEYL-VTEPEEGGAQHQET 521
            YR YDP   R H+SRD+VFDE+  W W+     D + + F VEY  V  P       Q+ 
Sbjct: 973  YRAYDPATRRVHISRDIVFDEAAQWDWDAEAAADLDTD-FVVEYTTVYHPGSLSGTRQDA 1032

Query: 522  SPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTDLEARYRR 581
              PPA +   P                     VEF +P T A + LDADHD D   R+R 
Sbjct: 1033 GEPPARSSSSPRTPSDSPTAGRTPSVHGDALAVEFVSPPTGAAANLDADHD-DAPLRFRT 1092

Query: 582  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQT 641
            MD+++G    PGLA RE++E  EL  VS +EP TF +AE++  WR+AM +E++SI EN+T
Sbjct: 1093 MDNVLGPAMLPGLANREVQE--ELMMVSGEEPATFGQAERDEDWRRAMLDEISSIEENKT 1152

Query: 642  WSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLE 701
            W L D+P GHR IGLKWV+KLK++ +G VVKHKARLVAKGYVQ+ G+DF+EVFAPVARL+
Sbjct: 1153 WRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHKARLVAKGYVQRAGIDFDEVFAPVARLD 1212

Query: 702  SVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYG 761
            SVR LLA+AA   W VHHMDVKSAFLNGEL E VYV QPPGF  +   NKV RL KALYG
Sbjct: 1213 SVRLLLALAAQEGWMVHHMDVKSAFLNGELIEEVYVVQPPGFEIDGQENKVYRLDKALYG 1272

Query: 762  LRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLG 821
            LRQAPRAWN KLD TL  L FK+   EHG+Y  G G  RL+VGVYVDDL+I GGD G++ 
Sbjct: 1273 LRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYARGDGSGRLLVGVYVDDLVIVGGDSGMIK 1332

Query: 822  RFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPM 881
             FK++M   F+MSDLG LS+YLGIEV Q +  I++ Q+AY  ++++  GL   NP  TPM
Sbjct: 1333 GFKEQMKAEFKMSDLGPLSFYLGIEVHQEAGIITLKQAAYVSRIVEKAGLTGCNPCATPM 1392

Query: 882  EARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVK 941
            E RL+L K    + VD+T YRS+VGSLRYLVN+RPDLAYSVGYVSRFME P +EHL AVK
Sbjct: 1393 EPRLKLSKESAGSLVDATEYRSLVGSLRYLVNTRPDLAYSVGYVSRFMEKPTDEHLAAVK 1452

Query: 942  RILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQ 1001
            RI+RYVAGT   G RY    E     L GYSDSDMAGD+D RKST+G+I+FL    + WQ
Sbjct: 1453 RIIRYVAGTIHLGCRYVKEGEG---GLQGYSDSDMAGDIDTRKSTTGVIFFLGKNPVSWQ 1512

Query: 1002 STKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNP 1061
            S KQ+VVALSSCE+EYIAA+ AA QGIWLARL+ +L    ++   L VDN++ ++L+KNP
Sbjct: 1513 SQKQRVVALSSCESEYIAAATAACQGIWLARLLGDLRNAATEVVDLRVDNQSALALMKNP 1572

Query: 1062 VLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGVQI 1103
            V HDRSKHI+T+FH+IRE  + G I   +I TE QL DI TK L+R+KF+ELR +IG+  
Sbjct: 1573 VFHDRSKHIQTKFHFIREAVENGEITPSYIGTEGQLADILTKPLSRIKFQELREQIGLAT 1628

BLAST of CmoCh05G000040 vs. TrEMBL
Match: Q7FAB9_ORYSJ (OSJNBa0033H08.2 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0033H08.2 PE=4 SV=1)

HSP 1 Score: 1219.9 bits (3155), Expect = 0.0e+00
Identity = 615/1082 (56.84%), Postives = 778/1082 (71.90%), Query Frame = 1

Query: 42   IQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFGDGSVV 101
            ++L E +VFA + +  + H+  +WI+D+GA+NHMTG+R AF++LD+ I G V+ GDGSVV
Sbjct: 397  VELVEMKVFAALDDAAD-HDPGRWIMDSGASNHMTGSRMAFADLDTNITGNVRLGDGSVV 456

Query: 102  EIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQR 161
             I GRGTILF  K GEHR L++ Y++PRL AN++S+GQLDETG  +  E G++++ D QR
Sbjct: 457  RIAGRGTILFACKNGEHRTLSNTYYLPRLAANIISIGQLDETGFKVLAEDGIMRVWDEQR 516

Query: 162  RLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHG 221
            RLL +  RT  RLY+L++ + +PV L+A  +E +WRWHAR GH+NF AL K+ K+ELV G
Sbjct: 517  RLLARIPRTPGRLYMLDINLARPVCLAAHADEDAWRWHARLGHINFRALCKMGKEELVRG 576

Query: 222  LPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLL 281
            LP +  V+++C+ C  GK RR+PFP +   R+D PL L+HGD+CGPI PATP G   FLL
Sbjct: 577  LPCLSQVDQVCEACLAGKHRRSPFPRQALCRSDVPLALLHGDLCGPITPATPSGNRYFLL 636

Query: 282  LVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASFNKYCD 341
            LVDD SR+MW+ LL  K  A  AIKR +A AE +  +K+R LRTDRGGEFTS  F +YC 
Sbjct: 637  LVDDYSRYMWVALLSTKDAAPAAIKRTQAAAERKSGRKLRALRTDRGGEFTSTQFAEYCA 696

Query: 342  EIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSP 401
            E+G++R LTAPYSPQQNGVVERRNQ++VGTARS+L   G+PG FWGEA+ TAVYLLNRS 
Sbjct: 697  ELGMRRELTAPYSPQQNGVVERRNQSVVGTARSMLKAKGLPGMFWGEAINTAVYLLNRSS 756

Query: 402  TRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKA 461
            ++ + GKTPY  W    P VHH R FGCVA++K T P+L KLD R   ++F+GYEPGSKA
Sbjct: 757  SKGIGGKTPYALWNGVPPAVHHLRTFGCVAHVKTTTPNLKKLDDRSRPMIFVGYEPGSKA 816

Query: 462  YRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQFTVEYL-VTEPEEGGAQHQET 521
            YR YDP   R H+SRD+VFDE+  W W+     D + + F VEY  V  P       Q+ 
Sbjct: 817  YRAYDPATRRVHISRDIVFDEAAQWDWDAEAAADLDTD-FVVEYTTVYHPGSLSGTRQDA 876

Query: 522  SPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTDLEARYRR 581
              PPA +   P                     VEF +P T A + LDADHD D   R+R 
Sbjct: 877  WEPPARSSSSPRTPSDSPTAGRTPSVHGDAPAVEFVSPPTGAAANLDADHD-DAPLRFRT 936

Query: 582  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQT 641
            MD+++G    PGLA RE++E  EL  VS +EP TFA+AE++  WR+AM +E++SI EN+T
Sbjct: 937  MDNVLGPAMLPGLANREVQE--ELMMVSGEEPATFAQAERDEDWRRAMLDEISSIEENKT 996

Query: 642  WSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLE 701
            W L D+P GHR IGLKWV+KLK++ +G VVKHKARLVAKGYVQ+ G+DF+EVFAPVARL+
Sbjct: 997  WRLVDLPSGHRPIGLKWVYKLKKDAQGVVVKHKARLVAKGYVQRAGIDFDEVFAPVARLD 1056

Query: 702  SVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYG 761
            SVR LLA+AA   W VHHMDVKSAFLNGEL E VYV QPPGF  +   NKV RL KALYG
Sbjct: 1057 SVRLLLALAAQEGWMVHHMDVKSAFLNGELIEEVYVVQPPGFEIDGQENKVYRLDKALYG 1116

Query: 762  LRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLG 821
            LRQAPRAWN KLD TL  L FK+   EHG+Y  G G  RL+VGVYVDDL+I GGD G++ 
Sbjct: 1117 LRQAPRAWNTKLDCTLKKLGFKQSPLEHGLYARGDGSGRLLVGVYVDDLVIVGGDSGMIK 1176

Query: 822  RFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPM 881
             FK++M   F+MSDLG LS+YLGIEV Q +  I++ Q+AYA ++++  GL   NP  TPM
Sbjct: 1177 GFKEQMKAEFKMSDLGPLSFYLGIEVHQEAGIITLKQAAYASRIVEKAGLTGCNPCATPM 1236

Query: 882  EARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVK 941
            E RL+L K    + VD+T YRS+VGSL YLVN+RPDLAYSVGYVSRFME P +EHL AVK
Sbjct: 1237 EPRLKLSKESAGSLVDATEYRSLVGSLHYLVNTRPDLAYSVGYVSRFMEKPTDEHLAAVK 1296

Query: 942  RILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQ 1001
            RI+RYVAGT   G RY    E     L GYSDSDMAGD+D RKST+G+I+FL    + WQ
Sbjct: 1297 RIIRYVAGTIHLGCRYVKEGEG---GLQGYSDSDMAGDIDTRKSTTGVIFFLGKNPVSWQ 1356

Query: 1002 STKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNP 1061
            S KQ+VVALSSCE+EYIAA+ AA QGIWLARL+ +L    ++   L VDN++ ++L+KNP
Sbjct: 1357 SQKQRVVALSSCESEYIAAATAACQGIWLARLLGDLRNAATEVVDLRVDNQSALALMKNP 1416

Query: 1062 VLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGVQI 1101
            V HDRSKHI+T+FH+IRE  + G I   +I TE QL DI TK L+R+KF+ELR +IG+  
Sbjct: 1417 VFHDRSKHIQTKFHFIREAVENGEITPSYIGTEGQLADILTKPLSRIKFQELREQIGLAT 1470

BLAST of CmoCh05G000040 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 365.5 bits (937), Expect = 1.1e-100
Identity = 198/504 (39.29%), Postives = 301/504 (59.72%), Query Frame = 1

Query: 587  ADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGE 646
            A EP+T+ EA++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K N  G 
Sbjct: 83   AKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGT 142

Query: 647  VVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNG 706
            + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI+A +++ +H +D+ +AFLNG
Sbjct: 143  IERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNG 202

Query: 707  ELKETVYVRQPPGFL----DNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLSLNFKRC 766
            +L E +Y++ PPG+     D+  PN V  L K++YGL+QA R W  K   TL+   F + 
Sbjct: 203  DLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQS 262

Query: 767  ASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGI 826
             S+H  +        L V VYVDD+II   +   +   K ++   F++ DLG L Y+LG+
Sbjct: 263  HSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGL 322

Query: 827  EVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTNYRSIV 886
            E+ ++++GI+ICQ  YA  LLD TGL+   P+  PM+  +          VD+  YR ++
Sbjct: 323  EIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLI 382

Query: 887  GSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEK 946
            G L YL  +R D++++V  +S+F EAPR  H  AV +IL Y+ GT G G+ Y   S + +
Sbjct: 383  GRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFY---SSQAE 442

Query: 947  LKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAAT 1006
            ++L  +SD+      D R+ST+G   FL    I W+S KQ+VV+ SS EAEY A S A  
Sbjct: 443  MQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATD 502

Query: 1007 QGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPVLHDRSKHIETRFHYIRE-CADRG 1066
            + +WLA+   EL    S   +L+ DN A I +  N V H+R+KHIE+  H +RE    + 
Sbjct: 503  EMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQA 562

Query: 1067 LIKIDFIRTEEQLGDIFTKSLARV 1086
             +   F   +EQ  D FT+ L+ +
Sbjct: 563  TLSYSFQAYDEQ--DGFTEYLSPI 581

BLAST of CmoCh05G000040 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 150.6 bits (379), Expect = 5.7e-36
Identity = 81/225 (36.00%), Postives = 128/225 (56.89%), Query Frame = 1

Query: 782  VYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARK 841
            +YVDD+++TG    +L     ++S  F M DLG + Y+LGI+++ + SG+ + Q+ YA +
Sbjct: 5    LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 842  LLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGY 901
            +L+  G++D  P  TP+  +L      T    D +++RSIVG+L+YL  +RPD++Y+V  
Sbjct: 65   ILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNI 124

Query: 902  VSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRK 961
            V + M  P       +KR+LRYV GT   G+      +  KL +  + DSD AG    R+
Sbjct: 125  VCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYI---HKNSKLNVQAFCDSDWAGCTSTRR 184

Query: 962  STSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAATQGIW 1007
            ST+G   FL    I W + +Q  V+ SS E EY A ++ A +  W
Sbjct: 185  STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh05G000040 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 87.4 bits (215), Expect = 5.9e-17
Identity = 42/99 (42.42%), Postives = 64/99 (64.65%), Query Frame = 1

Query: 589 EPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVV 648
           EP +   A K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K +  G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 649 KHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIA 688
           + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh05G000040 vs. TAIR10
Match: ATMG00710.1 (ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 68.9 bits (167), Expect = 2.2e-11
Identity = 36/95 (37.89%), Postives = 52/95 (54.74%), Query Frame = 1

Query: 365 NQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHF 424
           N+TI+   RS+L   G+P  F  +A  TAV+++N+ P+ +++   P E W+   PT  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 425 RVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGS 460
           R FGCVAY+        KL PR  K    G E GS
Sbjct: 62  RRFGCVAYIHCDE---GKLKPRAKK----GEEKGS 89

BLAST of CmoCh05G000040 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 60.5 bits (145), Expect = 7.7e-09
Identity = 33/79 (41.77%), Postives = 48/79 (60.76%), Query Frame = 1

Query: 887 YLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLKLV 946
           YL  +RPDL ++V  +S+F  A R   + AV ++L YV GT G G+ Y A S+   L+L 
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSD---LQLK 61

Query: 947 GYSDSDMAGDVDDRKSTSG 966
            ++DSD A   D R+S +G
Sbjct: 62  AFADSDWASCPDTRRSVTG 77

BLAST of CmoCh05G000040 vs. NCBI nr
Match: gi|38344222|emb|CAE03692.2| (OSJNBb0026E15.10 [Oryza sativa Japonica Group])

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 678/1077 (62.95%), Postives = 829/1077 (76.97%), Query Frame = 1

Query: 36   APTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKF 95
            +P GE + + E +VFAQ+ +  E H+   WILDTGATNHMTG+RSAF+ELD+ + GTV+F
Sbjct: 368  SPIGE-LAVVEAKVFAQLDDGGE-HDPAMWILDTGATNHMTGSRSAFAELDTAVTGTVRF 427

Query: 96   GDGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLK 155
            GDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L 
Sbjct: 428  GDGSVVRIEGRVTVLFSCRFGEHRGIAGVYYIPRLTANIVSLGQLDRSGSKVLIHHGILH 487

Query: 156  ICDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQK 215
            + D +  LL + RR+ + LY ++L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL +
Sbjct: 488  VWDPRGHLLVRVRRSDDCLYTIKLDIDRPVCLAARSAEPAWRWHARYGHLNFPALRKLAQ 547

Query: 216  KELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGG 275
            +E+V GLP ++ V ++CDGC +GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G
Sbjct: 548  QEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRADEHLELVHGDLCGPIEPATPAG 607

Query: 276  KSLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSAS 335
               FLLLVDD SR+MWLT++++K EAA AIK  +ARAE E  +K+R LR DRG EFTS  
Sbjct: 608  NRYFLLLVDDMSRYMWLTMIRSKDEAANAIKHFQARAEVESGRKLRALRMDRGSEFTSIE 667

Query: 336  FNKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVY 395
            F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS++   G+PGRFWGEA+ TAV+
Sbjct: 668  FGEYCANLGVGRQLTAPYSPQQNGVVERRNQTIVATARSMMKAKGVPGRFWGEAMSTAVF 727

Query: 396  LLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGY 455
            LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K+T+P L KLD R   +V +GY
Sbjct: 728  LLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVKITKPGLKKLDDRSAPMVLLGY 787

Query: 456  EPGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPN--QFTVEYLVT----- 515
            E GSKAYRLYDPV  R HVSRDVVFDE   W W  V   D  P    FTVE +VT     
Sbjct: 788  EQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTP-DGAPQLEPFTVEQVVTTTIGT 847

Query: 516  ---------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM 575
                      P         T  PP+   PE VEF TP T DS LDAD D D+  RYR +
Sbjct: 848  APASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPTQDSILDADADDDVVPRYRLV 907

Query: 576  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTW 635
            D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TW
Sbjct: 908  DNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEADPSWRGAMQDELNAIVDNDTW 967

Query: 636  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLES 695
            SL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGYVQ+QGVDF+EVFA VARLES
Sbjct: 968  SLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGYVQRQGVDFDEVFALVARLES 1027

Query: 696  VRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGL 755
            VR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGL
Sbjct: 1028 VRLLLAVAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPGFVDDNHKNKVYRLHKALYGL 1087

Query: 756  RQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGR 815
            RQAPRAWNAKLDS+LLSL F R +SEHG+YT   G +RL+VGVYVDDLIITG     +  
Sbjct: 1088 RQAPRAWNAKLDSSLLSLGFHRSSSEHGVYTRTRGGRRLMVGVYVDDLIITGDHDDEIRS 1147

Query: 816  FKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPME 875
            FK EM K F+MSDLG L YYLGIEV Q+S GI++ Q+AYA K+L+  GL D NP +TPME
Sbjct: 1148 FKGEMMKLFKMSDLGALRYYLGIEVTQDSDGITLGQAAYAGKILEKAGLKDCNPCQTPME 1207

Query: 876  ARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKR 935
             RL+LRK      VD+T YRS+VGSLRYLVN+RPDLA+SVGYVSRFME+PRE+HL AV+R
Sbjct: 1208 VRLKLRKGSDFPLVDATLYRSLVGSLRYLVNTRPDLAFSVGYVSRFMESPREDHLAAVRR 1267

Query: 936  ILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQS 995
            ILRYVAGTR WG+R+  G+      LVGYSDSD+AGD D+RKSTSG I+F++GG + WQS
Sbjct: 1268 ILRYVAGTRCWGIRFGPGARCALPMLVGYSDSDLAGDPDERKSTSGQIFFINGGPVTWQS 1327

Query: 996  TKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPV 1055
            +KQKVVALSSCEAEYIAA+ A  QG+WLARL+ E++G E  +P+L VDN++TISLIKNPV
Sbjct: 1328 SKQKVVALSSCEAEYIAAAAATCQGVWLARLLAEVLGDEITAPLLKVDNQSTISLIKNPV 1387

Query: 1056 LHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
             HDRSKHI+ ++HYIRECA++ LI++ F+ T EQLGDIFTKSL R +F+ELRSKIGV
Sbjct: 1388 HHDRSKHIDVKYHYIRECAEKKLIEMMFVGTAEQLGDIFTKSLGRTRFQELRSKIGV 1440

BLAST of CmoCh05G000040 vs. NCBI nr
Match: gi|116634828|emb|CAH66352.1| (OSIGBa0135C09.3 [Oryza sativa Indica Group])

HSP 1 Score: 1326.6 bits (3432), Expect = 0.0e+00
Identity = 664/1077 (61.65%), Postives = 812/1077 (75.39%), Query Frame = 1

Query: 36   APTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKF 95
            +P GE + + E +VFAQ+ +  E H+   WILDTGATNHMTG+RSAF++LD+ + GTV+F
Sbjct: 301  SPMGE-LAVVEAKVFAQLDDGGE-HDPAMWILDTGATNHMTGSRSAFAKLDTAVTGTVRF 360

Query: 96   GDGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLK 155
            GDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L+
Sbjct: 361  GDGSVVRIEGRGTVLFSCRFGEHRGIAGVYYIPRLTANIVSLGQLDRSGSKVLIHHGVLR 420

Query: 156  ICDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQK 215
            + D +  LL + RR+ + LY ++L ID+PV L+A++ + +WRWHARYGHLNFP+L KL +
Sbjct: 421  VWDPRGHLLVRVRRSDDCLYTIKLNIDRPVYLAARSAKPAWRWHARYGHLNFPSLRKLAQ 480

Query: 216  KELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGG 275
            +E+V GLP ++ V ++CDGC +GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G
Sbjct: 481  QEMVRGLPLLQQVTQVCDGCLLGKQRRAAFPTQSKYRADEHLELVHGDLCGPIEPATPAG 540

Query: 276  KSLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSAS 335
               FLLLVDD SR+MWLTL+++K EAA AIK  +A AE E  +K+R LRTDRGGEFTS  
Sbjct: 541  NRYFLLLVDDMSRYMWLTLIRSKDEAANAIKHFQAHAEVESGRKLRALRTDRGGEFTSIE 600

Query: 336  FNKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVY 395
            F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS++   G+PGRFWGEA+ TAV+
Sbjct: 601  FGEYCANLRVGRQLTAPYSPQQNGVVERRNQTIVATARSMMKAKGVPGRFWGEAMSTAVF 660

Query: 396  LLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGY 455
            LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K+T+P L KLD R   +V +GY
Sbjct: 661  LLNRSPTKSLDNQTPYEAWYGQRPAVHFLRTFGCVGHVKITKPGLKKLDDRSAPMVLLGY 720

Query: 456  EPGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPN--QFTVEYLVT----- 515
            E GSKAYRLYDPV  R HVSRDVVFDE   W W   +  D  P    FTVE +VT     
Sbjct: 721  EQGSKAYRLYDPVSERVHVSRDVVFDEDAAWDWGP-LTPDGAPQLEPFTVEQVVTTTIGT 780

Query: 516  ---------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM 575
                      P         T  PP+   PE VEF TP T DS LDAD D D+  RY  +
Sbjct: 781  APASSLTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPTQDSILDADADDDVVPRYHLV 840

Query: 576  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTW 635
            D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TW
Sbjct: 841  DNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEADPNWRGAMQDELNAIVDNDTW 900

Query: 636  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLES 695
            SL D+P GHRAIGLKW                ARLVAKGYVQ+QGVDF+EVFAPVARLE 
Sbjct: 901  SLTDLPHGHRAIGLKW----------------ARLVAKGYVQRQGVDFDEVFAPVARLEL 960

Query: 696  VRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGL 755
            VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGL
Sbjct: 961  VRLLLAIAAHQGWQVHHMDVKSAFLNGELLEEVYVSQPPGFVDDNHKNKVYRLHKALYGL 1020

Query: 756  RQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVLGR 815
            RQAPRAWN KLDS+LLSL F R +SEHG+YT   G +RL VGVYVDDLIITG     +  
Sbjct: 1021 RQAPRAWNTKLDSSLLSLGFHRSSSEHGVYTRTRGGRRLTVGVYVDDLIITGDHDDEIRS 1080

Query: 816  FKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTPME 875
            FK EM K F+MSDLG L YYLGIEV Q+S GI++ Q+AYA K+L+  GL D NP +TPME
Sbjct: 1081 FKGEMMKLFKMSDLGALRYYLGIEVTQDSDGITLGQAAYAGKILEKAGLKDCNPCQTPME 1140

Query: 876  ARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAVKR 935
             RL+LRK      VD+T YRS+VGSLRYLVN+RPDLA+SVGYVSRFME+PRE+HL AV+R
Sbjct: 1141 VRLKLRKGSDFPLVDATLYRSLVGSLRYLVNTRPDLAFSVGYVSRFMESPREDHLAAVRR 1200

Query: 936  ILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICWQS 995
            ILRYVAGTR WG+R+  G+      LVGYSDSD+AGD D+RKSTSG I+F++GG + WQS
Sbjct: 1201 ILRYVAGTRCWGIRFGPGARCALPMLVGYSDSDLAGDPDERKSTSGQIFFINGGPVTWQS 1260

Query: 996  TKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKNPV 1055
            +K+KVVALSSCEAEYIAA+    QG+WLARL+ E++G E  +P+L VDN++TISLIKNPV
Sbjct: 1261 SKKKVVALSSCEAEYIAAAATTCQGVWLARLLAEVLGDEIAAPLLKVDNQSTISLIKNPV 1320

Query: 1056 LHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
             HDRSKHI+ ++HYIRECA++ LI++ F+ T EQLGDIFTKSL R +F+ELRSKIGV
Sbjct: 1321 HHDRSKHIDVKYHYIRECAEKKLIEMMFVGTAEQLGDIFTKSLGRTRFQELRSKIGV 1357

BLAST of CmoCh05G000040 vs. NCBI nr
Match: gi|113623687|dbj|BAF23632.1| (Os08g0389500 [Oryza sativa Japonica Group])

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 647/1095 (59.09%), Postives = 801/1095 (73.15%), Query Frame = 1

Query: 37   PTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFG 96
            P  E I L+E ++F Q+G  +   E  +WILDTGATNHMTG RSAFSEL++GIRGTVKFG
Sbjct: 362  PALEQIHLDESKLFVQLGG-EHGGEATRWILDTGATNHMTGTRSAFSELNTGIRGTVKFG 421

Query: 97   DGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKI 156
            DGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+VSLGQLDE     S E G+LKI
Sbjct: 422  DGSVVGIEGRGTVLFKCKDGEHQALEGVYHIPRLTTNIVSLGQLDEEKFKWSCEDGVLKI 481

Query: 157  CDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKK 216
             + QRRLL +  R+ NRLY+++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL + 
Sbjct: 482  WNKQRRLLAKVVRSPNRLYVVKLNIGRPVCLAAQGGDIAWRWHARFGHLNFRALEKLGRA 541

Query: 217  ELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGK 276
             +V GLP I  V+++CD C +GKQRR PFPS+  YRA E LELVHGDICGP+ PATP G 
Sbjct: 542  VMVRGLPLINHVDQVCDSCLVGKQRRLPFPSKAKYRAKEKLELVHGDICGPVTPATPSGN 601

Query: 277  SLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASF 336
             LFLLLVDD SR+MWL LL +K +A+ AIKR  A AEAE  +K+R LRTDRGGEFT+ +F
Sbjct: 602  KLFLLLVDDLSRYMWLILLSSKDQASVAIKRFLACAEAEAGRKLRTLRTDRGGEFTAHAF 661

Query: 337  NKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYL 396
             +YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS++    +PG FWGEAV TAV+L
Sbjct: 662  AEYCAEHGIQRHLTAPYTPQQNGVVERRNQTVMGMARSMMKAKSLPGWFWGEAVNTAVFL 721

Query: 397  LNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYE 456
            LNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K     LAKLD R + +VF+GYE
Sbjct: 722  LNRAPTQCVDGKTPFEVWHGVKPPVHFLRTFGCVAHVKNGGQRLAKLDDRSMPMVFVGYE 781

Query: 457  PGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQ----FTVEYLVTEPE- 516
             G+KAYR Y+PV  R HVSRD VF+E   W+W    E    P+     F VE+L T P  
Sbjct: 782  AGTKAYRFYNPVSRRVHVSRDAVFEEERSWEWG--AEKGAGPDDDIEPFVVEHLATGPTG 841

Query: 517  EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD 576
            +GG       A  + TS P                       PA A    +EFA+P   D
Sbjct: 842  QGGPVAATPTATQRSTSAPAPMAPPATPSQAGTPTHGAGPRTPASASSPAIEFASPPQGD 901

Query: 577  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPC 636
              LD DHD D+  R+R +D+L+G   PPGLA RE+ E   L     DEP T  EA++   
Sbjct: 902  LDLDNDHDDDVPLRFRTVDNLLGASSPPGLAEREVTE--GLMVAIEDEPATAEEAKQVKE 961

Query: 637  WRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ 696
            WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K++E G + KHKARLVAKGYVQ
Sbjct: 962  WREAMIEEMASIEHNKTWSLVELPAGQRAIGLKWVFKIKKDEHGNITKHKARLVAKGYVQ 1021

Query: 697  KQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL 756
            +QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKSAFLNG+L E VYV+QPPGF+
Sbjct: 1022 RQGIDYEEVFAPVARIESVRVLLAVAAHRSWSVHHMDVKSAFLNGDLAEEVYVQQPPGFV 1081

Query: 757  DNDNPNKVLRLHKALYGLRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVG 816
               +  KVL+LHKALYGL+QAPRAWN+KLDS+LL L F R   EHG+YT   G+KRL+VG
Sbjct: 1082 AAGHERKVLKLHKALYGLKQAPRAWNSKLDSSLLMLGFARSECEHGLYTRSDGEKRLVVG 1141

Query: 817  VYVDDLIITGGDVGVLGRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARK 876
            +YVDDLIITGG   V+  FK EM   F MSDLGVLSYYLGIEV+Q   GI + Q+AYA+K
Sbjct: 1142 IYVDDLIITGGSTEVINTFKTEMKTLFRMSDLGVLSYYLGIEVRQGRRGIELLQAAYAKK 1201

Query: 877  LLDTTGLVDSNPTRTPMEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGY 936
            +L+  G+   NP  TPMEARL+L K  T+  VD+T YRS++GSLRYL+N+RPD+A++VGY
Sbjct: 1202 ILEKAGMGTCNPCATPMEARLKLSKQSTSPAVDATEYRSLIGSLRYLMNTRPDMAFAVGY 1261

Query: 937  VSRFMEAPREEHLVAVKRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRK 996
            +SRFME PR+EHL A+K +LRYVAGT  +G+ Y +G    +  LVGYSDSDMAGD+DDRK
Sbjct: 1262 LSRFMENPRQEHLAAMKHLLRYVAGTIDYGLVYTSGD--TEFNLVGYSDSDMAGDIDDRK 1321

Query: 997  STSGMIYFLSGGAICWQSTKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDS 1056
            STSG+IYFL G  + WQS KQ+VVALSSCEAEYIA + AA QG+WL RL+++++G     
Sbjct: 1322 STSGIIYFLGGNPVAWQSQKQRVVALSSCEAEYIAGAAAACQGVWLRRLLQDVVGVSGPP 1381

Query: 1057 PMLYVDNKATISLIKNPVLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKS 1097
            P L +DN++ I+L KNPVLHDRSKHI+T+FH++REC D G +++ F+ T+ QL DI TK+
Sbjct: 1382 PQLKMDNQSAIALSKNPVLHDRSKHIDTKFHFLRECVDSGAVRLAFVSTQAQLADIMTKA 1441

BLAST of CmoCh05G000040 vs. NCBI nr
Match: gi|113611032|dbj|BAF21410.1| (Os07g0434200, partial [Oryza sativa Japonica Group])

HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 630/1079 (58.39%), Postives = 783/1079 (72.57%), Query Frame = 1

Query: 37   PTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFG 96
            P  + ++L E++VFA + +  + H++++WILDTGA+NHMTG+R+AFS++D+ + G V+ G
Sbjct: 134  PRLDVVELVEQKVFAALDDATD-HDNKRWILDTGASNHMTGSRAAFSDIDTNVTGNVRLG 193

Query: 97   DGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKI 156
            DGS+V I GR TILF  K GEH  L   Y++P L AN++S+GQLDETG  + +E G++++
Sbjct: 194  DGSLVRIGGRRTILFACKNGEHHMLHKAYYLPCLAANIISVGQLDETGFKVLVEDGVMRV 253

Query: 157  CDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKK 216
             D Q RLL +  RT  RLY+L++ + +PV L A+  E +WRWHAR+GH+NF AL K+ ++
Sbjct: 254  WDEQHRLLARITRTPGRLYVLDINLARPVYLMARAGEDAWRWHARFGHVNFTALRKMGRE 313

Query: 217  ELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGK 276
             LV GLP +  V ++C+ C  GKQRR PFP +  +RA EPL L+HGD+CGP+ PATP G 
Sbjct: 314  ALVRGLPVLSQVEQVCEACLAGKQRRAPFPQQALHRATEPLALLHGDLCGPVMPATPSGN 373

Query: 277  SLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASF 336
              F LLVDD SR+MWL LL  K  A +A+KR++A AE +  +K+R LRTDRGGEFT   F
Sbjct: 374  RYFPLLVDDYSRYMWLVLLATKDAAPDAMKRVQAAAERKSGRKLRALRTDRGGEFTVGHF 433

Query: 337  NKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYL 396
             +YC E+G++R LTAPYSPQQNGVVERRNQ++V TARS+L   G+PG FWGEAV TAVYL
Sbjct: 434  TEYCAELGLRRELTAPYSPQQNGVVERRNQSVVSTARSMLKAKGLPGMFWGEAVNTAVYL 493

Query: 397  LNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYE 456
            LNR  ++S+DGKTPYE W    P VHH R FGCVA++KVT P   KLD R   ++F+GYE
Sbjct: 494  LNRCSSKSIDGKTPYELWNGVTPAVHHLRTFGCVAHVKVTAP-TKKLDDRSRPMIFVGYE 553

Query: 457  PGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQFTVEYLV--------- 516
             GSKAYR+YDP   R HVSRDVVFDE   W W+    T+ + + FT+EY           
Sbjct: 554  LGSKAYRVYDPATRRVHVSRDVVFDEEAQWNWDGEAATNVD-SDFTIEYTTVYHPATATP 613

Query: 517  --TEPEEGGAQHQETSPPPAGAPP-------EPVEFAT-PRTADSTLDADHDTDLEARYR 576
              T  E GGA     SP     P         PVEF + P   +  LDADHD D   R+R
Sbjct: 614  TQTGTEHGGAPASPRSPASGSTPTTPPVAEVSPVEFVSPPPDVEDDLDADHD-DAPLRFR 673

Query: 577  RMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQ 636
            R+DD++G   PPG A REL E  EL AV+A+EP +FAEAE+  CWR+AM EEM SI  N+
Sbjct: 674  RIDDVLGPATPPGQAVRELSE--ELFAVTAEEPASFAEAEQLSCWRQAMIEEMRSIEANK 733

Query: 637  TWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARL 696
            TW L D     R IGLKWV+K K++  G + K+KARLVAKGYVQ+QG+DF+EVFAPVARL
Sbjct: 734  TWRLVDPLARQRPIGLKWVYKAKKDAAGNITKYKARLVAKGYVQRQGIDFDEVFAPVARL 793

Query: 697  ESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALY 756
            ESVR LLA AA   W VHHMDVKSAFLNGEL E VYV QPPGF+ +   +KVLRL KALY
Sbjct: 794  ESVRLLLAHAACEGWAVHHMDVKSAFLNGELLEEVYVAQPPGFVVDGQEHKVLRLDKALY 853

Query: 757  GLRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVL 816
            GLRQAPRAW +KLD++LLSL F R  SEH +Y  G G++RL+VGVYVDDLIITGG+ G L
Sbjct: 854  GLRQAPRAWYSKLDASLLSLGFHRSDSEHAVYMRGTGEQRLVVGVYVDDLIITGGNPGEL 913

Query: 817  GRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTP 876
             +FK+EM   F+MSDLG+L YYLG+EV Q   GI++ Q AYA K+L T G+V SNP+ TP
Sbjct: 914  KQFKEEMKGTFQMSDLGLLQYYLGLEVNQTEDGITVNQRAYAEKILQTAGMVASNPSLTP 973

Query: 877  MEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAV 936
            ME RL+L K     +VD+T+YR IVGSLRYLVNSRPDLAYSVGYVSRFME P  EHL AV
Sbjct: 974  METRLKLSKMSNAPSVDATDYRWIVGSLRYLVNSRPDLAYSVGYVSRFMEKPTTEHLAAV 1033

Query: 937  KRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICW 996
            KR+LRYVAG+ G+G  Y     K+   LVGYSDSD+AGDVD RKSTSG+ +FL    I W
Sbjct: 1034 KRVLRYVAGSIGYGCHY---KRKKDASLVGYSDSDLAGDVDTRKSTSGVFFFLGDNLITW 1093

Query: 997  QSTKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKN 1056
            QS KQKVVALSSCEAEYIAA+ AA QG+WLARL+ EL G E+D+  L +DN++ I L KN
Sbjct: 1094 QSQKQKVVALSSCEAEYIAATTAACQGVWLARLLAELQGEEADAVTLRIDNQSAIMLSKN 1153

Query: 1057 PVLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
            PV HDRSKHI+TR+HYIREC + G +K++FI T EQL DI TKSL R +F ELRS+IG+
Sbjct: 1154 PVFHDRSKHIDTRYHYIRECIEEGRVKVEFIGTNEQLADILTKSLGRDRFMELRSQIGL 1203

BLAST of CmoCh05G000040 vs. NCBI nr
Match: gi|218199506|gb|EEC81933.1| (hypothetical protein OsI_25798 [Oryza sativa Indica Group])

HSP 1 Score: 1245.0 bits (3220), Expect = 0.0e+00
Identity = 626/1079 (58.02%), Postives = 782/1079 (72.47%), Query Frame = 1

Query: 37   PTGEPIQLEEERVFAQIGERDEQHEHQQWILDTGATNHMTGARSAFSELDSGIRGTVKFG 96
            P  + ++L E++VFA + +  + H++++WILDTGA+NHMTG+R+AFS++D+ + G V+ G
Sbjct: 26   PRLDVVELVEQKVFAALDDATD-HDNKRWILDTGASNHMTGSRAAFSDIDTNVTGNVRLG 85

Query: 97   DGSVVEIEGRGTILFVSKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKI 156
            DGS+V I GRGTILF  K GEHR L + Y++P L AN++S+GQLDETG  + +E G++++
Sbjct: 86   DGSLVRIGGRGTILFACKNGEHRMLHNAYYLPCLAANIISVGQLDETGFKVLVEDGVMRV 145

Query: 157  CDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKK 216
             D Q RLL +  RT  RL++L++ + +PV L A+  E +WRWHA +GH+NF AL K+ ++
Sbjct: 146  WDEQHRLLARITRTPGRLFVLDINLARPVYLMARAGEDAWRWHACFGHVNFTALRKMGRE 205

Query: 217  ELVHGLPEIKGVNKLCDGCFIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGK 276
             LV GLP +  V ++C+ C  GKQRR PFP +  +RA EPL L+HGD+CGP+ PATP G 
Sbjct: 206  ALVRGLPVLSQVEQVCEACLAGKQRRAPFPQQALHRATEPLALLHGDLCGPVMPATPSGN 265

Query: 277  SLFLLLVDDKSRFMWLTLLQAKSEAAEAIKRIKARAEAECEKKMRVLRTDRGGEFTSASF 336
              F LLVDD SR+MWL LL  K  A +A+KR++A AE +   K+R LRTDRGGEFT   F
Sbjct: 266  RYFPLLVDDYSRYMWLVLLATKDVAPDAMKRVQAAAERKSGSKLRALRTDRGGEFTVGHF 325

Query: 337  NKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYL 396
             +YC E+G++R LTAPYSPQQNGVVE RNQ++V TARS+L   G+PG FWGEAV TAVYL
Sbjct: 326  TEYCAELGLRRELTAPYSPQQNGVVECRNQSVVSTARSMLKAKGLPGMFWGEAVNTAVYL 385

Query: 397  LNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYE 456
            LNR  ++S+DGKTPYE W    P VHH R FGCVA++KVT P   KLD R   ++F+GYE
Sbjct: 386  LNRCSSKSIDGKTPYELWNRVTPAVHHLRTFGCVAHVKVTAP-TKKLDDRSRPMIFVGYE 445

Query: 457  PGSKAYRLYDPVGGRAHVSRDVVFDESTFWQWNDVIETDRNPNQFTVEYLV--------- 516
            PGSKAYR+YDP   R HVSRDVVFDE   W W+     + + + FT+EY           
Sbjct: 446  PGSKAYRVYDPATRRVHVSRDVVFDEEAQWNWDGEAAANVD-SDFTIEYTTVYHPATATP 505

Query: 517  --TEPEEGGAQHQETSPPPAGAPPE-------PVEFAT-PRTADSTLDADHDTDLEARYR 576
              T  E GGA     SP     P         PVEF + P   +  LDADHD D   R+R
Sbjct: 506  TQTGTEHGGAPASPRSPASGSTPTTPPVAEVLPVEFVSPPPDVEDDLDADHD-DAPLRFR 565

Query: 577  RMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQ 636
            R+DD++G   PPG A REL E  EL AV+A+EP +FAEAE+  CWR+AM EEM SI  N+
Sbjct: 566  RIDDVLGPATPPGQAVRELSE--ELFAVTAEEPASFAEAEQLSCWRQAMIEEMRSIEANK 625

Query: 637  TWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARL 696
            TW L D P     IGLKWV+K K++  G + K+KARLVAKGYVQ+QG+ F+EVFAPVARL
Sbjct: 626  TWRLVDPPARQCPIGLKWVYKAKKDAAGNITKYKARLVAKGYVQRQGIYFDEVFAPVARL 685

Query: 697  ESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALY 756
            ESVR LLA AA   W VHHMDVKSAFLNGEL E VYV QPPGF+ +   +KVLRL KALY
Sbjct: 686  ESVRLLLAHAACEGWAVHHMDVKSAFLNGELLEEVYVAQPPGFVVDGQEHKVLRLDKALY 745

Query: 757  GLRQAPRAWNAKLDSTLLSLNFKRCASEHGMYTYGHGKKRLIVGVYVDDLIITGGDVGVL 816
            GLRQAPRAW +KLD++LLSL F R  SEH +Y  G G++RL+VGVYVDDLIITGG+ G L
Sbjct: 746  GLRQAPRAWYSKLDASLLSLGFHRSDSEHAVYMRGTGEQRLVVGVYVDDLIITGGNPGEL 805

Query: 817  GRFKKEMSKNFEMSDLGVLSYYLGIEVQQNSSGISICQSAYARKLLDTTGLVDSNPTRTP 876
             +FK+EM   F+MSDLG+L YYLG+EV Q   GI++ Q AYA K+L T G+V SNP+ TP
Sbjct: 806  KQFKEEMKGTFQMSDLGLLQYYLGLEVNQTEDGITVNQRAYAEKILQTAGMVASNPSLTP 865

Query: 877  MEARLQLRKAGTTTTVDSTNYRSIVGSLRYLVNSRPDLAYSVGYVSRFMEAPREEHLVAV 936
            ME RL+L K     ++D+T+YR IVGSLRYLVNSRPDLAYSVGYVSRFME P  EHL AV
Sbjct: 866  METRLKLSKMSNAPSIDATDYRWIVGSLRYLVNSRPDLAYSVGYVSRFMEKPTTEHLAAV 925

Query: 937  KRILRYVAGTRGWGVRYCAGSEKEKLKLVGYSDSDMAGDVDDRKSTSGMIYFLSGGAICW 996
            K++LRYVAG+ G+G  Y     K+   LVGYSDSD+AGDVD RKSTSG+ +FL    I W
Sbjct: 926  KQVLRYVAGSIGYGCHY---KRKKDASLVGYSDSDLAGDVDTRKSTSGVFFFLGDNLITW 985

Query: 997  QSTKQKVVALSSCEAEYIAASMAATQGIWLARLMEELIGRESDSPMLYVDNKATISLIKN 1056
            QS KQKVVALSSCEAEYIAA+ AA QG+WLARL+ EL G E+D+  L +DN++ I L KN
Sbjct: 986  QSQKQKVVALSSCEAEYIAATTAACQGVWLARLLAELKGEEADAVTLRIDNQSAIMLSKN 1045

Query: 1057 PVLHDRSKHIETRFHYIRECADRGLIKIDFIRTEEQLGDIFTKSLARVKFEELRSKIGV 1097
            PV HDRSKHI+TR+HYIREC + G +K++FI T EQL DI TKSL R +F ELRS+IG+
Sbjct: 1046 PVFHDRSKHIDTRYHYIRECIEEGRVKVEFIGTNEQLADILTKSLGRDRFMELRSQIGL 1095

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC3.2e-19038.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME3.9e-9536.00Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST7.7e-5137.38Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
M810_ARATH1.0e-3436.00Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YD14B_YEAST3.2e-2524.71Transposon Ty1-DR5 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
Q7XPB1_ORYSJ0.0e+0062.95OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0026E15.10 PE=... [more]
Q0J5Y3_ORYSJ0.0e+0059.09Os08g0389500 protein OS=Oryza sativa subsp. japonica GN=Os08g0389500 PE=4 SV=1[more]
Q7XTU6_ORYSJ0.0e+0057.30OSJNBb0034I13.10 protein OS=Oryza sativa subsp. japonica GN=OSJNBb0034I13.10 PE=... [more]
Q7XEA3_ORYSJ0.0e+0056.83Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Q7FAB9_ORYSJ0.0e+0056.84OSJNBa0033H08.2 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0033H08.2 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.1e-10039.29 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.15.7e-3636.00ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.15.9e-1742.42ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00710.12.2e-1137.89ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
ATMG00240.17.7e-0941.77ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|38344222|emb|CAE03692.2|0.0e+0062.95OSJNBb0026E15.10 [Oryza sativa Japonica Group][more]
gi|116634828|emb|CAH66352.1|0.0e+0061.65OSIGBa0135C09.3 [Oryza sativa Indica Group][more]
gi|113623687|dbj|BAF23632.1|0.0e+0059.09Os08g0389500 [Oryza sativa Japonica Group][more]
gi|113611032|dbj|BAF21410.1|0.0e+0058.39Os07g0434200, partial [Oryza sativa Japonica Group][more]
gi|218199506|gb|EEC81933.1|0.0e+0058.02hypothetical protein OsI_25798 [Oryza sativa Indica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009536 plastid
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G000040.1CmoCh05G000040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 254..368
score: 1.0
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 252..418
score: 27
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 256..422
score: 5.9
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 255..427
score: 7.59
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 616..860
score: 4.7E
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 173..240
score: 2.5
NoneNo IPR availableunknownCoilCoilcoord: 295..315
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 42..1021
score:
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 42..1021
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 616..838
score: 2.62E-25coord: 865..1052
score: 2.62

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None