CmoCh13G003440 (gene) Cucurbita moschata (Rifu)

NameCmoCh13G003440
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrotransposon protein, putative, Ty3-gypsy subclass
LocationCmo_Chr13 : 4417165 .. 4420958 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGATGTGTATTGATTACAGAGAATTAAACAAGGTCACTATAAAGAATAAGTACCCTCTCCCGAGAATTGACGACCTGTTTGATCAGCTACAGGGAGCAACAGTTTTCTCCAAAATTGACCTAAGGTCGGGGTATCATCAGTTGAGGGTTAGAGAAGGGGATATACTCAAGACAGCCTTCAGAAGTCGTTATGGGCATTACGAGTTTCTTGTTATGTCTTTTGGCCTTACCAATGCACCAGCTATTTTCATGGAGCTAATGAACAGGGTGTTTAAGGAATTCTTAGACACCTTTGTCATAGTGTTCATCGACAACATTCTGGTATACTCTAAGTCAGAGGTAGATCATGAAATACACCTCAGAAAAGTCTTGACAATACTGAGAGCTCAGCAGTTGTATGCCAAGTTCTCCAAGTGTGAGTTTTGGTTGTCTGAAGTTGCGTTTCTGGGTCACGTGGTGTCAAGCAAGGGGATCACAGTGGACCCAGCTAAGATAGAAGCAGTGATGAGGTGGCCACAGCCGACCACAGTCACAGAGGTGAGGAGTTTTCTTGGGCTAGCTGGTTATTACAGAAGGTTTGTTCAGGATTTCTCCAAAATTTCCTCGGCGCTGACTCAGCTAACCAAGAAGGGCAAGCCCTTTGCTTGGACTCCAGTCTGCGAACAAAGTTTCCAGGAACTCAAGAAGAGGTTGGTGACTGCACCAGTCCTTACAGTTCCAGACGGGTCAGGTAATCTCGTGGTGTACAGTGATGCATCAGGGAAAGGCTTGGGGTGTGTGCTCATGCAGAAAGGTAAGGTGATAGCGTATGCTTCTCGACAATTGAAAGAATATGAACGAAACTACCCCACGCATGATCTCGAGTTAGCAGCGGTAGTGTTTGCTCTAAAAACGTGGCGACACTACCTGTATGGGGAAAAAGTACAAGTCTTCACTGATCATAAGAGCCTCAAGTACTTATTCACGCAGAAGGAGCTCAATATGAGACAAAGGCGATGGTTGGAGCTGGTAAAGGATTATGACATAGAGATTCTGTACCATCCAGGCAAAGCCAACGTGGTAGCAGCATGCATTGAGCAGGAAGGCTGTGCACACTTCTGTGATGATCACCACACAGGAAAAACTACAAGATGAGATGAAGAGGGCTGGGATAGACGTGGTGATTAAAGGTGGTAGTGTTCAGATAGCACAGTTAACTATACAGCCTACCCTACGAAAGAAATTTATCGACGCTCAGAGGTCTGATGAACACCTCAGTAAAGTGTGGAGTCAGATTGAGACAGAGAGGCCAGCAGGGTATTCTATCTCCCTAGACGGGGGTCTGCTATGGCAAAACCGCCTCTGCGTTCCCCGAGACGAGGGAATCTTAAAAGATATTATGACCGAAGCCCACGATACATCTTATGTGTTCCACCCTGGAAGTACAAAGATGTATCAGGATCTGAAGAGATTTTACTGGTGGTCTGGAATGAAGAGGGACATAGCGGATTTCGTAAGCCGTTGCTTGACCTGCCAGCAGGTGAAGGCCCCGAGGCAGCGCCCAGCAGGATTGCTACAGCCCCTGAGCGTCCCTCAGTGGAAATGGGTAGCAGTCTGTATGGATTTCATTTCGGGTTTGCCAAAGACAAAGCAGGGTTTCAACGTCATCTGGGTAATTGTGGACAGACTGACTAAGACAGCCCACTTCATTCTAGGAAAGTCCACATATCGAGTAGACCGGTGGGCTCAGTTATATATCAAGGAGATAGTACGCCTGCACGGGGTACCAGTGTCCATAGTATCAGACCGGGACACCAGGTTCACCTCTCAGTTCTGGAGGAGTCTTCAGAAGGCACTAAGAACTCAGTTGAGGTTCAGTACAGCATTCCATCCTCAAACGGACGGACAGACCGAAAGGCTGAATCAGGTTTTAGAGGACATGTTGCGAGCCTGCTCCTTAGATTTCGCTGGGTGTTGGGACGAACATCTGCCTTTAATGGAGTTTGCCTACAACAATAGTTATCAAGCGACCATTCAGATGGCCCCCTTCGAGGCACTGTATGGGCGTAGGTGTCGAACACCAGTGTTTTGGGAAGAGGTAGGCACGCAGCAGCTAATGGGACCAGAGTTGGTCCAGGTCACCAACGCAGCGGTGCAGAAAATCAAACAGAGGATACTCACCGCACAGAGTCGACAGAAAAGCTATGCAGATGTGCGTAGAAGGAACCTCGAATTTGAGGTGGGTGACCACGTGTTCCTGAAGGTGGCCCCTATGAGGGGGGTGTGGAGGTTCGGGAAGGAAGGGAAATTGAGCCCAAGGTTCATAGGCCCCTTCGAGATTCTGGACAGAGTCGGGGCTGTGGCTTACAGAATTGCCCTACCACCGAACCTTGCCGCCGTGCACAATGTGTTCCACGTATCCATGCTGCGAAAGTATACTCCCGACCCTACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGAATTGTGTAAATTACATGAGTGTTGCTTGTATGCCTTAGGCTTCCGCCGTATGTATGCATGCGTTATTTTATGTGTAGAATTGAGTATGCCGGGTTTCGGGTGTTTGATGGACGAATTCGCCCCATTGAGCGCGAACTTCGTGCTCTGGGTGCTTGA

mRNA sequence

ATGCGGATGTGTATTGATTACAGAGAATTAAACAAGGTCACTATAAAGAATAAGTACCCTCTCCCGAGAATTGACGACCTGTTTGATCAGCTACAGGGAGCAACAGTTTTCTCCAAAATTGACCTAAGGTCGGGGTATCATCAGTTGAGGGTTAGAGAAGGGGATATACTCAAGACAGCCTTCAGAAGTCGTTATGGGCATTACGAGTTTCTTGTTATGTCTTTTGGCCTTACCAATGCACCAGCTATTTTCATGGAGCTAATGAACAGGGTGTTTAAGGAATTCTTAGACACCTTTGTCATAGTGTTCATCGACAACATTCTGGTATACTCTAAGTCAGAGGTAGATCATGAAATACACCTCAGAAAAGTCTTGACAATACTGAGAGCTCAGCAGTTGTATGCCAAGTTCTCCAAGTGTGAGTTTTGGTTGTCTGAAGTTGCGTTTCTGGGTCACGTGGTGTCAAGCAAGGGGATCACAGTGGACCCAGCTAAGATAGAAGCAGTGATGAGGTGGCCACAGCCGACCACAGTCACAGAGGTGAGGAGTTTTCTTGGGCTAGCTGGTTATTACAGAAGGTTTGTTCAGGATTTCTCCAAAATTTCCTCGGCGCTGACTCAGCTAACCAAGAAGGGCAAGCCCTTTGCTTGGACTCCAGTCTGCGAACAAAGTTTCCAGGAACTCAAGAAGAGGTTGGTGACTGCACCAGTCCTTACAGTTCCAGACGGGTCAGGTAATCTCGTGGTGTACAGTGATGCATCAGGGAAAGGCTTGGGGTGTGTGCTCATGCAGAAAGGTAAGGTGATAGCGTATGCTTCTCGACAATTGAAAGAATATGAACGAAACTACCCCACGCATGATCTCGAGTTAGCAGCGGTAGTGTTTGCTCTAAAAACGTGGCGACACTACCTGTATGGGGAAAAAGTACAAGTCTTCACTGATCATAAGAGCCTCAAGTACTTATTCACGCAGAAGGAGCTCAATATGAGACAAAGGCGATGGTTGGAGCTGGTAAAGGATTATGACATAGAGATTCTGTACCATCCAGGCAAAGCCAACGTGGAAAAACTACAAGATGAGATGAAGAGGGCTGGGATAGACGTGGTGATTAAAGGTGGTAGTGTTCAGATAGCACAGTTAACTATACAGCCTACCCTACGAAAGAAATTTATCGACGCTCAGAGGTCTGATGAACACCTCAGTAAAGTGTGGAGTCAGATTGAGACAGAGAGGCCAGCAGGGTATTCTATCTCCCTAGACGGGGGTCTGCTATGGCAAAACCGCCTCTGCGTTCCCCGAGACGAGGGAATCTTAAAAGATATTATGACCGAAGCCCACGATACATCTTATGTGTTCCACCCTGGAAGTACAAAGATGTATCAGGATCTGAAGAGATTTTACTGGTGGTCTGGAATGAAGAGGGACATAGCGGATTTCGTAAGCCGTTGCTTGACCTGCCAGCAGGTGAAGGCCCCGAGGCAGCGCCCAGCAGGATTGCTACAGCCCCTGAGCGTCCCTCAGTGGAAATGGGTAGCAGTCTGTATGGATTTCATTTCGGGTTTGCCAAAGACAAAGCAGGGTTTCAACGTCATCTGGGTAATTGTGGACAGACTGACTAAGACAGCCCACTTCATTCTAGGAAAGTCCACATATCGAGTAGACCGGTGGGCTCAGTTATATATCAAGGAGATAGTACGCCTGCACGGGGTACCAGTGTCCATAGTATCAGACCGGGACACCAGGTTCACCTCTCAGTTCTGGAGGAGTCTTCAGAAGGCACTAAGAACTCAGTTGAGGTTCAGTACAGCATTCCATCCTCAAACGGACGGACAGACCGAAAGGCTGAATCAGGTTTTAGAGGACATGTTGCGAGCCTGCTCCTTAGATTTCGCTGGGTGTTGGGACGAACATCTGCCTTTAATGGAGTTTGCCTACAACAATAGTTATCAAGCGACCATTCAGATGGCCCCCTTCGAGGCACTGTATGGGCGTAGGTGTCGAACACCAGTGTTTTGGGAAGAGGTAGGCACGCAGCAGCTAATGGGACCAGAGTTGGTCCAGGTCACCAACGCAGCGGTGCAGAAAATCAAACAGAGGATACTCACCGCACAGAGTCGACAGAAAAGCTATGCAGATGTGCGTAGAAGGAACCTCGAATTTGAGGTGGGTGACCACGTGTTCCTGAAGGTGGCCCCTATGAGGGGGGTGTGGAGGTTCGGGAAGGAAGGGAAATTGAGCCCAAGGTTCATAGGCCCCTTCGAGATTCTGGACAGAGTCGGGGCTGTGGCTTACAGAATTGCCCTACCACCGAACCTTGCCGCCGTGCACAATGTGTTCCACGTATCCATGCTGCGAAAAATTGAGTATGCCGGGTTTCGGGTGTTTGATGGACGAATTCGCCCCATTGAGCGCGAACTTCGTGCTCTGGGTGCTTGA

Coding sequence (CDS)

ATGCGGATGTGTATTGATTACAGAGAATTAAACAAGGTCACTATAAAGAATAAGTACCCTCTCCCGAGAATTGACGACCTGTTTGATCAGCTACAGGGAGCAACAGTTTTCTCCAAAATTGACCTAAGGTCGGGGTATCATCAGTTGAGGGTTAGAGAAGGGGATATACTCAAGACAGCCTTCAGAAGTCGTTATGGGCATTACGAGTTTCTTGTTATGTCTTTTGGCCTTACCAATGCACCAGCTATTTTCATGGAGCTAATGAACAGGGTGTTTAAGGAATTCTTAGACACCTTTGTCATAGTGTTCATCGACAACATTCTGGTATACTCTAAGTCAGAGGTAGATCATGAAATACACCTCAGAAAAGTCTTGACAATACTGAGAGCTCAGCAGTTGTATGCCAAGTTCTCCAAGTGTGAGTTTTGGTTGTCTGAAGTTGCGTTTCTGGGTCACGTGGTGTCAAGCAAGGGGATCACAGTGGACCCAGCTAAGATAGAAGCAGTGATGAGGTGGCCACAGCCGACCACAGTCACAGAGGTGAGGAGTTTTCTTGGGCTAGCTGGTTATTACAGAAGGTTTGTTCAGGATTTCTCCAAAATTTCCTCGGCGCTGACTCAGCTAACCAAGAAGGGCAAGCCCTTTGCTTGGACTCCAGTCTGCGAACAAAGTTTCCAGGAACTCAAGAAGAGGTTGGTGACTGCACCAGTCCTTACAGTTCCAGACGGGTCAGGTAATCTCGTGGTGTACAGTGATGCATCAGGGAAAGGCTTGGGGTGTGTGCTCATGCAGAAAGGTAAGGTGATAGCGTATGCTTCTCGACAATTGAAAGAATATGAACGAAACTACCCCACGCATGATCTCGAGTTAGCAGCGGTAGTGTTTGCTCTAAAAACGTGGCGACACTACCTGTATGGGGAAAAAGTACAAGTCTTCACTGATCATAAGAGCCTCAAGTACTTATTCACGCAGAAGGAGCTCAATATGAGACAAAGGCGATGGTTGGAGCTGGTAAAGGATTATGACATAGAGATTCTGTACCATCCAGGCAAAGCCAACGTGGAAAAACTACAAGATGAGATGAAGAGGGCTGGGATAGACGTGGTGATTAAAGGTGGTAGTGTTCAGATAGCACAGTTAACTATACAGCCTACCCTACGAAAGAAATTTATCGACGCTCAGAGGTCTGATGAACACCTCAGTAAAGTGTGGAGTCAGATTGAGACAGAGAGGCCAGCAGGGTATTCTATCTCCCTAGACGGGGGTCTGCTATGGCAAAACCGCCTCTGCGTTCCCCGAGACGAGGGAATCTTAAAAGATATTATGACCGAAGCCCACGATACATCTTATGTGTTCCACCCTGGAAGTACAAAGATGTATCAGGATCTGAAGAGATTTTACTGGTGGTCTGGAATGAAGAGGGACATAGCGGATTTCGTAAGCCGTTGCTTGACCTGCCAGCAGGTGAAGGCCCCGAGGCAGCGCCCAGCAGGATTGCTACAGCCCCTGAGCGTCCCTCAGTGGAAATGGGTAGCAGTCTGTATGGATTTCATTTCGGGTTTGCCAAAGACAAAGCAGGGTTTCAACGTCATCTGGGTAATTGTGGACAGACTGACTAAGACAGCCCACTTCATTCTAGGAAAGTCCACATATCGAGTAGACCGGTGGGCTCAGTTATATATCAAGGAGATAGTACGCCTGCACGGGGTACCAGTGTCCATAGTATCAGACCGGGACACCAGGTTCACCTCTCAGTTCTGGAGGAGTCTTCAGAAGGCACTAAGAACTCAGTTGAGGTTCAGTACAGCATTCCATCCTCAAACGGACGGACAGACCGAAAGGCTGAATCAGGTTTTAGAGGACATGTTGCGAGCCTGCTCCTTAGATTTCGCTGGGTGTTGGGACGAACATCTGCCTTTAATGGAGTTTGCCTACAACAATAGTTATCAAGCGACCATTCAGATGGCCCCCTTCGAGGCACTGTATGGGCGTAGGTGTCGAACACCAGTGTTTTGGGAAGAGGTAGGCACGCAGCAGCTAATGGGACCAGAGTTGGTCCAGGTCACCAACGCAGCGGTGCAGAAAATCAAACAGAGGATACTCACCGCACAGAGTCGACAGAAAAGCTATGCAGATGTGCGTAGAAGGAACCTCGAATTTGAGGTGGGTGACCACGTGTTCCTGAAGGTGGCCCCTATGAGGGGGGTGTGGAGGTTCGGGAAGGAAGGGAAATTGAGCCCAAGGTTCATAGGCCCCTTCGAGATTCTGGACAGAGTCGGGGCTGTGGCTTACAGAATTGCCCTACCACCGAACCTTGCCGCCGTGCACAATGTGTTCCACGTATCCATGCTGCGAAAAATTGAGTATGCCGGGTTTCGGGTGTTTGATGGACGAATTCGCCCCATTGAGCGCGAACTTCGTGCTCTGGGTGCTTGA
BLAST of CmoCh13G003440 vs. Swiss-Prot
Match: TF211_SCHPO (Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-11 PE=3 SV=1)

HSP 1 Score: 461.8 bits (1187), Expect = 1.5e-128
Identity = 266/808 (32.92%), Postives = 440/808 (54.46%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +RM +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +RVR+GD  K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR   G +E+LVM +G++ APA F   +N +  E  ++ V+ ++DNIL++SKSE +H  H
Sbjct: 522  FRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKH 581

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            ++ VL  L+   L    +KCEF  S+V F+G+ +S KG T     I+ V++W QP    E
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKE 641

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +R FLG   Y R+F+   S+++  L  L KK   + WTP   Q+ + +K+ LV+ PVL  
Sbjct: 642  LRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRH 701

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGK-----VIAYASRQLKEYERNYPTHDLELAAVVF 300
             D S  +++ +DAS   +G VL QK        + Y S ++ + + NY   D E+ A++ 
Sbjct: 702  FDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIK 761

Query: 301  ALKTWRHYLYG--EKVQVFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIEILYHPGK 360
            +LK WRHYL    E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG 
Sbjct: 762  SLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGS 821

Query: 361  AN-----VEKLQDEMKRAGIDVVIKGGSVQ-IAQLTIQPTLRKKFIDAQRSDEHLSKVWS 420
            AN     + ++ DE +    D   +  S+  + Q++I    + + +    +D  L  + +
Sbjct: 822  ANHIADALSRIVDETEPIPKD--SEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLN 881

Query: 421  QIETERPAGYSISLDGGLL--WQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDL 480
                ++    +I L  GLL   ++++ +P D  + + I+ + H+   + HPG   +   +
Sbjct: 882  --NEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNII 941

Query: 481  KRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPK 540
             R + W G+++ I ++V  C TCQ  K+   +P G LQP+   +  W ++ MDFI+ LP+
Sbjct: 942  LRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE 1001

Query: 541  TKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFT 600
            +  G+N ++V+VDR +K A  +    +   ++ A+++ + ++   G P  I++D D  FT
Sbjct: 1002 S-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFT 1061

Query: 601  SQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEF 660
            SQ W+         ++FS  + PQTDGQTER NQ +E +LR         W +H+ L++ 
Sbjct: 1062 SQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQ 1121

Query: 661  AYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTA 720
            +YNN+  +  QM PFE ++  R    +   E+ +      E  Q T    Q +K+ + T 
Sbjct: 1122 SYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTN 1181

Query: 721  QSRQKSYADVRRRNL-EFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAV 780
              + K Y D++ + + EF+ GD V +K           K  KL+P F GPF +L + G  
Sbjct: 1182 NIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPN 1241

Query: 781  AYRIALPPNLAAV-HNVFHVSMLRKIEY 790
             Y + LP ++  +  + FHVS L K  +
Sbjct: 1242 NYELDLPDSIKHMFSSTFHVSHLEKYRH 1259

BLAST of CmoCh13G003440 vs. Swiss-Prot
Match: TF27_SCHPO (Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-7 PE=3 SV=1)

HSP 1 Score: 461.8 bits (1187), Expect = 1.5e-128
Identity = 266/808 (32.92%), Postives = 440/808 (54.46%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +RM +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +RVR+GD  K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR   G +E+LVM +G++ APA F   +N +  E  ++ V+ ++DNIL++SKSE +H  H
Sbjct: 522  FRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKH 581

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            ++ VL  L+   L    +KCEF  S+V F+G+ +S KG T     I+ V++W QP    E
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKE 641

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +R FLG   Y R+F+   S+++  L  L KK   + WTP   Q+ + +K+ LV+ PVL  
Sbjct: 642  LRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRH 701

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGK-----VIAYASRQLKEYERNYPTHDLELAAVVF 300
             D S  +++ +DAS   +G VL QK        + Y S ++ + + NY   D E+ A++ 
Sbjct: 702  FDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIK 761

Query: 301  ALKTWRHYLYG--EKVQVFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIEILYHPGK 360
            +LK WRHYL    E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG 
Sbjct: 762  SLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGS 821

Query: 361  AN-----VEKLQDEMKRAGIDVVIKGGSVQ-IAQLTIQPTLRKKFIDAQRSDEHLSKVWS 420
            AN     + ++ DE +    D   +  S+  + Q++I    + + +    +D  L  + +
Sbjct: 822  ANHIADALSRIVDETEPIPKD--SEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLN 881

Query: 421  QIETERPAGYSISLDGGLL--WQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDL 480
                ++    +I L  GLL   ++++ +P D  + + I+ + H+   + HPG   +   +
Sbjct: 882  --NEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNII 941

Query: 481  KRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPK 540
             R + W G+++ I ++V  C TCQ  K+   +P G LQP+   +  W ++ MDFI+ LP+
Sbjct: 942  LRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE 1001

Query: 541  TKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFT 600
            +  G+N ++V+VDR +K A  +    +   ++ A+++ + ++   G P  I++D D  FT
Sbjct: 1002 S-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFT 1061

Query: 601  SQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEF 660
            SQ W+         ++FS  + PQTDGQTER NQ +E +LR         W +H+ L++ 
Sbjct: 1062 SQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQ 1121

Query: 661  AYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTA 720
            +YNN+  +  QM PFE ++  R    +   E+ +      E  Q T    Q +K+ + T 
Sbjct: 1122 SYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTN 1181

Query: 721  QSRQKSYADVRRRNL-EFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAV 780
              + K Y D++ + + EF+ GD V +K           K  KL+P F GPF +L + G  
Sbjct: 1182 NIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPN 1241

Query: 781  AYRIALPPNLAAV-HNVFHVSMLRKIEY 790
             Y + LP ++  +  + FHVS L K  +
Sbjct: 1242 NYELDLPDSIKHMFSSTFHVSHLEKYRH 1259

BLAST of CmoCh13G003440 vs. Swiss-Prot
Match: TF28_SCHPO (Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-8 PE=3 SV=1)

HSP 1 Score: 461.8 bits (1187), Expect = 1.5e-128
Identity = 266/808 (32.92%), Postives = 440/808 (54.46%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +RM +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +RVR+GD  K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR   G +E+LVM +G++ APA F   +N +  E  ++ V+ ++DNIL++SKSE +H  H
Sbjct: 522  FRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKH 581

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            ++ VL  L+   L    +KCEF  S+V F+G+ +S KG T     I+ V++W QP    E
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKE 641

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +R FLG   Y R+F+   S+++  L  L KK   + WTP   Q+ + +K+ LV+ PVL  
Sbjct: 642  LRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRH 701

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGK-----VIAYASRQLKEYERNYPTHDLELAAVVF 300
             D S  +++ +DAS   +G VL QK        + Y S ++ + + NY   D E+ A++ 
Sbjct: 702  FDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIK 761

Query: 301  ALKTWRHYLYG--EKVQVFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIEILYHPGK 360
            +LK WRHYL    E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG 
Sbjct: 762  SLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGS 821

Query: 361  AN-----VEKLQDEMKRAGIDVVIKGGSVQ-IAQLTIQPTLRKKFIDAQRSDEHLSKVWS 420
            AN     + ++ DE +    D   +  S+  + Q++I    + + +    +D  L  + +
Sbjct: 822  ANHIADALSRIVDETEPIPKD--SEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLN 881

Query: 421  QIETERPAGYSISLDGGLL--WQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDL 480
                ++    +I L  GLL   ++++ +P D  + + I+ + H+   + HPG   +   +
Sbjct: 882  --NEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNII 941

Query: 481  KRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPK 540
             R + W G+++ I ++V  C TCQ  K+   +P G LQP+   +  W ++ MDFI+ LP+
Sbjct: 942  LRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE 1001

Query: 541  TKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFT 600
            +  G+N ++V+VDR +K A  +    +   ++ A+++ + ++   G P  I++D D  FT
Sbjct: 1002 S-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFT 1061

Query: 601  SQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEF 660
            SQ W+         ++FS  + PQTDGQTER NQ +E +LR         W +H+ L++ 
Sbjct: 1062 SQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQ 1121

Query: 661  AYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTA 720
            +YNN+  +  QM PFE ++  R    +   E+ +      E  Q T    Q +K+ + T 
Sbjct: 1122 SYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTN 1181

Query: 721  QSRQKSYADVRRRNL-EFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAV 780
              + K Y D++ + + EF+ GD V +K           K  KL+P F GPF +L + G  
Sbjct: 1182 NIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPN 1241

Query: 781  AYRIALPPNLAAV-HNVFHVSMLRKIEY 790
             Y + LP ++  +  + FHVS L K  +
Sbjct: 1242 NYELDLPDSIKHMFSSTFHVSHLEKYRH 1259

BLAST of CmoCh13G003440 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 3.3e-128
Identity = 265/808 (32.80%), Postives = 440/808 (54.46%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +RM +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +RVR+GD  K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR   G +E+LVM +G++ APA F   +N +  E  ++ V+ ++D+IL++SKSE +H  H
Sbjct: 522  FRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKH 581

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            ++ VL  L+   L    +KCEF  S+V F+G+ +S KG T     I+ V++W QP    E
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKE 641

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +R FLG   Y R+F+   S+++  L  L KK   + WTP   Q+ + +K+ LV+ PVL  
Sbjct: 642  LRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRH 701

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGK-----VIAYASRQLKEYERNYPTHDLELAAVVF 300
             D S  +++ +DAS   +G VL QK        + Y S ++ + + NY   D E+ A++ 
Sbjct: 702  FDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIK 761

Query: 301  ALKTWRHYLYG--EKVQVFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIEILYHPGK 360
            +LK WRHYL    E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG 
Sbjct: 762  SLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGS 821

Query: 361  AN-----VEKLQDEMKRAGIDVVIKGGSVQ-IAQLTIQPTLRKKFIDAQRSDEHLSKVWS 420
            AN     + ++ DE +    D   +  S+  + Q++I    + + +    +D  L  + +
Sbjct: 822  ANHIADALSRIVDETEPIPKD--SEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLN 881

Query: 421  QIETERPAGYSISLDGGLL--WQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDL 480
                ++    +I L  GLL   ++++ +P D  + + I+ + H+   + HPG   +   +
Sbjct: 882  --NEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNII 941

Query: 481  KRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPK 540
             R + W G+++ I ++V  C TCQ  K+   +P G LQP+   +  W ++ MDFI+ LP+
Sbjct: 942  LRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE 1001

Query: 541  TKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFT 600
            +  G+N ++V+VDR +K A  +    +   ++ A+++ + ++   G P  I++D D  FT
Sbjct: 1002 S-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFT 1061

Query: 601  SQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEF 660
            SQ W+         ++FS  + PQTDGQTER NQ +E +LR         W +H+ L++ 
Sbjct: 1062 SQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQ 1121

Query: 661  AYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTA 720
            +YNN+  +  QM PFE ++  R    +   E+ +      E  Q T    Q +K+ + T 
Sbjct: 1122 SYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTN 1181

Query: 721  QSRQKSYADVRRRNL-EFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAV 780
              + K Y D++ + + EF+ GD V +K           K  KL+P F GPF +L + G  
Sbjct: 1182 NIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPN 1241

Query: 781  AYRIALPPNLAAV-HNVFHVSMLRKIEY 790
             Y + LP ++  +  + FHVS L K  +
Sbjct: 1242 NYELDLPDSIKHMFSSTFHVSHLEKYRH 1259

BLAST of CmoCh13G003440 vs. Swiss-Prot
Match: TF24_SCHPO (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 3.3e-128
Identity = 265/808 (32.80%), Postives = 440/808 (54.46%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +RM +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +RVR+GD  K A
Sbjct: 462  LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR   G +E+LVM +G++ APA F   +N +  E  ++ V+ ++D+IL++SKSE +H  H
Sbjct: 522  FRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKH 581

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            ++ VL  L+   L    +KCEF  S+V F+G+ +S KG T     I+ V++W QP    E
Sbjct: 582  VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKE 641

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +R FLG   Y R+F+   S+++  L  L KK   + WTP   Q+ + +K+ LV+ PVL  
Sbjct: 642  LRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRH 701

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGK-----VIAYASRQLKEYERNYPTHDLELAAVVF 300
             D S  +++ +DAS   +G VL QK        + Y S ++ + + NY   D E+ A++ 
Sbjct: 702  FDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIK 761

Query: 301  ALKTWRHYLYG--EKVQVFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIEILYHPGK 360
            +LK WRHYL    E  ++ TDH++L    T +    N R  RW   ++D++ EI Y PG 
Sbjct: 762  SLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGS 821

Query: 361  AN-----VEKLQDEMKRAGIDVVIKGGSVQ-IAQLTIQPTLRKKFIDAQRSDEHLSKVWS 420
            AN     + ++ DE +    D   +  S+  + Q++I    + + +    +D  L  + +
Sbjct: 822  ANHIADALSRIVDETEPIPKD--SEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLN 881

Query: 421  QIETERPAGYSISLDGGLL--WQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDL 480
                ++    +I L  GLL   ++++ +P D  + + I+ + H+   + HPG   +   +
Sbjct: 882  --NEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNII 941

Query: 481  KRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPK 540
             R + W G+++ I ++V  C TCQ  K+   +P G LQP+   +  W ++ MDFI+ LP+
Sbjct: 942  LRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE 1001

Query: 541  TKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFT 600
            +  G+N ++V+VDR +K A  +    +   ++ A+++ + ++   G P  I++D D  FT
Sbjct: 1002 S-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFT 1061

Query: 601  SQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEF 660
            SQ W+         ++FS  + PQTDGQTER NQ +E +LR         W +H+ L++ 
Sbjct: 1062 SQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQ 1121

Query: 661  AYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTA 720
            +YNN+  +  QM PFE ++  R    +   E+ +      E  Q T    Q +K+ + T 
Sbjct: 1122 SYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTN 1181

Query: 721  QSRQKSYADVRRRNL-EFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAV 780
              + K Y D++ + + EF+ GD V +K           K  KL+P F GPF +L + G  
Sbjct: 1182 NIKMKKYFDMKIQEIEEFQPGDLVMVK---RTKTGFLHKSNKLAPSFAGPFYVLQKSGPN 1241

Query: 781  AYRIALPPNLAAV-HNVFHVSMLRKIEY 790
             Y + LP ++  +  + FHVS L K  +
Sbjct: 1242 NYELDLPDSIKHMFSSTFHVSHLEKYRH 1259

BLAST of CmoCh13G003440 vs. TrEMBL
Match: Q84KB0_CUCME (Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1196.4 bits (3094), Expect = 0.0e+00
Identity = 574/806 (71.22%), Postives = 679/806 (84.24%), Query Frame = 1

Query: 1   MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
           MR+CIDYRELNKVT+KN+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+++ D+ KTA
Sbjct: 43  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTA 102

Query: 61  FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
           FRSRYGHY+F+VMSFGLTNAPA+FM+LMNRVF+EFLDTFVIVFID+IL+YSK+E +HE H
Sbjct: 103 FRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEH 162

Query: 121 LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
           LR VL  LR  +LYAKFSKCEFWL +V+FLGHVVS  G++VDPAKIEAV  W +P+TV+E
Sbjct: 163 LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSE 222

Query: 181 VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
           VRSFLGLAGYYRRFV++FS+I++ LTQLT+KG PF W+  CE SFQ LK++LVTAPVLTV
Sbjct: 223 VRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTV 282

Query: 241 PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
           PDGSGN V+YSDAS KGLGCVLMQ+GKV+AYASRQLK +E+NYPTHDLELAAVVFALK W
Sbjct: 283 PDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW 342

Query: 301 RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANV------ 360
           RHYLYGEK+Q+FTDHKSLKY FTQKELNMRQRRWLELVKDYD EILYHPGKANV      
Sbjct: 343 RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALS 402

Query: 361 -------------EKLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHLS 420
                          L  +++RA I V++   ++Q+AQLT+QPTLR++ IDAQ +D +L 
Sbjct: 403 RKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLV 462

Query: 421 KVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQ 480
           +     E  + A +S+S DGGLL++ RLCVP D  +  ++++EAH + +  HPGST+   
Sbjct: 463 EKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTEDVS 522

Query: 481 DLKR-FYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
             +  F     MKR++A+FVS+CL CQQVKAPRQ+PAGLLQPLS+P+WKW  V MDFI+G
Sbjct: 523 GPEAGFIGGRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITG 582

Query: 541 LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
           LP+T +GF VIWV+VDRLTK+AHF+ GKSTY   +WAQLY+ EIVRLHGVPVSIVSDRD 
Sbjct: 583 LPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDA 642

Query: 601 RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
           RFTS+FW+ LQ A+ T+L FSTAFHPQTDGQTERLNQVLEDMLRAC+L+F G WD HL L
Sbjct: 643 RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHL 702

Query: 661 MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
           MEFAYNNSYQATI MAPFEALYGR CR+PV W EVG Q+LMGPELVQ TN A+QKI+ R+
Sbjct: 703 MEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRM 762

Query: 721 LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
            TAQSRQKSYADVRR++LEFEVGD VFLKVAPM+GV RF + GKLSPRF+GPFEIL+R+G
Sbjct: 763 HTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIG 822

Query: 781 AVAYRIALPPNLAAVHNVFHVSMLRK 787
            VAYR+ALPP+L+ VH+VFHVSMLRK
Sbjct: 823 PVAYRLALPPSLSTVHDVFHVSMLRK 848

BLAST of CmoCh13G003440 vs. TrEMBL
Match: O64892_ANACO (Polyprotein (Fragment) OS=Ananas comosus PE=4 SV=1)

HSP 1 Score: 1087.4 bits (2811), Expect = 0.0e+00
Identity = 515/805 (63.98%), Postives = 633/805 (78.63%), Query Frame = 1

Query: 1   MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
           +R+C+DYRELNKVTIKNKYPLPRIDDLFDQLQG+ V+SKIDL+SGYHQL+++  D+ KTA
Sbjct: 67  LRLCVDYRELNKVTIKNKYPLPRIDDLFDQLQGSCVYSKIDLQSGYHQLKIKPEDVSKTA 126

Query: 61  FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
           FR+RYGHYEF VM FGLTNAP  FM+LMNRVFK +LD FV+VFID+ILVYS+S+ DHE H
Sbjct: 127 FRTRYGHYEFAVMPFGLTNAPTAFMDLMNRVFKPYLDRFVVVFIDDILVYSRSDADHEEH 186

Query: 121 LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
           LR VL +LR ++LY K  KCEFWL EVAFLGH++S  GI VDP KIEA+  WP+ T+VTE
Sbjct: 187 LRIVLQVLREKELYVKLKKCEFWLREVAFLGHLISGSGIAVDPKKIEAIKDWPRLTSVTE 246

Query: 181 VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
           +RSFLGLAGYYRRFV+ F+K+S+ LT+LT KG  F W   CE+SFQELK+RL TAP+LT+
Sbjct: 247 IRSFLGLAGYYRRFVERFAKLSTPLTRLTHKGVKFIWNDACERSFQELKQRLTTAPILTL 306

Query: 241 PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
           P      VVYSDAS  GLGCVLMQ  KVIAYASRQLKEYE+NYPTHDLELAAVVFALK W
Sbjct: 307 PVAGAGYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKNYPTHDLELAAVVFALKLW 366

Query: 301 RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANV------ 360
           RHYLYGE+ +V+TDHKSLKYLFTQKELN+RQRRWLEL+KDYD+ ILYHPGKANV      
Sbjct: 367 RHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALS 426

Query: 361 --------------EKLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHL 420
                          +L ++MKR  +++V     +++  L +QPTL  +  + Q SD  L
Sbjct: 427 RKSMENLAMHVVTQPRLIEQMKRLELEIVTPDTPMRLMTLVVQPTLLDRIKEKQASDVEL 486

Query: 421 SKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMY 480
            K+  ++       +++  DG + ++ R+CVP D GI +DI+ EAH   Y  HPG TKMY
Sbjct: 487 QKIKGKMVDGCTGDFTLDGDGLMRFRGRICVPADSGIKEDILQEAHRAPYAIHPGGTKMY 546

Query: 481 QDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
           +DLK  YWW G+K+D+ +FV++CLTCQQVKA  + PAG LQ L +P WKW  + MDF++G
Sbjct: 547 KDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSLPIPVWKWEKITMDFVTG 606

Query: 541 LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
           LP+++ G + IWVIVDRLTK+AHFI   +T+  +R AQ+Y+ EIVRLHGVP SIVSDRDT
Sbjct: 607 LPRSQAGHDAIWVIVDRLTKSAHFIPIHTTWTGERLAQVYLDEIVRLHGVPTSIVSDRDT 666

Query: 601 RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
           RF S FWRSLQ AL T+L FSTAFHPQ+DGQ+ER  Q LEDMLRAC +DF G W +HLP+
Sbjct: 667 RFVSHFWRSLQDALGTRLDFSTAFHPQSDGQSERTIQTLEDMLRACVIDFQGGWSQHLPM 726

Query: 661 MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
            EFAYNNSYQA+I+MAPFEALYGR+CR+P+ W EVG    +GP+++Q     V+  ++R+
Sbjct: 727 AEFAYNNSYQASIKMAPFEALYGRKCRSPLHWSEVGESLALGPDVLQEAEVKVRIARERL 786

Query: 721 LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
           LTAQSRQ+SYAD RRR+LEF+VGDHVFLKV+P RG+ RFG  GKLSPRFIGP+EIL+RVG
Sbjct: 787 LTAQSRQRSYADRRRRDLEFQVGDHVFLKVSPTRGIKRFGIRGKLSPRFIGPYEILERVG 846

Query: 781 AVAYRIALPPNLAAVHNVFHVSMLR 786
            VAYR+ALPPNL+ VHNVFHVS++R
Sbjct: 847 PVAYRLALPPNLSGVHNVFHVSVVR 871

BLAST of CmoCh13G003440 vs. TrEMBL
Match: A0A061EEG7_THECC (DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_018243 PE=4 SV=1)

HSP 1 Score: 1077.4 bits (2785), Expect = 0.0e+00
Identity = 511/786 (65.01%), Postives = 633/786 (80.53%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +R+CIDYR+LNKVT+KNKYPLPRIDDLFDQLQGA  FSKIDLRSGYHQLR+R  DI KTA
Sbjct: 596  LRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGYHQLRIRNEDIPKTA 655

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVMSFGLTNAPA FM+LMNRVFK +LD FV+VFID+IL+YSKS  +HE H
Sbjct: 656  FRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDDILIYSKSREEHEQH 715

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            L+ VL ILR  +LYAKFSKCEFWL  VAFLGHVVS +GI VD  KIEAV +WP+PT+V+E
Sbjct: 716  LKIVLQILREHRLYAKFSKCEFWLESVAFLGHVVSKEGIRVDTKKIEAVEKWPRPTSVSE 775

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSF+GLAGYYRRFV+DFSKI + LT+LT+K   F W+  CE SF++LK  L TAPVL++
Sbjct: 776  IRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSL 835

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            P G+G   ++ DASG GLGCVLMQ GKVIAYASRQLK +E+NYP HDLE+AA+VFALK W
Sbjct: 836  PQGTGGYTMFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIW 895

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVEKLQDE 360
            RHYLYGE  +++TDHKSLKY+F Q++LN+RQ RW+EL+KDYD  ILYHPGKANV  + D 
Sbjct: 896  RHYLYGETCEIYTDHKSLKYIFQQRDLNLRQCRWMELLKDYDCTILYHPGKANV--VADA 955

Query: 361  MKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHLSKVWSQIETERPAGYSISLD 420
            + R  +     G    I+   ++P L  K  +AQ  DE + K     +  +   ++   D
Sbjct: 956  LSRKSM-----GSLAHIS--IVRPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD 1015

Query: 421  GGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDLKRFYWWSGMKRDIADFV 480
            G L +  RL VP  +G+ ++I+ EAH  +YV HPG+TKMYQDLK  YWW G+KRD+A+FV
Sbjct: 1016 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 1075

Query: 481  SRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPKTKQGFNVIWVIVDRLTK 540
            S+CL CQQVKA  Q+PAGLLQPL VP+WKW  + MDF++GLP+T  G++ IW++VDRLTK
Sbjct: 1076 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTK 1135

Query: 541  TAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFTSQFWRSLQKALRTQLRF 600
            +AHF+  K+TY   ++A++Y+ EIVRLHG+P+SIVSDR  +FTS+FW  LQ+AL T+L F
Sbjct: 1136 SAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDF 1195

Query: 601  STAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEFAYNNSYQATIQMAPFEA 660
            STAFHPQTDGQ+ER  Q LE MLRAC +D    W+++LPL+EFAYNNS+Q +IQMAPFEA
Sbjct: 1196 STAFHPQTDGQSERTIQTLEAMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEA 1255

Query: 661  LYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTAQSRQKSYADVRRRNLEF 720
            LYGRRCR+P+ W EVG ++L+GPELVQ     +  I+QR+LTAQSRQKSYAD RRR+LEF
Sbjct: 1256 LYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEF 1315

Query: 721  EVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAVAYRIALPPNLAAVHNVFH 780
            +VGDHVFLKV+P +GV RFGK+GKLSPR+IGPFEIL++VGAVAYR+ALPP+L+ +H VFH
Sbjct: 1316 QVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFH 1372

Query: 781  VSMLRK 787
            VSMLRK
Sbjct: 1376 VSMLRK 1372

BLAST of CmoCh13G003440 vs. TrEMBL
Match: Q6L3S2_SOLDE (Putative retrotransposon protein, identical OS=Solanum demissum GN=SDM1_41t00017 PE=4 SV=1)

HSP 1 Score: 1050.4 bits (2715), Expect = 1.1e-303
Identity = 508/811 (62.64%), Postives = 628/811 (77.44%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +R+CIDYR+LNKVTIKNKYPLPRIDDLFDQLQGAT FSKIDLRSGYHQLRVRE DI KTA
Sbjct: 714  LRICIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGYHQLRVRERDIPKTA 773

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVMSFGLTNAPA FM+LMNRVF+ +LD FVI+FID+IL+YS++E DH  H
Sbjct: 774  FRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFRPYLDMFVIIFIDDILIYSRNEEDHASH 833

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            LR VL  L+ ++LYAKFSKCEFWL  VAFLGH+VS  GI VD  KIEAV  WP+PT+ TE
Sbjct: 834  LRTVLQTLKDKELYAKFSKCEFWLKSVAFLGHIVSGDGIKVDTRKIEAVQNWPRPTSPTE 893

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSFLGLAGYYRRFV+ FS I+S LT+LT+K   F W+  CE+SFQELKKRL+TAPVLT+
Sbjct: 894  IRSFLGLAGYYRRFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTL 953

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            P+G+  LVVY DAS  GLGCVLMQ GKVIAYASRQLK +E+NYPTHDLELA VVFALK W
Sbjct: 954  PEGTQGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLW 1013

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVEKLQDE 360
            RHYLYG  V +FTDHKSL+Y+ TQK LN+RQRRWLEL+KDYD+ ILYHPGKANV  + D 
Sbjct: 1014 RHYLYGVHVDIFTDHKSLQYVLTQKALNLRQRRWLELLKDYDLSILYHPGKANV--VADS 1073

Query: 361  MKR--AGIDVVIKGGSVQIAQ-----------------------LTIQPTLRKKFIDAQR 420
            + R   G    I+ G  ++A+                          + +L  +  + Q 
Sbjct: 1074 LSRLSMGSTTHIEEGRRELAKDMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQD 1133

Query: 421  SDEHLSKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPG 480
             D  L ++ + ++ +R   +    DG L +Q RLCVP  +G+ + +M EAH + Y  HPG
Sbjct: 1134 QDPILLELKANVQKQRVLAFEQGGDGVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPG 1193

Query: 481  STKMYQDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCM 540
            STKMY+DL+ FYWW+GMK+ IA+FV++C  CQQVK   QRP GL Q + +P+WKW  + M
Sbjct: 1194 STKMYRDLREFYWWNGMKKGIAEFVAKCPNCQQVKVEHQRPGGLAQNIELPEWKWEMINM 1253

Query: 541  DFISGLPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIV 600
            DFI+GLP++++  + IWVIVDR+TK+AHF+  K+T+  + +A+LYI+EIVRLHGVP+SI+
Sbjct: 1254 DFITGLPRSRRQHDSIWVIVDRMTKSAHFLPVKTTHSAEDYAKLYIQEIVRLHGVPISII 1313

Query: 601  SDRDTRFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWD 660
            SDR  +FT+QFW+S QK L +++  STAFHPQTDGQ ER  Q LEDMLRAC +DF   WD
Sbjct: 1314 SDRGAQFTAQFWKSFQKGLGSKVSLSTAFHPQTDGQAERTIQTLEDMLRACVIDFKSNWD 1373

Query: 661  EHLPLMEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQK 720
            +HLPL+EFAYNNSY ++IQMAP+EALYGRRCR+P+ W EVG  +L+GP+LV      V+ 
Sbjct: 1374 DHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPIGWFEVGEARLIGPDLVHQAMEKVKV 1433

Query: 721  IKQRILTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEI 780
            I++R+ TAQSRQKSY DVRRR LEFEV D V+LKV+PM+GV RFGK+GKLSPR+IGP+ I
Sbjct: 1434 IQERLKTAQSRQKSYTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIGPYRI 1493

Query: 781  LDRVGAVAYRIALPPNLAAVHNVFHVSMLRK 787
            + RVG+VAY + LP  LAAVH VFH+SML+K
Sbjct: 1494 VQRVGSVAYELELPQELAAVHPVFHISMLKK 1522

BLAST of CmoCh13G003440 vs. TrEMBL
Match: M5WLY8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021229mg PE=4 SV=1)

HSP 1 Score: 1049.7 bits (2713), Expect = 1.9e-303
Identity = 505/806 (62.66%), Postives = 627/806 (77.79%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            MR+C+DYR+LNK+T++N+YPLPRIDDLFDQL+GA VFSKIDLRSGYHQLRVRE D+ KTA
Sbjct: 315  MRLCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDMPKTA 374

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVM FGLTNAPA FM+LMNRVF+ +LD FVIVFID+ILVYSKS+  H  H
Sbjct: 375  FRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFIDDILVYSKSQKAHMKH 434

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            L  VL  LR +QLYAKFSKC+FWL  V+FLGHV+S++GI VDP KIEAV+ W +PT+VTE
Sbjct: 435  LNLVLRTLRRRQLYAKFSKCQFWLDRVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTE 494

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSFLGLAGYYRRFV+ FS I++ LT LT+KG  F W+  CE+SF ELK RL TAPVL +
Sbjct: 495  IRSFLGLAGYYRRFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELKTRLTTAPVLAL 554

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            PD SGN V+YSDAS +GLGCVLMQ G+VIAYASRQLK++E NYP HDLELAAVVFALK W
Sbjct: 555  PDDSGNFVIYSDASQQGLGCVLMQHGRVIAYASRQLKKHELNYPVHDLELAAVVFALKIW 614

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVE----- 360
            RHYLYGE  Q+FTDHKSLKYLFTQKELN+RQRRWLEL+KDYD  I +HPG+ANV      
Sbjct: 615  RHYLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLELIKDYDCTIEHHPGRANVVADALS 674

Query: 361  ---------------KLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHL 420
                            L  EM++  I + +      +A L ++P L ++ + AQ  D  +
Sbjct: 675  RKSSGSIAYLRGRYLPLMVEMRKLRIGLDVDNQGALLATLHVRPVLVERILAAQSQDPLI 734

Query: 421  SKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMY 480
              +  ++        S+  DG L+  NRL VP DE + ++I+ EAH++++  HPGSTKMY
Sbjct: 735  CTLRVEVANGDRTDCSVRNDGALMVGNRLYVPNDEALKREILEEAHESAFAMHPGSTKMY 794

Query: 481  QDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
              L+  YWW  MK+ IA++V RCL CQQVKA RQ+P+GLLQPL +P+WKW  + MDF+  
Sbjct: 795  HTLREHYWWPFMKKQIAEYVRRCLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFK 854

Query: 541  LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
            LP+T+   + +WVIVDRLTK+AHF+  ++ Y +++ A+++I EIVRLHGVPVSIVSDRD 
Sbjct: 855  LPQTQSKHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDP 914

Query: 601  RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
            RFTS+FW  L +A  TQL+FSTAFHPQTDGQ+ER  Q LE MLRAC+L F G WDE LPL
Sbjct: 915  RFTSRFWTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLEHMLRACALQFRGDWDEKLPL 974

Query: 661  MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
            MEFAYNNSYQ +I M+PF+ALYGR+CRTP +W+EVG  +L+  E V++T   VQ I++R+
Sbjct: 975  MEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWDEVGEHRLVVSEDVELTKKQVQIIRERL 1034

Query: 721  LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
             TAQ RQKSYAD RR++L+FEVGD VFLK++P +GV RFGK GKLSPR+IGP+EI++ VG
Sbjct: 1035 KTAQDRQKSYADNRRKDLQFEVGDWVFLKLSPWKGVVRFGKRGKLSPRYIGPYEIIECVG 1094

Query: 781  AVAYRIALPPNLAAVHNVFHVSMLRK 787
             VAYR+ LP +LA +H+VFHVSMLRK
Sbjct: 1095 PVAYRLTLPSDLARLHDVFHVSMLRK 1120

BLAST of CmoCh13G003440 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 120.6 bits (301), Expect = 4.6e-27
Identity = 58/125 (46.40%), Postives = 80/125 (64.00%), Query Frame = 1

Query: 120 HLRKVLTILRAQQLYAKFSKCEFWLSEVAFLGH--VVSSKGITVDPAKIEAVMRWPQPTT 179
           HL  VL I    Q YA   KC F   ++A+LGH  ++S +G++ DPAK+EA++ WP+P  
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 180 VTEVRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPV 239
            TE+R FLGL GYYRRFV+++ KI   LT+L KK     WT +   +F+ LK  + T PV
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNS-LKWTEMAALAFKALKGAVTTLPV 122

Query: 240 LTVPD 243
           L +PD
Sbjct: 123 LALPD 126

BLAST of CmoCh13G003440 vs. NCBI nr
Match: gi|28558781|gb|AAO45752.1| (pol protein [Cucumis melo subsp. melo])

HSP 1 Score: 1196.4 bits (3094), Expect = 0.0e+00
Identity = 574/806 (71.22%), Postives = 679/806 (84.24%), Query Frame = 1

Query: 1   MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
           MR+CIDYRELNKVT+KN+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+++ D+ KTA
Sbjct: 43  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTA 102

Query: 61  FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
           FRSRYGHY+F+VMSFGLTNAPA+FM+LMNRVF+EFLDTFVIVFID+IL+YSK+E +HE H
Sbjct: 103 FRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEH 162

Query: 121 LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
           LR VL  LR  +LYAKFSKCEFWL +V+FLGHVVS  G++VDPAKIEAV  W +P+TV+E
Sbjct: 163 LRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSE 222

Query: 181 VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
           VRSFLGLAGYYRRFV++FS+I++ LTQLT+KG PF W+  CE SFQ LK++LVTAPVLTV
Sbjct: 223 VRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTV 282

Query: 241 PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
           PDGSGN V+YSDAS KGLGCVLMQ+GKV+AYASRQLK +E+NYPTHDLELAAVVFALK W
Sbjct: 283 PDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW 342

Query: 301 RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANV------ 360
           RHYLYGEK+Q+FTDHKSLKY FTQKELNMRQRRWLELVKDYD EILYHPGKANV      
Sbjct: 343 RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALS 402

Query: 361 -------------EKLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHLS 420
                          L  +++RA I V++   ++Q+AQLT+QPTLR++ IDAQ +D +L 
Sbjct: 403 RKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLV 462

Query: 421 KVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQ 480
           +     E  + A +S+S DGGLL++ RLCVP D  +  ++++EAH + +  HPGST+   
Sbjct: 463 EKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTEDVS 522

Query: 481 DLKR-FYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
             +  F     MKR++A+FVS+CL CQQVKAPRQ+PAGLLQPLS+P+WKW  V MDFI+G
Sbjct: 523 GPEAGFIGGRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITG 582

Query: 541 LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
           LP+T +GF VIWV+VDRLTK+AHF+ GKSTY   +WAQLY+ EIVRLHGVPVSIVSDRD 
Sbjct: 583 LPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDA 642

Query: 601 RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
           RFTS+FW+ LQ A+ T+L FSTAFHPQTDGQTERLNQVLEDMLRAC+L+F G WD HL L
Sbjct: 643 RFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHL 702

Query: 661 MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
           MEFAYNNSYQATI MAPFEALYGR CR+PV W EVG Q+LMGPELVQ TN A+QKI+ R+
Sbjct: 703 MEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRM 762

Query: 721 LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
            TAQSRQKSYADVRR++LEFEVGD VFLKVAPM+GV RF + GKLSPRF+GPFEIL+R+G
Sbjct: 763 HTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIG 822

Query: 781 AVAYRIALPPNLAAVHNVFHVSMLRK 787
            VAYR+ALPP+L+ VH+VFHVSMLRK
Sbjct: 823 PVAYRLALPPSLSTVHDVFHVSMLRK 848

BLAST of CmoCh13G003440 vs. NCBI nr
Match: gi|2995405|emb|CAA73042.1| (polyprotein [Ananas comosus])

HSP 1 Score: 1087.4 bits (2811), Expect = 0.0e+00
Identity = 515/805 (63.98%), Postives = 633/805 (78.63%), Query Frame = 1

Query: 1   MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
           +R+C+DYRELNKVTIKNKYPLPRIDDLFDQLQG+ V+SKIDL+SGYHQL+++  D+ KTA
Sbjct: 67  LRLCVDYRELNKVTIKNKYPLPRIDDLFDQLQGSCVYSKIDLQSGYHQLKIKPEDVSKTA 126

Query: 61  FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
           FR+RYGHYEF VM FGLTNAP  FM+LMNRVFK +LD FV+VFID+ILVYS+S+ DHE H
Sbjct: 127 FRTRYGHYEFAVMPFGLTNAPTAFMDLMNRVFKPYLDRFVVVFIDDILVYSRSDADHEEH 186

Query: 121 LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
           LR VL +LR ++LY K  KCEFWL EVAFLGH++S  GI VDP KIEA+  WP+ T+VTE
Sbjct: 187 LRIVLQVLREKELYVKLKKCEFWLREVAFLGHLISGSGIAVDPKKIEAIKDWPRLTSVTE 246

Query: 181 VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
           +RSFLGLAGYYRRFV+ F+K+S+ LT+LT KG  F W   CE+SFQELK+RL TAP+LT+
Sbjct: 247 IRSFLGLAGYYRRFVERFAKLSTPLTRLTHKGVKFIWNDACERSFQELKQRLTTAPILTL 306

Query: 241 PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
           P      VVYSDAS  GLGCVLMQ  KVIAYASRQLKEYE+NYPTHDLELAAVVFALK W
Sbjct: 307 PVAGAGYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKNYPTHDLELAAVVFALKLW 366

Query: 301 RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANV------ 360
           RHYLYGE+ +V+TDHKSLKYLFTQKELN+RQRRWLEL+KDYD+ ILYHPGKANV      
Sbjct: 367 RHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALS 426

Query: 361 --------------EKLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHL 420
                          +L ++MKR  +++V     +++  L +QPTL  +  + Q SD  L
Sbjct: 427 RKSMENLAMHVVTQPRLIEQMKRLELEIVTPDTPMRLMTLVVQPTLLDRIKEKQASDVEL 486

Query: 421 SKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMY 480
            K+  ++       +++  DG + ++ R+CVP D GI +DI+ EAH   Y  HPG TKMY
Sbjct: 487 QKIKGKMVDGCTGDFTLDGDGLMRFRGRICVPADSGIKEDILQEAHRAPYAIHPGGTKMY 546

Query: 481 QDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
           +DLK  YWW G+K+D+ +FV++CLTCQQVKA  + PAG LQ L +P WKW  + MDF++G
Sbjct: 547 KDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSLPIPVWKWEKITMDFVTG 606

Query: 541 LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
           LP+++ G + IWVIVDRLTK+AHFI   +T+  +R AQ+Y+ EIVRLHGVP SIVSDRDT
Sbjct: 607 LPRSQAGHDAIWVIVDRLTKSAHFIPIHTTWTGERLAQVYLDEIVRLHGVPTSIVSDRDT 666

Query: 601 RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
           RF S FWRSLQ AL T+L FSTAFHPQ+DGQ+ER  Q LEDMLRAC +DF G W +HLP+
Sbjct: 667 RFVSHFWRSLQDALGTRLDFSTAFHPQSDGQSERTIQTLEDMLRACVIDFQGGWSQHLPM 726

Query: 661 MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
            EFAYNNSYQA+I+MAPFEALYGR+CR+P+ W EVG    +GP+++Q     V+  ++R+
Sbjct: 727 AEFAYNNSYQASIKMAPFEALYGRKCRSPLHWSEVGESLALGPDVLQEAEVKVRIARERL 786

Query: 721 LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
           LTAQSRQ+SYAD RRR+LEF+VGDHVFLKV+P RG+ RFG  GKLSPRFIGP+EIL+RVG
Sbjct: 787 LTAQSRQRSYADRRRRDLEFQVGDHVFLKVSPTRGIKRFGIRGKLSPRFIGPYEILERVG 846

Query: 781 AVAYRIALPPNLAAVHNVFHVSMLR 786
            VAYR+ALPPNL+ VHNVFHVS++R
Sbjct: 847 PVAYRLALPPNLSGVHNVFHVSVVR 871

BLAST of CmoCh13G003440 vs. NCBI nr
Match: gi|590649404|ref|XP_007032400.1| (DNA/RNA polymerases superfamily protein [Theobroma cacao])

HSP 1 Score: 1077.4 bits (2785), Expect = 0.0e+00
Identity = 511/786 (65.01%), Postives = 633/786 (80.53%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +R+CIDYR+LNKVT+KNKYPLPRIDDLFDQLQGA  FSKIDLRSGYHQLR+R  DI KTA
Sbjct: 596  LRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGYHQLRIRNEDIPKTA 655

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVMSFGLTNAPA FM+LMNRVFK +LD FV+VFID+IL+YSKS  +HE H
Sbjct: 656  FRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDDILIYSKSREEHEQH 715

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            L+ VL ILR  +LYAKFSKCEFWL  VAFLGHVVS +GI VD  KIEAV +WP+PT+V+E
Sbjct: 716  LKIVLQILREHRLYAKFSKCEFWLESVAFLGHVVSKEGIRVDTKKIEAVEKWPRPTSVSE 775

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSF+GLAGYYRRFV+DFSKI + LT+LT+K   F W+  CE SF++LK  L TAPVL++
Sbjct: 776  IRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSL 835

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            P G+G   ++ DASG GLGCVLMQ GKVIAYASRQLK +E+NYP HDLE+AA+VFALK W
Sbjct: 836  PQGTGGYTMFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIW 895

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVEKLQDE 360
            RHYLYGE  +++TDHKSLKY+F Q++LN+RQ RW+EL+KDYD  ILYHPGKANV  + D 
Sbjct: 896  RHYLYGETCEIYTDHKSLKYIFQQRDLNLRQCRWMELLKDYDCTILYHPGKANV--VADA 955

Query: 361  MKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHLSKVWSQIETERPAGYSISLD 420
            + R  +     G    I+   ++P L  K  +AQ  DE + K     +  +   ++   D
Sbjct: 956  LSRKSM-----GSLAHIS--IVRPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD 1015

Query: 421  GGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMYQDLKRFYWWSGMKRDIADFV 480
            G L +  RL VP  +G+ ++I+ EAH  +YV HPG+TKMYQDLK  YWW G+KRD+A+FV
Sbjct: 1016 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 1075

Query: 481  SRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISGLPKTKQGFNVIWVIVDRLTK 540
            S+CL CQQVKA  Q+PAGLLQPL VP+WKW  + MDF++GLP+T  G++ IW++VDRLTK
Sbjct: 1076 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTK 1135

Query: 541  TAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDTRFTSQFWRSLQKALRTQLRF 600
            +AHF+  K+TY   ++A++Y+ EIVRLHG+P+SIVSDR  +FTS+FW  LQ+AL T+L F
Sbjct: 1136 SAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDF 1195

Query: 601  STAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPLMEFAYNNSYQATIQMAPFEA 660
            STAFHPQTDGQ+ER  Q LE MLRAC +D    W+++LPL+EFAYNNS+Q +IQMAPFEA
Sbjct: 1196 STAFHPQTDGQSERTIQTLEAMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEA 1255

Query: 661  LYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRILTAQSRQKSYADVRRRNLEF 720
            LYGRRCR+P+ W EVG ++L+GPELVQ     +  I+QR+LTAQSRQKSYAD RRR+LEF
Sbjct: 1256 LYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEF 1315

Query: 721  EVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVGAVAYRIALPPNLAAVHNVFH 780
            +VGDHVFLKV+P +GV RFGK+GKLSPR+IGPFEIL++VGAVAYR+ALPP+L+ +H VFH
Sbjct: 1316 QVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFH 1372

Query: 781  VSMLRK 787
            VSMLRK
Sbjct: 1376 VSMLRK 1372

BLAST of CmoCh13G003440 vs. NCBI nr
Match: gi|47824950|gb|AAT38724.1| (Putative retrotransposon protein, identical [Solanum demissum])

HSP 1 Score: 1050.4 bits (2715), Expect = 1.6e-303
Identity = 508/811 (62.64%), Postives = 628/811 (77.44%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            +R+CIDYR+LNKVTIKNKYPLPRIDDLFDQLQGAT FSKIDLRSGYHQLRVRE DI KTA
Sbjct: 714  LRICIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGYHQLRVRERDIPKTA 773

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVMSFGLTNAPA FM+LMNRVF+ +LD FVI+FID+IL+YS++E DH  H
Sbjct: 774  FRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFRPYLDMFVIIFIDDILIYSRNEEDHASH 833

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            LR VL  L+ ++LYAKFSKCEFWL  VAFLGH+VS  GI VD  KIEAV  WP+PT+ TE
Sbjct: 834  LRTVLQTLKDKELYAKFSKCEFWLKSVAFLGHIVSGDGIKVDTRKIEAVQNWPRPTSPTE 893

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSFLGLAGYYRRFV+ FS I+S LT+LT+K   F W+  CE+SFQELKKRL+TAPVLT+
Sbjct: 894  IRSFLGLAGYYRRFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTL 953

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            P+G+  LVVY DAS  GLGCVLMQ GKVIAYASRQLK +E+NYPTHDLELA VVFALK W
Sbjct: 954  PEGTQGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLW 1013

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVEKLQDE 360
            RHYLYG  V +FTDHKSL+Y+ TQK LN+RQRRWLEL+KDYD+ ILYHPGKANV  + D 
Sbjct: 1014 RHYLYGVHVDIFTDHKSLQYVLTQKALNLRQRRWLELLKDYDLSILYHPGKANV--VADS 1073

Query: 361  MKR--AGIDVVIKGGSVQIAQ-----------------------LTIQPTLRKKFIDAQR 420
            + R   G    I+ G  ++A+                          + +L  +  + Q 
Sbjct: 1074 LSRLSMGSTTHIEEGRRELAKDMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQD 1133

Query: 421  SDEHLSKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPG 480
             D  L ++ + ++ +R   +    DG L +Q RLCVP  +G+ + +M EAH + Y  HPG
Sbjct: 1134 QDPILLELKANVQKQRVLAFEQGGDGVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPG 1193

Query: 481  STKMYQDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCM 540
            STKMY+DL+ FYWW+GMK+ IA+FV++C  CQQVK   QRP GL Q + +P+WKW  + M
Sbjct: 1194 STKMYRDLREFYWWNGMKKGIAEFVAKCPNCQQVKVEHQRPGGLAQNIELPEWKWEMINM 1253

Query: 541  DFISGLPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIV 600
            DFI+GLP++++  + IWVIVDR+TK+AHF+  K+T+  + +A+LYI+EIVRLHGVP+SI+
Sbjct: 1254 DFITGLPRSRRQHDSIWVIVDRMTKSAHFLPVKTTHSAEDYAKLYIQEIVRLHGVPISII 1313

Query: 601  SDRDTRFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWD 660
            SDR  +FT+QFW+S QK L +++  STAFHPQTDGQ ER  Q LEDMLRAC +DF   WD
Sbjct: 1314 SDRGAQFTAQFWKSFQKGLGSKVSLSTAFHPQTDGQAERTIQTLEDMLRACVIDFKSNWD 1373

Query: 661  EHLPLMEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQK 720
            +HLPL+EFAYNNSY ++IQMAP+EALYGRRCR+P+ W EVG  +L+GP+LV      V+ 
Sbjct: 1374 DHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPIGWFEVGEARLIGPDLVHQAMEKVKV 1433

Query: 721  IKQRILTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEI 780
            I++R+ TAQSRQKSY DVRRR LEFEV D V+LKV+PM+GV RFGK+GKLSPR+IGP+ I
Sbjct: 1434 IQERLKTAQSRQKSYTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIGPYRI 1493

Query: 781  LDRVGAVAYRIALPPNLAAVHNVFHVSMLRK 787
            + RVG+VAY + LP  LAAVH VFH+SML+K
Sbjct: 1494 VQRVGSVAYELELPQELAAVHPVFHISMLKK 1522

BLAST of CmoCh13G003440 vs. NCBI nr
Match: gi|595885005|ref|XP_007213082.1| (hypothetical protein PRUPE_ppa021229mg [Prunus persica])

HSP 1 Score: 1049.7 bits (2713), Expect = 2.7e-303
Identity = 505/806 (62.66%), Postives = 627/806 (77.79%), Query Frame = 1

Query: 1    MRMCIDYRELNKVTIKNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVREGDILKTA 60
            MR+C+DYR+LNK+T++N+YPLPRIDDLFDQL+GA VFSKIDLRSGYHQLRVRE D+ KTA
Sbjct: 315  MRLCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDMPKTA 374

Query: 61   FRSRYGHYEFLVMSFGLTNAPAIFMELMNRVFKEFLDTFVIVFIDNILVYSKSEVDHEIH 120
            FR+RYGHYEFLVM FGLTNAPA FM+LMNRVF+ +LD FVIVFID+ILVYSKS+  H  H
Sbjct: 375  FRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFIDDILVYSKSQKAHMKH 434

Query: 121  LRKVLTILRAQQLYAKFSKCEFWLSEVAFLGHVVSSKGITVDPAKIEAVMRWPQPTTVTE 180
            L  VL  LR +QLYAKFSKC+FWL  V+FLGHV+S++GI VDP KIEAV+ W +PT+VTE
Sbjct: 435  LNLVLRTLRRRQLYAKFSKCQFWLDRVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTE 494

Query: 181  VRSFLGLAGYYRRFVQDFSKISSALTQLTKKGKPFAWTPVCEQSFQELKKRLVTAPVLTV 240
            +RSFLGLAGYYRRFV+ FS I++ LT LT+KG  F W+  CE+SF ELK RL TAPVL +
Sbjct: 495  IRSFLGLAGYYRRFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELKTRLTTAPVLAL 554

Query: 241  PDGSGNLVVYSDASGKGLGCVLMQKGKVIAYASRQLKEYERNYPTHDLELAAVVFALKTW 300
            PD SGN V+YSDAS +GLGCVLMQ G+VIAYASRQLK++E NYP HDLELAAVVFALK W
Sbjct: 555  PDDSGNFVIYSDASQQGLGCVLMQHGRVIAYASRQLKKHELNYPVHDLELAAVVFALKIW 614

Query: 301  RHYLYGEKVQVFTDHKSLKYLFTQKELNMRQRRWLELVKDYDIEILYHPGKANVE----- 360
            RHYLYGE  Q+FTDHKSLKYLFTQKELN+RQRRWLEL+KDYD  I +HPG+ANV      
Sbjct: 615  RHYLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLELIKDYDCTIEHHPGRANVVADALS 674

Query: 361  ---------------KLQDEMKRAGIDVVIKGGSVQIAQLTIQPTLRKKFIDAQRSDEHL 420
                            L  EM++  I + +      +A L ++P L ++ + AQ  D  +
Sbjct: 675  RKSSGSIAYLRGRYLPLMVEMRKLRIGLDVDNQGALLATLHVRPVLVERILAAQSQDPLI 734

Query: 421  SKVWSQIETERPAGYSISLDGGLLWQNRLCVPRDEGILKDIMTEAHDTSYVFHPGSTKMY 480
              +  ++        S+  DG L+  NRL VP DE + ++I+ EAH++++  HPGSTKMY
Sbjct: 735  CTLRVEVANGDRTDCSVRNDGALMVGNRLYVPNDEALKREILEEAHESAFAMHPGSTKMY 794

Query: 481  QDLKRFYWWSGMKRDIADFVSRCLTCQQVKAPRQRPAGLLQPLSVPQWKWVAVCMDFISG 540
              L+  YWW  MK+ IA++V RCL CQQVKA RQ+P+GLLQPL +P+WKW  + MDF+  
Sbjct: 795  HTLREHYWWPFMKKQIAEYVRRCLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFK 854

Query: 541  LPKTKQGFNVIWVIVDRLTKTAHFILGKSTYRVDRWAQLYIKEIVRLHGVPVSIVSDRDT 600
            LP+T+   + +WVIVDRLTK+AHF+  ++ Y +++ A+++I EIVRLHGVPVSIVSDRD 
Sbjct: 855  LPQTQSKHDGVWVIVDRLTKSAHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDP 914

Query: 601  RFTSQFWRSLQKALRTQLRFSTAFHPQTDGQTERLNQVLEDMLRACSLDFAGCWDEHLPL 660
            RFTS+FW  L +A  TQL+FSTAFHPQTDGQ+ER  Q LE MLRAC+L F G WDE LPL
Sbjct: 915  RFTSRFWTKLNEAFGTQLQFSTAFHPQTDGQSERTIQTLEHMLRACALQFRGDWDEKLPL 974

Query: 661  MEFAYNNSYQATIQMAPFEALYGRRCRTPVFWEEVGTQQLMGPELVQVTNAAVQKIKQRI 720
            MEFAYNNSYQ +I M+PF+ALYGR+CRTP +W+EVG  +L+  E V++T   VQ I++R+
Sbjct: 975  MEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWDEVGEHRLVVSEDVELTKKQVQIIRERL 1034

Query: 721  LTAQSRQKSYADVRRRNLEFEVGDHVFLKVAPMRGVWRFGKEGKLSPRFIGPFEILDRVG 780
             TAQ RQKSYAD RR++L+FEVGD VFLK++P +GV RFGK GKLSPR+IGP+EI++ VG
Sbjct: 1035 KTAQDRQKSYADNRRKDLQFEVGDWVFLKLSPWKGVVRFGKRGKLSPRYIGPYEIIECVG 1094

Query: 781  AVAYRIALPPNLAAVHNVFHVSMLRK 787
             VAYR+ LP +LA +H+VFHVSMLRK
Sbjct: 1095 PVAYRLTLPSDLARLHDVFHVSMLRK 1120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TF211_SCHPO1.5e-12832.92Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
TF27_SCHPO1.5e-12832.92Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF28_SCHPO1.5e-12832.92Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF25_SCHPO3.3e-12832.80Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF24_SCHPO3.3e-12832.80Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
Q84KB0_CUCME0.0e+0071.22Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
O64892_ANACO0.0e+0063.98Polyprotein (Fragment) OS=Ananas comosus PE=4 SV=1[more]
A0A061EEG7_THECC0.0e+0065.01DNA/RNA polymerases superfamily protein OS=Theobroma cacao GN=TCM_018243 PE=4 SV... [more]
Q6L3S2_SOLDE1.1e-30362.64Putative retrotransposon protein, identical OS=Solanum demissum GN=SDM1_41t00017... [more]
M5WLY8_PRUPE1.9e-30362.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021229mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.14.6e-2746.40ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|28558781|gb|AAO45752.1|0.0e+0071.22pol protein [Cucumis melo subsp. melo][more]
gi|2995405|emb|CAA73042.1|0.0e+0063.98polyprotein [Ananas comosus][more]
gi|590649404|ref|XP_007032400.1|0.0e+0065.01DNA/RNA polymerases superfamily protein [Theobroma cacao][more]
gi|47824950|gb|AAT38724.1|1.6e-30362.64Putative retrotransposon protein, identical [Solanum demissum][more]
gi|595885005|ref|XP_007213082.1|2.7e-30362.66hypothetical protein PRUPE_ppa021229mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh13G003440.1CmoCh13G003440.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 2..153
score: 2.7
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..154
score: 10
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 513..622
score: 9.5
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 502..665
score: 16
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 245..343
score: 2.9E-5coord: 513..669
score: 1.3
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 503..659
score: 5.77
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 1..75
score: 3.5
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 76..154
score: 2.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 1..787
score:
NoneNo IPR availablePANTHERPTHR24559:SF207SUBFAMILY NOT NAMEDcoord: 1..787
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1..347
score: 3.34E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None