CmoCh20G009950 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G009950
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
LocationCmo_Chr20: 5925057 .. 5928087 (-)
RNA-Seq ExpressionCmoCh20G009950
SyntenyCmoCh20G009950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGAAAGCTCAACGATGCCGAGCGAAGATACACTGTCTCCGAGAAAGAAATGCTGGCTGTAGTCCATTGCCTTTGAGTCTGGAGACAGTATCTCTTAGGATCACAATTCGTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGTATGAAAGTAGTCGGTTGACAGGGACGTCAACCAATTAGGTGGGGGAGGATGTCACGGTCATGCTTGTCCAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG

mRNA sequence

ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG

Coding sequence (CDS)

ATGGTCATACCACTGATAGAAGAAGCAACCACCGAAGAGACTGTCCTAGAGGAAATCAAGGAAGTACTAGACAGCTATACCGACATAATGCCAGAGAGTCTACAACAAACACTACCACCTCGTCGAGGCATTGACCACGAAATCGAACTCCTCCCCGGGGTTAAGCAGCCAGCGAAGAACGCATACCGGATGGCTTCGCCCGAGCTAGCCGAATTAAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGACAAAGGCACCTTATGGAGCCCCCGTACTGTTCCAGAAGAAGAAGAATGGGACGTTACGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACAAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGACTTGACAAACGCCCCAGCTACGTTCTGCACGTTGATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCACTTAAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCGAACACAAAGCAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATGGATCGATACGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGACCTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACGGGGGAACTGAGGAAGAAGCTCATTCAGAAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCCCTAATAAAGAAGGGGTACTTTTGGCCAAATATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGCCAACAGGATAAGGTCGAGAAAGCCAAAGTCTTTGGACTCTTGGAACCTCTACCCGTGCCAACAAGACCCTGGGAAAGTGTATCTCTGGACTTCATAACACACCGCCCAAAAGTCGGGGAATATGACGCCATCTTGGTTATTGTAGACCGGTTCTCAAAATATGCGACATTCATCCCCACTCCCAAATTATGCTCGGCGGAACTCACAGCCCAACTATTTTTCAAACACGTTGTAAAGCTATCGGGCATTCCATCGAGCATCATCAGTGATAGGGATGGTAGATTCATTGGGACATTCTGGACTGATTTATTCGCTTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTATCACCCTCAAACCGATGCACAGATAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAAAAGAACTGGATACAATTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACAAGCTCGTCTACAGGGAAGAGTCCCTTCGAAATTGTAAGTGGACGACAACCGGCCTTACCCCATATTATTGATTATCCTTATGCAGGAAAAAACCCTCAAGCTCACAACTTTACAAGAGAATGGAAGCAGACGACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAGAAATGGGCGGACAAGAAGCGTCGCCCCCTTCAATTCCGTGCAGGAGATCAAGTCCTTATCAAGCTGAGACCAGAACAGATCAGATTTCGTAGCCGAAAGGACCAAAGACTAGTTCGTAAATATGAAGGCCCGGTTGAAGTCCTTAAAAAGATCGGGGCTACCTCCTACCGAGTTGCACTACCCACATGGATGAAAATCCACCCCGTCATTCATGTGAGCAACTTGAAGCCCTATCACCCTGATCCAGACGACGACCAACGAAATGCAACCACAAGACTGAACATCGATCTGCAGCAAAAAGAAACAAAGGAAGTGGAAGAGATCCTAGTCGACAGGGTTAGGAAGATAGGAAGACCTGTACGGACGATTCGTGAATTCCTTATCAAATGGAAGAATCTCCCTACAGAGGAAACAAGCTGGGAACGCGCCGAAGATCTGAACTCCGCCGCCACCCACATCGCAAGGGCATGTTGCCCATGGCCGCATGCCCATGACAAGCCAAGCCCTTATCATGTCTTTGCATTCTAG

Protein sequence

MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKNAYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNATTRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNSAATHIARACCPWPHAHDKPSPYHVFAF
Homology
BLAST of CmoCh20G009950 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 1.2e-119
Identity = 283/929 (30.46%), Postives = 445/929 (47.90%), Query Frame = 0

Query: 18   EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELL-PGVKQPAKNAYRMASPELAELRKQ 77
            E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78   LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
            +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
            G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318  KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL------------ 377
            KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+            
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 378  --------VKTDNSATCHFFDQPKLTA--------------------------------- 437
                     K   +   +     ++ A                                 
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 438  --------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 497
                    + ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 498  SIRDIIKEHLHKDPSAKAVVELAKAGK------------TRQFWVEGDLLITKGNRLYVP 557
            SI  + +  +  D   + V E     K                 ++  LLI   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 558  RTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 617
               +L + +I+K H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 618  KAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSA 677
              K +G L+P+P   RPWES+S+DFIT  P+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 678  ELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIE 737
            E TA++F + V+   G P  II+D D  F    W D        +  S  Y PQTD Q E
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 738  RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 797
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 798  YPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIKLR 857
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K  
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 858  PEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMK--IHPVIHVSNLKPYHPD 864
                  +S K   L   + GP  VL+K G  +Y + LP  +K       HVS+L+ Y  +
Sbjct: 1213 KTGFLHKSNK---LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHN 1272

BLAST of CmoCh20G009950 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 1.2e-119
Identity = 283/929 (30.46%), Postives = 445/929 (47.90%), Query Frame = 0

Query: 18   EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELL-PGVKQPAKNAYRMASPELAELRKQ 77
            E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78   LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
            +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
            G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318  KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL------------ 377
            KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+            
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 378  --------VKTDNSATCHFFDQPKLTA--------------------------------- 437
                     K   +   +     ++ A                                 
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 438  --------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 497
                    + ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 498  SIRDIIKEHLHKDPSAKAVVELAKAGK------------TRQFWVEGDLLITKGNRLYVP 557
            SI  + +  +  D   + V E     K                 ++  LLI   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 558  RTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 617
               +L + +I+K H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 618  KAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSA 677
              K +G L+P+P   RPWES+S+DFIT  P+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 678  ELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIE 737
            E TA++F + V+   G P  II+D D  F    W D        +  S  Y PQTD Q E
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 738  RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 797
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 798  YPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIKLR 857
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K  
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 858  PEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMK--IHPVIHVSNLKPYHPD 864
                  +S K   L   + GP  VL+K G  +Y + LP  +K       HVS+L+ Y  +
Sbjct: 1213 KTGFLHKSNK---LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHN 1272

BLAST of CmoCh20G009950 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 1.2e-119
Identity = 283/929 (30.46%), Postives = 445/929 (47.90%), Query Frame = 0

Query: 18   EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELL-PGVKQPAKNAYRMASPELAELRKQ 77
            E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78   LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
            +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
            G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318  KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL------------ 377
            KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+            
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 378  --------VKTDNSATCHFFDQPKLTA--------------------------------- 437
                     K   +   +     ++ A                                 
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 438  --------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 497
                    + ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 498  SIRDIIKEHLHKDPSAKAVVELAKAGK------------TRQFWVEGDLLITKGNRLYVP 557
            SI  + +  +  D   + V E     K                 ++  LLI   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 558  RTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 617
               +L + +I+K H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 618  KAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSA 677
              K +G L+P+P   RPWES+S+DFIT  P+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 678  ELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIE 737
            E TA++F + V+   G P  II+D D  F    W D        +  S  Y PQTD Q E
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 738  RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 797
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 798  YPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIKLR 857
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K  
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 858  PEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMK--IHPVIHVSNLKPYHPD 864
                  +S K   L   + GP  VL+K G  +Y + LP  +K       HVS+L+ Y  +
Sbjct: 1213 KTGFLHKSNK---LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHN 1272

BLAST of CmoCh20G009950 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 1.2e-119
Identity = 283/929 (30.46%), Postives = 445/929 (47.90%), Query Frame = 0

Query: 18   EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELL-PGVKQPAKNAYRMASPELAELRKQ 77
            E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78   LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
            +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
            G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318  KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL------------ 377
            KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+            
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 378  --------VKTDNSATCHFFDQPKLTA--------------------------------- 437
                     K   +   +     ++ A                                 
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 438  --------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 497
                    + ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 498  SIRDIIKEHLHKDPSAKAVVELAKAGK------------TRQFWVEGDLLITKGNRLYVP 557
            SI  + +  +  D   + V E     K                 ++  LLI   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 558  RTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 617
               +L + +I+K H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 618  KAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSA 677
              K +G L+P+P   RPWES+S+DFIT  P+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 678  ELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIE 737
            E TA++F + V+   G P  II+D D  F    W D        +  S  Y PQTD Q E
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 738  RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 797
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 798  YPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIKLR 857
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K  
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 858  PEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMK--IHPVIHVSNLKPYHPD 864
                  +S K   L   + GP  VL+K G  +Y + LP  +K       HVS+L+ Y  +
Sbjct: 1213 KTGFLHKSNK---LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHN 1272

BLAST of CmoCh20G009950 vs. ExPASy Swiss-Prot
Match: P0CT37 (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 1.2e-119
Identity = 283/929 (30.46%), Postives = 445/929 (47.90%), Query Frame = 0

Query: 18   EIKEVLDSYTDIMPESLQQTLP-PRRGIDHEIELL-PGVKQPAKNAYRMASPELAELRKQ 77
            E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y +   ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 78   LDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTVRNKYPLPIISDLFDK 137
            +++ LK+G IR +KA    PV+F  KK GTLR+ +DY+ LNK    N YPLP+I  L  K
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 138  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFDLTNAPATFCTLMNQ 197
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+ ++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 198  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 257
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 258  GHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLK 317
            G+ +     +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 318  KDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFAL------------ 377
            KD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+            
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 378  --------VKTDNSATCHFFDQPKLTA--------------------------------- 437
                     K   +   +     ++ A                                 
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 438  --------KQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 497
                    + ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 498  SIRDIIKEHLHKDPSAKAVVELAKAGK------------TRQFWVEGDLLITKGNRLYVP 557
            SI  + +  +  D   + V E     K                 ++  LLI   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 558  RTGELRKKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 617
               +L + +I+K H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 618  KAKVFGLLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSA 677
              K +G L+P+P   RPWES+S+DFIT  P+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 678  ELTAQLFFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIE 737
            E TA++F + V+   G P  II+D D  F    W D        +  S  Y PQTD Q E
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 738  RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 797
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 798  YPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIKLR 857
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K  
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 858  PEQIRFRSRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMK--IHPVIHVSNLKPYHPD 864
                  +S K   L   + GP  VL+K G  +Y + LP  +K       HVS+L+ Y  +
Sbjct: 1213 KTGFLHKSNK---LAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHN 1272

BLAST of CmoCh20G009950 vs. ExPASy TrEMBL
Match: A0A5A7SUK4 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold81G00100 PE=4 SV=1)

HSP 1 Score: 1488.0 bits (3851), Expect = 0.0e+00
Identity = 719/958 (75.05%), Postives = 809/958 (84.45%), Query Frame = 0

Query: 1    MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKN 60
            M IPL     + ETV +EI  VL+ Y D+MP+SL ++LPPRR IDHEIEL+PG K PAKN
Sbjct: 611  MAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSLPKSLPPRRMIDHEIELVPGAKPPAKN 670

Query: 61   AYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTV 120
            AYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KK+G+LRLCIDYRALNK+TV
Sbjct: 671  AYRMAPPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTV 730

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            RNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF
Sbjct: 731  RNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 790

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMNQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYV
Sbjct: 791  GLTNAPATFCTLMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYV 850

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            K+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFV
Sbjct: 851  KREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFV 910

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
            EGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD
Sbjct: 911  EGFSKRASPLTELLKKDVHWNWDPECQTAFDGLKQALMEGPLLGIADVTKPFEVETDASD 970

Query: 361  FAL----------------------------------------------------VKTDN 420
            +AL                                                    VKTDN
Sbjct: 971  YALGGVLLQNGHPIAYESRKLNAAERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDN 1030

Query: 421  SATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
            SATCHFF QPKLT+KQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+ 
Sbjct: 1031 SATCHFFTQPKLTSKQARWQEFLAEFDFEFEHKKGSSNQAADALSRKQEHAAICLLAHLQ 1090

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
             S+I GS+RD ++E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LR
Sbjct: 1091 GSEIGGSVRDTLREFLQKDHAAQNVMNLAKAGKTRQFWVEEDLLVTKGNRLYVPRAGGLR 1150

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            KKL+ +CHDTLWAGHPGWQRTYAL+KKGYFWPNMRDD+MQYTKTCLICQQDKVEK KV G
Sbjct: 1151 KKLLYECHDTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAG 1210

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LL+PLPVPTRPWESVS+DFITH PKVG+++AILVI+DRFSKYATFIP  K CSAE TAQL
Sbjct: 1211 LLDPLPVPTRPWESVSMDFITHLPKVGDFEAILVIIDRFSKYATFIPATKQCSAETTAQL 1270

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            FFKHVVKL G+P+SI+SDRDGRFIG+FWT+LF+FLGT+LNISSSYHPQTD Q ERFN +L
Sbjct: 1271 FFKHVVKLWGVPTSIVSDRDGRFIGSFWTELFSFLGTSLNISSSYHPQTDGQTERFNSML 1330

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            EEYLRHFV+ARQKNW+QLLDVAQFCFN QTSSSTG+SPFEIVSGRQP LPH++D+P+AGK
Sbjct: 1331 EEYLRHFVNARQKNWVQLLDVAQFCFNAQTSSSTGRSPFEIVSGRQPVLPHLVDHPFAGK 1390

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            NPQA NFT+EW+QT DIARAYLEKASK MKKWADKKRRPL+FRAGDQVLIKLRPEQ+RFR
Sbjct: 1391 NPQALNFTKEWRQTNDIARAYLEKASKRMKKWADKKRRPLEFRAGDQVLIKLRPEQVRFR 1450

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
             RKDQRLVRKYEGPVEVLKK+G TSYRVALPTWMKI+PVIHVSNLKPYH D +D QRN  
Sbjct: 1451 GRKDQRLVRKYEGPVEVLKKVGNTSYRVALPTWMKIYPVIHVSNLKPYHQDTEDLQRNVV 1510

Query: 901  TRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNS 907
            TR  IDL QKE K+VEEIL +RVRK  RP R I E+L+KWKNLP EETSWER EDL +
Sbjct: 1511 TRPIIDLSQKEDKDVEEILAERVRKSRRPARRIHEYLVKWKNLPVEETSWERVEDLEA 1568

BLAST of CmoCh20G009950 vs. ExPASy TrEMBL
Match: A0A5D3C9P8 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G00750 PE=4 SV=1)

HSP 1 Score: 1488.0 bits (3851), Expect = 0.0e+00
Identity = 719/958 (75.05%), Postives = 809/958 (84.45%), Query Frame = 0

Query: 1    MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKN 60
            M IPL     + ETV +EI  VL+ Y D+MP+SL ++LPPRR IDHEIEL+PG K PAKN
Sbjct: 581  MAIPLNSSENSGETVPKEIVRVLEKYRDVMPDSLPKSLPPRRMIDHEIELVPGAKPPAKN 640

Query: 61   AYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTV 120
            AYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQKKK+G+LRLCIDYRALNK+TV
Sbjct: 641  AYRMAPPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQKKKDGSLRLCIDYRALNKLTV 700

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            RNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF
Sbjct: 701  RNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 760

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMNQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYV
Sbjct: 761  GLTNAPATFCTLMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYV 820

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            K+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFV
Sbjct: 821  KREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFV 880

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
            EGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD
Sbjct: 881  EGFSKRASPLTELLKKDVHWNWDPECQAAFDGLKQALMEGPLLGIADVTKPFEVETDASD 940

Query: 361  FAL----------------------------------------------------VKTDN 420
            +AL                                                    VKTDN
Sbjct: 941  YALGGVLLQNGHPIAYESRKLNAAERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDN 1000

Query: 421  SATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
            SATCHFF QPKLT+KQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+ 
Sbjct: 1001 SATCHFFTQPKLTSKQARWQEFLAEFDFEFEHKKGSSNQAADALSRKQEHAAICLLAHLQ 1060

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
             S+I GS+RD ++E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LR
Sbjct: 1061 GSEIGGSVRDTLREFLQKDHAAQNVMNLAKAGKTRQFWVEEDLLVTKGNRLYVPRAGGLR 1120

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            KKL+ +CHDTLWAGHPGWQRTYAL+KKGYFWPNMRDD+MQYTKTCLICQQDKVEK KV G
Sbjct: 1121 KKLLYECHDTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAG 1180

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LL+PLPVPTRPWESVS+DFITH PKVG+++AILVI+DRFSKYATFIP  K CSAE TAQL
Sbjct: 1181 LLDPLPVPTRPWESVSMDFITHLPKVGDFEAILVIIDRFSKYATFIPATKQCSAETTAQL 1240

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            FFKHVVKL G+P+SI+SDRDGRFIG+FWT+LF+FLGT+LNISSSYHPQTD Q ERFN +L
Sbjct: 1241 FFKHVVKLWGVPTSIVSDRDGRFIGSFWTELFSFLGTSLNISSSYHPQTDGQTERFNSML 1300

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            EEYLRHFV+ARQKNW+QLLDVAQFCFN QTSSSTG+SPFEIVSGRQP LPH++D+P+AGK
Sbjct: 1301 EEYLRHFVNARQKNWVQLLDVAQFCFNAQTSSSTGRSPFEIVSGRQPVLPHLVDHPFAGK 1360

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            NPQA NFT+EW+QT DIARAYLEKASK MKKWADKKRRPL+FRAGDQVLIKLRPEQ+RFR
Sbjct: 1361 NPQALNFTKEWRQTNDIARAYLEKASKRMKKWADKKRRPLEFRAGDQVLIKLRPEQVRFR 1420

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
             RKDQRLVRKYEGPVEVLKK+G TSYRVALPTWMKI+PVIHVSNLKPYH D +D QRN  
Sbjct: 1421 GRKDQRLVRKYEGPVEVLKKVGNTSYRVALPTWMKIYPVIHVSNLKPYHQDTEDLQRNVV 1480

Query: 901  TRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNS 907
            TR  IDL QKE K+VEEIL +RVR+  RP R I E+L+KWKNLP EETSWER EDL +
Sbjct: 1481 TRPTIDLSQKEDKDVEEILAERVRRGRRPARRIHEYLVKWKNLPVEETSWERVEDLEA 1538

BLAST of CmoCh20G009950 vs. ExPASy TrEMBL
Match: A0A5D3C4R1 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G001990 PE=4 SV=1)

HSP 1 Score: 1487.2 bits (3849), Expect = 0.0e+00
Identity = 718/958 (74.95%), Postives = 809/958 (84.45%), Query Frame = 0

Query: 1    MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKN 60
            M IPL     + ETV +EI  VL+ Y D+MP+SL ++LPPRR IDHEIEL+PG K PAKN
Sbjct: 611  MAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSLPKSLPPRRMIDHEIELVPGAKPPAKN 670

Query: 61   AYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTV 120
            AYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KK+G+LRLCIDYRALNK+TV
Sbjct: 671  AYRMAPPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTV 730

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            RNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF
Sbjct: 731  RNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 790

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMNQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYV
Sbjct: 791  GLTNAPATFCTLMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYV 850

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            K+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFV
Sbjct: 851  KREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFV 910

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
            EGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD
Sbjct: 911  EGFSKRASPLTELLKKDVHWNWDPECQTAFDGLKQALMEGPLLGIADVTKPFEVETDASD 970

Query: 361  FAL----------------------------------------------------VKTDN 420
            +AL                                                    VKTDN
Sbjct: 971  YALGGVLLQNGHPIAYESRKLNAAERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDN 1030

Query: 421  SATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
            SATCHFF QPKLT+KQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+ 
Sbjct: 1031 SATCHFFTQPKLTSKQARWQEFLAEFDFEFEHKKGSSNQAADALSRKQEHAAICLLAHLQ 1090

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
             S+I GS+RD ++E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LR
Sbjct: 1091 GSEIGGSVRDTLREFLQKDHAAQNVMNLAKAGKTRQFWVEEDLLVTKGNRLYVPRAGGLR 1150

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            KKL+ +CHDTLWAGHPGWQRTYAL+KKGYFWPNMRDD+MQYTKTCLICQQDKVEK KV G
Sbjct: 1151 KKLLYECHDTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAG 1210

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LL+PLPVPTRPWESVS+DFITH PKVG+++AILVI+DRFSKYATFIP  K CSAE TAQL
Sbjct: 1211 LLDPLPVPTRPWESVSMDFITHLPKVGDFEAILVIIDRFSKYATFIPATKQCSAETTAQL 1270

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            FFKHVVKL G+P+SI+SDRDGRFIG+FWT+LF+FLGT+LNISSSYHPQTD Q ERFN +L
Sbjct: 1271 FFKHVVKLWGVPTSIVSDRDGRFIGSFWTELFSFLGTSLNISSSYHPQTDGQTERFNSML 1330

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            EEYLRHFV+ARQKNW+QLLDVAQFCFN QTSSSTG+SPFEIVSGRQP LPH++D+P+AGK
Sbjct: 1331 EEYLRHFVNARQKNWVQLLDVAQFCFNAQTSSSTGRSPFEIVSGRQPVLPHLVDHPFAGK 1390

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            NPQA NFT+EW+QT DIARAYLEKASK MKKWADKKRRPL+FRAGDQVLIKLRPEQ+RFR
Sbjct: 1391 NPQALNFTKEWRQTNDIARAYLEKASKRMKKWADKKRRPLEFRAGDQVLIKLRPEQVRFR 1450

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
             RKDQRLVRKYEGPVEVLKK+G TSYRVALPTWMKI+PVIHVSNLKPYH D +D QRN  
Sbjct: 1451 GRKDQRLVRKYEGPVEVLKKVGNTSYRVALPTWMKIYPVIHVSNLKPYHQDTEDLQRNVV 1510

Query: 901  TRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNS 907
            TR  IDL QKE K+VEEIL +RVR+  RP R I E+L+KWKNLP EETSWER EDL +
Sbjct: 1511 TRPTIDLSQKEDKDVEEILAERVRRGRRPARRIHEYLVKWKNLPVEETSWERVEDLEA 1568

BLAST of CmoCh20G009950 vs. ExPASy TrEMBL
Match: A0A5A7UXR6 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22G005550 PE=4 SV=1)

HSP 1 Score: 1487.2 bits (3849), Expect = 0.0e+00
Identity = 718/958 (74.95%), Postives = 809/958 (84.45%), Query Frame = 0

Query: 1    MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKN 60
            M IPL     + ETV +EI  VL+ Y D+MP+SL ++LPPRR IDHEIEL+PG K PAKN
Sbjct: 611  MAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSLPKSLPPRRMIDHEIELVPGAKPPAKN 670

Query: 61   AYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTV 120
            AYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KK+G+LRLCIDYRALNK+TV
Sbjct: 671  AYRMAPPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTV 730

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            RNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF
Sbjct: 731  RNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 790

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMNQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYV
Sbjct: 791  GLTNAPATFCTLMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYV 850

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            K+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFV
Sbjct: 851  KREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFV 910

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
            EGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD
Sbjct: 911  EGFSKRASPLTELLKKDVHWNWDPECQTAFDGLKQALMEGPLLGIADVTKPFEVETDASD 970

Query: 361  FAL----------------------------------------------------VKTDN 420
            +AL                                                    VKTDN
Sbjct: 971  YALGGVLLQNGHPIAYESRKLNAAERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDN 1030

Query: 421  SATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
            SATCHFF QPKLT+KQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+ 
Sbjct: 1031 SATCHFFTQPKLTSKQARWQEFLAEFDFEFEHKKGSSNQAADALSRKQEHAAICLLAHLQ 1090

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
             S+I GS+RD ++E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LR
Sbjct: 1091 GSEIGGSVRDTLREFLQKDHAAQNVMNLAKAGKTRQFWVEEDLLVTKGNRLYVPRAGGLR 1150

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            KKL+ +CHDTLWAGHPGWQRTYAL+KKGYFWPNMRDD+MQYTKTCLICQQDKVEK KV G
Sbjct: 1151 KKLLYECHDTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAG 1210

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LL+PLPVPTRPWESVS+DFITH PKVG+++AILVI+DRFSKYATFIP  K CSAE TAQL
Sbjct: 1211 LLDPLPVPTRPWESVSMDFITHLPKVGDFEAILVIIDRFSKYATFIPATKQCSAETTAQL 1270

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            FFKHVVKL G+P+SI+SDRDGRFIG+FWT+LF+FLGT+LNISSSYHPQTD Q ERFN +L
Sbjct: 1271 FFKHVVKLWGVPTSIVSDRDGRFIGSFWTELFSFLGTSLNISSSYHPQTDGQTERFNSML 1330

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            EEYLRHFV+ARQKNW+QLLDVAQFCFN QTSSSTG+SPFEIVSGRQP LPH++D+P+AGK
Sbjct: 1331 EEYLRHFVNARQKNWVQLLDVAQFCFNAQTSSSTGRSPFEIVSGRQPVLPHLVDHPFAGK 1390

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            NPQA NFT+EW+QT DIARAYLEKASK MKKWADKKRRPL+FRAGDQVLIKLRPEQ+RFR
Sbjct: 1391 NPQALNFTKEWRQTNDIARAYLEKASKRMKKWADKKRRPLEFRAGDQVLIKLRPEQVRFR 1450

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
             RKDQRLVRKYEGPVEVLKK+G TSYRVALPTWMKI+PVIHVSNLKPYH D +D QRN  
Sbjct: 1451 GRKDQRLVRKYEGPVEVLKKVGNTSYRVALPTWMKIYPVIHVSNLKPYHQDTEDLQRNVV 1510

Query: 901  TRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNS 907
            TR  IDL QKE K+VEEIL +RVR+  RP R I E+L+KWKNLP EETSWER EDL +
Sbjct: 1511 TRPTIDLSQKEDKDVEEILAERVRRGRRPARRIHEYLVKWKNLPVEETSWERVEDLEA 1568

BLAST of CmoCh20G009950 vs. ExPASy TrEMBL
Match: A0A5D3B7E7 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G00360 PE=4 SV=1)

HSP 1 Score: 1487.2 bits (3849), Expect = 0.0e+00
Identity = 718/958 (74.95%), Postives = 809/958 (84.45%), Query Frame = 0

Query: 1    MVIPLIEEATTEETVLEEIKEVLDSYTDIMPESLQQTLPPRRGIDHEIELLPGVKQPAKN 60
            M IPL     + ETV +EI  VL+ Y D+MP+SL ++LPPRR IDHEIEL+PG K PAKN
Sbjct: 611  MAIPLKSSENSGETVPKEIMRVLEKYRDVMPDSLPKSLPPRRMIDHEIELVPGAKPPAKN 670

Query: 61   AYRMASPELAELRKQLDELLKAGFIRPTKAPYGAPVLFQKKKNGTLRLCIDYRALNKVTV 120
            AYRMA PELAELRKQLDELL AGFIRP KAPYGAPVLFQ+KK+G+LRLCIDYRALNK+TV
Sbjct: 671  AYRMAPPELAELRKQLDELLNAGFIRPAKAPYGAPVLFQRKKDGSLRLCIDYRALNKLTV 730

Query: 121  RNKYPLPIISDLFDKLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 180
            RNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF
Sbjct: 731  RNKYPLPIITDLFDRLHGAKYFSKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPF 790

Query: 181  DLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYV 240
             LTNAPATFCTLMNQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYV
Sbjct: 791  GLTNAPATFCTLMNQVFHEYLDKFVVVYLDDIVVYSTTMEEHRDHLQKVFQKLKENQLYV 850

Query: 241  KKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFV 300
            K+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGLANYYRRFV
Sbjct: 851  KREKCSFAQERINFLGHVIECGRIGMEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFV 910

Query: 301  EGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASD 360
            EGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD
Sbjct: 911  EGFSKRASPLTELLKKDVHWNWDPECQTAFDGLKQALMEGPLLGIADVTKPFEVETDASD 970

Query: 361  FAL----------------------------------------------------VKTDN 420
            +AL                                                    VKTDN
Sbjct: 971  YALGGVLLQNGHPIAYESRKLNAAERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDN 1030

Query: 421  SATCHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIH 480
            SATCHFF QPKLT+KQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+ 
Sbjct: 1031 SATCHFFTQPKLTSKQARWQEFLAEFDFEFEHKKGSSNQAADALSRKQEHAAICLLAHLQ 1090

Query: 481  SSKIDGSIRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELR 540
             S+I GS+RD ++E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LR
Sbjct: 1091 GSEIGGSVRDTLREFLQKDHAAQNVMNLAKAGKTRQFWVEEDLLVTKGNRLYVPRAGGLR 1150

Query: 541  KKLIQKCHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVFG 600
            KKL+ +CHDTLWAGHPGWQRTYAL+KKGYFWPNMRDD+MQYTKTCLICQQDKVEK KV G
Sbjct: 1151 KKLLYECHDTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAG 1210

Query: 601  LLEPLPVPTRPWESVSLDFITHRPKVGEYDAILVIVDRFSKYATFIPTPKLCSAELTAQL 660
            LL+PLPVPTRPWESVS+DFITH PKVG+++AILVI+DRFSKYATFIP  K CSAE TAQL
Sbjct: 1211 LLDPLPVPTRPWESVSMDFITHLPKVGDFEAILVIIDRFSKYATFIPATKQCSAETTAQL 1270

Query: 661  FFKHVVKLSGIPSSIISDRDGRFIGTFWTDLFAFLGTTLNISSSYHPQTDAQIERFNCLL 720
            FFKHVVKL G+P+SI+SDRDGRFIG+FWT+LF+FLGT+LNISSSYHPQTD Q ERFN +L
Sbjct: 1271 FFKHVVKLWGVPTSIVSDRDGRFIGSFWTELFSFLGTSLNISSSYHPQTDGQTERFNSML 1330

Query: 721  EEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDYPYAGK 780
            EEYLRHFV+ARQKNW+QLLDVAQFCFN QTSSSTG+SPFEIVSGRQP LPH++D+P+AGK
Sbjct: 1331 EEYLRHFVNARQKNWVQLLDVAQFCFNAQTSSSTGRSPFEIVSGRQPVLPHLVDHPFAGK 1390

Query: 781  NPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLRPEQIRFR 840
            NPQA NFT+EW+QT DIARAYLEKASK MKKWADKKRRPL+FRAGDQVLIKLRPEQ+RFR
Sbjct: 1391 NPQALNFTKEWRQTNDIARAYLEKASKRMKKWADKKRRPLEFRAGDQVLIKLRPEQVRFR 1450

Query: 841  SRKDQRLVRKYEGPVEVLKKIGATSYRVALPTWMKIHPVIHVSNLKPYHPDPDDDQRNAT 900
             RKDQRLVRKYEGPVEVLKK+G TSYRVALPTWMKI+PVIHVSNLKPYH D +D QRN  
Sbjct: 1451 GRKDQRLVRKYEGPVEVLKKVGNTSYRVALPTWMKIYPVIHVSNLKPYHQDTEDLQRNVV 1510

Query: 901  TRLNIDLQQKETKEVEEILVDRVRKIGRPVRTIREFLIKWKNLPTEETSWERAEDLNS 907
            TR  IDL QKE K+VEEIL +RVR+  RP R I E+L+KWKNLP EETSWER EDL +
Sbjct: 1511 TRPTIDLSQKEDKDVEEILAERVRRGRRPARRIHEYLVKWKNLPVEETSWERVEDLEA 1568

BLAST of CmoCh20G009950 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 108.2 bits (269), Expect = 3.6e-23
Identity = 60/158 (37.97%), Postives = 87/158 (55.06%), Query Frame = 0

Query: 225 HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTS 284
           HL +V     Q+Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P +
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 285 VSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPV 344
            +ELR FLGL  YYRRFV+ + +   PLTELLKK +   W+    +AF+ LK  +T  PV
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK-NSLKWTEMAALAFKALKGAVTTLPV 122

Query: 345 LGLVDVTKPFEVETDASDFALVKTDNSATCHFFDQPKL 381
           L L D+  PF       +++   T   A C    QP++
Sbjct: 123 LALPDLKLPFVTRVGKWNWSCFITREQACC--VSQPRV 157

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0CT411.2e-11930.46Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT341.2e-11930.46Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT351.2e-11930.46Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT361.2e-11930.46Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT371.2e-11930.46Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A5A7SUK40.0e+0075.05Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold81... [more]
A0A5D3C9P80.0e+0075.05Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold26... [more]
A0A5D3C4R10.0e+0074.95Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
A0A5A7UXR60.0e+0074.95Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22... [more]
A0A5D3B7E70.0e+0074.95Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
Match NameE-valueIdentityDescription
ATMG00860.13.6e-2337.97DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 268..360
e-value: 1.4E-29
score: 103.9
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 124..259
e-value: 3.5E-86
score: 289.9
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 322..363
e-value: 4.4E-9
score: 36.3
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 44..184
e-value: 3.5E-86
score: 289.9
NoneNo IPR availableGENE3D1.10.340.70coord: 452..540
e-value: 1.9E-20
score: 74.9
NoneNo IPR availableGENE3D2.40.50.40coord: 856..913
e-value: 1.4E-9
score: 39.5
NoneNo IPR availablePANTHERPTHR47266FAMILY NOT NAMEDcoord: 76..905
NoneNo IPR availableCDDcd01647RT_LTRcoord: 83..259
e-value: 6.27543E-86
score: 271.006
NoneNo IPR availableCDDcd00024CD_CSDcoord: 862..907
e-value: 1.18926E-7
score: 47.0865
IPR023780Chromo domainPFAMPF00385Chromocoord: 862..909
e-value: 5.3E-9
score: 35.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 551..749
e-value: 6.1E-43
score: 148.4
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 486..540
e-value: 8.4E-21
score: 73.8
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 100..258
e-value: 3.5E-26
score: 92.1
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 80..259
score: 12.626934
IPR000953Chromo/chromo shadow domainPROSITEPS50013CHROMO_2coord: 861..933
score: 11.254301
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 555..714
score: 21.807405
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 22..400
IPR016197Chromo-like domain superfamilySUPERFAMILY54160Chromo domain-likecoord: 822..906
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 552..708

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G009950.1CmoCh20G009950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006508 proteolysis
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0006413 translational initiation
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0003743 translation initiation factor activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding