CSPI03G21020 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G21020
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
LocationChr3: 17108049 .. 17110999 (-)
RNA-Seq ExpressionCSPI03G21020
SyntenyCSPI03G21020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACGACTGCTCGAGCAATATGCAGACATCTTTAGGCTGCCCACAGATTTACCTCCAAAGAGAACCATAGACCATCGTATTTTGACCTTGCCCGATCAGAAACCGATTAATGTACGACCATATAAGTATGGGCATGTACAAAAGGAGGAGATTGAAAAATTGGTATTAGAAATGCTACAAGCAGGGGTTATTCGTCCAAGCCACAACCTATATTCGAGCCCAATCCTCTTAGTGAAGAAAACAGATAGGGGTGGAGGTTCTGTGTAGACTACCGTAAACTCAATCAAGTAACTACGGGTGACAAATTCCCAATCCCAGTGATTGAAGAACTCTTAGATGAACTTCATGGGGCGACAATTTTCTCAAAGTTAGATCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCTTTTCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGTCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTAAAGGTGTGTCCTGGTTTTTTTTTTTTATGACATTTTAGTTTATAGTATGGACATAGATGAACATATGAAACATTTAGGGATGATATTCGCTATTTTGAGGGACCATAAGTTGTTTGCAAATAGAACTAAATGTGTAATTGCTCATTCCCAAGTTCAATATTTGGGACATCTGATTTCCAATAGAGGAGTGGAGGCTGATGAGGAAAAGATTCGTAGTATGGTCAATTGGCCACGACCGAAAGATATAACGGGGCTGAGAGGATTCCTTGGACTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAGATAGCTGCACCCATAACTAAACTACTTCAGAAAAATGCATTCCAGTGGAATGAGGAAGCCACGATAGCTTTTGATCAACTGAAGCTAGCAATGACAACCATACCCGTGTTAGCATTGCTGGATTGGTCTCAGCCCTTGACAATTGAAACTGATGCTTCAGGAGTAGGTTTAGGCGCAGTTTTATCACAAGATGGTCATCCTATCGCATTCTTCAGCCAGAAACTGTCCCCAAGGGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATGGCGGTTGTCCTTTCGGTGCAAAAATGGAGGCATTACCTCCTGGGCAAAAGGTTCACAATTATTTCAGATCAGAAGGCTCTAAAATTTTTGTTGGAACTGAGGGAAGTTCAACCTCAATTCCAAAAGTGGCTCACAAAACTTTTAGGGTATGACTTTGAGATTTTGTATCAACCGGGTCTACAGAATAAAGTGGCAGATGCTCTCTCAAGGAAGGACCAATCAGCTGAGTTAAACACAATGACAACCACAGGCATAGTTGATATAGAGCTGATAGAGAAGGAAGTCGAGAAGGATCAAGAACTTCATAAAATTATTGCCGAACTTAAGGGAGGGGTGGATCAAGGTGGAAAATACCAGTGGAACAATGGCAGGTTGTTATATAAAGGAAGGATGGTGCTGTCGCGTAATTCTTCCCTCATTCCGAGACTTCTACACACGTTCCATGATTCCATATTAGGAGGGCACTCGAGGTTCTTGAGAACCTACAAAAGGATGAGTGGGGAATTATTTTGGAAGGGGATGAGGGCTGATGTCAAGAGATATGTGGAAGAATGTGACATATGTCAATGCAACAAATTCGAAGCTACCAAGCCTGTTGGAGTTCTTCAACCCATTCCTATCCCCGACAAGATATTGGAAGATTGGACCATGGACTTCATCGAAGGTCTGCCAATAGCAGGAGGATACAACATGATTATGGTAGTCGTCGATCGCCTAAGTAAGTACTCCTATTTCTTGCCTCTTAAACACCCGTACACAGCCAAGCAAGTGGCTTCCATTTTTTTTGGAAAAAGTGGTCAGCAAACGTGGAATACCCAAGTCCATTATTGCTGATCGCAATAAGATCTTCCTTAGCAATTTCTGGAAAGAGTTGTTCACCACCATGGGCACAATTTTGAAGAGAAGCACGACATTCCATCCCTAAACCGATGAACAAACCGAGAGAGTAAACAGATGCCTAGAGACTTATTTGAGATGCTTTTGCAATGAGCAACCGAAGAAATAGGATAAACTAATTCCTTGGGGGGAATTATGGTATAACACAACCTTCCATGCGTCCACCAAAACTACTCCTTACCAATCGATCTTTGGAAGAATTCCCCCGCCATTGTTGTTGTACGGATGGAAGAAATCTCCTAACAATGACGTAGAAGTCATGTTGAAGGAGAGAGACCTAGCAATCAATGCATTGAAAGAAAACCTATGCATAGCTCAAAACATAATGAAGAAAATGGCAGACCGGAATCGCAGGGAGCTTAGATTCAAAATCGGTGACGAAGTGTATTTGAAACTACGACCCTATCGACAAAGATCACTAGCTAGAAAGAAATGTGAAAAGCTATCCTCGAAATTTTATGGGCTGTATGAGATTATTGAGGAGATAGGAGAGGTCGCTTATCGGTTGAAACTACCGCCGGAGGCTGCCATTCACAATGTCTTCCACGTTTCTCAATTGAAACTGAAGTTGGGAAAACAACATGTAGTCCAACAACAATAACCCATCCTAACTGAGGATTTTGAATTACAATTGTGGCCGAAAAACGTGCTGGGAATACGGTGGAACAAGGAGTTAGGAGGCAATGAGTGGTTGGTCAAGTGGAAGAACTTACCAGACAGTGAAGCAACGTGGGAATCAGTCTACCTATTGAACCAGGAGTTTCCTCACTTTCACCTTGAGGACAAGGTGAACTTAGAACCCCGGGGTATTGTAAGACCCTCAATCATTTACACATATCAAATGAGGGACAAAAAAGTAAATGGACAAATAATTAATGATGAAGGAAAAAGAGGAGGAGATCGTGCGTGTGGGGCCCAGGCATAG

mRNA sequence

ATGGTTCAACGACTGCTCGAGCAATATGCAGACATCTTTAGGCTGCCCACAGATTTACCTCCAAAGAGAACCATAGACCATCGTATTTTGACCTTGCCCGATCAGAAACCGATTAATGTACGACCATATAAGTATGGGCATGTACAAAAGGAGGAGATTGAAAAATTGGTATTAGAAATGCTACAAGCAGGGGTTATTCGTCCAAGCCACAACCTATATTCGAGCCCAATCCTCTTAGTGAAGAAAACAGATAGGGGTGGAGTGATTGAAGAACTCTTAGATGAACTTCATGGGGCGACAATTTTCTCAAAGTTAGATCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCTTTTCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGTCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTAAAGGGATGATATTCGCTATTTTGAGGGACCATAAGTTGTTTGCAAATAGAACTAAATGTGTAATTGCTCATTCCCAAGTTCAATATTTGGGACATCTGATTTCCAATAGAGGAGTGGAGGCTGATGAGGAAAAGATTCGTAGTATGGTCAATTGGCCACGACCGAAAGATATAACGGGGCTGAGAGGATTCCTTGGACTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAGATAGCTGCACCCATAACTAAACTACTTCAGAAAAATGCATTCCAGTGGAATGAGGAAGCCACGATAGCTTTTGATCAACTGAAGCTAGCAATGACAACCATACCCGTGTTAGCATTGCTGGATTGGTCTCAGCCCTTGACAATTGAAACTGATGCTTCAGGAGTAGGTTTAGGCGCAGTTTTATCACAAGATGGTCATCCTATCGCATTCTTCAGCCAGAAACTGTCCCCAAGGGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATGGCGGTTGTCCTTTCGGTGCAAAAATGGAGGCATTACCTCCTGGGCAAAAGGTTCACAATTATTTCAGATCAGAAGGCTCTAAAATTTTTGTTGGAACTGAGGGAAGTTCAACCTCAATTCCAAAAGTGGCTCACAAAACTTTTAGGGTATGACTTTGAGATTTTGTATCAACCGGGTCTACAGAATAAAGTGGCAGATGCTCTCTCAAGGAAGGACCAATCAGCTGAGTTAAACACAATGACAACCACAGGCATAGTTGATATAGAGCTGATAGAGAAGGAAGTCGAGAAGGATCAAGAACTTCATAAAATTATTGCCGAACTTAAGGGAGGGGTGGATCAAGGTGGAAAATACCAGTGGAACAATGGCAGGTTGTTATATAAAGGAAGGATGGTGCTGTCGCGTAATTCTTCCCTCATTCCGAGACTTCTACACACGTTCCATGATTCCATATTAGGAGGGCACTCGAGGTTCTTGAGAACCTACAAAAGGATGAGTGGGGAATTATTTTGGAAGGGGATGAGGGCTGATGTCAAGAGATATGTGGAAGAATGTGACATATGTCAATGCAACAAATTCGAAGCTACCAAGCCTGTTGGAGTTCTTCAACCCATTCCTATCCCCGACAAGATATTGGAAGATTGGACCATGGACTTCATCGAAGGTCTGCCAATAGCAGGAGGATACAACATGATTATGGTAGTCGTCGATCGCCTAAGTAAAATTCCCCCGCCATTGTTGTTGTACGGATGGAAGAAATCTCCTAACAATGACGTAGAAGTCATGTTGAAGGAGAGAGACCTAGCAATCAATGCATTGAAAGAAAACCTATGCATAGCTCAAAACATAATGAAGAAAATGGCAGACCGGAATCGCAGGGAGCTTAGATTCAAAATCGGTGACGAAGTGTATTTGAAACTACGACCCTATCGACAAAGATCACTAGCTAGAAAGAAATGTGAAAAGCTATCCTCGAAATTTTATGGGCTGTATGAGATTATTGAGGAGATAGGAGAGGTCGCTTATCGGTTGAAACTACCGCCGGAGGCTGCCATTCACAATGTCTTCCACGTTTCTCAATTGAAACTGAAGTTGGGAAAACAACATGAGTTAGGAGGCAATGAGTGGTTGGTCAAGTGGAAGAACTTACCAGACAGTGAAGCAACGTGGGAATCAGTCTACCTATTGAACCAGGAGTTTCCTCACTTTCACCTTGAGGACAAGGTGAACTTAGAACCCCGGGGTATTGTAAGACCCTCAATCATTTACACATATCAAATGAGGGACAAAAAAGTAAATGGACAAATAATTAATGATGAAGGAAAAAGAGGAGGAGATCGTGCGTGTGGGGCCCAGGCATAG

Coding sequence (CDS)

ATGGTTCAACGACTGCTCGAGCAATATGCAGACATCTTTAGGCTGCCCACAGATTTACCTCCAAAGAGAACCATAGACCATCGTATTTTGACCTTGCCCGATCAGAAACCGATTAATGTACGACCATATAAGTATGGGCATGTACAAAAGGAGGAGATTGAAAAATTGGTATTAGAAATGCTACAAGCAGGGGTTATTCGTCCAAGCCACAACCTATATTCGAGCCCAATCCTCTTAGTGAAGAAAACAGATAGGGGTGGAGTGATTGAAGAACTCTTAGATGAACTTCATGGGGCGACAATTTTCTCAAAGTTAGATCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCTTTTCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGTCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTAAAGGGATGATATTCGCTATTTTGAGGGACCATAAGTTGTTTGCAAATAGAACTAAATGTGTAATTGCTCATTCCCAAGTTCAATATTTGGGACATCTGATTTCCAATAGAGGAGTGGAGGCTGATGAGGAAAAGATTCGTAGTATGGTCAATTGGCCACGACCGAAAGATATAACGGGGCTGAGAGGATTCCTTGGACTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAGATAGCTGCACCCATAACTAAACTACTTCAGAAAAATGCATTCCAGTGGAATGAGGAAGCCACGATAGCTTTTGATCAACTGAAGCTAGCAATGACAACCATACCCGTGTTAGCATTGCTGGATTGGTCTCAGCCCTTGACAATTGAAACTGATGCTTCAGGAGTAGGTTTAGGCGCAGTTTTATCACAAGATGGTCATCCTATCGCATTCTTCAGCCAGAAACTGTCCCCAAGGGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATGGCGGTTGTCCTTTCGGTGCAAAAATGGAGGCATTACCTCCTGGGCAAAAGGTTCACAATTATTTCAGATCAGAAGGCTCTAAAATTTTTGTTGGAACTGAGGGAAGTTCAACCTCAATTCCAAAAGTGGCTCACAAAACTTTTAGGGTATGACTTTGAGATTTTGTATCAACCGGGTCTACAGAATAAAGTGGCAGATGCTCTCTCAAGGAAGGACCAATCAGCTGAGTTAAACACAATGACAACCACAGGCATAGTTGATATAGAGCTGATAGAGAAGGAAGTCGAGAAGGATCAAGAACTTCATAAAATTATTGCCGAACTTAAGGGAGGGGTGGATCAAGGTGGAAAATACCAGTGGAACAATGGCAGGTTGTTATATAAAGGAAGGATGGTGCTGTCGCGTAATTCTTCCCTCATTCCGAGACTTCTACACACGTTCCATGATTCCATATTAGGAGGGCACTCGAGGTTCTTGAGAACCTACAAAAGGATGAGTGGGGAATTATTTTGGAAGGGGATGAGGGCTGATGTCAAGAGATATGTGGAAGAATGTGACATATGTCAATGCAACAAATTCGAAGCTACCAAGCCTGTTGGAGTTCTTCAACCCATTCCTATCCCCGACAAGATATTGGAAGATTGGACCATGGACTTCATCGAAGGTCTGCCAATAGCAGGAGGATACAACATGATTATGGTAGTCGTCGATCGCCTAAGTAAAATTCCCCCGCCATTGTTGTTGTACGGATGGAAGAAATCTCCTAACAATGACGTAGAAGTCATGTTGAAGGAGAGAGACCTAGCAATCAATGCATTGAAAGAAAACCTATGCATAGCTCAAAACATAATGAAGAAAATGGCAGACCGGAATCGCAGGGAGCTTAGATTCAAAATCGGTGACGAAGTGTATTTGAAACTACGACCCTATCGACAAAGATCACTAGCTAGAAAGAAATGTGAAAAGCTATCCTCGAAATTTTATGGGCTGTATGAGATTATTGAGGAGATAGGAGAGGTCGCTTATCGGTTGAAACTACCGCCGGAGGCTGCCATTCACAATGTCTTCCACGTTTCTCAATTGAAACTGAAGTTGGGAAAACAACATGAGTTAGGAGGCAATGAGTGGTTGGTCAAGTGGAAGAACTTACCAGACAGTGAAGCAACGTGGGAATCAGTCTACCTATTGAACCAGGAGTTTCCTCACTTTCACCTTGAGGACAAGGTGAACTTAGAACCCCGGGGTATTGTAAGACCCTCAATCATTTACACATATCAAATGAGGGACAAAAAAGTAAATGGACAAATAATTAATGATGAAGGAAAAAGAGGAGGAGATCGTGCGTGTGGGGCCCAGGCATAG

Protein sequence

MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRGGVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAILRDHKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAFQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKIIAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLPIAGGYNMIMVVVDRLSKIPPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIAQNIMKKMADRNRRELRFKIGDEVYLKLRPYRQRSLARKKCEKLSSKFYGLYEIIEEIGEVAYRLKLPPEAAIHNVFHVSQLKLKLGKQHELGGNEWLVKWKNLPDSEATWESVYLLNQEFPHFHLEDKVNLEPRGIVRPSIIYTYQMRDKKVNGQIINDEGKRGGDRACGAQA*
Homology
BLAST of CSPI03G21020 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.5e-75
Identity = 187/605 (30.91%), Postives = 302/605 (49.92%), Query Frame = 0

Query: 37   PINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRG---------- 96
            PI   P   G +Q    E  + + L++G+IR S  + + P++ V K +            
Sbjct: 414  PIRNYPLPPGKMQAMNDE--INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPL 473

Query: 97   -----------GVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEF 156
                        +IE+LL ++ G+TIF+KLDLKS YH IR+R+ D  K AFR   G +E+
Sbjct: 474  NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 533

Query: 157  LVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAI------------------------LRD 216
            LVMP+G++ APA FQ  +N +     +  +                           L++
Sbjct: 534  LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 593

Query: 217  HKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGY 276
              L  N+ KC    SQV+++G+ IS +G    +E I  ++ W +PK+   LR FLG   Y
Sbjct: 594  ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 653

Query: 277  YRRFVKSYGEIAAPITKLLQKNA-FQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIE 336
             R+F+    ++  P+  LL+K+  ++W    T A + +K  + + PVL   D+S+ + +E
Sbjct: 654  LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 713

Query: 337  TDASGVGLGAVLSQDG-----HPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLL 396
            TDAS V +GAVLSQ       +P+ ++S K+S      S+ ++E++A++ S++ WRHYL 
Sbjct: 714  TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 773

Query: 397  G--KRFTIISDQKAL--KFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR 456
               + F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR
Sbjct: 774  STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 833

Query: 457  ---------KD-QSAELNTMTTTGIVD--IELIEKEVEKDQELHKIIAELKGGVDQGGKY 516
                     KD +   +N +    I D     +  E   D +L  ++      V++    
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEE--NI 893

Query: 517  QWNNGRLL-YKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRAD 574
            Q  +G L+  K +++L  ++ L   ++  +H+     H         +     WKG+R  
Sbjct: 894  QLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQ 953

BLAST of CSPI03G21020 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.5e-75
Identity = 187/605 (30.91%), Postives = 302/605 (49.92%), Query Frame = 0

Query: 37   PINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRG---------- 96
            PI   P   G +Q    E  + + L++G+IR S  + + P++ V K +            
Sbjct: 414  PIRNYPLPPGKMQAMNDE--INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPL 473

Query: 97   -----------GVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEF 156
                        +IE+LL ++ G+TIF+KLDLKS YH IR+R+ D  K AFR   G +E+
Sbjct: 474  NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 533

Query: 157  LVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAI------------------------LRD 216
            LVMP+G++ APA FQ  +N +     +  +                           L++
Sbjct: 534  LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 593

Query: 217  HKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGY 276
              L  N+ KC    SQV+++G+ IS +G    +E I  ++ W +PK+   LR FLG   Y
Sbjct: 594  ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 653

Query: 277  YRRFVKSYGEIAAPITKLLQKNA-FQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIE 336
             R+F+    ++  P+  LL+K+  ++W    T A + +K  + + PVL   D+S+ + +E
Sbjct: 654  LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 713

Query: 337  TDASGVGLGAVLSQDG-----HPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLL 396
            TDAS V +GAVLSQ       +P+ ++S K+S      S+ ++E++A++ S++ WRHYL 
Sbjct: 714  TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 773

Query: 397  G--KRFTIISDQKAL--KFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR 456
               + F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR
Sbjct: 774  STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 833

Query: 457  ---------KD-QSAELNTMTTTGIVD--IELIEKEVEKDQELHKIIAELKGGVDQGGKY 516
                     KD +   +N +    I D     +  E   D +L  ++      V++    
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEE--NI 893

Query: 517  QWNNGRLL-YKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRAD 574
            Q  +G L+  K +++L  ++ L   ++  +H+     H         +     WKG+R  
Sbjct: 894  QLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQ 953

BLAST of CSPI03G21020 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.5e-75
Identity = 187/605 (30.91%), Postives = 302/605 (49.92%), Query Frame = 0

Query: 37   PINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRG---------- 96
            PI   P   G +Q    E  + + L++G+IR S  + + P++ V K +            
Sbjct: 414  PIRNYPLPPGKMQAMNDE--INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPL 473

Query: 97   -----------GVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEF 156
                        +IE+LL ++ G+TIF+KLDLKS YH IR+R+ D  K AFR   G +E+
Sbjct: 474  NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 533

Query: 157  LVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAI------------------------LRD 216
            LVMP+G++ APA FQ  +N +     +  +                           L++
Sbjct: 534  LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 593

Query: 217  HKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGY 276
              L  N+ KC    SQV+++G+ IS +G    +E I  ++ W +PK+   LR FLG   Y
Sbjct: 594  ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 653

Query: 277  YRRFVKSYGEIAAPITKLLQKNA-FQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIE 336
             R+F+    ++  P+  LL+K+  ++W    T A + +K  + + PVL   D+S+ + +E
Sbjct: 654  LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 713

Query: 337  TDASGVGLGAVLSQDG-----HPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLL 396
            TDAS V +GAVLSQ       +P+ ++S K+S      S+ ++E++A++ S++ WRHYL 
Sbjct: 714  TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 773

Query: 397  G--KRFTIISDQKAL--KFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR 456
               + F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR
Sbjct: 774  STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 833

Query: 457  ---------KD-QSAELNTMTTTGIVD--IELIEKEVEKDQELHKIIAELKGGVDQGGKY 516
                     KD +   +N +    I D     +  E   D +L  ++      V++    
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEE--NI 893

Query: 517  QWNNGRLL-YKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRAD 574
            Q  +G L+  K +++L  ++ L   ++  +H+     H         +     WKG+R  
Sbjct: 894  QLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQ 953

BLAST of CSPI03G21020 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.5e-75
Identity = 187/605 (30.91%), Postives = 302/605 (49.92%), Query Frame = 0

Query: 37   PINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRG---------- 96
            PI   P   G +Q    E  + + L++G+IR S  + + P++ V K +            
Sbjct: 414  PIRNYPLPPGKMQAMNDE--INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPL 473

Query: 97   -----------GVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEF 156
                        +IE+LL ++ G+TIF+KLDLKS YH IR+R+ D  K AFR   G +E+
Sbjct: 474  NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 533

Query: 157  LVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAI------------------------LRD 216
            LVMP+G++ APA FQ  +N +     +  +                           L++
Sbjct: 534  LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 593

Query: 217  HKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGY 276
              L  N+ KC    SQV+++G+ IS +G    +E I  ++ W +PK+   LR FLG   Y
Sbjct: 594  ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 653

Query: 277  YRRFVKSYGEIAAPITKLLQKNA-FQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIE 336
             R+F+    ++  P+  LL+K+  ++W    T A + +K  + + PVL   D+S+ + +E
Sbjct: 654  LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 713

Query: 337  TDASGVGLGAVLSQDG-----HPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLL 396
            TDAS V +GAVLSQ       +P+ ++S K+S      S+ ++E++A++ S++ WRHYL 
Sbjct: 714  TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 773

Query: 397  G--KRFTIISDQKAL--KFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR 456
               + F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR
Sbjct: 774  STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 833

Query: 457  ---------KD-QSAELNTMTTTGIVD--IELIEKEVEKDQELHKIIAELKGGVDQGGKY 516
                     KD +   +N +    I D     +  E   D +L  ++      V++    
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEE--NI 893

Query: 517  QWNNGRLL-YKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRAD 574
            Q  +G L+  K +++L  ++ L   ++  +H+     H         +     WKG+R  
Sbjct: 894  QLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQ 953

BLAST of CSPI03G21020 vs. ExPASy Swiss-Prot
Match: P0CT37 (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.5e-75
Identity = 187/605 (30.91%), Postives = 302/605 (49.92%), Query Frame = 0

Query: 37   PINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSHNLYSSPILLVKKTDRG---------- 96
            PI   P   G +Q    E  + + L++G+IR S  + + P++ V K +            
Sbjct: 414  PIRNYPLPPGKMQAMNDE--INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPL 473

Query: 97   -----------GVIEELLDELHGATIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEF 156
                        +IE+LL ++ G+TIF+KLDLKS YH IR+R+ D  K AFR   G +E+
Sbjct: 474  NKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEY 533

Query: 157  LVMPFGLTNAPATFQSLMNEVFKPFLKGMIFAI------------------------LRD 216
            LVMP+G++ APA FQ  +N +     +  +                           L++
Sbjct: 534  LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 593

Query: 217  HKLFANRTKCVIAHSQVQYLGHLISNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGY 276
              L  N+ KC    SQV+++G+ IS +G    +E I  ++ W +PK+   LR FLG   Y
Sbjct: 594  ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 653

Query: 277  YRRFVKSYGEIAAPITKLLQKNA-FQWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIE 336
             R+F+    ++  P+  LL+K+  ++W    T A + +K  + + PVL   D+S+ + +E
Sbjct: 654  LRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLE 713

Query: 337  TDASGVGLGAVLSQDG-----HPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLL 396
            TDAS V +GAVLSQ       +P+ ++S K+S      S+ ++E++A++ S++ WRHYL 
Sbjct: 714  TDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE 773

Query: 397  G--KRFTIISDQKAL--KFLLELREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR 456
               + F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR
Sbjct: 774  STIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 833

Query: 457  ---------KD-QSAELNTMTTTGIVD--IELIEKEVEKDQELHKIIAELKGGVDQGGKY 516
                     KD +   +N +    I D     +  E   D +L  ++      V++    
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEE--NI 893

Query: 517  QWNNGRLL-YKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKRMSGELFWKGMRAD 574
            Q  +G L+  K +++L  ++ L   ++  +H+     H         +     WKG+R  
Sbjct: 894  QLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQ 953

BLAST of CSPI03G21020 vs. ExPASy TrEMBL
Match: A0A5D3BBH7 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold549G00100 PE=4 SV=1)

HSP 1 Score: 1155.6 bits (2988), Expect = 0.0e+00
Identity = 604/975 (61.95%), Postives = 683/975 (70.05%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. ExPASy TrEMBL
Match: A0A5D3DWA9 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G001670 PE=4 SV=1)

HSP 1 Score: 1152.1 bits (2979), Expect = 0.0e+00
Identity = 603/975 (61.85%), Postives = 681/975 (69.85%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. ExPASy TrEMBL
Match: A0A5D3DU86 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G00470 PE=4 SV=1)

HSP 1 Score: 1152.1 bits (2979), Expect = 0.0e+00
Identity = 603/975 (61.85%), Postives = 681/975 (69.85%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. ExPASy TrEMBL
Match: A0A5D3E325 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold426G00690 PE=4 SV=1)

HSP 1 Score: 1150.6 bits (2975), Expect = 0.0e+00
Identity = 603/975 (61.85%), Postives = 679/975 (69.64%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F  PT LPPKR IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFNSPTTLPPKRIIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. ExPASy TrEMBL
Match: A0A5D3CT96 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45G001230 PE=4 SV=1)

HSP 1 Score: 1149.4 bits (2972), Expect = 0.0e+00
Identity = 601/975 (61.64%), Postives = 680/975 (69.74%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F  PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFESPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVMEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNTITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQ+LMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQALMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFENLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+ ++ KE+EKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMAVVTKEIEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      G Y   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGNYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. NCBI nr
Match: KAE8637598.1 (hypothetical protein CSA_022681 [Cucumis sativus])

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 715/982 (72.81%), Postives = 746/982 (75.97%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            MVQRLLEQYAD+FRLPT LPP+R IDHRILT+ DQKPINVRPYKYGHVQKEEIEKLVLEM
Sbjct: 87   MVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEM 146

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQAGVIRPS + YSSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 147  LQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGA 206

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T FSKLDLKSGYHQIRMREEDVEKTAF THEGHYEFLVMPFGLTNAPATFQSLMNEVFKP
Sbjct: 207  TAFSKLDLKSGYHQIRMREEDVEKTAFHTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 266

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FAILRDH+LFANR+KCVIAHSQVQYLGHLI
Sbjct: 267  FLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLI 326

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S+RGVEADE+KIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAP+TKLLQKNAF
Sbjct: 327  SSRGVEADEDKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAF 386

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
             WNEEATIAFDQLKLAMTT+PVLAL DWSQP TIETDASGVGLGAVLSQDGHPIAFFSQK
Sbjct: 387  HWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQK 446

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LSPRAQGKSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 447  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWL 506

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNKVADALSRKD S ELNTMTTTGIVDIE+IEKEVE DQEL KI
Sbjct: 507  TKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKI 566

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            IAELKG VDQGGKYQWNNGRLLYKGRMVL RNSSLIP LLHTFHDSILGGHS FLRTYKR
Sbjct: 567  IAELKGEVDQGGKYQWNNGRLLYKGRMVLPRNSSLIPSLLHTFHDSILGGHSGFLRTYKR 626

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+AD+KRYVEECD CQ NKFEATKP GVLQPIPIPDKILEDWTMDFIEGLP
Sbjct: 627  MSGELFWKGMKADIKRYVEECDTCQRNKFEATKPAGVLQPIPIPDKILEDWTMDFIEGLP 686

Query: 601  IAGGYNMIMVVVDRLSK------------------------------------------- 660
            IAGGYN+IMVVVDRLSK                                           
Sbjct: 687  IAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQVASIFLEKVVSKHGIPKSIITDRDKIFL 746

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 747  SNFWKELFTTMGTILKRSTAFHPQTDGQTERVNRCLETYLRCFCNEQPKKWDKLIPWAEL 806

Query: 721  ---------------------IPPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 780
                                  PPPLL YGWK+SPNNDVEVMLKERDLA+NAL+ENLCIA
Sbjct: 807  WYNTTFHASTKTTPYQSVFGRTPPPLLSYGWKQSPNNDVEVMLKERDLALNALEENLCIA 866

Query: 781  QNIMKKMADRNRRELRFKIGDEVYLKLRPYRQRSLARKKCEKLSSKFYGLYEIIEEIGEV 785
            QN MKKMADRNRREL+FKIGDEVYLKLRPYRQRSLARKKCEKLS KFYG YE+IEEIGEV
Sbjct: 867  QNRMKKMADRNRRELKFKIGDEVYLKLRPYRQRSLARKKCEKLSPKFYGPYEVIEEIGEV 926

BLAST of CSPI03G21020 vs. NCBI nr
Match: TYJ96663.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 1155.6 bits (2988), Expect = 0.0e+00
Identity = 604/975 (61.95%), Postives = 683/975 (70.05%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. NCBI nr
Match: KAE8637561.1 (hypothetical protein CSA_017659 [Cucumis sativus])

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 599/968 (61.88%), Postives = 687/968 (70.97%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M++ LL+QYA+IF  P  LPPKR IDHRIL LPDQ+PINVRPYKYG+VQKEEIEKLV+EM
Sbjct: 179  MIKNLLQQYANIFEDPKKLPPKREIDHRILVLPDQRPINVRPYKYGYVQKEEIEKLVVEM 238

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQAGVIRPSH+ YSSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 239  LQAGVIRPSHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTISDKFPIPVIEELLDELHGA 298

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLD+KS YHQIRM+EEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKP
Sbjct: 299  TVFSKLDMKSDYHQIRMQEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKP 358

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA+LRD+ LFAN+ KCVIAHS++QYLGH+I
Sbjct: 359  FLRRCVLVFFDDILVYSKDISEHEKHLGMVFAVLRDNHLFANKKKCVIAHSKIQYLGHII 418

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S++GV+ADEEKI+ MV WP+PKD+TGLRGFLGL+GYYRRFVK YGEIAAP+T+LLQKN+F
Sbjct: 419  SSKGVQADEEKIKDMVKWPQPKDVTGLRGFLGLSGYYRRFVKGYGEIAAPLTRLLQKNSF 478

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
             W+E+AT+AF++LK AMTTIPVLAL +W  P  IETDASG GLGAVLSQ+GHPIAFFSQK
Sbjct: 479  VWDEQATVAFEKLKTAMTTIPVLALPNWDLPFLIETDASGTGLGAVLSQNGHPIAFFSQK 538

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELM VVLSVQKWRHYLLG++FTIISDQKALKFLLE REVQPQFQKWL
Sbjct: 539  LSIRAQAKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWL 598

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR + S E+N++TT GIVD+E+I+KEV +D+EL K 
Sbjct: 599  TKLLGYDFEILYQPGLQNKAADALSRMEYSLEVNSLTTNGIVDMEVIDKEVNQDEELQKT 658

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I ELK       K+ W NG+LLYK R+VLS+NSS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 659  IKELKQNPKGISKFSWENGKLLYKKRVVLSKNSSVIPTLLHTFHDSILGGHSGFLRTYKR 718

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGEL+W+GM+AD+K+YVE+C+ICQ NK+EATKP GVL PIP PD ILE+W+MDFIEGLP
Sbjct: 719  MSGELYWEGMKADIKKYVEQCEICQRNKYEATKPAGVLHPIPTPDAILEEWSMDFIEGLP 778

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 779  KAGGMNVIMVVVDRLSKYAYFITMKHPFTAKQVATTFIEKIVSKHGVPKSILSDRDEVFI 838

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 839  SHFWNELFATMGTKLKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPQKWHEFISWAEL 898

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 771
                                  PPP+L YG +K+ N++VEVMLKERDLA+NALKENL +A
Sbjct: 899  WYNTTFHSSIRSNPFKIVYGRQPPPILSYGTQKTQNDEVEVMLKERDLALNALKENLHLA 958

BLAST of CSPI03G21020 vs. NCBI nr
Match: TYK27058.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 1152.1 bits (2979), Expect = 0.0e+00
Identity = 603/975 (61.85%), Postives = 681/975 (69.85%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. NCBI nr
Match: TYK27963.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 1152.1 bits (2979), Expect = 0.0e+00
Identity = 603/975 (61.85%), Postives = 681/975 (69.85%), Query Frame = 0

Query: 1    MVQRLLEQYADIFRLPTDLPPKRTIDHRILTLPDQKPINVRPYKYGHVQKEEIEKLVLEM 60
            M+Q LL QY+D+F+ PT LPPKR+IDHRILTLP QKPINVRPYKYGH QKEEIEKLV+EM
Sbjct: 559  MIQFLLHQYSDVFKSPTTLPPKRSIDHRILTLPGQKPINVRPYKYGHQQKEEIEKLVIEM 618

Query: 61   LQAGVIRPSHNLYSSPILLVKKTDRG---------------------GVIEELLDELHGA 120
            LQ G+IRPSH+ +SSP+LLVKK D G                      VIEELLDELHGA
Sbjct: 619  LQTGIIRPSHSPFSSPVLLVKKKDGGWRFCVDYRKLNKITIADKFPIPVIEELLDELHGA 678

Query: 121  TIFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKP 180
            T+FSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKP
Sbjct: 679  TVFSKLDLKSGYHQIRMREEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQSLMNQVFKP 738

Query: 181  FLK------------------------GMIFAILRDHKLFANRTKCVIAHSQVQYLGHLI 240
            FL+                        GM+FA LRD++L+ANR KCV AHSQ+ YLGH+I
Sbjct: 739  FLRRCVLVFFDDILVYSSDITEHEKHLGMVFATLRDNQLYANRKKCVFAHSQIHYLGHVI 798

Query: 241  SNRGVEADEEKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAF 300
            S  GVEAD++K++SM+ WP+PKD+TGLRGFLGLTGYYRRFVK YGEIAAP+TKLLQKNAF
Sbjct: 799  SKHGVEADQDKVKSMLQWPKPKDVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNAF 858

Query: 301  QWNEEATIAFDQLKLAMTTIPVLALLDWSQPLTIETDASGVGLGAVLSQDGHPIAFFSQK 360
            +W+E AT+AF+ LK AM+TIPVLAL DWS P  IETDASG GLGAVLSQ+ HPIAFFSQK
Sbjct: 859  KWDENATLAFESLKSAMSTIPVLALPDWSLPFMIETDASGSGLGAVLSQNSHPIAFFSQK 918

Query: 361  LSPRAQGKSIYERELMAVVLSVQKWRHYLLGKRFTIISDQKALKFLLELREVQPQFQKWL 420
            LS RAQ KSIYERELMAVVLSVQKWRHYLLG+RFTI+SDQKALKFLLE REVQPQFQKWL
Sbjct: 919  LSTRAQAKSIYERELMAVVLSVQKWRHYLLGRRFTIMSDQKALKFLLEQREVQPQFQKWL 978

Query: 421  TKLLGYDFEILYQPGLQNKVADALSRKDQSAELNTMTTTGIVDIELIEKEVEKDQELHKI 480
            TKLLGYDFEILYQPGLQNK ADALSR D S EL  ++TTGIVD+E++ KEVEKD+EL  +
Sbjct: 979  TKLLGYDFEILYQPGLQNKAADALSRMDHSIELKALSTTGIVDMEVVTKEVEKDEELQLL 1038

Query: 481  IAELKGGVDQGGKYQWNNGRLLYKGRMVLSRNSSLIPRLLHTFHDSILGGHSRFLRTYKR 540
            I +L+      GKY   NG L+YKGR+VLS++SS+IP LLHTFHDSILGGHS FLRTYKR
Sbjct: 1039 IQQLQNNPALEGKYSLTNGTLMYKGRVVLSKSSSIIPSLLHTFHDSILGGHSGFLRTYKR 1098

Query: 541  MSGELFWKGMRADVKRYVEECDICQCNKFEATKPVGVLQPIPIPDKILEDWTMDFIEGLP 600
            MSGELFWKGM+ D+K+YVE+C+ICQ NK EATKP GVLQP+PIPD+ILEDWTMDFIEGLP
Sbjct: 1099 MSGELFWKGMKEDIKKYVEQCEICQRNKSEATKPAGVLQPLPIPDRILEDWTMDFIEGLP 1158

Query: 601  IAGGYNMIMVVVDRLSKI------------------------------------------ 660
             AGG N+IMVVVDRLSK                                           
Sbjct: 1159 KAGGMNVIMVVVDRLSKYAYFVTMKHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFV 1218

Query: 661  ------------------------------------------------------------ 720
                                                                        
Sbjct: 1219 SNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAEL 1278

Query: 721  ----------------------PPPLLLYGWKKSPNNDVEVMLKERDLAINALKENLCIA 778
                                  PPPL+ YG KK+PN++VE +LKERDLAI+ALKENL IA
Sbjct: 1279 WYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIA 1338

BLAST of CSPI03G21020 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 149.4 bits (376), Expect = 1.2e-35
Identity = 70/126 (55.56%), Postives = 88/126 (69.84%), Query Frame = 0

Query: 163 GMIFAILRDHKLFANRTKCVIAHSQVQYLG--HLISNRGVEADEEKIRSMVNWPRPKDIT 222
           GM+  I   H+ +ANR KC     Q+ YLG  H+IS  GV AD  K+ +MV WP PK+ T
Sbjct: 5   GMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTT 64

Query: 223 GLRGFLGLTGYYRRFVKSYGEIAAPITKLLQKNAFQWNEEATIAFDQLKLAMTTIPVLAL 282
            LRGFLGLTGYYRRFVK+YG+I  P+T+LL+KN+ +W E A +AF  LK A+TT+PVLAL
Sbjct: 65  ELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVLAL 124

Query: 283 LDWSQP 287
            D   P
Sbjct: 125 PDLKLP 130

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0CT411.5e-7530.91Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT341.5e-7530.91Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT351.5e-7530.91Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT361.5e-7530.91Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT371.5e-7530.91Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A5D3BBH70.0e+0061.95Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3DWA90.0e+0061.85Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3DU860.0e+0061.85Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3E3250.0e+0061.85Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3CT960.0e+0061.64Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
KAE8637598.10.0e+0072.81hypothetical protein CSA_022681 [Cucumis sativus][more]
TYJ96663.10.0e+0061.95Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
KAE8637561.10.0e+0061.88hypothetical protein CSA_017659 [Cucumis sativus][more]
TYK27058.10.0e+0061.85Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
TYK27963.10.0e+0061.85Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
ATMG00860.11.2e-3555.56DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023780Chromo domainPFAMPF00385Chromocoord: 703..732
e-value: 5.9E-6
score: 26.1
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 40..221
e-value: 5.2E-15
score: 55.6
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 1..230
score: 10.023561
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 468..523
e-value: 3.5E-16
score: 59.0
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 25..144
e-value: 7.3E-38
score: 132.0
NoneNo IPR availableGENE3D3.10.20.370coord: 287..352
e-value: 3.0E-9
score: 38.7
NoneNo IPR availableGENE3D1.10.340.70coord: 437..522
e-value: 1.2E-14
score: 56.3
NoneNo IPR availableGENE3D2.40.50.40coord: 693..736
e-value: 4.9E-6
score: 28.2
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 2..84
NoneNo IPR availablePANTHERPTHR24559:SF319SUBFAMILY NOT NAMEDcoord: 2..84
coord: 233..572
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 233..572
coord: 89..171
NoneNo IPR availablePANTHERPTHR24559:SF319SUBFAMILY NOT NAMEDcoord: 89..171
NoneNo IPR availableCDDcd01647RT_LTRcoord: 64..195
e-value: 6.14056E-50
score: 171.239
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 288..402
e-value: 5.60249E-50
score: 169.21
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 257..351
e-value: 1.9E-30
score: 104.8
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 87..162
e-value: 7.3E-38
score: 132.0
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 204..286
e-value: 1.8E-25
score: 90.8
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 165..202
e-value: 4.4E-6
score: 28.9
IPR000953Chromo/chromo shadow domainPROSITEPS50013CHROMO_2coord: 677..723
score: 8.611701
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 5..387
IPR016197Chromo-like domain superfamilySUPERFAMILY54160Chromo domain-likecoord: 682..729

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21020.1CSPI03G21020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000976 transcription cis-regulatory region binding