CSPI03G21630 (gene) Wild cucumber (PI 183967)

NameCSPI03G21630
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrotransposon protein, putative, Ty3-gypsy subclass
LocationChr3 : 17919094 .. 17921629 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTCTAAGATTGACTTACGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTAATATTCCTAAGACTTCTTTTCGTTCGAGGTATGGGCATTATGAATTCATTGTAATGTCTTTTGGTTTGACCAACGCACCTGTAGTATTTATGGATTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAATTTTCATTGATGATATTTTGGTTTATTCCAAGACCGAGGCCGAACATGAGGAACATTTACATAAAGTATTAGAACTCTTCGAGTGAATAAACTTTATGCTAAATTATCTAAGTGCGAATTTTGGTTGAAGCAAGTGGCTTTTCTTGGTCATGTGGTTTCCAGTGAGAGAGTTTCTGTAGATCCTGCAAAGATTTAAGCGGTTACCAGTTGGTCTCGACCCTCTACAGTTAGTGAGGTTCGTAGTTTTCTGGGTTTAGCAGTGTATTACCGGAGGTTTGTGGAGGATTTTTCACATTTGGCTACTCCCTTGACTCAATTGACCAGGAAGAGAACTCTGTTTTTTTGGAGTCCAGCTTGTGAGGATAGTTTTCAGAACCTTAAGCAAAGGTTAGTACTGCACCGATCCTTAAAGTACCAGATGGATCTGGAAGCTTTGTGATTTACAGTGATGCTTCCAAGAGAGGACTTGGTTTTGTTTTGATGCAGCAAGATAAGGTAGTTGCTAATGCCTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTGGAGTTGGCAACAGTGGTTTTTGCACTAAAGATATGGAGACATTACTTGTATGGTGAAAAGATACAAATCTTTACTGACCATAAGAGCCTGAAGTATTTCTTCACCCAGAAGGAGTTAAATACGAGACAGCGAAGGTGGCTCGAGTTGGTAAAAGATCATGACTATGAGATATTGTACCATCCTGGTAAGGCGAATGTGATAGCAGATGCTCTTAGTAGAAAAGTATCACATTCGGCAGCACTTATTACTAGATAGACACCGTTACATCGAAACTTTGAGAGAGCTAAGATTACAGTATCCGTGGGGACGGTCACCTCACAGTTGGCTCAGTTAATGGTGCAACCAACGCTGAGGCGGAAGGTTATTGATGCTCAGAGTAGTGATCCTTATTTGGTGGAGAGACGTCGCCTCGTTGAAACAGGTCAGATTGATGAGTTTTCCATATCCTCTGATGGTGGACTAATGTTAGAGAGATGTTTGTGTGTGCCAACAAACAGTGCAATTAAGATTGACTTACTAGATGAAGCCCATAATTCCCTGTTTTCCATGCATCTTGGTAGTACGAAAATGTATCAGGATCTAAAATGATTTTATTAGTGGCGGAATATGAAAAGGAAAGTGGCAGAATTTGTTAGTAAGTGTCTAGTATGTCAGCAGGTTAAGGCACCTAGACAAAAACCAGCACGTTTGTTACAACCTTTAAGTGTGCCAGAGTGGAAGTGGGAGTATGTGTCGATGGATTTTATTACAGGGCTGCCTAGAACCCTGAAGGGTTTCACAGTGATTTGGGTTGTTGTAGACAGGCTTACGAAATCGGCACATTTCGTACCAGGGAAATCCACTTATACTGATAGTAAGTGGGCACAGTTGTATTTAACTGAGATTGTGAGACTGCATGGAGTGCCTGTGCCGATCGTTTCTGACAGGGATGCACGATTTACTTCCAAGTTTTGGAAGGGACTTCAGATTGCCATGGGTACGAGGTTAGATTTCAGTACAGCTTTTCATCCACAGACTGATGGTCAAACTGAACGTTTGAACCAAATTTTAGAGGATATGCTACGAGCTTGTGTTCTAGAGTTTCCAGGTAGTTGGGACTCTCATTTACATTTGATGGAGTTTGCTTATAACAACAGTTTCCATGCTACCATTGGTATGGCACCATTTGAGGCTTTATATGGTAAATGTTGTAGAACCCCTGTTTGTTGGAGTGAGGTTGGTGAACAAAGGTTGATGGGACCTGAGTTGGTTCAGTCCACAAATGAGGCTATTCAGAAGATTAGAACACGTATGCAGATAGCGCACAGTAGACAGAAGAGTTATGCGGATGTGAGACGGAAAAACCTTGAGTTTGCGGTGGGAGATAAGGTATTCTTTAAGGTAGCACCTATAAAAGGTGTTATGCGTTTTGAGAAGAAAGCGAAGTTGAGTCCTCGTTTTGTTGGACCGTTTGAGATCTTAGAGAGAATTGGTGTTGTGGCGTATCGCTTGGCGCTACCACCATCTCTCTCCGCACTTCATAATGTTTTCCATGTTTCGATGTTGAGGAAGTATGTGGCAGATATATCTCATGTAGTGGACTATGAACCCTTGGAGATTGATGAGAATTTGAGCTATGTGGAACAACCTGTGGAGATTCTGGCTAGAGAGGTAAAGATGCTTCGTAATAGAAGCATTCCATTAATAAAGGTTTTGTGGCAGAATCATCGAATTGAAGAGGCGACATGGGAGCGAGAGGCTGAGATGAAAACTCGATATCCGGAGTTATTTCAGGATTAG

mRNA sequence

ATGTTCTCTAAGATTGACTTACGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTAATATTCCTAAGACTTCTTTTCGTTCGAGGTATGGGCATTATGAATTCATTGTAATGTCTTTTGGTTTGACCAACGCACCTGTAGTATTTATGGATTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAATTTTCATTGATGATATTTTGGTTTATTCCAAGACCGAGGCCGAACATGAGGAACATTTACATAAAGTATTAGAACTCTTCGATTGGTCTCGACCCTCTACAGTTAGTGAGGTTCGTAGTTTTCTGGGTTTAGCAGTGTATTACCGGAGGTTTGTGGAGGATTTTTCACATTTGGCTACTCCCTTGACTCAATTGACCAGGAAGAGAACTCTGTTTTTTTGGAGTCCAGCTTGTGAGGATAGTTTTCAGAACCTTAAGCAAAGTGATGCTTCCAAGAGAGGACTTGGTTTTGTTTTGATGCAGCAAGATAAGGTAGTTGCTAATGCCTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTGGAGTTGGCAACAGTGGTTTTTGCACTAAAGATATGGAGACATTACTTGTATGGTGAAAAGATACAAATCTTTACTGACCATAAGAGCCTGAAGTATTTCTTCACCCAGAAGGAGTTAAATACGAGACAGCGAAGGTGGCTCGAGTTGGTAAAAGATCATGACTATGAGATATTGTACCATCCTGGTAAGGCGAATGTGATAGCAGATGCTCTTAGTAGAAAAAGAGCTAAGATTACAGTATCCGTGGGGACGGTCACCTCACAGTTGGCTCAGTTAATGGTGCAACCAACGCTGAGGCGGAAGGTTATTGATGCTCAGAGTAGTGATCCTTATTTGGTGGAGAGACGTCGCCTCGTTGAAACAGGTCAGATTGATGAGTTTTCCATATCCTCTGATGGTGGACTAATGTTAGAGAGATGTTTGTGTGTGCCAACAAACAGTGCAATTAAGATTGACTTACTAGATGAAGCCCATAATTCCCTGTTTTCCATGCATCTTGGTATATGTCAGCAGGTTAAGGCACCTAGACAAAAACCAGCACGTTTGTTACAACCTTTAAGTGTGCCAGAGTGGAAGTGGGAGTATGTGTCGATGGATTTTATTACAGGGCTGCCTAGAACCCTGAAGGGTTTCACAGTGATTTGGGTTGTTGTAGACAGGCTTACGAAATCGGCACATTTCGTACCAGGGAAATCCACTTATACTGATAGTAAGTGGGCACAGTTGTATTTAACTGAGATTGTGAGACTGCATGGAGTGCCTGTGCCGATCGTTTCTGACAGGGATGCACGATTTACTTCCAAGTTTTGGAAGGGACTTCAGATTGCCATGGGTACGAGGTTAGATTTCAGTACAGCTTTTCATCCACAGACTGATGGTCAAACTGAACGTTTGAACCAAATTTTAGAGGATATGCTACGAGCTTGTGTTCTAGAGTTTCCAGGTAGTTGGGACTCTCATTTACATTTGATGGAGTTTGCTTATAACAACAGTTTCCATGCTACCATTGGTATGGCACCATTTGAGGCTTTATATGGTAAATGTTGTAGAACCCCTGTTTGTTGGAGTGAGGTTGGTGAACAAAGGTTGATGGGACCTGAGTTGGTTCAGTCCACAAATGAGGCTATTCAGAAGATTAGAACACGTATGCAGATAGCGCACAGTAGACAGAAGAGTTATGCGGATGTGAGACGGAAAAACCTTGAGTTTGCGGTGGGAGATAAGGTATTCTTTAAGGTAGCACCTATAAAAGGTGTTATGCGTTTTGAGAAGAAAGCGAAGTTGAGTCCTCGTTTTGTTGGACCGTTTGAGATCTTAGAGAGAATTGGTGTTGTGGCGTATCGCTTGGCGCTACCACCATCTCTCTCCGCACTTCATAATGTTTTCCATGTTTCGATGTTGAGGAAGTATGTGGCAGATATATCTCATGTAGTGGACTATGAACCCTTGGAGATTGATGAGAATTTGAGCTATGTGGAACAACCTGTGGAGATTCTGGCTAGAGAGGTAAAGATGCTTCGTAATAGAAGCATTCCATTAATAAAGGTTTTGTGGCAGAATCATCGAATTGAAGAGGCGACATGGGAGCGAGAGGCTGAGATGAAAACTCGATATCCGGAGTTATTTCAGGATTAG

Coding sequence (CDS)

ATGTTCTCTAAGATTGACTTACGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTAATATTCCTAAGACTTCTTTTCGTTCGAGGTATGGGCATTATGAATTCATTGTAATGTCTTTTGGTTTGACCAACGCACCTGTAGTATTTATGGATTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAATTTTCATTGATGATATTTTGGTTTATTCCAAGACCGAGGCCGAACATGAGGAACATTTACATAAAGTATTAGAACTCTTCGATTGGTCTCGACCCTCTACAGTTAGTGAGGTTCGTAGTTTTCTGGGTTTAGCAGTGTATTACCGGAGGTTTGTGGAGGATTTTTCACATTTGGCTACTCCCTTGACTCAATTGACCAGGAAGAGAACTCTGTTTTTTTGGAGTCCAGCTTGTGAGGATAGTTTTCAGAACCTTAAGCAAAGTGATGCTTCCAAGAGAGGACTTGGTTTTGTTTTGATGCAGCAAGATAAGGTAGTTGCTAATGCCTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTGGAGTTGGCAACAGTGGTTTTTGCACTAAAGATATGGAGACATTACTTGTATGGTGAAAAGATACAAATCTTTACTGACCATAAGAGCCTGAAGTATTTCTTCACCCAGAAGGAGTTAAATACGAGACAGCGAAGGTGGCTCGAGTTGGTAAAAGATCATGACTATGAGATATTGTACCATCCTGGTAAGGCGAATGTGATAGCAGATGCTCTTAGTAGAAAAAGAGCTAAGATTACAGTATCCGTGGGGACGGTCACCTCACAGTTGGCTCAGTTAATGGTGCAACCAACGCTGAGGCGGAAGGTTATTGATGCTCAGAGTAGTGATCCTTATTTGGTGGAGAGACGTCGCCTCGTTGAAACAGGTCAGATTGATGAGTTTTCCATATCCTCTGATGGTGGACTAATGTTAGAGAGATGTTTGTGTGTGCCAACAAACAGTGCAATTAAGATTGACTTACTAGATGAAGCCCATAATTCCCTGTTTTCCATGCATCTTGGTATATGTCAGCAGGTTAAGGCACCTAGACAAAAACCAGCACGTTTGTTACAACCTTTAAGTGTGCCAGAGTGGAAGTGGGAGTATGTGTCGATGGATTTTATTACAGGGCTGCCTAGAACCCTGAAGGGTTTCACAGTGATTTGGGTTGTTGTAGACAGGCTTACGAAATCGGCACATTTCGTACCAGGGAAATCCACTTATACTGATAGTAAGTGGGCACAGTTGTATTTAACTGAGATTGTGAGACTGCATGGAGTGCCTGTGCCGATCGTTTCTGACAGGGATGCACGATTTACTTCCAAGTTTTGGAAGGGACTTCAGATTGCCATGGGTACGAGGTTAGATTTCAGTACAGCTTTTCATCCACAGACTGATGGTCAAACTGAACGTTTGAACCAAATTTTAGAGGATATGCTACGAGCTTGTGTTCTAGAGTTTCCAGGTAGTTGGGACTCTCATTTACATTTGATGGAGTTTGCTTATAACAACAGTTTCCATGCTACCATTGGTATGGCACCATTTGAGGCTTTATATGGTAAATGTTGTAGAACCCCTGTTTGTTGGAGTGAGGTTGGTGAACAAAGGTTGATGGGACCTGAGTTGGTTCAGTCCACAAATGAGGCTATTCAGAAGATTAGAACACGTATGCAGATAGCGCACAGTAGACAGAAGAGTTATGCGGATGTGAGACGGAAAAACCTTGAGTTTGCGGTGGGAGATAAGGTATTCTTTAAGGTAGCACCTATAAAAGGTGTTATGCGTTTTGAGAAGAAAGCGAAGTTGAGTCCTCGTTTTGTTGGACCGTTTGAGATCTTAGAGAGAATTGGTGTTGTGGCGTATCGCTTGGCGCTACCACCATCTCTCTCCGCACTTCATAATGTTTTCCATGTTTCGATGTTGAGGAAGTATGTGGCAGATATATCTCATGTAGTGGACTATGAACCCTTGGAGATTGATGAGAATTTGAGCTATGTGGAACAACCTGTGGAGATTCTGGCTAGAGAGGTAAAGATGCTTCGTAATAGAAGCATTCCATTAATAAAGGTTTTGTGGCAGAATCATCGAATTGAAGAGGCGACATGGGAGCGAGAGGCTGAGATGAAAACTCGATATCCGGAGTTATTTCAGGATTAG
BLAST of CSPI03G21630 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 1.5e-42
Identity = 106/305 (34.75%), Postives = 159/305 (52.13%), Query Frame = 1

Query: 360  CQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFV 419
            CQ  K+   KP   LQP+   E  WE +SMDFIT LP +  G+  ++VVVDR +K A  V
Sbjct: 960  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILV 1019

Query: 420  PGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFH 479
            P   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + 
Sbjct: 1020 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1079

Query: 480  PQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-K 539
            PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++   
Sbjct: 1080 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1139

Query: 540  CCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNL-EFAVG 599
               +P+      ++     E  Q T +  Q ++  +   + + K Y D++ + + EF  G
Sbjct: 1140 PALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1199

Query: 600  DKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVS 659
            D V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS
Sbjct: 1200 DLVMVKRTK-TGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1257

Query: 660  MLRKY 662
             L KY
Sbjct: 1260 HLEKY 1257

BLAST of CSPI03G21630 vs. Swiss-Prot
Match: TF26_SCHPO (Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 1.5e-42
Identity = 106/305 (34.75%), Postives = 159/305 (52.13%), Query Frame = 1

Query: 360  CQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFV 419
            CQ  K+   KP   LQP+   E  WE +SMDFIT LP +  G+  ++VVVDR +K A  V
Sbjct: 960  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILV 1019

Query: 420  PGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFH 479
            P   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + 
Sbjct: 1020 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1079

Query: 480  PQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-K 539
            PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++   
Sbjct: 1080 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1139

Query: 540  CCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNL-EFAVG 599
               +P+      ++     E  Q T +  Q ++  +   + + K Y D++ + + EF  G
Sbjct: 1140 PALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1199

Query: 600  DKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVS 659
            D V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS
Sbjct: 1200 DLVMVKRTK-TGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1257

Query: 660  MLRKY 662
             L KY
Sbjct: 1260 HLEKY 1257

BLAST of CSPI03G21630 vs. Swiss-Prot
Match: TF24_SCHPO (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 1.5e-42
Identity = 106/305 (34.75%), Postives = 159/305 (52.13%), Query Frame = 1

Query: 360  CQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFV 419
            CQ  K+   KP   LQP+   E  WE +SMDFIT LP +  G+  ++VVVDR +K A  V
Sbjct: 960  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILV 1019

Query: 420  PGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFH 479
            P   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + 
Sbjct: 1020 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1079

Query: 480  PQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-K 539
            PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++   
Sbjct: 1080 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1139

Query: 540  CCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNL-EFAVG 599
               +P+      ++     E  Q T +  Q ++  +   + + K Y D++ + + EF  G
Sbjct: 1140 PALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1199

Query: 600  DKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVS 659
            D V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS
Sbjct: 1200 DLVMVKRTK-TGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1257

Query: 660  MLRKY 662
             L KY
Sbjct: 1260 HLEKY 1257

BLAST of CSPI03G21630 vs. Swiss-Prot
Match: TF29_SCHPO (Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 1.5e-42
Identity = 106/305 (34.75%), Postives = 159/305 (52.13%), Query Frame = 1

Query: 360  CQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFV 419
            CQ  K+   KP   LQP+   E  WE +SMDFIT LP +  G+  ++VVVDR +K A  V
Sbjct: 960  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILV 1019

Query: 420  PGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFH 479
            P   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + 
Sbjct: 1020 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1079

Query: 480  PQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-K 539
            PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++   
Sbjct: 1080 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1139

Query: 540  CCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNL-EFAVG 599
               +P+      ++     E  Q T +  Q ++  +   + + K Y D++ + + EF  G
Sbjct: 1140 PALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1199

Query: 600  DKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVS 659
            D V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS
Sbjct: 1200 DLVMVKRTK-TGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1257

Query: 660  MLRKY 662
             L KY
Sbjct: 1260 HLEKY 1257

BLAST of CSPI03G21630 vs. Swiss-Prot
Match: TF23_SCHPO (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 1.5e-42
Identity = 106/305 (34.75%), Postives = 159/305 (52.13%), Query Frame = 1

Query: 360  CQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFV 419
            CQ  K+   KP   LQP+   E  WE +SMDFIT LP +  G+  ++VVVDR +K A  V
Sbjct: 960  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILV 1019

Query: 420  PGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFH 479
            P   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + 
Sbjct: 1020 PCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYR 1079

Query: 480  PQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-K 539
            PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++   
Sbjct: 1080 PQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYS 1139

Query: 540  CCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNL-EFAVG 599
               +P+      ++     E  Q T +  Q ++  +   + + K Y D++ + + EF  G
Sbjct: 1140 PALSPLELPSFSDKT---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1199

Query: 600  DKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVS 659
            D V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS
Sbjct: 1200 DLVMVKRTK-TGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1257

Query: 660  MLRKY 662
             L KY
Sbjct: 1260 HLEKY 1257

BLAST of CSPI03G21630 vs. TrEMBL
Match: Q84KB0_CUCME (Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1146.3 bits (2964), Expect = 0.0e+00
Identity = 596/844 (70.62%), Postives = 653/844 (77.37%), Query Frame = 1

Query: 1   MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
           +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHY+FIVMSFGLTNAP VFMDLMNRVF++F
Sbjct: 78  VFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREF 137

Query: 61  LDTFVIIFIDDILVYSKTEAEHEEHLHKVLEL--------------FDWSRPSTVSEVRS 120
           LDTFVI+FIDDIL+YSKTEAEHEEHL  VL+               F   + S +  V S
Sbjct: 138 LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS 197

Query: 121 FLGLAVYYRRF--------------VEDFSHL--------------ATPLTQLTRKRTLF 180
             G++V   +               V  F  L              ATPLTQLTRK   F
Sbjct: 198 KAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPF 257

Query: 181 FWSPACEDSFQNLKQ--------------------SDASKRGLGFVLMQQDKVVANASRQ 240
            WS ACEDSFQ LKQ                    SDASK+GLG VLMQQ KVVA ASRQ
Sbjct: 258 VWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQ 317

Query: 241 LKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL 300
           LKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Sbjct: 318 LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWL 377

Query: 301 ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQ 360
           ELVKD+D EILYHPGKANV+ADALSRK                   RA+I V VG VT Q
Sbjct: 378 ELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQ 437

Query: 361 LAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSA 420
           LAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS+SSDGGL+ ER LCVP++SA
Sbjct: 438 LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSA 497

Query: 421 IKIDLLDEAHNSLFSMHLGICQQVKAPRQ------------------------------K 480
           +K +LL EAH+S FSMH G  + V  P                                K
Sbjct: 498 VKTELLSEAHSSPFSMHPGSTEDVSGPEAGFIGGRNMKREVAEFVSKCLVCQQVKAPRQK 557

Query: 481 PARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSK 540
           PA LLQPLS+PEWKWE VSMDFITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SK
Sbjct: 558 PAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASK 617

Query: 541 WAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL 600
           WAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERL
Sbjct: 618 WAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERL 677

Query: 601 NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEV 660
           NQ+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EV
Sbjct: 678 NQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEV 737

Query: 661 GEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKG 720
           GEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK+LEF VGDKVF KVAP+KG
Sbjct: 738 GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKG 797

Query: 721 VMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVV 734
           V+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVV
Sbjct: 798 VLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVV 857

BLAST of CSPI03G21630 vs. TrEMBL
Match: E5GCE2_CUCME (Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 872.1 bits (2252), Expect = 4.9e-250
Identity = 474/724 (65.47%), Postives = 531/724 (73.34%), Query Frame = 1

Query: 1    MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
            +FSKIDLRSGYHQLRI+D ++PKT+F SRYGHYEFIVMSF LTNAP VFMDLMNRVF++F
Sbjct: 538  LFSKIDLRSGYHQLRIKDRDVPKTAFHSRYGHYEFIVMSFALTNAPSVFMDLMNRVFREF 597

Query: 61   LDTFVIIFIDDILVYSKTEAEHEEHLHKVLELFDWSRPST--------VSEVRSFLGLAV 120
            LDTFVI+FI+DIL+YSK EAEHEEHL  VL+    ++           + +V SFLG  V
Sbjct: 598  LDTFVIVFINDILIYSKIEAEHEEHLRMVLQTLQDNKLYAKFLKCEFWLKQV-SFLGHVV 657

Query: 121  YYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ------------------- 180
                   D + +   +T   R  T+   S ACEDSFQNLKQ                   
Sbjct: 658  SKAGVSVDLAKIEA-VTSWPRPSTV---SEACEDSFQNLKQKLVTTPVLTVPDGSGSFVI 717

Query: 181  -SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKI 240
             SDASK+G G VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYG+KI
Sbjct: 718  YSDASKKGFGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGQKI 777

Query: 241  QIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITVS 300
            QIFT HK+LKYFFTQKELN RQRRWLELVKD+D EILYHPGKANV+ADALSRK       
Sbjct: 778  QIFTYHKNLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK------- 837

Query: 301  VGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCL 360
                   LAQL VQPTLR+K+IDAQS++PYLV +R L ETGQ  EFSISSDGGL+ ER L
Sbjct: 838  -------LAQLTVQPTLRQKIIDAQSNNPYLVGKRGLAETGQAVEFSISSDGGLLFERRL 897

Query: 361  CVPTNSAIKIDLLDEAHNSLFSMHLG---ICQQVKAP------RQKPARL---------- 420
             VP++SA+K +LL EAH+S FSMH G   + Q +K        +++ A            
Sbjct: 898  YVPSDSAVKTELLSEAHSSPFSMHSGSTKMYQHLKRVYWWSNIKREVAEFVSKCLVCQQV 957

Query: 421  ----------LQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKS 480
                      LQPLSVP+WKWE VSMDFITGLPRTL+GFTVIWVVVDRLTKSAHF+ GKS
Sbjct: 958  KAPRQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFIQGKS 1017

Query: 481  TYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTD 540
            TYT SKWAQLY++EIVRLHGVPV IVS+RDARFTSKF KGLQ AMGTRLDFST FHPQTD
Sbjct: 1018 TYTASKWAQLYMSEIVRLHGVPVSIVSNRDARFTSKFLKGLQAAMGTRLDFSTTFHPQTD 1077

Query: 541  GQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTP 600
             QTERLNQ+LEDMLRA  L FPGSWDSHLHLMEFAYNNSF ATI MAPFEALY K CR+P
Sbjct: 1078 CQTERLNQVLEDMLRAYALGFPGSWDSHLHLMEFAYNNSFQATIDMAPFEALYSKRCRSP 1137

Query: 601  VCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFK 660
            +CW E                              SRQKSYADVR K+LEF VGDKVF K
Sbjct: 1138 LCWGE------------------------------SRQKSYADVRWKDLEFDVGDKVFLK 1197

Query: 661  VAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVA 668
            VAP+KGV+RFE+  KLSPRFV PFEILERIG VAYRLALPPSLSA+H+VFHVSMLRKY+ 
Sbjct: 1198 VAPMKGVLRFERSGKLSPRFVRPFEILERIGPVAYRLALPPSLSAVHDVFHVSMLRKYMP 1212

BLAST of CSPI03G21630 vs. TrEMBL
Match: A5BA10_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009489 PE=4 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 3.5e-240
Identity = 434/792 (54.80%), Postives = 551/792 (69.57%), Query Frame = 1

Query: 1   MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
           +FSKIDL+SGYHQL +R  ++PKT+FR+RYGHYEF+VM FGLTNAP  FMDLMNRVFK +
Sbjct: 193 VFSKIDLQSGYHQLMVRSEDVPKTAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFKPY 252

Query: 61  LDTFVIIFIDDILVYSKTEAEHEEHLHKVLELFDWSRPSTVSEVRSFLGLA-VYYRRFVE 120
           LD FV +FIDDIL+YS++  EHE HL  VL+        T+ + + +  L    + RF+E
Sbjct: 253 LDQFVAVFIDDILIYSRSREEHEGHLSIVLQ--------TLRDKQLYAKLKKCEFWRFIE 312

Query: 121 DFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKR 180
            FS +  PLT+LT+K   F WS  CE SFQ LK                     SDAS +
Sbjct: 313 GFSKIVLPLTKLTQKGVKFEWSDDCECSFQELKNRLVSAPILTIPSGSGGFVVYSDASHQ 372

Query: 181 GLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHK 240
           GLG VLMQ  +VVA ASRQLK +E+NYPTHD ELA VVFALKIWRH+L+GE  +IFTDHK
Sbjct: 373 GLGCVLMQHGRVVAYASRQLKPYERNYPTHDSELADVVFALKIWRHFLFGETCEIFTDHK 432

Query: 241 SLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITVSVGTVTSQ 300
           SLKY F+QK+LN RQRRW+EL+KD+DY I YH  KANV+ADALSRK      ++     Q
Sbjct: 433 SLKYLFSQKKLNMRQRRWIELLKDYDYIIQYHSRKANVVADALSRKSVGSLTAIRGCQRQ 492

Query: 301 L--------------------AQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS 360
           L                    A   VQP L  ++   Q +D  LV+    V+ G   +F 
Sbjct: 493 LLEDLRSLQVHMRVLDSGALIANFRVQPDLVGRIKALQKNDLNLVQLMEEVKKGSKLDFV 552

Query: 361 ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGICQQVKAPRQK---------- 420
           +S DG L     LCVP +  ++ +LL+EAH S F++H    +  K  RQ           
Sbjct: 553 LSDDGILRFGTRLCVPNDEDLRRELLEEAHCSKFAIHPERTKMYKDLRQNYWWSGMKCDI 612

Query: 421 ---PARLL---QPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKS 480
               A+ L   QPL++PEWKWE+++MDF+ GLPRTL G   IWV+VDRLTKSAHF+P K 
Sbjct: 613 AQFVAQCLVCQQPLAIPEWKWEHITMDFVIGLPRTLGGNNAIWVIVDRLTKSAHFLPMKV 672

Query: 481 TYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTD 540
            ++  + A LY+ EIVR+HGVPV IVSDRD RFTS+FW  LQ ++GT+L FSTAFHPQTD
Sbjct: 673 NFSLDRLASLYVKEIVRMHGVPVSIVSDRDPRFTSRFWHSLQKSLGTKLSFSTAFHPQTD 732

Query: 541 GQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTP 600
           GQ+ER+ Q+LED+ RAC+L+  G+WD HL L+EFAYNNSF A+IGMAPFEALYG+ CR+P
Sbjct: 733 GQSERVIQVLEDLFRACILDLQGNWDDHLPLVEFAYNNSFQASIGMAPFEALYGRKCRSP 792

Query: 601 VCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFK 660
           +CW++VGE++L+GPELVQ T E +  I+ R++ A SR KSY D RR++LEF VGD VF K
Sbjct: 793 ICWNDVGERKLLGPELVQLTVEKVALIKERLKAAQSRHKSYVDHRRRDLEFEVGDHVFLK 852

Query: 661 VAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVA 720
           V+P+K VMRF +K KLSPRFVG FEILER+G +AY++ALPPSLS +HNVFHVS LRKY+ 
Sbjct: 853 VSPMKSVMRFGRKGKLSPRFVGLFEILERVGTLAYKVALPPSLSKVHNVFHVSTLRKYIY 912

Query: 721 DISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHRIEEATWEREA 736
           D SHVVD EP++I E+L+Y E PV+I+    K+LR+  + L+KV W NH I EATWE E 
Sbjct: 913 DPSHVVDLEPIQIFEDLTYEEVPVQIVDMMDKVLRHAVVKLVKVQWSNHSIREATWELEE 972

BLAST of CSPI03G21630 vs. TrEMBL
Match: Q2QTC6_ORYSJ (Retrotransposon protein, putative, Ty3-gypsy subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g20110 PE=4 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 3.5e-240
Identity = 431/841 (51.25%), Postives = 560/841 (66.59%), Query Frame = 1

Query: 1    MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
            +FSKIDLRSGYHQLRIR+ +IPKT+F +RYG +E  VMSFGLTNAP  FM+LMN+VF ++
Sbjct: 850  VFSKIDLRSGYHQLRIREEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNKVFMEY 909

Query: 61   LDTFVIIFIDDILVYSKTEAEHEEHLHKVLE----------------------------- 120
            LD FV++FIDDIL+YSKT+ EHEEHL   LE                             
Sbjct: 910  LDKFVVVFIDDILIYSKTKEEHEEHLRLALEKLREHQLYAKFSKCEFWLSEVKFLGHVIS 969

Query: 121  -------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLF 180
                         +  W +P TVSE+RSFLGLA YYRRF+E+FS +A P+T+L +K   +
Sbjct: 970  SGGVAVDPSNVESVLSWKQPKTVSEIRSFLGLARYYRRFIENFSKIARPMTRLLQKEVKY 1029

Query: 181  FWSPACEDSFQNLKQS--------------------DASKRGLGFVLMQQDKVVANASRQ 240
             W+  CE SFQ LK+                     DAS+ GLG VLMQ+ KVVA ASRQ
Sbjct: 1030 KWTEDCERSFQELKKRLVTAPVLILPNSRKGFQVYCDASRHGLGCVLMQEGKVVAYASRQ 1089

Query: 241  LKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL 300
            L+ HE NYPTHDLELA VV ALKIWRHYL+G + +++TDHKSLKY FTQ +LN RQRRWL
Sbjct: 1090 LRPHENNYPTHDLELAAVVHALKIWRHYLFGNRTEMYTDHKSLKYIFTQPDLNMRQRRWL 1149

Query: 301  ELVKDHDYEILYHPGKANVIADALSRK----------------RAKITVSVGTVTSQ-LA 360
            EL+KD+D EI YHPGKANV+ADALSRK                +    +++G V+   +A
Sbjct: 1150 ELIKDYDMEIHYHPGKANVVADALSRKSYCNMSEGRRLPWKLCQEFEKLNLGIVSKGFVA 1209

Query: 361  QLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIK 420
             L  QPTL  +V +AQ +DP + E ++ +  G+   +     G + L   +CVP N  +K
Sbjct: 1210 TLEAQPTLFDQVREAQVNDPDIQEIKKNMRRGKAIGYVEDEQGTVWLGERICVPENKELK 1269

Query: 421  IDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPAR 480
              ++ EAH +L+S+H G                             +CQ+VKA  QKPA 
Sbjct: 1270 DTIMKEAHETLYSIHPGSTKMYQDLKQQFWWASMRREIAEYVALCDVCQRVKAEHQKPAG 1329

Query: 481  LLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQ 540
            LLQPL +PEWKWE + MDFITGLPRT  G   IWVVVDRLTK AHF+P K+TYT +K A+
Sbjct: 1330 LLQPLKIPEWKWEEIGMDFITGLPRTSAGHDSIWVVVDRLTKVAHFIPVKTTYTGNKLAE 1389

Query: 541  LYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQI 600
            LY+  +V LHGVP  IVSDR ++FTSKFW+ LQ+ MGTRL+FSTA+HPQTDGQTER+NQI
Sbjct: 1390 LYMARVVCLHGVPKKIVSDRGSQFTSKFWQKLQVEMGTRLNFSTAYHPQTDGQTERVNQI 1449

Query: 601  LEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQ 660
            LEDMLRACVL+F GSWD +L   EF+YNNS+ A++ MAP+EALYG+ CRTP+ W + GE+
Sbjct: 1450 LEDMLRACVLDFGGSWDKNLPYAEFSYNNSYQASLQMAPYEALYGRKCRTPILWDQTGER 1509

Query: 661  RLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMR 720
            ++ G ++++   E ++ I+ R+++A SRQKSYAD RR++L F  GD V+ +V P++GV R
Sbjct: 1510 QVFGTDILREAEEKVKIIQERLRVAQSRQKSYADNRRRDLAFEEGDYVYLRVTPLRGVHR 1569

Query: 721  FEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYE 734
            F+ K KL+PRFVGPF+I+ R G VAY+L LPPS++ +H+VFHVS L+K +   +   D +
Sbjct: 1570 FQTKGKLAPRFVGPFKIVSRRGEVAYQLELPPSMAGIHDVFHVSQLKKCLRVPTEEADPD 1629

BLAST of CSPI03G21630 vs. TrEMBL
Match: Q2R8I6_ORYSJ (Retrotransposon protein, putative, Ty3-gypsy subclass OS=Oryza sativa subsp. japonica GN=LOC_Os11g12060 PE=4 SV=1)

HSP 1 Score: 834.7 bits (2155), Expect = 8.6e-239
Identity = 430/841 (51.13%), Postives = 560/841 (66.59%), Query Frame = 1

Query: 1    MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
            +FSKIDLRSGYHQLRIR+ +IPKT+F +RYG +E  VMSFGLTNAP  FM+LMN+VF ++
Sbjct: 631  VFSKIDLRSGYHQLRIREEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNKVFMEY 690

Query: 61   LDTFVIIFIDDILVYSKTEAEHEEHLHKVLE----------------------------- 120
            LD FV++FIDDIL+YSKT+ EHEEHL   LE                             
Sbjct: 691  LDKFVVVFIDDILIYSKTKEEHEEHLRLALEKLREHQLYAKFSKCEFWLSEVKFLGHVIS 750

Query: 121  -------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLF 180
                         +  W +P TVSE+RSFLGLA YYRRF+E+FS +A P+T+L +K   +
Sbjct: 751  SGGVAVDPSNVESVLSWKQPKTVSEIRSFLGLAGYYRRFIENFSKIARPMTRLLQKEVKY 810

Query: 181  FWSPACEDSFQNLKQS--------------------DASKRGLGFVLMQQDKVVANASRQ 240
             W+  CE SFQ LK+                     DAS+ GLG VLMQ+ KVVA ASRQ
Sbjct: 811  KWTEDCERSFQELKKRLVTAPVLILPDSRKGFQVYCDASRLGLGCVLMQEGKVVAYASRQ 870

Query: 241  LKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL 300
            L+ HE NYPTHDLELA VV ALKIWRHYL+G + +I+TDHKSLKY FTQ +LN RQRRWL
Sbjct: 871  LRPHENNYPTHDLELAAVVHALKIWRHYLFGNRTEIYTDHKSLKYIFTQPDLNMRQRRWL 930

Query: 301  ELVKDHDYEILYHPGKANVIADALSRK----------------RAKITVSVGTVTSQ-LA 360
            EL+KD+D EI YHPGKANV+ADALSRK                +    +++G V++  +A
Sbjct: 931  ELIKDYDMEIHYHPGKANVVADALSRKSYCNMSEGRRLPWELCQEFEKLNLGIVSNGFVA 990

Query: 361  QLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIK 420
             L  +PTL  +V +AQ +DP + E ++ +  G+   +     G + L   +CVP N  +K
Sbjct: 991  ALEAKPTLFDQVREAQVNDPDIQEIKKNMRRGKAIGYVEDEQGTVWLGERICVPENKGLK 1050

Query: 421  IDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPAR 480
              ++ EAH +L+S+H G                             +CQ+VKA  QKPA 
Sbjct: 1051 DTIMKEAHETLYSIHPGSTKMYQDLKQQFWWASMRREIAEYVALCDVCQRVKAEHQKPAG 1110

Query: 481  LLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQ 540
            LLQPL +PEWKWE + MDFITGLPRT  G   IWVVVDRLTK AHF+P K+TYT +K A+
Sbjct: 1111 LLQPLKIPEWKWEEIGMDFITGLPRTSAGHDSIWVVVDRLTKVAHFIPVKTTYTGNKLAE 1170

Query: 541  LYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQI 600
            LY+  +V LHGVP  IVSDR ++FTSKFW+ LQ+ MGTRL+FSTA+HPQTDGQTER+NQI
Sbjct: 1171 LYMARVVCLHGVPKKIVSDRGSQFTSKFWQKLQLEMGTRLNFSTAYHPQTDGQTERINQI 1230

Query: 601  LEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQ 660
            LEDMLRACVL+F GSWD +L   EF+YNNS+ A++ MAP+EALYG+ CRTP+ W + GE+
Sbjct: 1231 LEDMLRACVLDFGGSWDKNLPYAEFSYNNSYQASLQMAPYEALYGRKCRTPLLWDQTGER 1290

Query: 661  RLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMR 720
            ++ G ++++   E ++ I+ R+++A SRQKSYAD RR++L F  GD V+ +V P++GV R
Sbjct: 1291 QVFGTDILREAEEKVKIIQERLRVAQSRQKSYADNRRRDLAFEEGDYVYLRVTPLRGVHR 1350

Query: 721  FEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYE 734
            F+ K KL+PRFVGPF+I+ R G VAY+L LPPS++ +H+VFHVS L+K +   +   D +
Sbjct: 1351 FQTKGKLAPRFVGPFKIVSRRGEVAYQLELPPSMAGIHDVFHVSQLKKCLRVPTEEADPD 1410

BLAST of CSPI03G21630 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 53.5 bits (127), Expect = 6.3e-07
Identity = 25/65 (38.46%), Postives = 39/65 (60.00%), Query Frame = 1

Query: 88  KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDS 147
           K+  +  W  P   +E+R FLGL  YYRRFV+++  +  PLT+L +K +L  W+     +
Sbjct: 50  KLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSL-KWTEMAALA 109

Query: 148 FQNLK 153
           F+ LK
Sbjct: 110 FKALK 113

BLAST of CSPI03G21630 vs. NCBI nr
Match: gi|28558781|gb|AAO45752.1| (pol protein [Cucumis melo subsp. melo])

HSP 1 Score: 1146.3 bits (2964), Expect = 0.0e+00
Identity = 596/844 (70.62%), Postives = 653/844 (77.37%), Query Frame = 1

Query: 1   MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
           +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHY+FIVMSFGLTNAP VFMDLMNRVF++F
Sbjct: 78  VFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREF 137

Query: 61  LDTFVIIFIDDILVYSKTEAEHEEHLHKVLEL--------------FDWSRPSTVSEVRS 120
           LDTFVI+FIDDIL+YSKTEAEHEEHL  VL+               F   + S +  V S
Sbjct: 138 LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS 197

Query: 121 FLGLAVYYRRF--------------VEDFSHL--------------ATPLTQLTRKRTLF 180
             G++V   +               V  F  L              ATPLTQLTRK   F
Sbjct: 198 KAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPF 257

Query: 181 FWSPACEDSFQNLKQ--------------------SDASKRGLGFVLMQQDKVVANASRQ 240
            WS ACEDSFQ LKQ                    SDASK+GLG VLMQQ KVVA ASRQ
Sbjct: 258 VWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQ 317

Query: 241 LKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL 300
           LKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Sbjct: 318 LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWL 377

Query: 301 ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQ 360
           ELVKD+D EILYHPGKANV+ADALSRK                   RA+I V VG VT Q
Sbjct: 378 ELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQ 437

Query: 361 LAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSA 420
           LAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS+SSDGGL+ ER LCVP++SA
Sbjct: 438 LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSA 497

Query: 421 IKIDLLDEAHNSLFSMHLGICQQVKAPRQ------------------------------K 480
           +K +LL EAH+S FSMH G  + V  P                                K
Sbjct: 498 VKTELLSEAHSSPFSMHPGSTEDVSGPEAGFIGGRNMKREVAEFVSKCLVCQQVKAPRQK 557

Query: 481 PARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSK 540
           PA LLQPLS+PEWKWE VSMDFITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SK
Sbjct: 558 PAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASK 617

Query: 541 WAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL 600
           WAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERL
Sbjct: 618 WAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERL 677

Query: 601 NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEV 660
           NQ+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EV
Sbjct: 678 NQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEV 737

Query: 661 GEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKG 720
           GEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK+LEF VGDKVF KVAP+KG
Sbjct: 738 GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKG 797

Query: 721 VMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVV 734
           V+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVV
Sbjct: 798 VLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVV 857

BLAST of CSPI03G21630 vs. NCBI nr
Match: gi|307136318|gb|ADN34141.1| (ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 872.1 bits (2252), Expect = 7.0e-250
Identity = 474/724 (65.47%), Postives = 531/724 (73.34%), Query Frame = 1

Query: 1    MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
            +FSKIDLRSGYHQLRI+D ++PKT+F SRYGHYEFIVMSF LTNAP VFMDLMNRVF++F
Sbjct: 538  LFSKIDLRSGYHQLRIKDRDVPKTAFHSRYGHYEFIVMSFALTNAPSVFMDLMNRVFREF 597

Query: 61   LDTFVIIFIDDILVYSKTEAEHEEHLHKVLELFDWSRPST--------VSEVRSFLGLAV 120
            LDTFVI+FI+DIL+YSK EAEHEEHL  VL+    ++           + +V SFLG  V
Sbjct: 598  LDTFVIVFINDILIYSKIEAEHEEHLRMVLQTLQDNKLYAKFLKCEFWLKQV-SFLGHVV 657

Query: 121  YYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ------------------- 180
                   D + +   +T   R  T+   S ACEDSFQNLKQ                   
Sbjct: 658  SKAGVSVDLAKIEA-VTSWPRPSTV---SEACEDSFQNLKQKLVTTPVLTVPDGSGSFVI 717

Query: 181  -SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKI 240
             SDASK+G G VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYG+KI
Sbjct: 718  YSDASKKGFGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGQKI 777

Query: 241  QIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITVS 300
            QIFT HK+LKYFFTQKELN RQRRWLELVKD+D EILYHPGKANV+ADALSRK       
Sbjct: 778  QIFTYHKNLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK------- 837

Query: 301  VGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCL 360
                   LAQL VQPTLR+K+IDAQS++PYLV +R L ETGQ  EFSISSDGGL+ ER L
Sbjct: 838  -------LAQLTVQPTLRQKIIDAQSNNPYLVGKRGLAETGQAVEFSISSDGGLLFERRL 897

Query: 361  CVPTNSAIKIDLLDEAHNSLFSMHLG---ICQQVKAP------RQKPARL---------- 420
             VP++SA+K +LL EAH+S FSMH G   + Q +K        +++ A            
Sbjct: 898  YVPSDSAVKTELLSEAHSSPFSMHSGSTKMYQHLKRVYWWSNIKREVAEFVSKCLVCQQV 957

Query: 421  ----------LQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKS 480
                      LQPLSVP+WKWE VSMDFITGLPRTL+GFTVIWVVVDRLTKSAHF+ GKS
Sbjct: 958  KAPRQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFIQGKS 1017

Query: 481  TYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTD 540
            TYT SKWAQLY++EIVRLHGVPV IVS+RDARFTSKF KGLQ AMGTRLDFST FHPQTD
Sbjct: 1018 TYTASKWAQLYMSEIVRLHGVPVSIVSNRDARFTSKFLKGLQAAMGTRLDFSTTFHPQTD 1077

Query: 541  GQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTP 600
             QTERLNQ+LEDMLRA  L FPGSWDSHLHLMEFAYNNSF ATI MAPFEALY K CR+P
Sbjct: 1078 CQTERLNQVLEDMLRAYALGFPGSWDSHLHLMEFAYNNSFQATIDMAPFEALYSKRCRSP 1137

Query: 601  VCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFK 660
            +CW E                              SRQKSYADVR K+LEF VGDKVF K
Sbjct: 1138 LCWGE------------------------------SRQKSYADVRWKDLEFDVGDKVFLK 1197

Query: 661  VAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVA 668
            VAP+KGV+RFE+  KLSPRFV PFEILERIG VAYRLALPPSLSA+H+VFHVSMLRKY+ 
Sbjct: 1198 VAPMKGVLRFERSGKLSPRFVRPFEILERIGPVAYRLALPPSLSAVHDVFHVSMLRKYMP 1212

BLAST of CSPI03G21630 vs. NCBI nr
Match: gi|77555016|gb|ABA97812.1| (retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group])

HSP 1 Score: 839.3 bits (2167), Expect = 5.0e-240
Identity = 431/841 (51.25%), Postives = 560/841 (66.59%), Query Frame = 1

Query: 1    MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
            +FSKIDLRSGYHQLRIR+ +IPKT+F +RYG +E  VMSFGLTNAP  FM+LMN+VF ++
Sbjct: 850  VFSKIDLRSGYHQLRIREEDIPKTAFTTRYGLFECTVMSFGLTNAPAFFMNLMNKVFMEY 909

Query: 61   LDTFVIIFIDDILVYSKTEAEHEEHLHKVLE----------------------------- 120
            LD FV++FIDDIL+YSKT+ EHEEHL   LE                             
Sbjct: 910  LDKFVVVFIDDILIYSKTKEEHEEHLRLALEKLREHQLYAKFSKCEFWLSEVKFLGHVIS 969

Query: 121  -------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLF 180
                         +  W +P TVSE+RSFLGLA YYRRF+E+FS +A P+T+L +K   +
Sbjct: 970  SGGVAVDPSNVESVLSWKQPKTVSEIRSFLGLARYYRRFIENFSKIARPMTRLLQKEVKY 1029

Query: 181  FWSPACEDSFQNLKQS--------------------DASKRGLGFVLMQQDKVVANASRQ 240
             W+  CE SFQ LK+                     DAS+ GLG VLMQ+ KVVA ASRQ
Sbjct: 1030 KWTEDCERSFQELKKRLVTAPVLILPNSRKGFQVYCDASRHGLGCVLMQEGKVVAYASRQ 1089

Query: 241  LKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL 300
            L+ HE NYPTHDLELA VV ALKIWRHYL+G + +++TDHKSLKY FTQ +LN RQRRWL
Sbjct: 1090 LRPHENNYPTHDLELAAVVHALKIWRHYLFGNRTEMYTDHKSLKYIFTQPDLNMRQRRWL 1149

Query: 301  ELVKDHDYEILYHPGKANVIADALSRK----------------RAKITVSVGTVTSQ-LA 360
            EL+KD+D EI YHPGKANV+ADALSRK                +    +++G V+   +A
Sbjct: 1150 ELIKDYDMEIHYHPGKANVVADALSRKSYCNMSEGRRLPWKLCQEFEKLNLGIVSKGFVA 1209

Query: 361  QLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIK 420
             L  QPTL  +V +AQ +DP + E ++ +  G+   +     G + L   +CVP N  +K
Sbjct: 1210 TLEAQPTLFDQVREAQVNDPDIQEIKKNMRRGKAIGYVEDEQGTVWLGERICVPENKELK 1269

Query: 421  IDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPAR 480
              ++ EAH +L+S+H G                             +CQ+VKA  QKPA 
Sbjct: 1270 DTIMKEAHETLYSIHPGSTKMYQDLKQQFWWASMRREIAEYVALCDVCQRVKAEHQKPAG 1329

Query: 481  LLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQ 540
            LLQPL +PEWKWE + MDFITGLPRT  G   IWVVVDRLTK AHF+P K+TYT +K A+
Sbjct: 1330 LLQPLKIPEWKWEEIGMDFITGLPRTSAGHDSIWVVVDRLTKVAHFIPVKTTYTGNKLAE 1389

Query: 541  LYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQI 600
            LY+  +V LHGVP  IVSDR ++FTSKFW+ LQ+ MGTRL+FSTA+HPQTDGQTER+NQI
Sbjct: 1390 LYMARVVCLHGVPKKIVSDRGSQFTSKFWQKLQVEMGTRLNFSTAYHPQTDGQTERVNQI 1449

Query: 601  LEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQ 660
            LEDMLRACVL+F GSWD +L   EF+YNNS+ A++ MAP+EALYG+ CRTP+ W + GE+
Sbjct: 1450 LEDMLRACVLDFGGSWDKNLPYAEFSYNNSYQASLQMAPYEALYGRKCRTPILWDQTGER 1509

Query: 661  RLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMR 720
            ++ G ++++   E ++ I+ R+++A SRQKSYAD RR++L F  GD V+ +V P++GV R
Sbjct: 1510 QVFGTDILREAEEKVKIIQERLRVAQSRQKSYADNRRRDLAFEEGDYVYLRVTPLRGVHR 1569

Query: 721  FEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYE 734
            F+ K KL+PRFVGPF+I+ R G VAY+L LPPS++ +H+VFHVS L+K +   +   D +
Sbjct: 1570 FQTKGKLAPRFVGPFKIVSRRGEVAYQLELPPSMAGIHDVFHVSQLKKCLRVPTEEADPD 1629

BLAST of CSPI03G21630 vs. NCBI nr
Match: gi|147769978|emb|CAN61139.1| (hypothetical protein VITISV_009489 [Vitis vinifera])

HSP 1 Score: 839.3 bits (2167), Expect = 5.0e-240
Identity = 434/792 (54.80%), Postives = 551/792 (69.57%), Query Frame = 1

Query: 1   MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDF 60
           +FSKIDL+SGYHQL +R  ++PKT+FR+RYGHYEF+VM FGLTNAP  FMDLMNRVFK +
Sbjct: 193 VFSKIDLQSGYHQLMVRSEDVPKTAFRTRYGHYEFLVMPFGLTNAPAAFMDLMNRVFKPY 252

Query: 61  LDTFVIIFIDDILVYSKTEAEHEEHLHKVLELFDWSRPSTVSEVRSFLGLA-VYYRRFVE 120
           LD FV +FIDDIL+YS++  EHE HL  VL+        T+ + + +  L    + RF+E
Sbjct: 253 LDQFVAVFIDDILIYSRSREEHEGHLSIVLQ--------TLRDKQLYAKLKKCEFWRFIE 312

Query: 121 DFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKR 180
            FS +  PLT+LT+K   F WS  CE SFQ LK                     SDAS +
Sbjct: 313 GFSKIVLPLTKLTQKGVKFEWSDDCECSFQELKNRLVSAPILTIPSGSGGFVVYSDASHQ 372

Query: 181 GLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHK 240
           GLG VLMQ  +VVA ASRQLK +E+NYPTHD ELA VVFALKIWRH+L+GE  +IFTDHK
Sbjct: 373 GLGCVLMQHGRVVAYASRQLKPYERNYPTHDSELADVVFALKIWRHFLFGETCEIFTDHK 432

Query: 241 SLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITVSVGTVTSQ 300
           SLKY F+QK+LN RQRRW+EL+KD+DY I YH  KANV+ADALSRK      ++     Q
Sbjct: 433 SLKYLFSQKKLNMRQRRWIELLKDYDYIIQYHSRKANVVADALSRKSVGSLTAIRGCQRQ 492

Query: 301 L--------------------AQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS 360
           L                    A   VQP L  ++   Q +D  LV+    V+ G   +F 
Sbjct: 493 LLEDLRSLQVHMRVLDSGALIANFRVQPDLVGRIKALQKNDLNLVQLMEEVKKGSKLDFV 552

Query: 361 ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGICQQVKAPRQK---------- 420
           +S DG L     LCVP +  ++ +LL+EAH S F++H    +  K  RQ           
Sbjct: 553 LSDDGILRFGTRLCVPNDEDLRRELLEEAHCSKFAIHPERTKMYKDLRQNYWWSGMKCDI 612

Query: 421 ---PARLL---QPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKS 480
               A+ L   QPL++PEWKWE+++MDF+ GLPRTL G   IWV+VDRLTKSAHF+P K 
Sbjct: 613 AQFVAQCLVCQQPLAIPEWKWEHITMDFVIGLPRTLGGNNAIWVIVDRLTKSAHFLPMKV 672

Query: 481 TYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTD 540
            ++  + A LY+ EIVR+HGVPV IVSDRD RFTS+FW  LQ ++GT+L FSTAFHPQTD
Sbjct: 673 NFSLDRLASLYVKEIVRMHGVPVSIVSDRDPRFTSRFWHSLQKSLGTKLSFSTAFHPQTD 732

Query: 541 GQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTP 600
           GQ+ER+ Q+LED+ RAC+L+  G+WD HL L+EFAYNNSF A+IGMAPFEALYG+ CR+P
Sbjct: 733 GQSERVIQVLEDLFRACILDLQGNWDDHLPLVEFAYNNSFQASIGMAPFEALYGRKCRSP 792

Query: 601 VCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFK 660
           +CW++VGE++L+GPELVQ T E +  I+ R++ A SR KSY D RR++LEF VGD VF K
Sbjct: 793 ICWNDVGERKLLGPELVQLTVEKVALIKERLKAAQSRHKSYVDHRRRDLEFEVGDHVFLK 852

Query: 661 VAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVA 720
           V+P+K VMRF +K KLSPRFVG FEILER+G +AY++ALPPSLS +HNVFHVS LRKY+ 
Sbjct: 853 VSPMKSVMRFGRKGKLSPRFVGLFEILERVGTLAYKVALPPSLSKVHNVFHVSTLRKYIY 912

Query: 721 DISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHRIEEATWEREA 736
           D SHVVD EP++I E+L+Y E PV+I+    K+LR+  + L+KV W NH I EATWE E 
Sbjct: 913 DPSHVVDLEPIQIFEDLTYEEVPVQIVDMMDKVLRHAVVKLVKVQWSNHSIREATWELEE 972

BLAST of CSPI03G21630 vs. NCBI nr
Match: gi|1012113262|ref|XP_015960510.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107484454 [Arachis duranensis])

HSP 1 Score: 837.0 bits (2161), Expect = 2.5e-239
Identity = 437/799 (54.69%), Postives = 551/799 (68.96%), Query Frame = 1

Query: 2    FSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFL 61
            FSKIDLRSGYHQL+I++ +IPKT+FR+RY HYEF+VMSF LTNAP  FMDLMNRVFK FL
Sbjct: 547  FSKIDLRSGYHQLKIKEEDIPKTTFRTRYRHYEFLVMSFCLTNAPAAFMDLMNRVFKPFL 606

Query: 62   DTFVIIFIDDILVYSKTEAEHEEHL---------HKVLELFD----W------------- 121
            D FVIIFIDDILVYSK+  EHE HL         HK+   F     W             
Sbjct: 607  DRFVIIFIDDILVYSKSATEHEYHLRIVLQTLRDHKLYAKFSKCEFWLDQVTFLGHVISK 666

Query: 122  ----SRPSTVSEVRSFL------------GLAVYYRRFVEDFSHLATPLTQLTRKRTLFF 181
                  P  V  V+ +             GLA YYRRF++DFS ++TPLT+LT+K   F 
Sbjct: 667  DGIMVDPKKVEAVQKWPRPTTVTEIRSFLGLAGYYRRFIKDFSRISTPLTKLTQKNVKFQ 726

Query: 182  WSPACEDSFQNLKQ--------------------SDASKRGLGFVLMQQDKVVANASRQL 241
            WS ACE+ FQ LK                      DAS+ GLG VLMQ  +V+A ASRQL
Sbjct: 727  WSEACEEGFQTLKACLTSAPVLVLPSGSGGFSVFCDASRIGLGCVLMQHGRVIAYASRQL 786

Query: 242  KSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWLE 301
            K HEQNYPTHD+E+A VVFALKIWRHYLYGE  +I+TDHKSLKY F QK+LN RQRRW+E
Sbjct: 787  KKHEQNYPTHDMEMAAVVFALKIWRHYLYGETCEIYTDHKSLKYIFQQKDLNLRQRRWME 846

Query: 302  LVKDHDYEILYHPGKANVIADALSRKR----AKITVSVGTVTSQLAQLMVQPTLRRKVID 361
            L+KD+D  ILYHPGKANV+ADALSRK     A IT+    +  ++ QL           D
Sbjct: 847  LLKDYDCTILYHPGKANVVADALSRKSMGSLAHITLGRRPIVEEVHQLEASGI----QFD 906

Query: 362  AQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSM 421
               S  +L   R   ++  I++  ++                  +K D+       +F  
Sbjct: 907  LGESGVFLAHVR--TQSSLIEQIKVAQSDWW-----------EGMKKDI------GVFVS 966

Query: 422  HLGICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKS 481
            H   CQQVKA  Q+PA LLQ + +PEWKWE ++MDF+TGLPR+ KGF  IWV+VDR+TKS
Sbjct: 967  HCLTCQQVKAEHQRPAGLLQQIEIPEWKWERITMDFVTGLPRSFKGFDSIWVIVDRMTKS 1026

Query: 482  AHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFS 541
            AHF+P K+T++ +++AQLY+ EIV+LHG+PV I+SDR  +FTS FWK  Q A+GTRLD S
Sbjct: 1027 AHFLPVKTTFSAARYAQLYVDEIVKLHGIPVSIISDRGPQFTSHFWKSFQKALGTRLDLS 1086

Query: 542  TAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEAL 601
            TAFHPQTDGQ+ER  QILEDMLR CVL+F G+WDS+L L+EF+YNNS+ A+I MAPFEAL
Sbjct: 1087 TAFHPQTDGQSERTIQILEDMLRCCVLDFGGNWDSYLPLIEFSYNNSYQASIQMAPFEAL 1146

Query: 602  YGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFA 661
            YG+ CR+P+ W EVGE +L+GP LVQ   E ++ IR R+  A SRQK+Y D RR+NLEF+
Sbjct: 1147 YGRRCRSPIGWFEVGEVKLLGPNLVQDAVEKVRIIRERLLAAQSRQKAYVDNRRRNLEFS 1206

Query: 662  VGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHV 721
            VGD+VF KV+P+KGVMRF K+ KLSPR++GPFEIL+RIGVVAYRLALP  LS +H VFH+
Sbjct: 1207 VGDQVFLKVSPMKGVMRFGKRGKLSPRYIGPFEILDRIGVVAYRLALPSELSMIHPVFHM 1266

Query: 722  SMLRKYVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHRIE 735
            SMLRKY++D SHV+  + +E+ E+LS+ E+PV I+ R+VK LR++ I  +KV+ +NH +E
Sbjct: 1267 SMLRKYLSDPSHVLTPQAIELKEDLSFEEEPVAIVDRQVKKLRSKEIASVKVVXKNHSVE 1322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TF25_SCHPO1.5e-4234.75Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF26_SCHPO1.5e-4234.75Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF24_SCHPO1.5e-4234.75Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF29_SCHPO1.5e-4234.75Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF23_SCHPO1.5e-4234.75Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
Q84KB0_CUCME0.0e+0070.62Pol protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
E5GCE2_CUCME4.9e-25065.47Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A5BA10_VITVI3.5e-24054.80Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009489 PE=4 SV=1[more]
Q2QTC6_ORYSJ3.5e-24051.25Retrotransposon protein, putative, Ty3-gypsy subclass OS=Oryza sativa subsp. jap... [more]
Q2R8I6_ORYSJ8.6e-23951.13Retrotransposon protein, putative, Ty3-gypsy subclass OS=Oryza sativa subsp. jap... [more]
Match NameE-valueIdentityDescription
ATMG00860.16.3e-0738.46ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|28558781|gb|AAO45752.1|0.0e+0070.62pol protein [Cucumis melo subsp. melo][more]
gi|307136318|gb|ADN34141.1|7.0e-25065.47ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo][more]
gi|77555016|gb|ABA97812.1|5.0e-24051.25retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Gro... [more]
gi|147769978|emb|CAN61139.1|5.0e-24054.80hypothetical protein VITISV_009489 [Vitis vinifera][more]
gi|1012113262|ref|XP_015960510.1|2.5e-23954.69PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107484454 [Arachis du... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR023780Chromo_domain
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
biological_process GO:0006508 proteolysis
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21630.1CSPI03G21630.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1..95
score: 4.3
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 385..496
score: 3.2
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 372..539
score: 19
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 384..543
score: 7.8
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 377..533
score: 5.31
IPR023780Chromo domainPFAMPF00385Chromocoord: 686..734
score: 7.
NoneNo IPR availableunknownCoilCoilcoord: 165..185
scor
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 4..91
score: 9.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 1..733
score:
NoneNo IPR availablePANTHERPTHR24559:SF207SUBFAMILY NOT NAMEDcoord: 1..733
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 2..250
score: 6.23

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None