CSPI03G21010 (gene) Wild cucumber (PI 183967)

NameCSPI03G21010
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-I Gag-Pol polyprotein
LocationChr3 : 17104476 .. 17107556 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGGACAGAAGCCTCGTAAAATCACAGGTGTCACTAAAATCAATGATAAAGACTTTTGAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAATCGAACAATAGGGAGCGGAAGAGTCAAAGAATAATTTAGCGGACTGTCTGAGTAATCTAAAACTCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCAGAAATCAGTTGACACCACCTCGCAATCATGATCCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGTTATCCCCAATTCCAGAAAGATGAAATTGAAAAATTAGTAAAGGAAATGTTGTAAGCCGAAATTATCCAGCCAAGCAAAAGTGCCTTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGTGATTCTGCGTGGATTACCGAGCACTGAATCTTGGCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACCATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCAGATTAGAGTGCGAGTCGCCGATGTTCACAAAACGGCGCTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCAGCCACTTTTCAATCAGGAATGAATGATATTCTCCGCCCATACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAACAAGTCACTAGAAGAACATTTGCAGCAATTGGTCGTGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTTAAAAAATGTCAATTCGCAGTAGATCGAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAATGGCTATGGTTAAATGACCAGCACCCAAGAATGTGAAGGAATTACAAGGTTTCTTGGGTCTTACAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTACCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAAAAAATGCGTTCCAACAATTAAAATTTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTCCCAAGGTTTTGTGCTGGAAACTGATGTTTCAGGTGTTGATATTGGGGCTGTCCTAATGCATCATCAACGTCCAGTGGCTTTTTCAGTCAAGCTTTGCCCGATACCCATAGGTTTAAAGCTGTGTATGAACGGGAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGTGCCCATATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATAAAAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAGAGATGGATCACTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAATAAAGCGGCTAATGCTTTATCACGATTACCACCGGTATTTGAATTGGGCCTCATAAGTGTTGTGAGCGGGCTCAATCCTTCGATTTTCATTGATCAAGTGGCTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCACTACATAGAGAGGTTCTATGCTATCACGGCCGGTTGGTTCTTTCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACAGCCCGATAGGGGGACATCATGGCGCGTTGACAACATATCAAAGGCTAGCAAGGGAGGTATATTGGCGAGGTATGAAAGCACACATCCGCACATATGTTGCCGAATGTTCGGTTTGTCAACAAGCCAAATATTTGTCATTAACTCCAGCCGGTCTTCTTCAAGCATTACCCATTCCGGACCGTGTATGGGAGGATATTAGTATGGATTTTATTGAGGGTCTTCCTAAATCTGAGTGCTATGATGTCATATTGGTGGTTGTGGATAGACTTTCCAAGTATGCACATTTCATTCCTTTGAAGCATCCTTTTGGTGCAAAAGTTGTAGCTGCTGTTTTTATGCGGGAAATCATTCATTTGCATGGATGCCCCCGAAGTATAGTGTCGGATCGTGATCGTATTTTCACAAGTCTATTTTGGGAGGAGTTGTGGCGGTTGTTGGGAACTTAGTTAAGACTAAGCACCACCTACCATCCCCAAACAGATGGTCAAACCGAAGTTGTCAATCGAGGGGTGGAGACGTATCTACGATGTTTTGCTATGAACACACCTAAGCAGTGGGCTAAATGGCTAGCTTGGGCTGAATTGAGCTATAATACAACTGTCCATACAGCCATATTGATGACCCCCTTCGAGGCTCACTATGGTCGACCACCTCCCTCTATATTACCATGCGTGAAGGAGTCGAGCCCGGTGAATGAAGTAGATCATTTGATGAAGGAAAGAGATGGAATTCTGAGAATTTTGAAGGCCAATTTATTGAAGGCACAACAGCGAATGGTCAAATACGCCAAATTGACCGACGGGAGGTACAGTTTGAGGTGGATGATTGGGTTTATGTGAAGTTACGCCCTTACCGCCAGTCCTCTCTCACCCATTTTAAGCACCCCAAATTGGCCCCAAGATTCATTGGCCCATTTGTCATTGTGGCACGAATTGGTCTGGTTGCATATCGTTTGGCTCTTCCCAAGGAGTCGCTTATTCACCCGGTCTTTCATGTTTCGATTCTTTAAAAAGCCGTAGGTTCAAACAGGCCTCTATTTCCAATTCCAAAAGATTTGGCTGCTGATTTGTCGATGCAATTGTCTCCGGTAGAGGTGTTGGGAGTTCGCAAAACCAGAGAAAAGGGGGACAAGTTAGAAGTTTTGATTTTCTGGAGTGACGGTACACCTGAAAGTGCTACATGGGAACAAGCCACTCTTATTCAAGAACAATTTCTTGAATTCCACCTTGAGGACAAGGTGGCTCTTTGGGGGGCGGGCAATGATAGACCCCACATAGTAAAGGTGTATTCACGTAGGAATAAAAGTGGGAAGAAGAAAGAATGA

mRNA sequence

ATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGGACAGAAGCCTCGTAAAATCACAGGGAGCGGAAGAGTCAAAGAATAATTTAGCGGACTGTCTGAGTAATCTAAAACTCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCAGAAATCAGTTGACACCACCTCGCAATCATGATCCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCCACTGAATCTTGGCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACCATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCAGATTAGAGTGCGAGTCGCCGATGTTCACAAAACGGCGCTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCAGCCACTTTTCAATCAGGAATGAATGATATTCTCCGCCCATACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAACAAGTCACTAGAAGAACATTTGCAGCAATTGGTCGTGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTTAAAAAATGTCAATTCGCAGTAGATCGAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAATGGCTATGGAATTACAAGGTTTCTTGGGTCTTACAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTACCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAAAAAATGCGTTCCAACAATTAAAATTTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTCCCAAGGTTTTGTGCTGGAAACTGATGTTTCAGGTGTTGATATTGGGGCTTCAAGCTTTGCCCGATACCCATAGGTTTAAAGCTGTGTATGAACGGGAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGTGCCCATATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATAAAAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAGAGATGGATCACTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAATAAAGCGGCTAATGCTTTATCACGATTACCACCGGTATTTGAATTGGGCCTCATAAGTGTTGTGAGCGGGCTCAATCCTTCGATTTTCATTGATCAAGTGGCTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCACTACATAGAGAGGTTCTATGCTATCACGGCCGGTTGGTTCTTTCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACAGCCCGATAGGGGGACATCATGGCGCGTTGACAACATATCAAAGGCTAGCAAGGGAGGTATATTGGCGAGGTATGAAAGCACACATCCGCACATATGTTGCCGAATGTTCGGTTTGTCAACAAGCCAAATATTTGTCATTAACTCCAGCCGGTCTTCTTCAAGCATTACCCATTCCGGACCGTGTATGGGAGGATATTAGTATGGATTTTATTGAGGGTCTTCCTAAATCTGAGTGCTATGATGTCATATTGGTGGTTGTGGATAGACTTTCCAAGTATGCACATTTCATTCCTTTGAAGCATCCTTTTGGTGCAAAAGTTGTAGCTGCTGTTTTTATGCGGGAAATCATTCATTTGCATGGATGCCCCCGAAGTATAGTGTCGGATCGTGATCATGGTCAAACCGAAGTTGTCAATCGAGGGGTGGAGACGTATCTACGATGTTTTGCTATGAACACACCTAAGCAGTGGGCTAAATGGCTAGCTTGGGCTGAATTGAGCTATAATACAACTGTCCATACAGCCATATTGATGACCCCCTTCGAGGCTCACTATGGTCGACCACCTCCCTCTATATTACCATGCGTGAAGGAGTCGAGCCCGGTGAATGAAGTAGATCATTTGATGAAGGAAAGAGATGGAATTCTGAGAATTTTGAAGGCCAATTTATTGAAGGCACAACAGCGAATGGTCAAATACGCCAAATTGACCGACGGGAGGCCTCTATTTCCAATTCCAAAAGATTTGGCTGCTGATTTGTCGATGCAATTGTCTCCGGTAGAGGTGTTGGGAGTTCGCAAAACCAGAGAAAAGGGGGACAAGTTAGAAGTTTTGATTTTCTGGAGTGACGGTACACCTGAAAGTGCTACATGGGAACAAGCCACTCTTATTCAAGAACAATTTCTTGAATTCCACCTTGAGGACAAGGTGGCTCTTTGGGGGGCGGGCAATGATAGACCCCACATAGTAAAGGTGTATTCACGTAGGAATAAAAGTGGGAAGAAGAAAGAATGA

Coding sequence (CDS)

ATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGGACAGAAGCCTCGTAAAATCACAGGGAGCGGAAGAGTCAAAGAATAATTTAGCGGACTGTCTGAGTAATCTAAAACTCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCAGAAATCAGTTGACACCACCTCGCAATCATGATCCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCCACTGAATCTTGGCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACCATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCAGATTAGAGTGCGAGTCGCCGATGTTCACAAAACGGCGCTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCAGCCACTTTTCAATCAGGAATGAATGATATTCTCCGCCCATACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAACAAGTCACTAGAAGAACATTTGCAGCAATTGGTCGTGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTTAAAAAATGTCAATTCGCAGTAGATCGAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAATGGCTATGGAATTACAAGGTTTCTTGGGTCTTACAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTACCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAAAAAATGCGTTCCAACAATTAAAATTTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTCCCAAGGTTTTGTGCTGGAAACTGATGTTTCAGGTGTTGATATTGGGGCTTCAAGCTTTGCCCGATACCCATAGGTTTAAAGCTGTGTATGAACGGGAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGTGCCCATATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATAAAAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAGAGATGGATCACTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAATAAAGCGGCTAATGCTTTATCACGATTACCACCGGTATTTGAATTGGGCCTCATAAGTGTTGTGAGCGGGCTCAATCCTTCGATTTTCATTGATCAAGTGGCTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCACTACATAGAGAGGTTCTATGCTATCACGGCCGGTTGGTTCTTTCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACAGCCCGATAGGGGGACATCATGGCGCGTTGACAACATATCAAAGGCTAGCAAGGGAGGTATATTGGCGAGGTATGAAAGCACACATCCGCACATATGTTGCCGAATGTTCGGTTTGTCAACAAGCCAAATATTTGTCATTAACTCCAGCCGGTCTTCTTCAAGCATTACCCATTCCGGACCGTGTATGGGAGGATATTAGTATGGATTTTATTGAGGGTCTTCCTAAATCTGAGTGCTATGATGTCATATTGGTGGTTGTGGATAGACTTTCCAAGTATGCACATTTCATTCCTTTGAAGCATCCTTTTGGTGCAAAAGTTGTAGCTGCTGTTTTTATGCGGGAAATCATTCATTTGCATGGATGCCCCCGAAGTATAGTGTCGGATCGTGATCATGGTCAAACCGAAGTTGTCAATCGAGGGGTGGAGACGTATCTACGATGTTTTGCTATGAACACACCTAAGCAGTGGGCTAAATGGCTAGCTTGGGCTGAATTGAGCTATAATACAACTGTCCATACAGCCATATTGATGACCCCCTTCGAGGCTCACTATGGTCGACCACCTCCCTCTATATTACCATGCGTGAAGGAGTCGAGCCCGGTGAATGAAGTAGATCATTTGATGAAGGAAAGAGATGGAATTCTGAGAATTTTGAAGGCCAATTTATTGAAGGCACAACAGCGAATGGTCAAATACGCCAAATTGACCGACGGGAGGCCTCTATTTCCAATTCCAAAAGATTTGGCTGCTGATTTGTCGATGCAATTGTCTCCGGTAGAGGTGTTGGGAGTTCGCAAAACCAGAGAAAAGGGGGACAAGTTAGAAGTTTTGATTTTCTGGAGTGACGGTACACCTGAAAGTGCTACATGGGAACAAGCCACTCTTATTCAAGAACAATTTCTTGAATTCCACCTTGAGGACAAGGTGGCTCTTTGGGGGGCGGGCAATGATAGACCCCACATAGTAAAGGTGTATTCACGTAGGAATAAAAGTGGGAAGAAGAAAGAATGA
BLAST of CSPI03G21010 vs. Swiss-Prot
Match: TF22_SCHPO (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.5e-86
Identity = 220/718 (30.64%), Postives = 346/718 (48.19%), Query Frame = 1

Query: 78   PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHY 137
            PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +
Sbjct: 470  PLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVF 529

Query: 138  EFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETL 197
            E+LVMP+G+  APA FQ  +N IL       V+ + DDILI++KS  EH++ +  VL+ L
Sbjct: 530  EYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKL 589

Query: 198  VVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLT 257
                L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG  
Sbjct: 590  KNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSV 649

Query: 258  GYYRKFVANYGSIALPLTQLLKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLC 317
             Y RKF+     +  PL  LLKK  +++W  T   A + +K  ++S PVL   DF K + 
Sbjct: 650  NYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709

Query: 318  WKLMFQVLILG--LQALPDTHRFK----------------AVYERELMAIVRAVQKWCPY 377
             +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Sbjct: 710  LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769

Query: 378  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANAL 437
            L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+AL
Sbjct: 770  LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADAL 829

Query: 438  SRL-------PPVFELGLISVVSGLN-PSIFIDQVAGNEALNSIRLSLINGQPTPEGYSL 497
            SR+       P   E   I+ V+ ++    F +QV      ++  L+L+N     E   +
Sbjct: 830  SRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNN----EDKRV 889

Query: 498  HREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRG 557
               +    G L+ S+D   +P        ++ ++H      H G       + R   W+G
Sbjct: 890  EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 949

Query: 558  MKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVIL 617
            ++  I+ YV  C  CQ  K  +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + 
Sbjct: 950  IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1009

Query: 618  VVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------ 677
            VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Sbjct: 1010 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1069

Query: 678  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA 719
                             GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A
Sbjct: 1070 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1129

BLAST of CSPI03G21010 vs. Swiss-Prot
Match: TF212_SCHPO (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.5e-86
Identity = 220/718 (30.64%), Postives = 346/718 (48.19%), Query Frame = 1

Query: 78   PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHY 137
            PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +
Sbjct: 470  PLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVF 529

Query: 138  EFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETL 197
            E+LVMP+G+  APA FQ  +N IL       V+ + DDILI++KS  EH++ +  VL+ L
Sbjct: 530  EYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKL 589

Query: 198  VVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLT 257
                L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG  
Sbjct: 590  KNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSV 649

Query: 258  GYYRKFVANYGSIALPLTQLLKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLC 317
             Y RKF+     +  PL  LLKK  +++W  T   A + +K  ++S PVL   DF K + 
Sbjct: 650  NYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709

Query: 318  WKLMFQVLILG--LQALPDTHRFK----------------AVYERELMAIVRAVQKWCPY 377
             +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Sbjct: 710  LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769

Query: 378  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANAL 437
            L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+AL
Sbjct: 770  LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADAL 829

Query: 438  SRL-------PPVFELGLISVVSGLN-PSIFIDQVAGNEALNSIRLSLINGQPTPEGYSL 497
            SR+       P   E   I+ V+ ++    F +QV      ++  L+L+N     E   +
Sbjct: 830  SRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNN----EDKRV 889

Query: 498  HREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRG 557
               +    G L+ S+D   +P        ++ ++H      H G       + R   W+G
Sbjct: 890  EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 949

Query: 558  MKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVIL 617
            ++  I+ YV  C  CQ  K  +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + 
Sbjct: 950  IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1009

Query: 618  VVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------ 677
            VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Sbjct: 1010 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1069

Query: 678  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA 719
                             GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A
Sbjct: 1070 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1129

BLAST of CSPI03G21010 vs. Swiss-Prot
Match: TF23_SCHPO (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.5e-86
Identity = 220/718 (30.64%), Postives = 346/718 (48.19%), Query Frame = 1

Query: 78   PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHY 137
            PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +
Sbjct: 470  PLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVF 529

Query: 138  EFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETL 197
            E+LVMP+G+  APA FQ  +N IL       V+ + DDILI++KS  EH++ +  VL+ L
Sbjct: 530  EYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKL 589

Query: 198  VVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLT 257
                L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG  
Sbjct: 590  KNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSV 649

Query: 258  GYYRKFVANYGSIALPLTQLLKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLC 317
             Y RKF+     +  PL  LLKK  +++W  T   A + +K  ++S PVL   DF K + 
Sbjct: 650  NYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709

Query: 318  WKLMFQVLILG--LQALPDTHRFK----------------AVYERELMAIVRAVQKWCPY 377
             +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Sbjct: 710  LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769

Query: 378  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANAL 437
            L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+AL
Sbjct: 770  LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADAL 829

Query: 438  SRL-------PPVFELGLISVVSGLN-PSIFIDQVAGNEALNSIRLSLINGQPTPEGYSL 497
            SR+       P   E   I+ V+ ++    F +QV      ++  L+L+N     E   +
Sbjct: 830  SRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNN----EDKRV 889

Query: 498  HREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRG 557
               +    G L+ S+D   +P        ++ ++H      H G       + R   W+G
Sbjct: 890  EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 949

Query: 558  MKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVIL 617
            ++  I+ YV  C  CQ  K  +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + 
Sbjct: 950  IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1009

Query: 618  VVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------ 677
            VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Sbjct: 1010 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1069

Query: 678  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA 719
                             GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A
Sbjct: 1070 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1129

BLAST of CSPI03G21010 vs. Swiss-Prot
Match: TF24_SCHPO (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.5e-86
Identity = 220/718 (30.64%), Postives = 346/718 (48.19%), Query Frame = 1

Query: 78   PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHY 137
            PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +
Sbjct: 470  PLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVF 529

Query: 138  EFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETL 197
            E+LVMP+G+  APA FQ  +N IL       V+ + DDILI++KS  EH++ +  VL+ L
Sbjct: 530  EYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKL 589

Query: 198  VVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLT 257
                L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG  
Sbjct: 590  KNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSV 649

Query: 258  GYYRKFVANYGSIALPLTQLLKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLC 317
             Y RKF+     +  PL  LLKK  +++W  T   A + +K  ++S PVL   DF K + 
Sbjct: 650  NYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709

Query: 318  WKLMFQVLILG--LQALPDTHRFK----------------AVYERELMAIVRAVQKWCPY 377
             +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Sbjct: 710  LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769

Query: 378  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANAL 437
            L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+AL
Sbjct: 770  LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADAL 829

Query: 438  SRL-------PPVFELGLISVVSGLN-PSIFIDQVAGNEALNSIRLSLINGQPTPEGYSL 497
            SR+       P   E   I+ V+ ++    F +QV      ++  L+L+N     E   +
Sbjct: 830  SRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNN----EDKRV 889

Query: 498  HREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRG 557
               +    G L+ S+D   +P        ++ ++H      H G       + R   W+G
Sbjct: 890  EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 949

Query: 558  MKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVIL 617
            ++  I+ YV  C  CQ  K  +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + 
Sbjct: 950  IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1009

Query: 618  VVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------ 677
            VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Sbjct: 1010 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1069

Query: 678  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA 719
                             GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A
Sbjct: 1070 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1129

BLAST of CSPI03G21010 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.5e-86
Identity = 220/718 (30.64%), Postives = 346/718 (48.19%), Query Frame = 1

Query: 78   PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHY 137
            PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +
Sbjct: 470  PLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVF 529

Query: 138  EFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETL 197
            E+LVMP+G+  APA FQ  +N IL       V+ + DDILI++KS  EH++ +  VL+ L
Sbjct: 530  EYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKL 589

Query: 198  VVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLT 257
                L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG  
Sbjct: 590  KNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSV 649

Query: 258  GYYRKFVANYGSIALPLTQLLKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLC 317
             Y RKF+     +  PL  LLKK  +++W  T   A + +K  ++S PVL   DF K + 
Sbjct: 650  NYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709

Query: 318  WKLMFQVLILG--LQALPDTHRFK----------------AVYERELMAIVRAVQKWCPY 377
             +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Sbjct: 710  LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769

Query: 378  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANAL 437
            L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+AL
Sbjct: 770  LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADAL 829

Query: 438  SRL-------PPVFELGLISVVSGLN-PSIFIDQVAGNEALNSIRLSLINGQPTPEGYSL 497
            SR+       P   E   I+ V+ ++    F +QV      ++  L+L+N     E   +
Sbjct: 830  SRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNN----EDKRV 889

Query: 498  HREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRG 557
               +    G L+ S+D   +P        ++ ++H      H G       + R   W+G
Sbjct: 890  EENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 949

Query: 558  MKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVIL 617
            ++  I+ YV  C  CQ  K  +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + 
Sbjct: 950  IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALF 1009

Query: 618  VVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------ 677
            VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Sbjct: 1010 VVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAH 1069

Query: 678  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA 719
                             GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A
Sbjct: 1070 KYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSA 1129

BLAST of CSPI03G21010 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 1.4e-205
Identity = 375/689 (54.43%), Postives = 478/689 (69.38%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYHQI V+  +V KTA RTH+GHYE
Sbjct: 675  LNKVTIPDSFPIPMIDQLLDELHGATVFSKLDLKSGYHQILVKPQNVPKTAFRTHDGHYE 734

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAP TFQ+ MN++ R +LRKFVLVFFDDIL+Y+ SL+EH + L VVL+ L 
Sbjct: 735  FLVMPFGLTNAPTTFQALMNEVFRAHLRKFVLVFFDDILVYSSSLQEHQEHLRVVLQILF 794

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
              +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ AM           L+GFLGLTG
Sbjct: 795  QQQLFANKKKCQFGSSSIEYLGHVISGEGVSADPSKLQAMVSWPLPKNIKALRGFLGLTG 854

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYR+FV  YGSIA PLT LLKK KFQW+E A  AF++LK AM +VPVL + DF ++   +
Sbjct: 855  YYRRFVQGYGSIAKPLTSLLKKDKFQWSEEATVAFEKLKVAMSTVPVLALVDFSELFVVE 914

Query: 319  LMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                 + LG              QAL D  + K+VYERELMAIV A+QKW  YLLG+ F+
Sbjct: 915  SDASGIGLGAVLLQKQKPVAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFL 974

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGL 438
            VRTD+KSLKFLLEQR +  EYQ+W+TK+LG++F I YK G+ENKAA+ALSR+  + +L  
Sbjct: 975  VRTDQKSLKFLLEQREVNLEYQQWLTKILGFNFDIHYKPGLENKAADALSRVEGLPQLYA 1034

Query: 439  ISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPT 498
            +SV + +      ++V  N     I+  ++    T  GYS+ +  L Y+G+LVL ++S  
Sbjct: 1035 LSVPAAIQLEEINEEVDRNPVSKKIKEEVLLDASTHSGYSVVQGRLLYNGKLVLPKESYL 1094

Query: 499  IPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPA 558
            I +LL EFHNS +GGH G L T + L    YW+GM A I+T+VAEC VCQ+ KY +L P+
Sbjct: 1095 IKVLLHEFHNSRMGGHGGVLKTQRHLGALFYWQGMMADIKTFVAECVVCQKHKYSTLAPS 1154

Query: 559  GLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAA 618
            GLLQ LPIP +VWEDIS+DF+EGLPKSE +D ILVVVDRL+KYAHFI L+HPFGAK +AA
Sbjct: 1155 GLLQPLPIPTQVWEDISLDFVEGLPKSEGFDAILVVVDRLTKYAHFIKLQHPFGAKEIAA 1214

Query: 619  VFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG 678
            VF++EI+ LHG P ++VSDRD                              GQTEV NRG
Sbjct: 1215 VFIQEIVRLHGYPSTMVSDRDTLFTGMFWTELFRLAGTSLNFSTAYHPQTDGQTEVTNRG 1274

Query: 679  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSP 716
            +ET LRCF  + PK+WA +L WAE  YN++ H+AI MTPF+A YGR PPS+L     S+ 
Sbjct: 1275 LETILRCFTSDKPKKWAAYLPWAEFCYNSSYHSAIQMTPFKALYGRDPPSLLRFEDGSTT 1334

BLAST of CSPI03G21010 vs. TrEMBL
Match: A0A087G3S6_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4 SV=1)

HSP 1 Score: 710.7 bits (1833), Expect = 2.1e-201
Identity = 367/693 (52.96%), Postives = 468/693 (67.53%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  TIPD +PIP++D+LLDEL GATIFSK+DLKSGYHQI V+  DV KTA RTH+GHYE
Sbjct: 701  LNKVTIPDSFPIPMIDQLLDELHGATIFSKLDLKSGYHQILVKAEDVAKTAFRTHDGHYE 760

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAPATFQS MND+ R YLRKFVLVFFDDIL+Y+KSL+EH Q L +VLE L 
Sbjct: 761  FLVMPFGLTNAPATFQSLMNDVFRGYLRKFVLVFFDDILVYSKSLQEHQQHLGLVLELLQ 820

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
             H+L AN KKC+F    +EYLGH++S  GVAADP KI AM           L+GFLGLTG
Sbjct: 821  QHQLFANKKKCEFGRTELEYLGHVVSGKGVAADPEKIQAMVSWPEPQNVKALRGFLGLTG 880

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYRKFV  YG IA PLT LLKK +FQW   A  AFQ+LK AM +VPVL + DF +    +
Sbjct: 881  YYRKFVQRYGEIARPLTALLKKDQFQWTAEATVAFQKLKKAMSTVPVLALVDFTEQFVVE 940

Query: 319  LMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                   LG              QAL +  R K+VYERELMAIV A+QKW  YLLG+ FV
Sbjct: 941  SDASGTGLGAVLMQSQRPLAYFSQALTERQRLKSVYERELMAIVFAIQKWRHYLLGRKFV 1000

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGL 438
            VRTD+KSLKFLLEQR I  EYQ+W+TKLLG+DF I+YK G+ENKAA+ALSR     +L  
Sbjct: 1001 VRTDQKSLKFLLEQREINMEYQKWLTKLLGFDFEIQYKPGLENKAADALSRKDMALQLCA 1060

Query: 439  ISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPT 498
            +S+ + +       +V  +  L  ++  ++    +   +S+ +  L   G+LV+   S  
Sbjct: 1061 LSIPAAIQLEQINTEVDNDPDLRKLKEEVLQDAASHSEFSVVQGRLLRKGKLVVPAQSRL 1120

Query: 499  IPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPA 558
            + ++L EFHN  +GGH G L T +R+    YW+GM + IR +VA C VCQ+ KY +L PA
Sbjct: 1121 VNVILQEFHNGKLGGHGGVLKTQKRVEAIFYWKGMMSRIREFVAACQVCQRHKYSTLAPA 1180

Query: 559  GLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAA 618
            GLLQ LPIPD+VWEDISMDF+EGLPKSE ++V++VVVDRL+KYAHFI +KHP  A  VA 
Sbjct: 1181 GLLQPLPIPDQVWEDISMDFVEGLPKSEGFEVVMVVVDRLTKYAHFISMKHPVTAVEVAL 1240

Query: 619  VFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG 678
            +F +E++ LHG P++IVSDRD                              GQTEV NRG
Sbjct: 1241 IFTKEVVKLHGFPKTIVSDRDPLFTGRFWTEMFRLAGTSLCFSTAYHPQSDGQTEVTNRG 1300

Query: 679  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSP 720
            +ET LRCF+ + P+ W ++L WAEL YNT+ HTAI M+PF+A YGR PP+++     S+ 
Sbjct: 1301 METLLRCFSSDKPRCWVQFLHWAELCYNTSYHTAIKMSPFQAVYGREPPTLIKFETGSTS 1360

BLAST of CSPI03G21010 vs. TrEMBL
Match: A0A087H8D5_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G106900 PE=4 SV=1)

HSP 1 Score: 701.4 bits (1809), Expect = 1.3e-198
Identity = 375/741 (50.61%), Postives = 484/741 (65.32%), Query Frame = 1

Query: 39   KLEVQR---ILLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPI 98
            K E+++   ++L+ G + ES +  + P    R  D   R  V+ R   LN  T+ D YPI
Sbjct: 580  KAEIEKQVAVMLAAGIIRESTSPYSSPVLLVRKKDGSWRFCVDYRA--LNKATVGDSYPI 639

Query: 99   PVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAP 158
            P++D+LLDEL GA +FSK+DL+SGYHQIRVR  DV KTA RTH+GHYEFLVMPFGL NAP
Sbjct: 640  PMIDQLLDELHGACVFSKLDLRSGYHQIRVRAEDVPKTAFRTHDGHYEFLVMPFGLTNAP 699

Query: 159  ATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQ 218
            ATFQ+ MND+ R +LRKFVLVFFDDIL+Y+KS  EH   L +VL+ L  H+L AN +KCQ
Sbjct: 700  ATFQALMNDVFRQHLRKFVLVFFDDILVYSKSASEHRNHLQLVLQLLQDHQLYANKRKCQ 759

Query: 219  FAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSI 278
            F    IEYLGH+I+++GV+AD +KI AM           L+GFLGLTGYYRKFV  YGSI
Sbjct: 760  FGSRSIEYLGHVITAEGVSADASKIQAMVDWPEPRNVKALRGFLGLTGYYRKFVRGYGSI 819

Query: 279  ALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-- 338
            A PLT LL+K +F+W+  A  AF  LK AM++VPVL + DF      +       LG   
Sbjct: 820  AKPLTSLLQKDQFRWSPEASTAFNNLKQAMVTVPVLTMADFDAQFVVESDASGTGLGAVL 879

Query: 339  -----------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLL 398
                       QAL D  + K+VYERELMAIV A+QKW  YLLG+ FVVRTD+KSLKFLL
Sbjct: 880  MQHQKPLAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLL 939

Query: 399  EQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIF 458
            EQR I  EYQ+W+TK+LG+DF I+YK G+ENKAA+ALSR   + +L  +S+ + +     
Sbjct: 940  EQRQINMEYQKWLTKILGFDFNIQYKSGLENKAADALSRRDAIPQLFALSIPAAIQLEDI 999

Query: 459  IDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSP 518
              +V  +  L  I+  ++    +  G+++ +  L   G+LV+   S  + L+L EFH S 
Sbjct: 1000 SSEVDKDLKLQKIKAEVLADPKSHAGFTVVQGRLLRQGKLVVPAQSHLVELILKEFHGSK 1059

Query: 519  IGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRV 578
            IGGH G L T +R+    YW GM   IRTYVAEC VCQ+ KY +L PAGLLQ LPIP +V
Sbjct: 1060 IGGHGGVLKTQKRITAVFYWEGMLNTIRTYVAECQVCQRHKYSTLAPAGLLQPLPIPTQV 1119

Query: 579  WEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGC 638
            W DISMDF EGLPK E +DVI+VVVDRL+KY+HFI L HPFGA  VA VF+ EI+ LHG 
Sbjct: 1120 WADISMDFFEGLPKFEGFDVIMVVVDRLTKYSHFISLAHPFGAPQVAMVFILEIVRLHGF 1179

Query: 639  PRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNT 698
            P ++VSDRD                              GQTEV NRG+ETYLRCFA + 
Sbjct: 1180 PETLVSDRDTLFTGLFWTELFRLAGTKLCFSTAYHPQSDGQTEVTNRGLETYLRCFASDK 1239

Query: 699  PKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERD 720
            PK W ++L WAE SYN++ H++I M+PF+A YGR PP +L     S+    +++ + ERD
Sbjct: 1240 PKSWVRFLPWAEFSYNSSYHSSIKMSPFQALYGREPPVLLKFENGSTVNATLENRLSERD 1299

BLAST of CSPI03G21010 vs. TrEMBL
Match: J3SDF5_BETVU (Ty3/gypsy retrotransposon protein OS=Beta vulgaris subsp. vulgaris PE=4 SV=1)

HSP 1 Score: 698.4 bits (1801), Expect = 1.1e-197
Identity = 359/694 (51.73%), Postives = 456/694 (65.71%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYHQI VR  D HKTA RTHEGHYE
Sbjct: 750  LNKETVPDKYPIPVIDELLDELHGATVFSKLDLRAGYHQILVRPEDTHKTAFRTHEGHYE 809

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAPATFQS MN++ RP+LR+FVLVF DDILIY++S EEH+  L +VL  L 
Sbjct: 810  FLVMPFGLTNAPATFQSLMNEVFRPFLRRFVLVFLDDILIYSRSDEEHVGHLEMVLGMLA 869

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
             H L  N KKC+F    + YLGH+IS  GVA D  K+ A+          EL+GFLGLTG
Sbjct: 870  QHALFVNKKKCEFGKREVAYLGHVISEGGVAMDTEKVKAVLEWEVPKNLRELRGFLGLTG 929

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYRKFVANY  IA PLT+ LKK  F+W+ TA  AF+QLK AM+S PVL +P+F      +
Sbjct: 930  YYRKFVANYAHIARPLTEQLKKDNFKWSATATEAFKQLKSAMVSAPVLAMPNFQLTFVVE 989

Query: 319  LMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                   +G   + D               + K+VYE+ELMAI  AVQKW  YLLG+ FV
Sbjct: 990  TDASGYGMGAVLMQDNRPIAYYSKLLGTRAQLKSVYEKELMAICFAVQKWKYYLLGRHFV 1049

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-LPPVFELG 438
            VRTD++SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ A+ALSR      ELG
Sbjct: 1050 VRTDQQSLRYITQQREIGAEFQKWVSKLMGYDFEIHYKPGLSNRVADALSRKTVGEVELG 1109

Query: 439  LISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSP 498
             I  V G+  +    ++ G+  L  +R  L  G+ TP  ++L    L + GR V+   S 
Sbjct: 1110 AIVAVQGVEWAELRREITGDSFLTQVRKELQEGR-TPSHFTLVDGNLLFKGRYVIPSSST 1169

Query: 499  TIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTP 558
             IP LL E+H++P+GGH G L TY RLA E YWRGM+  +  YV +C +CQQ K     P
Sbjct: 1170 IIPKLLYEYHDAPMGGHAGELKTYLRLAAEWYWRGMRQEVARYVHQCLICQQQKVSQQHP 1229

Query: 559  AGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVA 618
             GLLQ LPIP  VWEDISMDFIEGLP S+  D ILV+VDRLSKYAHF+ L+HPF A +VA
Sbjct: 1230 RGLLQPLPIPSLVWEDISMDFIEGLPVSKGVDTILVIVDRLSKYAHFLTLRHPFTALMVA 1289

Query: 619  AVFMREIIHLHGCPRSIVSDRDH-----------------------------GQTEVVNR 678
             +F++E++ LHG P SIVSDRD                              GQTE+VNR
Sbjct: 1290 DLFVKEVVRLHGFPSSIVSDRDRIFLSLFWKELFRLHGTTLKRSSAYHPQTDGQTEIVNR 1349

Query: 679  GVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESS 720
             +ETYLRCF    P+ WAKWL WAE SYNT+ HT+  M+PF+  YGR PP ++   K  +
Sbjct: 1350 ALETYLRCFVGGHPRSWAKWLPWAEFSYNTSPHTSTKMSPFKVLYGRDPPHVVRAPKGQT 1409

BLAST of CSPI03G21010 vs. TrEMBL
Match: A0A087HNF3_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G155400 PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 9.4e-194
Identity = 369/735 (50.20%), Postives = 480/735 (65.31%), Query Frame = 1

Query: 39   KLEVQR---ILLSFGSVFESRNQLTPPR---NHDPGARAVNVRPYPLNLGTIPDKYPIPV 98
            K E++R    +L  G V +SR+  + P        G+    V    LN  T+ D YPIP+
Sbjct: 592  KEEIERQVASMLGAGIVRDSRSPFSSPVLLVKKKDGSWRFCVDYRALNKATVSDSYPIPM 651

Query: 99   VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPAT 158
            +D+LLDEL GA IFSK+DL+SGYHQI V+  DV KTA RTH+GHYEFLVMPFGLKNAPAT
Sbjct: 652  IDQLLDELHGANIFSKLDLRSGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLKNAPAT 711

Query: 159  FQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFA 218
            FQ+ MND+ RP+LR+FVLVFFDDIL+Y+ +LEEH + L +VL+ L  +KL AN KKCQF 
Sbjct: 712  FQALMNDLFRPHLRRFVLVFFDDILVYSSNLEEHKEHLTMVLQILQNNKLFANPKKCQFG 771

Query: 219  VDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIAL 278
               IEYLGHIIS  GV+AD  KI AM           L+GFLGLTGYYRKFV+ YG  A 
Sbjct: 772  SSEIEYLGHIISGQGVSADQEKIKAMIEWPEPRNVKALRGFLGLTGYYRKFVSRYGEKAK 831

Query: 279  PLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL---- 338
            PLT LLKK +F+W + A  AF  LK AM SV VL + DF ++   +       LG     
Sbjct: 832  PLTTLLKKDQFKWGKEAAVAFTTLKEAMTSVSVLALADFNELFVVESDASGTGLGAVLMQ 891

Query: 339  ---------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ 398
                     QAL +  R K+VYERELMAIV A+QKW  YLLG+ F+VRTD+KSLKFL EQ
Sbjct: 892  KQKPLAFFSQALTERQRMKSVYERELMAIVFAIQKWRHYLLGRRFLVRTDQKSLKFLFEQ 951

Query: 399  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFID 458
            R I  EYQ+W+TK+LG++F I+YK G+EN+AA+ALSR   V  L  +S+ + L  +    
Sbjct: 952  REINLEYQKWLTKILGFNFEIQYKPGLENRAADALSRKEAVPLLFALSIPAVLQLNEIES 1011

Query: 459  QVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIG 518
             V  +  L  I+   +    +   Y++ +  L + GRLV+   S  I ++L EFH+  +G
Sbjct: 1012 AVDQDPVLKKIKDDWLQDPSSQPDYTVVQGRLLWKGRLVIPTGSAWIEVILKEFHDGKVG 1071

Query: 519  GHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWE 578
            GH G L T +R++   +W+GM   IR YVA C VCQ+ KY +L PAGLLQ LPIP+ VWE
Sbjct: 1072 GHGGVLKTQRRISALFFWKGMLGKIREYVAACHVCQRHKYSTLAPAGLLQPLPIPEAVWE 1131

Query: 579  DISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPR 638
            DISMDFIEGLPKS   ++I+VVVDRL+KY HF+ LKHP  A  VA+VF++EI+ LHG P+
Sbjct: 1132 DISMDFIEGLPKSAGMELIMVVVDRLTKYGHFVGLKHPLDATTVASVFIQEIVRLHGFPK 1191

Query: 639  SIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPK 698
            ++VSDRD                              GQ+EV NRG+ETY RCF  + P+
Sbjct: 1192 TLVSDRDRLFTGKFWGEMFKLVGTKLCFSTAYHPQSDGQSEVTNRGLETYPRCFTSDKPQ 1251

Query: 699  QWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI 716
             WA++L WAELSYNT+ H++I MTPF+A YGR PP++      S+ V +++  ++ERD +
Sbjct: 1252 TWAQFLPWAELSYNTSYHSSIHMTPFQAVYGREPPALRRYENGSTHVADLETKLQERDSM 1311

BLAST of CSPI03G21010 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 119.4 bits (298), Expect = 1.0e-26
Identity = 64/126 (50.79%), Postives = 81/126 (64.29%), Query Frame = 1

Query: 187 LQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGH--IISSDGVAADPTKIMAM------ 246
           +  L +VL+    H+  AN KKC F   +I YLGH  IIS +GV+ADP K+ AM      
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 247 ----ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVP 301
               EL+GFLGLTGYYR+FV NYG I  PLT+LLKK   +W E A  AF+ LK A+ ++P
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

BLAST of CSPI03G21010 vs. NCBI nr
Match: gi|674235545|gb|KFK28310.1| (hypothetical protein AALP_AA8G499800 [Arabis alpina])

HSP 1 Score: 724.5 bits (1869), Expect = 2.0e-205
Identity = 375/689 (54.43%), Postives = 478/689 (69.38%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYHQI V+  +V KTA RTH+GHYE
Sbjct: 675  LNKVTIPDSFPIPMIDQLLDELHGATVFSKLDLKSGYHQILVKPQNVPKTAFRTHDGHYE 734

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAP TFQ+ MN++ R +LRKFVLVFFDDIL+Y+ SL+EH + L VVL+ L 
Sbjct: 735  FLVMPFGLTNAPTTFQALMNEVFRAHLRKFVLVFFDDILVYSSSLQEHQEHLRVVLQILF 794

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
              +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ AM           L+GFLGLTG
Sbjct: 795  QQQLFANKKKCQFGSSSIEYLGHVISGEGVSADPSKLQAMVSWPLPKNIKALRGFLGLTG 854

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYR+FV  YGSIA PLT LLKK KFQW+E A  AF++LK AM +VPVL + DF ++   +
Sbjct: 855  YYRRFVQGYGSIAKPLTSLLKKDKFQWSEEATVAFEKLKVAMSTVPVLALVDFSELFVVE 914

Query: 319  LMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                 + LG              QAL D  + K+VYERELMAIV A+QKW  YLLG+ F+
Sbjct: 915  SDASGIGLGAVLLQKQKPVAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFL 974

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGL 438
            VRTD+KSLKFLLEQR +  EYQ+W+TK+LG++F I YK G+ENKAA+ALSR+  + +L  
Sbjct: 975  VRTDQKSLKFLLEQREVNLEYQQWLTKILGFNFDIHYKPGLENKAADALSRVEGLPQLYA 1034

Query: 439  ISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPT 498
            +SV + +      ++V  N     I+  ++    T  GYS+ +  L Y+G+LVL ++S  
Sbjct: 1035 LSVPAAIQLEEINEEVDRNPVSKKIKEEVLLDASTHSGYSVVQGRLLYNGKLVLPKESYL 1094

Query: 499  IPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPA 558
            I +LL EFHNS +GGH G L T + L    YW+GM A I+T+VAEC VCQ+ KY +L P+
Sbjct: 1095 IKVLLHEFHNSRMGGHGGVLKTQRHLGALFYWQGMMADIKTFVAECVVCQKHKYSTLAPS 1154

Query: 559  GLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAA 618
            GLLQ LPIP +VWEDIS+DF+EGLPKSE +D ILVVVDRL+KYAHFI L+HPFGAK +AA
Sbjct: 1155 GLLQPLPIPTQVWEDISLDFVEGLPKSEGFDAILVVVDRLTKYAHFIKLQHPFGAKEIAA 1214

Query: 619  VFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG 678
            VF++EI+ LHG P ++VSDRD                              GQTEV NRG
Sbjct: 1215 VFIQEIVRLHGYPSTMVSDRDTLFTGMFWTELFRLAGTSLNFSTAYHPQTDGQTEVTNRG 1274

Query: 679  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSP 716
            +ET LRCF  + PK+WA +L WAE  YN++ H+AI MTPF+A YGR PPS+L     S+ 
Sbjct: 1275 LETILRCFTSDKPKKWAAYLPWAEFCYNSSYHSAIQMTPFKALYGRDPPSLLRFEDGSTT 1334

BLAST of CSPI03G21010 vs. NCBI nr
Match: gi|674230743|gb|KFK24528.1| (hypothetical protein AALP_AAs46225U000100, partial [Arabis alpina])

HSP 1 Score: 710.7 bits (1833), Expect = 3.0e-201
Identity = 367/693 (52.96%), Postives = 468/693 (67.53%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  TIPD +PIP++D+LLDEL GATIFSK+DLKSGYHQI V+  DV KTA RTH+GHYE
Sbjct: 701  LNKVTIPDSFPIPMIDQLLDELHGATIFSKLDLKSGYHQILVKAEDVAKTAFRTHDGHYE 760

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAPATFQS MND+ R YLRKFVLVFFDDIL+Y+KSL+EH Q L +VLE L 
Sbjct: 761  FLVMPFGLTNAPATFQSLMNDVFRGYLRKFVLVFFDDILVYSKSLQEHQQHLGLVLELLQ 820

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
             H+L AN KKC+F    +EYLGH++S  GVAADP KI AM           L+GFLGLTG
Sbjct: 821  QHQLFANKKKCEFGRTELEYLGHVVSGKGVAADPEKIQAMVSWPEPQNVKALRGFLGLTG 880

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYRKFV  YG IA PLT LLKK +FQW   A  AFQ+LK AM +VPVL + DF +    +
Sbjct: 881  YYRKFVQRYGEIARPLTALLKKDQFQWTAEATVAFQKLKKAMSTVPVLALVDFTEQFVVE 940

Query: 319  LMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                   LG              QAL +  R K+VYERELMAIV A+QKW  YLLG+ FV
Sbjct: 941  SDASGTGLGAVLMQSQRPLAYFSQALTERQRLKSVYERELMAIVFAIQKWRHYLLGRKFV 1000

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGL 438
            VRTD+KSLKFLLEQR I  EYQ+W+TKLLG+DF I+YK G+ENKAA+ALSR     +L  
Sbjct: 1001 VRTDQKSLKFLLEQREINMEYQKWLTKLLGFDFEIQYKPGLENKAADALSRKDMALQLCA 1060

Query: 439  ISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPT 498
            +S+ + +       +V  +  L  ++  ++    +   +S+ +  L   G+LV+   S  
Sbjct: 1061 LSIPAAIQLEQINTEVDNDPDLRKLKEEVLQDAASHSEFSVVQGRLLRKGKLVVPAQSRL 1120

Query: 499  IPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPA 558
            + ++L EFHN  +GGH G L T +R+    YW+GM + IR +VA C VCQ+ KY +L PA
Sbjct: 1121 VNVILQEFHNGKLGGHGGVLKTQKRVEAIFYWKGMMSRIREFVAACQVCQRHKYSTLAPA 1180

Query: 559  GLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAA 618
            GLLQ LPIPD+VWEDISMDF+EGLPKSE ++V++VVVDRL+KYAHFI +KHP  A  VA 
Sbjct: 1181 GLLQPLPIPDQVWEDISMDFVEGLPKSEGFEVVMVVVDRLTKYAHFISMKHPVTAVEVAL 1240

Query: 619  VFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG 678
            +F +E++ LHG P++IVSDRD                              GQTEV NRG
Sbjct: 1241 IFTKEVVKLHGFPKTIVSDRDPLFTGRFWTEMFRLAGTSLCFSTAYHPQSDGQTEVTNRG 1300

Query: 679  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSP 720
            +ET LRCF+ + P+ W ++L WAEL YNT+ HTAI M+PF+A YGR PP+++     S+ 
Sbjct: 1301 METLLRCFSSDKPRCWVQFLHWAELCYNTSYHTAIKMSPFQAVYGREPPTLIKFETGSTS 1360

BLAST of CSPI03G21010 vs. NCBI nr
Match: gi|674245622|gb|KFK38387.1| (hypothetical protein AALP_AA3G106900 [Arabis alpina])

HSP 1 Score: 701.4 bits (1809), Expect = 1.8e-198
Identity = 375/741 (50.61%), Postives = 484/741 (65.32%), Query Frame = 1

Query: 39   KLEVQR---ILLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPI 98
            K E+++   ++L+ G + ES +  + P    R  D   R  V+ R   LN  T+ D YPI
Sbjct: 580  KAEIEKQVAVMLAAGIIRESTSPYSSPVLLVRKKDGSWRFCVDYRA--LNKATVGDSYPI 639

Query: 99   PVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAP 158
            P++D+LLDEL GA +FSK+DL+SGYHQIRVR  DV KTA RTH+GHYEFLVMPFGL NAP
Sbjct: 640  PMIDQLLDELHGACVFSKLDLRSGYHQIRVRAEDVPKTAFRTHDGHYEFLVMPFGLTNAP 699

Query: 159  ATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQ 218
            ATFQ+ MND+ R +LRKFVLVFFDDIL+Y+KS  EH   L +VL+ L  H+L AN +KCQ
Sbjct: 700  ATFQALMNDVFRQHLRKFVLVFFDDILVYSKSASEHRNHLQLVLQLLQDHQLYANKRKCQ 759

Query: 219  FAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSI 278
            F    IEYLGH+I+++GV+AD +KI AM           L+GFLGLTGYYRKFV  YGSI
Sbjct: 760  FGSRSIEYLGHVITAEGVSADASKIQAMVDWPEPRNVKALRGFLGLTGYYRKFVRGYGSI 819

Query: 279  ALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-- 338
            A PLT LL+K +F+W+  A  AF  LK AM++VPVL + DF      +       LG   
Sbjct: 820  AKPLTSLLQKDQFRWSPEASTAFNNLKQAMVTVPVLTMADFDAQFVVESDASGTGLGAVL 879

Query: 339  -----------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLL 398
                       QAL D  + K+VYERELMAIV A+QKW  YLLG+ FVVRTD+KSLKFLL
Sbjct: 880  MQHQKPLAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLL 939

Query: 399  EQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIF 458
            EQR I  EYQ+W+TK+LG+DF I+YK G+ENKAA+ALSR   + +L  +S+ + +     
Sbjct: 940  EQRQINMEYQKWLTKILGFDFNIQYKSGLENKAADALSRRDAIPQLFALSIPAAIQLEDI 999

Query: 459  IDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSP 518
              +V  +  L  I+  ++    +  G+++ +  L   G+LV+   S  + L+L EFH S 
Sbjct: 1000 SSEVDKDLKLQKIKAEVLADPKSHAGFTVVQGRLLRQGKLVVPAQSHLVELILKEFHGSK 1059

Query: 519  IGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRV 578
            IGGH G L T +R+    YW GM   IRTYVAEC VCQ+ KY +L PAGLLQ LPIP +V
Sbjct: 1060 IGGHGGVLKTQKRITAVFYWEGMLNTIRTYVAECQVCQRHKYSTLAPAGLLQPLPIPTQV 1119

Query: 579  WEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGC 638
            W DISMDF EGLPK E +DVI+VVVDRL+KY+HFI L HPFGA  VA VF+ EI+ LHG 
Sbjct: 1120 WADISMDFFEGLPKFEGFDVIMVVVDRLTKYSHFISLAHPFGAPQVAMVFILEIVRLHGF 1179

Query: 639  PRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNT 698
            P ++VSDRD                              GQTEV NRG+ETYLRCFA + 
Sbjct: 1180 PETLVSDRDTLFTGLFWTELFRLAGTKLCFSTAYHPQSDGQTEVTNRGLETYLRCFASDK 1239

Query: 699  PKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERD 720
            PK W ++L WAE SYN++ H++I M+PF+A YGR PP +L     S+    +++ + ERD
Sbjct: 1240 PKSWVRFLPWAEFSYNSSYHSSIKMSPFQALYGREPPVLLKFENGSTVNATLENRLSERD 1299

BLAST of CSPI03G21010 vs. NCBI nr
Match: gi|387965727|gb|AFK13856.1| (Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 698.4 bits (1801), Expect = 1.5e-197
Identity = 359/694 (51.73%), Postives = 456/694 (65.71%), Query Frame = 1

Query: 79   LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYE 138
            LN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYHQI VR  D HKTA RTHEGHYE
Sbjct: 750  LNKETVPDKYPIPVIDELLDELHGATVFSKLDLRAGYHQILVRPEDTHKTAFRTHEGHYE 809

Query: 139  FLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLV 198
            FLVMPFGL NAPATFQS MN++ RP+LR+FVLVF DDILIY++S EEH+  L +VL  L 
Sbjct: 810  FLVMPFGLTNAPATFQSLMNEVFRPFLRRFVLVFLDDILIYSRSDEEHVGHLEMVLGMLA 869

Query: 199  VHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTG 258
             H L  N KKC+F    + YLGH+IS  GVA D  K+ A+          EL+GFLGLTG
Sbjct: 870  QHALFVNKKKCEFGKREVAYLGHVISEGGVAMDTEKVKAVLEWEVPKNLRELRGFLGLTG 929

Query: 259  YYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWK 318
            YYRKFVANY  IA PLT+ LKK  F+W+ TA  AF+QLK AM+S PVL +P+F      +
Sbjct: 930  YYRKFVANYAHIARPLTEQLKKDNFKWSATATEAFKQLKSAMVSAPVLAMPNFQLTFVVE 989

Query: 319  LMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFV 378
                   +G   + D               + K+VYE+ELMAI  AVQKW  YLLG+ FV
Sbjct: 990  TDASGYGMGAVLMQDNRPIAYYSKLLGTRAQLKSVYEKELMAICFAVQKWKYYLLGRHFV 1049

Query: 379  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-LPPVFELG 438
            VRTD++SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ A+ALSR      ELG
Sbjct: 1050 VRTDQQSLRYITQQREIGAEFQKWVSKLMGYDFEIHYKPGLSNRVADALSRKTVGEVELG 1109

Query: 439  LISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSP 498
             I  V G+  +    ++ G+  L  +R  L  G+ TP  ++L    L + GR V+   S 
Sbjct: 1110 AIVAVQGVEWAELRREITGDSFLTQVRKELQEGR-TPSHFTLVDGNLLFKGRYVIPSSST 1169

Query: 499  TIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTP 558
             IP LL E+H++P+GGH G L TY RLA E YWRGM+  +  YV +C +CQQ K     P
Sbjct: 1170 IIPKLLYEYHDAPMGGHAGELKTYLRLAAEWYWRGMRQEVARYVHQCLICQQQKVSQQHP 1229

Query: 559  AGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVA 618
             GLLQ LPIP  VWEDISMDFIEGLP S+  D ILV+VDRLSKYAHF+ L+HPF A +VA
Sbjct: 1230 RGLLQPLPIPSLVWEDISMDFIEGLPVSKGVDTILVIVDRLSKYAHFLTLRHPFTALMVA 1289

Query: 619  AVFMREIIHLHGCPRSIVSDRDH-----------------------------GQTEVVNR 678
             +F++E++ LHG P SIVSDRD                              GQTE+VNR
Sbjct: 1290 DLFVKEVVRLHGFPSSIVSDRDRIFLSLFWKELFRLHGTTLKRSSAYHPQTDGQTEIVNR 1349

Query: 679  GVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESS 720
             +ETYLRCF    P+ WAKWL WAE SYNT+ HT+  M+PF+  YGR PP ++   K  +
Sbjct: 1350 ALETYLRCFVGGHPRSWAKWLPWAEFSYNTSPHTSTKMSPFKVLYGRDPPHVVRAPKGQT 1409

BLAST of CSPI03G21010 vs. NCBI nr
Match: gi|674250890|gb|KFK43655.1| (hypothetical protein AALP_AA1G155400 [Arabis alpina])

HSP 1 Score: 685.3 bits (1767), Expect = 1.4e-193
Identity = 369/735 (50.20%), Postives = 480/735 (65.31%), Query Frame = 1

Query: 39   KLEVQR---ILLSFGSVFESRNQLTPPR---NHDPGARAVNVRPYPLNLGTIPDKYPIPV 98
            K E++R    +L  G V +SR+  + P        G+    V    LN  T+ D YPIP+
Sbjct: 592  KEEIERQVASMLGAGIVRDSRSPFSSPVLLVKKKDGSWRFCVDYRALNKATVSDSYPIPM 651

Query: 99   VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPAT 158
            +D+LLDEL GA IFSK+DL+SGYHQI V+  DV KTA RTH+GHYEFLVMPFGLKNAPAT
Sbjct: 652  IDQLLDELHGANIFSKLDLRSGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLKNAPAT 711

Query: 159  FQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFA 218
            FQ+ MND+ RP+LR+FVLVFFDDIL+Y+ +LEEH + L +VL+ L  +KL AN KKCQF 
Sbjct: 712  FQALMNDLFRPHLRRFVLVFFDDILVYSSNLEEHKEHLTMVLQILQNNKLFANPKKCQFG 771

Query: 219  VDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIAL 278
               IEYLGHIIS  GV+AD  KI AM           L+GFLGLTGYYRKFV+ YG  A 
Sbjct: 772  SSEIEYLGHIISGQGVSADQEKIKAMIEWPEPRNVKALRGFLGLTGYYRKFVSRYGEKAK 831

Query: 279  PLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL---- 338
            PLT LLKK +F+W + A  AF  LK AM SV VL + DF ++   +       LG     
Sbjct: 832  PLTTLLKKDQFKWGKEAAVAFTTLKEAMTSVSVLALADFNELFVVESDASGTGLGAVLMQ 891

Query: 339  ---------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ 398
                     QAL +  R K+VYERELMAIV A+QKW  YLLG+ F+VRTD+KSLKFL EQ
Sbjct: 892  KQKPLAFFSQALTERQRMKSVYERELMAIVFAIQKWRHYLLGRRFLVRTDQKSLKFLFEQ 951

Query: 399  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFID 458
            R I  EYQ+W+TK+LG++F I+YK G+EN+AA+ALSR   V  L  +S+ + L  +    
Sbjct: 952  REINLEYQKWLTKILGFNFEIQYKPGLENRAADALSRKEAVPLLFALSIPAVLQLNEIES 1011

Query: 459  QVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIG 518
             V  +  L  I+   +    +   Y++ +  L + GRLV+   S  I ++L EFH+  +G
Sbjct: 1012 AVDQDPVLKKIKDDWLQDPSSQPDYTVVQGRLLWKGRLVIPTGSAWIEVILKEFHDGKVG 1071

Query: 519  GHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWE 578
            GH G L T +R++   +W+GM   IR YVA C VCQ+ KY +L PAGLLQ LPIP+ VWE
Sbjct: 1072 GHGGVLKTQRRISALFFWKGMLGKIREYVAACHVCQRHKYSTLAPAGLLQPLPIPEAVWE 1131

Query: 579  DISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPR 638
            DISMDFIEGLPKS   ++I+VVVDRL+KY HF+ LKHP  A  VA+VF++EI+ LHG P+
Sbjct: 1132 DISMDFIEGLPKSAGMELIMVVVDRLTKYGHFVGLKHPLDATTVASVFIQEIVRLHGFPK 1191

Query: 639  SIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPK 698
            ++VSDRD                              GQ+EV NRG+ETY RCF  + P+
Sbjct: 1192 TLVSDRDRLFTGKFWGEMFKLVGTKLCFSTAYHPQSDGQSEVTNRGLETYPRCFTSDKPQ 1251

Query: 699  QWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI 716
             WA++L WAELSYNT+ H++I MTPF+A YGR PP++      S+ V +++  ++ERD +
Sbjct: 1252 TWAQFLPWAELSYNTSYHSSIHMTPFQAVYGREPPALRRYENGSTHVADLETKLQERDSM 1311

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TF22_SCHPO2.5e-8630.64Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF212_SCHPO2.5e-8630.64Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
TF23_SCHPO2.5e-8630.64Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF24_SCHPO2.5e-8630.64Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF25_SCHPO2.5e-8630.64Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A087GEK8_ARAAL1.4e-20554.43Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
A0A087G3S6_ARAAL2.1e-20152.96Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4... [more]
A0A087H8D5_ARAAL1.3e-19850.61Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G106900 PE=4 SV=1[more]
J3SDF5_BETVU1.1e-19751.73Ty3/gypsy retrotransposon protein OS=Beta vulgaris subsp. vulgaris PE=4 SV=1[more]
A0A087HNF3_ARAAL9.4e-19450.20Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G155400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.11.0e-2650.79ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|674235545|gb|KFK28310.1|2.0e-20554.43hypothetical protein AALP_AA8G499800 [Arabis alpina][more]
gi|674230743|gb|KFK24528.1|3.0e-20152.96hypothetical protein AALP_AAs46225U000100, partial [Arabis alpina][more]
gi|674245622|gb|KFK38387.1|1.8e-19850.61hypothetical protein AALP_AA3G106900 [Arabis alpina][more]
gi|387965727|gb|AFK13856.1|1.5e-19751.73Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris][more]
gi|674250890|gb|KFK43655.1|1.4e-19350.20hypothetical protein AALP_AA1G155400 [Arabis alpina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21010.1CSPI03G21010.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 97..223
score: 1.3
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..223
score: 11
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 538..632
score:
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 551..674
score: 1.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 541..670
score: 2.09
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 79..143
score: 7.3
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 144..226
score: 2.3
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 79..786
score: 7.8E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 79..786
score: 7.8E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 65..393
score: 2.78E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None