CSPI01G33150 (gene) Wild cucumber (PI 183967)

NameCSPI01G33150
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-I Gag-Pol polyprotein
LocationChr1 : 28025513 .. 28029031 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCTGAAGGAGATGTTATTAGAGATGAAAAAATCGATGGAGAGAATGGTTGAGGATCTGAGAGAGAGCCATAGCAACAAAAGTAGAGAGGAGTTTGGGACCTCCGATGGATCGGTCATGAAACTGAAGGTGAAGGCCGAGGAAACCGATACTACCAATGAAGGGAACTTGACCATGATCGACCACAGCAAATACAAGAAACTGGAAATGTCGATGTTCTTGAGAGAAAATCCGAAATCTTGGGTCTACAGAGCAGAGCATTTCTTTGAAATCAATAATTTACCCAAAGCAAAAAAGGTCATGGTGGCAGTAGTAAGCTTTGGACAAGATGAGGTAGACTGGTACAGATGGTCACATAATAGGAAGAAGGTGGAATCGTAGGAGGATTTGAAGGAAAGGATGTTTGAATTCTTTAAAGACACAGGCCAGAAGAGTTTAGTGTCGAGACTGATCAAAATTCAATAGGATGGTTCCTACAACGATTACGTAAAGAAATTTGTTAATTATTTGGCACCCCTACCCCACATGACAGAGAGTGTATTAAGGGATGCGTTCCTGACCTGGTTAGAGCCGACTCTCCAAGCCGAAGTGGTAAGCCGCAATCCACAAACTCTGGAAGAATGTATGAGATAGGCACAGTTGGTCAACGATAGAAATTTAGCCATGAAATGGTCAAAGGCGGAATGGGGAGGAAGGGATCACAAAAAAGGAGAAGGAAGCACGGGCAAAGGTCCAGAAGGAGTTGAAAAGGGAATAACCAGGAAAACATAATTTCCTTTGAAACAGGTCACAATTCCCATTAGAGGCAACTACTAGAAGAGTGAACCACCAGTGAAACATTTATCTGATGCAGAATTCAGAGCAAGACTGGACAAGGGTTTGTGTTTTAAATGCAATGAAAGATATTCACCAAGACATCGATGCAAAATGAAGGAAAAACGAGAGTTGATGTTGTTTATCATGAATGAATAAGAAAGTCTAGAAGAGGAAGGCAGAACAGAGGAGTCTAATGAGGATGTGTTAGAATTGAAACAGATTAAATTGGAGAAGGGAACCAAAATTGAGTTGAAAGCAATACATGGACTAACAAGTAAAGGAACCATGAAGATAAAGGGTGAAATCAAAGGGAGGGAAGTACTAGTACTGATTGATAGCAAGGCCACCCACAACTTCATACACAACAAGATTGTTGAGGGAATGGGATTAGCACTAGAGAAGGGTACTCCATTCGGGGTCACTACTGGAGATGGTACGAGATGCCAAGGAAGAGGGGTGTGTAAGAGATTAGAACTGAAACTAAAGGAGATCACAATTGTGGCGGATTTTTTAGCAATTGAAGTCGGGAATGTGGATCTCATCTTGGGAATGCAATGGCTGGATACAACTGGAACTATGAAGATACATTGGCCATCCCTAACCATGACATTTTCGATGGGGGAAAACAGATTACCCTCAAGGGTGATCCTTCACTTATTAGAGTTGAATGTTCTATGAAAATCATAGAAAAAACATGAGAAGAGGAGGATCAAGGCTTCCTATTAGAACTACAAAGTAATGATGCAGAGAGGGATGAGGAATCTGATGAAGAACAAAGGGTGAAGAGCGATGAGGATTTATTTGAGGAACCTAAGGGGTTACCACCTAAAAGAGAAGTTGACCATCGTATTTTGTTGTTGCCCGGACAGAAACCTATCAACATGAGACCCTACAAGTATGGTCACACCCAGAAGGAAGAAATTGAGAAGTTGGTAACCAAGATGCTCCAAACTGGGATAAACCGACCCAACCACAGCCCTTACTCTAGTCCCATTTTGATGGCTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTATGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAGCTGGATCTAAAATCAGGGTATCATCAGAGTTGAAAGAATGATAGTCCCGTTTTGCTGGTTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTGTGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAACTACATTCAGGACCCATTAGGGACACTATGAGTTCTTGGTCATGTCGTTCGGCCTCACCAACGCACCAACTACTTTCCAATCATTGATGAATCGGGTATTTAAACCTTTTCTAAGATGTGTATTGGTTTTTTTTACGATATACTTGTGTATAGCACGGACCTCACAGAGCATGAGAAGCATTTGGGCATGGTATTTGCAGTAATGAGGGATAACCAGTTCTTTGCCAACAAAAAAAAATGTGTGATGCCTCACTCGCAGATCCAATACTTGGGACATTTAATATCTAGTAAGGGTGTAGAGGCAAATGAAGAGAAGATTAAGAACATGGTAAATTGAACCCAGCCCAAAGATGTAACTAAATTGAGGGGGTTTTTAGGCTTGACTGGTTACTACAGAAGATTCGTCAAGGGATATGGGAAGATTGCAGGCCCTTTGACCAAGCTACTGCAAAAGAACTCATTCCTATGGAATGAAGAAGCAACAAAGGCATTTGATAAGTTGAAGTTAGCCATGACAACCATACCCGTGCTAGCCCTACCGGACTAGAACCTACCCTTCATCATTGAAACGGATGCTTCCGGGATTGCTCTAAGGGCAGTTCTACCTCAAAATGGCCACCCCATAGCCTTCTTTAGTCAGAAACTTTCATTCAGACCTCAAACCAAATCCATATACGAGAGGGAATTGATAGCAGTAGTCCTTTCAGTACAAAAGTGGAGATATTACCTTTTGGGAAGGAATTTTACAATCGTCTCGAATCAGAGAGCCCTCATATTTCTTTTAGAACAAAGGGAAGTGCAACCCCAGTTCCAAAAGTGGTTGACTAAACTCCTAGGGTACTACTTCGAAATACTTTACCAACCAAGGTTACAAAACAAAGCAGCAAATGCCCTCTCCCGAATAGAACAGCCACTAGAAGTGAGGAGTATGTGCACCACGGGTATTGTTAACATGGAAGTGATTGAGAAGGAGGTCAAGTTAGATGAGGATCTCAAGAGAATCATCGAAGAATTAAAGAAAAATCCTGATGAGTCTAGTAAATTCCAATGGATTAATGGAAACCTACTGTATAAGAAACGAATAGTTTTGTCCAAGAGATCCTCCTTGATTCCCACCCTGCTACATACGTTTCATGACTCAATTTTGAGAGGCCACTCCGGATTCTTAAGAACCTATAAAAGAATGTGTGGGGAACTTTATTGGAAGGGCATGAAAACGGATGTAAAAAAATATGTAGAGCAGTGTGAGGTATGCCAAAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTTTGCAGCCAATTCCAATCCCCGAAAGAATTTTGGAGGAATGGTCCATGGACTTCATTGAAGGATTATGTTAG

mRNA sequence

ATGGGTCTGAAGGAGATGTTATTAGAGATGAAAAAATCGATGGAGAGAATGGTTGAGGATCTGAGAGAGAGCCATAGCAACAAAAGTAGAGAGGAGTTTGGGACCTCCGATGGATCGGTCATGAAACTGAAGGTGAAGGCCGAGGAAACCGATACTACCAATGAAGGGAACTTGACCATGATCGACCACAGCAAATACAAGAAACTGGAAATGTCGATGTTCTTGAGAGAAAATCCGAAATCTTGGGTCTACAGAGCAGAGCATTTCTTTGAAATCAATAATTTACCCAAAGCAAAAAAGGTCATGGTGGCAGTAGTAAGCTTTGGACAAGATGAGGTAGACTGGTACAGATGGTCACATAATAGGAAGAAGGTGGAATCAGAGTGTATTAAGGGATGCGTTCCTGACCTGGTTAGAGCCGACTCTCCAAGCCGAAGTGGTAAGCCGCAATCCACAAACTCTGGAAGAATGCGGAATGGGGAGGAAGGGATCACAAAAAAGGAGAAGGAAGCACGGGCAAAGAAGAGTGAACCACCAGTGAAACATTTATCTGATGCAGAATTCAGAGCAAGACTGGACAAGGAAAGTCTAGAAGAGGAAGGCAGAACAGAGGAGTCTAATGAGGATGTGTTAGAATTGAAACAGATTAAATTGGAGAAGGGAACCAAAATTGAGTTGAAAGCAATACATGGACTAACAAGTAAAGGAACCATGAAGATAAAGGGTGAAATCAAAGGGAGGGAAGTACTAGTACTGATTGATAGCAAGGCCACCCACAACTTCATACACAACAAGATTGTTGAGGGAATGGGATTAGCACTAGAGAAGGGTACTCCATTCGGGGTCACTACTGGAGATGGTACGAGATGCCAAGGAAGAGGGGTGTGTAAGAGATTAGAACTGAAACTAAAGGAGATCACAATTGTGGCGGATTTTTTAGCAATTGAAGTCGGGAATGTGGATCTCATCTTGGGAATGCAATGGCTGGATACAACTGGAACTATGAAGATACATTGGCCATCCCTAACCATGACATTTTCGATGGGGGAAAACAGATTACCCTCAAGGAGGGATGAGGAATCTGATGAAGAACAAAGGGTGAAGAGCGATGAGGATTTATTTGAGGAACCTAAGGGGTTACCACCTAAAAGAGAAGTTGACCATCGTATTTTGTTGTTGCCCGGACAGAAACCTATCAACATGAGACCCTACAAGTATGGTCACACCCAGAAGGAAGAAATTGAGAAGTTGGTAACCAAGATGCTCCAAACTGGGATAAACCGACCCAACCACAGCCCTTACTCTAGTCCCATTTTGATGGCTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTATGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAGCTGGATCTAAAATCAGGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTGTGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAACTACATTCAGGACCCATTAGGGACACTATGAGTTCTTGCACGGACCTCACAGAGCATGAGAAGCATTTGGGCATGGTATTTGCAGTAATGAGGGATAACCAGTTCTTTGCCAACAAAAAAAAATGTGTGATGCCTCACTCGCAGATCCAATACTTGGGACATTTAATATCTAGTAAGGGTGTAGAGGCAAATGAAGAGAAGATTAAGAACATGAACCTACCCTTCATCATTGAAACGGATGCTTCCGGGATTGCTCTAAGGGCAGTTCTACCTCAAAATGGCCACCCCATAGCCTTCTTTAGTCAGAAACTTTCATTCAGACCTCAAACCAAATCCATATACGAGAGGGAATTGATAGCAGTAGTCCTTTCAGTACAAAAGTGGAGATATTACCTTTTGGGAAGGAATTTTACAATCGTCTCGAATCAGAGAGCCCTCATATTTCTTTTAGAACAAAGGGAAGTGCAACCCCAGTTCCAAAAGTGGTTGACTAAACTCCTAGGGTACTACTTCGAAATACTTTACCAACCAAGGTTACAAAACAAAGCAGCAAATGCCCTCTCCCGAATAGAACAGCCACTAGAAGTGAGGAGTATGTGCACCACGGGTATTGTTAACATGGAAGTGATTGAGAAGGAGGTCAAGTTAGATGAGGATCTCAAGAGAATCATCGAAGAATTAAAGAAAAATCCTGATGAGTCTAGTAAATTCCAATGGATTAATGGAAACCTACTGTATAAGAAACGAATAGTTTTGTCCAAGAGATCCTCCTTGATTCCCACCCTGCTACATACGTTTCATGACTCAATTTTGAGAGGCCACTCCGGATTCTTAAGAACCTATAAAAGAATGTGTGGGGAACTTTATTGGAAGGGCATGAAAACGGATGTAAAAAAATATGTAGAGCAGTGTGAGGTATGCCAAAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTTTGCAGCCAATTCCAATCCCCGAAAGAATTTTGGAGGAATGGTCCATGGACTTCATTGAAGGATTATGTTAG

Coding sequence (CDS)

ATGGGTCTGAAGGAGATGTTATTAGAGATGAAAAAATCGATGGAGAGAATGGTTGAGGATCTGAGAGAGAGCCATAGCAACAAAAGTAGAGAGGAGTTTGGGACCTCCGATGGATCGGTCATGAAACTGAAGGTGAAGGCCGAGGAAACCGATACTACCAATGAAGGGAACTTGACCATGATCGACCACAGCAAATACAAGAAACTGGAAATGTCGATGTTCTTGAGAGAAAATCCGAAATCTTGGGTCTACAGAGCAGAGCATTTCTTTGAAATCAATAATTTACCCAAAGCAAAAAAGGTCATGGTGGCAGTAGTAAGCTTTGGACAAGATGAGGTAGACTGGTACAGATGGTCACATAATAGGAAGAAGGTGGAATCAGAGTGTATTAAGGGATGCGTTCCTGACCTGGTTAGAGCCGACTCTCCAAGCCGAAGTGGTAAGCCGCAATCCACAAACTCTGGAAGAATGCGGAATGGGGAGGAAGGGATCACAAAAAAGGAGAAGGAAGCACGGGCAAAGAAGAGTGAACCACCAGTGAAACATTTATCTGATGCAGAATTCAGAGCAAGACTGGACAAGGAAAGTCTAGAAGAGGAAGGCAGAACAGAGGAGTCTAATGAGGATGTGTTAGAATTGAAACAGATTAAATTGGAGAAGGGAACCAAAATTGAGTTGAAAGCAATACATGGACTAACAAGTAAAGGAACCATGAAGATAAAGGGTGAAATCAAAGGGAGGGAAGTACTAGTACTGATTGATAGCAAGGCCACCCACAACTTCATACACAACAAGATTGTTGAGGGAATGGGATTAGCACTAGAGAAGGGTACTCCATTCGGGGTCACTACTGGAGATGGTACGAGATGCCAAGGAAGAGGGGTGTGTAAGAGATTAGAACTGAAACTAAAGGAGATCACAATTGTGGCGGATTTTTTAGCAATTGAAGTCGGGAATGTGGATCTCATCTTGGGAATGCAATGGCTGGATACAACTGGAACTATGAAGATACATTGGCCATCCCTAACCATGACATTTTCGATGGGGGAAAACAGATTACCCTCAAGGAGGGATGAGGAATCTGATGAAGAACAAAGGGTGAAGAGCGATGAGGATTTATTTGAGGAACCTAAGGGGTTACCACCTAAAAGAGAAGTTGACCATCGTATTTTGTTGTTGCCCGGACAGAAACCTATCAACATGAGACCCTACAAGTATGGTCACACCCAGAAGGAAGAAATTGAGAAGTTGGTAACCAAGATGCTCCAAACTGGGATAAACCGACCCAACCACAGCCCTTACTCTAGTCCCATTTTGATGGCTAGAAAGAAAGATGGAGGGTGGAGGTTCTGTGTTGACTACAGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTATGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAGCTGGATCTAAAATCAGGGAAATTAAACCAAGTAACCGTATCTGATAAATTTCCAATACCTGTGATAGAGGAATTGTTAGATGAGTTGCACGGGGCTACAGTCTTCTCAAAACTACATTCAGGACCCATTAGGGACACTATGAGTTCTTGCACGGACCTCACAGAGCATGAGAAGCATTTGGGCATGGTATTTGCAGTAATGAGGGATAACCAGTTCTTTGCCAACAAAAAAAAATGTGTGATGCCTCACTCGCAGATCCAATACTTGGGACATTTAATATCTAGTAAGGGTGTAGAGGCAAATGAAGAGAAGATTAAGAACATGAACCTACCCTTCATCATTGAAACGGATGCTTCCGGGATTGCTCTAAGGGCAGTTCTACCTCAAAATGGCCACCCCATAGCCTTCTTTAGTCAGAAACTTTCATTCAGACCTCAAACCAAATCCATATACGAGAGGGAATTGATAGCAGTAGTCCTTTCAGTACAAAAGTGGAGATATTACCTTTTGGGAAGGAATTTTACAATCGTCTCGAATCAGAGAGCCCTCATATTTCTTTTAGAACAAAGGGAAGTGCAACCCCAGTTCCAAAAGTGGTTGACTAAACTCCTAGGGTACTACTTCGAAATACTTTACCAACCAAGGTTACAAAACAAAGCAGCAAATGCCCTCTCCCGAATAGAACAGCCACTAGAAGTGAGGAGTATGTGCACCACGGGTATTGTTAACATGGAAGTGATTGAGAAGGAGGTCAAGTTAGATGAGGATCTCAAGAGAATCATCGAAGAATTAAAGAAAAATCCTGATGAGTCTAGTAAATTCCAATGGATTAATGGAAACCTACTGTATAAGAAACGAATAGTTTTGTCCAAGAGATCCTCCTTGATTCCCACCCTGCTACATACGTTTCATGACTCAATTTTGAGAGGCCACTCCGGATTCTTAAGAACCTATAAAAGAATGTGTGGGGAACTTTATTGGAAGGGCATGAAAACGGATGTAAAAAAATATGTAGAGCAGTGTGAGGTATGCCAAAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTTTGCAGCCAATTCCAATCCCCGAAAGAATTTTGGAGGAATGGTCCATGGACTTCATTGAAGGATTATGTTAG
BLAST of CSPI01G33150 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 119.8 bits (299), Expect = 1.5e-25
Identity = 71/227 (31.28%), Postives = 110/227 (48.46%), Query Frame = 1

Query: 386 VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKD 445
           V H I + PG +   ++PY      ++EI K+V K+L      P+ SP SSP+++  KKD
Sbjct: 584 VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKD 643

Query: 446 GGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG------------K 505
           G +R CVDYR LN+ T+SD FP+P I+ LL  +  A +F+ LDL SG            K
Sbjct: 644 GTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYK 703

Query: 506 LNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHE 565
              VT S K+   V+   L  ++  + F++  +   R          D +       EH 
Sbjct: 704 TAFVTPSGKYEYTVMPFGL--VNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHW 763

Query: 566 KHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEK 591
           KHL  V   +++      KKKC     + ++LG+ I  + +   + K
Sbjct: 764 KHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHK 808

BLAST of CSPI01G33150 vs. Swiss-Prot
Match: YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2)

HSP 1 Score: 119.8 bits (299), Expect = 1.5e-25
Identity = 71/227 (31.28%), Postives = 110/227 (48.46%), Query Frame = 1

Query: 386 VDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKD 445
           V H I + PG +   ++PY      ++EI K+V K+L      P+ SP SSP+++  KKD
Sbjct: 610 VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKD 669

Query: 446 GGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG------------K 505
           G +R CVDYR LN+ T+SD FP+P I+ LL  +  A +F+ LDL SG            K
Sbjct: 670 GTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYK 729

Query: 506 LNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIR----------DTMSSCTDLTEHE 565
              VT S K+   V+   L  ++  + F++  +   R          D +       EH 
Sbjct: 730 TAFVTPSGKYEYTVMPFGL--VNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHW 789

Query: 566 KHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEK 591
           KHL  V   +++      KKKC     + ++LG+ I  + +   + K
Sbjct: 790 KHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHK 834

BLAST of CSPI01G33150 vs. Swiss-Prot
Match: TF28_SCHPO (Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-8 PE=3 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 9.7e-25
Identity = 88/290 (30.34%), Postives = 141/290 (48.62%), Query Frame = 1

Query: 599 IIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRY 658
           ++ETDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+
Sbjct: 709 LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRH 768

Query: 659 YLLG--RNFTIVSNQRALIFLL--EQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANA 718
           YL      F I+++ R LI  +  E      +  +W   L  + FEI Y+P   N  A+A
Sbjct: 769 YLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADA 828

Query: 719 LSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDE 778
           LSRI    E             V  +  T     +V+  E   D  L  ++    K  +E
Sbjct: 829 LSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVT-EYTNDTKLLNLLNNEDKRVEE 888

Query: 779 SSKFQWINGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKG 838
           + + +  +G L+  K +I+L   + L  T++  +H+     H G       +     WKG
Sbjct: 889 NIQLK--DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 948

Query: 839 MKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL 866
           ++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI  L
Sbjct: 949 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITAL 995

BLAST of CSPI01G33150 vs. Swiss-Prot
Match: TF29_SCHPO (Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 9.7e-25
Identity = 88/290 (30.34%), Postives = 141/290 (48.62%), Query Frame = 1

Query: 599 IIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRY 658
           ++ETDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+
Sbjct: 709 LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRH 768

Query: 659 YLLG--RNFTIVSNQRALIFLL--EQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANA 718
           YL      F I+++ R LI  +  E      +  +W   L  + FEI Y+P   N  A+A
Sbjct: 769 YLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADA 828

Query: 719 LSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDE 778
           LSRI    E             V  +  T     +V+  E   D  L  ++    K  +E
Sbjct: 829 LSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVT-EYTNDTKLLNLLNNEDKRVEE 888

Query: 779 SSKFQWINGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKG 838
           + + +  +G L+  K +I+L   + L  T++  +H+     H G       +     WKG
Sbjct: 889 NIQLK--DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 948

Query: 839 MKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL 866
           ++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI  L
Sbjct: 949 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITAL 995

BLAST of CSPI01G33150 vs. Swiss-Prot
Match: TF27_SCHPO (Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-7 PE=3 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 9.7e-25
Identity = 88/290 (30.34%), Postives = 141/290 (48.62%), Query Frame = 1

Query: 599 IIETDASGIALRAVLPQNG-----HPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRY 658
           ++ETDAS +A+ AVL Q       +P+ ++S K+S      S+ ++E++A++ S++ WR+
Sbjct: 709 LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRH 768

Query: 659 YLLG--RNFTIVSNQRALIFLL--EQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANA 718
           YL      F I+++ R LI  +  E      +  +W   L  + FEI Y+P   N  A+A
Sbjct: 769 YLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADA 828

Query: 719 LSRIEQPLE-------------VRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNPDE 778
           LSRI    E             V  +  T     +V+  E   D  L  ++    K  +E
Sbjct: 829 LSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVT-EYTNDTKLLNLLNNEDKRVEE 888

Query: 779 SSKFQWINGNLLYKK-RIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWKG 838
           + + +  +G L+  K +I+L   + L  T++  +H+     H G       +     WKG
Sbjct: 889 NIQLK--DGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKG 948

Query: 839 MKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL 866
           ++  +++YV+ C  CQ NK    KP G LQPIP  ER  E  SMDFI  L
Sbjct: 949 IRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITAL 995

BLAST of CSPI01G33150 vs. TrEMBL
Match: A5CAG1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018166 PE=4 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 1.3e-108
Identity = 222/531 (41.81%), Postives = 319/531 (60.08%), Query Frame = 1

Query: 341 SLTMTFSMGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLPPKREVDHRILLLPGQKPIN 400
           S T   S G   +P    E   + Q++      FE    LPP R++DH I L+PG  P+N
Sbjct: 153 STTSDLSEGVQEVPKIVKEVLAQHQQI------FEPITXLPPSRDIDHAIQLIPGASPVN 212

Query: 401 MRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQV 460
           +RPY+Y H  K EIE+LV +ML+ GI RP+ SP+SSP+L+ +KKDGGWRFC+DY  LN+V
Sbjct: 213 VRPYRYPHILKNEIERLVQEMLEAGIVRPSLSPFSSPVLLVKKKDGGWRFCIDYCALNKV 272

Query: 461 TVSDKFPIPM------IEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDE 520
           TV D+FPI +      I +     H    +  L +  G  N +         +    L +
Sbjct: 273 TVXDRFPISIRVRQQDIPKTXFRTHEGH-YEFLVMPFGLTNALATFQSLMNRIFRPHLRK 332

Query: 521 LHGATVFSKLHSGPIRDTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQY 580
                VF         D +    DL EH  HL  V +++ ++    N KKC+    Q++Y
Sbjct: 333 F--VLVF-------FDDILVYNKDLKEHCDHLQSVLSILANHHLHVNGKKCLFAKPQLEY 392

Query: 581 LGHLISSKGVEANEEKIKNMNLPFIIETDASGIALRAVLPQNGHPIAFFSQKLSFRPQTK 640
           LGHL+S+KGV A+  KI  M     +E    G  L A+L Q+  P+A+FSQ L  R + K
Sbjct: 393 LGHLVSAKGVAADPNKISAM-----VEDGCLGYGLGAILMQSHKPVAYFSQVLRARKRQK 452

Query: 641 SIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYF 700
           SIYEREL+A+VL+VQKWR+YLLGR+F + ++Q +L FLLEQR V   +QKW+ KL GY F
Sbjct: 453 SIYERELMAIVLAVQKWRHYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDF 512

Query: 701 EILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNP 760
           EI ++P ++NKA +ALSRI   +E+ ++     ++  +I  +V+ D  L +I + L  + 
Sbjct: 513 EIQFRPGMKNKAVDALSRIPISMELAALMVPSRIDTSLISSQVEADPCLAKIKQRLLDDL 572

Query: 761 DESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWK 820
           D   ++   +G L+YK  +V  K S L+P LL   H S++ GHSGFL TYKR+  + +W 
Sbjct: 573 DAYPRYALDHGILIYKGCLVFPKASPLVPALLQEGHASVVGGHSGFLWTYKRLTRDFFWV 632

Query: 821 GMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL 866
           GMK D+K++VE+C VCQ+NK      AG+LQP+PIP++I ++ +MDFIEGL
Sbjct: 633 GMKNDIKEFVEKCLVCQQNKTLTLSLAGLLQPLPIPDKIWDDVTMDFIEGL 662

BLAST of CSPI01G33150 vs. TrEMBL
Match: A5C5K2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039388 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 3.7e-108
Identity = 244/727 (33.56%), Postives = 380/727 (52.27%), Query Frame = 1

Query: 223  KIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVE-GMGLALEKGTPFG 282
            +I   AI G     T+ + G++K + V+VLID  +THNFI   I+    GL + +   F 
Sbjct: 349  EIYFHAIAGTEHPQTICVMGKLKNKNVMVLIDGGSTHNFIDQAIIVFKFGLPVIRDRKFE 408

Query: 283  VTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPS 342
            V   +  + +  G C+ L L ++  ++ AD+  + V    L+LG+QWL+T G +++ +  
Sbjct: 409  VMVANREKIECAGQCRSLTLTIQGYSVTADYYILPVAACQLVLGVQWLETLGPIEMDYKQ 468

Query: 343  LTMTFSM----------GENRLPSRRDEESDEEQ------RVKSDEDLFEEPKGLPPKR- 402
            LTM F M          G   + +  ++ES+  Q      ++        EP   P K  
Sbjct: 469  LTMNFKMEGTSHTFQGLGRTGIEALSNKESNGLQGTGLFFQIIPSSSSSSEPNSYPSKIG 528

Query: 403  --------------------EVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQT 462
                                  DH+I L P   P+++RPY+Y + QK EIEK+V ++LQ+
Sbjct: 529  QLLAKFSHVFESPTTLPPRRSHDHKIPLQPSAGPVSVRPYRYPYYQKTEIEKMVKELLQS 588

Query: 463  GINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVF 522
            G+ RP++SP+SSPIL+ +K DG WRFCVDYR LN +T+ DK+PIP+I+ELLDELHGA  +
Sbjct: 589  GLIRPSNSPFSSPILLVKKADGAWRFCVDYRALNDITIKDKYPIPVIDELLDELHGAKFY 648

Query: 523  SKLDLKSGKLNQVTVSDKFPIPVIEELLDELH------------GATVFSKLHSGPIR-- 582
            SKLDL+SG  +Q+ V +   IP       E H              T F  L +   R  
Sbjct: 649  SKLDLRSG-YHQIRVHEA-DIPKTAFRTHEGHYEFIVMPFGLTNAPTTFQSLMNDLFRPY 708

Query: 583  ----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLG---- 642
                      D +       +H  HL +V  ++  N  FA + KC     +++YL     
Sbjct: 709  LQKFILVFFYDILIYSRSWEDHLTHLQIVLQILSANSLFAKESKCRFGVLZVEYLASPLT 768

Query: 643  HLISSKGVEANEEK------------------IKNMNLPFIIETDASGIALRAVLPQNGH 702
             L+S +G   NE                    + + + PF+IE DASG+ + A+L Q   
Sbjct: 769  RLLSKEGFHXNEAAEMAFKQLKEALTSPPILCLPDFSQPFVIECDASGLGIGAILTQQNQ 828

Query: 703  PIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREV 762
             +A+FS+ L       S YE+E++A++ +++KWR YLLG+ FT  ++ ++L +LLEQR  
Sbjct: 829  XVAYFSEALKGSALALSTYEKEMLAIIKAIKKWRPYLLGKPFTXRTBHKSLKYLLEQRIT 888

Query: 763  QPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVK 822
             P   +WL KLLGY ++I Y+   +N+ A++LSR+ +  ++ S+         +++KE++
Sbjct: 889  TPAQTRWLPKLLGYDYKIEYKRGPENQGADSLSRVVE-FQILSLSMPHADWWSILQKEIQ 948

Query: 823  LDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHS 866
             D   ++ IE  K       K    +G    + ++ LS  SSLIP +L   H S + GH 
Sbjct: 949  QDSFYEKXIE--KSTSQSGHKLLQHDGVWFKRDKVYLSPTSSLIPKILXDCHSSSIGGHF 1008

BLAST of CSPI01G33150 vs. TrEMBL
Match: Q9LP90_ARATH (T32E20.30 OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 8.3e-100
Identity = 253/777 (32.56%), Postives = 390/777 (50.19%), Query Frame = 1

Query: 173  AKKSEPPVKHLSDAEFRARLDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGL 232
            +K+ + P K L         + E LE     EE ++ V +  ++           +  GL
Sbjct: 312  SKEHKCPNKELRVLTVINGFEMEVLESNSVEEEFHDSVAQFAELSFS--------SYMGL 371

Query: 233  TSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQG 292
             S  T+K+KG I   E         T  +  N               F +  G G   QG
Sbjct: 372  PSYTTIKMKGSICKGEWC------HTQFYFPN---------------FHIRLGTGITVQG 431

Query: 293  RGVCKRLELKL-----KEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFS 352
             G+C ++ + L     +E+ +   F+ +++G VD+ILG+ WL T G  K++W    ++F 
Sbjct: 432  LGLCDKVTMTLPVGCGQELELTTHFITLDLGPVDVILGIAWLRTLGDCKVNWERHELSFL 491

Query: 353  MGENRLPSRRDEESDE-EQRVKSDEDLF--------------EEPKGLPPKREVDHRILL 412
                 +  R D E D  +  +KS    F              +  KGLPP +  +H I L
Sbjct: 492  YHGRTVTLRGDPELDTFKMSLKSFSTKFRLQNKELEVSLNSHQNLKGLPPIKGNEHAISL 551

Query: 413  LPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCV 472
            LPG + I++RPY+Y H  KE +E LV++ML  GI R + SP+SSP+L+ +KKD  WRFCV
Sbjct: 552  LPGTRAISVRPYRYPHAHKEAMEGLVSEMLDNGIIRASKSPFSSPVLLVKKKDQSWRFCV 611

Query: 473  DYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG------KLNQV------TVS 532
            DYR LN+ T+ +KFPIPMI++LLDELHGA +FSKLDL++G      K+  +      T  
Sbjct: 612  DYRALNRATIPNKFPIPMIDQLLDELHGAIIFSKLDLRAGYHQIRMKVEDIEKTTFRTHD 671

Query: 533  DKFPIPVIEELLDELHGATVFSKLHSGPIR------------DTMSSCTDLTEHEKHLGM 592
              F   V+   L   +    F    +  +R            D +    +  EHE+HL M
Sbjct: 672  GHFEFLVMPFGLS--NAPATFQSSMNDMLRPFLRKFVLVFFDDILIYSRNEQEHEEHLAM 731

Query: 593  VFAVMRDNQF--------------------FANKKKCVMPHSQIQYLGHL---------I 652
            V  V+ ++QF                         K V P S  +  G L         +
Sbjct: 732  VLKVLEEHQFYANRKKPYHITQGVSTDPTKTVAMTKWVTPQSVKELRGFLGLTGYYRRFL 791

Query: 653  SSKGVEANE--EKIKNMNLPFIIETDASGIALRAVL---PQNGHPIAFFSQKLSFRPQTK 712
               G  A    E +K  +  +      +  AL+  +   P    P       L+ + Q K
Sbjct: 792  KGYGTLARPLTELLKKDSFVWSESAQEAFDALKRAMSTAPVLALPDFGKVHGLTSKEQLK 851

Query: 713  SIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYF 772
             +YEREL+A+VLS+QKW++YL+GR F + ++Q++L FL EQREV   +QKWLTKLL Y F
Sbjct: 852  PVYERELMAIVLSIQKWKHYLMGRRFVLHTDQKSLKFLQEQREVSMDYQKWLTKLLHYEF 911

Query: 773  EILYQPRLQNKAANALSRIEQP------LEVRSMCTTGIVNMEVIEKEVKLDEDLKRIIE 832
            +ILY+  + NKAA+ LSR+ QP      + + +     ++ +  + +E+  +  L+ +++
Sbjct: 912  DILYKLGVDNKAADGLSRMVQPTGSFSSMLLMAFTVPTVLQLHDLYEEIDSNAHLQHLVK 971

Query: 833  ELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMC 866
            E       +S +    G L  K+R+++ K S  +P +L  +H  +L GHSG L+T KR+ 
Sbjct: 972  ECLSAKQGTSAYTVKEGRLWKKQRLIIPKDSKFLPLILAEYHSGLLGGHSGVLKTMKRIQ 1031

BLAST of CSPI01G33150 vs. TrEMBL
Match: Q7XCD6_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os10g40400 PE=4 SV=2)

HSP 1 Score: 362.1 bits (928), Expect = 1.9e-96
Identity = 249/779 (31.96%), Postives = 382/779 (49.04%), Query Frame = 1

Query: 199  EEGRTEES--NEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSK 258
            E+  TE S   ED++ +          I  +A++G  S+ +++++G I+G E+L+LIDS 
Sbjct: 392  EDSPTESSPEREDIIVMA---------ISQQALNGTESRNSIRLRGWIQGTELLMLIDSG 451

Query: 259  ATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIE 318
            ++H+FI  KI + M        P  V   DG       V        +      DF  I 
Sbjct: 452  SSHSFIDEKIGKSMSGVKLLTKPLKVQIADGGELVCSQVIPNCSWWTQGHNFSNDFKLIP 511

Query: 319  VGNVDLILGMQWLDTTGTMKIHW--------------------PSLTMTFSMGENRLPSR 378
            +G  D+ILGM WL+    MKI+W                     S T    +   +L S 
Sbjct: 512  LGGYDIILGMDWLEQYSPMKIYWVNKWVEFQYQNQWLRVLGISSSTTSYAEISVEQLMSL 571

Query: 379  R------------DEESDEEQRVKSD--------EDLFEEPKGLPPKREVDHRILLLPGQ 438
                         D ++ EEQ              D+F+EP  LPP+R  DH+I L+ G 
Sbjct: 572  AKMGSIMYMVKITDHQNTEEQSYPDSIKQILLEYHDVFDEPSELPPERYCDHQIPLIEGA 631

Query: 439  KPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRK 498
            KP+++RPYKY    K+EIE+ V +ML +G+ +P+ S +SSP L+ RKKDG WR CVDYR 
Sbjct: 632  KPVSLRPYKYNPELKDEIERQVKEMLDSGVIQPSQSAWSSPALLVRKKDGTWRLCVDYRH 691

Query: 499  LNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSGKLNQVTVS--DKFPIPVIEELLDE 558
            LN +TV  K+P+P+IEELLDEL G+  F+KLDL++G  +Q+ +   ++        L   
Sbjct: 692  LNALTVKSKYPVPIIEELLDELSGSKWFTKLDLRAG-YHQIRMKPGEEHKTAFQTHLGHY 751

Query: 559  LHGATVFSKLHS-----GPIRDTMSSCT----------------DLTEHEKHLGMVFAVM 618
             +    F  + +     G + DT+SS                  DL  H +HL  V  ++
Sbjct: 752  EYKVMSFGLIGAPASFQGAMNDTLSSVLRKCALVFFDDILIYSPDLQSHCQHLTQVLQLL 811

Query: 619  RDNQFFANKKKCVMPHSQIQYLG-----HLISSK----------GVEANEEKIK------ 678
            R + +     KC     Q+ YLG     H +S++           V    +K++      
Sbjct: 812  RRDHWQVKISKCSFAQQQVSYLGHVIGVHGVSTEPRKILDVQNWAVPTTVKKLRGFLGLA 871

Query: 679  ------------------------NMNLPFIIETDASGIALRAVLPQNGHPIAFFSQKLS 738
                                    N N PFI+ETDAS   + AVL Q GHPIA+ S+ L 
Sbjct: 872  GYYRKFVKDFDQASSHYCSCLSITNFNFPFIVETDASDGGIGAVLSQEGHPIAYLSKALG 931

Query: 739  FRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTK 798
             R +  S YE+E +A++L+V+ WR YL  + F I+++  +L  L +QR   P  QK  TK
Sbjct: 932  PRSKGLSTYEKECMAILLAVEHWRSYLQHQEFMILTDHHSLTHLSDQRLHTPWQQKAFTK 991

Query: 799  LLGYYFEILYQPRLQNKAANALSR--IEQPLEVRSMCTTGIVNMEVIEKEVKLDEDLKRI 858
            LLG  + I+Y+    N AA+ALSR  +    ++ ++ +     ++ + +  + D+   ++
Sbjct: 992  LLGLQYRIVYRKGSANSAADALSRKDLGDSAQILAVSSCSPSWLQEVIQGYEQDKFSSQL 1051

Query: 859  IEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKR 866
            + EL  NP     +    G + YK RI +   + L   L+   HD+   GHSGF  TY+R
Sbjct: 1052 LAELSLNPKAREHYTLQQGLIRYKGRIWVGNNTDLQLKLIKELHDNPAGGHSGFPVTYRR 1111

BLAST of CSPI01G33150 vs. TrEMBL
Match: A0A087HNU1_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G173500 PE=4 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 2.0e-93
Identity = 232/678 (34.22%), Postives = 369/678 (54.42%), Query Frame = 1

Query: 193 DKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVL 252
           D   LE E   EES+E+ ++    ++     + L ++ G++S  T+KI+G I+G  V+VL
Sbjct: 341 DGSELELEEADEESDEETIQ----EIVGMATLSLNSMVGISSPRTVKIRGVIQGEPVVVL 400

Query: 253 IDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQGRGVCK-RLELKL--KEITIV 312
           IDS ATHNFI  KIV  + L  E+   +GV TG G   QG+G+CK +L  K+  +++ + 
Sbjct: 401 IDSGATHNFISEKIVTLLRLRTEETKGYGVVTGTGLTVQGQGICKAKLSFKVEGRQVELQ 460

Query: 313 ADFLAIEVGNVDLILGMQWLDTTGT-MKIHWPSLTMTFSMGENRLPSRRDEESDEEQRVK 372
            D   I    V +    + LD  G  + I +  L    + GEN +         E Q+V 
Sbjct: 461 GD-PGICCSPVTMKGLWKALDQEGQGVIIEYAGLQAQRAEGENPVTEGVQSILREFQQV- 520

Query: 373 SDEDLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINR 432
                FEEP+GLPP R  +H I                               L +G   
Sbjct: 521 -----FEEPQGLPPSRGREHTI------------------------------ELTSGATP 580

Query: 433 PNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLD 492
            +  P+  P +   + +      +    + +       P+ ++++      G +    +D
Sbjct: 581 VSVRPFRYPQIQREELEKQVAAMLAAGIIQESISLFSSPVLLVKK-----KGGSWRFCVD 640

Query: 493 LKSGKLNQVTVSDKFPIPVIEELLDELHGATVFSKLHSGPIRDTMSSCTDLTEHEKHLGM 552
            ++  LN+VTV D + IP+I++LLDELHG+ +FSKL             DL      +  
Sbjct: 641 YRA--LNKVTVGDSYHIPMIDQLLDELHGSIIFSKL-------------DLRAGYHQI-- 700

Query: 553 VFAVMRDNQFFANKKKCVMPHSQIQYLGHLISSKGVEANEEKIKNMNLPFIIETDASGIA 612
              V+ DNQ FAN KKC     ++ YLGH+IS++GV A+  K++ M + + +  +    A
Sbjct: 701 --RVLADNQLFANSKKCQFGSQKVDYLGHVISAEGVSADPAKVQAM-VDWPVPKNVK--A 760

Query: 613 LRAVLPQNGHPIAFFS-QKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQR 672
           LR  L   G+   F   Q L+ R + KS+YEREL+A+V ++QKW++YLLGR F + ++Q+
Sbjct: 761 LRGFLGLTGYYRKFVKGQALTERQRLKSVYERELMAIVFAIQKWKHYLLGRKFIVKTDQK 820

Query: 673 ALIFLLEQREVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEVRSMCTTGI 732
           +L FLLEQRE+  ++Q+WLTK+LG+ FEI Y+P L+NKAA+ALSRI+   ++ ++     
Sbjct: 821 SLKFLLEQREINLEYQRWLTKILGFDFEIQYKPGLENKAADALSRIDAVPQLCALSMPVA 880

Query: 733 VNMEVIEKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLH 792
           + +  +++ V+ D +L ++ +E+ K+      F  + G LL K R+VL   S L+  +L 
Sbjct: 881 IQLAEVDEAVEKDAELSKLKQEVMKDATSHPDFSVVQGRLLRKGRLVLPAASPLVKLVLQ 940

Query: 793 TFHDSILRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPI 852
            FHD  + GH G L+T  R+    +WK M  D+++YV +C+VCQR+K     PAG+LQP+
Sbjct: 941 EFHDGKMGGHGGVLKTQMRISEMFFWKKMMADIRQYVAECQVCQRHKYSTLAPAGLLQPL 950

Query: 853 PIPERILEEWSMDFIEGL 866
           P+P+++ E+ SMDF+EGL
Sbjct: 1001 PVPKQVWEDISMDFVEGL 950

BLAST of CSPI01G33150 vs. TAIR10
Match: AT3G29750.1 (AT3G29750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 56.2 bits (134), Expect = 1.1e-07
Identity = 54/207 (26.09%), Postives = 94/207 (45.41%), Query Frame = 1

Query: 189 RARLDKESLEEEGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGRE 248
           +A+LD    +++G   E  E  LE     L +G +   + +  LT    M+  G I   +
Sbjct: 81  QAKLDVVK-KKKGVINELEE--LEQDSYTLRQGME---QLVIDLTRNKGMRFYGFILDHK 140

Query: 249 VLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQGRGVCKRLELKLKEITI 308
           V+V IDS AT NFI  ++   + L         V  G     Q  G C  + L ++E+ I
Sbjct: 141 VVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQCIQSVGTCLGIRLWVQEVEI 200

Query: 309 VADFLAIEVG--NVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSRRDEESDEEQR 368
             +FL +++   +VD+ILG +WL   G   ++W +   +FS  +  +    + E  E+  
Sbjct: 201 TENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSFSHNQQWITLCAEHEELEQVT 260

Query: 369 VKSDEDLFEEPKGLPPKREVDHRILLL 394
            K       E + +  +R  D  +L++
Sbjct: 261 TKVKMKSENEQEDIEEQRNNDGEMLVV 281

BLAST of CSPI01G33150 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 53.5 bits (127), Expect = 7.4e-07
Identity = 25/52 (48.08%), Postives = 34/52 (65.38%), Query Frame = 1

Query: 545 HLGMVFAVMRDNQFFANKKKCVMPHSQIQYLG--HLISSKGVEANEEKIKNM 595
           HLGMV  +   +QF+AN+KKC     QI YLG  H+IS +GV A+  K++ M
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAM 54

BLAST of CSPI01G33150 vs. NCBI nr
Match: gi|778697580|ref|XP_011654353.1| (PREDICTED: uncharacterized protein LOC105435354 [Cucumis sativus])

HSP 1 Score: 545.8 bits (1405), Expect = 1.3e-151
Identity = 320/559 (57.25%), Postives = 365/559 (65.30%), Query Frame = 1

Query: 151 STNSGRMRNGEEGITKKEK----------EARAKKSEPPVKHLSDAEFRARLDK------ 210
           STN G    GE+GIT+K +          +   +KSEPPVK L D EF+ARLDK      
Sbjct: 258 STNKGP-EGGEKGITRKTEFPLKQVTIPIKGNYQKSEPPVKRLLDVEFKARLDKGLCFKC 317

Query: 211 --------------------------ESLEEEGRTEESNEDVLELKQIKLEKGTKIELKA 270
                                     ESLE+E RTEE+NE+VLEL Q+ LE+GT+IELKA
Sbjct: 318 NERYSPGHRCKMKDKRELMLFIMNEEESLEDEDRTEETNEEVLELNQLTLEEGTEIELKA 377

Query: 271 IHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVEGMGLALEKGTPFGVTTGDGT 330
           IHGLTSKGTMKIKGEIKG+EVL+LIDS ATHNFIHNKIVE +GL LE  TPFGVT GDGT
Sbjct: 378 IHGLTSKGTMKIKGEIKGKEVLILIDSGATHNFIHNKIVEEVGLELENHTPFGVTIGDGT 437

Query: 331 RCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPS------- 390
           RCQGRGVC RLELKLKEITIVADFLAIE+G+VD+ILGMQWL+TTGTMKIHWPS       
Sbjct: 438 RCQGRGVCNRLELKLKEITIVADFLAIELGSVDVILGMQWLNTTGTMKIHWPSLTMTFRM 497

Query: 391 ----------------------LTMT-------FSMGENRLPSRRDEESDEEQRVKSDE- 450
                                 +  T       F +      +  D E DE QRVK DE 
Sbjct: 498 GKKQFILKGDPSLIRAECSLKTIEKTWEEDDQGFLLEMQNYEAEEDGELDEVQRVKGDEE 557

Query: 451 -------------DLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLV 510
                        DLFEEPKGLPPKRE DHRILL+ GQKPIN+RPYKYGHTQKEEIEKL+
Sbjct: 558 ESPMIQVLLQQYTDLFEEPKGLPPKRECDHRILLVTGQKPINVRPYKYGHTQKEEIEKLI 617

Query: 511 TKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDEL 570
           ++MLQ GI RP+HSPYSSP+L+ RKKDGGWRFCVDYRKLNQVT+SDKFPIP+IEELLDEL
Sbjct: 618 SEMLQVGIIRPSHSPYSSPVLLVRKKDGGWRFCVDYRKLNQVTISDKFPIPVIEELLDEL 677

Query: 571 HGATVFSKLDLKSG----KLNQVTVSDKFPIPVIE---ELLDELHGAT----VFSKLHSG 595
           HGATVFSKLDLKSG    ++ +  V +K      E   E L    G T     F  L + 
Sbjct: 678 HGATVFSKLDLKSGYHQIRMKEEDV-EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNL 737

BLAST of CSPI01G33150 vs. NCBI nr
Match: gi|922559973|ref|XP_013608212.1| (PREDICTED: uncharacterized protein LOC106314967 [Brassica oleracea var. oleracea])

HSP 1 Score: 419.9 bits (1078), Expect = 1.1e-113
Identity = 286/846 (33.81%), Postives = 445/846 (52.60%), Query Frame = 1

Query: 91   EINNLPKAKKVMVAV---VSFGQDEVDWYRWSHNRKKVESECIKGCVPDLVRADSPSRSG 150
            E+ NL K   V   V   V++G         S   K V+     G  P + R+  PS   
Sbjct: 463  ELRNLRKMMSVAKLVEEWVNYGDPSPA----SQTGKWVKGGGQTGSSPFMSRS-GPSVGN 522

Query: 151  KPQSTNSGRMRN--GEEGITKKEKEARAK------KSEPPVKHLSDAEFRARLDKESLEE 210
             P   N+ R +   G+E I   EK+   +      ++  P + LS        +++  EE
Sbjct: 523  GPGPNNNQRPKTFPGQEKIAGMEKKPTTQNPNFNGRNRAPFRRLSGGV----CEEDEGEE 582

Query: 211  EGRTEESNEDVLELKQIKLEKGTKIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATH 270
              + EE  +D  E + ++  +   +  K++ G++S  T+                + ATH
Sbjct: 583  AEQFEEDEKD--EEEAVEFAECAALSTKSVMGISSPKTI----------------NGATH 642

Query: 271  NFIHNKIVEGMGLALEKGTPFGVTTGDGTRCQGRGVCK-----RLELKLKEITIVADFLA 330
            NFI   +++ +GL  E+   FGV TG G   +G GV +     R+  KL+ + I  +   
Sbjct: 643  NFIDYWLMQELGLVAEETQSFGVITGSGKPVRG-GVARNVGETRVNWKLQTLKIPVEGRL 702

Query: 331  IEVGNVDLILGMQWLDTTGTMKIHWPSLTMTFSMGENRLPSR-RDEESDEEQRVKSD-ED 390
            + +  V  +   +         +    +++     E     + + + S   QR+ S  + 
Sbjct: 703  VSLQGVPNLCSSEVSCKAMQKLLDQAEVSVVVKCREVVAEMKGKAKYSGAMQRLLSQFQQ 762

Query: 391  LFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQTGINRPNHS 450
            +F+EP+GLPP R  +H I L+ G  P+++R ++Y H QKEEIEK VT +L+ GI + + S
Sbjct: 763  VFQEPQGLPPTRGREHAINLVRGSNPVSVRLFRYPHAQKEEIEKQVTTILKAGIIQESIS 822

Query: 451  PYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVFSKLDLKSG 510
            P+SSP+L+ +KKDG WRFCVDYR +N+VTV D +PIPMI++LLDEL GA VFSKLDL+S 
Sbjct: 823  PFSSPVLLVKKKDGSWRFCVDYRSINKVTVMDSYPIPMIDQLLDELRGARVFSKLDLRS- 882

Query: 511  KLNQVTVSDKFPIPVIEELLDELH------------GATVFSKLHSGPIR---------- 570
            + +Q+ V  +  IP       + H                F  L +   R          
Sbjct: 883  RYHQIRVKTE-DIPKTSFRTHDGHYEFLVMPFGLSNAPATFQSLMNEIFRPYLRRFVLVF 942

Query: 571  --DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQ--------------- 630
              D +    +  EH  H+  V   ++ +Q +AN KKC    S I+               
Sbjct: 943  FDDILVFSKNHQEHTAHVRHVLETLQQHQLYANMKKCEFGCSSIEKFVKDYGIIARPLTE 1002

Query: 631  -----YLGHLISSKG----VEANEEKIKNMNLP-----FIIETDASGIALRAVLPQNGHP 690
                   G  + ++G    ++     I  + LP     F++E+DASG  L AVL QN  P
Sbjct: 1003 LLKKDRFGWNVVAEGAFSALKVAMSTIPVLALPDFQEQFVVESDASGKGLGAVLIQNQRP 1062

Query: 691  IAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQ 750
            IA++SQ LS R Q KS+YEREL+A+V ++QKWR+YLLGR F + ++Q++L FLLEQR++ 
Sbjct: 1063 IAYYSQALSERQQLKSVYERELMAIVFAIQKWRHYLLGRKFLVRTDQKSLKFLLEQRKIN 1122

Query: 751  PQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVKL 810
             ++QKWLTKLLG+ F+I Y+P L+NKA +ALSR     E+ ++     + ++ IEKE+ L
Sbjct: 1123 VEYQKWLTKLLGFDFDIQYKPGLENKAEDALSRRGVATELMALSVPTAIQLQDIEKELSL 1182

Query: 811  DEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSG 866
            D  L+++ EE+  N  +  +++ + G LL + ++VL K S LI  ++   HD ++ GH G
Sbjct: 1183 DPTLQKLKEEIVANKGDHKEYEVVQGRLLRRGKLVLPKESPLIGIIMKELHDGLMGGHRG 1242

BLAST of CSPI01G33150 vs. NCBI nr
Match: gi|147807720|emb|CAN66553.1| (hypothetical protein VITISV_018166 [Vitis vinifera])

HSP 1 Score: 402.5 bits (1033), Expect = 1.8e-108
Identity = 222/531 (41.81%), Postives = 319/531 (60.08%), Query Frame = 1

Query: 341 SLTMTFSMGENRLPSRRDEESDEEQRVKSDEDLFEEPKGLPPKREVDHRILLLPGQKPIN 400
           S T   S G   +P    E   + Q++      FE    LPP R++DH I L+PG  P+N
Sbjct: 153 STTSDLSEGVQEVPKIVKEVLAQHQQI------FEPITXLPPSRDIDHAIQLIPGASPVN 212

Query: 401 MRPYKYGHTQKEEIEKLVTKMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQV 460
           +RPY+Y H  K EIE+LV +ML+ GI RP+ SP+SSP+L+ +KKDGGWRFC+DY  LN+V
Sbjct: 213 VRPYRYPHILKNEIERLVQEMLEAGIVRPSLSPFSSPVLLVKKKDGGWRFCIDYCALNKV 272

Query: 461 TVSDKFPIPM------IEELLDELHGATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDE 520
           TV D+FPI +      I +     H    +  L +  G  N +         +    L +
Sbjct: 273 TVXDRFPISIRVRQQDIPKTXFRTHEGH-YEFLVMPFGLTNALATFQSLMNRIFRPHLRK 332

Query: 521 LHGATVFSKLHSGPIRDTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQY 580
                VF         D +    DL EH  HL  V +++ ++    N KKC+    Q++Y
Sbjct: 333 F--VLVF-------FDDILVYNKDLKEHCDHLQSVLSILANHHLHVNGKKCLFAKPQLEY 392

Query: 581 LGHLISSKGVEANEEKIKNMNLPFIIETDASGIALRAVLPQNGHPIAFFSQKLSFRPQTK 640
           LGHL+S+KGV A+  KI  M     +E    G  L A+L Q+  P+A+FSQ L  R + K
Sbjct: 393 LGHLVSAKGVAADPNKISAM-----VEDGCLGYGLGAILMQSHKPVAYFSQVLRARKRQK 452

Query: 641 SIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREVQPQFQKWLTKLLGYYF 700
           SIYEREL+A+VL+VQKWR+YLLGR+F + ++Q +L FLLEQR V   +QKW+ KL GY F
Sbjct: 453 SIYERELMAIVLAVQKWRHYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDF 512

Query: 701 EILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVKLDEDLKRIIEELKKNP 760
           EI ++P ++NKA +ALSRI   +E+ ++     ++  +I  +V+ D  L +I + L  + 
Sbjct: 513 EIQFRPGMKNKAVDALSRIPISMELAALMVPSRIDTSLISSQVEADPCLAKIKQRLLDDL 572

Query: 761 DESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHSGFLRTYKRMCGELYWK 820
           D   ++   +G L+YK  +V  K S L+P LL   H S++ GHSGFL TYKR+  + +W 
Sbjct: 573 DAYPRYALDHGILIYKGCLVFPKASPLVPALLQEGHASVVGGHSGFLWTYKRLTRDFFWV 632

Query: 821 GMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERILEEWSMDFIEGL 866
           GMK D+K++VE+C VCQ+NK      AG+LQP+PIP++I ++ +MDFIEGL
Sbjct: 633 GMKNDIKEFVEKCLVCQQNKTLTLSLAGLLQPLPIPDKIWDDVTMDFIEGL 662

BLAST of CSPI01G33150 vs. NCBI nr
Match: gi|147783182|emb|CAN68669.1| (hypothetical protein VITISV_039388 [Vitis vinifera])

HSP 1 Score: 401.0 bits (1029), Expect = 5.4e-108
Identity = 244/727 (33.56%), Postives = 380/727 (52.27%), Query Frame = 1

Query: 223  KIELKAIHGLTSKGTMKIKGEIKGREVLVLIDSKATHNFIHNKIVE-GMGLALEKGTPFG 282
            +I   AI G     T+ + G++K + V+VLID  +THNFI   I+    GL + +   F 
Sbjct: 349  EIYFHAIAGTEHPQTICVMGKLKNKNVMVLIDGGSTHNFIDQAIIVFKFGLPVIRDRKFE 408

Query: 283  VTTGDGTRCQGRGVCKRLELKLKEITIVADFLAIEVGNVDLILGMQWLDTTGTMKIHWPS 342
            V   +  + +  G C+ L L ++  ++ AD+  + V    L+LG+QWL+T G +++ +  
Sbjct: 409  VMVANREKIECAGQCRSLTLTIQGYSVTADYYILPVAACQLVLGVQWLETLGPIEMDYKQ 468

Query: 343  LTMTFSM----------GENRLPSRRDEESDEEQ------RVKSDEDLFEEPKGLPPKR- 402
            LTM F M          G   + +  ++ES+  Q      ++        EP   P K  
Sbjct: 469  LTMNFKMEGTSHTFQGLGRTGIEALSNKESNGLQGTGLFFQIIPSSSSSSEPNSYPSKIG 528

Query: 403  --------------------EVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVTKMLQT 462
                                  DH+I L P   P+++RPY+Y + QK EIEK+V ++LQ+
Sbjct: 529  QLLAKFSHVFESPTTLPPRRSHDHKIPLQPSAGPVSVRPYRYPYYQKTEIEKMVKELLQS 588

Query: 463  GINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELHGATVF 522
            G+ RP++SP+SSPIL+ +K DG WRFCVDYR LN +T+ DK+PIP+I+ELLDELHGA  +
Sbjct: 589  GLIRPSNSPFSSPILLVKKADGAWRFCVDYRALNDITIKDKYPIPVIDELLDELHGAKFY 648

Query: 523  SKLDLKSGKLNQVTVSDKFPIPVIEELLDELH------------GATVFSKLHSGPIR-- 582
            SKLDL+SG  +Q+ V +   IP       E H              T F  L +   R  
Sbjct: 649  SKLDLRSG-YHQIRVHEA-DIPKTAFRTHEGHYEFIVMPFGLTNAPTTFQSLMNDLFRPY 708

Query: 583  ----------DTMSSCTDLTEHEKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLG---- 642
                      D +       +H  HL +V  ++  N  FA + KC     +++YL     
Sbjct: 709  LQKFILVFFYDILIYSRSWEDHLTHLQIVLQILSANSLFAKESKCRFGVLZVEYLASPLT 768

Query: 643  HLISSKGVEANEEK------------------IKNMNLPFIIETDASGIALRAVLPQNGH 702
             L+S +G   NE                    + + + PF+IE DASG+ + A+L Q   
Sbjct: 769  RLLSKEGFHXNEAAEMAFKQLKEALTSPPILCLPDFSQPFVIECDASGLGIGAILTQQNQ 828

Query: 703  PIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQREV 762
             +A+FS+ L       S YE+E++A++ +++KWR YLLG+ FT  ++ ++L +LLEQR  
Sbjct: 829  XVAYFSEALKGSALALSTYEKEMLAIIKAIKKWRPYLLGKPFTXRTBHKSLKYLLEQRIT 888

Query: 763  QPQFQKWLTKLLGYYFEILYQPRLQNKAANALSRIEQPLEVRSMCTTGIVNMEVIEKEVK 822
             P   +WL KLLGY ++I Y+   +N+ A++LSR+ +  ++ S+         +++KE++
Sbjct: 889  TPAQTRWLPKLLGYDYKIEYKRGPENQGADSLSRVVE-FQILSLSMPHADWWSILQKEIQ 948

Query: 823  LDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSILRGHS 866
             D   ++ IE  K       K    +G    + ++ LS  SSLIP +L   H S + GH 
Sbjct: 949  QDSFYEKXIE--KSTSQSGHKLLQHDGVWFKRDKVYLSPTSSLIPKILXDCHSSSIGGHF 1008

BLAST of CSPI01G33150 vs. NCBI nr
Match: gi|727620440|ref|XP_010481012.1| (PREDICTED: uncharacterized protein LOC104759832 [Camelina sativa])

HSP 1 Score: 387.9 bits (995), Expect = 4.7e-104
Identity = 216/552 (39.13%), Postives = 320/552 (57.97%), Query Frame = 1

Query: 360 ESDEEQRVKSDEDLFEEPKGLPPKREVDHRILLLPGQKPINMRPYKYGHTQKEEIEKLVT 419
           E  E QRV     +FE P+GLPP R  +H   L  G  P+N+RPY+Y   QK EIEKL+ 
Sbjct: 369 ELPEVQRVLERYSVFEMPQGLPPVRNSEHTTTLKEGAGPVNLRPYRYSFVQKNEIEKLIQ 428

Query: 420 KMLQTGINRPNHSPYSSPILMARKKDGGWRFCVDYRKLNQVTVSDKFPIPMIEELLDELH 479
           +ML   I +P+ SPYSSP+L+ +KKDGGWRFCVDYR L  VTV D++PI  IEELLDE +
Sbjct: 429 EMLAAHIIKPSVSPYSSPVLLVKKKDGGWRFCVDYRALKTVTVLDRYPISGIEELLDEFN 488

Query: 480 GATVFSKLDLKSGKLNQVTVSDKFPIPVIEELLDELH--------GATVFSKLHSGPIRD 539
           GA  FSKLDLKSG  +Q+ V  ++ +     ++ + H        G T         + D
Sbjct: 489 GAVFFSKLDLKSG-YHQIRVR-RYDVEKTAFIMHQGHYEFLVMPFGLTNAPSTFQNVMND 548

Query: 540 ---------TMSSCTDLTEH----EKHLGMVFAVMRDNQFFANKKKCVMPHSQIQYLGHL 599
                     +    D+  +    E HL +V  +++ NQF+ANKKKC    +++ YLGH+
Sbjct: 549 LFRPYLRRFVLVFFDDILVYSPSVESHLQLVLKLLQQNQFYANKKKCSFGQTKVSYLGHV 608

Query: 600 ISSKGVEANEEKIKNM-----------------------NLPFIIETDASGIALRAVLPQ 659
           I   GV A+ + I  M                          F IET+ASG  + AVL Q
Sbjct: 609 ILRDGVAADPKNIVTMVDWHEPRSVTELRGFLGLTGYYRRKEFTIETNASGAGIGAVLSQ 668

Query: 660 NGHPIAFFSQKLSFRPQTKSIYERELIAVVLSVQKWRYYLLGRNFTIVSNQRALIFLLEQ 719
           +  P+AF SQ  S + + KS+YEREL+ +V +V KW++YL+G  F I ++Q++L  LL+Q
Sbjct: 669 DWRPVAFISQAFSSQGRIKSVYERELLVIVKAVTKWKHYLIGEEFVIKTDQKSLRHLLDQ 728

Query: 720 REVQPQFQKWLTKLLGYYFEILYQPRLQNKAANALSR--IEQPLEVRSMCTTGIVNMEVI 779
           + +    Q+W  KL+G  ++I Y+PR+QN+ ANALSR  + + L   ++     ++++ +
Sbjct: 729 KAISTVQQRWAAKLIGLNYKIEYKPRVQNRVANALSRRPLTETLFQLTLVAPLTIDLKEL 788

Query: 780 EKEVKLDEDLKRIIEELKKNPDESSKFQWINGNLLYKKRIVLSKRSSLIPTLLHTFHDSI 839
           +++V  D +L  I++ELK+       +      LL + R+V+  RS  IP LL  FH S+
Sbjct: 789 KEQVSKDMELGEILKELKRGEKVKGGYSLEKEMLLKEGRLVVPCRSLFIPRLLEQFHSSV 848

Query: 840 LRGHSGFLRTYKRMCGELYWKGMKTDVKKYVEQCEVCQRNKLEATKPAGVLQPIPIPERI 866
              H G  +TYKR+  E+YW+GM+ DV +++ +C+V Q NK     PAG+L P+ IP++I
Sbjct: 849 TGEHEGAFKTYKRLIQEIYWRGMQKDVVEFIVKCQVSQENKYSTLSPAGLLAPLTIPKQI 908

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YG31B_YEAST1.5e-2531.28Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YI31B_YEAST1.5e-2531.28Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
TF28_SCHPO9.7e-2530.34Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF29_SCHPO9.7e-2530.34Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF27_SCHPO9.7e-2530.34Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A5CAG1_VITVI1.3e-10841.81Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018166 PE=4 SV=1[more]
A5C5K2_VITVI3.7e-10833.56Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039388 PE=4 SV=1[more]
Q9LP90_ARATH8.3e-10032.56T32E20.30 OS=Arabidopsis thaliana PE=4 SV=1[more]
Q7XCD6_ORYSJ1.9e-9631.96Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
A0A087HNU1_ARAAL2.0e-9334.22Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G173500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G29750.11.1e-0726.09 Eukaryotic aspartyl protease family protein[more]
ATMG00860.17.4e-0748.08ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778697580|ref|XP_011654353.1|1.3e-15157.25PREDICTED: uncharacterized protein LOC105435354 [Cucumis sativus][more]
gi|922559973|ref|XP_013608212.1|1.1e-11333.81PREDICTED: uncharacterized protein LOC106314967 [Brassica oleracea var. oleracea... [more]
gi|147807720|emb|CAN66553.1|1.8e-10841.81hypothetical protein VITISV_018166 [Vitis vinifera][more]
gi|147783182|emb|CAN68669.1|5.4e-10833.56hypothetical protein VITISV_039388 [Vitis vinifera][more]
gi|727620440|ref|XP_010481012.1|4.7e-10439.13PREDICTED: uncharacterized protein LOC104759832 [Camelina sativa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013242Retroviral aspartyl protease
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005622 intracellular
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G33150.1CSPI01G33150.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013242Retroviral aspartyl proteasePFAMPF08284RVP_2coord: 249..330
score: 6.4
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 234..333
score: 6.9
NoneNo IPR availableunknownCoilCoilcoord: 733..753
score: -coord: 4..31
scor
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 393..481
score: 1.4
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 434..865
score: 2.1E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 434..865
score: 2.1E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 375..492
score: 4.67E-38coord: 493..698
score: 9.34

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI01G33150Cucumber (Chinese Long) v3cpicucB000
CSPI01G33150Cucumber (Gy14) v1cgycpiB584
CSPI01G33150Cucumber (Chinese Long) v2cpicuB001
CSPI01G33150Melon (DHL92) v3.5.1cpimeB044
CSPI01G33150Watermelon (97103) v1cpiwmB020
CSPI01G33150Bottle gourd (USVL1VR-Ls)cpilsiB059
CSPI01G33150Melon (DHL92) v3.6.1cpimedB038
CSPI01G33150Cucumber (Gy14) v2cgybcpiB002